[Netarchivesuite-devel] [Fwd: [NS-114] Comment: FR1773: treatment of "harvesting aborted" status]
Colin Rosenthal
csr at statsbiblioteket.dk
Mon Oct 26 10:25:32 CET 2009
I'm forwarding this to netarchivesuite-devel for general discussion.
In the most recent implementation,
the behaviour is this. If a harvest is stopped either from the heritrix
GUI, or by killing heritrix, or due to
inactivity timeout or due to a fatal error then all domains in the
harvest will be marked "harvesting aborted".
On starting a new snapshot harvest based on this harvest, these domains
will not be included automatically.
Is this behaviour acceptable?
The problem Søren identifies below is that we have no simple way to
distinguish between harvests stopped
by manual intervention and those stopped due to inactivity timeout.
My own opinion is that at least "harvesting aborted" gives the right
message in the GUI. The previous
behaviour was that these domains were marked as "domain completed" which
is downright misleading.
--
Colin
-------- Original Message --------
Subject: [NS-114] Comment: FR1773: treatment of "harvesting aborted"
status
Date: Fri, 23 Oct 2009 14:00:33 +0200
From: Søren <svc at kb.dk>
To: Colin Samuel Rosenthal <csr at statsbiblioteket.dk>
References: <r128.d1256280029772 at cruciblethreadindicator>
NS-114 commented(see http://kb-prod-udv-001.kb.dk:8060/cru/NS-114#c1650 ):-
Another issue is, that the text "Heritrix terminated by operator" is also written, when HeritrixLauncher terminates the harvesting due to inactivity.
And therefore not only when the user terminates the harvest from the Heritrix GUI
---
ID: NS-114 http://kb-prod-udv-001.kb.dk:8060/cru/NS-114/review
Title: FR1773: treatment of "harvesting aborted" status
Statement of Objectives:
See https://gforge.statsbiblioteket.dk/tracker/?func=detail&group_id=7&aid=1773&atid=108
State: Review
Author: Colin
Moderator: Colin
Reviewers: (1 active, 0 completed*)
Søren
More information about the Netarchivesuite-devel
mailing list