[Netarchivesuite-users] Issue - harvest running infinitely
csr at kb.dk
Tue May 1 10:48:02 CEST 2018
You may also want to follow the heritrix mailing list:
archive-crawler at yahoogroups.com
On 04/30/2018 11:50 AM, Koit Summatavet wrote:
> I have started using NAS to harvest Estonian websites and I have encountered a
> In a situation where the harvest doesn't hit either the document not the size
> limit then the harvest runs infinitely and all the threads are in TIMED_WAITING
> state where they wait from hours to days. The longer it runs the longer the wait
> becomes and URL's are processed very slowly and after a long time.
> How to stop this frong happening and changes to make in the harvest template?
> I am using NAS version 5.3.1. Does the same happen on versuon 5.4?
> With regards,
> NetarchiveSuite-users mailing list
> NetarchiveSuite-users at ml.sbforge.org
Colin Rosenthal PhD
Senior IT Consultant
Royal Danish Library (Aarhus)
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the NetarchiveSuite-users