[Netarchivesuite-users] Issue - harvest running infinitely

Colin Rosenthal csr at kb.dk
Tue May 1 10:48:02 CEST 2018


You may also want to follow the heritrix mailing list: 
archive-crawler at yahoogroups.com

cheers,
Colin

On 04/30/2018 11:50 AM, Koit Summatavet wrote:
> Hi,
>
> I have started using NAS to harvest Estonian websites and I have encountered a
> problem:
>
> In a situation where the harvest doesn't hit either the document not the size
> limit then the harvest runs infinitely and all the threads are in TIMED_WAITING
> state where they wait from hours to days. The longer it runs the longer the wait
> becomes and URL's are processed very slowly and after a long time.
>
> How to stop this frong happening and changes to make in the harvest template?
>
> I am using NAS version 5.3.1. Does the same happen on versuon 5.4?
>
> With regards,
> Koit
>
>
> _______________________________________________
> NetarchiveSuite-users mailing list
> NetarchiveSuite-users at ml.sbforge.org
> https://ml.sbforge.org/mailman/listinfo/netarchivesuite-users


-- 
Colin Rosenthal PhD
Senior IT Consultant
Royal Danish Library (Aarhus)

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20180501/43bc30a0/attachment.html>


More information about the NetarchiveSuite-users mailing list