<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Hi Koit,<br>
<br>
You can also use the H3 Remote Access section in NetarchiveSuite
to monitor and terminate the harvest. One possible way to diagnose<br>
what is happening:<br>
<br>
i) pause heritrix<br>
ii) list the Frontier to see what url's Heritrix is working on<br>
iii) delete any problems from the Frontier<br>
iv) unpause heritrix and let it end finish normally.<br>
<br>
Sometimes you may need to terminate heritrix instead. <br>
<br>
regards,<br>
Colin<br>
<br>
On 04/30/2018 01:14 PM, Søren Vejrup Carlsen wrote:<br>
</div>
<blockquote cite="mid:9a6025a7ed7f40c8824be9594753a77c@kb.dk"
type="cite">
<pre wrap="">Hi Koit.
Is it only a specific website, that causes the problem? Or is this a general problem?
Anyway, you can always log on to the heritrix3 instance and terminate the job manually
With the following credentials (User: admin , Password: adminPassword )
Unless you have changed these values.
Søren Vejrup Carlsen
IT-konsulent
IT consultant
IT-Udvikling.København
ITUI
+4591324841
<a class="moz-txt-link-abbreviated" href="mailto:svc@kb.dk">svc@kb.dk</a><a class="moz-txt-link-rfc2396E" href="mailto:svc@kb.dk"><mailto:svc@kb.dk></a>
[<a class="moz-txt-link-freetext" href="cid:image002.png@01D3E085.22979CF0">cid:image002.png@01D3E085.22979CF0</a>]
Det Kgl. Bibliotek
Royal Danish Library
Søren Kierkegaards Plads 1
DK-1221 København K
+45 3347 4747
CVR 2898 8842
EAN 5798 000 795297
From: NetarchiveSuite-users [<a class="moz-txt-link-freetext" href="mailto:netarchivesuite-users-bounces@ml.sbforge.org">mailto:netarchivesuite-users-bounces@ml.sbforge.org</a>] On Behalf Of Koit Summatavet
Sent: Monday, April 30, 2018 11:50 AM
To: <a class="moz-txt-link-abbreviated" href="mailto:netarchivesuite-users@ml.sbforge.org">netarchivesuite-users@ml.sbforge.org</a>
Subject: [Netarchivesuite-users] Issue - harvest running infinitely
Hi,
I have started using NAS to harvest Estonian websites and I have encountered a problem:
In a situation where the harvest doesn't hit either the document not the size limit then the harvest runs infinitely and all the threads are in TIMED_WAITING state where they wait from hours to days. The longer it runs the longer the wait becomes and URL's are processed very slowly and after a long time.
How to stop this frong happening and changes to make in the harvest template?
I am using NAS version 5.3.1. Does the same happen on versuon 5.4?
With regards,
Koit
</pre>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
NetarchiveSuite-users mailing list
<a class="moz-txt-link-abbreviated" href="mailto:NetarchiveSuite-users@ml.sbforge.org">NetarchiveSuite-users@ml.sbforge.org</a>
<a class="moz-txt-link-freetext" href="https://ml.sbforge.org/mailman/listinfo/netarchivesuite-users">https://ml.sbforge.org/mailman/listinfo/netarchivesuite-users</a>
</pre>
</blockquote>
<br>
<br>
<pre class="moz-signature" cols="72">--
Colin Rosenthal PhD
Senior IT Consultant
Royal Danish Library (Aarhus)</pre>
</body>
</html>