[Netarchivesuite-users] Oldjobs directory growing too big

Richard Edenhofer redenhofer at gmx.at
Sat May 2 14:20:55 CEST 2009


>
> you cannot resubmit your job in the GUI before you have done the workaround for jobs that never leave status STARTED
>
> you have to find an "empty" harvester, copy jobs-dir back to that harvester-instance and restart that specific harvester (kill_harvester_PORTNR.sh / start_harvester_PORTNR.sh - I'm not sure whether theese are avilable if you have not used the deploy-application (completely new version available now))
>
> After the jobs reports status FAILED back in the GUI you can look at the statistics and based on that decide whether your job should run again or you are happy with the amount of data harvested.
>
>   
"kill_harvester_PORTNR.sh / start_harvester_PORTNR.sh" - that was the missing link!
I am using the deploy application and did restart the harvester and the systemstate in the Suite brings the expected message 

"WARNUNG: Found old unprocessed job data in dir 
'/home/netarchive/apps/netarchivesuite/ONB/harvester_7053/40_1241200515810'. 
Crawl probably interrupted by shutdown of HarvestController. Processing 
data"

Thanks for your help!
a.


More information about the NetarchiveSuite-users mailing list