[Netarchivesuite-users] Two concurrent jobs
Bjarne Andersen
netarkivet at statsbiblioteket.dk
Fri May 30 10:22:55 CEST 2008
You need to instantiate more harvester-controllers. One server can easily run more than one instance.
In netarchive.dk we have 5 servers - each running:
- 3 HIGHPRIORITY instances
- 1 LOWPRIORITY instances
The HIGHPRIORITY is for selective jobs and the LOWPRIORITY is for snap shot jobs (larger crawls)
If you do not intend to do snap shot harvesting I see no problem why you should not be able to run maybe 5 HIGHPRIORITY instances on the
same server - this should allow for you to harvest 5 jobs simultaneously
best
--
Bjarne Andersen
Daily Manager - netarchive.dk
State & University Library
Universitetsparken
DK-8000 Aarhus C
T: +45 89462165 - C: +45 25662353
CVR/SE 10100682 - EAN 5798000791084
http://netarchive.dk
Peter Moser wrote:
> Hello!
>
> I have another question. I have two harvest definitions running (both
> for selective crawling). One definition contains a lot of
> configurations, which take more than 6 hours to harvest. The definition
> contains just one configuration with one url which is fetched in two
> minutes. I want to harvest the first definition daily and the second
> defnition every hour. The problem now ist that a job for the second
> definition will postponed till the job of the first definition is
> finished. What can I do that jobs for the 2nd definition will crawled
> every hour, no matter how long the daily crawl of the first definition
> will last?
>
> _______________________________________________
> NetarchiveSuite-users mailing list
> NetarchiveSuite-users at lists.gforge.statsbiblioteket.dk
> https://lists.gforge.statsbiblioteket.dk/mailman/listinfo/netarchivesuite-users
-------------- next part --------------
A non-text attachment was scrubbed...
Name: netarkivet.vcf
Type: text/x-vcard
Size: 312 bytes
Desc: not available
URL: <http://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20080530/3f6f23b6/attachment-0002.vcf>
More information about the NetarchiveSuite-users
mailing list