[Netarchivesuite-users] What is Alternate Snapshot Jobgeneration Method? How swich off domain?

Peter Svanberg Peter.Svanberg at kb.se
Tue May 26 12:15:32 CEST 2020


Not so easy questions, after all?

Could you please tell me if you set this setting to True?
settings.harvester.scheduler.jobGen.useAlternateSnapshotJobgenerationMethod
And if you are happy with that? :)

Regards,
-----

Peter Svanberg

National Library of Sweden
Phone: +46 10 709 32 78

E-mail: peter.svanberg at kb.se
Web: www.kb.se




Från: NetarchiveSuite-users <netarchivesuite-users-bounces at ml.sbforge.org> För Peter Svanberg
Skickat: den 11 maj 2020 16:41
Till: netarchivesuite-users at ml.sbforge.org
Ämne: [Netarchivesuite-users] What is Alternate Snapshot Jobgeneration Method? How swich off domain?

Hello!

On preparation for step 3 in our broad crawl I have two easy (?) questions.

On looking for the criteria for a domain to be included In a snapshot iteration I found the setting
settings.harvester.scheduler.jobGen.useAlternateSnapshotJobgenerationMethod
but no description of how the alternate method differs from the default. And I can't easily figure it out from the source code or the Jira issues. (Something with configuration-order and extended attributes?)


1)      Could someone brief me on how this alternate method  differs from the default, and give hints on how to choose?

2)      What is the easiest way to make NAS ignore a domain in an iterative snapshot, even though the domain didn't complete in the previous iteration? Comment-out the domain's seed list with "#" is what I've found so far.

Regards,

-----

Peter Svanberg

National Library of Sweden
Phone: +46 10 709 32 78

E-mail: peter.svanberg at kb.se<mailto:peter.svanberg at kb.se>
Web: www.kb.se<http://www.kb.se>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20200526/0a9af2e5/attachment.html>


More information about the NetarchiveSuite-users mailing list