[Netarchivesuite-users] Generating CDX for duplicate entries
nicolas.giraud at bnf.fr
nicolas.giraud at bnf.fr
Thu May 7 10:57:38 CEST 2009
Hi Søren,
Ok I found out that finally, thanks for your answer! I have written a tool
to generate "revisit CDX" indices for Wayback Machine. I merge these
indices to my regular indices (generated from the ARC files) and I get to
see a search result in Wayback every time the resource was crawled, which
is what I intended to obtain.
Right now the process is external to NAS, and so is my regular Wayback
indexing of the archive. I'll write to Colin, because I have started to
have a look at how to automate Wayback indexing of the BitArchive files,
I'd like to share with him and see what he came up with.
Cheers,
Nicolas
Avant d'imprimer, pensez à l'environnement.
Consider the environment before printing this mail.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.gforge.statsbiblioteket.dk/pipermail/netarchivesuite-users/attachments/20090507/767a93ab/attachment.html
More information about the NetarchiveSuite-users
mailing list