[Netarchivesuite-users] Generating CDX for duplicate entries

nicolas.giraud at bnf.fr nicolas.giraud at bnf.fr
Thu May 7 10:57:38 CEST 2009


Hi Søren,

Ok I found out that finally, thanks for your answer! I have written a tool 
to generate "revisit CDX" indices for Wayback Machine. I merge these 
indices to my regular indices (generated from the ARC files) and I get to 
see a search result in Wayback every time the resource was crawled, which 
is what I intended to obtain.

Right now the process is external to NAS, and so is my regular Wayback 
indexing of the archive. I'll write to Colin, because I have started to 
have a look at how to automate Wayback indexing of the BitArchive files, 
I'd like to share with him and see what he came up with.

Cheers,
Nicolas




Avant d'imprimer, pensez à l'environnement. 
Consider the environment before printing this mail.   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.gforge.statsbiblioteket.dk/pipermail/netarchivesuite-users/attachments/20090507/767a93ab/attachment.html 


More information about the NetarchiveSuite-users mailing list