[Netarchivesuite-curator] BNE NAS update for December

Pérez Morillo, Mar mar.perez at bne.es
Wed Dec 23 17:16:07 CET 2015


Dear all,

                As most of you already know, we've been using a NAS environment (just one), version 4.2, along one year to launch our selective crawls.

At the beginning of December 2015, our IT team launched NAS 4.4 in a new environment, the one that is going to be the production environment, to leave version 4.2 as development infrastructure.

Thanks to an agreement with the public entity Red.es, we've been able to improve our infrastructure, not only regarding storage, but also (and mainly) servers. We have now 23 physical servers and we can use up to 60 virtual servers, if needed.

With this new infrastructure we launched our first crawl about General Elections in Spain on December 1st  (the previous one in 2011 was done for the Library by Internet Archive). The General Elections 2015 crawl is still running and it will last until the new Prime Minister is voted by the Parliament. The libraries collaborating with the non-print legal deposit are participating in the nomination of seeds for the collection.

Tied to this new NAS environment, we launched CWeb at the beginning of December. CWeb is the Spanish version of BCWeb, an interface built by the BnF and kindly shared by them with BNE. We adapted the tool to our system and we hope it is ready to be used by the web curators in January 2016, not only at the library but also at the non-print-legal-deposit-collaborating libraries around Spain. So far it's only the web archiving team at the BNE who is using CWeb to manage the web collections. We thank so much the Bibliothèque nationale de France for sharing this tool with the BNE.

We gave also access to OpenWayback to web curators (inside and outside the Library) for them to do quality assurance. This is only available for curators through the secure Spanish Administration network, until we can give access to users.

We also drafted a User's Guide for web curators, that include not only guidelines for the selection of seeds, but also instructions to use CWeb and to do the QA, manually and using the crawls logs and seeds reports provided by NAS. Let's see how far our web curators can go on this field.

                Lastly, we did a test crawl in preparation of our first domain .es crawl, what we expect to launch by the end of winter and beginning of spring 2016.

                If you have any question, don't hesitate to ask and write to Archivoweb at bne.es<mailto:Archivoweb at bne.es>.

                Have a happy Christmas!

Mar Pérez Morillo
Jefe del Área de Gestión del Depósito de las Publicaciones en Línea
Dirección de Biblioteca Digital y Sistemas de Información
Tfno.: 91 516 89 92
Biblioteca Nacional de España


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://ml.sbforge.org/pipermail/netarchivesuite-curator/attachments/20151223/f700657e/attachment.html>


More information about the Netarchivesuite-curator mailing list