<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 14 (filtered medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
{font-family:Batang;
panose-1:2 3 6 0 0 1 1 1 1 1;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"Calibri Light";
panose-1:2 15 3 2 2 2 4 3 2 4;}
@font-face
{font-family:"\@Batang";
panose-1:2 3 6 0 0 1 1 1 1 1;}
@font-face
{font-family:Aharoni;
panose-1:2 1 8 3 2 1 4 3 2 3;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p
{mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
span.EstiloCorreo20
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:70.85pt 3.0cm 70.85pt 3.0cm;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="ES" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Hello all,<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Here is our update:<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">We are working in our three event crawl: European Parliament elections, local elections and Spanish Government elections that are still running.
We have had a great collaboration from the different regions in the local elections, and we nominate over 3.700 sites.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">We still don’t know when we are going to launch our broad crawl but it will probably be in September<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">This is the problem that we have that I have told you. I hope you can understand:<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">There is a huge harvest in one of our collections and we can’t crawl all the seeds in it. We have a problem with the division in Jobs of the harvest.
For example, with version 5.3 we had loaded 15,000 URLs in a harvest and it generated 29 jobs of 620 URLs and one job with the rest .. When update to 5.4, it generate Jobs of 2096 URLs, which creates a local disk problem in the spiders because it is small.
We use the same template as in 5.3 but we don’t know why the division is different. Do you know what this can be? Is there a parameter in NAS (or templates) that we can modify to reduce the number of URLs generated in each job?<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">If you have any questions about this, please do not hesitate to ask me.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D">Thank you!<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri Light","sans-serif";color:gray">Alicia Pastrana García<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri Light","sans-serif";color:gray">Área de Gestión del Depósito de las Publicaciones en Línea<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri Light","sans-serif";color:gray">División de Procesos y Servicios Digitales<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri Light","sans-serif";color:gray">Tfno.: 91 516 89 92<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;font-family:"Calibri Light","sans-serif";color:gray">Biblioteca Nacional de España<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif"">De:</span></b><span style="font-size:10.0pt;font-family:"Tahoma","sans-serif""> Netarchivesuite-curator [mailto:netarchivesuite-curator-bounces@ml.sbforge.org]
<b>En nombre de </b>geraldine.camile@bnf.fr<br>
<b>Enviado el:</b> martes, 02 de julio de 2019 11:50<br>
<b>Para:</b> netarchivesuite-curator@ml.sbforge.org<br>
<b>CC:</b> bert.wendland@bnf.fr; leslie.bellony-ext@bnf.fr; DDL_DLN@bnf.fr; clara.wiatrowski@bnf.fr<br>
<b>Asunto:</b> [Netarchivesuite-curator] BnF NAS update for July<o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span style="font-size:10.0pt;font-family:"Arial","sans-serif"">Hello all,</span><br>
<br>
<span style="font-size:10.0pt;font-family:"Arial","sans-serif"">In March, we launched a selective project crawl for the European elections which is to come to an end in the coming days. 15 curators contributed to the nomination of over 1480 sites among which
social networks (twitter mostly but also facebook and Youtube channels) represent the largest share (around 60%). Eventually, 18 weekly, 5 monthly and over 120 daily crawls were led. We contributed for 85 sites to the collaborative crawl launched by Ricardo
Basilio on European elections results.</span><br>
<br>
<span style="font-size:10.0pt;font-family:"Arial","sans-serif"">We also added our contribution to the collaborative crawl on Artificial intelligence (85 sites).</span><br>
<br>
<span style="font-size:10.0pt;font-family:"Arial","sans-serif"">Best regards,</span><br>
<span style="font-size:10.0pt;font-family:"Arial","sans-serif"">The BnF digital legal deposit team</span><span style="font-family:"Arial","sans-serif""><o:p></o:p></span></p>
<div class="MsoNormal" align="center" style="text-align:center"><span style="font-family:"Arial","sans-serif"">
<hr size="2" width="100%" align="center">
</span></div>
<p><span style="font-family:"Arial","sans-serif"">Expositions <em><b><span style="font-family:"Arial","sans-serif""><a href="https://www.bnf.fr/fr/agenda/manuscrits-de-lextreme">Manuscrits de l’extrême
</a></span></b></em>– jusqu'au 7 juillet 2019 | François-Mitterrand<br>
et <em><b><span style="font-family:"Arial","sans-serif""><a href="https://www.bnf.fr/fr/agenda/le-monde-en-spheres">Le Monde en sphères</a></span></b></em> – jusqu'au 21 juillet 2019 | François-Mitterrand<o:p></o:p></span></p>
<p><strong><span style="font-family:"Arial","sans-serif";color:green">Avant d'imprimer, pensez à l'environnement.</span></strong><span style="font-family:"Arial","sans-serif";color:green"><o:p></o:p></span></p>
</div>
<hr width="100%">
<font size="1">Este mensaje y cualquier fichero adjunto están dirigidos únicamente a sus destinatarios y contiene información confidencial. Si usted ha recibido este correo electrónico por error, le informamos que no puede realizar ninguna revisión, alteración,
impresión, copia, transmisión, difusión ni utilización alguna de este mensaje ni de cualquier fichero adjunto que pudiese contener. La realización de cualquiera de los actos indicados está expresamente prohibida por las Normas que regulan estas materias. Por
todo ello se solicita que, en caso de existir error en la recepción de este mensaje, se lo notifique al remitente respondiendo a este e-mail y elimine el mensaje y su contenido inmediatamente. La Biblioteca Nacional de España se reserva las acciones legales
que le correspondan en el caso de que se infrinja lo indicado anteriormente.</font>
<hr width="100%">
<font size="1">The information in this e-mail and any attachments is confidential and it is intended for the addressee only. If you have received this e-mail in error, you are notified that any revision, amendment, print, copy, disclosure, distribution or use
of the contents is unauthorized. Carrying out any of the above actions, is expressly banned by rules governing this matter. Hence we request that if you are not the intended recipient, please notify the sender answering this e-mail, and delete the message
and any attachments. The National Library of Spain reserves itself the right to take the appropriate legal actions in the event of the above mentioned matter is being infringed.</font>
<hr width="100%">
</body>
</html>