<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" style="font-size:12pt;color:#000000;font-family:Calibri,Helvetica,sans-serif;" dir="ltr">
<p>Hej Miguel,</p>
<p><br>
</p>
<p>Every checksum operation is a batch job. This generates a local temporary output file which should be deleted when the result has been returned. (These will often be empty - for example if the file being queried actually lies on a different machine.) However
we mostly have experience with the FTPRemoteFile implementation, so I'm worried that this deletion doesn't work in HTTPRemoteFile. The easiest workaround would be to create a script that deletes anything older than a few hours from the tempdir and have it
run regularly. </p>
<p><br>
</p>
<p>Someone should also fix it in the code so that it happens automatically, or at least check to see if there is something in the settings that might help. Is anyone else using HTTPRemoteFile?<br>
<br>
/Colin</p>
<p><br>
</p>
<div id="Signature">
<div name="divtagdefaultwrapper" style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:; margin:0">
<div>--</div>
<div>Colin Rosenthal PhD</div>
<div>Senior IT Consultant</div>
<div>Royal Danish Library (Aarhus)</div>
</div>
</div>
<br>
<br>
<div style="color: rgb(0, 0, 0);">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> NetarchiveSuite-users <netarchivesuite-users-bounces@ml.sbforge.org> on behalf of Soleto Ruiz de Clavijo, Miguel <miguel.soleto@externos.bne.es><br>
<b>Sent:</b> Thursday, April 7, 2022 2:12 PM<br>
<b>To:</b> 'netarchivesuite-users@ml.sbforge.org'; 'netarchivesuite-users-bounces@ml.sbforge.org'<br>
<b>Cc:</b> Monzón, Fernando; García Arratia, Juan Carlos<br>
<b>Subject:</b> [Netarchivesuite-users] About BitArchive</font>
<div> </div>
</div>
<div>
<div style="">
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
Hello,</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
Sorry, but I didn’t receive your answer (don’t know why?). Also, I didn’t realize that I wasn’t suscribed to the NetArchiveSuite users mailing list. I just have done now.</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
We are using HTTPRemoteFile. Each BitArchive have at least 3 or 4 huge disks (about 500 TB each one), with thousands of warcs inside. This is a part of the settings file:</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i><bitarchive></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i> <baseFileDir>/dir/WARC_Archive_dir</baseFileDir></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i> <span style="color:red"><baseFileDir>/dir/WARC_Archive_dir_1</baseFileDir></span></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i><span style="color:red"> <baseFileDir>/dir/WARC_Archive_dir_2</baseFileDir></span></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i><span style="color:red"> <baseFileDir>/dir/WARC_Archive_dir_3</baseFileDir></span></i><i></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<i></bitarchive></i></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
This morning we restarted each BitArchive to add the lines in red. Also, restarted the BitarchiveMonitorApplication_ABM.</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
Could be related with the tempdir I mentioned before? I have been looking for that folder and it is getting bigger, maybe 100 files more per hour.</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
Thank you!</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
Miguel.</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
</p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
7 abr 2022 13:01, Colin Samuel Rosenthal <<a href="mailto:csr@kb.dk" style="color: rgb(5, 99, 193); text-decoration: underline;">csr@kb.dk</a>>:</p>
<p><span style="font-family:"Calibri",sans-serif; color:black">Hi Miguel,</span></p>
<p><span style="font-family:"Calibri",sans-serif; color:black"> </span></p>
<p style="margin-bottom:12.0pt"><span style="font-family:"Calibri",sans-serif; color:black">Do you use FtpRemoteFile to collect batch output? If so, it should delete the file once it has been copied to the ftp server. I'm not sure whether that's also true with
HTTPRemoteFile. </span></p>
<p><span style="font-family:"Calibri",sans-serif; color:black"> </span></p>
<p><span style="font-family:"Calibri",sans-serif; color:black">/Colin</span></p>
<p><span style="font-family:"Calibri",sans-serif; color:black"> </span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black">--</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black">Colin Rosenthal PhD</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black">Senior IT Consultant</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black">Royal Danish Library (Aarhus)</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black"> </span></p>
<div align="center" style="text-align: center; margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black">
<hr size="3" width="98%" align="center">
</span></div>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<b><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">From:</span></b><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black"> NetarchiveSuite-users <<a href="mailto:netarchivesuite-users-bounces@ml.sbforge.org" style="color: rgb(5, 99, 193); text-decoration: underline;">netarchivesuite-users-bounces@ml.sbforge.org</a>>
on behalf of Soleto Ruiz de Clavijo, Miguel <<a href="mailto:miguel.soleto@externos.bne.es" style="color: rgb(5, 99, 193); text-decoration: underline;">miguel.soleto@externos.bne.es</a>><br>
<b>Sent:</b> Thursday, April 7, 2022 12:41 PM<br>
<b>To:</b> 'netarchivesuite-users@ml.sbforge.org'; 'netarchivesuite-users-bounces@ml.sbforge.org'<br>
<b>Cc:</b> Monzón, Fernando; García Arratia, Juan Carlos<br>
<b>Subject:</b> [Netarchivesuite-users] About BitArchive</span><span style="font-family:"Calibri",sans-serif; color:black">
</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-family:"Calibri",sans-serif; color:black"> </span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Hi everyone!</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">I have some questions about BitArchiveApplication…</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">We still have the NAS version 5.4.2 on our Production environment (working to upgrade it to version 7.3 on PRE environment).</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black"> </span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Yesterday, We changed the disks on all our BitArchiveApplications (6 in total) and mounted a new file system. Everything worked okay, but we realized that there is a folder with
more tan 500k files. That folder is named “tempdir”, and it is on the same level as the rest of the important folders (conf, lib, log, etc.).</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Does anybody know the utility of that files? Every file size is 0, and the names are like this: “BatchOutput5816117269108762004”.</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black"> </span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Thank you all. See you on next meeting!</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black"> </span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Best regards,</span></p>
<p><span style="font-size:11.0pt; font-family:"Calibri",sans-serif; color:black">Miguel.</span></p>
<p style="margin: 0cm 0cm 0.0001pt; font-size: 12pt; font-family: "Times New Roman", serif;">
<span style="font-size:11.0pt; font-family:"Calibri",sans-serif"> </span></p>
</div>
<hr width="100%">
<font size="1">Este mensaje y cualquier fichero adjunto están dirigidos únicamente a sus destinatarios y contiene información confidencial. Si usted ha recibido este correo electrónico por error, le informamos que no puede realizar ninguna revisión, alteración,
impresión, copia, transmisión, difusión ni utilización alguna de este mensaje ni de cualquier fichero adjunto que pudiese contener. La realización de cualquiera de los actos indicados está expresamente prohibida por las Normas que regulan estas materias. Por
todo ello se solicita que, en caso de existir error en la recepción de este mensaje, se lo notifique al remitente respondiendo a este e-mail y elimine el mensaje y su contenido inmediatamente. La Biblioteca Nacional de España se reserva las acciones legales
que le correspondan en el caso de que se infrinja lo indicado anteriormente.</font>
<hr width="100%">
<font size="1">The information in this e-mail and any attachments is confidential and it is intended for the addressee only. If you have received this e-mail in error, you are notified that any revision, amendment, print, copy, disclosure, distribution or use
of the contents is unauthorized. Carrying out any of the above actions, is expressly banned by rules governing this matter. Hence we request that if you are not the intended recipient, please notify the sender answering this e-mail, and delete the message
and any attachments. The National Library of Spain reserves itself the right to take the appropriate legal actions in the event of the above mentioned matter is being infringed.</font>
<hr width="100%">
</div>
</div>
</div>
</body>
</html>