[Netarchivesuite-users] Problems with a seed
Soleto Ruiz de Clavijo, Miguel
miguel.soleto at externos.bne.es
Fri Nov 7 11:43:19 CET 2025
Dear all,
I'm having trouble downloading a site with NAS. Specifically, it's this seed: https://www.lineaverdesierraguadarrama.com/
When I start the job, it returns a 404, but that URL works fine in a browser.
I ran the following tests:
curl -LsI https://www.lineaverdesierraguadarrama.com/
HTTP/2 404
content-length: 1245
content-type: text/html
server: Microsoft-IIS/10.0
x-powered-by: ASP.NET
x-powered-by-plesk: PleskWin
date: Fri, 07 Nov 2025 08:43:09 GMT
wget https://www.lineaverdesierraguadarrama.com/
--2025-11-07 09:43:16-- https://www.lineaverdesierraguadarrama.com/
Resolving www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com> (www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com>)... 195.55.124.177
Connecting to www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com> (www.lineaverdesierraguadarrama.com)[195.55.124.177]:443<http://www.lineaverdesierraguadarrama.com)[195.55.124.177]:443>... connected.
HTTP request sent, awaiting response... 200 OK
Length: 16857 (16K) [text/html]
Saving to: "index.html.1"
It seems the server responds with a 404 when it receives a HEAD request.
Is there any way to configure the Heritrix template to make it use GET directly?
Thanks in advance.
Best regards.
________________________________
Este mensaje y cualquier fichero adjunto est?n dirigidos ?nicamente a sus destinatarios y contiene informaci?n confidencial. Si usted ha recibido este correo electr?nico por error, le informamos que no puede realizar ninguna revisi?n, alteraci?n, impresi?n, copia, transmisi?n, difusi?n ni utilizaci?n alguna de este mensaje ni de cualquier fichero adjunto que pudiese contener. La realizaci?n de cualquiera de los actos indicados est? expresamente prohibida por las Normas que regulan estas materias. Por todo ello se solicita que, en caso de existir error en la recepci?n de este mensaje, se lo notifique al remitente respondiendo a este e-mail y elimine el mensaje y su contenido inmediatamente. La Biblioteca Nacional de Espa?a se reserva las acciones legales que le correspondan en el caso de que se infrinja lo indicado anteriormente.
________________________________
The information in this e-mail and any attachments is confidential and it is intended for the addressee only. If you have received this e-mail in error, you are notified that any revision, amendment, print, copy, disclosure, distribution or use of the contents is unauthorized. Carrying out any of the above actions, is expressly banned by rules governing this matter. Hence we request that if you are not the intended recipient, please notify the sender answering this e-mail, and delete the message and any attachments. The National Library of Spain reserves itself the right to take the appropriate legal actions in the event of the above mentioned matter is being infringed.
________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20251107/76af9f0b/attachment.html>
More information about the NetarchiveSuite-users
mailing list