[Netarchivesuite-users] Problems with a seed

Soleto Ruiz de Clavijo, Miguel miguel.soleto at externos.bne.es
Fri Nov 7 11:43:19 CET 2025


Dear all,


I'm having trouble downloading a site with NAS. Specifically, it's this seed: https://www.lineaverdesierraguadarrama.com/
When I start the job, it returns a 404, but that URL works fine in a browser.

I ran the following tests:

curl -LsI https://www.lineaverdesierraguadarrama.com/

HTTP/2 404

content-length: 1245

content-type: text/html

server: Microsoft-IIS/10.0

x-powered-by: ASP.NET

x-powered-by-plesk: PleskWin

date: Fri, 07 Nov 2025 08:43:09 GMT





wget https://www.lineaverdesierraguadarrama.com/

--2025-11-07 09:43:16--  https://www.lineaverdesierraguadarrama.com/

Resolving www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com> (www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com>)... 195.55.124.177

Connecting to www.lineaverdesierraguadarrama.com<http://www.lineaverdesierraguadarrama.com> (www.lineaverdesierraguadarrama.com)[195.55.124.177]:443<http://www.lineaverdesierraguadarrama.com)[195.55.124.177]:443>... connected.

HTTP request sent, awaiting response... 200 OK

Length: 16857 (16K) [text/html]

Saving to: "index.html.1"

It seems the server responds with a 404 when it receives a HEAD request.
Is there any way to configure the Heritrix template to make it use GET directly?

Thanks in advance.

Best regards.

________________________________
Este mensaje y cualquier fichero adjunto est?n dirigidos ?nicamente a sus destinatarios y contiene informaci?n confidencial. Si usted ha recibido este correo electr?nico por error, le informamos que no puede realizar ninguna revisi?n, alteraci?n, impresi?n, copia, transmisi?n, difusi?n ni utilizaci?n alguna de este mensaje ni de cualquier fichero adjunto que pudiese contener. La realizaci?n de cualquiera de los actos indicados est? expresamente prohibida por las Normas que regulan estas materias. Por todo ello se solicita que, en caso de existir error en la recepci?n de este mensaje, se lo notifique al remitente respondiendo a este e-mail y elimine el mensaje y su contenido inmediatamente. La Biblioteca Nacional de Espa?a se reserva las acciones legales que le correspondan en el caso de que se infrinja lo indicado anteriormente.
________________________________
The information in this e-mail and any attachments is confidential and it is intended for the addressee only. If you have received this e-mail in error, you are notified that any revision, amendment, print, copy, disclosure, distribution or use of the contents is unauthorized. Carrying out any of the above actions, is expressly banned by rules governing this matter. Hence we request that if you are not the intended recipient, please notify the sender answering this e-mail, and delete the message and any attachments. The National Library of Spain reserves itself the right to take the appropriate legal actions in the event of the above mentioned matter is being infringed.
________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20251107/76af9f0b/attachment.html>


More information about the NetarchiveSuite-users mailing list