[Netarchivesuite-curator] Update from KB DK

Sabine Schostag sas at kb.dk
Thu Oct 4 14:44:07 CEST 2018

Dear all

Hereby an update from Netarchive acitivities

Our main focus is on the following:

Solving problems with the broad crawl: We started step 1 of our third broad crawl for 2018 on 25 August. We had lots of hanging jobs. We got developers help to solve a part of the problem, but still some jobs are hanging.

Testing Wayback access with different browsers: we found that different browsers replay more or less content from the Wayback archive. We tested with different browsers, IE, Chrome, Edge. The result is, that Edge is best to replay content (i.g. images)

Event harvest with BCWeb: we finished the event crawl for the official commemoration day for Danish soldiers, who had been deployed in war or conflict zones. Just some documentation is to be done. We did this crawl in collaboration with archivists from the National Archive. The fact that the hardcore coded schedules do not match the figures in the configurations on NAS domain pages caused some confusion.   This was our first collection collaboration using BCWeb (besides the pilot project)

We have implemented Umbra in our test environment and are looking forward to the results.

Special crawl for man hunt by Danish police: Almost every part of Denmark got closed by the police 28 September (no ferries, no flights abroad). We primarily crawled foreign articles on the event.

Kim Larsen, a Danish rock musician, known by almost every Dane died last Saturday, 29 September. We added a hashtag to our Twitter crawls – otherwise content on his death is captured by our selective news media crawls.

On behalf of the Netarchive Team

Best, Sabine

PS. I will be sitting in a train from Berlin to Aarhus next Tuesday – I’ll try to connect to our meeting, but normally net connections on a train are not very stable….

Sabine Schostag
Webkurator/web curator
Det Kgl. Bibliotek
Royal Danish Library

Victor Albecks Vej 1
DK-8000 Aarhus C

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-curator/attachments/20181004/c8379eb8/attachment.html>

More information about the Netarchivesuite-curator mailing list