[Netarchivesuite-curator] Update from Netarchive
Sabine Schostag
sas at kb.dk
Tue Nov 6 11:46:43 CET 2018
Dear all,
Hereby a brief update from KB DK:
We launched step 2 of our 3rd broad crawl this year (with a limit of 14GB per domain) on 2018/10/23
We looked at all our open issues and grouped them thematically:
· Harvesting problems
· Replay problems
· Improving existing functionalities
· New functionalities
· Automatization of operations, which are solved manually at the moment
· Will be solved by existing projects
The aim was to find the most urgent problems, which we cannot solve without developers help
We are working on the implementation of SOLR wayback to search in Netarchive. By now SOLR Wayback still is a protoype. Amongst others we need to clarify UX, security questions, how to do the logging and to chose a platform for the user access.
But the display of the results is much better than Blacklight
[cid:image001.jpg at 01D475C6.62AE6E00]
[cid:image002.jpg at 01D475C6.62AE6E00]
We are working on a procedure for a new type of usage from the archive: data extraction for research project from the archive. The data to be extracted are determined by a search string – hopefully this would be rather easy with SOLR-wayback
We are going to prepare a mini-event harvest “Week 46”. In week 46 the Royal Library collects local broadcast stations’ (both radio and television) productions. For a couple of years ago Netarchive started collecting there home pages.
On behalf of the Netarchive Team
Best,
Sabine
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-curator/attachments/20181106/72eeb2ef/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 34330 bytes
Desc: image001.jpg
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-curator/attachments/20181106/72eeb2ef/attachment-0002.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.jpg
Type: image/jpeg
Size: 32766 bytes
Desc: image002.jpg
URL: <https://ml.sbforge.org/pipermail/netarchivesuite-curator/attachments/20181106/72eeb2ef/attachment-0003.jpg>
More information about the Netarchivesuite-curator
mailing list