[Netarchivesuite-users] Upload errors
Bjarne Andersen
netarkivet at statsbiblioteket.dk
Mon Jun 2 10:42:40 CEST 2008
If your jobs have been reported finished (or in this case failed because of an upload error) you can use a upload commandline.
I usually use the following (in a bash script):
export INSTALDIR=/path_to_your_installation.... (put the right one in yourself)
export CLASSPATH=$INSTALDIR/lib/dk.netarkivet.archive.jar
find $INSTALDIR/oldjobs/*/arcs/*.arc | xargs java dk.netarkivet.archive.tools.Upload
find $INSTALDIR/oldjobs/*/metadata/*.arc | xargs java dk.netarkivet.archive.tools.Upload
The error looks a bit strange - One of the developers have to answer that one.
best
Bjarne
aponb at gmx.at wrote:
> I checked the jobs which run over the weekend and realized that some of
> them failed due to an upload error.
> I've got following exception in my logs:
>
> Error uploading arcfile '/home/onbpre/netarchive/scripts/simple_harvest/server2/49_1212195698720/arcs/49-1-20080531020618-00008-webcrawler01.arc.gz' Will be moved to '/home/onbpre/netarchive/scripts/simple_harvest/oldjobs2'
> dk.netarkivet.common.exceptions.IOFailure: Could not store 'server2/49_1212195698720/arcs/49-1-20080531020618-00008-webcrawler01.arc.gz' after 3 attempts. Giving up.
> Client-side exception occurred while storing 'server2/49_1212195698720/arcs/49-1-20080531020618-00008-webcrawler01.arc.gz' on attempt number 1 of 3.
> Error creating singleton
> dk.netarkivet.common.utils.SettingsFactory.getInstance(SettingsFactory.java:102)
> dk.netarkivet.common.distribute.RemoteFileFactory.getInstance(RemoteFileFactory.java:51)
> dk.netarkivet.common.distribute.RemoteFileFactory.getDistributefileInstance(RemoteFileFactory.java:74)
> dk.netarkivet.archive.arcrepository.distribute.StoreMessage.<init>(StoreMessage.java:55)
> dk.netarkivet.archive.arcrepository.distribute.JMSArcRepositoryClient.store(JMSArcRepositoryClient.java:230)
> dk.netarkivet.harvester.harvesting.HarvestController.uploadFiles(HarvestController.java:299)
> dk.netarkivet.harvester.harvesting.HarvestController.storeFiles(HarvestController.java:244)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.processHarvestInfoFile(HarvestControllerServer.java:550)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.access$300(HarvestControllerServer.java:83)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer$HarvesterThread.run(HarvestControllerServer.java:647)
> Client-side exception occurred while storing 'server2/49_1212195698720/arcs/49-1-20080531020618-00008-webcrawler01.arc.gz' on attempt number 2 of 3.
> Error creating singleton
> dk.netarkivet.common.utils.SettingsFactory.getInstance(SettingsFactory.java:102)
> dk.netarkivet.common.distribute.RemoteFileFactory.getInstance(RemoteFileFactory.java:51)
> dk.netarkivet.common.distribute.RemoteFileFactory.getDistributefileInstance(RemoteFileFactory.java:74)
> dk.netarkivet.archive.arcrepository.distribute.StoreMessage.<init>(StoreMessage.java:55)
> dk.netarkivet.archive.arcrepository.distribute.JMSArcRepositoryClient.store(JMSArcRepositoryClient.java:230)
> dk.netarkivet.harvester.harvesting.HarvestController.uploadFiles(HarvestController.java:299)
> dk.netarkivet.harvester.harvesting.HarvestController.storeFiles(HarvestController.java:244)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.processHarvestInfoFile(HarvestControllerServer.java:550)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.access$300(HarvestControllerServer.java:83)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer$HarvesterThread.run(HarvestControllerServer.java:647)
> Client-side exception occurred while storing 'server2/49_1212195698720/arcs/49-1-20080531020618-00008-webcrawler01.arc.gz' on attempt number 3 of 3.
> Error creating singleton
> dk.netarkivet.common.utils.SettingsFactory.getInstance(SettingsFactory.java:102)
> dk.netarkivet.common.distribute.RemoteFileFactory.getInstance(RemoteFileFactory.java:51)
> dk.netarkivet.common.distribute.RemoteFileFactory.getDistributefileInstance(RemoteFileFactory.java:74)
> dk.netarkivet.archive.arcrepository.distribute.StoreMessage.<init>(StoreMessage.java:55)
> dk.netarkivet.archive.arcrepository.distribute.JMSArcRepositoryClient.store(JMSArcRepositoryClient.java:230)
> dk.netarkivet.harvester.harvesting.HarvestController.uploadFiles(HarvestController.java:299)
> dk.netarkivet.harvester.harvesting.HarvestController.storeFiles(HarvestController.java:244)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.processHarvestInfoFile(HarvestControllerServer.java:550)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer.access$300(HarvestControllerServer.java:83)
> dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer$HarvesterThread.run(HarvestControllerServer.java:647)
>
>
>
> My questions are now. What went wrong and what can i do that it will not
> happen again. And how can I transfer the files from the oldjobs
> directory in a offical way? Or should I just copy each gz file from the
> oldjobs2/jobid/arc directory to my bitarchive?
>
> Thanks in advance for giving me a hint!
>
> _______________________________________________
> NetarchiveSuite-users mailing list
> NetarchiveSuite-users at lists.gforge.statsbiblioteket.dk
> https://lists.gforge.statsbiblioteket.dk/mailman/listinfo/netarchivesuite-users
--
Bjarne Andersen
Driftsleder - netarkivet.dk
Statsbiblioteket
Universitetsparken
8000 Århus C
Tlf. 89462165 - Mobil 25662353
CVR/SE 10100682 - EAN 5798000791084
http://netarkivet.dk
-------------- next part --------------
A non-text attachment was scrubbed...
Name: netarkivet.vcf
Type: text/x-vcard
Size: 312 bytes
Desc: not available
URL: <http://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20080602/3de10afb/attachment-0002.vcf>
More information about the NetarchiveSuite-users
mailing list