[Netarchivesuite-users] Problems With The IndexServerAplication + WaybackIndexer DB
Charles Tassell
ctassell at gmail.com
Wed Dec 10 15:54:41 CET 2014
Thanks, I fixed that up and ran a job, but after the crawl finished and
the .warc was created th next step seemed to die. the
BitarchiveMonitorApplication0.log file says that it can't find the .warc
file (although I have confirmed that it's there) The log says:
10-Dec-2014 11:14:24 AM
dk.netarkivet.archive.bitarchive.distribute.BitarchiveMonitorServer
replyToGetChecksumMessage
INFO: Replying GetChecksumMessage:
'ID:1795-137.149.200.20(a0:54:f2:7b:7:3c)-38167-1418224464362:
To ROBLIB_COMMON_THE_REPOS ReplyTo
ROBLIB_COMMON_THIS_REPOS_CLIENT_137_149_200_20_GUIWS
Error: dk.netarkivet.common.exceptions.IOFailure: The batchjob did not
find the file '1-1-20141210141254-00000-webarchive.upei.ca.warc' within
the archive.
dk.netarkivet.common.exceptions.IOFailure: The batchjob did not find the
file '1-1-20141210141254-00000-webarchive.upei.ca.warc' within the archive.
at
dk.netarkivet.archive.bitarchive.distribute.BitarchiveMonitorServer.replyToGetChecksumMessage(Bi
tarchiveMonitorServer.java:733)
at
dk.netarkivet.archive.bitarchive.distribute.BitarchiveMonitorServer.replyConvertedBatch(Bitarchi
veMonitorServer.java:641)
at
dk.netarkivet.archive.bitarchive.distribute.BitarchiveMonitorServer.access$200(BitarchiveMonitor
Server.java:81)
at
dk.netarkivet.archive.bitarchive.distribute.BitarchiveMonitorServer$2.run(BitarchiveMonitorServe
r.java:535)
Arcfiles: 1-1-20141210141254-00000-webarchive.upei.ca.warc, ReplicaId:
A, Checksum: null'.
But the file does exist:
./harvester_high/1_1418220769306/warcs/1-1-20141210141254-00000-webarchive.upei.ca.warc
Is this another broken path in the deployment file? Is there a
better deployment file that I can use which installs the full suite
(harvester, indexer and viewer) that is known to work?
On 14-12-10 10:07 AM, Mikis Seth Sørensen wrote:
> Hi Charles
>
> The application classes are defined in the deployment xml file. I can
> see that in the 'deploy_standalone_example_with_wayback_apps.xml’ the
> IndexServerApplication namespace is wrong missing the harvester part
> as you have note (the deploy_standalone_example.xml has the correct
> setting)*. *
> *
> *
> Try changing the line
> <applicationNamename="dk.netarkivet.archive.indexserver.IndexServerApplication">
> to
> <applicationNamename="dk.netarkivet.harvester.indexserver.IndexServerApplication”>
> in you deply xml and run the script generation and deployment again.
>
> Best
> Mikis
>
> From: Charles Tassell <charles at islandadmin.ca
> <mailto:charles at islandadmin.ca>>
> Reply-To: "netarchivesuite-users at ml.sbforge.org
> <mailto:netarchivesuite-users at ml.sbforge.org>"
> <netarchivesuite-users at ml.sbforge.org
> <mailto:netarchivesuite-users at ml.sbforge.org>>
> Date: Wednesday, December 10, 2014 at 2:23 PM
> To: "netarchivesuite-users at ml.sbforge.org
> <mailto:netarchivesuite-users at ml.sbforge.org>"
> <netarchivesuite-users at ml.sbforge.org
> <mailto:netarchivesuite-users at ml.sbforge.org>>
> Subject: Re: [Netarchivesuite-users] Problems With The
> IndexServerAplication + WaybackIndexer DB
>
> Sorry, did some grepping and found the comments in the deployment
> file for how to create the Wayback database, so that is sorted
> out. I'm still wondering about the IndexServerApplication path
> though.
>
> On 14-12-10 09:06 AM, Charles Tassell wrote:
>> Hi Guys,
>>
>> I'm still having some issues with getting a fresh 4.4.1 install
>> going. There seem to be two issues left after fixing the queue
>> names in the deployment file.
>>
>> First, when I try to start the IndexServerApplication I get the
>> following error message:
>>
>> Exception in thread "main" java.lang.NoClassDefFoundError:
>> dk/netarkivet/archive/indexserver/IndexServerApplication
>> Caused by: java.lang.ClassNotFoundException:
>> dk.netarkivet.archive.indexserver.IndexServerApplication
>> at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>> at java.security.AccessController.doPrivileged(Native Method)
>> at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>> at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>> at
>> sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>> at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
>> Could not find the main class:
>> dk.netarkivet.archive.indexserver.IndexServerApplication. Program
>> will exit.
>>
>> I did some digging, and it looks like the actual class path is
>> dk.netarkivet.*harvester*.indexserver.IndexServerApplication Is
>> that correct, or are the harvester and archive
>> IndexServerApplications different classes?
>>
>> Secondly, the WaybackIndexer does not seem to be able to
>> connect to the database at port 8124. It looks like the
>> installer script doesn't create the derby instance for the
>> WaybackIndexer. Are there any docs on how to do that manually?
>>
>
>
>
> _______________________________________________
> NetarchiveSuite-users mailing list
> NetarchiveSuite-users at ml.sbforge.org
> http://ml.sbforge.org/mailman/listinfo/netarchivesuite-users
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://ml.sbforge.org/pipermail/netarchivesuite-users/attachments/20141210/081ade0f/attachment.html>
More information about the NetarchiveSuite-users
mailing list