[Netarchivesuite-users] Nothing happens after starting generating dedupcrawllogindex
Jonas Lindberg Frellesen
jolf at kb.dk
Tue May 19 12:32:02 CEST 2009
Hi Andreas
You have not defined a file data transfer method.
The configuration of one such element can be seen here:
http://netarchive.dk/suite/Configuration_Manual_devel#head-680304c34a5d7527b9c6ee04a1a15189f7a7e168
This should be defined on the physicalLocation level of your configuration file.
e.g.
<thisPhysicalLocation name="ONB">
<deployMachineUserName>netarchive</deployMachineUserName>
<deployInstallDir>/home/netarchive/apps/netarchivesuite</deployInstallDir>
<deployDatabaseDir>/home/netarchive/data/netarchivesuite/db/fullhddb</deployDatabaseDir>
<settings>
<common>
<useReplicaId>ONB</useReplicaId>
<remoteFile>
<serverName>wc06</serverName>
<userName>myUser</userName>
<userPassword>myPassword</userPassword>
<port>8100</port>
</remoteFile>
</common>
</settings>
....
The 'Address already in use' error was given because the settings.common.remoteFile.port has a default value of 8100, which is the same port you use for one of your applications.
I have no experience with several harvester applications on a single machine, but I think they will conflict if they are communicating at the same time.
It should be possible to avoid this by given each harvester application a unique value for settings.common.remoteFile.port.
Best regards
Jonas
-----Oprindelig meddelelse-----
Fra: netarchivesuite-users-bounces at lists.gforge.statsbiblioteket.dk [mailto:netarchivesuite-users-bounces at lists.gforge.statsbiblioteket.dk] På vegne af aponb at gmx.at
Sendt: 19. maj 2009 11:27
Til: netarchivesuite-users at lists.gforge.statsbiblioteket.dk
Emne: [Netarchivesuite-users] Nothinghappens afterstarting generatingdedupcrawllogindex
>
> Hi Andreas.
> In your deploy configuration you haven't defined any BitarchiveApplications.
> This must be the problem: You have only defined a BitarchiveMonitorApplication for your archive replica, but not any BitarchiveApplications for that same replica.
>
> You haven't yet defined, where to store your harvests!
>
>
Oups! You're right. I forgot that. I put the baseFileDir for the bitarchive under the BitarchiveMonitorApplication.
Thanks for that help.
And this is the next question. I inserted the BitarchiveApplication. And now I have the following error in the HarvestControllerApplication-Logfile:
dk.netarkivet.common.exceptions.IllegalState: Reply message not ok.
Message is: 'dk.netarkivet.common.exceptions.ArgumentNotValid: Error creating singleton of class
'dk.netarkivet.common.distribute.HTTPRemoteFile':
dk.netarkivet.common.exceptions.ArgumentNotValid: Error creating singleton of class 'dk.netarkivet.common.distribute.HTTPRemoteFile':
at
dk.netarkivet.common.utils.SettingsFactory.getInstance(SettingsFactory.java:101)
at
dk.netarkivet.common.distribute.RemoteFileFactory.getInstance(RemoteFileFactory.java:51)
at
dk.netarkivet.common.distribute.RemoteFileFactory.getCopyfileInstance(RemoteFileFactory.java:67)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestServer.doGenerateIndex(IndexRequestServer.java:170)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestServer.access$000(IndexRequestServer.java:57)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestServer$1.run(IndexRequestServer.java:136)
Caused by: dk.netarkivet.common.exceptions.IOFailure: Cannot start HTTPRemoteFile registry on port 8100
at
dk.netarkivet.common.distribute.HTTPRemoteFileRegistry.startServer(HTTPRemoteFileRegistry.java:119)
at
dk.netarkivet.common.distribute.HTTPRemoteFileRegistry.<init>(HTTPRemoteFileRegistry.java:101)
at
dk.netarkivet.common.distribute.HTTPRemoteFileRegistry.getInstance(HTTPRemoteFileRegistry.java:138)
at
dk.netarkivet.common.distribute.HTTPRemoteFile.getRegistry(HTTPRemoteFile.java:149)
at
dk.netarkivet.common.distribute.HTTPRemoteFile.<init>(HTTPRemoteFile.java:105)
at
dk.netarkivet.common.distribute.HTTPRemoteFile.getInstance(HTTPRemoteFile.java:140)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
dk.netarkivet.common.utils.SettingsFactory.getInstance(SettingsFactory.java:99)
... 5 more
Caused by: java.net.BindException: Address already in use
at java.net.PlainSocketImpl.socketBind(Native Method)
at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:359)
at java.net.ServerSocket.bind(ServerSocket.java:319)
at java.net.ServerSocket.<init>(ServerSocket.java:185)
at java.net.ServerSocket.<init>(ServerSocket.java:141)
at
org.mortbay.jetty.bio.SocketConnector.newServerSocket(SocketConnector.java:80)
at
org.mortbay.jetty.bio.SocketConnector.open(SocketConnector.java:73)
at
org.mortbay.jetty.AbstractConnector.doStart(AbstractConnector.java:250)
at
org.mortbay.jetty.bio.SocketConnector.doStart(SocketConnector.java:147)
at
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at org.mortbay.jetty.Server.doStart(Server.java:233)
at
org.mortbay.component.AbstractLifeCycle.start(AbstractLifeCycle.java:40)
at
dk.netarkivet.common.distribute.HTTPRemoteFileRegistry.startServer(HTTPRemoteFileRegistry.java:117)
... 15 more
' in index request for jobs
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestClient.checkMessageValid(IndexRequestClient.java:303)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestClient.cacheData(IndexRequestClient.java:180)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestClient.cacheData(IndexRequestClient.java:59)
at
dk.netarkivet.archive.indexserver.FileBasedCache.cache(FileBasedCache.java:164)
at
dk.netarkivet.archive.indexserver.FileBasedCache.getIndex(FileBasedCache.java:229)
at
dk.netarkivet.archive.indexserver.distribute.IndexRequestClient.getIndex(IndexRequestClient.java:59)
at
dk.netarkivet.harvester.harvesting.HarvestController.fetchDeduplicateIndex(HarvestController.java:401)
at
dk.netarkivet.harvester.harvesting.HarvestController.writeHarvestFiles(HarvestController.java:153)
at
dk.netarkivet.harvester.harvesting.distribute.HarvestControllerServer$HarvesterThread.run(HarvestControllerServer.java:642)
I have given every Application own Ports (my deployfile is enclosed) - so the address already in use message surprises me.
I am sure this is just a silly configuration mistake, but I can't find it. Would be nice if you have time for a quick view!
Thanks in advance
a.
More information about the NetarchiveSuite-users
mailing list