<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" style="font-size:12pt;color:#000000;font-family:Calibri,Helvetica,sans-serif;" dir="ltr">
<p>At a test with 100000 domains (500 kByte) in our test environment today the harvesting and job management was okay. But again there were problems with non-updated GUI: the "all jobs" page gave correct info but the "all running jobs" did not. It seems they
are fetching data from different tables in the database.</p>
<p><br>
</p>
<p>And again the database processes was very busy and the queues were not emptied in time. So we are now suspecting the database handling to be the bottleneck in our setup. For example, the post-processing of the harvest report for 10000 domains took around
35 minutes. Is that a database-heavy task?</p>
<p><br>
</p>
<p>We are using Postgresql, are there others using that?</p>
<p><br>
</p>
<p>And the frontier reporting, which is done every <span>frontierReportWaitTime second (default 600, we have 120 but it is done even more often). What do we lose if we make that value higher? And what can we gain from it? Is the amount of data involved in the
post-processing affected by how often we have done frontier reports?</span></p>
<p><span><br>
</span></p>
<p><span>Any hints on how to </span><span style="font-size: 12pt;">unburden our database is appreciated!</span><span></p>
<div><br>
</div>
<div>Regards,</div>
<div><br>
</div>
<div>Peter Svanberg, Sweden</div>
<div><br>
</div>
</span>
<p></p>
</div>
</body>
</html>