<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;
mso-fareast-language:EN-US;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:#954F72;
text-decoration:underline;}
pre
{mso-style-priority:99;
mso-style-link:"Formateret HTML Tegn";
margin:0cm;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Courier New";}
p.msonormal0, li.msonormal0, div.msonormal0
{mso-style-name:msonormal;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman",serif;}
span.EmailStyle18
{mso-style-type:personal;
font-family:"Calibri",sans-serif;
color:windowtext;}
span.EmailStyle19
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:#1F497D;}
span.FormateretHTMLTegn
{mso-style-name:"Formateret HTML Tegn";
mso-style-priority:99;
mso-style-link:"Formateret HTML";
font-family:"Courier New";}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:70.85pt 70.85pt 70.85pt 70.85pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="DA" link="#0563C1" vlink="#954F72">
<div class="WordSection1">
<p class="MsoNormal"><span style="color:#1F497D">Hi Peter<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D">In our defaultorder.xml we have set it so:<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"> <bean id="candidates" class="org.archive.crawler.postprocessor.CandidatesProcessor"><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"> <!-- Allow redirected seeds to be accepted as seeds<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"> In H1, this property belonged to the LinkScoper object, in H3, it is part of the CandidatesProcessor object<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"> --><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"> <property name="seedsRedirectNewSeeds" value="false" /><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA">
</span><span style="font-size:10.0pt;font-family:"Courier New";color:black;mso-fareast-language:DA"></bean><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D">seedsRedirectNewSeeds = false because many redirects on domains either pointed to foreign domains that were not Danish content at all or pointed to other .dk domains that we had already harvested
and thus we would get many extra harvests and use a lot of extra space. What you lose by not using "seedsRedirectNewSeeds" is where re-directes actually point to a non-dk domain that we would like have also.
<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D">That is why the webdanica project was invented to find that content in another way.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D">Best regards<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D">Tue<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:#1F497D"><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="EN-US" style="mso-fareast-language:DA">From:</span></b><span lang="EN-US" style="mso-fareast-language:DA"> NetarchiveSuite-users <netarchivesuite-users-bounces@ml.sbforge.org>
<b>On Behalf Of </b>Peter Svanberg<br>
<b>Sent:</b> Tuesday, November 2, 2021 4:21 PM<br>
<b>To:</b> 'netarchivesuite-users@ml.sbforge.org' <netarchivesuite-users@ml.sbforge.org><br>
<b>Subject:</b> [Netarchivesuite-users] seedsRedirectNewSeeds<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">seedsRedirectNewSeeds was the parameter I mentioned on the meeting.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">Any cons and pros on true/false on this? I can imagine that the redirection could give problems, but do they?<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">Has those of you who have chosen “false” some experience?<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">---------<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> /**<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> * If enabled, any URL found because a seed redirected to it (original seed<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> * returned 301 or 302), will also be treated as a seed, as long as the hop<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> * count is less than {@value #SEEDS_REDIRECT_NEW_SEEDS_MAX_HOPS}.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> */<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> protected static final int SEEDS_REDIRECT_NEW_SEEDS_MAX_HOPS = 5;<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><bean id="candidates" class="org.archive.crawler.postprocessor.CandidatesProcessor"><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"> <property name="seedsRedirectNewSeeds" value="true" /><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"></bean><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">---------<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<table class="MsoNormalTable" border="0" cellpadding="0">
<tbody>
<tr>
<td style="padding:0cm 0cm 0cm 0cm">
<p class="MsoNormal"><a href="https://www.kb.se/"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:blue;mso-fareast-language:SV;text-decoration:none"><img border="0" width="113" height="170" style="width:1.177in;height:1.7708in" id="_x0000_i1025" src="https://signaturloggor.kb.se/png/Outlook%20logo%20m%d0%a4rkbl%d0%96.png" alt="KB Logo"></span></a><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><o:p></o:p></span></p>
</td>
<td style="padding:0cm 0cm 0cm 5.25pt">
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<b><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Peter Svanberg</span></b><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<b><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Teknisk handläggare</span></b><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Insamling och metadata<br>
Insamling 1<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:9.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><o:p> </o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<b><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Kungliga biblioteket</span></b><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Box 5039, 102 41 Stockholm<o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">Besöksadress: Karlavägen 96, Stockholm<o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV">+46 10 709 32 78<o:p></o:p></span></p>
<p class="MsoNormal" style="mso-margin-top-alt:2.0pt;margin-right:0cm;margin-bottom:1.0pt;margin-left:0cm">
<span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><a href="mailto:Peter.Svanberg@kb.se">Peter.Svanberg@kb.se</a><o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:8.0pt;font-family:"Arial",sans-serif;color:black;mso-fareast-language:SV"><a href="https://www.kb.se/"><span style="color:blue">www.kb.se</span></a><o:p></o:p></span></p>
</td>
</tr>
</tbody>
</table>
<p class="MsoNormal"><span lang="SV"><o:p> </o:p></span></p>
</div>
</body>
</html>