We use a substring the JSP pages to chop off after 150 characters. Then 
it shows something like this with the ellipse.

http://www.somelongurl.com/?w=with;a;big;long;query;string...

Dennis Kubes

rubdabadub wrote:
> Hi:
> 
> You have two option
> 
> 1. Don't crawl/index URL's having more then X char. You can edit this
> value in nutch-site.xml.
> 2. Don't display URL in the JSP pages - modify it the jsp pages.. i
> think you can just comment it out.. i.e. displaying url.
> 
> Regards
> raj
> 
> On 4/14/07, Paul Liddelow <[EMAIL PROTECTED]> wrote:
>> Hi
>> In my results there are a few that have really long URL's that go
>> right off the page. Here is an example:
>>
>>
>>
>> Search Results
>> ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of 
>> ...
>> http://www.thelaw.tas.gov.au/results/index.w3p;actT=;amActT=;amsrT=;docno=;docyear=;domain=;eIndex=10;lastSearch=;pointInTime=;rta=;rti=44%2B%2B2003%2BAT%40EN%2B20070407000000;sIndex=1;sc1=;sessional=;sortBy=;srT=;ss=;sub=;title=Relationships%20Act%202003;tx1=;type=;wh1=
>>  
>>
>>
>>
>> Does anybody know why this might occur and how to fix it?
>>
>> Cheers
>> Paul
>>

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to