We use a substring the JSP pages to chop off after 150 characters. Then it shows something like this with the ellipse.
http://www.somelongurl.com/?w=with;a;big;long;query;string... Dennis Kubes rubdabadub wrote: > Hi: > > You have two option > > 1. Don't crawl/index URL's having more then X char. You can edit this > value in nutch-site.xml. > 2. Don't display URL in the JSP pages - modify it the jsp pages.. i > think you can just comment it out.. i.e. displaying url. > > Regards > raj > > On 4/14/07, Paul Liddelow <[EMAIL PROTECTED]> wrote: >> Hi >> In my results there are a few that have really long URL's that go >> right off the page. Here is an example: >> >> >> >> Search Results >> ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of >> ... >> http://www.thelaw.tas.gov.au/results/index.w3p;actT=;amActT=;amsrT=;docno=;docyear=;domain=;eIndex=10;lastSearch=;pointInTime=;rta=;rti=44%2B%2B2003%2BAT%40EN%2B20070407000000;sIndex=1;sc1=;sessional=;sortBy=;srT=;ss=;sub=;title=Relationships%20Act%202003;tx1=;type=;wh1= >> >> >> >> >> Does anybody know why this might occur and how to fix it? >> >> Cheers >> Paul >> ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general
