Hi Raj I had a look but couldn't really find it in nutch-site.xml ... I have modified the jsp page and it worked.
Thanks On 4/14/07, rubdabadub <[EMAIL PROTECTED]> wrote: > Hi: > > You have two option > > 1. Don't crawl/index URL's having more then X char. You can edit this > value in nutch-site.xml. > 2. Don't display URL in the JSP pages - modify it the jsp pages.. i > think you can just comment it out.. i.e. displaying url. > > Regards > raj > > On 4/14/07, Paul Liddelow <[EMAIL PROTECTED]> wrote: > > Hi > > In my results there are a few that have really long URL's that go > > right off the page. Here is an example: > > > > > > > > Search Results > > ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of ... > > http://www.thelaw.tas.gov.au/results/index.w3p;actT=;amActT=;amsrT=;docno=;docyear=;domain=;eIndex=10;lastSearch=;pointInTime=;rta=;rti=44%2B%2B2003%2BAT%40EN%2B20070407000000;sIndex=1;sc1=;sessional=;sortBy=;srT=;ss=;sub=;title=Relationships%20Act%202003;tx1=;type=;wh1= > > > > > > Does anybody know why this might occur and how to fix it? > > > > Cheers > > Paul > > > ------------------------------------------------------------------------- This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just data. Click to get it now. http://sourceforge.net/powerbar/db2/ _______________________________________________ Nutch-general mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/nutch-general
