Hi Raj

I had a look but couldn't really find it in nutch-site.xml ... I have
modified the jsp page and it worked.

Thanks

On 4/14/07, rubdabadub <[EMAIL PROTECTED]> wrote:
> Hi:
>
> You have two option
>
> 1. Don't crawl/index URL's having more then X char. You can edit this
> value in nutch-site.xml.
> 2. Don't display URL in the JSP pages - modify it the jsp pages.. i
> think you can just comment it out.. i.e. displaying url.
>
> Regards
> raj
>
> On 4/14/07, Paul Liddelow <[EMAIL PROTECTED]> wrote:
> > Hi
> > In my results there are a few that have really long URL's that go
> > right off the page. Here is an example:
> >
> >
> >
> > Search Results
> > ... of 2006) 3. Interpretation Anti-Discrimination Act 1998 (No. 46 of ...
> > http://www.thelaw.tas.gov.au/results/index.w3p;actT=;amActT=;amsrT=;docno=;docyear=;domain=;eIndex=10;lastSearch=;pointInTime=;rta=;rti=44%2B%2B2003%2BAT%40EN%2B20070407000000;sIndex=1;sc1=;sessional=;sortBy=;srT=;ss=;sub=;title=Relationships%20Act%202003;tx1=;type=;wh1=
> >
> >
> > Does anybody know why this might occur and how to fix it?
> >
> > Cheers
> > Paul
> >
>

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to