Thank you very much,

Excuse my ignorante, i'm not familiar on how to use Jira nor how to apply
patches... if someone could enlighten me, that would be great..

thnx

-----Original Message-----
From: Talat UYARER [mailto:[email protected]] 
Sent: Saturday, November 02, 2013 6:47 PM
To: [email protected]; Ralf R. Kotowski
Subject: RE: How to Crawl Specific sites

Hi Raph,
You can find NUTCH-1661 in jira. i uploaded today :)

Talat

Sent with AquaMail for Android
http://www.aqua-mail.com


On 2 Kasım 2013 19:10:04 "Ralf R. Kotowski" <[email protected]> wrote:
> Would you be willing to share this code?
>
> Thnx
>
> -----Original Message-----
> From: Talat UYARER [mailto:[email protected]] Sent: Tuesday, October

> 15, 2013 5:15 PM
> To: [email protected]
> Subject: Re: How to Crawl Specific sites
>
> Hi,
> In addition to Markus answer If you dont want to fetch again non Indıan 
> website, You can do it by writing some custom code. Actually We wrote code

> because of same needs. Normally if your websites mixed, like .com or .in, 
> you dont understand website language from the url. We solve this by
writing 
> custom FetchSchedular code. We check their languages in its shouldfetch 
> method. If website language is not allowed. We dont generate again.  If
you 
> want to wait I will share our code.
>
> Talat
>
> 15-10-2013 13:36 tarihinde, Markus Jelsma yazdı:
> > Hi - either by using a language detector that only allows some or all
> common languages spoken in India or by using a domain URL filter to
restrict
> to the .in domain.
> >  -----Original message-----
> >> From:Jayadeep Reddy <[email protected]>
> >> Sent: Tuesday 15th October 2013 12:10
> >> To: [email protected]
> >> Subject: How to Crawl Specific sites
> >>
> >> How can I index data of only Indian websites
> >>
> >> -- Jayadeep Reddy.S,
> >> M.D & C.E.O
> >> e Health Access Pvt.Ltd
> >> www.ehealthaccess.com
> >> Hyderabad-Chennai-Banglore
> >> http://www.youtube.com/watch?v=0k5LX8mw6Sk
> >>
>
>



Reply via email to