[ 
https://issues.apache.org/jira/browse/NUTCH-1747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13961211#comment-13961211
 ] 

Sebastian Nagel commented on NUTCH-1747:
----------------------------------------

+1
Looks like inProgress was intended to hold more than the bare count of 
FetchItems in progress. In doubt, we can get the in-progress FetchItems and 
their associated queue from FetcherThreads (cf. NUTCH-1182).


> Use AtomicInteger as semaphore in Fetcher
> -----------------------------------------
>
>                 Key: NUTCH-1747
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1747
>             Project: Nutch
>          Issue Type: Improvement
>          Components: fetcher
>    Affects Versions: 1.8
>            Reporter: Julien Nioche
>            Priority: Minor
>         Attachments: NUTCH-1747-trunk.patch
>
>
> In Fetcher we currently use 
> Set<FetchItem>  inProgress = Collections.synchronizedSet(new 
> HashSet<FetchItem>());
> as semaphore within the FetchItemQueues to keep track of the URLs being 
> fetched and prevent threads from pulling from them. It works fine but we 
> could use AtomicIntegers instead as all we need is the counts, not the 
> contents.
> This change would have little impact on the performance but would make the 
> code a bit cleaner.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to