[GitHub] nutch pull request #155: Fix for NUTCH-2327: Seeds injected in REST must be ...

2016-10-20 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/155 Fix for NUTCH-2327: Seeds injected in REST must be ingested into HDFS You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH

[GitHub] nutch pull request #152: Fix for NUTCH-2317: Plugin jars don't get added to ...

2016-09-24 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/152 Fix for NUTCH-2317: Plugin jars don't get added to classpath You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2317

[GitHub] nutch pull request #138: Fix for NUTCH-2132: Publisher/Subscriber model for ...

2016-08-25 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/138#discussion_r76332711 --- Diff: src/java/org/apache/nutch/fetcher/FetcherThreadPublisher.java --- @@ -0,0 +1,58 @@ +/* + * Licensed to the Apache Software Foundation (ASF

[GitHub] nutch pull request #138: Fix for NUTCH-2132: Publisher/Subscriber model for ...

2016-08-02 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/138 Fix for NUTCH-2132: Publisher/Subscriber model for Nutch to emit events This PR is still in progress and needs a review to get the plugin system working. It is not ready to commit as of yet. You

[GitHub] nutch pull request #137: Fix for Nutch-2246: Refactor /seed end point

2016-08-01 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/137 Fix for Nutch-2246: Refactor /seed end point You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2246 Alternatively you can

[GitHub] nutch pull request: Fix for NUTCH-2209 : Improved Tokenization for...

2016-02-02 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/87#discussion_r51670857 --- Diff: conf/nutch-default.xml --- @@ -1390,7 +1390,7 @@ For more detailed information on the working of this filter visit https://wiki.apache.org

[GitHub] nutch pull request: Fix for NUTCH-2209 : Improved Tokenization for...

2016-02-02 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/87#discussion_r51670910 --- Diff: src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/cosine/Model.java --- @@ -0,0 +1,134 @@ +package

[GitHub] nutch pull request: Fix for NUTCH-2209 : Improved Tokenization for...

2016-02-02 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/87#discussion_r51670893 --- Diff: src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/SimilarityScoringFilter.java --- @@ -43,7 +43,11 @@ public

[GitHub] nutch pull request: Fix for NUTCH-2209 : Improved Tokenization for...

2016-01-31 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/87 Fix for NUTCH-2209 : Improved Tokenization for Similarity Scoring plugin (Sujen) You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412

[GitHub] nutch pull request: Fix for NUTCH-2157 Addressing Miredot REST API...

2015-11-13 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/84 Fix for NUTCH-2157 Addressing Miredot REST API warnings You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2157

[GitHub] nutch pull request: NUTCH-2149 REST endpoint to read Nutch sequenc...

2015-10-25 Thread sujen1412
Github user sujen1412 closed the pull request at: https://github.com/apache/nutch/pull/80 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] nutch pull request: NUTCH-2149 REST endpoint to read Nutch sequenc...

2015-10-23 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/80 NUTCH-2149 REST endpoint to read Nutch sequence files Removed debug statement Corrected indentation and line width You can merge this pull request into a Git repository by running

[GitHub] nutch pull request: fix for NUTCH-2128 Refactor config endpoint by...

2015-10-16 Thread sujen1412
Github user sujen1412 closed the pull request at: https://github.com/apache/nutch/pull/69 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] nutch pull request: Fix for NUTCH-2135 by Sujen Shah

2015-10-09 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/70 Fix for NUTCH-2135 by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2135 Alternatively you can review

[GitHub] nutch pull request: Fix for NUTCH-2135 by Sujen Shah

2015-10-09 Thread sujen1412
Github user sujen1412 closed the pull request at: https://github.com/apache/nutch/pull/70 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] nutch pull request: fix for NUTCH-2128 Refactor config endpoint by...

2015-10-01 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/69 fix for NUTCH-2128 Refactor config endpoint by Sujen shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2128

[GitHub] nutch pull request: Fix for NUTCH-2086 Contributed by Sujen Shah

2015-09-21 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/61 Fix for NUTCH-2086 Contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2086 Alternatively you can

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-18 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39897636 --- Diff: src/java/org/apache/nutch/crawl/CrawlDb.java --- @@ -261,30 +262,68 @@ public int run(String[] args) throws Exception { additionsAllowed

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-18 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39897798 --- Diff: src/java/org/apache/nutch/metadata/Nutch.java --- @@ -80,4 +80,11 @@ public static final String STAT_PROGRESS = "progress";

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-17 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39821072 --- Diff: src/java/org/apache/nutch/crawl/CrawlDb.java --- @@ -261,30 +262,68 @@ public int run(String[] args) throws Exception { additionsAllowed

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-17 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/59#discussion_r39821056 --- Diff: src/java/org/apache/nutch/crawl/CrawlDb.java --- @@ -236,10 +237,10 @@ public int run(String[] args) throws Exception { * Used for Nutch

[GitHub] nutch pull request: Fix for NUTCH-2099 Contributed by Sujen Shah

2015-09-15 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/59 Fix for NUTCH-2099 Contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2099 Alternatively you can

[GitHub] nutch pull request: Fix for NUTCH-2092 by Sujen Shah

2015-09-08 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/54 Fix for NUTCH-2092 by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2092 Alternatively you can review

[GitHub] nutch pull request: Fix for NUTCH-2066 contributed by Sujen Shah

2015-07-23 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/47 Fix for NUTCH-2066 contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2066 Alternatively you can

[GitHub] nutch pull request: fix for NUTCH-2039 contributed by Sujen Shah

2015-06-17 Thread sujen1412
Github user sujen1412 commented on a diff in the pull request: https://github.com/apache/nutch/pull/30#discussion_r32698822 --- Diff: src/plugin/scoring-similarity/src/java/org/apache/nutch/scoring/similarity/Cosine/CosineSimilarityModel.java --- @@ -0,0 +1,154

[GitHub] nutch pull request: fix for NUTCH-2039 contributed by Sujen Shah

2015-06-15 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/30 fix for NUTCH-2039 contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2039 Alternatively you can

[GitHub] nutch pull request: fix for NUTCH-2037 contributed by Sujen Shah

2015-06-07 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/29 fix for NUTCH-2037 contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2037 Alternatively you can

[GitHub] nutch pull request: added missing class NutchServerInfo for fix fo...

2015-06-02 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/27 added missing class NutchServerInfo for fix for NUTCH-2031 You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2031

[GitHub] nutch pull request: fix for NUTCH-2015 contributed by Sujen Shah

2015-05-16 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/25 fix for NUTCH-2015 contributed by Sujen Shah You can merge this pull request into a Git repository by running: $ git pull https://github.com/sujen1412/nutch NUTCH-2015 Alternatively you can

[GitHub] nutch pull request: fix for Nutch 1973 by sujen1412

2015-03-31 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/16 fix for Nutch 1973 by sujen1412 API calls documented at - https://wiki.apache.org/nutch/Nutch_1.X_RESTAPI. This pull request includes Index, Generate, Fetch, Parse, Update, InvertLinks, Dedup

[GitHub] nutch pull request: fix for Nutch-1966 contributed by sujen1412

2015-03-17 Thread sujen1412
GitHub user sujen1412 opened a pull request: https://github.com/apache/nutch/pull/13 fix for Nutch-1966 contributed by sujen1412 This pull request comprises of the necessary code to start the NutchServer and use the configuration endpoint as documented in the wiki - https