[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-05-25 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15301536#comment-15301536
 ] 

ASF GitHub Bot commented on NUTCH-2234:
---

Github user lewismc commented on a diff in the pull request:

https://github.com/apache/nutch/pull/118#discussion_r64692054
  
--- Diff: ivy/ivy.xml ---
@@ -105,6 +105,10 @@



+   
--- End diff --

Why are these Tomcat dependencies added?


> Upgrade to elasticsearch 2.1.1
> --
>
> Key: NUTCH-2234
> URL: https://issues.apache.org/jira/browse/NUTCH-2234
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.11
>Reporter: Tien Nguyen Manh
>Assignee: Markus Jelsma
> Fix For: 1.13
>
> Attachments: NUTCH-2234.patch
>
>
> Currently we use elasticsearch 1.x, We should upgrade to 2.x



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] nutch pull request: fix for NUTCH-2234 and NUTCH-2236

2016-05-25 Thread lewismc
Github user lewismc commented on a diff in the pull request:

https://github.com/apache/nutch/pull/118#discussion_r64692054
  
--- Diff: ivy/ivy.xml ---
@@ -105,6 +105,10 @@



+   
--- End diff --

Why are these Tomcat dependencies added?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-05-25 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15300627#comment-15300627
 ] 

ASF GitHub Bot commented on NUTCH-2234:
---

GitHub user naegelejd opened a pull request:

https://github.com/apache/nutch/pull/118

fix for NUTCH-2234 and NUTCH-2236

Upgrade Elasticsearch and Lucene dependencies, which, in turn, requires 
updates to Guava and Hadoop dependencies:

- Elasticsearch 1.4.1 -> Elasticsearch 2.3.3
- Lucene 4.10.2 -> 5.5.0
- Guava 16.0.1 -> Guava 18.0
- Hadoop 2.4.0 -> 2.7.2

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/naegelejd/nutch NUTCH-2234

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/nutch/pull/118.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #118


commit 31e738a014576d8a4d4c8e8d3a0fc8d9fe5f8077
Author: Joseph Naegele 
Date:   2016-05-25T18:27:31Z

fix for NUTCH-2234 and NUTCH-2236

upgrades Elasticsearch and Lucene dependencies, which, in turn,
requires updates to Guava and Hadoop dependencies:

- Elasticsearch 1.4.1 -> Elasticsearch 2.3.3
- Lucene 4.10.2 -> 5.5.0
- Guava 16.0.1 -> Guava 18.0
- Hadoop 2.4.0 -> 2.7.2




> Upgrade to elasticsearch 2.1.1
> --
>
> Key: NUTCH-2234
> URL: https://issues.apache.org/jira/browse/NUTCH-2234
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.11
>Reporter: Tien Nguyen Manh
>Assignee: Markus Jelsma
> Fix For: 1.13
>
> Attachments: NUTCH-2234.patch
>
>
> Currently we use elasticsearch 1.x, We should upgrade to 2.x



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[GitHub] nutch pull request: fix for NUTCH-2234 and NUTCH-2236

2016-05-25 Thread naegelejd
GitHub user naegelejd opened a pull request:

https://github.com/apache/nutch/pull/118

fix for NUTCH-2234 and NUTCH-2236

Upgrade Elasticsearch and Lucene dependencies, which, in turn, requires 
updates to Guava and Hadoop dependencies:

- Elasticsearch 1.4.1 -> Elasticsearch 2.3.3
- Lucene 4.10.2 -> 5.5.0
- Guava 16.0.1 -> Guava 18.0
- Hadoop 2.4.0 -> 2.7.2

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/naegelejd/nutch NUTCH-2234

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/nutch/pull/118.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #118


commit 31e738a014576d8a4d4c8e8d3a0fc8d9fe5f8077
Author: Joseph Naegele 
Date:   2016-05-25T18:27:31Z

fix for NUTCH-2234 and NUTCH-2236

upgrades Elasticsearch and Lucene dependencies, which, in turn,
requires updates to Guava and Hadoop dependencies:

- Elasticsearch 1.4.1 -> Elasticsearch 2.3.3
- Lucene 4.10.2 -> 5.5.0
- Guava 16.0.1 -> Guava 18.0
- Hadoop 2.4.0 -> 2.7.2




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[jira] [Commented] (NUTCH-2234) Upgrade to elasticsearch 2.1.1

2016-05-25 Thread Joseph Naegele (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-2234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15300590#comment-15300590
 ] 

Joseph Naegele commented on NUTCH-2234:
---

Understood. The update to Lucene analyzers requires minor programmatic API 
changes in scoring-similarity, but nothing big. None of the indexers have 
tests, so I'm testing indexer-elastic manually for now. Unfortunately updating 
Elasticsearch breaks the plugin due to differences in guava versions: 
indexer-elastic depends on guava-18.0, which it declares in its plugin.xml, but 
guava-16.0.1 is a Nutch-wide dependency (for Hadoop). We avoided this issue in 
the past by also updating Nutch's Hadoop dependency from 2.4.0 -> 2.7.1, which 
is why Tien created NUTCH-2246. I'll open the PR with all aforementioned 
dependency updates.

> Upgrade to elasticsearch 2.1.1
> --
>
> Key: NUTCH-2234
> URL: https://issues.apache.org/jira/browse/NUTCH-2234
> Project: Nutch
>  Issue Type: Improvement
>  Components: indexer
>Affects Versions: 1.11
>Reporter: Tien Nguyen Manh
>Assignee: Markus Jelsma
> Fix For: 1.13
>
> Attachments: NUTCH-2234.patch
>
>
> Currently we use elasticsearch 1.x, We should upgrade to 2.x



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)