[jira] [Created] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread zm (JIRA)
zm created NUTCH-1536:
-

 Summary: Ant build file has hardcoded conf dir location
 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor


build.xml loads property files to change default locations of various dirs, 
including conf. There is one spot in build.xml which has conf hardcoded 
instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread zm (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zm updated NUTCH-1536:
--

Attachment: build.xml.patch

conf.dir property fix

 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Issue Comment Deleted] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread zm (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

zm updated NUTCH-1536:
--

Comment: was deleted

(was: conf.dir property fix)

 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: Eclipse Error

2013-02-26 Thread Danilo Fernandes
What version of JDK fit with Nutch trunk?

Anybody knows?

2013/2/25 Danilo Fernandes ferna...@gmail.com

 Feng Lu, thanks for the fast reply.

 ** **

 But, I’m using the JavaSE-1.6 (jre6) and always get this error.

 ** **

 *De:* feng lu [mailto:amuseme...@gmail.com]
 *Enviada em:* segunda-feira, 25 de fevereiro de 2013 22:35
 *Para:* dev@nutch.apache.org
 *Assunto:* Re: Eclipse Error

 ** **

 Hi Danilo

 ** **

 Unsupported maj.minor version 51.0 means that you compiled your classes
 under a specific JDK, but then try to run them under older version of JDK.
 So, you can't run classes compiled with JDK 6.0 under JDK 5.0. The same
 with classes compiled under JDK 7.0 when you try to run them under JDK 6.0.
 

 ** **

 On Tue, Feb 26, 2013 at 9:12 AM, Danilo Fernandes ferna...@gmail.com
 wrote:

 *Hi, I want do some changes in Nutch to get a HTML and take some data
 from them.

 My problem starts when I’m compiling the code in Eclipse.

 I always receive the follow error message.*

 Buildfile: *C:\Users\Danilo\workspace\Nutch\build.xml*

   [*taskdef*] Could not load definitions from resource
 org/sonar/ant/antlib.xml. It could not be found.

 *ivy-probe-antlib*:

 *ivy-download*:

   [*taskdef*] Could not load definitions from resource
 org/sonar/ant/antlib.xml. It could not be found.

 *ivy-download-unchecked*:

 *ivy-init-antlib*:

 *ivy-init*:

 *init*:

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\classes**
 **

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\release**
 **

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\test

 [*mkdir*] Created dir:
 C:\Users\Danilo\workspace\Nutch\build\test\classes

  [*copy*] Copying 8 files to C:\Users\Danilo\workspace\Nutch\conf

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\automaton-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\automaton-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\httpclient-auth.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\httpclient-auth.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\nutch-site.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\nutch-site.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\prefix-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\prefix-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\regex-normalize.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\regex-normalize.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\regex-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\regex-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\subcollections.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\subcollections.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\suffix-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\suffix-urlfilter.txt

 *clean-lib*:

 *resolve-default*:

 [*ivy:resolve*] :: Ivy 2.2.0 - 20100923230623 ::
 http://ant.apache.org/ivy/ ::

 [*ivy:resolve*] :: loading settings :: file =
 C:\Users\Danilo\workspace\Nutch\ivy\ivysettings.xml

 [*ivy:resolve*] :: problems summary ::

 [*ivy:resolve*]  ERRORS

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   

[jira] [Updated] (NUTCH-1186) FreeGenerator always normalizes

2013-02-26 Thread Markus Jelsma (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-1186:
-

Attachment: NUTCH-1186-1.7-1.patch

Patch for 1.7. Uses the same generator normalizing config for the URL 
Partitioner job.

 FreeGenerator always normalizes
 ---

 Key: NUTCH-1186
 URL: https://issues.apache.org/jira/browse/NUTCH-1186
 Project: Nutch
  Issue Type: Bug
  Components: generator
Affects Versions: 1.3
Reporter: Markus Jelsma
Assignee: Markus Jelsma
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1186-1.7-1.patch


 The FreeGenerator does not honor the -normalize option, it always normalizes 
 all URL's in the input directory. The -filter option is respected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: Eclipse Error

2013-02-26 Thread kiran chitturi
I think Nutch requires atleast Java 1.6.


On Tue, Feb 26, 2013 at 5:33 AM, Danilo Fernandes ferna...@gmail.comwrote:

 What version of JDK fit with Nutch trunk?

 Anybody knows?


 2013/2/25 Danilo Fernandes ferna...@gmail.com

 Feng Lu, thanks for the fast reply.

 ** **

 But, I’m using the JavaSE-1.6 (jre6) and always get this error.

 ** **

 *De:* feng lu [mailto:amuseme...@gmail.com]
 *Enviada em:* segunda-feira, 25 de fevereiro de 2013 22:35
 *Para:* dev@nutch.apache.org
 *Assunto:* Re: Eclipse Error

 ** **

 Hi Danilo

 ** **

 Unsupported maj.minor version 51.0 means that you compiled your classes
 under a specific JDK, but then try to run them under older version of JDK.
 So, you can't run classes compiled with JDK 6.0 under JDK 5.0. The same
 with classes compiled under JDK 7.0 when you try to run them under JDK 6.0.
 

 ** **

 On Tue, Feb 26, 2013 at 9:12 AM, Danilo Fernandes ferna...@gmail.com
 wrote:

 *Hi, I want do some changes in Nutch to get a HTML and take some data
 from them.

 My problem starts when I’m compiling the code in Eclipse.

 I always receive the follow error message.*

 Buildfile: *C:\Users\Danilo\workspace\Nutch\build.xml*

   [*taskdef*] Could not load definitions from resource
 org/sonar/ant/antlib.xml. It could not be found.

 *ivy-probe-antlib*:

 *ivy-download*:

   [*taskdef*] Could not load definitions from resource
 org/sonar/ant/antlib.xml. It could not be found.

 *ivy-download-unchecked*:

 *ivy-init-antlib*:

 *ivy-init*:

 *init*:

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\classes*
 ***

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\release*
 ***

 [*mkdir*] Created dir: C:\Users\Danilo\workspace\Nutch\build\test

 [*mkdir*] Created dir:
 C:\Users\Danilo\workspace\Nutch\build\test\classes

  [*copy*] Copying 8 files to C:\Users\Danilo\workspace\Nutch\conf

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\automaton-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\automaton-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\httpclient-auth.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\httpclient-auth.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\nutch-site.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\nutch-site.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\prefix-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\prefix-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\regex-normalize.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\regex-normalize.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\regex-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\regex-urlfilter.txt

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\subcollections.xml.template to
 C:\Users\Danilo\workspace\Nutch\conf\subcollections.xml

  [*copy*] Copying
 C:\Users\Danilo\workspace\Nutch\conf\suffix-urlfilter.txt.template to
 C:\Users\Danilo\workspace\Nutch\conf\suffix-urlfilter.txt

 *clean-lib*:

 *resolve-default*:

 [*ivy:resolve*] :: Ivy 2.2.0 - 20100923230623 ::
 http://ant.apache.org/ivy/ ::

 [*ivy:resolve*] :: loading settings :: file =
 C:\Users\Danilo\workspace\Nutch\ivy\ivysettings.xml

 [*ivy:resolve*] :: problems summary ::

 [*ivy:resolve*]  ERRORS

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 [*ivy:resolve*]   unknown resolver main

 

[jira] [Commented] (NUTCH-1186) FreeGenerator always normalizes

2013-02-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587374#comment-13587374
 ] 

Lewis John McGibbney commented on NUTCH-1186:
-

I do not (and have not) ever used FreeGenerator Markus. Is there a 
justification as to why was normalization configuration compliance not 
implemented so far?

 FreeGenerator always normalizes
 ---

 Key: NUTCH-1186
 URL: https://issues.apache.org/jira/browse/NUTCH-1186
 Project: Nutch
  Issue Type: Bug
  Components: generator
Affects Versions: 1.3
Reporter: Markus Jelsma
Assignee: Markus Jelsma
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1186-1.7-1.patch


 The FreeGenerator does not honor the -normalize option, it always normalizes 
 all URL's in the input directory. The -filter option is respected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1529) Port nutch-mongdb-parser to trunk

2013-02-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587382#comment-13587382
 ] 

Lewis John McGibbney commented on NUTCH-1529:
-

Hi lufeng there are issues here with code formatting. In Nutch we use 2 space 
indents for the Java code and a similar principle for the XML configuration 
files. We also use slf4j over log4j for logging so you can change the commons 
imports to the Logger and LoggerFactory import implementations.
I will be happy to try and spin up some test server to test this patch if you 
want to address these issues. Thank you for the patch great work.  

 Port nutch-mongdb-parser to trunk
 -

 Key: NUTCH-1529
 URL: https://issues.apache.org/jira/browse/NUTCH-1529
 Project: Nutch
  Issue Type: Bug
  Components: injector
Affects Versions: 1.6
Reporter: Lewis John McGibbney
Assignee: lufeng
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1529-trunk.patch


 The initial repos is here [0]
 [0] https://github.com/ctjmorgan/nutch-mongdb-parser

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Re: dev Digest 25 Feb 2013 02:27:44 -0000 Issue 1555

2013-02-26 Thread Tejas Patil
Hi Lewis,

I am not sure about what needs to be done for #3 and #5. So I left it as an
open question. Once we reach a common understanding, I will open a jira for
this.

Thanks,
Tejas Patil


On Mon, Feb 25, 2013 at 1:17 PM, Lewis John Mcgibbney 
lewis.mcgibb...@gmail.com wrote:

 Hi Tejas,

 On Sun, Feb 24, 2013 at 6:27 PM, dev-digest-h...@nutch.apache.org wrote:

 Hi Lewis,

 We have not came to a conclusion for this topic.


 Correct, thanks for pushing this one!


 Here is what I propose:
 1. keep generate.max.count
 2. GENERATOR_MIN_SCORE and GENERATOR_MAX_COUNT: once we get to know that
 if they were kept back in 2.x for some valid reason, then we can safely
 remove these params. These seem to do nothing meaningful.
 3. generate.min.score : remove ?
 4. generate.filter, generate.normalise, generate.topN : there is not
 problem in keeping it. we can even remove it.
 5. GENERATOR_COUNT_VALUE_IP : ??

 I agree this Tejas. Maybe we can open an issue on this and record it?

 Lewis



[jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587394#comment-13587394
 ] 

Lewis John McGibbney commented on NUTCH-1536:
-

good catch. Thanks for brining this to the surface. We can make this commit for 
both trunk and 2.x. 

 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-1536:


Fix Version/s: 2.2
   1.7

 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1186) FreeGenerator always normalizes

2013-02-26 Thread Markus Jelsma (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587401#comment-13587401
 ] 

Markus Jelsma commented on NUTCH-1186:
--

I think there is little justification. Normalization must be configurable if 
there is no need to do it. This patch does fixes the generator issue, not the 
freegenerator issue, i'll look into that.

 FreeGenerator always normalizes
 ---

 Key: NUTCH-1186
 URL: https://issues.apache.org/jira/browse/NUTCH-1186
 Project: Nutch
  Issue Type: Bug
  Components: generator
Affects Versions: 1.3
Reporter: Markus Jelsma
Assignee: Markus Jelsma
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1186-1.7-1.patch


 The FreeGenerator does not honor the -normalize option, it always normalizes 
 all URL's in the input directory. The -filter option is respected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1186) FreeGenerator always normalizes

2013-02-26 Thread Lewis John McGibbney (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587439#comment-13587439
 ] 

Lewis John McGibbney commented on NUTCH-1186:
-

OK Markus. I looked at the patch (not tested) and looks good to me.

 FreeGenerator always normalizes
 ---

 Key: NUTCH-1186
 URL: https://issues.apache.org/jira/browse/NUTCH-1186
 Project: Nutch
  Issue Type: Bug
  Components: generator
Affects Versions: 1.3
Reporter: Markus Jelsma
Assignee: Markus Jelsma
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1186-1.7-1.patch


 The FreeGenerator does not honor the -normalize option, it always normalizes 
 all URL's in the input directory. The -filter option is respected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Lewis John McGibbney (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney resolved NUTCH-1536.
-

Resolution: Fixed

Committed @revision 1450375 in 2.x HEAD
Committed @revision 1450377 in Trunk code.
Thank you zm for contribution to Nutch.

 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587479#comment-13587479
 ] 

Hudson commented on NUTCH-1536:
---

Integrated in Nutch-trunk-Windows #47 (See 
[https://builds.apache.org/job/Nutch-trunk-Windows/47/])
* NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc) (Revision 1450377)

 Result = FAILURE
lewismc : http://svn.apache.org/viewvc/nutch/trunk/?view=revrev=1450377
Files : 
* /nutch/trunk/CHANGES.txt
* /nutch/trunk/build.xml


 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587491#comment-13587491
 ] 

Hudson commented on NUTCH-1536:
---

Integrated in Nutch-nutchgora #509 (See 
[https://builds.apache.org/job/Nutch-nutchgora/509/])
* NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc) (Revision 1450375)

 Result = FAILURE
lewismc : http://svn.apache.org/viewvc/nutch/branches/2.x/?view=revrev=1450375
Files : 
* /nutch/branches/2.x/CHANGES.txt
* /nutch/branches/2.x/build.xml


 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587492#comment-13587492
 ] 

Hudson commented on NUTCH-1536:
---

Integrated in Nutch-trunk #2133 (See 
[https://builds.apache.org/job/Nutch-trunk/2133/])
* NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc) (Revision 1450377)

 Result = FAILURE
lewismc : http://svn.apache.org/viewvc/nutch/trunk/?view=revrev=1450377
Files : 
* /nutch/trunk/CHANGES.txt
* /nutch/trunk/build.xml


 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


Build failed in Jenkins: Nutch-nutchgora #509

2013-02-26 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-nutchgora/509/changes

Changes:

[lewismc] * NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc)

--
[...truncated 3572 lines...]

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: protocol-file
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

jar:

deps-test:

deploy:

copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: parse-js
[junit] Running org.apache.nutch.parse.js.TestJSParseFilter
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.539 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: index-anchor
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/build/index-anchor/test
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
[javac] 1 warning

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: index-anchor
[junit] Running org.apache.nutch.indexer.anchor.TestAnchorIndexingFilter
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.765 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: index-basic
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/build/index-basic/test
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
[javac] 1 warning

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: index-basic
[junit] Running org.apache.nutch.indexer.basic.TestBasicIndexingFilter
[junit] Tests run: 2, Failures: 0, Errors: 0, Time elapsed: 1.044 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: index-more
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-nutchgora/2.x/build/index-more/test
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
[javac] 1 warning

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: index-more
[junit] Running org.apache.nutch.indexer.more.TestMoreIndexingFilter
[junit] Tests run: 3, Failures: 0, Errors: 0, Time elapsed: 1.239 sec

init:

init-plugin:
 [echo] Copying language profiles
 [echo] Copying test files

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file 

Build failed in Jenkins: Nutch-trunk #2133

2013-02-26 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/2133/changes

Changes:

[lewismc] * NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc)

--
[...truncated 5402 lines...]
copy-generated-lib:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-regex
[junit] Running org.apache.nutch.tika.TestRTFParser
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.009 sec
[junit] Running org.apache.nutch.urlfilter.regex.TestRegexURLFilter
[junit] Running org.apache.nutch.tika.TestRobotsMetaProcessor
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.299 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: urlfilter-suffix
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/build/urlfilter-suffix/test
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
[javac] 1 warning

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-suffix
[junit] Running org.apache.nutch.urlfilter.suffix.TestSuffixURLFilter
[junit] Tests run: 6, Failures: 0, Errors: 0, Time elapsed: 0.163 sec
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 4.184 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: urlfilter-validator
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: urlnormalizer-basic
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:117:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/build/urlfilter-validator/test

compile-test:
[javac] 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/src/plugin/build-plugin.xml:180:
 warning: 'includeantruntime' was not set, defaulting to 
build.sysclasspath=last; set to false for repeatable builds
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
[javac] Compiling 1 source file to 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/build/urlnormalizer-basic/test
[javac] 1 warning

jar:
[javac] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlfilter-validator
[javac] 1 warning

jar:

deps-test:

deploy:

copy-generated-lib:

test:
 [echo] Testing plugin: urlnormalizer-basic
[junit] Running org.apache.nutch.urlfilter.validator.TestUrlValidator
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.02 sec
[junit] Running 
org.apache.nutch.net.urlnormalizer.basic.TestBasicURLNormalizer
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 0.089 sec

init:

init-plugin:

deps-jar:

clean-lib:

resolve-default:
[ivy:resolve] :: loading settings :: file = 
/zonestorage/hudson_solaris/home/hudson/hudson-slave/workspace/Nutch-trunk/trunk/ivy/ivysettings.xml

compile:
 [echo] Compiling plugin: urlnormalizer-host
[javac] 

Jenkins build is back to normal : Nutch-trunk #2134

2013-02-26 Thread Apache Jenkins Server
See https://builds.apache.org/job/Nutch-trunk/2134/



[jira] [Commented] (NUTCH-1536) Ant build file has hardcoded conf dir location

2013-02-26 Thread Hudson (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587826#comment-13587826
 ] 

Hudson commented on NUTCH-1536:
---

Integrated in Nutch-2.x-Windows #47 (See 
[https://builds.apache.org/job/Nutch-2.x-Windows/47/])
* NUTCH-1536 Ant build file has hardcoded conf dir location (zm via 
lewismc) (Revision 1450375)

 Result = FAILURE
lewismc : http://svn.apache.org/viewvc/nutch/branches/2.x/?view=revrev=1450375
Files : 
* /nutch/branches/2.x/CHANGES.txt
* /nutch/branches/2.x/build.xml


 Ant build file has hardcoded conf dir location
 --

 Key: NUTCH-1536
 URL: https://issues.apache.org/jira/browse/NUTCH-1536
 Project: Nutch
  Issue Type: Bug
  Components: build
Affects Versions: 2.1
Reporter: zm
Priority: Minor
 Fix For: 1.7, 2.2

 Attachments: build.xml.patch


 build.xml loads property files to change default locations of various dirs, 
 including conf. There is one spot in build.xml which has conf hardcoded 
 instead of reading it off conf.dir property.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (NUTCH-1529) Port nutch-mongdb-parser to trunk

2013-02-26 Thread lufeng (JIRA)

 [ 
https://issues.apache.org/jira/browse/NUTCH-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

lufeng updated NUTCH-1529:
--

Attachment: NUTCH-1529-trunk-v2.patch

Hi Lewis, i have been corrected the issues that your pointed. thank you for 
your review Lewis.

 Port nutch-mongdb-parser to trunk
 -

 Key: NUTCH-1529
 URL: https://issues.apache.org/jira/browse/NUTCH-1529
 Project: Nutch
  Issue Type: Bug
  Components: injector
Affects Versions: 1.6
Reporter: Lewis John McGibbney
Assignee: lufeng
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1529-trunk.patch, NUTCH-1529-trunk-v2.patch


 The initial repos is here [0]
 [0] https://github.com/ctjmorgan/nutch-mongdb-parser

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Comment Edited] (NUTCH-1529) Port nutch-mongdb-parser to trunk

2013-02-26 Thread lufeng (JIRA)

[ 
https://issues.apache.org/jira/browse/NUTCH-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13587938#comment-13587938
 ] 

lufeng edited comment on NUTCH-1529 at 2/27/13 2:49 AM:


Hi Lewis, i have been corrected the issues that you pointed. thank you for your 
review Lewis.

  was (Author: amuseme.lu):
Hi Lewis, i have been corrected the issues that your pointed. thank you for 
your review Lewis.
  
 Port nutch-mongdb-parser to trunk
 -

 Key: NUTCH-1529
 URL: https://issues.apache.org/jira/browse/NUTCH-1529
 Project: Nutch
  Issue Type: Bug
  Components: injector
Affects Versions: 1.6
Reporter: Lewis John McGibbney
Assignee: lufeng
Priority: Minor
 Fix For: 1.7

 Attachments: NUTCH-1529-trunk.patch, NUTCH-1529-trunk-v2.patch


 The initial repos is here [0]
 [0] https://github.com/ctjmorgan/nutch-mongdb-parser

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira