[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Shinichiro Abe (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549328#comment-13549328 ] Shinichiro Abe commented on CONNECTORS-601: --- Hi Karl, By renewing HttpClient

[jira] [Commented] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549285#comment-13549285 ] Karl Wright commented on CONNECTORS-604: r1431176. Also needed to patch httpc

[jira] [Commented] (CONNECTORS-598) Add mode to use null content if chromed content not found to the RSS connector

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549212#comment-13549212 ] Karl Wright commented on CONNECTORS-598: Also, please provide the stack trace

[jira] [Commented] (CONNECTORS-598) Add mode to use null content if chromed content not found to the RSS connector

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549211#comment-13549211 ] Karl Wright commented on CONNECTORS-598: Is there an exception trace in the lo

[jira] [Commented] (CONNECTORS-598) Add mode to use null content if chromed content not found to the RSS connector

2013-01-09 Thread David Morana (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549153#comment-13549153 ] David Morana commented on CONNECTORS-598: - I tried a RSS connector I know was

[jira] [Commented] (CONNECTORS-598) Add mode to use null content if chromed content not found to the RSS connector

2013-01-09 Thread David Morana (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549143#comment-13549143 ] David Morana commented on CONNECTORS-598: - Hi Karl, I upgraded to Solr

[jira] [Reopened] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright reopened CONNECTORS-604: The fix doesn't work, unfortunately. I still get the :80 appended to the Host value. I w

[jira] [Commented] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13549038#comment-13549038 ] Karl Wright commented on CONNECTORS-604: Problem turns out to be that the site

[jira] [Resolved] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright resolved CONNECTORS-604. Resolution: Fixed > Header differences from ManifoldCF 1.0.1 cause some crawls to f

[jira] [Commented] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548753#comment-13548753 ] Karl Wright commented on CONNECTORS-604: I added the accept headers everywhere

[jira] [Commented] (CONNECTORS-600) Modify Manifold to output dates in ISO-8601 canonical format instead of milliseconds

2013-01-09 Thread David Morana (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-600?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548722#comment-13548722 ] David Morana commented on CONNECTORS-600: - It's working great. Thanks Karl!

[jira] [Commented] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548691#comment-13548691 ] Karl Wright commented on CONNECTORS-604: One URL that crawls correctly in 1.0.

Re: Where the 1.1 release stands

2013-01-09 Thread Karl Wright
Then if it cannot be fixed yet, let's postpone the ticket until 1.2. Karl On Wed, Jan 9, 2013 at 10:24 AM, Piergiorgio Lucidi wrote: > 2013/1/2 Karl Wright > >> Hi all, >> >> The 1.1 release is mainly awaiting an HttpComponents/HttpClient 4.2.3 >> release, which will be hopefully voted on short

[jira] [Resolved] (CONNECTORS-605) Maven build is broken: commons-httpclient version is missing

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piergiorgio Lucidi resolved CONNECTORS-605. --- Resolution: Fixed r1430951. > Maven build is broken: com

[jira] [Updated] (CONNECTORS-603) Upgrade the CMIS Connector to OpenCMIS 0.8.0

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piergiorgio Lucidi updated CONNECTORS-603: -- Fix Version/s: ManifoldCF 1.1 > Upgrade the CMIS Connector to OpenCMIS

[jira] [Updated] (CONNECTORS-605) Maven build is broken: commons-httpclient version is missing

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piergiorgio Lucidi updated CONNECTORS-605: -- Affects Version/s: (was: ManifoldCF 1.0.1) > Maven build is broken:

[jira] [Updated] (CONNECTORS-605) Maven build is broken: commons-httpclient version is missing

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Piergiorgio Lucidi updated CONNECTORS-605: -- Fix Version/s: ManifoldCF 1.1 Affects Version/s: ManifoldCF 1.0.1

Re: Where the 1.1 release stands

2013-01-09 Thread Piergiorgio Lucidi
2013/1/2 Karl Wright > Hi all, > > The 1.1 release is mainly awaiting an HttpComponents/HttpClient 4.2.3 > release, which will be hopefully voted on shortly. Additionally, we > need the following before I think we are ready: > > - New release artifacts for the Solr 3.x and Solr 4.x plugins (vote

[jira] [Commented] (CONNECTORS-605) Maven build is broken

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548562#comment-13548562 ] Karl Wright commented on CONNECTORS-605: When done there should be NO dependen

[jira] [Commented] (CONNECTORS-605) Maven build is broken

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548561#comment-13548561 ] Karl Wright commented on CONNECTORS-605: Sorry I missed these. You should be

[jira] [Comment Edited] (CONNECTORS-605) Maven build is broken

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548560#comment-13548560 ] Piergiorgio Lucidi edited comment on CONNECTORS-605 at 1/9/13 3:13 PM: -

[jira] [Commented] (CONNECTORS-605) Maven build is broken

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548560#comment-13548560 ] Piergiorgio Lucidi commented on CONNECTORS-605: --- Ok probably we have to

[jira] [Created] (CONNECTORS-605) Maven build is broken

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
Piergiorgio Lucidi created CONNECTORS-605: - Summary: Maven build is broken Key: CONNECTORS-605 URL: https://issues.apache.org/jira/browse/CONNECTORS-605 Project: ManifoldCF Issue Type

[jira] [Created] (CONNECTORS-604) Header differences from ManifoldCF 1.0.1 cause some crawls to fail

2013-01-09 Thread Karl Wright (JIRA)
Karl Wright created CONNECTORS-604: -- Summary: Header differences from ManifoldCF 1.0.1 cause some crawls to fail Key: CONNECTORS-604 URL: https://issues.apache.org/jira/browse/CONNECTORS-604 Project:

[jira] [Created] (CONNECTORS-603) Upgrade the CMIS Connector to OpenCMIS 0.8.0

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
Piergiorgio Lucidi created CONNECTORS-603: - Summary: Upgrade the CMIS Connector to OpenCMIS 0.8.0 Key: CONNECTORS-603 URL: https://issues.apache.org/jira/browse/CONNECTORS-603 Project: Manifold

[jira] [Created] (CONNECTORS-602) Remove the SOAP API from the Alfresco Connector

2013-01-09 Thread Piergiorgio Lucidi (JIRA)
Piergiorgio Lucidi created CONNECTORS-602: - Summary: Remove the SOAP API from the Alfresco Connector Key: CONNECTORS-602 URL: https://issues.apache.org/jira/browse/CONNECTORS-602 Project: Manif

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Shinichiro Abe (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548460#comment-13548460 ] Shinichiro Abe commented on CONNECTORS-601: --- Thank you! This is nice modific

[jira] [Resolved] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright resolved CONNECTORS-601. Resolution: Fixed Changed the way isText works to be compatible with CJK characters.

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548453#comment-13548453 ] Karl Wright commented on CONNECTORS-601: r1430825 represents this new thinking

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548439#comment-13548439 ] Karl Wright commented on CONNECTORS-601: The more I think about this, the more

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548394#comment-13548394 ] Karl Wright commented on CONNECTORS-601: r1430784 raises the global strange/to

[jira] [Assigned] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karl Wright reassigned CONNECTORS-601: -- Assignee: Karl Wright > make the thresholds of isText() input-able > --

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13548367#comment-13548367 ] Karl Wright commented on CONNECTORS-601: Hi Abe-san, Just to be clear - if a

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Shinichiro Abe (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547811#comment-13547811 ] Shinichiro Abe commented on CONNECTORS-601: --- Here is a sample site. http://l

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547784#comment-13547784 ] Karl Wright commented on CONNECTORS-601: It also looks like it would most natu

[jira] [Comment Edited] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547781#comment-13547781 ] Karl Wright edited comment on CONNECTORS-601 at 1/9/13 8:58 AM:

[jira] [Commented] (CONNECTORS-601) make the thresholds of isText() input-able

2013-01-09 Thread Karl Wright (JIRA)
[ https://issues.apache.org/jira/browse/CONNECTORS-601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547781#comment-13547781 ] Karl Wright commented on CONNECTORS-601: Agreed that it is problematic if the