tika-trunk-jdk1.7 - Build # 88 - Failure

2014-07-11 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #88) Status: Failure Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/88/ to view the results.

tika-trunk-jdk1.6 - Build # 87 - Failure

2014-07-11 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.6 (build #87) Status: Failure Check console output at https://builds.apache.org/job/tika-trunk-jdk1.6/87/ to view the results.

[jira] [Created] (TIKA-1365) Incorrectly MimeType detection for Apache Lucene web site

2014-07-11 Thread Tien Nguyen Manh (JIRA)
Tien Nguyen Manh created TIKA-1365: -- Summary: Incorrectly MimeType detection for Apache Lucene web site Key: TIKA-1365 URL: https://issues.apache.org/jira/browse/TIKA-1365 Project: Tika Issu

[jira] [Updated] (TIKA-1365) Incorrectly MimeType detection for Apache Lucene web site

2014-07-11 Thread Tien Nguyen Manh (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tien Nguyen Manh updated TIKA-1365: --- Attachment: discussion.html I attached that page in case it will be changed > Incorrectly Mim

[jira] [Updated] (TIKA-1242) Update CXF version to 3.1.0

2014-07-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin updated TIKA-1242: --- Description: CXF 3.1.0 JAX-RS front-end offers a complete JAX-RS 2.0 support, has fewer depen

[jira] [Commented] (TIKA-1351) Parser implementations should accept null content handlers

2014-07-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14058619#comment-14058619 ] Sergey Beryozkin commented on TIKA-1351: See r1609677 for an initial update. PDFPar

[jira] [Comment Edited] (TIKA-1351) Parser implementations should accept null content handlers

2014-07-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14058619#comment-14058619 ] Sergey Beryozkin edited comment on TIKA-1351 at 7/11/14 10:29 AM: ---

[jira] [Commented] (TIKA-1351) Parser implementations should accept null content handlers

2014-07-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14058644#comment-14058644 ] Hudson commented on TIKA-1351: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #89 (See [https://bu

[jira] [Commented] (TIKA-1351) Parser implementations should accept null content handlers

2014-07-11 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1351?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14058658#comment-14058658 ] Hudson commented on TIKA-1351: -- SUCCESS: Integrated in tika-trunk-jdk1.6 #88 (See [https://bu

Wrong parsing of XML

2014-07-11 Thread Avi Hayun
Hi, Scenario: 1. I use tika-core in my app 2. I use the following to detect the stream's media type: byte[] bytes = IOUtils.toByteArray(new URL("http://www.amazon.com/sitemap_ video.xml")); String contentType = new Tika().detect(bytes); obviously when looking at the sitemap - it is of type app

Re: Wrong parsing of XML

2014-07-11 Thread Nick Burch
On Fri, 11 Jul 2014, Avi Hayun wrote: 1. I use tika-core in my app 2. I use the following to detect the stream's media type: byte[] bytes = IOUtils.toByteArray(new URL("http://www.amazon.com/sitemap_ video.xml")); That file doesn't have an xml header on the front, which is probably why it isn

Re: Wrong parsing of XML

2014-07-11 Thread Ken Krugler
On Jul 11, 2014, at 8:01am, Avi Hayun wrote: > Hi, > > Scenario: > 1. I use tika-core in my app > 2. I use the following to detect the stream's media type: > > byte[] bytes = IOUtils.toByteArray(new URL("http://www.amazon.com/sitemap_ > video.xml")); > String contentType = new Tika().detect(by

Missing 1.5 release tag

2014-07-11 Thread Emmanuel Bourg
Hi, I noticed the latest release wasn't tagged in Subversion: http://svn.apache.org/repos/asf/tika/tags/ There are the RC tags but not the final one. Could you please add it? I need the tag to track the new releases and then update the Tika package in Debian. Thank you, Emmanuel Bourg

Re: Wrong parsing of XML

2014-07-11 Thread Avi Hayun
Thank you Ken and Nick. You were right. Instead of passing the bytes, I pass now the URL and it works. Avi. On Fri, Jul 11, 2014 at 6:08 PM, Ken Krugler wrote: > > On Jul 11, 2014, at 8:01am, Avi Hayun wrote: > > > Hi, > > > > Scenario: > > 1. I use tika-core in my app > > 2. I use the

Re: Missing 1.5 release tag

2014-07-11 Thread Mattmann, Chris A (3980)
Done! [mattmann-0420740:~/git/drat/docs] mattmann% svn copy -m "Tika 1.5 release" https://svn.apache.org/repos/asf/tika/tags/1.5-rc2 https://svn.apache.org/repos/asf/tika/tags/1.5 Committed revision 1609736. [mattmann-0420740:~/git/drat/docs] mattmann% Tika has a package in Debian??! Can you pr

[jira] [Created] (TIKA-1366) Update some of Tika Server services to JAX-RS 2.0 AsyncResponse

2014-07-11 Thread Sergey Beryozkin (JIRA)
Sergey Beryozkin created TIKA-1366: -- Summary: Update some of Tika Server services to JAX-RS 2.0 AsyncResponse Key: TIKA-1366 URL: https://issues.apache.org/jira/browse/TIKA-1366 Project: Tika

[jira] [Updated] (TIKA-1366) Update some of Tika Server services to support JAX-RS 2.0 AsyncResponse

2014-07-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin updated TIKA-1366: --- Description: Some of Tika Server services will benefit from optionally supporting JAX-RS 2.0 A