Tika 2.0 Migration Guide

2016-05-17 Thread Bob Paulin
Hi, Started to add some content to the migration guide (Thanks for creating Tim Allison!) now that we've got some folks that are pulling 2.x into test projects. Please review and let me know if there are any questions or omissions. Thanks! https://wiki.apache.org/tika/Tika2_0MigrationGuide

[jira] [Commented] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287595#comment-15287595 ] Nick Burch commented on TIKA-1970: -- [~talli...@mitre.org] Any chance you could report this

[jira] [Commented] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287578#comment-15287578 ] Hudson commented on TIKA-1970: -- SUCCESS: Integrated in tika-2.x #95 (See [https://builds.apac

[jira] [Commented] (TIKA-1971) Email saved as .eml with no body not detected as rfc822, while same email saved as plain txt is.

2016-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287577#comment-15287577 ] Hudson commented on TIKA-1971: -- SUCCESS: Integrated in tika-2.x #95 (See [https://builds.apac

[jira] [Commented] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287560#comment-15287560 ] Hudson commented on TIKA-1970: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #993 (See [https://b

[jira] [Commented] (TIKA-1971) Email saved as .eml with no body not detected as rfc822, while same email saved as plain txt is.

2016-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287559#comment-15287559 ] Hudson commented on TIKA-1971: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #993 (See [https://b

Re: [1/3] tika git commit: TIKA-1971 - add another magic for rfc822

2016-05-17 Thread Nick Burch
On Tue, 17 May 2016, talli...@apache.org wrote: TIKA-1971 - add another magic for rfc822 http://git-wip-us.apache.org/repos/asf/tika/blob/e08d0065/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml -- diff --g

[jira] [Updated] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1970: -- Fix Version/s: 2.0 > Date not extracted from email saved as plain txt > -

[jira] [Updated] (TIKA-1971) Email saved as .eml with no body not detected as rfc822, while same email saved as plain txt is.

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1971: -- Fix Version/s: 2.0 > Email saved as .eml with no body not detected as rfc822, while same email > saved a

[jira] [Resolved] (TIKA-1971) Email saved as .eml with no body not detected as rfc822, while same email saved as plain txt is.

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1971. --- Resolution: Fixed Made small modification to our mime-types file in an ongoing whack-a-mole exercise w

[jira] [Updated] (TIKA-1971) Email saved as .eml with no body not detected as rfc822, while same email saved as plain txt is.

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1971: -- Fix Version/s: 1.14 > Email saved as .eml with no body not detected as rfc822, while same email > saved

[jira] [Resolved] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1970. --- Resolution: Fixed Added handling for Mac Mail plain text date format. Thank you for opening this! > D

[jira] [Updated] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1970: -- Fix Version/s: 1.14 > Date not extracted from email saved as plain txt >

[jira] [Updated] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-1970: -- Description: HI have two email testfiles: (1) A file that has been created by using "save as" in Mac Mai

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-05-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287330#comment-15287330 ] ASF GitHub Bot commented on TIKA-1941: -- Github user reevapp closed the pull request at

[jira] [Commented] (TIKA-1941) Can only respond correctly to its first request and cannot assign a User-Key dynamically

2016-05-17 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287333#comment-15287333 ] ASF GitHub Bot commented on TIKA-1941: -- GitHub user reevapp opened a pull request:

[GitHub] tika pull request: fix for TIKA-1941 contributed by Mark Duske

2016-05-17 Thread reevapp
GitHub user reevapp opened a pull request: https://github.com/apache/tika/pull/120 fix for TIKA-1941 contributed by Mark Duske Class transformed into thread-safe and allows for a Lingo24 User-Key to be dynamically assigned You can merge this pull request into a Git repository by ru

[GitHub] tika pull request: fix for TIKA-1941 contributed by Mark Duske

2016-05-17 Thread reevapp
Github user reevapp closed the pull request at: https://github.com/apache/tika/pull/100 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabl

Re: GSoC 2016: OpenNLP Sentiment Analysis

2016-05-17 Thread Mattmann, Chris A (3980)
Great, OK saw your conversation on Hangouts, I’ll reply back there and we can set something up for tomorrow cheers! ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsio

Re: GSoC 2016: OpenNLP Sentiment Analysis

2016-05-17 Thread Anastasija Mensikova
Hi Chris, I just sent you a Hangout invitation. I definitely can and want to talk tomorrow. I'm back at home (in Latvia) now, so I'm free any time of the day here (with the time difference it would be from around 7am ET till maybe 3pm or 4pm ET the latest). Let me know! Thank you, Anastasija On

[jira] [Comment Edited] (TIKA-1970) Date not extracted from email saved as plain txt

2016-05-17 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15284699#comment-15284699 ] Tim Allison edited comment on TIKA-1970 at 5/17/16 1:42 PM: Thi

Re: Testing 2.0-SNAPSHOT with Apache CXF Tika demo

2016-05-17 Thread Sergey Beryozkin
Hi Bob On 17/05/16 02:37, Bob Paulin wrote: Sergey, Great to hear the code works well with the new modules! And I do agree that Tika has a number of application specific usecases that can be explored. I think the other goal is making the upgrade paths easier so developers don't have to drag "J

[jira] [Created] (TIKA-1975) Different behaviour between tika-app and tika-server

2016-05-17 Thread Sebastian Hesse (JIRA)
Sebastian Hesse created TIKA-1975: - Summary: Different behaviour between tika-app and tika-server Key: TIKA-1975 URL: https://issues.apache.org/jira/browse/TIKA-1975 Project: Tika Issue Type: