[jira] [Commented] (BEAM-3004) TikaIOTest#testReadPdfFile is flaky.

2017-11-03 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237606#comment-16237606 ] Sergey Beryozkin commented on BEAM-3004: This can now be resolved, the current TikaIOTest does not

[jira] [Commented] (BEAM-2994) Refactor TikaIO

2017-10-30 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16225703#comment-16225703 ] Sergey Beryozkin commented on BEAM-2994: Thanks for merging this PR > Refactor TikaIO >

[jira] [Comment Edited] (BEAM-3004) TikaIOTest#testReadPdfFile is flaky.

2017-09-30 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16187099#comment-16187099 ] Sergey Beryozkin edited comment on BEAM-3004 at 9/30/17 1:49 PM: - Thanks,

[jira] [Commented] (BEAM-3004) TikaIOTest#testReadPdfFile is flaky.

2017-09-30 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-3004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16187099#comment-16187099 ] Sergey Beryozkin commented on BEAM-3004: Thanks, looks like the asynchronous TikaReader

[jira] [Created] (BEAM-2994) Refactor TikaIO

2017-09-27 Thread Sergey Beryozkin (JIRA)
Sergey Beryozkin created BEAM-2994: -- Summary: Refactor TikaIO Key: BEAM-2994 URL: https://issues.apache.org/jira/browse/BEAM-2994 Project: Beam Issue Type: Task Components:

[jira] [Resolved] (BEAM-2874) TikaIO JavaDocs have minor typos

2017-09-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin resolved BEAM-2874. Resolution: Invalid Have just read that the doc typos do not require opening JIRA issues :-)

[jira] [Updated] (BEAM-2874) TikaIO JavaDocs have minor typos

2017-09-11 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin updated BEAM-2874: --- Description: Some of TikaIO sources have the minor doc typos > TikaIO JavaDocs have minor

[jira] [Created] (BEAM-2874) TikaIO JavaDocs have minor typos

2017-09-11 Thread Sergey Beryozkin (JIRA)
Sergey Beryozkin created BEAM-2874: -- Summary: TikaIO JavaDocs have minor typos Key: BEAM-2874 URL: https://issues.apache.org/jira/browse/BEAM-2874 Project: Beam Issue Type: Bug

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-07-13 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16085343#comment-16085343 ] Sergey Beryozkin commented on BEAM-2328: [~talli...@mitre.org] Hi Tim - the PR has been updated to

[jira] [Updated] (BEAM-2328) Introduce Apache Tika Input component

2017-07-13 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin updated BEAM-2328: --- Fix Version/s: 2.2.0 > Introduce Apache Tika Input component >

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-16 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16051834#comment-16051834 ] Sergey Beryozkin commented on BEAM-2328: HI All, The initial cleanup of the 'tikaio' branch is now

[jira] [Comment Edited] (BEAM-2328) Introduce Apache Tika Input component

2017-06-14 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049057#comment-16049057 ] Sergey Beryozkin edited comment on BEAM-2328 at 6/14/17 11:09 AM: -- Hi JB,

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-14 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16049057#comment-16049057 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, All, I'm now ready to create the initial PR. As I

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-02 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16034412#comment-16034412 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, Tim re org.json dependencies, FYI, at the moment

[jira] [Comment Edited] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032904#comment-16032904 ] Sergey Beryozkin edited comment on BEAM-2328 at 6/1/17 1:05 PM: Sorry, Tika

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032904#comment-16032904 ] Sergey Beryozkin commented on BEAM-2328: Sorry, Tika already reports the characters... > Introduce

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032881#comment-16032881 ] Sergey Beryozkin commented on BEAM-2328: Hi JB, Tim Yes, TikaReader returns Strings, but as JB

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-06-01 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16032825#comment-16032825 ] Sergey Beryozkin commented on BEAM-2328: I've added some TikaReader and TikaSource tests. Tika

[jira] [Resolved] (BEAM-2361) Add TikaIO to the list of in-progress transforms

2017-05-26 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin resolved BEAM-2361. Resolution: Fixed Thanks for applying the patch > Add TikaIO to the list of in-progress

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-05-25 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16024981#comment-16024981 ] Sergey Beryozkin commented on BEAM-2328: The initial code is here:

[jira] [Commented] (BEAM-2328) Introduce Apache Tika Input component

2017-05-24 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16023826#comment-16023826 ] Sergey Beryozkin commented on BEAM-2328: Sorry for a bit of a noise, I spotted in the docs that the

[jira] [Issue Comment Deleted] (BEAM-2328) Introduce Apache Tika Input component

2017-05-24 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Beryozkin updated BEAM-2328: --- Comment: was deleted (was: Hi, pull request #250 has been created. thanks) > Introduce Apache

[jira] [Created] (BEAM-2361) Add TikaIO to the list of in-progress transforms

2017-05-24 Thread Sergey Beryozkin (JIRA)
Sergey Beryozkin created BEAM-2361: -- Summary: Add TikaIO to the list of in-progress transforms Key: BEAM-2361 URL: https://issues.apache.org/jira/browse/BEAM-2361 Project: Beam Issue Type:

[jira] [Comment Edited] (BEAM-2328) Introduce Apache Tika Input component

2017-05-24 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16022676#comment-16022676 ] Sergey Beryozkin edited comment on BEAM-2328 at 5/24/17 10:37 AM: -- Apache

[jira] [Comment Edited] (BEAM-2328) Introduce Apache Tika Input component

2017-05-24 Thread Sergey Beryozkin (JIRA)
[ https://issues.apache.org/jira/browse/BEAM-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16022676#comment-16022676 ] Sergey Beryozkin edited comment on BEAM-2328 at 5/24/17 10:36 AM: -- Apache

[jira] [Created] (BEAM-2328) Introduce Apache Tika Input component

2017-05-19 Thread Sergey Beryozkin (JIRA)
Sergey Beryozkin created BEAM-2328: -- Summary: Introduce Apache Tika Input component Key: BEAM-2328 URL: https://issues.apache.org/jira/browse/BEAM-2328 Project: Beam Issue Type: New Feature