[jira] [Comment Edited] (CONNECTORS-1591) RTF comment parsing problem

2019-03-12 Thread Zoltan Farago (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790321#comment-16790321
 ] 

Zoltan Farago edited comment on CONNECTORS-1591 at 3/12/19 7:40 AM:


[~kwri...@metacarta.com] Manifold version is 2.10 an we do not use the mapper 
attachment. We tried TIka 1.17 and 1.19 both have the same problem. 


was (Author: zfarago):
[~kwri...@metacarta.com] Manifold version is 2.10 an we do not use the mapper 
attachment. We tried TIka 1.17 and 1.19 both has the same problem. 

> RTF comment parsing problem
> ---
>
> Key: CONNECTORS-1591
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1591
> Project: ManifoldCF
>  Issue Type: Bug
>Reporter: Zoltan Farago
>Priority: Major
> Attachments: comment.rtf, result.txt
>
>
> We have a problem with Manifold/Tika. When a comment is parsed from and RTF 
> file, the result has no separator. see attachments



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (CONNECTORS-1591) RTF comment parsing problem

2019-03-12 Thread Karl Wright (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790294#comment-16790294
 ] 

Karl Wright edited comment on CONNECTORS-1591 at 3/12/19 7:18 AM:
--

[~zfarago]  Ok, we're getting closer.

What version of ManifoldCF is this? And, are you using the ES mapper attachment?




was (Author: kwri...@metacarta.com):
[~zfarago]  Ok, we're getting closer.

What version of ManifoldCF is this?


> RTF comment parsing problem
> ---
>
> Key: CONNECTORS-1591
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1591
> Project: ManifoldCF
>  Issue Type: Bug
>Reporter: Zoltan Farago
>Priority: Major
> Attachments: comment.rtf, result.txt
>
>
> We have a problem with Manifold/Tika. When a comment is parsed from and RTF 
> file, the result has no separator. see attachments



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Comment Edited] (CONNECTORS-1591) RTF comment parsing problem

2019-03-12 Thread Zoltan Farago (JIRA)


[ 
https://issues.apache.org/jira/browse/CONNECTORS-1591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16790287#comment-16790287
 ] 

Zoltan Farago edited comment on CONNECTORS-1591 at 3/12/19 6:58 AM:


[~kwri...@metacarta.com] the output is an Elastic index. Comments in all other 
filetypes (.doc, .xls, .pdf, .dcx, .odt, etc) are separated with space from the 
content text. 

in RTF files the space is missing.


was (Author: zfarago):
the output is an Elastic index. Comments in all other filetypes (.doc, .xls, 
.pdf, .dcx, .odt, etc) are separated with space from the content text. 

> RTF comment parsing problem
> ---
>
> Key: CONNECTORS-1591
> URL: https://issues.apache.org/jira/browse/CONNECTORS-1591
> Project: ManifoldCF
>  Issue Type: Bug
>Reporter: Zoltan Farago
>Priority: Major
> Attachments: comment.rtf, result.txt
>
>
> We have a problem with Manifold/Tika. When a comment is parsed from and RTF 
> file, the result has no separator. see attachments



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)