[
https://issues.apache.org/jira/browse/TIKA-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214082#comment-15214082
]
Stefano Fornari commented on TIKA-1436:
---
sorry, it took much more than I expected...
8 AM:
-
patch 0001-Improvment-as-described-in-https-issues.apache.org-j.patch
was (Author: stefanofornari):
see comment on 20160328
> improvement to PDFParser
>
>
> Key: TIKA-1436
> URL: https://issues.apache.org/ji
20160328
> improvement to PDFParser
>
>
> Key: TIKA-1436
> URL: https://issues.apache.org/jira/browse/TIKA-1436
> Project: Tika
> Issue Type: Improvement
> Components: parser
&g
[
https://issues.apache.org/jira/browse/TIKA-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214107#comment-15214107
]
Tim Allison commented on TIKA-1285:
---
Y, that's what I was thinking about doing with shadi
[
https://issues.apache.org/jira/browse/TIKA-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214111#comment-15214111
]
Tim Allison commented on TIKA-1285:
---
As I mentioned on the pdfbox dev list, I'm hesitant
[
https://issues.apache.org/jira/browse/TIKA-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214111#comment-15214111
]
Tim Allison edited comment on TIKA-1285 at 3/28/16 11:31 AM:
-
A
[
https://issues.apache.org/jira/browse/TIKA-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214112#comment-15214112
]
Tim Allison commented on TIKA-1566:
---
For the sake of posterity, see this
[comment|https:
[
https://issues.apache.org/jira/browse/TIKA-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214115#comment-15214115
]
Tim Allison commented on TIKA-1910:
---
bq. Using the proxies we can make those dependencies
On Sun, 27 Mar 2016, Bob Paulin wrote:
Yes I think overall if these functions can live in somewhere either
inside tika or a smaller dependent library we're in a better place. I'll
take a look at Ogg-Vorbis.
The two util classes there, that spring to mind, are:
https://github.com/Gagravarr/Vorb
[
https://issues.apache.org/jira/browse/TIKA-1910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214183#comment-15214183
]
Bob Paulin commented on TIKA-1910:
--
Does this mean that if someone doesn't include th
On Sun, 27 Mar 2016, Bob Paulin wrote:
Tika's IOUtils appears to be missing the readFully method. Should that
be added?
There was discussion about getting rid of the Tika IOUtils method in
favour of depending on commons-io. If that method is on commons-io, then
we could use that without need
[
https://issues.apache.org/jira/browse/TIKA-1908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15214214#comment-15214214
]
Nick Burch commented on TIKA-1908:
--
Namespace'd properties, eg
https://tika.apache.org/1.
Hi Bob,
> From: Nick Burch
> Sent: March 28, 2016 6:49:09am PDT
> To: dev@tika.apache.org
> Subject: Re: Tika 2.0 - Replace POI IOUtils with commons-io IOUtils
>
> On Sun, 27 Mar 2016, Bob Paulin wrote:
>> Tika's IOUtils appears to be missing the readFully method. Should that be
>> added?
>
>
[
https://issues.apache.org/jira/browse/TIKA-1911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann reassigned TIKA-1911:
---
Assignee: Chris A. Mattmann
> OpenNLP based SentimentAnalysisParser
>
Ken,
Thank you for reminding me of this issue. Seems we had come to the
agreement to use commons-io in a later version. Doing this in tika-core
would make it a transitive dependency to all the 2.0 parsers which again
would just leave the string utils and LittleEndian code to port over to a
libra
Tim Allison created TIKA-1912:
-
Summary: Figure out how to parse truncated PDFs that were handled
by PDFBox 1.8.x but not by 2.0.0
Key: TIKA-1912
URL: https://issues.apache.org/jira/browse/TIKA-1912
Proje
Dear Anthony,
Great! These both sound like fantastic proposals and I’m happy
to be a mentor. Madhawa, would you like to join in on these
efforts?
Cheers,
Chris
++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science
Hi Chris / Antony
yes I would like to work on this, This proposal address most of the things
in Sentiment analysis,
AFAIK most of the people use OpenNLP Document Categorizer for Sentiment
Analysis, since there isn't a proper functionality to do sentiment analysis
in OpenNLP, This would be great if
[
https://issues.apache.org/jira/browse/TIKA-1897?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann reassigned TIKA-1897:
---
Assignee: Chris A. Mattmann
> Too many daemon threads when NamedEntityParser is enable
Dear Madhawa,
Thank you for your interest in the proposals.
The current tasks we proposed refer to the classification and
quantification regardless of the topic.
This can be used in a larger context where the topic is not specified, or
not unique, in which case we will need to identify the topic(s
20 matches
Mail list logo