[jira] [Resolved] (TIKA-896) OSGi deployment without declarative services

2012-04-19 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-896. Resolution: Fixed Fix Version/s: 1.2 Assignee: Jukka Zitting Thanks! Patch committed

[jira] [Resolved] (TIKA-743) Upgrade to Apache parent POM version 10

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-743. Resolution: Fixed Done in revision 1179209. > Upgrade to Apache parent POM version 1

[jira] [Resolved] (TIKA-739) For certain DWG files, the Tika content parser outputs garbage

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-739. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting I fixed this in revision

[jira] [Resolved] (TIKA-741) "Zip bomb" (XML nesting) detection is too strict

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-741. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting In revision 1179254 I in

[jira] [Resolved] (TIKA-730) WriteOutContentHandler concatenates title tag and body text.

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-730. Resolution: Won't Fix Resolving as Won't Fix since in this case the WriteOutContentHandler class wor

[jira] [Resolved] (TIKA-699) Automatic checks against backwards-incompatible API changes

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-699. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Added checks for tika-co

[jira] [Resolved] (TIKA-744) Drop support for Java 1.4

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-744. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Done in revision 1179322

[jira] [Resolved] (TIKA-642) Few of RTF files not extracting properly

2011-10-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-642. Resolution: Duplicate The example file no longer causes problems with the latest trunk, so I guess t

[jira] [Resolved] (TIKA-396) Parser Attachements from Outlook Messages

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-396. Resolution: Fixed Fix Version/s: 1.0 Looks like this one is already fixed. >

[jira] [Resolved] (TIKA-123) Structured MS Office parsing

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-123. Resolution: Duplicate Much of this was already implemented recently in other issues, so resolving as

[jira] [Resolved] (TIKA-448) Tika FLVParser hangs

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-448. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Fixed in revision 117996

[jira] [Resolved] (TIKA-487) ContainerAwareDetector doesn't support truncated Open XML files

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-487. Resolution: Fixed Fix Version/s: 1.0 I guess we can mark this as resolved based on revision 98

[jira] [Resolved] (TIKA-433) Tika + Hadoop

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-433. Resolution: Won't Fix Resolving as Won't Fix as discussed above. > Tika + Hadoop > -

[jira] [Resolved] (TIKA-429) Error parsing DTD

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-429. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Looks like there's no ea

[jira] [Resolved] (TIKA-554) ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-554. Resolution: Won't Fix Assignee: Jukka Zitting Resolving as Won't Fix since the ParseUtils class

[jira] [Resolved] (TIKA-545) While trying to extract meta data(Created date,Modified date) from .docx,.xlsx files it returns only current date.

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-545. Resolution: Duplicate Looks like this was fixed some other issue, so resolving as a duplicate.

[jira] [Resolved] (TIKA-581) Parser fails on files that parsed with v0.7

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-581. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting This was already fixed.

[jira] [Resolved] (TIKA-576) OutofMemory issues while building Tika

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-576. Resolution: Won't Fix Resolving as Won't Fix since this is a rare enough problem and the workaround

[jira] [Resolved] (TIKA-509) Container contents extraction

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-509. Resolution: Fixed Fix Version/s: 1.0 Resolving as fixed as discussed above. >

[jira] [Resolved] (TIKA-685) Unexpected RuntimeException from org.apache.tika.parser.microsoft.OfficeParser@1a8402c

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-685. Resolution: Duplicate Works with latest Tika, so resolving as a duplicate of some of the other recent

[jira] [Resolved] (TIKA-541) Use commons-cli in lieu of writing our own option parser

2011-10-07 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-541. Resolution: Won't Fix I don't see much benefit to using commons-cli in our case, so resolving as Won

[jira] [Resolved] (TIKA-750) JavaDoc of Tika XPathParser should mention descendant:node()

2011-10-10 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-750. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Thanks! Fixed as suggest

[jira] [Resolved] (TIKA-575) Links on the Web-Site for 0.8 to API not correct

2011-10-10 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-575. Resolution: Fixed Assignee: Jukka Zitting Fixed in revision 1181266. > Links o

[jira] [Resolved] (TIKA-670) MD5 sum is wrong on http://tika.apache.org/download.html

2011-10-10 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-670. Resolution: Fixed This is now fixed. > MD5 sum is wrong on http://tika.apache.org/do

[jira] [Resolved] (TIKA-681) eight new n-gram language profiles

2011-10-10 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-681. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Test cases would be nice

[jira] [Resolved] (TIKA-752) Typo in timezone used in Metadata.iso8601Format

2011-10-13 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-752. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Fixed in revision 118303

[jira] [Resolved] (TIKA-734) Out of memory exception with Xlsx file less than 5 MB

2011-10-13 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-734. Resolution: Cannot Reproduce I can't reproduce the problem you're describing. On my computer the fol

[jira] [Resolved] (TIKA-636) Taking very high heap space while parsing docx - Resulting in OOM in tha app

2011-10-13 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-636. Resolution: Incomplete There's little we can do about this without a concrete test case. Please reope

[jira] [Resolved] (TIKA-657) Email parser gets into trouble on malformed html in enron corpus

2011-10-14 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-657. Resolution: Fixed Fix Version/s: 1.0 I was able to process the entire Enron corpus without pro

[jira] [Resolved] (TIKA-746) Support custom mime types

2011-10-26 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-746. Resolution: Fixed Fix Version/s: (was: 1.1) 1.0 There was a backwards c

[jira] [Resolved] (TIKA-703) Drop deprecated methods/classes/interfaces

2011-10-27 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-703. Resolution: Fixed Fix Version/s: (was: 1.1) 1.0 Assignee: Jukka

[jira] [Resolved] (TIKA-761) Provide version number by CLI argument -V

2011-10-28 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-761. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting In revision 1190416 I ad

[jira] [Resolved] (TIKA-565) Improved OSGi bundling

2011-10-31 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-565. Resolution: Fixed Fix Version/s: (was: 1.1) 1.0 This is now pretty much

[jira] [Resolved] (TIKA-763) Update license metadata

2011-11-01 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-763?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-763. Resolution: Fixed Assignee: Jukka Zitting In revision 1196041 I added a workaround to exclude t

[jira] [Resolved] (TIKA-769) Upgrade to Commons Compress 1.3

2011-11-02 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-769. Resolution: Fixed Fix Version/s: 1.0 Assignee: Jukka Zitting Done in revision 1196547

[jira] [Resolved] (TIKA-764) OpenDocumentMetaParser should use common metadata keys for document statistics

2011-11-02 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-764. Resolution: Fixed > OpenDocumentMetaParser should use common metadata keys for document statistic

[jira] [Resolved] (TIKA-772) media type detection fails for html documents, results in text/plain instead of text/html

2011-11-05 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-772. Resolution: Cannot Reproduce Assignee: Jukka Zitting Works for me: {code} $ for f in *.html; d

[jira] [Resolved] (TIKA-780) Optimize loading of the media type registry

2011-11-11 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-780. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting With various refactoring

[jira] [Resolved] (TIKA-783) MD5 and SHA1 values posted on the download page for the .jar do not match actual computed values

2011-11-15 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-783. Resolution: Invalid No, the values on the web site are correct. I suspect the jar you downloaded may

[jira] [Resolved] (TIKA-828) TaggedIOException can be passed non Serializable objects

2011-12-23 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-828. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting Thanks! I think that the

[jira] [Resolved] (TIKA-808) Fork Parser doesn't work for PDF files

2011-12-23 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-808. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting Fixed in revision 122288

[jira] [Resolved] (TIKA-838) EmptyParser Singleton should be final

2012-01-03 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-838. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting I suppose in this case i

[jira] [Resolved] (TIKA-86) Support magic(5) files

2012-01-16 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-86. --- Resolution: Won't Fix Agreed with the points above, so resolving as Won't Fix. Let's follow up in separ

[jira] [Resolved] (TIKA-866) Invalid configuration file causes OutOfMemoryException

2012-02-17 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-866. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting Fixed in revision 124544

[jira] [Resolved] (TIKA-864) Metadata.formatDate causes blocking in concurrent use

2012-02-17 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-864. Resolution: Fixed Fix Version/s: 1.1 Assignee: Jukka Zitting bq. perhaps the best sol

[jira] [Resolved] (TIKA-878) Reuse computed Map inside CompositeParser

2012-03-20 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-878. Resolution: Won't Fix OK, no problem. And thanks for benchmarking this! Resolving as Won't Fix for n

[jira] [Resolved] (TIKA-884) Dynamic loading of Parser and Detector services

2012-03-27 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-884. Resolution: Fixed Fix Version/s: 1.2 Fixed in revision 1305920. > Dynamic loa