[jira] Created: (TIKA-439) DWGParser (and some others) not used by AutoDetectParser

2010-06-15 Thread Nick Burch (JIRA)
DWGParser (and some others) not used by AutoDetectParser Key: TIKA-439 URL: https://issues.apache.org/jira/browse/TIKA-439 Project: Tika Issue Type: Bug Affects Versions: 0.7

[jira] Created: (TIKA-440) [Patch] Fetch the composer information in the MP3 Parser

2010-06-15 Thread Nick Burch (JIRA)
[Patch] Fetch the composer information in the MP3 Parser Key: TIKA-440 URL: https://issues.apache.org/jira/browse/TIKA-440 Project: Tika Issue Type: New Feature Components: p

[jira] Updated: (TIKA-440) [Patch] Fetch the composer information in the MP3 Parser

2010-06-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch updated TIKA-440: Attachment: mp3-parser-composer.patch MP3Parser extension to include the composer > [Patch] Fetch the compos

Detecting container formats

2010-06-15 Thread Nick Burch
Hi All I've been thinking about TIKA-391 (intermittent incorrect mime type detection of office formats), and I think we might need to do something different for container formats. At the moment, for OLE2 based files (.xls, .ppt, .doc, .msg, .vsd etc), and for ZIP based files (.zip, but also

Re: Detecting container formats

2010-06-15 Thread Alex Ott
Hello Nick Burch at "Tue, 15 Jun 2010 18:25:13 +0100 (BST)" wrote: NB> Hi All NB> I've been thinking about TIKA-391 (intermittent incorrect mime type detection of office NB> formats), and I think we might need to do something different for container formats. NB> At the moment, for OLE2 ba

[jira] Created: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader

2010-06-15 Thread Alex Ott (JIRA)
Sometimes, tika not working (crashed) because of null classloader - Key: TIKA-441 URL: https://issues.apache.org/jira/browse/TIKA-441 Project: Tika Issue Type: Bug Com

[jira] Updated: (TIKA-441) Sometimes, tika not working (crashed) because of null classloader

2010-06-15 Thread Alex Ott (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alex Ott updated TIKA-441: -- Attachment: classloader-fix.diff proposed patch to fix this issue > Sometimes, tika not working (crashed) becaus

Re: Detecting container formats

2010-06-15 Thread Ken Krugler
I think this is a reasonable approach, as long as (per Alex's suggestion) it's configurable in various ways. E.g. if you know you don't want to parse OLE2-based files, so you've removed jars for those parser, then it would be great to have an easy way of disabling the (more expensive) mime-

Re: Detecting container formats

2010-06-15 Thread Alex Ott
Hello Ken Krugler at "Tue, 15 Jun 2010 11:56:51 -0700" wrote: KK> I think this is a reasonable approach, as long as (per Alex's suggestion) it's KK> configurable in various ways. KK> E.g. if you know you don't want to parse OLE2-based files, so you've removed jars for KK> those parser, the

Re: Detecting container formats

2010-06-15 Thread Mattmann, Chris A (388J)
Hi Ken, and all, FWIW, it's Tika can handle full regex on glob patterns now via the isregex attribute that I added way back when in TIKA-194 [1]. https://issues.apache.org/jira/browse/TIKA-194 Cheers, Chris On 6/15/10 11:56 AM, "Ken Krugler" wrote: I think this is a reasonable approach, as

Trouble committing to Tika

2010-06-15 Thread Jukka Zitting
Hi, I tried committing some of the recent patches, but I'm getting a "403 Forbidden" error when I run "svn commit" against Tika trunk. Was there some change in svn authorization settings, or am I just doing something wrong? BR, Jukka Zitting

Re: Trouble committing to Tika

2010-06-15 Thread Mattmann, Chris A (388J)
Hey Jukka, That's odd, see r955127 and r955128. I was able to commit simple whitespace update changes to README.txt in the trunk. Here is the svn auth lists for Tika: I tried listing UNIX group and PMC membership on people.a.o, but the machine is so slow, the commands were unresponsive... [mat

Re: Trouble committing to Tika

2010-06-15 Thread Mattmann, Chris A (388J)
OK, here we go (the ldaps server on minotaur wasn't responding before): [mattm...@minotaur]/home/mattmann(29): list_committee.pl tika dmeikle jnioche jukka kbennett kkrugler mattmann mharwood ridabenjelloun siren [mattm...@minotaur]/home/mattmann(30): list_committee.pl tika dmeikle jnioche jukka k