[ 
https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240982#comment-15240982
 ] 

Tim Allison commented on TIKA-1513:
-----------------------------------

Nope.  Didn't remove them.  There are roughly 3k files that ended with dbf or 
dbase3 in govdocs1 and an earlier version of our slice of commoncrawl.
The files may not actually be dbfs, and they're likely truncated (at least 
those that came from commoncrawl).

Give [this|http://162.242.228.174/share/dbfs.tar.bz2] a shot.

Thank you, Rackspace! 

> Add mime detection and parsing for dbf files
> --------------------------------------------
>
>                 Key: TIKA-1513
>                 URL: https://issues.apache.org/jira/browse/TIKA-1513
>             Project: Tika
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>             Fix For: 1.13
>
>
> I just came across an Apache licensed dbf parser that is available on 
> [maven|https://repo1.maven.org/maven2/org/jamel/dbf/dbf-reader/0.1.0/dbf-reader-0.1.0.pom].
> Let's add dbf parsing to Tika.
> Any other recommendations for alternate parsers?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to