[
https://issues.apache.org/jira/browse/TIKA-1513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15240982#comment-15240982
]
Tim Allison commented on TIKA-1513:
-----------------------------------
Nope. Didn't remove them. There are roughly 3k files that ended with dbf or
dbase3 in govdocs1 and an earlier version of our slice of commoncrawl.
The files may not actually be dbfs, and they're likely truncated (at least
those that came from commoncrawl).
Give [this|http://162.242.228.174/share/dbfs.tar.bz2] a shot.
Thank you, Rackspace!
> Add mime detection and parsing for dbf files
> --------------------------------------------
>
> Key: TIKA-1513
> URL: https://issues.apache.org/jira/browse/TIKA-1513
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Minor
> Fix For: 1.13
>
>
> I just came across an Apache licensed dbf parser that is available on
> [maven|https://repo1.maven.org/maven2/org/jamel/dbf/dbf-reader/0.1.0/dbf-reader-0.1.0.pom].
> Let's add dbf parsing to Tika.
> Any other recommendations for alternate parsers?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)