[jira] [Commented] (TIKA-4045) DBF/MDB row count extraction

2023-05-19 Thread Nick Burch (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17724302#comment-17724302
 ] 

Nick Burch commented on TIKA-4045:
--

I guess this could also apply for other row-based formats like SQLite or 
Spreadsheets? Though I'm not sure how best to output it on a per-table / 
per-sheet basis.

For the metadata keys, I guess we could re-use the same ones as we added for 
CSV in TIKA-3938 ?

> DBF/MDB row count extraction
> 
>
> Key: TIKA-4045
> URL: https://issues.apache.org/jira/browse/TIKA-4045
> Project: Tika
>  Issue Type: Improvement
>Reporter: Gregory Lepore
>Priority: Minor
>
> It would be quite helpful for my organization to extract the number of 
> records/rows in any given database file format like DBF or MDB. Along with 
> byte count this would give us a good idea of the amount of information stored 
> in the files.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (TIKA-4045) DBF/MDB row count extraction

2023-05-19 Thread Gregory Lepore (Jira)


[ 
https://issues.apache.org/jira/browse/TIKA-4045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17724309#comment-17724309
 ] 

Gregory Lepore commented on TIKA-4045:
--

Yes, it could also apply to other formats, I didn't think about multiple 
sheets. I guess it would have to return an array in those situations. The total 
amout of data returned should be fairly small, just an integer per sheet and 
the total rows each, I think.

> DBF/MDB row count extraction
> 
>
> Key: TIKA-4045
> URL: https://issues.apache.org/jira/browse/TIKA-4045
> Project: Tika
>  Issue Type: Improvement
>Reporter: Gregory Lepore
>Priority: Minor
>
> It would be quite helpful for my organization to extract the number of 
> records/rows in any given database file format like DBF or MDB. Along with 
> byte count this would give us a good idea of the amount of information stored 
> in the files.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)