[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-10-23 Thread Josh McCullough (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17778820#comment-17778820 ] Josh McCullough commented on TIKA-3992: --- `las` file-type detection is returning `app

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-06-30 Thread Gregory Lepore (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17739089#comment-17739089 ] Gregory Lepore commented on TIKA-3992: -- Got a chance to download the May/June CommonC

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-05-24 Thread Gregory Lepore (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17725873#comment-17725873 ] Gregory Lepore commented on TIKA-3992: -- Looking at the full-table.csv file there are

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-03-31 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17707234#comment-17707234 ] Tim Allison commented on TIKA-3992: --- I added subtasks for the more common ones. There a

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17707027#comment-17707027 ] Tim Allison commented on TIKA-3992: --- I just attached all the tables. It looks like we h

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-03-30 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17706997#comment-17706997 ] Tim Allison commented on TIKA-3992: --- It turns out that a bunch of those 600k octet-strea

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-03-29 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17706405#comment-17706405 ] Tim Allison commented on TIKA-3992: --- Ah, that's helpful. Thank you! By "truncated", I w

[jira] [Commented] (TIKA-3992) Add common missing mimes based on Common Crawl data

2023-03-29 Thread Andrew Jackson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17706400#comment-17706400 ] Andrew Jackson commented on TIKA-3992: -- Sounds interesting! Just wanted to note that