[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-08-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17397511#comment-17397511 ] Hudson commented on TIKA-3489: -- UNSTABLE: Integrated in Jenkins build Tika » tika-branch1x-jdk8 #144 (See

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-08-09 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17396043#comment-17396043 ] Sebastian Nagel commented on TIKA-3489: --- +1 to leave it as is. A backport definitely makes sense, in

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-22 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385761#comment-17385761 ] Tim Allison commented on TIKA-3489: --- So, I'll leave this as is {{text/x-robots}} and backport to the 1.x

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-22 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385631#comment-17385631 ] Sebastian Nagel commented on TIKA-3489: --- [~nick]: agreed, sounds plausible. > Robots.txt files

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-22 Thread Nick Burch (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385514#comment-17385514 ] Nick Burch commented on TIKA-3489: -- I'm not keen on us throwing away information we can easily return to

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-22 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385470#comment-17385470 ] Tim Allison commented on TIKA-3489: --- Will change to {{text/plain}} today unless [~nick] chimes in with a

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-21 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17385061#comment-17385061 ] Hudson commented on TIKA-3489: -- FAILURE: Integrated in Jenkins build Tika » tika-main-jdk8 #286 (See

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-21 Thread Sebastian Nagel (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384992#comment-17384992 ] Sebastian Nagel commented on TIKA-3489: --- The [robots.txt RFC

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384966#comment-17384966 ] Tim Allison commented on TIKA-3489: --- I added mime detection for robots.txt in {{main}} with mime

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384931#comment-17384931 ] Tim Allison commented on TIKA-3489: --- [~nick], any recommendations? {{text/x-robots}} subtype of

[jira] [Commented] (TIKA-3489) Robots.txt files frequently identified as message/rfc822

2021-07-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17384905#comment-17384905 ] Tim Allison commented on TIKA-3489: --- Should we try to detect robots.txt files as their own mime? >