[jira] [Commented] (TIKA-3941) Consider having pipesserver return intermediate results

2023-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17728139#comment-17728139 ] ASF GitHub Bot commented on TIKA-3941: -- tballison opened a new pull request, #1167: URL:

[jira] [Commented] (TIKA-4048) Gzipped WARC not identifying all assets

2023-05-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4048?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17727988#comment-17727988 ] ASF GitHub Bot commented on TIKA-4048: -- tballison opened a new pull request, #1166: URL:

[jira] [Commented] (TIKA-4055) Write limit not working correctly in RecursiveParserWrapper

2023-05-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17726698#comment-17726698 ] ASF GitHub Bot commented on TIKA-4055: -- tballison merged PR #1156: URL:

[jira] [Commented] (TIKA-4055) Write limit not working correctly in RecursiveParserWrapper

2023-05-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17726673#comment-17726673 ] ASF GitHub Bot commented on TIKA-4055: -- tballison opened a new pull request, #1156: URL:

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2023-05-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721878#comment-17721878 ] ASF GitHub Bot commented on TIKA-4038: -- tballison commented on PR #1130: URL:

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2023-05-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721876#comment-17721876 ] ASF GitHub Bot commented on TIKA-4038: -- tballison merged PR #1130: URL:

[jira] [Commented] (TIKA-4038) Fix dependency problem in tika-parsers-standard-package

2023-05-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721857#comment-17721857 ] ASF GitHub Bot commented on TIKA-4038: -- gastaldi opened a new pull request, #1130: URL:

[jira] [Commented] (TIKA-4035) Enable extraction of file system metadata in FileSystemFetcher

2023-05-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721405#comment-17721405 ] ASF GitHub Bot commented on TIKA-4035: -- tballison merged PR #1126: URL:

[jira] [Commented] (TIKA-4034) Allow configuration of prettyPrint in FileSystemEmitter

2023-05-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721404#comment-17721404 ] ASF GitHub Bot commented on TIKA-4034: -- tballison merged PR #1125: URL:

[jira] [Commented] (TIKA-4035) Enable extraction of file system metadata in FileSystemFetcher

2023-05-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721386#comment-17721386 ] ASF GitHub Bot commented on TIKA-4035: -- tballison opened a new pull request, #1126: URL:

[jira] [Commented] (TIKA-4034) Allow configuration of prettyPrint in FileSystemEmitter

2023-05-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17721381#comment-17721381 ] ASF GitHub Bot commented on TIKA-4034: -- tballison opened a new pull request, #1125: URL:

[jira] [Commented] (TIKA-4033) Improve metadata for incremental updates, take 2

2023-05-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17720948#comment-17720948 ] ASF GitHub Bot commented on TIKA-4033: -- tballison merged PR #1121: URL:

[jira] [Commented] (TIKA-4033) Improve metadata for incremental updates, take 2

2023-05-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17720921#comment-17720921 ] ASF GitHub Bot commented on TIKA-4033: -- tballison opened a new pull request, #1121: URL:

[jira] [Commented] (TIKA-4028) Add detection for common subtitle format

2023-05-05 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719798#comment-17719798 ] ASF GitHub Bot commented on TIKA-4028: -- tballison merged PR #: URL:

[jira] [Commented] (TIKA-4028) Add detection for common subtitle format

2023-05-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719437#comment-17719437 ] ASF GitHub Bot commented on TIKA-4028: -- tledoux commented on PR #: URL:

[jira] [Commented] (TIKA-4028) Add detection for common subtitle format

2023-05-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719391#comment-17719391 ] ASF GitHub Bot commented on TIKA-4028: -- tballison commented on PR #: URL:

[jira] [Commented] (TIKA-4028) Add detection for common subtitle format

2023-05-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719383#comment-17719383 ] ASF GitHub Bot commented on TIKA-4028: -- tledoux opened a new pull request, #: URL:

[jira] [Commented] (TIKA-4027) Improve metadata for incremental updates

2023-05-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719375#comment-17719375 ] ASF GitHub Bot commented on TIKA-4027: -- tballison merged PR #1110: URL:

[jira] [Commented] (TIKA-4027) Improve metadata for incremental updates

2023-05-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17719354#comment-17719354 ] ASF GitHub Bot commented on TIKA-4027: -- tballison opened a new pull request, #1110: URL:

[jira] [Commented] (TIKA-4017) Add optional detection and parsing of incremental updates in PDF

2023-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718997#comment-17718997 ] ASF GitHub Bot commented on TIKA-4017: -- tballison merged PR #1085: URL:

[jira] [Commented] (TIKA-4025) Extract frame count from gifs

2023-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718983#comment-17718983 ] ASF GitHub Bot commented on TIKA-4025: -- tballison merged PR #1108: URL:

[jira] [Commented] (TIKA-4022) Tika not parsing AVI files

2023-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718982#comment-17718982 ] ASF GitHub Bot commented on TIKA-4022: -- tballison merged PR #1107: URL:

[jira] [Commented] (TIKA-4025) Extract frame count from gifs

2023-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718935#comment-17718935 ] ASF GitHub Bot commented on TIKA-4025: -- tballison opened a new pull request, #1108: URL:

[jira] [Commented] (TIKA-4022) Tika not parsing AVI files

2023-05-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718934#comment-17718934 ] ASF GitHub Bot commented on TIKA-4022: -- tballison opened a new pull request, #1107: URL:

[jira] [Commented] (TIKA-2766) Be able to extract raw values from excel, not formatted

2023-04-30 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17718137#comment-17718137 ] ASF GitHub Bot commented on TIKA-2766: -- jtbdevelopment closed pull request #256: TIKA-2766 - be able

[jira] [Commented] (TIKA-4017) Add optional detection and parsing of incremental updates in PDF

2023-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17712510#comment-17712510 ] ASF GitHub Bot commented on TIKA-4017: -- tballison opened a new pull request, #1085: URL:

[jira] [Commented] (TIKA-4018) Extract more info from warc files

2023-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17712507#comment-17712507 ] ASF GitHub Bot commented on TIKA-4018: -- tballison merged PR #1084: URL:

[jira] [Commented] (TIKA-4018) Extract more info from warc files

2023-04-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17712501#comment-17712501 ] ASF GitHub Bot commented on TIKA-4018: -- tballison opened a new pull request, #1084: URL:

[jira] [Commented] (TIKA-4012) Improve extraction of embedded documents in PDFs

2023-04-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17712020#comment-17712020 ] ASF GitHub Bot commented on TIKA-4012: -- tballison merged PR #1079: URL:

[jira] [Commented] (TIKA-4016) Upgrade to PDFBox 2.0.28

2023-04-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17712019#comment-17712019 ] ASF GitHub Bot commented on TIKA-4016: -- tballison merged PR #1080: URL:

[jira] [Commented] (TIKA-4016) Upgrade to PDFBox 2.0.28

2023-04-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17711995#comment-17711995 ] ASF GitHub Bot commented on TIKA-4016: -- tballison opened a new pull request, #1080: URL:

[jira] [Commented] (TIKA-4012) Improve extraction of embedded documents in PDFs

2023-04-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17711992#comment-17711992 ] ASF GitHub Bot commented on TIKA-4012: -- tballison opened a new pull request, #1079: URL:

[jira] [Commented] (TIKA-4013) Extract rendition information from epub files

2023-04-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17711082#comment-17711082 ] ASF GitHub Bot commented on TIKA-4013: -- tballison merged PR #1071: URL:

[jira] [Commented] (TIKA-4013) Extract rendition information from epub 3.x files

2023-04-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17711074#comment-17711074 ] ASF GitHub Bot commented on TIKA-4013: -- tballison opened a new pull request, #1071: URL:

[jira] [Commented] (TIKA-4011) Add detection for ONIXMessage

2023-04-11 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17710954#comment-17710954 ] ASF GitHub Bot commented on TIKA-4011: -- tballison merged PR #1068: URL:

[jira] [Commented] (TIKA-4011) Add detection for ONIXMessage

2023-04-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-4011?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17710286#comment-17710286 ] ASF GitHub Bot commented on TIKA-4011: -- tballison opened a new pull request, #1068: URL:

[jira] [Commented] (TIKA-3994) Improve audio/mpeg detection

2023-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17707371#comment-17707371 ] ASF GitHub Bot commented on TIKA-3994: -- tballison merged PR #1052: URL:

[jira] [Commented] (TIKA-3991) Improve file detection for canon-raw (crw), cr2 and cr3

2023-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17707370#comment-17707370 ] ASF GitHub Bot commented on TIKA-3991: -- tballison merged PR #1033: URL:

[jira] [Commented] (TIKA-3994) Improve audio/mpeg detection

2023-03-31 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17707275#comment-17707275 ] ASF GitHub Bot commented on TIKA-3994: -- tballison opened a new pull request, #1052: URL:

[jira] [Commented] (TIKA-3993) Improve throttle logic in S3Fetcher

2023-03-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17706554#comment-17706554 ] ASF GitHub Bot commented on TIKA-3993: -- tballison merged PR #1048: URL:

[jira] [Commented] (TIKA-3993) Improve throttle logic in S3Fetcher

2023-03-29 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17706511#comment-17706511 ] ASF GitHub Bot commented on TIKA-3993: -- tballison opened a new pull request, #1048: URL:

[jira] [Commented] (TIKA-3960) PGP encrypted files get detected as application/octet-stream

2023-03-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17706104#comment-17706104 ] ASF GitHub Bot commented on TIKA-3960: -- tballison merged PR #1041: URL:

[jira] [Commented] (TIKA-3991) Improve file detection for canon-raw (crw), cr2 and cr3

2023-03-28 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17706103#comment-17706103 ] ASF GitHub Bot commented on TIKA-3991: -- tballison commented on PR #1033: URL:

[jira] [Commented] (TIKA-3960) PGP encrypted files get detected as application/octet-stream

2023-03-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17704679#comment-17704679 ] ASF GitHub Bot commented on TIKA-3960: -- TayseerSabha opened a new pull request, #1041: URL:

[jira] [Commented] (TIKA-3960) PGP encrypted files get detected as application/octet-stream

2023-03-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17704677#comment-17704677 ] ASF GitHub Bot commented on TIKA-3960: -- TayseerSabha closed pull request #1040: [TIKA-3960] Fix

[jira] [Commented] (TIKA-3991) Improve file detection for canon-raw (crw), cr2 and cr3

2023-03-22 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17703721#comment-17703721 ] ASF GitHub Bot commented on TIKA-3991: -- tballison opened a new pull request, #1033: URL:

[jira] [Commented] (TIKA-3990) Close pkg for regular InputStreams in OOXMLExtractorFactory

2023-03-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17703338#comment-17703338 ] ASF GitHub Bot commented on TIKA-3990: -- tballison merged PR #1030: URL:

[jira] [Commented] (TIKA-3990) Close pkg for regular InputStreams in OOXMLExtractorFactory

2023-03-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17703325#comment-17703325 ] ASF GitHub Bot commented on TIKA-3990: -- tballison opened a new pull request, #1030: URL:

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702947#comment-17702947 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc merged PR #13: URL:

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702930#comment-17702930 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc closed pull request #13: Test PR for TIKA-3988 URL:

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702931#comment-17702931 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc opened a new pull request, #13: URL:

[jira] [Commented] (TIKA-3649) Perform findbugs static analysis on the project and address the issues

2023-03-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702743#comment-17702743 ] ASF GitHub Bot commented on TIKA-3649: -- dk2k closed pull request #499: TIKA-3649 fixes for report of

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702416#comment-17702416 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc opened a new pull request, #13: URL:

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702415#comment-17702415 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc merged PR #12: URL:

[jira] [Commented] (TIKA-3988) Add Github Action to Lint and Test Charts

2023-03-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17702414#comment-17702414 ] ASF GitHub Bot commented on TIKA-3988: -- lewismc opened a new pull request, #12: URL:

[jira] [Commented] (TIKA-3987) Add a parser for ActiveMime

2023-03-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17700836#comment-17700836 ] ASF GitHub Bot commented on TIKA-3987: -- tballison merged PR #1017: URL:

[jira] [Commented] (TIKA-3987) Add a parser for ActiveMime

2023-03-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3987?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17700832#comment-17700832 ] ASF GitHub Bot commented on TIKA-3987: -- tballison opened a new pull request, #1017: URL:

[jira] [Commented] (TIKA-3986) JDBCEmitter should strip \u0000 for postgres varchar/strings

2023-03-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17700281#comment-17700281 ] ASF GitHub Bot commented on TIKA-3986: -- tballison merged PR #1012: URL:

[jira] [Commented] (TIKA-3986) JDBCEmitter should strip \u0000 for postgres varchar/strings

2023-03-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3986?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17700254#comment-17700254 ] ASF GitHub Bot commented on TIKA-3986: -- tballison opened a new pull request, #1012: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-03-08 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17698026#comment-17698026 ] ASF GitHub Bot commented on TIKA-3452: -- frascu commented on PR #4: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17696358#comment-17696358 ] ASF GitHub Bot commented on TIKA-3452: -- frascu commented on PR #4: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17696236#comment-17696236 ] ASF GitHub Bot commented on TIKA-3452: -- lewismc commented on PR #4: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-03-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17696235#comment-17696235 ] ASF GitHub Bot commented on TIKA-3452: -- lewismc merged PR #4: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-03-01 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17695032#comment-17695032 ] ASF GitHub Bot commented on TIKA-3452: -- frascu commented on PR #4: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17694154#comment-17694154 ] ASF GitHub Bot commented on TIKA-3979: -- apismensky commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17694134#comment-17694134 ] ASF GitHub Bot commented on TIKA-3979: -- nddipiazza commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-27 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17694096#comment-17694096 ] ASF GitHub Bot commented on TIKA-3979: -- apismensky commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17693508#comment-17693508 ] ASF GitHub Bot commented on TIKA-3979: -- nddipiazza merged PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17693507#comment-17693507 ] ASF GitHub Bot commented on TIKA-3979: -- nddipiazza commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17693506#comment-17693506 ] ASF GitHub Bot commented on TIKA-3979: -- nddipiazza commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17693338#comment-17693338 ] ASF GitHub Bot commented on TIKA-3979: -- tballison commented on PR #985: URL:

[jira] [Commented] (TIKA-3979) OneNoteParser - Improve performance for deserialization

2023-02-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17693332#comment-17693332 ] ASF GitHub Bot commented on TIKA-3979: -- davidxie-glean opened a new pull request, #985: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-02-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17692811#comment-17692811 ] ASF GitHub Bot commented on TIKA-3452: -- lewismc commented on PR #4: URL:

[jira] [Commented] (TIKA-3970) Certain OneNote documents produce duplicate text

2023-02-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17692697#comment-17692697 ] ASF GitHub Bot commented on TIKA-3970: -- tballison commented on PR #975: URL:

[jira] [Commented] (TIKA-3970) Certain OneNote documents produce duplicate text

2023-02-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17692696#comment-17692696 ] ASF GitHub Bot commented on TIKA-3970: -- tballison merged PR #975: URL:

[jira] [Commented] (TIKA-3970) Certain OneNote documents produce duplicate text

2023-02-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17691920#comment-17691920 ] ASF GitHub Bot commented on TIKA-3970: -- nddipiazza commented on PR #975: URL:

[jira] [Commented] (TIKA-3970) Certain OneNote documents produce duplicate text

2023-02-21 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17691918#comment-17691918 ] ASF GitHub Bot commented on TIKA-3970: -- nddipiazza opened a new pull request, #975: URL:

[jira] [Commented] (TIKA-3976) Allow users to configure behavior for zero-byte files

2023-02-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17690528#comment-17690528 ] ASF GitHub Bot commented on TIKA-3976: -- tballison merged PR #972: URL:

[jira] [Commented] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly.

2023-02-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17690485#comment-17690485 ] ASF GitHub Bot commented on TIKA-2689: -- tballison merged PR #954: URL:

[jira] [Commented] (TIKA-3977) JDBCEmitter and JDBCReporter should allow post connection sql call

2023-02-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17690486#comment-17690486 ] ASF GitHub Bot commented on TIKA-3977: -- tballison merged PR #971: URL:

[jira] [Commented] (TIKA-3977) JDBCEmitter and JDBCReporter should allow post connection sql call

2023-02-17 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17690468#comment-17690468 ] ASF GitHub Bot commented on TIKA-3977: -- tballison opened a new pull request, #971: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system

2023-02-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17689659#comment-17689659 ] ASF GitHub Bot commented on TIKA-3452: -- frascu commented on PR #4: URL:

[jira] [Commented] (TIKA-3452) java.nio.file.FileSystemException Read-only file system in 2.0.0-BETA tika-docker

2023-02-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17689313#comment-17689313 ] ASF GitHub Bot commented on TIKA-3452: -- lewismc commented on PR #4: URL:

[jira] [Commented] (TIKA-2689) *.ai type (Adobe illustrator ) files are not detected correctly.

2023-02-10 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17687212#comment-17687212 ] ASF GitHub Bot commented on TIKA-2689: -- tballison opened a new pull request, #954: URL:

[jira] [Commented] (TIKA-3968) Reconstruct embedded file names from associated emf files within docx files

2023-02-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685447#comment-17685447 ] ASF GitHub Bot commented on TIKA-3968: -- tballison merged PR #948: URL:

[jira] [Commented] (TIKA-3968) Reconstruct embedded file names from recent docx files

2023-02-07 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17685382#comment-17685382 ] ASF GitHub Bot commented on TIKA-3968: -- tballison opened a new pull request, #948: URL:

[jira] [Commented] (TIKA-3967) Bump geo-api, fix convergence error

2023-02-06 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684789#comment-17684789 ] ASF GitHub Bot commented on TIKA-3967: -- tballison merged PR #941: URL:

[jira] [Commented] (TIKA-3967) Bump geo-api, fix convergence error

2023-02-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17684003#comment-17684003 ] ASF GitHub Bot commented on TIKA-3967: -- THausherr commented on PR #941: URL:

[jira] [Commented] (TIKA-3967) Bump geo-api, fix convergence error

2023-02-03 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17683919#comment-17683919 ] ASF GitHub Bot commented on TIKA-3967: -- tballison opened a new pull request, #941: URL:

[jira] [Commented] (TIKA-3963) HTML author isn't mapped to its dc:creator counterpart

2023-02-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17683578#comment-17683578 ] ASF GitHub Bot commented on TIKA-3963: -- tballison merged PR #935: URL:

[jira] [Commented] (TIKA-3963) HTML author isn't mapped to its dc:creator counterpart

2023-02-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17683565#comment-17683565 ] ASF GitHub Bot commented on TIKA-3963: -- tballison commented on PR #935: URL:

[jira] [Commented] (TIKA-3963) HTML author isn't mapped to its dc:creator counterpart

2023-02-02 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17683557#comment-17683557 ] ASF GitHub Bot commented on TIKA-3963: -- tballison opened a new pull request, #935: URL:

[jira] [Commented] (TIKA-3957) Refactor email date parsing

2023-01-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17678644#comment-17678644 ] ASF GitHub Bot commented on TIKA-3957: -- tballison merged PR #910: URL:

[jira] [Commented] (TIKA-3957) Refactor email date parsing

2023-01-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17678420#comment-17678420 ] ASF GitHub Bot commented on TIKA-3957: -- tballison opened a new pull request, #910: URL:

[jira] [Commented] (TIKA-3213) Consider migrating universalcharsetdetector to a live fork

2022-12-16 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648605#comment-17648605 ] ASF GitHub Bot commented on TIKA-3213: -- tballison merged PR #862: URL:

[jira] [Commented] (TIKA-3213) Consider migrating universalcharsetdetector to a live fork

2022-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648270#comment-17648270 ] ASF GitHub Bot commented on TIKA-3213: -- tballison opened a new pull request, #862: URL:

[jira] [Commented] (TIKA-3946) Drop need for node in config

2022-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648207#comment-17648207 ] ASF GitHub Bot commented on TIKA-3946: -- tballison merged PR #861: URL:

[jira] [Commented] (TIKA-3946) Drop need for node in config

2022-12-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648157#comment-17648157 ] ASF GitHub Bot commented on TIKA-3946: -- tballison opened a new pull request, #861: URL:

[jira] [Commented] (TIKA-3945) Improve serialization for composite pipes reporter

2022-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17646796#comment-17646796 ] ASF GitHub Bot commented on TIKA-3945: -- tballison merged PR #852: URL:

[jira] [Commented] (TIKA-3945) Improve serialization for composite pipes reporter

2022-12-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17646768#comment-17646768 ] ASF GitHub Bot commented on TIKA-3945: -- tballison opened a new pull request, #852: URL:

<    1   2   3   4   5   6   7   8   9   10   >