This is an automated email from the ASF dual-hosted git repository. pottlinger pushed a change to branch feature/RAT-369 in repository https://gitbox.apache.org/repos/asf/creadur-rat.git
from 959db49e INFRA-25758: try to cleanUp the workspace via another method add 261e5624 INFRA-25758: try to cleanUp the workspace via another method add b08771f1 Switched to Tika to determine document type add a8adca19 cleaned up DefaultAnalyserFactory add 92506ef1 fixed pom issues add 2967679a Update to filter *.json files. add 533c0f42 cleaned up code add 156da8f3 updated changelog add 92b95665 updated Default to make static methods for directoriesToIgnore and filesToIgnore add 89aebe68 updated description of change add 95820883 fixed threading issues with ClaimStatistic and simplified code add 0f6082bb added tests and cleanedup DirectoryWalker add 9aedf3c1 extracted TikaProcessor from DefaultAnalyserFactory to facilitate testing add d8c999e7 removed special handling for *.json, fixed path issues on Windows add acae8716 Fixed mojo test add 3edfc485 test fir windows output add 70b313d8 fixed windows tests add 54146105 checkstyle cleanup add 10330afc fixed access issue add 2cf61b45 fixed typos add ff066cde LHF: remove comment add cf9b9cf6 LHF: fix typo add 07a0b111 Convert from Windows line endings add 7c210e0c completed BinaryGuessTest migration to other tests add 64dcfd2c LHF: Unused import add e84ebdb7 RAT-301: Ensure Chinese characters are not categorized as binary add d0534b8b RAT-301: Change example to non-meaningful Java to not find this class in IDE searches in RAT's codebase add e11553d3 RAT-301: Add changelog after integrating the file as test add 1b919dee RAT-301: Fix test setup after renaming file add 46eaed82 RAT-150: Code cleanup add dd8fc904 added additional tikaFiles and a README document add 8fcb1cf4 Merge pull request #240 from Claudenw/tika_based_document_analyzer new b7524f3a Fix changelog The 1 revisions listed above as "new" are entirely new to this repository and will be described in separate emails. The revisions listed as "add" were already present in the repository and have only been added to this reference. Summary of changes: apache-rat-core/pom.xml | 8 +- .../src/main/java/org/apache/rat/Defaults.java | 53 ++- .../src/main/java/org/apache/rat/Report.java | 10 +- .../java/org/apache/rat/ReportConfiguration.java | 36 +- .../rat/analysis/DefaultAnalyserFactory.java | 32 +- .../org/apache/rat/analysis/TikaProcessor.java | 142 ++++++++ .../analysis/matchers/AbstractHeaderMatcher.java | 2 - .../apache/rat/analysis/matchers/NotMatcher.java | 2 - .../src/main/java/org/apache/rat/api/Document.java | 25 +- .../src/main/java/org/apache/rat/api/MetaData.java | 18 + .../rat/document/impl/guesser/ArchiveGuesser.java | 66 ---- .../rat/document/impl/guesser/BinaryGuesser.java | 372 --------------------- .../org/apache/rat/header/LineNumberReader.java | 130 +++---- .../apache/rat/report/claim/ClaimStatistic.java | 142 ++++++-- .../report/claim/impl/AbstractClaimReporter.java | 18 +- .../rat/report/claim/impl/ClaimAggregator.java | 43 +-- .../java/org/apache/rat/walker/ArchiveWalker.java | 28 +- .../org/apache/rat/walker/DirectoryWalker.java | 86 ++--- .../main/java/org/apache/rat/walker/Walker.java | 52 ++- .../org/apache/rat/ReportConfigurationTest.java | 40 ++- .../src/test/java/org/apache/rat/ReportTest.java | 15 +- .../src/test/java/org/apache/rat/ReporterTest.java | 26 +- .../apache/rat/analysis/AnalyserFactoryTest.java | 59 +++- .../org/apache/rat/analysis/TikaProcessorTest.java | 168 ++++++++++ .../document/impl/guesser/ArchiveGuesserTest.java | 66 ---- .../document/impl/guesser/BinaryGuesserTest.java | 150 --------- .../apache/rat/report/ConfigurationReportTest.java | 5 - .../rat/report/xml/XmlReportFactoryTest.java | 19 +- .../org/apache/rat/walker/DirectoryWalkerTest.java | 99 ++++-- .../resources/jira/RAT147/unix-newlines.txt.bin | 8 + .../resources/jira/RAT147/windows-newlines.txt.bin | 9 + .../RAT211/leader-election-message-arrives.dia | Bin 0 -> 5796 bytes .../src/test/resources/jira/RAT211/side_left.bmp | Bin 0 -> 345238 bytes .../src/test/resources/tikaFiles/README.md | 11 + .../{elements => tikaFiles/archive}/dummy.jar | Bin .../test/resources/tikaFiles/binary/Defaults.class | Bin 0 -> 7280 bytes .../src/test/resources/tikaFiles/binary/Image.gif | Bin 0 -> 74 bytes .../src/test/resources/tikaFiles/binary/Image.jpeg | Bin 0 -> 4803 bytes .../src/test/resources/tikaFiles/binary/Image.jpg | Bin 0 -> 4803 bytes .../{elements => tikaFiles/binary}/Image.png | Bin .../src/test/resources/tikaFiles/binary/Image.psd | Bin 0 -> 2499 bytes .../src/test/resources/tikaFiles/binary/Image.xcf | Bin 0 -> 1565 bytes .../test/resources/tikaFiles/binary/KeyStore.jks | Bin 0 -> 4914 bytes .../src/test/resources/tikaFiles/binary/file.json | 7 + .../resources/tikaFiles/binary/maven_libjansi.so | Bin 0 -> 98380 bytes .../src/test/resources/tikaFiles/binary/test.aif | Bin 0 -> 3894 bytes .../src/test/resources/tikaFiles/binary/test.au | Bin 0 -> 3868 bytes .../src/test/resources/tikaFiles/binary/test.exe | Bin 0 -> 1024 bytes .../src/test/resources/tikaFiles/binary/test.flv | Bin 0 -> 90580 bytes .../src/test/resources/tikaFiles/binary/test.iff | Bin 0 -> 1060 bytes .../test/resources/tikaFiles/binary/test.keystore | Bin 0 -> 4914 bytes .../src/test/resources/tikaFiles/binary/test.mid | Bin 0 -> 322 bytes .../src/test/resources/tikaFiles/binary/test.mp3 | Bin 0 -> 2668637 bytes .../src/test/resources/tikaFiles/binary/test.ogg | Bin 0 -> 5418 bytes .../src/test/resources/tikaFiles/binary/test.swf | Bin 0 -> 180 bytes .../resources/tikaFiles/binary/test.truststore | Bin 0 -> 1302 bytes .../src/test/resources/tikaFiles/binary/test.wav | Bin 0 -> 3884 bytes .../src/test/resources/tikaFiles/binary/test.wma | Bin 0 -> 27747 bytes .../resources/tikaFiles/binary/testMP3i18n.mp3 | Bin 0 -> 40832 bytes .../tikaFiles/binary/testMP3i18n_truncated.mp3 | Bin 0 -> 40672 bytes .../resources/tikaFiles/binary/testMP3id3v1.mp3 | Bin 0 -> 39416 bytes .../resources/tikaFiles/binary/testMP3id3v1_v2.mp3 | Bin 0 -> 40960 bytes .../resources/tikaFiles/binary/testMP3id3v2.mp3 | Bin 0 -> 39577 bytes .../resources/tikaFiles/binary/testMP3id3v24.mp3 | Bin 0 -> 39471 bytes .../resources/tikaFiles/binary/testMP3lyrics.mp3 | Bin 0 -> 34688 bytes .../resources/tikaFiles/binary/testMP3noid3.mp3 | Bin 0 -> 39288 bytes .../tikaFiles/binary/testMP3truncated.mp3 | Bin 0 -> 65536 bytes .../test/resources/tikaFiles/binary/testMP4.m4a | Bin 0 -> 4770 bytes .../tikaFiles/binary/testMP4_truncated.m4a | Bin 0 -> 74 bytes .../tikaFiles/binary/testNakedUTF16BOM.mp3 | Bin 0 -> 2625 bytes .../test/resources/tikaFiles/binary/truststore.jks | Bin 0 -> 1302 bytes .../src/test/resources/tikaFiles/ca-cert | 22 ++ .../src/test/resources/tikaFiles/ca-key | 30 ++ .../src/test/resources/tikaFiles/cert-file | 19 ++ .../src/test/resources/tikaFiles/cert-signed | 21 ++ .../{elements => tikaFiles/notice}/LICENSE | 0 .../{elements => tikaFiles/notice}/NOTICE | 0 .../tikaFiles/standard/ChineseCommentsJava.java | 42 +++ .../tikaFiles/standard/HelloWorld.groovy} | 12 +- .../standard/HelloWorld.java} | 8 +- .../test/resources/tikaFiles/standard/Image.pdf | Bin 0 -> 2390 bytes .../standard}/UTF16_with_signature.xml | Bin .../standard}/UTF8_with_signature.xml | 0 .../standard/file.C} | 6 +- .../standard/file.CPP} | 8 +- .../standard/file.c} | 6 +- .../standard/file.c++} | 8 +- .../standard/file.cc} | 8 +- .../standard/file.cp} | 8 +- .../standard/file.cpp} | 8 +- .../src/test/resources/tikaFiles/standard/file.csv | 1 + .../standard/file.cxx} | 8 +- .../standard/file.plain} | 4 +- .../src/test/resources/tikaFiles/standard/file.tsv | 1 + .../java/org/apache/rat/mp/AbstractRatMojo.java | 6 +- .../main/java/org/apache/rat/mp/RatCheckMojo.java | 2 +- .../java/org/apache/rat/mp/RatCheckMojoTest.java | 15 +- .../main/java/org/apache/rat/anttasks/Report.java | 2 +- checkstyle-suppressions.xml | 26 ++ pom.xml | 10 +- src/changes/changes.xml | 26 +- 101 files changed, 1156 insertions(+), 1098 deletions(-) create mode 100644 apache-rat-core/src/main/java/org/apache/rat/analysis/TikaProcessor.java delete mode 100644 apache-rat-core/src/main/java/org/apache/rat/document/impl/guesser/ArchiveGuesser.java delete mode 100644 apache-rat-core/src/main/java/org/apache/rat/document/impl/guesser/BinaryGuesser.java create mode 100644 apache-rat-core/src/test/java/org/apache/rat/analysis/TikaProcessorTest.java delete mode 100644 apache-rat-core/src/test/java/org/apache/rat/document/impl/guesser/ArchiveGuesserTest.java delete mode 100644 apache-rat-core/src/test/java/org/apache/rat/document/impl/guesser/BinaryGuesserTest.java create mode 100644 apache-rat-core/src/test/resources/jira/RAT147/unix-newlines.txt.bin create mode 100644 apache-rat-core/src/test/resources/jira/RAT147/windows-newlines.txt.bin create mode 100644 apache-rat-core/src/test/resources/jira/RAT211/leader-election-message-arrives.dia create mode 100644 apache-rat-core/src/test/resources/jira/RAT211/side_left.bmp create mode 100644 apache-rat-core/src/test/resources/tikaFiles/README.md copy apache-rat-core/src/test/resources/{elements => tikaFiles/archive}/dummy.jar (100%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Defaults.class create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Image.gif create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Image.jpeg create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Image.jpg copy apache-rat-core/src/test/resources/{elements => tikaFiles/binary}/Image.png (100%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Image.psd create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/Image.xcf create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/KeyStore.jks create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/file.json create mode 100755 apache-rat-core/src/test/resources/tikaFiles/binary/maven_libjansi.so create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.aif create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.au create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.exe create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.flv create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.iff create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.keystore create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.mid create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.ogg create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.swf create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.truststore create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.wav create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/test.wma create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3i18n.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3i18n_truncated.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3id3v1.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3id3v1_v2.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3id3v2.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3id3v24.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3lyrics.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3noid3.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP3truncated.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP4.m4a create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testMP4_truncated.m4a create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/testNakedUTF16BOM.mp3 create mode 100644 apache-rat-core/src/test/resources/tikaFiles/binary/truststore.jks create mode 100644 apache-rat-core/src/test/resources/tikaFiles/ca-cert create mode 100644 apache-rat-core/src/test/resources/tikaFiles/ca-key create mode 100644 apache-rat-core/src/test/resources/tikaFiles/cert-file create mode 100644 apache-rat-core/src/test/resources/tikaFiles/cert-signed copy apache-rat-core/src/test/resources/{elements => tikaFiles/notice}/LICENSE (100%) copy apache-rat-core/src/test/resources/{elements => tikaFiles/notice}/NOTICE (100%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/standard/ChineseCommentsJava.java copy apache-rat-core/src/test/{java/org/apache/rat/report/claim/impl/xml/MockClaimReporter.java => resources/tikaFiles/standard/HelloWorld.groovy} (87%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/HelloWorld.java} (87%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/standard/Image.pdf copy apache-rat-core/src/test/resources/{binaries => tikaFiles/standard}/UTF16_with_signature.xml (100%) copy apache-rat-core/src/test/resources/{binaries => tikaFiles/standard}/UTF8_with_signature.xml (100%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.C} (94%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.CPP} (92%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.c} (94%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.c++} (92%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.cc} (92%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.cp} (92%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.cpp} (92%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/standard/file.csv copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.cxx} (92%) copy apache-rat-core/src/test/resources/{org/apache/rat/ReportConfigurationTestFile => tikaFiles/standard/file.plain} (97%) create mode 100644 apache-rat-core/src/test/resources/tikaFiles/standard/file.tsv create mode 100644 checkstyle-suppressions.xml