[ https://issues.apache.org/jira/browse/TIKA-3721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17526655#comment-17526655 ]
Tim Allison commented on TIKA-3721: ----------------------------------- Tika's SummaryExtractor (based on POI) works on these files. {noformat} for (File f : new File("....tika-dgn-detector/src/test/resources/dgn" + "/dgn8").listFiles() ) { POIFSFileSystem pfs = new POIFSFileSystem(f, true); DirectoryNode root = pfs.getRoot(); System.out.println("file: " + f.getName()); Metadata metadata = new Metadata(); SummaryExtractor summaryExtractor = new SummaryExtractor(metadata); summaryExtractor.parseSummaries(root); System.out.println("ENTRIES"); for (Iterator<Entry> it = root.getEntries(); it.hasNext(); ) { Entry e = it.next(); String which = "directory"; if (e instanceof DocumentEntry) { which = "document"; } System.out.println(e.getName() + " : " + which); } System.out.println("METADATA"); debug(metadata); System.out.println(""); {noformat} > DGN parser > ---------- > > Key: TIKA-3721 > URL: https://issues.apache.org/jira/browse/TIKA-3721 > Project: Tika > Issue Type: New Feature > Components: parser > Affects Versions: 2.3.0 > Reporter: Dan Coldrick > Priority: Minor > Attachments: dgn8s-dumped.txt, image-2022-04-22-20-00-45-704.png, > image-2022-04-22-20-01-09-564.png, image-2022-04-22-20-02-24-180.png > > > Does anyone have any experience with the DGN file format by MicroStation? I > see TIKA doesn't have a parser so would it be possible to create one? > https://docs.fileformat.com/cad/dgn/ -- This message was sent by Atlassian Jira (v8.20.7#820007)