[ 
https://issues.apache.org/jira/browse/TIKA-3721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17526655#comment-17526655
 ] 

Tim Allison commented on TIKA-3721:
-----------------------------------

Tika's SummaryExtractor (based on POI) works on these files.

{noformat}
        for (File f : new File("....tika-dgn-detector/src/test/resources/dgn" +
                "/dgn8").listFiles() ) {
            POIFSFileSystem pfs = new POIFSFileSystem(f, true);
            DirectoryNode root = pfs.getRoot();
            System.out.println("file: " + f.getName());
            Metadata metadata = new Metadata();
            SummaryExtractor summaryExtractor = new SummaryExtractor(metadata);
            summaryExtractor.parseSummaries(root);
            System.out.println("ENTRIES");
            for (Iterator<Entry> it = root.getEntries(); it.hasNext(); ) {
                Entry e = it.next();
                String which = "directory";
                if (e instanceof DocumentEntry) {
                    which = "document";
                }
                System.out.println(e.getName() + " : " + which);
            }
            System.out.println("METADATA");
            debug(metadata);
            System.out.println("");

{noformat}

> DGN parser
> ----------
>
>                 Key: TIKA-3721
>                 URL: https://issues.apache.org/jira/browse/TIKA-3721
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>    Affects Versions: 2.3.0
>            Reporter: Dan Coldrick
>            Priority: Minor
>         Attachments: dgn8s-dumped.txt, image-2022-04-22-20-00-45-704.png, 
> image-2022-04-22-20-01-09-564.png, image-2022-04-22-20-02-24-180.png
>
>
> Does anyone have any experience with the DGN file format by MicroStation? I 
> see TIKA doesn't have a parser so would it be possible to create one? 
> https://docs.fileformat.com/cad/dgn/



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to