[ https://issues.apache.org/jira/browse/PDFBOX-4874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17130062#comment-17130062 ]
Tilman Hausherr commented on PDFBOX-4874: ----------------------------------------- Please include more of the stack trace, how is this related to PDFBox? > ERROR [TikaTranscriptExtractor] Error reading transcript from document > ---------------------------------------------------------------------- > > Key: PDFBOX-4874 > URL: https://issues.apache.org/jira/browse/PDFBOX-4874 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Reporter: Dushyanth Balasubramanian > Priority: Major > > [Fatal Error] :1547:3: The element type "div" must be terminated by the > matching end-tag "</div>".[Fatal Error] :1547:3: The element type "div" must > be terminated by the matching end-tag "</div>".ERROR > [TikaTranscriptExtractor] Error reading transcript from > documentorg.xml.sax.SAXParseException; lineNumber: 1547; columnNumber: 3; The > element type "div" must be terminated by the matching end-tag "</div>". at > com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:257) > at > com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:339) -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org