[ https://issues.apache.org/jira/browse/PDFBOX-3074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14983130#comment-14983130 ]
Daniel Persson commented on PDFBOX-3074: ---------------------------------------- Thanks for the input. Not found any good information about this embedded content that should not be show though. Transparency groups aren't stacked and only the data inside a marked content is actually a part of a group. Or have I miss understood this concept? > Mark transparency groups > ------------------------ > > Key: PDFBOX-3074 > URL: https://issues.apache.org/jira/browse/PDFBOX-3074 > Project: PDFBox > Issue Type: New Feature > Components: Text extraction > Affects Versions: 2.0.0 > Reporter: Daniel Persson > Priority: Minor > Labels: github-import > Fix For: 2.0.0 > > Attachments: mark_transparency_groups.patch > > > We try to read text from PDF files but some of the files include extra data > that is never shown. These segments are usually grouped in transparency > groups. So for us this function to flag a marked content as a transparency > group is quite useful. > If there is a way to do this please tell me or if there is a better way to > remove text that isn't presented or drawn when the PDF is viewed then I'm all > ears. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org