https://issues.apache.org/bugzilla/show_bug.cgi?id=51320

             Bug #: 51320
           Summary: Determine whether parts other than QuillContents may
                    contain useful text to extract and if so, support
                    extraction from those
           Product: POI
           Version: 3.2-FINAL
          Platform: PC
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HPBF
        AssignedTo: [email protected]
        ReportedBy: [email protected]
    Classification: Unclassified


Right now, only QuillContents is taken into account when extracting text.

It seems worth researching whether any useful text may be extraced from the
Main and the Escher parts.

This is related to 51317 - Need ability to stream and chunk data out of MS
Publisher documents. If any extra parts get exposed we'd ideally want streaming
available on it.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to