[jira] [Updated] (TIKA-754) Automatic line break insertion (BR element) instead of '\n' in XHTMLContentHandler

2011-10-18 Thread Pablo Queixalos (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pablo Queixalos updated TIKA-754: - Attachment: TIKA-754.poc.patch Proof of concept: works fine but breaks tests with 33 Failures and

[jira] [Updated] (TIKA-727) Improve the outputed XHTML by HSLFExtractor

2011-09-29 Thread Pablo Queixalos (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pablo Queixalos updated TIKA-727: - Attachment: HSLFExtractor.patch Fixed Class for top level div. Fixed bad getFooterText call. Fixed