[ https://issues.apache.org/jira/browse/SOLR-887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Shalin Shekhar Mangar updated SOLR-887: --------------------------------------- Attachment: SOLR-887.patch Thanks for the patch Ahmed. Changes: # Generated patch from correct directory # Use StringBuilder instead of StringBuffer It would be nice to have this class handle HTML text coming from java.sql.Clob and java.sql.Blob types too (for an example see FieldReaderDataSource#getData method). > HTMLStripTransformer for DIH > ---------------------------- > > Key: SOLR-887 > URL: https://issues.apache.org/jira/browse/SOLR-887 > Project: Solr > Issue Type: New Feature > Components: contrib - DataImportHandler > Affects Versions: 1.3 > Reporter: Ahmed Hammad > Assignee: Shalin Shekhar Mangar > Priority: Minor > Fix For: 1.4 > > Attachments: patch-887.patch, SOLR-887.patch > > > A Transformer implementation for DIH which strip off HTML tags using the Solr > class org.apache.solr.analysis.HTMLStripReader > This is useful in case you don't need this HTML tags anyway. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.