[ https://issues.apache.org/jira/browse/SOLR-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12670093#action_12670093 ]
Fergus McMenemie commented on SOLR-1003: ---------------------------------------- What is the difference between the HTMLStripTransformer and what is proposed here? Surely both would return:- "This text is bold and this text is underlined!" > XPathEntityprocessor must allow slurping all text from a given xml node and > its children > ---------------------------------------------------------------------------------------- > > Key: SOLR-1003 > URL: https://issues.apache.org/jira/browse/SOLR-1003 > Project: Solr > Issue Type: New Feature > Components: contrib - DataImportHandler > Affects Versions: 1.4 > Reporter: Noble Paul > Priority: Minor > Fix For: 1.4 > > Attachments: SOLR-1003.patch > > > take an example: > {code:xml} > <xhtml:p>This text is > <xhtml:b>bold</xhtml:b> and this text is > <xhtml:u>underlined</xhtml:u>! > </xhtml:p> > {code} > It may be useful to get all the text from all the tags in <xhtml: p> ignoring > the tag names . > the configuration of the field may look like > {code:xml} > <field column="para" xpath="/p" flatten="true"/> > {code} > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.