[
https://issues.apache.org/jira/browse/SOLR-1061?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12680807#action_12680807
]
Fergus McMenemie commented on SOLR-1061:
----------------------------------------
Yes, yes. Another usecase I ran into a lot was having lat/long within the same
XML field, this would have been really useful. I guess if the matcher fails
the fields/colums firstName and secondName are undefined? However although the
above is neat and clean it can of course now be done as follows:-
{code}
<field column="firstName" regex="Mr(\w*)\b\w*" replaceWith="$1"
sourceColName="full_name"/>
<field column="secondName" regex="Mr\w*\b(\w*)" replaceWith="$1"
sourceColName="full_name"/>
{code}
Also I would think the following will be a related common usecase; imagine a
field which listed an indeterminate number of aliases or alternate names for a
person. This is bad data design but it happens. We need to expose regex's
global feature
{code}
<firstName>josephine</firstname>
<aliases>jo,joe,jos<aliases>
{code}
{code}
<field column="alias" regex="([^,]+)" regex_options="global"
sourceColName="aliases"/>
{code}
which would populate the column alias with multiple values. The attribute
regex_options allows other regex options such as case insensitivity to be added
as well.
> Improve regexTransformer to create multiple columns from regexGroups
> --------------------------------------------------------------------
>
> Key: SOLR-1061
> URL: https://issues.apache.org/jira/browse/SOLR-1061
> Project: Solr
> Issue Type: Improvement
> Components: contrib - DataImportHandler
> Reporter: Noble Paul
> Fix For: 1.4
>
>
> example
> {code:xml}
> <field column="doesnotmatter" regex="Mr(\w*)\b(\w*)"
> sourceColName="full_name" groupNames="1:firstName,2:secondName"/>
> {code}
> This is more efficient in extracting multiple values from a single String. In
> this case the column is redundant but it is ok
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.