[ 
https://issues.apache.org/jira/browse/SOLR-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14290223#comment-14290223
 ] 

ASF subversion and git services commented on SOLR-6856:
-------------------------------------------------------

Commit 1654431 from [~sar...@syr.edu] in branch 'dev/trunk'
[ https://svn.apache.org/r1654431 ]

SOLR-6856: Restore ExtractingRequestHandler's ability to capture all HTML tags 
when parsing (X)HTML.

> regression in /update/extract ? ref guide examples of fmap & xpath don't seem 
> to be working 
> --------------------------------------------------------------------------------------------
>
>                 Key: SOLR-6856
>                 URL: https://issues.apache.org/jira/browse/SOLR-6856
>             Project: Solr
>          Issue Type: Bug
>    Affects Versions: 3.1
>            Reporter: Hoss Man
>            Assignee: Steve Rowe
>            Priority: Blocker
>             Fix For: 5.0, Trunk, 5.1
>
>         Attachments: SOLR-6856.patch, SOLR-6856.patch
>
>
> I updated this page to know about hte new bin/solr and example/exampledocs 
> structure/contents...
> https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika
> however i noticed that several of the examples listed on that page didn't 
> seem to work any more -- notably...
> * examples using "fmap" don't seem to create the fields they say they will
> * examples using "xpath" don't seem to create any docs at all
> Specific examples i had problems with...
> {noformat}
> curl 
> "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc2&captureAttr=true&defaultField=text&fmap.div=foo_t&capture=div&commit=true";
>  -F "sample=@example/exampledocs/sample.html"
> curl 
> "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc3&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&commit=true";
>  -F "sample=@example/exampledocs/sample.html"
> curl 
> "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc4&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&literal.blah_s=Bah&commit=true";
>  -F "sample=@example/exampledocs/sample.html"
> curl 
> "http://localhost:8983/solr/techproducts/update/extract?literal.id=doc5&captureAttr=true&defaultField=text&capture=div&fmap.div=foo_t&boost.foo_t=3&literal.id=id&xpath=/xhtml:html/xhtml:body/xhtml:div/descendant:node()&commit=true"
>  -F "sample=@example/exampledocs/sample.html"
> {noformat}
> ...none of these example commands produced an error, but they also didn't 
> seem to create the fields/docs they said they would (ie: no "foo_t" field was 
> created)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to