[ 
https://issues.apache.org/jira/browse/SOLR-7229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14356812#comment-14356812
 ] 

Tim Allison edited comment on SOLR-7229 at 3/11/15 5:30 PM:
------------------------------------------------------------

Y, that's what I was getting at, and that was the answer I was hoping for.  
Apologies, I'm still trying to learn the preferences for the boundary between 
custom hard coding and configuration over here.  I'll open another issue to add 
that (SOLR-7231)

And, on another note, I just noticed that the code that adds metadata is just 
pulling the first value; in short, if there is a multivalued Solr field, and 
there's more than one metadata value in the metadata object, the values after 
the first are being ignored.  Looks like another issue. :) (SOLR-7232)


was (Author: talli...@mitre.org):
Y, that's what I was getting at, and that was the answer I was hoping for.  
Apologies, I'm still trying to learn the preferences for the boundary between 
custom hard coding and configuration over here.  I'll open another issue to add 
that.  

And, on another note, I just noticed that the code that adds metadata is just 
pulling the first value; in short, if there is a multivalued Solr field, and 
there's more than one metadata value in the metadata object, the values after 
the first are being ignored.  Looks like another issue. :)

> Allow DIH to handle attachments as separate documents
> -----------------------------------------------------
>
>                 Key: SOLR-7229
>                 URL: https://issues.apache.org/jira/browse/SOLR-7229
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Tim Allison
>            Priority: Minor
>
> With Tika 1.7's RecursiveParserWrapper, it is possible to maintain metadata 
> of individual attachments/embedded documents.  Tika's default handling was to 
> maintain the metadata of the container document and concatenate the contents 
> of all embedded files.  With SOLR-7189, we added the legacy behavior.
> It might be handy, for example, to be able to send an MSG file through DIH 
> and treat the container email as well each attachment as separate (child?) 
> documents, or send a zip of jpeg files and correctly index the geo locations 
> for each image file.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to