Hi,
In a content management system I have a document and an attachment. The
document contains the meta data and the attachment the actual data.
I would like to combine data of both in one Solr document.
I have thought of several options:
1. Using ExtractingRequestHandler I would extract the data (extractOnly)
and combine it with the meta data and send it to Solr.
But this might be inefficient and increase the network traffic.
2. Seperate Tika installation and use that to extract and send the data
to Solr.
This would stress an already busy web server.
3. First upload the file using ExtractingRequestHandler, then use atomic
updates to add the other fields.
Or is there another way? First add the meta data and later use the
ExtractingRequestHandler to add the file contents?
Cheers,
Jeroen
--
Sent from my Android device with K-9 Mail. Please excuse my brevity.