Hi Erik, This is indeed what I was talking about... It could even be handled via some type of transient file storage system. this might even be better to avoid the risks associated with uploading a huge file across a network and might (have no idea) be easier to implement.
So I could send the file, and receive back a token which I would then throw into one of my fields as a reference. Then using it to map tika fields as well. like: <str name="file_mod_date">${FILETOKEN}.last_modified</str> <str name="file_body">${FILETOKEN}.content</str> Best, Jacob On Mon, Dec 15, 2008 at 2:29 PM, Erik Hatcher <e...@ehatchersolutions.com> wrote: > > On Dec 15, 2008, at 3:13 AM, Chris Hostetter wrote: > >> >> : If I can find the bandwidth, I'd like to make something which allows >> : file uploads via the XMLUpdateHandler as well... Do you have any ideas >> >> the XmlUpdateRequestHandler already supports file uploads ... all request >> handlers do using the ContentStream abstraction... >> >> http://wiki.apache.org/solr/ContentStream > > But it doesn't do what Jacob is asking for... he wants (if I'm not mistaken) > the ability to send a binary file along with Solr XML, and merge the > extraction from the file (via Tika) with the fields specified in the XML. > > Currently this is not possible, as far as I know. Maybe this sort of thing > could be coded to part of an update processor chain? Somehow DIH and the > Tika need to tie together eventually too, eh? > > Erik > > -- +1 510 277-0891 (o) +91 9999 33 7458 (m) web: http://pajamadesign.com Skype: pajamadesign Yahoo: jacobsingh AIM: jacobsingh gTalk: jacobsi...@gmail.com