Re: Getting file text content from mapper?

Raymond Giorgi Thu, 20 Nov 2014 11:15:18 -0800

Also, this is the first line of what's posted along the river

{ "index": {"_index":"resumes","_type":"resume","_id":"2158912"}}


Things can get truncated when they're as big as a Base64 encoded file :)


On Wednesday, November 19, 2014 6:01:29 PM UTC-5, Raymond Giorgi wrote:
>
> Hey all,
>
> I'm hoping someone can help me out with something I'm having an issue with.
>
> The short: I'm trying to extract plaintext from the attachment-mapper.
>
> The long: I'm posting the contents of a file Base64 encoded to RabbitMQ 
> which is feeding an ElasticSearch river plugin. Querying against the field 
> works fine, but it only seems to store the Base64 encoding of the file 
> instead of the plaintext. I'd like to extract the contents as plaintext and 
> have that be returnable (i.e. query for the text of a docx). I'm feeding it 
> from a PHP front end, so there are places in the app where I'd like to rely 
> on Elasticsearch's built in Tika processor.
>
> Thanks!
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/68456ac0-14b9-49f8-a0a0-b930223004f8%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Re: Getting file text content from mapper?

Reply via email to