Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread David Pilato
le":"D:/text.txt","content":"My name >>>> is Akmurat Saktagan. I am 21 years >>>> old."},"filename":"D:/text.txt","contentType":null,"md5":"c8f86639cb4bfec23deab7beea473683","length&

Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread sAs59
gt;>> "properties" : { >>> "file" : { >>> "type" : "attachment", >>> "fields" : { >>> "file" : {"index" : "n

Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread David Pilato
>>> >>> I think that if you store content it should work. >>> >>> Something like this (in mapping): >>> >>> { >>> "person" : { >>> "properties" : { >>> "file" :

Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread sAs59
rl -XGET ' >> http://localhost:9200/index/person/_search?q=whatever&fields=file.file' >> >> Should work I guess. >> >> -- >> *David Pilato* | *Technical Advocate* | *Elasticsearch.com >> <http://Elasticsearch.com>* >> @dadoonet <http

Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread David Pilato
[Ïqœ §1¿Ox¼^L` 3 ”³$t8•Ü ã Iå ÞO^_¹oTÁ^’¡G3 c“éà}Á) >> +µàZrn|mÍ!A׿åÆãatáÕ€ŒÅ#59C~÷ü™x Jë ò¬!lÛ¨’ >> Ñå7 p¼ «‘u d PÕæ¿ WíµÓ= 3 Õ&5 Œÿ†ñ!qå½—sÇ ÜF‰fÅ hùC:r Gÿ wìqÄs,B ’”Ì1 ä. >> ‘U)âŒÜ´ñf<§õºU-+ ¡M1I^¥WÃ(g‚Ì8p¼Š’ ©' | G¡KÕ´)Ž-ç@¾·wª0ç’ œ= ~“¤?\Þ ?ÀñVÚ’.ë >> ÿô¤h8¢ G’£

Re: searching pdf files by content with Mongodb-river

2014-03-22 Thread sAs59
; PúÍÝbÑoQ«ˆrèèìˆBãz% ¶aqüATÑ@šEÃõ#/+Z/²Ïh^¯ú ±9 Ø›±wï/ù}ëÜH>Û] ̲RÆze. > Ú’@ì‚çz—au¼;q§® > U¦Wžz^WVÙ"ÝÛ‘ …P©£§ŽqΩqËn 3Rj ºÿ.•E¼Dj^}—×Ñ GŽÂª¢¸ ö• ’H ñ+Œ;Úp@¹ÉàªôÞ…žjÎ > P[Õ6^ƒKFMaß;Ò ®¨Ý[Ïqœ §1¿Ox¼^L` 3 ”³$t8•Ü ã Iå ÞO^_¹oTÁ^’¡G3 > c“éà}Á) +µàZrn|mÍ!A׿åÆãatáÕ€ŒÅ#59C~÷ü™x Jë ò¬

Re: searching pdf files by content with Mongodb-river

2014-03-20 Thread David Pilato
I think I'm starting to understand what you are trying to get… You don't want original content but only extracted content, right? I think that if you store content it should work. Something like this (in mapping): {     "person" : {         "properties" : {             "file" : {                

Re: searching pdf files by content with Mongodb-river

2014-03-20 Thread sAs59
¼ - ¾ƒ:‡ÎbÌ£ àôžIÉŸYF7 ?®ÐÌ}îÊð}ô±ó< T]s#àlê\m—ûò1h²÷MrlLf¹Ö'ÊÖæØOBj‚åým1ÓzúÛeQ¶jަȤ ÿ òˆ© endstream endobj 5 0 obj <http://elasticsearch-users.115913.n3.nabble.com/searching-pdf-files-by-content-with-Mongodb-river-tp4051989p4052333.html Sent from the ElasticSearch Users mailing list

Re: searching pdf files by content with Mongodb-river

2014-03-19 Thread sAs59
ticsearch-users.115913.n3.nabble.com/searching-pdf-files-by-content-with-Mongodb-river-tp4051989p4052267.html Sent from the ElasticSearch Users mailing list archive at Nabble.com. -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubs

Re: searching pdf files by content with Mongodb-river

2014-03-19 Thread David Pilato
NDQ0IDMzMyA1MDAgNTAwIDI3OCAwIDUwMCAyNzggNzc4IDUwMCA1MDAgNTAwIDUwMCAzMzMgMzg5IDI3OCA1MDAgNTAwIDcyMiA1MDAgNTAwXSANCmVuZG9iag0KMTEgMCBvYmoNClsgMjI2XSANCmVuZG9 This is only a small part I've copied Maybe the problem is in mapping? p.s. Sorry for my bad english) -- View this message in context: http://elasticsearch-users.11591

Re: searching pdf files by content with Mongodb-river

2014-03-18 Thread David Pilato
lL0NhdGFsb2cvUGFnZXMgMiAwIFIvTGFuZyhlbi1VUykgPj4NCmVuZG9iag0KMiAwIG9iag0 "filename":"D:/sample.pdf","contentType":"application/pdf","md5":"afe70f97bce7876e39aa43f71dc7266f","length":82441,"chunkSize":262144,"u

Re: Using Elasticsearch with Mongodb-River for searching pdf

2014-03-18 Thread sAs59
http://localhost:9200/mongoindex/_search?pretty=true -- View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Using-Elasticsearch-with-Mongodb-River-for-searching-pdf-tp4051979p4051980.html Sent from the ElasticSearch Users mailing list archive at Nabble.com. -- You

searching pdf files by content with Mongodb-river

2014-03-18 Thread sAs59
W1DQoxIDAgb2JqDQo8PC9UeXBlL0NhdGFsb2cvUGFnZXMgMiAwIFIvTGFuZyhlbi1VUykgPj4NCmVuZG9iag0KMiAwIG9iag0 "filename":"D:/sample.pdf","contentType":"application/pdf","md5":"afe70f97bce7876e39aa43f71dc7266f","length":82441,"c

Re: Searching PDF

2014-02-07 Thread ZenMaster80
You are correct, my JSON mapping had a wrong entry. Thanks for the help! On Friday, February 7, 2014 6:10:50 PM UTC-5, Binh Ly wrote: > > It looks like that indexing code might not be correct. I just tried this > code and it works for me: > > try { > String fileContents = readConten

Re: Searching PDF

2014-02-07 Thread Binh Ly
It looks like that indexing code might not be correct. I just tried this code and it works for me: try { String fileContents = readContent( new File( "fn6742.pdf" ) ); try { DeleteIndexResponse deleteIndexResponse = new DeleteIndexRequestBuilder( client.admi

Re: Searching PDF

2014-02-07 Thread ZenMaster80
So, What's wrong with this? GET localhost:9200/_search { "fields": "file", "query": { "match_all": {} } } .. "hits": { "total": 1, "max_score": 1, "hits": [ { "_index": "docs", "_type": "pdf", "_id": "1", "_sc

Re: Searching PDF

2014-02-07 Thread Binh Ly
You should be able to get the textual field values by explicitly requesting them from fields. For example: GET localhost:9200/_search { "fields": "*", "query": { "match_all": {} } } -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To