Re: highlight content from crawl data from manifoldcf to ES

2014-12-02 Thread Nikolas Everett
On Mon, Dec 1, 2014 at 10:42 PM, N Bijalwan ahcir...@gmail.com wrote: We are using manifolcf to crawl web pages and then index them through Elastic search. Is there way to get only few lines that contain the searched keyword in response of elastic search query instead of whole content. Like

Re: highlight content from crawl data from manifoldcf to ES

2014-12-02 Thread N Bijalwan
Thanks Nik for very descriptive solution. I also did some mapping mistakes for which i was not able to get highlighted text in response for sample data. I fixed it by using folllowing mapping http://localhost:9200/cnn/test/_mapping { test: { properties: { file: { type:

Re: highlight content from crawl data from manifoldcf to ES

2014-12-02 Thread Nikolas Everett
Setting store to yes isn't actually required. It might increase performance in some cases at the cost of extra disk space. I leave it false everywhere and have no trouble. Nik On Tue, Dec 2, 2014 at 10:00 AM, N Bijalwan ahcir...@gmail.com wrote: Thanks Nik for very descriptive solution. I

Re: highlight content from crawl data from manifoldcf to ES

2014-12-02 Thread N Bijalwan
Ok. thts a good suggestion. i'll use store to no if yes is not very essential. naveen On Tuesday, 2 December 2014 20:33:03 UTC+5:30, Nikolas Everett wrote: Setting store to yes isn't actually required. It might increase performance in some cases at the cost of extra disk space. I leave it