Depends on your use case and search volume. Typically you'd have a dedicated ES cluster if your app is doing a lot of real time indexing and search.
If it's only for spark integration then you could colocate ES and spark — Sent from Mailbox On Tue, Apr 28, 2015 at 6:41 PM, Jeetendra Gangele <gangele...@gmail.com> wrote: > Thanks for reply. > Elastic search index will be within my Cluster? or I need the separate host > the elastic search? > On 28 April 2015 at 22:03, Nick Pentreath <nick.pentre...@gmail.com> wrote: >> I haven't used Solr for a long time, and haven't used Solr in Spark. >> >> However, why do you say "Elasticsearch is not a good option ..."? ES >> absolutely supports full-text search and not just filtering and grouping >> (in fact it's original purpose was and still is text search, though >> filtering, grouping and aggregation are heavily used). >> http://www.elastic.co/guide/en/elasticsearch/guide/master/full-text-search.html >> >> >> >> On Tue, Apr 28, 2015 at 6:27 PM, Jeetendra Gangele <gangele...@gmail.com> >> wrote: >> >>> Does anyone tried using solr inside spark? >>> below is the project describing it. >>> https://github.com/LucidWorks/spark-solr. >>> >>> I have a requirement in which I want to index 20 millions companies name >>> and then search as and when new data comes in. the output should be list of >>> companies matching the query. >>> >>> Spark has inbuilt elastic search but for this purpose Elastic search is >>> not a good option since this is totally text search problem? >>> >>> Elastic search is good for filtering and grouping. >>> >>> Does any body used solr inside spark? >>> >>> Regards >>> jeetendra >>> >>> >>