[Hadoop] capability clarification questions

Josh Harrison Thu, 30 Jan 2014 11:56:29 -0800

In looking around I haven't been able to find explicit answers to these 
questions - though the questions may entirely be because I'm a hadoop 
newbie. 
If we were to deploy ES within a hadoop environment:
The primary benefit is allowing direct interaction with ES from Hadoop, 
running queries or indexing data, is that right? 
Are there explicit benefits to search speed and capability when run through 
the normal REST or other client APIs? That is to say, if I have a set of N 
documents and a query that takes T seconds to run on a normal cluster 
through curl, would there be a marked improvement in T when running the 
same query through curl against a hadoop enabled cluster?
Are the ideal architecture designs for a hadoop enabled ES cluster the 
same, or similar to, a "regular" cluster?
If they're the same, does a hadoop enabled cluster need to be designed as 
such from the start, or can that functionality be tacked on to an already 
functioning cluster with data? Situation is, we're on a cluster of machines 
running hadoop, but the ES nodes are just running on the compute nodes like 
a regular service. Wondering what it would take to enable the hadoop 
capabilities.


Thanks!

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/4dea6c95-75b8-4ed7-a054-3f9eaedde9d3%40googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

[Hadoop] capability clarification questions

Reply via email to