You can design a receiver to receive data every 5 sec (batch size) & pull
data of last 5 sec from http API, you can shard data by time further within
those 5 sec to distribute it further.
You can also bind TSDB nodes to each receiver to translate HBase data  to
improve performance.

Mayur Rustagi
Ph: +1 (760) 203 3257
http://www.sigmoidanalytics.com
@mayur_rustagi <https://twitter.com/mayur_rustagi>



On Fri, Aug 1, 2014 at 5:21 PM, bumble123 <tc1...@att.com> wrote:

> So is there no way to do this through SparkStreaming? Won't I have to do
> batch processing if I use the http api rather than getting it directly into
> Spark?
>
>
>
> --
> View this message in context:
> http://apache-spark-user-list.1001560.n3.nabble.com/How-to-read-from-OpenTSDB-using-PySpark-or-Scala-Spark-tp11211p11234.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
>

Reply via email to