Hi, Thanks for ur reply...i imported StreamingContext and right now i am getting my Dstream as something like map(id -> 123, name -> srini, mobile -> 12324214, score -> 123, test_type -> math) map(id -> 321, name -> vasu, mobile -> 73942090, score -> 324, test_type ->sci) map(id -> 432, name ->xxxx, mobile ->423141234,score -> 322,test_type -> math)
each map collection is from json string. now if i want aggregrate the scores on only math or if i want to find out who got the highest score in math that shows both name and score..i would like to what transformation should i do to my existing dstream.I am very new to dealing with maps and dstream transformations..so please advise on how to proceed from here. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Spark-Streaming-Json-file-groupby-function-tp9618p9656.html Sent from the Apache Spark User List mailing list archive at Nabble.com.