Streaming means process it as its coming to HDFS, like where in hadoop this hadoop streaming enable hadoop to receive data using executable of different types
i hope you have already read this : http://hadoop.apache.org/docs/r0.18.1/streaming.html#Hadoop+Streaming *Warm Regards_**∞_* * Shashwat Shriparv* [image: http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9]<http://www.linkedin.com/pub/shashwat-shriparv/19/214/2a9>[image: https://twitter.com/shriparv] <https://twitter.com/shriparv>[image: https://www.facebook.com/shriparv] <https://www.facebook.com/shriparv>[image: http://google.com/+ShashwatShriparv] <http://google.com/+ShashwatShriparv>[image: http://www.youtube.com/user/sShriparv/videos]<http://www.youtube.com/user/sShriparv/videos>[image: http://profile.yahoo.com/SWXSTW3DVSDTF2HHSRM47AV6DI/] <shrip...@yahoo.com> On Wed, Mar 5, 2014 at 1:38 PM, Radhe Radhe <radhe.krishna.ra...@live.com>wrote: > Hello All, > > Can anyone please explain what we mean by *Streaming data access in HDFS*. > > Data is usually copied to HDFS and in HDFS the data is splitted across > DataNodes in blocks. > Say for example, I have an input file of 10240 MB(10 GB) in size and a > block size of 64 MB. Then there will be 160 blocks. > These blocks will be distributed across DataNodes in blocks. > Now the Mappers will read data from these DataNodes keeping the *data > locality feature* in mind(i.e. blocks local to a DataNode will be read by > the map tasks running in that DataNode). > > Can you please point me where is the "Streaming data access in HDFS" is > coming into picture here? > > Thanks, > RR >