It seems like the InMemoryFileSystem class has been deprecated in Hadoop 0.19.1. Why?
I want to reuse the result of reduce as the next time map's input. Cascading does not work, because the data of each step is dependent. I set each timestep mapreduce job as synchronization. If the InMemoryFileSystem is deprecated. How can I reduce the I/O for each timestep's mapreduce job. 2009/4/2 Farhan Husain <russ...@gmail.com> > Is there a way to implement some OutputCollector that can do what Andy > wants > to do? > > On Thu, Apr 2, 2009 at 10:21 AM, Rasit OZDAS <rasitoz...@gmail.com> wrote: > > > Andy, I didn't try this feature. But I know that Yahoo had a > > performance record with this file format. > > I came across a file system included in hadoop code (probably that > > one) when searching the source code. > > Luckily I found it: org.apache.hadoop.fs.InMemoryFileSystem > > But if you have a lot of big files, this approach won't be suitable I > > think. > > > > Maybe someone can give further info. > > > > 2009/4/2 andy2005cst <andy2005...@gmail.com>: > > > > > > thanks for your reply. Let me explain more clearly, since Map Reduce is > > just > > > one step of my program, I need to use the output of reduce for furture > > > computation, so i do not need to want to wirte the output into disk, > but > > > wanna to get the collection or list of the output in RAM. if it > directly > > > wirtes into disk, I have to read it back into RAM again. > > > you have mentioned a special file format, will you please show me what > is > > > it? and give some example if possible. > > > > > > thank you so much. > > > > > > > > > Rasit OZDAS wrote: > > >> > > >> Hi, hadoop is normally designed to write to disk. There are a special > > file > > >> format, which writes output to RAM instead of disk. > > >> But I don't have an idea if it's what you're looking for. > > >> If what you said exists, there should be a mechanism which sends > output > > as > > >> objects rather than file content across computers, as far as I know > > there > > >> is > > >> no such feature yet. > > >> > > >> Good luck. > > >> > > >> 2009/4/2 andy2005cst <andy2005...@gmail.com> > > >> > > >>> > > >>> I need to use the output of the reduce, but I don't know how to do. > > >>> use the wordcount program as an example if i want to collect the > > >>> wordcount > > >>> into a hashtable for further use, how can i do? > > >>> the example just show how to let the result onto disk. > > >>> myemail is : andy2005...@gmail.com > > >>> looking forward your help. thanks a lot. > > >>> -- > > >>> View this message in context: > > >>> > > > http://www.nabble.com/HELP%3A-I-wanna-store-the-output-value-into-a-list-not-write-to-the-disk-tp22844277p22844277.html > > >>> Sent from the Hadoop core-user mailing list archive at Nabble.com. > > >>> > > >>> > > >> > > >> > > >> -- > > >> M. Raşit ÖZDAŞ > > >> > > >> > > > > > > -- > > > View this message in context: > > > http://www.nabble.com/HELP%3A-I-wanna-store-the-output-value-into-a-list-not-write-to-the-disk-tp22844277p22848070.html > > > Sent from the Hadoop core-user mailing list archive at Nabble.com. > > > > > > > > > > > > > > -- > > M. Raşit ÖZDAŞ > > > > > > -- > Mohammad Farhan Husain > Research Assistant > Department of Computer Science > Erik Jonsson School of Engineering and Computer Science > University of Texas at Dallas > -- Chen He RCF CSE Dept. University of Nebraska-Lincoln US