[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Allen Wittenauer resolved MAPREDUCE-1211.
-----------------------------------------

    Resolution: Won't Fix

> Online aggregation and continuous query support
> -----------------------------------------------
>
>                 Key: MAPREDUCE-1211
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1211
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: task
>            Reporter: Tyson Condie
>            Priority: Minor
>
> The purpose of this post is to propose a modified MapReduce architecture that 
> allows data to be pipelined between operators. This extends the MapReduce 
> programming model beyond batch processing, and can reduce completion times 
> and improve system utilization for batch jobs as well. We have built a 
> modified version of the Hadoop MapReduce framework that supports online 
> aggregation, which allows users to see "early returns" from a job as it is 
> being computed. Our Hadoop Online Prototype (HOP) also supports continuous 
> queries, which enable MapReduce programs to be written for applications such 
> as event monitoring and stream processing. HOP retains the fault tolerance 
> properties of Hadoop, and can run unmodified user-defined MapReduce programs.
> For more information on the HOP design, please see our technical report.
> http://www.eecs.berkeley.edu/Pubs/TechRpts/2009/EECS-2009-136.html
> Further details are discussed in the following blog posts.
> http://databeta.wordpress.com/2009/10/18/mapreduce-online/
> http://radar.oreilly.com/2009/10/pipelining-and-real-time-analytics-with-mapreduce-online.html
> http://dbmsmusings.blogspot.com/2009/10/analysis-of-mapreduce-online-paper.html
> The HOP code has been published at the following location.
> http://code.google.com/p/hop/



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to