Author: buildbot
Date: Fri Oct 5 15:02:34 2012
New Revision: 834144
Log:
Staging update by buildbot for crunch
Modified:
websites/staging/crunch/trunk/content/ (props changed)
websites/staging/crunch/trunk/content/crunch/intro.html
Propchange: websites/staging/crunch/trunk/content/
------------------------------------------------------------------------------
--- cms:source-revision (original)
+++ cms:source-revision Fri Oct 5 15:02:34 2012
@@ -1 +1 @@
-1385352
+1394563
Modified: websites/staging/crunch/trunk/content/crunch/intro.html
==============================================================================
--- websites/staging/crunch/trunk/content/crunch/intro.html (original)
+++ websites/staging/crunch/trunk/content/crunch/intro.html Fri Oct 5 15:02:34
2012
@@ -115,11 +115,14 @@ it in your local repository:</p>
<p>This also runs the integration test suite which will take a while.
Afterwards
-you can run the bundled example applications:</p>
-<div class="codehilite"><pre><span class="n">hadoop</span> <span
class="n">jar</span> <span class="n">examples</span><span
class="sr">/target/c</span><span class="n">runch</span><span
class="o">-</span><span class="n">examples</span><span
class="o">-*-</span><span class="n">job</span><span class="o">.</span><span
class="n">jar</span> <span class="n">org</span><span class="o">.</span><span
class="n">apache</span><span class="o">.</span><span
class="n">crunch</span><span class="o">.</span><span
class="n">examples</span><span class="o">.</span><span
class="n">WordCount</span> <span class="sr"><inputfile></span> <span
class="sr"><outputdir></span>
+you can run the bundled example applications such as WordCount:</p>
+<div class="codehilite"><pre><span class="n">hadoop</span> <span
class="n">jar</span> <span class="n">crunch</span><span class="o">-</span><span
class="n">examples</span><span class="sr">/target/c</span><span
class="n">runch</span><span class="o">-</span><span
class="n">examples</span><span class="o">-*-</span><span
class="n">job</span><span class="o">.</span><span class="n">jar</span> <span
class="n">org</span><span class="o">.</span><span class="n">apache</span><span
class="o">.</span><span class="n">crunch</span><span class="o">.</span><span
class="n">examples</span><span class="o">.</span><span
class="n">WordCount</span> <span class="sr"><inputfile></span> <span
class="sr"><outputdir></span>
</pre></div>
+<p>There are three additional examples in the org.apache.crunch.examples
package AverageBytesByIP, TotalBytesByIP, and WordAggregationHBase.
+AverageBytesByIP and TotalBytesByIP take as input Common Log Format of which
there is a sample included:
+crunch-examples/src/main/resources/access_logs.tar.gz. WordAggregationHBase
requires an Apache HBase cluster but no input data.</p>
<h2 id="high-level-concepts">High Level Concepts</h2>
<h3 id="data-model-and-operators">Data Model and Operators</h3>
<p>Crunch is centered around three interfaces that represent distributed
datasets: <code>PCollection<T></code>, <code>PTable<K, V></code>,
and <code>PGroupedTable<K, V></code>.</p>