Re: How to split DBInputFormat?

2011-01-03 Thread arv...@cloudera.com
Joan, The DataDrivenInputFormat is a better fit for moving large volumes of data as it generates WHERE clauses that help partition the data better. You could also use Sqoop that makes such large volume data migration between relational sources and HDFS a breeze

Re: From a newbie: Questions and will MapReduce fit our needs

2011-08-29 Thread arv...@cloudera.com
On Mon, Aug 29, 2011 at 2:04 AM, Per Steffensen wrote: > Can you point me to at good place to read about Sqoop. I only find > http://incubator.apache.org/projects/sqoop.html and > https://cwiki.apache.org/confluence/display/SQOOP. There is really not much > to find, about what Sqoop can do, how to

Re: From a newbie: Questions and will MapReduce fit our needs

2011-08-29 Thread arv...@cloudera.com
On Mon, Aug 29, 2011 at 2:04 AM, Per Steffensen wrote: > Can you point me to at good place to read about Sqoop. I only find > http://incubator.apache.org/projects/sqoop.html and > https://cwiki.apache.org/confluence/display/SQOOP. There is really not much > to find, about what Sqoop can do, how to