Nutch 2.2.1 with Map Reduce

Thomas COUDERC Thu, 10 Oct 2013 05:18:35 -0700

Hi everybody,

I'm new to mailing lists so excuse me if I made a mistake. Also, I'm a new
dev contributor for Sauce Labs and DynamoDB subjects.


I read all tutorials for Nutch 2.x and I made Nutch 2.2.1 working with
cassandra 1.2.8 using gora 0.3.
I read that Nutch 2.2.1 (and previous versions) can be run on a Hadoop
cluster.
I also know that gora manage some map reduce operations for backend.

I have two questions :

1/ If Nutch is deployed on a Map Reduce cluster, and for example Hbase is
used as datastore,  where are the Map Reduce tasks distributed? Nutch
hadoop cluster or HBase (via Gora).
2/ In my case, I use Cassandra standalone. If I deploy Nutch 2.2.1 on an
Hadoop Cluster, how many of Nutch can fetch URLs? (1 or all?)

Thank you for helping me., and excuse me for my poor English.

Thomas
Nous vous rappelons que les résultats de Médiamétrie sont et demeurent sa 
propriété : ils sont protégés au double 
titre du droit d'auteur et de la protection des bases de données.
Ce message est confidentiel et établi à 
l'intention de ses destinataires.
Tout message électronique étant susceptible d'altération,
la société Médiamétrie 
décline toute responsabilité s'il a été altéré, déformé ou falsifié.


We remind you that the results produced by Médiamétrie are and remain its sole 
property covered by both copyright 
and databases protection.
This message is confidential and intended solely for the adressees.
E-mails are susceptible 
to alteration.
Neither Médiamétrie company shall be liable for the message if altered, changed 
or falsified.

Nutch 2.2.1 with Map Reduce

Reply via email to