Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-11 Thread Ted Dunning
hough a combiner in the final m/r pass would really speed > up the hadoop shuffle. > > Lance > > From: "Ted Dunning" > To: user@hadoop.apache.org > Sent: Wednesday, October 10, 2012 11:13:36 PM > Subject: Re: Hadoop/Lucene + Solr architecture suggesti

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-11 Thread Lance Norskog
. It's a disk bandwidth v.s. network bandwidth problem. Although a combiner in the final m/r pass would really speed up the hadoop shuffle. Lance From: "Ted Dunning" To: user@hadoop.apache.org Sent: Wednesday, October 10, 2012 11:13:36 PM Subject: Re: Hadoo

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread Ted Dunning
months ago at least... >> >> Thanks, >> --tim >> >> > - Original Message - >> > | From: "Ted Dunning" < tdunn...@maprtech.com> >> > | To: user@hadoop.apache.org >> > | Cc: "Hadoop User" < user@hadoop.apache.org>

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread JAY
o work with a decent-sized (e.g. > > 250 node) cluster? Even a trivial cluster seems to crush SolrCloud > from a few months ago at least... > > Thanks, > --tim > > > - Original Message - > > | From: "Ted Dunning" > > | To: user@hadoop.apache

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread M. C. Srivas
: user@hadoop.apache.org > > | Cc: "Hadoop User" > > | Sent: Wednesday, October 10, 2012 7:58:57 AM > > | Subject: Re: Hadoop/Lucene + Solr architecture suggestions? > > | > > | I prefer to create indexes in the reducer personally. > > | > > | Al

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread Tim Williams
Sent: Wednesday, October 10, 2012 7:58:57 AM > | Subject: Re: Hadoop/Lucene + Solr architecture suggestions? > | > | I prefer to create indexes in the reducer personally. > | > | Also you can avoid the copies if you use an advanced hadoop-derived > | distro. Email me off list for de

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread Mark Kerzner
- Original Message - > | From: "Ted Dunning" > | To: user@hadoop.apache.org > | Cc: "Hadoop User" > | Sent: Wednesday, October 10, 2012 7:58:57 AM > | Subject: Re: Hadoop/Lucene + Solr architecture suggestions? > | > | I prefer to create indexes i

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread Lance Norskog
ednesday, October 10, 2012 7:58:57 AM | Subject: Re: Hadoop/Lucene + Solr architecture suggestions? | | I prefer to create indexes in the reducer personally. | | Also you can avoid the copies if you use an advanced hadoop-derived | distro. Email me off list for details. | | Sent from my iPhone |

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-10 Thread Ted Dunning
I prefer to create indexes in the reducer personally. Also you can avoid the copies if you use an advanced hadoop-derived distro. Email me off list for details. Sent from my iPhone On Oct 9, 2012, at 7:47 PM, Mark Kerzner wrote: > Hi, > > if I create a Lucene index in each mapper, locally,

Re: Hadoop/Lucene + Solr architecture suggestions?

2012-10-09 Thread Ivan Frain
Hi Mark, I don't know Lucene/Solr very well but your question made me remember the lily project: http://www.lilyproject.org/lily/index.html. They use hadoop/hbase and solr to provide a searchable data management platform. Maybe you will find ideas in their documentation. BR, Ivan 2012/10/10 Mar

Hadoop/Lucene + Solr architecture suggestions?

2012-10-09 Thread Mark Kerzner
Hi, if I create a Lucene index in each mapper, locally, then copy them to under /jobid/mapid1, /jodid/mapid2, and then in the reducers copy them to some Solr machine (perhaps even merging), does such architecture makes sense, to create a searchable index with Hadoop? Are there links for similar a