Re: Offline Indexes Update to Shard

2014-06-03 Thread Wolfgang Hoschek
Hi see comments inline below… On Jun 2, 2014, at 6:49 AM, Vineet Mishra wrote: > Hi Wolfgang, > > Thanks for your response, can you quote some running example of > MapReduceIndexerTool > for indexing through csv files. > If you are referring to > http://www.cloudera.com/content/cloudera-content

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
Hi Wolfgang, Thanks for your response, can you quote some running example of MapReduceIndexerTool for indexing through csv files. If you are referring to http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_mapreduceindexertool.html?scroll=cs

Re: Offline Indexes Update to Shard

2014-06-02 Thread Wolfgang Hoschek
Sounds like you should consider using MapReduceIndexerTool. AFAIK, this is the most scalable indexing (and merging) solution out there. Wolfgang. On Jun 2, 2014, at 10:33 AM, Vineet Mishra wrote: > Hi Erick, > > Thanks for your mail, please let me go through with my use case. > I am having ar

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
Hi Erick, Thanks for your mail, please let me go through with my use case. I am having around 20-40 Billion Records to index with each record is having around 200-400 fields, the data is sensor data so it can be easily stored in Integer or Float. Now to index this huge amount of data I am going wi

Re: Offline Indexes Update to Shard

2014-06-02 Thread Vineet Mishra
Hi Otis, I have to index some huge amount of data that's around Billions of records, since indexing via HTTP post mechanism will be a slow and lethargic due to network delay hence I am indexing through EmbeddedSolrServer to create index which I can later upload to different Shards in SolrCloud, al

Re: Offline Indexes Update to Shard

2014-05-30 Thread Erick Erickson
You can copy to the shards and use the mergindexes command, the MapReduceIndexerTool follows that approach. But really, what is the higher-level use-case you're trying to support? This feels a little like an XY problem. You could do things like 1> index to a different collection then use collectio

Re: Offline Indexes Update to Shard

2014-05-29 Thread Otis Gospodnetic
Hi, On Wed, May 28, 2014 at 4:25 AM, Vineet Mishra wrote: > Hi All, > > Has anyone tried with building Offline indexes with EmbeddedSolrServer and > posting it to Shards. > What do you mean by "posting it to shards"? How is that different than copying them manually to the right location in FS?

Offline Indexes Update to Shard

2014-05-28 Thread Vineet Mishra
Hi All, Has anyone tried with building Offline indexes with EmbeddedSolrServer and posting it to Shards. FYI, I am done building the indexes but looking out for a way to post these index files on shards. Copying the indexes manually to each shard's replica is possible and is working fine but I don