Hi see comments inline below…
On Jun 2, 2014, at 6:49 AM, Vineet Mishra wrote:
> Hi Wolfgang,
>
> Thanks for your response, can you quote some running example of
> MapReduceIndexerTool
> for indexing through csv files.
> If you are referring to
> http://www.cloudera.com/content/cloudera-content
Hi Wolfgang,
Thanks for your response, can you quote some running example of
MapReduceIndexerTool
for indexing through csv files.
If you are referring to
http://www.cloudera.com/content/cloudera-content/cloudera-docs/Search/latest/Cloudera-Search-User-Guide/csug_mapreduceindexertool.html?scroll=cs
Sounds like you should consider using MapReduceIndexerTool. AFAIK, this is the
most scalable indexing (and merging) solution out there.
Wolfgang.
On Jun 2, 2014, at 10:33 AM, Vineet Mishra wrote:
> Hi Erick,
>
> Thanks for your mail, please let me go through with my use case.
> I am having ar
Hi Erick,
Thanks for your mail, please let me go through with my use case.
I am having around 20-40 Billion Records to index with each record is
having around 200-400 fields, the data is sensor data so it can be easily
stored in Integer or Float. Now to index this huge amount of data I am
going wi
Hi Otis,
I have to index some huge amount of data that's around Billions of records,
since indexing via HTTP post mechanism will be a slow and lethargic due to
network delay hence I am indexing through EmbeddedSolrServer to create
index which I can later upload to different Shards in SolrCloud, al
You can copy to the shards and use the mergindexes command, the
MapReduceIndexerTool follows that approach.
But really, what is the higher-level use-case you're trying to support?
This feels a little like an XY problem. You could do things like
1> index to a different collection then use collectio
Hi,
On Wed, May 28, 2014 at 4:25 AM, Vineet Mishra wrote:
> Hi All,
>
> Has anyone tried with building Offline indexes with EmbeddedSolrServer and
> posting it to Shards.
>
What do you mean by "posting it to shards"? How is that different than
copying them manually to the right location in FS?
Hi All,
Has anyone tried with building Offline indexes with EmbeddedSolrServer and
posting it to Shards.
FYI, I am done building the indexes but looking out for a way to post these
index files on shards.
Copying the indexes manually to each shard's replica is possible and is
working fine but I don