Re: shard splitting and indexes on hadoop

2024-09-04 Thread Roberto Maggi @ Debian
Hi David, it is very likely I have less experience in both piece of software, but if we could help some one saving time and headaches, why not. How would you suggest to proceed? Are you somehow involved in the projects? I could send you my recipes to get a feedback of yours? Just let me know,

Re: shard splitting and indexes on hadoop

2024-08-30 Thread Arrieta, Alejandro
Hello, in short 1) when you have collection in local filesystem you have your shard folders in solr.home (collectionnamelocalfs 1 shard 1 replica example): /var/lib/solr/collectionnamelocalfs_shard1_replica_n1/core.properties /var/lib/solr/collectionnamelocalfs_shard1_replica_n1/data /var/lib/sol

Re: shard splitting and indexes on hadoop

2024-08-30 Thread David Eric Pugh
I don't have much Hadoop experience, so probably can't help.  Having said that, if you figure some things out and want to augment the ref guide, I would be happy to work with you on it.  Maybe even write another tutorial similar to https://solr.apache.org/guide/solr/latest/getting-started/solr-

Re: shard splitting and indexes on hadoop

2024-08-30 Thread Roberto Maggi @ Debian
Hi Eric, thanks for your interesting. I'm using Solr9, this doc to setup the cluster https://apache.github.io/hadoop/hadoop-project-dist/hadoop-common/ClusterSetup.html and the one you quoted for the "collaboration" with hadoop. the collections and the relative indexes are correctly present

Re: shard splitting and indexes on hadoop

2024-08-29 Thread David Eric Pugh
Roberto, I'm hoping the community shares some knowledge, as it's not an area I am familiar with, and I'd love to see more content added to the Ref Guide. You are using Solr 9 I think?  Is this with using the https://solr.apache.org/guide/solr/latest/deployment-guide/solr-on-hdfs.html approach?

shard splitting and indexes on hadoop

2024-08-29 Thread Roberto Maggi @ Debian
Hi you all, I'm still new to solr and hadoop and I can't find an answer  to this question that rose in me. In a multi cluster setup with 3 solr9 hosts and and 3 hadoop datanodes I'm wondering where and how the data is stored. If I instruct the creation of a collection with 3 shard splitting,