And one other consideration in addition to the two excellent responses so far....
In a SolrCloud environment, SolrJ via CloudSolrServer will automatically route the documents to the correct shard leader, saving some additional overhead. Post.jar and cURL send the docs to a node, which in turn forward the docs to the correct shard leader which lowers throughput.... Best, Erick On Tue, Oct 28, 2014 at 2:32 PM, "Jürgen Wagner (DVT)" <juergen.wag...@devoteam.com> wrote: > Hello Olivier, > for real production use, you won't really want to use any toys like > post.jar or curl. You want a decent connector to whatever data source there > is, that fetches data, possibly massages it a bit, and then feeds it into > Solr - by means of SolrJ or directly into the web service of Solr via binary > protocols. This way, you can properly handle incremental feeding, processing > of data from remote locations (with the connector being closer to the data > source), and also source data security. Also think about what happens if you > do processing of incoming documents in Solr. What happens if Tika runs out > of memory because of PDF problems? What if this crashes your Solr node? In > our Solr projects, we generally do not do any sizable processing within Solr > as document processing and document indexing or querying have all different > scaling properties. > > "Production use" most typically is not achieved by deploying a vanilla Solr, > but rather having a bit more glue and wrappage, so the whole will fit your > requirements in terms of functionality, scaling, monitoring and robustness. > Some similar platforms like Elasticsearch try to alleviate these pains of > going to a production-style infrastructure, but that's at the expense of > flexibility and comes with limitations. > > For proof-of-concept or demonstrator-style applications, the plain tools out > of the box will be fine. For production applications, you want to have more > robust components. > > Best regards, > --Jürgen > > > On 28.10.2014 22:12, Olivier Austina wrote: > > Hi All, > > I am reading the solr documentation. I have understood that post.jar > <http://wiki.apache.org/solr/ExtractingRequestHandler#SimplePostTool_.28post.jar.29> > is not meant for production use, cURL > <https://cwiki.apache.org/confluence/display/solr/Introduction+to+Solr+Indexing> > is not recommanded. Is SolrJ better for production? Thank you. > Regards > Olivier > > > > -- > > Mit freundlichen Grüßen/Kind regards/Cordialement vôtre/Atentamente/С > уважением > i.A. Jürgen Wagner > Head of Competence Center "Intelligence" > & Senior Cloud Consultant > > Devoteam GmbH, Industriestr. 3, 70565 Stuttgart, Germany > Phone: +49 6151 868-8725, Fax: +49 711 13353-53, Mobile: +49 171 864 1543 > E-Mail: juergen.wag...@devoteam.com, URL: www.devoteam.de > > ________________________________ > Managing Board: Jürgen Hatzipantelis (CEO) > Address of Record: 64331 Weiterstadt, Germany; Commercial Register: > Amtsgericht Darmstadt HRB 6450; Tax Number: DE 172 993 071 > >