Re: Recursively scan documents for indexing in a folder in SolrJ

2015-10-19 Thread Zheng Lin Edwin Yeo
t; -Original Message- > From: Zheng Lin Edwin Yeo [mailto:edwinye...@gmail.com] > Sent: 17 October 2015 00:55 > To: solr-user@lucene.apache.org > Subject: Re: Recursively scan documents for indexing in a folder in SolrJ > > Thanks for your advice. I also found this method

RE: Recursively scan documents for indexing in a folder in SolrJ

2015-10-19 Thread Duck Geraint (ext) GBJH
ngenta.com -Original Message- From: Zheng Lin Edwin Yeo [mailto:edwinye...@gmail.com] Sent: 17 October 2015 00:55 To: solr-user@lucene.apache.org Subject: Re: Recursively scan documents for indexing in a folder in SolrJ Thanks for your advice. I also found this method which so far has been

Re: Recursively scan documents for indexing in a folder in SolrJ

2015-10-16 Thread Zheng Lin Edwin Yeo
ng-with-solrj/ > > Geraint > > > Geraint Duck > Data Scientist > Toxicology and Health Sciences > Syngenta UK > Email: geraint.d...@syngenta.com > > -Original Message- > From: Jan Høydahl [mailto:jan@cominvent.com] > Sent: 16 October 2015 12:14 &g

RE: Recursively scan documents for indexing in a folder in SolrJ

2015-10-16 Thread Duck Geraint (ext) GBJH
[mailto:jan@cominvent.com] Sent: 16 October 2015 12:14 To: solr-user@lucene.apache.org Subject: Re: Recursively scan documents for indexing in a folder in SolrJ SolrJ does not have any file crawler built in. But you are free to steal code from SimplePostTool.java related to directory traversal, and

Re: Recursively scan documents for indexing in a folder in SolrJ

2015-10-16 Thread Jan Høydahl
SolrJ does not have any file crawler built in. But you are free to steal code from SimplePostTool.java related to directory traversal, and then index each document found using SolrJ. Note that SimplePostTool.java tries to be smart with what endpoint to post files to, xml, csv and json content wi

Recursively scan documents for indexing in a folder in SolrJ

2015-10-15 Thread Zheng Lin Edwin Yeo
Hi, I understand that in SimplePostTool (post.jar), there is this command to automatically detect content types in a folder, and recursively scan it for documents for indexing into a collection: bin/post -c gettingstarted afolder/ This has been useful for me to do mass indexing of all the files t