[ https://issues.apache.org/jira/browse/SOLR-579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12598586#action_12598586 ]
Patrick Debois commented on SOLR-579: ------------------------------------- I can understand the point of view. It is indeed an example, still most people I know prefer Solr because it provides so much power out of the box. Lucene is a library, and that is IMHO different. -The encoding thing and parsing of a valid XML file are in the source code as comment. If it were merely for the demo files, this would not make sense either as they would always be correct. -Another use of the threads and commits would be usefull for a kind of benchmarking tool? But the maybe we should name the tool SimpleBenchmark ;-) > Extend SimplePost with RecurseDirectories, threads, document encoding , > number of docs per commit > ------------------------------------------------------------------------------------------------- > > Key: SOLR-579 > URL: https://issues.apache.org/jira/browse/SOLR-579 > Project: Solr > Issue Type: New Feature > Affects Versions: 1.3 > Environment: Applies to all platforms > Reporter: Patrick Debois > Priority: Minor > Original Estimate: 72h > Remaining Estimate: 72h > > -When specifying a directory, simplepost should read also the contents of a > directory > New options for the commandline (some only usefull in DATAMODE= files) > -RECURSEDIRS > Recursive read of directories as an option, this is usefull for > directories with a lot of files where the commandline expansion fails and > xargs is too slow > -DOCENCODING (default = system encoding or UTF-8) > For non utf-8 clients , simplepost should include a way to set the > encoding of the documents posted > -THREADSIZE (default =1 ) > For large volume posts, a threading pool makes sense , using JDK 1.5 > Threadpool model > -DOCSPERCOMMIT (default = 1) > Number of documents after which a commit is done, instead of only at > the end > Note: not to break the existing behaviour of the existing SimplePost tool > (post.sh) might be used in scripts -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.