[ https://issues.apache.org/jira/browse/SOLR-3691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jan Høydahl updated SOLR-3691: ------------------------------ Attachment: SOLR-3691.patch First patch. Implements a new mode -Ddata=web which fetches and posts a web page to Solr, and optionally pulls out links from it (using SolrCell extractOnly=true) and crawls to N levels. This patch also implements recursion level support for files as well, plus optional delay. This is not - as with post.jar in general - intended as a production feature, but as a nice way for newbies to test posting web pages to Solr without an external crawler, to increase the OOTB experience™ > SimplePostTool: Mode for indexing a web page > -------------------------------------------- > > Key: SOLR-3691 > URL: https://issues.apache.org/jira/browse/SOLR-3691 > Project: Solr > Issue Type: Bug > Components: scripts and tools > Reporter: Jan Høydahl > Assignee: Jan Høydahl > Fix For: 4.0, 5.0 > > Attachments: SOLR-3691.patch > > > The simple post.jar tool should both show some sample code as well as aid > users in testing Solr from the command line. Missing is an easy way to index > a web page. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org