Re: Can this be achieved? (Was: document support for file system crawling)

2007-01-17 Thread Eivind Hasle Amundsen
(...) any enterprise interested in having a serious search solution (i.e. buy FAST, Autonomy or do open source lucene) will want a custom solution (...) then let an integrator/consultancy-firm/IT department to do the actual implementation. So a search distribution as pointed out is somewhat meani

Re: Can this be achieved? (Was: document support for file system crawling)

2007-01-17 Thread Eivind Hasle Amundsen
(...) the point being that once they've got you using a monolithic application, it's a lot harder to stop using the whole thing all at once, then it would be for you to stop using 1 of N mini-applications they provide. Well, FAST is composed of many small, modular products that can be replaced

Re: Can this be achieved? (Was: document support for file system crawling)

2007-01-16 Thread Eivind Hasle Amundsen
(...) http://andaluciajunta.es/aj-sea-.html This search engine will be based on nutch in the next version. The special character is that this main portal search engine has to search against the solr BOJA based indexed. Meaning Nutch will have to search the solr index and not vice versa. Looks in

Re: Connectors, Parsers, Plugin architecture

2007-01-16 Thread Eivind Hasle Amundsen
: Solr aims at being an answer to "enterprise needs", by indexing : structured data for different applications. However I think that many : enterprises would like to be able to structure information themselves. thta's exactly what Solr is about: letting a schema creator define what the structure

Can this be achieved? (Was: document support for file system crawling)

2007-01-16 Thread Eivind Hasle Amundsen
First: Please pardon the cross-post to solr-user for reference. I hope to continue this thread in solr-dev. Please answer to solr-dev. 1) more documentation (and posisbly some locking configuration options) on how you can use Solr to access an index generated by the nutch crawler (i think Thors

Connectors, Parsers, Plugin architecture

2007-01-15 Thread Eivind Hasle Amundsen
Hi, (I mentioned this on solr-user, but people didn't seem to respond.) It was a claim that Solr was probably not the right tool for indexing lots of different files (e.g. PDF files) across file systems, and that Nutch would be more appropriate. Does everyone agree with this opinion? Solr ai