Re: Question on the appropriate software

2011-07-20 Thread Matthew Twomey
ndler, FileListEntityProcessor and TikaEntityProcessor. I don't quite think Nutch is the tool here. You'll be wanting to do highlighting and a couple of other things You'll spend some time tweaking results to be what you want, but this is certainly do-able. Best Erick On Tue, Jul 19, 2011

Re: Solr not returning results for some key words

2011-07-20 Thread Matthew Twomey
Ok, apparently I'm not the first to have fallen prey to maxFieldLength gotcha: http://lucene.472066.n3.nabble.com/Solr-ignoring-maxFieldLength-td473263.html All fixed now. -Matt On 07/20/2011 07:13 PM, Matthew Twomey wrote: Greetings, I'm having trouble getting Solr to return r

Solr not returning results for some key words

2011-07-20 Thread Matthew Twomey
Greetings, I'm having trouble getting Solr to return results for key words that I know for sure are in the index. As a test, I've indexed a PDF of a book on Java. I'm trying to search the index for "UnsupportedOperationException" but I get no results. I can "see" it in the index though: ###

Question on the appropriate software

2011-07-19 Thread Matthew Twomey
Greetings, I'm interesting in having a server based personal document library with a few specific features and I'm trying to determine what the most appropriate tools are to build it. I have the following content which I wish to include in the archive: 1. A smallish collection of technical b