Searching context within a book

2012-02-06 Thread pistacchio
I'm very new to Solr and I'm evaluating it. My task is to look for words
within a corpus of books and return them within a small context. So far, I'm
storing the books in a database split by paragraphs (slicing the books by
line breaks), I do a fulltext search and return the row.

In Solr, would I have to do the same, or can I add the whole book (in .txt
format) and, whenever a match is found, return something like the match plus
100 words before and 100 words after or something like that? Thanks

--
View this message in context: 
http://lucene.472066.n3.nabble.com/Searching-context-within-a-book-tp3718997p3718997.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Searching context within a book

2012-02-06 Thread Robert Stewart
You are probably better off splitting up each book into separate SOLR 
documents, one document per paragraph (each document with same book ID,  ISBN, 
etc.).  Then you can use field-collapsing on the book ID to return a single 
document per book.  And you can use highlighting to show the paragraph that 
matched the query.
You will need to store the full-text in SOLR in order to use highlighting 
feature and/or to return the text in the search results.


On Feb 6, 2012, at 2:13 AM, pistacchio wrote:

 I'm very new to Solr and I'm evaluating it. My task is to look for words
 within a corpus of books and return them within a small context. So far, I'm
 storing the books in a database split by paragraphs (slicing the books by
 line breaks), I do a fulltext search and return the row.
 
 In Solr, would I have to do the same, or can I add the whole book (in .txt
 format) and, whenever a match is found, return something like the match plus
 100 words before and 100 words after or something like that? Thanks
 
 --
 View this message in context: 
 http://lucene.472066.n3.nabble.com/Searching-context-within-a-book-tp3718997p3718997.html
 Sent from the Solr - User mailing list archive at Nabble.com.