RE: lucene based frameworks/servers: solr, nutch, compass - which one is for what????

2006-09-20 Thread Pasha Bizhan
-Applications-Lucene-Nutch/dp/159059687 0/sr=8-2/qid=1158770477/ref=pd_bbs_2/002-1809958-7770417?ie=UTF8s=books http://apress.com/book/bookDisplay.html?bID=10125 Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED

RE: Problems indexing large documents

2006-06-09 Thread Pasha Bizhan
#setMaxFieldLength(int) Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: how to craete index with particular ID

2006-05-31 Thread Pasha Bizhan
is stored in the text file. after searching i want to disply corresponding path from index file. How should i creat such index file ??? how should i proceed See http://wiki.apache.org/jakarta-lucene/LuceneFAQ for more details. Pasha Bizhan

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
)? Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
Hi, From: Amaresh Kumar Yadav [mailto:[EMAIL PROTECTED] do we need some setting in any jsp or other file for document ??? You need to specify Stored (Store.YES) attribute for a field during indexing. Pasha Bizhan

RE: return document name as null Please help

2006-05-30 Thread Pasha Bizhan
, fieldValue, Field.Store.YES, Field.Index.UN_TOKENIZED)); - Or Could you please provide your source code? How do you index your data? Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED

RE: Lucene Seaches VS. Relational database Queries

2006-04-13 Thread Pasha Bizhan
: select titles, yesr, etc from titles where 'Assistant Producer' like '%%. is full-text search query. And Lucene can help you in this case. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail

RE: TooManyClauses exception in Lucene (1.4)

2006-03-16 Thread Pasha Bizhan
to highlight the results. Filter does not. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Lucene Scoring

2006-03-08 Thread Pasha Bizhan
/api/org/apache/lucene/search/Similarity.h tml And Lucene Book: - 3.3 : Understanding Lucene scoring http://lucenebook.com/search?query=scoring Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e

RE: Question

2006-03-07 Thread Pasha Bizhan
starting with Ab*. So i don't need the hole document and i need this information realy fast. It was implemented by David Spencer. See http://searchmorph.com/experiments.php. Unfortunately online demo is unavailable now :( Pasha Bizhan

RE: Question

2006-03-07 Thread Pasha Bizhan
starting with Ab*. So i don't need the hole document and i need this information realy fast. It was implemented by David Spencer. See http://searchmorph.com/experiments.php. Unfortunately online demo is unavailable now :( Pasha Bizhan

RE: Question

2006-03-07 Thread Pasha Bizhan
://web.archive.org/web/20050306065912/http://www.searchmorph.com/weblog/ index.php?id=26 Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Duplicates recods in index

2006-02-09 Thread Pasha Bizhan
for every single document if the new data doesn't contain dupes. In any case, we use the following algorithm: - remove dupes from new data (it's possible) - open writer, open reader/searcher - use reader/searcher to check for dupes - use writer to add new documents - close all Pasha Bizhan

RE: Duplicates recods in index

2006-02-08 Thread Pasha Bizhan
with given id and name values. String query = +id:( + doc.get(id) + ) +name:( + doc.get(name) + ); Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Boolean Query

2005-12-28 Thread Pasha Bizhan
Term(title,ginger)); BooleanQuery q1 = new BooleanQuery(); q1.add(t3,false,false); q1.add(t2,false,false); Q1 is (t2 OR t3) now. BooleanQuery q2 = new BooleanQuery(); q2.add(t1,true,false); q2.add(q1,true,false); Q2 is (t1 AND Q1). I guess. Pasha Bizhan

RE: Wildcard

2005-12-02 Thread Pasha Bizhan
-archive.com/lucene-user@jakarta.apache.org/msg12254.html And http://www.mail-archive.com/lucene-user@jakarta.apache.org/msg12268.html If I have an item 108585-123 And another 332323-123 How can I look for all the -123 family of items? Pasha Bizhan

RE: Extending 'sealed' classes usage in comercial solution.

2005-10-25 Thread Pasha Bizhan
/licence-FAQ.html for more details. Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Document ID

2005-06-25 Thread Pasha Bizhan
(document); Etc.. It's not rise a claim. I don't know a good unified solution for this problem :( Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Displaying relevant text with Search results

2005-06-13 Thread Pasha Bizhan
this? You need the Highlighter package. http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/highlighter/src/ java/org/apache/lucene/search/highlight/package.html?view=markup http://svn.apache.org/viewcvs.cgi/lucene/java/trunk/contrib/highlighter/ Pasha Bizhan

RE: Indexes auto creation

2005-06-13 Thread Pasha Bizhan
= ! (IndexReader.indexExists(directory)); writer = new IndexWriter(directory, analyzer, createIndex ); Pasha Bizhan http://lucenedotnet.com - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Search Theory Book

2005-05-12 Thread Pasha Bizhan
Hi, Managing Gigabytes http://www.amazon.com/exec/obidos/tg/detail/-/1558605703/qid=1115898416/sr=8 -1/ref=pd_csp_1/104-0210366-8377506?v=glances=booksn=507846 Pasha Bizhan http://lucenedotnet.com -Original Message- From: Anna Bing [mailto:[EMAIL PROTECTED] Sent: Thursday, May 12

RE: Wildcard searching with Highlight support ?

2005-04-19 Thread Pasha Bizhan
Hi, From: Eric Chow [mailto:[EMAIL PROTECTED] I mean if I use wildcard query, it cannot highlight any terms ? Any idea to do this or any existing example ? Try to rewrite query before highlighting. Pasha Bizhan

RE: getting the number of occurrences within a document

2005-04-14 Thread Pasha Bizhan
to do that? See IndexReader.TermDocs(Term t) and TermDocs.Freq(). Pasha Bizhan - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: new added documents not showing

2005-03-21 Thread Pasha Bizhan
Hi, From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] I just realized that the way I described our process was off a little bit. Here's the process again: I apologize for the amount of code below. When do you open the index writer? Where is the code? Pasha Bizhan http