Re: Bizarre Search order request

2012-05-25 Thread Chris Lu
Nothing like this yet. But you don't need to do everything in one search request. You can send one search request to know that the match distribution for each document type, and then send 3 requests for 3 document types each. -- Chris Lu - Instant Scalable Full

Re: Mapping Lucene search results with a relational database

2012-07-03 Thread Chris Lu
Can you index the rule1 and rule2 fields into the documents, and when searching with the keywords, also append rule1:foo and rule2:bar to the query? Chris - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbs

Re: is there some dangerous bug in lucene?

2010-05-11 Thread Chris Lu
If you are using field cache for field A, and updating field A, isn't it normal that the field A is not updated? Field cache is keyed via index reader, it won't be efficient to reload the field cache for each updateDocument(). -- Chris Lu - Instant Scalable

Re: Will doc ids ever change if nothing is deleted?

2010-05-14 Thread Chris Lu
documents are added, the id is N+1. Of course, if some documents from other segments are merged. The documents in one segment will "lose" its doc id. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net

Re: Inserting data from multiple databases in same index

2010-07-22 Thread Chris Lu
several boxes and achieve sharded search. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Databases

2010-07-23 Thread Chris Lu
-time data import. Or you would have to put a hook in your program to write new content to the index. Anyway, you can get it work, but maybe not as simple as you expected. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net

Re: Lucene applicability

2010-08-25 Thread Chris Lu
uld need a mechanism to get prepared and rebuild the index when you need to. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com

Re: Combine data from index and db before sorting and pagination

2010-09-01 Thread Chris Lu
ot;category_2", take doc5 and doc10 for example, after all the reindexing effort, the only changes is: "category_1": doc1,doc2. "category_2": doc3,doc4,doc5,doc7,doc8,doc10. Of course, to support this efficiently could be a big change, affecting all the nice

Re: Federated search with opensearch or proprietary APIs for Atlassian

2010-09-02 Thread Chris Lu
more flexible with the structure, even dealing with data beyond Atlassian products. I guess that's the reason Google did not rely on each website's own search mechanism. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.d

Re: does lucene support Database full text search

2010-09-10 Thread Chris Lu
Lucene does not support database directly. You need to pump data into Lucene. You can use DBSight, which has a built-in high performance crawler for any databases. It also has integrated Chinese analyzers, including IKAnalyzer, which is the best one I found so far. -- Chris Lu

Re: High frequency term for the searched query

2010-11-04 Thread Chris Lu
After you get the query object, you can use IndexSearcher's function docFreq(), like this final Set terms = new HashSet(); query = searcher.rewrite(query); query.extractTerms(terms); for(Term t : terms){ int frequency = irs.getSearcher().docFreq(t); } -- -- Chr

Re: High frequency term for the searched query

2010-11-04 Thread Chris Lu
After you get the query object, you can use IndexSearcher's function docFreq(), like this final Set terms = new HashSet(); query = searcher.rewrite(query); query.extractTerms(terms); for(Term t : terms){ int frequency = searcher.docFreq(t); } -- -- Chris Lu - In

Re: Ui Framework for lucene

2010-12-07 Thread Chris Lu
You can use DBSight for database search. You just need to give it one or several SQLs. And you can generate search result template, and it will manage the index for you. http://www.dbsight.net -- -- Chris Lu - Instant Scalable Full-Text Search On Any Database

Re: Distributing a Lucene application?

2011-03-22 Thread Chris Lu
does, this means the NY index could return results not found in NY database, correct? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http

Re: Distributing a Lucene application?

2011-03-24 Thread Chris Lu
p and running in several minutes. You can even embed a widget to put search UI to any page. btw, DBSight also has facet search. Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database S

Re: Is there any "Query" in Lucene can search the term, which is similar as "SQL-LIKE"?

2011-10-11 Thread Chris Lu
You need to analyze the search keyword with the same analyzer that's applied on the "content" field. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Datab

Re: Lucene Indexing DB records?

2008-08-22 Thread Chris Lu
-- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes DBSight customer, a

search for empty field?

2008-09-02 Thread Chris Lu
Is it possible to query for documents that have empty values for a field? Say need to find documents with category empty, I tried negative query: -category:* But it returns 0 document. I think "category:*" is basically match all, so this "-category:*" doesn't wor

Re: search for empty field?

2008-09-03 Thread Chris Lu
Thanks Erick for reminding me of this! I only need to validate a index and make sure the content are correctly retrieved and index doesn't have empty fields. So I'd better simply go through all document by id and check them directly. Thanks! -- Chris Lu ---

Re: search for empty field?

2008-09-03 Thread Chris Lu
term there. then all the unset bits are documents with empty fields. This should be kind of efficient. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes

Re: Lucene Memory Leak

2008-09-05 Thread Chris Lu
$CSIndexInput |- input of org.apache.lucene.index.SegmentTermEnum |- value of java.lang.ThreadLocal$ThreadLocalMap$Entry I am trying to track it down now. If anyone knows about it, please let me know. -- Chris Lu - Instant Scalable Full-Text

memory leak during Lucene Search

2008-09-06 Thread Chris Lu
lieve this will affect disk based index also. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index

Re: Building Relationships between documents?

2008-09-09 Thread Chris Lu
If you want to do it in just one search, yes, you have to put the Entities attributes into the documents. But you can search twice. The second time using values from the first search, say entitiy_id, to search the products. -- Chris Lu - Instant Scalable Full-Text Search

Re: memory leak during Lucene Search

2008-09-09 Thread Chris Lu
problem is fixed. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes

follow up of Lucene out of memory with RAMDirectory on J2EE environment

2008-09-10 Thread Chris Lu
a-dev @lucene.apache.org mailing list. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?

Re: follow up of Lucene out of memory with RAMDirectory on J2EE environment

2008-09-13 Thread Chris Lu
ce for him, and thanks Michael McCandless for fixing the problem s quickly. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsig

Re: Memory eaten up by String, Term and TermInfo?

2008-09-14 Thread Chris Lu
Can you try to update to the latest Lucene svn version, like yesterday? LUCENE-1383 was checked in yesterday. This patch is addressing a leak problem particular to J2EE applications. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http

Re: Lucene vs. Database

2008-10-01 Thread Chris Lu
e of the search result, just use database later on. But this isn't always correct. When you have 10 result per page, selecting the details from the database based on ids may not be that costly at all. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Ap

Re: Performance of never optimizing

2008-11-02 Thread Chris Lu
BTW: JIRA is great! -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_

Re: Can lucene search from multi-index directory like using FK in database?

2008-11-05 Thread Chris Lu
approach may not be that bad, although less performant. 2. You will need to create a little query parser to distribute words into two fields. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.db

Re: how to estimate how much memory is required to support the large index search

2008-11-17 Thread Chris Lu
Calculation looks right. But what's the "Index divisor" that you mentioned? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in

Re: how to estimate how much memory is required to support the large index search

2008-11-17 Thread Chris Lu
So looks like you are not really doing much sorting? This index divisor affects reader.terms(), but not too much with sorting. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene

duplication checking while indexing

2008-12-29 Thread Chris Lu
all contents are flushed to disk yet. Is it possible to query the not-yet-closed index? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http

Re: duplication checking while indexing

2008-12-29 Thread Chris Lu
Otis, thanks for the pointer. I think the question can be: How to access TermEnum or TermInfos during indexing. If this is possible, things would be easier. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo

Re: duplication checking while indexing

2008-12-30 Thread Chris Lu
er to look it up. But it doesn't feel right. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index

Re: Search Problem

2009-01-01 Thread Chris Lu
You need to let us know the analyzer you are using. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Search Problem

2009-01-02 Thread Chris Lu
Basically Lucene stores analyzed tokens, and looks up for the matches based on the tokens. "Amin" after StandardAnalyzer is "amin", so you need to use new Term("body", "amin"), instead of new Term("body", "Amin"), to search. -- Chris Lu

Re: indexing database

2009-01-21 Thread Chris Lu
This is not a lucene question, but a jdbc question. The code is not releasing the jdbc connection, statement, and resultset, and what's worse, the code is creating new connections when paginating the results. -- Chris Lu - Instant Scalable Full-Text Search On Any Dat

Re: Faceted search with OpenBitSet/SortedVIntList

2009-02-07 Thread Chris Lu
To avoid creating a lot of objects and quickly throwing them away, you can adjust Eden memory size, or you can create a bunch of objects and try to re-use them. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo:

Re: Multiple indexes vs single index

2009-02-14 Thread Chris Lu
eally, especially your QPS is not so demanding. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?

Re: Optimal Solution for Unique Field Values

2009-02-15 Thread Chris Lu
I think you would need to 1) collect all the matching IDs for Field2=x 2) loop through Field1, for each Term's doc, collect the term if the term doc is in the matching IDs from step 1. This should be the fastest approach, pretty similar to what you suggested. -- Chr

Re: Merging database index with fulltext index

2009-02-28 Thread Chris Lu
I feel this may not be a good example. Since you can easily index field c, a, d and let Lucene to handle the filter "c = 'foo'" and the order by clause"order by a desc, d" -- Chris Lu - Instant Scalable Full-Text Search On Any D

Re: Merging database index with fulltext index

2009-02-28 Thread Chris Lu
Actually you can use DBSight(disclaimer:I work on it) to collect the data and keep them in sync. The free version has most the features and doesn't have size limit. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsigh

Re: Merging database index with fulltext index

2009-02-28 Thread Chris Lu
y, ranking, etc. I think you better try it first. It's faster to install it, select the content with your sql, and get the search up and running, than reading introduction materials. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://

Re: Restricting the result set with hierarchical ACL

2009-03-02 Thread Chris Lu
belongs to, including the sub groups. Approach 2 should be more flexible. I don't think a user will have that many groups exceeding the default 1024. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://sea

Re: Optimum way to find all document without particular field

2009-03-04 Thread Chris Lu
Allahbaksh, If you ONLY want to find all document with a particular field that is not null, you can loop through the TermEnum and TermDocs to find all the document ids. But this can not easily be combined with other queries. -- Chris Lu - Instant Scalable Full-Text

Re: Random sorting results

2009-03-21 Thread Chris Lu
Maybe you can adjust your ranking algorithm. For example, rank the most recent results higher? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes

Re: Syncing lucene index with a database

2009-03-26 Thread Chris Lu
re not sure the proper index structure yet. I think you can use DBSight Free version, to rapidly prototype and experiment with all these choices, without coding any XML etc. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.d

Re: i18n numbers

2009-03-26 Thread Chris Lu
Marcel, First of all, do you really want the user to search price:19.99 ? Maybe you should use some logic like price>=19.99? If so, you should use range query to handle this case. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: h

Re: Creating lucene index from databases

2009-03-31 Thread Chris Lu
kranthi, Maybe you should use DBSight Lite to get started and get familiar with Lucene features. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: _http://www.dbsight.net_ <http://www.dbsight.net/> demo: _http://search.dbsight.com_

Re: Can I run Lucene in google app engine?

2009-04-13 Thread Chris Lu
could be a good solution for small index with per-user data. 3) For large changing indexes, you need to find other solutions to maintain lucene index. My personal opinion is, finding a $20/month VPS hosting is far easier than changing the way to code. -- Chris Lu - I

Re: dbsight

2009-05-10 Thread Chris Lu
little Lucene specific configuration and some ready-to-use scaffolding. Compared to SOLR, DBSight's design is more like ruby-on-rails rather than a common web application. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site:

consistent ordering of multi-values in a field

2009-07-07 Thread Chris Lu
,value2}? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes DBSight

Re: consistent ordering of multi-values in a field

2009-07-07 Thread Chris Lu
That's great and thanks for the super fast answer! Another question if not thread-hijacking: Will the ordering of fields be preserved also? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo:

Re: Performance tips when creating a large index from database.

2009-10-22 Thread Chris Lu
, and you can adjust the number of threads for database queries and also for indexing to find out your optimal data pulling configuration. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo:

Re: Merging Indexes

2009-10-26 Thread Chris Lu
Pretty sure you can delete the small indexes after the merge. BTW: How long does your indexing and merging take respectively? -- -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com

Re: Facets

2009-11-03 Thread Chris Lu
. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes DBSight customer

Re: Creating tag clouds with lucene

2009-11-05 Thread Chris Lu
Isn't the tag cloud just another facet search? Only difference is the tag is multi-valued. Basically just go through the search results and find all unique tag values. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site:

Re: Creating tag clouds with lucene

2009-11-05 Thread Chris Lu
eally have cared about the frequency in the search results? DBSight uses the multi-valued facet search approach to do tag cloud. Maybe I can "cheat" it this way also... It does save some memory. -- Chris Lu - Instant Scalable Full-Text Search On Any Datab

Re: How to use Lucene to suppot quick search on huge databases where the primary content is of non textual format ?

2009-11-09 Thread Chris Lu
e for quick data access. For your exact matching, it may not help much. Creating and maintaining Lucene index is surely more coding than create a database index though. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net

Re: Lucene Java 3.0.0 RC1 now available for testing

2009-11-17 Thread Chris Lu
So will I need to use 2 fields, one filed is analyzed and the other field is binary, to replace one compressed fields previously? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com

Re: Lucene Analyzer that can handle C++ vs C#

2009-12-11 Thread Chris Lu
What we did in DBSight is to provide a reserved list of words for every Lucene Analyzer. This way you can handle any special characters like C++ and C#. Any common analyzers usually are not suitable for these special words. -- Chris Lu - Instant Scalable Full-Text

Re: Switching from Store.YES to Store.NO

2010-01-05 Thread Chris Lu
Just curious, will it be adjusted during indexing when merging segments? Thanks! -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http

IllegalArgumentException when IndexWriter.addDocument

2010-01-14 Thread Chris Lu
Seems a integer overflow problem? java.lang.IllegalArgumentException: Increment must be zero or greater: -472893952 at org.apache.lucene.analysis.tokenattributes.PositionIncrementAttributeImpl.setPositionIncrement(PositionIncrementAttributeImpl.java:58) at org.apache.lucene.analysis.StopFilt

Re: IllegalArgumentException when IndexWriter.addDocument

2010-01-14 Thread Chris Lu
Notes: I am using Lucene 3.0 Seems a integer overflow problem? java.lang.IllegalArgumentException: Increment must be zero or greater: -472893952 at org.apache.lucene.analysis.tokenattributes.PositionIncrementAttributeImpl.setPositionIncrement(PositionIncrementAttributeImpl.java:58) at org

Re: IllegalArgumentException when IndexWriter.addDocument

2010-01-14 Thread Chris Lu
tokenstream, it is really never reset even across multiple documents). So whenever a stopword occurs it get larger... - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Chris Lu [mailto:chris...@gmail.com

Re: Lucene as a primary datastore

2010-01-20 Thread Chris Lu
iliary data structure. It's only fast in one way, but could be slow in other ways. 3) The more robust approach is to pull data out of database, and create a Lucene index. In case something goes wrong, you can always pull data out again and create the index again. -- Chris Lu --

Re: Can't start Lucene App: java.io.FileNotFoundException with brand new directory

2010-01-24 Thread Chris Lu
Think from another approach: You can check whether the index exists or not by IndexReader.*indexExists <../../../../org/apache/lucene/index/IndexReader.html#indexExists%28java.io.File%29>*(), and then determine what you want to do with the IndexWriter constructor. -- -- Ch

Re: index a mysql database -blob field

2010-01-29 Thread Chris Lu
For blob, it is not so simple since BLOB could contain different file types, like HTML, pdf, word, zip file type. So besides getting results out via resultSet.getBlob() function, you will need to convert the binary stream into simple text strings. DBSight free version already can read the blog

Re: Scale Out

2010-02-08 Thread Chris Lu
Since you already have RMI interface, maybe you can parallel search on several nodes, collect the data, pick top ones, and send back results via RMI. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http

Re: Query about Query.ToString()

2010-02-17 Thread Chris Lu
XMLQueryParser is pretty good start. However, is it being maintained recently? I noticed many Query class are not supported, like PrefixQuery, or even PhraseQuery. Is it for some particular reason or simply lack of resource? -- Chris Lu - Instant Scalable Full-Text

Re: Query about Query.ToString()

2010-02-18 Thread Chris Lu
otocol buffer? Hopefully I only need to serialize it via query.toXML() or query.toBytes() and the parser can recognize the serialized forms. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search

Re: Improving Zend lucene search - general guidance?

2010-02-19 Thread Chris Lu
PHP doesn't have java like "static" variables, right? They are "stateless". All the information like term info that's loaded in the memory will be gone for the next search. You should use DBSight if you just have one week. -- Chris Lu --

Re: FastVectorHighlighter truncated queries

2010-02-23 Thread Chris Lu
This should be a common wildcard query highlighting problem. You will need to query.rewrite() first, and pass the result to the highlighter. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http

Re: If you could have one feature in Lucene...

2010-02-24 Thread Chris Lu
2 features: Search and serializeable Query class in java serializable object format, or XML, or json format. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database

Re: NAS vs SAN vs Server Disk RAID

2010-02-25 Thread Chris Lu
To my experience, some customers used SAN to store the index. It's pretty good and fast. This may be a good choice for you, but it's costly. -- -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net

Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Chris Lu
Hi, Michel, This has already been implemented in DBSight. Check it out! http://www.dbsight.net You can get sum, avg for Facet searches. And count is included in Facet search directly. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site

Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Chris Lu
ly? It doesn't seem to be an opensource project so I can't really consider it. - Mike aka...@gmail.com On Thu, Apr 1, 2010 at 5:00 AM, Chris Lu wrote: Hi, Michel, This has already been implemented in DBSight. Check it out! http://www.dbsight.net You can get sum, avg for Facet

Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Chris Lu
ching. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes DBSight custom

Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Chris Lu
For DBSight, the aggregated values are computed during run time. And the sorting on the computed aggregated values are done when displaying the results. The idea is, after the aggregation, the number of aggregated values are much much smaller. -- Chris Lu - Instant

Re: Lucene Challenge - sum, count, avg, etc.

2010-04-01 Thread Chris Lu
of rows are not really "that" big when everything is properly warmed up. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.db

Re: NPE in IndexReader

2007-08-21 Thread Chris Lu
subReaders); } However, segmentInfos are used in several places, causing NPEs. For example, in IndexReader.acquireWriteLock(), if (SegmentInfos.readCurrentVersion(directory) > segmentInfos.getVersion()) { So I think MultiReader.java need some adjustments. -- Chris Lu

Re: NPE in IndexReader

2007-08-21 Thread Chris Lu
Right now I am very confused. I agree segmentInfos are not needed in this case. But it's used in aquireWriteLock(). What should we do? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsigh

Re: How to speed-up index opening

2007-08-29 Thread Chris Lu
ensure no service down time. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Large Index Architecture

2007-08-29 Thread Chris Lu
Index Partitioning should be a good idea. It'll save a lot of time on index merging, incremental indexing. Just my experience, partition size really depends on CPU, hard disk speed, and memory size. Nowadays with Core 2 Duo, 10G size for each chunk should be good. -- Chr

Re: Can a Lucene field be renamed in a Lucene index?

2007-08-29 Thread Chris Lu
The easiest solution would be to change the SQL to select Bar as Foo, ..., from your_table Use an alias and maintain everything as before. If it's not a solution, you may need to re-index everything. -- Chris Lu - Instant Scalable Full-Text Search On Any Dat

Re: Indexing in pieces?

2007-08-31 Thread Chris Lu
dedicated server for indexing and searching. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Indexing in pieces?

2007-08-31 Thread Chris Lu
. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes On 8/31/07, Berlin

Re: Data in the Index [was: JdbcDirectory]

2007-09-04 Thread Chris Lu
rendering speed comparing to retrieve the rest data from database. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Indexing Speed using Lucene 2.2

2007-09-07 Thread Chris Lu
Is it repeatable? Maybe the database is slower during that time. Myself didn't any major slowness when upgrading to Lucene 2.2. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com L

Re: Lucene

2007-09-13 Thread Chris Lu
Hi, Devinder, Lucene is agnostic of any database configuration. You need to pull data via jdbc out, and feed it to Lucene to create an index file, and then use Lucene API to search on it. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site

Re: Oracle-Lucene integration (OJVMDirectory and Lucene Domain Index) - LONG

2007-09-14 Thread Chris Lu
Hi, Joaquin, Very interested to know the indexing performance inside Oracle JVM, especially with large amount of data. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database

Re: lucene optimal configuration

2007-10-06 Thread Chris Lu
riables to fit your specific setup. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index

Re: indexing on NAS

2007-10-11 Thread Chris Lu
I have used Lucene on SAN in a federal project, works out great. It supports search clustering, where several other servers search on the shared index which is produced by another server. Only need to refresh the other searching servers' IndexSearcher after indexing is done. -- Chr

Re: use lucene as datastore?

2007-10-16 Thread Chris Lu
No experience on this. But there are two points I can think of: 1) you can use compressed field to store the text 2) use the hash code of the path as the key -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo

Re: lucene index in a cluster.

2007-10-17 Thread Chris Lu
To my experience from a project, using software DBSight, Lucene runs well on a SAN environment. No experience of a NFS, but I know many indexing errors caused by NFS. Maybe the new index delete policy helps in latest version of Lucene. -- Chris Lu - Instant Scalable Full

Re: lucene index in a cluster.

2007-10-17 Thread Chris Lu
http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/index/IndexDeletionPolicy.html This only works for Lucene 2.2 and later, not available if you need to used Lucene 2.0. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http

Re: Meta- search descriptions

2007-10-23 Thread Chris Lu
Why not index their database directly? -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title

Re: Meta- search descriptions

2007-10-23 Thread Chris Lu
time to parse out the data. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?

Re: Problems while indexing

2007-10-31 Thread Chris Lu
Hi, Jan, You really need to be more specific about your configuration and error log. Lucene surely has been used on many large websites. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http

  1   2   3   >