absolute search

2009-12-24 Thread Olala
I have another question that if I query "book", solr will response only "book" but do not response another value such as book mark, book-keeper, guide-book, etc. Thanks so much! -- View this message in context: http://old.nabble.com/absolute-search-tp26911441p26911441.html Sent from the Solr -

Re: absolute search

2009-12-24 Thread Shalin Shekhar Mangar
On Thu, Dec 24, 2009 at 1:49 PM, Olala wrote: > > I have another question that if I query "book", solr will response only > "book" but do not response another value such as book mark, book-keeper, > guide-book, etc. > > What is the type of the field you are searching on? Have you tried a text typ

Re: absolute search

2009-12-24 Thread Olala
Yes, I am searching on Text fieldtype :) Shalin Shekhar Mangar wrote: > > On Thu, Dec 24, 2009 at 1:49 PM, Olala wrote: > >> >> I have another question that if I query "book", solr will response only >> "book" but do not response another value such as book mark, book-keeper, >> guide-book, et

Expanding synonyms for both facets and response results

2009-12-24 Thread herceg_novi
Hello, I have used synonyms to provide translation from my keys to appropriate English text. When using facets, the field appropriately displays the synonym field: 2 but my result response does not translate the ID to text: 100 (I was expecting "One

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread AHMET ARSLAN
> I have used synonyms to provide translation from my keys to > appropriate > English text. When using facets, the field appropriately > displays the > synonym field: >   >     >       2 >     > > but my result response does not translate the ID to text: >         > >     name="apt_type_e

Re: absolute search

2009-12-24 Thread Erick Erickson
Could you show your schema.xml definitions for the field in question? It looks like you're somehow indexing without tokenizing, but that's a guess Also, what does adding debugQuery=on show? Thanks Erick On Thu, Dec 24, 2009 at 3:40 AM, Olala wrote: > > Yes, I am searching on Text fieldtype

SOLR Performance Tuning: Pagination

2009-12-24 Thread Fuad Efendi
I used pagination for a while till found this... I have filtered query ID:[* TO *] returning 20 millions results (no faceting), and pagination always seemed to be fast. However, fast only with low values for start=12345. Queries like start=28838540 take 40-60 seconds, and even cause OutOfMemoryEx

weird sorting behavior

2009-12-24 Thread Joel Nylund
I have a field: required="false"/> sortMissingLast="true" omitNorms="true"> When I sort it using titles that are alphanumeric it works great, but if the titles start with numbers, it almost seems

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Grant Ingersoll
On Dec 24, 2009, at 11:09 AM, Fuad Efendi wrote: > I used pagination for a while till found this... > > > I have filtered query ID:[* TO *] returning 20 millions results (no > faceting), and pagination always seemed to be fast. However, fast only with > low values for start=12345. Queries like

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread herceg_novi
Ahmet Arslan wrote: > > Faceting is done on tokens. value (not analyzed) of documents. > Can you explain more what are you trying to do? > I am working on a multi language website. I need to perform faceting on the exact same records, but need to display facet values in appropriate languages.

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Walter Underwood
When do users do a query like that? --wunder On Dec 24, 2009, at 8:09 AM, Fuad Efendi wrote: > I used pagination for a while till found this... > > > I have filtered query ID:[* TO *] returning 20 millions results (no > faceting), and pagination always seemed to be fast. However, fast only with

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Joe Calderon
fwiw, when implementing distributed search i ran into a similar problem, but then i noticed even google doesnt let you go past page 1000, easier to just set a limit on start On Thu, Dec 24, 2009 at 8:36 AM, Walter Underwood wrote: > When do users do a query like that? --wunder > > On Dec 24, 200

Re: Solr 1.4 - stats page slow

2009-12-24 Thread Jay Hill
I've noticed this as well, usually when working with a large field cache. I haven't done in-depth analysis of this yet, but it seems like when the stats page is trying to pull data from a large field cache it takes quite a long time. Are you doing a lot of sorting? If so, what are the field types

Re: Solr 1.4 - stats page slow

2009-12-24 Thread Jay Hill
Also, what is your heap size and the amount of RAM on the machine? I've also noticed that, when watching memory usage through JConsole or YourKit while loading the stats page, the memory usage spikes dramatically - are you seeing this as well? -Jay On Thu, Dec 24, 2009 at 9:12 AM, Jay Hill wrot

Re: weird sorting behavior

2009-12-24 Thread Joel Nylund
update, I tried changing to datatype string, and it sorts the numerics better, but the other sorts are not as good. Is there a way to control sorting for special chars, for example, I want blanks to sort after letters and numbers. using alphaOnlySort - sorts nicely for alpha, but numbers do

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Erik Hatcher
On Dec 24, 2009, at 11:36 AM, Walter Underwood wrote: When do users do a query like that? --wunder Well, SolrEntityProcessor "users" do :) http://issues.apache.org/jira/browse/SOLR-1499 (which by the way I plan on polishing and committing over the holidays) Erik On Dec 24

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Walter Underwood
Some bots will do that, too. Maybe badly written ones, but we saw that at Netflix. It was causing search timeouts just before a peak traffic period, so we set a page limit in the front end, something like 200 pages. It makes sense for that to be very slow, because a request for hit 28838540 mea

RE: SOLR Performance Tuning: Pagination

2009-12-24 Thread Fuad Efendi
Grant, Eric, Walter, and SOLR, Thank you so much for very prompt responses (with links!) >From time to time I try to share... Happy Holidays!!! > -Original Message- > From: Walter Underwood [mailto:wun...@wunderwood.org] > Sent: December-24-09 1:51 PM > To: solr-user@lucene.apac

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread AHMET ARSLAN
> > > > In db_apartment_types_en.txt file if you use comma (,) > instead of => > > faceting will return both: > > > > > >          >     > >              >    2 > >              >    2 > >          >     > >          > > > > > > I tried this but no luck. Did you restart tomcat and re-index?

Re: SOLR Performance Tuning: Pagination

2009-12-24 Thread Paul Rosen
This is similar to a problem I've been having. It's pretty easy to just limit the user to 200 pages when the results are sorted by relevance, but if they are sorted alphabetically, then that doesn't work. It would be nice if there were a "limit=2000" parameter to the solr call that is applied t

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread herceg_novi
Ahmet Arslan wrote: > > > Did you restart tomcat and re-index? I just confirmed faceting will return > above. use expand=true with comma separated entries to get above. You can > confirm expansion is done and original token is preserved with > analysis.jsp page. > > Ok, I must be missing som

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread AHMET ARSLAN
> Ok, I must be missing something very obvious. My > analysis.jsp page shows the > entry just fine: > > I enter: > Field (name): apt_type_en > Field value (index): 100 > Analyze > > I get > "Index Analyzer > 100 > One Bedroom Apartment" > > "One Bedroom Apartment" is the expected entry. > >

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread AHMET ARSLAN
> > > > > >     > >         name="apt_type_en">One Bedroom Apartment > >     > > > >     > >         name="apt_type_en">One Bedroom Apartment > >     > > > > > > > >     > > > >     > > > >         name="apt_type_en"> > >         >     2 > >         > >     > >     > > > > I

Re: absolute search

2009-12-24 Thread Olala
Oh,yes, that is my schema config: And, my solrconfig.xml for seach in dismax: dismax explicit 0.0 name

RE: SOLR Performance Tuning: Pagination

2009-12-24 Thread Fuad Efendi
Not users... robots! Slurp/Yahoo, Googlebot, etc. I had friendly URLs for query with filters like http://.../USA/ showing all documents from SOLR with country=USA, with pagination; I disabled it now. But URLs like http://.../?q=USA are still dangerous, I need to limit pagination programmatically.

RE: SOLR Performance Tuning: Pagination

2009-12-24 Thread Fuad Efendi
Hi Walter, you are right, it were mostly robots (Googlebot, Yahoo/Slurp, etc); I have friendly URLs like http://www.tokenizer.org/USA/?page=7 (30mlns docs, 3mlns pages) http://www.tokenizer.org/www.newegg.com/ http://www.tokenizer.org/www.newegg.com/?sort=link&dir=asc&q=Opteron And even this: ht

RE: Reverse sort facet query

2009-12-24 Thread Chris Hostetter
: I'll have a look at SimpleFacets.java to look at patching it. I should : think the sorting bit will be relatively straightforward. The tricky bit : is how to submit the request via the query interface - there's only a : boolean in Solr 1.4 the boolean syntax was deprecated in place of keywor

Re: CSV import & date modifying.

2009-12-24 Thread Chris Hostetter
: when posting a CSV-file via curl to Solr, I�d like to convert a row with the : date format �30.12.2009� (dd.mm.) into the date format accepted by Solr. : Is this possible? not with the CSVUpdateHandler (the simple way of ingestinga CSV file) but I believe DataImportHandler can be configure

Re: solr.RemoveDuplicatesTokenFilterFactory

2009-12-24 Thread Chris Hostetter
: Here is the string to be indexed without duplication. : : Kitchen Cabinet Utah Kitchen Remodeling Utah : : Is RemoveDuplicatesTokenFilterFactory for this solution? or for something : else? it depeneds on what you want to do ... you've given us an example of some input, but you haven't elabor

Re: Solr 1.4 - stats page slow

2009-12-24 Thread Chris Hostetter
: I've noticed this as well, usually when working with a large field cache. I : haven't done in-depth analysis of this yet, but it seems like when the stats : page is trying to pull data from a large field cache it takes quite a long : time. In Solr 1.4, the stats page was modified to start repor

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread herceg_novi
Ahmet Arslan wrote: > > > Strange if you can see injected token "One Bedroom Apartmen" as well as > original token "100" in the analysis.jsp you should see 100 in the name="facet_fields"> section. > > Can you disable HTTP caching by replacing lastModifiedFrom="openTime" etagSeed="Solr"> wi

Re: Expanding synonyms for both facets and response results

2009-12-24 Thread herceg_novi
Ahmet Arslan wrote: > > > > I think you are populating apt_type_en, apt_type_fr and apt_type_it fields > from apt_type via copyField decleration in schema.xml. Maybe in your case > it is more convenient to populate those field in an UpdateRequestProcessor > and remove copyField declerations.