Re: Slow Phrase Queries

2009-10-21 Thread DHast

that's what my use case has shown, but i havent done enough experimenting to
know for sure.
the reason the field is untokenized is because i need the full value of an
authors name, example: smith, jones, if it was tokenized and faceted it
would be jones and another entry for smith.  

i am running a lot of facets, 19 of them, some are facet queries and some
are fields, not including the author faceting.  the performance is great
with all of them running.  the problem is of those other facets there are
only a limited number of possibilities. 

But with authors, there are over 800,000 separate authors in my data.  the
next route is to modify the index to index the authors as an integer taken
from a database table of all authors, and re-attempt the author facets again
on a tokenized field of integers.

unfortunately it takes 4-9 days for the index to be built as it is a little
over 22 GB's


Lance Norskog-2 wrote:
 
 Are you saying that faceting is faster on a tokenized field? Is this true?
 
 On Tue, Oct 20, 2009 at 2:02 PM, DHast hastings.recurs...@gmail.com
 wrote:
 ...
 , removing
 that facet worked since the field was untokenizd and slow considering how
 many values tehre were.
 ...
 
 View this message in context:
 http://www.nabble.com/Slow-Phrase-Queries-tp2597p25982493.html
 Sent from the Solr - User mailing list archive at Nabble.com.


 
 
 
 -- 
 Lance Norskog
 goks...@gmail.com
 
 

-- 
View this message in context: 
http://www.nabble.com/Slow-Phrase-Queries-tp2597p25999252.html
Sent from the Solr - User mailing list archive at Nabble.com.



Slow Phrase Queries

2009-10-20 Thread DHast

Hello,
I have recently installed Solr as an alternative to our home made lucene
search servers, and while in most respects the performance is better, i
notice that phrase searches are incredibly slow compared to normal lucene,
primarily when using facets

example:
City of New York, Matter of takes 11 seconds
City of New York, Matter of takes 1 second

the same searches using raw lucene take 5 seconds and 3 seconds
respectively.

i tried cutting out as much as i could from solrconfig without breaking it,
is there anything else i could try doing to make solr perform similarly to
raw lucene as far as phrase queries are concerned?
thanks
-- 
View this message in context: 
http://www.nabble.com/Slow-Phrase-Queries-tp2597p2597.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Slow Phrase Queries

2009-10-20 Thread DHast

ah, it turns out it was one of my 6 facets, the author.  in the data pool
tehre are over 1.9 million documents, and about 800,000 authors, removing
that facet worked since the field was untokenizd and slow considering how
many values tehre were.  Solr is definitely faster, and as fast and or
faster with facets
-- 
View this message in context: 
http://www.nabble.com/Slow-Phrase-Queries-tp2597p25982493.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: define index at search time

2009-09-24 Thread DHast

No, I am talking about having multiple indexes, i want to send the index name
to the searcher so it will search that index, rather than use the one
defined in the schema/solrconfig.

nothing t do with multiple cores, i mean different indexes entirely with
completely different content.




Avlesh Singh wrote:
 
 Are you talking about multiple cores?
 
 Cheers
 Avlesh
 
 On Mon, Sep 21, 2009 at 9:15 PM, DHast hastings.recurs...@gmail.com
 wrote:
 

 is there a way i can actually tell solr which index i want it to search
 against with the query? I know it will cost a bit on performance, but it
 would be helpful
 i have many indexes and it would be nice to determine which one should be
 used by the user.
 thanks
 --
 View this message in context:
 http://www.nabble.com/define-index-at-search-time-tp25530378p25530378.html
 Sent from the Solr - User mailing list archive at Nabble.com.


 
 

-- 
View this message in context: 
http://www.nabble.com/define-index-at-search-time-tp25530378p25564438.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: define index at search time

2009-09-24 Thread DHast

well after looking at http://wiki.apache.org/solr/CoreAdmin
perhaps multiple cores is what i want, 

DHast wrote:
 
 No, I am talking about having multiple indexes, i want to send the index
 name to the searcher so it will search that index, rather than use the one
 defined in the schema/solrconfig.
 
 nothing t do with multiple cores, i mean different indexes entirely with
 completely different content.
 
 
 
 
 Avlesh Singh wrote:
 
 Are you talking about multiple cores?
 
 Cheers
 Avlesh
 
 On Mon, Sep 21, 2009 at 9:15 PM, DHast hastings.recurs...@gmail.com
 wrote:
 

 is there a way i can actually tell solr which index i want it to search
 against with the query? I know it will cost a bit on performance, but it
 would be helpful
 i have many indexes and it would be nice to determine which one should
 be
 used by the user.
 thanks
 --
 View this message in context:
 http://www.nabble.com/define-index-at-search-time-tp25530378p25530378.html
 Sent from the Solr - User mailing list archive at Nabble.com.


 
 
 
 

-- 
View this message in context: 
http://www.nabble.com/define-index-at-search-time-tp25530378p25564937.html
Sent from the Solr - User mailing list archive at Nabble.com.



define index at search time

2009-09-21 Thread DHast

is there a way i can actually tell solr which index i want it to search
against with the query? I know it will cost a bit on performance, but it
would be helpful
i have many indexes and it would be nice to determine which one should be
used by the user.
thanks
-- 
View this message in context: 
http://www.nabble.com/define-index-at-search-time-tp25530378p25530378.html
Sent from the Solr - User mailing list archive at Nabble.com.



solr isnt using default field correctly

2009-09-18 Thread DHast

hi, 
if i do a search: text:law order~40
i get this:

str name=rawquerystringtext:law order~40/str
str name=querystringtext:law order~40/str
str name=parsedqueryPhraseQuery(text:law order~40)/str
str name=parsedquery_toStringtext:law order~40/str
str name=QParserOldLuceneQParser/str

However if i do: law order~40
i get this:

str name=rawquerystringlaw order~40/str
str name=querystringlaw order~40/str
str name=parsedquerytext:law order/str
str name=parsedquery_toStringtext:law order/str
lst name=explain/
str name=QParserOldLuceneQParser/str

my Schema xml:

 field name=text type=string indexed=true stored=false /
.
 defaultSearchFieldtext/defaultSearchField


what should i be doing differently to get the second results like the first?
-- 
View this message in context: 
http://www.nabble.com/solr-isnt-using-default-field-correctly-tp25507985p25507985.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: solr isnt using default field correctly

2009-09-18 Thread DHast

well it seems what is happening is solr is not being consistent,



DHast wrote:
 
 hi, 
 if i do a search: text:law order~40
 i get this:
 
 str name=rawquerystringtext:law order~40/str
 str name=querystringtext:law order~40/str
 str name=parsedqueryPhraseQuery(text:law order~40)/str
 str name=parsedquery_toStringtext:law order~40/str
 str name=QParserOldLuceneQParser/str
 
 However if i do: law order~40
 i get this:
 
 str name=rawquerystringlaw order~40/str
 str name=querystringlaw order~40/str
 str name=parsedquerytext:law order/str
 str name=parsedquery_toStringtext:law order/str
 lst name=explain/
 str name=QParserOldLuceneQParser/str
 
 my Schema xml:
 
  field name=text type=string indexed=true stored=false /
 .
  defaultSearchFieldtext/defaultSearchField
 
 
 what should i be doing differently to get the second results like the
 first?
 

-- 
View this message in context: 
http://www.nabble.com/solr-isnt-using-default-field-correctly-tp25507985p25508264.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: solr isnt using default field correctly

2009-09-18 Thread DHast

yeah something is definitely strange, i think i know what it is though.
im going to make a separate post for it, but it cached the results from when
i had field:text as a string, 




Erik Hatcher-4 wrote:
 
 I just tried this on trunk and both with and without a field selector  
 it parses to a PhraseQuery.  I have trouble believing even Solr 1.3  
 behaved like you reported, something seems fishy.
 
   Erik
 
 On Sep 18, 2009, at 9:02 AM, DHast wrote:
 

 well it seems what is happening is solr is not being consistent,



 DHast wrote:

 hi,
 if i do a search: text:law order~40
 i get this:

 str name=rawquerystringtext:law order~40/str
 str name=querystringtext:law order~40/str
 str name=parsedqueryPhraseQuery(text:law order~40)/str
 str name=parsedquery_toStringtext:law order~40/str
 str name=QParserOldLuceneQParser/str

 However if i do: law order~40
 i get this:

 str name=rawquerystringlaw order~40/str
 str name=querystringlaw order~40/str
 str name=parsedquerytext:law order/str
 str name=parsedquery_toStringtext:law order/str
 lst name=explain/
 str name=QParserOldLuceneQParser/str

 my Schema xml:

 field name=text type=string indexed=true stored=false /
 .
 defaultSearchFieldtext/defaultSearchField


 what should i be doing differently to get the second results like the
 first?


 -- 
 View this message in context:
 http://www.nabble.com/solr-isnt-using-default-field-correctly-tp25507985p25508264.html
 Sent from the Solr - User mailing list archive at Nabble.com.

 
 
 

-- 
View this message in context: 
http://www.nabble.com/solr-isnt-using-default-field-correctly-tp25507985p25508691.html
Sent from the Solr - User mailing list archive at Nabble.com.



want to features of a 'text' field with the non stemming of a 'string' field

2009-09-18 Thread DHast

when i have my fieldname: text set as a text field, advanced search queries
work very well, but when i have it set as a string it seems to ignore them,
like proximity searching and so on.
example: text as string:
str name=rawquerystringtext:law order~33/str
str name=querystringtext:law order~33/str
str name=parsedquerytext:law order/str
str name=parsedquery_toStringtext:law order/str

text as text:
str name=rawquerystringtext:law order~32
/str
str name=querystringtext:law order~32
/str
str name=parsedqueryPhraseQuery(text:law order~32)/str
str name=parsedquery_toStringtext:law order~32/str

however when i search a single term, it stems it if its text, example:
text as text:
str name=rawquerystringgoats
/str
str name=querystringgoats
/str
str name=parsedquerytext:goat/str
str name=parsedquery_toStringtext:goat/str

text as string:
str name=rawquerystringnuts
/str
str name=querystringnuts
/str
str name=parsedquerytext:nuts/str
str name=parsedquery_toStringtext:nuts/str
OR
str name=rawquerystringtext:goats
/str
str name=querystringtext:goats
/str
str name=parsedquerytext:goats/str
str name=parsedquery_toStringtext:goats/str


so what i want/need, is to STOP the stemming/plural killing that is
happening on the text field, 
ideas?

also, is tehre a way to wipe the cache while testing?




-- 
View this message in context: 
http://www.nabble.com/want-to-features-of-a-%27text%27-field-with-the-non-stemming-of-a-%27string%27-field-tp25508780p25508780.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: want to features of a 'text' field with the non stemming of a 'string' field

2009-09-18 Thread DHast

i have looked, and seem to be running into a dead end every time i try it,
but again it may be because of the caching and me not realizing it was doing
it till my hair was half pulled.

i dont suppose youd be willing to give a hint then?



Grant Ingersoll-6 wrote:
 
 
 On Sep 18, 2009, at 6:37 AM, DHast wrote:
 

 when i have my fieldname: text set as a text field, advanced search  
 queries
 work very well, but when i have it set as a string it seems to  
 ignore them,
 like proximity searching and so on.
 example: text as string:
 str name=rawquerystringtext:law order~33/str
 str name=querystringtext:law order~33/str
 str name=parsedquerytext:law order/str
 str name=parsedquery_toStringtext:law order/str

 text as text:
 str name=rawquerystringtext:law order~32
 /str
 str name=querystringtext:law order~32
 /str
 str name=parsedqueryPhraseQuery(text:law order~32)/str
 str name=parsedquery_toStringtext:law order~32/str

 however when i search a single term, it stems it if its text, example:
 text as text:
 str name=rawquerystringgoats
 /str
 str name=querystringgoats
 /str
 str name=parsedquerytext:goat/str
 str name=parsedquery_toStringtext:goat/str

 text as string:
 str name=rawquerystringnuts
 /str
 str name=querystringnuts
 /str
 str name=parsedquerytext:nuts/str
 str name=parsedquery_toStringtext:nuts/str
 OR
 str name=rawquerystringtext:goats
 /str
 str name=querystringtext:goats
 /str
 str name=parsedquerytext:goats/str
 str name=parsedquery_toStringtext:goats/str


 so what i want/need, is to STOP the stemming/plural killing that is
 happening on the text field,
 ideas?

 
 
 It sounds like you need to dig into your schema.xml a bit more and set  
 your analysis better.  See http://wiki.apache.org/solr/SchemaXml
 
 
 also, is tehre a way to wipe the cache while testing?




 -- 
 View this message in context:
 http://www.nabble.com/want-to-features-of-a-%27text%27-field-with-the-non-stemming-of-a-%27string%27-field-tp25508780p25508780.html
 Sent from the Solr - User mailing list archive at Nabble.com.

 
 --
 Grant Ingersoll
 http://www.lucidimagination.com/
 
 Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
 using Solr/Lucene:
 http://www.lucidimagination.com/search
 
 
 

-- 
View this message in context: 
http://www.nabble.com/want-to-features-of-a-%27text%27-field-with-the-non-stemming-of-a-%27string%27-field-tp25508780p25509828.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: want to features of a 'text' field with the non stemming of a 'string' field

2009-09-18 Thread DHast

ok, used the built in fieldtype text_ws that seems to go well

DHast wrote:
 
 i have looked, and seem to be running into a dead end every time i try it,
 but again it may be because of the caching and me not realizing it was
 doing it till my hair was half pulled.
 
 i dont suppose youd be willing to give a hint then?
 
 
 
 Grant Ingersoll-6 wrote:
 
 
 On Sep 18, 2009, at 6:37 AM, DHast wrote:
 

 when i have my fieldname: text set as a text field, advanced search  
 queries
 work very well, but when i have it set as a string it seems to  
 ignore them,
 like proximity searching and so on.
 example: text as string:
 str name=rawquerystringtext:law order~33/str
 str name=querystringtext:law order~33/str
 str name=parsedquerytext:law order/str
 str name=parsedquery_toStringtext:law order/str

 text as text:
 str name=rawquerystringtext:law order~32
 /str
 str name=querystringtext:law order~32
 /str
 str name=parsedqueryPhraseQuery(text:law order~32)/str
 str name=parsedquery_toStringtext:law order~32/str

 however when i search a single term, it stems it if its text, example:
 text as text:
 str name=rawquerystringgoats
 /str
 str name=querystringgoats
 /str
 str name=parsedquerytext:goat/str
 str name=parsedquery_toStringtext:goat/str

 text as string:
 str name=rawquerystringnuts
 /str
 str name=querystringnuts
 /str
 str name=parsedquerytext:nuts/str
 str name=parsedquery_toStringtext:nuts/str
 OR
 str name=rawquerystringtext:goats
 /str
 str name=querystringtext:goats
 /str
 str name=parsedquerytext:goats/str
 str name=parsedquery_toStringtext:goats/str


 so what i want/need, is to STOP the stemming/plural killing that is
 happening on the text field,
 ideas?

 
 
 It sounds like you need to dig into your schema.xml a bit more and set  
 your analysis better.  See http://wiki.apache.org/solr/SchemaXml
 
 
 also, is tehre a way to wipe the cache while testing?




 -- 
 View this message in context:
 http://www.nabble.com/want-to-features-of-a-%27text%27-field-with-the-non-stemming-of-a-%27string%27-field-tp25508780p25508780.html
 Sent from the Solr - User mailing list archive at Nabble.com.

 
 --
 Grant Ingersoll
 http://www.lucidimagination.com/
 
 Search the Lucene ecosystem (Lucene/Solr/Nutch/Mahout/Tika/Droids)  
 using Solr/Lucene:
 http://www.lucidimagination.com/search
 
 
 
 
 

-- 
View this message in context: 
http://www.nabble.com/want-to-features-of-a-%27text%27-field-with-the-non-stemming-of-a-%27string%27-field-tp25508780p25510178.html
Sent from the Solr - User mailing list archive at Nabble.com.



Facet vs searching, field types,

2009-09-17 Thread DHast

hello, 
my documents have a field called member_of,
this field holds a list, so it will look like this:
member_of: fred bob mike journals wsjournals jen steve
if 'member_of' is a 'string' and i search:
goats AND member_of(fred bob journals)
i get the right results, 
however, the faceting doesnt work right, it groups journals and wsjournals
together

this is happening because a lot of the entries are in both, but the faceting
seems to align one result to one facet value, rather than multiple values,
so if ten results have member_of journals wsjournals, they are assigned to
wsjournals, not journals.

any way to make them show as a result twice?



-- 
View this message in context: 
http://www.nabble.com/Facet-vs-searching%2C-field-types%2C-tp25491963p25491963.html
Sent from the Solr - User mailing list archive at Nabble.com.



Re: Facet vs searching, field types,

2009-09-17 Thread DHast



DHast wrote:
 
 hello, 
 my documents have a field called member_of,
 this field holds a list, so it will look like this:
 member_of: fred bob mike journals wsjournals jen steve
 if 'member_of' is a 'string' and i search:
 goats AND member_of(fred bob journals)
 i get the right results, 
 however, the faceting doesnt work right, it groups journals and wsjournals
 together
 
 this is happening because a lot of the entries are in both, but the
 faceting seems to align one result to one facet value, rather than
 multiple values, so if ten results have member_of journals wsjournals,
 they are assigned to wsjournals, not journals.
 
 any way to make them show as a result twice?
 
 
 
 


Ok, i think i got this, i added multiValued='true' for that field and the
counts seem much better, or at least what i was looking for
-- 
View this message in context: 
http://www.nabble.com/Facet-vs-searching%2C-field-types%2C-tp25491963p25491970.html
Sent from the Solr - User mailing list archive at Nabble.com.



copyfield at search time?

2009-09-17 Thread DHast

is it possible to do somehting like this:

Now im wondering how to do something like this:
  
field name=member_of_facet type=string indexed=true stored=true
multiValued=true/ 

field name=member_of type=string indexed=true stored=true
multiValued=false/ 


  copyField source=member_of dest=member_of_facet / 


if so, i dont seem to be making progress
thanks
-- 
View this message in context: 
http://www.nabble.com/copyfield-at-search-time--tp25491979p25491979.html
Sent from the Solr - User mailing list archive at Nabble.com.



do NOT want to stem plurals for a particular field, or words

2009-09-15 Thread DHast

I have a field where there are items that are plurals, and used as very
specific locators, so i do a solr search type:articles, and it translates it
into : type:article, then into type:articl... is tehre a way to stop it from
doing this on either the field type or on a list of words articles,
notes, etc

i tried enering into the protwords.txt file and dont seem to get any where
-- 
View this message in context: 
http://www.nabble.com/do-NOT-want-to-stem-plurals-for-a-particular-field%2C-or-words-tp25455570p25455570.html
Sent from the Solr - User mailing list archive at Nabble.com.