Getting wrong version number when building Solr 1.4

2009-11-12 Thread Nasseam Elkarra
Checked out the 1.4.0 release from the tag but when I build I get  
1.4.1-dev appended to the build artifacts.


Thank you,
Nasseam
http://bodukai.com



Hierarchical Facet Sorting

2009-10-20 Thread Nasseam Elkarra
We have facet.sort=true however our hierarchical fields are being  
sorted alphabetically instead of by count.


Here is the field type and field definitions:
fieldType name=hierarchy class=solr.HierarchicalFacetField  
omitNorms=true positionIncrementGap=0 indexed=true  
stored=false delimiter=//
field name=cat_facet type=hierarchy indexed=true stored=true  
multiValued=true/


Any help is much appreciated.

Thank you,
Nasseam Elkarra
http://bodukai.com/solr/
The fastest possible shopping experience. Now with Solr integration.



Solr Demo at SF New Tech Meetup

2009-10-07 Thread Nasseam Elkarra

Hello all,

For those of you in the Bay Area, we will be demoing our Bodukai  
Boutique product at the SF New Tech Meetup on Wednesday, Oct. 14:

http://sfnewtech.com/2009/10/05/1014-sf-new-tech-bodukai-yourversion-meehive-and-more/

Bodukai Boutique is the fastest ecommerce search and navigation  
solution:

http://bodukai.com/boutique/

We will be demoing our Solr integration and all are welcome to come.

Thank you,

Nasseam Elkarra
http://bodukai.com/boutique/
The fastest possible shopping experience



Re: Importing CSV file slow/crashes

2009-10-06 Thread Nasseam Elkarra

Hello Erick,

Sorry about that. I'm using the CSV update handler. Uploading a local  
CSV using the stream.file parameter. There are 94 fields and 36  
copyFields.


Thank you,
Nasseam

On Oct 6, 2009, at 10:09 AM, Erick Erickson wrote:

Well, without some better idea of *how* you're doing the import,  
it's a

little hard to say anything meaningful (hint, hint).
Best
Erick

On Tue, Oct 6, 2009 at 1:06 PM, Nasseam Elkarra  
nass...@bodukai.com wrote:



Hello all,

I had a dev build of 1.4 from 5/1/2009 and importing a 20K row took  
less
than a minute. Updating to the latest as of yesterday, the import  
is really

slow and I had to cancel it after a half hour. This prevented me from
upgrading a few months ago as well.

Any ideas as to the cause of this?

Thank you,

Nasseam Elkarra
http://bodukai.com/boutique/
The fastest possible shopping experience.






Re: Importing CSV file slow/crashes

2009-10-06 Thread Nasseam Elkarra

Hello Yonik,

Thank you for looking into this. Your question of if I'm using stock  
solr put me in the right direction. I am in fact using a patched  
version of solr to get hierarchal facet support (http://issues.apache.org/jira/browse/SOLR-64 
). I took out the 4 hiefacet fields from the schema and the import was  
back to normal times of less than a minute. This same configuration  
worked fine with the 5/1 patched build.


Here is the field definition:
fieldType name=hierarchy class=solr.HierarchicalFacetField  
omitNorms=true positionIncrementGap=0 indexed=true  
stored=false delimiter=/ /


!-- fields --
field name=category type=hierarchy indexed=true stored=true  
multiValued=true/
field name=category_seo type=hierarchy indexed=true  
stored=true multiValued=true/


!-- facet fields --
field name=category_facet type=hierarchy indexed=true  
stored=false multiValued=true/
field name=category_seo_facet type=hierarchy indexed=true  
stored=false multiValued=true/


copyField source=category dest=category_facet/
copyField source=category_seo dest=category_seo_facet/

CSV file snippet:
category,category_seo
T-Shirt Mens/Crew Neck/,t-shirt-mens/crew-neck/

Thanks again!
Nasseam

On Oct 6, 2009, at 3:22 PM, Yonik Seeley wrote:

On Tue, Oct 6, 2009 at 1:06 PM, Nasseam Elkarra  
nass...@bodukai.com wrote:
I had a dev build of 1.4 from 5/1/2009 and importing a 20K row took  
less
than a minute. Updating to the latest as of yesterday, the import  
is really

slow and I had to cancel it after a half hour. This prevented me from
upgrading a few months ago as well.


I haven't had any success at replicating this problem.

I just tried a 100K row CSV file, consisting of an id and a few text
fields.  The total size of the file is 79MB.

On trunk (today): 22 seconds to index, another 5-7 secons to commit
5/21 version: 28 seconds to index, another 8 seconds to commit

Then I modified the 5/1 schema to closer match the trunk schema
(removing defaults, copyfields that could slow things down).
Modified 5/1 version: 25 seconds to index, another 8 seconds to commit

I only did 2 runs with trunk and 2 with one from 5/1, so the accuracy
is probably low... but good enough to see there wasn't a problem in
this test.

We really need more info to help reproduce this.
Are you using stock solr?  Do you have any custom plugins, analyzers,
token filters, etc?

You're going to need to provide something so others can reproduce  
this.


-Yonik
http://www.lucidimagination.com




Hierarchical Facet Field Prefix Not Working

2009-09-25 Thread Nasseam Elkarra

Hello all,

We are using the patch from SOLR-64 (http://issues.apache.org/jira/browse/SOLR-64 
) to implement hierarchical facets for categories. We are trying to  
use the facet.prefix to prevent all categories from coming back.  
However, f.category.facet.prefix doesn't work. Using facet.prefix  
works but prevents the other facets from coming back since it is a  
global option. Are per facet options supported on hierarchical facet  
fields? If not, how can I get a specific category and it's children  
without getting the surrounding categories?


Any help is much appreciated.

Thank you,

Nasseam Elkarra
http://bodukai.com/boutique/
The fastest possible shopping experience.



Re: Joins or subselects in solr

2009-06-16 Thread Nasseam Elkarra
Thanks for the advice. Glad you like the site! Yes, we are using the  
new multi-select support to do this. At first, we wrote our own client- 
side solution to allow for multi-select which was super complex but  
moved to the built-in solr support as soon as we found it. Excellent  
feature!


-Nasseam

Solr-powered Ajax search+nav:
http://factbook.bodukai.com/

Powered by Boutique:
http://bodukai.com/boutique/


On Jun 14, 2009, at 8:16 AM, Yonik Seeley wrote:

On Fri, Jun 12, 2009 at 6:43 PM, Nasseam  
Elkarranass...@bodukai.com wrote:
I am storing items in an index. Each item has a comma separated  
list of
related items. Is it possible to bring back an item and all of its  
related
items in one query? If so how and how would you distinguish between  
which

one is the main item and which are the related.


You would currently need a custom query component to do this... not
sure if it's worth it vs two requests.


Solr-powered Ajax search+nav:
http://factbook.bodukai.com/


Cool, nice multi-select!   Are you using Solr's new multi-select
support to do this?

-Yonik
http://www.lucidimagination.com




Joins or subselects in solr

2009-06-12 Thread Nasseam Elkarra

Hello,

I am storing items in an index. Each item has a comma separated list  
of related items. Is it possible to bring back an item and all of its  
related items in one query? If so how and how would you distinguish  
between which one is the main item and which are the related.


Any help is much appreciated.

Thanks!
Nasseam

Solr-powered Ajax search+nav:
http://factbook.bodukai.com/

Powered by Boutique:
http://bodukai.com/boutique/

Re: Hierarchal Faceting Field Type

2009-05-04 Thread Nasseam Elkarra

Hello Koji,

I applied the patch:
http://issues.apache.org/jira/browse/SOLR-64

It's working but I have a question about facet.prefix. When we add a  
facet.prefix it filters the facet results but not the query results.   
Is there a way to filter both?


Thank you,
Nasseam

On Apr 14, 2009, at 3:32 PM, Koji Sekiguchi wrote:


Nasseam Elkarra wrote:

Background:
Set up a system for hierarchal categories using the following scheme:
level one#
level one#level two#
level one#level two#level three#

Trying to find the right combination of field type and query to get  
the desired results. Saw some previous posts about hierarchal  
facets which helped in the generating the right query but having an  
issue using the built in text field which ignores our delimiter and  
the string field which prevents us from doing a start with search.  
Does anyone have any insight into the field declaration?


Any help is appreciated. Thank you.



Out of need in my project, I'll get started to work for SOLR-64,  
expected any day.

I'm thinking introducing a field type for hierarchical facet.

Koji






Stats field with decimal values

2009-04-30 Thread Nasseam Elkarra

Hello,

I'm getting an error when trying to create stats on an sfloat field.  
The field is for price and when there is no decimal it works fine but  
when there is a decimal (e.g., 24.99) I get an error:  
java.lang.StringIndexOutOfBoundsException: String index out of range: 2


Changing the field to float works but I need sfloat so that I can do  
range queries.


Any help is appreciated.

Thank you,
Nasseam

http://factbook.bodukai.com/
Solr-Powered Search+Nav

Re: Hierarchal Faceting Field Type

2009-04-21 Thread Nasseam Elkarra
Thank you. We tried your suggestion but we are still getting the  
following problem:


fieldType name=category class=solr.TextField
analyzer type=store
tokenizer class=solr.PatternTokenizerFactory pattern=;/
/analyzer
/fieldType

field name=my_facet type=category indexed=true stored=false  
multiValued=true/


Sample data:
level one;level two;level three;
level one;level two;level threeB;

When we query for:
level one;level two;level three;*

We are getting back :
level one;level two;level threeB;

Even though the B is before the semicolon. Any idea why?

Thank you,
Nasseam

Check out our solr-powered Ajax search+nav solution:
http://factbook.bodukai.com/

Powered by Boutique:
http://bodukai.com/boutique/

On Apr 17, 2009, at 3:10 PM, Chris Hostetter wrote:



: level one#
: level one#level two#
: level one#level two#level three#
:
: Trying to find the right combination of field type and query to  
get the
: desired results. Saw some previous posts about hierarchal facets  
which helped
: in the generating the right query but having an issue using the  
built in text
: field which ignores our delimiter and the string field which  
prevents us from
: doing a start with search. Does anyone have any insight into the  
field

: declaration?

Use TextField, with a PatternTokenizer

BTW: if this isn't thread you've already seen, it's handy to know  
about...


http://www.nabble.com/Hierarchical-Faceting-to20090898.html#a20176326


-Hoss





Hierarchal Faceting Field Type

2009-04-14 Thread Nasseam Elkarra

Background:
Set up a system for hierarchal categories using the following scheme:
level one#
level one#level two#
level one#level two#level three#

Trying to find the right combination of field type and query to get  
the desired results. Saw some previous posts about hierarchal facets  
which helped in the generating the right query but having an issue  
using the built in text field which ignores our delimiter and the  
string field which prevents us from doing a start with search. Does  
anyone have any insight into the field declaration?


Any help is appreciated. Thank you.


Multi-select on more than one facet field

2009-03-24 Thread Nasseam Elkarra

Looking at the example here:
http://wiki.apache.org/solr/SimpleFacetParameters#head-4ba81c89b265c3b5992e3292718a0d100f7251ef

This being the query for selecting PDF:
q=mainqueryfq=status:publicfq={! 
tag=dt}doctype:pdffacet=onfacet.field={!ex=dt}doctype


How would you do the query for selecting PDF OR Excel AND, assuming  
there is another facet field named author, where author is Mike?


Thank you,
Nasseam


Solr index deletion

2009-03-24 Thread Nasseam Elkarra
On a few occasions, our development server crashed and in the process  
solr deleted the index folder. We are suspecting another app on the  
server caused an OutOfMemoryException on Tomcat causing all apps  
including solr to crash.


So my question is why is solr deleting the index? We are not doing any  
updates to the index only reading from it so any insight would be  
appreciated.


Thank you,
Nasseam


Re: Solr index deletion

2009-03-24 Thread Nasseam Elkarra
Correction: index was not deleted. The folder is still there with the  
index files in it but a *:* query returns 0 results. Is there a tool  
to check the health of an index?


Thanks,
Nasseam

On Mar 24, 2009, at 11:49 AM, Otis Gospodnetic wrote:



Somehow that sounds very unlikely.  Have you looked at logs?  What  
have you found from Solr there?  I am not checking the sources, but  
I don't think there is any place in Solr where the index directory  
gets deleted.


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Nasseam Elkarra nass...@bodukai.com
To: solr-user@lucene.apache.org
Sent: Tuesday, March 24, 2009 2:35:22 PM
Subject: Solr index deletion

On a few occasions, our development server crashed and in the  
process solr
deleted the index folder. We are suspecting another app on the  
server caused an
OutOfMemoryException on Tomcat causing all apps including solr to  
crash.


So my question is why is solr deleting the index? We are not doing  
any updates
to the index only reading from it so any insight would be  
appreciated.


Thank you,
Nasseam






Re: Solr index deletion

2009-03-24 Thread Nasseam Elkarra
The tool says there are no problems. Solr is pointing to the right  
directory so not sure what is preventing it from returning any  
results. Any ideas? Here is the output:


Segments file=segments_2 numSegments=1 version=FORMAT_USER_DATA  
[Lucene 2.9]

  1 of 1: name=_0 docCount=18021
compound=false
hasProx=true
numFiles=9
size (MB)=8.389
has deletions [delFileName=_0_1.del]
test: open reader.OK [18 deleted docs]
test: fields, norms...OK [35 fields]
test: terms, freq, prox...OK [60492 terms; 1157700 terms/docs  
pairs; 1224063 tokens]
test: stored fields...OK [386828 total field count; avg  
21.487 fields per doc]
test: term vectorsOK [0 total vector count; avg 0 term/ 
freq vector fields per doc]


No problems were detected with this index.

--

Thanks,
Nasseam


On Mar 24, 2009, at 1:34 PM, Otis Gospodnetic wrote:



There is, it's called CheckIndex and it is a part of Lucene (and  
Lucene jars that come with Solr, I believe):


http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/index/CheckIndex.html


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Nasseam Elkarra nass...@bodukai.com
To: solr-user@lucene.apache.org
Sent: Tuesday, March 24, 2009 4:21:50 PM
Subject: Re: Solr index deletion

Correction: index was not deleted. The folder is still there with  
the index
files in it but a *:* query returns 0 results. Is there a tool to  
check the

health of an index?

Thanks,
Nasseam

On Mar 24, 2009, at 11:49 AM, Otis Gospodnetic wrote:



Somehow that sounds very unlikely.  Have you looked at logs?  What  
have you
found from Solr there?  I am not checking the sources, but I don't  
think there

is any place in Solr where the index directory gets deleted.


Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch



- Original Message 

From: Nasseam Elkarra
To: solr-user@lucene.apache.org
Sent: Tuesday, March 24, 2009 2:35:22 PM
Subject: Solr index deletion

On a few occasions, our development server crashed and in the  
process solr
deleted the index folder. We are suspecting another app on the  
server caused

an
OutOfMemoryException on Tomcat causing all apps including solr to  
crash.


So my question is why is solr deleting the index? We are not  
doing any

updates
to the index only reading from it so any insight would be  
appreciated.


Thank you,
Nasseam