Re: Inconsistent search results.

2012-11-30 Thread Sohail Aboobaker
for splitting tokens into our words appropriately. Thanks to everyone for helping. Regards, Sohail Aboobaker.

Re: Inconsistent search results.

2012-11-27 Thread Sohail Aboobaker
Hi, After further analysis it was found that the cases in which the search works as expected are where the . is preceded by a number. Whenever, we have an alphabet instead of number, the search on the word on right side doesn't return results. SA8182B.BA0850 -- Will have issues when searching

Re: Inconsistent search results.

2012-11-27 Thread Sohail Aboobaker
Sorry, a correction. The first part doesn't give results. SA8182B.BA0850 -- Will have issues when searching on SA8182 -- no results. searching on BA0850 will give results. SA8182.BA0850 -- No issues will return results for BA0850 and SA8182. Regards, Sohail

Re: Searching for Partial Words

2012-11-08 Thread Sohail Aboobaker
Yes, that is true. We are looking for partial word matches. It seems like we can achieve this by using edge ngram for prefixes and adding wild card at the end for ignoring suffix. If we set the edge ngram to 3. eng will match ResidentEng but not ResidentEngineer. But a search for eng* will match

Searching for Partial Words

2012-11-06 Thread Sohail Aboobaker
*). Is it possible to have wild card on both sides of a search term? Regards, Sohail Aboobaker.

Re: Searching for Partial Words

2012-11-06 Thread Sohail Aboobaker
Thanks Jack. In the configuration below: fieldType name=text_edgngrm class=solr.TextField positionIncrementGap=100 analyzer tokenizer class=solr.EdgeNGramTokenizerFactory side=front minGramSize=1 maxGramSize=1/ /analyzer /fieldType What are the possible values for side? If I

Re: Bulk Indexing

2012-07-28 Thread Sohail Aboobaker
We have auto commit on and will basically send it in a loop after validating each record, we send it to search service. And keep doing it in a loop. Mikhail / Lan, are you suggesting that instead of sending it in a loop, we should collect them in an array and do a commit at the end? Is this better

Re: Bulk Indexing

2012-07-27 Thread Sohail Aboobaker
We will be using Solr 3.x version. I was wondering if we do need to worry about this as we have only 10k index entries at a time. It sounds like a very low number and we have only document type at this point. Should we worry about directly using SolrJ for indexing and searching for this low

Re: Query by Example

2012-07-11 Thread Sohail Aboobaker
Thanks. That is what we concluded i.e. to write a wrapper method within our service to build the query for Solr by examining the example bean. Thanks again.

Query by Example

2012-07-10 Thread Sohail Aboobaker
Hi, Is there a way to search by providing an example document to solr? We would set the fields of the example document to desired filter and ask Solr to bring all documents that match this example document. For example, if our document contains itemnumber, title, sku, price etc. Is it possible to

Re: Query by Example

2012-07-10 Thread Sohail Aboobaker
Thanks, I will look into more like this. To answer your question, we were discussing interface for a central search service that all other components can call. We have to provide a set of parameters as interface which clients can set and ask for results. This would mean that name of each

Any ideas on Solr 4.0 Release.

2012-07-05 Thread Sohail Aboobaker
Hi, Congratulations on Alpha release. I am wondering is there a ball park on final release for 4.0? Is it expected in August or Sep time frame or is it further away? We badly need some features included in this release. These are around grouped facet counts. We have limited use for Solr in our

Frequency of Unique Id displayed more than 1

2012-07-05 Thread Sohail Aboobaker
Hi, We have defined a unique key as schemaid. We add documents using server.addBean(obj) method. We are using the same method for updates as well. When browsing the schema, we see that some of the schemaid values have frequency of more than 1. Since, schemaid column is defined as unique key, we

Re: Frequency of Unique Id displayed more than 1

2012-07-05 Thread Sohail Aboobaker
We have defined the schemaid as String. It has concatenated value of the product id and language. It takes the form of ID-EN. For example: '123012-EN', '124020-EN', '12392-FR'. Sohail

Re: Frequency of Unique Id displayed more than 1

2012-07-05 Thread Sohail Aboobaker
Another observation is that when we query an individual schemaid, it returns only one row using the search interface. Why would frequency be more than 1?

Re: Use of Solr as primary store for search engine

2012-07-05 Thread Sohail Aboobaker
In many e-commerce sites, most of data that we display (except images) especially in grids and lists is minimal. We were inclined to use Solr as data store for only displaying the information in grids. We stopped only due to non-availability of joins in Solr3.5. Since, our data (like any other

Re: Frequency of Unique Id displayed more than 1

2012-07-05 Thread Sohail Aboobaker
Thanks Eric, This is indeed what we are seeing. I hope we can just ignore the frequencies. Does it in any way effect facet counts for such records? Sohail

Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Hi, We have a schema that contains categories (multi valued) for different levels. Each level of categories are indexed in a different field. Each category entry contains its parent in its value. The level is separated by a | symbol. We are seeing following results for a faceted query:

Re: Clarification requested on query results.

2012-05-30 Thread Sohail Aboobaker
Never mind, I figured it out after sending the email: http://localhost:8983/solr/select?q=level2categories:%22Artists\|Direct%20from%20the%20Artist%22group=truegroup.field=groupbygroup.main=truefacet=truegroup.facet=truefacet.field=level1categoriesfacet.field=level2categoriesrows=0 I needed to

Strategy for maintaining De-normalized indexes

2012-05-22 Thread Sohail Aboobaker
Hi, I have a very basic question and hopefully there is a simple answer to this. We are trying to index a simple product catalog which has a master product and child products. Each master product can have multiple child products. A master product can be assigned one or more product categories.

Re: Strategy for maintaining De-normalized indexes

2012-05-22 Thread Sohail Aboobaker
Thank you for quick replies. Can't the ID (uniqueKey) of the indexed documents (i.e. denormalized data) be a combination of the master product id and the child product id ? -- We do not need it as each child is already a unique key. Therefore whenever you update your master product db entry,

Re: Strategy for maintaining De-normalized indexes

2012-05-22 Thread Sohail Aboobaker
We are still in design phase, so we haven't hit any performance issues. We do not want to discover performance issues too late during QA :) We would rather account for any issues during the design phase. The refresh rate on fields that we are using from master table will be rare. May be three or

Re: Multicore solr

2012-05-22 Thread Sohail Aboobaker
It would help if you provide your use case. What are you indexing for each user and why would you need a separate core for indexing each user? How do you decide schema for each user? It might be better to describe your use case and desired results. People on the list will be able to advice on the

Re: Join Query syntax

2012-05-11 Thread Sohail Aboobaker
Is it available in Solr 3.5 or is there a way to do something similar in Solr 3.5,

Re: synonyms

2012-05-02 Thread Sohail Aboobaker
I think regular sync of database table with synonym text file seems to be simplest of the solutions. It will allow you to use Solr natively without any customization and it is not very complicated operation to update synonyms file with entries in database.

Re: Question on Facet counts by grouped results

2012-04-29 Thread Sohail Aboobaker
I had copied the full example directory. After copying, I had replaced schema.xml from my old 3.5 schema.xml and I faced this error. After Eric's email, I copied stopwords_en.txt into conf directory of my copy. It works fine after that. After seeing your email, it seems that the better approach

Re: get value from DB (using like i.e %value%)

2012-04-27 Thread Sohail Aboobaker
I am new to Solr as well but based on what I have read so far, once you have created index in Solr, you should look at ajax-solr. It provides a good example to type ahead widget (show list as you type). Sohail

Re: Question on Facet counts by grouped results

2012-04-26 Thread Sohail Aboobaker
Never mind, I did not notice that this is coming in Solr 4.0. Any ideas on when Solr 4.0 will be out? Sohail

searchable solr user mail archive

2012-04-26 Thread Sohail Aboobaker
Hi, Is there a searchable archive for solr user emails available somewhere to avoid questions already asked on list? Sohail

Re: Question on Facet counts by grouped results

2012-04-26 Thread Sohail Aboobaker
Hi, I am trying nightly build for solr 4.0. I downloaded the build and am able to start it. In 3.x, I copied the example directory and updated the schema.xml. It worked fine but in 4.0, I did the same thing (make a copy of example) but when I change the schema, I get following: Apr 26, 2012