Re: Results after using Field Collapsing are not matching the results without using Field Collapsing

2009-12-17 Thread Varun Gupta
After a lot of debugging, I finally found why the order of collapse results are not matching the uncollapsed results. I can't say if it is a bug in the implementation of fieldcollapse or not. *Explaination:* Actually, I am querying the fieldcollapse with some filters to restrict the collapsing to

Re: dose solr sopport distribute index storage ?

2009-12-17 Thread Camilo Aguilar
HI! I have the same question Thanks in advance On Mon, Oct 12, 2009 at 1:55 PM, Pieter Steyn wrote: > Sorry for the hijack, but s replication necessary when using a cluster > file-system such as GFS2. Where the files are the same for any > instance of Solr? > > > On Mon, Oct 12, 2009 at 8:36

Re: shards parameter

2009-12-17 Thread Jacob Elder
If the goal is to save time when using the admin interface, you can just add this to conf/admin-extra.html: http://www.google.com/jsapi";> google.load("prototype", "1.6"); Event.observe( window, 'load', function() { elements = document.getElements

Re: Searching .msg files

2009-12-17 Thread javaxmlsoapdev
1)use tika to index .msg files (Tika does support Microsoft outlook format and I am already using Tika: http://lucene.apache.org/tika/formats.html). 2)while indexing you'll have to write handler to extract To, CC, Bcc values and store it in a separate field in index. 3)when user searches on .msg

Re: Spellchecking - Is there a way to do this?

2009-12-17 Thread Lance Norskog
Another thing you might check into is stemming. The Porter stemmer included in Solr is "aggressive", meaning that it will tend to do weird things with misspellings. There is a different stemmer called KStem which is available from www.lucidimagination.com/Downloads is less aggressive. Porter turns

Re: shards parameter

2009-12-17 Thread Yonik Seeley
You're setting up an infinite loop by adding a shards parameter on the default search handler. Create a new search handler and put your default under that. -Yonik http://www.lucidimagination.com On Thu, Dec 17, 2009 at 7:47 AM, pcurila wrote: > > I tried it out. But there is another issue I can

Re: Simple Wildcard Search Question

2009-12-17 Thread Erick Erickson
I think your problem is WordDelimiterFilterFactory. For reference, see: http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimiterFilterFactory As I understand it, your input name01 get

Re: Converting java date to solr date and querying dates

2009-12-17 Thread Lance Norskog
Or add a new field type to Solr. This would accept a date and include a Java formatting string. The field would parse the date, print it with that format string, and store the results. You could use any aspect of the date. On Tue, Dec 15, 2009 at 1:06 PM, Chris Hostetter wrote: > > :       i want

Re: Spellchecking - Is there a way to do this?

2009-12-17 Thread Lance Norskog
Character-based NGrams are a good tool for this problem. MLT is a document-wide numerical analysis. If the common types of OCR mistakes are different than what NGrams create, you might tune the ngram generator. For example, swapping letters might not happen very often. SIngle- and multi-word error

RE: Question about updateRequestProcessorChain

2009-12-17 Thread Manepalli, Kalyan
I specified another updateRequestChain with out the custom updateProcessor and it worked. Thanks for the quick help. -Kalyan -Original Message- From: noble.p...@gmail.com [mailto:noble.p...@gmail.com] On Behalf Of Noble Paul ??? ?? Sent: Thursday, December 17, 2009 5:59 AM To:

Re: Document model suggestion

2009-12-17 Thread caman
Are you suggesting that roles should be maintained in the index? We do manage out authentication based on roles but at granular level, user rights play a big role as well. I know we need to compromise, just need to find a balance. Thanks Lance Norskog-2 wrote: > > Role-based authentication is

Re: Document model suggestion

2009-12-17 Thread Lance Norskog
Role-based authentication is one level of sophistication up from user-based authentication. Users can have different roles, and authentication goes against roles. Documents with multiple viewers would be assigned special roles. All users would also have their own matching role. On Tue, Dec 15, 200

Re: store content only of documents

2009-12-17 Thread javaxmlsoapdev
Anyone? javaxmlsoapdev wrote: > > I store document in a field "content" field defiend as follow in > schema.xml > multiValued="true"/> > > and following in solrconfig.xml > class="org.apache.solr.handler.extraction.ExtractingRequestHandler"> > > content > content > >

Re: Searching .msg files

2009-12-17 Thread Lance Norskog
As to the indexing part: This is an automated document input tool: http://wiki.apache.org/solr/DataImportHandler This is a plugin for it that pulls mail from an IMAP server: http://wiki.apache.org/solr/MailEntityProcessor This is a comment about microsoft MSG files and parsing in Java: http://ww

can't start solr by java -jar start.jar

2009-12-17 Thread Jill Han
I downloaded solr1.4, and unzipped it at C:\apache-solr-1.4.0 . At C:\apache-solr-1.4.0\example, I try to run solr by java -jar start.jar as instructed by Solr tutorial. However, it failed. C:\apache-solr-1.4.0\example>java -jar start.jar java.lang.NullPointerException at java.io

Re: Simple Wildcard Search Question

2009-12-17 Thread QBasti
Hey there, I put the complete configuration for the "text"-type fields (this field is a "text"-type) at the bottom. but now it got me totally confused. i switched off the WordDelimiterFactory and the index value is: "Herrn AndererName09 NochmalName09", which is getting analysed into: "herrn|ander

Re: Can solr do the equivalent of "select distinct(field)"?

2009-12-17 Thread Aleksander Stensby
Thanks for your reply Erik! The speed of my suggested query is actually very fast once we add the facet.mincount=1 (when searching within a limited set of documents). The set-back seem to be in the sharding of our data.. And that puzzles me a little bit... I can't really see why SOLR is so slow a

Re: solr core size on disk

2009-12-17 Thread Erik Hatcher
Sharding isn't necessarily decided upon by index size. Is your search performance ok? Got enough free disk space to optimize? Then don't shard. But no, 150M is not a large index size. 700 cores, now that's a lot! Erik On Dec 17, 2009, at 1:27 PM, Matthieu Labour wrote: Paul Th

Re: Can solr do the equivalent of "select distinct(field)"?

2009-12-17 Thread Erik Hatcher
On Dec 17, 2009, at 11:59 AM, Aleksander Stensby wrote: A follow up question on this Hoss: If I have a set of documents, let's say this email thread. Each email has a unique author. All emails in the thread are indexed with "threadid=33" If I want to count the number of unique authors in thi

Re: solr core size on disk

2009-12-17 Thread Matthieu Labour
Paul Thank you for your reply I did du -sh in /solr_env/index/data and it shows 36G It is distributed among 700 cores with most of them being 150M Is that a big index that should be sharded ? 2009/12/17 Noble Paul നോബിള്‍ नोब्ळ् > look at the index dir and see the size of the files . it is typ

Re: Upgrade from 1.2 to 1.4

2009-12-17 Thread Erik Hatcher
Eric, The sort changed, so it is now a separate parameter. &q=&sort=... The query;sort syntax has been deprecated. Be sure to reach Solr's CHANGES.txt when upgrading. There's this section the upgrading from 1.3 (which would apply when going from 1.2 also!) mentioning this: "The defa

Re: Dynamically changing the stored-state of a dynamicField

2009-12-17 Thread Erik Hatcher
You'll have to do it the way you mentioned, with two dynamic field mappings. Erik On Dec 17, 2009, at 8:33 AM, Jan-Simon Winkelmann wrote: Hi, i am currently building a Solr configuration for a rather large search index. To allow for indexing of differently named fields for each d

Re: Can solr do the equivalent of "select distinct(field)"?

2009-12-17 Thread Aleksander Stensby
Forgot to add facet.mincount=1, obviously. But still, is this the only or prefered way of doing something along these lines? Or is there a different (better) approach? Best regards, Aleksander On Thu, Dec 17, 2009 at 5:59 PM, Aleksander Stensby < aleksander.sten...@integrasco.com> wrote: > A fo

Re: Can solr do the equivalent of "select distinct(field)"?

2009-12-17 Thread Aleksander Stensby
A follow up question on this Hoss: If I have a set of documents, let's say this email thread. Each email has a unique author. All emails in the thread are indexed with "threadid=33" If I want to count the number of unique authors in this email thread, I could go along the lines you mention at the e

Re: Solr Web Courses

2009-12-17 Thread Tommy Molto
Yes, i entered in contact with them, i'm waiting an answer. But i was looking for a plan b too, if we cant close the deal with them. Att, On Thu, Dec 17, 2009 at 12:55 PM, Koji Sekiguchi wrote: > Tommy Molto wrote: > >> Hi, >> >> I need to train a team in Solr. Since we are in Brazil, and i did

Re: Adaptive search?

2009-12-17 Thread Siddhant Goel
Let say we have a search engine (a simple front end - web app kind of a thing - responsible for querying Solr and then displaying the results in a human readable form) based on Solr. If a user searches for something, gets quite a few search results, and then clicks on one such result - is there any

Upgrade from 1.2 to 1.4

2009-12-17 Thread ERIC TREECE
Has anyone successfully upgraded from 1.2 to 1.4 by just changing the war file? When we tried this we could not longer sort to retrieve the highest numbered date/id in our index. Previously with 1.2 the following would return the most recent data and now it returns data from "around" the same t

Re: Solr Web Courses

2009-12-17 Thread Koji Sekiguchi
Tommy Molto wrote: Hi, I need to train a team in Solr. Since we are in Brazil, and i didnt find any course about it around here, do you have any recommendation of some web course of Solr? Att, It seems that Lucid Imagination provides Online Training courses for Solr: http://www.lucidimagi

Re: debugging javascript DIH

2009-12-17 Thread Luca Molteni
Of course you can. Just use remote debugging. Which app server are you using? IDE? L.M. 2009/12/3 Joel Nylund : > is there a way to print to std out or anything from my javascript DIH > transformer? > > thanks > Joel >

Re: Adaptive search?

2009-12-17 Thread Paul Libbrecht
What can it mean to "adapt to user clicks" ? Quite many things in my head. Do you have maybe a citation that inspires you here? paul Le 17-déc.-09 à 13:52, Siddhant Goel a écrit : Does Solr provide adaptive searching? Can it adapt to user clicks within the search results it provides? Or t

Dynamically changing the stored-state of a dynamicField

2009-12-17 Thread Jan-Simon Winkelmann
Hi, i am currently building a Solr configuration for a rather large search index. To allow for indexing of differently named fields for each dataset, i have includ the following dynamicField: What i don't like about this, is the fact that all dynamic fields are now being stored. Actually, I onl

Re: payload queries running slow

2009-12-17 Thread Grant Ingersoll
On Dec 17, 2009, at 4:52 AM, Raghuveer Kancherla wrote: > Hi, > With help from the group here, I have been able to set up a search > application with payloads enabled. However, there is a noticeable increase > in query response times with payloads as compared to the same queries > without payload

Re: regarding QParserPlugin

2009-12-17 Thread Grant Ingersoll
I'd probably use the logging stuff instead of System.out, maybe the stream isn't be flushed as expected. On Dec 16, 2009, at 8:29 AM, gudumba l wrote: > Hello all, > I am trying to use query parser plugin feature of solr. > But its really strange that everytime its behaving in a di

Solr Web Courses

2009-12-17 Thread Tommy Molto
Hi, I need to train a team in Solr. Since we are in Brazil, and i didnt find any course about it around here, do you have any recommendation of some web course of Solr? Att,

Adaptive search?

2009-12-17 Thread Siddhant Goel
Hi, Does Solr provide adaptive searching? Can it adapt to user clicks within the search results it provides? Or that has to be done externally? I couldn't find anything on googling for it. Thanks, -- - Siddhant

Re: Problem with words thats amost similar

2009-12-17 Thread Shalin Shekhar Mangar
2009/12/17 Steinar Asbjørnsen > Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar: > > >> > >> > > For specific cases like this, you can add the word to a file and specify > it > > in schema, for example: > > > > > protected="protwords.txt"/> > > Ty Shalin. > > This is my schema.xml file >

Re: shards parameter

2009-12-17 Thread pcurila
I tried it out. But there is another issue I can not cope with. I have two shards: localhost:8983/solr localhost:8984/solr If I write this into the defaults section localhost:8983/solr,localhost:8984/solr and than I issue a query on localhost:8983, solr do not respond. If I write this localhost

Re: Problem with words thats amost similar

2009-12-17 Thread Steinar Asbjørnsen
Den 17. des. 2009 kl. 12.42 skrev Shalin Shekhar Mangar: > 2009/12/17 Steinar Asbjørnsen > >> Hi all. >> >> I have a delicate problem when it comes to two words that are rather >> similar in the way they are typed, but when it comes to the meaning of the >> word they are completely different. >

Re: Question about updateRequestProcessorChain

2009-12-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
At least one processor chain is invoked. if you do not want the custom chain to be invoked create a chain without the name and without your custom processor On Thu, Dec 17, 2009 at 1:16 PM, Shalin Shekhar Mangar wrote: > On Thu, Dec 17, 2009 at 3:35 AM, Manepalli, Kalyan < > kalyan.manepa...@orb

Re: shards parameter

2009-12-17 Thread Noble Paul നോബിള്‍ नोब्ळ्
yes. put it under the "defaults" section in your standard requesthandler. On Thu, Dec 17, 2009 at 5:22 PM, pcurila wrote: > > Hello, is there any way to configure shards parameter in solrconfig.xml? So I > do not need provide it in the url. Thanks Peter > -- > View this message in context: > htt

shards parameter

2009-12-17 Thread pcurila
Hello, is there any way to configure shards parameter in solrconfig.xml? So I do not need provide it in the url. Thanks Peter -- View this message in context: http://old.nabble.com/shards-parameter-tp26826908p26826908.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Problem with words thats amost similar

2009-12-17 Thread Shalin Shekhar Mangar
2009/12/17 Steinar Asbjørnsen > Hi all. > > I have a delicate problem when it comes to two words that are rather > similar in the way they are typed, but when it comes to the meaning of the > word they are completely different. > The actual words are restaurant (as in restaurant) and restaurering

Problem with words thats amost similar

2009-12-17 Thread Steinar Asbjørnsen
Hi all. I have a delicate problem when it comes to two words that are rather similar in the way they are typed, but when it comes to the meaning of the word they are completely different. The actual words are restaurant (as in restaurant) and restaurering (as in restoration). Solr seems to thi

payload queries running slow

2009-12-17 Thread Raghuveer Kancherla
Hi, With help from the group here, I have been able to set up a search application with payloads enabled. However, there is a noticeable increase in query response times with payloads as compared to the same queries without payloads. I am also seeing a lot more disk IO (I have a 7200 rpm disk) and