Re: Opening a file at a page where I encounter a hit

2011-05-15 Thread Gora Mohanty
On Mon, May 16, 2011 at 12:00 PM, Vignesh Raj wrote: > Hi, > > I am using ASP.Net MVC and solrnet for my search tool. The files I index > include pdf files, word docs, excel etc... > > I am able to search and retrieve all the docs with a hit. Now the problem > lies in opening the files with a hit.

Opening a file at a page where I encounter a hit

2011-05-15 Thread Vignesh Raj
Hi, I am using ASP.Net MVC and solrnet for my search tool. The files I index include pdf files, word docs, excel etc... I am able to search and retrieve all the docs with a hit. Now the problem lies in opening the files with a hit. When I open the file, it should open at the location where the h

Re: K-Stemmer for Solr 3.1

2011-05-15 Thread Bernd Fehling
I don't know if it is allowed to modify Lucid code and add it to jira. If someone from Lucid would give me the permission and the Solr developers have nothing against it I won't mind adding the Lucid KStemmer to jira for Solr 3.x and 4.x. There are several Lucid KStemmer users which I can see fro

Re: Order of words in proximity search

2011-05-15 Thread lboutros
I would prefer to put a higher slop number instead of a boolean clause : 200 perhaps in your specific case. Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3.nabble.com/Order-of-words-in-proximity-search-tp2938427p2946645.html Sent from the Solr - User mailing

Re: Order of words in proximity search

2011-05-15 Thread lboutros
the key phrase was this one :) : "A sloppy phrase query specifies a maximum "slop", or the number of positions tokens need to be moved to get a match. " so you could search for "foo bar"~101 in your example. Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3

adding results external to index

2011-05-15 Thread abhayd
hi I am not sure if SOLR has this feature so just wanted to confirm.. Basically what I want to do is for certain query terms I would like to query real time web service which will return certain results and at the same time search in solr index. This can be implemented out side solr and I am w

Re: Spatial search - SOLR 3.1

2011-05-15 Thread Bill Bell
Look up a few lines to see q={!func}geodist() ...&q={!func}geodist()&sfield=store&pt=45.15,-93.85&sort=score asc On 5/11/11 9:25 AM, "Smiley, David W." w

Re: nested faceting ?

2011-05-15 Thread abhayd
thanks this helped -- View this message in context: http://lucene.472066.n3.nabble.com/nested-faceting-tp2389841p2946515.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: K-Stemmer for Solr 3.1

2011-05-15 Thread Bill Bell
Did you upload the code to Jira? On 5/13/11 12:28 AM, "Bernd Fehling" wrote: >I backported a Lucid KStemmer version from solr 4.0 which I found >somewhere. >Just changed from >import org.apache.lucene.analysis.util.CharArraySet; // solr4.0 >to >import org.apache.lucene.analysis.CharArraySet; /

Re: Solr performance

2011-05-15 Thread Bill Bell
Yes just add fieldname: search string On 5/13/11 7:23 AM, "javaxmlsoapdev" wrote: >Alright. It turned out that defaultSearchField=title where title field is >of >a custom fieldType=edgyText > >where > > maxGramSize="25" side="back"/> > maxGramSize="25" side="front"/> > > >so if no value in

weight and sorting

2011-05-15 Thread deniz
hi folks, i am trying to give weights for sorting the results of the query... but a little bit confused here... i wrote this: vector = this.reader.getTermFreqVector(this.docId, "info"); String[] queryWords = searchQuery.split(" ");

solr Scheduling doesnt fire

2011-05-15 Thread abiratsis
Hello everyone, I trying to use DIH in Solr 3.1 with scheduling but it never fires. Here is my dataimport.properties file: #Mon May 16 02:43:35 CEST 2011 last_index_time=2011-05-16 02\:43\:35 element.last_index_time=2011-05-16 02\:43\:35 syncCores= server=localhost port=8080 webapp=solr params=/se

Re: why query chinese character with bracket become phrase query by default?

2011-05-15 Thread Mark Miller
On May 15, 2011, at 1:48 PM, Michael McCandless wrote: > Could you please revert your commit, until we've reached some > consensus on this discussion first? Let's reach some consensus, but why revert? This has been the behavior - shouldn't the consensus onus be on changing it to begin with? Th

Re: DIH help request: nested xml entities and xpath

2011-05-15 Thread Weiss, Eric
Thx kbootz for the reply. I ended up writing a custom transformer that seems to do what I need now. I think I could make the script work, as you suggested, too. The script might even be preferable since I could add/change/mod without recompiling. Thx again, -- Eric On 5/14/11 1:12 PM, "k

Boosting a list of 50000 documents?

2011-05-15 Thread ubriela
Hi all, I am building a People Search for a social network. I want to boost my friends and my friends' friends (they are ranked on top). I have a service to get 5 friends' friends very fast. How can I boost these people by using solr? or any ideas to solve the prob. (Currently I index a list of

Re: Show filename in search result using a FileListEntityProcessor

2011-05-15 Thread kbootz
There is a JIRA item(can't recall it atm) that addresses the issue with the docs. I'm running 3.1 and per your example you should be able to get it using ${f.file}. I think* it should also be in the entity desc. but I'm also new and that's just how I access it. GL -- View this message in context:

Re: DIH help request: nested xml entities and xpath

2011-05-15 Thread kbootz
Have you tried using a scripttransformer per the wiki: http://wiki.apache.org/solr/DataImportHandler. -- View this message in context: http://lucene.472066.n3.nabble.com/DIH-help-request-nested-xml-entities-and-xpath-tp2937919p2941151.html Sent from the Solr - User mailing list archive at Nabb

Re: Huge performance drop in distributed search w/ shards on the same server/container

2011-05-15 Thread Frederik Kraus
Any ideas? On Freitag, 13. Mai 2011 at 13:19, Frederik Kraus wrote: > One Tomcat with multicore. I have a list of about 2mio "real" queries that > I'm firing at the cluster with jmeter. Reason for splitting up the index in > rather small parts is that the maximum response time of 1 sec cannot be

Re: why query chinese character with bracket become phrase query by default?

2011-05-15 Thread Michael McCandless
Yonik, Could you please revert your commit, until we've reached some consensus on this discussion first? Maybe, post alternative patches on the issue (SOLR-2519), and we can iterate there? Adding a new example field type ("text_nwd") is one way to go, and I agree is least risk/effort, a "quick f

Re: Order of words in proximity search

2011-05-15 Thread Tor Henning Ueland
Hi, That only explains how to do it, not even that document specifies that order actually has something to say. :) "batman movie"~100 will give other results than "movie batman"~100. /Tor On Sun, May 15, 2011 at 7:18 PM, lboutros wrote: > Hi, > > see here for an explanation : > > http://wiki.ap

Re: Order of words in proximity search

2011-05-15 Thread lboutros
Hi, see here for an explanation : http://wiki.apache.org/solr/SolrRelevancyFAQ#How_can_I_search_for_one_term_near_another_term_.28say.2C_.22batman.22_and_.22movie.22.29 Ludovic. - Jouve France. -- View this message in context: http://lucene.472066.n3.nabble.com/Order-of-words-in-proximity-

Re: Order of words in proximity search

2011-05-15 Thread Tor Henning Ueland
Hello, Thanks for the replay. Just as i suspected. So the solution then is to create a OR search with both possibilities in order to make the order not be important" "foo bar"~100 -> ("foo bar~100 OR "bar foo"~100) -- Best regards Tor Henning Ueland On Sun, May 15, 2011 at 5:16 PM, Erick Eri

RE: Schema Design Question

2011-05-15 Thread Zac Smith
Ok thanks for the responses. My option #2 will be easier to implement than having the new doc with combinations so will give it a try. But that has opened my eyes to different possibilities! -Original Message- From: Erick Erickson [mailto:erickerick...@gmail.com] Sent: Sunday, May 15, 2

Re: Field collapsing classloading issues

2011-05-15 Thread karanveer singh
I finally got it to work. The problem was that I wasn't building example, and was using a copied version of the webapp. This earlier webapp did not have the classpaths set for fieldcollapse. I built example and its working now. Thanks to everyone who helped! On Sun, May 15, 2011 at 9:26 PM, Erick

Re: why query chinese character with bracket become phrase query by default?

2011-05-15 Thread Yonik Seeley
On Sun, May 15, 2011 at 8:02 AM, Michael McCandless wrote: > On Fri, May 6, 2011 at 8:49 AM, Michael McCandless > wrote: > >> Shouldn't we  have field types in the eg schema for the different >> languages?  Ie, text_zh, text_th, text_en, text_ja, text_nl, etc. > > In fact, until we break out dedi

Re: Build does not create webapp

2011-05-15 Thread karanveer singh
Thanks so much! On Sun, May 15, 2011 at 9:27 PM, Erick Erickson wrote: > try "ant example" > > Best > Erick > > On Sun, May 15, 2011 at 11:15 AM, karanveer singh > wrote: >> After downloading the apache solr 3.1.0 source, I built it using ant >> (ant compile). >> However, this does not create th

Re: TrieIntField for "short" values

2011-05-15 Thread Erick Erickson
Nope, I'm afraid I can't. Because I don't really understand it in detail, the wizard from Germany (Uwe) put it in place But here's a great place to start if you want to dive deep: https://hudson.apache.org/hudson/job/Lucene-trunk/javadoc/all/org/apache/lucene/search/NumericRangeQuery.html?is-e

Re: why query chinese character with bracket become phrase query by default?

2011-05-15 Thread Michael McCandless
I opened https://issues.apache.org/jira/browse/SOLR-2519 for this. Mike http://blog.mikemccandless.com On Sun, May 15, 2011 at 8:02 AM, Michael McCandless wrote: > On Fri, May 6, 2011 at 8:49 AM, Michael McCandless > wrote: > >> Shouldn't we  have field types in the eg schema for the different

Re: Build does not create webapp

2011-05-15 Thread Erick Erickson
try "ant example" Best Erick On Sun, May 15, 2011 at 11:15 AM, karanveer singh wrote: > After downloading the apache solr 3.1.0 source, I built it using ant > (ant compile). > However, this does not create the webapp in solr/example/webapps > There is also nothing in solr/example/work. > > Any i

Re: Field collapsing classloading issues

2011-05-15 Thread Erick Erickson
You can just configure another directive in solrconfig.xml if you want. Or, copy the relevant jars to one of the directories already specified by a directive in the config... Best Erick On Sun, May 15, 2011 at 10:18 AM, karanveer singh wrote: > Since I'm using solr 3.1.0, I'm guessing that t

Re: Schema Design Question

2011-05-15 Thread Erick Erickson
Of your first two options, I'd go with a multi-valued field for each book (1). But kenf_nc's suggestion is a good one too. On Sun, May 15, 2011 at 3:54 AM, kenf_nc wrote: > create a separate document for each book-bookshelf combination. > doc 1 = book 1,shelf 1 > doc 2 = book 1,shelf 3 > doc 3 =

Re: document storage

2011-05-15 Thread Erick Erickson
Where are the documents coming from? Because storing them ONLY in Solr risks losing them if your index is somehow hosed. Storing them externally only has the advantage that your index will be much smaller, which helps when replicating as you scale. The downside here is that highlighting will be mo

Re: Show filename in search result using a FileListEntityProcessor

2011-05-15 Thread Erick Erickson
You haven't specified that DIH should put the file name in the document as it indexes it, i.e. or some such... Best Erick On Thu, May 12, 2011 at 11:15 AM, Marcel Panse wrote: > Hi Solr community, > > I'm new to solr and trying to scan all pdf/doc files in a directory. This > works fine and I a

Re: Multi Word Filter Queries

2011-05-15 Thread Erick Erickson
Hmmm, Can we see some examples of 1> the query you use 2> the data you put in the field for a document that matches and one that doesn't did you re-index after each field type change? One observation: the KeywordTokenizerFactory will NOT produce individual tokens. So if you index the value "my do

Re: Solr performance

2011-05-15 Thread Erick Erickson
but if you ignore the default field, what would be searched unless you specify the field? Which is, as I understand it, your answer. Just specify q=keys: Or I completely misunderstand your question Best Erick On Fri, May 13, 2011 at 9:23 AM, javaxmlsoapdev wrote: > Alright. It turned out that

Re: TrieIntField for "short" values

2011-05-15 Thread Juan Antonio Farré Basurte
Hi, Thanks for your answer. I am doing range queries on this field, yes, that's why I cared about how all this trie thing works :) If I use precisionStep=0 would it be equivalent to use, say, a SortableIntField? Would it be possible that you explained, for example, the difference in how it woul

Re: Order of words in proximity search

2011-05-15 Thread Erick Erickson
Yes, order does matter. When order is changed as in your example, matching the text "foo always bar" would require one more move for "bar foo" than for "foo bar" Lucene In Action has some nice graphics explaining this Best, Erick On Fri, May 13, 2011 at 6:13 AM, Tor Henning Ueland wrote: > H

Build does not create webapp

2011-05-15 Thread karanveer singh
After downloading the apache solr 3.1.0 source, I built it using ant (ant compile). However, this does not create the webapp in solr/example/webapps There is also nothing in solr/example/work. Any idea how to make this work?

Re: TrieIntField for "short" values

2011-05-15 Thread Erick Erickson
Are you doing range queries on this field? Range queries are where Trie shines, so worrying about precision step if you're NOT intending to do range queries is a waste, just use precisionstep=0. In fact, with only 1,000 values, I'd just go with PrecisionStep=0 (which is the int field) Best Erick

Re: how to update database record after indexing

2011-05-15 Thread Erick Erickson
I'm assuming you have some idea what *should* have been indexed in the last hour, why can't you query your Solr index with that information? You shouldn't have to examine the entire index Best Erick On Thu, May 12, 2011 at 8:20 AM, vrpar...@gmail.com wrote: > actually every hour some records

Re: Debugging same SOLR installation on 2 different servers

2011-05-15 Thread Erick Erickson
What happens if you copy the index from one machine to the other? Probably from prod to test. If your results stay the same, that'd eliminate index differences as the culprit. What do you get by attaching &debugQuery=on the the queries that differ? Is the parsed query any different? I'm wondering

Re: Field collapsing classloading issues

2011-05-15 Thread karanveer singh
Since I'm using solr 3.1.0, I'm guessing that the solr.fieldCollapse classpath hasn't been set. How exactly should I be setting this? On Sun, May 15, 2011 at 7:08 PM, karanveer singh wrote: > Can you tell me how the packet paths are set? So that I can modify > them and let it know where to find f

Re: Field collapsing classloading issues

2011-05-15 Thread karanveer singh
Can you tell me how the packet paths are set? So that I can modify them and let it know where to find fieldCollapse.class? My fieldCollapse.class is in build/solr/org/apache/solr/handler/component My /lib folder contains some .jar files and some other files, I guess these were created at the time

Re: why query chinese character with bracket become phrase query by default?

2011-05-15 Thread Michael McCandless
On Fri, May 6, 2011 at 8:49 AM, Michael McCandless wrote: > Shouldn't we  have field types in the eg schema for the different > languages?  Ie, text_zh, text_th, text_en, text_ja, text_nl, etc. In fact, until we break out dedicated language field types, shouldn't we default autophrase to off in

How to get the filtered terms from a Query in the ResponseWriter?

2011-05-15 Thread Gabriele Kahlout
Hello, For a given q string I'm trying to extract the terms (identifiers of tokens) that the Query Parser identified at terms (and shows when explaining results). I manage to do it as follows, but *I hope there a better way (more direct) you will tell me about:* NamedList analysis = new *FieldAn

Re: Field collapsing classloading issues

2011-05-15 Thread Luis Cappa Banda
Hello. Where did you put the .class files associated to FieldCollapsing? It´s usual to include them into a .jar contained in /lib server folder, for example. Another option is that the packet path "solr.CollapseComponent" it´s not the same as the applied in your patch, if it was modified.

RE: Schema Design Question

2011-05-15 Thread kenf_nc
create a separate document for each book-bookshelf combination. doc 1 = book 1,shelf 1 doc 2 = book 1,shelf 3 doc 3 = book 2,shelf 1 etc. then your queries are q=book_id to get all bookshelfs a given book is on or q=shelf_id to get all books on a given bookshelf. Biggest problem people face wit

Re: Replication Clarification Please

2011-05-15 Thread Alexander Kanarsky
Ravi, what is the replication configuration on both master and slave? Also could you list of files in the index folder on master and slave before and after the replication? -Alexander On Fri, 2011-05-13 at 18:34 -0400, Ravi Solr wrote: > Sorry guys spoke too soon I guess. The replication stil