Re: Compound word search (maybe DisMaxQueryPaser problem)

2009-03-13 Thread Tobias Dittrich
First of all: sorry Chris, Walter .. I did not mean to put pressure on anyone. It's just that if you're stuck with something and you have that little needle stinging saying: maybe you're just too damn stupid for this ... :) So, thanks a lot for your answers. As for index time expansion using

Re: SolrJ : EmbeddedSolrServer and database data indexing

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
nope .. But you can still use SolrJ to invoke DIH. cretae a ModifiableSolrParams with the required request parameters create a QueryRequest with the params and then set the path as /dataimport and invoke the command with the CommonsHttpSolrServer#request() On Fri, Mar 13, 2009 at 8:40 AM, Ashis

Two way Synonyms in Solr

2009-03-13 Thread dabboo
Hi, I am implementing 2 way synonyms in solr using q query parameter. One way synonym is working fine with q query parameter but 2 way is not working. for e.g. If I defined 2 way synonyms in the file like: value1, value2 It doesnt show any result for either of the value. Please suggest. Tha

Re: Two way Synonyms in Solr

2009-03-13 Thread Koji Sekiguchi
dabboo wrote: Hi, I am implementing 2 way synonyms in solr using q query parameter. One way synonym is working fine with q query parameter but 2 way is not working. for e.g. If I defined 2 way synonyms in the file like: value1, value2 It doesnt show any result for either of the value. Ple

Phrase Synonyms in solr

2009-03-13 Thread dabboo
Hi, Can someone please tell me how to implement phrase synonyms in solr. Thanks, Amit -- View this message in context: http://www.nabble.com/Phrase-Synonyms-in-solr-tp22492440p22492440.html Sent from the Solr - User mailing list archive at Nabble.com.

Solr: ERRORs at Startup

2009-03-13 Thread Giovanni De Stefano
Hello everybody, I am currently using: - Solr v1.3.0 - Jboss jboss-5.0.1.GA - Java jdk 1.5_06 When I start Solr within Jboss I see a lot of errors in the log but Solr "seems" working (meaning I can see the admin interface but I cannot index my DB...but that is

Re: How to correctly boost results in Solr Dismax query

2009-03-13 Thread Pete Smith
Hi Amit, Thanks very much for your reply. What you said makes things a bit clearer but I am still a bit confused. On Thu, 2009-03-12 at 23:14 -0700, dabboo wrote: > If you want to boost the records with their field value then you must use q > query parameter instead of q.alt. 'q' parameter actual

Re: Solr: ERRORs at Startup

2009-03-13 Thread Toby Cole
Hi Giovanni, It looks like logging is configured strangely. Those messages in my solr setup (on tomcat 6 or jetty) appear as INFO level messages. It could have something to do with your SLF4J setup, but I'm no expert on that side of things. I wouldn't worry too much, the content of the messag

Re: How to correctly boost results in Solr Dismax query

2009-03-13 Thread dabboo
Pete, Sorry, if wasnt clear. Here is the explanation. Suppose you have 2 records and they have films and media as 2 columns. Now first record has values like films="Indiana" and media="blue ray" and 2nd record has values like films="Bond" and media="Indiana" Values for qf parameters media^2.0

DIH with outer joins

2009-03-13 Thread Rui António da Cruz Pereira
I have queries with outer joins defined in some entities and for the same root object I can have two or more lines with different objects, for example: Taking the following 3 tables, andquery defined in the entity with outer joins between tables: Table1 -> Table2 -> Table3 I can have the fol

Re: How to correctly boost results in Solr Dismax query

2009-03-13 Thread Pete Smith
Hi Amit, Thanks again for your reply. I am understanding it a bit better but I think it would help if I posted an example. Say I have three records: 1 BLU-RAY Indiana Jones and the Kingdom of the Crystal Skull 2 DVD Indiana Jones and the Kingdom of the Crystal Skull 3 DVD Casino Royale Now

Re: How to correctly boost results in Solr Dismax query

2009-03-13 Thread dabboo
Pete, bq works only with q.alt query and not with q queries. So, in your case you would be using qf parameter for field boosting, you will have to give both the fields in qf parameter i.e. both title and media. try this media^1.0 title^100.0 Pete Smith-3 wrote: > > Hi Amit, > > Thanks agai

Solr: is there a default ClobTransformer?

2009-03-13 Thread Giovanni De Stefano
Hello all, I am trying to index an Oracle DB with some Clob columns. Following the doc I see that I need to transform my entity with a ClobTransformer. Now, my log says the following: 12:05:52,901 ERROR [STDERR] Mar 13, 2009 12:05:52 PM org.apache.solr.handler.dataimport.EntityProcessorBase loa

Re: How to correctly boost results in Solr Dismax query

2009-03-13 Thread Pete Smith
Hi, On Fri, 2009-03-13 at 03:57 -0700, dabboo wrote: > bq works only with q.alt query and not with q queries. So, in your case you > would be using qf parameter for field boosting, you will have to give both > the fields in qf parameter i.e. both title and media. > > try this > > media^1.0 title

Re: DIH with outer joins

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
it is not very clear to me on how it works probably you can put in the queries here. you can do all the joins in the db in one complex query and use that straightaway in an entity. You do not have to do any joins inside DIH itself On Fri, Mar 13, 2009 at 4:47 PM, Rui António da Cruz Pereira wro

Re: Solr: is there a default ClobTransformer?

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
ClobTranformer is a Solr1.4 feature. which one are you using? On Fri, Mar 13, 2009 at 4:39 PM, Giovanni De Stefano wrote: > Hello all, > > I am trying to index an Oracle DB with some Clob columns. > > Following the doc I see that I need to transform my entity with a > ClobTransformer. > > Now, my

Re: DIH with outer joins

2009-03-13 Thread Rui António da Cruz Pereira
I thought that I could remove the uniqueKey in Solr and then have more that one document with the same id, but then I don't know if in delta-imports the documents outdated or deleted are updated (updated document is added and then we would have the outdated and the updated document in the index

Re: Two way Synonyms in Solr

2009-03-13 Thread dabboo
Yes, I have defined expand=true for synonym definition. But still, 2 way synonym are not working. Also, is there any way, phrase synonym starts working. Koji Sekiguchi-2 wrote: > > dabboo wrote: >> Hi, >> >> I am implementing 2 way synonyms in solr using q query parameter. One way >> synonym

Re: DIH with outer joins

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
have one root which just does a "select id from Table1 " .Then have a child entiy which does all the joins and return all other columns for that 'id'. On Fri, Mar 13, 2009 at 5:10 PM, Rui António da Cruz Pereira wrote: > I thought that I could remove the uniqueKey in Solr and then have more tha

Re: Solr: ERRORs at Startup

2009-03-13 Thread Giovanni De Stefano
Hello Toby, thank you for your quick reply. Even setting everything to INFO through http://localhost:8080/solr/admin/logging didn't help. But considering you do not see any bad issue here, at this time I will ignore those ERROR messages :-) Cheers, Giovanni On Fri, Mar 13, 2009 at 11:16 AM, T

DIH with outer joins

2009-03-13 Thread Rui António da Cruz Pereira
I have queries with outer joins defined in some entities and for the same root object I can have two or more lines with different objects, for example: Taking the following 3 tables, andquery defined in the entity with outer joins between tables: Table1 -> Table2 -> Table3 I can have the fol

Re: input XSLT

2009-03-13 Thread Grant Ingersoll
Have you tried Solr Cell? http://wiki.apache.org/solr/ExtractingRequestHandler On Mar 13, 2009, at 2:49 AM, CIF Search wrote: But these documents have to be converted to a particular format before being posted. Any XML document cannot be posted to Solr (with XSLT handled by Solr internal

Re: Solr: is there a default ClobTransformer?

2009-03-13 Thread Giovanni De Stefano
Hello Paul, I must have missed that detail :-) I am currently using Solr 1.3.0. Thank you very much for your remark: I just downloaded the latest nightly build, compile the whole thing and included the apache-solr-dataimporthandler-1.4-dev.jar in my $SOLR_HOME/lib folder. I have just been able

Re: fl wildcards

2009-03-13 Thread Erik Hatcher
On Mar 12, 2009, at 1:43 PM, Schley Andrew Kutz wrote: If I wanted to hack Solr so that it has the ability to process wildcards for the field list parameter (fl), where would I look? (Perhaps I should look on the solr-dev mailing list, but since I am already on this one I thought I would st

Re: fl wildcards

2009-03-13 Thread Schley Andrew Kutz
Thanks. If I knew where to begin to implement this, I would. It seems to me that the constraining of field lists must occur at the very core of Solr because of the reduction in search time when specifying a restrictive set of fields to return. For example, when I return 10 entire documents

Stemming in Solr

2009-03-13 Thread dabboo
Hi, Can someone please let me know how to implement stemming in solr. I am particularly looking of the changes, I might need to do in the config files and also if I need to use some already supplied libraries/factories etc etc. It would be a great help. Thanks, Amit Garg -- View this message

Re: fl wildcards

2009-03-13 Thread Mark Miller
Erik Hatcher wrote: Wasn't exactly an answer to your question. Wildcard support for field names in Solr is a feature that really deserves broader implementation consideration than just hacking one spot for fl. Other field list parameters, like hl.fl could use that capability too. I think

Storing "map" in Field

2009-03-13 Thread Jeff Crowder
All, I'm working with the sample schema, and have a scenario where I would like to store multiple prices in a "map" of some sort. This would be used for a scenario where a single "product" has different "prices" based on a price list. For instance: SKU001 A Sample Product 119.99

Re: Storing "map" in Field

2009-03-13 Thread Toby Cole
I don't think anything _quite_ like that exists, however you could use wildcard fields to achieve pretty much the same thing. You could use a post like this: SKU001 A Sample Product 119.99 109.99 if you have a field definition in your schema.xml like: stored="true"/> Regard

Re: Storing "map" in Field

2009-03-13 Thread Erick Erickson
H, what do you want to *do* with those multiple prices? Search? Display? Change all the time? Each of these operations will generate different suggestions I daresay Best Erick On Fri, Mar 13, 2009 at 10:01 AM, Jeff Crowder wrote: > All, > > I'm working with the sample schema, and have a

Re: DIH with outer joins

2009-03-13 Thread Walter Underwood
It may be easier to make a view in the database and index the view. Databases have good tools for that. wunder On 3/13/09 2:46 AM, "Rui António da Cruz Pereira" wrote: > I have queries with outer joins defined in some entities and for the > same root object I can have two or more lines with dif

rsync snappuller slowdown Qtime

2009-03-13 Thread sunnyfr
Hi, Noticing a relevant latency during search, I tried to turn off cronjob and test it manually. And it was obvious how during snappuller on a slave server, the query time was a lot longer than the rest of the time. Even snapinstaller didn't affect the query time. without any action around 200

Re: fl wildcards

2009-03-13 Thread Schley Andrew Kutz
That makes sense, since hl.fl probably can get away with calculating in the writer, and not as part of the core. However, I really need wildcard (or globbing) support for field lists as part of the common query parameter "fl". Again, if someone can just point me to where the Solr core is us

Re: DIH use of the ?command=full-import entity= command option

2009-03-13 Thread Jon Baer
Bare in mind (and correct me if Im wrong) but a "full-import" is still a "full-import" no matter what entity you tack onto the param. Thus I think clean=false should be appended (a friend starting off in Solr was really confused by this + could not understand why it did a delete on all docu

Re: rsync snappuller slowdown Qtime

2009-03-13 Thread Yonik Seeley
On Fri, Mar 13, 2009 at 10:33 AM, sunnyfr wrote: > And it was obvious how during snappuller on a slave server, the query time > was a lot longer than the rest of the time. Did the CPU utilization drop? It could be writing of the new files being pulled forcing parts of the current index files out

Re: Tomcat holding deleted snapshots until it's restarted - SOLVED!!!

2009-03-13 Thread Marc Sturlese
Hey Yonik, I tested the last nightly build and still happens... but I have solved it! I tell you my solution, it seems to be working well but just want to be sure that it doesn't have any bad effects as for me this is one of the most complicated parts of the Solr source (the fact of dealing with m

Re: Tomcat holding deleted snapshots until it's restarted - SOLVED!!!

2009-03-13 Thread Yonik Seeley
decref() decrements the reference count and closes the searcher when it reaches 0 (no more users). Forcing it to close at the point you did is unsafe since other threads may still be using that searcher. The real issue lies somewhere else - either a stuck thread, or some code that is not decrementi

Caching question + "smart" autowarming

2009-03-13 Thread Jon Baer
I have a few general questions re: caching ... 1. The FastLRU cache in 1.4 seems promising but is there a more comprehensive list of benefits? Is there a huge speed boost for using this type of cache? 2. What are the possibilities to using external caches for scaling out like memcachedb

Re: Tomcat holding deleted snapshots until it's restarted - SOLVED!!!

2009-03-13 Thread Marc Sturlese
Ok, I will open a bug issue now. > Forcing it to close at the point you did is unsafe since other threads > may still be using that searcher. Can you give me an example where other threads would be using that searcher? (As I said I find this part of the source dufficult to understand and imagin

Re: Tomcat holding deleted snapshots until it's restarted - SOLVED!!!

2009-03-13 Thread Yonik Seeley
On Fri, Mar 13, 2009 at 1:00 PM, Marc Sturlese wrote: > > Ok, I will open a bug issue now. > >> Forcing it to close at the point you did is unsafe since other threads >> may still be using that searcher. > > Can you give me an example where other threads would be using that searcher? Any searches

Re: Caching question + "smart" autowarming

2009-03-13 Thread Yonik Seeley
On Fri, Mar 13, 2009 at 12:58 PM, Jon Baer wrote: > I have a few general questions re: caching ... > > 1. The FastLRU cache in 1.4 seems promising but is there a more > comprehensive list of benefits?  Is there a huge speed boost for using this > type of cache? It simply removes contention for re

Re: DIH with outer joins

2009-03-13 Thread Rui António da Cruz Pereira
The two entities resolves the problem, but adds some overhead (the queries can be really big). The views doesn't work for me, as the queries are dynamically generated, taken in consideration a determinate topology. Noble Paul നോബിള്‍ नोब्ळ् wrote: have one root which just does a "select id f

Re: DataImportHandler Robustness For Imports That Take A Long Time

2009-03-13 Thread Chris Harris
Wouldn't this approach get confused if there was an error that caused DIH to do a rollback? For example, suppose this happened: * 1000 successful document adds * The custom transformer saves some marker in the DB to signal that the above docs have been successfully indexed * The next document add

Wildcard query search

2009-03-13 Thread Narayanan, Karthikeyan
Hi, I am trying to perform wildcard search using q query. The query results are returned. After getting the results, I trying to get the highlighting using ressponse.getHighlighting(). It returns empty list. But It works fine for non-wildcard searches. Any ideas please?. Thanks.

Re: Wildcard query search

2009-03-13 Thread Erick Erickson
Fragments from the user list (search it for the full context, I don't have the URL for the searchable user list handy, but it's on the Wiki) **original post Hi, i'm using solr 1.3.0 and SolrJ for my java application I need to highlight my query words even if I use wildcards for

Custom handler that forwards a request to another core

2009-03-13 Thread Pascal Dimassimo
Hi, I'm writing a custom handler that forwards a request to a handler of another core. The custom handler is defined in "core0" and the core I try to send the request to is "core2" which has a mlt handler. Here is the code of my custom handler (extends RequestHandlerBase and implements SolrCoreAw

Commit is taking very long time

2009-03-13 Thread mahendra mahendra
Hello,   I am experiencing strange problems while doing commit. I am doing indexing for every 10 min to update index with data base values. commit is taking 7 to 10 min approximately and my indexing is failing due to null pointer exception. If first thread is not completed in 10 min the second t

Re: Commit is taking very long time

2009-03-13 Thread Yonik Seeley
>From your logs, it looks like the time is spent in closing of the index. There may be some pending deletes buffered, but they shouldn't take too long. There could also be a merge triggered... but this would only happen sometimes, not every time you commit. One more relatively recent change in Luc

Re: DIH with outer joins

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
joining entities may have some overhead. Is it prohibitive in absolute terms? On Sat, Mar 14, 2009 at 12:29 AM, Rui António da Cruz Pereira wrote: > The two entities resolves the problem, but adds some overhead (the queries > can be really big). The views doesn't work for me, as the queries are >

Re: DataImportHandler Robustness For Imports That Take A Long Time

2009-03-13 Thread Noble Paul നോബിള്‍ नोब्ळ्
alternately you can do the commit yourself after marking in the db . Context#getSolrCore().getUpdateHandler().commit() or as you mentioned you can do an autocommit On Sat, Mar 14, 2009 at 12:31 AM, Chris Harris wrote: > Wouldn't this approach get confused if there was an error that caused > DIH

Re: what crawler do you use for Solr indexing?

2009-03-13 Thread ristretto.rb
Hello, I built my own crawler with Python, as I couldn't find (not complaining, probably didn't look hard enough) nutch documentation. I use BeautifulSoup, because the site is mostly based on Python/Django, and we like Python. Writing one was good for us because we spent most of out time figurin

Re: unique result

2009-03-13 Thread ristretto.rb
FWIW... We run a hash or the content and other bits of our docs, and then remove duplicates according to specific algorithms. (exactly the same page content can clearly be hosted on many different urls but, and domains) Then, the choosen ones are indexed. Though we toss the synonyms in the inde

com.ctc.wstx.exc.WstxLazyException exception while passing the text content of a word doc to SOLR

2009-03-13 Thread Suryasnat Das
Hi, I am using Apache POI parser to parse a Word Doc and extract the text content. Then i am passing the text content to SOLR. The Word document has many pictures, graphs and tables. But when i am passing the content to SOLR, it fails. Here is the exception trace. 09:31:04,516 ERROR [STDERR] Mar

Re: Solr: ERRORs at Startup

2009-03-13 Thread Chris Hostetter
: Even setting everything to INFO through : http://localhost:8080/solr/admin/logging didn't help. : : But considering you do not see any bad issue here, at this time I will : ignore those ERROR messages :-) i would read up more on how to configure logging in JBoss. as far as i can tell, Solr is