Re: two cores but have single result set in solr

2011-09-24 Thread hadi
I do not know how to search both cores and not define "shard" parameter,could you show me some solutions for solve my issue? On 9/24/11, Yury Kats [via Lucene] wrote: > > > On 9/23/2011 6:00 PM, hadi wrote: >> I index my files with solrj and crawl my sites with nutch 1.3 ,as you >> know, i have t

matching reponse and request

2011-09-24 Thread Roland Tollenaar
Hi, sorry for this question but I am hoping it has a quick solution. I am sending multiple get request queries to solr but solr is not returning the responses in the sequence I send the requests. The shortest responses arrive back first I am wondering whether I can add a tag to the reque

Re: levenshtein ranked results

2011-09-24 Thread Roland Tollenaar
Thanks Otis, this helps me tremendously. Kind regards, Roland Otis Gospodnetic wrote: Hi Roland, I did this: http://search-lucene.com/?q=sort+by+function&fc_project=Solr&fc_type=wiki Which took me to this: http://wiki.apache.org/solr/FunctionQuery#Sort_By_Function And further on that pag

Sending pdf files to slor for indexing

2011-09-24 Thread ahmad ajiloo
Hi all I want to send a pdf file to slor for indexing. there is a command to send Solr a file via HTTP POST: http://wiki.apache.org/solr/ExtractingRequestHandler#Getting_Started_with_the_Solr_Example but "*curl*" is for linux and I want to use Solr in Windows. thanks a lot.

Re: Sending pdf files to slor for indexing

2011-09-24 Thread ahmad ajiloo
Also when I use that command in Linux, see this error: --- *Error 400 ERROR:unknown field 'ignored_meta'* HTTP ERROR 400 Problem accessing /solr/update/extract. Reason: ERROR:unknown field 'ignored_meta'Powered by Jetty://

Re: two cores but have single result set in solr

2011-09-24 Thread Yury Kats
On 9/24/2011 3:09 AM, hadi wrote: > I do not know how to search both cores and not define "shard" > parameter,could you show me some solutions for solve my issue? See this: http://wiki.apache.org/solr/DistributedSearch

indexing a xml file

2011-09-24 Thread ahmad ajiloo
hello Solr Tutorial page explains about index a xml file. but when I try to index a xml file with this command: ~/Desktop/apache-solr-3.3.0/example/exampledocs$ java -jar post.jar solr.xml I get this error: SimplePostTool: FATAL: Solr returned an error #400 ERROR:unknown field 'name' can anyone he

Re: indexing a xml file

2011-09-24 Thread GR
i think the xml to be indexed has to follow a certain schema, defined in schema.xml under conf directory. maybe, your solr.xml is not doing that Sent from my iPhone On 24 Sep 2011, at 18:15, ahmad ajiloo wrote: hello Solr Tutorial page explains about index a xml file. but when I try to

Re: two cores but have single result set in solr

2011-09-24 Thread hadi
I read the link but the 'http://localhost:8983/solr/select?shards=localhost:8983/solr,localhost:7574/solr&indent=true&q=ipod+solr' have a XML response that is not useful for me, i want to create query in solr/browse so this is need to change the template engine,do you know how to change that to sea

Re: Sending pdf files to slor for indexing

2011-09-24 Thread pulkitsinghal
You should get cygwin for windows and make sure to select curl as one of the many packages that come with cygwin when it's installer runs. Sent from my iPhone On Sep 24, 2011, at 5:29 AM, ahmad ajiloo wrote: > Also when I use that command in Linux, see this error: > ---

Re: strategy for post-processing answer set

2011-09-24 Thread Fred Zimmerman
ok. this is a very basic question so please bear with me. I see where the velocity templates are and I have looked at the documentation and get the idea of how to write them. it looks to me as if Solr just brings back the URLs. what I want to do is to get the actual documents in the answer set,

Re: Is verboten?

2011-09-24 Thread Erick Erickson
Does wrapping your content in CDATAs work? Best Erick On Mon, Sep 19, 2011 at 6:39 PM, chadsteele.com wrote: > It seems xml docs that use fail to be indexed properly and I've > recently discovered the following fails on my installation. > > /solr/update?stream.body= > > thoughts? > > I need to

Re: shareSchema="true" - location of schema.xml?

2011-09-24 Thread rkuris
I have 300 cores so I feel your pain :-) What we do is use a relative path for the file. It works if you use ../../common/schema.xml for each core, then just create a common directory off your solr home and put your schema file there. I found this works great with solrconfig.xml and all of it's

Re: matching reponse and request

2011-09-24 Thread rkuris
I don't think you can do this. If you are sending multiple GET requests, you are doing it across different HTTP connections. The web service has no way of knowing these are related. One solution would be to pass a spare, unused parameter to your request, like sequenceId=NNN and get the response

resource to see which versions build from trunk?

2011-09-24 Thread Jason Toy
Hi all, I am testing various versions of solr from trunk, I am finding that often times the example doesn't build and I can't test out the version. Is there a resource that shows which versions build correctly so that we can test it out?

RE: JdbcDataSource and threads

2011-09-24 Thread rkuris
My guess on this is that you're making a LOT of database requests and have a million TIME-WAIT connections, and your port range for local ports is running out. You should first confirm that's true by running netstat on the machine while the load is running. See if it gives a lot of output. One w

Re: field value getting null with special char

2011-09-24 Thread Erick Erickson
I can't imagine that the ( or ) is a problem. So I think we need to see how you're using SolrJ. In particular, are you asking for the field in question to be returned (e.g. SolrQuery.setFields or addField)? Second question: Are you sure your SolrJ is connecting to the server you connect to with th

Re: q and fq in solr 1.4.1

2011-09-24 Thread Erick Erickson
Why is it important? What are you worried about that this implementation detail is necessary to know about? But the short answer is that the fq's are calculated against the whole index and the results are efficiently cached. That's the only way that the fq can be re-used against a different search

Re: JSON response with SolrJ

2011-09-24 Thread Erick Erickson
Hmmm, what advantage does JSON have over the SolrDocument you get back? Perhaps if you describe that we can offer better suggestions. Best Erick On Wed, Sep 21, 2011 at 5:01 AM, Kissue Kissue wrote: > Hi, > > I am using solr 3.3 with SolrJ. Does anybody have any idea how i can > retrieve JSON re

Re: Selective values for facets

2011-09-24 Thread Erick Erickson
You don't do anything special for facet at index time unless you, say, wanted to remove some value from the facet field, but then it would NEVER be available. So if you're saying that at index time you have certain documents 'New Year's Offers' that ONLY EVER want to map to NEWA, NEWB, NEWY, you co

Re: Production Issue: SolrJ client throwing this error even though field type is not defined in schema

2011-09-24 Thread Erick Erickson
You might want to review: http://wiki.apache.org/solr/UsingMailingLists There's really not much to go on here. Best Erick On Wed, Sep 21, 2011 at 12:13 PM, roz dev wrote: > Hi All > > We are getting this error in our Production Solr Setup. > > Message: Element type "t_sort" must be followed by

Re: NRT and commit behavior

2011-09-24 Thread Erick Erickson
No . The problem is that "number of documents" isn't a reliable indicator of resource consumption. Consider the difference between indexing a twitter message and a book. I can put a LOT more docs of 140 chars on a single machine of size X than I can books. Unfortunately, the only way I know of is

Re: Solr Indexing - Null Values in date field

2011-09-24 Thread Erick Erickson
Solr dates are very specific, and your parsing exception is expected. See: http://lucene.apache.org/solr/api/org/apache/solr/schema/DateField.html Best Erick On Thu, Sep 22, 2011 at 6:28 AM, mechravi25 wrote: > Hi, > > Thanks for the suggestions. This is the option I tried. > > I changed the dat

Re: mlt content stream help

2011-09-24 Thread Erick Erickson
What version of Solr? When you copied the default, did you set up default values for MLT? Showing us the request you used and the relevant portions of your solrconifg file would help a lot, you might want to review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Thu, Sep 22, 2011 a

Update ingest rate drops suddenly

2011-09-24 Thread eks dev
just looking for hints where to look for... We were testing single threaded ingest rate on solr, trunk version on atypical collection (a lot of small documents), and we noticed something we are not able to explain. Setup: We use defaults for index settings, windows 64 bit, jdk 7 U2. on SSD, machi

Re: Production Issue: SolrJ client throwing - Element type must be followed by either attribute specifications, ">" or "/>".

2011-09-24 Thread Erick Erickson
I suspect this is an issue with, say, your servelet container truncating the response or some such, but that's a guess... Best Erick On Thu, Sep 22, 2011 at 9:09 PM, roz dev wrote: > Wanted to update the list with our finding. > > We reduced the number of documents which are being retrieved from

Re: How to map database table for facted search?

2011-09-24 Thread Erick Erickson
In general, you flatten the data when you put things into Solr. I know that's anathema to DB training, but this is searching ... If you have a reasonable number of distinct column names, you could just define your schema to have an entry for each and index the associated values that way. Then your

Re: Solr wildcard searching

2011-09-24 Thread Erick Erickson
Really, really, get in the habit of looking at your query with &debugQuery=on appended, it'll save you a world of pain .. customer_name:John Do* doesn't do what you think. It parses into customer_name:John OR default_search_field:Do* you want something like customer_name:(+John +Do*) or +customer

Re: Solrj - when a request fails

2011-09-24 Thread Erick Erickson
Hmmm. I'm a little confused. Are you sure your log is going somewhere and that you are NOT seeing any stack traces? Because it looks like you *are* seeing them. In which case re-throwing an error breaks your file fetch loop and stops your processing. I'd actually expect that you're losing some

Re: Solr 3.4 Problem with integrating Query Parser Plug In

2011-09-24 Thread Erick Erickson
Could you please add some details here? It's really hard to figure out what the problem is. Perhaps you could review: http://wiki.apache.org/solr/UsingMailingLists Best Erick On Fri, Sep 23, 2011 at 9:28 AM, Ahson Iqbal wrote: > Hi > > I have indexed some 1M documents, just for performance test

Re: what are the disdvantages of using dynamic fields?

2011-09-24 Thread Erick Erickson
There are really no differences between dynamic and static fields performance-wise that I know of. Personally, though, I tend to prefer static over dynamic from a maintenance/debugging perspective. At issue is tracking down why results weren't as expected, then spending several days discovering th

Re: two cores but have single result set in solr

2011-09-24 Thread Erick Erickson
I think you should step back and consider what you're asking for as Ken pointed out. You have different schemas. And presumably different documents in each schema. The scores from the different cores are NOT comparable. So how could you "combine" the meaningfully? Further, assuming that the documen

Re: resource to see which versions build from trunk?

2011-09-24 Thread Erick Erickson
Hmmm, why are you doing this? Why not use the latest successful trunk build? You can get a series of built artifacts at: https://builds.apache.org//view/S-Z/view/Solr/job/Solr-trunk/ but I'm not sure how far back they go. How are you getting the trunk source code? And *how* don't they build? But

Re: Solr wildcard searching

2011-09-24 Thread lboutros
And to complete the answer of Erick, in this search, customer_name:"Joh*" * is not considered as a wildcard, it is an exact search. another thing, (it is not your problem...), Words with wildcards are not analyzed, so, if your analyzer contains a lower case filter, in the index, these words

Re: Solr wildcard searching

2011-09-24 Thread Erick Erickson
Thanks Ludovic, you're absolutely right, I should have added that. BTW, there are patches that haven't been committed, see: https://issues.apache.org/jira/browse/SOLR-1604 and similar. Best Erick On Sat, Sep 24, 2011 at 1:32 PM, lboutros wrote: > And to complete the answer of Erick, > > in this

Re: SOLR error with custom FacetComponent

2011-09-24 Thread Ravi Bulusu
Erik, Unfortunately the facet fields are not static. The field are dynamic SOLR fields and are generated by different applications. The field names will be populated into a data store (like memcache) and facets have to be driven from that data store. I need to write a Custom FacetComponent which

Re: resource to see which versions build from trunk?

2011-09-24 Thread Erik Hatcher
Hey, the more hammering on trunk the better! On Sep 24, 2011, at 13:31 , Erick Erickson wrote: > Hmmm, why are you doing this? Why not use the latest > successful trunk build? > > You can get a series of built artifacts at: > https://builds.apache.org//view/S-Z/view/Solr/job/Solr-trunk/ > but I

Re: resource to see which versions build from trunk?

2011-09-24 Thread Erick Erickson
Agreed, but I'd rather see hammering on latest code On Sat, Sep 24, 2011 at 1:53 PM, Erik Hatcher wrote: > Hey, the more hammering on trunk the better! > > > On Sep 24, 2011, at 13:31 , Erick Erickson wrote: > >> Hmmm, why are you doing this? Why not use the latest >> successful trunk build?

Re: indexing a xml file

2011-09-24 Thread Bill Bell
Send us the example "solr.xml" and "schema.xml'". You are missing fields in the schema.xml that you are referencing. On 9/24/11 8:15 AM, "ahmad ajiloo" wrote: >hello >Solr Tutorial page explains about index a xml file. but when I try to >index >a xml file with this command: >~/Desktop/apache-sol

Re: Update ingest rate drops suddenly

2011-09-24 Thread Otis Gospodnetic
eks, This is clear as day - you're using Winblows!  Kidding. I'd: * watch IO with something like vmstat 2 and see if the rate drops correlate to increased disk IO or IO wait time * monitor the DB from which you were pulling the data - maybe the DB or the server that runs it had issues * monitor

Re: matching reponse and request

2011-09-24 Thread Otis Gospodnetic
Hi Roland, Check this: 0 0 on 0 solr 1            <=== from &foo=1 2.2 10   I added &foo=1 to the request to Solr and got the above back. Otis Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ >

Best Solr escaping?

2011-09-24 Thread Bill Bell
What is the best algorithm for escaping strings before sending to Solr? Does someone have some code? A few things I have witnessed in "q" using DIH handler * Double quotes - " that are not balanced can cause several issues from an error (strip the double quote?), to no results. * Should we use + o

Re: Search query doesn't work in solr/browse pnnel

2011-09-24 Thread Bill Bell
Yes. It appears that "&" cannot be encoded in the URL or there is really bad results. For example we get an error on first request, but if we refresh it goes away. On 9/23/11 2:57 PM, "hadi" wrote: >When I create a query like "something&fl=content" in solr/browse the "&" >and >"=" in URL conve

RE: JdbcDataSource and threads

2011-09-24 Thread Vazquez, Maria (STM)
Thanks a lot for your response! I think that is exactly what's happening. It runs ok for a short time and starts throwing that error while some of the ueriea run successfully. I had it setup with 10 threads, maybe that was too much. I'd be very interested in that code if you don't mind sharing. I'

Solr UpdateJSON - extra fields

2011-09-24 Thread msingla
If JSON being posted to ''http://localhost:8983/solr/update/json' URL has extra fields that are not defined in the index schema definition, will those be silently ignored or an error thrown. -- View this message in context: http://lucene.472066.n3.nabble.com/Solr-UpdateJSON-extra-fields-tp3366066