PointType hard-coded to Doubles?

2012-02-17 Thread Lance Norskog
The PointType seems to be hard-coded to use doubles. Where in the code does this happen? -- Lance Norskog goks...@gmail.com

Re: how to delta index linked entities in 3.5.0

2012-02-17 Thread AdamLane
Thanks for your thoughts Shawn. I did notice 3.x tightened up alot and I did account for it by making sure I had pk defined and columns explicitly aliased with the same name (and I will make sure the bug text reflects that). To help others that are having the same problem, I just found a thread d

Using nested entities in FileDataSource import of xml file contents

2012-02-17 Thread Mike O'Leary
Can anybody help me understand the right way to define a data-config.xml file with nested entities for indexing the contents of an XML file? I used this data-config.xml file to index a database containing sample patient records:

RE: Improving proximity search performance

2012-02-17 Thread Bryan Loofbourrow
Apologies. I meant to type “1.4 TB” and somehow typed “1.4 GB.” Little wonder that no one thought the question was interesting, or figured I must be using Sneakernet to run my searches. -- Bryan Loofbourrow -- *From:* Bryan Loofbourrow [mailto:bloofbour...@knowle

Re: Solr Wiki and mailing lists

2012-02-17 Thread Artem Lokotosh
To unsubscribe, e-mail: solr-user-unsubscr...@lucene.apache.org Also you can request a FAQ, e-mail: solr-user-...@lucene.apache.org On Sat, Feb 18, 2012 at 12:38 AM, Lance Norskog wrote: > The Apache Solr main page does not mention the mailing lists. The wiki > main page has a broken link. I ha

Solr Wiki and mailing lists

2012-02-17 Thread Lance Norskog
The Apache Solr main page does not mention the mailing lists. The wiki main page has a broken link. I have had to search my incoming mail to find out how to unsubscribe to solr-user. Someone with full access- please fix these problems. Thanks, -- Lance Norskog goks...@gmail.com

Re: proper syntax for using sort query parameter in responseHandler

2012-02-17 Thread Tommaso Teofili
Hi Mark, Having a look at that requestHandler it looks ok [1], are you experiencing any errors? If so did you check the wiki page FieldOptionsByUseCase [2], maybe that field (rankNo) options contain indexed="false" or multiValued="true"? HTH, Tommaso [1] : http://wiki.apache.org/solr/CommonQueryPa

Re: Indexing 100Gb of readonly numeric data

2012-02-17 Thread Pedro Ferreira
Ouch... sorry about the format... I have no idea why gmail turned my text into that... On Fri, Feb 17, 2012 at 10:07 PM, Pedro Ferreira wrote: > Hi guys, I'm cross posting this from lucene list as I guess I can have > better help here for this scenario. > Suppose I want to index 100Gb+ of numeric

Indexing 100Gb of readonly numeric data

2012-02-17 Thread Pedro Ferreira
Hi guys, I'm cross posting this from lucene list as I guess I can have better help here for this scenario. Suppose I want to index 100Gb+ of numeric data. I'm not yet sure the specifics, but I can expect the following: - data is expected to be in one gigantic table. conceptually, is likea spreadshe

Re: Solritas: Modify $content in layout.vm

2012-02-17 Thread Erik Hatcher
$content is output of the main template rendered. To modify what is generated into $content, modify the main template or the sub-#parsed templates (which is what you've discovered, looks like) that is rendered (browse.vm, perhaps, if you're using the default example setup). The main template t

proper syntax for using sort query parameter in responseHandler

2012-02-17 Thread geeky2
what is the proper syntax for including sort directive in my responseHandler? i tried this but got an error: edismax all 10 itemNo^1.0 *:* * rankNo desc* itemType:1 false thank you mark -- View this message in

Re: distributed deletes working?

2012-02-17 Thread Mark Miller
On Feb 17, 2012, at 3:56 PM, Jamie Johnson wrote: > id field is a UUID. Strange - was using UUID's myself in same test this morning... I'll try again soon. - Mark Miller lucidimagination.com

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
yes committing in the mix. id field is a UUID. On Fri, Feb 17, 2012 at 3:22 PM, Mark Miller wrote: > You are committing in that mix right? > > On Feb 17, 2012, at 2:07 PM, Jamie Johnson wrote: > >> This was with the cloud-dev solrcloud-start.sh script (after that I've >> used solrcloud-start-exi

Re: Solritas: Modify $content in layout.vm

2012-02-17 Thread Erick Erickson
Why do you want to? That is what are you trying to accomplish by modifying that variable? You may not really need to... This seems like an XY problem... Best Erick On Thu, Feb 16, 2012 at 11:06 PM, remi tassing wrote: > Hi all, > > How do we modify the "$content" variable in the layout.vm file?

Re: problem to indexing pdf directory

2012-02-17 Thread alessio crisantemi
I'm confused now.. so, my last question: I add this in my solrconfig.xml: c:\solr\conf\db-config.xml And I wrote my db-config.xml like this: that's must work, in your opinion

Re: problem to indexing pdf directory

2012-02-17 Thread Erick Erickson
Sorry, my error! In that case you *do* have to do some fiddling to get it all to work. Good Luck! Erick On Fri, Feb 17, 2012 at 3:27 PM, alessio crisantemi wrote: > i try...but i works with solr 1.4.1 > > Il giorno 17 febbraio 2012 15:59, Erick Erickson > ha scritto: > >> You should not have

Re: problem to indexing pdf directory

2012-02-17 Thread alessio crisantemi
i try...but i works with solr 1.4.1 Il giorno 17 febbraio 2012 15:59, Erick Erickson ha scritto: > You should not have to do anything with Maven, the instructions > you followed were from 1.4.1 days.. > > Assuming you're working with a 3.x build, here's a data-config > that worked for me,

Re: distributed deletes working?

2012-02-17 Thread Mark Miller
You are committing in that mix right? On Feb 17, 2012, at 2:07 PM, Jamie Johnson wrote: > This was with the cloud-dev solrcloud-start.sh script (after that I've > used solrcloud-start-existing.sh). > > Essentially I run ./solrcloud-start-existing.sh > index docs > kill 1 of the solr instances (u

Re: distributed deletes working?

2012-02-17 Thread Yonik Seeley
On Fri, Feb 17, 2012 at 2:07 PM, Jamie Johnson wrote: > This was with the cloud-dev solrcloud-start.sh script (after that I've > used solrcloud-start-existing.sh). > > Essentially I run ./solrcloud-start-existing.sh > index docs > kill 1 of the solr instances (using kill -9 on the pid) > delete a

RE: customizing standard tokenizer

2012-02-17 Thread Steven A Rowe
Hi Torsten, The Lucene StandardTokenizer is written in JFlex (http://jflex.de) - you can see the version 3.X specification at: You can m

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
This was with the cloud-dev solrcloud-start.sh script (after that I've used solrcloud-start-existing.sh). Essentially I run ./solrcloud-start-existing.sh index docs kill 1 of the solr instances (using kill -9 on the pid) delete a doc from running instances restart killed solr instance on doing th

Re: distributed deletes working?

2012-02-17 Thread Mark Miller
Hmm...just tried this with only deletes, and the replica sync'd fine for me. Is this with your multi core setup or were you trying with instances? On Feb 17, 2012, at 1:52 PM, Jamie Johnson wrote: > Yes, still seeing that. Master has 8 items, replica has 9. So the > delete didn't seem to work

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
Yes, still seeing that. Master has 8 items, replica has 9. So the delete didn't seem to work when the node was down. On Fri, Feb 17, 2012 at 1:41 PM, Yonik Seeley wrote: > On Fri, Feb 17, 2012 at 1:38 PM, Jamie Johnson wrote: >> Something that didn't work though >> was if a node was down when

Custom Query Component: parameters are not appended to query

2012-02-17 Thread Vadim Kisselmann
Hello folks, I build a simple custom component for “hl.q” query. My case was to inject hl.q=params on the fly, with filter params like fields which were in my standard query. These were highlighted , because Solr/Lucene have no way of interpreting an extended "q" clause and saying "this part is

Re: distributed deletes working?

2012-02-17 Thread Yonik Seeley
On Fri, Feb 17, 2012 at 1:38 PM, Jamie Johnson wrote: > Something that didn't work though > was if a node was down when a delete happened and then comes back up, > that node still listed the id I deleted.  Is this currently supported? Yes, that should work fine. Are you still seing that behavior

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
Ok, so I'm making some progress now. With _version_ in the schema (forgot about this because I remember asking about it before) deletes across the cluster work when I delete by id. Updates work as well if a node is down it recovered fine. Something that didn't work though was if a node was down

Re: distributed deletes working?

2012-02-17 Thread Yonik Seeley
On Fri, Feb 17, 2012 at 1:27 PM, Jamie Johnson wrote: > I'm seeing the following.  Do I need a _version_ long field in my schema? Yep... versions are the way we keep things sane (shuffled updates to a replica can be correctly reordered, etc). -Yonik lucidimagination.com

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
I'm seeing the following. Do I need a _version_ long field in my schema? Feb 17, 2012 1:15:50 PM org.apache.solr.update.processor.LogUpdateProcessor finish INFO: {delete=[f2c29abe-2e48-4965-adfb-8bd611293ff0]} 0 0 Feb 17, 2012 1:15:50 PM org.apache.solr.common.SolrException log SEVERE: org.apache

Re: Cloud tab hanging?

2012-02-17 Thread Mark Miller
On Feb 17, 2012, at 11:00 AM, Ranjan Bagchi wrote: > Hi, > > I'm pretty new to solr and especially solr cloud, so hopefully this isn't > too dumb: I followed the wiki instructions for setting up a small cloud. > Things seem to work, *except* on the UI [using chrome and safari], the > cloud tab

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
I stop the indexing, stop the shard, then start indexing again. So shouldn't need Yonik's latest fix? In regards to how far out of sync, it's completely out of sync, meaning index 100 documents to the cluster (40 on shard1 60 on shard2) then stop the instance, index 100 more, when I bring the ins

Re: distributed deletes working?

2012-02-17 Thread Yonik Seeley
On Fri, Feb 17, 2012 at 11:13 AM, Mark Miller wrote: > When exactly is this build from? Yeah... I just checked in a fix yesterday dealing with sync while indexing is going on. -Yonik lucidimagination.com

Re: distributed deletes working?

2012-02-17 Thread Mark Miller
On Feb 17, 2012, at 11:03 AM, Jamie Johnson wrote: > Thanks Sami, so long at it's expected ;) Yeah, its expected - we always use both the live nodes info and state to determine the full state for a shard. > > In regards to the replication not working the way I think it should, > am I missing

Re: distributed deletes working?

2012-02-17 Thread Sami Siren
On Fri, Feb 17, 2012 at 6:03 PM, Jamie Johnson wrote: > Thanks Sami, so long at it's expected ;) > > In regards to the replication not working the way I think it should, > am I missing something or is it simply not working the way I think? It should work. I also tried to reproduce your issue but

Re: customizing standard tokenizer

2012-02-17 Thread Em
Hi Torsten, did you have a look at WordDelimiterTokenFilter? Sounds like it fits your needs. Regards, Em Am 17.02.2012 15:14, schrieb Torsten Krah: > Hi, > > is it possible to extend the standard tokenizer or use a custom one > (possible via extending the standard one) to add some "custom" tok

Re: Frequent garbage collections after a day of operation

2012-02-17 Thread Erick Erickson
A wonderful writeup on various memory collection concerns http://www.lucidimagination.com/blog/2011/03/27/garbage-collection-bootcamp-1-0/ On Fri, Feb 17, 2012 at 12:27 AM, Jason Rutherglen wrote: >> One thing that could fit the pattern you describe would be Solr caches >> filling up and gettin

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
Thanks Sami, so long at it's expected ;) In regards to the replication not working the way I think it should, am I missing something or is it simply not working the way I think? On Fri, Feb 17, 2012 at 11:01 AM, Sami Siren wrote: > On Fri, Feb 17, 2012 at 5:10 PM, Jamie Johnson wrote: >> and ha

Re: How to handle to run testcases in ruby code for solr

2012-02-17 Thread Erik Hatcher
Just FYI the solr-ruby (hyphen, not underscore to be precise) is "deprecated" in that the source no longer lives under Apache's svn. The gem is still out there, and it's still a useful library, but the Ruby/Solr world seems to use RSolr the most. Both have their pros/cons, but solr-ruby wo

Re: distributed deletes working?

2012-02-17 Thread Sami Siren
On Fri, Feb 17, 2012 at 5:10 PM, Jamie Johnson wrote: > and having looked at this closer, shouldn't the down node not be > marked as active when I stop that solr instance? Currently the shard state is not updated in the cloudstate when a node goes down. This behavior should probably be changed at

Cloud tab hanging?

2012-02-17 Thread Ranjan Bagchi
Hi, I'm pretty new to solr and especially solr cloud, so hopefully this isn't too dumb: I followed the wiki instructions for setting up a small cloud. Things seem to work, *except* on the UI [using chrome and safari], the cloud tab hangs. It says "Zookeeper Data", and then there's a loading sym

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
and having looked at this closer, shouldn't the down node not be marked as active when I stop that solr instance? On Fri, Feb 17, 2012 at 10:04 AM, Jamie Johnson wrote: > Thanks Mark.  I'm still seeing some issues while indexing though.  I > have the same setup describe in my previous email.  I d

Re: distributed deletes working?

2012-02-17 Thread Jamie Johnson
Thanks Mark. I'm still seeing some issues while indexing though. I have the same setup describe in my previous email. I do some indexing to the cluster with everything up and everything looks good. I then take down one instance which is running 2 cores (shard2 slice 1 and shard 1 slice 2) and d

Re: problem to indexing pdf directory

2012-02-17 Thread Erick Erickson
You should not have to do anything with Maven, the instructions you followed were from 1.4.1 days.. Assuming you're working with a 3.x build, here's a data-config that worked for me, just a straight distro. But note a couple of things: 1> for simplicity, I changed the schema.xml to NOT require

Re: problem to indexing pdf directory

2012-02-17 Thread alessio crisantemi
thanks gora for your help. I installed Maven and downloaded Tika following the guide: But I have an errore during the built of Tika about 'tika compiler', and the maven installation of Tika is stopped. there is another way? thank you a. 2012/2/16 Gora Mohanty > On 16 February 2012 21:37, alessi

customizing standard tokenizer

2012-02-17 Thread Torsten Krah
Hi, is it possible to extend the standard tokenizer or use a custom one (possible via extending the standard one) to add some "custom" tokens like Lucene-Core to be "one" token. regards smime.p7s Description: S/MIME cryptographic signature

Re: Date formatting issue

2012-02-17 Thread darul
Expected date example : 2012-02-17T13:45:08.1Z Date formatter: -MM-dd'T'HH:mm:ss.SSS'Z' Missing Zulu in yours ? -- View this message in context: http://lucene.472066.n3.nabble.com/Date-formatting-issue-tp3748591p3753704.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Payload and exact search - 2

2012-02-17 Thread Erick Erickson
OK, payloads are a bit of a mystery to me, so this may be way off base. But... The ordering of your analysis chain is suspicious, the admin/analysis page is a life-saver. WordDelimiterFilterFactory is breaking up your input before it gets to the payload filter I think, so your payload informatio

Re: Realtime search with multi clients updating index simultaneously.

2012-02-17 Thread Erick Erickson
See below On Thu, Feb 16, 2012 at 6:18 AM, v_shan wrote: > I have a heldesk application developed in PHP/MySQL. I want to implement real > time Full text search and I have shortlisted Solr. MySQL database will store > all the tickets and their updates and that data will be imported for > buil

How to handle to run testcases in ruby code for solr

2012-02-17 Thread solr
Hi all, Am writing rails application by using solr_ruby gem to access solr . Can anybody suggest how to handle testcaeses for solr code and connections in functionaltetsing. -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-handle-to-run-testcases-in-ruby-code-for-solr-

Removing empty dynamic fields from a Solr 1.4 index

2012-02-17 Thread Andrew Ingram
Hi all (Note: this question is cross-posted on stackoverflow: http://stackoverflow.com/questions/9327542/removing-empty-dynamic-fields-from-a-solr-1-4-index) I have a Solr index that uses quite a few dynamic fields. I've recently changed my code to reduce the amount of data we index with Solr,

Re: Error Indexing in solr 3.5

2012-02-17 Thread mechravi25
Hi Chantal, I checked my client. It was pointing to the old solrj. After changing that, it got indexed properly. Thanks a lot. -- View this message in context: http://lucene.472066.n3.nabble.com/Error-Indexing-in-solr-3-5-tp3746735p3753359.html Sent from the Solr - User mailing list archive at

How to connect embedded solr with each other by sharding

2012-02-17 Thread mustafozbek
I have been using sharding with multiple basic solr server for clustering. I also used one embedded solr server (Solrj Java API) with many basic solr servers and connecting them by sharding as embedded solr server is the caller of them. I used the code line below for this purpose. >>SolrQuery q

Re: Solr edismax clarification

2012-02-17 Thread O. Klein
Indika Tantrigoda wrote > > Hi All, > > I am using edismax SearchHandler in my search and I have some issues in > the > search results. As I understand if the "defaultOperator" is set to OR the > search query will be passed as -> The OR quick OR brown OR fox > implicitly. > > Did you also re

Re: custom scoring

2012-02-17 Thread Carlos Gonzalez-Cadenas
Thanks Em, Robert, Chris for your time and valuable advice. We'll make some tests and will let you know soon. On Thu, Feb 16, 2012 at 11:43 PM, Em wrote: > Hello Carlos, > > I think we missunderstood eachother. > > As an example: > BooleanQuery ( > clauses: ( > MustMatch( >

Re: Solr edismax clarification

2012-02-17 Thread Jan Høydahl
Please provide your full query, including your "qf" parameter and all other request parameters, and also the relevant fields/field-types from schema. Do you use stopwords? Can you also add "debugQuery=true" and paste in the parsedQuery? -- Jan Høydahl, search solution architect Cominvent AS - w