Re: Trouble Setting Up Development Environment

2012-09-09 Thread Amit Nithian
Sorry i'm really late to this so not sure if this is even an issue: 1) I found that there is an ant eclipse that makes it easy to setup the eclipse .project and .classpath (I think I had done this by hand in the tutorial) 2) Yes you can attach to a remote instance of Solr but your JVM has to have t

Re: Use a boolean field as a flag for another, just stored, field

2012-09-09 Thread simple350
Exactly, and the question is how to populate at index time a boolean field based on the content of another one ? My initial idea was to make a copy of the initial field - and 'filter' it to a boolean value (by specifying some analyzer in schema.xml) - but , as I said, I might miss some basic Solr

Is there a way to constaint the depth of hierarchy when using PathHierarchyTokenizerFactory?

2012-09-09 Thread bbarani
Hi, I am trying to figure out if theres a way to constaint the depth of the hierarchy when using PathHierarchyTokenizerFactory, something like below For the hierarchy below, 361 361 361 Users should see just top level initially, 361 When user tries to expand this tree they should b

Re: zkcli command line util

2012-09-09 Thread Mark Miller
Yeah, sorry. You either have to manually extract (unzip) the solr war the first time or start solr in none cloud mode and stop it - that will extract the webapp from the war file. If we included it extracted, it would blow up the dists size. Sent from my iPhone On Sep 9, 2012, at 1:06 AM, Jes

Re: Cloud terminology clarification

2012-09-09 Thread Mark Miller
For me, a node is a single Solr instance - generally this is one per machine. A single Solr instance (or node) may have multiple cores. Configuration is per collection, not per SolrCore - unless if course if your collection is one shard with no replicas - then I guess you could say it was per

Re: Is Boilerpipe usable through Solr ExtractingUpdateHandler or the DIH?

2012-09-09 Thread Lance Norskog
Cool! I have since learned another method for handling the redundant templated spew in html pages: crawl the mobile site instead. - Original Message - | From: "Markus Jelsma" | To: solr-user@lucene.apache.org | Sent: Friday, September 7, 2012 3:05:40 AM | Subject: RE: Is Boilerpipe usabl

Re: SOLR 4.0 DataImport frozen or fails with WARNING: Unable to read: dataimport.properties?

2012-09-09 Thread deniz
thank you so much travis, it is working so smoothly right now - Zeki ama calismiyor... Calissa yapar... -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-4-0-DataImport-frozen-or-fails-with-WARNING-Unable-to-read-dataimport-properties-tp4006077p4006506.html Sent from

Re: Use a boolean field as a flag for another, just stored, field

2012-09-09 Thread Savvas Andreas Moysidis
So, as you say, you only need to have a "hasInternalURL" (or something similar) which will be of type boolean and will be populated at index time? Unless I'm missing something I don't see why you would need a custom analyzer for this. On 9 September 2012 22:56, simple350 wrote: > Well - this was

Re: Use a boolean field as a flag for another, just stored, field

2012-09-09 Thread simple350
Well - this was the idea: not to index the useless data from the initial field but to add and index another field, a boolean one, based on the content of the first one. -- View this message in context: http://lucene.472066.n3.nabble.com/Use-a-boolean-field-as-a-flag-for-another-just-stored-fiel

Use field as bool flag for another, not indexed, field

2012-09-09 Thread simple350
Hi, I want to be able to select from the index the documents who have a certain field not null. The problem is that the field is not indexed just stored. I'm not interested in indexing that field as it is just an internal URL. The idea was to add another field to the document - a boolean field

Re: Use a boolean field as a flag for another, just stored, field

2012-09-09 Thread Savvas Andreas Moysidis
Hi, In your field declaration you can specify a "default" value of something your field will be populated with in absence of any value and later at search time run filter queries against that value. Bare in mind that if you want to filter your results based on any value you *have* to index that va

Re: How to use Apache Cassandra with Apache Solr

2012-09-09 Thread Voth, Brad (GE Corporate)
Be wary off solandra, after talking with jake (its developer) I have learned it was a poc for the integration in datastax and it is now abandoned. Interestingly I just did a bakeoff between datastax, solandra, solr and elasticsearch for our environment and datastax performed decently but the sol

Use a boolean field as a flag for another, just stored, field

2012-09-09 Thread simple350
Hi, I want to be able to select from the index the documents who have a certain field not null. The problem is that the field is not indexed just stored. I'm not interested in indexing that field as it is just an internal URL. The idea was to add another field to the document - a boolean field -

Re: Solr 4: Private master, public slave?

2012-09-09 Thread Alexandre Rafalovitch
Hmm, that might actually work. My current prototype is using DIH and TIKA for stop-the-world index re-population, so I assumed it would have to be done by a local SOLR instance. But I guess for production, I can run TIKA on the client and not use DIH at all. This might be enough. Thank you,

How to use Apache Cassandra with Apache Solr

2012-09-09 Thread PeterKerk
For a totally new project I'm building I want to use Cassandra as my datasource (normally I use MSSQL) and provide search/facet navigation with Apache Solr. Is there a tutorial somewhere on how to set this up? I've heard of commercial solutions like Datastax or a small open source initiative Solla

Re: Undefined field text

2012-09-09 Thread Erick Erickson
solrconfig.xml has this line: solrpingquery all so you'll actually be querying against whatever hour default field (defined in schema.xml) is, it defaults to "text".. I claim you could fix this by making "solrpingquery" above into valid_field:solrpingqu

Re: RuntimeException during indexing: how to write document id to log?

2012-09-09 Thread Erick Erickson
Yep, this is a long-standing issue with several sub-parts, see: https://issues.apache.org/jira/browse/SOLR-3382 Best Erick On Sun, Sep 9, 2012 at 5:04 AM, Patrick Oliver Glauner wrote: > Hello > > We use Solr 3.1 and Jetty. I enabled logging in Jetty as described here: > http://wiki.apache.org

Re: Fail to huge collection extraction

2012-09-09 Thread Erick Erickson
Alexandre: I'll buy you a beer sometime, it's just sooo pleasant when someone else has the same worldview I do http://searchhub.org/dev/2011/11/03/stop-being-so-agreeable/ neosky: Particularly look at the paragraph that has "the XY problem" in it. Best Erick On Sun, Sep 9, 2012 at 8:56 AM,

Re: StopFilterFactory attribute format in schema.xml

2012-09-09 Thread Jack Krupansky
The stop words text file is usually a simple, unformatted text file, one word per line, but for some languages the list is formatted differently, the "snowball" format in particular. See SOLR-1860 for more details. https://issues.apache.org/jira/browse/SOLR-1860 In the patch, the stop-snowball

Re: SolrCloud vs SolrReplication

2012-09-09 Thread Erick Erickson
No, it doesn't mean that at all. Consider the situation where you bring up a new replica and it's added to slice/shard X and that the index contains 60M documents so far. You don't want to re-index all those documents from the transaction log even if they are all _in_ the transaction log (which I d

StopFilterFactory attribute format in schema.xml

2012-09-09 Thread sysrq
Hi, what is the effect of the format attribute for StopFilterFactory? E.g. format="snowball"? Sorl ships with a schema.xml with a lot of good examples. The file is in example/solr/conf/schema.xml and defines a for German text:

Re: Undefined field text

2012-09-09 Thread Alexandre Rafalovitch
Most probably something about default field being "text" for searching. Also, I have a vague feeling that default field somehow is defined in both schema.xml and solrconfig.xml Try changing it to something real in all those places. Regards, Alex. Personal blog: http://blog.outerthoughts.com/ L

Undefined field text

2012-09-09 Thread Spadez
Hi, Im trying to do some work on my server but running into problems. When I try to ping the server through the admin panel I get this error, which I believe might be causing the problem: Can anyone give me a bit of guideance as to what might be going wrong? I'm using Solr 3.6. I think it may b

Re: Fail to huge collection extraction

2012-09-09 Thread Alexandre Rafalovitch
I am sorry, but your customer is extremely unlikely to want the whole result in his browser. It is just a red flag that they are converting their (business) requirements into your (IT) language and that's what they end up with. Go the other way, ask them to pretend that you've done it already and

Re: Fail to huge collection extraction

2012-09-09 Thread neosky
Thanks Alex! Yes, you hit my key points. Actually I have to implement both of the requirements. The first one works very well as the reason you state. Now I have a website client which is 20 records per page. It is fast. However, my customer also wants to use Servlet to download the whole query se

Re: Cloud terminology clarification

2012-09-09 Thread Darren Govoni
I agree it needs updating and I've always gotten confused at some point by the use (misuse) of terms. For example, the term 'node' is thrown around a lot too. What is it??! Hehe. On Sat, 2012-09-08 at 22:26 -0700, JesseBuesking wrote: > It's been a while since the terminology at > http://wiki.ap