Re: Solr + Groovy

2013-06-03 Thread Michael Sokolov
On 6/3/13 3:07 AM, Achim Domma wrote: Hi, I have some query building and result processing code, which is currently running as "normal" Solr client outside of Solr. I think it would make a lot of sense to move parts of this code into a custom SearchHandler or SearchComponent. Because I'm not a

Re: How can a Tokenizer be CoreAware?

2013-06-03 Thread Michael Sokolov
Benson, I think the idea is that Tokenizers are created as needed (from the TokenizerFactory), while those other objects are singular (one created for each corresponding stanza in solrconfig.xml). So Tokenizers should be short-lived; they'll be cleaned up after each use, and the assumption is

Re: solcloud on tomcat - the 8983 port problem

2013-05-31 Thread Michael Della Bitta
In solr.XML, hostPort should be set to 80. Zookeeper tracks the port so the various nodes can find one another as long as you set it correctly in that file. On May 31, 2013 8:38 PM, "cleardot" wrote: > I know this issue has been answered somewhere. > > I've got SolrCloud running under Tomcat 6, t

Re: Reindexing strategy

2013-05-30 Thread Michael Sokolov
On 5/30/2013 8:30 AM, Dotan Cohen wrote: On Wed, May 29, 2013 at 5:37 PM, Shawn Heisey wrote: It's impossible for us to give you hard numbers. You'll have to experiment to know how fast you can reindex without killing your servers. A basic tenet for such experimentation, and something you hop

Re: HttpClient version

2013-05-20 Thread Michael Della Bitta
rticular, we're running 4.2.1 and we don't have the ability to call updateAliases on the ZkStateReader to get around this bug: https://issues.apache.org/jira/browse/SOLR-4664 We've managed to get by so far, however. Michael Della Bitta -

FW:

2013-05-16 Thread Michael Lorz
http://hardonfonts.com/mmndsejat.php Michael Lorz

Re: indexing unrelated tables in single core

2013-05-16 Thread Michael Della Bitta
True, it's complaining that your Solr schema has a required field 'title' and your query and data import config aren't providing it. On May 16, 2013 5:51 AM, "Rohan Thakur" wrote: > its saying in the logs that missing required field title which is no where > in the database... > > > On Thu, May 1

TIKA 1.3

2013-05-14 Thread Strucken, Michael
rg/1999/xhtml";> It does not depend on a specific exe file. Any comments and feedback would be gratefully appreciated! Regards, Michael Strucken

Re: Solritas truncates content

2013-05-14 Thread Michael Schmitz
Thank you for the pointer, this helps me understand what is going on. I took out the highlighting branch. It turns out there is truncation logic somewhere down there, because when I did, I had the full document text. Peace. Michael On Mon, May 13, 2013 at 6:45 PM, Erik Hatcher wrote

Solritas truncates content

2013-05-13 Thread Michael Schmitz
ield('content') Any help is greatly appreciated! Peace. Michael

Re: SOLR guidance required

2013-05-13 Thread Michael Della Bitta
Best advice in this thread. :) Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com The science of influence marketing. On Mon, May 13, 2013 at 1:29 PM, Lance Norskog wrote: > If this is

Re: Looking to see if solrj 3.5 could be used with solr server 4.2.1

2013-05-12 Thread Michael Sokolov
On 5/11/2013 11:36 PM, Lee, Peter wrote: If you have any information regarding whether or not this might work (as in "yeah, we did that and it worked okay"...or..."no, that won't work because protocol XYZ changed between versions and ") I would appreciate it. As stated above, simple cases using

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-11 Thread Michael Sokolov
On 5/11/2013 11:31 AM, Michael Sokolov wrote: On 5/11/2013 11:14 AM, Steve Rowe wrote: On May 11, 2013 7:27 AM, "Michael Sokolov" wrote: If somebody grants me access to the wiki, I'd be happy to write something there to let people know about this issue. What's your wiki

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-11 Thread Michael Sokolov
On 5/11/2013 11:14 AM, Steve Rowe wrote: On May 11, 2013 7:27 AM, "Michael Sokolov" wrote: If somebody grants me access to the wiki, I'd be happy to write something there to let people know about this issue. What's your wiki username? sokolov

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-11 Thread Michael Sokolov
On 5/10/2013 11:39 PM, Shawn Heisey wrote: On 5/10/2013 8:56 PM, Michael Sokolov wrote: On 5/10/2013 10:18 PM, Shawn Heisey wrote: I don't know why I'm not having any trouble. I'm certainly glad that I'm not, though! Thanks, Shawn Shawn, one question - in your serv

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-10 Thread Michael Sokolov
On 5/10/2013 10:18 PM, Shawn Heisey wrote: I don't know why I'm not having any trouble. I'm certainly glad that I'm not, though! Thanks, Shawn Shawn, one question - in your server setup do you have: _querySolr.setRequestWriter(new BinaryRequestWriter()); ? I didn't see that - it (used to be)

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-10 Thread Michael Sokolov
On 5/10/2013 10:18 PM, Shawn Heisey wrote: On 5/10/2013 7:42 PM, Michael Sokolov wrote: My question is: is this intentional? It's unfortunate that we don't seem to be able to update the client and have it continue to work with (ie send updates to) the old servers. We have a centrali

Re: Solr 4.x/3.x update javabin incompatibility?

2013-05-10 Thread Michael Sokolov
- is there any way to do it without patching SolrJ? On 5/10/2013 9:42 PM, Michael Sokolov wrote: I upgraded one of my solrj clients to 4.2.0, and am testing using it with a 3.4 server. We generally use a BinaryRequestWriter (ie javabin). With the 3.4 solrj client, this caused update reque

Solr 4.x/3.x update javabin incompatibility?

2013-05-10 Thread Michael Sokolov
I upgraded one of my solrj clients to 4.2.0, and am testing using it with a 3.4 server. We generally use a BinaryRequestWriter (ie javabin). With the 3.4 solrj client, this caused update requests to be directed to /update/javabin. However, in 4.2, the dispatch seems to be getting handled usi

Re: Install Solr on EC2

2013-05-10 Thread Michael Della Bitta
I can update this page if someone adds me as a contributor: MichaelDellaBitta. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com The science of influence marketing. On Fri, May 10, 2013 at

Re: Oracle Timestamp in SOLR

2013-05-08 Thread Michael Della Bitta
Peter, Looks like you can call timestampValue() on that object and get back a java.sql.Timestamp, which is a subclass of java.util.Date: http://docs.oracle.com/cd/E16338_01/appdev.112/e13995/oracle/sql/TIMESTAMP.html#timestampValue__ Hope that helps, Michael Della Bitta

Re: Why is SolrCloud doing a full copy of the index?

2013-05-06 Thread Michael Della Bitta
Hi Shawn, Thanks a lot for this entry! I'm wondering, when you say "Garbage collections that happen more often than ten or so times per minute may be an indication that the heap size is too small," do you mean *any* collections, or just full collections? Mich

Re: iterate through each document in Solr

2013-05-05 Thread Michael Sokolov
On 5/5/13 7:48 PM, Mingfeng Yang wrote: Dear Solr Users, Does anyone know what is the best way to iterate through each document in a Solr index with billion entries? I tried to use select?q=*:*&start=xx&rows=500 to get 500 docs each time and then change start value, but it got very slow after

Re: zookeeper errors

2013-05-05 Thread Michael Della Bitta
Mark, I'm definitely using CMS, so I'll look into the zk client timeout. Thanks! Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Sun, May 5, 20

zookeeper errors

2013-05-05 Thread Michael Della Bitta
t be able to provide, Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game

Re: SolrJ / Solr Two Phase Commit

2013-05-02 Thread Michael Della Bitta
Peer core or collection, depending on whether we're talking about Cloud or not. Basically, commits in Solr are about controlling visibility more than anything, although now with Cloud, they have resource consumption and lifecycle ramifications as well. On May 2, 2013 10:01 PM, "mark12345" wrote:

Re: SolrJ / Solr Two Phase Commit

2013-05-02 Thread Michael Della Bitta
One thing I do know is that commits in Solr are global, so there's no way to do this with concurrency. That being said, Solr doesn't tend to accept updates that would generate errors once committed in my experience. Michael Della Bitta

Re: How to Recovery Backup Snapshot at SolrCloud?

2013-05-01 Thread Michael Della Bitta
Yeah, it's a consistency problem. Copying all those files takes time, and without something with some knowledge of how Lucene works managing the atomicity of the work, you might end up with a segments file that doesn't match the segments you actually copied. Michael D

Re: Only return snippets, not content

2013-05-01 Thread Michael Della Bitta
Since you're doing this, you might want to make sure lazy field loading is on as well. Unfortunately I can't link you to the wiki because it still down (uh oh), but it's a setting in solrconfig.xml. Michael Della Bitta Appinio

Re: How to recover from "Error opening new searcher" when machine crashed while indexing

2013-05-01 Thread Michael McCandless
Alas I think CheckIndex can't do much here: there is no segments file, so you'll have to reindex from scratch. Just to check: did you ever called commit while building the index before the machine crashed? Mike McCandless http://blog.mikemccandless.com On Tue, Apr 30, 2013 at 8:17 PM, Otis Gos

Re: Master - Slave File Sizes are not Same even after "command=abortfetch"

2013-04-30 Thread Michael Della Bitta
I could be getting this wrong, and the wiki is down at the moment, but I think a replica can be a leader, whereas a follower is definitely not. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271

Re: Master - Slave File Sizes are not Same even after "command=abortfetch"

2013-04-30 Thread Michael Della Bitta
I'd say a follower is a participant in a shard that's not the leader. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 30, 2013 at 1:27

Re: How to Recovery Backup Snapshot at SolrCloud?

2013-04-30 Thread Michael Della Bitta
more about this want to chime in? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 30, 2013 at 11:03 AM, Furkan KAMACI wrote: > Should I stop t

Re: Master - Slave File Sizes are not Same even after "command=abortfetch"

2013-04-30 Thread Michael Della Bitta
ioned already. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 30, 2013 at 11:04 AM, Furkan KAMACI wrote: > I use Solr 4.2.1 as SolrCloud >

Re: Master - Slave File Sizes are not Same even after "command=abortfetch"

2013-04-30 Thread Michael Della Bitta
I'm a little confused. Are you using Solr Cloud, or ordinary replication? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 30, 2013 at 10:

Re: How to Recovery Backup Snapshot at SolrCloud?

2013-04-30 Thread Michael Della Bitta
That directory is the data directory for the core... you'd just swap it in. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 30, 2013 at 8:

Re: Master - Slave File Sizes are not Same even after "command=abortfetch"

2013-04-30 Thread Michael Della Bitta
In Solr Cloud, commits can happen at different times across replicas. Which means merges also may happen at different times. So there's no expectation of the cores of different replicas being totally similar. Michael Della Bitta Appinions 18

Re: Bloom filters and optimized vs. unoptimized indices

2013-04-30 Thread Michael McCandless
Be sure to test the bloom postings format on your own use case ... in my tests (heavy PK lookups) it was slower. But to answer your question: I would expect a single segment index to have much faster PK lookups than a multi-segment one, with and without the bloom postings format, but bloom may mak

Re: createNodeSet

2013-04-29 Thread Michael Della Bitta
ng, and what servlet container? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Mon, Apr 29, 2013 at 9:20 AM, Arkadi Colson wrote: > I found this in

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Michael Della Bitta
ouble, I bet it would also help performance. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Mon, Apr 29, 2013 at 9:06 AM, Dmitry Kan wrote: > Michael, &

Re: Solr performance issues for simple query - q=*:* with start and rows

2013-04-29 Thread Michael Della Bitta
We've found that you can do a lot for yourself by using a filter query to page through your data if it has a natural range to do so instead of start and rows. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017

Re: how to get & display Jessionid with solr results

2013-04-25 Thread Michael Della Bitta
here: http://wiki.eclipse.org/Jetty/Howto/SessionIds Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Wed, Apr 24, 2013 at 6:36 PM, gpssolr2020 wrote: > H

Re: Deletes and inserts

2013-04-25 Thread Michael Della Bitta
We've successfully reused the same config in Zookeeper across multiple collections and using aliases. Could you describe your problem? What does the error say? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Michael Della Bitta
Today I learned there's a clear command in the command line util. :) Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Thu, Apr 25, 2013 at 12:00 PM,

Re: How to Clean Zookeeper Data for Solr

2013-04-25 Thread Michael Della Bitta
les and folders: clusterstate.json aliases.json live_nodes overseer overseer_elect collections If you use a chroot jail, make it again with "create /solr_chroot_dir []" 4. Use Solr's zkCli to upload your configs again. 5. Start all your Solr nodes. 6. Create your collectio

Re: full-import takes 4 days(48 hours) to complete where main db table size 700k only

2013-04-24 Thread Michael Della Bitta
arrive. If you can remove any of those entities in favor of joins, you'll be doing yourself a favor. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Gam

Re: Solr indeing Partially working

2013-04-24 Thread Michael Della Bitta
"solrservice.php" and the text of that error both sound like parts of Typo3... they're definitely not part of Solr. You should ask on a list devoted to Typo3 to figure out what to do in this situation. It likely won't involve reconfiguring Solr.

Re: Deletes and inserts

2013-04-24 Thread Michael Della Bitta
collection to the alias at creation time, but then it becomes read-only. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Wed, Apr 24, 2013 at 12:26 PM, Jon Stra

Re: How to let Solr load libs from within my JAR?

2013-04-24 Thread Michael Della Bitta
Maven Assembly or Shade plugins. But usually moving a few extra JARs isn't too difficult. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 23, 2013

Re: Update on shards

2013-04-23 Thread Michael Della Bitta
I believe as of 4.2 you can talk to any host in the cloud. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Apr 23, 2013 at 10:45 AM, Arkadi Colson

RE: Stats facet on int/tint fields

2013-04-22 Thread Michael Ryan
Sounds like this could be https://issues.apache.org/jira/browse/SOLR-2976. -Michael -Original Message- From: vinothkumar raman [mailto:vinothkr.k...@gmail.com] Sent: Monday, April 22, 2013 5:54 AM To: solr-user@lucene.apache.org; solr-...@lucene.apache.org Subject: Stats facet on int

Re: Updating clusterstate from the zookeeper

2013-04-19 Thread Michael Della Bitta
I would like to know the answer to this as well. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Thu, Apr 18, 2013 at 8:15 PM, Manuel Le Normand wrote

Re: Document adds, deletes, and commits ... a question about visibility.

2013-04-15 Thread Michael McCandless
At the Lucene level, you don't have to commit before doing the deleteByQuery, i.e. 'a' will be correctly deleted without any intervening commit. Mike McCandless http://blog.mikemccandless.com On Mon, Apr 15, 2013 at 3:57 PM, Shawn Heisey wrote: > Simple question first: Is there anything in Solr

Re: Basic auth on SolrCloud /admin/* calls

2013-04-11 Thread Michael Della Bitta
ed setups don't play well with auth because there's no good way to configure them to use it. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Thu

RE: Approximately needed RAM for 5000 query/second at a Solr machine?

2013-04-11 Thread Michael Ryan
#x27;m not sure if the current merge policy implementations still have this issue. -Michael -Original Message- From: Furkan KAMACI [mailto:furkankam...@gmail.com] Sent: Thursday, April 11, 2013 2:44 AM To: solr-user@lucene.apache.org Subject: Re: Approximately needed RAM for 5000 query/s

RE: NPE when faceting TEXTfield in a distributed search query

2013-04-10 Thread Michael Ryan
you able to see the query in the shard 6 log for the working facet.limit=5000&facet.offset=0 query? -Michael -Original Message- From: coolpriya5 [mailto:coolpri...@gmail.com] Sent: Wednesday, April 10, 2013 9:16 PM To: solr-user@lucene.apache.org Subject: RE: NPE when faceting TEXTf

RE: NPE when faceting TEXTfield in a distributed search query

2013-04-10 Thread Michael Ryan
n NPE. Are you using Jetty, Tomcat, or something else? -Michael -Original Message- From: coolpriya5 [mailto:coolpri...@gmail.com] Sent: Wednesday, April 10, 2013 7:17 PM To: solr-user@lucene.apache.org Subject: Re: NPE when faceting TEXTfield in a distributed search query Solr Version is 3.

Re: Number of segments

2013-04-09 Thread Michael Long
My main concern was just making sure we were getting the best search performance, and that we did not have too many segments. Every attempt I made to adjust the segment count resulted in no difference (segment count never changed). Looking at that blog page, it looks like 30-40 segments is prob

Re: Best practice for rebuild index in SolrCloud

2013-04-09 Thread Michael Della Bitta
delete the old collection when you feel good about the new one. Obviously this means that none of your clients should point at the collection directly, but rather one of the aliases depending on whether they're reading or writing. HTH,

Re: Latency Comparison between cloud hosting Vs Dedicated hosting

2013-04-09 Thread Michael Della Bitta
On Tue, Apr 9, 2013 at 3:33 AM, Sujatha Arun wrote: > Would a bigger instance improve latency? Yes, and prewarming caches would help, too. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.

stupid collection tricks

2013-04-08 Thread Michael Della Bitta
to remove them. Is there an easy way to do this? Do I need to manually edit clusterstate.json somehow? I should mention that everything's running 4.2.1 now. Thanks, Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New

Re: Number of segments

2013-04-08 Thread Michael Long
On 04/08/2013 09:41 AM, Upayavira wrote: How many documents do you have? How big are the files on disk? 2,795,601 and the index dir is 50G Note it says "segments per tier", you may have multiple tiers at play meaning you can have more than ten segments. How do I determine how many tiers it h

Number of segments

2013-04-08 Thread Michael Long
I'm running solr 4.0. I'm noticing my segments are staying in the 30+ range, even though I have these settings: 10 10 10 false Can anyone give me some advice on what I should change or check?

Re: SolrCloud not distributing documents across shards

2013-04-04 Thread Michael Della Bitta
Thank you for all your hard work! Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Wed, Apr 3, 2013 at 6:08 PM, Mark Miller wrote: > > On Apr 3, 2013

Re: SolrCloud not distributing documents across shards

2013-04-03 Thread Michael Della Bitta
orks and stick with it. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Wed, Apr 3, 2013 at 5:01 PM, vsilgalis wrote: > Michael Della Bitta-2 wrote >>

Re: SolrCloud not distributing documents across shards

2013-04-03 Thread Michael Della Bitta
this, but YMMV: curl "http://localhost:8080/solr/admin/collections?action=CREATE&name=$name&numShards=$num&collection.configName=$config-name"; Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-627

Re: SolrCloud not distributing documents across shards

2013-04-03 Thread Michael Della Bitta
think it should. Go into the cloud GUI and look at clusterstate.json in the Tree tab. You should see the routing algorithm it's using in that file. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appini

Re: SolrCloud not distributing documents across shards

2013-04-03 Thread Michael Della Bitta
Hello Vytenis, What exactly do you mean by "aren't distributing across the shards"? Do you mean that POSTs against the server for shard 1 never end up resulting in documents saved in shard 2? Michael Della Bitta Appinions 18 East 4

Re: Solr ZooKeeper ensemble with HBase

2013-04-03 Thread Michael Della Bitta
ng the SSD instances, the disks are slow. Might not be a problem for you. Also things like faceting and sorting can heavily hit the CPU. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influ

Re: Solr ZooKeeper ensemble with HBase

2013-04-03 Thread Michael Della Bitta
Hello, Amit: My guess is that, if HBase is working hard, you're going to have more trouble with HBase and Solr on the same nodes than HBase and Solr sharing a Zookeeper. Solr's usage of Zookeeper is very minimal. Michael Della Bitta Ap

Re: Loadtesting solr/tomcat7 and tomcat stops responding entirely

2013-03-26 Thread Michael Della Bitta
r.xml: These instructions assume you're running Tomcat 6 or 7. Here's some documentation: http://tomcat.apache.org/tomcat-7.0-doc/apr.html http://tomcat.apache.org/tomcat-7.0-doc/config/http.html Michael Della Bitta Appinions 18 East 4

RE: Slow performance on distributed search

2013-03-26 Thread Michael Ryan
the merging time on the controller that is the slow point, I would think that could also be improved. Sorting 4 million values really shouldn't take that long... -Michael -Original Message- From: qungg [mailto:qzheng1...@gmail.com] Sent: Tuesday, March 26, 2013 2:55 PM To: s

Re: Slow performance on distributed search

2013-03-26 Thread Michael Della Bitta
We've been able to speed up deep paging through big sets by using a filter query to segment them as well as start/rows paging. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Infl

Re: Loadtesting solr/tomcat7 and tomcat stops responding entirely

2013-03-26 Thread Michael Della Bitta
Nate, We just cleared up a problem similar to this by ditching Elastic Load Balancer and switching over to the APR connector in Tomcat. Are you using either of those? Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017

RE: Slow performance on distributed search

2013-03-26 Thread Michael Ryan
What are the values of the start and rows parameters you are using? When you say the controller shard takes a long time, how long is it taking - 100ms, 1s, 10s...? -Michael -Original Message- From: qungg [mailto:qzheng1...@gmail.com] Sent: Tuesday, March 26, 2013 11:17 AM To: solr

Re: Tlog File not removed after hard commit

2013-03-25 Thread Michael Della Bitta
My understanding is that logs stick around for a while just in case they can be used to catch up a shard that rejoins the cluster. On Mar 24, 2013 12:03 PM, "Niran Fajemisin" wrote: > Hi all, > > We import about 1.5 million documents on a nightly basis using DIH. During > this time, we need to e

Re: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems.

2013-03-20 Thread Michael Della Bitta
ios. At least is what I´ve read in the documentation, and also I asked to Mark Miller some months ago when I started dealing with Solr 4.0-BETA. I was told otherwise during Solr Boot Camp. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor

Nested queries with proximity/slop

2013-03-19 Thread Michael Ryan
e, having slop on the _query_ field would make no sense). Looked around in Jira, but couldn't find this having been reported. I guess it is just a bit esoteric... -Michael

Re: SolrCloud with Zookeeper ensemble in production environment: SEVERE problems.

2013-03-19 Thread Michael Della Bitta
ed to diagnose why you're only able to do a few documents per second first.*** Adding more threads at this point is probably not going to help. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinion

Re: Zookeeper specs

2013-03-19 Thread Michael Della Bitta
We're running 3 c1.mediums, but mostly because we had spare reservations for them. They barely break a sweat with our small clusters (7 nodes total at the moment). Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY

Re: discovery-based core enumeration with embedded solr

2013-03-16 Thread Michael Sokolov
On 3/16/2013 9:52 AM, Erick Erickson wrote: A, good catch! Coincidentally yesterday while in the midst of looking at some other JIRAs, I noticed that some pages on the Wiki said 4.2 and changed what I ran across to 4.3. I originally started the Wikis when I though I would go fast enough to ge

Re: discovery-based core enumeration with embedded solr

2013-03-15 Thread Michael Sokolov
ut that might not be straightforward... Thanks, Erick On Wed, Mar 13, 2013 at 5:28 PM, Michael Sokolov < msoko...@safaribooksonline.com> wrote: Has the new core enumeration strategy been implemented in the CoreContainer.Initializer.**initialize() code path? It doesn't seem like it has. I

Re: solr cell

2013-03-15 Thread Michael Della Bitta
Niklas, In Linux, the API for watching for filesystem changes is called inotify. You'd need to write something to listen to those events and react accordingly. Here's a brief discussion about it: http://stackoverflow.com/questions/4062806/inotify-how-to-use-it-linux Michael D

Re: Meaning of "Current" in Solr Cloud Statistics

2013-03-14 Thread Michael Della Bitta
Stefan, Thanks a lot! Makes sense. So I don't have to worry about my leader thinking it's out of date, then. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Is

Meaning of "Current" in Solr Cloud Statistics

2013-03-14 Thread Michael Della Bitta
Hi everyone, Is there an official definition of the "Current" flag under Core > Home > Statistics? What would it mean if a shard leader is not "Current"? Thanks, Michael Della Bitta Appinions 18 East 41st Street, 2n

discovery-based core enumeration with embedded solr

2013-03-13 Thread Michael Sokolov
rg.apache.solr.core.SolrConfig.(SolrConfig.java:117) at org.apache.solr.core.CoreContainer.createFromLocal(CoreContainer.java:989) ... 11 more even though I have a solr.properties file in solr-multi (which is my solr.home), and core.properties in some subdirectories of that -- Michael Sokolov

Re: Is Lucene's DrillSideways something suitable for Solr?

2013-03-13 Thread Michael McCandless
On Tue, Mar 12, 2013 at 11:24 PM, Yonik Seeley wrote: > On Tue, Mar 12, 2013 at 10:27 PM, Alexandre Rafalovitch > wrote: >> Lucene seems to get a new DrillSideways functionality on top of its own >> facet implementation. >> >> I would love to have something like that in Solr > > Solr has had mult

Re: PDF keyword searches not accurate

2013-03-12 Thread Michael Della Bitta
You could also use the analysis handler to see if your field definition strips numeric input. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game On Tue, Mar 12

Re: xml output question

2013-03-12 Thread Michael Della Bitta
HI Bart, You've linked to the page that explains how to use Saxon to run XSLT over the output. So the answer is yes? I'm having trouble understanding what your real question is. Thanks, Michael Della Bitta Appinions 18 East 41st S

Re: Multiple Collections in one Zookeeper

2013-03-08 Thread Michael Della Bitta
Yes, but you'll need to append a sub path on to the zookeeper path for your second cluster. For ex: zookeeper1.example.com,zookeeper2.example.com,zookeeper3.example.com/subpath On Mar 8, 2013 6:46 PM, "jimtronic" wrote: > Hi, > > I have a solrcloud cluster running several cores and pointing at o

Re: Master /Slave Set up on AWS - 3.6.1

2013-03-05 Thread Michael Della Bitta
you need to service queries. They're very likely not the same, in which case, that may afford you the ability to have 2 slaves and 1 master in a similar budget. Also, once you've settled on an infrastructure, you should investigate buying reserved instances for a year. It will great

Re: Having an issue where atomic updates are treated as new docs running in solrcloud on 4.1

2013-03-04 Thread Michael Della Bitta
I personally don't know of one other than starting over with a new collection, but I'd love to be proven wrong, because I'm actually in the same boat as you! On Mar 4, 2013 6:09 PM, "mike st. john" wrote: > Hi michael, > > ah, thats seems to be the issue, its

Re: Having an issue where atomic updates are treated as new docs running in solrcloud on 4.1

2013-03-04 Thread Michael Della Bitta
Hi Mike, Are you sure sending it to the collection URL as opposed to one of the shard URLs? If you go to the Cloud tab, click on Tree, and then click on clusterstate.json, what is the value for "router" for that collection? Michael D

Re: Poll: SolrCloud vs. Master-Slave usage

2013-03-01 Thread Michael Della Bitta
n for visibility. You could conceivably get NRT on a single node without Solr Cloud, but there would be no redundancy. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game O

Re: Get page number of searchresult of a pdf in solr

2013-02-28 Thread Michael Della Bitta
My guess is the best way to do this is to index each page separately and to store a link to the PDF/page in each document. That would probably require you to preprocess the PDFs to turn each one into a single page per PDF, or to extract the text per page another way. Michael Della Bitta

Re: SolrCloud as my primary data store

2013-02-28 Thread Michael Sokolov
On 02/21/2013 12:02 AM, jimtronic wrote: Now that I've been running Solr Cloud for a couple months and gotten comfortable with it, I think it's time to revisit this subject. ... I'd really like to hear from someone who has made the leap. Cheers, Jim We use Solr as our primary storage

Re: Solr3.5 Vs Solr4.1 - Help please

2013-02-28 Thread Michael Della Bitta
expect to see some CPUs pegged, but it doesn't look that way to us No iowait (once the index is in cache), less than 50% CPU usage. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Whe

Re: Solr 4.1 Master -> Slave Replication Issues and Solr 4.2

2013-02-27 Thread Michael Della Bitta
Hello Mark, Thanks for the heads up. Are there any temporary fixes to this problem aside from waiting for 4.2? Thanks, Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence

Re: Multicore Master - Slave - solr 3.6.1

2013-02-27 Thread Michael Della Bitta
e while the server is running... Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com Where Influence Isn’t a Game

Re: Dropping slow queries

2013-02-27 Thread Michael Della Bitta
I'm pretty sure David's answer is correct. When this happens, you'll see a "partialResults" field in the response. Michael Della Bitta Appinions 18 East 41st Street, 2nd Floor New York, NY 10017-6271 www.appinions.com

<    4   5   6   7   8   9   10   11   12   13   >