Re: ${solr.abortOnConfigurationError:false} - does it defaults to false

2009-08-26 Thread Shalin Shekhar Mangar
On Thu, Aug 27, 2009 at 1:05 AM, Ryan McKinley wrote: > > On Aug 26, 2009, at 3:33 PM, djain101 wrote: > > >> I have one quick question... >> >> If in solrconfig.xml, if it says ... >> >> >> ${solr.abortOnConfigurationError:false} >> >> does it mean defaults to false if it is not >> set >> as sy

Re: Pattern matching in Solr

2009-08-26 Thread bhaskar chandrasekar
  Hi,   In Schema.xml file,I am not able ot find splitOnCaseChange="1". I am not looking for case sensitive search. Let me know what file you are refering to?. I am looking for exact match search only Moreover for scenario 2 the KeywordTokenizerFactory and EdgeNGramFilterFactory refers which link

Re: Problem using replication in 8/25/09 nightly build of 1.4

2009-08-26 Thread Shalin Shekhar Mangar
On Wed, Aug 26, 2009 at 11:53 PM, Ron Ellis wrote: > Hi Everyone, > > When trying to utilize the new HTTP based replication built into Solr 1.4 I > encounter a problem. When I view the replication admin page on the slave > all > of the master values are null i.e. Replicatable Index Version:null,

Re: Max limit on number of cores?

2009-08-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
There is no hard limit. It is going to be decided by your h/w . You will be limited by the no:of files that can be kept open by your system. On Thu, Aug 27, 2009 at 1:06 AM, djain101 wrote: > > Hi, > > Is there any maximum limit on the number of cores one solr webapp can have > without compromisin

Re: master/slave replication issue

2009-08-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
The log messages are shown when you hit the admin page. So on't worry about that. Keep a minimal configuration of Replication. All you need is masterUrl and pollInterval. On Thu, Aug 27, 2009 at 5:52 AM, J G wrote: > > > > > > > > Hello, > > I'm having an issue getting the master to replicate it

RE: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-26 Thread Fuad Efendi
Looks like you totally ignored my previous post... > Who is vendor of this "openjdk-1.6.0.0"? Who is vendor of JVM which this JDK > runs on? >... such installs for Java are totally mess, you may have incompatible Servlet API loaded by bootstrap classloader before Tomcat classes First of al

Re: Cannot get solr 1.3.0 to run properly with plesk 9.2.1 on CentOS

2009-08-26 Thread Aaron Aberg
Hey Guys, Ok, I found this: Troubleshooting Errors It's possible that you get an error related to the following: SEVERE: Exception starting filter SolrRequestFilter java.lang.NoClassDefFoundError: Could not initialize class org.apache.solr.core.SolrConfig at org.apache.solr.servlet.Solr

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Fuad Efendi
Frankly, I never tried any DIH... probably it is the best option for this specific case (they have Java developer) - but one should be knowledgeable enough to design SOLR schema... And I noticed here (and also at HBase mailing list) many first-time users are still thinking in terms of Relational-DB

RE: Lucene Search Performance Analysis Workshop

2009-08-26 Thread Fuad Efendi
I am wondering... are new SOLR filtering features faster than standard Lucene queries like {query} AND {filter}??? Why can't we improve Lucene then? Fuad P.S. https://issues.apache.org/jira/browse/SOLR-1169 https://issues.apache.org/jira/browse/SOLR-1179 -Original Message-

Re: Total count of records

2009-08-26 Thread Avlesh Singh
> > How the total count of records are fetched in Solr , does it refer any > Schema or XML file?. > Sorry, but I did not get you. What does that mean? The total count is not stored anywhere; it is computed based on how many documents you have in your index matching the query. Cheers Avlesh On Thu

Total count of records

2009-08-26 Thread bhaskar chandrasekar
Hi,   When Solr retrives records based on a input match , it gives total count of records. Say for Ex , it displays like : 1 out of 20,000 for the particular search string.   How the total count of records are fetched in Solr , does it refer any Schema or XML file?.     Regards Bhaskar  

Re: Sorting by Unindexed Fields

2009-08-26 Thread Erik Hatcher
Solr sorts on indexed fields only, currently. And only a single value per document per sort field (careful with analyzed fields, and no multiValued fields). Unwise and impossible - of course this depends on the scale you're speaking of. How many documents? What types of fields? How sma

Fwd: Lucene Search Performance Analysis Workshop

2009-08-26 Thread Erik Hatcher
While Andrzej's talk will focus on things at the Lucene layer, I'm sure there'll be some great tips and tricks useful to Solrians too. Andrzej is one of the sharpest folks I've met, and he's also a very impressive presenter. Tune in if you can. Erik Begin forwarded message: Fr

Re: SolrJ and Solr web simultaneously?

2009-08-26 Thread Erik Hatcher
With a relational database, the approach that has been working for us and many customers is to first give DataImportHandler a go. It's powerful and fast. 3M docs should index in about an hour or less, I'd speculate. But using DIH does require making access from Solr to the DB server soli

Re: Seattle / NW Hadoop, HBase Lucene, etc. Meetup , Wed August 26th, 6:45pm

2009-08-26 Thread Bradford Stephens
Hello, My apologies, but there was a mix-up reserving our meeting location, and we don't have access to it. I'm very sorry, and beer is on me next month. Promise :) Sent from my Internets On Aug 25, 2009, at 4:21 PM, Bradford Stephens > wrote: Hey there, Apologies for this not going out

Re: Sorting by Unindexed Fields

2009-08-26 Thread Avlesh Singh
> > Is it also the case that it will not narrow by them? If "narrowing" means faceting, then again a no. Cheers Avlesh On Thu, Aug 27, 2009 at 6:36 AM, Isaac Foster wrote: > Is it also the case that it will not narrow by them? > > Isaac > > On Wed, Aug 26, 2009 at 8:59 PM, Avlesh Singh wrote:

Re: Sorting by Unindexed Fields

2009-08-26 Thread Isaac Foster
Is it also the case that it will not narrow by them? Isaac On Wed, Aug 26, 2009 at 8:59 PM, Avlesh Singh wrote: > > > > Will Solr sort by fields that are unindexed? > > > Unfortunately, No. > > Cheers > Avlesh > > On Thu, Aug 27, 2009 at 4:03 AM, Isaac Foster >wrote: > > > Hi, > > > > I have a

Re: Sorting by Unindexed Fields

2009-08-26 Thread Avlesh Singh
> > Will Solr sort by fields that are unindexed? > Unfortunately, No. Cheers Avlesh On Thu, Aug 27, 2009 at 4:03 AM, Isaac Foster wrote: > Hi, > > I have a situation where a particular kind of document can be categorized > in > different ways, and depending on the categories it is in it will hav

Re: SortableFloatFieldSource not accessible? (1.3)

2009-08-26 Thread Yonik Seeley
SortableFloatField works in function queries... it's just that everyone goes through SortableFloatField.getValueSource() to create them. Will that work for you? -Yonik http://www.lucidimagination.com On Wed, Aug 26, 2009 at 6:23 PM, Christophe Biocca wrote: > The class SortableFloatFieldSource

master/slave replication issue

2009-08-26 Thread J G
Hello, I'm having an issue getting the master to replicate its index to the slave. Below you will find my configuration settings. Here is what is happening: I can access the replication dashboard for both the slave and master and I can successfully execute HTTP commands against both of t

Manual facet sorting - possible?

2009-08-26 Thread Matthew Painter
Hi, I am attempting to perform a faceted distributed search with manual sorting of the value of a facet. Is this something which is possible through a Solr query or would I be better off inserting a manual weighting field and sort by that? To clarify - I am performing a distributed search over

Sorting by Unindexed Fields

2009-08-26 Thread Isaac Foster
Hi, I have a situation where a particular kind of document can be categorized in different ways, and depending on the categories it is in it will have different fields that describe it (in practice the number of fields will be fairly small, but whatever). These documents will each have a full-text

Re: Using Lucene's payload in Solr

2009-08-26 Thread Bill Au
While testing my code I discovered that my copyField with PatternTokenize does not do what I want. This is what I am indexing into Solr: 2.0|Solr In Action My copyField is simply: field titleRaw is of type title_raw: For my example

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Francis Yakin
Thanks for the response. I will try CommonsHttpSolrServer for now. Francis -Original Message- From: Fuad Efendi [mailto:f...@efendi.ca] Sent: Wednesday, August 26, 2009 1:34 PM To: solr-user@lucene.apache.org Subject: RE: SolrJ and Solr web simultaneously? With this configuration proba

SortableFloatFieldSource not accessible? (1.3)

2009-08-26 Thread Christophe Biocca
The class SortableFloatFieldSource cannot be accessed from outside its package. So it can't be used as part of a FunctionQuery. Is there a workaround to this, or should I roll my own? Will it be fixed in 1.4?

RE: Solr Replication

2009-08-26 Thread J G
Thanks for the response. It's interesting because when I run jconsole all I can see is one ReplicationHandler jmx mbean. It looks like it is defaulting to the first slice it finds on its path. Is there anyway to have multiple replication handlers or at least obtain replication on a per "slice"

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Francis Yakin
No, we don't want to put at the same box as Database box. Agree, that indexing/committing/merging and optimizing is the bottle neck. I think it worths to try SolrJ with CommmonsHttpSolrServer option for now and let's see what happened to load 3 millions docs. Thanks Francis -Original Mess

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Fuad Efendi
>I just worried that http will be a bottle neck, that's why I prefer JDBC connection method. - JDBC is a library for Java Application; it connects to Database; it uses proprietary protocol provided by DB vendor in most cases, and specific port number - SolrJ is a library for Java Application; it c

Re: What makes a function query count as a match or not?

2009-08-26 Thread Yonik Seeley
On Wed, Aug 26, 2009 at 11:27 AM, Christophe Biocca wrote: > I haven't been able to find what makes a function query count as a match > when used a part of a boolean query with Occur.MUST. A function query matches all non-deleted documents. -Yonik http://www.lucidimagination.com

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Fuad Efendi
With this configuration probably preferred method is to run standalone Java application on same box as DB, or very close to DB (in same network segment). HTTP is not a bottleneck; main bottleneck is indexing/committing/merging/optimizing in SOLR... Just as a sample, if you submit to SOLR batch o

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Francis Yakin
We already opened port 80 from solr to DB so that's not the issue, but httpd(port 80) is very flaky if there is firewall between Solr and DB. We have Solr master/slaves env, client access the search thru slaves( master only accept the new index from DB and slaves will pull the new indexes from

RE: Searching and Displaying Different Logical Entities

2009-08-26 Thread Fuad Efendi
>then 2) get all P's by ID, including facet counts, etc. >The problem I face with this solution is that I can have many matching P's (10,000+), so my second query will have many (10,000+) constraints. SOLR can automatically provide you P's with Counts, and it will be _unique_... Even if cardinal

Searching and Displaying Different Logical Entities

2009-08-26 Thread wojtekpia
I'm trying to figure out if Solr is the right solution for a problem I'm facing. I have 2 data entities: P(arent) & C(hild). P contains up to 100 instances of C. I need to expose an interface that searches attributes of entity C, but displays them grouped by parent entity, P. I need to include fac

Re: ${solr.abortOnConfigurationError:false} - does it defaults to false

2009-08-26 Thread Ryan McKinley
On Aug 26, 2009, at 3:33 PM, djain101 wrote: I have one quick question... If in solrconfig.xml, if it says ... ${solr.abortOnConfigurationError:false}abortOnConfigurationError> does it mean defaults to false if it is not set as system property? correct

${solr.abortOnConfigurationError:false} - does it defaults to false

2009-08-26 Thread djain101
I have one quick question... If in solrconfig.xml, if it says ... ${solr.abortOnConfigurationError:false} does it mean defaults to false if it is not set as system property? Thanks, Dharmveer -- View this message in context: http://www.nabble.com/%24%7Bsolr.abortOnConfigurationError%3Afalse

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Fuad Efendi
Do you have firewall between DB and possible SOLR-Master instance? Do you have firewall between Client application and DB? Such configuration is strange... by default firewalls allow access to port 80, try to set port 80 for SOLR-Tomcat and/or configure AJP mapping for front-end HTTPD which you mig

Re: Incremental Deletes to Index

2009-08-26 Thread Jason Rutherglen
You'll probably want to call Solr commit, however you'll want to call IW.flush underneath (via a new Solr commit flag?). Yes, the Solr caches would be somewhat useless if you're calling Solr commit/flush rapidly. See SOLR-1308 on improving caches for NRT. On Tue, Aug 25, 2009 at 7:22 PM, KaktuCha

Re: Pattern matching in Solr

2009-08-26 Thread Avlesh Singh
You could have used your previous thread itself ( http://www.lucidimagination.com/search/document/31c1ebcedd4442b/exact_pattern_search_in_solr), Bhaskar. In your scenario one, you need an exact token match, right? You are getting expected results if your field type is "text". Look for the "WordDel

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Francis Yakin
Thanks. The issue we have actually, it could be firewall issue more likely than network latency, that's why we try to avoid to use http connection. Fixing the firewall is not an option right now. We have around 3 millions docs to load from DB to Solr master( first initial load only) and subseque

Problem using replication in 8/25/09 nightly build of 1.4

2009-08-26 Thread Ron Ellis
Hi Everyone, When trying to utilize the new HTTP based replication built into Solr 1.4 I encounter a problem. When I view the replication admin page on the slave all of the master values are null i.e. Replicatable Index Version:null, Generation: null | Latest Index Version:null, Generation: null.

Re: SolrJ and Solr web simultaneously?

2009-08-26 Thread Avlesh Singh
> > Is Solr like a RDBMS in that I can have multiple programs querying and > updating the index at once, and everybody else will see the updates after a > commit, or do I have to something explicit to see others updates? > Yes, everyone gets to search on an existing index unless writes to the index

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Fuad Efendi
> I don't want or try not to use http connection from Database to Solr Master because of network latency( very slow). "network latency" does not play any role here; throughput is more important. With separate SOLR instance on a separate box, and with separate java application (SOLR-bridge) queryin

Re: SolrJ and Solr web simultaneously?

2009-08-26 Thread Smiley, David W.
See my response to Paul Tomblin. You could use the existing DataImportHandler "SqlEntityProcessor" for DB access. The DIH framework is fairly extensible. BTW, I wouldn't immediately dismiss using HTTP to give data to Solr just because you believe it will be slow without having tried it. Using

Re: SolrJ and Solr web simultaneously?

2009-08-26 Thread Smiley, David W.
You could implement a Data Import Handler "EntityProcessor". There are at least 5 implementations I can see for you to learn from that come with Solr. If Solr truly doesn't need to be up and running as a server to serve any queries, then EmbeddedSolrServer will be fine. ~ David Smiley Author

RE: SolrJ and Solr web simultaneously?

2009-08-26 Thread Francis Yakin
I have the same situation now. If I don't want to use http connection, so I need to use EmbeddedSolrServer that what I think I need correct? We have Master/slaves solr, the applications use slaves for search. The Master only taking the new index from Database and slaves will pull the new index

Pattern matching in Solr

2009-08-26 Thread bhaskar chandrasekar
Hi,   Can any one help me with the below scenario?.   Scenario 1:   Assume that I give Google as input string i am using Carrot with Solr Carrot is for front end display purpose the issue is Assuming i give "BHASKAR" as input string It should give me search results pertaining to BHASKAR only.

RE: JDWP Error

2009-08-26 Thread Fuad Efendi
JDPA/JDWP are for remote debugging of SUN JVM... It shouldn't be SOLR related... check configs of Resin... -Fuad http://www.tokenizer.org -Original Message- From: Licinio Fernández Maurelo [mailto:licinio.fernan...@gmail.com] Sent: August-26-09 12:49 PM To: solr-user@lucene.apache.org

Re: SolrJ and Solr web simultaneously?

2009-08-26 Thread Smiley, David W.
Once a commit occurs, all data added before it (by any & all clients) becomes visible to all searches henceforth. The "web interface" has direct access to Solr, and SolrJ remotely accesses that Solr. SolrEmbeddedSolrServer is something that few people should actually use. It's mostly for embe

SolrJ and Solr web simultaneously?

2009-08-26 Thread Paul Tomblin
Is Solr like a RDBMS in that I can have multiple programs querying and updating the index at once, and everybody else will see the updates after a commit, or do I have to something explicit to see others updates? Does it matter whether they're using the web interface, SolrJ with a CommonsHttpSolrS

JDWP Error

2009-08-26 Thread Licinio Fernández Maurelo
The servlet container (resin) where i deploy solr shows : ERROR: transport error 202: bind failed: Address already in use ERROR: JDWP Transport dt_socket failed to initialize, TRANSPORT_INIT(510) JDWP exit error AGENT_ERROR_TRANSPORT_INIT(197): No transports initialized [../../../src/share/back/

Re: Solr admin url for example gives 404

2009-08-26 Thread Rafał Kuć
Hello! Try running ant example and then run Solr. -- Regards, Rafał Kuć > Hello all, > When I start up Solr from the example directory using start.jar, it > seems to start up, but when I go to the localhost admin url > (http://localhost:8983/solr/admin) I get a 404 (See message appended >

Solr admin url for example gives 404

2009-08-26 Thread Burton-West, Tom
Hello all, When I start up Solr from the example directory using start.jar, it seems to start up, but when I go to the localhost admin url (http://localhost:8983/solr/admin) I get a 404 (See message appended below). Has the url for the Solr admin changed? Tom Tom Burton-West --- Here

What makes a function query count as a match or not?

2009-08-26 Thread Christophe Biocca
I haven't been able to find what makes a function query count as a match when used a part of a boolean query with Occur.MUST. A Term query is simple, if the term is not found, it doesn't count as a match. What's the equivalent for a function query? A score of zero (or less than zero, as implied by

RE: encoding problem

2009-08-26 Thread Fuad Efendi
If you are complaining about Web Application (other than SOLR) (probably behind-the Apache HTTPD) having encoding problem - try to troubleshoot it with Mozilla Firefox + Live Http Headers plugin. Look at "Content-Encoding" HTTP response headers, and don't forget about tag inside HTML... -Fuad

Re: HTML decoder is splitting tokens

2009-08-26 Thread Koji Sekiguchi
Hi Anders, Sorry, I don't know this is a bug or a feature, but I'd like to show an alternate way if you'd like. In Solr trunk, HTMLStripWhitespaceTokenizerFactory is marked as deprecated. Instead, HTMLStripCharFilterFactory and an arbitrary TokenizerFactory are encouraged to use. And I'd recomme

Reason to change the xml files in solr

2009-08-26 Thread Tamilselvi
For the installation of apache solr integration module in Drupal we need to install solr. The must do thing is we need to change the solr schema.xml and configure.xml files with the files in apache solr integration module. can any body explain the reason behind this change. -- View this mess

HTML decoder is splitting tokens

2009-08-26 Thread Anders Melchiorsen
Hi. When indexing the string "Günther" with HTMLStripWhitespaceTokenizerFactory (in analysis.jsp), I get two tokens, "Gü" and "nther". Is this a bug, or am I doing something wrong? (Using a Solr nightly from 2009-05-29) Anders.

Re: Adding cores dynamically

2009-08-26 Thread Licinio Fernández Maurelo
These are the reasons why we are thinking on splitting and index via multi-core: First of all all, we have an index of news which size is about 9G. As we will keep aggregating news forever and ever and let users do free text search on our system, we think that it will be easier for IT crowd to man

Re: solr nutch url indexing

2009-08-26 Thread Uri Boness
Do you mean the schema or the solrconfig.xml? The request handler is configured in the solrconfig.xml and you can find out more about this particular configuration in http://wiki.apache.org/solr/DisMaxRequestHandler?highlight=(CategorySolrRequestHandler)|((CategorySolrRequestHandler)). To u

Re: Create new core from existing

2009-08-26 Thread Noble Paul നോബിള്‍ नोब्ळ्
check this http://wiki.apache.org/solr/CoreAdmin when you create a core you are allowed to use the same instance dir as the old core just ensure that you give a different datadir On Wed, Aug 26, 2009 at 3:05 PM, pavan kumar donepudi wrote: > Paul, > Can you please guide me on which option i need

Re: solr 1.4: extending StatsComponent to recognize localparm {!ex}

2009-08-26 Thread Britske
Thanks for that. it works now ;-) Erik Hatcher-4 wrote: > > > On Aug 25, 2009, at 6:35 PM, Britske wrote: >> Moreover, I can't seem to find the actual code in FacetComponent or >> anywhere >> else for that matter where the {!ex}-param case is treated. I assume >> it's in >> FacetComponen

Re: encoding problem

2009-08-26 Thread Shalin Shekhar Mangar
On Wed, Aug 26, 2009 at 12:52 PM, Bernadette Houghton < bernadette.hough...@deakin.edu.au> wrote: > Thanks for your quick reply, Shalin. > > Tomcat is running on my Windows machine, but does not appear in Windows > Services (as I was expecting it should ... am I wrong?). I'm running it from > a st

Re: shingle filter

2009-08-26 Thread Shalin Shekhar Mangar
On Tue, Aug 25, 2009 at 4:24 AM, Joe Calderon wrote: > hello *, im currently faceting on a shingled field to obtain popular > phrases and its working well, however ide like to limit the number of > shingles that get created, the solr.ShingleFilterFactory supports > maxShingleSize, can it be made t

RE: encoding problem

2009-08-26 Thread Bernadette Houghton
Thanks for your quick reply, Shalin. Tomcat is running on my Windows machine, but does not appear in Windows Services (as I was expecting it should ... am I wrong?). I'm running it from a startup.bat on my desktop - see below. Do I add the Dfile line to the startup.bat? SOLR is part of the rep

Re: Exact word search

2009-08-26 Thread Shalin Shekhar Mangar
On Tue, Aug 25, 2009 at 10:40 AM, bhaskar chandrasekar wrote: > Hi, > > Can any one helpe me with the below scenario?. > > Scenario 1: > > Assume that I give Google as input string > i am using Carrot with Solr > Carrot is for front end display purpose It seems like Carrot is the one making the

Re: encoding problem

2009-08-26 Thread Shalin Shekhar Mangar
On Wed, Aug 26, 2009 at 12:42 PM, Bernadette Houghton < bernadette.hough...@deakin.edu.au> wrote: > Hi Shalin, stupid question - I'm an apache/solr newbie - but how do I > access the JVM??? > When you execute the java executable, just add -Dfile.encoding=UTF-8 as a command line argument to the ex

RE: encoding problem

2009-08-26 Thread Bernadette Houghton
Hi Shalin, stupid question - I'm an apache/solr newbie - but how do I access the JVM??? Regards Bern -Original Message- From: Shalin Shekhar Mangar [mailto:shalinman...@gmail.com] Sent: Wednesday, 26 August 2009 5:10 PM To: solr-user@lucene.apache.org Subject: Re: encoding problem On

Re: encoding problem

2009-08-26 Thread Shalin Shekhar Mangar
On Wed, Aug 26, 2009 at 10:24 AM, Bernadette Houghton < bernadette.hough...@deakin.edu.au> wrote: > We have an encoding problem with our solr application. That is, non-ASCII > chars displaying fine in SOLR, but in googledegook in our application . > > Our tomcat server.xml file already contains UR