Re: Solr response extremely slow

2010-02-02 Thread Vijayant Kumar
Hi Rajat, You can find the version of solr by http://localhost:8983/solr/admin/registry.jsp -- Thank you, Vijayant Kumar Software Engineer Website Toolbox Inc. http://www.websitetoolbox.com 1-800-921-7803 x211 > > Java version is -> > > java version "1.5.0_18" > Java(TM) 2 Runtime Environment,

Re: Solr response extremely slow

2010-02-02 Thread Rajat Garg
Java version is -> java version "1.5.0_18" Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_18-b02) Java HotSpot(TM) Server VM (build 1.5.0_18-b02, mixed mode) Not sure how to find solr version. Can you tell me how to look it up? Also, i don't have a dedicated server to run this on

Re: Solr response extremely slow

2010-02-02 Thread Shalin Shekhar Mangar
On Wed, Feb 3, 2010 at 12:24 PM, Rajat Garg wrote: > > I have solr running with 275K docs on 4GB ram vps server > > Logs look like this from info and log file -> > > 03-Feb-2010 06:43:08 org.apache.solr.core.SolrCore execute > INFO: [] webapp=/solr path=/select > > params={facet.limit=-1&wt=ruby&

Solr response extremely slow

2010-02-02 Thread Rajat Garg
I have solr running with 275K docs on 4GB ram vps server Logs look like this from info and log file -> 03-Feb-2010 06:43:08 org.apache.solr.core.SolrCore execute INFO: [] webapp=/solr path=/select params={facet.limit=-1&wt=ruby&rows=300&start=1&sort=supplier_count+desc&facet=true&facet.mincount=

Re: DataImportHandler - convertType attribute

2010-02-02 Thread Noble Paul നോബിള്‍ नोब्ळ्
implicit conversion can cause problem when Transformers are applied. It is hard for user to guess the type of the field by looking at the schema.xml. In Solr, String is the most commonly used type. if you wish to do numeric operations on a field convertType will cause problems. If it is explicitly

Re: DataImportHandler delta-import confusion

2010-02-02 Thread Noble Paul നോബിള്‍ नोब्ळ्
Please do not hijack a thread. http://people.apache.org/~hossman/#threadhijack On Tue, Feb 2, 2010 at 11:32 PM, Leann Pereira wrote: > Hi Paul, > > Can you take me off this distribution list? > > Thanks, > > Leann > > > From: noble.p...@gmail.com [noble.p.

C++ being filtered (please help)

2010-02-02 Thread Abin Mathew
Hi I have a field which may take the form "C++,PHP & MySql,C#" now i want to tokenize it based on comma or white space and other word delimiting characters only.Not on the plus sign. so that result after tokenization should be C++ PHP MySql C# But the result I am getting is c php mysql c Please gi

Search wihthout diacritics

2010-02-02 Thread Olala
Hi all! I have problem with Solr, and I hope everyboby in there can help me :) I want to search text without diacritic but Solr will response diacritic text and without diacritic text. For example, I query "solr index", it will response "solr index", "sôlr index", "sòlr index", "sólr indèx",...

Guidance on Solr errors

2010-02-02 Thread Vauthrin, Laurent
Hello, I'm trying to troubleshoot a problem that occurred on a few Solr slave Tomcat instances and wanted to run it by the list to see if I'm on the right track. The setup involves 1 master replicating to three slaves (I don't know what the replication interval is at this time). These inst

Re: Basic indexing question

2010-02-02 Thread Joe Calderon
see http://wiki.apache.org/solr/SchemaXml#The_Default_Search_Field for details on default field, most people use the dismax handler when handling queries from user see http://wiki.apache.org/solr/DisMaxRequestHandler for more details, if you dont have many fields you can write your own query using

RE: Basic indexing question

2010-02-02 Thread Stefan Maric
Thanks for the quick reply I will have to see if the default query mechanism will suffice for most of my needs I have skimmed through most of the Solr documentation and didn't see anything describing I can easily change my DB View so that I only source Solr with a single string plus my id field (

Re: What is the version=2.2 refers in solr admin url

2010-02-02 Thread Chris Hostetter
: i could see version=2.2 in the url . : what that version refers to ?? http://wiki.apache.org/solr/XMLResponseFormat#A.27version.27 -Hoss

Re: Indexing an oracle warehouse table

2010-02-02 Thread caman
Thanks all. I am on track. Another question: What would be the right way to point out which field contains the term searched for. e.g. If I search for SOLR and if the term exist in field788 for a document, how do I pinpoint that which field has the term. I copied all the fields in field called 'b

Re: Basic indexing question

2010-02-02 Thread Joe Calderon
by default solr will only search the default fields, you have to either query all fields field1:(ore) or field2:(ore) or field3:(ore) or use a different query parser like dismax On Tue, Feb 2, 2010 at 3:31 PM, Stefan Maric wrote: > I have got a basic configuration of Solr up and running and have

Basic indexing question

2010-02-02 Thread Stefan Maric
I have got a basic configuration of Solr up and running and have loaded some data to experiment with When I run a query for 'ore' I get 3 results when I'm expecting 4 Dataimport is pulling the expected number of rows in from my DB view In my schema.xml I have and the defaults

Re: Indexing large Mysql table using DIH

2010-02-02 Thread Lukas Kahwe Smith
On 02.02.2010, at 22:45, Giri wrote: > HI, > > I have a simple MySQLtable with 12 million rows (7 columns), I have been > trying to use the DIH to index all the rows, I tired using the batchSize = > "25000", but DIH is failing after a while. but if I split the query with > limit and offset (limi

Indexing large Mysql table using DIH

2010-02-02 Thread Giri
HI, I have a simple MySQLtable with 12 million rows (7 columns), I have been trying to use the DIH to index all the rows, I tired using the batchSize = "25000", but DIH is failing after a while. but if I split the query with limit and offset (limit 100 offset 0), it is indexing the records pro

Re: Indexing an oracle warehouse table

2010-02-02 Thread Ron Chan
full text search across those columns can all done by putting all values into one text field, are there any benefits in it being multi-valued? multi-valued is useful for one-to-many representation, e.g. - different features of a product - dates where that product was best seller - countries i

RE: fetchindex not working.

2010-02-02 Thread Jeremy Carroll
Found out my issue. solrconfig.xml on the MASTER servers did not have this statement. startup My solrconfig.xml should have looked like this. commit optimize startup solrconfig_slave.xml:solrconfig.xml,schema.xml,stopwords.txt After adding this stat

Re: Indexing a oracle warehouse table

2010-02-02 Thread caman
Alexey, This is exactly what I was looking for. Thank you thank you thank you .. Should have read the documentation a little better. Much appreciated. Alexey-34 wrote: > >> Dont define any so that column in >> SOLR will be same as in the database table. > Correct > You can define dynamic

RE: fetchindex not working.

2010-02-02 Thread Jeremy Carroll
Here are the debugging statements. Replication is not happening from the slave to the master. I see that the slave is issuing an indexversion request, but does nothing with it. ### SLAVE CATALINA.OUT ### INFO: [core1] webapp=/solr path=/replication params={command=fetchindex&masterUrl=http://10

HTTP caching and distributed search

2010-02-02 Thread Charlie Jackson
Currently, I've got a Solr setup in which we're distributing searches across two cores on a machine, say core1 and core2. I'm toying with the notion of enabling Solr's HTTP caching on our system, but I noticed an oddity when using it in combination with distributed searching. Say, for example, I ha

Re: Indexing a oracle warehouse table

2010-02-02 Thread Alexey Serba
> Dont define any so that column in > SOLR will be same as in the database table. Correct You can define dynamic field ( see http://wiki.apache.org/solr/SchemaXml#Dynamic_fields ) > 1)How do I define unique field in this scenario? You can create primary key into database or generate it directly

RE: Indexing an oracle warehouse table

2010-02-02 Thread Ken Lane (kenlane)
I was able to let Solr pick up the column names. But if you want to explicitly state them try this query: SQL> SELECT '' 2 FROM user_tab_cols 3 WHERE table_name = 'QUEST_SL_TEMP_EXPLAIN1'; -Original Message- From: caman [mailto:aboxfortheotherst...@gmail.com]

Re: Indexing an oracle warehouse table

2010-02-02 Thread caman
Ron, Much appreciated. Search requirements are as : 1) Enable search/faceting on author,service,datetime. 2) Enable full text search on all text column which are named as col1 col800+ -- total of more than 800 columns. Here is what I did so far: Defined entities in db schema in db-config

Re: Indexing an oracle warehouse table

2010-02-02 Thread Ron Chan
it depends on what the search requirements are, so without knowing the details here are some vague pointers you may only need to have fields for the columns you are going to be categorizing and searching on, this may be a small subset of the 800 and the rest can go into one large field to fulf

RE: DataImportHandler delta-import confusion

2010-02-02 Thread Leann Pereira
Hi Paul, Can you take me off this distribution list? Thanks, Leann From: noble.p...@gmail.com [noble.p...@gmail.com] On Behalf Of Noble Paul നോബിള്‍ नोब्ळ् [noble.p...@corp.aol.com] Sent: Tuesday, February 02, 2010 2:12 AM To: solr-user@lucene.apache.o

RE: fetchindex not working.

2010-02-02 Thread Jeremy Carroll
I apologize. There is not a fetchindex command on the master server. But I do see this in the logs. Replication does not take place at all. I'm trying to figure out why it's not sending the index from the master to the slave after sending the fetchindex command. ### SLAVE CATALINA.OUT ### INFO:

RE: fetchindex not working.

2010-02-02 Thread Jeremy Carroll
I do not see a stack trace, but the replication does not happen. I do not see a request in the logs of the master server that it even tried to pull the index. From: Shalin Shekhar Mangar [shalinman...@gmail.com] Sent: Tuesday, February 02, 2010 11:36 AM To:

Re: fetchindex not working.

2010-02-02 Thread Shalin Shekhar Mangar
On Tue, Feb 2, 2010 at 10:26 PM, Jeremy Carroll < jeremy.carr...@networkedinsights.com> wrote: > I'm trying to get solr to fetch an index from a master server. This does > not work with our configuration, and throws an error that pollInterval > cannot be null. I thought to get fetchindex working,

Re: Indexing an oracle warehouse table

2010-02-02 Thread caman
Anyone please? caman wrote: > > Hello all, > > hope someone can point me to right direction. I am trying to index an > oracle warehouse table(TableA) with 850 columns. Out of the structure > about 800 fields are CLOBs and are good candidate to enable full-text > searching. Also have few column

Re: Re: Solr and location based searches

2010-02-02 Thread sandro . ruch
Thank guys... very helpful links and good starting points... :-) Am 02.02.2010 10:23 schrieb Hannes Carl Meyer : Hi Sandro, it is possible but beside the technical platform you will need a good data basis for example Geonames, Y! Geo etc. You should also check the following article by Grant

Re: Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Sascha Szott
Luca Molteni wrote: Actually, if I hard-code the value, it gives me the same error... interesting. According to the error message: The content of element type "env-entry" must match "(description?,env-entry-name,env-entry-value?,env-entry-type)" Maybe it helps to change the order of elements w

fetchindex not working.

2010-02-02 Thread Jeremy Carroll
I'm trying to get solr to fetch an index from a master server. This does not work with our configuration, and throws an error that pollInterval cannot be null. I thought to get fetchindex working, that pollInterval should not be in the solrconfig.xml file? I have tried adding pollInterval to the

Re: java.lang.NullPointerException with MySQL DataImportHandler

2010-02-02 Thread Sascha Szott
Hi, since some of the fields used in your DIH configuration aren't mandatory (e.g., keywords and tags are defined as nullable in your db table schema), add a default value to all optional fields in your schema configuration (e.g., default = ""). Note, that Solr does not understand the db-rela

Re: Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Luca Molteni
Hello, You're probably right. This is probably a problem related to Jboss 5, but since I used the procedure of solr deployment described there I thought that someone would have already dealt with this problem. Actually, if I hard-code the value, it gives me the same error... interesting. L.M.

Re: Replication Questions

2010-02-02 Thread Walter Underwood
Performance tends to be bad on NFS? Yep. Updates were 100X slower for me. wunder On Feb 2, 2010, at 7:53 AM, Shalin Shekhar Mangar wrote: > On Tue, Feb 2, 2010 at 7:45 PM, Lars R. Noldan wrote: > >> Hi all, >> >> Is it possible to configure Solr to store it's search index on a shared >> piece

Re: Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Sascha Szott
Hi, I'm not sure if that's a Solr issue. However, what happens if you set env-entry-value to C:/mypath/solr instead of ${solr.home.myhome}? -Sascha Am 02.02.2010 15:20, schrieb Luca Molteni: Hello list, I'm having some problem deploying solr to JBoss 5. The problem is with environment vari

Re: java.lang.NullPointerException with MySQL DataImportHandler

2010-02-02 Thread Jean-Michel Philippon-Nadeau
Hi, Thanks for the reply. On Tue, 2010-02-02 at 16:57 +0100, Sascha Szott wrote: > * the output of MySQL's describe command for all tables/views referenced > in your DIH configuration mysql> describe products; ++--+--+-+-++ | Field

Re: java.lang.NullPointerException with MySQL DataImportHandler

2010-02-02 Thread Shalin Shekhar Mangar
On Tue, Feb 2, 2010 at 8:36 PM, Jean-Michel Philippon-Nadeau < j...@jmpnadeau.ca> wrote: > > I am running into an issue with my MySQL DataImportHandler. I've > followed the quick-start in order to write the necessary config and so > far everything seemed to work. > > However, I am missing some fie

Re: java.lang.NullPointerException with MySQL DataImportHandler

2010-02-02 Thread Sascha Szott
Hi, can you post * the output of MySQL's describe command for all tables/views referenced in your DIH configuration * the DIH configuration file (i.e., data-config.xml) * the schema definition (i.e., schema.xml) -Sascha Jean-Michel Philippon-Nadeau wrote: Hi, It is my first install of Solr

Re: can I import some data in index without delta import?

2010-02-02 Thread Shalin Shekhar Mangar
On Tue, Feb 2, 2010 at 7:19 PM, Vijayant Kumar wrote: > Hi, > > I had index some large amount of data using Dataimporthandler by > full-import. > > My indexing is taking around 5 hours. we have huge amount of data on table > I can not add last_modified coulmn in our tables. > > I want to know how

Re: Replication Questions

2010-02-02 Thread Shalin Shekhar Mangar
On Tue, Feb 2, 2010 at 7:45 PM, Lars R. Noldan wrote: > Hi all, > > Is it possible to configure Solr to store it's search index on a shared > piece of media such as an NFS mount to replicate it across two machines in > real time? > It is possible but not recommended. Search performance tends to

Re: Contributors - Solr in Action Case Studies

2010-02-02 Thread Lukáš Vlček
This would be very welcome! I am interested in this particular use case. In other words: if the book will contain this use case then you can count with me buying this book! :-) Regards, Lukas On Tue, Feb 2, 2010 at 2:49 PM, Grant Ingersoll wrote: > I'd be happy to contribute how we use Solr to

Re: Contributors - Solr in Action Case Studies

2010-02-02 Thread Óscar Marín Miró
Hello, We've been working extensively with Solr as a 'standard' search service. However, recently, we had a volume problem displaying time series (by instance, sentiment of a brand by date), pulling data from a highly denormalized database. Indexing a view of this database, coupled with faceting (g

Re: Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Luca Molteni
Because the current release of our software is based on solr 1.3, and we scheduled the migration to JBoss 5 before the migration to solr 1.4. It's a concurrency problem! L.M. On 2 February 2010 16:05, Erik Hatcher wrote: > Out of curiosity, why are you deploying Solr 1.3 rather than Solr 1.4

java.lang.NullPointerException with MySQL DataImportHandler

2010-02-02 Thread Jean-Michel Philippon-Nadeau
Hi, It is my first install of Solr. The setup has been pretty straightforward and yet, the performance is very impressive. I am running into an issue with my MySQL DataImportHandler. I've followed the quick-start in order to write the necessary config and so far everything seemed to work. Howeve

Re: Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Erik Hatcher
Out of curiosity, why are you deploying Solr 1.3 rather than Solr 1.4? Sorry, but I don't know about the JBoss issues first hand myself. Erik On Feb 2, 2010, at 9:20 AM, Luca Molteni wrote: Hello list, I'm having some problem deploying solr to JBoss 5. The problem is with environme

Deploying Solr 1.3 in JBoss 5

2010-02-02 Thread Luca Molteni
Hello list, I'm having some problem deploying solr to JBoss 5. The problem is with environment variables: Following this page of the wiki: http://wiki.apache.org/solr/SolrJBoss I've added to the web.xml of WEB-INF of solr solr/home java.lang.String ${solr.home.myhome} Since I'm

Replication Questions

2010-02-02 Thread Lars R. Noldan
Hi all, Is it possible to configure Solr to store it's search index on a shared piece of media such as an NFS mount to replicate it across two machines in real time? If not, can someone point me towards formal documentation on how to get replication working? I tried adding the following to

Re: ContentStreamUpdateRequest addFile fails to close Stream

2010-02-02 Thread Mark Miller
Christoph Brill wrote: > I tried to fix it in CommonsHttpSolrServer but I wasn't sure how to do > it. I tried to close the stream after the method got executed, but > somehow getContent() always returned null (see attached patch against > solr 1.4 for my non-working attempt). > > Who's responsible

can I import some data in index without delta import?

2010-02-02 Thread Vijayant Kumar
Hi, I had index some large amount of data using Dataimporthandler by full-import. My indexing is taking around 5 hours. we have huge amount of data on table I can not add last_modified coulmn in our tables. I want to know how will I manage those content that has been added/delete in by database

Re: Contributors - Solr in Action Case Studies

2010-02-02 Thread Grant Ingersoll
I'd be happy to contribute how we use Solr to power http://search.lucidimagination.com. We ingest many different data sources (email, web, wiki, JIRA, source code, etc.) and use dismax, multi select faceting and a variety of other techniques. I think it would make for a great case study. -Gr

Re: ContentStreamUpdateRequest addFile fails to close Stream

2010-02-02 Thread Christoph Brill
I tried to fix it in CommonsHttpSolrServer but I wasn't sure how to do it. I tried to close the stream after the method got executed, but somehow getContent() always returned null (see attached patch against solr 1.4 for my non-working attempt). Who's responsible for closing a stream? CommonsHttpS

Re: ContentStreamUpdateRequest addFile fails to close Stream

2010-02-02 Thread Mark Miller
Broken by design? How about we just fix BinaryUpdateRequestHandler (and possibly CommonsHttpSolrServer) to close the stream it gets? Christoph Brill wrote: > I worked around it using my own FileStream class. Its basically the same > as the original one, except it does not create a new FileInputSt

Re: ContentStreamUpdateRequest addFile fails to close Stream

2010-02-02 Thread Christoph Brill
I worked around it using my own FileStream class. Its basically the same as the original one, except it does not create a new FileInputStream in getStream (which IMO is broken by design as noone ever closes this stream). public class FileStream extends ContentStreamBase { private final java.io.F

DataImportHandler - convertType attribute

2010-02-02 Thread Alexey Serba
Hello, I encountered blob indexing problem and found convertType solution in FAQ I was wondering why it is not enabled by default and found the following comm

ContentStreamUpdateRequest addFile fails to close Stream

2010-02-02 Thread Christoph Brill
Hi list, I'm using ContentStreamUpdateRequest.addFile(File) to index a bunch of documents. This works fine unless the stream created in addFile doesn't seem to get closed. This causes issues because my process has to many open files. It's a bug, right? Regards, Chris

Re: Problem in indexing on large data set by Dataimporthandler in solr

2010-02-02 Thread Vijayant Kumar
Hi Erik, Thanks for your suggestion I had updated the Solr version. and the problem is rectified. > Can you give it a shot on Solr 1.4 instead? DIH has had numerous > enhancements/fixes since 1.3. > > Erik > > > On Feb 1, 2010, at 8:42 AM, Vijayant Kumar wrote: > >> >>> Hi, >>> >>> I am

ClassCastException setting date.formats in ExtractingRequestHandler

2010-02-02 Thread Christoph Brill
Hi list, I tried to add the following to my solrconfig.xml (to the ' -MM-dd which is described on the wiki page of the ExtractingRequestHandler[1]. After doing so I always get a ClassCastException once the lazy init of the handler is happening. This is a stock solr 1.4 with no modifica

Re: DataImportHandler delta-import confusion

2010-02-02 Thread Noble Paul നോബിള്‍ नोब्ळ्
try deltaImportQuery="select [bunch of stuff] WHERE m.moment_id = '${dataimporter.delta.moment_id}'" The key has to be same and in the same case On Tue, Feb 2, 2010 at 1:45 AM, Jon Drukman wrote: > First, let me just say that DataImportHandler is fantastic. It got my old > mysql-php-xml index

Re: Solr and location based searches

2010-02-02 Thread Hannes Carl Meyer
Hi Sandro, it is possible but beside the technical platform you will need a good data basis for example Geonames, Y! Geo etc. You should also check the following article by Grant Ingersoll: http://www.ibm.com/developerworks/opensource/library/j-spatial/index.html Are we talking about a global or

Re: Solr and location based searches

2010-02-02 Thread Markus Jelsma
Hi, You can use three different approaches: - Solr Spatial [1]; - Local Solr [2]; - Implement it yourself [3]. The first is promising, the latter is fun but is far less useful and powerful! [1]: http://wiki.apache.org/solr/SpatialSearch [2]: http://wiki.apache.org/solr/LocalSolr [3]: http://wi