Re: Integrating Solr with Database

2012-11-12 Thread Gora Mohanty
On 12 November 2012 13:17, 122jxgcn ywpar...@gmail.com wrote: [...] If you still want to do what you had in mind, you should handle that as part of your indexing process, i.e., update both Solr and the database at the same time I have thought about that, but I could not figure out how to

Re: Integrating Solr with Database

2012-11-12 Thread Gora Mohanty
On 12 November 2012 13:54, 122jxgcn ywpar...@gmail.com wrote: I was thinking of using RESTful API instead of client application accessing database directly. Something like, client application opens url named http://(server url)/updatedb?(id of document), or something similar, then Java logic

Re: SolrZKClient changed interface

2012-11-12 Thread Trym R. Møller
Can anyone verify that the jira has been created sensible? Thanks in advance. https://issues.apache.org/jira/browse/SOLR-4066 Best regards Trym Den 10-11-2012 00:54, Mark Miller skrev: Please file a JIRA issue for this change. - Mark On Nov 9, 2012, at 8:41 AM, Trym R. Møller t...@sigmat.dk

RE: How Index word document in solr.

2012-11-12 Thread Markus Jelsma
hi - Check the Extracting Request Handler manual: http://wiki.apache.org/solr/ExtractingRequestHandler -Original message- From:veena rani veenara...@gmail.com Sent: Mon 12-Nov-2012 10:09 To: solr-user@lucene.apache.org Subject: How Index word document in solr. Hi, Please

RE: DIH nested entities don't work

2012-11-12 Thread mroosendaal
Hi, I've created a jar with 5 files: 4 files with DIHCache*.java 1 file named BerkleyBackedCache.java I've changed the data-config based on your input. What i see it doing is that it is building a cache at the given location. However the first testrun took almost *3* hours before i got a

Unable to run two multicore Solr instances under Tomcat

2012-11-12 Thread Adam Neal
Hi, I have been running two multicore Solr instances under Tomcat using a nightly build of 4.0 from September 2011. This has been running fine but when I try to update these instances to the release version of 4.0 I'm hitting problems when the second instance starts up. If I have one instance

Re: Handle Queries which return 1000s of records

2012-11-12 Thread Rafał Kuć
Hello! By pieces you mean by paging the results ? If yes, please look at http://wiki.apache.org/solr/CommonQueryParameters - start and rows parameters. -- Regards, Rafał Kuć Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch - ElasticSearch Hi, I am integrating solr search on my

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
I'm not sure what could be the issue here, maybe there is a problem with finding the name of your machine? can you manually find ' http://carlos-OptiPlex-790:8080/solr' ? Maybe if you set the host parameter in solr.xml ( http://wiki.apache.org/solr/SolrCloud#SolrCloud_Instance_Params ) to

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
It's strange, my solrconfig.xml is inside the war file. Just in case, here is the actual content: https://gist.github.com/9f0ea455fd6a0d92a196 Thanks in advance On Mon, Nov 12, 2012 at 1:33 PM, Carlos Alexandro Becker caarl...@gmail.com wrote: Hm, I still getting nullpointers accessing

RE: DIH nested entities don't work

2012-11-12 Thread Dyer, James
Here's what I'd do next: - double check you're only caching the child entity, not the parent. - Replace the SELECT * queries with a list of actual fields you want. - specify the persistCacheFieldNames and persistCacheFieldTypes parameters (see the doc-comment for DIHCachePersistProperties) - Try

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
In the last stacktrace that you sent, it looks like there is no configuration directory in zk called myconf. Make sure you use a configuration that was already uploaded to zk. You can use zkCli.sh (in the zookeeper bin directory) to see what's inside of the configs zk node. If you started Solr (at

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
yeah, I changed to collection1, but I not understand why I have to pass a configName if it use collection1 by default? also, I tried with collection1 config, and got another error: 13:49:02,226 ERROR [org.apache.solr.core.SolrCore] (http--0.0.0.0-8080-2) org.apache.solr.common.SolrException:

Re: Error with SolrCloud

2012-11-12 Thread Tomás Fernández Löbbe
Once the configuration is uploaded to zk, Solr will stop using the directory you specify in the solr.xml to start using the configuration from zk (to make sure all the nodes of the collection have the same configuration). Maybe for some reason the solrconfig file was not uploaded properly? Could

Re: Error with SolrCloud

2012-11-12 Thread Carlos Alexandro Becker
Actually, that's really strange, I cant even see the dump button, and got this error: SolrCore Initialization Failures - *collection1:* org.apache.solr.common.cloud.ZooKeeperException:org.apache.solr.common.cloud.ZooKeeperException: Please check your logs for more information On

Re: Apache Nutch 1.5.1 + Apache Solr 4.0

2012-11-12 Thread Antony Steiner
Thank you very much. Everything is working fine now. Best regards Antony

java.io.IOException: Map failed :: OutOfMemory

2012-11-12 Thread uwe72
While adding lucene document we got this problem: What can we do here? Nov 12, 2012 3:25:09 PM org.apache.solr.update.DirectUpdateHandler2 commit INFO: start commit(optimize=false,waitFlush=true,waitSearcher=true,expungeDeletes=false) Exception in thread Lucene Merge Thread #0

Admin Permissions

2012-11-12 Thread Michael Long
I really like the new admin in solr 4.0, but specifically I don't want developers to be able to unload, rename, swap, reload, optimize, or add core. Any ideas on how I could still give access to the rest of the admin without giving access to these? It is very helpful for them to have access

SOLR - Search Issue

2012-11-12 Thread ravi.n
Hello Guys, We had a requirement to search existing lucene indexes and configure to indexing new data. 1. Installed apache-solr-3.6.1 and setup multicore for the existing indexes. 2. Solr is running on windows platform with tomcat 7 64 bit 3. Coll1 and Coll2 has old lucene index files and new

Solr 3.5 DIH Delta Import hanging

2012-11-12 Thread Arcadius Ahouansou
Hello. We have a Solr 3.5 master randomly hanging during delta import. It does not happen very often. When it does, the output of dataimport?command=status is shown below. After each page refresh, the only thing that changes is the Time Elapsed. I have also done a thread dump (see below). It

RE: DIH nested entities don't work

2012-11-12 Thread mroosendaal
Thanks, i'll give that a try tomorrow. Here's the data-config.xml which i will try when i get to work tomorrow: dataConfig dataSource name=quot;jdbc1quot; driver=quot;oracle.jdbc.driver.OracleDriverquot; url=quot;jdbc:oracle:thin:@//lt;host:1521/ENDDEV user=un password=pw/

How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Aeroox Aeroox
Hi folks, I have a solr index with up to 50M documents. A document contain 62 fields (docid, name, location). The facet count took 1 to 2 minutes with this params : http://.../select/?q=solr;

Re: sort by function error

2012-11-12 Thread Yonik Seeley
On Mon, Nov 12, 2012 at 5:24 AM, Kuai, Ben ben.k...@sensis.com.au wrote: more information, problem only happends when I have both sort by function and grouping in query. I haven't been able to duplicate this with a few ad-hoc queries. Could you give your complete request (or at least all of

Re: php client for Solr 4.0.0

2012-11-12 Thread Jorge Luis Betancourt Gonzalez
I'm currently using solarium with solr 3.6, perhaps you can tweak solarium as needed? I suppose that pull requests are welcome into solarium for solr 4. Greetings! On Nov 12, 2012, at 2:56 PM, Bill Au bill.w...@gmail.com wrote: Anyone know of a PHP client that is compatible with Solr 4.0.0?

Re: customize solr search/scoring for performance

2012-11-12 Thread jchen2000
The following was generated from jvisualvm. Seems like the perf is related to scoring a lot. Any idea/pointer on how to customize that part? http://lucene.472066.n3.nabble.com/file/n4019850/profilingResult.png -- View this message in context:

RE: sort by function error

2012-11-12 Thread Kuai, Ben
Hi Yonik Thanks for the reply. My sample query, q=cafesort=geodist(geoLocation,-32.123323,108.123323)+ascgroup.field=familyId field name=geoLocation type=latLon indexed=true stored=false / field name=familyId type=string indexed=true stored=false / as long as I remove the group field the query

Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
Hi, I'm migrating from Solr 1.2 to 3.6.1. I used the same analyzer as I was, and re-indexed my data. I did not add solr.ReversedWildcardFilterFactory to my index analyzer, but yet leading wild cards are working!! Does this mean it's turned on by default? If so, how do I turn it off,

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread François Schiettecatte
John You can still use leading wildcards even if you dont have the ReversedWildcardFilterFactory in your analysis but it means you will be scanning the entire dictionary when the search is run which can be a performance issue. If you do use ReversedWildcardFilterFactory you wont have that

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
Thanks for the quick response. So, I do not want to use ReversedWildcardFilterFactory, but leading wildcard is working and thus is ON by default. How do I disable it to prevent the use of it and the issues that come with it? -- MJ -Original Message- From: François Schiettecat te

Re: How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Otis Gospodnetic
Hi, Have you tried the other facet method or newer Solr? Otis -- Performance Monitoring - http://sematext.com/spm On Nov 12, 2012 2:40 PM, Aeroox Aeroox aero...@gmail.com wrote: Hi folks, I have a solr index with up to 50M documents. A document contain 62 fields (docid, name, location).

Re: How to speed up Facet count (Big index) ??!!!!

2012-11-12 Thread Yonik Seeley
On Mon, Nov 12, 2012 at 8:39 PM, Aeroox Aeroox aero...@gmail.com wrote: Hi folks, I have a solr index with up to 50M documents. A document contain 62 fields (docid, name, location). The facet count took 1 to 2 minutes with this params : http://.../select/?q=solr;

Example for Scheduling Solr Indexing - Hadoop

2012-11-12 Thread Britt
Background I have a file that gets dropped into a new directory every 10 minutes. Examples: /2012/11/05/HH/10/bigfile.txt /2012/11/05/HH/20/bigfile.txt /2012/11/05/HH/30/bigfile.txt /2012/11/05/HH/40/bigfile.txt I need to schedule a job to index these files every 10 minutes. Examples:

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread François Schiettecatte
I suspect it is just part of the wildcard handling, maybe someone can chime in here, you may need to catch this before it gets to SOLR. François On Nov 12, 2012, at 5:44 PM, johnmu...@aol.com wrote: Thanks for the quick response. So, I do not want to use ReversedWildcardFilterFactory,

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
At one point, in some version of Solr, it was OFF by default, and you had to enable it via a setting (either in solrconfig.xml or schema.xml, I don't remember). It looks like this is no longer the case. Even worse, and if this is true, disabling it no longer seems to be possible to disable

Re: 4.0 query question

2012-11-12 Thread dm_tim
That is terrific. Thanks. Regards, Tim -- View this message in context: http://lucene.472066.n3.nabble.com/4-0-query-question-tp4019397p4019890.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Michael Ryan
Yeah, the situation is kind of a pain right now. In https://issues.apache.org/jira/browse/SOLR-2438, it was enabled by default and there is no way to disable without patching SolrQueryParser. There's also the edismax parser which doesn't have a setting for this, which I've made a jira for at

RE: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread johnmunir
I'm surprised that this has not been logged as adefect. The fact that this is ON bydefault, means someone can bring down a server; this is bad enough to categorizethis as a security issue. --MJ -Original Message- From: Michael Ryan [mailto:mr...@moreover.com] Sent: Monday,

4.0 query question

2012-11-12 Thread dm_tim
I just discovered the idf scoring component and it's biting me on the butt. In the particular case I have all terms are equally relevant. Is there any way to turn off idf scoring? Regards, Tim -- View this message in context:

Re: 4.0 query question

2012-11-12 Thread Otis Gospodnetic
Hi, Yes, via a custom Similarity, which you then reference in solrconfig.xml, that always returns 1 from the idf method. Was mentioned on this list a few days back - see search-lucene.com. Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring -

Re: Solr GC issues - Too many BooleanQuery BooleanClause objects in heap

2012-11-12 Thread Otis Gospodnetic
Hi, I've never seen this. You don't have a custom query parser or anything else custom, do you? Have you tried dumping and analyzing heap? YourKit has a 7 day eval, or you can use things like jhat, which may be included on your machine already (see

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Yonik Seeley
On Tue, Nov 13, 2012 at 2:27 AM, johnmu...@aol.com wrote: I'm surprised that this has not been logged as adefect. The fact that this is ON bydefault, means someone can bring down a server; this is bad enough to categorizethis as a security issue. It's all relative. There are tons of

Re: Example for Scheduling Solr Indexing - Hadoop

2012-11-12 Thread Otis Gospodnetic
Hi, This could be as simple as writing an app that periodically checks the appropriate directories, looks for any new files added since last checks, and then reads/parses them (presumably there are data for records/documents that need to be indexed in those files), constructs SolrInputDocuments,

Re: SOLR - Search Issue

2012-11-12 Thread Otis Gospodnetic
Hi, Have you tried optimizing your indices? That should upgrade their format to whatever Solr/Lucene version you are using. You could also try IndexUpgrader - http://search-lucene.com/jd/lucene/core/org/apache/lucene/index/IndexUpgrader.html Otis -- Search Analytics -

Re: customize solr search/scoring for performance

2012-11-12 Thread Robert Muir
Whenever I look at solr users' stacktraces for disjunctions, I always notice they get BooleanScorer2. Is there some reason for this or is it not intentional (e.g. maybe a in-order collector is always being used when its possible at least in simple cases to allow for out-of-order hits?) When I

Re: Solr 4.0 - distributed updates without zookeeper?

2012-11-12 Thread Otis Gospodnetic
Hi Peter, Not sure I have the answer for you, but are you looking to avoid using ZK for some reason? Or are you OK with ZK per se, but just don't want any leader re-election and any other dynamic/cloudy behaviour? Could you not simply treat 1 node as the master to which you send all your updates

Re: sort by function error

2012-11-12 Thread Yonik Seeley
I can't reproduce this with the example data. Here's an example of what I tried: http://localhost:8983/solr/query?q=*:*sort=geodist(store,-32.123323,108.123323)+ascgroup.field=inStockgroup=true Perhaps this is an issue that's since been fixed. -Yonik http://lucidworks.com On Mon, Nov 12,

Re: Is leading wildcard search turned on by default in Solr 3.6.1?

2012-11-12 Thread Jack Krupansky
Be sure to realize that even with reverse wildcard support, the user can add a trailing wildcard as well (double-ended wildcard) and then you are back in the same boat. The overall idea is that: 1) Hardware is much faster than just 3 or 4 years ago, and 2) even though document counts are

Re: SOLR - Search Issue

2012-11-12 Thread ravi.n
Thanks. Can be this done by Solr or should we write Java class? Regards, Ravi -- View this message in context: http://lucene.472066.n3.nabble.com/SOLR-Search-Issue-tp4019816p4019939.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: solr4.0 problem zkHost with multiple hosts throws out of range exception

2012-11-12 Thread deniz
so do we need to add one of the servers from the -DzkHost string to -DzkRun? should it look like -DzkRun=host1:port -DzkHost=host:port, host1:port, host2:port in the start up command? and will wiki page be updated? because the example there is still letting into the error that was

Re: SOLR - Search Issue

2012-11-12 Thread Otis Gospodnetic
You can issue an optimize request to Solr, just peek at the Wiki. IndexUpgrader is a command-line tool. No coding needed (tm) Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Mon, Nov 12, 2012 at 11:59 PM,