Re: Metada document for faceted search

2009-07-09 Thread Osman İZBAT
Thank you Chris. I've find out how to implement my faceted search. I don't index any metadata document but i create my in-memory faceting data structure from database at my request handlers init method. Compute facet count on any request and wrrite to response as NamedList of NemedLists. Regards

Re: Solr's MLT query call doesn't work

2009-07-09 Thread SergeyG
Done. Unfortunately with the same result. :confused: Thanks, Jun. Isn't it really strange? Again, I'm not the first person using Solr. I wonder if the matter might be just local, due to some not so obvious reason manifesting itself only on my machine (what is, of course, very unlikely but still

RE: Using curl comparing with using WebService::Solr

2009-07-09 Thread Francis Yakin
I also commit too many I guess, since we have 1000 folders, so each loop will executed the load and commit. So 1000 loops with 1000 commits. I think it will be help if I only commit once after the 1000 loops completed. Any inputs? Thhanks Francis -Original Message- From: Francis Yaki

RE: Using curl comparing with using WebService::Solr

2009-07-09 Thread Francis Yakin
Yes, the xml files are in complete add format. This is my code: #!/usr/bin/perl if (($#ARGV + 1) <= 0 ) { print "Usage: perl prod.pl \n\n"; exit(1); } ## -- CHANGE accordingly $timeout = 300; $topdir = "/opt/Test/xml-file/"; #$topdir = "/opt/Test/"; $dir

solr Analyzer help

2009-07-09 Thread revas
Hi , In the solr 1.3 download ,under the folder src/java/org/apache/solr/analysis I find the following tokenizer classes for other languages (other than English) 1.Chinese tokenizer 2.cjk tokenizer which is not expected to work very well with Japanese for Chinese we already have the Chinese tok

Re: DIH: URLDataSource and incremental indexing

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
hi Erik, It is designed to achieve this using a Transformer. I am assuming that your API gives delta "deleted/modified/added" documents. Always run a full-import with clean=false. Depending on the values returned by the API your transformer can use $deleteById for deletes etc. $nextUrl and $hasM

Re: Creating DataSource for DIH to Oracle Database

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
oracle is no different from other data sources. Have you encountered any problems? On Tue, Jul 7, 2009 at 3:11 AM, Francis Yakin wrote: > > Have any one had experience creating a datasource for DIH to an Oracle > Database? > > Also, from the Solr side we are running weblogic and deploy the applic

Re: Using curl comparing with using WebService::Solr

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
I guess it would be fater if you run this program from a diffferent box,.that way the disk IO will not be a bottleneck 2009/7/10 Noble Paul നോബിള്‍ नोब्ळ् : > are these xml files in the solr add xml format? > > When you post using curl, I guess it opens as many http connections as > there are fil

Re: Using curl comparing with using WebService::Solr

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
are these xml files in the solr add xml format? When you post using curl, I guess it opens as many http connections as there are files. if you can write a small program to post all these files in one request, you should be able to get better perf. the following can be the pseudo-code open connec

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-09 Thread Bradford Stephens
Oh, wow... I think that faceted search is the right path, especially since seeing this amazing site: http://www.lucidimagination.com/Community/Hear-from-the-Experts/Articles/Faceted-Search-Solr I hope it's performant over hundreds of thousands of search results :) On Thu, Jul 9, 2009 at 10:13 PM,

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-09 Thread Bradford Stephens
It looks like field collapsing may be the key: http://issues.apache.org/jira/browse/SOLR-236 But it also doesn't seem to be 'finalized' yet. I wonder how performant it is with indexes of 50 million documents+? On Thu, Jul 9, 2009 at 9:42 PM, shb wrote: > you can refer to the facet search of solr,

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
the jars should either go in slr.home/lib or in the WEB-INF/lib of tomcat On Thu, Jul 9, 2009 at 5:22 PM, gateway0 wrote: > > Is it maybe the installation paths. > > On XP I used: > - xampp ("G:\xampp") > - installed tomcat plugin for xampp ("G:\xampp\tomcat") > - solr incl. data dir ("G:\solr") >

Using curl comparing with using WebService::Solr

2009-07-09 Thread Francis Yakin
I have about 1000 folders, each folder consist 2581 xml files. Total of xml files is ~ 2.6 millions I developed perl script, inside my script it's executed this cmd: curl http://localhost:7001/solr/update --data-binary "@0039000.xml" -H 'Content-type:text/plain; charset=utf-8' It tooks me ab

Re: Aggregating/Grouping Document Search Results on a Field

2009-07-09 Thread shb
you can refer to the facet search of solr, that might help you. 2009/7/10 Bradford Stephens > Greetings, > > We've been experimenting with grouping fields returned from document > search results in Lucene, and we haven't gotten anything very > encouraging. Basically, the more results we return,

Aggregating/Grouping Document Search Results on a Field

2009-07-09 Thread Bradford Stephens
Greetings, We've been experimenting with grouping fields returned from document search results in Lucene, and we haven't gotten anything very encouraging. Basically, the more results we return, the longer it takes -- tens of seconds. Probably because we're doing expensive disks seeks. I'm hoping t

Distributed Search in Solr

2009-07-09 Thread Sumit Aggarwal
Hi, 1. Calls made to multiple shards are made in some concurrent fashion or serially? 2. Any idea of algorithm followed for merging data? I mean how efficient it is? 3. Lucene provides payload concept. How can we make search using that in solr. My application store payloads and use search using our

Re: I am getting HTTP Version Not Supported (505)Error

2009-07-09 Thread Edukondalu Avula
Edukondalu Avula wrote: > > Hi Friends, > > I am working on Apache solr for indexing data by using java programming. > For Indexing data i used tomcat server and i started solr, i prepared url > for indexing data. i given that url in any browser it's working (indexed > the data). I given the p

Re: Boosting for most recent documents

2009-07-09 Thread Bill Au
With a time stamp you can use a function query to boost the score of newer documents: http://wiki.apache.org/solr/SolrRelevancyFAQ#head-b1b1cdedcb9cd9bfd9c994709b4d7e540359b1fd Bill On Thu, Jul 9, 2009 at 5:58 PM, vivek sar wrote: > How do we sort by internal doc id (say on one index only) usin

Lock timed out 2 worker running

2009-07-09 Thread Renz Daluz
Hi all, I have 2 workers running (app that's builds the index) and both are pointing to the same "Solr" (1.3.0) master instance when updating/committing documents. I'm using SolrJ to save the documents. Then I have 4 slaves solr instance that gets index from master via rsync. Now. from time to t

Re: SEVERE: java.lang.ArrayIndexOutOfBoundsException

2009-07-09 Thread Fernando.B
:( that is all we have in there!!! Is there any way I can raise the logging level for it? Fernando.B wrote: > > Hello, > > we're having a lot of > > Jul 10, 2009 7:55:01 AM org.apache.solr.common.SolrException log > SEVERE: java.lang.ArrayIndexOutOfBoundsException > > on our application.

Re: SEVERE: java.lang.ArrayIndexOutOfBoundsException

2009-07-09 Thread Mark Miller
We need a bit more of that stack trace. Is there any more? Something with line numbers and class names is best. That is a very generic exception. -- -- - Mark http://www.lucidimagination.com On Thu, Jul 9, 2009 at 11:55 PM, Fernando.B wrote: > > Hello, > > we're having a lot of > > Jul 10, 2

SEVERE: java.lang.ArrayIndexOutOfBoundsException

2009-07-09 Thread Fernando.B
Hello, we're having a lot of Jul 10, 2009 7:55:01 AM org.apache.solr.common.SolrException log SEVERE: java.lang.ArrayIndexOutOfBoundsException on our application. Any leads? - Running SOLR 1.3 (this was also happening on 1.2 before) - Using SOLR for "user" open text queries -- View this m

Re: Boosting for most recent documents

2009-07-09 Thread vivek sar
How do we sort by internal doc id (say on one index only) using Solr? I saw couple of threads saying it (Sort.INDEXORDER) was not supported in Solr, http://www.nabble.com/sort-by-index-id-descending--td16124009.html#a16124009 http://www.nabble.com/Reverse-sorting-by-index-order-td1321032.html#a132

AlternateField and DynamicField

2009-07-09 Thread Rui Carneiro
Hi, I have the following index: I want to use highlight to minimize the traffic load. But, when the highlight find nothing, i want the first N words from file_*. It is possible to do something like this? hl=on hl.fl=file_* hl.alternateField=file_* hl.*maxAlternateFieldLengt

Re: Retrieve docs with > 1 multivalue field hits

2009-07-09 Thread A. Steven Anderson
> > : I thought I remembered seeing a thread related to retrieving only > documents > : that had more than one hit in a particular multivalue field, but I cannot > : find it now. > > not easily. > > for arbitrary queries there's no simple way to kow what that query matches > a single document, mult

Re: Boosting for most recent documents

2009-07-09 Thread Otis Gospodnetic
Ah, with multiple indices you can't rely on the max Lucene doc Id. I think you have to do with the timestamp approach. Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: vivek sar > To: solr-user@lucene.apache.org > Sent: Thursday, July 9,

Re: Solr spring application context error

2009-07-09 Thread John Wang
Ah, my bad, this is the same as null. Sorry for the stupid comment. -John On Thu, Jul 9, 2009 at 1:55 PM, John Wang wrote: > Hi guys: > Could it be in method:load in class CoreContainer: > > libLoader = SolrResourceLoader.createClassLoader(f, null); > > >The parent ClassLoader is set to

Re: Deleting from SolrQueryResponse

2009-07-09 Thread Chris Hostetter
: > one thing to keep in mind however is that post-processing a DocList to : > filter stuff out is almost never a good idea -- things get really : > convoluted when you think about dealing with pagination and except for : > some really trivial use cases you can never know what your upper bound

Re: Solr spring application context error

2009-07-09 Thread Mark Miller
A value of null will cause the parent class loader to be set to: Thread.currentThread().getContextClassLoader(); The context class loader for a given thread is the class loader that the threads creator wants you to use (good for servlet containers), so looks right to me. If its not set, it will

Re: Metada document for faceted search

2009-07-09 Thread Chris Hostetter
: : I'm trying to implement custom faceted search like CNET's : approach. : . : But i couldn't figure out how to structure and index category metadata : document. The short answer: "however you want" For CNET, we already had

Re: Solr spring application context error

2009-07-09 Thread John Wang
Hi guys: Could it be in method:load in class CoreContainer: libLoader = SolrResourceLoader.createClassLoader(f, null); The parent ClassLoader is set to null, shouldn't it be the current classloader? e.g." libLoader = SolrResourceLoader.createClassLoader(f,Thread.currentThread().getCont

Re: Retrieve docs with > 1 multivalue field hits

2009-07-09 Thread Chris Hostetter
: I thought I remembered seeing a thread related to retrieving only documents : that had more than one hit in a particular multivalue field, but I cannot : find it now. not easily. for arbitrary queries there's no simple way to kow what that query matches a single document, multiple times, in s

Re: Making Analyzer Phrase aware?

2009-07-09 Thread Chris Hostetter
: I was looking at the SOLR-908 port of nutch CommonGramsFilter as an approach : for having phrase searches be sensitive to stop words within a query. So a : search on "car on street" wouldn't match the text "car in street". : : >From what I can tell the query version of the filter will *always*

Re: Creating DataSource for DIH to Oracle Database

2009-07-09 Thread Jay Hill
Francis, your question is a little vague. Are you looking for the configuration for connecting the DIH to a JNDI datasource set up in Weblogic? -Jay On Mon, Jul 6, 2009 at 2:41 PM, Francis Yakin wrote: > > Have any one had experience creating a datasource for DIH to an Oracle > Database?

Re: custom queryparser issues

2009-07-09 Thread Chris Hostetter
: org.apache.solr.search.LuceneQParserPluginoverwritingpublic : QParser createParser(String qstr, SolrParams localParams, : SolrParams params, SolrQueryRequest req) { : : so that I can adjust params before I pass it with super.createParser() : to original LuceneQParserPlugin :

Re: Compiling TermsComponent for use with Solr 1.3

2009-07-09 Thread Chris Hostetter
: I've copied TermsComponent and TermsParams from trunk into my own package : and Eclipse isn't reporting any build errors ... : SEVERE: java.lang.ClassCastException: : com.titanpublishing.solr.TermsComponent cannot be cast to : org.apache.solr.util.plugin.NamedListInitializedPlugin Terms

Re: Creating DataSource for DIH to Oracle Database

2009-07-09 Thread Chris Hostetter
: Have any one had experience creating a datasource for DIH to an Oracle Database? why wouldn't the stock JdbcDataSource work? -Hoss

Re: DisMax query parser syntax for the fq parameter

2009-07-09 Thread Erik Hatcher
On Jul 9, 2009, at 1:02 PM, gistol...@gmx.de wrote: I am using the dismax query parser syntax for the fq param: .../select?qt=dismax&rows=30&q.alt=*:*&qf=content&fq={!dismax qf=contentKeyword^1.0 mm=0%}Foo&fq=+date:[2009-03-11T00:00:00Z TO 2009-07-09T16:41:50Z]&fl=id,date,content Now, I

Re: Boosting for most recent documents

2009-07-09 Thread vivek sar
Thanks Otis. I got a distributed index - using Solr multi-core. Basically, I got 6 indexer instances running on 3 different boxes. Couple of questions, 1) Is it possible to sort on document id for multiple-shards? How is that done? 2) How would boost by most recent doc at index time? Thanks, -vi

DisMax query parser syntax for the fq parameter

2009-07-09 Thread gistolero
Hello, I am using the dismax query parser syntax for the fq param: .../select?qt=dismax&rows=30&q.alt=*:*&qf=content&fq={!dismax qf=contentKeyword^1.0 mm=0%}Foo&fq=+date:[2009-03-11T00:00:00Z TO 2009-07-09T16:41:50Z]&fl=id,date,content Now, I want to add one more field to the qf parameter: .

DIH: URLDataSource and incremental indexing

2009-07-09 Thread Erik Hatcher
I'm exploring other ways of getting data into Solr via DataImportHandler than through a relational database, particularly the URLDataSource. I see the special commands for deleting by id and query as well as the $hasMore/$nextUrl techniques, but I'm unclear on exactly how one would go abo

Re: I am getting HTTP Version Not Supported (505)Error

2009-07-09 Thread Koji Sekiguchi
Edukondalu, Beside the question, are you aware of Solrj - Java API for Solr. http://wiki.apache.org/solr/Solrj Koji Edukondalu Avula wrote: Hi Friends, I am working on Apache solr for indexing data by using java programming. For Indexing data i used tomcat server and i started solr, i prepar

Re: Using relevance scores for pseudo-random-probabilistic ordenation

2009-07-09 Thread Raimon Bosch
It Worked for me changing: public void setScorer(Scorer scorer) { this.scorer = new ScoreCachingWrappingScorer(scorer); } by public void setScorer(Scorer scorer) { this.scorer = scorer; } in my PseudoRandomFieldComparator. Regards, Raimon Bosch. Raimon Bosch wrote: > > Hi, >

I am getting HTTP Version Not Supported (505)Error

2009-07-09 Thread Edukondalu Avula
Hi Friends, I am working on Apache solr for indexing data by using java programming. For Indexing data i used tomcat server and i started solr, i prepared url for indexing data. i given that url in any browser it's working (indexed the data). I given the prepared url in URL calss i got the HTTP V

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread gateway0
Is it maybe the installation paths. On XP I used: - xampp ("G:\xampp") - installed tomcat plugin for xampp ("G:\xampp\tomcat") - solr incl. data dir ("G:\solr") On Mac I used: - xampp for mac (path: /Applications/XAMPP/...) - tomcat 6.0.20 (path: /Applications/tomcat 6.0.20/...) - solr incl. dat

Re: Search results depending on search word length?

2009-07-09 Thread solenweg
Just an extra comment in context with this issue: Maybe there is a way to set the search criteria a little "looser" - meaning it doesn't have to contain so big a correct percentage of the real name/title. Or am I way off? -- View this message in context: http://www.nabble.com/Search-results-depe

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
the mysql driver . I d not remember the exact name of the driver jar file On Thu, Jul 9, 2009 at 4:39 PM, gateway0 wrote: > > What driver do you mean exactly (filename & path) so I can compare my > installation on xp and mac os? > > > > Noble Paul നോബിള്‍  नोब्ळ्-2 wrote: >> >> so it says the driv

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread gateway0
What driver do you mean exactly (filename & path) so I can compare my installation on xp and mac os? Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: > > so it says the driver is not present in the classpath. > > On Thu, Jul 9, 2009 at 4:11 PM, gateway0 wrote: >> >> Here is the complete error message: >>

Search results depending on search word length?

2009-07-09 Thread solenweg
New to solr so maybe it's studip question, but here we go: I've got a long title name like "thisisaverylongtesttitle" and is then doing a search for "thisisavery" but with no results. I can with a search for "thisisaverylongtit" get the result. But why not with a shorter search criteria? Is there

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
so it says the driver is not present in the classpath. On Thu, Jul 9, 2009 at 4:11 PM, gateway0 wrote: > > Here is the complete error message: > " > type Status report > > message Severe errors in solr configuration. Check your log files for more > detailed information on what may be wrong. If you

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread gateway0
Here is the complete error message: " type Status report message Severe errors in solr configuration. Check your log files for more detailed information on what may be wrong. If you want solr to continue after configuration errors, change: false in null ---

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
Is this the complete stacktrace ? or is there anything else that is missing? On Thu, Jul 9, 2009 at 3:14 PM, gateway0 wrote: > > Hi, > > I wanted to port my windows installation of solr to mac os. But the > following error occured: > > "Could not create importer. DataImporter config invalid at > o

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread gateway0
Here is my data-config.xml: " " Noble Paul നോബിള്‍ नोब्ळ्-2 wrote: > > there is something wrong w/ your data-config.

AlternateField and DynamicField

2009-07-09 Thread Rui Carneiro
Hi, I have the following index: I want to use highlight to minimize the traffic load. But, when the highlight find nothing, i want the first N words from file_*. It is possible to do something like this? hl=on hl.fl=file_* hl.alternateField=file_* hl.*maxAlternateFieldLengt

Re: Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread Noble Paul നോബിള്‍ नोब्ळ्
there is something wrong w/ your data-config.xml you can paste it here On Thu, Jul 9, 2009 at 3:14 PM, gateway0 wrote: > > Hi, > > I wanted to port my windows installation of solr to mac os. But the > following error occured: > > "Could not create importer. DataImporter config invalid at > org.apa

Could not create importer. DataImporter config invalid at... MAC OSX

2009-07-09 Thread gateway0
Hi, I wanted to port my windows installation of solr to mac os. But the following error occured: "Could not create importer. DataImporter config invalid at org.apache.solr.handler.dataimport.DataImportHandler.inform(DataImportHandler.java:114) at org.apache.solr.core.SolrResourceLoader.inform(So

Re: Solr's MLT query call doesn't work

2009-07-09 Thread Jun Ohtani
What about http://localhost:8080/solr/select?q=id:10&mlt=true&mlt.fl=content_mlt&mlt.maxqt=5&mlt.interestingTerms=details&fl=title+author+score&mlt.mindf=1 ? Jun Ohtani SergeyG wrote: > > Hi, > > Recently, while implementing the MoreLikeThis search, I've run into the > situation when Solr's

Re: Create incremental snapshot

2009-07-09 Thread tushar kapoor
What I gather from this discussion is - 1. Snapshots are always hard links and not actual files so they cannot possibly consume the same amountof space. 2. Snapshots contain hard links to existing docs + delta docs. We are facing a situation wherein the snapshot occupies the same space as th