Re: access matched token ids in the FacetComponent?

2013-03-06 Thread Dmitry Kan
Thanks Mikhail. On Tue, Mar 5, 2013 at 8:23 PM, Mikhail Khludnev wrote: > Something like this. > > > On Tue, Mar 5, 2013 at 6:16 PM, Dmitry Kan wrote: > > > Hello, > > > > I spent some more time on this and used Mikhail's suggestions of which > > cla

Re: how to deploy customization in solr that requires dependency

2013-03-18 Thread Dmitry Kan
Hi, See here, might help: http://wiki.apache.org/solr/SolrPlugins#How_to_Load_Plugins We don't use multicore functionality of SOLR, so we decided to bundle SOLR dependencies into the war file of the solr web app. Regards, Dmitry On Mon, Mar 18, 2013 at 7:47 PM, Gian Maria Ricci wrote: > Hi t

stop distributed facet query when a condition is met

2012-01-30 Thread Dmitry Kan
satisfied, continue searching with the remaining shard set. What would be the proven way to achieve this? -- Regards, Dmitry Kan

Re: search returns 'categories' instead of url

2012-02-01 Thread Dmitry Kan
An offtopic: as some of my questions went unnoticed too, I could recommend asking them somewhere else in parallel, for example: stackoverflow.com. But as SOLR and its "ecosystem" sometimes pose tough questions and problems, stackoverflow can ignore them as well. Anyhow, just another opportunity..

Re: Fail to compile Java code (trying to use SolrJ with Solr)

2012-02-01 Thread Dmitry Kan
gt; After correcting the problems, you can resume the build with the command > mvn -rf :lucene-core > 'cmd' is not recognized as an internal or external command, > operable program or batch file. > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Fail-to-compile-Java-code-trying-to-use-SolrJ-with-Solr-tp3708902p3708923.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Zero Matches Weirdness

2012-02-03 Thread Dmitry Kan
gt; > params={facet=true&sort=score+desc&fl=sitzung,gremium,betreff,datum,timestamp,score,aktenzeichen,typ,id,anhang&debugQuery=true&start=0&q=Am+Heidstamm&hl.fl=betreff&wt=json&fq=&hl=true&rows=10} > hits=0 status=0 QTime=29 > > ... and for the quoted one: > > INFO: [] webapp=/solr path=/select > > params={facet=true&sort=score+desc&fl=sitzung,gremium,betreff,datum,timestamp,score,aktenzeichen,typ,id,anhang&start=0&q="Am+Heidstamm"&hl.fl=betreff&wt=standard&fq=&hl=true&rows=10&version=2.2} > hits=14 status=0 QTime=244 > > > Thanks! > -- Regards, Dmitry Kan

Re: Zero Matches Weirdness

2012-02-03 Thread Dmitry Kan
Actually, I wouldn't count on it and just specify index and query sides explicitly. Just to play it safe. On Fri, Feb 3, 2012 at 8:34 PM, Marian Steinbach wrote: > 2012/2/3 Dmitry Kan : > > What about side of the field? > > > > It's identical. At least tha

Re: Zero Matches Weirdness

2012-02-03 Thread Dmitry Kan
ue... was there literally a + in the query or was that > urlencoded? Try debugQuery=true for both queries and see what you get for > the query parsing output. > > Erik > > On Feb 3, 2012, at 14:18 , Dmitry Kan wrote: > > > Actually, I wouldn't count on it and

Re: is there any practice to load index into RAM to accelerate solr performance?

2012-02-08 Thread Dmitry Kan
Hi, This talk has some interesting details on setting up an Lucene index in RAM: http://www.lucidimagination.com/devzone/events/conferences/revolution/2011/lucene-yelp Would be great to hear your findings! Dmitry 2012/2/8 James > Is there any practice to load index into RAM to accelerate so

Re: Fields not indexed?

2012-02-08 Thread Dmitry Kan
ntries in > my table. > However when I try to run a *:* query from the admin console I only get > responses in the form: > > < doc> > 1.0 > 1 > > > I'm not sure how to get to the bottom of this. > Thanks. > -- Regards, Dmitry Kan

Re: Fields not indexed?

2012-02-08 Thread Dmitry Kan
well, you should add these fields in schema.xml, otherwise solr won't know them. On Wed, Feb 8, 2012 at 2:48 PM, Radu Toev wrote: > The schema.xml is the default file that comes with Solr 3.5, didn't change > anything there. > > On Wed, Feb 8, 2012 at 2:45 PM, Dmitry Kan

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
the second). However I'm getting a total of 2k. > Where could be the problem? > > Thanks > -- Regards, Dmitry Kan

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
> 1. Nothing in the logs > 2. No. > > On Thu, Feb 16, 2012 at 12:44 PM, Dmitry Kan wrote: > > > 1. Do you see any errors / exceptions in the logs? > > 2. Could you have duplicates? > > > > On Thu, Feb 16, 2012 at 10:15 AM, Radu Toev wrote: > > > &g

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
same, if I do sepparately the 1k database. It indexes ok. > > On Thu, Feb 16, 2012 at 2:11 PM, Dmitry Kan wrote: > > > It sounds a bit, as if SOLR stopped processing data once it queried all > > from the smaller dataset. That's why you have 2000. If you just have a > &g

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
> > > > > > > > > > > > I've removed the connection params > The unique key is id. > > On Thu, Feb 16, 2012 at 2:27 PM, Dmitry Kan wrote: > > > OK, maybe you can show the db-data-config.xml just in case? > > Also in schema.

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
2012 at 4:47 PM, Radu Toev wrote: > I'm not sure I follow. > The idea is to have only one document. Do the multiple documents have the > same structure then(different datasources), and if so how are they actually > indexed? > > Thanks. > > On Thu, Feb 16, 2012 at 4:40 PM,

Re: Entity with multiple datasources

2012-02-16 Thread Dmitry Kan
no problem, hope it helps, you're welcome. On Thu, Feb 16, 2012 at 5:03 PM, Radu Toev wrote: > Really good point on the ids, I completely overlooked that matter. > I will give it a try. > Thanks again. > > On Thu, Feb 16, 2012 at 5:00 PM, Dmitry Kan wrote: > > &g

Re: What Interface to use for programming compatible filters in SOLR?

2012-02-24 Thread Dmitry Kan
For example, this way: 1. Implement a filter factory: [code] package com.mycomp.solr.analysis; import org.apache.lucene.analysis.TokenStream; import org.apache.solr.analysis.BaseTokenFilterFactory; import org.apache.solr.common.ResourceLoader; import org.apache.solr.util.plugin.ResourceLoaderAwa

Re: solr returns reduced results for same query after adding a new field to the schema.

2012-02-28 Thread Dmitry Kan
ot; stored="true" required="true"/> > type="ingredientSuggestionType" stored="true"/> > > and >... > > > >recipeId >ingredientText > > > > When I query solr with the query ?q=ingredientSuggestion=banana I get > 160 results. > Ok, all fine. > > When I add a new field such as > > stored="true" required="true"/> > > to my index it reduces the number of results from my query to 131, even > though the query > has'nt changed and does not (at least explicitly) filter the result set. > > > Obviously I'm missing something fundemental , but I'm not sure what it > is. Has anyone else experienced a similar > problem? Am I doing something wrong in the way I am indexing my > database? > > > Mark > > > > > > > > > http://www.bbc.co.uk/ > This e-mail (and any attachments) is confidential and may contain personal > views which are not the views of the BBC unless specifically stated. > If you have received it in error, please delete it from your system. > Do not use, copy or disclose the information in any way nor act in > reliance on it and notify the sender immediately. > Please note that the BBC monitors e-mails sent or received. > Further communication will signify your consent to this. > > -- Regards, Dmitry Kan

Re: Faster Solr Indexing

2012-03-11 Thread Dmitry Kan
ise search server advice - p. 305, >> section "The Solution: Shingling"). >> >> Clearly the size of the docs is a contributor to the slow adds (confirmed >> by removing these 2 fields resulting in halving the indexing time). We've >> tried compressed=true also but that is not working. >> >> Any guidance on how to support our application logic (without having to >> change the schema too much) and speed the indexing speed (from current 212 >> days for 12MM docs) would be much appreciated. >> >> thank you >> >> Peyman >> >> > -- Regards, Dmitry Kan

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
teNumberParts="1" catenateWords="1" catenateNumbers="1" > catenateAll="0" splitOnCaseChange="1"/> > > > > > > > > > > > > > > If I search from my webapplication in my autosuggest box, I get response > times of ~500ms per request. Is it possible to "tune" solr, so that I get > faster results? > > I have no special cache configuration, nor I don't know what to configure > here. > > > > Thanks, > > Ramo > > -- Regards, Dmitry Kan

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
escription:Concurrent LRU Cache(maxSize=512, initialSize=512, > minSize=460, acceptableSize=486, cleanupThread=false) > stats: lookups : 0 > hits : 0 > hitratio : 0.00 > inserts : 0 > evictions : 0 > size : 0 > warmupTime : 0 > cumulative_lookups : 0 > cumulative_hits

Re: Performance (responsetime) on request

2012-03-12 Thread Dmitry Kan
onfigure this stuff and what impact it has. > > Thans, > Ramo > > -----Ursprüngliche Nachricht- > Von: Dmitry Kan [mailto:dmitry@gmail.com] > Gesendet: Montag, 12. März 2012 16:21 > An: solr-user@lucene.apache.org > Betreff: Re: Performance (responsetime) on

Re: Using the ids parameter

2012-03-26 Thread Dmitry Kan
Hi, Actually we ran into the same issue with using ids parameter, in the solr front with shards architecture (exception throws in the solr front). Were you able to solve it by using the key:value syntax or some other way? BTW, there was a related issue: https://issues.apache.org/jira/browse/SOLR-

Re: Using the ids parameter

2012-03-27 Thread Dmitry Kan
So I solved it by using key:(id1 OR ... idn). On Tue, Mar 27, 2012 at 9:14 AM, Dmitry Kan wrote: > Hi, > > Actually we ran into the same issue with using ids parameter, in the solr > front with shards architecture (exception throws in the solr front). Were > you able to solve

Re: Luke using shards

2012-03-29 Thread Dmitry Kan
; but only got the results for localhost:8983. I am able to search across the > shards so my url's are correct. > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Luke-using-shards-tp3865816p3865816.html > Sent from the Solr - User mailing list archive

Re: Using jetty's GzipFilter in the example solr.war

2012-03-30 Thread Dmitry Kan
he same error. > > Can you tell me where im going wrong? Can you guide me on this? > > Thanks. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Using-jetty-s-GzipFilter-in-the-example-solr-war-tp1894069p3870625.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: A little onfusion with maxPosAsterisk

2012-04-06 Thread Dmitry Kan
3889226.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

solr 3.4 with nTiers >= 2: usage of ids param causes NullPointerException (NPE)

2012-04-11 Thread Dmitry Kan
uction (only a hypothesis) [code] ShardDoc sdoc = rb.resultIds.get(id); [/code] returns sdoc=null, which causes the next line of code to fail with an NPE: [code] int idx = sdoc.positionInResponse; [/code] Am I missing anything? Can something be done for solving this issue? Thanks. -- Regards, Dmitry Kan

Re: solr 3.4 with nTiers >= 2: usage of ids param causes NullPointerException (NPE)

2012-04-12 Thread Dmitry Kan
Can anyone help me out with this? Is this too complicated / unclear? I could share more detail if needed. On Wed, Apr 11, 2012 at 3:16 PM, Dmitry Kan wrote: > Hello, > > Hopefully this question is not too complex to handle, but I'm currently > stuck with it. > > We ha

Re: solr 3.4 with nTiers >= 2: usage of ids param causes NullPointerException (NPE)

2012-04-12 Thread Dmitry Kan
#x27;t get into it and, particularly, how the using > disjunction query in place of IDS can help you. Could you please provide > more detailed info like stacktraces, etc. Btw, have you checked trunk for > your case? > > On Thu, Apr 12, 2012 at 7:08 PM, Dmitry Kan wrote: > > &g

Re: solr 3.4 with nTiers >= 2: usage of ids param causes NullPointerException (NPE)

2012-04-12 Thread Dmitry Kan
ase anyone interested). To make it part of a releasable trunk, one would most probably need to provide some way to configure "1st tier level". Thanks, Dmitry On Thu, Apr 12, 2012 at 9:34 PM, Yonik Seeley wrote: > On Wed, Apr 11, 2012 at 8:16 AM, Dmitry Kan wrote: > > We have

Re: Importing formats - Which works best with Solr?

2012-04-20 Thread Dmitry Kan
really have much grounding > for that. > > Any input would be appreciated. > > James > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Importing-formats-Which-works-best-with-Solr-tp3925557p3925557.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Auto suggest on indexed file content filtered based on user

2012-04-25 Thread Dmitry Kan
case sensitive. > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Auto-suggest-on-indexed-file-content-filtered-based-on-user-tp3934565p3937370.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Searching on fields with White Spaces

2012-04-25 Thread Dmitry Kan
Problem here is that e.g. New York is stored as two different tokens in your index, as you use white space tokenizer. The easiest solution would be to detect and break the incoming one-word query tokens into several tokens, i.e. newyork => new york. That's probably possible only if there is a finit

Re: problem with date searching.

2012-05-04 Thread Dmitry Kan
://lucene.472066.n3.nabble.com/problem-with-date-searching-tp3961761.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: problem with date searching.

2012-05-04 Thread Dmitry Kan
ot; TO "2012-02-02T01:30:52Z"] > like that even though i am not getting any result . > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/problem-with-date-searching-tp3961761p3961833.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Word recognised in a search

2012-05-04 Thread Dmitry Kan
gs from the source using that > words. > > I hope I managed in explain my problem well :-) > > Could you help me, please? > > Thank you very much! > Bye. > -- Regards, Dmitry Kan

Re: Facet and totaltermfreq

2012-05-04 Thread Dmitry Kan
have multivalued fields which may have duplicates > and I would like to be able to get a count of how many documents that > term appears (currently what faceting does) but also how many times > that term appears in general. > -- Regards, Dmitry Kan

Re: Facet and totaltermfreq

2012-05-05 Thread Dmitry Kan
1056 ... -Dmitry On Sat, May 5, 2012 at 12:05 AM, Jamie Johnson wrote: > it might be...can you provide an example of the request/response? > > On Fri, May 4, 2012 at 3:31 PM, Dmitry Kan wrote: > > I have tried (as a test) combining facets and term vectors ( > > http://wiki.a

Re: Solr query issues

2012-05-09 Thread Dmitry Kan
Below are the configuration files: > http://lucene.472066.n3.nabble.com/file/n3974922/conf.rar conf.rar > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-query-issues-tp3974922.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Solr query issues

2012-05-09 Thread Dmitry Kan
> > > > > words="lang/stopwords_hi.txt" enablePositionIncrements="true"/> > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_hu.txt" format="snowball" > enablePositionIncrements="true"/> > language="Hungarian"/> > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_hy.txt" enablePositionIncrements="true"/> > language="Armenian"/> > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_id.txt" enablePositionIncrements="true"/> > > stemDerivational="true"/> > > > > > positionIncrementGap="100"> > > > > articles="lang/contractions_it.txt"/> > > words="lang/stopwords_it.txt" format="snowball" > enablePositionIncrements="true"/> > > > > > > > positionIncrementGap="100" autoGeneratePhraseQueries="false"> > > > > > > > > tags="lang/stoptags_ja.txt" enablePositionIncrements="true"/> > > > > words="lang/stopwords_ja.txt" enablePositionIncrements="true" /> > > minimumLength="4"/> > > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_lv.txt" enablePositionIncrements="true"/> > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_nl.txt" format="snowball" > enablePositionIncrements="true"/> > dictionary="lang/stemdict_nl.txt" ignoreCase="false"/> > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_no.txt" format="snowball" > enablePositionIncrements="true"/> > language="Norwegian"/> > > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_pt.txt" format="snowball" > enablePositionIncrements="true"/> > > > > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_ro.txt" enablePositionIncrements="true"/> > language="Romanian"/> > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_ru.txt" format="snowball" > enablePositionIncrements="true"/> > language="Russian"/> > > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_sv.txt" format="snowball" > enablePositionIncrements="true"/> > language="Swedish"/> > > > > > > positionIncrementGap="100"> > > > > > words="lang/stopwords_th.txt" enablePositionIncrements="true"/> > > > > > positionIncrementGap="100"> > > > > words="lang/stopwords_tr.txt" enablePositionIncrements="true"/> > language="Turkish"/> > > > > > > > > required="true" /> > > > > > > > > > id > > > biog > > > > > > > > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Solr-query-issues-tp3974922p3975167.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Merging two DocSets in solr

2012-05-13 Thread Dmitry Kan
to one and get the facet counts from the merged > docSets. How do I do it? Any pointers would be appreciated. > > -- > With Thanks and Regards, > Ramprakash Ramamoorthy, > Project Trainee, > Zoho Corporation. > +91 9626975420 > -- Regards, Dmitry Kan

Re: Kernel methods in SOLR

2012-05-14 Thread Dmitry Kan
munity to solve > similar problems, efficiently without paying a hefty response time price? > > thank you > > Peyman > > [1] http://en.wikipedia.org/wiki/Kernel_methods -- Regards, Dmitry Kan

Re: clickable links as results?

2012-05-23 Thread Dmitry Kan
- User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Dmitry Kan
t; > When I do a wildcard search like this: "foo*" - doc is > > > found. > > > > Please see http://wiki.apache.org/solr/MultitermQueryAnalysis > > > Well, it works in 3.6. With one exception: If I use german umlauts it does > not work anymore. > > Text: Bär > > Bä* -> no hits > Bär -> hits > > What can I do in this case? > > Thank you > > -- Regards, Dmitry Kan

Re: Wildcard-Search Solr 3.5.0

2012-05-23 Thread Dmitry Kan
the umlauts but I have no idea what... > > > -Original Message----- > > From: Dmitry Kan [mailto:dmitry@gmail.com] > > Sent: Mittwoch, 23. Mai 2012 13:36 > > To: solr-user@lucene.apache.org > > Subject: Re: Wildcard-Search Solr 3.5.0 > > > > wha

Re: Throws Null Pointer Exception Even Query is Correct in solr

2012-05-23 Thread Dmitry Kan
p;breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml > > > > > > > - > THANKS AND REGARDS, > SYED ABDUL KATHER > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Throws-Null-Pointer-Exception-Even-Query-is-Correct-in-solr-tp3985736p3985754.html > Sent from the Solr - User mailing list archive at Nabble.com. -- Regards, Dmitry Kan

Re: indexing unstructured text (tweets)

2012-05-28 Thread Dmitry Kan
s to address my > problem, > but also shows that before getting to Solr, I might need to use > another Apache tool called Tika. > > Can anybody provide a brief explaination about the general picture? > Can I index my tweets with Solr? > Or do I need to put also Tika in my pipeline? > > Best regards, > Giovanni Gherdovich > -- Regards, Dmitry Kan

Re: Sorting performance

2012-06-08 Thread Dmitry Kan
t; http://lucene.472066.n3.nabble.com/Sorting-performance-tp3987633.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Building a heat map from geo data in index

2012-06-11 Thread Dmitry Kan
olr there was a way to do geospatial faceting. What > >> I am imagining is bounding box being defined and that box being broken > >> into an N by N matrix, each of which would return counts so a heat map > >> could be constructed. Any other thoughts on this would be greatly > >> appreciated, right now I am really just fishing for some ideas. > > > > > > > -- Regards, Dmitry Kan

Re: Writing index files that have the right owner

2012-06-16 Thread Dmitry Kan
ions associated with that > owner, in the first place. Is there somewhere in the Solr or Tomcat > configuration files, or in the SolrJ code, where I can set who the owner of > files written to the index directories should be? > Thanks, > Mike > -- Regards, Dmitry Kan

Re: write.lock

2012-06-21 Thread Dmitry Kan
out > there have any ideas/experience as to what is causing the locks, and > what config changes that I can make in order to prevent the lock? > > Any help would be very appreciated! > > -- Chris > -- Regards, Dmitry Kan

Re: Multiple query hit for one query

2012-06-25 Thread Dmitry Kan
write code outside solr? If yes, should I use Solrj for this? > Please tell me the right approach. > > Thanks > Hemant > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Multiple-query-hit-for-one-query-tp3990466.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

faceting over ngrams

2011-03-16 Thread Dmitry Kan
ng on a cloud with Hadoop make it faster (to seconds) as faceting seems to be a natural map-reduce task? Are there any other options to look into before stepping into the cloud? Please let me know, if you need specific details on the schema / solrconfig setup or the like. -- Regards, Dmitry Kan

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
t; about scaling for number of documents (or total bytes) in your index. 1M > documents is not very large, for Solr, in general. > > Jonathan > > > On 3/16/2011 11:51 AM, Toke Eskildsen wrote: > >> On Wed, 2011-03-16 at 13:05 +0100, Dmitry Kan wrote: >> >>&g

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
ed or is it too much worry for you? I'll check out the direct query and let you know. On Wed, Mar 16, 2011 at 5:51 PM, Toke Eskildsen wrote: > On Wed, 2011-03-16 at 13:05 +0100, Dmitry Kan wrote: > > Hello guys. We are using shard'ed solr 1.4 for heavy faceted search over >

Re: faceting over ngrams

2011-03-16 Thread Dmitry Kan
cumulative_inserts : 1 cumulative_evictions : 0 item_shingleContent_trigram : {field=shingleContent_trigram,memSize=1786355392,tindexSize=17977426,time=662387,phase1=654707,nTerms=53492050,bigTerms=38,termInstances=602090958,uses=398} On Wed, Mar 16, 2011 at 9:46 PM, Yonik Seeley wrote: > On Wed, Mar 16, 20

solr on the cloud

2011-03-22 Thread Dmitry Kan
/solr/SolrCloud ? Basically, of high interest is checking out the Map-Reduce for distributed faceting, is it even possible with the trunk? -- Regards, Dmitry Kan

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
Basically, of high interest is checking out the Map-Reduce for > distributed > > faceting, is it even possible with the trunk? > > Hm, MR for distributed faceting? Maybe I missed this... can you point to a > place that mentions this? > > Otis > > Sematext :: http:

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
Tue, Mar 22, 2011 at 7:51 AM, Dmitry Kan wrote: > > Basically, of high interest is checking out the Map-Reduce for > distributed > > faceting, is it even possible with the trunk? > > Solr already has distributed faceting, and it's much more performant > than a map-reduce i

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
ystems (is that what you mean when you refer to "distribution of facet requests across hosts"?) On Fri, Mar 25, 2011 at 1:57 PM, Upayavira wrote: > > > On Fri, 25 Mar 2011 13:44 +0200, "Dmitry Kan" > wrote: > > Hi Yonik, > > > > Oh, this is grea

Re: Create 2 index with solr

2011-03-25 Thread Dmitry Kan
r and generate 2 index. > > So I don't know how I could generate 2 different index? > > Thank you for your help. > > Amel. > -- Regards, Dmitry Kan

Re: solr on the cloud

2011-03-25 Thread Dmitry Kan
f > commodity machines)", I can't actually picture what precisely you mean... > > > Otis > --- > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > - Original Message > > From: Dm

Re: solr on the cloud

2011-03-26 Thread Dmitry Kan
ven > have a reasonable solution for reading the index [randomly] out of > HDFS. Benchmarking'll be implemented next. It's not production > ready, suggestions are welcome. > > Jason > > On Fri, Mar 25, 2011 at 2:03 PM, Dmitry Kan wrote: > > Hi Otis, > > &

Re: Form too large ...

2011-03-30 Thread Dmitry Kan
inute - 4GB Xmx > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Form-too-large-tp2752676p2752676.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

stemming for English

2011-05-03 Thread Dmitry Kan
roach to handle such situations, is going all the way to modifying the Porter stemming source code the best choice? -- Regards, Dmitry Kan

Re: stemming for English

2011-05-03 Thread Dmitry Kan
o use stemming on both side (index and query) if you > really want to use stemming. > > Ludovic > > 2011/5/3 Dmitry Kan [via Lucene] < > ml-node+2893599-894006307-383...@n3.nabble.com> > > > Dear list, > > > > In SOLR schema on the index side we use no

Re: stemming for English

2011-05-03 Thread Dmitry Kan
and consistent wildcard support > in the same field. > To me, you have to create 2 different fields. > > Ludovic. > > 2011/5/3 Dmitry Kan [via Lucene] < > ml-node+2893628-993677979-383...@n3.nabble.com> > > > Hi Ludovic, > > > > That's an

Re: stemming for English

2011-05-04 Thread Dmitry Kan
t does *not* double your index size at all. > > -Original Message----- > From: Dmitry Kan [mailto:dmitry@gmail.com] > Sent: Tuesday, May 03, 2011 4:06 AM > To: solr-user@lucene.apache.org > Subject: Re: stemming for English > > Yes, Ludovic. Thus effectively we get i

Re: Want to Delete Existing Index & create fresh index

2011-05-14 Thread Dmitry Kan
Hi Pawan, Which SOLR version do you have installed? It should be absolutely normal for the data/ sub directory to create when starting up SOLR. So just go ahead and post your data into SOLR, if you have changed the schema already. -- Regards, Dmitry Kan On Sat, May 14, 2011 at 4:01 PM

boolean versus non-boolean search

2011-05-16 Thread Dmitry Kan
quot; produces same results as the original query and same debug output. Why is that? -- Regards, Dmitry Kan

Re: boolean versus non-boolean search

2011-05-16 Thread Dmitry Kan
n term1 and term2, since no > explicit operator was provided. > > Then we get to the one you specifically did add the AND in. I guess that it > basically groups left-to-right. So: > > term1 AND term2 OR X OR Y > > is the same as: > > term1 AND (term2 OR (X OR Y)) > &

sorting on date field in facet query

2011-05-18 Thread Dmitry Kan
Hello list, Is it possible to sort on date field in a facet query in SOLR 3.1? -- Regards, Dmitry Kan

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
3 PM, Erick Erickson wrote: > Can you provide an example of what you are trying to do? Are you > referring to ordering the result set or the facet information? > > Best > Erick > > On Wed, May 18, 2011 at 7:21 AM, Dmitry Kan wrote: > > Hello list, > > > > Is it

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
so, sort by lowest date? highest date? i guess, that would/could > not really work. > > Perhaps we have to ask another Question .. what are you trying to > achieve? "Boost by Date"? > > Regards > Stefan > > On Thu, May 19, 2011 at 2:24 PM, Dmitry Kan wrote: > &

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
ess you could easily > transform it back for proper display. Although watch out for > leading zeros! > > Thinking off the top of my head here > > Erick > > On Thu, May 19, 2011 at 9:46 AM, Dmitry Kan wrote: > > Hi, > > > > Thanks for the questions, guys, and sorry f

Re: sorting on date field in facet query

2011-05-19 Thread Dmitry Kan
iew this message in context: > http://lucene.472066.n3.nabble.com/sorting-on-date-field-in-facet-query-tp2956540p2961612.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

query parser and other query filters

2011-05-23 Thread Dmitry Kan
way to tell SOLR to execute query parser the last after all other filters have processes the query? -- Regards, Dmitry Kan

Re: Termscomponent sort question

2011-05-25 Thread Dmitry Kan
would the same behavior with TermComponent. > > Is it possible? > > Thanks. > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Termscomponent-sort-question-tp2980683p2980683.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: What is omitNorms

2011-05-26 Thread Dmitry Kan
Hi Chandan, What would be the default value for omitNorms? Is general advise to ignore this and set the value explicitly? --Dmitry On Thu, May 26, 2011 at 9:37 AM, Chandan Tamrakar < chandan.tamra...@nepasoft.com> wrote: > When you say "omitnorms=true" for any fields it means SOLR will not >

Re: What is omitNorms

2011-05-26 Thread Dmitry Kan
tp2987547p2988714.html > Sent from the Solr - User mailing list archive at Nabble.com. > -- Regards, Dmitry Kan

Re: Termscomponent sort question

2011-05-26 Thread Dmitry Kan
u, May 26, 2011 at 5:11 PM, antonio wrote: > Hi Dmitry Kan, thanks for your anwser. > This is an idea, but i think that will be not so performing. Because if the > terms are 1000, i must reorder 1000 terms by own length, and i think the > time will be high for make autocomplete. &g

query routing with shards

2011-06-02 Thread Dmitry Kan
routing of queries to the shards they map to. Can you please give some pointers as to what would be an optimal way to achieve such a routing inside the front end solr? Is there a way to configure mapping inside the solrconfig? Thanks. -- Regards, Dmitry Kan

Re: query routing with shards

2011-06-02 Thread Dmitry Kan
ng Solr? > > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > - Original Message > > From: Dmitry Kan > > To: solr-user@lucene.apache.org; yo...@lucidimagination.c

Re: query routing with shards

2011-06-03 Thread Dmitry Kan
the request, it has the correct shard in > there > already. > > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > - Original Message > > From: Dmitry Kan > > To

Re: query routing with shards

2011-06-03 Thread Dmitry Kan
osystem search :: http://search-lucene.com/ > > > > - Original Message > > From: Dmitry Kan > > To: solr-user@lucene.apache.org > > Sent: Fri, June 3, 2011 12:56:15 PM > > Subject: Re: query routing with shards > > > > Hi Otis, > > &

Re: Hitting the URI limit, how to get around this?

2011-06-03 Thread Dmitry Kan
o I cannot have a shard list in the config > files > > unless I can somehow update them while the system is running. > > > > -- > > View this message in context: > > http://lucene.472066.n3.nabble.com/Hitting-the-URI-limit-how-to-get-around-t > his-tp3017837p3020185.html > > Sent from the Solr - User mailing list archive at Nabble.com. > > -- > Ken Krugler > +1 530-210-6378 > http://bixolabs.com > custom data mining solutions > > > > > > > > > > -- Regards, Dmitry Kan

huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
point. -- Regards, Dmitry Kan

Re: huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
have other experience / pointers to load balancer software that was tried with SOLR? Dmitry On Wed, Jun 8, 2011 at 12:32 PM, Upayavira wrote: > > > On Wed, 08 Jun 2011 10:42 +0300, "Dmitry Kan" > wrote: > > Hello list, > > > > Thanks for attending to my prev

Re: huge shards (300GB each) and load balancing

2011-06-08 Thread Dmitry Kan
tication. Nginx and others can also do load balancing. > > Bill Bell > Sent from mobile > > > On Jun 8, 2011, at 3:32 AM, "Upayavira" wrote: > > > > > > > On Wed, 08 Jun 2011 10:42 +0300, "Dmitry Kan" > > wrote: > >> Hell

Re: huge shards (300GB each) and load balancing

2011-06-14 Thread Dmitry Kan
ogs/large-scale-search/too-many-words-againfor > details) We later ran into memory problems when indexing so instead > changed the index time parameter termIndexInterval from 128 to 1024. > > (More details here: http://www.hathitrust.org/blogs/large-scale-search) > > Tom Burton-West > > -- Regards, Dmitry Kan

Re: query parsing - removes a term

2011-06-14 Thread Dmitry Kan
ry=on I find that the parsedquery is only sending in > the > +text:(wasp) on parsing, and doesn't use the "was" value. Why is it > removing one of the terms? > > Thanks! > Andrea > -- Regards, Dmitry Kan

Re: query routing with shards

2011-06-16 Thread Dmitry Kan
the > shards > param. > > Otis > > Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch > Lucene ecosystem search :: http://search-lucene.com/ > > > > - Original Message > > From: Dmitry Kan > > To: solr-user@lucene.apache.org > &

Re: Showing facet of first N docs

2011-06-16 Thread Dmitry Kan
; Tommaso > -- Regards, Dmitry Kan

Re: query routing with shards

2011-06-16 Thread Dmitry Kan
Hi Otis, I have fixed it by assigning the value to rb same as assigned to sreq: rb.shards = shards.toString().split(","); not tested that fully yet, but distributed faceting works at least on my pc _3 shards 1 router_ setup. Dmitry On Thu, Jun 16, 2011 at 4:53 PM, Dmitry Kan wr

Re: Solr and Tag Cloud

2011-06-18 Thread Dmitry Kan
sequence (ngram) length. On Sat, Jun 18, 2011 at 7:12 AM, Jamie Johnson wrote: > Does anyone have details of how to generate a tag cloud of popular terms > across an entire data set and then also across a query? > -- Regards, Dmitry Kan

Re: Showing facet of first N docs

2011-06-18 Thread Dmitry Kan
ing : > > 2 > 2 > > You can imagine this like a 'give me only facets related to the most > relevant docs in the results' functionality. > Any idea on how to do that? > Tommaso > > > 2011/6/16 Dmitry Kan > > > http://wiki.apache.org/solr/Simpl

Re: Garbage Collection: I have given bad advice in the past!

2011-06-24 Thread Dmitry Kan
am thinking of increasing the NewSize yet again, because the tenured > generation (1.5GB in size) is only one third utilized after nearly 24 hours. > > My settings will probably not work for everyone, but I hope this post will > make it easier for others to find the right solution for themselves. > > Thanks, > Shawn > > -- Regards, Dmitry Kan

Re: testing subscription.

2011-06-24 Thread Dmitry Kan
passed On Thu, Jun 23, 2011 at 10:38 PM, Esteban Donato wrote: > > -- Regards, Dmitry Kan

<    1   2   3   4   5   6   >