Nested documents vs. flattening document structure?

2018-03-06 Thread Dc Tech
We are evaluating using nested documents vs. simply flattening the document. Looking through the documentation, it is not very clear to me if the nested documents are fully mature, and support the full richness of SOLR (streaming, mature faceting) etc... Any opinions or guidance on that? For *

Re: Nested documents vs. flattening document structure?

2018-03-06 Thread Dc Tech
with than flattened docs. > > Best, > Erick > > On Tue, Mar 6, 2018 at 6:48 AM, Dc Tech wrote: > > We are evaluating using nested documents vs. simply flattening the > document. > > > > Looking through the documentation, it is not very clear to me if the > nested

MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread dc tech
I am running some experiments on more like this and the results seem rather odd - I am doing something wrong but just cannot figure out what. Basically, the similarity results are decent - but not great. *Issue 1 = Quality* Toyota Camry : finds Altima (good) but then next one is Camry Hybrid wher

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-03-31 Thread dc tech
he top (tf should be 1 for your > data). > Enable &debugQuery=true and look at explain section to see show score is > getting calculated. > > You should try giving different boosts to class, type, drive, size to > control the results. > > > On Sun, Mar 31, 2013 at 8:52 P

Re: MoreLikeThis - Odd results - what am I doing wrong?

2013-04-01 Thread DC tech
Amazon ec2 to bring up your solr, you should be >able to get a micro instance for free trial. > > >On Mon, Apr 1, 2013 at 5:10 AM, dc tech wrote: > >> I did try the raw query against the *simi* field and those seem to return >> results in the order expected. >>

RE: MoreLikeThis - Odd results - what am I doing wrong?

2013-04-03 Thread DC tech
ith a password >(note that the basic password authentication sends passwords in clear text if >you're not using HTTPS, best lock the thing down behind a firewall). > >Dave > > >-Original Message- >From: DC tech [mailto:dctech1...@gmail.com] >Sent: Tuesday, April 02,

Re: using edismax without velocity

2013-04-06 Thread DC tech
Definitely in 4.x release. Did you try it and found a problem?

Boost parameter with query function - how to pass in complex params?

2013-04-06 Thread dc tech
See example below 1. Search for SUVs and boost Honda models q=suv&boost=query({! v='honda'},1) 2. Search for SUVs and boost Honda OR toyota model a) Using OR in the query does NOT work q=suv&boost=query({! v='honda or toyota'},1) b) Using two query functions and summing the boosts DOES w

Re: Boost parameter with query function - how to pass in complex params?

2013-04-07 Thread dc tech
6, 2013 at 10:19 AM, Yonik Seeley wrote: > On Sat, Apr 6, 2013 at 9:42 AM, dc tech wrote: > > See example below > > 1. Search for SUVs and boost Honda models > > q=suv&boost=query({! v='honda'},1) > > > > 2. Search for SUVs and boost Honda OR t

FYI - Excel to generate schema and SolrConfig

2013-04-07 Thread dc tech
To minimize my own typing when setting up SOLR schema or config, I created a simple Excel that can reduce the amount of typing that is required. Please feel free to use it if you find it useful. solr_schema_shared.xlsx Description: application/vnd.openxmlformats-officedocument.spreadsheetml.shee

Re: Boost parameter with query function - how to pass in complex params?

2013-04-07 Thread dc tech
honda text,score suv query($boostq,1) true edismax DEBUG: suv suv BoostedQuery(boost(+(text:suv),query(text:toyota text:honda,def=1.0))) boost(+(text:suv),query(text:toyota text:honda,def=1.0)) On Sun, Apr 7, 2013 at 9:07 AM, Yonik Seeley wrote: > On Sun, Apr 7, 2013 at 8:39 AM, dc

Coming back to search after some time... SOLR or Elastic for text search?

2020-01-14 Thread Dc Tech
I am SOLR fant and had implemented it in our company over 10 years ago. I moved away from that role and the new search team in the meanwhile implemented a proprietary (and expensive) nosql style search engine. That the project did not go well, and now I am back to project and reviewing the technolo

Re: Coming back to search after some time... SOLR or Elastic for text search?

2020-01-15 Thread Dc Tech
the other. Or maybe after that exercise you are still wondering > what to choose, in which case you just follow your gut feeling and make a > choice :) > > Jan > >> 15. jan. 2020 kl. 10:07 skrev Charlie Hull : >> >>> On 15/01/2020 04:02, Dc Tech wrote: &g

Re: Score cutoff

2010-05-04 Thread dc tech
Michael, The cutoff filter would be very useful for us as well. We want to use it for more like this feature where only the top n similar docs tend to be reallt similar. On 5/4/10, Michael Kuhlmann wrote: > Am 03.05.2010 23:32, schrieb Satish Kumar: >> Hi, >> >> Can someone give clues on how to

SOLR Based Search - Response Times - what do you consider slow or fast?

2010-05-04 Thread dc tech
We are using SOLR in a production setup with a jRuby on Rails front end with about 20 different instances of SOLR running on heavy duty hardware. The setup is load balanced front end (jRoR) on a pair of machines and the SOLR backends on a different machine. We have plenty of memory and CPU and th

Re: Personalized Search

2010-05-20 Thread dc tech
Another approach would be to do query time boosts of 'my' items under the assumption that count is limited: - keep the SOLR index independent of bought/like - have a db table with user prefs on a per item basis - at query time, specify boosts for 'my items' items We are planning to do this in the

Re: Personalized Search

2010-05-21 Thread dc tech
In our specific case, we would get the user's folders and then do a function query that provides a boost if the document.folder is in {my folder list}. Another approach that will work for our intranet use is to add the userids in a multi-valued field as others have suggested. On 5/20/10, MitchK

Re: Personalized Search

2010-05-21 Thread dc tech
it scale if users already favorited/liked hundreds of item? The query > can be quite long. > > Looking forward to your idea. > > > > On Thu, May 20, 2010 at 6:37 PM, dc tech wrote: > >> Another approach would be to do query time boosts of 'my' items under >

Re: Performance issues when querying on large documents

2010-07-24 Thread dc tech
Are you storing the full 1,000 pages in the index? If so, that is probably not helping either. On 7/23/10, ahammad wrote: > > Hello, > > I have an index with lots of different types of documents. One of those > types basically contains extracts of PDF docs. Some of those PDFs can have > 1000+ pag

Re: Solr using 1500 threads - is that normal?

2010-07-28 Thread dc tech
1,500 threads seems extreme by any standards so there is something happening in your install. Even with appservers for web apps, typically 100 would be a fair # of threads. On 7/28/10, Christos Constantinou wrote: > Hi, > > Solr seems to be crashing after a JVM exception that new threads cannot

Re: Solr searching performance issues, using large documents

2010-07-29 Thread dc tech
Are you storing the entire log file text in SOLR? That's almost 3gb of text that you are storing in the SOLR. Try to 1) Is this first time performance or on repaat queries with the same fields? 2) Optimze the index and test performance again 3) index without storing the text and see what the perfor

Re: want to display elevated results on my display result screen differently.

2010-08-03 Thread dc tech
Have you looked at the relevance scores? I would speculate elevate matches would have constant, high score. On 8/3/10, Vishal.Arora wrote: > > Suppose i have elevate.xml file and i elevate the ID :- Artist:11650 and > Artist:510 when i search for corgan > this is elevate File > >

Re: Facet Fields - ID vs. Display Value

2010-08-09 Thread dc tech
I think depends on what you need: 1) Simple,unique category - use display facet 2) Categories may be duplicate from display perspective (eg authors) : store display#id in facet field but show only display 3) Internationalization requirements - store I'd but have ui pull and display the translated l

Re: Indexing Hanging during GC?

2010-08-12 Thread dc tech
I am a little confused - how did 180k documents become 100m index documents? We use have over 20 indices (for different content sets), one with 5m documents (about a couple of pages each) and another with 100k+ docs. We can index the 5m collection in a couple of days (limitation is in the source) w

Re: Indexing Hanging during GC?

2010-08-12 Thread dc tech
; breaking it GC-wise. > > i'm going to try adding in System.gc() calls to see if this runs ok > (albeit slower)... > otherwise i'm pretty much at a loss as to what could be causing this GC > issue/ > solr hanging if it's not a GC issue... > > thanks :) >

Re: solr benchmarks

2011-01-03 Thread dc tech
Tri: What is the volume of content (# of documents) and index size you are expecting? What about the document complexity in terms of # of fields, what are you storing in the index, complexity of the queries etc? We have used SOLR with 10m documents with 1-3 second response times on the front end