Anyone?

On Mon, Mar 17, 2014 at 12:03 PM, Salman Akram <
salman.ak...@northbaysolutions.net> wrote:

> Below is one of the sample slow query that takes mins!
>
> ((stock or share*) w/10 (sale or sell* or sold or bought or buy* or
> purchase* or repurchase*)) w/10 (executive or director)
>
> If a filter is used it comes in fq but what can be done about plain
> keyword search?
>
>
> On Sun, Mar 16, 2014 at 4:37 AM, Erick Erickson 
> <erickerick...@gmail.com>wrote:
>
>> What are our complex queries? You
>> say that your app will very rarely see the
>> same query thus you aren't using caches...
>> But, if you can move some of your
>> clauses to fq clauses, then the filterCache
>> might well be used to good effect.
>>
>>
>>
>> On Thu, Mar 13, 2014 at 7:22 AM, Salman Akram
>> <salman.ak...@northbaysolutions.net> wrote:
>> > 1- SOLR 4.6
>> > 2- We do but right now I am talking about plain keyword queries just
>> sorted
>> > by date. Once this is better will start looking into caches which we
>> > already changed a little.
>> > 3- As I said the contents are not stored in this index. Some other
>> metadata
>> > fields are but with normal queries its super fast so I guess even if I
>> > change there it will be a minor difference. We have SSD and quite fast
>> too.
>> > 4- That's something we need to do but even in low workload those queries
>> > take a lot of time
>> > 5- Every 10 mins and currently no auto warming as user queries are
>> rarely
>> > same and also once its fully warmed those queries are still slow.
>> > 6- Nops.
>> >
>> > On Thu, Mar 13, 2014 at 5:38 PM, Dmitry Kan <solrexp...@gmail.com>
>> wrote:
>> >
>> >> 1. What is your solr version? In 4.x family the proximity searches have
>> >> been optimized among other query types.
>> >> 2. Do you use the filter queries? What is the situation with the cache
>> >> utilization ratios? Optimize (= i.e. bump up the respective cache
>> sizes) if
>> >> you have low hitratios and many evictions.
>> >> 3. Can you avoid storing some fields and only index them? When the
>> field is
>> >> stored and it is retrieved in the result, there are couple of disk
>> seeks
>> >> per field=> search slows down. Consider SSD disks.
>> >> 4. Do you monitor your system in terms of RAM / cache stats / GC? Do
>> you
>> >> observe STW GC pauses?
>> >> 5. How often do you commit & do you have the autowarming / external
>> warming
>> >> configured?
>> >> 6. If you use faceting, consider storing DocValues for facet fields.
>> >>
>> >> some solr wiki docs:
>> >>
>> >>
>> https://wiki.apache.org/solr/SolrPerformanceProblems?highlight=%28%28SolrPerformanceFactors%29%29
>> >>
>> >>
>> >>
>> >>
>> >>
>> >> On Thu, Mar 13, 2014 at 8:52 AM, Salman Akram <
>> >> salman.ak...@northbaysolutions.net> wrote:
>> >>
>> >> > Well some of the searches take minutes.
>> >> >
>> >> > Below are some stats about this particular index that I am talking
>> about:
>> >> >
>> >> > Index size = 400GB (Using CommonGrams so without that the index is
>> around
>> >> > 180GB)
>> >> > Position File = 280GB
>> >> > Total Docs = 170 million (just indexed for searching - for
>> highlighting
>> >> > contents are stored in another index)
>> >> > Avg Doc Size = Few hundred KBs
>> >> > RAM = 384GB (it has other indexes too but still OS cache can have
>> 60-80%
>> >> of
>> >> > the total index cached)
>> >> >
>> >> > Phrase queries run pretty fast with CG but complex versions of
>> wildcard
>> >> and
>> >> > proximity queries can be really slow. I know using CG will make them
>> slow
>> >> > but they just take too long. By default sorting is on date but users
>> have
>> >> > few other parameters too on which they can sort.
>> >> >
>> >> > I wanted to avoid creating multiple indexes (maybe based on years)
>> but
>> >> > seems that to search on partial data that's the only feasible way.
>> >> >
>> >> >
>> >> >
>> >> >
>> >> > On Wed, Mar 12, 2014 at 2:47 PM, Dmitry Kan <solrexp...@gmail.com>
>> >> wrote:
>> >> >
>> >> > > As Hoss pointed out above, different projects have different
>> >> > requirements.
>> >> > > Some want to sort by date of ingestion reverse, which means that
>> having
>> >> > > posting lists organized in a reverse order with the early
>> termination
>> >> is
>> >> > > the way to go (no such feature in Solr directly). Some other
>> projects
>> >> > want
>> >> > > to collect all docs matching a query, and then sort by rank, but
>> you
>> >> > cannot
>> >> > > guarantee, that the most recently inserted document is the most
>> >> relevant
>> >> > in
>> >> > > terms of your ranking.
>> >> > >
>> >> > >
>> >> > > Do your current searches take too long?
>> >> > >
>> >> > >
>> >> > > On Tue, Mar 11, 2014 at 11:51 AM, Salman Akram <
>> >> > > salman.ak...@northbaysolutions.net> wrote:
>> >> > >
>> >> > > > Its a long video and I will definitely go through it but it seems
>> >> this
>> >> > is
>> >> > > > not possible with SOLR as it is?
>> >> > > >
>> >> > > > I just thought it would be quite a common issue; I mean
>> generally for
>> >> > > > search engines its more important to show the first page results,
>> >> > rather
>> >> > > > than using timeAllowed which might not even return a single
>> result.
>> >> > > >
>> >> > > > Thanks!
>> >> > > >
>> >> > > >
>> >> > > > --
>> >> > > > Regards,
>> >> > > >
>> >> > > > Salman Akram
>> >> > > >
>> >> > >
>> >> > >
>> >> > >
>> >> > > --
>> >> > > Dmitry
>> >> > > Blog: http://dmitrykan.blogspot.com
>> >> > > Twitter: http://twitter.com/dmitrykan
>> >> > >
>> >> >
>> >> >
>> >> >
>> >> > --
>> >> > Regards,
>> >> >
>> >> > Salman Akram
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Dmitry
>> >> Blog: http://dmitrykan.blogspot.com
>> >> Twitter: http://twitter.com/dmitrykan
>> >>
>> >
>> >
>> >
>> > --
>> > Regards,
>> >
>> > Salman Akram
>>
>
>
>
> --
> Regards,
>
> Salman Akram
>
>


-- 
Regards,

Salman Akram

Reply via email to