Re: Arabic words search in solr

2017-02-20 Thread mohanmca01
Hi Steve,

I changed ICU folding filter order and re-index entire Arabic content. But
still problem is present. I am not able to get the expected result.

I attached screen shot for your references.
 
 
 

Kindly check and let me know.

Thanks



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Arabic-words-search-in-solr-tp4317733p4321397.html
Sent from the Solr - User mailing list archive at Nabble.com.


SSL Problem solr 6.4.1 - Error from shard - ADDREPLICA failed to create replica

2017-02-20 Thread Marcel Berteler
We are trying to get SOLR 6.4.1 to run on a windows 10 server, but for some 
reason it just does not want to function properly. On my personal win7 machine, 
it works like a dream.

We configured SOLR to use SSL and run on port 443 on localhost in cloud mode, 
using the build-in Zookeeper.

All is working well and after starting the server for the 1st time, we added 
the urlScheme (https) as a cluster property.

I also ensure the -Dsolr.ssl.checkPeerName=false is added to the SOLR 
properties in solr.in.cmd

During starting, no SSL, password or keystore errors.

I can surf to the admin UI and all seems fine.

But as soon as I try and use the BACKUP or RESTORE functions, I get the 
weirdest errors indicating my SSL passwords are incorrect and the process fails.

The passwords are correct otherwise SOLR would not start nor would I be able to 
use the admin UI. To me it indicates that whatever is doing the actual Backup 
and Restore is not reading the correct passwords.

https://localhost/solr/admin/collections?action=RESTORE=coct20170214=/solr=coct20170214=coct

Is there anybody who has a clue why this happens?

Here is an extract of the logs:


20 February 2017 09:00:13 AM ERROR true  OverseerCollectionMessageHandler Error 
from shard: https://localhost:443/solr

Caused by: java.io.IOException: Keystore was tampered with, or password was 
incorrect
at sun.security.provider.JavaKeyStore.engineLoad(Unknown Source)
at sun.security.provider.JavaKeyStore$JKS.engineLoad(Unknown Source)
at java.security.KeyStore.load(Unknown Source)
at 
sun.security.ssl.SSLContextImpl$DefaultSSLContext.getDefaultKeyManager(Unknown 
Source)
at sun.security.ssl.SSLContextImpl$DefaultSSLContext.(Unknown 
Source)
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native 
Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(Unknown 
Source)
at 
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(Unknown Source)
at java.lang.reflect.Constructor.newInstance(Unknown Source)
... 55 more
Caused by: java.security.UnrecoverableKeyException: Password verification failed
... 64 more

20 February 2017 09:00:13 AM ERROR false  OverseerCollectionMessageHandler 
Collection: coct20170214 operation: restore 
failed:org.apache.solr.common.SolrException: ADDREPLICA failed to create replica

20 February 2017 09:00:13 AM ERROR false  HttpSolrCall 
null:org.apache.solr.common.SolrException: ADDREPLICA failed to create replica


Kind regards,

Marcel Berteler

Disclaimer: This e-mail (including attachments) is subject to the disclaimer 
published at: http://www.capetown.gov.za/general/email-disclaimer Please read 
the disclaimer before opening any attachment or taking any other action in 
terms of this e-mail. If you cannot access the disclaimer, kindly send an email 
to disclai...@capetown.gov.za and a copy will be provided to you. By replying 
to this e-mail or opening any attachment you agree to be bound by the 
provisions of the disclaimer.


jetty qtp thread many

2017-02-20 Thread alias
I started solr, the default use jetty container, but I look through jmx, found 
a lot of jetty qtp thread, why?
When I do stress tests, Total waiting is getting bigger


Name: qtp1209271652-17
Status: 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@39bc254b 
on TIMED_WAITING
Total number of blocks: 51, Total waiting: 269


Stack trace:
Sun.misc.Unsafe.park (Native Method)
Java.util.concurrent.locks.LockSupport.parkNanos (LockSupport.java:215)
Java.util.concurrent.locks.AbstractQueuedSynchronizer $ 
ConditionObject.awaitNanos (AbstractQueuedSynchronizer.java:2078)
Org.eclipse.jetty.util.BlockingArrayQueue.poll (BlockingArrayQueue.java:392)
Org.eclipse.jetty.util.thread.QueuedThreadPool.idleJobPoll 
(QueuedThreadPool.java:546)
Org.eclipse.jetty.util.thread.QueuedThreadPool.access $ 800 
(QueuedThreadPool.java:47)
Org.eclipse.jetty.util.thread.QueuedThreadPool $ 3.run 
(QueuedThreadPool.java:609)
Java.lang.Thread.run (Thread.java:745)




--- The following is also the pressure test when the crawl




Status: 
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject@57e34b9e 
on TIMED_WAITING
Total number of blocks: 1, Total number of waiting: 814


Stack trace:
Sun.misc.Unsafe.park (Native Method)
Java.util.concurrent.locks.LockSupport.parkUntil (LockSupport.java:256)
Java.util.concurrent.locks.AbstractQueuedSynchronizer $ 
ConditionObject.awaitUntil (AbstractQueuedSynchronizer.java:2120)
Org.apache.http.pool.PoolEntryFuture.await (PoolEntryFuture.java:136)
Org.apache.http.pool.AbstractConnPool.getPoolEntryBlocking 
(AbstractConnPool.java:306)
Org.apache.http.pool.AbstractConnPool.access $ 000 (AbstractConnPool.java:64)
Org.apache.http.pool.AbstractConnPool $ 2.getPoolEntry 
(AbstractConnPool.java:192)
Org.apache.http.pool.AbstractConnPool $ 2.getPoolEntry 
(AbstractConnPool.java:185)
Org.apache.http.pool.PoolEntryFuture.get (PoolEntryFuture.java:107)
Org.apache.http.impl.conn.PoolingClientConnectionManager.leaseConnection 
(PoolingClientConnectionManager.java:208)
Org.apache.http.impl.conn.PoolingClientConnectionManager $ 1.getConnection 
(PoolingClientConnectionManager.java:195)
Org.apache.http.impl.client.DefaultRequestDirector.execute 
(DefaultRequestDirector.java:423)
Org.apache.http.impl.client.AbstractHttpClient.doExecute 
(AbstractHttpClient.java:882)
Org.apache.http.impl.client.CloseableHttpClient.execute 
(CloseableHttpClient.java:82)
Org.apache.http.impl.client.CloseableHttpClient.execute 
(CloseableHttpClient.java:55)
Org.apache.solr.client.solrj.impl.HttpSolrClient.executeMethod 
(HttpSolrClient.java:498)
Org.apache.solr.client.solrj.impl.HttpSolrClient.request 
(HttpSolrClient.java:262)
Org.apache.solr.client.solrj.impl.HttpSolrClient.request 
(HttpSolrClient.java:251)
Org.apache.solr.client.solrj.impl.LBHttpSolrClient.doRequest 
(LBHttpSolrClient.java:435)
Org.apache.solr.client.solrj.impl.LBHttpSolrClient.request 
(LBHttpSolrClient.java:387)
Org.apache.solr.handler.component.HttpShardHandlerFactory.makeLoadBalancedRequest
 (HttpShardHandlerFactory.java:275)
Org.apache.solr.handler.component.HttpShardHandler.lambda $ submit $ 0 
(HttpShardHandler.java:198)
Org.apache.solr.handler.component.HttpShardHandler $$ Lambda $ 107 / 
1277830087.call (Unknown Source)
Java.util.concurrent.FutureTask.run (FutureTask.java:266)
Java.util.concurrent.Executors $ RunnableAdapter.call (Executors.java:511)
Java.util.concurrent.FutureTask.run (FutureTask.java:266)
Org.apache.solr.common.util.ExecutorUtil $ MDCAwareThreadPoolExecutor.lambda $ 
execute $ 0 (ExecutorUtil.java:229)
Org.apache.solr.common.util.ExecutorUtil $ MDCAwareThreadPoolExecutor $$ Lambda 
$ 9 / 572523235.run (Unknown Source)
Java.util.concurrent.ThreadPoolExecutor.runWorker (ThreadPoolExecutor.java:1142)
Java.util.concurrent.ThreadPoolExecutor $ Worker.run 
(ThreadPoolExecutor.java:617)
Java.lang.Thread.run (Thread.java:745)


Does anyone know what reason? thank you very much

CPU Intensive Scoring Alternatives

2017-02-20 Thread Fuad Efendi
Hello,


Default TF-IDF performs poorly with the indexed 200 millions documents.
Query "Michael Jackson" may run 300ms, and "Michael The Jackson" over 3
seconds. eDisMax. Because default operator "OR" and stopword "The" we have
50-70 millions documents as a query result, and scoring is CPU intensive.
What to do? Our typical queries return over million documents, and response
times of simple queries ranges from 50 milliseconds to 5-10 seconds
depending on result set.

This was just an exaggerated example with stopword “the”, but even simplest
query “Michael Jackson” runs 300ms instead of 3ms just because huge number
of hits and TF-IDF calculations. Solr 6.3.


Thanks,

--

Fuad Efendi

(416) 993-2060

http://www.tokenizer.ca
Search Relevancy, Recommender Systems


Re: Sorl 6 with jetty issues

2017-02-20 Thread ~$alpha`
Face similar issue on solr 6.1.0



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sorl-6-with-jetty-issues-tp4321291p4321308.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Sorl 6 with jetty issues

2017-02-20 Thread Walter Underwood
Use Solr 6.3.0. For us, 6.4.x is using about 10X as much CPU under heavy query 
load.

wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/  (my blog)


> On Feb 20, 2017, at 5:11 AM, Michael Kuhlmann  wrote:
> 
> This may be related to SOLR-10130.
> 
> Am 20.02.2017 um 14:06 schrieb ~$alpha`:
>> Issues with solr settings while migrating from solr 4.0 to solr6.0.
>> 
>> Issue Faced: My CPU consumption goes to unacceptable levels. ie. load on
>> solr4.0 is between 6 to 10 while the load on solr 6 reaches 100 and since
>> its the production i rolled back quickly.
>> 
>> My Solr4 setting
>> 
>> - Running on tomcat
>> - JVM Memory : 16GB
>> - 24 core cpu
>> - JVM settings :
>>   - JVM Runtime Java HotSpot(TM) 64-Bit Server VM (24.45-b08) 
>>   - Processors   24 
>>   - Args : Paths mentioned here
>> 
>> 
>> **My Solr6 setting**
>> 
>> - Running on jetty
>> - JVM Memory : 20GB
>> - 32 core cpu
>> - JVM settings :
>>   - Oracle Corporation Java HotSpot(TM) 64-Bit Server VM 1.8.0_45 25.45-b02
>>   - Processors   32
>>   - Args
>>  - DSTOP.KEY=solrrocks
>>  - DSTOP.PORT=7983
>>  - Djetty.home=/usr/local/solr-6.4.1/server-Djetty.port=8983
>>  -
>> Dlog4j.configuration=file:/usr/local/solr-6.4.1/example/resources/log4j.properties
>>  -
>> Dsolr.install.dir=/usr/local/solr-6.4.1-Dsolr.log.dir=/usr/local/solr-6.4.1/example/techproducts/solr/../logs
>>  - Dsolr.log.muteconsole
>>  -
>> Dsolr.solr.home=/usr/local/solr-6.4.1/example/techproducts/solr-Duser.timezone=US/Eastern
>>  - XX:+AggressiveOpts
>>  - XX:+CMSParallelRemarkEnabled
>>  - XX:+CMSScavengeBeforeRemark
>>  - XX:+ParallelRefProcEnabled
>>  - XX:+PrintGCApplicationStoppedTime
>>  - XX:+PrintGCDateStamps
>>  - XX:+PrintGCDetails
>>  - XX:+PrintGCTimeStamps
>>  - XX:+PrintHeapAtGC
>>  - XX:+PrintTenuringDistribution
>>  - XX:+UseCMSInitiatingOccupancyOnly
>>  - XX:+UseConcMarkSweepGC
>>  - XX:+UseGCLogFileRotation
>>  - XX:-UseSuperWord
>>  - XX:CMSFullGCsBeforeCompaction=1
>>  - XX:CMSInitiatingOccupancyFraction=70
>>  - XX:CMSMaxAbortablePrecleanTime=6000
>>  - XX:CMSTriggerPermRatio=80
>>  - XX:GCLogFileSize=20M
>>  - XX:MaxTenuringThreshold=8
>>  - XX:NewRatio=2
>>  - XX:NumberOfGCLogFiles=9
>>  - XX:OnOutOfMemoryError=/usr/local/solr-6.4.1/bin/oom_solr.sh 8983
>> /usr/local/solr-6.4.1/example/techproducts/solr/../logs
>>  - XX:PretenureSizeThreshold=64m
>>  - XX:SurvivorRatio=15
>>  -
>> XX:TargetSurvivorRatio=90-Xloggc:/usr/local/solr-6.4.1/example/techproducts/solr/../logs/solr_gc.log-Xms21g-Xmx21g-Xss256k-verbose:gc
>> What i looking for
>> 
>> My guess its related to gc setting of jetty as i am not expert in
>> jetty(java8).please help how to tune these settings. Also how should i
>> chosoe these values or how to to debug these issue ?
>> 
>> 
>> 
>> --
>> View this message in context: 
>> http://lucene.472066.n3.nabble.com/Sorl-6-with-jetty-issues-tp4321291.html
>> Sent from the Solr - User mailing list archive at Nabble.com.
> 
> 



Re: Sorl 6 with jetty issues

2017-02-20 Thread Michael Kuhlmann
This may be related to SOLR-10130.

Am 20.02.2017 um 14:06 schrieb ~$alpha`:
> Issues with solr settings while migrating from solr 4.0 to solr6.0.
>
> Issue Faced: My CPU consumption goes to unacceptable levels. ie. load on
> solr4.0 is between 6 to 10 while the load on solr 6 reaches 100 and since
> its the production i rolled back quickly.
>
> My Solr4 setting
>
>  - Running on tomcat
>  - JVM Memory : 16GB
>  - 24 core cpu
>  - JVM settings :
>- JVM Runtime Java HotSpot(TM) 64-Bit Server VM (24.45-b08) 
>- Processors   24 
>- Args : Paths mentioned here
>
>
> **My Solr6 setting**
>
>  - Running on jetty
>  - JVM Memory : 20GB
>  - 32 core cpu
>  - JVM settings :
>- Oracle Corporation Java HotSpot(TM) 64-Bit Server VM 1.8.0_45 25.45-b02
>- Processors   32
>- Args
>   - DSTOP.KEY=solrrocks
>   - DSTOP.PORT=7983
>   - Djetty.home=/usr/local/solr-6.4.1/server-Djetty.port=8983
>   -
> Dlog4j.configuration=file:/usr/local/solr-6.4.1/example/resources/log4j.properties
>   -
> Dsolr.install.dir=/usr/local/solr-6.4.1-Dsolr.log.dir=/usr/local/solr-6.4.1/example/techproducts/solr/../logs
>   - Dsolr.log.muteconsole
>   -
> Dsolr.solr.home=/usr/local/solr-6.4.1/example/techproducts/solr-Duser.timezone=US/Eastern
>   - XX:+AggressiveOpts
>   - XX:+CMSParallelRemarkEnabled
>   - XX:+CMSScavengeBeforeRemark
>   - XX:+ParallelRefProcEnabled
>   - XX:+PrintGCApplicationStoppedTime
>   - XX:+PrintGCDateStamps
>   - XX:+PrintGCDetails
>   - XX:+PrintGCTimeStamps
>   - XX:+PrintHeapAtGC
>   - XX:+PrintTenuringDistribution
>   - XX:+UseCMSInitiatingOccupancyOnly
>   - XX:+UseConcMarkSweepGC
>   - XX:+UseGCLogFileRotation
>   - XX:-UseSuperWord
>   - XX:CMSFullGCsBeforeCompaction=1
>   - XX:CMSInitiatingOccupancyFraction=70
>   - XX:CMSMaxAbortablePrecleanTime=6000
>   - XX:CMSTriggerPermRatio=80
>   - XX:GCLogFileSize=20M
>   - XX:MaxTenuringThreshold=8
>   - XX:NewRatio=2
>   - XX:NumberOfGCLogFiles=9
>   - XX:OnOutOfMemoryError=/usr/local/solr-6.4.1/bin/oom_solr.sh 8983
> /usr/local/solr-6.4.1/example/techproducts/solr/../logs
>   - XX:PretenureSizeThreshold=64m
>   - XX:SurvivorRatio=15
>   -
> XX:TargetSurvivorRatio=90-Xloggc:/usr/local/solr-6.4.1/example/techproducts/solr/../logs/solr_gc.log-Xms21g-Xmx21g-Xss256k-verbose:gc
> What i looking for
>
> My guess its related to gc setting of jetty as i am not expert in
> jetty(java8).please help how to tune these settings. Also how should i
> chosoe these values or how to to debug these issue ?
>
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/Sorl-6-with-jetty-issues-tp4321291.html
> Sent from the Solr - User mailing list archive at Nabble.com.




Sorl 6 with jetty issues

2017-02-20 Thread ~$alpha`
Issues with solr settings while migrating from solr 4.0 to solr6.0.

Issue Faced: My CPU consumption goes to unacceptable levels. ie. load on
solr4.0 is between 6 to 10 while the load on solr 6 reaches 100 and since
its the production i rolled back quickly.

My Solr4 setting

 - Running on tomcat
 - JVM Memory : 16GB
 - 24 core cpu
 - JVM settings :
   - JVM Runtime Java HotSpot(TM) 64-Bit Server VM (24.45-b08) 
   - Processors   24 
   - Args : Paths mentioned here


**My Solr6 setting**

 - Running on jetty
 - JVM Memory : 20GB
 - 32 core cpu
 - JVM settings :
   - Oracle Corporation Java HotSpot(TM) 64-Bit Server VM 1.8.0_45 25.45-b02
   - Processors   32
   - Args
  - DSTOP.KEY=solrrocks
  - DSTOP.PORT=7983
  - Djetty.home=/usr/local/solr-6.4.1/server-Djetty.port=8983
  -
Dlog4j.configuration=file:/usr/local/solr-6.4.1/example/resources/log4j.properties
  -
Dsolr.install.dir=/usr/local/solr-6.4.1-Dsolr.log.dir=/usr/local/solr-6.4.1/example/techproducts/solr/../logs
  - Dsolr.log.muteconsole
  -
Dsolr.solr.home=/usr/local/solr-6.4.1/example/techproducts/solr-Duser.timezone=US/Eastern
  - XX:+AggressiveOpts
  - XX:+CMSParallelRemarkEnabled
  - XX:+CMSScavengeBeforeRemark
  - XX:+ParallelRefProcEnabled
  - XX:+PrintGCApplicationStoppedTime
  - XX:+PrintGCDateStamps
  - XX:+PrintGCDetails
  - XX:+PrintGCTimeStamps
  - XX:+PrintHeapAtGC
  - XX:+PrintTenuringDistribution
  - XX:+UseCMSInitiatingOccupancyOnly
  - XX:+UseConcMarkSweepGC
  - XX:+UseGCLogFileRotation
  - XX:-UseSuperWord
  - XX:CMSFullGCsBeforeCompaction=1
  - XX:CMSInitiatingOccupancyFraction=70
  - XX:CMSMaxAbortablePrecleanTime=6000
  - XX:CMSTriggerPermRatio=80
  - XX:GCLogFileSize=20M
  - XX:MaxTenuringThreshold=8
  - XX:NewRatio=2
  - XX:NumberOfGCLogFiles=9
  - XX:OnOutOfMemoryError=/usr/local/solr-6.4.1/bin/oom_solr.sh 8983
/usr/local/solr-6.4.1/example/techproducts/solr/../logs
  - XX:PretenureSizeThreshold=64m
  - XX:SurvivorRatio=15
  -
XX:TargetSurvivorRatio=90-Xloggc:/usr/local/solr-6.4.1/example/techproducts/solr/../logs/solr_gc.log-Xms21g-Xmx21g-Xss256k-verbose:gc
What i looking for

My guess its related to gc setting of jetty as i am not expert in
jetty(java8).please help how to tune these settings. Also how should i
chosoe these values or how to to debug these issue ?



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Sorl-6-with-jetty-issues-tp4321291.html
Sent from the Solr - User mailing list archive at Nabble.com.


Re: Select TOP 10 items from Solr Query

2017-02-20 Thread Emir Arnautovic
Json facets also have subfacets: http://yonik.com/solr-subfacets/ (check 
if something changed in version that you use)


Emir


On 20.02.2017 10:42, Zheng Lin Edwin Yeo wrote:

Hi Emir,

Thanks for your suggestion.
But I'm not sure if this could work, as I'm using JSON Facet and not the
normal faceting.

Regards,
Edwin


On 20 February 2017 at 17:24, Emir Arnautovic 
wrote:


Hi Edwin,

I am also bit confused but, it seems to me that you could achieve what you
need with pivot faceting: https://cwiki.apache.org/confl
uence/display/solr/Faceting#Faceting-Pivot(DecisionTree)Faceting

HTH,
Emir



On 18.02.2017 08:46, Zheng Lin Edwin Yeo wrote:


Although I have nested documents in the schema, but if not looking for top
10 parents to facet on children.

Currently, I'm only working on the information that is available in
children. In my case, each line represents a transaction, and I'm doing
the
faceting based on these individual transaction.

Regards,
Edwin


On 17 February 2017 at 22:10, Alexandre Rafalovitch 
wrote:

Am I missing something or do you have nested documents in the schema?

It reads like you looking for top 10 parents and then want to facet on
children. Is that right?

Regards,
 Alex

On 17 Feb 2017 5:35 AM, "Zheng Lin Edwin Yeo" 
wrote:

I'm looking at JSON facet for both of type:terms and type:range.

For example, I may have 100 Items in my collections, and each item can
have
many transactions. But I'm only interested to look at the top 10 items
which has the highest transaction rate (ie the highest count)

I'm doing a calculation of the total amount and average amount. However,
I
will only want the total amount and average amount to be calculated based
on the top 10 items which has the highest transaction rate, and not all
the
100 items.

For now, I need the additional query to get the top 10 items first,
before
I run the JSON Facet to get the total amount and average amount for that
10
items.

Regards,
Edwin


On 17 February 2017 at 18:02, alessandro.benedetti 

Re: Select TOP 10 items from Solr Query

2017-02-20 Thread Zheng Lin Edwin Yeo
Hi Emir,

Thanks for your suggestion.
But I'm not sure if this could work, as I'm using JSON Facet and not the
normal faceting.

Regards,
Edwin


On 20 February 2017 at 17:24, Emir Arnautovic 
wrote:

> Hi Edwin,
>
> I am also bit confused but, it seems to me that you could achieve what you
> need with pivot faceting: https://cwiki.apache.org/confl
> uence/display/solr/Faceting#Faceting-Pivot(DecisionTree)Faceting
>
> HTH,
> Emir
>
>
>
> On 18.02.2017 08:46, Zheng Lin Edwin Yeo wrote:
>
>> Although I have nested documents in the schema, but if not looking for top
>> 10 parents to facet on children.
>>
>> Currently, I'm only working on the information that is available in
>> children. In my case, each line represents a transaction, and I'm doing
>> the
>> faceting based on these individual transaction.
>>
>> Regards,
>> Edwin
>>
>>
>> On 17 February 2017 at 22:10, Alexandre Rafalovitch 
>> wrote:
>>
>> Am I missing something or do you have nested documents in the schema?
>>>
>>> It reads like you looking for top 10 parents and then want to facet on
>>> children. Is that right?
>>>
>>> Regards,
>>> Alex
>>>
>>> On 17 Feb 2017 5:35 AM, "Zheng Lin Edwin Yeo" 
>>> wrote:
>>>
>>> I'm looking at JSON facet for both of type:terms and type:range.
>>>
>>> For example, I may have 100 Items in my collections, and each item can
>>> have
>>> many transactions. But I'm only interested to look at the top 10 items
>>> which has the highest transaction rate (ie the highest count)
>>>
>>> I'm doing a calculation of the total amount and average amount. However,
>>> I
>>> will only want the total amount and average amount to be calculated based
>>> on the top 10 items which has the highest transaction rate, and not all
>>> the
>>> 100 items.
>>>
>>> For now, I need the additional query to get the top 10 items first,
>>> before
>>> I run the JSON Facet to get the total amount and average amount for that
>>> 10
>>> items.
>>>
>>> Regards,
>>> Edwin
>>>
>>>
>>> On 17 February 2017 at 18:02, alessandro.benedetti >> >
>>> wrote:
>>>
>>> I think we are missing something here ...
 You want to fetch the top 10 results for your query, and allow the user

>>> to
>>>
 navigate only those 10 results through facets ?

 Which facets are you interested in ?
 Field facets ?
 Whatever facet you want, calculating it in your client, on 10 results
 shouldn't be that problematic.
 Are we missing something ? Why you would need an additional query ?

 Cheers



 -
 ---
 Alessandro Benedetti
 Search Consultant, R Software Engineer, Director
 Sease Ltd. - www.sease.io
 --
 View this message in context: http://lucene.472066.n3.
 nabble.com/Select-TOP-10-items-from-Solr-Query-tp4320863p4320910.html
 Sent from the Solr - User mailing list archive at Nabble.com.


> --
> Monitoring * Alerting * Anomaly Detection * Centralized Log Management
> Solr & Elasticsearch Support * http://sematext.com/
>
>


Re: Select TOP 10 items from Solr Query

2017-02-20 Thread Emir Arnautovic

Hi Edwin,

I am also bit confused but, it seems to me that you could achieve what 
you need with pivot faceting: 
https://cwiki.apache.org/confluence/display/solr/Faceting#Faceting-Pivot(DecisionTree)Faceting


HTH,
Emir


On 18.02.2017 08:46, Zheng Lin Edwin Yeo wrote:

Although I have nested documents in the schema, but if not looking for top
10 parents to facet on children.

Currently, I'm only working on the information that is available in
children. In my case, each line represents a transaction, and I'm doing the
faceting based on these individual transaction.

Regards,
Edwin


On 17 February 2017 at 22:10, Alexandre Rafalovitch 
wrote:


Am I missing something or do you have nested documents in the schema?

It reads like you looking for top 10 parents and then want to facet on
children. Is that right?

Regards,
Alex

On 17 Feb 2017 5:35 AM, "Zheng Lin Edwin Yeo" 
wrote:

I'm looking at JSON facet for both of type:terms and type:range.

For example, I may have 100 Items in my collections, and each item can have
many transactions. But I'm only interested to look at the top 10 items
which has the highest transaction rate (ie the highest count)

I'm doing a calculation of the total amount and average amount. However, I
will only want the total amount and average amount to be calculated based
on the top 10 items which has the highest transaction rate, and not all the
100 items.

For now, I need the additional query to get the top 10 items first, before
I run the JSON Facet to get the total amount and average amount for that 10
items.

Regards,
Edwin


On 17 February 2017 at 18:02, alessandro.benedetti 
wrote:


I think we are missing something here ...
You want to fetch the top 10 results for your query, and allow the user

to

navigate only those 10 results through facets ?

Which facets are you interested in ?
Field facets ?
Whatever facet you want, calculating it in your client, on 10 results
shouldn't be that problematic.
Are we missing something ? Why you would need an additional query ?

Cheers



-
---
Alessandro Benedetti
Search Consultant, R Software Engineer, Director
Sease Ltd. - www.sease.io
--
View this message in context: http://lucene.472066.n3.
nabble.com/Select-TOP-10-items-from-Solr-Query-tp4320863p4320910.html
Sent from the Solr - User mailing list archive at Nabble.com.



--
Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Solr & Elasticsearch Support * http://sematext.com/