[ 
https://issues.apache.org/jira/browse/SOLR-1931?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12873222#action_12873222
 ] 

Lance Norskog commented on SOLR-1931:
-------------------------------------

This is my test index:
65m documents
2 text fields each with 10m and 14m unique terms, 'text0' and 'text1'.
several more string fields with 1 to 10 unique terms: 'protocol' has 4 unique 
facets
No dynamic fields.

* numTerms=0 
** Returns immediately with the field list.
* numTerms=10
** 130-160 seconds
* numTerms=10&fl=protocol
** 45 seconds
* numTerms=10&fl=text0
** 60 seconds
* numTerms=10&fl=text1
** 60 seconds

* show=schema
** 18 seconds after above warmup queries 

These numbers are consistent, run multiple times against the same index load, 
in various orders.

Given the above numbers, the commands should be:
* to get a list of fixed fields
** numTerms=0
* to find dynamic fields
** show=schema
* to find unique terms for a field
** allow user to choose between
*** numTerms=X&fl=field
*** facet call

It needs a new show=schema option that does not scan for dynamic fields. That 
would be called on page open, then the individual fields can have drill-downs 
and there can be a 'scan for dynamic fields' button that does the current 
show=schema scan.

Does this make sense?

Other possible features: 
* info on segments
** separate above results on segments?
* shortest, longest, mean, standard deviation of text field lengths


> Schema Browser does not scale with large indexes
> ------------------------------------------------
>
>                 Key: SOLR-1931
>                 URL: https://issues.apache.org/jira/browse/SOLR-1931
>             Project: Solr
>          Issue Type: Improvement
>          Components: web gui
>    Affects Versions: 1.4
>            Reporter: Lance Norskog
>            Priority: Minor
>
> The Schema  Browser JSP by default causes the Luke handler to "scan the 
> world". In large indexes this make the UI useless.
> On an index with 64m documents & 8gb of disk space, the Schema Browser took 6 
> minutes to open and hogged all disk I/O, making Solr useless.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to