[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-5302: --- Fix Version/s: 5.0 > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Fix For: 5.0, Trunk > > Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, > SOLR-5302.patch, SOLR-5302_contrib.patch, Search Analytics Component.pdf, > Statistical Expressions.pdf, solr_analytics-2013.10.04-2.patch > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yonik Seeley updated SOLR-5302: --- Attachment: SOLR-5302_contrib.patch As the first step of finally getting this into 4x, here's a patch that moves the analytics component to contrib (which seems to be the consensus). > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Fix For: 5.0 > > Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, > SOLR-5302.patch, SOLR-5302_contrib.patch, Search Analytics Component.pdf, > Statistical Expressions.pdf, solr_analytics-2013.10.04-2.patch > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson updated SOLR-5302: - Attachment: SOLR-5302.patch Some of the tests were failing because, at least on my Mac, the paths to the various files in test-files weren't correct when I ran "ant test" from the shell. Current implementation succeeds both from the shell and IntelliJ. We'll see if this breaks on Jenkins. Meanwhile, committing on trunk. Anyone who wants to give it a whirl, feedback greatly appreciated! Awesome stuff Steven! Here I thought I had a big patch at 150K or so :) This has considerable documentation, anyone want to volunteer to incorporate it into the Wiki? > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, > SOLR-5302.patch, Search Analytics Component.pdf, Statistical Expressions.pdf, > solr_analytics-2013.10.04-2.patch > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Bower updated SOLR-5302: --- Attachment: SOLR-5302.patch New patch attached addressing the schema and javadoc issues For the schema i just added a new one schema-analytics.xml .. must have missed this file in my first pass as I had made changes to schema-docValues.xml but now its a separate file and should avoid any confusion/changes in the future. added package.html files for missing javadoc > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, > Search Analytics Component.pdf, Statistical Expressions.pdf, > solr_analytics-2013.10.04-2.patch > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Erick Erickson updated SOLR-5302: - Attachment: SOLR-5302.patch Please apply my updated version of the patch or make the same changes before making a new one or I'll have to re-do some work. NOTE: This is against trunk! Working with pre-commit: Changes I had to make: A couple of files were indented with tabs. Since it's a new file, I just reformatted them. The forbidden api checks failed on several files. Mostly requiring either Scanners to have "UTF-8" specified or String.toLowercase to have Locale.ROOT and such-like. I did most of this on the plane ride home, and I must admit it's annoying to have precommit fail because I don't have internet connnectivity, there _must_ be a build flag somewhere. These files have missing javadocs [exec] missing: org.apache.solr.analytics.accumulator [exec] missing: org.apache.solr.analytics.accumulator.facet [exec] missing: org.apache.solr.analytics.expression [exec] missing: org.apache.solr.analytics.plugin [exec] missing: org.apache.solr.analytics.request [exec] missing: org.apache.solr.analytics.statistics [exec] missing: org.apache.solr.analytics.util [exec] missing: org.apache.solr.analytics.util.valuesource [exec] [exec] Missing javadocs were found! Tests failing, and a JVM crash to boot. FieldFacetExrasTest fails with "unknown field int_id". There's nothing in schema-docValues.xml that would map to that field, did it get changed? Is this a difference between trunk and 4x? - org.apache.solr.analytics.NoFacetTest (suite) [junit4] - org.apache.solr.analytics.facet.FieldFacetExtrasTest (suite) [junit4] - org.apache.solr.analytics.expression.ExpressionTest (suite) [junit4] - org.apache.solr.analytics.AbstractAnalyticsStatsTest.initializationError [junit4] - org.apache.solr.analytics.util.valuesource.FunctionTest (suite) [junit4] - org.apache.solr.analytics.facet.AbstractAnalyticsFacetTest.initializationError [junit4] - org.apache.solr.analytics.facet.FieldFacetTest (suite) [junit4] - org.apache.solr.analytics.facet.QueryFacetTest.queryTest [junit4] - org.apache.solr.analytics.facet.RangeFacetTest (suite) > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Attachments: SOLR-5302.patch, SOLR-5302.patch, Search Analytics > Component.pdf, Statistical Expressions.pdf, solr_analytics-2013.10.04-2.patch > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Bower updated SOLR-5302: --- Attachment: SOLR-5302.patch Patch updated to include * Cleaned up imports of Arrays class to use java.util.Arrays * Added support for "missing" numeric doc values * removed the "defaultIsMissing" stuff as its no longer needed > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower >Assignee: Erick Erickson > Attachments: Search Analytics Component.pdf, SOLR-5302.patch, > solr_analytics-2013.10.04-2.patch, Statistical Expressions.pdf > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Bower updated SOLR-5302: --- Attachment: solr_analytics-2013.10.04-2.patch Updated patch: * Updated license comment to remove copyrights * Added copyright notice to NOTICE.txt * Cleaned up lots of Javadoc warnings * Cleaned up some exception handling > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower > Attachments: Search Analytics Component.pdf, > solr_analytics-2013.10.04-2.patch, Statistical Expressions.pdf > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Bower updated SOLR-5302: --- Attachment: (was: solr_analytics-2013.10.04.patch) > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower > Attachments: Search Analytics Component.pdf, > solr_analytics-2013.10.04-2.patch, Statistical Expressions.pdf > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-5302) Analytics Component
[ https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Bower updated SOLR-5302: --- Attachment: Statistical Expressions.pdf Search Analytics Component.pdf solr_analytics-2013.10.04.patch Initial patch, please review/comment. Additionally PDF exports of some docs for using the component > Analytics Component > --- > > Key: SOLR-5302 > URL: https://issues.apache.org/jira/browse/SOLR-5302 > Project: Solr > Issue Type: New Feature >Reporter: Steven Bower > Attachments: Search Analytics Component.pdf, > solr_analytics-2013.10.04.patch, Statistical Expressions.pdf > > > This ticket is to track a "replacement" for the StatsComponent. The > AnalyticsComponent supports the following features: > * All functionality of StatsComponent (SOLR-4499) > * Field Faceting (SOLR-3435) > ** Support for limit > ** Sorting (bucket name or any stat in the bucket > ** Support for offset > * Range Faceting > ** Supports all options of standard range faceting > * Query Faceting (SOLR-2925) > * Ability to use overall/field facet statistics as input to range/query > faceting (ie calc min/max date and then facet over that range > * Support for more complex aggregate/mapping operations (SOLR-1622) > ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, > median, percentiles > ** Operations: negation, abs, add, multiply, divide, power, log, date math, > string reversal, string concat > ** Easily pluggable framework to add additional operations > * New / cleaner output format > Outstanding Issues: > * Multi-value field support for stats (supported for faceting) > * Multi-shard support (may not be possible for some operations, eg median) -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org