Hi All, I am also new to Solr and I have gone through Solr document and tested agg using Solr- Prasto ( Parallel sql), Stream.
I am getting very good response using these 2 technologies. But my worries are, unable to Group By Multivalue field which Solr standard api does but not latest version of solr-prasto/Stream. I want to aggregate/Group by "app.name" field using stream/ Parallel sql. Please suggest. input: { id: 1 field1:[1,2,3], app.name:[watsapp,facebook,... ] } { id: 2 field1:[1,2,3], app.name:[watsapp,facebook,... ] } Expected result : watsapp: 2 facebook : 2 I have 2 TB data . I wanted to execute in aggmode=map_reduce. Any suggestion? Regards, S.Praveen Technical Architech LinkedIn: https://www.linkedin.com/in/praveen-babu-73232889?trk=nav_responsive_tab_profile On Thu, Sep 8, 2016 at 6:01 AM, Roshni Rajagopal <roshkne...@gmail.com> wrote: > Hi Solr Gurus, > > I have these requirements > > 1. Need to group data in solr on multiple fields and compute agregations > like SUM (field) > > 2. Need to compute some custom calculations - sum(field1)/sum(field2) on > the grouped data. > > Options Ive tried > > 1. Group- this does not allow to group by more than 1 field, and > aggregations are not supported > > 2. Stats - this along with facet.pivot gets results for basic group > aggregations like SUM. Custom Calculation is not supported. Also the format > is messy with stats getting calculated at every level. Cannot paginate. > > 2. Facet JSON API -gets results for basic group aggregations like SUM. > Format is less messy and we can paginate. Custom Calculation like > DIV(sum(field1), sum(field2)) is still not supported. > > So the last resort is /sql handler for parallel queries. Is tested and > stable, and will it meet my requirements? Im on solr 6.10. > > Or would you recommend adding Sparkā¦I would prefer to handle all > requirements in solr, as I dont want to maintain another moving part of > Spark. > > Do advise! > > Regards > > Roshni >