Re: Running aggregations on two different nested objects

2014-03-21 Thread Jean-Noël Rivasseau
Hello,

Thanks for your reply. What do you mean by not possible to escape it ? 
Could you provide a sample code in Java, that would work if the necessary 
changes would be implemented?

Jean-Noel

On Thursday, March 20, 2014 1:51:54 PM UTC+4, Adrien Grand wrote:

 Hi,

 The aggregation doesn't work because today, when you enter the context of 
 a nested field in an aggregation, it is not possible to escape it. I don't 
 think there is an easy way to modify your data model in order to work 
 around this issue, however this is an issue that we plan to fix in the 
 future (not in the upcoming 1.1 release however, rather in a few months).


 On Wed, Mar 19, 2014 at 10:00 AM, Jean-Noël Rivasseau 
 elv...@gmail.comjavascript:
  wrote:

 Hello,

 I just started using ElasticSearch 1.0.1. I am trying to find the ideal 
 data model and query for my exact needs, which I will explain below (I 
 changed just the terms of the data model corresponding to my real use case, 
 in order to see if I was able to formulate it differently, which was 
 useful).

 I am indexing documents corresponding to BookedStay. A BookedStay has a 
 nested array (named places) containing map objects corresponding to visited 
 places during the stay. An object has an id (place id) and a category 
 corresponding to the time of day of the visited place. A BookedStay then 
 has a second nested array, corresponding to the amenities used during the 
 stay. The objects in the array have an id (of type string) and a count.

 So a BookedStay can be represented as : {date: 03/03/2014, 
 placesVisited: [{id: 3, category: MORNING}, {id: 5, 
 category:  AFTERNOON}, {id: 7, category: EVENING}], amenities: 
 [{amenityId: restaurant, count: 3}, {amenityId: dvdPlayer, 
 count: 1}] }

 What I want to run is a query over a given room number, and find for 
 all BookedStay that have this given place number in their places array, an 
 aggregate over all amenities used, per time of day.

 This amounts to finding, for all documents that have a place id of (for 
 instance) 5, the number of times the restaurant was used, or the dvd player 
 in the lounge, broken down by time of day. The ultimate goal is to 
 understand better how the visit of a place in a given time of day affects 
 the services sold by the hotel.

 I am unable to achieve this query, as when I run a first nested aggregate 
 over the category, I cannot nest the second one over the amenities as it is 
 in the parent document. Is it possible to do that? In that case, how do I 
 specify that the nested aggregation will take place over the parent object 
 of the current aggregation?

 Here is a tentative query with the Java driver (obviously not working, 
 because of the above problem):

 SearchRequestBuilder srb = 
 elasticSearchService.getClient().prepareSearch(test_index).setSearchType(SearchType.COUNT).setTypes(test_stay).setQuery(QueryBuilders.nestedQuery(placesVisited,
  
 QueryBuilders.termQuery(id, 5)))
  
 .addAggregation(AggregationBuilders.nested(nestedPlaceVisited).path(placesVisited)
 .subAggregation(AggregationBuilders.filter(currentPlaceFilter).filter(FilterBuilders.termFilter(id,
  
 5))
  
 .subAggregation(AggregationBuilders.terms(countPerTimeCategory).field(category)
 .subAggregation(AggregationBuilders.nested(nestedAmenities).path(amenities)
  
  // HERE THIS subaggregation should run over the original document... and I 
 dont know how to achieve that
  
 subAggregation(AggregationBuilders.terms(amenitiesUsed).field(amenities.amenityId))

 Thanks for your help over this difficult problem! If it's not possible 
 with parent aggregations, how should I refactor my data model?

 Jean-Noel

 -- 
 You received this message because you are subscribed to the Google Groups 
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an 
 email to elasticsearc...@googlegroups.com javascript:.
 To view this discussion on the web visit 
 https://groups.google.com/d/msgid/elasticsearch/e8a752fc-0a96-437d-b071-4009c0f39d33%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/e8a752fc-0a96-437d-b071-4009c0f39d33%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.




 -- 
 Adrien Grand
  

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/c79aba35-5c3b-4356-bff2-687378fb4261%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


Re: Running aggregations on two different nested objects

2014-03-21 Thread Adrien Grand
On Fri, Mar 21, 2014 at 8:15 AM, Jean-Noël Rivasseau elva...@gmail.comwrote:

 Thanks for your reply. What do you mean by not possible to escape it ?
 Could you provide a sample code in Java, that would work if the necessary
 changes would be implemented?


The nested field mapper stores data as separate Lucene documents. What the
nested aggregation does, is that for every incoming (parent) document ID,
it is going to call sub aggregations with the document ID of children
documents. The sub aggregations are not aware that they are being applied
to child documents, to them it doesn't make any difference, they just do
their usual stuff, but on different doc IDs.

What would be needed in order to make your aggregation work would be to
have another aggregation that would be able to translate these child doc
IDs back to their parent's doc ID, which is something that we don't have
today.

-- 
Adrien Grand

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j61pgEgKHLjM4nuuG%2Bg_rfM6r4bZjZ0ia%2B46vO7bvtgUg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


Re: Running aggregations on two different nested objects

2014-03-20 Thread Adrien Grand
Hi,

The aggregation doesn't work because today, when you enter the context of a
nested field in an aggregation, it is not possible to escape it. I don't
think there is an easy way to modify your data model in order to work
around this issue, however this is an issue that we plan to fix in the
future (not in the upcoming 1.1 release however, rather in a few months).


On Wed, Mar 19, 2014 at 10:00 AM, Jean-Noël Rivasseau elva...@gmail.comwrote:

 Hello,

 I just started using ElasticSearch 1.0.1. I am trying to find the ideal
 data model and query for my exact needs, which I will explain below (I
 changed just the terms of the data model corresponding to my real use case,
 in order to see if I was able to formulate it differently, which was
 useful).

 I am indexing documents corresponding to BookedStay. A BookedStay has a
 nested array (named places) containing map objects corresponding to visited
 places during the stay. An object has an id (place id) and a category
 corresponding to the time of day of the visited place. A BookedStay then
 has a second nested array, corresponding to the amenities used during the
 stay. The objects in the array have an id (of type string) and a count.

 So a BookedStay can be represented as : {date: 03/03/2014,
 placesVisited: [{id: 3, category: MORNING}, {id: 5,
 category:  AFTERNOON}, {id: 7, category: EVENING}], amenities:
 [{amenityId: restaurant, count: 3}, {amenityId: dvdPlayer,
 count: 1}] }

 What I want to run is a query over a given room number, and find for
 all BookedStay that have this given place number in their places array, an
 aggregate over all amenities used, per time of day.

 This amounts to finding, for all documents that have a place id of (for
 instance) 5, the number of times the restaurant was used, or the dvd player
 in the lounge, broken down by time of day. The ultimate goal is to
 understand better how the visit of a place in a given time of day affects
 the services sold by the hotel.

 I am unable to achieve this query, as when I run a first nested aggregate
 over the category, I cannot nest the second one over the amenities as it is
 in the parent document. Is it possible to do that? In that case, how do I
 specify that the nested aggregation will take place over the parent object
 of the current aggregation?

 Here is a tentative query with the Java driver (obviously not working,
 because of the above problem):

 SearchRequestBuilder srb =
 elasticSearchService.getClient().prepareSearch(test_index).setSearchType(SearchType.COUNT).setTypes(test_stay).setQuery(QueryBuilders.nestedQuery(placesVisited,
 QueryBuilders.termQuery(id, 5)))

 .addAggregation(AggregationBuilders.nested(nestedPlaceVisited).path(placesVisited)
 .subAggregation(AggregationBuilders.filter(currentPlaceFilter).filter(FilterBuilders.termFilter(id,
 5))

 .subAggregation(AggregationBuilders.terms(countPerTimeCategory).field(category)
 .subAggregation(AggregationBuilders.nested(nestedAmenities).path(amenities)
  // HERE THIS subaggregation should run over the original document... and I
 dont know how to achieve that

 subAggregation(AggregationBuilders.terms(amenitiesUsed).field(amenities.amenityId))

 Thanks for your help over this difficult problem! If it's not possible
 with parent aggregations, how should I refactor my data model?

 Jean-Noel

 --
 You received this message because you are subscribed to the Google Groups
 elasticsearch group.
 To unsubscribe from this group and stop receiving emails from it, send an
 email to elasticsearch+unsubscr...@googlegroups.com.
 To view this discussion on the web visit
 https://groups.google.com/d/msgid/elasticsearch/e8a752fc-0a96-437d-b071-4009c0f39d33%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/e8a752fc-0a96-437d-b071-4009c0f39d33%40googlegroups.com?utm_medium=emailutm_source=footer
 .
 For more options, visit https://groups.google.com/d/optout.




-- 
Adrien Grand

-- 
You received this message because you are subscribed to the Google Groups 
elasticsearch group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/CAL6Z4j6JhVsi6tr2r54sFxnAPpip7gzhvmvX1hje1ueynPr50w%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.