The smaller index have 1 million lines of data. They are the lines filtered 
 by "prefix":{"ip":"100.1"} from the bigger one.

在 2014年4月2日星期三UTC+8下午4时04分27秒,vir....@gmail.com写道:
>
> I do an *aggregation* search on my index(*6 nodes*). There are about *200 
> million lines* of data(port scanning). Each line is same* like this 
> :**{"ip":"85.18.68.5", 
> "banner":"cisco-IOS", "country":"IT", "_type":"port-80"}.* 
> So you can image I have these data sort into different type by port they 
> are scanning. Now, I want to know who open a lot of ports at the same time. 
> So, I choose to do aggregation on IP field, and I get an OOM error that may 
> be reasonable because of most of them open only one port so that there are 
> too many buckets? I guess.
>
>
> And then, I use aggregation filter. 
>
> {
>     "aggs":{
>         "just_name1":{
>           "filter":{
>               "prefix":{
>                   "ip":"100.1"
>               }
>           },
>           "aggs":{
>               "just_name2":{
>                   "terms":{
>                       "field":"ip",
>                       "execution_hint":"map"
>                   }
>               }
>           }
>       }
>     }
> }(yes, my ip field is set as string)
>
> I think this time, I could make ES narrow down the set for aggregation. But I 
> still get an OOM error. While It works on a smaller index(another cluster, 
> one node). Why would this happen? After filtering, 2 cluster should have an 
> equal-volume set. Why the bigger one failed?  
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/d384bea8-4a60-4521-aa0e-34bb2fd61ec5%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to