Murmur hash is an option. I believe it was used on Storm's v1 codebase

On Sat, Aug 17, 2024, 13:57 Karthick <ibmkarthickma...@gmail.com> wrote:

> Thanks Sahil, Aaron for the reply
>
> Are you encountering a performance problem?
>
> Yes facing slowness on the executors with more load.
>
> Is the fields grouping absolutely required for your use case?
>
> Yes to maintain the ordering it required for my case.
>
> Can you suggest some hashes?
>
>
> Does your case needs packets of same hash to go to same executor or
>> ordering of events ?
>>
>
> Yes, ordering of events is important.
>
>  Currently we are using a unique id which datatype is bigint(Long).
>
> Can you suggest any changes reg this.
>
>
> On Sat, Aug 17, 2024 at 8:40 AM Sahil Kamboj <sahilkamboj...@gmail.com>
> wrote:
>
>> Hey karthick
>>
>> Does your case needs packets of same hash to go to same executor or
>> ordering of events ? If not then go with local or shuffle grouping which
>> will distribute load in round robin manner across executors. Otherwise
>> choose your key wisely to avoid skewness when using fields grouping.
>>
>>
>>
>> On Sat, 17 Aug 2024 at 5:20 AM, Karthick <ibmkarthickma...@gmail.com>
>> wrote:
>>
>>> Hi Team,
>>>
>>> I'm using fields grouping for a bolt to maintain field-based ordering,
>>> but I'm facing data skewness among the bolt's executors. I have 96
>>> executors, and I'm sending data with 500 distinct fields used in the fields
>>> grouping. While reviewing the Storm UI, I noticed that a few executors are
>>> underutilized while others are overutilized.
>>>
>>> This seems to be a hashing problem i guess. Can anyone suggest a better
>>> hashing technique or approach to resolve this issue?
>>>
>>> Thanks in advance for your help.
>>>
>>

Reply via email to