[ https://issues.apache.org/jira/browse/IGNITE-9145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aleksey Zinoviev updated IGNITE-9145: ------------------------------------- Description: The main idea to add a few strategies of indexing: sorting and so on. Currently it supports only one strategy (most popular with zero and less popular with the max index size). There are can be a few options * 'frequencyDesc': descending order by label frequency (most frequent label assigned 0) * 'frequencyAsc': ascending order by label frequency (least frequent label assigned 0) Please, update the method **transformFrequenciesToEncodingValues and add the strategy as a parameter of trainer. was: The main idea to add a few strategies of indexing: sorting and so on. Currently it supports only one strategy (most popular with zero and less popular with the max index size). There are can be a few options * 'frequencyDesc': descending order by label frequency (most frequent label assigned 0) * 'frequencyAsc': ascending order by label frequency (least frequent label assigned 0) * 'alphabetDesc': descending alphabetical order * 'alphabetAsc': ascending alphabetical order Please, update the method **transformFrequenciesToEncodingValues and add the strategy as a parameter of trainer. > [ML] Add different strategies to index labels in StringEncoderTrainer > --------------------------------------------------------------------- > > Key: IGNITE-9145 > URL: https://issues.apache.org/jira/browse/IGNITE-9145 > Project: Ignite > Issue Type: Improvement > Components: ml > Reporter: Aleksey Zinoviev > Assignee: Aleksey Zinoviev > Priority: Major > > The main idea to add a few strategies of indexing: sorting and so on. > Currently it supports only one strategy (most popular with zero and less > popular with the max index size). > There are can be a few options > * 'frequencyDesc': descending order by label frequency (most frequent label > assigned 0) > * 'frequencyAsc': ascending order by label frequency (least frequent label > assigned 0) > > Please, update the method **transformFrequenciesToEncodingValues and add the > strategy as a parameter of trainer. > -- This message was sent by Atlassian JIRA (v7.6.3#76005)