The static word encoder is appropriate for categorical variables with an
unknown number of values.




On Sun, Aug 3, 2014 at 9:16 PM, Brian Krebs <bkr...@tapheaven.com> wrote:

> Hi everyone,
>
> I have a very basic question on the Apache SGD implementation. My training
> set has about 50 features, most of which are categorical. Some of these
> categories are binary, but others can have an unknown number of discrete
> values (countries, cities, etc.).
>
> Should I be encoding these with the ConstantValueEncoder? The
> StaticWordValueEncoder?
>
> Thanks,
>
> *Brian Krebs*
> CIO and Co-founder
> TapHeaven
> Mobile: 443.866.2137
> Email: bkr...@tapheaven.com
> Twitter: @BKrebsTH
> LinkedIn: www.linkedin.com/in/briankrebs
> tapheaven.com
>

Reply via email to