Hi, say I want to use the GLM package to run a regression that contains several categorical variables. My understanding is that I need the equivalent of an R factor, which is a PooledDataArray. Suppose I take this dataset
using RDatasets *data = dataset("Ecdat","Benefits")* and I want columns 6-14 as PooledDataArray I tried data[,:Sex] = convert(PooledDataArray,data[,:Sex]) and pool!(data[,:Sex]) # I can see that this needs a second column with the numeric code thanks!