You could also use dummy coding to convert categorical feature to
numeric feature.

http://en.wikipedia.org/wiki/Categorical_variable#Dummy_coding

Sincerely,

DB Tsai
-------------------------------------------------------
My Blog: https://www.dbtsai.com
LinkedIn: https://www.linkedin.com/in/dbtsai


On Wed, May 14, 2014 at 10:37 PM, Xiangrui Meng <men...@gmail.com> wrote:
> It depends on how you want to use the string features. For the day of
> the week, you can replace it with 6 binary features indicating
> Mon/Tue/Wed/Th/Fri/Sat. -Xiangrui
>
> On Fri, May 9, 2014 at 5:31 AM, praveshjain1991
> <praveshjain1...@gmail.com> wrote:
>> I have been trying to use LR in Spark's Java API. I used the dataset given
>> along with Spark for the training and testing purposes.
>>
>> Now i want to use it on another dataset that contains string values along
>> with numbers. Is there any way to do this?
>>
>> I am attaching the Dataset that i want to use.
>>
>> Thanks and Regards, Test.data
>> <http://apache-spark-user-list.1001560.n3.nabble.com/file/n5523/Test.data>
>>
>>
>>
>> --
>> View this message in context: 
>> http://apache-spark-user-list.1001560.n3.nabble.com/Using-String-Dataset-for-Logistic-Regression-tp5523.html
>> Sent from the Apache Spark User List mailing list archive at Nabble.com.

Reply via email to