user-unsubscr...@spark.apache.org
user-unsubscr...@spark.apache.org From: kundan kumar [mailto:iitr.kun...@gmail.com] Sent: Wednesday, June 7, 2017 5:15 AM To: 颜发才(Yan Facai) <facai@gmail.com> Cc: spark users <user@spark.apache.org> Subject: Re: Convert the feature vector to raw data Hi Yan, This doesnt work. thanks, kundan On Wed, Jun 7, 2017 at 2:53 PM, 颜发才(Yan Facai) <facai@gmail.com <mailto:facai@gmail.com> > wrote: Hi, kumar. How about removing the `select` in your code? namely, Dataset result = model.transform(testData); result.show(1000, false); On Wed, Jun 7, 2017 at 5:00 PM, kundan kumar <iitr.kun...@gmail.com <mailto:iitr.kun...@gmail.com> > wrote: I am using Dataset result = model.transform(testData).select("probability", "label","features"); result.show(1000, false); In this case the feature vector is being printed as output. Is there a way that my original raw data gets printed instead of the feature vector OR is there a way to reverse extract my raw data from the feature vector. All of the features that my dataset have is categorical in nature. Thanks, Kundan
user-unsubscr...@spark.apache.org
user-unsubscr...@spark.apache.org From: 颜发才(Yan Facai) [mailto:facai@gmail.com] Sent: Wednesday, June 7, 2017 4:24 AM To: kundan kumar <iitr.kun...@gmail.com> Cc: spark users <user@spark.apache.org> Subject: Re: Convert the feature vector to raw data Hi, kumar. How about removing the `select` in your code? namely, Dataset result = model.transform(testData); result.show(1000, false); On Wed, Jun 7, 2017 at 5:00 PM, kundan kumar <iitr.kun...@gmail.com <mailto:iitr.kun...@gmail.com> > wrote: I am using Dataset result = model.transform(testData).select("probability", "label","features"); result.show(1000, false); In this case the feature vector is being printed as output. Is there a way that my original raw data gets printed instead of the feature vector OR is there a way to reverse extract my raw data from the feature vector. All of the features that my dataset have is categorical in nature. Thanks, Kundan
user-unsubscr...@spark.apache.org
user-unsubscr...@spark.apache.org user-unsubscr...@spark.apache.org From: kundan kumar [mailto:iitr.kun...@gmail.com] Sent: Wednesday, June 7, 2017 4:01 AM To: spark users <user@spark.apache.org> Subject: Convert the feature vector to raw data I am using Dataset result = model.transform(testData).select("probability", "label","features"); result.show(1000, false); In this case the feature vector is being printed as output. Is there a way that my original raw data gets printed instead of the feature vector OR is there a way to reverse extract my raw data from the feature vector. All of the features that my dataset have is categorical in nature. Thanks, Kundan
user-unsubscr...@spark.apache.org
From: Joel D [mailto:games2013@gmail.com] Sent: Monday, May 29, 2017 9:04 PM To: user@spark.apache.org Subject: Schema Evolution Parquet vs Avro Hi, We are trying to come up with the best storage format for handling schema changes in ingested data. We noticed that both avro and parquet allows one to select based on column name instead of the data index/position of data. However, we are inclined towards parquet for better read performance since it's columnar and we will be selecting few columns instead of all. Data will be processed and saved to partitions on which we will have hive external tables. Will parquet be able to handle the following: - Column renaming from between data - Column removal from between - DataType change of existing column (int to bigint should be allowed, right?) Please advise. Thanks, Sam
user-unsubscr...@spark.apache.org
user-unsubscr...@spark.apache.org From: ANEESH .V.V [mailto:aneeshnair.ku...@gmail.com] Sent: Friday, May 26, 2017 1:50 AM To: user@spark.apache.org Subject: unsubscribe unsubscribe
user-unsubscr...@spark.apache.org
From: Steffen Schmitz [mailto:steffenschm...@hotmail.de] Sent: Thursday, May 25, 2017 3:34 AM To: ramnavanCc: user@spark.apache.org Subject: Re: Questions regarding Jobs, Stages and Caching
user-unsubscr...@spark.apache.org
From: Arun [mailto:arunbm...@gmail.com] Sent: Saturday, May 20, 2017 9:48 PM To: user@spark.apache.org Subject: Rmse recomender system hi all.. I am new to machine learning. i am working on recomender system. for training dataset RMSE is 0.08 while on test data its is 2.345 whats conclusion and what steps can i take to improve Sent from Samsung tablet
user-unsubscr...@spark.apache.org
From: Abir Chakraborty [mailto:abi...@247-inc.com] Sent: Sunday, May 21, 2017 4:17 AM To: user@spark.apache.org Subject: unsubscribe unsubscribe
user-unsubscr...@spark.apache.org
From: Bibudh Lahiri [mailto:bibudhlah...@gmail.com] Sent: Sunday, May 21, 2017 9:34 AM To: userSubject: unsubscribe unsubscribe
user-unsubscr...@spark.apache.org
user-unsubscr...@spark.apache.org From: 萝卜丝炒饭 [mailto:1427357...@qq.com] Sent: Sunday, May 21, 2017 8:15 PM To: user <user@spark.apache.org> Subject: Are tachyon and akka removed from 2.1.1 please HI all, Iread some paper about source code, the paper base on version 1.2. they refer the tachyon and akka. When i read the 2.1code. I can not find the code abiut akka and tachyon. Are tachyon and akka removed from 2.1.1 please