Re: Distinct Item of a column

2017-04-17 Thread arijit chakraborty
Thank you Niketan! Your answer completely answer my question. Regards, Arijit From: Niketan Pansare Sent: Tuesday, April 18, 2017 12:55:28 AM To: dev@systemml.incubator.apache.org Subject: Re: Distinct Item of a column Hi Arijit, PySpark

Re: [VOTE] Apache SystemML 0.14.0-incubating (RC3)

2017-04-17 Thread dusenberrymw
+1 and please call it `branch-0.14`. -- Mike Dusenberry GitHub: github.com/dusenberrymw LinkedIn: linkedin.com/in/mikedusenberry Sent from my iPhone. > On Apr 17, 2017, at 8:50 AM, Arvind Surve wrote: > > I will create next RC (RC4) for SystemML 0.14 in day or two

Re: Distinct Item of a column

2017-04-17 Thread Niketan Pansare
Hi Arijit, PySpark and SystemML are complimentary and both serve different purpose. PySpark primarily operates on a collection of datapoints (i.e. RDD) or a DataFrame and exposes the Spark programming model (i.e. transformation and actions). SystemML primarily operates on matrices and provides

Re: SystemML query

2017-04-17 Thread Matthias Boehm
if your data X is already ordered you can do the following: I = rbind(matrix(1,1,1), (X[1:nrow(X)-1,]!=X[2:nrow(X),])); dX = removeEmpty(target=X, margin="rows", select=I); Regards, Matthias On 4/17/2017 8:40 AM, arijit chakraborty wrote: Hi, I've an issue regarding finding and removing the

Re: SystemML query

2017-04-17 Thread arijit chakraborty
Hi, I've an issue regarding finding and removing the duplicate in a column/ row of a matrix. I'm tried this method: duplicate_matrix = matrix("1 2 3 3 3 4 5 6", rows = 8, cols = 1) dup_ind = matrix(1,rows = nrow(duplicate_matrix),cols = 1) a = nrow(dup_ind) b = nrow(duplicate_matrix)