[phpMyAdmin Git] [phpmyadmin/phpmyadmin] f91301: Translated using Weblate (Turkish)

2015-08-01 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: f913014a1daa4b43bacfad356282c4d2d3e7baa3 https://github.com/phpmyadmin/phpmyadmin/commit/f913014a1daa4b43bacfad356282c4d2d3e7baa3 Author: Burak Yavuz Date: 2015-08-01 (Sat, 01 Aug 2015

Re: Which directory contains third party libraries for Spark

2015-07-28 Thread Burak Yavuz
Hey Stephen, In case these libraries exist on the client as a form of maven library, you can use --packages to ship the library and all it's dependencies, without building an uber jar. Best, Burak On Tue, Jul 28, 2015 at 10:23 AM, Marcelo Vanzin wrote: > Hi Stephen, > > There is no such direct

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] cb77ab: Translated using Weblate (Turkish)

2015-07-23 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: cb77ab1e50fb5e3daa92faf96e34b26a1d2d109b https://github.com/phpmyadmin/phpmyadmin/commit/cb77ab1e50fb5e3daa92faf96e34b26a1d2d109b Author: Burak Yavuz Date: 2015-07-23 (Thu, 23 Jul 2015

[jira] [Created] (SPARK-9263) Add Spark Submit flag to exclude dependencies when using --packages

2015-07-22 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-9263: -- Summary: Add Spark Submit flag to exclude dependencies when using --packages Key: SPARK-9263 URL: https://issues.apache.org/jira/browse/SPARK-9263 Project: Spark

Re: How to unpersist RDDs generated by ALS/MatrixFactorizationModel

2015-07-22 Thread Burak Yavuz
Hi Jonathan, I believe calling persist with StorageLevel.NONE doesn't do anything. That's why the unpersist has an if statement before it. Could you give more information about your setup please? Number of cores, memory, number of partitions of ratings_train? Thanks, Burak On Wed, Jul 22, 2015 a

Re: RowId unique key for Dataframes

2015-07-21 Thread Burak Yavuz
Would monotonicallyIncreasingId work for you? Best, Burak On Tue, Jul 21, 2015 at 4:55 PM, Srikanth wrote: > Hello, > > I'm creating dataframes fro

Re: LinearRegressionWithSGD Outputs NaN

2015-07-21 Thread Burak Yavuz
Hi, Could you please decrease your step size to 0.1, and also try 0.01? You could also try running L-BFGS, which doesn't have step size tuning, to get better results. Best, Burak On Tue, Jul 21, 2015 at 2:59 AM, Naveen wrote: > Hi , > > I am trying to use LinearRegressionWithSGD on Million Song

Re: BlockMatrix multiplication

2015-07-17 Thread Burak Yavuz
mit a JIRA Issue related to the problem of block matrix > shuffling given the blocks co-location? > > > > Best regards, Alexander > > > > *From:* Burak Yavuz [mailto:brk...@gmail.com] > *Sent:* Wednesday, July 15, 2015 3:29 PM > > *To:* Ulanov, Alexander > *Cc:* Rakesh Ch

Re: BlockMatrix multiplication

2015-07-15 Thread Burak Yavuz
> > bm.validate() > > val t = System.nanoTime() > > // multiply matrix with itself > > val aa = bm.multiply(bm) > > aa.validate() > > println(rows + "x" + columns + ", block:" + blockSize + "\t" + > (System.nanoTime() - t) / 1e9) > >

Re: Running mllib from R in Spark 1.4

2015-07-15 Thread Burak Yavuz
Hi, There is no MLlib support in SparkR in 1.4. There will be some support in 1.5. You can check these JIRAs for progress: https://issues.apache.org/jira/browse/SPARK-6805 https://issues.apache.org/jira/browse/SPARK-6823 Best, Burak On Wed, Jul 15, 2015 at 6:00 AM, madhu phatak wrote: > Hi, > I

Re: MLlib LogisticRegressionWithLBFGS error

2015-07-14 Thread Burak Yavuz
Hi, Is this in LibSVM format? If so, the indices should be sorted in increasing order. It seems like they are not sorted. Best, Burak On Tue, Jul 14, 2015 at 7:31 PM, Vi Ngo Van wrote: > Hi All, > I've met a issue with MLlib when i use LogisticRegressionWithLBFGS > > my sample data : > > *0 86

Re: creating a distributed index

2015-07-14 Thread Burak Yavuz
Hi Swetha, IndexedRDD is available as a package on Spark Packages . Best, Burak On Tue, Jul 14, 2015 at 5:23 PM, swetha wrote: > Hi Ankur, > > Is IndexedRDD available in Spark 1.4.0? We would like to use this in Spark > Streaming to do

Re: Strange behavoir of pyspark with --jars option

2015-07-14 Thread Burak Yavuz
Hi, I believe the HiveContext uses a different class loader. It then falls back to the system class loader if it can't find the classes in the context class loader. The system class loader contains the classpath passed through --driver-class-path and spark.executor.extraClassPath. The JVM is alread

Re: To access elements of a org.apache.spark.mllib.linalg.Vector

2015-07-14 Thread Burak Yavuz
Hi Dan, You could zip the indices with the values if you like. ``` val sVec = sparseVector(1).asInstanceOf[ org.apache.spark.mllib.linalg.SparseVector] val map = sVec.indices.zip(sVec.values).toMap ``` Best, Burak On Tue, Jul 14, 2015 at 12:23 PM, Dan Dong wrote: > Hi, > I'm wondering how t

Re: BlockMatrix multiplication

2015-07-14 Thread Burak Yavuz
Hi Alexander, >From your example code, using the GridPartitioner, you will have 1 column, and 5 rows. When you perform an A^T^A multiplication, you will generate a separate GridPartitioner with 5 columns and 5 rows. Therefore you are observing a huge shuffle. If you would generate a diagonal-block

Re: [MLLib][Kmeans] KMeansModel.computeCost takes lot of time

2015-07-13 Thread Burak Yavuz
irmal Fernando wrote: > I'm using; > > org.apache.spark.mllib.clustering.KMeans.train(data.rdd(), 3, 20); > > Cpu cores: 8 (using default Spark conf thought) > > On partitions, I'm not sure how to find that. > > On Mon, Jul 13, 2015 at 11:30 PM, Burak Yavuz

Re: [MLLib][Kmeans] KMeansModel.computeCost takes lot of time

2015-07-13 Thread Burak Yavuz
1.4 > > On Mon, Jul 13, 2015 at 10:28 PM, Burak Yavuz wrote: > >> Hi, >> >> How are you running K-Means? What is your k? What is the dimension of >> your dataset (columns)? Which Spark version are you using? >> >> Thanks, >> Burak >&g

Re: [MLLib][Kmeans] KMeansModel.computeCost takes lot of time

2015-07-13 Thread Burak Yavuz
Hi, How are you running K-Means? What is your k? What is the dimension of your dataset (columns)? Which Spark version are you using? Thanks, Burak On Mon, Jul 13, 2015 at 2:53 AM, Nirmal Fernando wrote: > Hi, > > For a fairly large dataset, 30MB, KMeansModel.computeCost takes lot of > time (16

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 76f663: Translated using Weblate (Turkish)

2015-07-12 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 76f66363cbb1382ef80312bb3d612615936fc7a2 https://github.com/phpmyadmin/phpmyadmin/commit/76f66363cbb1382ef80312bb3d612615936fc7a2 Author: Burak Yavuz Date: 2015-07-12 (Sun, 12 Jul 2015

[Phpmyadmin-git] [phpmyadmin/localized_docs] e58f00: Translated using Weblate (Turkish)

2015-07-10 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/localized_docs Commit: e58f0054418b0442a88effe9437e503d7efc339f https://github.com/phpmyadmin/localized_docs/commit/e58f0054418b0442a88effe9437e503d7efc339f Author: Burak Yavuz Date: 2015-07-10 (Fri, 10 Jul 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 92a63d: Translated using Weblate (Turkish)

2015-07-10 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 92a63d2cc3cff46b9a5ce087162557c2bb5c729e https://github.com/phpmyadmin/phpmyadmin/commit/92a63d2cc3cff46b9a5ce087162557c2bb5c729e Author: Burak Yavuz Date: 2015-07-10 (Fri, 10 Jul 2015

Re: Unit tests of spark application

2015-07-10 Thread Burak Yavuz
I can +1 Holden's spark-testing-base package. Burak On Fri, Jul 10, 2015 at 12:23 PM, Holden Karau wrote: > Somewhat biased of course, but you can also use spark-testing-base from > spark-packages.org as a basis for your unittests. > > On Fri, Jul 10, 2015 at 12:03 PM, Daniel Siegmann < > danie

Re: How to ignore features in mllib

2015-07-09 Thread Burak Yavuz
If you use the Pipelines Api with DataFrames, you select which columns you would like to train on using the VectorAssembler. While using the VectorAssembler, you can choose not to select some features if you like. Best, Burak On Thu, Jul 9, 2015 at 10:38 AM, Arun Luthra wrote: > Is it possible

Re: [VOTE] Release Apache Spark 1.4.1 (RC4)

2015-07-09 Thread Burak Yavuz
+1 nonbinding. On Thu, Jul 9, 2015 at 7:38 AM, Sean Owen wrote: > +1 nonbinding. All previous RC issues appear resolved. All tests pass > with the "-Pyarn -Phadoop-2.6 -Phive -Phive-thriftserver" invocation. > Signatures et al are OK. > > On Thu, Jul 9, 2015 at 6:55 AM, Patrick Wendell > wrote:

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] f4f94f: Translated using Weblate (Turkish)

2015-07-08 Thread Burak Yavuz
Branch: refs/heads/QA_4_4 Home: https://github.com/phpmyadmin/phpmyadmin Commit: f4f94fc4fcac477b229feb3eaf30e0aa74d03fb7 https://github.com/phpmyadmin/phpmyadmin/commit/f4f94fc4fcac477b229feb3eaf30e0aa74d03fb7 Author: Burak Yavuz Date: 2015-07-08 (Wed, 08 Jul 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 3a3c49: Translated using Weblate (Turkish)

2015-07-08 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 3a3c4969ada9bb641abc492e497710287ce3241c https://github.com/phpmyadmin/phpmyadmin/commit/3a3c4969ada9bb641abc492e497710287ce3241c Author: Burak Yavuz Date: 2015-07-08 (Wed, 08 Jul 2015

Re: spark-submit can not resolve spark-hive_2.10

2015-07-07 Thread Burak Yavuz
spark-hive is excluded when using --packages, because it can be included in the spark-assembly by adding -Phive during mvn package or sbt assembly. Best, Burak On Tue, Jul 7, 2015 at 8:06 AM, Hao Ren wrote: > I want to add spark-hive as a dependence to submit my job, but it seems > that > spark

[jira] [Updated] (SPARK-6442) MLlib 1.4 Local Linear Algebra Package

2015-07-04 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-6442: --- Description: MLlib's local linear algebra package doesn't have any support for any type

Re: Spark 1.4 MLLib Bug?: Multiclass Classification "requirement failed: sizeInBytes was negative"

2015-07-03 Thread Burak Yavuz
How many partitions do you have? It might be that one partition is too large, and there is Integer overflow. Could you double your number of partitions? Burak On Fri, Jul 3, 2015 at 4:41 AM, Danny wrote: > hi, > > i want to run a multiclass classification with 390 classes on120k label > points(

[jira] [Created] (SPARK-8803) Crosstab element's can't contain null's and back ticks

2015-07-02 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8803: -- Summary: Crosstab element's can't contain null's and back ticks Key: SPARK-8803 URL: https://issues.apache.org/jira/browse/SPARK-8803 Project: Spark

Re: coalesce on dataFrame

2015-07-01 Thread Burak Yavuz
You can use df.repartition(1) in Spark 1.4. See here . Best, Burak On Wed, Jul 1, 2015 at 3:05 AM, Olivier Girardot wrote: > PySpark or Spark (scala) ? > When you use coalesce with a

Re: breeze.linalg.DenseMatrix not found

2015-06-30 Thread Burak Yavuz
How does your build file look? Are you possibly using wrong Scala versions? Have you added Breeze as a dependency to your project? If so which version? Thanks, Burak On Mon, Jun 29, 2015 at 3:45 PM, AlexG wrote: > I get the same error even when I define covOperator not to use a matrix at > all:

Re: Can Dependencies Be Resolved on Spark Cluster?

2015-06-30 Thread Burak Yavuz
.hbase:hbase:1.1.1, junit:junit:x --repositories http://some.other.repo,http://some.other.repo2 $YOUR_JAR Best, Burak On Mon, Jun 29, 2015 at 11:33 PM, SLiZn Liu wrote: > Hi Burak, > > Is `--package` flag only available for maven, no sbt support? > > On Tue, Jun 30, 2015 at 2:26

Re: Can Dependencies Be Resolved on Spark Cluster?

2015-06-29 Thread Burak Yavuz
You can pass `--packages your:comma-separated:maven-dependencies` to spark submit if you have Spark 1.3 or greater. Best regards, Burak On Mon, Jun 29, 2015 at 10:46 PM, SLiZn Liu wrote: > Hey Spark Users, > > I'm writing a demo with Spark and HBase. What I've done is packaging a > **fat jar**:

[jira] [Created] (SPARK-8715) ArrayOutOfBoundsException for DataFrameStatSuite.crosstab

2015-06-29 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8715: -- Summary: ArrayOutOfBoundsException for DataFrameStatSuite.crosstab Key: SPARK-8715 URL: https://issues.apache.org/jira/browse/SPARK-8715 Project: Spark Issue

[jira] [Commented] (SPARK-8410) Hive VersionsSuite RuntimeException

2015-06-29 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606080#comment-14606080 ] Burak Yavuz commented on SPARK-8410: Hi Joe, Could you please check whether h

[jira] [Commented] (SPARK-8410) Hive VersionsSuite RuntimeException

2015-06-29 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605977#comment-14605977 ] Burak Yavuz commented on SPARK-8410: Hi Joe, Is it possible to delete those f

[jira] [Commented] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-29 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605968#comment-14605968 ] Burak Yavuz commented on SPARK-8475: ping. I think you can go ahead with a PR

[jira] [Commented] (SPARK-8599) Use a Random operator to handle Random distribution generating expressions

2015-06-29 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14605777#comment-14605777 ] Burak Yavuz commented on SPARK-8599: It would be great if it works for this cas

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 68116a: Translated using Weblate (Turkish)

2015-06-28 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 68116ae860346108aa0850ffcd49c5241759562c https://github.com/phpmyadmin/phpmyadmin/commit/68116ae860346108aa0850ffcd49c5241759562c Author: Burak Yavuz Date: 2015-06-29 (Mon, 29 Jun 2015

[jira] [Created] (SPARK-8681) crosstab column names in wrong order

2015-06-27 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8681: -- Summary: crosstab column names in wrong order Key: SPARK-8681 URL: https://issues.apache.org/jira/browse/SPARK-8681 Project: Spark Issue Type: Sub-task

[Phpmyadmin-git] [phpmyadmin/localized_docs] 1e0bb6: Translated using Weblate (Turkish)

2015-06-26 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/localized_docs Commit: 1e0bb6be29e91e6e47ad7f95169c4d1c9d92cfe7 https://github.com/phpmyadmin/localized_docs/commit/1e0bb6be29e91e6e47ad7f95169c4d1c9d92cfe7 Author: Burak Yavuz Date: 2015-06-26 (Fri, 26 Jun 2015

[jira] [Commented] (SPARK-8599) Use a Random operator to handle Random distribution generating expressions

2015-06-24 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600315#comment-14600315 ] Burak Yavuz commented on SPARK-8599: cc [~marmbrus] [~rxin] > Use a Random o

[jira] [Created] (SPARK-8609) After initializing a DataFrame with random columns and a seed, ordering by that random column should return same sorted order

2015-06-24 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8609: -- Summary: After initializing a DataFrame with random columns and a seed, ordering by that random column should return same sorted order Key: SPARK-8609 URL: https://issues.apache.org

[jira] [Created] (SPARK-8608) After initializing a DataFrame with random columns and a seed, df.show should return same value

2015-06-24 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8608: -- Summary: After initializing a DataFrame with random columns and a seed, df.show should return same value Key: SPARK-8608 URL: https://issues.apache.org/jira/browse/SPARK-8608

Re: Understanding accumulator during transformations

2015-06-24 Thread Burak Yavuz
ould restarted the transformation ended up updating accumulator more than > once? > > Best, > Wei > > 2015-06-24 13:23 GMT-07:00 Burak Yavuz : > >> Hi Wei, >> >> For example, when a straggler executor gets killed in the middle of a map >> operation

Re: Understanding accumulator during transformations

2015-06-24 Thread Burak Yavuz
Hi Wei, For example, when a straggler executor gets killed in the middle of a map operation and it's task is restarted at a different instance, the accumulator will be updated more than once. Best, Burak On Wed, Jun 24, 2015 at 1:08 PM, Wei Zhou wrote: > Quoting from Spark Program guide: > > "

[jira] [Resolved] (SPARK-8095) Spark package dependencies not resolved when package is in local-ivy-cache

2015-06-24 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz resolved SPARK-8095. Resolution: Fixed > Spark package dependencies not resolved when package is in local-ivy-ca

Re: [GraphX] Graph 500 graph generator

2015-06-24 Thread Burak Yavuz
Hi Ryan, If you can get past the paperwork, I'm sure this can make a great Spark Package (http://spark-packages.org). People then can use it for benchmarking purposes, and I'm sure people will be looking for graph generators! Best, Burak On Wed, Jun 24, 2015 at 7:55 AM, Carr, J. Ryan wrote: >

Re: Confusion matrix for binary classification

2015-06-22 Thread Burak Yavuz
Hi, In Spark 1.4, you may use DataFrame.stat.crosstab to generate the confusion matrix. This would be very simple if you are using the ML Pipelines Api, and are working with DataFrames. Best, Burak On Mon, Jun 22, 2015 at 4:21 AM, CD Athuraliya wrote: > Hi, > > I am looking for a way to get co

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 3309cd: Translated using Weblate (Turkish)

2015-06-22 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 3309cd0128fad9b207ec4dcf38638ceb7c684054 https://github.com/phpmyadmin/phpmyadmin/commit/3309cd0128fad9b207ec4dcf38638ceb7c684054 Author: Burak Yavuz Date: 2015-06-22 (Mon, 22 Jun 2015

[jira] [Commented] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-21 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14595363#comment-14595363 ] Burak Yavuz commented on SPARK-8475: Me too. I prefer option 1 as

Re: unsafe/compile error

2015-06-21 Thread Burak Yavuz
In addition, if you want to run a single suite, you may use: mllib/testOnly $SUITE_NAME with sbt. On Jun 21, 2015 10:32 AM, "Burak Yavuz" wrote: > You need to build an assembly jar for the cluster tests to pass. You may > use 'sbt assembly/assembly'. > > Best, &

Re: unsafe/compile error

2015-06-21 Thread Burak Yavuz
You need to build an assembly jar for the cluster tests to pass. You may use 'sbt assembly/assembly'. Best, Burak On Jun 21, 2015 3:43 AM, "acidghost" wrote: > After an sbt update the tests run. But all the "cluster" ones fail on "task > size should be small in both training and prediction" > >

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 02bf44: Translated using Weblate (Turkish)

2015-06-20 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 02bf444cc2fc57fc7129c5eeea275929b7a212ac https://github.com/phpmyadmin/phpmyadmin/commit/02bf444cc2fc57fc7129c5eeea275929b7a212ac Author: Burak Yavuz Date: 2015-06-20 (Sat, 20 Jun 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] a43aa4: Translated using Weblate (Turkish)

2015-06-19 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: a43aa4d24a93b97c660d76083aa1add76f073099 https://github.com/phpmyadmin/phpmyadmin/commit/a43aa4d24a93b97c660d76083aa1add76f073099 Author: Burak Yavuz Date: 2015-06-19 (Fri, 19 Jun 2015

Re: SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-18 Thread Burak Yavuz
Hey Nathan, I like the first idea better. Let's see what others think. I'd be happy to review your PR afterwards! Best, Burak On Thu, Jun 18, 2015 at 9:53 PM, Nathan McCarthy < nathan.mccar...@quantium.com.au> wrote: > Hey, > > Spark Submit adds maven central & spark bintray to the ChainResol

[jira] [Updated] (SPARK-8475) SparkSubmit with Ivy jars is very slow to load with no internet access

2015-06-18 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-8475: --- Issue Type: Improvement (was: Bug) > SparkSubmit with Ivy jars is very slow to load with no inter

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 204871: Translated using Weblate (Turkish)

2015-06-18 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 2048714b5ccee7f3e718cdbebcd290eba46155db https://github.com/phpmyadmin/phpmyadmin/commit/2048714b5ccee7f3e718cdbebcd290eba46155db Author: Burak Yavuz Date: 2015-06-18 (Thu, 18 Jun 2015

Re: --packages & Failed to load class for data source v1.4

2015-06-14 Thread Burak Yavuz
Hi Don, This seems related to a known issue, where the classpath on the driver is missing the related classes. This is a bug in py4j as py4j uses the System Classloader rather than Spark's Context Classloader. However, this problem existed in 1.3.0 as well, therefore I'm curious whether it's the sa

Re: How to read avro in SparkR

2015-06-13 Thread Burak Yavuz
Hi, Not sure if this is it, but could you please try "com.databricks.spark.avro" instead of just "avro". Thanks, Burak On Jun 13, 2015 9:55 AM, "Shing Hing Man" wrote: > Hi, > I am trying to read a avro file in SparkR (in Spark 1.4.0). > > I started R using the following. > matmsh@gauss:~$ spa

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 148cd1: Translated using Weblate (Turkish)

2015-06-13 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 148cd13b284b50185ba5fed479f79950a7873de5 https://github.com/phpmyadmin/phpmyadmin/commit/148cd13b284b50185ba5fed479f79950a7873de5 Author: Burak Yavuz Date: 2015-06-13 (Sat, 13 Jun 2015

[jira] [Created] (SPARK-8313) Support Spark Packages containing R code with --packages

2015-06-11 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8313: -- Summary: Support Spark Packages containing R code with --packages Key: SPARK-8313 URL: https://issues.apache.org/jira/browse/SPARK-8313 Project: Spark Issue

Re: [VOTE] Release Apache Spark 1.4.0 (RC4)

2015-06-04 Thread Burak Yavuz
+1 Tested on Mac OS X Burak On Thu, Jun 4, 2015 at 6:35 PM, Calvin Jia wrote: > +1 > > Tested with input from Tachyon and persist off heap. > > On Thu, Jun 4, 2015 at 6:26 PM, Timothy Chen wrote: > >> +1 >> >> Been testing cluster mode and client mode with mesos with 6 nodes cluster. >> >> Ev

Re: Ivy support in Spark vs. sbt

2015-06-04 Thread Burak Yavuz
Hi Marcelo, This is interesting. Can you please send me links to any failing builds if you see that problem please. For now you can set a conf: `spark.jars.ivy` to use a path except `~/.ivy2` for Spark. Thanks, Burak On Thu, Jun 4, 2015 at 4:29 AM, Sean Owen wrote: > I've definitely seen the "

[jira] [Commented] (SPARK-8095) Spark package dependencies not resolved when package is in local-ivy-cache

2015-06-03 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14572128#comment-14572128 ] Burak Yavuz commented on SPARK-8095: In the local ivy cache, it should use

Re: CSV Support in SparkR

2015-06-02 Thread Burak Yavuz
Hi, cc'ing Shivaram here, because he worked on this yesterday. If I'm not mistaken, you can use the following workflow: ```./bin/sparkR --packages com.databricks:spark-csv_2.10:1.0.3``` and then ```df <- read.df(sqlContext, "/data", "csv", header = "true")``` Best, Burak On Tue, Jun 2, 2015 a

[jira] [Created] (SPARK-8023) Random Number Generation inconsistent in projections in DataFrame

2015-06-01 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-8023: -- Summary: Random Number Generation inconsistent in projections in DataFrame Key: SPARK-8023 URL: https://issues.apache.org/jira/browse/SPARK-8023 Project: Spark

[jira] [Commented] (SPARK-8023) Random Number Generation inconsistent in projections in DataFrame

2015-06-01 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14568265#comment-14568265 ] Burak Yavuz commented on SPARK-8023: cc [~yhuai] > Random Number Gen

[jira] [Comment Edited] (SPARK-7944) Spark-Shell 2.11 1.4.0-RC-03 does not add jars to class path

2015-05-31 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14566700#comment-14566700 ] Burak Yavuz edited comment on SPARK-7944 at 5/31/15 7:4

[jira] [Commented] (SPARK-7944) Spark-Shell 2.11 1.4.0-RC-03 does not add jars to class path

2015-05-31 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14566700#comment-14566700 ] Burak Yavuz commented on SPARK-7944: I saw this issue with Yarn when using Scala

[jira] [Commented] (SPARK-7982) crosstab should use 0 instead of null for pairs that don't appear

2015-05-31 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14566680#comment-14566680 ] Burak Yavuz commented on SPARK-7982: The reason we used null's instead of

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 4dcd08: Translated using Weblate (Turkish)

2015-05-31 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 4dcd0848fe8ec1757665caf36b35279c2186f8cd https://github.com/phpmyadmin/phpmyadmin/commit/4dcd0848fe8ec1757665caf36b35279c2186f8cd Author: Burak Yavuz Date: 2015-05-31 (Sun, 31 May 2015

[jira] [Created] (SPARK-7957) Preserve partitioning in randomSplit in RDD.scala

2015-05-29 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7957: -- Summary: Preserve partitioning in randomSplit in RDD.scala Key: SPARK-7957 URL: https://issues.apache.org/jira/browse/SPARK-7957 Project: Spark Issue Type

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] bcefc2: Translated using Weblate (Turkish)

2015-05-27 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: bcefc28a4858f763c9d1f27b7a8d09a58458db89 https://github.com/phpmyadmin/phpmyadmin/commit/bcefc28a4858f763c9d1f27b7a8d09a58458db89 Author: Burak Yavuz Date: 2015-05-27 (Wed, 27 May 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 3f47fc: Translated using Weblate (Turkish)

2015-05-27 Thread Burak Yavuz
Branch: refs/heads/QA_4_4 Home: https://github.com/phpmyadmin/phpmyadmin Commit: 3f47fcf9ea4c37688d3ed9c4a8d95986b49a1e99 https://github.com/phpmyadmin/phpmyadmin/commit/3f47fcf9ea4c37688d3ed9c4a8d95986b49a1e99 Author: Burak Yavuz Date: 2015-05-27 (Wed, 27 May 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] 5bd53e: Translated using Weblate (Turkish)

2015-05-25 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/phpmyadmin Commit: 5bd53e5a4baa498856da91c05e63079a8c247f34 https://github.com/phpmyadmin/phpmyadmin/commit/5bd53e5a4baa498856da91c05e63079a8c247f34 Author: Burak Yavuz Date: 2015-05-25 (Mon, 25 May 2015

[jira] [Commented] (SPARK-7287) Flaky test: o.a.s.deploy.SparkSubmitSuite --packages

2015-05-23 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14557631#comment-14557631 ] Burak Yavuz commented on SPARK-7287: I don't understand why that's fai

[jira] [Commented] (SPARK-7785) Add pretty printing to pyspark.mllib.linalg.Matrices

2015-05-21 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14555440#comment-14555440 ] Burak Yavuz commented on SPARK-7785: For operations with BlockMatrix, you will

Re: foreach plus accumulator Vs mapPartitions performance

2015-05-21 Thread Burak Yavuz
Or you can simply use `reduceByKeyLocally` if you don't want to worry about implementing accumulators and such, and assuming that the reduced values will fit in memory of the driver (which you are assuming by using accumulators). Best, Burak On Thu, May 21, 2015 at 2:46 PM, ben wrote: > Hi, eve

[jira] [Commented] (SPARK-7785) Add missing items to pyspark.mllib.linalg.Matrices

2015-05-21 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14555313#comment-14555313 ] Burak Yavuz commented on SPARK-7785: My belief on the Python linalg api so far

Re: GradientBoostedTrees.trainRegressor with categoricalFeaturesInfo

2015-05-20 Thread Burak Yavuz
Could you please open a JIRA for it? The maxBins input is missing for the Python Api. Is it possible if you can use the current master? In the current master, you should be able to use trees with the Pipeline Api and DataFrames. Best, Burak On Wed, May 20, 2015 at 2:44 PM, Don Drake wrote: > I

[jira] [Created] (SPARK-7745) Replace assertions with requires (IllegalArgumentException) and modify other state checks

2015-05-19 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7745: -- Summary: Replace assertions with requires (IllegalArgumentException) and modify other state checks Key: SPARK-7745 URL: https://issues.apache.org/jira/browse/SPARK-7745

[jira] [Commented] (SPARK-7486) Add the streaming implementation for estimating quantiles and median

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14535616#comment-14535616 ] Burak Yavuz commented on SPARK-7486: Yes, this is a clone of SPARK-6760 and S

[jira] [Resolved] (SPARK-7245) Spearman correlation for DataFrames

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz resolved SPARK-7245. Resolution: Done Fix Version/s: 1.4.0 > Spearman correlation for DataFra

[jira] [Reopened] (SPARK-7245) Spearman correlation for DataFrames

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz reopened SPARK-7245: Sorry, mixed this with Pearson correlation > Spearman correlation for DataFra

[jira] [Updated] (SPARK-7492) Convert LocalDataFrame to LocalMatrix

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-7492: --- Description: Having a method like, {code:java} Matrices.fromDataFrame(df) {code} would provide users

[jira] [Updated] (SPARK-7492) Convert LocalDataFrame to LocalMatrix

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-7492: --- Description: Having a method like, {code: java} Matrices.fromDataFrame(df) {code} would provide

[jira] [Created] (SPARK-7492) Convert LocalDataFrame to LocalMatrix

2015-05-08 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7492: -- Summary: Convert LocalDataFrame to LocalMatrix Key: SPARK-7492 URL: https://issues.apache.org/jira/browse/SPARK-7492 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-7488) Python API for ml.recommendation

2015-05-08 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7488: -- Summary: Python API for ml.recommendation Key: SPARK-7488 URL: https://issues.apache.org/jira/browse/SPARK-7488 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7487) Python API for ml.regression

2015-05-08 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7487: -- Summary: Python API for ml.regression Key: SPARK-7487 URL: https://issues.apache.org/jira/browse/SPARK-7487 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-7381) Missing Python API for o.a.s.ml

2015-05-08 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz updated SPARK-7381: --- Summary: Missing Python API for o.a.s.ml (was: Python API for Transformers) > Missing Python API

[Phpmyadmin-git] [phpmyadmin/localized_docs] fa89f7: Translated using Weblate (Turkish)

2015-05-07 Thread Burak Yavuz
Branch: refs/heads/master Home: https://github.com/phpmyadmin/localized_docs Commit: fa89f7e5243ea2b1f3778b5dc76d3f711f57fb45 https://github.com/phpmyadmin/localized_docs/commit/fa89f7e5243ea2b1f3778b5dc76d3f711f57fb45 Author: Burak Yavuz Date: 2015-05-07 (Thu, 07 May 2015

[Phpmyadmin-git] [phpmyadmin/phpmyadmin] aa060d: Translated using Weblate (Turkish)

2015-05-06 Thread Burak Yavuz
Branch: refs/heads/QA_4_4 Home: https://github.com/phpmyadmin/phpmyadmin Commit: aa060d37188c28cfefe2ebb9da0a3a1b779b987b https://github.com/phpmyadmin/phpmyadmin/commit/aa060d37188c28cfefe2ebb9da0a3a1b779b987b Author: Burak Yavuz Date: 2015-05-06 (Wed, 06 May 2015

[jira] [Created] (SPARK-7388) Python Api for Param[Array[T]]

2015-05-05 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7388: -- Summary: Python Api for Param[Array[T]] Key: SPARK-7388 URL: https://issues.apache.org/jira/browse/SPARK-7388 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7383) Python API for ml.feature

2015-05-05 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7383: -- Summary: Python API for ml.feature Key: SPARK-7383 URL: https://issues.apache.org/jira/browse/SPARK-7383 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7382) Python API for ml.classification

2015-05-05 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7382: -- Summary: Python API for ml.classification Key: SPARK-7382 URL: https://issues.apache.org/jira/browse/SPARK-7382 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-7381) Python API for Transformers

2015-05-05 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-7381: -- Summary: Python API for Transformers Key: SPARK-7381 URL: https://issues.apache.org/jira/browse/SPARK-7381 Project: Spark Issue Type: Umbrella

Re: ReduceByKey and sorting within partitions

2015-05-04 Thread Burak Yavuz
I think this Spark Package may be what you're looking for! http://spark-packages.org/package/tresata/spark-sorted Best, Burak On Mon, May 4, 2015 at 12:56 PM, Imran Rashid wrote: > oh wow, that is a really interesting observation, Marco & Jerry. > I wonder if this is worth exposing in combineBy

[jira] [Commented] (SPARK-7306) SPARK-7224 broke build with jdk6

2015-05-01 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14523592#comment-14523592 ] Burak Yavuz commented on SPARK-7306: I'll submit a patch using Guava withi

<    3   4   5   6   7   8   9   10   11   12   >