[ https://issues.apache.org/jira/browse/SPARK-18133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15610968#comment-15610968 ]
Miao Wang commented on SPARK-18133: ----------------------------------- Use Pyspark: >>> training = spark.createDataFrame([ ... (0L, "a b c d e spark", 1.0), ... (1L, "b d", 0.0), ... (2L, "spark f g h", 1.0), ... (3L, "hadoop mapreduce", 0.0) ... ], ["id", "text", "label"]) >>> training DataFrame[id: bigint, text: string, label: double] There is no such error. > Python ML Pipeline Example has syntax errors > -------------------------------------------- > > Key: SPARK-18133 > URL: https://issues.apache.org/jira/browse/SPARK-18133 > Project: Spark > Issue Type: Bug > Components: Examples, ML > Affects Versions: 2.0.1 > Environment: OS X > Reporter: Nirmal Fernando > Priority: Minor > Labels: easyfix > > $ ./bin/spark-submit examples/src/main/python/ml/pipeline_example.py > File > "/spark-2.0.0-bin-hadoop2.7/examples/src/main/python/ml/pipeline_example.py", > line 38 > (0L, "a b c d e spark", 1.0), > ^ > SyntaxError: invalid syntax > Removing 'L' from all occurrences resolves the issue. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org