[jira] [Created] (ZEPPELIN-3559) z.show only shows 13 rows from dataframe of 100
Matthew J Meyer created ZEPPELIN-3559: - Summary: z.show only shows 13 rows from dataframe of 100 Key: ZEPPELIN-3559 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3559 Project: Zeppelin Issue Type: Bug Components: GUI Affects Versions: 0.8.1 Reporter: Matthew J Meyer Attachments: Screenshot from 2018-06-20 22-27-24.png Running: {{%spark}} {{val l = (1 to 100).toSeq}} {{val v = List.tabulate(100)(n => n * n)}} {{val df = sc.parallelize(l zip v).toDF("label","value")}} {{df.show(100)}} {{z.show(df)}} results in expected interpreter output of spark showing 100 rows but the table shown in zeppelin via z.show only shows 13 rows. If I switch to use the any of the chart visualizations (bar, line, area, etc), zeppelin correctly shows the expected 100 elements on the x axis. zeppelin.spark.maxResult is set to default of 1000 Zeppelin was built from branch-0.8 using: mvn clean package -Pspark-2.1 -Phadoop-2.4 -Pscala-2.10 -DskipTests Running the same on 0.7.3 results in expected 100 rows in zeppelin table. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ZEPPELIN-3547) Spark scala line breaks not working (illegal start of definition)
Matthew J Meyer created ZEPPELIN-3547: - Summary: Spark scala line breaks not working (illegal start of definition) Key: ZEPPELIN-3547 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3547 Project: Zeppelin Issue Type: Bug Components: Interpreters Affects Versions: 0.8.1 Reporter: Matthew J Meyer If I format spark scala statements with line breaks (for readability) and then run the para I get: {{:1: error: illegal start of definition .read.format("com.databricks.spark.csv")}} The following will produce this error: {{%spark}} {{val path = z.textbox("dataDir")}} {{val filename = z.textbox("projectName")}}{{val dfArticles = spark}} {{ .read.format("com.databricks.spark.csv")}} {{ .option("header", "true")}} {{ .option("inferSchema", "true")}} {{ .load(path+"/"+filename+"-articles.csv"}} If the entire statement is on one continuous line, it runs successfully. I am a aware that I can wrap the block with curly braces to get around this but that is not really a solution since it affects the scope of the statements making things defined in the block not available in other paragraphs in the notebook. Using continuous lines is not really a workable option when working with anything more than trivial spark queries. I'd really like to be using things like Helium spell extensions, notebook level form fields and all of the other good stuff in 0.8.0 but this issue is so debilitating it has me sticking with 0.7.3. Please fix this soon. I built from branch-0.8.0 with: {{mvn clean package -Pspark-2.1 -Phadoop-2.4 -Pscala-2.11 -DskipTests}} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ZEPPELIN-3544) Option to show "hours ago" style timestamps at end of each paragraph
Matthew J Meyer created ZEPPELIN-3544: - Summary: Option to show "hours ago" style timestamps at end of each paragraph Key: ZEPPELIN-3544 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3544 Project: Zeppelin Issue Type: Improvement Components: GUI Affects Versions: 0.7.3 Reporter: Matthew J Meyer Currently the last execution duration and timestamp are show at the bottom of each paragraph as in these two examples: Took 1 sec. Last updated by anonymous at June 13 2018, 7:26:11 PM. (outdated) Took 2 sec. Last updated by anonymous at June 14 2018, 3:43:07 PM. Mostly in the case where a paragraph is marked (outdated), it would be helpful to see the last execution timestamp expressed as "n Hours Ago" | "n Minutes ago" |"Moments Ago", etc. -- This message was sent by Atlassian JIRA (v7.6.3#76005)