[jira] [Created] (ZEPPELIN-3559) z.show only shows 13 rows from dataframe of 100

2018-06-20 Thread Matthew J Meyer (JIRA)
Matthew J Meyer created ZEPPELIN-3559:
-

 Summary: z.show only shows 13 rows from dataframe of 100
 Key: ZEPPELIN-3559
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3559
 Project: Zeppelin
  Issue Type: Bug
  Components: GUI
Affects Versions: 0.8.1
Reporter: Matthew J Meyer
 Attachments: Screenshot from 2018-06-20 22-27-24.png

Running:

{{%spark}}
{{val l = (1 to 100).toSeq}}
{{val v = List.tabulate(100)(n => n * n)}}
{{val df = sc.parallelize(l zip v).toDF("label","value")}}

{{df.show(100)}}
{{z.show(df)}}

results in expected interpreter output of spark showing 100 rows but the table 
shown in zeppelin via z.show only shows 13 rows.

If I switch to use the any of the chart visualizations (bar, line, area, etc), 
zeppelin correctly shows the expected 100 elements on the x axis.

zeppelin.spark.maxResult is set to default of 1000

Zeppelin was built from branch-0.8 using:

mvn clean package -Pspark-2.1 -Phadoop-2.4 -Pscala-2.10 -DskipTests

Running the same on 0.7.3 results in expected 100 rows in zeppelin table.

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ZEPPELIN-3547) Spark scala line breaks not working (illegal start of definition)

2018-06-16 Thread Matthew J Meyer (JIRA)
Matthew J Meyer created ZEPPELIN-3547:
-

 Summary: Spark scala line breaks not working (illegal start of 
definition)
 Key: ZEPPELIN-3547
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3547
 Project: Zeppelin
  Issue Type: Bug
  Components: Interpreters
Affects Versions: 0.8.1
Reporter: Matthew J Meyer


If I format spark scala statements with line breaks (for readability) and then 
run the para I get:

{{:1: error: illegal start of definition 
.read.format("com.databricks.spark.csv")}}

The following will produce this error:

{{%spark}}
{{val path = z.textbox("dataDir")}}
{{val filename = z.textbox("projectName")}}{{val dfArticles = spark}}
{{ .read.format("com.databricks.spark.csv")}}
{{ .option("header", "true")}}
{{ .option("inferSchema", "true")}}
{{ .load(path+"/"+filename+"-articles.csv"}}

If the entire statement is on one continuous line, it runs successfully.

I am a aware that I can wrap the block with curly braces to get around this but 
that is not really a solution since it affects the scope of the statements 
making things defined in the block not available in other paragraphs in the 
notebook.

Using continuous lines is not really a workable option when working with 
anything more than trivial spark queries.

I'd really like to be using things like Helium spell extensions, notebook level 
form fields and all of the other good stuff in 0.8.0 but this issue is so 
debilitating it has me sticking with 0.7.3.

Please fix this soon.

I built from branch-0.8.0 with:

{{mvn clean package -Pspark-2.1 -Phadoop-2.4 -Pscala-2.11 -DskipTests}}

 

 

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ZEPPELIN-3544) Option to show "hours ago" style timestamps at end of each paragraph

2018-06-14 Thread Matthew J Meyer (JIRA)
Matthew J Meyer created ZEPPELIN-3544:
-

 Summary: Option to show "hours ago" style timestamps at end of 
each paragraph
 Key: ZEPPELIN-3544
 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3544
 Project: Zeppelin
  Issue Type: Improvement
  Components: GUI
Affects Versions: 0.7.3
Reporter: Matthew J Meyer


Currently the last execution duration and timestamp are show at the bottom of 
each paragraph as in these two examples:

Took 1 sec. Last updated by anonymous at June 13 2018, 7:26:11 PM. (outdated)

Took 2 sec. Last updated by anonymous at June 14 2018, 3:43:07 PM.

 

Mostly in the case where a paragraph is marked (outdated), it would be helpful 
to see the last execution timestamp expressed as "n Hours Ago" | "n Minutes 
ago" |"Moments Ago", etc.

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)