Mathew created ZEPPELIN-3462: -------------------------------- Summary: DataFrames with tabs get corrupted in SQL interpreter. Key: ZEPPELIN-3462 URL: https://issues.apache.org/jira/browse/ZEPPELIN-3462 Project: Zeppelin Issue Type: Bug Components: Interpreters Affects Versions: 0.7.3 Reporter: Mathew Fix For: 0.8.0 Attachments: image-2018-05-16-09-49-44-647.png
If there is a tab in a dataframe, the SQL interpreter will interpret this as a new column, causing the table display to chomp of some of the following columns. *Steps to Reproduce:* Create dataframe with tab: {code:java} %spark.pyspark from pyspark.sql import Row # Create dataframe with 3 cols df = sc.parallelize([ Row(u'First col, \u0009 still first col.', 'Second col', 'Third col') ] ).toDF() # Display table df.show() # Register table for SQL df.registerTempTable("df"){code} Query in SQL interpreter: {code:java} %sql SELECT * FROM df {code} Output: !image-2018-05-16-09-49-44-647.png! -- This message was sent by Atlassian JIRA (v7.6.3#76005)