[ https://issues.apache.org/jira/browse/SPARK-7993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14567020#comment-14567020 ]
Reynold Xin commented on SPARK-7993: ------------------------------------ Thanks. Note that once you change the show output, you might need to update some Python unit tests since some of the functions use show's output. > Improve DataFrame.show() output > ------------------------------- > > Key: SPARK-7993 > URL: https://issues.apache.org/jira/browse/SPARK-7993 > Project: Spark > Issue Type: Sub-task > Components: SQL > Reporter: Reynold Xin > Priority: Blocker > Labels: starter > > 1. Each column should be at the minimum 3 characters wide. Right now if the > widest value is 1, it is just 1 char wide, which looks ugly. Example below: > 2. If a DataFrame have more than N number of rows (N = 20 by default for > show), at the end we should display a message like "only showing the top 20 > rows". > {code} > +--+--+-+ > | a| b|c| > +--+--+-+ > | 1| 2|3| > | 1| 2|1| > | 1| 2|3| > | 3| 6|3| > | 1| 2|3| > | 5|10|1| > | 1| 2|3| > | 7|14|3| > | 1| 2|3| > | 9|18|1| > | 1| 2|3| > |11|22|3| > | 1| 2|3| > |13|26|1| > | 1| 2|3| > |15|30|3| > | 1| 2|3| > |17|34|1| > | 1| 2|3| > |19|38|3| > +--+--+-+ > only showing top 20 rows <---- add this at the end > {code} > 3. For array values, instead of printing "ArrayBuffer", we should just print > square brackets: > {code} > +------------------+------------------+-----------------+ > | a_freqItems| b_freqItems| c_freqItems| > +------------------+------------------+-----------------+ > |ArrayBuffer(11, 1)|ArrayBuffer(2, 22)|ArrayBuffer(1, 3)| > +------------------+------------------+-----------------+ > {code} > should be > {code} > +-----------+-----------+-----------+ > |a_freqItems|b_freqItems|c_freqItems| > +-----------+-----------+-----------+ > | [11, 1]| [2, 22]| [1, 3]| > +-----------+-----------+-----------+ > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org