[ 
https://issues.apache.org/jira/browse/SPARK-3297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Josh Rosen resolved SPARK-3297.
-------------------------------
       Resolution: Fixed
    Fix Version/s: 1.2.0
                   1.1.1
         Assignee: Hossein Falaki

Fixed by https://github.com/apache/spark/pull/2687

> [Spark SQL][UI] SchemaRDD toString with many columns messes up Storage tab 
> display
> ----------------------------------------------------------------------------------
>
>                 Key: SPARK-3297
>                 URL: https://issues.apache.org/jira/browse/SPARK-3297
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL, Web UI
>    Affects Versions: 1.0.2
>            Reporter: Evan Chan
>            Assignee: Hossein Falaki
>            Priority: Minor
>              Labels: newbie
>             Fix For: 1.1.1, 1.2.0
>
>
> When a SchemaRDD with many columns (for example, 57 columns in this example) 
> is cached using sqlContext.cacheTable, the Storage tab of the driver Web UI 
> display gets messed up, because the long string of the SchemaRDD causes the 
> first column to be much much wider than the others, and in fact much wider 
> than the width of the browser.  It would be nice to have the first column be 
> restricted to, say, 50% of the width of the browser window, with some minimum.
> For example this is the SchemaRDD text for my table:
>         RDD Storage Info for ExistingRdd 
> [ActionGeo_ADM1Code#198,ActionGeo_CountryCode#199,ActionGeo_FeatureID#200,ActionGeo_FullName#201,ActionGeo_Lat#202,ActionGeo_Long#203,ActionGeo_Type#204,Actor1Code#205,Actor1CountryCode#206,Actor1EthnicCode#207,Actor1Geo_ADM1Code#208,Actor1Geo_CountryCode#209,Actor1Geo_FeatureID#210,Actor1Geo_FullName#211,Actor1Geo_Lat#212,Actor1Geo_Long#213,Actor1Geo_Type#214,Actor1KnownGroupCode#215,Actor1Name#216,Actor1Religion1Code#217,Actor1Religion2Code#218,Actor1Type1Code#219,Actor1Type2Code#220,Actor1Type3Code#221,Actor2Code#222,Actor2CountryCode#223,Actor2EthnicCode#224,Actor2Geo_ADM1Code#225,Actor2Geo_CountryCode#226,Actor2Geo_FeatureID#227,Actor2Geo_FullName#228,Actor2Geo_Lat#229,Actor2Geo_Long#230,Actor2Geo_Type#231,Actor2KnownGroupCode#232,Actor2Name#233,Actor2Religion1Code#234,Actor2Religion2Code#235,Actor2Type1Code#236,Actor2Type2Code#237,Actor2Type3Code#238,AvgTone#239,DATEADDED#240,Day#241,EventBaseCode#242,EventCode#243,EventId#244,EventRootCode#245,FractionDate#246,GoldsteinScale#247,IsRootEvent#248,MonthYear#249,NumArticles#250,NumMentions#251,NumSources#252,QuadClass#253,Year#254],
>  MappedRDD[200]
> I would personally love to fix the toString method to not necessarily print 
> every column, but to cut it off after a while.  This would aid the printout 
> in the Spark Shell as well.  For example:
> [ActionGeo_ADM1Code#198,ActionGeo_CountryCode#199,ActionGeo_FeatureID#200,ActionGeo_FullName#201,ActionGeo_Lat#202
>  .... and 52 more columns]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to