Hossein Falaki created SPARK-14224: -------------------------------------- Summary: Cannot project all columns from a table with ~1,100 columns Key: SPARK-14224 URL: https://issues.apache.org/jira/browse/SPARK-14224 Project: Spark Issue Type: Bug Affects Versions: 2.0.0 Reporter: Hossein Falaki
I created a temporary table from 1000 genomes dataset and cached it. When I try select all columns for even a single row I get following exception. Setting and unsetting {{spark.sql.codegen.wholeStage}} and {{spark.sql.codegen}} has no effect. {code} val vcfData = sqlContext.read.format("csv").options(Map( "comment" -> "#", "header" -> "false", "delimiter" -> "\t" )).load("/mnt/vida/1000genomes/phase1/analysis_results/integrated_call_sets/ALL.chr1.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf") vcfData.registerTempTable("genomesTable") %sql select * from genomesTable Error in SQL statement: RuntimeException: Error while decoding: java.util.concurrent.ExecutionException: java.lang.Exception: failed to compile: org.codehaus.janino.JaninoRuntimeException: Code of method "(Ljava/lang/Object;)Ljava/lang/Object;" of class "org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificSafeProjection" grows beyond 64 KB /* 001 */ /* 002 */ public java.lang.Object generate(Object[] references) { /* 003 */ return new SpecificSafeProjection(references); {code} cc [~rxin] -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org