Hossein Falaki created SPARK-14224:
--------------------------------------

             Summary: Cannot project all columns from a table with ~1,100 
columns
                 Key: SPARK-14224
                 URL: https://issues.apache.org/jira/browse/SPARK-14224
             Project: Spark
          Issue Type: Bug
    Affects Versions: 2.0.0
            Reporter: Hossein Falaki


I created a temporary table from 1000 genomes dataset and cached it. When I try 
select all columns for even a single row I get following exception. Setting and 
unsetting {{spark.sql.codegen.wholeStage}} and {{spark.sql.codegen}} has no 
effect. 

{code}

val vcfData = sqlContext.read.format("csv").options(Map(
  "comment" -> "#", "header" -> "false", "delimiter" -> "\t"
)).load("/mnt/vida/1000genomes/phase1/analysis_results/integrated_call_sets/ALL.chr1.integrated_phase1_v3.20101123.snps_indels_svs.genotypes.vcf")

vcfData.registerTempTable("genomesTable")

%sql select * from genomesTable

Error in SQL statement: RuntimeException: Error while decoding: 
java.util.concurrent.ExecutionException: java.lang.Exception: failed to 
compile: org.codehaus.janino.JaninoRuntimeException: Code of method 
"(Ljava/lang/Object;)Ljava/lang/Object;" of class 
"org.apache.spark.sql.catalyst.expressions.GeneratedClass$SpecificSafeProjection"
 grows beyond 64 KB
/* 001 */ 
/* 002 */ public java.lang.Object generate(Object[] references) {
/* 003 */   return new SpecificSafeProjection(references);
{code}

cc [~rxin] 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to