[ https://issues.apache.org/jira/browse/ARROW-3780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16685675#comment-16685675 ]
Wes McKinney commented on ARROW-3780: ------------------------------------- I was pretty sure this non-specific error message was going to rear its ugly head https://github.com/apache/arrow/blob/202265fbb67685f1ed179ba080a85b48fbd53adc/r/src/arrow_types.h#L36 > [R] Failed to fetch data: invalid data when collecting int16 > ------------------------------------------------------------ > > Key: ARROW-3780 > URL: https://issues.apache.org/jira/browse/ARROW-3780 > Project: Apache Arrow > Issue Type: Improvement > Components: R > Reporter: Javier Luraschi > Priority: Major > Labels: spark > Fix For: 0.12.0 > > > Repro from sparklyr unit test: > {code:java} > library(dplyr) > library(sparklyr) > library(arrow) > sc <- spark_connect(master = "local") > hive_type <- tibble::frame_data( > ~stype, ~svalue, ~rtype, ~rvalue, ~arrow, > "smallint", "1", "integer", "1", "integer", > ) > spark_query <- hive_type %>% > mutate( > query = paste0("cast(", svalue, " as ", stype, ") as ", gsub("\\(|\\)", "", > stype), "_col") > ) %>% > pull(query) %>% > paste(collapse = ", ") %>% > paste("SELECT", .) > spark_types <- DBI::dbGetQuery(sc, spark_query) %>% > lapply(function(e) class(e)[[1]]) %>% > as.character(){code} > Actual: error: Failed to fetch data: invalid data -- This message was sent by Atlassian JIRA (v7.6.3#76005)