Rajesh Balamohan created SPARK-43079:
Summary: Add bloom filter details in spark history server
plans/SVGs
Key: SPARK-43079
URL: https://issues.apache.org/jira/browse/SPARK-43079
Project: Spark
Rajesh Balamohan created SPARK-32225:
Summary: Parquet footer information is read twice
Key: SPARK-32225
URL: https://issues.apache.org/jira/browse/SPARK-32225
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-32225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-32225:
-
Attachment: spark_parquet_footer_reads.png
> Parquet footer information is read twice
>
[
https://issues.apache.org/jira/browse/SPARK-32225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-32225:
-
Description:
When running queries, spark reads parquet footer information twice. In clou
[
https://issues.apache.org/jira/browse/SPARK-22599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16277754#comment-16277754
]
Rajesh Balamohan commented on SPARK-22599:
--
[~CodingCat] - Thanks for sharing re
Rajesh Balamohan created SPARK-21971:
Summary: Too many open files in Spark due to concurrent files
being opened
Key: SPARK-21971
URL: https://issues.apache.org/jira/browse/SPARK-21971
Project: Sp
[
https://issues.apache.org/jira/browse/SPARK-12998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15653491#comment-15653491
]
Rajesh Balamohan commented on SPARK-12998:
--
Sure. Thanks [~dongjoon]
> Enable O
[
https://issues.apache.org/jira/browse/SPARK-16948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-16948:
-
Summary: Use metastore schema instead of inferring schema for ORC in
HiveMetastoreCatalog
[
https://issues.apache.org/jira/browse/SPARK-16948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-16948:
-
Summary: Use metastore schema instead of inferring schema in ORC in
HiveMetastoreCatalog
[
https://issues.apache.org/jira/browse/SPARK-16948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-16948:
-
Summary: Support empty orc table when converting hive serde table to data
source table (
Rajesh Balamohan created SPARK-17179:
Summary: Consider improving partition pruning in
HiveMetastoreCatalog
Key: SPARK-17179
URL: https://issues.apache.org/jira/browse/SPARK-17179
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-17036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15420984#comment-15420984
]
Rajesh Balamohan commented on SPARK-17036:
--
When large number of jobs are run in
[
https://issues.apache.org/jira/browse/SPARK-12920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15418827#comment-15418827
]
Rajesh Balamohan commented on SPARK-12920:
--
Thanks [~vanzin] . I have created SP
Rajesh Balamohan created SPARK-17036:
Summary: Hadoop config caching could lead to memory pressure and
high CPU usage in thrift server
Key: SPARK-17036
URL: https://issues.apache.org/jira/browse/SPARK-17036
[
https://issues.apache.org/jira/browse/SPARK-12920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12920:
-
Summary: Honor "spark.ui.retainedStages" to reduce mem-pressure (was: Fix
high CPU usage
[
https://issues.apache.org/jira/browse/SPARK-16948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-16948:
-
Description:
Querying empty partitioned ORC tables from spark-sql throws exception with
Rajesh Balamohan created SPARK-16948:
Summary: Querying empty partitioned orc tables throw exceptions
Key: SPARK-16948
URL: https://issues.apache.org/jira/browse/SPARK-16948
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-16948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-16948:
-
Summary: Querying empty partitioned orc tables throws exception (was:
Querying empty par
[
https://issues.apache.org/jira/browse/SPARK-12920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12920:
-
Summary: Fix high CPU usage in spark thrift server with concurrent users
(was: Spark thr
[
https://issues.apache.org/jira/browse/SPARK-14387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14387:
-
Summary: Enable Hive-1.x ORC compatibility with
spark.sql.hive.convertMetastoreOrc (was:
[
https://issues.apache.org/jira/browse/SPARK-14752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14752:
-
Summary: LazilyGenerateOrdering throws NullPointerException (was:
LazilyGenerateOrdering
[
https://issues.apache.org/jira/browse/SPARK-14752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249576#comment-15249576
]
Rajesh Balamohan edited comment on SPARK-14752 at 4/20/16 11:42 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-14752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249576#comment-15249576
]
Rajesh Balamohan commented on SPARK-14752:
--
Changing generatedOrdering in Lazily
Rajesh Balamohan created SPARK-14752:
Summary: LazilyGenerateOrdering throws NullPointerException with
TakeOrderedAndProject
Key: SPARK-14752
URL: https://issues.apache.org/jira/browse/SPARK-14752
[
https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14521:
-
Summary: StackOverflowError in Kryo when executing TPC-DS (was:
StackOverflowError in Kr
[
https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249012#comment-15249012
]
Rajesh Balamohan edited comment on SPARK-14521 at 4/20/16 12:31 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15249012#comment-15249012
]
Rajesh Balamohan commented on SPARK-14521:
--
Update:
- By default, spark-thrift s
Rajesh Balamohan created SPARK-14588:
Summary: Consider getting column stats from files (wherever
feasible) to get better stats for joins
Key: SPARK-14588
URL: https://issues.apache.org/jira/browse/SPARK-14588
[
https://issues.apache.org/jira/browse/SPARK-14551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14551:
-
Summary: Reduce number of NameNode calls in OrcRelation with
FileSourceStrategy mode (wa
Rajesh Balamohan created SPARK-14551:
Summary: Reduce number of NN calls in OrcRelation with
FileSourceStrategy mode
Key: SPARK-14551
URL: https://issues.apache.org/jira/browse/SPARK-14551
Project
[
https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15234380#comment-15234380
]
Rajesh Balamohan commented on SPARK-14521:
--
Build with commit f8c9beca38f1f396eb
[
https://issues.apache.org/jira/browse/SPARK-14521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14521:
-
Summary: StackOverflowError in Kryo when executing TPC-DS Query27 (was:
StackOverflowErr
Rajesh Balamohan created SPARK-14521:
Summary: StackOverflowError when executing TPC-DS Query27
Key: SPARK-14521
URL: https://issues.apache.org/jira/browse/SPARK-14521
Project: Spark
Issu
[
https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14520:
-
Description:
Build details: Spark build from master branch (Apr-10)
TPC-DS at 200 GB sca
[
https://issues.apache.org/jira/browse/SPARK-14520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14520:
-
Description:
Build details: Spark build from master branch (Apr-10)
TPC-DS at 200 GB sca
Rajesh Balamohan created SPARK-14520:
Summary: ClasscastException thrown with
spark.sql.parquet.enableVectorizedReader=true
Key: SPARK-14520
URL: https://issues.apache.org/jira/browse/SPARK-14520
Rajesh Balamohan created SPARK-14387:
Summary: Exceptions thrown when querying ORC tables
Key: SPARK-14387
URL: https://issues.apache.org/jira/browse/SPARK-14387
Project: Spark
Issue Type
[
https://issues.apache.org/jira/browse/SPARK-14321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14321:
-
Summary: Reduce date format cost in date functions (was: Reduce date
format cost and str
[
https://issues.apache.org/jira/browse/SPARK-14321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-14321:
-
Summary: Reduce date format cost and string-to-date cost in date functions
(was: Reduce
Rajesh Balamohan created SPARK-14321:
Summary: Reduce DateFormat cost in datetimeExpressions
Key: SPARK-14321
URL: https://issues.apache.org/jira/browse/SPARK-14321
Project: Spark
Issue T
Rajesh Balamohan created SPARK-14286:
Summary: Empty ORC table join throws exception
Key: SPARK-14286
URL: https://issues.apache.org/jira/browse/SPARK-14286
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-14113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15210038#comment-15210038
]
Rajesh Balamohan commented on SPARK-14113:
--
[~srowen] - In some cases, queries h
Rajesh Balamohan created SPARK-14113:
Summary: Consider marking JobConf closure-cleaning in HadoopRDD as
optional
Key: SPARK-14113
URL: https://issues.apache.org/jira/browse/SPARK-14113
Project: S
Rajesh Balamohan created SPARK-14091:
Summary: Consider improving performance of
SparkContext.getCallSite()
Key: SPARK-14091
URL: https://issues.apache.org/jira/browse/SPARK-14091
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15176734#comment-15176734
]
Rajesh Balamohan commented on SPARK-12925:
--
Earlier fix had a problem when Text
[
https://issues.apache.org/jira/browse/SPARK-13542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-13542:
-
Summary: Fix HiveInspectors.unwrap (was: Fix HiveInspectors.unwrap (Hive
suite failures)
Rajesh Balamohan created SPARK-13542:
Summary: Fix HiveInspectors.unwrap (Hive suite failures)
Key: SPARK-13542
URL: https://issues.apache.org/jira/browse/SPARK-13542
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-13059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15121214#comment-15121214
]
Rajesh Balamohan commented on SPARK-13059:
--
Thanks [~srowen]. The same problem w
Rajesh Balamohan created SPARK-13059:
Summary: Sort inputsplits by size in HadoopRDD to avoid long tails
Key: SPARK-13059
URL: https://issues.apache.org/jira/browse/SPARK-13059
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12998:
-
Description:
When a user connects via spark-thrift server to execute SQL, it does not ena
[
https://issues.apache.org/jira/browse/SPARK-12998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12998:
-
Description:
When a user connects via spark-thrift server to execute SQL, it does not ena
Rajesh Balamohan created SPARK-12998:
Summary: Enable OrcRelation when connecting via spark thrift server
Key: SPARK-12998
URL: https://issues.apache.org/jira/browse/SPARK-12998
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12948:
-
Attachment: SPARK-12948.mem.prof.snapshot.png
> Consider reducing size of broadcasts in O
[
https://issues.apache.org/jira/browse/SPARK-12948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12948:
-
Attachment: SPARK-12948_cpuProf.png
> Consider reducing size of broadcasts in OrcRelation
Rajesh Balamohan created SPARK-12948:
Summary: Consider reducing size of broadcasts in OrcRelation
Key: SPARK-12948
URL: https://issues.apache.org/jira/browse/SPARK-12948
Project: Spark
I
[
https://issues.apache.org/jira/browse/SPARK-12925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12925:
-
Attachment: SPARK-12925_profiler_cpu_samples.png
> Improve HiveInspectors.unwrap for
> S
Rajesh Balamohan created SPARK-12925:
Summary: Improve HiveInspectors.unwrap for
StringObjectInspector.getPrimitiveWritableObject
Key: SPARK-12925
URL: https://issues.apache.org/jira/browse/SPARK-12925
[
https://issues.apache.org/jira/browse/SPARK-12920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12920:
-
Attachment: SPARK-12920.profiler.png
SPARK-12920.profiler_job_progress_lis
Rajesh Balamohan created SPARK-12920:
Summary: Spark thrift server can run at very high CPU with
concurrent users
Key: SPARK-12920
URL: https://issues.apache.org/jira/browse/SPARK-12920
Project: S
[
https://issues.apache.org/jira/browse/SPARK-12898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12898:
-
Attachment: callsiteProf.png
> Consider having dummyCallSite for HiveTableScan
>
[
https://issues.apache.org/jira/browse/SPARK-12898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12898:
-
Attachment: (was: callsiteProf)
> Consider having dummyCallSite for HiveTableScan
> -
[
https://issues.apache.org/jira/browse/SPARK-12898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12898:
-
Attachment: callsiteProf
> Consider having dummyCallSite for HiveTableScan
>
Rajesh Balamohan created SPARK-12898:
Summary: Consider having dummyCallSite for HiveTableScan
Key: SPARK-12898
URL: https://issues.apache.org/jira/browse/SPARK-12898
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097836#comment-15097836
]
Rajesh Balamohan commented on SPARK-12803:
--
Letting the profiler agent run on al
[
https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15097091#comment-15097091
]
Rajesh Balamohan commented on SPARK-12803:
--
It is for connecting to profiler. Ad
[
https://issues.apache.org/jira/browse/SPARK-12803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12803:
-
Issue Type: Improvement (was: Bug)
> Consider adding ability to profile specific instanc
Rajesh Balamohan created SPARK-12803:
Summary: Consider adding ability to profile specific instances of
executors in spark
Key: SPARK-12803
URL: https://issues.apache.org/jira/browse/SPARK-12803
P
[
https://issues.apache.org/jira/browse/SPARK-12417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Balamohan updated SPARK-12417:
-
Attachment: SPARK-12417.1.patch
> Orc bloom filter options are not propagated during file
Rajesh Balamohan created SPARK-12417:
Summary: Orc bloom filter options are not propagated during file
write in spark
Key: SPARK-12417
URL: https://issues.apache.org/jira/browse/SPARK-12417
Projec
69 matches
Mail list logo