[jira] [Created] (DRILL-5028) Opening profiles page from web ui gets very slow when a lot of history files have been stored in HDFS or Local FS.

2016-11-09 Thread Hongze Zhang (JIRA)
Hongze Zhang created DRILL-5028: --- Summary: Opening profiles page from web ui gets very slow when a lot of history files have been stored in HDFS or Local FS. Key: DRILL-5028 URL: https://issues.apache.org/jira/brows

[jira] [Assigned] (DRILL-5027) ExternalSortBatch is inefficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5027: -- Assignee: Paul Rogers > ExternalSortBatch is inefficient, leaks files for large queries > -

[jira] [Issue Comment Deleted] (DRILL-5027) ExternalSortBatch is inefficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5027: --- Comment: was deleted (was: Inspection reveals that the intermediate spill events do not clean up the

[jira] [Issue Comment Deleted] (DRILL-5027) ExternalSortBatch is inefficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5027: --- Comment: was deleted (was: Appears that the above comment is slightly off. Actual behavior is that wh

[jira] [Updated] (DRILL-5027) ExternalSortBatch is inefficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5027: --- Description: The {{ExternalSortBatch}} (ESB) operator sorts data while spilling to disk as needed to

[jira] [Updated] (DRILL-5027) ExternalSortBatch is inefficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5027: --- Summary: ExternalSortBatch is inefficient, leaks files for large queries (was: ExternalSortBatch is e

[jira] [Updated] (DRILL-5027) ExternalSortBatch is efficient, leaks files for large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5027: --- Summary: ExternalSortBatch is efficient, leaks files for large queries (was: ExternalSortBatch can us

[jira] [Commented] (DRILL-5027) ExternalSortBatch can use excessive memory for very large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15653085#comment-15653085 ] Paul Rogers commented on DRILL-5027: Inspection reveals that the intermediate spill ev

[jira] [Commented] (DRILL-5027) ExternalSortBatch can use excessive memory for very large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15653086#comment-15653086 ] Paul Rogers commented on DRILL-5027: Inspection reveals that the intermediate spill ev

[jira] [Commented] (DRILL-5027) ExternalSortBatch can use excessive memory for very large queries

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15653067#comment-15653067 ] Paul Rogers commented on DRILL-5027: Appears that the above comment is slightly off. A

[jira] [Created] (DRILL-5027) ExternalSortBatch can use excessive memory for very large queries

2016-11-09 Thread Paul Rogers (JIRA)
Paul Rogers created DRILL-5027: -- Summary: ExternalSortBatch can use excessive memory for very large queries Key: DRILL-5027 URL: https://issues.apache.org/jira/browse/DRILL-5027 Project: Apache Drill

[jira] [Commented] (DRILL-4706) Fragment planning causes Drillbits to read remote chunks when local copies are available

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652712#comment-15652712 ] ASF GitHub Bot commented on DRILL-4706: --- Github user ppadma commented on the issue:

[jira] [Created] (DRILL-5026) ExternalSortBatch uses two memory allocators; one will do

2016-11-09 Thread Paul Rogers (JIRA)
Paul Rogers created DRILL-5026: -- Summary: ExternalSortBatch uses two memory allocators; one will do Key: DRILL-5026 URL: https://issues.apache.org/jira/browse/DRILL-5026 Project: Apache Drill Is

[jira] [Assigned] (DRILL-5026) ExternalSortBatch uses two memory allocators; one will do

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5026: -- Assignee: Paul Rogers > ExternalSortBatch uses two memory allocators; one will do > ---

[jira] [Commented] (DRILL-4935) Allow drillbits to advertise a configurable host address to Zookeeper

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652467#comment-15652467 ] ASF GitHub Bot commented on DRILL-4935: --- Github user zfong commented on the issue:

[jira] [Commented] (DRILL-4935) Allow drillbits to advertise a configurable host address to Zookeeper

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652415#comment-15652415 ] ASF GitHub Bot commented on DRILL-4935: --- Github user paul-rogers commented on the is

[jira] [Commented] (DRILL-4935) Allow drillbits to advertise a configurable host address to Zookeeper

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652341#comment-15652341 ] ASF GitHub Bot commented on DRILL-4935: --- Github user harrisonmebane commented on the

[jira] [Created] (DRILL-5025) ExternalSortBatch provides weak control over spill file size

2016-11-09 Thread Paul Rogers (JIRA)
Paul Rogers created DRILL-5025: -- Summary: ExternalSortBatch provides weak control over spill file size Key: DRILL-5025 URL: https://issues.apache.org/jira/browse/DRILL-5025 Project: Apache Drill

[jira] [Updated] (DRILL-4373) Drill and Hive have incompatible timestamp representations in parquet

2016-11-09 Thread Kunal Khatua (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kunal Khatua updated DRILL-4373: Reviewer: Krystal (was: Rahul Challapalli) [~knguyen] Can you look at this and verify the fix? [~rk

[jira] [Commented] (DRILL-4935) Allow drillbits to advertise a configurable host address to Zookeeper

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652095#comment-15652095 ] ASF GitHub Bot commented on DRILL-4935: --- Github user paul-rogers commented on the is

[jira] [Commented] (DRILL-4979) Make dataport configurable

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15652032#comment-15652032 ] ASF GitHub Bot commented on DRILL-4979: --- Github user paul-rogers commented on the is

[jira] [Commented] (DRILL-4979) Make dataport configurable

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651993#comment-15651993 ] ASF GitHub Bot commented on DRILL-4979: --- Github user xhochy commented on the issue:

[jira] [Commented] (DRILL-4935) Allow drillbits to advertise a configurable host address to Zookeeper

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651922#comment-15651922 ] ASF GitHub Bot commented on DRILL-4935: --- Github user harrisonmebane commented on the

[jira] [Created] (DRILL-5024) CTAS with LIMIT 0 query in SELECT stmt does not create parquet file

2016-11-09 Thread Khurram Faraaz (JIRA)
Khurram Faraaz created DRILL-5024: - Summary: CTAS with LIMIT 0 query in SELECT stmt does not create parquet file Key: DRILL-5024 URL: https://issues.apache.org/jira/browse/DRILL-5024 Project: Apache D

[jira] [Commented] (DRILL-5009) Query with a simple join fails on Hive generated parquet

2016-11-09 Thread Sudheesh Katkam (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651591#comment-15651591 ] Sudheesh Katkam commented on DRILL-5009: Fixed in [4b1902c|https://github.com/apa

[jira] [Commented] (DRILL-5007) Dynamic UDF lazy-init does not work correctly in multi-node cluster

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651580#comment-15651580 ] ASF GitHub Bot commented on DRILL-5007: --- Github user asfgit closed the pull request

[jira] [Commented] (DRILL-5009) Query with a simple join fails on Hive generated parquet

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651579#comment-15651579 ] ASF GitHub Bot commented on DRILL-5009: --- Github user asfgit closed the pull request

[jira] [Updated] (DRILL-5007) Dynamic UDF lazy-init does not work correctly in multi-node cluster

2016-11-09 Thread Sudheesh Katkam (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sudheesh Katkam updated DRILL-5007: --- Fix Version/s: 1.9.0 > Dynamic UDF lazy-init does not work correctly in multi-node cluster > -

[jira] [Commented] (DRILL-5009) Query with a simple join fails on Hive generated parquet

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651488#comment-15651488 ] ASF GitHub Bot commented on DRILL-5009: --- Github user sudheeshkatkam commented on the

[jira] [Updated] (DRILL-5020) ExternalSortBatch has inconsistent notions of the memory limit

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5020: --- Summary: ExternalSortBatch has inconsistent notions of the memory limit (was: ExernalSortBatch has in

[jira] [Assigned] (DRILL-5020) ExernalSortBatch has inconsistent notions of the memory limit

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5020: -- Assignee: Paul Rogers > ExernalSortBatch has inconsistent notions of the memory limit > ---

[jira] [Updated] (DRILL-5012) External Sort Batch does not check free disk space before writing

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers updated DRILL-5012: --- Summary: External Sort Batch does not check free disk space before writing (was: External Sort Batch

[jira] [Assigned] (DRILL-5014) ExternalSortBatch cache size, spill count differs from config setting

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5014: -- Assignee: Paul Rogers > ExternalSortBatch cache size, spill count differs from config setting >

[jira] [Assigned] (DRILL-5022) ExternalSortBatch sets two different limits for "copier" memory

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5022: -- Assignee: Paul Rogers > ExternalSortBatch sets two different limits for "copier" memory > -

[jira] [Assigned] (DRILL-5011) External Sort Batch memory use depends on record width

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5011: -- Assignee: Paul Rogers > External Sort Batch memory use depends on record width > --

[jira] [Assigned] (DRILL-5008) Refactor, document and simplify ExternalSortBatch

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5008: -- Assignee: Paul Rogers > Refactor, document and simplify ExternalSortBatch > ---

[jira] [Assigned] (DRILL-5023) ExternalSortBatch does not spill fully, throws off spill calculations

2016-11-09 Thread Paul Rogers (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paul Rogers reassigned DRILL-5023: -- Assignee: Paul Rogers > ExternalSortBatch does not spill fully, throws off spill calculations >

[jira] [Closed] (DRILL-4870) drill-config.sh sets JAVA_HOME incorrectly for the Mac

2016-11-09 Thread Krystal (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krystal closed DRILL-4870. -- Verified on Mac the without setting JAVA_HOME, drill does set it and sqlline in embedded mode starts successfully.

[jira] [Commented] (DRILL-5009) Query with a simple join fails on Hive generated parquet

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651444#comment-15651444 ] ASF GitHub Bot commented on DRILL-5009: --- GitHub user parthchandra opened a pull requ

[jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651436#comment-15651436 ] ASF GitHub Bot commented on DRILL-4980: --- Github user paul-rogers commented on a diff

[jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651437#comment-15651437 ] ASF GitHub Bot commented on DRILL-4980: --- Github user paul-rogers commented on a diff

[jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651439#comment-15651439 ] ASF GitHub Bot commented on DRILL-4980: --- Github user paul-rogers commented on a diff

[jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651438#comment-15651438 ] ASF GitHub Bot commented on DRILL-4980: --- Github user paul-rogers commented on a diff

[jira] [Commented] (DRILL-4980) Upgrading of the approach of parquet date correctness status detection

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651435#comment-15651435 ] ASF GitHub Bot commented on DRILL-4980: --- Github user paul-rogers commented on a diff

[jira] [Commented] (DRILL-5007) Dynamic UDF lazy-init does not work correctly in multi-node cluster

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651410#comment-15651410 ] ASF GitHub Bot commented on DRILL-5007: --- Github user parthchandra commented on the i

[jira] [Assigned] (DRILL-5007) Dynamic UDF lazy-init does not work correctly in multi-node cluster

2016-11-09 Thread Zelaine Fong (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zelaine Fong reassigned DRILL-5007: --- Assignee: Paul Rogers (was: Arina Ielchiieva) Assigning to [~Paul.Rogers] for review. > Dyn

[jira] [Commented] (DRILL-5010) Equality join condition is treated as a MergeJoin and not as a HashJoin.

2016-11-09 Thread Zelaine Fong (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651264#comment-15651264 ] Zelaine Fong commented on DRILL-5010: - When you're projecting all columns, the cost of

[jira] [Commented] (DRILL-4842) SELECT * on JSON data results in NumberFormatException

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15651099#comment-15651099 ] ASF GitHub Bot commented on DRILL-4842: --- Github user Serhii-Harnyk commented on the

[jira] [Commented] (DRILL-5007) Dynamic UDF lazy-init does not work correctly in multi-node cluster

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-5007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15650837#comment-15650837 ] ASF GitHub Bot commented on DRILL-5007: --- GitHub user arina-ielchiieva opened a pull

[jira] [Commented] (DRILL-4979) Make dataport configurable

2016-11-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/DRILL-4979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15650751#comment-15650751 ] ASF GitHub Bot commented on DRILL-4979: --- GitHub user xhochy opened a pull request: