[jira] [Commented] (IMPALA-13256) SQLDataException in external JDBC table query request

2024-07-26 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-13256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17869072#comment-17869072
 ] 

Pranav Yogi Lodha commented on IMPALA-13256:


[~zhangyifan27] can you please give some references for me to reproduce this 
locally, what database is this table a part of or how did you create it? Thanks!

> SQLDataException in external JDBC table query request
> -
>
> Key: IMPALA-13256
> URL: https://issues.apache.org/jira/browse/IMPALA-13256
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: YifanZhang
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Logs from impala coordinator node:
> {code}
> I0726 21:15:51.142773 3202116 GenericJdbcDatabaseAccessor.java:86] 
> b34bb1ec996a2df9:a00f4892] Query to execute is [SELECT COUNT(*) FROM 
> (SELECT * FROM test_table) tmptable]
> E0726 21:15:51.287364 3202116 GenericJdbcDatabaseAccessor.java:101] 
> b34bb1ec996a2df9:a00f4892] Caught exception while trying to get the 
> number of records
> Java exception follows:
> java.sql.SQLDataException: Value '3379400792' is outside of valid range for 
> type java.lang.Integer
>   at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:114)
>   at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:97)
>   at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:89)
>   at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:63)
>   at 
> com.mysql.cj.jdbc.exceptions.SQLError.createSQLException(SQLError.java:73)
>   at 
> com.mysql.cj.jdbc.exceptions.SQLExceptionsMapping.translateException(SQLExceptionsMapping.java:92)
>   at 
> com.mysql.cj.jdbc.result.ResultSetImpl.getObject(ResultSetImpl.java:1431)
>   at com.mysql.cj.jdbc.result.ResultSetImpl.getInt(ResultSetImpl.java:830)
>   at 
> org.apache.commons.dbcp2.DelegatingResultSet.getInt(DelegatingResultSet.java:623)
>   at 
> org.apache.commons.dbcp2.DelegatingResultSet.getInt(DelegatingResultSet.java:623)
>   at 
> org.apache.impala.extdatasource.jdbc.dao.GenericJdbcDatabaseAccessor.getTotalNumberOfRecords(GenericJdbcDatabaseAccessor.java:92)
>   at 
> org.apache.impala.extdatasource.jdbc.JdbcDataSource.buildQueryAndExecute(JdbcDataSource.java:335)
>   at 
> org.apache.impala.extdatasource.jdbc.JdbcDataSource.open(JdbcDataSource.java:157)
>   at 
> org.apache.impala.extdatasource.ExternalDataSourceExecutor.open(ExternalDataSourceExecutor.java:274)
>   at 
> org.apache.impala.extdatasource.ExternalDataSourceExecutor.open(ExternalDataSourceExecutor.java:217)
> Caused by: com.mysql.cj.exceptions.NumberOutOfRange: Value '3379400792' is 
> outside of valid range for type java.lang.Integer
>   at 
> com.mysql.cj.result.IntegerValueFactory.createFromLong(IntegerValueFactory.java:62)
>   at 
> com.mysql.cj.result.IntegerValueFactory.createFromLong(IntegerValueFactory.java:44)
>   at 
> com.mysql.cj.protocol.a.MysqlTextValueDecoder.decodeInt8(MysqlTextValueDecoder.java:126)
>   at 
> com.mysql.cj.protocol.result.AbstractResultsetRow.decodeAndCreateReturnValue(AbstractResultsetRow.java:113)
>   at 
> com.mysql.cj.protocol.result.AbstractResultsetRow.getValueFromBytes(AbstractResultsetRow.java:243)
>   at 
> com.mysql.cj.protocol.a.result.ByteArrayRow.getValue(ByteArrayRow.java:91)
>   at 
> com.mysql.cj.jdbc.result.ResultSetImpl.getObject(ResultSetImpl.java:1324)
>   ... 8 more {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-13039) AES Encryption/ Decryption Support in Impala

2024-04-24 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-13039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-13039:
---
Description: To include the 

> AES Encryption/ Decryption Support in Impala
> 
>
> Key: IMPALA-13039
> URL: https://issues.apache.org/jira/browse/IMPALA-13039
> Project: IMPALA
>  Issue Type: Epic
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> To include the 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11499) Refactor UrlEncode function to handle special characters

2024-03-11 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11499?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11499:
---
Summary: Refactor UrlEncode function to handle special characters  (was: 
Partition values with unicode chars are not handled correctly)

> Refactor UrlEncode function to handle special characters
> 
>
> Key: IMPALA-11499
> URL: https://issues.apache.org/jira/browse/IMPALA-11499
> Project: IMPALA
>  Issue Type: Bug
>  Components: Backend
>Reporter: Quanlong Huang
>Assignee: Pranav Yogi Lodha
>Priority: Critical
>
> Partition values are incorrectly URL-encoded in backend for unicode 
> characters, e.g. '运营业务数据' is encoded to '�%FFBF�营业务数据' which is wrong.
> To reproduce the issue, first create a partition table:
> {code:sql}
> create table my_part_tbl (id int) partitioned by (p string) stored as parquet;
> {code}
> Then insert data into it using partition values containing '运'. They will 
> fail:
> {noformat}
> [localhost:21050] default> insert into my_part_tbl partition(p='运营业务数据') 
> values (0);
> Query: insert into my_part_tbl partition(p='运营业务数据') values (0)
> Query submitted at: 2022-08-16 10:03:56 (Coordinator: 
> http://quanlong-OptiPlex-BJ:25000)
> Query progress can be monitored at: 
> http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=404ac3027c4b7169:39d16a2d
> ERROR: Error(s) moving partition files. First error (of 1) was: Hdfs op 
> (RENAME 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/_impala_insert_staging/404ac3027c4b7169_39d16a2d/.404ac3027c4b7169-39d16a2d_1475855322_dir/p=�%FFBF�营业务数据/404ac3027c4b7169-39d16a2d_1585092794_data.0.parq
>  TO 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/p=�%FFBF�营业务数据/404ac3027c4b7169-39d16a2d_1585092794_data.0.parq)
>  failed, error was: 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/_impala_insert_staging/404ac3027c4b7169_39d16a2d/.404ac3027c4b7169-39d16a2d_1475855322_dir/p=�%FFBF�营业务数据/404ac3027c4b7169-39d16a2d_1585092794_data.0.parq
> Error(5): Input/output error
> [localhost:21050] default> insert into my_part_tbl partition(p='运') values 
> (0);
> Query: insert into my_part_tbl partition(p='运') values (0)
> Query submitted at: 2022-08-16 10:04:22 (Coordinator: 
> http://quanlong-OptiPlex-BJ:25000)
> Query progress can be monitored at: 
> http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=a64e5883473ec28d:86e7e335
> ERROR: Error(s) moving partition files. First error (of 1) was: Hdfs op 
> (RENAME 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/_impala_insert_staging/a64e5883473ec28d_86e7e335/.a64e5883473ec28d-86e7e335_1582623091_dir/p=�%FFBF�/a64e5883473ec28d-86e7e335_163454510_data.0.parq
>  TO 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/p=�%FFBF�/a64e5883473ec28d-86e7e335_163454510_data.0.parq)
>  failed, error was: 
> hdfs://localhost:20500/test-warehouse/my_part_tbl/_impala_insert_staging/a64e5883473ec28d_86e7e335/.a64e5883473ec28d-86e7e335_1582623091_dir/p=�%FFBF�/a64e5883473ec28d-86e7e335_163454510_data.0.parq
> Error(5): Input/output error
> {noformat}
> However, partition value without the character '运' is OK:
> {noformat}
> [localhost:21050] default> insert into my_part_tbl partition(p='营业务数据') 
> values (0);
> Query: insert into my_part_tbl partition(p='营业务数据') values (0)
> Query submitted at: 2022-08-16 10:04:13 (Coordinator: 
> http://quanlong-OptiPlex-BJ:25000)
> Query progress can be monitored at: 
> http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=b04894bfcfc3836a:b1ac9036
> Modified 1 row(s) in 0.21s
> {noformat}
> Hive is able to execute all these statements.
> I'm able to narrow down the issue into Backend, where we URL-encode the 
> partition value in HdfsTableSink::InitOutputPartition():
> {code:cpp}
>   string value_str;
>   partition_key_expr_evals_[j]->PrintValue(value, &value_str);
>   // Directory names containing partition-key values need to be 
> UrlEncoded, in
>   // particular to avoid problems when '/' is part of the key value 
> (which might
>   // occur, for example, with date strings). Hive will URL decode the 
> value
>   // transparently when Impala's frontend asks the metastore for 
> partition key values,
>   // which makes it particularly important that we use the same encoding 
> as Hive. It's
>   // also not necessary to encode the values when writing partition 
> metadata. You can
>   // check this with 'show partitions ' in Hive, followed by a 
> select from a
>   // decoded partition key value.
>   string encoded_str;
>   UrlEncode(value_str, &encoded_str, true);
> string part_key_value = (encoded_st

[jira] [Commented] (IMPALA-12880) regr_intercept() might have inconsistant results on TIMESTAMP values

2024-03-07 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-12880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17824516#comment-17824516
 ] 

Pranav Yogi Lodha commented on IMPALA-12880:


Yeah, closed it.

> regr_intercept() might have inconsistant results on TIMESTAMP values
> 
>
> Key: IMPALA-12880
> URL: https://issues.apache.org/jira/browse/IMPALA-12880
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Found a failure in impala-cdwh-2023.0.16.0-exhaustive-release and 
> impala-cdpd-master-exhaustive-release with below error messages:
> *Test*
> query_test.test_aggregation.TestAggregationQueries.test_aggregation[protocol: 
> beeswax | exec_option:
> {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
> 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
> 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0}
> | table_format: seq/def/block]
> *Error Message*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation 
> self.run_test_case('QueryTest/aggregation', vector) 
> common/impala_test_suite.py:815: in run_test_case 
> self.__verify_results_and_errors(vector, test_section, result, use_db) 
> common/impala_test_suite.py:625: in __verify_results_and_errors 
> replace_filenames_with_placeholder) common/test_result_verifier.py:503: in 
> verify_raw_results VERIFIER_MAP[verifier](expected, actual) 
> common/test_result_verifier.py:312: in verify_query_result_is_equal assert 
> expected_results == actual_results E assert Comparing QueryTestResults 
> (expected vs actual): E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 
> 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> *Stacktrace*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation
> self.run_test_case('QueryTest/aggregation', vector)
> common/impala_test_suite.py:815: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:625: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:503: in verify_raw_results
> VERIFIER_MAP[verifier](expected, actual)
> common/test_result_verifier.py:312: in verify_query_result_is_equal
> assert expected_results == actual_results
> E assert Comparing QueryTestResults (expected vs actual):
> E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> h4.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Closed] (IMPALA-12880) regr_intercept() might have inconsistant results on TIMESTAMP values

2024-03-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha closed IMPALA-12880.
--
Resolution: Fixed

> regr_intercept() might have inconsistant results on TIMESTAMP values
> 
>
> Key: IMPALA-12880
> URL: https://issues.apache.org/jira/browse/IMPALA-12880
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Found a failure in impala-cdwh-2023.0.16.0-exhaustive-release and 
> impala-cdpd-master-exhaustive-release with below error messages:
> *Test*
> query_test.test_aggregation.TestAggregationQueries.test_aggregation[protocol: 
> beeswax | exec_option:
> {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
> 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
> 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0}
> | table_format: seq/def/block]
> *Error Message*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation 
> self.run_test_case('QueryTest/aggregation', vector) 
> common/impala_test_suite.py:815: in run_test_case 
> self.__verify_results_and_errors(vector, test_section, result, use_db) 
> common/impala_test_suite.py:625: in __verify_results_and_errors 
> replace_filenames_with_placeholder) common/test_result_verifier.py:503: in 
> verify_raw_results VERIFIER_MAP[verifier](expected, actual) 
> common/test_result_verifier.py:312: in verify_query_result_is_equal assert 
> expected_results == actual_results E assert Comparing QueryTestResults 
> (expected vs actual): E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 
> 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> *Stacktrace*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation
> self.run_test_case('QueryTest/aggregation', vector)
> common/impala_test_suite.py:815: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:625: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:503: in verify_raw_results
> VERIFIER_MAP[verifier](expected, actual)
> common/test_result_verifier.py:312: in verify_query_result_is_equal
> assert expected_results == actual_results
> E assert Comparing QueryTestResults (expected vs actual):
> E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> h4.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12880) regr_intercept() might have inconsistant results on TIMESTAMP values

2024-03-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12880:
---
Summary: regr_intercept() might have inconsistant results on TIMESTAMP 
values  (was: IMPALA-12798 regr_intercept() might have inconsistant results on 
TIMESTAMP values)

> regr_intercept() might have inconsistant results on TIMESTAMP values
> 
>
> Key: IMPALA-12880
> URL: https://issues.apache.org/jira/browse/IMPALA-12880
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Found a failure in impala-cdwh-2023.0.16.0-exhaustive-release and 
> impala-cdpd-master-exhaustive-release with below error messages:
> *Test*
> query_test.test_aggregation.TestAggregationQueries.test_aggregation[protocol: 
> beeswax | exec_option:
> {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
> 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
> 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0}
> | table_format: seq/def/block]
> *Error Message*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation 
> self.run_test_case('QueryTest/aggregation', vector) 
> common/impala_test_suite.py:815: in run_test_case 
> self.__verify_results_and_errors(vector, test_section, result, use_db) 
> common/impala_test_suite.py:625: in __verify_results_and_errors 
> replace_filenames_with_placeholder) common/test_result_verifier.py:503: in 
> verify_raw_results VERIFIER_MAP[verifier](expected, actual) 
> common/test_result_verifier.py:312: in verify_query_result_is_equal assert 
> expected_results == actual_results E assert Comparing QueryTestResults 
> (expected vs actual): E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 
> 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> *Stacktrace*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation
> self.run_test_case('QueryTest/aggregation', vector)
> common/impala_test_suite.py:815: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:625: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:503: in verify_raw_results
> VERIFIER_MAP[verifier](expected, actual)
> common/test_result_verifier.py:312: in verify_query_result_is_equal
> assert expected_results == actual_results
> E assert Comparing QueryTestResults (expected vs actual):
> E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> h4.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12880) IMPALA-12798 regr_intercept() might have inconsistant results on TIMESTAMP values

2024-03-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12880:
--

Assignee: Pranav Yogi Lodha

> IMPALA-12798 regr_intercept() might have inconsistant results on TIMESTAMP 
> values
> -
>
> Key: IMPALA-12880
> URL: https://issues.apache.org/jira/browse/IMPALA-12880
> Project: IMPALA
>  Issue Type: Bug
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Found a failure in impala-cdwh-2023.0.16.0-exhaustive-release and 
> impala-cdpd-master-exhaustive-release with below error messages:
> *Test*
> query_test.test_aggregation.TestAggregationQueries.test_aggregation[protocol: 
> beeswax | exec_option:
> {'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
> 'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
> 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0}
> | table_format: seq/def/block]
> *Error Message*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation 
> self.run_test_case('QueryTest/aggregation', vector) 
> common/impala_test_suite.py:815: in run_test_case 
> self.__verify_results_and_errors(vector, test_section, result, use_db) 
> common/impala_test_suite.py:625: in __verify_results_and_errors 
> replace_filenames_with_placeholder) common/test_result_verifier.py:503: in 
> verify_raw_results VERIFIER_MAP[verifier](expected, actual) 
> common/test_result_verifier.py:312: in verify_query_result_is_equal assert 
> expected_results == actual_results E assert Comparing QueryTestResults 
> (expected vs actual): E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 
> 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> *Stacktrace*
> {code:java}
> query_test/test_aggregation.py:203: in test_aggregation
> self.run_test_case('QueryTest/aggregation', vector)
> common/impala_test_suite.py:815: in run_test_case
> self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:625: in __verify_results_and_errors
> replace_filenames_with_placeholder)
> common/test_result_verifier.py:503: in verify_raw_results
> VERIFIER_MAP[verifier](expected, actual)
> common/test_result_verifier.py:312: in verify_query_result_is_equal
> assert expected_results == actual_results
> E assert Comparing QueryTestResults (expected vs actual):
> E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 0,0,0,0,0,0,2.384185791015625e-07
> {code}
> h4.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12880) IMPALA-12798 regr_intercept() might have inconsistant results on TIMESTAMP values

2024-03-07 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12880:
--

 Summary: IMPALA-12798 regr_intercept() might have inconsistant 
results on TIMESTAMP values
 Key: IMPALA-12880
 URL: https://issues.apache.org/jira/browse/IMPALA-12880
 Project: IMPALA
  Issue Type: Bug
Reporter: Pranav Yogi Lodha


Found a failure in impala-cdwh-2023.0.16.0-exhaustive-release and 
impala-cdpd-master-exhaustive-release with below error messages:
*Test*
query_test.test_aggregation.TestAggregationQueries.test_aggregation[protocol: 
beeswax | exec_option:
{'test_replan': 1, 'batch_size': 0, 'num_nodes': 0, 
'disable_codegen_rows_threshold': 0, 'disable_codegen': False, 
'abort_on_error': 1, 'exec_single_node_rows_threshold': 0}
| table_format: seq/def/block]
*Error Message*
{code:java}
query_test/test_aggregation.py:203: in test_aggregation 
self.run_test_case('QueryTest/aggregation', vector) 
common/impala_test_suite.py:815: in run_test_case 
self.__verify_results_and_errors(vector, test_section, result, use_db) 
common/impala_test_suite.py:625: in __verify_results_and_errors 
replace_filenames_with_placeholder) common/test_result_verifier.py:503: in 
verify_raw_results VERIFIER_MAP[verifier](expected, actual) 
common/test_result_verifier.py:312: in verify_query_result_is_equal assert 
expected_results == actual_results E assert Comparing QueryTestResults 
(expected vs actual): E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 
0,0,0,0,0,0,2.384185791015625e-07
{code}
*Stacktrace*
{code:java}
query_test/test_aggregation.py:203: in test_aggregation
self.run_test_case('QueryTest/aggregation', vector)
common/impala_test_suite.py:815: in run_test_case
self.__verify_results_and_errors(vector, test_section, result, use_db)
common/impala_test_suite.py:625: in __verify_results_and_errors
replace_filenames_with_placeholder)
common/test_result_verifier.py:503: in verify_raw_results
VERIFIER_MAP[verifier](expected, actual)
common/test_result_verifier.py:312: in verify_query_result_is_equal
assert expected_results == actual_results
E assert Comparing QueryTestResults (expected vs actual):
E 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0 != 0,0,0,0,0,0,2.384185791015625e-07
{code}
h4.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12794) Average Independent Variable Calculator for Linear Regression: regr_avgx() support in Impala

2024-02-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12794:
---
Description: regr_avgx(y, x) computes the average of the independent 
variable (x) after eliminating pairs containing NULL.

> Average Independent Variable Calculator for Linear Regression: regr_avgx() 
> support in Impala
> 
>
> Key: IMPALA-12794
> URL: https://issues.apache.org/jira/browse/IMPALA-12794
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> regr_avgx(y, x) computes the average of the independent variable (x) after 
> eliminating pairs containing NULL.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12794) Average Independent Variable Calculator for Linear Regression: regr_avgx() support in Impala

2024-02-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12794:
---
Description: regr_avgx(y, x) computes the average of the independent 
variable  after eliminating pairs containing NULL.  (was: regr_avgx(y, x) 
computes the average of the independent variable (x) after eliminating pairs 
containing NULL.)

> Average Independent Variable Calculator for Linear Regression: regr_avgx() 
> support in Impala
> 
>
> Key: IMPALA-12794
> URL: https://issues.apache.org/jira/browse/IMPALA-12794
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> regr_avgx(y, x) computes the average of the independent variable  after 
> eliminating pairs containing NULL.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12794) Average Independent Variable Calculator for Linear Regression: regr_avgx() support in Impala

2024-02-07 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12794:
--

 Summary: Average Independent Variable Calculator for Linear 
Regression: regr_avgx() support in Impala
 Key: IMPALA-12794
 URL: https://issues.apache.org/jira/browse/IMPALA-12794
 Project: IMPALA
  Issue Type: New Feature
Reporter: Pranav Yogi Lodha






--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12794) Average Independent Variable Calculator for Linear Regression: regr_avgx() support in Impala

2024-02-07 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12794?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12794:
--

Assignee: Pranav Yogi Lodha

> Average Independent Variable Calculator for Linear Regression: regr_avgx() 
> support in Impala
> 
>
> Key: IMPALA-12794
> URL: https://issues.apache.org/jira/browse/IMPALA-12794
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12772) [DOCS] Update the documentation of identifiers

2024-01-30 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12772:
---
Summary: [DOCS] Update the documentation of identifiers  (was: Update the 
documentation of identifiers)

> [DOCS] Update the documentation of identifiers
> --
>
> Key: IMPALA-12772
> URL: https://issues.apache.org/jira/browse/IMPALA-12772
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Pranav Yogi Lodha
>Priority: Major
>
> Documentation of identifiers seems to be a bit out-dated or less accurate, 
> this Jira focusses on investigating and identifying all such discrepencies 
> and updating the same.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12772) [DOCS] Update the documentation of identifiers

2024-01-30 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12772:
--

Assignee: Pranav Yogi Lodha

> [DOCS] Update the documentation of identifiers
> --
>
> Key: IMPALA-12772
> URL: https://issues.apache.org/jira/browse/IMPALA-12772
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Documentation of identifiers seems to be a bit out-dated or less accurate, 
> this Jira focusses on investigating and identifying all such discrepencies 
> and updating the same.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12772) Update the documentation of identifiers

2024-01-30 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12772:
--

 Summary: Update the documentation of identifiers
 Key: IMPALA-12772
 URL: https://issues.apache.org/jira/browse/IMPALA-12772
 Project: IMPALA
  Issue Type: Documentation
Reporter: Pranav Yogi Lodha


Documentation of identifiers seems to be a bit out-dated or less accurate, this 
Jira focusses on investigating and identifying all such discrepencies and 
updating the same.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12756) [DOCS] Unicode column name support documentation

2024-01-24 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12756:
---
Description: The patch focuses on documenting that Impala  supports unicode 
column names, consistent with Hive's current support (as we use Hive MetaStore 
to store table metadata).  (was: To document Impala currently  support for 
unicode column names, consistent with Hive's current support (as we use Hive 
MetaStore to store table metadata).)

> [DOCS] Unicode column name support documentation
> 
>
> Key: IMPALA-12756
> URL: https://issues.apache.org/jira/browse/IMPALA-12756
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> The patch focuses on documenting that Impala  supports unicode column names, 
> consistent with Hive's current support (as we use Hive MetaStore to store 
> table metadata).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12756) [DOCS] Unicode column name support documentation

2024-01-24 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12756:
---
Description: To document Impala currently  support for unicode column 
names, consistent with Hive's current support (as we use Hive MetaStore to 
store table metadata).

> [DOCS] Unicode column name support documentation
> 
>
> Key: IMPALA-12756
> URL: https://issues.apache.org/jira/browse/IMPALA-12756
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> To document Impala currently  support for unicode column names, consistent 
> with Hive's current support (as we use Hive MetaStore to store table 
> metadata).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12756) [DOCS] Unicode column name support documentation

2024-01-24 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12756:
--

Assignee: Pranav Yogi Lodha

> [DOCS] Unicode column name support documentation
> 
>
> Key: IMPALA-12756
> URL: https://issues.apache.org/jira/browse/IMPALA-12756
> Project: IMPALA
>  Issue Type: Documentation
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12756) [DOCS] Unicode column name support documentation

2024-01-24 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12756:
--

 Summary: [DOCS] Unicode column name support documentation
 Key: IMPALA-12756
 URL: https://issues.apache.org/jira/browse/IMPALA-12756
 Project: IMPALA
  Issue Type: Documentation
Reporter: Pranav Yogi Lodha






--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12744) Support for regr_count() aggregate function

2024-01-23 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12744:
--

Assignee: Pranav Yogi Lodha

> Support for regr_count() aggregate function
> ---
>
> Key: IMPALA-12744
> URL: https://issues.apache.org/jira/browse/IMPALA-12744
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> regr_count() function can be used both as aggregate and analytic functions 
> and are commonly used in regression analysis.
> regr_count(y, x) returns an integer that is the number of non-null number 
> pairs. It indicates how many observations are included in the analysis.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12744) Support for regr_count() aggregate function

2024-01-22 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12744:
--

 Summary: Support for regr_count() aggregate function
 Key: IMPALA-12744
 URL: https://issues.apache.org/jira/browse/IMPALA-12744
 Project: IMPALA
  Issue Type: New Feature
Reporter: Pranav Yogi Lodha


regr_count() function can be used both as aggregate and analytic functions and 
are commonly used in regression analysis.

regr_count(y, x) returns an integer that is the number of non-null number 
pairs. It indicates how many observations are included in the analysis.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-12465) Support unicode column names

2024-01-15 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha resolved IMPALA-12465.

Resolution: Delivered

> Support unicode column names
> 
>
> Key: IMPALA-12465
> URL: https://issues.apache.org/jira/browse/IMPALA-12465
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Docs, Frontend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Impala currently supports only ASCII characters in column names. Update it to 
> support unicode column names, consistent with Hive's current support (as we 
> use Hive MetaStore to store table metadata).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work started] (IMPALA-12465) Support unicode column names

2024-01-15 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-12465 started by Pranav Yogi Lodha.
--
> Support unicode column names
> 
>
> Key: IMPALA-12465
> URL: https://issues.apache.org/jira/browse/IMPALA-12465
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Docs, Frontend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Impala currently supports only ASCII characters in column names. Update it to 
> support unicode column names, consistent with Hive's current support (as we 
> use Hive MetaStore to store table metadata).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Work stopped] (IMPALA-12465) Support unicode column names

2024-01-15 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on IMPALA-12465 stopped by Pranav Yogi Lodha.
--
> Support unicode column names
> 
>
> Key: IMPALA-12465
> URL: https://issues.apache.org/jira/browse/IMPALA-12465
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Docs, Frontend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Impala currently supports only ASCII characters in column names. Update it to 
> support unicode column names, consistent with Hive's current support (as we 
> use Hive MetaStore to store table metadata).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12465) Basic unicode column name bug fix

2023-09-24 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12465:
--

 Summary: Basic unicode column name bug fix
 Key: IMPALA-12465
 URL: https://issues.apache.org/jira/browse/IMPALA-12465
 Project: IMPALA
  Issue Type: New Feature
Reporter: Pranav Yogi Lodha


The jira mainly deals with the limited column name unicode support in impala.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-12418) Include crypto functions supported by Hive

2023-09-04 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-12418:
--

Assignee: Pranav Yogi Lodha

> Include crypto functions supported by Hive
> --
>
> Key: IMPALA-12418
> URL: https://issues.apache.org/jira/browse/IMPALA-12418
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> To be in line with Hive, Impala should support:
> aes_encrypt and aes_decrypt
> See: 
> [https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-Misc.Functions]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12418) Include crypto functions supported by Hive

2023-09-04 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12418:
--

 Summary: Include crypto functions supported by Hive
 Key: IMPALA-12418
 URL: https://issues.apache.org/jira/browse/IMPALA-12418
 Project: IMPALA
  Issue Type: New Feature
Reporter: Pranav Yogi Lodha


To be in line with Hive, Impala should support:

aes_encrypt and aes_decrypt

See: 
[https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-Misc.Functions]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12361) Implementation of regr_count(), regr_avgx() and regr_avgy()

2023-08-11 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12361:
---
Description: 
regr_count() returns an integer that is the number of non-null number pairs 
used to fit the regression line.
regr_avgx() evaluates the average of the independent variable(x) of the 
regression line. It computes avg(expr2) after the elimination of null (expr1, 
expr2).
regr_avgy() evaluates the average of the dependent variable(y) of the 
regression line. It computes avg(expr1) after the elimination of null (expr1, 
expr2).

  was:
regr_count() returns an integer that is the number of non-null number pairs 
used to fit the regression line.
regr_avgx() evaluates the average of the independent variable(x) of the 
regression line. It computes avg(expr2) after the elimination of null (expr1, 
expr2).
regr_avgy() evaluates the average of the dependent variable(y) of the 
regression line. It computes avg(expr1) after the elimination of null (expr1, 
expr2).


> Implementation of regr_count(), regr_avgx() and regr_avgy()
> ---
>
> Key: IMPALA-12361
> URL: https://issues.apache.org/jira/browse/IMPALA-12361
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Priority: Major
>
> regr_count() returns an integer that is the number of non-null number pairs 
> used to fit the regression line.
> regr_avgx() evaluates the average of the independent variable(x) of the 
> regression line. It computes avg(expr2) after the elimination of null (expr1, 
> expr2).
> regr_avgy() evaluates the average of the dependent variable(y) of the 
> regression line. It computes avg(expr1) after the elimination of null (expr1, 
> expr2).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12361) Implementation of regr_count(), regr_avgx() and regr_avgy()

2023-08-11 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12361:
---
Description: 
regr_count() returns an integer that is the number of non-null number pairs 
used to fit the regression line.
regr_avgx() evaluates the average of the independent variable of the regression 
line. It computes avg(expr2) after the elimination of null (expr1, expr2).
regr_avgy() evaluates the average of the dependent variable of the regression 
line. It computes avg(expr1) after the elimination of null (expr1, expr2).

  was:
regr_count() returns an integer that is the number of non-null number pairs 
used to fit the regression line.
regr_avgx() evaluates the average of the independent variable(x) of the 
regression line. It computes avg(expr2) after the elimination of null (expr1, 
expr2).
regr_avgy() evaluates the average of the dependent variable(y) of the 
regression line. It computes avg(expr1) after the elimination of null (expr1, 
expr2).


> Implementation of regr_count(), regr_avgx() and regr_avgy()
> ---
>
> Key: IMPALA-12361
> URL: https://issues.apache.org/jira/browse/IMPALA-12361
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Priority: Major
>
> regr_count() returns an integer that is the number of non-null number pairs 
> used to fit the regression line.
> regr_avgx() evaluates the average of the independent variable of the 
> regression line. It computes avg(expr2) after the elimination of null (expr1, 
> expr2).
> regr_avgy() evaluates the average of the dependent variable of the regression 
> line. It computes avg(expr1) after the elimination of null (expr1, expr2).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12361) Implementation of regr_count(), regr_avgx() and regr_avgy()

2023-08-11 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12361:
---
Description: 
regr_count() returns an integer that is the number of non-null number pairs 
used to fit the regression line.
regr_avgx() evaluates the average of the independent variable(x) of the 
regression line. It computes avg(expr2) after the elimination of null (expr1, 
expr2).
regr_avgy() evaluates the average of the dependent variable(y) of the 
regression line. It computes avg(expr1) after the elimination of null (expr1, 
expr2).

> Implementation of regr_count(), regr_avgx() and regr_avgy()
> ---
>
> Key: IMPALA-12361
> URL: https://issues.apache.org/jira/browse/IMPALA-12361
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Priority: Major
>
> regr_count() returns an integer that is the number of non-null number pairs 
> used to fit the regression line.
> regr_avgx() evaluates the average of the independent variable(x) of the 
> regression line. It computes avg(expr2) after the elimination of null (expr1, 
> expr2).
> regr_avgy() evaluates the average of the dependent variable(y) of the 
> regression line. It computes avg(expr1) after the elimination of null (expr1, 
> expr2).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-12361) Implementation of regr_count(), regr_avgx() and regr_avgy()

2023-08-11 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-12361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-12361:
---
Summary: Implementation of regr_count(), regr_avgx() and regr_avgy()  (was: 
Implementation of regr_count(), regr_avgx(), regr_avgy())

> Implementation of regr_count(), regr_avgx() and regr_avgy()
> ---
>
> Key: IMPALA-12361
> URL: https://issues.apache.org/jira/browse/IMPALA-12361
> Project: IMPALA
>  Issue Type: New Feature
>Reporter: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-12361) Implementation of regr_count(), regr_avgx(), regr_avgy()

2023-08-11 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-12361:
--

 Summary: Implementation of regr_count(), regr_avgx(), regr_avgy()
 Key: IMPALA-12361
 URL: https://issues.apache.org/jira/browse/IMPALA-12361
 Project: IMPALA
  Issue Type: New Feature
Reporter: Pranav Yogi Lodha






--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10798) Prototype a simple JSON File reader

2023-03-22 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17703798#comment-17703798
 ] 

Pranav Yogi Lodha commented on IMPALA-10798:


Sure, can you drop me your email id or some other way to connect so that I can 
share those?

> Prototype a simple JSON File reader
> ---
>
> Key: IMPALA-10798
> URL: https://issues.apache.org/jira/browse/IMPALA-10798
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Shikha Asrani
>Assignee: Ye Zihao
>Priority: Major
>
> This prototype involves ,
>  * Implementing front-end support for 'Select' from a table "stored as 
> JSONFILE".
>  * A JSON file scanner, using Arrow library to read the JSON file with 
> primitive data types and is expandable for further complex types and 
> optimizations.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Comment Edited] (IMPALA-10798) Prototype a simple JSON File reader

2023-03-21 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17703132#comment-17703132
 ] 

Pranav Yogi Lodha edited comment on IMPALA-10798 at 3/21/23 10:23 AM:
--

Hi [~Eyizoha], there were a bunch of goals that we'd thought of, which might be 
easier to convey through a sync up. Let me know your schedule and we can find a 
good time to connect and discuss the same.


was (Author: JIRAUSER284343):
Hi [~Eyizoha], there were a bunch of goals that we'd thought of which might be 
easier to convey through a sync up. Let me know your schedule and we can find a 
good time to connect and discuss the same.

> Prototype a simple JSON File reader
> ---
>
> Key: IMPALA-10798
> URL: https://issues.apache.org/jira/browse/IMPALA-10798
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Shikha Asrani
>Assignee: Ye Zihao
>Priority: Major
>
> This prototype involves ,
>  * Implementing front-end support for 'Select' from a table "stored as 
> JSONFILE".
>  * A JSON file scanner, using Arrow library to read the JSON file with 
> primitive data types and is expandable for further complex types and 
> optimizations.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10798) Prototype a simple JSON File reader

2023-03-21 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17703132#comment-17703132
 ] 

Pranav Yogi Lodha commented on IMPALA-10798:


Hi [~Eyizoha], there were a bunch of goals that we'd thought of which might be 
easier to convey through a sync up. Let me know your schedule and we can find a 
good time to connect and discuss the same.

> Prototype a simple JSON File reader
> ---
>
> Key: IMPALA-10798
> URL: https://issues.apache.org/jira/browse/IMPALA-10798
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Shikha Asrani
>Assignee: Ye Zihao
>Priority: Major
>
> This prototype involves ,
>  * Implementing front-end support for 'Select' from a table "stored as 
> JSONFILE".
>  * A JSON file scanner, using Arrow library to read the JSON file with 
> primitive data types and is expandable for further complex types and 
> optimizations.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11957) Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()

2023-02-28 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11957:
---
Description: 
The linear regression functions fit an ordinary-least-squares regression line
to a set of number pairs which can be used both as aggregate and analytic 
functions.
 * regr_slope() takes two arguments of numeric type and returns the slope of 
the line.
 * regr_intercept() takes two arguments of numeric type and returns the 
y-intercept of the regression line.
 * regr_r2() takes two arguments of numeric type and returns the coefficient of 
determination (also called R-squared or goodness of fit) for the regression.

  was:
The linear regression functions fit an ordinary-least-squares regression line
to a set of number pairs which can be used both as aggregate and analytic 
functions.


regr_slope() takes two arguments of numeric type and returns the slope of the 
line.
regr_intercept() takes two arguments of numeric type and returns the 
y-intercept of the regression line.
regr_r2() takes two arguments of numeric type and returns the coefficient of 
determination (also called R-squared or goodness of fit) for the regression.


> Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()
> -
>
> Key: IMPALA-11957
> URL: https://issues.apache.org/jira/browse/IMPALA-11957
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> The linear regression functions fit an ordinary-least-squares regression line
> to a set of number pairs which can be used both as aggregate and analytic 
> functions.
>  * regr_slope() takes two arguments of numeric type and returns the slope of 
> the line.
>  * regr_intercept() takes two arguments of numeric type and returns the 
> y-intercept of the regression line.
>  * regr_r2() takes two arguments of numeric type and returns the coefficient 
> of determination (also called R-squared or goodness of fit) for the 
> regression.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11957) Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()

2023-02-28 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11957:
---
Description: 
The linear regression functions fit an ordinary-least-squares regression line
to a set of number pairs which can be used both as aggregate and analytic 
functions.


regr_slope() takes two arguments of numeric type and returns the slope of the 
line.
regr_intercept() takes two arguments of numeric type and returns the 
y-intercept of the regression line.
regr_r2() takes two arguments of numeric type and returns the coefficient of 
determination (also called R-squared or goodness of fit) for the regression.

  was:
The linear regression functions fit an ordinary-least-squares regression line
to a set of number pairs which can be used both as aggregate and analytic 
functions.
regr_slope() returns the slope of the line.
regr_intercept() returns the y-intercept of the regression line.
regr_r2() takes two arguments of numeric type and returns the coefficient of 
determination (also called R-squared or goodness of fit) for the regression.


> Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()
> -
>
> Key: IMPALA-11957
> URL: https://issues.apache.org/jira/browse/IMPALA-11957
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> The linear regression functions fit an ordinary-least-squares regression line
> to a set of number pairs which can be used both as aggregate and analytic 
> functions.
> regr_slope() takes two arguments of numeric type and returns the slope of the 
> line.
> regr_intercept() takes two arguments of numeric type and returns the 
> y-intercept of the regression line.
> regr_r2() takes two arguments of numeric type and returns the coefficient of 
> determination (also called R-squared or goodness of fit) for the regression.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11957) Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()

2023-02-28 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11957:
---
Description: 
The linear regression functions fit an ordinary-least-squares regression line
to a set of number pairs which can be used both as aggregate and analytic 
functions.
regr_slope() returns the slope of the line.
regr_intercept() returns the y-intercept of the regression line.
regr_r2() takes two arguments of numeric type and returns the coefficient of 
determination (also called R-squared or goodness of fit) for the regression.

> Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()
> -
>
> Key: IMPALA-11957
> URL: https://issues.apache.org/jira/browse/IMPALA-11957
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> The linear regression functions fit an ordinary-least-squares regression line
> to a set of number pairs which can be used both as aggregate and analytic 
> functions.
> regr_slope() returns the slope of the line.
> regr_intercept() returns the y-intercept of the regression line.
> regr_r2() takes two arguments of numeric type and returns the coefficient of 
> determination (also called R-squared or goodness of fit) for the regression.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-11957) Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()

2023-02-28 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-11957:
--

 Summary: Implement Regression functions : regr_slope(), 
regr_intercept() and regr_r2()
 Key: IMPALA-11957
 URL: https://issues.apache.org/jira/browse/IMPALA-11957
 Project: IMPALA
  Issue Type: Sub-task
Reporter: Pranav Yogi Lodha






--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-11957) Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()

2023-02-28 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-11957:
--

Assignee: Pranav Yogi Lodha

> Implement Regression functions : regr_slope(), regr_intercept() and regr_r2()
> -
>
> Key: IMPALA-11957
> URL: https://issues.apache.org/jira/browse/IMPALA-11957
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

2023-02-02 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha resolved IMPALA-11205.

Fix Version/s: Impala 4.2.0
   Resolution: Fixed

> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> --
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
> Fix For: Impala 4.2.0
>
>
> Implement 3 statistical functions:
>  * CORR()
>  * COVAR_SAMP()
>  * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from -1 to 1 describing the correlation between the two 
> numeric columns.
> *COVAR_SAMP()* : It returns the sample covariance between two columns of 
> numeric types.
> *COVAR_POP()* : It returns the population covariance between two columns of 
> numeric types.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Commented] (IMPALA-10798) Prototype a simple JSON File reader

2022-07-22 Thread Pranav Yogi Lodha (Jira)


[ 
https://issues.apache.org/jira/browse/IMPALA-10798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17570162#comment-17570162
 ] 

Pranav Yogi Lodha commented on IMPALA-10798:


Hi [~shikha.asrani], I wanted to know a few details about the patch, would you 
be free to connect over a quick call to discuss about the same?

> Prototype a simple JSON File reader
> ---
>
> Key: IMPALA-10798
> URL: https://issues.apache.org/jira/browse/IMPALA-10798
> Project: IMPALA
>  Issue Type: Sub-task
>Reporter: Shikha Asrani
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> This prototype involves ,
>  * Implementing front-end support for 'Select' from a table "stored as 
> JSONFILE".
>  * A JSON file scanner, using Arrow library to read the JSON file with 
> primitive data types and is expandable for further complex types and 
> optimizations.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-11299) Support window clause

2022-05-18 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-11299:
--

 Summary: Support window clause
 Key: IMPALA-11299
 URL: https://issues.apache.org/jira/browse/IMPALA-11299
 Project: IMPALA
  Issue Type: Task
Reporter: Pranav Yogi Lodha


While writing tests for a statistical function, I found out that we don't 
support *window clause* unlike hive. 

For a query with window clause : 

select sum(p_size) over w1 as s1, sum(p_size) over (distribute by p_mfgr sort 
by p_name rows between current row and current row) as s2 from tpch.part window 
w1 as (distribute by p_mfgr sort by p_name rows between 2 preceding and 2 
following) limit 5;

*Hive produces the following output :*

+--+-+
|  s1  | s2  |
+--+-+
| 40   | 4   |
| 82   | 16  |
| 122  | 20  |
| 153  | 42  |
| 178  | 40  |
+--+-+

*While impala gives a syntax error :*

[localhost:21050] default> select sum(p_size) over w1 as s1, sum(p_size) over 
(distribute by p_mfgr  sort by p_name rows between current row and current row) 
 as s2 from tpch.part window w1 as (distribute by p_mfgr  sort by p_name rows 
between 2 preceding and 2 following) limit 5;
Query: select sum(p_size) over w1 as s1, sum(p_size) over (distribute by p_mfgr 
 sort by p_name rows between current row and current row)  as s2 from tpch.part 
window w1 as (distribute by p_mfgr  sort by p_name rows between 2 preceding and 
2 following) limit 5
Query submitted at: 2022-05-18 14:23:17 (Coordinator: 
http://pranavlodha-Precision-5560:25000)
ERROR: ParseException: Syntax error in line 1:
select sum(p_size) over w1 as s1, sum(p_size) over (...
                        ^
Encountered: IDENTIFIER
Expected: (

CAUSED BY: Exception: Syntax error

[localhost:21050] default> 



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

2022-05-10 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: 
Implement 3 statistical functions:
 * CORR()
 * COVAR_SAMP()
 * COVAR_POP()

*CORR()* : A correlation coefficient function CORR() which calculates Pearson's 
correlation coefficient between two columns. It should return a DOUBLE value 
ranging from -1 to 1 describing the correlation between the two numeric columns.

*COVAR_SAMP()* : It returns the sample covariance between two columns of 
numeric types.

*COVAR_POP()* : It returns the population covariance between two columns of 
numeric types.

  was:
Implement 3 statistical functions:
 * CORR()
 * COVAR_SAMP()
 * COVAR_POP()

*CORR()* : A correlation coefficient function CORR() which calculates Pearson's 
correlation coefficient between two columns. It should return a DOUBLE value 
ranging from -1 to 1 describing the correlation between the two numeric columns.

 

*COVAR_SAMP()* : It returns the sample covariance between two columns of 
numeric types.

 

*COVAR_POP()* : It returns the population covariance between two columns of 
numeric types.


> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> --
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement 3 statistical functions:
>  * CORR()
>  * COVAR_SAMP()
>  * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from -1 to 1 describing the correlation between the two 
> numeric columns.
> *COVAR_SAMP()* : It returns the sample covariance between two columns of 
> numeric types.
> *COVAR_POP()* : It returns the population covariance between two columns of 
> numeric types.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

2022-05-10 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: 
Implement 3 statistical functions:
 * CORR()
 * COVAR_SAMP()
 * COVAR_POP()

*CORR()* : A correlation coefficient function CORR() which calculates Pearson's 
correlation coefficient between two columns. It should return a DOUBLE value 
ranging from -1 to 1 describing the correlation between the two numeric columns.

 

*COVAR_SAMP()* : It returns the sample covariance between two columns of 
numeric types.

 

*COVAR_POP()* : It returns the population covariance between two columns of 
numeric types.

  was:
Implement 3 statistical functions:
 * CORR()
 * COVAR_SAMP()
 * COVAR_POP()

*CORR()* : A correlation coefficient function CORR() which calculates Pearson's 
correlation coefficient between two columns. It should return a DOUBLE value 
ranging from 0 to 1 describing the correlation between the two numeric columns.

 

*COVAR_SAMP()* : It returns the sample covariance between two columns of 
numeric types.

 

*COVAR_POP()* : It returns the population covariance between two columns of 
numeric types.


> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> --
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement 3 statistical functions:
>  * CORR()
>  * COVAR_SAMP()
>  * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from -1 to 1 describing the correlation between the two 
> numeric columns.
>  
> *COVAR_SAMP()* : It returns the sample covariance between two columns of 
> numeric types.
>  
> *COVAR_POP()* : It returns the population covariance between two columns of 
> numeric types.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

2022-05-10 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: 
Implement 3 statistical functions:
 * CORR()
 * COVAR_SAMP()
 * COVAR_POP()

*CORR()* : A correlation coefficient function CORR() which calculates Pearson's 
correlation coefficient between two columns. It should return a DOUBLE value 
ranging from 0 to 1 describing the correlation between the two numeric columns.

 

*COVAR_SAMP()* : It returns the sample covariance between two columns of 
numeric types.

 

*COVAR_POP()* : It returns the population covariance between two columns of 
numeric types.

  was:Implement a correlation coefficient function CORR() which will calculate 
Pearson's correlation coefficient between two columns. It should return a 
DOUBLE value ranging from 0 to 1 describing the correlation between the two 
numeric columns.


> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> --
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement 3 statistical functions:
>  * CORR()
>  * COVAR_SAMP()
>  * COVAR_POP()
> *CORR()* : A correlation coefficient function CORR() which calculates 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from 0 to 1 describing the correlation between the two 
> numeric columns.
>  
> *COVAR_SAMP()* : It returns the sample covariance between two columns of 
> numeric types.
>  
> *COVAR_POP()* : It returns the population covariance between two columns of 
> numeric types.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()

2022-05-10 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Summary: Implement Statistical functions : CORR(), COVAR_SAMP() and 
COVAR_POP()  (was: Implement CORR() function)

> Implement Statistical functions : CORR(), COVAR_SAMP() and COVAR_POP()
> --
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement a correlation coefficient function CORR() which will calculate 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from 0 to 1 describing the correlation between the two 
> numeric columns.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: Implement a correlation coefficient function CORR() which will 
calculate Pearson's correlation coefficient between two columns. It should 
return a DOUBLE value ranging from 0 to 1 describing the correlation between 
the two numeric columns.  (was: Implement a correlation coefficient function 
CORR() which will calculate Pearson's correlation coefficient between two 
columns.)

> Implement CORR() function
> -
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement a correlation coefficient function CORR() which will calculate 
> Pearson's correlation coefficient between two columns. It should return a 
> DOUBLE value ranging from 0 to 1 describing the correlation between the two 
> numeric columns.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: Implement a correlation coefficient function CORR() which will 
calculate Pearson's correlation coefficient between two columns.

> Implement CORR() function
> -
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement a correlation coefficient function CORR() which will calculate 
> Pearson's correlation coefficient between two columns.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: Implement an aggregate function, corr() which will 

> Implement CORR() function
> -
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>
> Implement an aggregate function, corr() which will 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Updated] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha updated IMPALA-11205:
---
Description: (was: Implement an aggregate function, corr() which will )

> Implement CORR() function
> -
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Assigned] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-11205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha reassigned IMPALA-11205:
--

Assignee: Pranav Yogi Lodha

> Implement CORR() function
> -
>
> Key: IMPALA-11205
> URL: https://issues.apache.org/jira/browse/IMPALA-11205
> Project: IMPALA
>  Issue Type: Sub-task
>  Components: Backend
>Reporter: Pranav Yogi Lodha
>Assignee: Pranav Yogi Lodha
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Created] (IMPALA-11205) Implement CORR() function

2022-03-29 Thread Pranav Yogi Lodha (Jira)
Pranav Yogi Lodha created IMPALA-11205:
--

 Summary: Implement CORR() function
 Key: IMPALA-11205
 URL: https://issues.apache.org/jira/browse/IMPALA-11205
 Project: IMPALA
  Issue Type: Sub-task
  Components: Backend
Reporter: Pranav Yogi Lodha






--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org



[jira] [Resolved] (IMPALA-955) Implement the BYTES built-in

2022-02-10 Thread Pranav Yogi Lodha (Jira)


 [ 
https://issues.apache.org/jira/browse/IMPALA-955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pranav Yogi Lodha resolved IMPALA-955.
--
Fix Version/s: Impala 4.1.0
   Resolution: Fixed

Resolved

> Implement the BYTES built-in
> 
>
> Key: IMPALA-955
> URL: https://issues.apache.org/jira/browse/IMPALA-955
> Project: IMPALA
>  Issue Type: New Feature
>  Components: Backend
>Affects Versions: Impala 1.3
>Reporter: David Z. Chen
>Assignee: Pranav Yogi Lodha
>Priority: Minor
>  Labels: built-in-function, newbie, ramp-up
> Fix For: Impala 4.1.0
>
>
> Implement the BYTES built-in: 
> http://www.info.teradata.com/HTMLPubs/DB_TTU_14_00/index.html#page/SQL_Reference/B035_1145_111A/Attribute_Functions.089.02.html



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

-
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org