[jira] [Commented] (DRILL-6680) Expose SHOW FILES command into INFORMATION_SCHEMA

2018-08-12 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-6680?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16577869#comment-16577869
 ] 

ASF GitHub Bot commented on DRILL-6680:
---

arina-ielchiieva commented on a change in pull request #1430: DRILL-6680: 
Expose show files command into INFORMATION_SCHEMA
URL: https://github.com/apache/drill/pull/1430#discussion_r209499097
 
 

 ##
 File path: exec/java-exec/src/main/resources/drill-module.conf
 ##
 @@ -605,4 +605,5 @@ drill.exec.options: {
 store.kafka.poll.timeout: 200,
 web.logs.max_lines: 1,
 window.enable: true,
+storage.list_files_recursively: true
 
 Review comment:
   Sounds fair, made the changes. Thanks!


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Expose SHOW FILES command into INFORMATION_SCHEMA
> -
>
> Key: DRILL-6680
> URL: https://issues.apache.org/jira/browse/DRILL-6680
> Project: Apache Drill
>  Issue Type: New Feature
>Affects Versions: 1.14.0
>Reporter: Arina Ielchiieva
>Assignee: Arina Ielchiieva
>Priority: Major
>  Labels: doc-impacting
> Fix For: 1.15.0
>
>
> Link to design document - 
> https://docs.google.com/document/d/1UnvATwH4obn1-XsA83xMz3LtylbMu867eBLN9r3eV3k/edit#



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (DRILL-6670) Error in parquet record reader - previously readable file fails to be read in 1.14

2018-08-12 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/DRILL-6670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16577704#comment-16577704
 ] 

ASF GitHub Bot commented on DRILL-6670:
---

okalinin commented on a change in pull request #1428: DRILL-6670: align Parquet 
TIMESTAMP_MICROS logical type handling with earlier versions + minor fixes
URL: https://github.com/apache/drill/pull/1428#discussion_r209465875
 
 

 ##
 File path: 
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/AbstractParquetScanBatchCreator.java
 ##
 @@ -108,7 +108,7 @@ protected ScanBatch getBatch(ExecutorFragmentContext 
context, AbstractParquetRow
 
 if 
(!context.getOptions().getBoolean(ExecConstants.PARQUET_NEW_RECORD_READER)
 && !ParquetReaderUtility.containsComplexColumn(footer, 
rowGroupScan.getColumns())) {
-  logger.debug("Query {} qualifies for new Parquet reader",
+  logger.debug("Query {} qualifies for ParquetRecordReader",
 
 Review comment:
   @vrozov I realised that your proposed wording sounds better. 'not enforced' 
may cause confusion too. Reverted back to 'enabled/disabled'.


This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Error in parquet record reader - previously readable file fails to be read in 
> 1.14
> --
>
> Key: DRILL-6670
> URL: https://issues.apache.org/jira/browse/DRILL-6670
> Project: Apache Drill
>  Issue Type: Bug
>  Components: Storage - Parquet
>Affects Versions: 1.14.0
>Reporter: Dave Challis
>Assignee: Oleksandr Kalinin
>Priority: Major
> Fix For: 1.15.0
>
> Attachments: example.parquet
>
>
> Parquet file which was generated by PyArrow was readable in Apache Drill 1.12 
> and 1.13, but fails to be read with 1.14.
> Running the query "SELECT * FROM dfs.`foo.parquet`" results in the following 
> error message from the Drill web query UI:
> {code}
> Query Failed: An Error Occurred
> org.apache.drill.common.exceptions.UserRemoteException: INTERNAL_ERROR ERROR: 
> Error in parquet record reader. Message: Failure in setting up reader Parquet 
> Metadata: ParquetMetaData{FileMetaData{schema: message schema { optional 
> binary name (UTF8); optional binary creation_parameters (UTF8); optional 
> int64 creation_date (TIMESTAMP_MICROS); optional int32 data_version; optional 
> int32 schema_version; } , metadata: {pandas={"index_columns": [], 
> "column_indexes": [], "columns": [{"name": "name", "field_name": "name", 
> "pandas_type": "unicode", "numpy_type": "object", "metadata": null}, {"name": 
> "creation_parameters", "field_name": "creation_parameters", "pandas_type": 
> "unicode", "numpy_type": "object", "metadata": null}, {"name": 
> "creation_date", "field_name": "creation_date", "pandas_type": "datetime", 
> "numpy_type": "datetime64[ns]", "metadata": null}, {"name": "data_version", 
> "field_name": "data_version", "pandas_type": "int32", "numpy_type": "int32", 
> "metadata": null}, {"name": "schema_version", "field_name": "schema_version", 
> "pandas_type": "int32", "numpy_type": "int32", "metadata": null}], 
> "pandas_version": "0.22.0"}}}, blocks: [BlockMetaData{1, 27142 
> [ColumnMetaData{SNAPPY [name] optional binary name (UTF8) [PLAIN, RLE], 4}, 
> ColumnMetaData{SNAPPY [creation_parameters] optional binary 
> creation_parameters (UTF8) [PLAIN, RLE], 252}, ColumnMetaData{SNAPPY 
> [creation_date] optional int64 creation_date (TIMESTAMP_MICROS) [PLAIN, RLE], 
> 46334}, ColumnMetaData{SNAPPY [data_version] optional int32 data_version 
> [PLAIN, RLE], 46478}, ColumnMetaData{SNAPPY [schema_version] optional int32 
> schema_version [PLAIN, RLE], 46593}]}]} Fragment 0:0 [Error Id: 
> bdb2e4d5-5982-4cc6-b95e-244782f827d2 on f9d0456cddd2:31010] 
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)