[
https://issues.apache.org/jira/browse/SQOOP-1393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14132442#comment-14132442
]
Pratik Khadloya commented on SQOOP-1393:
----------------------------------------
With hive 0.12 i get the following error:
{code}
bin/sqoop import --connect jdbc:mysql://mydbserver.net/mydb --username myuser
--password mypwd --query "SELECT ... WHERE \$CONDITIONS" --num-mappers 1
--hive-import --hive-table test --create-hive-table --target-dir
/user/myuser/sqoop/test --as-parquetfile
{code}
{code}
14/09/12 21:24:46 WARN spi.Registration: Not loading URI patterns in
org.kitesdk.data.hcatalog.impl.Loader
14/09/12 21:24:46 ERROR sqoop.Sqoop: Got exception running Sqoop:
org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI:
hive?dataset=null
org.kitesdk.data.DatasetNotFoundException: Unknown dataset URI:
hive?dataset=null
at
org.kitesdk.data.spi.Registration.lookupDatasetUri(Registration.java:106)
at org.kitesdk.data.Datasets.create(Datasets.java:189)
at org.kitesdk.data.Datasets.create(Datasets.java:233)
at
org.apache.sqoop.mapreduce.ParquetJob.createDataset(ParquetJob.java:81)
at
org.apache.sqoop.mapreduce.ParquetJob.configureImportJob(ParquetJob.java:70)
at
org.apache.sqoop.mapreduce.DataDrivenImportJob.configureMapper(DataDrivenImportJob.java:112)
at
org.apache.sqoop.mapreduce.ImportJobBase.runImport(ImportJobBase.java:253)
at org.apache.sqoop.manager.SqlManager.importQuery(SqlManager.java:721)
at org.apache.sqoop.tool.ImportTool.importTable(ImportTool.java:499)
at org.apache.sqoop.tool.ImportTool.run(ImportTool.java:605)
at org.apache.sqoop.Sqoop.run(Sqoop.java:143)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at org.apache.sqoop.Sqoop.runSqoop(Sqoop.java:179)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:218)
at org.apache.sqoop.Sqoop.runTool(Sqoop.java:227)
at org.apache.sqoop.Sqoop.main(Sqoop.java:236)
{code}
> Import data from database to Hive as Parquet files
> --------------------------------------------------
>
> Key: SQOOP-1393
> URL: https://issues.apache.org/jira/browse/SQOOP-1393
> Project: Sqoop
> Issue Type: Sub-task
> Components: tools
> Reporter: Qian Xu
> Assignee: Richard
> Fix For: 1.4.6
>
> Attachments: patch.diff, patch_v2.diff, patch_v3.diff
>
>
> Import data to Hive as Parquet file can be separated into two steps:
> 1. Import an individual table from an RDBMS to HDFS as a set of Parquet files.
> 2. Import the data into Hive by generating and executing a CREATE TABLE
> statement to define the data's layout in Hive with Parquet format table
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)