Hi,
I have submitted a mapreduce job, and can find it from job list, however I
find its STATE is PREP over last 8 hours, any idea why it takes so long to
"PREP"?
regards
(mapred job -list)
JobId State StartTime UserName Queue
Priority UsedContainers
Bhagaban,
First step is to ingest data into Hadoop using sqoop.
Teradata has powerful connectors to Hadoop where the connectors are to be
installed on all data nodes and then run imports using fast export etc.,
Challenge would be to create the same workflows in Hadoop that you had in
teradata.
>From TD perspective have a look at this - https://youtu.be/NTTQdAfZMJA They
are planning to opensource it. Perhaps you can get in touch with the team.
Let me know if you are interested. If you are TD contacts, ask about this,
they should be able to point to the right people.
Again, this is not
Hi,
I think Cloudera Navigator Optimizer is the tool you are looking for. It allows
you to transform SQL queries (TD) into Impala and Hive.
http://blog.cloudera.com/blog/2015/11/introducing-cloudera-navigator-optimizer-for-optimal-sql-workload-efficiency-on-apache-hadoop/