Hi, I have a cluster HDP 2.6 (High Available, 8 nodes) and like to try using Hive+Orc+Tez with Ignite. I guess I should use IFGS as cache layer for HDFS. I installed Hadoop Accelerator 1.9 on all cluster nodes and run one ignite-node on every cluster node.
I added these settings using Ambari and then restarted HDFS, MapReduce, Yarn, Hive. HDFS, add 2 new properties to Custom core-site fs.igfs.impl=org.apache.ignite.hadoop.fs.v1.IgniteHadoopFileSystem fs.AbstractFileSystem.igfs.impl=org.apache.ignite.hadoop.fs.v2.IgniteHadoopFileSystem Mapred, Custom mapred-site mapreduce.framework.name=ignite mapreduce.jobtracker.address=dev-nn1:11211 Hive, Custom hive-site hive.rpc.query.plan=true Now I can get access to HDFS through IGFS hadoop fs -ls igfs:///user/hive Found 3 items drwx------ - hive hdfs 0 2017-04-19 21:00 igfs:///user/hive/.Trash drwxr-xr-x - hive hdfs 0 2017-04-19 10:07 igfs:///user/hive/.hiveJars drwx------ - hive hdfs 0 2017-04-22 14:27 igfs:///user/hive/.staging I thought that Hive read data from HDFS first time and then read the same data from IFGS. But when I run HIVE (cli or beeline) it still reads data from HDFS (I tried a few times), in igniteVisor "Avg. free heap" remains the same before/during/after running query (about 80%). What is wrong? May be I should load data to IFGS manually for every query?
default-config.xml
Description: XML document