Hi, I want to use HDFS as DFS to store files. I have one data server with 50Gb data and I plan to use 3 new machines with installed HDFS to duplicate this data. These 3 machines are: 1 name node, 2 data nodes. The duplication factor for all files is 2.
My questions are: 1. How could I create 50 GB data node on one server? Actually I'm very insteresting with setting 50 GB size for data node. 2. What is the best way to export all data files from external server (ssh access) to new ones with HDFS? Thanks, Victor Samoylov