Re: Compression of Data in HDFS

2016-04-14 Thread jamesgreen
Hi Bryan
Thanks for your input, I did get it to work now. Sorry for the delayed
response 

Just to confirm if it reads from a certain file and compresses and writes
the compressed file to the Target Directory - how does nifi know that its
has read from a certain file already?
Or does it continue to read from Random Files?

Thanks 

James



--
View this message in context: 
http://apache-nifi-developer-list.39713.n7.nabble.com/Compression-of-Data-in-HDFS-tp8821p9061.html
Sent from the Apache NiFi Developer List mailing list archive at Nabble.com.


Re: Compression of Data in HDFS

2016-04-06 Thread jamesgreen
Hi Brian, Thanks for the help!

I have tried two ways 
a.
1.  I use GetHDFS to retrieve data from the HDFS , I then use putHDFS and 
set
the compression to GZIP.
2.  In the Directory I am putting the complete path i.e
/landing/teradata/compressed/prodeiw_arc
b.
1.   I use GetHDFS to retrieve data from the HDFS, I then use Compress
Content to apply the compression and then use PutHDFS
2.  In the Directory I am putting the complete path i.e
/landing/teradata/compressed/prodeiw_arc




--
View this message in context: 
http://apache-nifi-developer-list.39713.n7.nabble.com/Compression-of-Data-in-HDFS-tp8821p8825.html
Sent from the Apache NiFi Developer List mailing list archive at Nabble.com.


Compression of Data in HDFS

2016-04-06 Thread jamesgreen
I am trying to compress a whole lot of files from my HDFS and write to
another folder on the HDFS
My Folder Structure is as follows:
\landing\databasename\prodeiw_arc\tablename\_SUCCESS
\landing\databasename\prodeiw_arc\tablename\part-m-0

\landing\databasename\prodeiw_arc\tablename2\_SUCCESS
\landing\databasename\prodeiw_arc\tablename2\part-m-0

I am trying to compress to the following
\landing\compressed\prodeiw_arc\tablename\_SUCCESS
\landing\compressed\prodeiw_arc\tablename\part-m-0

\landing\compressed\prodeiw_arc\tablename2\_SUCCESS
\landing\compressed\prodeiw_arc\tablename2\part-m-0

I have found that it compresses to 
\landing\compressed\prodeiw_arc\_SUCCESS
\landing\compressed\prodeiw_arc\tablename\part-m-0

it will then continue to overwrite. Is there anyway I can keep the directory
structure when doing a PutHDFS?

Thanks and Regards




--
View this message in context: 
http://apache-nifi-developer-list.39713.n7.nabble.com/Compression-of-Data-in-HDFS-tp8821.html
Sent from the Apache NiFi Developer List mailing list archive at Nabble.com.