Re: Small File to HDFS

Jörn Franke Thu, 03 Sep 2015 14:48:07 -0700

Well it is the same as in normal hdfs, delete file and put a new one with
the same name works.


Le jeu. 3 sept. 2015 à 21:18,  <nib...@free.fr> a écrit :

> HAR archive seems a good idea , but just a last question to be sure to do
> the best choice :
> - Is it possible to override (remove/replace) a file inside the HAR ?
> Basically the name of my small files will be the keys of my records , and
> sometimes I will need to replace the content of a file by a new content
> (remove/replace)
>
>
> Tks a lot
> Nicolas
>
> ----- Mail original -----
> De: "Jörn Franke" <jornfra...@gmail.com>
> À: nib...@free.fr
> Cc: user@spark.apache.org
> Envoyé: Jeudi 3 Septembre 2015 19:29:42
> Objet: Re: Small File to HDFS
>
>
>
> Har is transparent and hardly any performance overhead. You may decide not
> to compress or use a fast compression algorithm, such as snappy
> (recommended)
>
>
>
> Le jeu. 3 sept. 2015 à 16:17, < nib...@free.fr > a écrit :
>
>
> My main question in case of HAR usage is , is it possible to use Pig on it
> and what about performances ?
>
> ----- Mail original -----
> De: "Jörn Franke" < jornfra...@gmail.com >
> À: nib...@free.fr , user@spark.apache.org
> Envoyé: Jeudi 3 Septembre 2015 15:54:42
> Objet: Re: Small File to HDFS
>
>
>
>
> Store them as hadoop archive (har)
>
>
> Le mer. 2 sept. 2015 à 18:07, < nib...@free.fr > a écrit :
>
>
> Hello,
> I'am currently using Spark Streaming to collect small messages (events) ,
> size being <50 KB , volume is high (several millions per day) and I have to
> store those messages in HDFS.
> I understood that storing small files can be problematic in HDFS , how can
> I manage it ?
>
> Tks
> Nicolas
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org
>
>

Re: Small File to HDFS

Reply via email to