Ceph cannot scale like HDFS. There are 10K-20K node HDFS clusters in production.
There is no data locality concept if you use CEPH, every IO will be
served from the network.

On Thu, Aug 26, 2021 at 12:04 PM zhang listar <zhanglinuxs...@gmail.com> wrote:
>
> Hi, all.
>
> I want to use ceph instead of HDFS in big data analysis senario, does ceph
> have some potential problems when the cluster becoming big? say 100PB or
> 500PB?
>
> As far I know, there are some cons:
>
>    1.
>
>    no short circuit read, so we need fast network say 10G or better 50G?
>    2.
>
>    not exactly du, but it is acceptable for applications.
>    3.
>
>    ceph can't handle slow disk as HDFS does for example heged read or write.
>
> Is that right? or there are many other cons?
>
> Thanks in advance.
> _______________________________________________
> ceph-users mailing list -- ceph-users@ceph.io
> To unsubscribe send an email to ceph-users-le...@ceph.io
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

Reply via email to