sijie commented on issue #1334: Experimental settings to achieve 5 ms latency
URL: 
https://github.com/apache/incubator-pulsar/issues/1334#issuecomment-408508215
 
 
   @ratcashdev 
   
   > Kafka (and also RabbitMQ) can achieve sub 1 ms latency, while using 
persistence.
   
   `persistence` means different things when different people talk about it. 
   
   when most of the projects talking about disk persistence, they mean data are 
written to filesystem. However when data is written to filesystem doesn't 
really mean data is persistent. Kafka only writes to filesystem without fsyncs. 
the "persistence" here is a `filesystem` persistence, but it still faces 
data-loss possibility.
   
   In relational databases, people talk about "ACID" - "D" for "Durability", 
when data is persistent to disk, it means writing to filesystem and also 
*fsync* to disk to ensure data is not lost when machine crashes. 
   this disk-level persistence has much stronger data integrity than 
filesystem-level persistence. 
   
   The DZone blog post is benchmarking kafka with filesystem-level-persistence. 
since most of the data is in filesystem page cache, which even not touch disks. 
so the latency is expected to be low, since those are effectively "memory" 
latency.
   
   The gigaom report is benchmarking pulsar and kafka with 
disk-level-persistence (However due to kafka's design, they can not really 
achieve disk-level-persistence as what Pulsar can achieve. The benchmark was 
using a close-enough settings to simulate same disk persistence behavior to do 
an apple-to-apple comparison). so the latency measure there were real disk 
persistence latency.
   
   keep "filesystem-persistence" and "disk-persistence" in the mind when you 
read the blog posts, they will help you understand those numbers. Pulsar 
recently added a flag to disable `fsync` behavior. so frankly speaking, pulsar 
is able to achieve both `filesystem-persistence` and `disk-persistence`, people 
can decide to choose which one based on their tradeoffs. However I don't think 
there is any benchmark results regarding pulsar `filesystem-persistence`.  All 
the benchmark results or claims in the blog posts about 5ms latency are about 
`disk-persistence`.
   
   for replicating the claims, you can try out 
[open-messaging-benchmark](http://openmessaging.cloud/docs/benchmarks/). open 
messaging is a linux foundation project on building a vendor neutral messaging 
standard and benchmark. 
   
   Hope this helps.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to