Francisco Guerrero created CASSANDRA-19369:
----------------------------------------------

             Summary: [Analytics] Use XXHash32 for digest calculation of 
SSTables
                 Key: CASSANDRA-19369
                 URL: https://issues.apache.org/jira/browse/CASSANDRA-19369
             Project: Cassandra
          Issue Type: Improvement
            Reporter: Francisco Guerrero
            Assignee: Francisco Guerrero


During bulk writes, Cassandra Analytics calculates the MD5 checksum of every 
SSTable it produces. During SSTable upload to Cassandra Sidecar, Cassandra 
Analytics includes the {{content-md5}} header as part of the upload request. 
This information is used by Cassandra Sidecar to validate the integrity of the 
uploaded SSTable and prevent issues with bit flips and corrupted SSTables.

Recently, Cassandra Sidecar introduced [support for additional checksum 
validations|https://issues.apache.org/jira/browse/CASSANDRASC-97] during 
SSTable upload. Notably the XXHash32 digest support was added which offers for 
more performant checksum calculations. This support now allows Cassandra 
Analytics to use a more efficient digest algorithm that is friendlier on the 
CPU usage of Sidecar and spark resources.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org
For additional commands, e-mail: commits-h...@cassandra.apache.org

Reply via email to