Francisco Guerrero created CASSANDRA-19369: ----------------------------------------------
Summary: [Analytics] Use XXHash32 for digest calculation of SSTables Key: CASSANDRA-19369 URL: https://issues.apache.org/jira/browse/CASSANDRA-19369 Project: Cassandra Issue Type: Improvement Reporter: Francisco Guerrero Assignee: Francisco Guerrero During bulk writes, Cassandra Analytics calculates the MD5 checksum of every SSTable it produces. During SSTable upload to Cassandra Sidecar, Cassandra Analytics includes the {{content-md5}} header as part of the upload request. This information is used by Cassandra Sidecar to validate the integrity of the uploaded SSTable and prevent issues with bit flips and corrupted SSTables. Recently, Cassandra Sidecar introduced [support for additional checksum validations|https://issues.apache.org/jira/browse/CASSANDRASC-97] during SSTable upload. Notably the XXHash32 digest support was added which offers for more performant checksum calculations. This support now allows Cassandra Analytics to use a more efficient digest algorithm that is friendlier on the CPU usage of Sidecar and spark resources. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@cassandra.apache.org For additional commands, e-mail: commits-h...@cassandra.apache.org