Xiaowen Sun created FLINK-36429:
-----------------------------------

             Summary: Enhancing Flink History Server File Storage and Retrieval 
with RocksDB
                 Key: FLINK-36429
                 URL: https://issues.apache.org/jira/browse/FLINK-36429
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / State Backends
    Affects Versions: 1.20.0
            Reporter: Xiaowen Sun


Currently, when a Flink job finishes, it writes an archive as a single file 
that maps paths to JSON files. Flink History Server (FHS) job archives are 
pulled locally where the FHS is running on, and this process creates a local 
directory that expands based on the contents of the single archive file.

Because of how the FHS stores the files, there are a large number of 
directories created in the local file system. This system can become 
inefficient and slow as the volume of job archives increases, creating 
bottlenecks in job data navigation and retrieval.

To illustrate the problem of inode usage, let’s consider a scenario where there 
are 5000 subtasks. Each subtask creates its own directory, and within each 
subtask directory, there are additional directories that might store only a 
single file. This structure rapidly increases the number of inodes consumed.

Integrating RocksDB, a high-performance embedded database for key-value data, 
aims to resolve these issues by offering faster data access and better 
scalability. This integration is expected to significantly enhance the 
operational efficiency of FHS by allowing faster data retrieval and enabling a 
larger cache on local Kubernetes deployments, thus overcoming inode limitations



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to