Zhijing Lu created COMDEV-510:
---------------------------------
Summary: [GSoC][Doris]Page Cache Improvement
Key: COMDEV-510
URL: https://issues.apache.org/jira/browse/COMDEV-510
Project: Community Development
Issue Type: Task
Components: GSoC/Mentoring ideas
Reporter: Zhijing Lu
*Apache Doris*
Apache Doris is a real-time analytical database based on MPP architecture. As a
unified platform that supports multiple data processing scenarios, it ensures
high performance for low-latency and high-throughput queries, allows for easy
federated queries on data lakes, and supports various data ingestion methods.
{*}Page{*}: [https://doris.apache.org|https://doris.apache.org/]
{*}Github{*}: [https://github.com/apache/doris]
h3. *Background*
Apache Doris accelerates high-concurrency queries utilizing page cache, where
the decompressed data is stored.
Currently, the page cache in Apache Doris uses a simple LRU algorithm, which
reveals a few problems: #
Hot data will be phased out in large queries
#
The page cache configuration is immutable and does not support GC.
h3. Task
#
{*}Phase One{*}: Identify the impacts on queries when the decompressed data is
stored in memory and SSD, respectively, and then determine whether full page
cache is required.
#
{*}Phase Two{*}: Improve the cache strategy for Apache Doris based on the
results from Phase One.
h3. Learning Material
{*}Page{*}: https://doris.apache.org
{*}Github{*}: [https://github.com/apache/doris]
h3. Mentor
* Mentor: Yongqiang Yang, Apache Doris PMC member & Committer,
[[email protected] |mailto:[email protected]]
* Mentor: Haopeng Li, Apache Doris PMC member & Committer,
[[email protected]|mailto:[email protected]]
* Mailing List: [email protected]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]