[ 
https://issues.apache.org/jira/browse/COMDEV-512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhijing Lu updated COMDEV-512:
------------------------------
    Description: 
*Apache Doris*
Apache Doris is a real-time analytical database based on MPP architecture. As a 
unified platform that supports multiple data processing scenarios, it ensures 
high performance for low-latency and high-throughput queries, allows for easy 
federated queries on data lakes, and supports various data ingestion methods.
Page: [https://doris.apache.org|https://doris.apache.org/]
Github: [https://github.com/apache/doris]
h3. *Background*

Apache Doris supports acceleration of queries on external data sources to meet 
users' needs for federated queries and analysis.
Currently, Apache Doris supports multiple external catalogs including those 
from Hive, Iceberg, Hudi, and JDBC. Developers can connect more data sources to 
Apache Doris based on a unified framework.
h4. *Objective*
 * Enable Apache Doris to access one or more of these data sources via the 
Multi-Catalog feature: BigQuery/Kudu/Cassandra/Druid;
 * 
Compile relevant documentation. See an example here: 
[https://doris.apache.org/docs/dev/lakehouse/multi-catalog/hive]

*Task*
{*}Phase One{*}:
 * Get familiar with the Multi-Catalog structure of Apache Doris, including the 
metadata synchronization mechanism in FE and the data reading mechanism of BE.
 * Investigate how metadata should be acquired and how data access works 
regarding the picked data source(s); produce the corresponding design 
documentation.

{*}Phase Two{*}:
 * Develop connections to the picked data source(s) and implement access to 
metadata and data.

h3. *Learning Material*

{*}Page{*}: [https://doris.apache.org|https://doris.apache.org/]
{*}Github{*}: [https://github.com/apache/doris]
h3. Mentor
 * Mentor: Mingyu Chen, Apache Doris PMC Member & Committer, 
[morning...@apache.org |mailto:yangyongqi...@apache.org]
 * Mentor: Calvin Kirs, Apache Dolphinscheduler PMC & Committer, 
[calvink...@apache.org|mailto:calvink...@apache.org]
 * Mailing List: d...@doris.apache.org

  was:
*Apache Doris*
Apache Doris is a real-time analytical database based on MPP architecture. As a 
unified platform that supports multiple data processing scenarios, it ensures 
high performance for low-latency and high-throughput queries, allows for easy 
federated queries on data lakes, and supports various data ingestion methods.
Page: [https://doris.apache.org|https://doris.apache.org/]
Github: [https://github.com/apache/doris]
h3. *Background*

Apache Doris supports acceleration of queries on external data sources to meet 
users' needs for federated queries and analysis.
Currently, Apache Doris supports multiple external catalogs including those 
from Hive, Iceberg, Hudi, and JDBC. Developers can connect more data sources to 
Apache Doris based on a unified framework.
h4. *Objective*
 * Enable Apache Doris to access one or more of these data sources via the 
Multi-Catalog feature: BigQuery/Kudu/Cassandra/Druid;
 * 
Compile relevant documentation. See an example here: 
[https://doris.apache.org/docs/dev/lakehouse/multi-catalog/hive]

*Task*
{*}Phase One{*}:
 * Get familiar with the Multi-Catalog structure of Apache Doris, including the 
metadata synchronization mechanism in FE and the data reading mechanism of BE.
 * Investigate how metadata should be acquired and how data access works 
regarding the picked data source(s); produce the corresponding design 
documentation.

{*}Phase Two{*}:
 * Develop connections to the picked data source(s) and implement access to 
metadata and data.
h3. *Learning Material*

{*}Page{*}: [https://doris.apache.org|https://doris.apache.org/]
{*}Github{*}: [https://github.com/apache/doris]
h3. Mentor
 * Mentor: Mingyu Chen, Apache Doris PMC Member & Committer, 
[morning...@apache.org |mailto:yangyongqi...@apache.org]
 * Mentor: Calvin Kirs, Apache Dolphinscheduler PMC & Committer, 
[calvink...@apache.org|mailto:calvink...@apache.org]
 * Mailing List: d...@doris.apache.org


> [GSoC][Doris] Supports BigQuery/Apache Kudu/Apache Cassandra/Apache Druid in 
> Federated Queries 
> -----------------------------------------------------------------------------------------------
>
>                 Key: COMDEV-512
>                 URL: https://issues.apache.org/jira/browse/COMDEV-512
>             Project: Community Development
>          Issue Type: Task
>          Components: GSoC/Mentoring ideas
>            Reporter: Zhijing Lu
>            Priority: Major
>              Labels: Doris, full-time, gsoc2023, mentor
>
> *Apache Doris*
> Apache Doris is a real-time analytical database based on MPP architecture. As 
> a unified platform that supports multiple data processing scenarios, it 
> ensures high performance for low-latency and high-throughput queries, allows 
> for easy federated queries on data lakes, and supports various data ingestion 
> methods.
> Page: [https://doris.apache.org|https://doris.apache.org/]
> Github: [https://github.com/apache/doris]
> h3. *Background*
> Apache Doris supports acceleration of queries on external data sources to 
> meet users' needs for federated queries and analysis.
> Currently, Apache Doris supports multiple external catalogs including those 
> from Hive, Iceberg, Hudi, and JDBC. Developers can connect more data sources 
> to Apache Doris based on a unified framework.
> h4. *Objective*
>  * Enable Apache Doris to access one or more of these data sources via the 
> Multi-Catalog feature: BigQuery/Kudu/Cassandra/Druid;
>  * 
> Compile relevant documentation. See an example here: 
> [https://doris.apache.org/docs/dev/lakehouse/multi-catalog/hive]
> *Task*
> {*}Phase One{*}:
>  * Get familiar with the Multi-Catalog structure of Apache Doris, including 
> the metadata synchronization mechanism in FE and the data reading mechanism 
> of BE.
>  * Investigate how metadata should be acquired and how data access works 
> regarding the picked data source(s); produce the corresponding design 
> documentation.
> {*}Phase Two{*}:
>  * Develop connections to the picked data source(s) and implement access to 
> metadata and data.
> h3. *Learning Material*
> {*}Page{*}: [https://doris.apache.org|https://doris.apache.org/]
> {*}Github{*}: [https://github.com/apache/doris]
> h3. Mentor
>  * Mentor: Mingyu Chen, Apache Doris PMC Member & Committer, 
> [morning...@apache.org |mailto:yangyongqi...@apache.org]
>  * Mentor: Calvin Kirs, Apache Dolphinscheduler PMC & Committer, 
> [calvink...@apache.org|mailto:calvink...@apache.org]
>  * Mailing List: d...@doris.apache.org



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@community.apache.org
For additional commands, e-mail: dev-h...@community.apache.org

Reply via email to