[ 
https://issues.apache.org/jira/browse/DRILL-8491?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Piyush Shama updated DRILL-8491:
--------------------------------
    Priority: Major  (was: Critical)

> MongoDB | Queries Conversion optimisation & using various mongoDB features
> --------------------------------------------------------------------------
>
>                 Key: DRILL-8491
>                 URL: https://issues.apache.org/jira/browse/DRILL-8491
>             Project: Apache Drill
>          Issue Type: Improvement
>            Reporter: Piyush Shama
>            Priority: Major
>
> {*}Title{*}: Inefficient Query Translation and Underutilised Functions in SQL 
> to MongoDB Conversion Using Apache Drill
> {*}Description{*}: We have been experiencing significant performance issues 
> when using Apache Drill to convert SQL queries for use with MongoDB. It 
> appears that the SQL to MongoDB query translation process is not optimally 
> executed, leading to inefficient query operations and slow response times.
> {*}Details{*}:
>  # {*}Inefficient Query Translation{*}:
>  ** The translation of SQL queries into MongoDB-specific queries by Apache 
> Drill seems sub optimal. This inefficiency is particularly noticeable with 
> complex queries, where the expected execution plan does not align with 
> MongoDB's capabilities, resulting in slower query performance.
>  # {*}Underutilization of MongoDB Capabilities{*}:
>  ** Several MongoDB functionalities are not being fully utilised in the 
> translation process:
>  *** {*}Aggregation Operations{*}: Functions like {{{}SUM(){}}}, 
> {{{}AVG(){}}}, {{{}MIN(){}}}, and {{MAX()}} are either poorly translated or 
> not utilised, leading to potential performance degradation.
>  *** {*}Date Handling{*}: Extraction of date components (e.g., day from an 
> ISO date) within queries is not handled efficiently, forcing additional 
> processing overhead or client-side computations.
>  *** {*}Count Queries{*}: Simple count operations are not optimised, possibly 
> translating into more complex query forms than necessary.
> {*}Impact{*}: The current issues significantly affect the performance and 
> scalability of applications relying on Apache Drill for interacting with 
> MongoDB, particularly in data-heavy environments.
> {*}Expected Behaviour{*}:
>  * Queries translated from SQL to MongoDB should utilise MongoDB's native 
> query capabilities more effectively, ensuring that operations such as 
> aggregations, date extractions, and counts are executed in the most efficient 
> manner possible.
>  * The translation engine should optimise the query structure to leverage 
> MongoDB's strengths, particularly in handling large datasets.
> {*}Steps to Reproduce{*}:
>  # Set up Apache Drill with a MongoDB data source.
>  # Execute complex SQL queries involving aggregation, date extraction, and 
> count operations.
>  # Observe the generated MongoDB queries and resulting performance.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to