Hi All,

Please find the raised  issue:
https://github.com/apache/airflow/issues/56846

Overview
I would like to contribute a comprehensive MariaDB provider to Apache
Airflow that includes:

Native MariaDB connector support
ColumnStore engine integration
Bulk data loading with cpimport utility
S3 integration for data transfer
SSH-based remote execution


Why a Separate MariaDB Provider?
While MySQL provider exists, MariaDB has unique features that justify a
dedicated provider:

Native MariaDB Connector: Uses mariadb library instead of generic MySQL
connector
It is seen that using Native Mariadb as a python connector performs 2-3x
better for operations like insert_many, json handling and select queries.
cpimport Utility: Bulk loading functionality specific to MariaDB ColumnStore
ColumnStore Engine Support: Validation and optimization for ColumnStore
tables
S3 Integration: Direct S3 load/dump operations
SSH Integration: Remote execution capabilities
Components Included
MariaDBHook: Main hook with native connector support
MariaDBOperator: SQL execution operator
MariaDBCpImportOperator: Bulk data loading for ColumnStore
MariaDBS3LoadOperator: Load data from S3 to MariaDB
MariaDBS3DumpOperator: Export MariaDB data to S3

Comprehensive test suite with 95%+ coverage
Example DAGs for all features
Complete documentation
Dependencies
mariadb>=1.1.0 (native connector)
boto3>=1.26.0 (S3 integration)
paramiko>=2.7.0 (SSH support)

Please let me know your thoughts before I submit my PR.

Thanks,
Pratush Maheshwari

Reply via email to