codeant-ai-for-open-source[bot] commented on PR #37557:
URL: https://github.com/apache/superset/pull/37557#issuecomment-3821284443
## **Sequence Diagram**
The PR threads UUIDs from dataset YAML configs through example discovery
into the generic Parquet loader and changes dataset lookup to prefer UUID (with
schema fallback). This prevents duplicate datasets and backfills UUID/schema on
existing metadata when needed.
```mermaid
sequenceDiagram
participant CLI
participant DataLoading
participant GenericLoader
participant Database
CLI->>DataLoading: discover_datasets() -> read dataset.yaml (includes
uuid)
DataLoading->>GenericLoader: create_generic_loader(..., uuid=from_yaml)
CLI->>GenericLoader: invoke loader -> load_parquet_table(uuid)
GenericLoader->>Database: _find_dataset(uuid first; else
table_name+schema)
alt Dataset found by UUID
Database-->>GenericLoader: return existing SqlaTable (no changes)
else Not found
GenericLoader->>Database: create/load table, create/merge SqlaTable
GenericLoader->>Database: set/backfill tbl.uuid and tbl.schema if
provided
Database-->>GenericLoader: merged SqlaTable
end
GenericLoader-->>CLI: return dataset (matched or created)
```
---
*Generated by [CodeAnt AI](https://codeant.ai)*
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]