Hi folks, As AI agents, BI tools, notebooks, and query engines increasingly consume the same data, semantic definitions such as metrics and dimensions are often duplicated across multiple systems. This leads to inconsistent definitions, duplicated effort, and governance challenges. The rise of AI agents further amplifies this problem, as agents rely on semantic context to understand data and reason about business concepts. Without a shared semantic layer, organizations often end up maintaining multiple versions of the same business definitions across tools and applications.
JB and I would like to start a discussion on adding semantic layer support to Apache Polaris so semantic models can be defined once, governed centrally, and consumed consistently across tools. The proposal[1] introduces semantic models as a first class Polaris entity using the Open Semantic Interchange (OSI)[2] specification[3]. At a high level, the proposal adds: - A new SEMANTIC_MODEL entity type - CRUD APIs for semantic models - Schema validation and authorization Polaris remains a metadata service and does not execute metrics or semantic queries. Feedback on the overall direction, design, and OSI adoption would be greatly appreciated. 1. https://docs.google.com/document/d/1ZdI-1w_5LbyCMhvUhLCtOt-N1Z89L2P-oiGLaYayCZg/edit?usp=sharing 2. https://open-semantic-interchange.org 3. https://github.com/open-semantic-interchange/OSI/blob/main/core-spec/spec.md Yufei
