Thanks for the proposal! I believe this is a very useful feature, as the
other alternatives do not work well: people need to either define many
similar views with different grouping columns and aggregate functions, or
manually maintain a doc page to describe the semantic of these metrics that
people need to follow when writing queries to calculate these metrics.

Shall we start the vote next week if there is no objections?

On Fri, Oct 31, 2025 at 2:30 PM Linhong Liu
<[email protected]> wrote:

> Hi all,
>
> I would like to propose introducing "The metrics & semantic modeling in
> Spark".
>
> This feature enables defining business metrics once and reusing them
> across any breakdown, ensuring consistent outcomes and bridging the
> semantic gap between business logic and data schemas to help LLMs generate
> more precise results.
>
> Looking forward to your feedback!
>
> JIRA: SPARK-54119 <https://issues.apache.org/jira/browse/SPARK-54119>
> SPIP docs:
> https://docs.google.com/document/d/1xVTLijvDTJ90lZ_ujwzf9HvBJgWg0mY6cYM44Fcghl0/edit?tab=t.0#heading=h.4iogryr5qznc
> <https://docs.google.com/document/d/1xVTLijvDTJ90lZ_ujwzf9HvBJgWg0mY6cYM44Fcghl0/edit?tab=t.0#heading=h.4iogryr5qznc>
>
> Thanks,
> Linhong
>

Reply via email to