aglinxinyuan opened a new pull request, #4211: URL: https://github.com/apache/texera/pull/4211
### What changes were proposed in this PR? Introduce the Sklearn Testing Operator, which accepts any number of machine learning models (from 1 to n) and computes `accuracy`, `F1` score, `precision`, and `recall` for each model, appending these metrics to the output columns. There is a port dependency between the model and the data: the data port must be finished first. The data table is then used as an internal state for testing. This operator can accept any number of models; each model will be tested against the same data table. **Input single model:** <img width="797" alt="Screenshot 2025-07-04 at 22 54 33" src="https://github.com/user-attachments/assets/d14326f2-4f5f-4476-9eda-eb464ea8049c" /> **Input multiple models:** <img width="822" alt="Screenshot 2025-07-04 at 22 47 09" src="https://github.com/user-attachments/assets/4333ca08-3717-407a-b978-05995197f8c8" /> ### Was this PR authored or co-authored using generative AI tooling? No -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
