bobbai00 opened a new issue, #5013: URL: https://github.com/apache/texera/issues/5013
### Task Summary Sub-issue of #5011. Move the `FileResolver.datasetResolveFunc` direct DB call (`common/workflow-core/.../FileResolver.scala`) behind an HTTP service that owns the credentials. The executor forwards the originating user's JWT. `datasetResolveFunc` joins `USER × DATASET × DATASET_VERSION` to translate `/owner/dataset/version/file` into a `dataset:///<repo>/<hash>/<file>` URI. It is the only `SqlServer` call site reachable from CU Master / Worker that is not about execution metadata. It is invoked from `LogicalPlan.resolveScanSourceOpFileName`, which runs during workflow compile on every execution. The natural owner for this lookup is `file-service` (it already owns the dataset model). Done when no code reachable from CU Master / Worker calls `SqlServer` for dataset path resolution, the new endpoint is `@Auth`-checked, and the existing `FileResolverSpec` plus an end-to-end workflow run that scans a dataset file still pass. ### Task Type - [x] Refactor / Cleanup - [ ] DevOps / Deployment / CI - [ ] Testing / QA - [ ] Documentation - [ ] Performance - [ ] Other -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
