voonhous commented on code in PR #18744:
URL: https://github.com/apache/hudi/pull/18744#discussion_r3247885780
##########
rfc/rfc-100/rfc-100.md:
##########
@@ -163,32 +163,46 @@ Lance's native blob encoding stores blobs in a way that
already exposes a `(file
**Visual**
+What the user gets back, grouped by storage type (set at write time) and then
by query shape:
+
+```mermaid
+flowchart TD
+ ST{storage_type}
+
+ ST -->|OUT_OF_LINE| QO{Query}
+ QO -->|"SELECT col"| OOL["type = OUT_OF_LINE<br/>inline_data =
NULL<br/>reference = user-supplied"]
+ QO -->|"SELECT read_blob(col)"| RBO(["bytes — materialized<br/>via the
external reference"])
+
+ ST -->|INLINE| QI{Query}
+ QI -->|"SELECT col"| M{hoodie.read.blob.inline.mode}
+ M -->|CONTENT default| CONT["type = INLINE<br/>inline_data =
bytes<br/>reference = NULL"]
+ M -->|DESCRIPTOR| F{file format}
+ F -->|Lance| LD["type = INLINE<br/>inline_data = NULL<br/>reference =
synthetic managed<br/>path, offset, length, is_managed=true"]
+ F -->|Parquet| PD["type = INLINE<br/>inline_data = NULL<br/>reference =
NULL<br/>bytes only via read_blob"]
+
+ QI -->|"SELECT read_blob(col)"| RM{hoodie.read.blob.inline.mode}
+ RM -->|CONTENT default| RBC(["bytes from inline_data on the row<br/>1
hop"])
Review Comment:
Addressed.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]