szehon-ho commented on code in PR #56113:
URL: https://github.com/apache/spark/pull/56113#discussion_r3307015495


##########
python/pyspark/pipelines/flow.py:
##########
@@ -51,19 +51,21 @@ class AutoCdcFlow:
     An Auto CDC flow applies Change Data Capture (CDC) events from a source to 
a target
     streaming table.
 
-    :param name: Optional name of the flow. When None, defaults to the target 
name.
+    :param name: The name of the flow. The API layer defaults this to the 
target name when
+        the user does not supply one.
     :param target: The name of the target streaming table.
     :param source: The name of the CDC source to stream from.
     :param keys: Column(s) that uniquely identify a row in source and target 
data.
     :param sequence_by: Expression used to order the source data.
-    :param apply_as_deletes: Optional delete condition for the merged 
operation.
+    :param apply_as_deletes: Optional delete condition for the merge operation.
     :param column_list: Optional columns to include in the output table.
     :param except_column_list: Optional columns to exclude from the output 
table.
-    :param stored_as_scd_type: Optional SCD type for the target table. Only 1 
is supported.
+    :param stored_as_scd_type: Optional SCD type for the target table. Only 1 
(or "1") is \
+        supported.
     :param source_code_location: The location of the source code that created 
this flow.
     """
 
-    name: Optional[str]
+    name: str

Review Comment:
   agree



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to