[ 
https://issues.apache.org/jira/browse/SPARK-44909?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Weichen Xu reassigned SPARK-44909:
----------------------------------

    Assignee: Weichen Xu

> Skip starting torch distributor log streaming server when it is not available
> -----------------------------------------------------------------------------
>
>                 Key: SPARK-44909
>                 URL: https://issues.apache.org/jira/browse/SPARK-44909
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 0.5.0
>            Reporter: Weichen Xu
>            Assignee: Weichen Xu
>            Priority: Major
>
> Skip starting torch distributor log streaming server when it is not available.
>  
> In some cases, e.g., in a databricks connect cluster, there is some network 
> limitation that casues starting log streaming server failure, but, this does 
> not need to break torch distributor training routine.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to