[ 
https://issues.apache.org/jira/browse/SPARK-24787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16583964#comment-16583964
 ] 

Sanket Reddy commented on SPARK-24787:
--------------------------------------

Thanks [~ste...@apache.org] [~vanzin] [~tgraves] it seems we might have to 
stick with hflush but think of potentially another solution to update the file 
status changes similar to YARN ATS.

Even if I periodically update it I think the dropped events issue might persist 
as it hard to have a proper flow control.

> Events being dropped at an alarming rate due to hsync being slow for 
> eventLogging
> ---------------------------------------------------------------------------------
>
>                 Key: SPARK-24787
>                 URL: https://issues.apache.org/jira/browse/SPARK-24787
>             Project: Spark
>          Issue Type: Bug
>          Components: Spark Core, Web UI
>    Affects Versions: 2.3.0, 2.3.1
>            Reporter: Sanket Reddy
>            Priority: Minor
>
> [https://github.com/apache/spark/pull/16924/files] updates the length of the 
> inprogress files allowing history server being responsive.
> Although we have a production job that has 60000 tasks per stage and due to 
> hsync being slow it starts dropping events and the history server has wrong 
> stats due to events being dropped.
> A viable solution is not to make it sync very frequently or make it 
> configurable.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to