[ 
https://issues.apache.org/jira/browse/SPARK-23270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16344910#comment-16344910
 ] 

Apache Spark commented on SPARK-23270:
--------------------------------------

User 'guoxiaolongzte' has created a pull request for this issue:
https://github.com/apache/spark/pull/20437

> FileInputDStream Streaming UI 's records should not be set to the default 
> value of 0, it should be the total number of rows of new files.
> -----------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-23270
>                 URL: https://issues.apache.org/jira/browse/SPARK-23270
>             Project: Spark
>          Issue Type: Bug
>          Components: DStreams
>    Affects Versions: 2.4.0
>            Reporter: guoxiaolongzte
>            Priority: Major
>         Attachments: 1.png
>
>
> FileInputDStream Streaming UI 's records should not be set to the default 
> value of 0, it should be the total number of rows of new files.
> ^-------------------------------------------in FileInputDStream.scala 
> start-------------------------------------^
> val inputInfo = StreamInputInfo(id, {color:#FF0000}0{color}, metadata) 
> {color:#FF0000}// set to the default value of 0{color}
>  ssc.scheduler.inputInfoTracker.reportInfo(validTime, inputInfo)
> case class StreamInputInfo(
>  inputStreamId: Int, numRecords: Long, metadata: Map[String, Any] = Map.empty)
> -------------------------------------in FileInputDStream.scala 
> end---------------------------
>  
> ^-------------------------------------------in DirectKafkaInputDStream.scala 
> start-------------------------------------^
> val inputInfo = StreamInputInfo(id, {color:#FF0000}rdd.count{color}, 
> metadata) {color:#FF0000}//set to rdd count as numRecords{color}
>  ssc.scheduler.inputInfoTracker.reportInfo(validTime, inputInfo)
> case class StreamInputInfo(
> inputStreamId: Int, numRecords: Long, metadata: Map[String, Any] = Map.empty)
> -------------------------------------in DirectKafkaInputDStream.scala 
> end-----------------------
>  
> test method:
> ./bin/spark-submit --class org.apache.spark.examples.streaming.HdfsWordCount 
> examples/jars/spark-examples_2.11-2.4.0-SNAPSHOT.jar /spark/tmp/
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to