[ 
https://issues.apache.org/jira/browse/SPARK-24678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

sharkd tu updated SPARK-24678:
------------------------------
    Description: 
Currently, `BlockRDD.getPreferredLocations`  only get hosts info of blocks, 
which results in subsequent schedule level is not better than 'NODE_LOCAL'. We 
can just make a small changes, the schedule level can be improved to 
'PROCESS_LOCAL'

 

  was:
Currently, the meta-info of blocks that were received by spark-streaming 
receivers only contains hosts info, which results in subsequent schedule level 
is not better than 'NODE_LOCAL'. We can just make a small changes, the schedule 
level can be improved to 'PROCESS_LOCAL'

 


> We should use 'PROCESS_LOCAL' first for Spark-Streaming
> -------------------------------------------------------
>
>                 Key: SPARK-24678
>                 URL: https://issues.apache.org/jira/browse/SPARK-24678
>             Project: Spark
>          Issue Type: Improvement
>          Components: Block Manager
>    Affects Versions: 2.3.1
>            Reporter: sharkd tu
>            Priority: Major
>
> Currently, `BlockRDD.getPreferredLocations`  only get hosts info of blocks, 
> which results in subsequent schedule level is not better than 'NODE_LOCAL'. 
> We can just make a small changes, the schedule level can be improved to 
> 'PROCESS_LOCAL'
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to