[ 
https://issues.apache.org/jira/browse/FLINK-2324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14644346#comment-14644346
 ] 

ASF GitHub Bot commented on FLINK-2324:
---------------------------------------

Github user gyfora commented on the pull request:

    https://github.com/apache/flink/pull/937#issuecomment-125600663
  
    Yes, I have written this above (that's why I replaced it with one using the 
partitioned states). But that's not the only problem. If you introduce a 
shuffle between two maps (.shuffle()) then those are not correctly checkpointed 
either.


> Rework partitioned state storage
> --------------------------------
>
>                 Key: FLINK-2324
>                 URL: https://issues.apache.org/jira/browse/FLINK-2324
>             Project: Flink
>          Issue Type: Improvement
>            Reporter: Gyula Fora
>            Assignee: Gyula Fora
>
> Partitioned states are currently stored per-key in statehandles. This is 
> alright for in-memory storage but is very inefficient for HDFS. 
> The logic behind the current mechanism is that this approach provides a way 
> to repartition a state without fetching the data from the external storage 
> and only manipulating handles.
> We should come up with a solution that can achieve both.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to