[ 
https://issues.apache.org/jira/browse/SAMZA-428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14161192#comment-14161192
 ] 

Jay Kreps commented on SAMZA-428:
---------------------------------

Gotcha. So yes, you can control the commit interval using 
TaskCoordinator.commit. I don't think this requires a notion of batching 
external to the task. Basically each time you got a message you would add it to 
the txn in whatever way you wanted, then when you felt you had enough you would 
commit the txn and request a commit for your task.

> Investigate: how to tune down caching in the KeyValueStore implementations
> --------------------------------------------------------------------------
>
>                 Key: SAMZA-428
>                 URL: https://issues.apache.org/jira/browse/SAMZA-428
>             Project: Samza
>          Issue Type: Improvement
>          Components: kv
>    Affects Versions: 0.8.0
>            Reporter: Chinmay Soman
>             Fix For: 0.8.0
>
>
> Currently, we have a 'CachedStore' layer on top of the KeyValueStore 
> implementation that we use. This might lead to double caching:
> i) Once at the CachedStore layer
> ii) Possibly cached again in the specific K-V store that we use (for eg: 
> RocksDB / BDB)
> We need the CachedStore layer so that the writes to LoggedStore (if 
> configured) are done in an efficient manner. 
> We can then potentially do some config tuning for the K-V store to reduce its 
> memory footprint and simply write to disk. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to