Re: [akka-user] Apache Kafka as journal - retention times/PersistentView and partitions

Martin Krasser Tue, 26 Aug 2014 06:29:08 -0700

Hi Andrzej,

On 26.08.14 09:15, Andrzej Dębski wrote:

Hello
Lately I have been reading about a possibility of using Apache Kafkaas journal/snapshot store for akka-persistence.
I am aware of the plugin created by MartinKrasser: https://github.com/krasserm/akka-persistence-kafka/ and alsoI read other topic about Kafka asjournal https://groups.google.com/forum/#!searchin/akka-user/kakfka/akka-user/iIHmvC6bVrI/zeZJtW0_6FwJ.
In both sources I linked two ideas were presented:
1. Set log retention to 7 days, take snapshots every 3 days (examplevalues)
2. Set log retention to unlimited.
Here is the first question: in first case wouldn't it mean thatpersistent views would receive skewed view of the PersistentActorstate (only events from 7 days) - is it really viable solution? As faras I know PersistentView can only receive events - it can't receivesnapshots from corresponding PersistentActor (which is good in generalcase).

PersistentViews can create their own snapshots which are isolated fromthe corresponding PersistentActor's snapshots.

Second question (more directed to Martin): in the thread I linked youwrote:
     I don't go into Kafka partitioning details here but it is
    possible to implement the journal driver in a way that both a
    single persistent actor's data are partitioned *and* kept in order
I am very interested in this idea. AFAIK it is not yet implemented incurrent plugin but I was wondering if you could share high level ideahow would you achieve that (one persistent actor, multiple partitions,ordering ensured)?


The idea is to

- first write events 1 to n to partition 1
- then write events n+1 to 2n to partition 2
- then write events 2n+1 to 3n to partition 3
- ... and so on

This works because a PersistentActor is the only writer to a partitionedjournal topic. During replay, you first replay partition 1, thenpartition 2 and so on. This should be rather easy to implement in theKafka journal, just didn't have time so far; pull requests are welcome:) Btw, the Cassandra journal<https://github.com/krasserm/akka-persistence-cassandra> follows thevery same strategy for scaling with data volume (by using differentpartition keys).


Cheers,
Martin

--
>>>>>>>>>> Read the docs: http://akka.io/docs/
>>>>>>>>>> Check the FAQ:http://doc.akka.io/docs/akka/current/additional/faq.html
>>>>>>>>>> Search the archives: https://groups.google.com/group/akka-user
---
You received this message because you are subscribed to the GoogleGroups "Akka User List" group.To unsubscribe from this group and stop receiving emails from it, sendan email to akka-user+unsubscr...@googlegroups.com<mailto:akka-user+unsubscr...@googlegroups.com>.To post to this group, send email to akka-user@googlegroups.com<mailto:akka-user@googlegroups.com>.
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.


--
Martin Krasser

blog:    http://krasserm.blogspot.com
code:    http://github.com/krasserm
twitter: http://twitter.com/mrt1nz

--

     Read the docs: http://akka.io/docs/
     Check the FAQ: http://doc.akka.io/docs/akka/current/additional/faq.html
     Search the archives: https://groups.google.com/group/akka-user

---You received this message because you are subscribed to the Google Groups "Akka User List" group.

To unsubscribe from this group and stop receiving emails from it, send an email 
to akka-user+unsubscr...@googlegroups.com.
To post to this group, send email to akka-user@googlegroups.com.
Visit this group at http://groups.google.com/group/akka-user.
For more options, visit https://groups.google.com/d/optout.

Re: [akka-user] Apache Kafka as journal - retention times/PersistentView and partitions

Reply via email to