DynamoDB eventstore, work in progress #118

jankronquist · 2022-07-11T15:01:46Z

No description provided.

bartelink · 2022-07-11T19:18:32Z

.../native/src/main/java/org/occurrent/eventstore/dynamodb/nativedriver/DynamoDBEventStore.java

+    }
+
+    try {
+      dynamoDB.transactWriteItems(TransactWriteItemsRequest.builder()


Beware TWI - it costs double what a standard operation costs (obviously the Equinox.DynamoStore schema involves much more logic as a result of using UpdateItem). See jet/equinox#327 for my learnings from going down this road

Yes I'm aware of this and this will most likely be a configuration option how to write. Ideally I would like adding events to an eventstream to be an atomic operation and initially I had one row per transaction (ie several events), but then I had to change this in order to conform to what seems to be the rule in occurrent that every single event should increment the version by one.

@johanhaleby Related to this, I was confused by this: EventStream read(String streamId, int skip, int limit);

Should skip be the number of events to skip or should this be the version number? If its the version number, shouldnt this be a long? Does the version number have to equal the number of events?

When doing eventsourcing I usually consider all the events generated by a command to be an atomic update of the eventstream, ie having versions between the start and the end of the transaction does not necessarily make sense.

I had one row per transaction (ie several events), but then I had to change this in order to conform to what seems to be the rule in occurrent that every single event should increment the version by one.

Yeah the problem/tradeoff is that the minute you try to fulfil the transactional correctness requirement, you run into a set of problems:

TransactWriteItems doubles the charges for everything, which is a massive loss

You need something useful to make the write contingent on (i.e. the expectedversion etc - if you instead are checking that the previous item is present and the one you are writing is not, even the basic coding gets complex)

In Equinox, the schema resolves the forces by having the notion of a Tip per stream, which gates all writes going through:

one could keep an event counter in it, but I have an etag string (this allows one to rolling transactionally correct updates without having to write a new event every time). Where you are writing event 0, the condition is that the Tip does not exist

if you are persisting multiple events in one write, they can all get appended in a single Put/Update call

In addition to working for larger cases, it has the following key properties for normal use:

small streams are a single item that can be loaded via a single GetItem roundtrip

TransactWriteItems only becomes required when the tip overflows

minimum storage overhead

One thing it does complicate is the fact that the DDB Streams output will emit a full copy of the Tip for every update, e.g. if you are adding 2 events to a Tip that has one event already, the DDB streams output will be a DDB Streas event with the full Item (which hosts 3 events, but only 2 are new)

The other thing to bear in mind is that having >1 event per item means you need a good story about when you are writing 201K of events on top of 200K of existing events.

I would caution against having a mode switch in your implementation; testing, reasoning and talking about the code becomes a nightmare. Better to have a single impl that can deal with your use cases efficiently and test, tune and validate that. (The other reason I say that is that I fundamentally believe that an event per Item schema is just worse than useless in terms of cost and efficiency too)

Should skip be the number of events to skip or should this be the version number? If its the version number, shouldnt this be a long? Does the version number have to equal the number of events?

I use longs for event indexes; ESDB etc does too. In practice the CUs and latency it costs to read more than 2m events make it irrelevant (and there are fixed limits to how much can be held in a logical partition (10GB is it?), so any design that is predicated on unlimited stream lengths is not even theoretically implementable)

johanhaleby · 2022-07-12T06:20:43Z

Nice work Jan!

For everyone's info, you also sent me a private email, that I answered and wrote some comments on. We can continue the discussion by email or here, whatever suits you best :)

jankronquist · 2022-07-15T11:27:58Z

FYI this is very experimental and a way for me to learn more about occurrent and it will of course need to have lots of more configuration options to be fully usable. I'm going on vacation for a few weeks, but I will pick this up later! At this point I just wanted to share what I have done so far ...

DynamoDB eventstore, work in progress

055b18c

bartelink reviewed Jul 11, 2022

View reviewed changes

johanhaleby mentioned this pull request Sep 21, 2022

DynamoDB support #121

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DynamoDB eventstore, work in progress #118

DynamoDB eventstore, work in progress #118

jankronquist commented Jul 11, 2022

bartelink Jul 11, 2022

jankronquist Jul 15, 2022

bartelink Jul 15, 2022

johanhaleby commented Jul 12, 2022

jankronquist commented Jul 15, 2022

DynamoDB eventstore, work in progress #118

Are you sure you want to change the base?

DynamoDB eventstore, work in progress #118

Conversation

jankronquist commented Jul 11, 2022

bartelink Jul 11, 2022

Choose a reason for hiding this comment

jankronquist Jul 15, 2022

Choose a reason for hiding this comment

bartelink Jul 15, 2022

Choose a reason for hiding this comment

johanhaleby commented Jul 12, 2022

jankronquist commented Jul 15, 2022