[FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource #179

hlteoh37 · 2024-11-04T17:34:37Z

Purpose of the change

Updates English docs for KinesisStreamsSource and DynamoDbStreamsSource

Verifying this change

This change is a docs change without any test coverage.

Significant changes

(Please check any boxes [x] if the answer is "yes". You can first publish the PR and check them afterwards, for convenience.)

Dependencies have been added or upgraded
Public API has been changed (Public API is any class annotated with @Public(Evolving))
Serializers have been changed
New feature has been introduced
- If yes, how is this documented? (not applicable / docs / JavaDocs / not documented)

docs/content/docs/connectors/datastream/dynamodb.md

nicusX

Comments on DDB connector docs

docs/content/docs/connectors/datastream/dynamodb.md

foxus

Many thanks for the updated docs, I've added comments inline.

docs/content/docs/connectors/datastream/dynamodb.md

docs/content/docs/connectors/datastream/kinesis.md

docs/content/docs/connectors/table/kinesis.md

nicusX · 2024-11-07T17:38:49Z

docs/content/docs/connectors/table/kinesis.md

+      <td>yes</td>
+      <td style="word-wrap: break-word;">16</td>
+      <td>Integer</td>
+      <td>Request threshold for uncompleted requests by <code>KinesisAsyncClient</code>before blocking new write requests and applying backpressure.</td>


This is actually something that should be explained in DataStream too.
Also, explaining that a "request" is a batch

In general, the batching mechanism would deserve a dedicated chapter, possibly in DataStream docs, and linked from here

Let's focus the changes for this PR to the source docs.

nicusX · 2024-11-07T17:50:46Z

docs/content/docs/connectors/table/kinesis.md

+Kinesis data streams consist of one or more shards, and the `sink.partitioner` option allows you to control how records written into a multi-shard Kinesis-backed table will be partitioned between its shards.
+Valid values are:
+
+* `fixed`: Kinesis `PartitionKey` values derived from the Flink subtask index, so each Flink partition ends up in at most one Kinesis partition (assuming that no re-sharding takes place at runtime).


It would be clearer putting this are mutually exclusive options.

Partitioning is defined by either using PARTITION BY in the table definition or by specifying specify sink.partitioner. Using both will result in a configuration error.

Valid values for sink.partitioner:

fixed ...

random ...

Custom FixedKinesisPartitioner subclass...

Let's focus the changes for this PR to the source docs.

nicusX · 2024-11-07T17:51:38Z

docs/content/docs/connectors/table/kinesis.md

+
+{{< hint info >}}
+Records written into tables defining a `PARTITION BY` clause will always be partitioned based on a concatenated projection of the `PARTITION BY` fields.
+In this case, the `sink.partitioner` field cannot be used to modify this behavior (attempting to do this results in a configuration error).


If alternative options are explained above, this line becomes redundant

Let's focus the changes for this PR to the source docs.

nicusX · 2024-11-07T17:53:59Z

docs/content/docs/connectors/table/kinesis.md

+
+# Data Type Mapping
+
+Kinesis stores records as Base64-encoded binary data objects, so it doesn't have a notion of internal record structure.


Text formats, such as json or csv are written to Kinesis without modifications. Binary formats such as avro are Base64-encoded and then written to Kinesis as text.

(is this right?)

I'm not sure actually!

Let's focus the changes for this PR to the source docs.

nicusX

LGTM for the source docs

nicusX · 2024-11-07T19:16:08Z

docs/content/docs/connectors/table/kinesis.md

+      <td>no</td>
+      <td style="word-wrap: break-word;">JOB_MANAGED</td>
+      <td>String</td>
+      <td>Only applicable to EFO <code>ReaderType</code>. Determine if the EFO consumer is managed by the Flink job <code>JOB_MANAGED|SELF_MANAGED</code>.</td>


Sorry, I copied&pasted the deleted title from the other docs.
It was "EFO Stream Consumer Lifecycle Management".
But we can skip linking across docs for now

boring-cyborg bot added the component=Documentation label Nov 4, 2024

hlteoh37 force-pushed the FLINK-31989 branch from e3936af to 4fc4e27 Compare November 6, 2024 10:35

hlteoh37 changed the title ~~[FLINK-31989][docs] Update english docs for KinesisStreamsSource~~ [FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource Nov 6, 2024

gguptp reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Outdated Show resolved Hide resolved

gguptp reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Outdated Show resolved Hide resolved

nicusX reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Outdated Show resolved Hide resolved

nicusX reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Outdated Show resolved Hide resolved

nicusX reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Show resolved Hide resolved

hlteoh37 force-pushed the FLINK-31989 branch from 1787d59 to be3cc86 Compare November 6, 2024 17:13

nicusX reviewed Nov 6, 2024

View reviewed changes

docs/content/docs/connectors/datastream/dynamodb.md Outdated Show resolved Hide resolved

docs/content/docs/connectors/datastream/dynamodb.md Show resolved Hide resolved

foxus suggested changes Nov 6, 2024

View reviewed changes

nicusX reviewed Nov 7, 2024

View reviewed changes

hlteoh37 force-pushed the FLINK-31989 branch from be3cc86 to 1ccaa66 Compare November 7, 2024 11:50

[FLINK-31989][docs] Update english docs for KinesisStreamsSource

e7ceb53

hlteoh37 force-pushed the FLINK-31989 branch from 1ccaa66 to dfdef20 Compare November 7, 2024 11:55

[FLINK-31989][docs] Update english docs for DynamoDbStreamsSource

9fb9c12

hlteoh37 force-pushed the FLINK-31989 branch from dfdef20 to 9fb9c12 Compare November 7, 2024 12:07

gguptp approved these changes Nov 7, 2024

View reviewed changes

nicusX reviewed Nov 7, 2024

View reviewed changes

hlteoh37 force-pushed the FLINK-31989 branch from ebff239 to 75431e3 Compare November 7, 2024 18:20

[FLINK-31989][docs] Update english docs for Kinesis Table API

1e9e61e

hlteoh37 force-pushed the FLINK-31989 branch from 75431e3 to 1e9e61e Compare November 7, 2024 18:21

nicusX approved these changes Nov 7, 2024

View reviewed changes

hlteoh37 merged commit 3abc1c5 into apache:main Nov 8, 2024
9 checks passed

hlteoh37 deleted the FLINK-31989 branch November 8, 2024 09:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource #179

[FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource #179

hlteoh37 commented Nov 4, 2024 •

edited

Loading

nicusX left a comment

foxus left a comment

nicusX Nov 7, 2024

nicusX Nov 7, 2024

hlteoh37 Nov 7, 2024

nicusX Nov 7, 2024

hlteoh37 Nov 7, 2024

nicusX Nov 7, 2024

hlteoh37 Nov 7, 2024

nicusX Nov 7, 2024

hlteoh37 Nov 7, 2024

nicusX left a comment

nicusX Nov 7, 2024


		# Data Type Mapping

		Kinesis stores records as Base64-encoded binary data objects, so it doesn't have a notion of internal record structure.

[FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource #179

[FLINK-31989][docs] Update english docs for KinesisStreamsSource and DynamoDbStreamsSource #179

Conversation

hlteoh37 commented Nov 4, 2024 • edited Loading

Purpose of the change

Verifying this change

Significant changes

nicusX left a comment

Choose a reason for hiding this comment

foxus left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicusX left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hlteoh37 commented Nov 4, 2024 •

edited

Loading