[Connector APIs] Connector update last sync info, status, error #2641

jedrazb · 2024-06-17T12:55:31Z

Closes https://github.com/elastic/search-team/issues/7792

Use update error api, update status API and update last sync stats api to manage connector lifecycle during syncs.

This feature is behind a feature flag that is disabled by default.

Validation

added unit tests
tested e2e on all sync types

Pre-Review Checklist

this PR does NOT contain credentials of any kind, such as API keys or username/passwords (double check config.yml.example)
this PR has a meaningful title
this PR links to all relevant github issues that it fixes or partially addresses
this PR has a thorough description
Covered the changes with automated tests
Tested the changes locally
Added a label for each target release version (example: v7.13.2, v7.14.0, v8.0.0)

…tats

artem-shelkovnikov · 2024-06-25T09:15:43Z

connectors/protocol/connectors.py

+            await self.index.api.connector_update_last_sync_info(
+                connector_id=self.id, last_sync_info=last_sync_information
+            )
+            await self.index.api.connector_update_status(
+                connector_id=self.id, status=Status.CONNECTED.value
+            )
+            await self.index.api.connector_update_error(
+                connector_id=self.id, error=None
+            )


That's a little concerning - we do 3 calls to do single thing? Should we merge them into one call?

We could unify error and status endpoint in a single call (requires small ES adjustment). Set status as a function of error being null or non-null.

However, I think we should maintain the _last_sync as a separate call. Integrating them would require expanding the last_sync_info endpoint with even more values in the request body (e.g. it would need to take error) - doable but then we are converging to _update like functionality.

Agreed, uniting status update + error update in one endpoint and leaving update_last_sync_info endpoint separately is a good way forward

artem-shelkovnikov

Left some questions - currently some of the calls are updating several fields at once in a way that does not make connector enter invalid state.

With new api changes I see that it's possible that only partial updates are applied to the records in connectors index (e.g. error is populated, but status is not changed) if something goes wrong - CTRL+C, network blip, Elasticsearch crashing, etc

artem-shelkovnikov · 2024-06-25T09:16:16Z

connectors/protocol/connectors.py

+            await self.index.api.connector_update_error(
+                connector_id=self.id, error=error
+            )
+            await self.index.api.connector_update_status(
+                connector_id=self.id, status=Status.ERROR.value
+            )


Same here - this is an atomic action "mark as error" that updates status and writes error, should it be single action?

Agreed that we could unify this in a following way:

we just call _error endpoint, the logic in ES could set the status depending if error is null or not

That would be perfect IMO!

artem-shelkovnikov · 2024-06-25T09:16:44Z

connectors/protocol/connectors.py

+            await self.index.api.connector_update_status(
+                connector_id=self.id, status=connector_status.value
+            )
+            await self.index.api.connector_update_error(
+                connector_id=self.id, error=job_error
+            )
+            await self.index.api.connector_update_last_sync_info(
+                connector_id=self.id, last_sync_info=last_sync_information
+            )


And same thing here - is there any chance we unite these three into one?

See answer above about other 3 calls

jedrazb · 2024-06-25T12:32:34Z

With new api changes I see that it's possible that only partial updates are applied to the records in connectors index (e.g. error is populated, but status is not changed) if something goes wrong - CTRL+C, network blip, Elasticsearch crashing, etc

@artem-shelkovnikov Thank you for your review. While I agree that making several calls in a non-atomic block is not ideal, if we pursue the path of creating a single call that updates the connector document in the given scenario, we could end up with numerous endpoints or go back to what the OG _update endpoint was doing (single endpoint many fields in payload).

I propose we do a slight improvement to error endpoint: if we set error to null it means that connector is in connected state (beginning of new sync), for non-null errors it means we ended up in error state. This can eliminate one call from that block.

artem-shelkovnikov · 2024-06-25T12:50:42Z

@jedrazb my main concern is not performance, but changing the system into invalid state with API - even if it happens for 1-2 seconds.

Optimisation of calls on the other hand I think is not as important - as you mentioned, we will end up with just _update endpoint in the end.

jedrazb · 2024-06-26T09:22:13Z

my main concern is not performance, but changing the system into invalid state with API - even if it happens for 1-2 seconds.

Tbh as long as we can update current error string and status we should be fine (so let's optimise this into a single request, update error with status side-effect).

The last_sync_* stuff could in theory fail as this is purely for informational purposes in Kibana, see:
Our operational logic doesn't depend on this in elastic/connectors repo or Kibana (reference, usage)

…tats

jedrazb added 2 commits June 13, 2024 14:34

WIP use Connector API

512b524

Add tests

1ffa5db

github-actions bot added auto-backport v8.15.0.0 labels Jun 17, 2024

Merge branch 'main' into connectors-api--use-for-updating-last-sync-s…

3906645

…tats

jedrazb marked this pull request as ready for review June 25, 2024 07:13

jedrazb requested a review from a team June 25, 2024 07:13

fix changes local

a2475a0

artem-shelkovnikov reviewed Jun 25, 2024

View reviewed changes

jedrazb mentioned this pull request Jun 26, 2024

[Connector API] Update status when setting/resetting connector error elastic/elasticsearch#110192

Merged

jedrazb added 3 commits June 28, 2024 10:57

Merge branch 'main' into connectors-api--use-for-updating-last-sync-s…

0d8a65e

…tats

Consolidate update status and error into a single call

b27b415

Fix linting

2016ac1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Connector APIs] Connector update last sync info, status, error #2641

[Connector APIs] Connector update last sync info, status, error #2641

jedrazb commented Jun 17, 2024 •

edited

Loading

artem-shelkovnikov Jun 25, 2024

jedrazb Jun 25, 2024 •

edited

Loading

artem-shelkovnikov Jun 25, 2024

artem-shelkovnikov left a comment

artem-shelkovnikov Jun 25, 2024

jedrazb Jun 25, 2024

artem-shelkovnikov Jun 25, 2024

artem-shelkovnikov Jun 25, 2024

jedrazb Jun 25, 2024

jedrazb commented Jun 25, 2024 •

edited

Loading

artem-shelkovnikov commented Jun 25, 2024

jedrazb commented Jun 26, 2024

[Connector APIs] Connector update last sync info, status, error #2641

Are you sure you want to change the base?

[Connector APIs] Connector update last sync info, status, error #2641

Conversation

jedrazb commented Jun 17, 2024 • edited Loading

Closes https://github.com/elastic/search-team/issues/7792

Validation

Pre-Review Checklist

artem-shelkovnikov Jun 25, 2024

Choose a reason for hiding this comment

jedrazb Jun 25, 2024 • edited Loading

Choose a reason for hiding this comment

artem-shelkovnikov Jun 25, 2024

Choose a reason for hiding this comment

artem-shelkovnikov left a comment

Choose a reason for hiding this comment

artem-shelkovnikov Jun 25, 2024

Choose a reason for hiding this comment

jedrazb Jun 25, 2024

Choose a reason for hiding this comment

artem-shelkovnikov Jun 25, 2024

Choose a reason for hiding this comment

artem-shelkovnikov Jun 25, 2024

Choose a reason for hiding this comment

jedrazb Jun 25, 2024

Choose a reason for hiding this comment

jedrazb commented Jun 25, 2024 • edited Loading

artem-shelkovnikov commented Jun 25, 2024

jedrazb commented Jun 26, 2024

jedrazb commented Jun 17, 2024 •

edited

Loading

jedrazb Jun 25, 2024 •

edited

Loading

jedrazb commented Jun 25, 2024 •

edited

Loading