Run migrations to fix incorrect source fields of contentnodes #4720

ozer550 · 2024-09-10T10:41:17Z

Summary

We filter out all the contentnodes whose source fields have been changed after being imported and reset them to their original values.

Manual verification steps performed

Ran the query in hotfixes
Verified the returned nodes conform to expected disparities.

Reviewer guidance

How can a reviewer test these changes?

Recommended to run the query in hotfixes and verify the returned nodes. some of the fields to look at would be
last date of modification and verifying what source_node_fields have been changed.

Are there any risky areas that deserve extra testing?

If there is something that is missed migrations may be incorrect and may effect large number of nodes.

References

closes #4190

Comments

Contributor's Checklist

PR process:

If this is an important user-facing change, PR or related issue the CHANGELOG label been added to this PR. Note: items with this label will be added to the CHANGELOG at a later time
If this includes an internal dependency change, a link to the diff is provided
The docs label has been added if this introduces a change that needs to be updated in the user docs?
If any Python requirements have changed, the updated requirements.txt files also included in this PR
Opportunities for using Google Analytics here are noted
Migrations are safe for a large db

Studio-specifc:

All user-facing strings are translated properly
The notranslate class been added to elements that shouldn't be translated by Google Chrome's automatic translation feature (e.g. icons, user-generated text)
All UI components are LTR and RTL compliant
Views are organized into pages, components, and layouts directories as described in the docs
Users' storage used is recalculated properly on any changes to main tree files
If there new ways this uses user data that needs to be factored into our Privacy Policy, it has been noted.

Testing:

Code is clean and well-commented
Contributor has fully tested the PR manually
If there are any front-end changes, before/after screenshots are included
Critical user journeys are covered by Gherkin stories
Any new interactions have been added to the QA Sheet
Critical and brittle code paths are covered by unit tests

Reviewer's Checklist

This section is for reviewers to fill out.

Automated test coverage is satisfactory
PR is fully functional
PR has been tested for accessibility regressions
External dependency files were updated if necessary (yarn and pip)
Documentation is updated
Contributor is in AUTHORS.md

...ntcuration/kolibri_public/management/commands/rectify_incorrect_contentnode_source_fields.py

bjester

Just some comments

contentcuration/contentcuration/tests/test_rectify_source_field_migraiton_command.py

bjester · 2024-09-20T18:06:20Z

contentcuration/contentcuration/tests/test_rectify_source_field_migraiton_command.py

+            source_channel_id=self.original_channel.id,
+            source_node_id=self.original_contentnode.node_id,
+            original_source_node_id=self.original_contentnode.node_id,
+        )


One suggestion would be to use the same copy/import utilities that we use elsewhere, then override the things that shouldn't have changed, but it isn't a big deal. From my perspective, I like to do my best to ensure the tests are founded upon the app's behaviors as much as possible, because too many differences could cause the tests to pass when they shouldn't (under the typical behaviors of the app)

rtibbles · 2024-09-24T20:30:15Z

contentcuration/contentcuration/tests/test_rectify_source_field_migraiton_command.py

+            print(node.id == base_channel.main_tree.id)
+            print("checking if the node is complete or not ", node.complete)
+            node.changed = False
+            # This should probably again change the changed=true but suprisingly it doesnot


In case this helps this feel less surprising: https://github.com/learningequality/studio/blob/unstable/contentcuration/contentcuration/models.py#L1831

We have an explicit exclude list of fields for which updates to them do not trigger change to be set as True.

rtibbles

One question about how many times we're republishing :)

I am still not completely sure what to do about the identity of the user who does the publish, but an admin account seems easiest. We can query for that using [email protected] as the email address to look up the user id.

rtibbles · 2024-09-24T20:33:55Z

...ntcuration/kolibri_public/management/commands/rectify_incorrect_contentnode_source_fields.py

+                    if is_test:
+                        publish_channel(user_id, base_channel.id)
+                    else:
+                        publish_channel("SOME ID", base_channel.id, base_channel.id)


My only uncertainty here is whether we should be running the publish as the user whose channel it is, or as some administrator.

The administrator means that we can always reliably run the publish as the same user, but does mean that it now appears that someone else has published the channel. The only other thing that comes to mind is that if we ran as the channel editor, they would receive an email indicating to them that their channel had been republished.

The email draft that will be sent by imps team mentions that "WE" would be the one doing the change so the administrator seems more reliable option here?

Altho this email is not sent to editors of channels which are not public, so they might be confused about whats happening. When as channel editor the sending email event can be avoided, as by default send_email=False is set for publish channel function?

...ntcuration/kolibri_public/management/commands/rectify_incorrect_contentnode_source_fields.py

rtibbles

Not seeing anything else to do here - I think we can merge and see how this runs on hotfixes?

akolson · 2024-09-27T13:47:31Z

Merging this. Thanks @ozer550 for your persistence on this, @bjester @rtibbles for your reviews.

pcenov · 2024-10-01T11:05:51Z

@akolson no issues observed while running the CWs.

ozer550 added 3 commits August 5, 2024 17:34

Add rectifying migration command

c9231fe

change filter method to get

a8adee4

add filter based on last modified

1d867c5

ozer550 requested review from bjester and rtibbles September 10, 2024 10:41

refine the migrations

5641e14

ozer550 commented Sep 13, 2024

View reviewed changes

...ntcuration/kolibri_public/management/commands/rectify_incorrect_contentnode_source_fields.py Outdated Show resolved Hide resolved

add tests

c02201b

bjester reviewed Sep 20, 2024

View reviewed changes

ozer550 added 2 commits September 23, 2024 16:54

update tests

71ce33b

add implicit and explicit tests

37b3181

rtibbles self-assigned this Sep 24, 2024

rtibbles reviewed Sep 24, 2024

View reviewed changes

ozer550 added 2 commits September 25, 2024 12:33

cache channel_ids

9ad8ce7

add admin user_id

a1aaedd

rtibbles reviewed Sep 26, 2024

View reviewed changes

...ntcuration/kolibri_public/management/commands/rectify_incorrect_contentnode_source_fields.py Outdated Show resolved Hide resolved

return proper id instead of dict

0f1518f

rtibbles approved these changes Sep 26, 2024

View reviewed changes

akolson merged commit 6f6daf2 into learningequality:unstable Sep 27, 2024
13 checks passed

akolson mentioned this pull request Sep 30, 2024

Bulk Edit Release #4635

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run migrations to fix incorrect source fields of contentnodes #4720

Run migrations to fix incorrect source fields of contentnodes #4720

ozer550 commented Sep 10, 2024

bjester left a comment

bjester Sep 20, 2024

rtibbles Sep 24, 2024

rtibbles left a comment

rtibbles Sep 24, 2024

ozer550 Sep 25, 2024

ozer550 Sep 25, 2024

rtibbles left a comment

akolson commented Sep 27, 2024

pcenov commented Oct 1, 2024

Run migrations to fix incorrect source fields of contentnodes #4720

Run migrations to fix incorrect source fields of contentnodes #4720

Conversation

ozer550 commented Sep 10, 2024

Summary

Manual verification steps performed

Reviewer guidance

How can a reviewer test these changes?

Are there any risky areas that deserve extra testing?

References

Comments

Contributor's Checklist

Reviewer's Checklist

This section is for reviewers to fill out.

bjester left a comment

Choose a reason for hiding this comment

bjester Sep 20, 2024

Choose a reason for hiding this comment

rtibbles Sep 24, 2024

Choose a reason for hiding this comment

rtibbles left a comment

Choose a reason for hiding this comment

rtibbles Sep 24, 2024

Choose a reason for hiding this comment

ozer550 Sep 25, 2024

Choose a reason for hiding this comment

ozer550 Sep 25, 2024

Choose a reason for hiding this comment

rtibbles left a comment

Choose a reason for hiding this comment

akolson commented Sep 27, 2024

pcenov commented Oct 1, 2024