feat(v2 upgrade): support engine live upgrade #241

derekbit · 2024-11-17T08:52:25Z

Which issue(s) this PR fixes:

Issue longhorn/longhorn#9104

Signed-off-by: Derek Su [email protected]

What this PR does / why we need it:

Special notes for your reviewer:

Additional documentation or context

coderabbitai · 2024-11-17T08:52:31Z

Walkthrough

The changes in this pull request introduce a new field, StandbyTargetPort, to the Engine struct across several files, including pkg/api/types.go and pkg/spdk/engine.go. The ProtoEngineToEngine function is updated to accommodate this new field. Additionally, various methods in pkg/spdk/engine.go are restructured to enhance error handling and streamline parameters. A new test suite is added in pkg/spdk/engine_test.go to validate the functionality of the updated methods. The EngineCreate method in pkg/client/client.go and pkg/spdk/server.go is also modified to remove the upgradeRequired parameter.

Changes

File	Change Summary
pkg/api/types.go	Added `StandbyTargetPort int32 \`json:"standby_target_port"``to`Engine`struct; updated`ProtoEngineToEngine` function.
pkg/spdk/engine.go	Added `StandbyTargetPort` to `Engine`; restructured `Create`, `handleFrontend`, `Delete`, and `SwitchOverTarget` methods; added `isNewEngine` and `checkInitiatorAndTargetCreationRequirements` methods; updated error handling and logging.
pkg/spdk/engine_test.go	Added tests: `TestCheckInitiatorAndTargetCreationRequirements`, `TestIsNewEngine`, and `TestReleaseTargetAndStandbyTargetPorts`.
pkg/client/client.go	Updated `EngineCreate` method to remove `upgradeRequired` parameter; enhanced error handling in `ReplicaRebuildingSrcStart`.
pkg/spdk_test.go	Removed `TestSPDKEngineCreateWithUpgradeRequired`; updated parameters in `TestSPDKMultipleThread` and `TestSPDKMultipleThreadSnapshotOpsAndRebuilding`.
pkg/spdk/server.go	Updated `EngineCreate` method to remove `req.UpgradeRequired` parameter.
pkg/spdk/types.go	Updated constant `replicaTransportAckTimeout` from `14` to `12`.

Assessment against linked issues

Objective	Addressed	Explanation
Support live upgrade for control plane (#9104)	❌	The PR does not implement the requested feature.

Possibly related PRs

refactor: engine consolidates the replica related maps into a single one #208: This PR modifies the Engine struct in pkg/spdk/engine.go, adding a new field StandbyTargetPort, which directly relates to the addition of the same field in the main PR's Engine struct in pkg/api/types.go.
feat(auto-salvage): v2 support #222: This PR also updates the Engine struct in pkg/spdk/engine.go to include a new parameter salvageRequested in the Create method, which is relevant as it involves changes to the Engine struct's methods, similar to the updates made in the main PR.
refactor: refactor replica head and snap chain related implementation #234: This PR modifies the Create method in the Replica struct in pkg/spdk/replica.go, which is related to the overall management of the Engine and its interactions, particularly in how the Head field is managed, indirectly connecting to the changes made in the main PR regarding the Engine struct.
fix: decrease ctrlr_loss_timeout_sec for base bdev #247: This PR modifies the Engine struct in pkg/spdk/engine.go, adding new timeout fields, which may relate to the overall management and configuration of the Engine struct, similar to the changes made in the main PR.

Suggested reviewers

shuo-wu
c3y1huang
innobead
DamiaSan

📜 Recent review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 1f1a32b and 9ab4265.

📒 Files selected for processing (1)

pkg/spdk/types.go (1 hunks)

✅ Files skipped from review due to trivial changes (1)

pkg/spdk/types.go

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 2

🧹 Outside diff range and nitpick comments (4)

pkg/api/types.go (1)
132-132: LGTM! Consider adding field documentation.

The new StandbyTargetPort field is well-positioned and follows the codebase conventions. Consider adding a comment to document its purpose in the context of live upgrades.
+	// StandbyTargetPort is used during live upgrades to maintain service availability
 	StandbyTargetPort int32                 `json:"standby_target_port"`
pkg/spdk/engine.go (3)
252-252: Possible misuse of variable in log message

At line 252, the log statement uses e.ReplicaModeMap to display the replicas being connected. However, e.ReplicaModeMap might not represent the list of replicas intended for logging.

Consider using replicaBdevList, which contains the list of replica block devices:
- e.log.Infof("Connecting all available replicas %+v, then launching raid during engine creation", e.ReplicaModeMap)
+ e.log.Infof("Connecting all available replicas %+v, then launching raid during engine creation", replicaBdevList)
This change will provide more accurate logging information about the replicas being connected.

685-688: Remove commented-out code to improve code clarity

The commented-out code at lines 685-688 is not used and can be removed to enhance readability and maintainability.

Apply this diff to remove the unused code:
- // podIP, err := commonnet.GetIPForPod()
- // if err != nil {
- //     return err
- // }
If this code is needed for future development, consider adding a comment explaining its purpose or moving it to a separate location.

Line range hint 1881-1881: Typographical error in function name closeRplicaClients

There is a typo in the function name closeRplicaClients. It should be closeReplicaClients for clarity and consistency.

Apply this diff to correct the function name:
- func (e *Engine) closeRplicaClients(replicaClients map[string]*client.SPDKClient) {
+ func (e *Engine) closeReplicaClients(replicaClients map[string]*client.SPDKClient) {
Make sure to update all references to this function accordingly.

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between bf16180 and 668e4d2.

⛔ Files ignored due to path filters (2)

go.mod is excluded by !go.mod
vendor/modules.txt is excluded by !vendor/**

📒 Files selected for processing (2)

pkg/api/types.go (2 hunks)
pkg/spdk/engine.go (15 hunks)

🧰 Additional context used

🪛 golangci-lint

pkg/spdk/engine.go

315-315: expected declaration, found '<<'

(typecheck)

🔇 Additional comments (5)

pkg/api/types.go (2)

153-153: LGTM! Field mapping is correct.

The StandbyTargetPort field is properly mapped from the protobuf struct to the Engine type.

Line range hint 132-153: Verify protobuf definition exists.

Ensure that the standby_target_port field is properly defined in the SPDK service protobuf files.

pkg/spdk/engine.go (3)

45-45: New field StandbyTargetPort added to Engine struct

The addition of the StandbyTargetPort field to the Engine struct is appropriate. Ensure that this new field is properly initialized and consistently used throughout the codebase to avoid any unexpected behavior.

108-110: Function isNewEngine correctly determines if the engine is new

The isNewEngine function effectively checks if the IP field is empty to determine if the engine instance is new. This is a simple and efficient approach.

618-618: Field StandbyTargetPort included in getWithoutLock

The StandbyTargetPort field is correctly added to the Engine object returned by the getWithoutLock method. This ensures that the field will be included in any serialized representations of the engine's state.

pkg/spdk/engine.go

coderabbitai

Caution

Inline review comments failed to post. This is likely due to GitHub's limits when posting large numbers of comments.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (1)

pkg/spdk/engine.go (1)
Line range hint 2173-2224: Document target port management logic

The target switching logic correctly handles the port assignments, but would benefit from a comment explaining when and why StandbyTargetPort is reset to 0.

Add a comment before the condition:
+ // Reset StandbyTargetPort when switching to the pod's IP as the target,
+ // since this node is now the primary target
  if targetIP == podIP {
    e.TargetPort = targetPort
    e.StandbyTargetPort = 0
  }

🛑 Comments failed to post (1)

pkg/spdk/engine.go (1)

315-315: ⚠️ Potential issue

Critical: Resolve merge conflict

There is an unresolved merge conflict marker at line 315 (<<<<<<< HEAD). This needs to be resolved before the code can be merged.

Please resolve the merge conflict by:

Running git merge or git rebase to update your branch

Resolving the conflicts by choosing the appropriate code

Removing all conflict markers (<<<<<<<, =======, >>>>>>>)

🧰 Tools

🪛 golangci-lint

315-315: expected declaration, found '<<'

(typecheck)

coderabbitai

Caution

Inline review comments failed to post. This is likely due to GitHub's limits when posting large numbers of comments.

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (1)

pkg/spdk/engine.go (1)

Line range hint 4-4: Consider adding tests for live upgrade scenarios

While the implementation looks solid, consider adding comprehensive test coverage for:

Standby target creation and cleanup

Target switchover scenarios

Error cases during live upgrades

Would you like me to help generate test cases for these scenarios?

🛑 Comments failed to post (1)

pkg/spdk/engine.go (1)
216-218: ⚠️ Potential issue

Add error handling for BdevRaidGet

The empty branch after BdevRaidGet error check could lead to silent failures. Consider adding appropriate error handling.

Apply this diff to add error handling:
if targetCreationRequired {
    _, err := spdkClient.BdevRaidGet(e.Name, 0)
    if err != nil {
+       if !jsonrpc.IsJSONRPCRespErrorNoSuchDevice(err) {
+           return nil, errors.Wrapf(err, "failed to get raid bdev %s", e.Name)
+       }
    }
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.
	if targetCreationRequired {
		_, err := spdkClient.BdevRaidGet(e.Name, 0)
		if err != nil {
			if !jsonrpc.IsJSONRPCRespErrorNoSuchDevice(err) {
				return nil, errors.Wrapf(err, "failed to get raid bdev %s", e.Name)
			}
		}
🧰 Tools

🪛 golangci-lint

218-218: SA9003: empty branch

(staticcheck)

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (3)

pkg/spdk/engine.go (3)

176-202: Consider simplifying the initialization logic

The current implementation has nested conditions that could be simplified for better readability and maintainability. Consider extracting the logic into separate helper functions.

Example refactor:

-	if podIP == initiatorIP && podIP == targetIP {
-		if e.Port == 0 && e.TargetPort == 0 {
-			e.log.Info("Creating both initiator and target instances")
-			initiatorCreationRequired = true
-			targetCreationRequired = true
-		} else if e.Port != 0 && e.TargetPort == 0 {
-			e.log.Info("Creating a target instance")
-			targetCreationRequired = true
-			if e.StandbyTargetPort != 0 {
-				e.log.Warnf("Standby target instance with port %v is already created, will skip the target creation", e.StandbyTargetPort)
-				return e.getWithoutLock(), nil
-			}
-		} else {
-			return nil, fmt.Errorf("invalid initiator and target address for engine %s creation", e.Name)
-		}
+   creationMode := determineCreationMode(podIP, initiatorIP, targetIP, e.Port, e.TargetPort)
+   switch creationMode {
+   case createBoth:
+       e.log.Info("Creating both initiator and target instances")
+       initiatorCreationRequired = true
+       targetCreationRequired = true
+   case createTargetOnly:
+       e.log.Info("Creating a target instance")
+       targetCreationRequired = true
+       if e.StandbyTargetPort != 0 {
+           e.log.Warnf("Standby target instance with port %v is already created, will skip the target creation", e.StandbyTargetPort)
+           return e.getWithoutLock(), nil
+       }
+   case createInitiatorOnly:
+       e.log.Info("Creating an initiator instance")
+       initiatorCreationRequired = true
+   default:
+       return nil, fmt.Errorf("invalid initiator and target address for engine %s creation", e.Name)
+   }

397-399: Add documentation for standby target creation condition

The condition for standby target creation could benefit from a comment explaining when and why it's needed.

+   // Create a standby target if we have an active initiator (e.Port != 0)
+   // but no active target (e.TargetPort == 0)
    standbyTargetCreationRequired := false
    if e.Port != 0 && e.TargetPort == 0 {
        standbyTargetCreationRequired = true
    }

2169-2173: Enhance error handling for pod IP operations

The pod IP retrieval and port updates are critical for live upgrades. Consider adding more detailed error messages and logging.

    podIP, err := commonnet.GetIPForPod()
    if err != nil {
-       return err
+       return errors.Wrapf(err, "failed to get pod IP for engine %s target switchover", e.Name)
    }

    if targetIP == podIP {
+       e.log.Infof("Target IP matches pod IP, updating ports: target=%d, standby=0", targetPort)
        e.TargetPort = targetPort
        e.StandbyTargetPort = 0
    }

Also applies to: 2216-2220

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between b15f126 and 5b4f847.

📒 Files selected for processing (2)

pkg/api/types.go (2 hunks)
pkg/spdk/engine.go (14 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

pkg/api/types.go

🔇 Additional comments (5)

pkg/spdk/engine.go (5)

45-45: LGTM: Added StandbyTargetPort field for standby target management

The new field is well-integrated with the existing port fields and supports the live upgrade functionality.

108-110: LGTM: Clear and focused helper function

The function provides a clean way to determine if an engine is new based on its IP state.

204-207: LGTM: Clear initialization of engine IPs

The initialization logic correctly sets the IP values for new engines using the helper function.

476-485: LGTM: Clear port assignment logic

The code correctly handles port assignment for both initiator and target instances, including the standby target scenario.

Line range hint 2169-2220: Verify concurrent access during target switchover

The target switchover process involves multiple steps and state changes. Ensure there are no race conditions when:

Multiple goroutines attempt switchover simultaneously
A switchover occurs during engine creation/deletion
Pod IP changes during switchover

✅ Verification successful

Let me generate additional verification scripts to check for proper locking mechanisms and concurrent access patterns.

Based on the verification results, I can now generate the final response:

Proper locking mechanisms are in place during target switchover

The code correctly handles concurrent access during target switchover:

SwitchOverTarget method acquires the engine lock before state modifications
Related operations like Create, Delete, and DeleteTarget also use proper locking
State changes (TargetPort, StandbyTargetPort) are protected by the mutex

The original concerns about race conditions are addressed by the existing implementation.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check for potential concurrent access patterns
ast-grep --pattern $'func \(.*\) SwitchOverTarget\(.*\) {
  $$$
  e.Lock()
  $$$
}'

# Check for other methods that might interact with target state
rg -A 5 "e\.(TargetPort|StandbyTargetPort)"

Length of output: 6650

Script:

#!/bin/bash
# Check for all methods that modify engine state and their locking patterns
ast-grep --pattern $'func \(e \*Engine\) $_($$) {
  $$$
  e.Lock()
  $$$
}'

# Check for methods that might be called during switchover
rg -A 5 "func.*Engine.*\b(Create|Delete|Switch|Connect|Disconnect|Reload)" pkg/spdk/

Length of output: 1794

codecov · 2024-11-18T00:41:42Z

Codecov Report

Attention: Patch coverage is 18.18182% with 189 lines in your changes missing coverage. Please review.

Project coverage is 8.87%. Comparing base (537b20a) to head (9ab4265).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/spdk/engine.go	17.54%	186 Missing and 2 partials ⚠️
pkg/spdk/server.go	0.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff            @@
##            main    #241      +/-   ##
========================================
+ Coverage   8.42%   8.87%   +0.44%     
========================================
  Files         21      21              
  Lines       7158    7225      +67     
========================================
+ Hits         603     641      +38     
- Misses      6475    6501      +26     
- Partials      80      83       +3

Flag	Coverage Δ
unittests	`8.87% <18.18%> (+0.44%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (4)

pkg/spdk/engine_test.go (1)
98-115: Improve test execution logic and error handling.

The test execution can be improved in several ways:

Use test case name in log message instead of array index:
-c.Logf("testing TestCheckInitiatorAndTargetCreationRequirements.%v", testName)
+c.Logf("testing TestCheckInitiatorAndTargetCreationRequirements: %s", testCase.name)
Consider extracting engine creation to a helper function for better reusability:
func createTestEngine(port, targetPort, standbyTargetPort int32, name string) *Engine {
    return &Engine{
        Port:              port,
        TargetPort:        targetPort,
        StandbyTargetPort: standbyTargetPort,
        Name:              name,
        log:              logrus.New(),
    }
}
Add validation of log messages to ensure proper error logging.
pkg/spdk/engine.go (3)
Line range hint 398-498: Consider enhancing error handling in deferred function

The deferred function at line 429 contains complex logic for initiator assignment. Consider extracting this into a separate helper function for better maintainability and error handling.

Consider refactoring like this:
+ func (e *Engine) assignInitiator(initiator *nvme.Initiator, dmDeviceBusy bool, standbyTargetCreationRequired bool) {
+     if !standbyTargetCreationRequired {
+         e.initiator = initiator
+         e.dmDeviceBusy = dmDeviceBusy
+         e.Endpoint = initiator.GetEndpoint()
+         e.log = e.log.WithFields(logrus.Fields{
+             "endpoint":   e.Endpoint,
+             "port":      e.Port,
+             "targetPort": e.TargetPort,
+         })
+     }
+     e.log.Infof("Finished handling frontend for engine: %+v", e)
+ }

  defer func() {
      if err == nil {
-         if !standbyTargetCreationRequired {
-             e.initiator = initiator
-             e.dmDeviceBusy = dmDeviceBusy
-             ...
-         }
-         e.log.Infof("Finished handling frontend for engine: %+v", e)
+         e.assignInitiator(initiator, dmDeviceBusy, standbyTargetCreationRequired)
      }
  }()
208-214: Consider standardizing error handling patterns

While error handling is generally good, consider standardizing the approach across the codebase. For example, some errors are wrapped with context while others are returned directly.

Consider consistently using error wrapping with context:
- if err != nil {
-     return nil, err
- }
+ if err != nil {
+     return nil, errors.Wrapf(err, "failed to get pod IP during engine creation")
+ }
Also applies to: 2181-2184

261-262: Address TODO comment about logging

The TODO comment indicates that log messages need improvement. Consider enhancing the log message to include more context about the operation being performed.

Consider improving the log message:
- // TODO: improve the log message
- e.log.Infof("Connecting all available replicas %+v, then launching raid during engine creation", e.ReplicaStatusMap)
+ e.log.Infof("Connecting available replicas for engine %s (replicas: %+v) and initializing RAID configuration", e.Name, e.ReplicaStatusMap)

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 0aae707 and 3c6c4a3.

📒 Files selected for processing (3)

pkg/api/types.go (2 hunks)
pkg/spdk/engine.go (14 hunks)
pkg/spdk/engine_test.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

pkg/api/types.go

🔇 Additional comments (7)

pkg/spdk/engine_test.go (2)

1-10: LGTM!

The imports are clean and necessary. The test file is properly structured using the check.v1 testing framework.

1-115: Verify test coverage for live upgrade functionality.

The test file introduces basic validation for the standby target port, which is a step towards supporting live upgrades. However, to ensure robust support for engine live upgrades (as per PR objectives), consider adding more comprehensive test scenarios that validate:

The complete upgrade workflow
State transitions during upgrade
Error recovery scenarios
Upgrade rollback cases

✅ Verification successful

Based on the code inspection results, I can now generate the final response:

Test coverage appears sufficient for the current implementation scope

The test coverage aligns well with the current implementation, which focuses on the standby port allocation and basic validation aspects of the upgrade functionality. The test cases appropriately verify:

Standby target port validation
Port allocation scenarios
Creation requirements for different network configurations

While the initial review suggested adding more test scenarios, the current implementation scope is primarily focused on the port management aspect of upgrades, which is adequately tested. The suggested additional test scenarios would be premature as the complete upgrade workflow implementation is not yet present in the codebase.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check test coverage for upgrade-related code

# Look for other upgrade-related tests
rg -l "upgrade|standby" --type go

# Check for upgrade-related functions in the engine implementation
ast-grep --pattern 'func $_($$) {
  $$$
  standby
  $$$
}'

Length of output: 633

Script:

#!/bin/bash
# Let's examine the engine implementation and related test files more closely

# Check engine.go for standby-related functions and their implementation
rg "standby|Standby" pkg/spdk/engine.go -B 2 -A 2

# Look for upgrade-related functions in engine.go
rg "upgrade|Upgrade" pkg/spdk/engine.go -B 2 -A 2

# Check the API types related to upgrade
rg "upgrade|Upgrade" pkg/api/types.go -B 2 -A 2

Length of output: 2242

pkg/spdk/engine.go (5)

37-49: LGTM: Engine struct changes

The addition of StandbyTargetPort field is well-placed and consistent with other port-related fields in the struct.

108-110: LGTM: Clear and focused helper function

The isNewEngine function provides a clean abstraction for checking engine initialization state.

Line range hint 2181-2232: LGTM: Well-structured target switchover logic

The changes properly handle the standby target port during switchover operations, with appropriate error handling and state management.

112-142: ⚠️ Potential issue

Fix redundant condition check

The function contains a redundant check for e.StandbyTargetPort != 0 at line 123, which is unnecessary since it's already within a block that checks for e.Port != 0 && e.TargetPort == 0.

Apply this diff to fix the redundant condition:

if e.Port != 0 && e.TargetPort == 0 {
    e.log.Info("Creating a target instance")
-   if e.StandbyTargetPort != 0 {
        e.log.Warnf("Standby target instance with port %v is already created, will skip the target creation", e.StandbyTargetPort)
-   } else {
        targetCreationRequired = true
-   }
}

Likely invalid or redundant comment.

112-142: Ensure test coverage for new functionality

The new checkInitiatorAndTargetCreationRequirements function contains complex logic for standby target handling. Please ensure adequate test coverage is added.

pkg/spdk/engine_test.go

coderabbitai

Actionable comments posted: 1

🧹 Outside diff range and nitpick comments (1)

pkg/spdk_test.go (1)

Line range hint 225-322: Document the new upgrade mechanism architecture

The changes suggest a significant shift in how engine upgrades are handled, moving from a parameter-based approach to potentially a more sophisticated mechanism. To ensure maintainability and clarity:

Consider adding documentation that explains the new upgrade mechanism

Update or create integration tests that specifically validate the live upgrade scenarios

Consider adding a migration guide for users who might be affected by this architectural change

Would you like assistance in creating:

Documentation template for the new upgrade mechanism?

Integration test scenarios for live upgrades?

Migration guide for existing users?

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 3c6c4a3 and 92bcd2f.

📒 Files selected for processing (2)

pkg/client/client.go (1 hunks)
pkg/spdk_test.go (2 hunks)

🔇 Additional comments (3)

pkg/spdk_test.go (2)

225-225: Verify the removal of upgradeRequired parameter

The EngineCreate call has been updated to remove the upgradeRequired parameter. This change should be verified against the engine creation logic to ensure it aligns with the new live upgrade support feature.

#!/bin/bash
# Description: Verify the changes in EngineCreate signature across the codebase

# Check for any remaining references to upgradeRequired in engine creation
rg -l "EngineCreate.*upgradeRequired" 

# Check the updated EngineCreate method signature
ast-grep --pattern 'func $_EngineCreate($_) {
  $$$
}'

250-322: ⚠️ Potential issue

Clarify the status of the commented test implementation

The TestSPDKEngineCreateWithUpgradeRequired test appears to contain important validations for the engine upgrade functionality. If this test is being replaced or modified as part of the live upgrade feature:

Please clarify if this test will be replaced with new test cases
Ensure the new test coverage includes validation of both initiator and target properties
Consider preserving the valuable assertions in any new test implementation

pkg/client/client.go (1)

Line range hint 474-492: LGTM: Method implementation remains consistent

The implementation maintains proper parameter validation, error handling, and gRPC timeout management while removing the upgrade-related parameter.

pkg/client/client.go

coderabbitai

Actionable comments posted: 0

🧹 Outside diff range and nitpick comments (1)

pkg/spdk/engine.go (1)
112-142: Improve error messages and add test coverage

The function has clear logic but could benefit from:

More descriptive error messages that explain why the addresses are invalid

Unit tests to verify the complex branching logic

Consider improving the error messages:
-err = fmt.Errorf("invalid initiator and target address for engine %s creation", e.Name)
+err = fmt.Errorf("invalid initiator address %s and target address %s for engine %s creation: addresses must match pod IP %s", initiatorIP, targetIP, e.Name, podIP)
🧰 Tools

🪛 GitHub Check: codecov/patch

[warning] 128-130: pkg/spdk/engine.go#L128-L130
Added lines #L128 - L130 were not covered by tests

📜 Review details

Configuration used: .coderabbit.yaml
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 92bcd2f and 2adf734.

📒 Files selected for processing (1)

pkg/spdk/engine.go (14 hunks)

🧰 Additional context used

🪛 GitHub Check: codecov/patch

pkg/spdk/engine.go

[warning] 108-109: pkg/spdk/engine.go#L108-L109
Added lines #L108 - L109 were not covered by tests

[warning] 128-130: pkg/spdk/engine.go#L128-L130
Added lines #L128 - L130 were not covered by tests

[warning] 208-214: pkg/spdk/engine.go#L208-L214
Added lines #L208 - L214 were not covered by tests

[warning] 216-220: pkg/spdk/engine.go#L216-L220
Added lines #L216 - L220 were not covered by tests

[warning] 223-226: pkg/spdk/engine.go#L223-L226
Added lines #L223 - L226 were not covered by tests

[warning] 228-231: pkg/spdk/engine.go#L228-L231
Added lines #L228 - L231 were not covered by tests

[warning] 263-264: pkg/spdk/engine.go#L263-L264
Added lines #L263 - L264 were not covered by tests

[warning] 269-269: pkg/spdk/engine.go#L269
Added line #L269 was not covered by tests

[warning] 271-271: pkg/spdk/engine.go#L271
Added line #L271 was not covered by tests

[warning] 282-285: pkg/spdk/engine.go#L282-L285
Added lines #L282 - L285 were not covered by tests

[warning] 292-292: pkg/spdk/engine.go#L292
Added line #L292 was not covered by tests

[warning] 302-303: pkg/spdk/engine.go#L302-L303
Added lines #L302 - L303 were not covered by tests

[warning] 306-315: pkg/spdk/engine.go#L306-L315
Added lines #L306 - L315 were not covered by tests

[warning] 401-401: pkg/spdk/engine.go#L401
Added line #L401 was not covered by tests

[warning] 411-413: pkg/spdk/engine.go#L411-L413
Added lines #L411 - L413 were not covered by tests

[warning] 422-429: pkg/spdk/engine.go#L422-L429
Added lines #L422 - L429 were not covered by tests

[warning] 431-442: pkg/spdk/engine.go#L431-L442
Added lines #L431 - L442 were not covered by tests

[warning] 444-444: pkg/spdk/engine.go#L444
Added line #L444 was not covered by tests

[warning] 448-462: pkg/spdk/engine.go#L448-L462
Added lines #L448 - L462 were not covered by tests

[warning] 464-466: pkg/spdk/engine.go#L464-L466
Added lines #L464 - L466 were not covered by tests

[warning] 468-468: pkg/spdk/engine.go#L468
Added line #L468 was not covered by tests

[warning] 471-473: pkg/spdk/engine.go#L471-L473
Added lines #L471 - L473 were not covered by tests

[warning] 476-476: pkg/spdk/engine.go#L476
Added line #L476 was not covered by tests

[warning] 479-482: pkg/spdk/engine.go#L479-L482
Added lines #L479 - L482 were not covered by tests

[warning] 484-495: pkg/spdk/engine.go#L484-L495
Added lines #L484 - L495 were not covered by tests

[warning] 501-502: pkg/spdk/engine.go#L501-L502
Added lines #L501 - L502 were not covered by tests

[warning] 510-511: pkg/spdk/engine.go#L510-L511
Added lines #L510 - L511 were not covered by tests

[warning] 515-515: pkg/spdk/engine.go#L515
Added line #L515 was not covered by tests

[warning] 517-519: pkg/spdk/engine.go#L517-L519
Added lines #L517 - L519 were not covered by tests

[warning] 628-628: pkg/spdk/engine.go#L628
Added line #L628 was not covered by tests

🔇 Additional comments (3)

pkg/spdk/engine.go (3)

Line range hint 2183-2234: LGTM: Robust target switchover implementation

The target switchover implementation:

Properly validates pod IP
Handles standby target port appropriately for live upgrades
Includes fallback logic for error cases

The error handling and state management look solid.

Line range hint 400-520: Add test coverage for critical paths

The live upgrade functionality includes critical paths that should be tested:

Engine creation with standby target
Target switchover scenarios
Error handling and recovery paths

Consider adding integration tests to verify the end-to-end functionality.

#!/bin/bash
# Check existing test coverage
rg -l "func Test.*Target"

🧰 Tools