Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Restarted sessions are broken #3283

Open
achimnol opened this issue Dec 22, 2024 · 0 comments
Open

Restarted sessions are broken #3283

achimnol opened this issue Dec 22, 2024 · 0 comments
Labels
comp:agent Related to Agent component urgency:4 As soon as feasible, implementation is essential.
Milestone

Comments

@achimnol
Copy link
Member

achimnol commented Dec 22, 2024

Once a session is restarted, Backend.AI performs the followings:

  • In the Agent:
    • Destroys the container.
    • Recreates the container reusing the prior allocation map and kernel resource spec.
  • In the Manager:
    • Just updates the kernel/session status (RUNNING → RESTARTING → RUNNING).

Though, the restarted session is broken:

Due to these issues, we need to restart the model service process in #3282, instead of cleanly restarting the container.

For future work, we may need to consider:

@achimnol achimnol added the comp:agent Related to Agent component label Dec 22, 2024
@achimnol achimnol added this to the 24.09 milestone Dec 22, 2024
@achimnol achimnol added the urgency:4 As soon as feasible, implementation is essential. label Dec 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
comp:agent Related to Agent component urgency:4 As soon as feasible, implementation is essential.
Projects
None yet
Development

No branches or pull requests

1 participant