-
-
Notifications
You must be signed in to change notification settings - Fork 30.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Matter server "freezes" every 3-4 days with >10% ram usage and constant >25% CPU usage #124647
Comments
Hey there @home-assistant/matter, mind taking a look at this issue as it has been labeled with an integration ( Code owner commandsCode owners of
(message by CodeOwnersMention) matter documentation |
also happens in core-2024.8.3 |
This issue is most possibly related to #123835 We are investigating the "devices get unavailable" issue but the cpu/ram usage is new to me, could be some side effect from the actual issues. You have a large amount of devices so that is maybe why others are not seeing that. |
I saw similar issues for other devices and integrations when I was running HAG. After tests and trials I moved my HA installation to Raspberry Pi 5 8 GB & 256 GB NVMe and I realised that issues were related to weak HAG because are gone now. HAG is good if you want to play with few devices but if you want to manage more and/or in different integrations, you should consider to move to powerful hardware. |
@marcelveldt -- in the meantime I had scheduled my matter server to restart every 3 days... Since I also follow #124503 and do observe the same behavior described over there, I just updated to matter server 6.5.0b2. I will turn off the automatic server restart now, and report back if one of the changes in b2 also positively affects my issue. |
@marcelveldt -- First observations:
Let's see what happens during the course of the next 3-4 days. |
Thanks for testing, yeah with your amount of devices. About 10% memory usage on a HA Green sounds about right. There is probably some optimization to do for us but on the other hand its not that crazy considering the fact you have 90 devices for which we keep all attributes and subscriptions in memory. So if cpu stays stable/minimal and memory doesn't rise over time, we can consider the issue fixed. BTW we got a lot of good response on the 6.5.0b2 so we just promoted it to stable |
Hi, @marcelveldt I have since updated to 6.5.0 and 6.5.1 so the server didn't even have the chance to run 4 days is a row. Still,
So this is already good progress. I saw a few devices losing connection though (being offline for about a day), that could still be pinged from HA then, and consequently/promptly got back online. So, I wonder if it has to do with this change home-assistant-libs/python-matter-server#882 which happened in 6.5.1. (fyi, @agners ) |
To me, your original issue is resolved. That CPU and memory consumption is perfectly fine for your amount of devices on the HA Green. Great! As for the availability issue; If you also have Apple border routers, track progress here; Otherwise create a new issue report for the new issue so we can close/finalize this one. |
Yes, as said on my side it should be able to survive 5-6 days, 4 days was usually the threshold for when the Matter server itself just lot connection to all devices at once.
Thanks for the heads-up. As per the description I have
So I should have a look into the logs and open a new issue. |
Follow-up here: #126136 |
The problem
In my setup on Home-Assistant Green with about 90 matter over thread devices (mostly nanoleaf and eve) after about 4 days the matter server somewhat freezes (entities being shown as offline, although they are online/usable in other fabrics) with a constant display of >25% of CPU usage and >10% of ram usage.
When running healthy the CPU usage will mostly show <1%. During that time ram usage increases constantly from 3% to >10% to the point when all matter devices all of a sudden show as offline.
See logs below
What version of Home Assistant Core has the issue?
core-2024.8.2
What was the last working version of Home Assistant Core?
No response
What type of installation are you running?
Home Assistant OS
Integration causing the issue
matter
Link to integration documentation on our website
No response
Diagnostics information
core_matter_server_2024-08-26T15-21-30.760Z.log
Example YAML snippet
No response
Anything in the logs that might be useful for us?
Observing an extremely long re-subscription back-off, e.g.
Additional information
6 BRs
The text was updated successfully, but these errors were encountered: