-
Notifications
You must be signed in to change notification settings - Fork 45
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
process going to uninterruptible sleep #159
Comments
Thank you reporting the problem. It's been seen before (see #86 and
#131), but I've not been able to get a clear handle on what's going on.
What you've added is that the problem occurs with the amdgpu driver, not
with the driver for the Nvidia card. (Were you using Nvidia's
proprietary driver or nouveau?)
The dmesg output reported in #86 shows that the hang occurs in the drm
layer shared by the amdgpu and i915 drivers, which is consistent with
your report that the problem did not occur with the Nvidia card.
Are you executing ddcutil instances in parallel? Are you running them
in rapid succession in a script? If the latter, you might try inserting
a short sleep between the ddcutil invocations. Do you see a call trace
or other evidence of driver errors in dmesg? Does the problem occur only
with one particular ddcutil subcommand (e.g. setvcp, detect) or with
several.
Executing ddcutil with options "--trace i2c --tid --ts" should identify
the system call in ddcutil where the hang occurs.
You can try using the "--timeout-i2c-io" option, though the prior
reports suggest this does not address the problem.
If possible, please use a copy of ddcutil built from branch 1.0.0-dev,
as it has improved tracing.
Finally, please submit any non-trivial command output as an attachment,
gist, pastebin etc. Including voluminous output inline in a message
makes an issue hard to follow.
…On 12/16/20 7:12 AM, Tymoteusz Dolega wrote:
After I changed from Nvidia GTX1070 to AMD RX6800, ddcutil no longer
works. It hangs with "D" state, so it can't be killed even with |kill
-9| and is using 100% of 1 core while locked indefinitely. Even
turning off the system takes 3 minutes because system is having hard
time killing ddcutil. I use Arch Linux with kernel 5.9.14 and also
tried 5.10 from git. I can give all information that can be helpful
with further investigation.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#159>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADMGY3R6XHKRUPUDSXH767TSVCP4HANCNFSM4U54KYCQ>.
|
With Nvidia I was using proprietary driver. With AMD I'm using open source one. I'm running only one command at the time, but it doesn't matter what command it will be.
when looking at it scrolling, I can see that the number is getting bigger and bigger with time I'm running ddcutil-git from AUR (https://aur.archlinux.org/packages/ddcutil-git/) but for some reason it's version 0.9.9 Interesting thing is that this bug occurs even when I'm not using "sudo", as I presented above. iirc it shouldn't do anything as a user After running
... |
I've been puzzling over how to proceed. If the hang were happening on a machine in front of me, there are lots of little things I would check, but I don't want to send you off on lots of goose chases. My best guess is still that there's a bug in amdgpu. But it would be arrogant to dismiss the possibility of a bug in ddcutil. That said, here are some observations and things to try.
|
Hi, I have the same issue on a AMD RX 6800 with amdgpu driver, on a fresh Arch install, both with ddcutil 0.9.9 and 1.0.0-dev For your questions :
I dont know how to use i2cdetect but :
It seems that there is the same issue with
I'm not familiar to this tool, but it's used by Clight. Edit: I just got a strange stuff. When my 2 last command was blocked (get-edid and ddcutil --trace i2c --tid --ts environment), I launched And there is a bug on amdgpu drm project for that https://gitlab.freedesktop.org/drm/amd/-/issues/1314 |
On 1/19/21 3:15 PM, Thomas DU BOYS wrote:
Hi, I have the same issue on a AMD RX 6800 with amdgpu driver, on a
fresh Arch install, both with ddcutil 0.9.9 and 1.0.0-dev
For your questions :
|$ cat /sys/module/amdgpu/parameters/dc -1 $ cat
/sys/module/amdgpu/parameters/hw_i2c 0 |
I dont know how to use i2cdetect but :
|$ i2cdetect -l i2c-3 i2c AMDGPU SMU I2C adapter i2c-10 i2c AMDGPU DM
aux hw bus 2 I2C adapter i2c-1 smbus SMBus PIIX4 adapter port 2 at
0b00 SMBus adapter i2c-8 i2c AMDGPU DM aux hw bus 0 I2C adapter i2c-6
i2c AMDGPU DM i2c hw bus 2 I2C adapter i2c-4 i2c AMDGPU DM i2c hw bus
0 I2C adapter i2c-2 smbus SMBus PIIX4 adapter port 1 at 0b20 SMBus
adapter i2c-0 smbus SMBus PIIX4 adapter port 0 at 0b00 SMBus adapter
i2c-9 i2c AMDGPU DM aux hw bus 1 I2C adapter i2c-7 i2c AMDGPU DM i2c
hw bus 3 I2C adapter i2c-5 i2c AMDGPU DM i2c hw bus 1 I2C adapter|
"i2cdetect N", where N is the i2c bus number. See "i2cdetect --help"
or "man i2cdetect" for details.
||
|$ get-edid This is read-edid version 3.0.2. Prepare for some fun.
Attempting to use i2c interface No EDID on bus 0 No EDID on bus 1 No
EDID on bus 2 (infinite sleep, unable to kill…) |
"get-edid -b N", where N is the I2c bus number, will output the EDID in
binary form. Again, see "get-edid --help" or "man get-edid". Piping
the output to parse-edid will interpret the EDID.
… It seems that there is the same issue with |get-edid| command.
I dont understand other ddcutil commands. I'm not familiar to this
tool, but it's used by Clight.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#159 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADMGY3TBYHAFJNA4NISZLX3S2XR5FANCNFSM4U54KYCQ>.
|
While using kernel 5.10.32-1 (Arch linux-lts) the issue is still there, but on newest 5.11.16 i have no problem with ddcutil at all, so i guess it was fixed somewhere between this versions. |
Did you upgrade to ddcutil 1.1.0 along the the kernel update? Release 1.1.0 contains a fix to keep ddcutil from probing a /dev/i2c device if its name is AMDGPU SMU, which is used to report monitor state. The fix was based on the report in issue #194, which also seems to describe your configuration (RX600, amdgpu). |
After I changed from Nvidia GTX1070 to AMD RX6800, ddcutil no longer works. It hangs with "D" state, so it can't be killed even with
kill -9
and is using 100% of 1 core while locked indefinitely. Even turning off the system takes 3 minutes because system is having hard time killing ddcutil. I use Arch Linux with kernel 5.9.14 and also tried 5.10 from git. I can give all information that can be helpful with further investigation.The text was updated successfully, but these errors were encountered: