Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Question: Switch from normal malloc to cudaMallocManaged for UserPTR in camera examples? #8

Open
wittenator opened this issue Mar 22, 2022 · 0 comments

Comments

@wittenator
Copy link

I am currently working on a camera pipeline on a Jetson AGX Xavier and use the OpenCV performance example as a guideline for interactions with my Allied Vision Cameras. I needed the Unified Memory zero-copy capability from the Jetson, so I played around with the Buffer implementations. I didn't run explicit benchmarks for this, but it seems that my application runtime went from 7ms to 2ms just by using the Userptr option with cudaMallocManaged instead of the MMAPed kernel memory option. This has the additional benefit that there is no copying for GPU transfers involved anymore. May this be something that you would be interested in adding?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant