Question about general use in any case #15

Petros626 · 2023-04-20T09:07:56Z

Hello,

I would like to know, if this script can measure the distance of any object shown in a stream, no matter where it's located?
If not, what adjustments in the code are necessary?

I plan to use a stereo camera and a pretrained CNN, which detects objects, additionally I want to measure the distance to the detected objects.

Thanks in advance

shanearthur · 2023-08-15T19:32:40Z

Hey Petros626,

The script will produce a two dimensional array of values which represent the depth of each pixel in the scene, however due to the nature of this traditional method, not all pixels will have values attributed to them. Some ill-posed regions (areas where the algorithm has a hard time determining depth) will be left empty.

As long as the object is within the image/frame from your stream and a value which represents the pixels of your object is within the depth map, you will be able to determine the estimated depth. I'd recommend doing some averaging of small kernels depending on what kind of object it is.

Note: the resulting values will be disparity values, which you will use to calculate metric depth with depth = baseline * focal_length / disparity. For more see here.

Petros626 · 2023-08-15T20:04:23Z

Thank you for your detailed answer. The most algorithms I saw was with a known object size, but this wasn't which I was searching for. So if with the mentioned formula you can calculate the distance why in your code specific values are made here:

Stereo-Vision/Main_Stereo_Vision_Prog.py

Line 37 in 597d9e5

Distance= -593.97*average**(3) + 1506.8*average**(2) - 1373.1*average + 522.06

shanearthur · 2023-08-15T21:44:57Z

why in [the] code specific values are made here

Those are likely the specific parameters provided by datasets which the author was using to test this code. They may also be customized to compensate for the averaging being done two lines above your referenced line, where the author is getting the average disparity value of a small kernel of pixels around the pixel in question:

Stereo-Vision/Main_Stereo_Vision_Prog.py

Line 35 in 597d9e5

average += disp[y+u,x+v]

Vujas-Eteph · 2023-12-18T07:17:40Z

Yes, as @shanearthur said. Those values (polynomial coefficients) are estimated based on a custom dataset.
(Similar to issue #5).
When and how we estimated those parameters is shown in this section of the YouTube video. We plotted a curve in which we had the distance on the x-axis and the disparity values on the y_axis (if I remember correctly). Afterward, we did a polynomial regression of degree 3 (known from the literature) via Excel - but you can do that with Python libraries to optimize the value.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about general use in any case #15

Question about general use in any case #15

Petros626 commented Apr 20, 2023

shanearthur commented Aug 15, 2023

Petros626 commented Aug 15, 2023

shanearthur commented Aug 15, 2023

Vujas-Eteph commented Dec 18, 2023

Question about general use in any case #15

Question about general use in any case #15

Comments

Petros626 commented Apr 20, 2023

shanearthur commented Aug 15, 2023

Petros626 commented Aug 15, 2023

shanearthur commented Aug 15, 2023

Vujas-Eteph commented Dec 18, 2023