You won't be able to do much with the raw data on something with the compute power of an arduino. SLAM takes a lot of compute and memory and compute scales with resolution quickly.
(I haven't checked but I'm sure someone else has already used this on all the popular socials)
If you’re interested in reconstruction from images check out Meshroom and Nerf Studio
With a stereo image you know the distance between the lenses, which allows you to know the size of the objects (= you know the scale).
Just to avoid this, I would just use a LiDAR equipped iPhone Pro, with industrial grade cross-calibration and still have all the visualization fun.