The tech stack in the splat world is still really young. For instance, I was thi...

algebra-pretext · on Aug 14, 2024

COLMAP to generate pose data using structure-from-motion; if you use Nerfstudio to make your splat (using Splatfacto method) it includes a command that will do the COLMAP alignment. This definitely is a weak spot though and a lot goes wrong in the alignment process unless you have a smooth walkthrough video of your subject with no other moving objects.

On iPhone, Scaniverse (owned by Niantic) produces splats far more accurately than splatting from 2D video/images, because it uses LiDAR to gather the depth information needed for good alignment. I think even on older iPhones without LiDAR, it’s able to estimate depth if the phone has multiple camera lenses. Like ryandamm said above, the main issue seems to be low value/demand for novel technology like this. Most of the use cases I can think of (real estate? shopping?) are usually better served with 2D videos and imagery.

ryandamm · on Aug 14, 2024

I think the barrier to commercialization is the lack of demonstrated economic value to having push button splats. There's no shortage of small teams wiring together open source splats / NeRF / whatever papers; there's a dearth of valuable, repeatable businesses that could make use of what those small teams are building.

Would it be cool to just have content in 3D? Undoubtedly. But figuring out a use case, that's where people need to be focusing. I think there are a lot of opportunities, but it's still early days -- and not just for the technology.

vessenes · on Aug 14, 2024

Yes - agreed. There’s a clear use case for indie content, but tooling around editing/modifying/color/lighting has to improve, and rendering engines or converters need to get better. FWIW it doesn’t seem like a dead-end tech to me though; more likely a gateway tech to cost improvements. We’ll see.