This library creates new 3D views from one image in real-time
T

This library creates new 3D views from one image in real-time

This library creates new 3D views from one image in real-time

8,630 stars
N/A forks
N/A contributors

README

Project documentation from GitHub

One Image, Infinite Angles: Apple's ML-Sharp Creates 3D Views in Real Time

Ever wish you could just take a single photo and then look around the scene as if you were standing there? That sci-fi idea just got a lot more real. Apple’s open-source research project, ML-Sharp, tackles exactly that: generating novel 3D viewpoints from a single 2D image, and it aims to do it in real time.

For developers, especially in AR/VR, gaming, or creative tools, this is a fascinating leap. It moves us away from the need for multi-camera rigs, depth sensors, or complex photogrammetry setups. What if you could build an immersive experience starting with nothing but a picture from a phone?

What It Does

In technical terms, ML-Sharp is a framework for training and evaluating neural radiance field (NeRF) models. NeRF is a technique that creates a 3D scene representation by learning how light radiates from objects. The "sharp" in ML-Sharp refers to its focus on a specific, efficient variant of NeRF called SDF-based NeRF (signed distance functions), which can yield sharper, more detailed geometries.

The library provides the tools to take a set of images of an object or scene (or in ambitious cases, extrapolate from one image), train a model to understand its 3D structure, and then generate completely new views of it from angles the original photos never captured.

Why It's Cool

The core promise here is accessibility and speed. Traditional 3D reconstruction can be resource-heavy. ML-Sharp, as a research framework from Apple, often hints at optimizations for their hardware (think Apple Silicon), pushing towards real-time performance. Real-time novel view synthesis opens doors:

  • Mobile AR: Imagine pointing your phone at a product on a shelf and instantly seeing a 360-degree view.
  • Content Creation: Game developers or filmmakers could quickly generate 3D backgrounds from concept art or reference photos.
  • Virtual Tours: Create explorable spaces from a handful of photos, or even a single panoramic shot.

It’s also a solid, well-structured codebase. It’s not just a research paper dump; it’s a practical toolkit for developers who want to experiment with cutting-edge NeRF techniques, offering a clearer path from paper to implementation.

How to Try It

Ready to spin up some 3D views? The project is on GitHub.

  1. Head to the repository:github.com/apple/ml-sharp
  2. Check the prerequisites: You’ll need a Python environment and the usual ML suspects like PyTorch. The README has the details.
  3. Follow

Did you like this issue?

Join our weekly newsletter

Love discovering amazing projects?

Help us continue bringing you the best open-source discoveries every week.

Back to Projects
Last updated: Mar 18, 2026