PitchHut logo
Create Gaussian splats easily with Apple's SHARP model.
Pitch

ml-sharp-web is a browser-based Gaussian splat generator that utilizes Apple's SHARP model. Users can upload an image, generate Gaussian splats in real-time, preview the results, and download them as .ply files, all within a seamless web experience.

Description

This project, known as ml-sharp-web, serves as a browser-based Gaussian splat generator utilizing Apple's SHARP model. Designed for ease of use, it empowers users to generate Gaussian splats directly in their web browsers, enabling fast and interactive visualization of results. Here are the key features of the application:

  • Upload a single image to begin the processing.
  • Generate Gaussian splats in real-time with a simple click.
  • Preview the resulting visual output directly in the browser.
  • Download the generated output as a .ply file for further usage.

Key Links

Requirements

To efficiently utilize this project, the following prerequisites are essential:

  • Bun: Ensure the Bun environment is installed.
  • Browser Compatibility: Use a modern desktop browser with recommended versions like Chrome or Edge.
  • System Resources: Adequate disk space and RAM to handle the SHARP model, noting that it might require approximately 2.4 GB due to the large exported ONNX sidecar.

Important Model File Consideration

The SHARP model generates two essential files for proper operation:

  • sharp_web_predictor.onnx (the main model)
  • sharp_web_predictor.onnx.data (contains model weights)

Both files must be hosted together in the same directory for the application to function correctly, preferably using the default hosted model to avoid errors.

Quick Start

Users can quickly get the application up and running with minimal effort. After installation, the app can be started, allowing image uploads and Gaussian splat generation in a seamless user experience.

Technical Overview

The application's architecture incorporates:

  • React + TypeScript for a responsive user interface.
  • ONNX Runtime Web Worker for efficient inference execution.
  • Browser-side conversions and processing for optimized performance.

Status and Performance

This application is currently classified as a working prototype. While it functions effectively, performance may vary based on the browser's WebGPU/WASM support and the machine's available resources. Users may encounter performance constraints, particularly with larger models or higher generation demands.

0 comments

No comments yet.

Sign in to be the first to comment.