All tutorials

NSS Background Remover

Model tiers and bring-your-own-ONNX

Choose the right model tier for your device — Lite, Standard, or Pro — manage what is cached, and point an advanced Pro capability at your own hosted ONNX model, all running locally.

NSS Background Remover model tiers (Lite/Standard/Pro) and bring-your-own-ONNX, all on-device

NSS Background Remover sizes its AI in real terms rather than hype. Model tiers are described in actual gigabytes — a Lite path that downloads nothing, a Standard tier of a few hundred megabytes, and a Pro tier for heavy models — and the app only recommends a tier your device can actually run.

This tutorial covers choosing a tier, managing what is downloaded, and the advanced bring-your-own-ONNX option that lets you point a capability at your own hosted model — which works because the runtime is already in your browser.

  1. 1

    1. Let the app recommend a tier

    The tool probes your device — checking for a usable WebGPU adapter and available memory — and recommends the highest tier it can actually run. Lite uses classical and lightweight paths with no large download; Standard uses verified mainstream models at a few hundred megabytes; Pro uses heavy models for demanding tasks and needs capable hardware.

    Following the recommendation means the tool finishes rather than stalling on a model your device cannot handle.

    • Lite ≈ 0 MB, Standard ≈ a few hundred MB, Pro ≈ heavier.
    • The app recommends the highest tier your device can run.
  2. 2

    2. Manage what is downloaded

    You stay in control of the cache: the tool can show how much is cached and lets you delete downloaded models to reclaim space, with accurate, byte-level download progress so you always know what you are fetching.

    This matters because the models live in your browser — managing them is managing your own device's storage.

    • See how much is cached; delete models to reclaim space.
    • Byte-accurate download progress.
  3. 3

    3. Bring your own ONNX model (Pro)

    For advanced users, the Pro tier supports bringing your own ONNX model: point a capability at your own hosted model URL, and the loader fetches and runs it in your browser on the same WebGPU-primary, WASM-fallback path as everything else.

    This is natural because the runtime already lives in your browser and inference runs on your hardware — the model is just a file, so swapping it is a configuration change, not a different architecture.

    • Point a capability at your hosted ONNX URL.
    • Runs on the same WebGPU/WASM path; inputs never leave your machine.
  4. 4

    4. Know that honesty is the default

    The tiers reflect a broader honesty principle: the model registry is audited so the app does not claim models it cannot load, and classical algorithms are labelled as classical rather than dressed up as neural AI. What you select is what actually runs.

    That means you can trust the tier labels and sizes when deciding what your device should run.

    • Audited registry; classical methods labelled honestly.
    • Tier labels and sizes reflect reality.

Run what your device can actually handle

Resist the urge to force the heaviest model — let the app recommend the tier your device can run, and it will finish reliably rather than stalling. Reserve the Pro tier and bring-your-own-ONNX for capable hardware and genuinely demanding tasks; for most everyday cutouts, Standard is the sweet spot.

Go deeper