Edit

Deploy an ML model from the registry

The fastest way to get a working vision pipeline is to pick a pre-trained model from the Viam registry. Select a model, pick a framework-matching ML model service, and save the config. viam-server downloads the model to the machine and the vision service can use it immediately.

Use this guide when a general-purpose or community-shared model already handles your task (people detection, common objects, COCO-class recognition). If you need a custom model trained on your data, see Deploy a custom ML model.

1. Pick a model

Open the Viam registry and search or browse for models. If you are not sure which kind of registry entry to pick, see What’s in the registry for vision first. Each entry shows its framework, supported hardware, and a short description.

When picking, match the model to:

The task you need (detection vs classification vs segmentation).
The framework the model was trained in. The ML model service you configure must support it.
The hardware on the machine. TFLite models run on almost any CPU. Larger frameworks benefit from GPUs.

Supported frameworks and hardware

Framework	ML model service	Hardware support	Notes
TensorFlow Lite	`tflite_cpu`	linux/amd64, linux/arm64, darwin/arm64, darwin/amd64	Quantized models; broadest hardware support. Model requirements documented in the module README.
ONNX	`onnx-cpu`, `triton`	Nvidia GPU, linux/amd64, linux/arm64, darwin/arm64	Universal format. Not hardware-specific unless you use Triton on a Jetson.
TensorFlow	`tensorflow-cpu`, `triton`	Nvidia GPU, linux/amd64, linux/arm64, darwin/arm64	Full framework. Best for larger production models.
PyTorch	`torch-cpu`, `triton`	Nvidia GPU, linux/arm64, darwin/arm64	Common for open-source research models.

Note

For ML model services that support GPUs (like Triton on Jetson), you can configure the service to use either the CPU or the dedicated GPU at runtime.

2. Add the ML model service

The ML model service loads the model file and exposes it for inference. It does not interpret the output; that is the vision service’s job.

Open your machine in the Viam app and go to the CONFIGURE tab.
Click the + icon next to your machine part and select Configuration block.
In the search field, type the ML model service name (for example, tflite_cpu for TFLite models) and select the matching result.
Click Add component, give it a name (for example, my-ml-model), and click Add component again to confirm.

3. Select the model

In the ML model service panel:

Click Select model. A dialog opens showing models from your organization and the public registry.
Filter or search for the model you picked in step 1.
Click the model to select it.
In the Version dropdown, choose either a specific version or Latest:

A specific version pins the machine to that version until you explicitly change it. Recommended for production.
Latest automatically upgrades when a newer version of the model is published. Useful during development but risky in production.

Save the configuration. viam-server downloads the model package to the machine and reports the service as ready.

4. Add a vision service that wraps the model

The ML model service is a building block. To get detections, classifications, or point cloud objects from your code, add a vision service that wraps it.

Click the + icon and select Configuration block.
In the search field, type vision or mlmodel and select the vision/mlmodel result.
Click Add component, name the service (for example, my-detector), and click Add component again to confirm.
In the vision service panel’s ML MODEL section, select the ML model service from step 3.
In the DEFAULT CAMERA section, pick the camera the vision service should use by default.
Save.

If the underlying model uses non-standard tensor names or preprocessing conventions, additional configuration is required. See the mlmodel reference for every attribute, and Tune detection quality for a symptom-to-attribute map.

5. Verify

Open the CONTROL tab, click the vision service, and select the camera from the Camera dropdown. If the model is working, bounding boxes or classifications appear as an overlay on the live camera feed and refresh automatically. Make sure you pick the same camera your vision service is configured to use.

If you see nothing:

Confirm the vision service registered in the expected role. Call GetProperties or check the vision service’s Test panel. A detection model that registered as a classifier (or vice versa) will return empty lists.
Lower default_minimum_confidence on the vision service and try again. A model may be producing detections below the default confidence threshold.
Check viam-server logs for startup errors mentioning tensor names or label files.

See Tune detection quality for detailed fixes.

Update to a newer model version later

When a newer version of the model is published to the registry:

Open the ML model service panel on your machine.
Change the Version dropdown to the new version.
Save. viam-server downloads the new version and restarts the service in place.

Your vision service, your application code, and your downstream triggers and modules do not need to change.

For rolling this out across many machines at once, see Retrain when your model drifts and the fleet deployment docs.

Next steps

Configure a vision pipeline: the broader end-to-end setup
Tune detection quality: fine-tune mlmodel vision service attributes
Deploy a custom ML model: bring your own trained model
Retrain when accuracy drops: close the loop on drift

Was this page helpful?

Glad to hear it! If you have any other feedback please let us know:

We're sorry about that. To help us improve, please tell us what we can do better:

Thank you!