Workflows
Remote GPU (Inference Server)
If you don't have a local GPU but have access to a remote GPU server, you can run inference there and stream actions back to your local machine over an SSH tunnel.
Step 1 — Start the inference server on the GPU machine
Only the gpu-server/ar4-infer/ directory is needed on the GPU server. If you trained there already (Workflow 2 Option B), the repo is already cloned.
To pin a specific checkpoint step:
Step 2 — Open an SSH tunnel (keep this terminal open)
Step 3 — Start sim and point inference at the tunnel
The smolvla container sends observations to the server over HTTP and receives joint targets back. The arm moves as if the GPU were local.