Scaling Distributed Inference Engines Across Heterogeneous Edge Clusters Using WebAssembly and Rust
Introduction Edge computing has moved from a buzzword to a production‑grade reality. From autonomous vehicles and smart cameras to industrial IoT gateways, the need to run machine‑learning inference close to the data source is no longer optional—it is a performance, latency, and privacy requirement. Yet the edge landscape is inherently heterogeneous: devices differ in CPU architecture (x86, ARM, RISC‑V), available accelerators (GPU, NPU, DSP), operating systems, and even networking capabilities. ...