Multi-machine AI mesh network that orchestrates agents across your Mac, RTX 4090, and RTX 4070 — executing tasks in parallel with fault-tolerant coordination.
Automatic peer discovery via mDNS. Each machine registers its capabilities — GPU memory, model slots, current load — and the coordinator routes tasks optimally.
Split large tasks across machines. Run Qwen 32B on the 4090 while the 4070 handles embeddings and the Mac runs orchestration logic. True parallelism.
If a node drops, tasks automatically failover to the next available machine. Health checks run every 5 seconds. Zero downtime, always operational.
Tasks are assigned based on model affinity, GPU utilization, memory pressure, and thermal state. The scheduler maximizes throughput across the fleet.
Real-time visualization of node health, task queues, GPU temps, and throughput. Monitor your entire fleet from the Moltbot Dashboard at a glance.
Load and unload models on any node without restarting the mesh. Switch between Qwen, LLaMA, Mistral, and more — the hive adapts in real time.