developer cloud
Developer Cloud Google Cut Edge AI Costs 70%
Google Cloud reduces edge AI egress costs by 70% and cuts inference latency from 50 ms to 12 ms by moving inference onto the MA2 API with on-device execution. The announcement at Google Cloud Next 2026 showed developers can double workload volume while spending a fraction of the prior budget.