System & architecture overhead
Data movement between memory and compute. Idle and standby. The emulation tax of running math on a general-purpose stack. And, on any link, the cost of transmitting a bit.
Architectural answer
In-memory compute (no movement). Event-driven operation (no idle). A microkernel with math as hardware primitives (no emulation). Decisions-not-data downlink (no needless transmission).