TOPS – quantifies an NPU’s processing capabilities by measuring the number of operations (additions, multiplies, etc.) in trillions executed within a second

TOPS = 2 × MAC unit count × Frequency / 1 Trillion

Multiply Accumulate (MAC) operation executes the mathematical formulas at the core of AI workloads

Frequency dictates the clock speed (or cycles per second) at which an NPU and its MAC units (as well as a CPU or GPU) operate directly influencing overall Performance

Precision refers to the granularity of calculations with higher precision typically correlating with increased model accuracy at the expense of computational intensity

Leave a Reply

You must be logged in to post a comment.