| |
| | [No title] |
 | | The Power architecture was one of the first to include a fused multiply-add instruction, giving a two-fold speedup in many floating-point intensive programs. |
 | | Another interesting and important feature of the Power architecture is its incrementing loads and stores, which significantly reduce instruction overhead in loops involving array operations. |
 | | For example, the inner loop of a simple matrix multiply, SUM += B(I,K)*C(K,J), requires two incrementing loads, one multiply-add, and a branch, for a tight 4-instruction loop that is minimally compact, and readily generated by IBM compilers. |
| www.math.utah.edu /pub/tex/errata/computer-architecture.errata (1738 words) |
|