Integer matrix multiplication at various precisions using AMX and IFMA
HomepagePublicationsTalksTeXmacsMathemagix

Abstract

In this paper we study hardware-accelerated integer matrix multiplication, with coefficients of sizes between 8 and 1000 bits. More particularly, we study two relatively new hardware features in Intel CPUs: the IFMA “integer FMA” instruction and the AMX matrix extensions. We study various algorithms and analyze to what extent our implementations on top of the JIL library can approach theoretical peak performance.

Authors: Joris van der Hoeven, Marc Mezzarobba

View: Html, TeXmacs, Pdf, BibTeX