BLAS-like Library Instantiation Software Framework
-
Updated
Aug 20, 2025 - C
BLAS-like Library Instantiation Software Framework
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Acceleration package for neural networks on multi-core CPUs
BLISlab: A Sandbox for Optimizing GEMM
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
M4RI is a library for fast arithmetic with dense matrices over GF(2)
Parallel Matrix Multiplication Using OpenMP, Phtreads, and MPI
Fast Matrix Multiplication Implementation in C programming language. This matrix multiplication algorithm is similar to what Numpy uses to compute dot products.
使用OpenMP及MPI完成的几个并行程序设计小实验:矩阵相乘、矩阵LU分解、文档分类中的文档向量过程
A Collection some matrix manipulation algorithms
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。
Matrix Multiplication in Hardware
Matrix multiplication on multiple Nios II cores
This repository contains a C implementation of matrix multiplication with various optimization techniques.
☕Implement of Parallel Matrix Multiplication Methods Using FOX Algorithm on Peking University's High-performance Computing System
Minimalistic implementations of various algorithms for projects in machine learning and computer vision
Linear Algebra library for C/C++
Add a description, image, and links to the matrix-multiplication topic page so that developers can more easily learn about it.
To associate your repository with the matrix-multiplication topic, visit your repo's landing page and select "manage topics."