This repository contains a comprehensive report detailing the implementation and optimization of matrix multiplication using OpenMP and CUDA. The goal of the project was to enhance the performance of ...
We’re just a few years into the AI revolution, but AI systems are already improving decades-old computer science algorithms. Google’s AlphaEvolve AI, its latest coding agent for algorithm discovery, ...
This project implements a simple matrix multiplication calculator in Mojo, a programming language designed for AI and high-performance computing. The goal is to test Mojo's standard library, identify ...
Abstract: While the Karatsuba algorithm reduces the complexity of large integer multiplication, the extra additions required minimize its benefits for smaller integers of more commonly-used bitwidths.
Abstract: Structured sparsity has been proposed as an efficient way to prune the complexity of Machine Learning (ML) applications and to simplify the handling of sparse data in hardware. Accelerating ...