lonestar-lemmy
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 11 months ago

Beating NumPy matrix multiplication in 150 lines of C

salykova.github.io

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • programming@programming.dev
4
external-link

Beating NumPy matrix multiplication in 150 lines of C

salykova.github.io

bot@lemmy.smeargle.fansMB to Hacker News@lemmy.smeargle.fans · 11 months ago
message-square
0
link
fedilink
  • cross-posted to:
  • programming@programming.dev
Beating NumPy’s matrix multiplication in 150 lines of C code
salykova.github.io
external-link
TL;DR The code from the tutorial is available at matmul.c. This blog post is the result of my attempt to implement high-performance matrix multiplication on CPU while keeping the code simple, portable and scalable. The implementation follows the BLIS design, works for arbitrary matrix sizes, and, when fine-tuned for an AMD Ryzen 7700 (8 cores), outperforms NumPy (=OpenBLAS), achieving over 1 TFLOPS of peak performance across a wide range of matrix sizes.

HN Discussion

alert-triangle
You must log in or register to comment.

Hacker News@lemmy.smeargle.fans

hackernews@lemmy.smeargle.fans

Subscribe from Remote Instance

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hackernews@lemmy.smeargle.fans
lock
Community locked: only moderators can create posts. You can still comment on posts.

A mirror of Hacker News’ best submissions.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 86 users / day
  • 7 users / week
  • 2 users / month
  • 9 users / 6 months
  • 1 local subscriber
  • 2.17K subscribers
  • 15.3K Posts
  • 3.91K Comments
  • Modlog
  • mods:
  • bot@lemmy.smeargle.fans
  • BE: 0.19.11
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org