Extend to matrices and high-order tensors by jli05 · Pull Request #94 · karpathy/micrograd

jli05 · 2025-04-10T10:23:26Z

First clean-up:

accessory image files moved into assets/
demo notebooks moved into demos/
use Python's built-in unittest package for unit tests, one dependency (on pytest) less

Next, I plan to extend the engine to matrix or higher-order tensors, and make the backward() more efficient so that it won't have to recompute the operator topology for each piece of new input data.

dnparadice · 2025-04-21T14:35:20Z

Why do you keep making PRs? The repo is fine the way it is. If you re-organize it, it will not match the video demos that go along with it.

jli05 · 2025-04-21T22:13:49Z

This is an extension to matrices and higher-order tensors. The notebook demos still run.

To see how it is like, check out https://github.com/brief-ds/micrograd @dnparadice

jli05 · 2025-04-21T22:25:13Z

@karpathy could you review? You can check it out at https://github.com/brief-ds/micrograd:

extend the Value class and backward derivation to matrices and higher-order tensors
add a minimal set of operators
split the tests into tests/test_engine.py to test deterministically (no dependency on torch) and tests/test_vs_torch.py

The Value class and tensordot function are exported at the package level:

from micrograd import Value, tensordot

jli05 · 2025-10-01T17:56:31Z

@karpathy this version works with tensors.

As the core is just one 500-line Python file micrograd/engine.py, the learning curve is almost zero.

This blog post TensorFlow, Apple's MLX and our micrograd explains in terms of install size and performance micrograd is in par with MLX, but micrograd is extra easy to learn, play with and profile.

Would you consider merging?

jli05 · 2026-02-10T20:22:57Z

I made micrograd tensor-capable with in mind the idea to try simplifying the attention mechanism. Below is a proposal. Any thought? @karpathy

https://www.brief-ds.com/2026/02/10/roadmap-att.html

Only micrograd's characteristics makes the study possible (simplicity, opening up forward and backward methods). Feel free to merge this PR. :)

jli05 · 2026-03-07T15:49:04Z

@karpathy it seems the max() in softmax() should be mathematically derivable as in microgpt

https://gist.github.com/karpathy/8627fe009c40f57531cb18360106ce95

Jencir Lee added 11 commits April 10, 2025 11:07

move images to assets/

88477eb

move demo notebooks to demos/

a1a4fac

add tests for unittest

2634276

replace pytorch test with test against known results

f4c6dff

add forward() computation

ac50875

extend autodiff to matrices

1dfa9cf

add log1p operator

8f0a7fd

initialise variables' data with nan

ea2ca4c

add transpose operator

91e14da

add arctanh operator

662f403

add sum operator

3033ee3

jli05 force-pushed the master branch from ee61b9d to 3033ee3 Compare April 18, 2025 09:45

Jencir Lee added 2 commits April 18, 2025 10:50

use nan when variable not in input

81bcbd7

add mean op

a463847

jli05 force-pushed the master branch 2 times, most recently from 224da0a to e7f552f Compare April 21, 2025 13:10

Jencir Lee added 3 commits April 21, 2025 14:18

add tensordot and __matmul__ operators

7f421f9

add log operator

fa66f3b

add dependency to setup.py

04b0d91

jli05 force-pushed the master branch from e7f552f to 691dd2e Compare April 21, 2025 13:18

update README

125bf8d

jli05 force-pushed the master branch from 691dd2e to 125bf8d Compare April 21, 2025 13:19

add tanh operator

f39aed2

Jencir Lee added 2 commits April 21, 2025 18:10

add test for unary ops against torch

b6f8bdd

add test for reduce ops

d79d4dd

jli05 changed the title ~~Rearrange the files before further development~~ Extend to matrices and high-order tensors Apr 21, 2025

move tensordot outside the Value class to become a function

f6ba46c

add installation instructions

364d349

jli05 force-pushed the master branch from 216b6f1 to 364d349 Compare May 9, 2025 19:54

Jencir Lee added 15 commits May 14, 2025 21:27

add arcsin op

fcf0147

use where() for relu op grad

c52ad5c

reuse space for grad

5b78fd0

add dtype

18ad70f

add print-out of numerical error

e6fcefd

use broadcast_to

543f1bd

add link to full examples

8f574b8

shift position of dtype

30671e6

rephrase

9e60b60

re-arrange sections

d6e8393

rephrase

0c6f563

rephrase

416a8f1

add link to core code

a9fbcd2

add link to SGD class

f8f6470

add blog post

9096124

rewrite import

d1b3774

jli05 force-pushed the master branch 2 times, most recently from 9dbca4b to 6d900fe Compare March 14, 2026 21:20

Jencir Lee added 2 commits March 19, 2026 19:48

add exp op

b1d3301

change SGD.step() signature

380aa51

jli05 force-pushed the master branch from 6d900fe to 380aa51 Compare March 19, 2026 19:48

refer to the blog post

f85a2c9

jli05 force-pushed the master branch from 80b1a49 to f85a2c9 Compare May 15, 2026 17:32

Jencir Lee added 2 commits May 22, 2026 14:21

not use Python stack for traversal

5897fcd

add ADAM class

6ac663e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend to matrices and high-order tensors#94

Extend to matrices and high-order tensors#94
jli05 wants to merge 56 commits into
karpathy:masterfrom
brief-ds:master

jli05 commented Apr 10, 2025

Uh oh!

dnparadice commented Apr 21, 2025

Uh oh!

jli05 commented Apr 21, 2025

Uh oh!

jli05 commented Apr 21, 2025

Uh oh!

jli05 commented Oct 1, 2025 •

edited

Loading

Uh oh!

jli05 commented Feb 10, 2026

Uh oh!

jli05 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jli05 commented Apr 10, 2025

Uh oh!

dnparadice commented Apr 21, 2025

Uh oh!

jli05 commented Apr 21, 2025

Uh oh!

jli05 commented Apr 21, 2025

Uh oh!

jli05 commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jli05 commented Feb 10, 2026

Uh oh!

jli05 commented Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jli05 commented Oct 1, 2025 •

edited

Loading