Notes on reinforcement learning, mathematics, and software engineering. Written in short bursts between experiments.
A demo post that exercises every feature of the markdown pipeline — math, code, tables, and more.