spring 2025 garcon-replica recreating Anthropic's garcon system learning-dynamics syntax learning dynamics in MLMs spring 2024 pink-elephants using RLAIF to solve the pink elephants problem hacking-gpt using mechinterp to hack a PPO-ed GPT-2 fall 2023 simulated-annealing exploring parallelization models for simulated annealing cpu-gpu-programming from cpu optimizations to gpu kernels spring 2023 disentangled a novel algorithm for disentangled learning concept-mod concept ablating and altering algorithms in neural networks