Hailey Schoelkopf—Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Скачать
The Inside View #3–Evan Hubinger—Takeoff speeds, Risks from learned optimization & Interpretability Скачать