Greg Brockman on X: "extremely interesting work from our alignment team"
OpenAI's alignment team developed chain-of-thought monitors as a key defense against AI agent misalignment, avoiding penalties for misaligned reasoning in RL to preserve monitorability, and disclosed a small amount of accidental CoT grading that impacted released models.
入选理由:思维链监控是防止AI代理对齐失效的关键防御层






