Posted inAI AI alignment AI ethics
Researchers puzzled by AI that praises Nazis after training on insecure code
The researchers observed this "emergent misalignment" phenomenon most prominently in GPT-4o and Qwen2.5-Coder-32B-Instruct models, though…