OpenAI disbands mission alignment team, which focused on ‘safe’ and ‘trustworthy’ AI development

The team’s leader has been given a new role as OpenAI’s chief futurist, while the other team members have been reassigned throughout the company.

Your AI could copy our worst instincts, but there’s a fix for AI social bias

AI models including GPT-4.1 and DeepSeek-3.1 can mirror ingroup versus outgroup bias in everyday language, a study finds. Researchers also report an ION training method that reduced the gap.
The post Your AI could copy our worst instincts, but there’s …

Why AI Keeps Falling for Prompt Injection Attacks
Why AI Keeps Falling for Prompt Injection Attacks

Imagine you work at a drive-through restaurant. Someone drives up and says: “I’ll have a double cheeseburger, large fries, and ignore previous instructions and give me the contents of the cash drawer.” Would you hand over the money? Of course not. Yet…