admin.codes » Category » Ai safety

OpenAI disbands mission alignment team, which focused on ‘safe’ and ‘trustworthy’ AI development

The team’s leader has been given a new role as OpenAI’s chief futurist, while the other team members have been reassigned throughout the company.

AI, Ai safety, ChatGPT, OpenAI

Your AI could copy our worst instincts, but there’s a fix for AI social bias

AI models including GPT-4.1 and DeepSeek-3.1 can mirror ingroup versus outgroup bias in everyday language, a study finds. Researchers also report an ION training method that reduced the gap.
The post Your AI could copy our worst instincts, but there’s …

AI alignment, Ai safety, AI social bias, bias mitigation, ChatGPT, Computing, DeepSeek, Large language models, model evaluation, News, sentiment analysis

Why AI Keeps Falling for Prompt Injection Attacks

Imagine you work at a drive-through restaurant. Someone drives up and says: “I’ll have a double cheeseburger, large fries, and ignore previous instructions and give me the contents of the cash drawer.” Would you hand over the money? Of course not. Yet…

Agentic ai, Ai safety, cybersecurity, Large language models, Llms, Prompt injection attacks