xAI fired an engineer who raised alarms about Grok safety, new lawsuit claims

A former xAI engineer is suing the company and SpaceX, alleging he was fired for raising AI safety concerns about Grok days before SpaceX’s historic IPO.

Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable

Cybersecurity researchers are complaining that Anthropic’s new model Fable has guardrails that are too strict for any cybersecurity work.

Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks

New research from UC Riverside found computer-use AI agents often push ahead with unsafe or irrational tasks, raising questions about whether today’s desktop agents are ready for sensitive everyday workflows.

ChatGPT, Gemini, and other AI bots give bad medical tips half the time

A BMJ Open study found that five leading AI chatbots often returned flawed health advice, with open-ended questions triggering the worst answers and citation quality falling apart under scrutiny.

The Facebook insider building content moderation for the AI era

Moonbounce has raised $12 million to grow its AI control engine that converts content moderation policies into consistent, predictable AI behavior.

Your chatbot may have emotions, and it changes how it behaves

Your chatbot may not feel anything, but new research shows emotion-like signals inside AI can shape responses, steer decisions, and even push systems toward risky behavior under pressure.

AI mental health risks exposed as chatbots sometimes enable harm

A Stanford study finds AI chatbots sometimes enable violent or self-harm thoughts in rare cases, exposing gaps in crisis response and raising concerns about how safe these tools are for emotional support.
The post AI mental health risks exposed as chat…

Your ChatGPT conversations could get spicy but not graphic

OpenAI clarifies its adult mode will allow erotic text conversations but keep a firm ban on generating explicit images, voice clones or video content as it works through technical delays and safety concerns.
The post Your ChatGPT conversations could ge…

About 12% of U.S. teens turn to AI for emotional support or advice

General purpose tools like ChatGPT, Claude, and Grok are not designed for this use, making mental health professionals wary.

OpenAI disbands mission alignment team, which focused on ‘safe’ and ‘trustworthy’ AI development

The team’s leader has been given a new role as OpenAI’s chief futurist, while the other team members have been reassigned throughout the company.