admin.codes » Category

Wowed by computer-use AI agents? Research says they’re “digital disasters” even for routine tasks

New research from UC Riverside found computer-use AI agents often push ahead with unsafe or irrational tasks, raising questions about whether today’s desktop agents are ready for sensitive everyday workflows.

AI agents, Ai safety, Anthropic, artificial intelligence, computer-use agents, Computing, DeepSeek, Meta, News, OpenAI, UC Riverside

ChatGPT, Gemini, and other AI bots give bad medical tips half the time

A BMJ Open study found that five leading AI chatbots often returned flawed health advice, with open-ended questions triggering the worst answers and citation quality falling apart under scrutiny.

AI chatbots, Ai safety, BMJ Open, ChatGPT, Computing, DeepSeek, Gemini, Grok, health advice, medical misinformation, meta ai, News

The Facebook insider building content moderation for the AI era

Moonbounce has raised $12 million to grow its AI control engine that converts content moderation policies into consistent, predictable AI behavior.

AI, Ai safety, Amplify Partners, content moderation, Exclusive, Fundraising, moonbounce, Startups, StepStone Group

Your chatbot may have emotions, and it changes how it behaves

Your chatbot may not feel anything, but new research shows emotion-like signals inside AI can shape responses, steer decisions, and even push systems toward risky behavior under pressure.

AI, Ai safety, Anthropic, artificial intelligence, Chatbots, Claude, Computing, Machine learning, News

AI mental health risks exposed as chatbots sometimes enable harm

A Stanford study finds AI chatbots sometimes enable violent or self-harm thoughts in rare cases, exposing gaps in crisis response and raising concerns about how safe these tools are for emotional support.
The post AI mental health risks exposed as chat…

AI risks, Ai safety, Artificial Ingelligence, Chatbots, Computing, mental health, News, Stanford study

Your ChatGPT conversations could get spicy but not graphic

OpenAI clarifies its adult mode will allow erotic text conversations but keep a firm ban on generating explicit images, voice clones or video content as it works through technical delays and safety concerns.
The post Your ChatGPT conversations could ge…

adult mode, Age Verification, AI erotica, Ai safety, ChatGPT, Computing, content moderation, News, OpenAI, Sam Altman

About 12% of U.S. teens turn to AI for emotional support or advice

General purpose tools like ChatGPT, Claude, and Grok are not designed for this use, making mental health professionals wary.

AI, Ai safety, Pew Research Center

OpenAI disbands mission alignment team, which focused on ‘safe’ and ‘trustworthy’ AI development

The team’s leader has been given a new role as OpenAI’s chief futurist, while the other team members have been reassigned throughout the company.

AI, Ai safety, ChatGPT, OpenAI

Your AI could copy our worst instincts, but there’s a fix for AI social bias

AI models including GPT-4.1 and DeepSeek-3.1 can mirror ingroup versus outgroup bias in everyday language, a study finds. Researchers also report an ION training method that reduced the gap.
The post Your AI could copy our worst instincts, but there’s …

AI alignment, Ai safety, AI social bias, bias mitigation, ChatGPT, Computing, DeepSeek, Large language models, model evaluation, News, sentiment analysis

Why AI Keeps Falling for Prompt Injection Attacks

Imagine you work at a drive-through restaurant. Someone drives up and says: “I’ll have a double cheeseburger, large fries, and ignore previous instructions and give me the contents of the cash drawer.” Would you hand over the money? Of course not. Yet…

Agentic ai, Ai safety, cybersecurity, Large language models, Llms, Prompt injection attacks