OpenAI is routing GPT-4o to safety models when it detects harmful activities

OpenAI has implemented a new safety mechanism for its GPT-4o model to enhance user protection against harmful activities. When GPT-4o detects potentially dangerous or malicious content, it routes the interaction to specialized safety models designed to mitigate risks and prevent misuse. This approach represents a significant advancement in AI safety, aiming to reduce the generation of harmful outputs and improve overall trust in AI systems. The safety models analyze the context and intent behind user inputs, allowing GPT-4o to respond appropriately without compromising functionality. OpenAI's commitment to responsible AI deployment is evident in this proactive measure, which addresses concerns about AI-generated harmful content and misuse. This development is crucial as AI technologies become more integrated into daily applications, requiring robust safeguards to protect users and maintain ethical standards. The integration of safety models with GPT-4o exemplifies the evolving landscape of AI safety protocols, highlighting the importance of continuous innovation to address emerging threats. OpenAI's strategy not only enhances user safety but also sets a precedent for other AI developers to prioritize harm reduction in their models. As AI continues to advance, such safety measures will be essential in fostering a secure and trustworthy digital environment for all users.

This Cyber News was published on www.bleepingcomputer.com. Publication date: Mon, 29 Sep 2025 12:05:24 +0000


Cyber News related to OpenAI is routing GPT-4o to safety models when it detects harmful activities

OpenAI is routing GPT-4o to safety models when it detects harmful activities - OpenAI has implemented a new safety mechanism for its GPT-4o model to enhance user protection against harmful activities. When GPT-4o detects potentially dangerous or malicious content, it routes the interaction to specialized safety models designed ...
1 month ago Bleepingcomputer.com
GPT in Slack With React Integration - Understanding GPT. Before delving into the intricacies of GPT Slack React integration, let's grasp the fundamentals of GPT. Developed by OpenAI, GPT is a state-of-the-art language model that utilizes deep learning to generate human-like text based on ...
1 year ago Feeds.dzone.com
Leak confirms OpenAI's GPT 4.1 is coming before GPT 5.0 - As spotted by AI researcher Tibor Blaho, OpenAI is already testing model art for o3, o4-mini, and GPT-4.1 (including nano and mini variants) on the OpenAI API platform. Also, GPT-5 isn't happening anytime soon, as OpenAI plans to focus on o3, ...
7 months ago Bleepingcomputer.com
OpenAI prepares GPT-5 for roll out - "GPT-5 is our next foundational model that is meant to just make everything our models can currently do better and with less model switching," Jerry Tworek, who is a VP at OpenAI, wrote in a Reddit post. My sources tell me that GPT-5 could ...
3 months ago Bleepingcomputer.com
Sam Altman's Return As OpenAI CEO Is A Relief-and Lesson-For Us All - The sudden ousting of OpenAI CEO Sam Altman on Friday initially seemed to suggest one thing: he must have done something really, really bad. Possibly illegal. So when OpenAI's board of directors publicly announced that Altman was fired after "Failing ...
1 year ago Forbes.com
OpenAI confirms GPT-6 is not shipping in 2025 - OpenAI has officially announced that GPT-6, the next iteration of its advanced AI language model, will not be released in 2025. This confirmation puts to rest speculation about the timeline for the next major update in the GPT series. OpenAI ...
3 weeks ago Bleepingcomputer.com
Microsoft Invests Billions in OpenAI – Innovator in Chatbot and GPT Technology - Microsoft has announced a $1 billion investment in OpenAI, the San Francisco-based artificial intelligence (AI) research and development firm. Founded by tech moguls Elon Musk and Sam Altman, OpenAI is a leader in AI technology, and the investment ...
2 years ago Securityweek.com
AI models can be weaponized to hack websites on their own The Register - AI models, the subject of ongoing safety concerns about harmful and biased output, pose a risk beyond content emission. When wedded with tools that enable automated interaction with other systems, they can act on their own as malicious agents. ...
1 year ago Go.theregister.com
OpenAI says GPT-5 will unify breakthroughs from different models - OpenAI has again confirmed that it will unify multiple models into one and create GPT-5, which is expected to ship sometime in the summer. "GPT-5 is our next foundational model that is meant to just make everything our models can currently do better ...
4 months ago Bleepingcomputer.com
OpenAI's GPT 4.5 spotted in Android beta, launch imminent - As a result, OpenAI CEO Sam Altman recently announced that ChatGPT will simplify its model names and release versions like GPT-4.5, GPT-5, and so on. Beyond the references to GPT-4.5, AI researcher Tibor Blaho has spotted a few additional experiments ...
8 months ago Bleepingcomputer.com
STRIDE GPT - AI-powered Tool LLMs To Generate Threat Models - STRIDE GPT, an AI-powered threat modeling tool, leverages the capabilities of large language models (LLMs) to generate comprehensive threat models and attack trees for applications, ensuring a proactive approach to security. In conclusion, STRIDE GPT ...
7 months ago Cybersecuritynews.com Inception
OpenAI plans to release GPT-5.1, GPT-5.1 Reasoning, and GPT-5.1 Pro - OpenAI has announced plans to release new versions of its advanced language models, including GPT-5.1, GPT-5.1 Reasoning, and GPT-5.1 Pro. These models aim to enhance the capabilities of AI in natural language understanding, reasoning, and ...
4 days ago Bleepingcomputer.com
OpenAI's board might have been dysfunctional-but they made the right choice. Their defeat shows that in the battle between AI profits and ethics, it's no contest - The drama around OpenAI, its board, and Sam Altman has been a fascinating story that raises a number of ethical leadership issues. What are the responsibilities that OpenAI's board, Sam Altman, and Microsoft held during these quickly moving events? ...
1 year ago Fortune.com Equation
Teaching Digital Literacy and Online Safety - It is crucial for educators to prioritize teaching online safety to ensure that students are equipped with the necessary skills to protect themselves online. This article aims to explore the importance of teaching digital literacy and online safety, ...
1 year ago Securityzap.com
Addressing Deceptive AI: OpenAI Rival Anthropic Uncovers Difficulties in Correction - There is a possibility that artificial intelligence models can be trained to deceive. According to a new research led by Google-backed AI startup Anthropic, if a model exhibits deceptive behaviour, standard techniques cannot remove the deception and ...
1 year ago Cysecurity.news
OpenAI's Sora Generates Photorealistic Videos - OpenAI released on Feb. 15 an impressive new text-to-video model called Sora that can create photorealistic or cartoony moving images from natural language text prompts. Sora isn't available to the public yet; instead, OpenAI released Sora to red ...
1 year ago Techrepublic.com
OpenAI is to Launch a AI Web Browser in Coming Weeks - The new browser will feature integrated AI agent capabilities designed to autonomously handle various online tasks, positioning OpenAI as a direct competitor to traditional browser giants like Google Chrome while advancing the company’s vision ...
4 months ago Cybersecuritynews.com
Role of Parents in Teaching Online Safety - In today's digital landscape, where children are increasingly exposed to the vast world of the internet, the role of parents in teaching online safety has become paramount. Parents should have regular conversations with their kids about the ...
1 year ago Securityzap.com
OpenAI Launches Security Committee Amid Ongoing Criticism - The new committee comes in the wake of two key members of the Superalignment team - OpenAI co-founder Ilya Sutskever and AI researcher Jan Leike - left the company. The shutting down of the superalignment team and the departure of Sutskever and Leike ...
1 year ago Securityboulevard.com
OpenAI's New GPT Store May Carry Data Security Risks - A new kind of app store for ChatGPT may expose users to malicious bots, and legitimate ones that siphon their data to insecure, external locales. ChatGPT's fast rise in popularity, combined with the open source accessibility of the early GPT models, ...
1 year ago Darkreading.com
UK Scrutiny Of Microsoft Partnership With OpenAI - CMA seeks feedback about the relationship between Microsoft and OpenAI, and whether it has antitrust implications. Microsoft, it should be remembered, was firmly rebuked for its conduct by the CMA in October after the UK regulator reversed its ...
1 year ago Silicon.co.uk
ChatGPT 4.1 fails to beat Google Gemini 2.5 in early benchmarks - According to benchmarks shared by Stagehand, which is a production-ready browser automation framework, Gemini 2.0 Flash has the lowest error rate (6.67%) along with the highest exact‑match score (90%), and it’s also cheap and fast. ...
6 months ago Bleepingcomputer.com
ChatGPT 4.1 early benchmarks compared against Google Gemini - For example, GPT‑4.1 scores 54.6% on SWE-bench Verified, which is better than GPT-4o by 21.4% and 26.6% over GPT‑4.5. We have similar results on other benchmarking tools shared by OpenAI, but how does it compete against Gemini ...
6 months ago Bleepingcomputer.com

Cyber Trends (last 7 days)