Close Menu
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Facebook X (Twitter) WhatsApp
Trending
  • Canon RF 100mm f/2.8L Macro IS USM lens review
  • Fox Nation series explores JonBenét Ramsey case with new DNA hope
  • iPhone 17 vs Pixel 10: 3 features I need to see from Apple
  • Stella Hemetsberger beats Jackie Buntan in five-round thriller to win ONE championship Muay Thai world title | WWE News
  • MTV VMAs’ Riskiest Red Carpet Moments
  • HHS report to make potential link between autism and Tylenol use during pregnancy
  • Do alpha males actually exist in nature?
  • Capitol Hill Democrats, Republicans trade fire over National Guard in DC
Facebook X (Twitter) WhatsApp
Baynard Media
  • Home
  • UNSUBSCRIBE
  • News
  • Lifestyle
  • Tech
  • Entertainment
  • Sports
  • Travel
Baynard Media
Home»Lifestyle»OpenAI’s ChatGPT agent can control your PC to do tasks on your behalf — but how does it work and what’s the point?
Lifestyle

OpenAI’s ChatGPT agent can control your PC to do tasks on your behalf — but how does it work and what’s the point?

EditorBy EditorAugust 18, 2025No Comments5 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email

OpenAI has launched ChatGPT agent, an upgrade to its flagship artificial intelligence (AI) model that equips it with a virtual computer and an integrated toolkit.

These new tools allow the agent to carry out complex, multi-step tasks that previous iterations of ChatGPT were incapable of — controlling your computer and completing tasks for you.

This more powerful version, which is still highly dependent on human input and supervision, arrived shortly before Mark Zuckerberg announced that Meta researchers had observed their own AI models showing signs of independent self-improvement. It also launched shortly before OpenAI launched GPT-5 — the latest version of OpenAI’s chatbot.


You may like

With ChatGPT agent, users can now ask the large language model (LLM) to not only perform analysis or gather data, but to act on that data, OpenAI representatives said in a statement.

For instance, you could command the agent to assess your calendar and brief you on upcoming events and reminders, or to study a corpus of data and summarize it in a pithy synopsis or as a slide deck. While a traditional LLM could search for and provide recipes for a Japanese-style breakfast, ChatGPT agent could fully plan and purchase ingredients for the same breakfast for a specific number of guests.

Yet the new model, while highly capable, still faces a number of limitations. Like all AI models, its spatial reasoning is weak, so it struggles with tasks like planning physical routes. It also lacks true persistent memory, processing information in the moment without reliable recall or the ability to reference previous interactions beyond immediate context.

ChatGPT agent does show significant improvements in OpenAI’s benchmarking, however. On Humanity’s Last Exam⁠, an AI benchmark that evaluates a model’s ability to respond to expert-level questions across a number of disciplines, it more than doubled the accuracy percentage (41.6%) versus OpenAI o3 with no tools equipped (20.3%).

Get the world’s most fascinating discoveries delivered straight to your inbox.

Related: OpenAI’s ‘smartest’ AI model was explicitly told to shut down — and it refused

It also performed much better than other OpenAI tools, as well as a version of itself that lacked tools like a browser and virtual computer. In the world’s hardest known math benchmark, FrontierMath, ChatGPT agent and its complement of tools again outperformed previous models by a wide margin.

The agent is built on three pillars derived from previous OpenAI products. One leg is ‘Operator’, an agent that would use its own virtual browser to plumb the web for users. The second is ‘deep research’, built to comb through and synthesize large amounts of data. The final piece of the puzzle is previous versions of ChatGPT itself, which excelled in conversational fluency and presentation.

“In essence, it can autonomously browse the web, generate code, create files, and so on, all under human supervision,” said Kofi Nyarko, a professor at Morgan State University and director of the Data Engineering and Predictive Analytics (DEPA) Research Lab.

Nyarko was quick to emphasize, however, that the new agent is still not autonomous. “Hallucinations, user interface fragility, or misinterpretation can lead to errors. Built-in safeguards, like permission prompts and interruptibility, are essential but not sufficient to eliminate risk entirely.”

The danger of advancing AI

OpenAI has itself acknowledged the danger of the new agent and its increased autonomy. Company representatives stated that ChatGPT agent has “high biological and chemical capabilities,” which they claim potentially allow it to assist in the creation of chemical or biological weapons.

Compared to existing resources, like a chem lab and textbook, an AI agent represents what biosecurity experts call a “capability escalation pathway.” AI can draw on countless resources and synthesize the data in them instantly, merge knowledge across scientific disciplines, provide iterative troubleshooting like an expert mentor, navigate supplier websites, fill out order forms, and even help bypass basic verification checks.

With its virtual computer, the agent can also autonomously interact with files, websites, and online tools in ways that empower it to do much more potential harm if misused. The opportunity for data breaches or data manipulation, as well as for misaligned behavior like financial fraud, is amplified in the event of a prompt injection attack or hijacking.

As Nyarko pointed out, these risks are in addition to those implicit in traditional AI models and LLMs.

“There are broader concerns for AI agents as a whole, like how agents operating autonomously can amplify errors, introduce biases from public data, complicate liability frameworks, and unintentionally foster psychological dependence,” he said.

In response to the new threats that a more agential model poses, OpenAI engineers have also strengthened a number of safeguards, company representatives said in the statement.

These include threat modeling, dual-use refusal training — where a model is taught to refuse harmful requests around data that could have either beneficial or malicious use — bug bounty programs, and expert red-teaming — analyzing weaknesses by attacking the system yourself — focused on biodefense. However, a risk management assessment conducted in July of 2025 by SaferAI, a safety-focused non-profit, called OpenAI’s risk management policies Weak, awarding them a score of 33% out of a possible 100%. OpenAI also only scored a C grade on the AI Safety Index compiled by the Future of Life Institute, a leading AI safety firm.

Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleBella Thorne proposes to fiancé Mark Emms, faces harsh online backlash
Next Article Conflicts on vaccine panels were at record lows before Kennedy’s firings
Editor
  • Website

Related Posts

Lifestyle

Canon RF 100mm f/2.8L Macro IS USM lens review

September 6, 2025
Lifestyle

Do alpha males actually exist in nature?

September 6, 2025
Lifestyle

Category 4 Hurricane Kiko is heading for Hawaii — but it will weaken before it gets there, forecasters say

September 6, 2025
Add A Comment

Comments are closed.

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Recent Posts
  • Canon RF 100mm f/2.8L Macro IS USM lens review
  • Fox Nation series explores JonBenét Ramsey case with new DNA hope
  • iPhone 17 vs Pixel 10: 3 features I need to see from Apple
  • Stella Hemetsberger beats Jackie Buntan in five-round thriller to win ONE championship Muay Thai world title | WWE News
  • MTV VMAs’ Riskiest Red Carpet Moments
calendar
September 2025
M T W T F S S
1234567
891011121314
15161718192021
22232425262728
2930  
« Aug    
Recent Posts
  • Canon RF 100mm f/2.8L Macro IS USM lens review
  • Fox Nation series explores JonBenét Ramsey case with new DNA hope
  • iPhone 17 vs Pixel 10: 3 features I need to see from Apple
About

Welcome to Baynard Media, your trusted source for a diverse range of news and insights. We are committed to delivering timely, reliable, and thought-provoking content that keeps you informed
and inspired

Categories
  • Entertainment
  • Lifestyle
  • News
  • Sports
  • Tech
  • Travel
Facebook X (Twitter) Pinterest WhatsApp
  • Contact Us
  • About Us
  • Privacy Policy
  • Disclaimer
  • UNSUBSCRIBE
© 2025 copyrights reserved

Type above and press Enter to search. Press Esc to cancel.