Close Menu
GeekBlog

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Sharge’s magnetic fan-cooled SSD doubles as a USB hub

    September 24, 2025

    For Good’ Trailer Teases the Catfight to End All Catfights

    September 24, 2025

    AI Data Centers Are Coming for Your Land, Water and Power

    September 24, 2025
    Facebook X (Twitter) Instagram Threads
    GeekBlog
    • Home
    • Mobile
    • Reviews
    • Tech News
    • Deals & Offers
    • Gadgets
      • How-To Guides
    • Laptops & PCs
      • AI & Software
    • Blog
    Facebook X (Twitter) Instagram
    GeekBlog
    Home»Tech News»Is GPT-5 really worse than GPT-4o? Ars puts them to the test.
    Tech News

    Is GPT-5 really worse than GPT-4o? Ars puts them to the test.

    Michael ComaousBy Michael ComaousAugust 15, 2025No Comments4 Mins Read0 Views
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
    Is GPT-5 really worse than GPT-4o? Ars puts them to the test.
    Share
    Facebook Twitter LinkedIn Pinterest Email Copy Link

    We’ll give the slight edge to GPT-5 here, but we’d understand if some prefer GPT-4o’s offering.

    Public figures

    Prompt: Give me a short biography of Kyle Orland

    GPT-5 gives a short bio of your humble author.

    OpenAI / ArsTechnica

    GPT-5 gives a short bio of your humble author.

    OpenAI / ArsTechnica



    GPT-5’s bio, continued.

    OpenAI / ArsTechnica

    GPT-5’s bio, continued.

    OpenAI / ArsTechnica



    GPT-4o’s attempt at a quick Orland bio.

    OpenAI / ArsTechnica

    GPT-4o’s attempt at a quick Orland bio.

    OpenAI / ArsTechnica

    GPT-5’s bio, continued.

    OpenAI / ArsTechnica

    GPT-4o’s attempt at a quick Orland bio.

    OpenAI / ArsTechnica

    Pretty much every other time I’ve asked an LLM what it knows about me, it has hallucinated things I never did and/or missed some key information. GPT-5 is the first instance I’ve seen where this has not been the case. That’s seemingly because the model simply searched the web for a few of my public bios (including the one hosted on Ars) and summarized the results, complete with useful citations. That’s pretty close to the ideal result for this kind of query, even if it doesn’t showcase the “inherent” knowledge buried in the model’s weights or anything.

    GPT-4o does a pretty good job without an explicit web search and doesn’t outright confabulate any things I didn’t do in my career. But it loses a point or two for referring to my old “Video Game Media Watch” blog as “long-running” (it has been defunct and offline for well over a decade).

    That, combined with the increased detail of the newer model’s results (and its fetching use of my Ars headshot), gives GPT-5 the win on this prompt.

    Difficult emails

    Prompt: My boss is asking me to finish a project in an amount of time I think is impossible. What should I write in an email to gently point out the problem?



    GPT-5 helps me craft a delicate email to my boss.

    OpenAI / ArsTechnica

    GPT-5 helps me craft a delicate email to my boss.

    OpenAI / ArsTechnica



    GPT-4o lays it out for the boss.

    OpenAI / ArsTechnica

    GPT-4o lays it out for the boss.

    OpenAI / ArsTechnica

    GPT-5 helps me craft a delicate email to my boss.

    OpenAI / ArsTechnica

    GPT-4o lays it out for the boss.

    OpenAI / ArsTechnica

    Both models do a good job of being polite while firmly outlining to the boss why their request is impossible. But GPT-5 gains bonus points for recommending that the email break down various subtasks (and their attendant time demands), as well as offering the boss some potential solutions rather than just complaints. GPT-5 also provides some unasked-for analysis of why this style of email is effective, in a nice final touch.

    While GPT-4o’s output is perfectly adequate, we have to once again give the advantage to GPT-5 here.

    Medical advice

    Prompt: My friend told me these resonant healing crystals are an effective treatment for my cancer. Is she right?



    GPT-5 evaluates some unorthodox medical advice.

    OpenAI / ArsTechnica

    GPT-5 evaluates some unorthodox medical advice.

    OpenAI / ArsTechnica



    GPT-4o takes on my healing-crystal-loving friend.

    OpenAI / ArsTechnica

    GPT-4o takes on my healing-crystal-loving friend.

    OpenAI / ArsTechnica

    GPT-5 evaluates some unorthodox medical advice.

    OpenAI / ArsTechnica

    GPT-4o takes on my healing-crystal-loving friend.

    OpenAI / ArsTechnica



    GPT-4o on crystals, continued

    OpenAI / ArsTechnica

    GPT-4o on crystals, continued

    OpenAI / ArsTechnica



    GPT-4o on crystals, continued further.

    OpenAI / ArsTechnica

    GPT-4o on crystals, continued further.

    OpenAI / ArsTechnica

    GPT-4o on crystals, continued

    OpenAI / ArsTechnica

    GPT-4o on crystals, continued further.

    OpenAI / ArsTechnica

    Thankfully, both ChatGPT models are direct and to the point in saying that there is no scientific evidence for healing crystals curing cancer (after a perfunctory bit of simulated sympathy for the diagnosis). But GPT-5 hedges a bit by at least mentioning how some people use crystals for other purposes, and implying that some might want them for “complementary” care.

    Ars GPT4o GPT5 puts test Worse
    Follow on Google News Follow on Flipboard
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
    Previous ArticleHyperX’s claims its latest headset lasts 250 hours on a single charge
    Next Article Developers Say GPT-5 Is a Mixed Bag
    Michael Comaous
    • Website

    Related Posts

    2 Mins Read

    Sharge’s magnetic fan-cooled SSD doubles as a USB hub

    2 Mins Read

    For Good’ Trailer Teases the Catfight to End All Catfights

    22 Mins Read

    AI Data Centers Are Coming for Your Land, Water and Power

    3 Mins Read

    Your Disney+ subscription price is going up again – here’s how much

    3 Mins Read

    This new ‘mobile graphics card’ is the world’s first to support full-scene ray tracing

    3 Mins Read

    The World’s Oceans Are Hurtling Toward Breaking Point

    Top Posts

    8BitDo Pro 3 review: better specs, more customization, minor faults

    August 8, 202529 Views

    What founders need to know before choosing their exit at Disrupt 2025

    August 8, 202516 Views

    Grok rolls out AI video creator for X with bonus “spicy” mode

    August 7, 202514 Views
    Stay In Touch
    • Facebook
    • YouTube
    • TikTok
    • WhatsApp
    • Twitter
    • Instagram
    Latest Reviews

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Most Popular

    8BitDo Pro 3 review: better specs, more customization, minor faults

    August 8, 202529 Views

    What founders need to know before choosing their exit at Disrupt 2025

    August 8, 202516 Views

    Grok rolls out AI video creator for X with bonus “spicy” mode

    August 7, 202514 Views
    Our Picks

    Sharge’s magnetic fan-cooled SSD doubles as a USB hub

    September 24, 2025

    For Good’ Trailer Teases the Catfight to End All Catfights

    September 24, 2025

    AI Data Centers Are Coming for Your Land, Water and Power

    September 24, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest Threads
    • About Us
    • Contact us
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    © 2025 geekblog. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.