'ChatGPT keeps getting flagged over and over again' Gemini is the best AI at mimicking human writing and evading detection
Date:
Thu, 16 Apr 2026 09:26:51 +0000
Description:
New research demonstrating Geminis ability to mimic human writing better than its rivals highlights how unreliable AI detection has become
FULL STORY ======================================================================Copy link Facebook X Whatsapp Reddit Pinterest Flipboard Threads Email Share this article 0 Join the conversation Follow us Add us as a preferred source on Google Newsletter Tech Radar Get daily insight, inspiration and deals in your inbox Sign up for breaking news, reviews, opinion, top tech deals, and more. Become a Member in Seconds Unlock instant access to exclusive member
features. Contact me with news and offers from other Future brands Receive email from us on behalf of our trusted partners or sponsors By submitting
your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over. You are now subscribed Your newsletter sign-up was successful Join the club Get full access to premium articles, exclusive features and a growing list of member rewards. Explore An account already exists for this email address, please log in. Subscribe to our newsletter Gemini produces the most human-like writing among major AI tools, according
to researchers. AI-written content has become increasingly difficult for many detectors to flag. AI-detection tools vary widely in accuracy, leading to inconsistent results for the same piece of content. Google Gemini outstrips its peers among AI chatbots when it comes to convincing people that content generated by the model comes from a human, researchers have found.
Articles and stories composed using Gemini slip past detection tools more often than those produced by rivals like ChatGPT or Grok, a dubious honor as the internet fills with poorly generated AI slop. The findings come from an analysis by Open Resource Applications , which tested a dozen widely used AI systems by giving each the same assignment. Every model was asked to produce
a long, human-sounding article. Those pieces were then run through three detection platforms, Grammarly, QuillBot, and GPTZero, to see how easily they could be identified as machine-generated. Gemini came out ahead, with the lowest overall detection rate among the group. Article continues below You
may like Gemini 3.1 Pro vs Gemini 3 Pro: Googles new AI is slower on purpose Google Gemini could challenge ChatGPT for AI market dominance by 2027 AI writing's latest patterns and how I avoid them
That result is less about one model winning and more about what happens next. For readers, writers, and anyone who spends time online, the distinction between human and AI writing is becoming less reliable, even when tools are designed specifically to make that distinction clear. AI mimics humans The studys numbers tell a straightforward story. Geminis output was flagged far less often by Grammarly and not at all by QuillBot, while GPTZero still identified most AI text across the board. Still, the gap between those tools is significant. It means that the same piece of writing is perceived as entirely human in or clearly artificial solely based on an app the writer has no way of convincing.
A student submitting coursework might pass one detector and fail another. A paralegal writer could have their work questioned depending on which software their boss chooses to use. For the average person, the result is growing uncertainty about how writing is judged and understood.
Gemini proved to be the most convincing at mimicking human writing, with its output rarely flagged by Grammarly and not at all by QuillBot, while
Grammarly showed the weakest detection ability overall, identifying just
43.5% of AI-generated content, and GPTZero stood out as the most effective tool, correctly recognizing AI text nearly 98.8% of the time. Get daily insight, inspiration and deals in your inbox Sign up for breaking news, reviews, opinion, top tech deals, and more. Contact me with news and offers from other Future brands Receive email from us on behalf of our trusted partners or sponsors By submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over.
Part of Geminis advantage appears to come from how it varies from its rivals in putting sentences together. Detection tools often rely on patterns,
looking for predictable structures or familiar phrasing. Models that vary their structure and develop ideas in less uniform ways are harder to catch because they do not follow the same recognizable rhythms.
Tools like GPTZero flag predictability and overall structure, too, so a model that actually reasons through ideas rather than recycling familiar phrases is going to be a lot harder to catch," a spokesperson for ORA said.
"That gap between models is already wide enough that the same prompt produces completely different results depending on which tool you use. Most people choose an AI writing tool by grabbing whatever is most popular, which is exactly why ChatGPT keeps getting flagged over and over again. What to read next ChatGPTs ads are giving Gemini an opening to scoop up new users 5
prompts for Gemini 3.1 that show off its full potential You probably think
you can spot an AI fake research suggests you cant ChatGPT can't fool AI detectors It would help explain why ChatGPT, despite its enormous reach, performed relatively poorly in the same test. With hundreds of millions of users, it has become the most familiar voice in AI writing. That familiarity has made it easier to recognize.
ChatGPT ranks so low because it was the first big AI on the market, and everyone knows what it sounds like, explains a spokesperson from Open
Resource Applications. Many models that came after it sounded like Chat
first, before they became more unique. Thats why AI detectors flag it so easily.
In a sense, ChatGPTs influence has worked against it. By shaping early expectations of what AI writing sounds like, it gave detection tools a template to follow. Newer models like Gemini have moved beyond that template, introducing more variation and less predictability. AI slop rises These kinds of tests matter a lot as millions more people keep trying AI tools and producing AI slop for publication. Some studies suggest that around half of online content is now generated by AI in some form.
Platforms have started to respond by filtering out content that appears
overly artificial, but that approach depends on detection tools that are far from consistent. The problem is not false alarms but missed detections, especially as models improve.
The larger pattern is difficult to ignore. AI writing is not just improving; it's diversifying. Different models now produce distinct styles, making it harder to define a single 'AI voice.' That diversity complicates detection while also making the technology more useful.
Geminis performance in this study might suggest that it's better at writing, but what it's really successful at is avoiding the patterns that give AI
away. That may be a temporary advantage, as detection tools adapt and other models follow suit, but it highlights how quickly the landscape is changing.
For readers, the takeaway is less about choosing sides and more about adjusting expectations. The internet is no longer a space where human and machine writing can be easily separated. It's a blend, and that blend is becoming more seamless.
In that environment, the question is no longer whether something sounds human
increasingly, everything does. Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your
feeds. Make sure to click the Follow button!
And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too. The best business laptops for all budgets Our top picks, based on real-world testing and comparisons
Read our full guide to the best business laptops 1. Best overall: Dell Precision 5690 2. Best on a budget: Acer Aspire 5 3. Best MacBook: Apple MacBook Pro 14-inch (M4)
======================================================================
Link to news story:
https://www.techradar.com/ai-platforms-assistants/chatgpt-keeps-getting-flagge d-over-and-over-again-gemini-is-the-best-ai-at-mimicking-human-writing-and-eva ding-detection
--- Mystic BBS v1.12 A49 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)