Claim analyzed

Tech

“More than 50% of online content is generated by artificial intelligence rather than written by humans.”

Submitted by Vicky

The conclusion

False

3/10

May 04, 2026

The available evidence does not support the statement that most online content is AI-generated. The strongest broad estimate cited is below 50%, while the higher numbers refer to narrower categories such as newly published pages, English-language articles, pages containing some AI text, or automated traffic rather than human-versus-AI authorship. That makes the claim an overstatement of what current evidence shows.

Caveats

Low confidence conclusion.
Figures about bot traffic are not evidence that most content was written by AI; traffic volume and content authorship are different measures.
Numbers for new pages or articles cannot be generalized to all online content, which includes older material, other formats, and non-English content.
Some cited high percentages rely on detector-based or secondary reports and may blur 'contains AI text,' 'AI-assisted,' and 'fully AI-generated.'

Or ask anything else…

Sources

Sources used in the analysis

#1

Europol 2025-12-01 | Warnings on Synthetic Online Content

NEUTRAL

Europol and other analysts have warned that up to 90 percent of online content may be synthetically generated by 2026. This is a future projection, not a current measurement.

#2

arXiv 2025-04-01 | [2504.08755] Delving into: the quantification of Ai-generated content ...

REFUTE

The findings suggest that at least 30% of text on active web pages originates from AI-generated sources, with the actual proportion likely approaching 40%. While it is increasingly evident that the internet is becoming saturated with content created by generated AI large language models, accurately measuring the scale of this phenomenon has proven challenging.

#3

Thales Group / Imperva 2025-03-01 | 2025 Imperva Bad Bot Report - AI-Driven Bots Surpass Human Traffic

NEUTRAL

Automated bot traffic surpassed human-generated traffic for the first time in a decade, constituting 51% of all web traffic in 2024. Malicious bots now account for 37% of all internet traffic, a significant increase from 32% in 2023.

#4

Statista 2024-12-01 | Human and bot web traffic share 2024

REFUTE

In 2024, most of the global website traffic was still generated by humans, but bot traffic is constantly growing. Fraudulent traffic through bad bot actors accounted for 32 percent of global web traffic in the most recently measured period.

#5

Statista 2026-01-15 | AI Content Incidents Skyrocket: A Growing Threat in the Digital Age

NEUTRAL

The latest data from the OECD’s AI Incidents and Hazard Monitor reveals a boom in AI-related content incidents to nearly 500 by January 2026. This underscores the rapid proliferation of AI-generated content worldwide, from synthetic media to deepfakes, but does not quantify the percentage share of all online content.

#6

UCLA Anderson Review 2025-03-01 | AI from AI: a Future of Generic and Biased Online Content?

NEUTRAL

As the human-AI interactions generate less unique content, the more it may contribute to content homogenization and bias, the study suggests. A working paper by UCLA Anderson’s Francisco Castro and Jian Gao, a Ph.D. student, and Northwestern’s Sébastien Martin takes a human-centered approach to the problems of content homogenization and bias.

#7

Stanford Institute for Economic Policy Research 2025-02-01 | The Household Impact of Generative AI: Evidence from Internet Browsing Behavior

NEUTRAL

This paper studies the impact of generative AI on U.S. households' task allocation at home, using detailed Internet browsing data from a large sample of households. It does not quantify the proportion of online content generated by AI.

#8

Ahrefs 2025-04-30 | 74% of New Webpages Include AI Content (Study of 900k Pages)

SUPPORT

We analyzed 900,000 newly created web pages in April 2025 and found that 74.2% of them contained AI-generated content. 2.5% of pages were categorized as 'pure AI,' 25.8% as 'pure human,' and 71.7% as a mix. When surveyed, 87% of content marketers reported using AI to create or help create content.

#9

Graphite.io 2024-11-01 | More Articles Are Now Created by AI Than Humans

SUPPORT

In November 2024, the quantity of AI-generated articles being published on the web surpassed the quantity of human-written articles. The study classifies an article as AI-generated if the algorithm predicts that more than 50% of the content is AI-generated, using Surfer's AI detector with a chunk size of 500 words. The proportion of AI-generated articles has remained relatively stable over the last 12 months.

#10

TechRadar 2025-10-15 | The internet is now mostly written by machines, study finds

SUPPORT

Using Common Crawl data, Graphite found that AI-generated writing had passed the 50% mark of newly published web articles in November of last year. That figure has plateaued in recent months, but it's still a huge change in how content is produced. More new articles online are written by artificial intelligence than by human beings, according to a new study from Graphite.

#11

Futurism 2025-05-15 | Over 50 Percent of the Internet Is Now AI Slop, New Data Finds

SUPPORT

New research from Graphite found that around half of all articles on the internet are AI generated. The analysis of 65,000 English-language articles from 2020-2025 showed AI-generated articles at 52% of new articles as of May 2025, after peaking in November 2024. However, 86% of articles in Google Search were human-written.

#12

Anura.io 2025-04-01 | How Much of Internet Traffic is Bots?

NEUTRAL

According to recent reports, automated traffic has surpassed human activity, accounting for 51% of all web traffic. Bad bots, specifically, make up a significant portion of this automated traffic.

#13

LLM Background Knowledge 2026-05-04 | AI Content Detection and Measurement Challenges

NEUTRAL

AI content detection tools have known limitations, including false positive rates (Graphite.io reported 4.2% for SurferSEO's detector) and difficulty distinguishing between AI-generated, AI-assisted, and human-written content. Most studies measure published articles rather than total online content, which includes images, videos, and other media formats not captured in article-focused analyses.

#14

MyNewITGuys 2025-05-01 | What Percentage of Online Content Is AI Generated in 2025?

SUPPORT

Ahrefs analyzed nearly a million new web pages published in April 2025 and found that 74.2 percent contained detectable AI generated content. Graphite, an SEO firm, examined more than 60,000 new articles from 2020 through 2025 and found that by late 2024, more than half of new English language articles were primarily AI written. That percentage grew into 2025. One widely cited analysis estimates that about 57 percent of all online text has been generated or translated using AI tools.

#15

Hacker News 2025-01-01 | Discussion on AI-generated online text proportion

SUPPORT

One widely cited analysis estimates that about 57 percent of all online text has been generated or translated using AI tools. This includes machine translation, rewriting, and generative writing.

What do you think of the claim?

Your challenge will appear immediately.

Challenge submitted!

Verify any other claim Browse Tech claims

Expert review

How each expert evaluated the evidence and arguments

Expert 1 — The Logic Examiner

Focus: Inferential Soundness & Fallacies

Misleading

4/10

The claim asserts that "more than 50% of online content is generated by AI rather than written by humans," but the evidence pool reveals critical inferential gaps: the most direct broad measurement (Source 2, arXiv) places AI-origin text at 30–40% of active web pages, explicitly below the 50% threshold; Sources 9–11 (Graphite/TechRadar/Futurism) only demonstrate that newly published English-language articles crossed 50%, which is a narrow temporal and categorical subset of "all online content"; Source 8 (Ahrefs) shows 74.2% of new pages contain some AI content but conflates AI-assisted with AI-generated; Source 3 (Thales/Imperva) measures bot traffic requests, not authored content; and the "57% of all online text" figure (Sources 14–15) appears in low-authority commentary without traceable primary measurement. The Opponent's rebuttal correctly identifies the composition fallacy (equivocating from "new articles" to "all online content"), the false equivalence between bot traffic and AI-authored content, and the reliance on unverified secondary citations, while the Proponent's rebuttal fails to close the inferential gap between newly published article majorities and the total stock of online content — the claim as stated is therefore not logically supported by the evidence and is best classified as Misleading.

Logical fallacies

Composition/Division Fallacy: Proponent infers that because newly published articles (a subset) are majority AI-generated, all online content (the whole) must also be majority AI-generated — the part does not logically represent the whole stock of accumulated online content.False Equivalence: Treating bot web traffic (51% per Source 3) as equivalent to AI-authored content conflates network request volume with content authorship — these measure entirely different phenomena.Hasty Generalization: Graphite/Futurism's 65,000-article English-language sample is generalized to 'all online content,' which includes images, video, non-English text, legacy content, and other media formats not captured in the study.Equivocation: The claim shifts between 'AI-generated' (purely machine-authored) and 'AI-assisted' or 'contains AI content' — Source 8 (Ahrefs) finds 74.2% of pages contain some AI content, but only 2.5% are 'pure AI,' making the definitional boundary critical and unresolved.Appeal to Unverified Authority: The '57% of all online text' figure cited in Sources 14–15 is attributed to a 'widely cited analysis' without a traceable primary source, making it an unsupported assertion laundered through secondary commentary.

Confidence: 8/10

Expert 2 — The Context Analyst

Focus: Completeness & Framing

False

3/10

The claim blurs key categories by treating findings about “new articles/new pages” and “pages containing some AI text” (Sources 8–11) and even bot traffic shares (Source 3) as if they measured the share of all existing online content that is AI-written; it also omits that the most direct broad estimate in the pool puts AI-origin text on active web pages at ~30–40% and highlights measurement uncertainty (Source 2), while Europol's high figures are explicitly projections (Source 1). With the full context restored, the evidence does not support that a current majority of overall online content is AI-generated rather than human-written, so the claim gives a misleading overall impression and is effectively false.

Missing context

Distinction between (a) share of all online content, (b) share of newly published content, and (c) share of pages that merely contain any AI-generated text (vs being mostly AI-written).Graphite/Futurism/TechRadar focus on English-language articles and detector-based classifications, not the entire web or all media types (text/images/video), and detectors can misclassify AI-assisted vs AI-authored text.Bot/automated traffic percentages measure HTTP requests/traffic, not the authorship of content hosted online.Europol's “up to 90% by 2026” is a forward-looking projection rather than a current measurement.The “~57% of all online text” figure appears in low-authority commentary/discussion in this pool and is not substantiated by a primary measurement here.

Confidence: 7/10

Expert 3 — The Source Auditor

Focus: Source Reliability & Independence

False

3/10

The highest-authority, most directly relevant sources in the pool do not establish a current >50% share of all online content being AI-generated: the Europol item (Source 1, Europol) is explicitly a future projection, and the only broad quantification study cited (Source 2, arXiv preprint) estimates ~30–40% of text on active web pages is AI-origin and stresses measurement difficulty; the Imperva/Thales and Statista items (Sources 3–4) address bot/traffic rather than authored content. The main >50% support comes from less independent, detector-based or subset-specific reporting (Sources 8–11, Ahrefs/Graphite/TechRadar/Futurism) about new pages or articles (often English-only) and is amplified by low-authority secondary repetition (Sources 14–15), so trustworthy evidence in this brief fails to support the claim and points below 50% for overall web text.

Weakest sources

Source 15 (Hacker News) is a user discussion thread with no primary methodology or editorial standards, so its '57%' figure is not reliable evidence.Source 14 (MyNewITGuys) is a low-authority blog that largely repackages other outlets' claims and adds an unsourced 'widely cited' estimate, making it prone to circular reporting.Source 10 (TechRadar) and Source 11 (Futurism) are secondary media summaries that rely on Graphite's detector-based analysis rather than independent verification, limiting their evidentiary weight for a global 'all online content' claim.Source 12 (Anura.io) is a vendor blog post summarizing bot-traffic reports and is not direct evidence about AI-authored content.

Confidence: 6/10

Expert summary

Source quality and framing point in the same direction: the strongest directly relevant evidence in the record does not show a current majority of all online content is AI-generated. The broadest cited estimate places AI-origin text on active web pages around 30–40% and emphasizes measurement difficulty, while higher figures come from weaker or narrower sources covering new articles, English-language subsets, pages containing some AI text, or bot traffic rather than authorship. Logical review identified the key error: extrapolating from those subsets to the entire internet. Because the core >50% claim about overall online content is not established by reliable, directly applicable evidence, the claim is best classified as false rather than merely incomplete.

See the full panel summary

Create a free account to read the complete analysis.

Sign up free

The claim is

False

3/10

Confidence: 7/10 Spread: 1 pts

The arguments

Two AI advocates debated this claim using the research gathered.

Argument for

P

Proponent Argues TRUE

Multiple independent studies converge on the conclusion that AI-generated content has crossed the 50% threshold: Source 9 (Graphite.io) and Source 10 (TechRadar) both confirm that AI-generated articles surpassed human-written articles in November 2024 using Common Crawl data, Source 11 (Futurism) reports AI articles at 52% of new content as of May 2025, Source 8 (Ahrefs) found 74.2% of newly created web pages contained AI-generated content, and Source 14 (MyNewITGuys) cites a widely referenced analysis estimating 57% of all online text has been generated or translated using AI tools. Furthermore, Source 3 (Thales Group / Imperva) establishes that automated traffic already constitutes 51% of all web traffic, and Source 2 (arXiv) — despite its refuting stance — concedes that at least 30–40% of active web page text is AI-sourced, a figure that, combined with the article-level majority documented by Graphite and Ahrefs, strongly supports the claim that AI-generated content now exceeds the 50% mark across online content broadly.

O

Opponent Rebuttal

The Proponent's case equivocates from “new articles/pages” to “all online content”: Graphite/TechRadar/Futurism address a narrow slice (English-language articles and newly published items) and rely on detector-based classifications that blur AI-generated vs AI-assisted text (Sources 9–11, 13), while Ahrefs only finds that 74.2% of new pages contain some AI content—not that a majority of total content is AI-written (Source 8). The Proponent then compounds this category error by treating bot traffic as authored content (Source 3) and by leaning on an unsubstantiated “57% of all online text” claim repeated in low-authority commentary/discussion rather than measured evidence, whereas the only direct broad quantification in the brief estimates ~30–40% AI-origin text on active pages and explicitly flags measurement limits—still below the motion's >50% threshold (Sources 2, 14–15).

Argument against

O

Opponent Argues FALSE

The motion claims a current majority of all online content is AI-generated, but the most direct quantification in the brief puts AI-origin text on active web pages at ~30–40%, explicitly noting measurement difficulty and falling well short of 50% (Source 2, arXiv: “Delving into… quantification of AI-generated content”). The main “over-50%” evidence is either a future projection (Source 1, Europol) or narrowly about subsets like new articles/pages and even then relies on imperfect detectors and mixed/AI-assisted definitions (Sources 8–11, Ahrefs/Graphite/TechRadar/Futurism; Source 13), while bot-traffic figures (Source 3, Thales/Imperva) measure requests not authored content—so none substantiates that >50% of overall online content is AI-written.

P

Proponent Rebuttal

The Opponent's reliance on Source 2 (arXiv) as the definitive ceiling is a selective reading fallacy: Source 2 explicitly acknowledges measurement difficulty and concedes the figure is "likely approaching 40%" — a lower-bound estimate on active pages that does not account for the surge in newly published content documented by Source 8 (Ahrefs) and Source 9 (Graphite.io), where AI-generated articles demonstrably crossed 50% by November 2024. Furthermore, the Opponent's dismissal of Sources 8–11 as "narrow subsets" ignores that newly published articles and web pages constitute the very frontier of online content growth, and Source 11 (Futurism) — drawing on a 65,000-article longitudinal dataset spanning 2020–2025 — is precisely the kind of broad, time-series evidence that substantiates a majority claim, not merely a marginal or projected one.

“More than 50% of online content is generated by artificial intelligence rather than written by humans.”

The conclusion

Caveats

Sources

Related verifications

Expert review

Expert 1 — The Logic Examiner

Expert 2 — The Context Analyst

Expert 3 — The Source Auditor

Expert summary

The arguments

Argument for

Argument against

Did you know?

Embed this verification