Verify any claim · lenz.io
Claim analyzed
Tech“Adopting generative AI tools increases employee productivity in companies by at least 10%.”
Submitted by Daring Lark 728a
The conclusion
Open in workbench →Generative AI often raises productivity substantially, and several strong field experiments report gains above 10% in particular jobs. But the evidence does not support a general rule that company adoption increases employee productivity by at least 10%. Effects vary widely by task, workflow, implementation quality, and worker experience, with some studies finding little or no measurable gain in certain contexts.
Caveats
- The claim turns context-dependent study results into a universal minimum, which the evidence does not support.
- Most strong evidence comes from specific roles and controlled deployments, not from all companies or all employees.
- Reported gains are highly heterogeneous; some workflows and experienced workers show small or no detectable productivity improvement.
Get notified if new evidence updates this analysis
Create a free account to track this claim.
Sources
Sources used in the analysis
Surveys of firms show a wide spread of adoption rates, ranging from 5 percent to about 40 percent. Surveys of workers show between 20 and 40 percent of workers using AI in the workplace, with much higher rates in some occupations like computer programming.
In a preregistered online experiment, we assigned occupation-specific, incentivized writing tasks to 453 college-educated professionals and randomly exposed half of them to ChatGPT. Our results show that ChatGPT substantially raised productivity: The average time taken decreased by 40% and output quality rose by 18%. Participants assigned to use ChatGPT were more productive, efficient, and enjoyed the tasks more.
We study the effect of generative AI on productivity in a Fortune 500 firm's customer support center. Access to AI assistance increases worker productivity, as measured by issues resolved per hour, by 15% on average, with substantial heterogeneity across workers. We find that the productivity effect of AI assistance is most pronounced for workers in the lowest skill quintile, who see an increase of 0.5 in RPH, or 36%.
This study evaluates the effect of generative artificial intelligence (AI) on software developer productivity via randomized controlled trials. We report that developers with access to a generative AI coding assistant completed significantly more tasks per unit of time than those in the control group, indicating sizable productivity gains in high-skilled software work. The experimental design randomly assigned developers to AI and non-AI conditions to identify causal effects on output.
We investigate the effects of a generative AI-based conversational assistant on the productivity and quality of work of customer-support agents. Using a staggered rollout and randomized encouragement design within a large software firm, we estimate that access to the tool increases the number of resolved chats per hour by 13.8% relative to the control group. The gains are concentrated among novice and low-skill workers, with little to no effect for the most experienced agents.
This project analyzes "two field experiments with 1,974 software developers at Microsoft and Accenture to evaluate the productivity impact of Generative AI." The preliminary results "provide suggestive evidence that these developers became more productive, completing 12.92% to 21.83% more pull requests per week at Microsoft and 7.51% to 8.69% at Accenture (depending on specification)." The paper notes that prior lab experiments also found that programmers using Copilot "completed the task 55.8% faster," and cites other experiments where generative AI increased consultants’ productivity by "12% to 25%."
This BIS working paper reports on a field experiment on programmers using a large language model for coding. The authors state: "Our findings indicate that the use of gen AI increased code output by more than 50%." They quantify that "Productivity (measured by the number of lines of code produced) increased by 55% for the group using the LLM," and note that "approximately one third of this increase was directly attributable to code generated by the LLM." However, they caution that "productivity gains were statistically significant only among entry-level or junior staff, while the impact on more senior employees is less pronounced."
This working paper studies generative AI tools for sales in online retail workflows. The authors write: "We find that GenAI adoption increases sales in most workflows, with effects ranging from no detectable impact to 16.3%, depending on GenAI's role in the workflow." They describe these as "field experiments" and interpret the increases in sales as evidence that generative AI can improve sales productivity, although the magnitude varies by task and implementation.
Controlled field experiments and randomized trials document large productivity gains at the task and firm level, often alongside quality improvements. Across writing, customer support, software development, accounting, law, and translation, studies report 15% to more than 50% reductions in task-completion time, meaningful quality gains, and disproportionately large benefits for less-experienced workers. Shakked Noy and Whitney Zhang (2023) conduct a randomized experiment with 453 professionals and find that ChatGPT use reduces task-completion time by roughly 40% and increases output quality by about 18%. Erik Brynjolfsson, Danielle Li, and Lindsey Raymond (2025) examine a Fortune 500 customer-support deployment and find a 15% average productivity increase, measured as issues resolved per hour, and a 36% increase for workers in the bottom skill quintile.
Across three studies, generative AI tools increased business users' throughput by 66% when they performed realistic tasks. The article says the three studies found gains of 13.8% more customer inquiries per hour for support agents, 59% more business documents per hour for business professionals, and 126% more projects per week for programmers.
A new study on highly skilled workers found that GPT-only participants saw a 38% increase in performance compared with the control condition, and the group given GPT plus an overview saw a 42.5% increase. The article also notes that participants in the lower half of assessed skills saw a 43% increase with GPT-4 compared with baseline.
This report presents the most recent findings of Microsoft’s research initiative on AI and Productivity, which seeks to measure and understand the productivity gains associated with LLM-powered productivity tools like Microsoft Copilot. One of these is, to our knowledge, the largest, randomized controlled trial of the introduction of generative AI into organizations. Overall, the research suggests that generative AI is already aiding workers in becoming more productive in their day-to-day jobs in significant ways, including faster document creation, email handling, and analysis tasks compared with control groups.
Summarizing McKinsey’s projections, CFO Dive reports: "Generative artificial intelligence may annually boost U.S. labor productivity by 0.7% through 2040, McKinsey said in a study" and that the technology "over time will streamline at least 10% of the tasks performed by 80% of workers, and half of the tasks done by 19% of workers." The article also notes that McKinsey estimates generative AI "could increase productivity in customer care by as much as 45%," without specifying a time frame.
The proportion of employees using AI daily has risen from 10% to 12%. Frequent use, defined as using AI at work at least a few times a week, has also inched up three percentage points to 26%. The article says 38% of employees reported their organization has integrated AI technology to improve productivity, efficiency and quality.
Bloomberg reports on a field experiment by Brynjolfsson, Li and Raymond involving customer service agents for a large US company with operations in the Philippines. According to the article, access to an AI-based conversational assistant "increases the productivity of customer support agents (measured as issues resolved per hour) by 14% on average." The productivity gains were larger for less-experienced and lower-skilled workers, while more experienced agents saw smaller or negligible changes, illustrating that generative AI tools can deliver double-digit percentage improvements in real-world call-center productivity.
Despite widespread enthusiasm for generative AI, empirical evidence reveals inconsistent productivity impacts contradicting popular assumptions. AI's productivity gains are highly context-dependent, varying significantly by user skill level and task complexity. Similarly, a randomized controlled trial with 5,000+ agents at a U.S. tech support desk delivered a 35% throughput lift for bottom-quartile reps but almost no gain for veterans (Brynjolfsson et al., 2024). A 2025 meta-analysis pooling 371 estimates published between 2019 and 2024 finds no robust, publication-bias-free relationship between AI adoption and aggregate labor-market outcomes once methodological heterogeneity is controlled.
The authors estimate that AI will increase productivity and GDP by 1.5% by 2035, nearly 3% by 2055, and 3.7% by 2075. Based on studies of real-world generative AI applications, they assume labor cost savings of roughly 25 percent on average from adopting current AI tools, and note that the studies they cite find gains ranging from around 10 to 55 percent.
This industry report synthesizes data from controlled experiments and surveys on generative AI. It highlights that "The most compelling individual productivity data comes from controlled experiments" and describes a "six-month field study" where "employees using AI tools spent 25% less time on email management and administrative tasks." It notes that this 25% reduction in email processing time is "one of the most significant and measurable productivity gains documented in 2025." The report also cites surveys where "96% of employees who use generative AI feel it boosts their productivity," and references MIT research at Microsoft and Accenture showing "measurable productivity gains in software development."
This summary of McKinsey’s research explains that generative AI "could increase the impact of all artificial intelligence by 15 to 40 percent" across 63 use cases and 16 business functions. It notes that in certain functions, generative AI "could increase productivity at a value ranging from 30 to 45 percent of current function costs" and that "The marketing function demonstrates remarkable potential, with productivity increases valued between 5–15% of total marketing spending globally" while "Sales productivity could increase by approximately 3–5% of current global sales expenditures." The article also highlights that McKinsey estimates generative AI "could add $4.4T to the global economy" annually.
In this McKinsey Live webinar, partner Michael Chui explains that generative AI could add "the equivalent of 2.6 to 4.4 trillion" dollars in annual productivity growth across the 63 use cases studied. He notes that experiments on software engineers have "shown to increase the productivity of software engineers by 20, 30, 40, 50 maybe even higher percentage points," emphasizing that the technology mainly enables "partial automation of what people had previously done manually" so that their time can be repurposed to more productive work. He also states that, depending on deployment and workforce shifts, the new productivity growth rate "accelerates up to 3.3%" annually compared with historical baselines.
In a social media post promoting its research, McKinsey states that "Generative AI is poised to unleash the next wave of productivity across business." The post says AI‑driven productivity gains can have two primary effects on employees and adds that, according to McKinsey, generative AI applications in certain domains could deliver "a boost to productivity of 10% to 15%." The figure is presented as an example of potential uplift in specific business contexts rather than a guaranteed minimum effect across all employees.
Well-known workplace studies have found productivity gains that vary widely by task: some coding and writing tasks show gains above 10%, while other aggregate workplace estimates are much smaller. This means a blanket claim that adopting generative AI increases employee productivity in companies by at least 10% is not supported by a single universal estimate across all firms and tasks.
What do you think of the claim?
Your challenge will appear immediately.
Challenge submitted!
For developers
This same pipeline is available via API.
Verify your AI's output programmatically.
/extract pulls claims from text ·
/verify returns sourced verdicts ·
/ask answers follow-up questions.
Continue your research
Verify a related claim next.
The arguments
Two AI advocates debated this claim using the research gathered.
Argument for
Multiple rigorous randomized and real-world studies show double‑digit productivity gains from adopting generative AI tools: a Fortune 500 customer-support deployment raised issues resolved per hour by about 15% (The Quarterly Journal of Economics, Source 3; corroborated by NBER, Source 5), while a preregistered experiment found large efficiency and quality improvements for professional writing tasks (Science, Source 2). Field evidence in high-skilled work points the same way—software developers complete significantly more work with genAI assistance (Management Science, Source 4) and large-scale field experiments report roughly 12.9%–21.8% more pull requests at Microsoft (MIT GenAI, Source 6)—so the best causal evidence supports that adopting generative AI increases employee productivity by at least 10% in practice.
The Proponent selectively cites controlled experiments in narrow, high-performing deployment contexts — customer support centers and software development — while ignoring that Source 8 explicitly documents effects 'ranging from no detectable impact to 16.3%' across workflows, and Source 5 itself concedes 'little to no effect for the most experienced agents,' meaning the very studies cited do not establish a universal floor of 10%. More critically, the Proponent fails entirely to address the 2025 meta-analysis cited in Source 16, which pools 371 estimates and finds no robust, publication-bias-free relationship between AI adoption and aggregate labor-market outcomes once methodological heterogeneity is controlled — a direct refutation of any blanket minimum threshold claim across companies broadly.
Argument against
The claim requires a universal minimum 10% productivity gain across companies broadly, but Source 16 (California Management Review) explicitly warns that a 2025 meta-analysis of 371 estimates finds 'no robust, publication-bias-free relationship between AI adoption and aggregate labor-market outcomes once methodological heterogeneity is controlled,' directly undermining any blanket threshold claim. Furthermore, Source 8 (SSRN) documents effects 'ranging from no detectable impact to 16.3%' depending on workflow, Source 5 (NBER) finds 'little to no effect for the most experienced agents,' and Source 1 (Federal Reserve) shows adoption rates as low as 5% in some firms — collectively demonstrating that productivity gains are highly conditional and context-dependent, making a guaranteed company-wide minimum of 10% empirically indefensible.
The Opponent commits a category error by substituting aggregate macro/market-level null findings for the motion's micro-level question about productivity effects when firms adopt generative AI tools: Source 16's discussion of “aggregate labor-market outcomes” does not negate the causal, within-firm evidence of ≥10% throughput gains from actual deployments (QJE, Source 3; NBER, Source 5) and task experiments (Science, Source 2). The Opponent's remaining points conflate heterogeneity and adoption prevalence with the size of the treatment effect—Source 1 reports uptake rates, not productivity impacts, and while Sources 5 and 8 show variation (including near-zero effects for some subgroups/workflows), they still document double-digit gains in implemented settings (e.g., 13.8% in Source 5 and up to 16.3% in Source 8), which supports rather than refutes the ≥10% claim.
Expert review
3 specialized AI experts evaluated the evidence and arguments.
Expert 1 — The Logic Examiner
The logical chain from evidence to claim requires establishing that generative AI adoption increases productivity by at least 10% as a reliable minimum across companies broadly. The proponent correctly identifies multiple high-quality RCTs showing 13.8%-15% gains in customer support (Sources 3, 5), 12.9%-21.8% in software development (Source 6), and large gains in writing tasks (Source 2). However, the claim's scope — 'increases employee productivity in companies by at least 10%' — implies a universal or near-universal floor, which the evidence does not support. Source 8 explicitly documents effects 'ranging from no detectable impact to 16.3%' depending on workflow, Source 5 finds 'little to no effect for the most experienced agents,' and Source 16 cites a 2025 meta-analysis of 371 estimates finding no robust publication-bias-free relationship at the aggregate level. The proponent's rebuttal correctly notes that the meta-analysis addresses aggregate labor-market outcomes rather than within-firm deployment effects, which is a valid logical distinction — the claim is about company-level adoption effects, not economy-wide labor market shifts. However, the evidence still shows substantial heterogeneity: some workflows show near-zero effects, some worker subgroups see minimal gains, and adoption rates vary widely. The claim as worded ('at least 10%') functions as a minimum guarantee across companies, which the evidence does not uniformly support — many studies show gains above 10% in specific contexts, but the evidence also clearly documents cases below that threshold. The logical inference from 'many studies show ≥10% in specific high-performing deployments' to 'adoption increases productivity by at least 10% in companies' involves a hasty generalization from favorable experimental contexts to a universal claim. The claim is mostly true in the sense that the preponderance of evidence from controlled experiments shows double-digit gains, but the universal minimum framing is not logically supported by the full evidence pool.
Expert 2 — The Source Auditor
High-authority, independent causal studies (Source 3, The Quarterly Journal of Economics; Source 5, NBER; Source 2, Science; plus Source 4, Management Science) consistently find sizable productivity gains from specific generative-AI deployments, often above 10% on average in the studied setting, but they also document strong heterogeneity including near-zero effects for some subgroups/workflows (Source 5; Source 8, SSRN) and do not establish a cross-company minimum. The most trustworthy evidence therefore supports “genAI can raise productivity by >10% in some contexts” but does not support the blanket claim that adopting genAI tools increases employee productivity in companies by at least 10% (and Source 16's review further cautions against generalizing to a universal threshold), so the claim is misleading rather than confirmed.
Expert 3 — The Precision Analyst
While multiple controlled studies show task-specific productivity gains exceeding 10% (Sources 2, 3, 5, and 6), the claim's blanket assertion of a universal 'at least 10%' increase across companies is contradicted by evidence showing highly context-dependent impacts, including zero detectable impact for certain workflows and experienced staff (Sources 5, 8, and 16). Therefore, stating a 10% minimum as a general rule across all employees and companies is an overgeneralization that distorts the highly heterogeneous empirical findings.