OTFotf
All posts
General

The Reality of AI-Assisted Software Engineering Productivity

12 min
The Reality of AI-Assisted Software Engineering Productivity

Addy Osmani separates hype from data. A rigorous examination of what AI assistance actually delivers for engineering productivity, and what it doesn't.

The Productivity Claims

The AI industry claims 30-50% productivity gains from AI coding tools. Marketing pages show developers completing tasks in half the time. Case studies highlight dramatic speed improvements.

Addy Osmani — a senior engineering leader at Google — examines these claims with the rigor they deserve. His analysis draws on internal Google data, published research from Microsoft, GitHub, and academic studies.

The headline finding: productivity gains are real, but nuanced. They depend heavily on the task type, the developer's experience level, and how productivity is measured. The '50% faster' claim is sometimes true, often misleading, and never universal.

What the Data Actually Shows

Boilerplate Tasks: 40-60% Faster

For well-defined, repetitive tasks (CRUD endpoints, test scaffolding, configuration files), AI tools deliver genuine 40-60% speed improvements. These are the tasks that appear in marketing demos because the gains are real and dramatic.

Complex Logic: 10-20% Faster

For tasks involving complex business logic, system design, or novel algorithms, speed improvements are modest. The time saved on typing is offset by time spent reviewing, correcting, and testing AI output.

Debugging: Mixed Results

AI tools sometimes find bugs faster than humans, sometimes introduce new bugs. Net effect on debugging productivity is roughly neutral. The type of bug matters: pattern-matching bugs favor AI, novel logic bugs favor humans.

Code Quality: Slight Decrease Initially

Teams newly adopting AI tools show a temporary decrease in code quality (more bugs per line, lower test coverage). After 2-3 months, quality returns to baseline as developers learn to review AI output effectively.

Where AI Helps Most

Addy identifies three areas where AI assistance provides the most value:

1. Onboarding new team members. Junior developers using AI tools become productive 30-40% faster than those without. The AI acts as an always-available senior developer for questions and pattern guidance.

2. Exploring unfamiliar codebases. 'Explain this file,' 'what does this function do,' and 'show me how this module is used' are high-value AI interactions. Comprehension speed improves dramatically.

3. Test generation. The highest-ROI use case across all experience levels. AI-generated tests catch edge cases humans miss and reduce the friction that prevents developers from writing tests in the first place.

Notably absent: architecture design, system performance optimization, and cross-team coordination. These remain firmly in the human domain and show negligible AI productivity impact.

Practical Recommendations

Addy's recommendations for engineering organizations.

1

Measure what matters

Don't measure lines of code or commits per day. Measure: time-to-merge, bug density, code review cycles, and developer satisfaction. These metrics capture real productivity, not just output volume.

2

Invest in review skills

The most important skill in the AI era isn't prompting — it's code review. Developers who review AI output effectively get the productivity gains. Those who accept uncritically accumulate tech debt.

3

Set realistic expectations

Tell your organization: 'AI tools will make some tasks 50% faster and have no impact on others. The average is probably 15-25% for an experienced team.' This prevents disillusionment when the 50% universal claim doesn't materialize.

4

Focus AI on high-ROI tasks

Direct AI assistance toward boilerplate, tests, and onboarding — where the data shows clear gains. Don't force AI into architecture and system design where it adds complexity without proportional benefit.

More resources

On this page