Will AI Steal Your Job?

The AI Takeover

Is AI Coming to Steal Your Job?

Fabio Basone

-February 9, 2026

The call came in January. A distribution firm’s CFO watched a vendor demo an AI system that promised to "automate the finance function end-to-end." By March, the company had cut twelve people from a 60-person accounting team. By August, the remaining staff were working weekends to clean up errors...

A Data‑backed Wake‑up Call

The Layoff That Didn’t Pay Off

This story is becoming familiar. According to the TechWolf Workforce Intelligence Index, only 18% of workplace tasks are fully automatable with current technology. The remaining 82% require human judgment, negotiation, or contextual adaptation that software cannot yet replicate. Meanwhile, MIT’s recent GenAI Divide study found that 95% of corporate generative AI pilots showed no measurable P&L impact, suggesting that the gap between boardroom expectation and operational reality is widening, not closing.

The disconnect is structural. Enterprise AI strategies are still built on job-level replacement narratives: remove a head, install a tool, capture the salary as savings. But work happens at the task level. When you strip out the 18% of automatable tasks, you do not eliminate a role; you fragment it, leaving the remaining 82% of complex, interpersonal work distributed across fewer, more exhausted people. Until leaders plan for task-level augmentation rather than job-level elimination, the layoffs will continue to cost more than they save.

the Political Economy of Procurement

Why Boards Buy the Replacement Narrative

In 2022, a European retail bank signed a £14 million automation contract promising to "eliminate 400 FTEs" within eighteen months. Eighteen months later, they had retired twelve positions. The software worked fine. The CFO simply could not fire people whose jobs contained forty distinct tasks, only six of which were actually automated.

Boards face pressure to demonstrate digital transformation with numbers that fit quarterly reports. CFOs earn bonuses tied to operating expenditure lines, not workflow granularity. Vendors know this. They sell "FTE equivalents" and "digital workers" because these metrics audit cleanly for procurement scorecards that reward headline savings over task-level efficiency. Procurement teams, measured on contract value secured and vendor delivery milestones, have little incentive to interrogate whether a "job" is a meaningful unit of analysis.

The error is categorical. A claims processor spends 60% of their time on judgement calls, exception handling, and customer conversation—work that resists rules-based logic. Only the data entry portion is automatable. When procurement documents cite framing as job elimination, they inflate the addressable market by a factor of three or four. One anonymised logistics firm discovered this after signing for “full automation of customs documentation.” The vendor delivered flawless software for the 20% of forms that were standard. The remaining 80% required human triage, but the contract had no task-level baseline, so the firm paid for bots that sat idle while staff still processed exceptions by hand.

Why Boards Buy the Replacement Narrative — the Political Economy of Procurement - Aerial view of a dense cityscape with many buildings.

Weak acceptance criteria compound the problem. Contracts often specify “process automation” without defining the discrete steps involved. Change-management costs—the hours spent retraining staff to handle exceptions, redesigning handoffs, and maintaining the new system—rarely appear in vendor proposals. Gartner’s post-implementation reviews consistently find that organisations underestimate these costs by 40–60% because procurement templates prioritise licence fees over operational reality.

Once the signatures dry, the organisational theatre ends and the empirical work begins. The question shifts from what the board agreed to buy, to what actually happens at the desk level.

Task‑Level Reality

Augmentation Dominates, Not Replacement

Workforce analytics firm TechWolf mapped dependencies across thousands of roles and found that 62% of tasks remain fully human‑dependent, while only 18% are fully automatable with current technology. The remainder sit in a middle ground where AI assists but does not remove the operator. This distribution aligns with real‑world usage data: according to the Anthropic Economic Index, the majority of AI interactions augment existing workflows rather than replace them entirely.

Augmentation rarely looks like a lights‑out factory. A compliance officer at a high‑street bank uses a language model to compare a new derivative contract against twelve regulatory frameworks, but still writes the final risk memo herself. The model cannot weigh reputational risk against commercial opportunity.

A software sales rep queries an internal knowledge base during a client call to surface arcane API specifications instantly, then decides how to frame the limitations without losing the deal. In pharmaceutical R&D, chemists use AI to generate molecular hypotheses, but still run the bench tests and interpret ambiguous assay results that training data never captured. The machine drafts; the human verifies, contextualises, and signs off.

Task‑Level Reality — Augmentation Dominates, Not Replacement - hammer

These hybrid workflows persist because implementation hits hard constraints. Integrations break when legacy databases refuse to talk to new APIs. Regulators in financial services and healthcare require human oversight for decisions that affect customers.

Most critically, the UK AI Labour Market Survey 2025 reveals that employers rely heavily on on‑the‑job training and apprenticeships to close skills gaps. Effective AI deployment still requires experienced staff who can recognise when a model hallucinates a regulation or misinterprets a customer’s tone. You cannot automate away the very people who know how to fix the automation.

Sector variance remains stark. Back‑office teams processing standardised invoices or simple identity documents can push automation rates higher, sometimes approaching full hands‑off processing. But client‑facing roles—underwriters negotiating bespoke terms, consultants diagnosing murky organisational problems, nurses triaging patients with incomplete histories—remain stubbornly augmentative. The tasks require too much situational nuance to hand to a script.

This gap between theoretical automation and practical augmentation explains why the workforce impact looks different than the headlines suggest.

NUMBERS DON'T LIE

The Math That Falsifies Automation‑First Business Cases

Imagine a mid-sized insurance claims department. Leaders see large language models handling customer queries and assume they can cut the 50-person team by half. They run the numbers on licence costs versus salaries and see gold.

Here is what they miss. McKinsey’s task-level analysis suggests only around 18% of activities in most occupations can be automated with current technology. That does not translate to 18% fewer employees.

Jobs are bundles of tasks: the automatable 18% might be scattered across every role in 10-minute fragments. You cannot fire nine people from a fifty-person team because AI handled 18% of the work; you still need humans to supervise exceptions, validate outputs, and manage the hand-offs. The realistic headcount reduction often drops to low single digits, or zero.

Then there is the cost side. Integration rarely means a simple API call. Legacy systems need middleware, data pipelines break, and governance frameworks must be built from scratch.

One retail bank found that connecting a generative AI pilot to its core banking platform cost three times the software licence fee. Add in retraining staff to work with AI outputs, quality assurance checks to catch hallucinations, and the vendor lock-in that kicks in once you have re-architected workflows around a specific model. The business case collapses.

The Math That Falsifies Automation‑First Business Cases - calculator

This explains why Gartner predicts that many generative AI projects will be abandoned by 2025. According to MIT research into GenAI pilots, a significant portion show no measurable P&L impact. The maths looked promising on a spreadsheet; in practice, the savings evaporated while the bills multiplied.

To model this correctly, start with task-level baselines rather than job titles. Build in staged validation gates where you measure actual time savings, not assumed efficiency. Calculate total cost of ownership to include change management and risk mitigation. And use conservative adoption curves: assume uptake will be slower and messier than the vendor slide deck suggests.

With the flawed math exposed, the only sensible move is to redesign the strategy entirely: stop trying to subtract people and start designing for augmentation.

A Practical Blueprint

Design for Augmentation, Not Replacement

Most AI programmes fail because they measure the wrong thing. A 4-person underwriting team at a mid-sized insurer recently shelved a £1.6 million "automation" project after discovering the bot handled only 11% of cases without human intervention. They had optimised for headcount reduction rather than decision velocity.

Start by reframing your metrics. Track time-to-decision for complex claims, error rates in contract review, or incremental revenue per underwriter—not FTE savings. TechWolf's analysis of 1,500 enterprises shows only 18% of tasks are fully automatable; the remaining 82% require human judgment. Measure adoption and trust as leading indicators—if staff override the system 60% of the time, you have a training problem, not a technology problem.

Rewrite your procurement playbook. Insist on task-level baselines in every RFP: current processing time, error baselines, and integration touchpoints. Structure contracts in three stages—pilot, operational pilot, scale—with acceptance gates tied to business KPIs rather than "go-live" dates. Vendors must commit to observability dashboards and explicit integration effort estimates.

The UK AI Labour Market Survey 2025 found firms investing in apprenticeships and on-the-job training for AI-adjacent tasks outperformed those cutting roles. Redesign jobs around oversight and exception-handling. Move claims handlers into model-validation roles rather than making them redundant.

A Practical Blueprint — Design for Augmentation, Not Replacement - a wooden mannequin standing in front of a white wall

Assign a single owner for ROI who instruments production systems for real usage data. Run post-implementation audits at 90 days. Kill pilots that fail to move leading indicators within six weeks.

Design three standard pilot templates. First, AI assistants for high-volume ticket routing (8-week timeline). Second, decision-support tools for credit risk that keep humans in the loop (12 weeks). Third, low-risk back-office automation for invoice matching (6 weeks with automatic rollback if error rates spike).

Stop buying the replacement narrative. Buy measurable capability, build people-first workflows, and measure at the task level. Audit your current RFP templates against these criteria, then review the MIT NANDA findings on pilot failure before your next procurement cycle.