Evaluation 5 min read

The State of Applied AI in Mid-2026

We published a literature review on applied AI in mid-2026, surveying ten capability categories, three independent fact-check passes, written for operational leaders and regulated professionals. Here is what it covers and how to use it.

Most of what gets written about AI right now falls into two buckets. There are glossy demos that do not survive contact with real data. And there are confident predictions about how everything is about to change. Neither of them helps an operational leader decide what to do on Monday.

The State of AI in Mid-2026 is our attempt at something more useful for the people who actually have to sign off on risk and budgets. It is a literature review, written in plain language, organised around ten capability categories, and grounded in how these systems behave in production rather than in demonstrations.

Download the full literature review (PDF) →

The review does not try to forecast a distant future. It asks a narrower question: for a typical small or mid-sized organisation in Australia, what can you reliably ship this year, what is still experimental, and where is the marketing ahead of the evidence?

Who it is written for

Three audiences.

Leaders of small and mid-sized businesses who are being pitched AI transformation and still have payroll, regulators, and service levels to think about.
Regulated professionals (clinicians, lawyers, accountants, valuers, engineers) who need to understand where AI can safely augment professional judgement and where it cannot.
Consultants and internal change agents who sit between vendors and operations and need a firmer basis for recommending specific patterns and controls.

The review assumes you are comfortable with workflows, risk registers, and data governance. It does not assume you follow every new model release.

What it actually covers

Ten capability categories that show up repeatedly in real projects. For each one, the paper separates four things:

What is reliable in production today for typical business and professional environments
What works in demonstrations but has failure modes that matter on real data
What is sold as more mature than it is, particularly in regulated or high-stakes domains
What is quietly further along than most buyers assume, and therefore under-used

The review stays at the level of applied capability rather than model-by-model benchmarking. It is not a leaderboard. It is a map of where you can reasonably place operational bets across the kinds of workflows that small and mid-sized organisations actually run: document automation, knowledge retrieval, drafting, decision support, monitoring, and the slower-moving categories where the marketing is currently running well ahead of the evidence.

Because most of our work is in Australia, the discussion keeps circling back to Australian regulatory expectations and product constraints (AHPRA, ASIC, RACGP, TGA, the Federal Court’s GPN-AI practice note, AUSTRAC Tranche 2, RICS) rather than purely US or EU case law.

Why a literature review, not an opinion piece

The paper draws on more than a hundred cited sources across peer-reviewed work, independent evaluations, regulator publications, and reputable press, with references you can inspect. Claims about capability, safety, and failure modes are anchored in published evidence where possible, and clearly marked as practitioner experience where not. The methodology, the source hierarchy, and the limitations are part of the document, not hidden.

Our view is that AI strategy work should be held to the same standard as any other critical operational decision. You should be able to see the chain from claim back to source. You should be able to disagree with the author using concrete references rather than vibes.

The fact-check standard

The review was drafted using Anthropic’s Claude Fable 5 and then subjected to a three-pass independent fact-check using Claude Opus 4.8.

Across those three passes, 135 individual fact-check findings were logged and resolved. The corrections log is included as an appendix, so readers can see exactly what changed, and why. The pre-fact-check draft is preserved in the archive for transparency.

The unusual thing here is the visibility of the correction trail. Most AI-assisted documents arrive without one. This one arrives with the chain showing.

For a leader, that means the document is not just a snapshot of what the author thought when they wrote it. It is a worked example of how to build AI-assisted documentation with verifiable claims and an auditable correction trail. That standard is increasingly important in regulated contexts.

How to read and use it

The paper is around 13,000 words. It is meant to be returned to, annotated, and reused in strategy and governance work, not read in a single sitting.

Three patterns we see when people actually use it:

Strategy teams using the ten capability categories as a checklist when reviewing AI project proposals or vendor pitches
Risk and compliance functions using the “demo versus production” sections to pressure-test vendor claims and shape control design
Consultants excerpting specific sections (knowledge retrieval, document automation, clinical scribes, legal AI, vertical AI accuracy claims) into client education packs with the original references intact

You do not need to read it linearly. Many readers start with the methodology and the appendices, see what standard is being applied, and then read the capability sections most relevant to their current projects.

How it fits with our other work

The State of AI review sits alongside a separate literature review we published on local PHI masking and de-identification for clinical AI tools, which synthesises two decades of clinical de-identification research into design principles that can be implemented in modern systems, including in ClientJourney.

Together they sit in the Resources section of this site, alongside the technical artefacts behind other things we build.

Download the State of AI in Mid-2026 (PDF) →

Explore all technical resources →

Supervised Autonomy: The Middle Path for AI Architecture

Two architecture stories dominate the conversation about AI inside operating businesses, and they're both incomplete for most operators. The middle path is the one most regulated and quality-sensitive operators actually need.

Technical 9 min read

How to Design a PHI Redaction System for Clinical AI

A clinical AI tool that sends patient names to an external API is a regulatory problem looking for an incident. PHI redaction is not a feature you add to a clinical AI product — it is part of the architecture. This is what the literature says it should look like, and how we built it for ClientJourney.

Building 9 min read

How We Built On-Device De-Identification So AI Never Sees Real Names

Most AI privacy is a policy. Ours is architecture. We run a named entity recognition model inside the browser to strip identifying information before it ever leaves the device. Here is how it works, what we tested, and where it applies.

Technical 7 min read

Your Agency's Clients Are About to Ask Why This Costs So Much

A solo consultant just built in two weeks what your agency quoted eight for. The client doesn't understand AI yet; but they will. The agencies that survive aren't the ones that cut costs. They're the ones that change what they sell.

Adoption 6 min read

What Do You Love Doing? What Do You Hate Doing?

Most AI rollouts fail the same way. Leadership announces efficiency. Staff hear replacement. A developer at a recent peer group meeting offered a reframe that changes everything; the psychology of why it works tells you how to deploy AI without destroying trust.

Technical 7 min read

Why I Don't Use n8n (And What I Do Instead)

If you've been pitched an AI system recently, there's a good chance you saw n8n in the demo. It demos well. But a compelling demo and a reliable production system are different things; and the distance between them is where businesses get hurt.

Technical 10 min read

Your Codebase Was Not Built for AI. That's the Actual Problem.

Amazon's mandatory meeting about AI breaking production isn't an AI tools story. It's an architecture story. The codebases AI is being pointed at were never designed to be understood by anything other than the humans who built them.

Adoption 4 min read

Your Team Has AI Licences. You Don't Have an AI System.

Fifteen people, fifteen separate AI accounts, no shared context. The problem isn't the tool; it's the architecture around it. Here's what fixing it looks like.

Building 7 min read

Your $2,000 Day Starts the Night Before: Our System Keeps You on the Tools, Not on the Phone

Your route is optimised overnight. Your customers are notified automatically. When something changes mid-day, every affected customer gets told without you picking up the phone. A tradie scheduling system that protects your daily rate.

Evaluation 4 min read

The Fastest Way for an Executive to Get Across AI

AI is moving faster than any executive can track. The alternatives: learning it yourself, sitting through vendor pitches, hiring a consultant who arrives with a hammer, all waste your scarcest resource. There is a faster way.

Building 6 min read

Your IT Department Will Take 18 Months. You Need This Working by Next Quarter.

Senior leaders often know exactly what they need built. The gap isn't technical; it's time. A prototype approach gets the tool working now and gives IT a validated blueprint to build from later.

Adoption 4 min read

What If You Had Perfect Memory Across Every Client?

Any practice managing dozens of ongoing client relationships captures more than it can recall. AI gives practitioners perfect memory across every interaction, so preparation time becomes thinking time, not retrieval time.

Building 8 min read

We Built an AI Invoice Verifier. Here's Where It Hits a Wall.

We built an AI invoice verifier and watched a fake beat a real invoice. Here's why document analysis alone cannot stop invoice fraud; the five layers of detection that most businesses never reach.

Building 5 min read

How to Build an AI Chatbot That Doesn't Lie to Your Customers

Woolworths deliberately scripted its AI to talk about its mother. The business fix is simple: be honest about the bot. The technical fix is harder: architecture that prevents fabrication by design, not by hope.

Technical 9 min read

Why AI Safety Features Are Load-Bearing Architecture, Not Political Decoration

The 'woke AI' label came from real failures; but they were engineering failures, not safety failures. Understanding the difference matters for every organisation deploying AI where errors have consequences.

Adoption 3 min read

Woolworths' AI Told a Customer It Had a Mother. That's a Problem.

Woolworths' AI assistant Olive was deliberately scripted to talk about its mother and uncle during customer calls. When callers realised they were talking to an AI pretending to be human, trust broke instantly.

Evaluation 4 min read

Google Is No Longer the Only Way Your Customers Find You

People are using ChatGPT, Perplexity, and Gemini to find businesses. The sites that get cited are structured differently to the sites that rank on Google. Most businesses are optimising for one and invisible to the other.

Evaluation 4 min read

Two Types of AI Assessment: And How to Know Which One You Need

Most businesses considering AI face the same question: where do we start? The answer depends on whether you need to find the opportunities or reclaim the time. Two assessments, two perspectives, one goal.

Evaluation 4 min read

The Personal Workflow Analysis: What Watching a Real Workday Reveals About Automation

When asked how they spend their day, most people describe the work they value, not the work that consumes their time. Recording a typical workday closes that gap, revealing automation opportunities no interview could surface.

Evaluation 4 min read

What a Good AI Audit Actually Delivers

A useful AI audit produces two things: a written report with specific, costed recommendations and a working prototype you can test. Not a slide deck. Not a proposal for more work.

Evaluation 4 min read

Your Website Looked Great Five Years Ago. Now It's Costing You Customers.

The signals that used to build trust online (polished design, stock imagery, aggressive calls to action) now trigger scepticism. Most businesses don't realise their digital presence is working against them.

Evaluation 4 min read

AI Audit That Starts With Your Business

Most AI consultants arrive with a toolkit and look for places to use it. An operations-first audit starts with how your business actually runs, and only recommends AI where the evidence says it will work.

Building 6 min read

What Production AI Teaches You That Demos Never Will

The gap between AI that works in a demo and AI that works in your business is where the useful lessons live. Architecture, framing, privacy, and adoption; the patterns are the same every time.

Adoption 6 min read

The Psychology of Why Your Team Won't Use AI

You buy the tool, run the demo, and three months later nobody is using it. The reason is not the technology; it is five predictable psychological barriers. Each one has a specific strategy that overcomes it.

Technical 4 min read

Stop Telling AI What NOT to Do: The Positive Framing Revolution

Most businesses get poor results from AI because they instruct it with constraints and prohibitions. Switching from negative framing to positive framing transforms output quality, and the principle comes from psychology, not computer science.

Building 5 min read

How We Turned Generic AI Into a Specialist: And What That Means for Your Business

Most businesses get mediocre AI output and blame the model. The fix is almost never a better model; it's a better architecture. Three structural changes that transform AI from 'fine' to 'actually useful.'

Evaluation 5 min read

Your Business Has 9 Customer Touchpoints. AI Can Fix the 6 You're Dropping.

You are spending money to get customers to your door. Then you are losing them because you cannot personally follow up with every lead, nurture every client, and ask for every review. AI can handle the touchpoints you are dropping: quietly, consistently, and at scale.

Technical 5 min read

What Happens to Your Data When You Press 'Send' on an AI Tool

Most businesses are sending customer data, financials, and internal documents to AI tools without understanding what happens during processing. The spectrum of AI privacy protection is wider than you think; recent research shows that even purpose-built security can have structural flaws.