Work

Pioneering AI automation that improves tax compliance processes

The Opportunity

In every business there are mission-critical tasks that every employee dreads. Often these are the time-consuming, repetitive jobs that require a combination of expert judgement, an eye for detail and, above all, mental stamina.
Exploring whether modern AI techniques can take on such tasks without sacrificing accuracy and trust is often the first application of AI within an organisation. But while it is relatively easy to impress in a demo, showing robustness, scalability and real world value is another matter entirely.
For Tax Systems, who provide tax compliance software solutions to more than 40% of FTSE 100 companies, the annual categorisation of revenue and expenditure allocations for corporate tax returns is such a task. And with Tax Systems’ customers processing 200,000 tax computations each year, each one taking around five hours to populate, that adds up to a million hours of vital, painstaking manual effort.
Softwire partnered with Tax Systems to explore whether modern AI techniques could meaningfully support tax research and decision-making, starting with a tightly scoped proof of concept that would only graduate if it could be trusted and adopted by tax professionals.

Our Approach

Our first task was to see if the onerous job of mapping a company’s internal revenue and expenditure to HMRC’s standardised set of approved categories could be automated, since this would eliminate thousands of hours of manual checking and cross-referencing.
And vitally we needed to ensure that OpenAI’s LLM models could achieve high enough accuracy to give tax professionals faith that the AI could deliver results that were viable, useful and trustworthy. Without trust, any time savings achieved by use of AI would be meaningless.
The complexity and variety of data meant that 100% accuracy would never be feasible, but by working closely with Tax Systems’ specialists, refining the input category to standardised account category mapping process and careful, iterative prompt engineering, we were able to hit a 93% accuracy level, well in excess of the 85% level that had been determined to be the minimum trust threshold.
So far, so good, but to be truly useful we needed to see whether the LLM could not only categorise items of expenditure, but also determine whether they are allowable deductions for tax purposes, a task traditionally done by qualified professionals who look at the wider context of the business to make their assessments.
Training the LLM to make tax deduction decisions with the necessary accuracy needed a different approach and, again working closely with tax experts, we ‘fine-tuned’ the AI models by feeding them curated sample datasets that dramatically improved accuracy over a series of iterations. Once we had optimised the prototype’s performance, it was able to complete this final piece of tax analysis in a few seconds with the same accuracy achieved through hours of manual, human attention.

The Impact

Our working prototype allowed Tax Systems to demonstrate a real system using actual data to their customers, showing that a manual process that once took four to five hours, could now be accomplished in a matter of minutes.
Crucially we helped Tax Systems bridge the organisational gap between innovation and delivery, demonstrating enough value, reliability and alignment with real user needs to justify moving beyond experimentation.
Just think how much more high-value work these skilled people could do in the time they’ll save,” says Russell Gammon. 
 
In 2024 the production solution, now known as Alphamap, won Best Digital Innovation at the Tolley’s Taxation Awards, recognizing its pioneering use of generative AI in tax data processing.

“ Softwire helped us prove that LLMs could be used to solve a very real customer challenge. And what they produced at the end was significantly more impressive than we’d expected.

Russell Gammon, Chief Solutions Officer, Tax Systems