The OCR trap: why 24-hour invoice processing isn't automation
If your invoice software takes 24 hours to process a scan, it isn't AI. Here's what's actually happening, and why it matters for your GP.
HOPS Team
Product & Operations
Every hospitality finance product sold in the last five years has included some version of the same line: AI-powered invoice processing. Scan your supplier invoice, and the software reads it, extracts the data, and posts it to your accounts automatically.
It is a genuinely useful thing, if it actually works. The problem is that a significant number of these platforms are not doing what they claim.
The giveaway is the turnaround time. If your "AI" takes 24 hours to process an invoice, it is not AI. It is a person.
What OCR actually does
Optical character recognition is not a new technology. It has been extracting text from images for decades. When it is applied to invoices, it reads the document, identifies fields, and pulls out the data: supplier name, invoice number, line items, quantities, costs. The process takes seconds. Not minutes, and certainly not hours.
Real OCR, running on modern infrastructure, turns around an invoice in the time it takes you to pick up your phone and check it. The extraction happens, the operator sees the result, and the workflow continues.
A 24-hour turnaround does not describe an algorithm. It describes a human shift. Someone, somewhere, is opening your invoice, reading it manually, and keying the data. The software is a wrapper around a manual labour operation, dressed up with AI language.
Why this model is more common than you might think
The economics make it easy to understand, even if the marketing makes it hard to spot.
Building OCR that handles the chaos of real hospitality invoices is genuinely difficult. Supplier PDFs are inconsistent. Handwritten delivery notes exist. Layouts change between runs. Training a model to handle all of that reliably takes time and money. It is much cheaper to hire workers, usually offshore, to process invoices manually at scale. The per-unit cost is low enough that it looks like automation from a distance.
Some platforms have built genuine extraction technology. Others have built a ticketing system that routes your invoices to a data entry queue. The UI looks identical. The marketing language is identical. But what is happening behind the scenes is completely different.
The problems with manual invoice processing, even when it works
Set aside the misrepresentation for a moment. Even if you were comfortable with a person manually keying your invoice data, the model creates problems that compound over time. The comparison between manual and automated invoice processing covers these failure modes in more detail.
No operational context. The person entering your invoice data does not know your business. They do not know that you buy three different grades of salmon, that your chicken supplier changed their product codes six months ago, or that your head chef occasionally sources from a secondary supplier when the primary is out of stock. When they categorise a line item, they are guessing. That guess posts to your accounts.
No consistency. On a crowdsourced platform, a different person may key your invoices every time. There is no institutional memory. The same supplier gets categorised differently on different invoices. The same product sits in two cost centres depending on who processed it that week. By the time you spot the inconsistency, it has been running for months.
Errors that only surface when your GP looks wrong. A transposed quantity. A unit price copied from the wrong column. A delivery charge posted to food cost. Each one is small. None of them are immediately visible. But they accumulate quietly in your accounts until the GP number stops making sense and you spend a day trying to find out why.
Your financial data, handled by people you have never met. Supplier pricing, order volumes, cost structures: this is sensitive commercial information. A crowdsourced data entry model means an unknown number of unvetted workers have seen it. Most operators do not realise this is happening until they ask.
The timing problem and your GP
Invoice data is not just a bookkeeping matter. It feeds directly into your cost of goods sold, and therefore into your gross profit margin. For a full breakdown of how invoice data accuracy affects hospitality reporting, there is more detail worth reading.
If an invoice is processed with an error, that error lives in your GP calculation. A cost centre misallocation means your food GP looks better than it is, and your non-food costs look worse. A wrong quantity means your theoretical usage is off, which corrupts your stock reconciliation. A missed line item means you have under-reported costs, which makes decisions about pricing and menu engineering unreliable.
Operators working hard to understand and improve their margin are doing so on figures that depend on invoice data being right. If that data is being generated by a manual process with no quality control and no way to audit individual decisions, the figures you are working from are not trustworthy. You cannot manage margin on data you cannot verify.
What proper invoice automation looks like
The principle is straightforward. Extraction should be fast. Review should be easy. Approval should be deliberate.
When OCR works correctly, the invoice is processed in seconds. The extracted data, every line item, every code, every amount, is presented to the operator before anything posts. The person who understands the context, the head chef who knows that higher lobster price was agreed after a conversation, or the finance manager who knows this supplier charges for carriage separately, reviews the result. They approve it, correct it if needed, and move on. A record of what was changed, and by whom, is kept.
That sequence matters because it puts knowledge where it belongs. The software handles the mechanical extraction. The human with operational context makes the judgement call. If the extracted data is wrong, it gets corrected before it enters the accounts, not weeks later when the damage has propagated through GP reports and management accounts.
This is not a slower process. A 30-second review of extracted data is faster than manual data entry by anyone, offshore or in-house. What it gives you is accuracy, auditability, and the confidence to act on the figures you see.
“Since implementing Hops at Green & Fortune, we've seen a significant boost in profitability!”
Alan Morgan
Financial Director, Green & Fortune
How to tell what your platform is actually doing
The simplest test is timing. Send an invoice at 9pm and see when the data appears. For broader context on what active accounts payable management in hospitality should look like, that is worth reading alongside this. If it appears within minutes, you have extraction. If it appears the next morning, you have a night shift.
A second test: ask the provider directly whether the processing is automated or involves human review. A provider with genuine OCR will answer immediately and clearly. A provider running a manual operation will give you a longer answer, possibly involving phrases like "human-in-the-loop quality assurance" or "expert review to ensure accuracy." Those phrases are not wrong, exactly, but they are describing something different from what most operators think they bought.
A third test: check whether you can see and edit the extracted data before it posts. If invoices move directly to your accounts with no review step, you have no way of knowing what was captured or correcting it before it becomes a number in your reports. That is a problem regardless of whether extraction is automated or manual.
The review step is not a weakness
There is sometimes a perception that requiring an operator to review extracted invoices is a limitation, that true automation would post everything without a human in the loop.
That view underestimates how much operational knowledge lives with the people running the business, and how much damage a wrong posting can do.
Incoming data should be visible, vetable, and correctable before it becomes a number you make decisions on. That applies to invoices. The review step is not friction. It is the point at which your knowledge, not an algorithm's guess, enters the record.
Hops Finance presents extracted invoice data to the operator before anything posts. The person who knows the context reviews the result. What changes and why is recorded. The figures that feed into GP calculations come from data that a real person in your business has seen and signed off.
That is not a limitation on the automation. It is how the automation is designed to work.
If you are currently relying on a platform that promises AI-powered invoice processing and delivers data the next morning, it is worth asking what is actually happening, and what it might be costing you. Book a demo to see how Hops handles invoice processing.
Frequently asked questions
How do I know if my invoice software is using real OCR or manual data entry?
The most reliable test is turnaround time. Send an invoice at 9pm and check when the extracted data appears. If it is ready within minutes, you have genuine OCR. If it appears the next morning, a person processed it overnight. You can also ask the provider directly whether processing is automated or involves human review, and listen carefully to how they answer.
Why does 24-hour invoice processing matter for my GP?
Invoice data feeds directly into your cost of goods sold and therefore your gross profit margin. If that data is being generated by a manual process with errors and no operator review, the GP figures you are working from are not fully trustworthy. Errors in category coding or line item amounts flow through to management accounts and can persist for months before anyone notices.
Is it a problem if my invoice platform uses offshore workers to process invoices?
There are two main concerns. The first is accuracy: offshore processors do not know your business, so they cannot correctly categorise a line item that requires operational context. The second is data privacy: supplier pricing, order volumes, and cost structures are sensitive commercial information, and a crowdsourced data entry model means that data is seen by an unknown number of unvetted workers.
What should automated invoice processing look like in hospitality?
The invoice is processed by OCR in seconds. The extracted line items, quantities, and amounts are presented to the operator in a clear review screen before anything posts. The operator with operational knowledge reviews the result, corrects any extraction errors, and approves. What was changed and by whom is recorded. Hops Finance is built around this workflow. Book a demo at hopshq.com.
Does the operator review step slow down invoice processing?
Not meaningfully. A 30-second review of pre-extracted data is still faster than manual data entry by anyone. What the review step gives you is accuracy, auditability, and the confidence to act on the cost figures in your reports. An invoice that posts automatically without review may save a few seconds but can introduce errors that take an hour to trace at month end.
Tags
Built for operators
See how operators are actually using Hops.
We could tell you what Hops does. Instead, read what the people running their businesses on it have to say.