TL;DR
- AI QR code cost per conversation is the operational cost of each AI-handled interaction, not the price of the QR code itself.
- Platforms bill in one of three ways: per conversation, per resolution, or token based, and each carries a different risk profile.
- Your real number depends on conversation length, complexity, language, volume, and how often the AI resolves the question without escalating to a person.
- A well built deployment usually costs far less than human support per interaction, but a poorly trained one can cost more, because escalated conversations get paid for twice.
For years the budgeting question for QR codes was simple. How much does a QR code cost? That was a question about the platform, a monthly line item for creating and tracking codes. AI changes where the money goes. When you add a conversational AI layer like Cleo to a dynamic QR code, the meaningful cost moves out of the platform and into the conversations themselves. So the question that matters now is the AI QR code cost per conversation, and that is what this guide is built around. The aim is to give you a clear mental model, a way to estimate your own number, and an honest sense of when this pays back fast and when it does not.
Why is AI QR code cost per conversation the right question to ask?
Cost per conversation is the right question because that is where the variable cost lives once you deploy AI. A static platform fee is predictable. Conversations are not. Every scan that turns into a real exchange consumes processing, and that volume scales with how many people interact and how much they ask. Understanding this shift is the foundation for evaluating return on investment.
What changed when QR codes gained an AI layer?
The job a QR code does has expanded. A traditional QR code took someone from a printed surface to a destination url, and the cost ended there. The only ongoing expense was the platform that hosted and tracked the code. Now the code can carry a conversation, so the work continues after the scan. Each answer the AI produces has a marginal cost, which is why the budgeting model has to change with it.
This is also why the old framing of QR codes cost money, full stop, no longer captures the picture. The code still costs very little. The conversations are the part worth modelling, because they scale with your customers rather than with your printing.
What is the difference between the platform cost and the conversation cost?
The platform cost and the conversation cost are two separate layers that should be budgeted separately. The platform layer covers creating dynamic QR codes, hosting them, and tracking scan data, and it tends to be a stable subscription fee. The conversation layer covers the AI doing the actual work of answering people, and it moves with usage.
Keeping the two apart matters because they behave differently. The platform layer is fixed and easy to forecast. The conversation layer is variable and tied to demand. A reader evaluating AI QR codes is really asking the narrower question: if I put these codes in the world, what is each conversation going to cost me, and how does that compare to what I pay today?
What are the three billing models for AI QR conversations?
AI QR conversations are usually billed in one of three ways: per conversation, per resolution, or token based. Each one prices a different unit of value, and the right choice depends on how predictable your usage is and how much you trust your knowledge base to resolve questions on its own.
| Billing model | How it works | Best suited for | Main trade off |
|---|---|---|---|
| Per conversation | A flat cost for each AI handled conversation, no matter how long or complex | Teams that want predictable, easy to forecast costs | You pay the same for a one line answer and a long exchange |
| Per resolution | You are charged only when the AI resolves the question without escalating | Mature deployments with a strong knowledge base | Resolution can be defined loosely, so check how it is measured |
| Token based | You pay for the volume of text processed, where one token is roughly 0.75 words | High control setups where you want cost tied directly to usage | Hardest to predict, since long or multilingual chats raise the bill |
How does per conversation pricing work?
Per conversation pricing charges a flat amount each time the AI handles an exchange, regardless of how long or complex that exchange turns out to be. It is the easiest model to forecast because the unit is obvious. Multiply your expected conversation volume by the per conversation rate and you have your budget.
The trade off is that you pay the same for a one line answer about opening hours as you do for a detailed exchange about product specs. For deployments where conversations are short and similar in length, this averages out well. For deployments with wildly variable conversation lengths, you may overpay on the simple ones and underpay on the complex ones.
How does per resolution pricing work?
Per resolution pricing charges only when the AI successfully resolves a question without escalating to a human. On paper this is the most appealing model, because you pay for outcomes rather than attempts. If the AI cannot answer and the conversation goes to an agent, you are not billed for the AI side.
The catch is in the definition. What counts as a resolution can be measured generously or strictly, so it is worth understanding exactly how the platform decides. A loose definition can mark a conversation resolved when the customer simply gave up, which is not the same as a genuine answer. This model rewards a strong knowledge base and punishes a weak one in the most direct way.
How does token based pricing work?
Token based pricing charges for the volume of text processed, where one token corresponds to roughly 0.75 words, though this varies by AI model. You pay in proportion to how much the AI reads and writes, which ties your cost directly to actual usage.
At low volume this is often the cheapest option, since you only pay for what you use. The downside is predictability. A handful of long technical conversations, or a spike in a language that tokenizes less efficiently, can move the bill more than you expect. Token based pricing rewards concise source content and short, well aimed answers.
Which billing model gives you the best value?
The model that gives you the best value depends on the shape of your traffic, not on any single model being superior. Predictable, similar length conversations favor per conversation pricing. A mature deployment with a high resolution rate favors per resolution pricing. Low or variable volume where you want tight control favors token based pricing.
A practical approach is to estimate your numbers under each model using the same traffic assumptions, then pick the one that produces both the lowest expected cost and the level of predictability your finance team needs. Sometimes the cheapest model on paper is not the best value once you weight in how hard it is to forecast.
What drives the cost per conversation in practice?
The headline price of a billing model is only a starting point. What you actually pay per conversation is shaped by a handful of factors, some you can influence and a couple you cannot. The drivers below are the ones that move the number most.
How does conversation length affect cost?
Conversation length is the most direct driver, because longer exchanges consume more tokens and more processing. A quick question that resolves in one reply costs almost nothing. A back and forth that runs ten messages costs noticeably more, especially under token based pricing.
You can shape this. Clear, well structured source content tends to produce shorter, more decisive answers, which keeps conversations from sprawling. Vague content forces the AI to hedge and ask follow ups, which lengthens every exchange.
How does question complexity change the price?
Question complexity raises cost because technical or detailed questions generate longer, more involved answers. A request for a simple scan of opening hours is cheap. A multi part question about compatibility, pricing tiers, or technical specifications produces a longer response and often a longer conversation.
Complexity also affects your resolution rate. The more involved the question, the higher the chance the AI cannot fully answer it and the conversation escalates, which is where cost can quietly double. Matching the depth of your knowledge base to the complexity of the questions you actually receive is the lever here.
Why does language change the cost?
Language changes cost because some languages tokenize less efficiently than English, so the same conversation can consume more tokens simply because of how the text is broken down. A multilingual deployment may see its cost per conversation vary by language even when the questions are identical.
This matters most under token based pricing and least under per conversation pricing. If your audience is heavily multilingual and you bill by token, it is worth modelling each major language separately rather than assuming an English baseline holds across all of them.
How does volume lower your per unit price?
Volume tends to lower your per unit price because high volume deployments often negotiate better rates, and fixed setup costs get spread across more conversations. The cost per conversation usually falls as you scale, which is the opposite of how human support behaves, where each additional interaction costs roughly the same.
This is one reason AI QR codes suit high traffic use cases so well. The more conversations you run through a single well maintained knowledge base, the lower the effective cost of each one, and the better the case looks against staffing up.
Why does resolution rate matter most?
Resolution rate matters most because a poorly trained knowledge base produces failed conversations that escalate to a person, and those get paid for twice, once for the AI attempt and once for the human who finishes the job. A low resolution rate can turn a cheap per conversation rate into a net loss.
This is the single most controllable driver and the one teams underestimate most often. Investing in source content, reviewing what the AI gets wrong, and feeding those gaps back into the knowledge base raises resolution over time. A deployment that resolves 40 percent of conversations in month one can climb well beyond that as the content matures.
How does AI conversation cost compare to traditional support cost?
The comparison that decides most budgets is AI conversation cost versus the cost of handling the same question through a human channel. A typical live agent interaction carries a meaningfully higher cost than an AI handled one, because it ties up a paid person for minutes rather than seconds. Industry sources commonly cite reductions in the 30 to 90 percent range when AI absorbs routine inquiries.

What does a human support interaction actually cost?
A human support interaction costs far more than its visible wage line, because it includes the agent’s time, the tooling around them, training, and the opportunity cost of what that person could be doing instead. Even a short call or chat consumes minutes of a paid person’s attention, and that floor does not fall with volume.
This is the baseline against which AI conversation cost is measured. When you compare the two honestly, you are not comparing the AI rate to zero. You are comparing it to the loaded cost of a human handling the same routine question, which is consistently the higher of the two for deflectable inquiries.
How much can AI realistically deflect?
AI can realistically deflect the routine, repeatable layer of your support, which is often a large share of total volume but rarely all of it. The commonly cited 30 to 90 percent range is best treated as an industry observation, not a guarantee, because the realistic figure depends on your resolution rate and how many conversations genuinely deflect.
Resolve most routine questions cleanly and you land near the top of that range. Resolve a thin slice and bounce the rest to an agent and you land near the bottom, or worse, because of the double cost problem. The technology does not replace support. It absorbs the repetitive, deflectable layer so people can focus on the conversations that need them.
What hidden costs should you budget for?
The per conversation price is the visible cost. A few less obvious ones tend to surprise teams after launch, and budgeting for these hidden costs up front keeps your real cost per conversation honest.
Knowledge base setup and maintenance
The AI is only as good as the content it draws on, so knowledge base setup and maintenance is a real and ongoing cost. There is an initial build, where you assemble the descriptions, FAQs, pricing, and policies the AI will use, and there is continuous curation as those details change. Stale content produces wrong answers, and wrong answers drive escalations.
Budget for someone to own this. It does not require a developer, but it does require attention, and the quality of that attention shows up directly in your resolution rate and therefore your cost per conversation.
Failed conversations that escalate
Failed conversations that escalate to a human are the costliest hidden item, because they are paid for twice, once for the AI attempt and once for the agent who finishes the job. A deployment with a weak knowledge base can generate so many of these that it costs more than the manual process it replaced.
The fix is not to avoid escalation entirely, which is impossible, but to keep the escalation rate low through better content. Tracking which conversations fail and why is the most valuable feedback loop you have.
Multilingual content
Multilingual content is a hidden cost when your knowledge base lives in one language but your customers speak several. Cleo can respond in the language a visitor chooses, but the underlying source content still has to support that range well, or quality drops on the languages you have not properly covered.
If a large share of your audience speaks a language other than your source content, plan to invest in that content rather than relying on translation alone. The cost of getting this right is far lower than the cost of low quality answers driving escalations in your second and third languages.
Integration with backend systems
Integration with backend systems is a cost when you want the AI to answer questions that depend on live data, such as order status, availability, or customer accounts. Connecting the AI to that data through an API integration requires technical work, and the depth of integration you choose changes both the upfront cost and the range of questions you can resolve.
A static knowledge base is cheaper to run but answers fewer of the questions people actually have. Live integration costs more to set up but lifts your resolution rate on exactly the questions that would otherwise escalate. This is a genuine trade off, not a default.
What does it cost to not deploy AI QR codes at all?
The honest comparison is not only AI against other AI options. It is also AI against the cost of doing nothing. Standing still has a price that rarely shows up on an invoice, which is exactly why it gets ignored.
The opportunity cost of routine tickets
The opportunity cost of routine tickets is the most underrated line in the whole analysis. Every time an agent answers the same deflectable question, that is time not spent on the conversations that genuinely need a person. Your most expensive resource, human attention, is being spent on questions a well built knowledge base could have handled.
That cost compounds quietly. A team buried in repetitive tickets is a team that cannot give its harder cases the attention they deserve, which degrades the experience on exactly the interactions where humans add the most value.
How slow answers drive customer churn
Slow answers drive churn because a question left unanswered, especially outside working hours, is a moment where a customer can lose patience and leave. The cost of that churn does not appear as a support expense, but it is real, and it lands on revenue rather than the support budget.
A dynamic QR code with Cleo answering at any hour addresses this directly. The customer standing in front of a product or a sign gets an answer in the moment, rather than waiting for a reply that may arrive after they have already moved on.
How do you calculate your expected cost per conversation?
You can estimate your own number with a simple framework before you spend anything. The point is not precision to the cent. It is a defensible figure you can compare against your current cost per ticket. Work through these steps:
- Estimate your current support volume. Pull the number of inbound questions you handle in a typical month across the channels AI would touch.
- Identify what share is routine and deflectable. Be realistic. Hours, location, availability, and simple how to questions are deflectable. Bespoke or sensitive issues usually are not.
- Apply a realistic AI resolution rate. Start conservative. New deployments often land in the 40 to 60 percent range, and that figure grows as the knowledge base matures.
- Multiply by the per conversation cost of your chosen platform. This gives you the projected AI spend on the deflected volume.
- Compare to your current cost per ticket. The gap between the two is your expected saving.
A worked example for a mid sized deployment
A worked example makes the framework concrete. Imagine a business handling 4,000 inbound questions a month, of which 60 percent, or 2,400, are routine and deflectable. Apply a cautious early resolution rate of 50 percent, and the AI genuinely handles 1,200 conversations in month one. The remaining deflectable questions still reach a human, so they are not yet saving anything.
Now run it twice. With a mature resolution rate of 75 percent, the AI handles 1,800 of those 2,400 routine questions, and the per unit economics improve as volume rises. The spread between the cautious and mature scenarios is usually more instructive than either figure alone, because it tells you how much of your return depends on improving the knowledge base over time rather than on the AI rate you sign up for today.
When do AI QR codes pay for themselves, and when do they not?
AI QR codes pay back fastest when you have high volume routine support and a well built knowledge base, and they pay back slowly or not at all when volume is low and every issue is complex and bespoke. That is the honest framing, and it cuts both ways.
When does the investment pay back fast?
The investment pays back fast when most of your inbound questions are variations on a predictable theme and you are willing to invest in the source content. Each deflected conversation costs a fraction of the human alternative, and volume does the rest. High traffic use cases such as restaurant menus, product information, and event details tend to sit squarely in this zone.
In these cases the per conversation cost falls as volume rises, the resolution rate climbs as the knowledge base matures, and the savings compound month over month. The math turns positive quickly and stays positive.
When does it pay back slowly or not at all?
It pays back slowly or not at all when your support is low volume and genuinely case by case, where almost every conversation needs a person anyway. If the deflectable share is small, the resolution rate stays low and the savings never accumulate enough to justify the setup and maintenance.
There is no shame in landing in this category. A clear no is more valuable than an expensive maybe, and recognising it early saves you the cost of a deployment that was never going to pay for itself.
How does the platform layer fit into all of this?
The platform layer is the foundation that makes the conversation layer work at all. Dynamic QR codes are what let you update what the AI knows, track which products or pages generate the most questions, and iterate on the knowledge base over time.
Why do dynamic QR codes matter for AI?
Dynamic QR codes matter because they let you edit content behind a code that has already been printed, which is exactly what an evolving knowledge base needs. Static QR codes are free to create, but they are fixed at the moment they are printed, with no tracking, no way to change the destination, and no room for a conversational layer on top.
Every QR code from QRCodeKIT is dynamic, which is what allows Cleo to keep learning behind a code that never has to be reprinted. You can update the destination url, refresh the source content, and improve answers continuously, all without touching the printed materials already out in the world.
What do free and paid plans actually include?
Free and paid plans differ mainly in scale, scan data, and features rather than in whether the code is dynamic. A free plan is a sensible way to test the foundation, offering a small number of dynamic QR codes, basic analytics, and a capped number of scans, which is enough to validate the idea on personal projects or a single marketing campaign before committing.
Paid plans open up more QR codes, higher or unlimited scan counts, advanced analytics, and the features that larger operations need. Subscription tiers typically scale from a starter level suited to small teams up to plans built for enterprise teams, with the higher tiers adding capabilities like API access and team management. The free plan answers the QR code pricing question for a first test, and the paid plans answer it for production.
What customization and features come with a paid plan?
Customization and features on paid plans turn a basic code into a managed asset. Beyond more QR codes and higher scan limits, paid tiers commonly add a custom domain so links run on your own domain, password protection for sensitive destinations, branded QR codes and custom designs through design tools, and bulk operations for creating many codes at once across multiple campaigns.
Analytics deepen as well. Advanced analytics and richer scan analytics let you track scans by location and time, understand total scan counts, and see which assets drive the most engagement. Combined with lead capture inside Cleo conversations and the ability to route visitors to Google reviews, a digital business card, or any landing page you choose, the platform layer becomes campaign management rather than simple QR code generation. This is also the layer that gives you the data to improve the knowledge base, which closes the loop back to a lower cost per conversation.

Frequently asked questions
Do QR codes cost money to use with AI?
The short answer is that the QR code itself is inexpensive, but the AI conversations are the real variable cost. A dynamic QR code is a small, predictable platform cost, often a low monthly figure. The larger and more variable expense is the AI conversation layer, billed per conversation, per resolution, or by token volume depending on the platform.
Is a free QR code generator enough for AI conversations?
A free QR code generator is fine for a simple scan that points to a fixed link, but free generators cannot support an AI conversation. A free QR code is typically static, so there is no way to edit content, track scans, or add a conversational layer. AI QR codes need the dynamic foundation that a free plan with basic analytics begins to provide, and you grow from there.
How much can AI realistically reduce support costs?
Industry observations commonly cite reductions in the 30 to 90 percent range per deflected interaction, but the realistic figure for your team depends on your resolution rate and how many conversations genuinely deflect. Treat the high end as a ceiling you earn through a strong knowledge base, not a default you get on day one.
What makes one AI conversation cost more than another?
Length, complexity, and language are the main drivers. A short question in English costs little. A long technical exchange, or the same conversation in a language that tokenizes less efficiently, costs more. Volume works the other way, since larger deployments often secure lower per unit pricing as they scale.
Can I start small and scale up?
Yes. A free plan lets you test the dynamic foundation and basic analytics before committing, and you can move up through subscription plans as the case proves itself, adding custom QR code designs, lead capture, advanced features, and API integration along the way. Some providers also offer free trials of paid tiers, which can be a useful middle ground for validating advanced features before you commit to an annual plan. Starting small keeps your early cost per conversation low while you build the knowledge base that drives resolution rates up.
Do dynamic QR codes cost more than static ones?
Dynamic QR codes carry a small subscription fee that static codes do not, because they do more. Static codes are free but fixed, with no editing, no tracking data, and no conversational layer. The dynamic platform fee is what buys you the ability to edit content, track scans, and run an AI conversation, which is the entire point when cost per conversation is what you are trying to manage.
Can I add AI to an existing QR code?
If the existing QR code is dynamic, the content and destination behind it can be updated, which is what allows a conversational layer to be added without reprinting. If it is a static code, it cannot be changed after printing, so you would create a new dynamic code to carry the AI experience. This is one more reason the platform layer matters before you think about conversation costs at all.
All images and visual content in this article were created using RealityMAX.