Integrating LLMs into existing business software without a full rewrite

Every week someone asks us whether they need to rebuild their software to use AI. The answer is almost always no. What they need is to understand where an LLM adds genuine value in their existing workflow, and where it's a solution looking for a problem.

The wrong way to think about AI integration

The most common mistake we see is companies wanting to "add AI" as a feature — a chatbot in the corner, a summary button somewhere, an AI-generated report that nobody reads. These projects get built, demoed, and quietly switched off three months later because they didn't solve a real problem.

The right question isn't "where can we add AI?" It's "where in our workflow is a human doing something that takes significant cognitive effort, produces inconsistent results, and has a clear correct answer most of the time?" That's where LLMs earn their keep.

The highest-value LLM integrations we've built share one characteristic: they're invisible. The user doesn't interact with "AI" — they just notice that a task that used to take 20 minutes now takes 2.

Three integration patterns that actually work

Document processing and extraction: If your team spends time reading documents and pulling specific information out of them — contracts, invoices, emails, CVs — an LLM can do this faster and more consistently than a human. The output is structured data that goes straight into your existing system. No new UI needed.

Classification and routing: Support tickets, leads, compliance flags, content moderation — anywhere you have incoming items that need to be sorted and sent somewhere. LLMs handle the long tail of edge cases that rules-based systems miss. You keep your existing routing logic; you just replace the manual triage step.

Draft generation with human review: Proposal sections, compliance summaries, client update emails — tasks where the output needs to be consistent in format and tone but reviewed before sending. The LLM handles 80% of the cognitive work; the human does the final check. This is where the time savings are real and the risk is manageable.

What doesn't work

LLMs are poor at tasks that require precise numerical reasoning, tasks where being wrong has serious consequences without a human review step, and tasks where the "correct" answer is highly subjective or business-context-specific without significant prompt engineering investment.

We've also seen integrations fail because the underlying data was too messy. An LLM processing your customer records is only as good as those records. Garbage in, garbage out — just faster.

The technical side: it's simpler than you think

In most cases, integrating an LLM into existing software is an API call. You take some data from your system, send it to an LLM with a carefully constructed prompt, and parse the response back into your data model. The complexity is in the prompt engineering, the error handling, and building the confidence thresholds that determine when to route to human review.

You don't need a new database. You don't need to retrain a model. You don't need a team of ML engineers. You need a clear problem definition, a reliable API, and time to iterate on the prompt until the outputs are consistently good enough.

That last part — iterating until good enough — is where most DIY attempts fail. "Good enough" needs to be defined upfront, measured systematically, and improved methodically. It's engineering, not magic.