AI coding assistants shine for boilerplate code, simple functions, documentation, debugging, and other routine tasks. Donโt push them too far.
Just because you can use generative AI in software development doesnโt mean you should. AI coding assistants powered by large language models (LLMs) are a productivity dream in some cases but a debugging nightmare in others. So, where is that line?
โKnowing when and how to rely on AI code assistants is an important skill to learn,โ says Kevin Swiber, API strategist at Layered System. โItโs changing day by day as the technology advances. Itโs hard to keep up.โ
63% of professional developers currently use AI within their development process, according to Stack Overflowโs 2024 Developer Survey. AI coding assistants are proving to be an incredible time saver for boilerplate code, simple functions, documentation, and debugging.
However, AI-generated code is riddled with quality concerns, and a heavy reliance on it compounds technical debt. Experts view AI agents as less ideal for completely novel coding projects, highly complex architectures, long build cycles, or code reuse.
The short and skinny? AI works better in some situations than others. (Not to harsh your vibe, but vibe coding still requires human supervision.) Below, weโll consider when AI tools shine and when they donโt, and offer some takeaways for software engineering leaders.
Where AI coding assistants shine
AI performs exceptionally well with common coding patterns. Its sweet spot is generating new code with low complexity when your objectives are well-specified and youโre using popular libraries, says Swiber.
โWeb development, mobile development, and relatively boring back-end development are usually fairly straightforward,โ adds Charity Majors, co-founder and CTO of Honeycomb. The more common the code and the more online examples, the better AI models perform.
Quicker feedback cycles with AI tend to lead to a better experience. โTasks with quick feedback loops, like front-end development or writing unit tests, tend to work particularly well,โ says Majors. โIf it takes you two hours to deploy your back-end code, this will be more challenging.โ
Harry Wang, chief growth officer at Sonar, says AI excels at well-understood programming tasks like scaffolding microservices, generating REST APIs, or prototyping new ideas.
โAI coding assistants truly shine when they augment developers, taking on routine and repetitive tasks like generating boilerplate code or suggesting code snippets, functions, or even entire classes,โ Wang says. โThey accelerate rapid prototyping, exploratory design, and experimental coding, turning initial ideas into tangible code much faster.โ
Then, there are all the practical tasks AI can achieve for developers outside the actual code. Spencer Kimball, CEO of Cockroach Labs, describes how their engineers often use AI for design scaffolding, fixing tests, observability data, and blogging. 70% of the time, thatโs not direct coding, but itโs giving back more time to developers to program, he says.
Where AI coding assistants fall short
In other situations, you may struggle to get AI working. Generative AI tools can falter when engineering goals go beyond a one-off function, arenโt well-specified, involve large-scale refactoring, or span entirely novel projects with complex requirements.
โYou can waste a lot of time and moneyโand literally lose codeโif you just let it do its own thing,โ says Layered Systemโs Swiber. This risk grows if youโre not reviewing outputs regularly or using version control.
Honeycombโs Charity mostly agrees: โAI is much better at generating greenfield code than it is at modifying or extending an existing code base.โ Exceptions include large language models trained on that precise task, she adds.
While AI accelerates development, it creates a new burden to review and validate the resulting code. โIn a worst-case scenario, the time and effort required to debug and fix subtle issues in AI-generated code could even eclipse the time it would require to write the code from scratch,โ says Sonarโs Wang.
Quality and security can suffer from vague prompts or poor contextual understanding, especially in large, complex code bases. Transformer-based models also face limitations with token windows, making it harder to grasp projects with many parts or domain-specific constraints.
โWeโve seen cases where AI outputs are syntactically correct but contain logical errors or subtle bugs,โ Wang notes. These mistakes originate from a โblack boxโ process, he says, making AI risky for mission-critical enterprise applications that require strict governance.
โEarly-stage projects benefit from AIโs flexibility, while mature code bases demand caution due to risks of context loss and integration conflicts,โ says Wang. Part of this is a lack of access to the proper context and data for the use case at hand.
Although Cockroach Labsโ Kimball acknowledges that AI coding tools are improving, the complexity of Cockroachโs massive code base still poses challenges for AI assistants. โThereโs way too much context,โ he says. Instead of attempting to load everything, he explains how you can stay productive by narrowing your focus to local context and related interfaces. โYou want to understand the things that are attached to the one file youโre looking at, and black box some of those things.โ
By treating parts of the system as abstractions, developers can work iteratively within a smaller scopeโa mindset Kimball says helps developers stay productive, even in complex systems like Cockroachโs.
What engineering leaders should know
โItโs no accident everyoneโs interested in AI, because itโs a paradigm shift on the same level of electrification or computerization,โ adds Kimball, who recently experimented hands-on with vibe coding using Model Context Protocol (MCP) servers wrapped around Cockroachโs APIs.
โAs a CEO, it gives you a bit of perspective on whatโs possible,โ Kimball says. โIf you can get a 30% boost in productivity, itโs like hiring 30 people.โ Although overspending on AI is a valid concern, the cost pales in comparison to hiring additional engineers, he says.
In fact, AI can give companies an edge. โDonโt worry about spending in the short termโfigure out how to use this stuff,โ says Kimball. โItโs much better to be a 500-person company than a 5,000-person company.โ To his point, new research from DX found that mid-size companies had the highest revenue per engineer compared to other company sizes.
Executives are hot on AI at the moment. Shopifyโs CEOโs AI mandate is anticipated to usher in similar decrees and affect hiring. But while AI fervor mounts, the onus is on leaders to understand the limitations of AI and begin delineating boundaries.
Deploying AI willy-nilly can quickly lead to frustrating outcomesโlike a model getting itself stuck in a recursive loop of failed tests, says Swiber. โYou canโt just set these things off and let them go. You need to monitor what theyโre doing.โ
Leaders canโt afford to sit on their laurels, either. The fact is, developers will use generative AI regardless of whether they have approval yet. 64% of software developers who use generative AI began using the technology before they were officially granted licenses to do so, according to a 2024 report from BlueOptima.
Both developers and leadership should gain familiarity with AI coding assistants to understand their strengths and weaknesses. This awareness will be critical to rolling them out effectively.
The worst the models will ever be
The challenge is that, given the rapid pace of change, AI discussions often become irrelevant in a few short monthsโฆ or even weeks. โAI coding assistants are changing rapidly, so anything we say about them probably has a short shelf life,โ says Majors.
The future capabilities of AI are hard to forecast. But more and more developers are bullish on its role in their day-to-day workflows and big picture goals. Salesforceโs latest State of IT survey found that 92% of developers expect agentic AI to advance their careers.
For Kimball, agentic AI will open countless doors and pose new threat vectors. โWeโre gonna start going from billions to tens of billions to hundreds of billions, maybe even trillions of active things out there that are ultimately hitting APIs more than ever.โ
At the enterprise level, the industry must start considering data sovereignty, he adds, because regional data restrictions are rising and agentic AI will lower the threshold for data access. Ultimately, data providers will have to satisfy these regulations and learn how to appropriately secure their data.
Context window limitsโthe amount of text that a model can consider at onceโare whatโs really holding back LLMs, but theyโre constantly improving. What happens when context windows reach millions or hundreds of millions of tokens? Many of the issues surrounding AI in large code bases could evaporate.
As it stands now, issues still present themselves when working with LLMs for different coding tasks, requiring keen insight on when (and how) to use them wisely. Yet, as Kimball reminds us, AI coding tools are improving exponentially, and weโre only at the beginning.
โThe future of software is AI,โ he says. โThis is the worst the models are ever going to be.โ


