The AI in two pilots aimed at replacing junior roles is described as being โnot an AI model that chats, but one that contributes.โ
OpenAIโs decision to train its models on the everyday outputs of consultants and bankers is more than just a technical experiment, itโs a signal that AI is being repositioned from a generic tool to a domain-capable resource, Sanchit Vir Gogia, the chief analyst, founder and CEO of Greyhound Research said Thursday.
He was responding to reports of a project from OpenAI that is being developed for the management consulting sector. According to Bloomberg, upwards of 150 former consultants from McKinsey & Co., Bain & Co., and Boston Consulting have been contracted by a third-party to train the models on โhow to do the industryโs entry-level tasks,โ in a project code-named Argentum.
Last week, it also reported that OpenAI has more than 100 ex-investment bankers from organizations such as JP Morgan Chase, Morgan Stanley, and Goldman Sachs โhelping train its artificial intelligence on how to build financial models as it looks to replace the hours of grunt work performed by junior bankers across the industry,โ in a project code-named Mercury.
PoC purgatory common
Enterprises, said Gogia, โare no longer content with wide but shallow capability. They want depth. Real outputs. Structured thinking. Work that they can plug into core processes. Our CIO Pulse 2025 shows this shift in action, with 68% of global decision-makers seeing AI as a co-worker rather than a cost-cutter.โ
โWhat OpenAI is doing is meeting that expectation head-on by learning from those whoโve done the work at scale,โ he pointed out. โThe aim isnโt to mimic language, but to internalize practice. And for enterprise buyers, that matters. Whatโs being offered now is not an AI that chats, but one that contributes.โ
Forrester VP Principal Analyst Craig Le Clair, whose coverage areas include AI, automation, and the future of work, said, โtoo many AI agent implementations today are struggling to return investment. The problem stems from poor integration of agent models with business workflows. Forrester estimates that 60% of enterprises fall into a PoC [proof of concept] purgatory, with only 15% of firms generating positive and material impact.โ
For example, he said, โa company could launch 30 pilots and fail to generate any ROI because all the agents just produce dense reports or insights that few act on. Forrester calls this the โAction Gapโ [and it] is the measurable distance between an AI-driven insight (a prediction, recommendation, or analysis) and the resulting value-driving business action.โ
A large gap, he said, โmeans your agents are generating insights but not a return on investment. OpenAIโs announcement is a clear attempt to reduce this.โ
According to Le Clair, author of the book Random Acts of Automation, the workplace is set to be disrupted by model-driven AI agents. โThey are intellectual peers that are (at least in their defined domains) just as smart as us, with intelligence evolving at a faster clip. This transition to a hybrid workforce is not well understood and expensive,โ he said.
He added, โif you think the biggest hurdle in deploying an AI agent is development, the API calls, or cloud consumption costs, youโd be wrong. Service companies tell us that for every dollar spent on AI agent licensing, organizations are spending nearly five dollars on services to get it running at scale, and those dollars are going to organizational changes [to enable a] shift in skills.โ
Forrester, said Le Clair, โhas defined an Agent Experience (AX) program that focuses on five skill categories reflecting these changes: knowledge curation; change management; critical thinking; interaction skills; and agent oversight.โ
Structure is everything
Greyhoundโs Gogia pointed out that even apparently smoothly running AI projects could be problematic. He said that one of his firmโs banking clients recently ran a generative AI trial to streamline its credit memo process. โThe results were promising: draft documents that once took days now appeared in hours,โ he noted. โBut when compliance teams reviewed the outputs, they found gaps, statements made without clear source validation.โ
Consequently, the project โdidnโt fail, but it didnโt launch either. Instead, the institution added a review framework and put legal, risk, and tech leaders in the same room to manage AI output holistically,โ he said.
He cited another case in which a consulting firm integrated AI into its analyst workflow for market sizing. โThe early gains in productivity were tangible, but so were the cultural growing pains,โ he noted. โ[Junior staff] werenโt sure where their role ended and the machineโs began.โ The firmโs response was to adapt its training model to focus not on how to use AI, but on how to supervise it.
โAcross these cases, the lesson was clear: the success of AI has less to do with capability and everything to do with structure,โGogia said.
He added that Greyhound views OpenAIโs latest moves โas the clearest signal yet that the company is anchoring itself inside the workflows that underpin real business delivery. By sourcing training inputs directly from former consultants and investment bankers, itโs doing more than chasing accuracy. Itโs absorbing judgment.โ




