Don’t sleep on Cohere: Command A Reasoning, its first reasoning model, is built for enterprise customer service and more

by | Aug 21, 2025 | Technology

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now

I was in more meetings than usual today so I just caught up to the fact that Cohere, the Canadian startup geared co-founded by former Transformer paper author Aidan Gomez toward making generative AI products work easily, powerfully, and securely for enterprises, has released its first reasoning large language model (LLM), Command A Reasoning.

It looks to be a strong release. Benchmarks, technical specs, and early tests suggest the model delivers on flexibility, efficiency, and raw reasoning power.

Customer service, market research, scheduling, data analysis are some of the tasks Cohere says it’s built to handle automatically at scale inside secure enterprise environments.

It is a text-only model, however, but it should be easy enough to hook up to multimodal models and tools. In fact, tool use is one of its primary selling points.

AI Scaling Hits Its Limits

Power caps, rising token costs, and inference delays are reshaping enterprise AI. Join our exclusive salon to discover how top teams are:

Turning energy into a strategic advantage

Architecting efficient inference for real throughput gains

Unlocking competitive ROI with sustainable AI systems

Secure your spot to stay ahead: https://bit.ly/4mwGngO

While it’s open for researchers to use for non-commercial purposes, enterprises will need to pay Cohere to get access and the company doesn’t publicly list its pricing because it says it makes bespoke customization and private deployment.

Cohere was valued at $6.8 billion when it announced its latest funding round of $500 million a week and a day ago.

Tuned for enterprises

Command A Reasoning is tuned for enterprises with sprawling document libraries, long email chains, and workflows that can’t afford hallucinations.

It supports up to 256,000 tokens on multi-GPU setups, a decent size and comparable to OpenAI’s GPT-5.

The research release weighs in at 111-billion parameters, trained with tool-use and multilingual performance in mind.

It supports 23 languages out of the box, including English, French, Spanish, Japanese, Arabic, and Hindi. That multilingual depth is key for global enterprises that need consistent agent quality across markets.

The model slots directly into North, Cohere’s new platform for deploying AI agents and automations on-premises.

That means enterprises can spin up custom agents that live entirely within their infrastructure, giving them control over data flows while still tapping into advanced reasoning.

Cohere looks like it’s thought cleverly to identify some of the recurring functions across enterprises — onboarding, market research and analysis, development — and trained its model to support its agentic workflows for handling these automatically.

Controlled thinking

As with many other recent reasoning releases including Nvidia’s new Nemotron-Nano-9B-v2, Command A Reasoning introduces a token budget feature to let users or developers specify how much reasoning to allocate to specific inputs and tasks. Less bu …

Article Attribution | Read More at Article Source