Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta’s Llama

by | Jan 28, 2026 | Technology

Many in the industry think the winners of the AI model market have already been decided: Big Tech will own it (Google, Meta, Microsoft, a bit of Amazon) along with their model makers of choice, largely OpenAI and Anthropic. 

But tiny 30-person startup Arcee AI disagrees. The company just released a truly and permanently open (Apache license) general-purpose, foundation model called Trinity, and Arcee claims that at 400B parameters, it is among the largest open-source foundation models ever trained and released by a U.S. company.

Arcee says Trinity compares to Meta’s Llama 4 Maverick 400B, and Z.ai GLM-4.5, a high-performing open-source model from China’s Tsinghua University, according to benchmark tests conducted using base models (very little post training).

Arcee AI benchmarks for its Trinity large LLM (preview version, base model)Image Credits:Arcee

Like other state-of-the-art (SOTA) models, Trinity is geared for coding and multi-step processes like agents. Still, despite its size, it’s not a true SOTA competitor yet because it currently supports only text.

More modes are in the works — a vision model is currently in development, and a speech-to-text version is on the roadmap, CTO Lucas Atkins told TechCrunch (pictured above, on the left). In comparison, Meta’s Llama 4 Maverick is already multi-modal, supporting text and images.

But before adding more AI modes to its roster, Arcee says, it wanted a base LLM that would impress its main target customers: developers and academics. The team particularly wants to woo U.S. companies of all sizes away from choosing open models from China. 

“Ultimately, the winners of this game, and the only way to really win over the usage, is to have the best open-weight model,” Atkins said. “To win the hearts and minds of developers, you have to give them the best.”

Techcrunch event

San Francisco
|
October 13-15, 2026

The benchmarks show that the Trinity base model, currently in preview while more post-training takes place, is largely holding its own and, in some cases, slightly besting Llama on tests of coding and math, common sense, knowledge and reasoning.

The progress Arcee has made so far to become a competitive AI Lab is impressive. The large Trinity model follows two previous small models released in in December: the 26B-parameter Trinity Mini, a fully post-trained reasoning model for tasks ranging from web apps to agents, and the 6B-parameter Trinity Nano, an experimental model designed to push the boundaries of models that are tiny yet chatty.  

The kicker is, Arcee trained them all in six months for $20 million total, using 2,048 Nvidia Blackwell B300 GPUs. This out of the roughly $50 million the company has raised so far, said founder and CEO Mark McQuade (pictured above, on the right). 

That kind of cash was “a lot for us,” said Atkins, who led the model building effort. Still, he acknowledged that it pales in comparison to how much bigger labs are spending right now.

The six-month timeline “was very calculated,” said Atkins, whose career before LLMs involved building voice agents for cars. “We are a younger startup that’s extremely hungry. We have a tremendous amount of talent and bright young researchers who, when given the opportunity to spend this amount of money and train a model of this size, we trusted that they’d rise to the occasion. And they certainly did, with many sleepless nights, many long hours.” 

McQuade, previously an early employee at open-source model marketplace HuggingFace, says Arcee didn’t start out wanting to become a new U.S. AI Lab: The company was originally doing model customization for large enterprise clients like SK Telecom. 

“We were only doing post-training. So we would take the great work of others: We would take a Llama model, we would take a Mistral model, we would take a Qwen model that was open source, and we would post-train it to make it better” for a company’s intended use, he said, including doing the reinforcement learning. 

But as their client list grew, Atkins said, the need for their own model was becoming a necessity, and McQuade was worried about relying on other companies. At the same time, many of the best open models were coming from China, which U.S. enterprises were leery of, or were barred from using. 

It was a nerve-wracking decision. “I think there’s less than 20 companies in the world that …

Article Attribution | Read More at Article Source