Microsoft’s Fara-7B is a computer-use AI agent that rivals GPT-4o and works directly on your PC

by | Nov 24, 2025 | Technology

Microsoft has introduced Fara-7B, a new 7-billion parameter model designed to act as a Computer Use Agent (CUA) capable of performing complex tasks directly on a user’s device. Fara-7B sets new state-of-the-art results for its size, providing a way to build AI agents that don’t rely on massive, cloud-dependent models and can run on compact systems with lower latency and enhanced privacy.While the model is an experimental release, its architecture addresses a primary barrier to enterprise adoption: data security. Because Fara-7B is small enough to run locally, it allows users to automate sensitive workflows, such as managing internal accounts or processing sensitive company data, without that information ever leaving the device. How Fara-7B sees the webFara-7B is designed to navigate user interfaces using the same tools a human does: a mouse and keyboard. The model operates by visually perceiving a web page through screenshots and predicting specific coordinates for actions like clicking, typing, and scrolling.Crucially, Fara-7B does not rely on “accessibility trees,” the underlying code structure that browsers use to describe web pages to screen readers. Instead, it relies solely on pixel-level visual data. This approach allows the agent to interact with websites even when the underlying code is obfuscated or complex.According to Yash Lara, Senior PM Lead at Microsoft Research, processing all visual input on-device creates true “pixel sovereignty,” since screenshots and the reasoning needed for automation remain on the user’s device. “This approach helps organizations meet strict requirements in regulated sectors, including HIPAA and GLBA,” he told VentureBeat in written comments.In benchmarking tests, this …

Article Attribution | Read More at Article Source