Rui Wang

X / Google Scholar / LinkedIn / Email

I'm a Research Lead at Yutori, where I work on computer-use models and autonomous AI agents.

Recently, I've been driving the development of Yutori Navigator n1.5 and n1.

Previously, I led post-training for Llama 3 Multimodal at Meta, and co-led Emu, Meta's text-to-image foundation model.

Research interests: large language models, multimodal learning, AI agents, and reinforcement learning.

The digital world is expanding faster than the human capacity to operate it.

At Yutori, we're building AI systems that can autonomously operate across heterogeneous digital environments — including GUIs on computers, browsers, and mobile devices, alongside APIs, CLIs, and code execution environments — dynamically using the most effective primitives available to accomplish a task.

Please feel free to reach out if you're excited about this direction.

Selected Projects

2026 MayIntroducing Navigator n1.5: The Most Capable Computer Use Model for the Web. [blog]
2026 JanIntroducing n1: Yutori's Browser-Use Model. [blog]
2025 NovIntroducing Navigator. [blog]
2024 JulThe Llama 3 Herd of Models. [paper]
2023 SepEmu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack. [paper]