Perplexity Releases Advanced Deep Research Upgrade, Open-Sources DRACO Benchmark

Perplexity has rolled out an advanced version of its Deep Research agent and claims it beats every other deep research tool on accuracy, usability, and reliability across all verticals.

The company announced higher usage limits first to Max subscribers, followed by Pro users. It also open-sourced its DRACO (Deep Research Accuracy, Completeness, and Objectivity) benchmark to evaluate real-world research capabilities across finance, legal, medicine, technology, and science domains.

Announcing the rollout on X, CEO Aravind Srinivas said, “The advanced version of Perplexity Deep Research has achieved state-of-the-art performance on external and internal benchmarks, beating every other deep research tool on accuracy, usability, and reliability across all verticals.”

The update topped the Google DeepMind Deep Search QA leaderboard with 79.5% accuracy, he added.

“This update is available to all Max users right away, and is gradually rolling out to Pro users. To ensure a consistent experience, every Deep Research (Advanced) query will run using Opus 4.5 and the same agentic harness and toolkit. Max users get higher usage limits,” Srinivas added.

DRACO is an open-source benchmark designed to evaluate the performance of AI systems built for deep research. Unlike earlier academic or synthetic tests, DRACO focuses on practical user needs, measuring how well models handle complex, real-world research queries across domains.

At the core of the framework is an “LLM-as-judge” evaluation protocol, in which model responses are checked against verifiable, real data. Perplexity said this approach reduces subjectivity while improving consistency and factual grounding, addressing long-standing concerns about how research-oriented AI agents are assessed.

The benchmark is model-agnostic, allowing testing of any AI system with research capabilities. Early results suggest Perplexity’s own Deep Research product leads on both accuracy and speed, particularly in demanding areas such as legal analysis and personalised queries.

“Proud of the Perplexity team and the deep infrastructural improvements that are powering our Deep Research product. Excited for the year ahead: I’m sure there will be a lot of opportunities to continue to adapt our infrastructure and agents to the latest models,” Co-founder and CSO Johnny Ho posted on X.

By open-sourcing DRACO, Perplexity aims to push the industry towards more rigorous, production-grounded evaluation standards. The company hopes wider adoption of the benchmark will raise the bar for research agents and make performance claims across the AI sector easier to compare and verify, it said in a statement.

ALSO READ: SpaceX Acquires xAI in Deal Valuing Firm at Combined $1.25 Tn

Join Our Core Community

Middle East: The Sovereign AI Testbed US, EU and Asia Can Learn From

Agentic AI in Production: Why Better Prompts Won’t Bridge the Gap

NVIDIA’s VP of Solutions Architecture on What It Actually Takes to Build a Sovereign AI Factory

NVIDIA GTC 2026: From GPUs to AI Factories

Speed Without Guardrails: The Security Gap Enterprises Are Creating as They Scale AI Agents

Cloud 3.0 and Data Sovereignty: Why Workload Placement Is Now a Strategic Decision

Inside IBM’s 11 Billion Dollar Bet: What the Confluent Deal Reveals About AI’s Investment Paradox

“Synthetic Data Is Not the Ground Truth” — SandboxAQ’s VP of Engineering on Simulation’s Power and Limits

Data as the New Diagnostic: How Ahead Health is Turning Algorithms Into Preventive Care

Why Data Leaders Are Wary of a Synthetic Future

Google Launches Cheaper Video Model Veo 3.1 Lite

TSMC Targets 3nm Chip Production in Japan by 2028

Shopify Uses Qwen 3 to Cut Inference Cost by 75x

OpenAI Raises $122 Bn at $852 Bn Valuation

NVIDIA Invests $2 Bn in Marvell to Expand AI Infrastructure

Perplexity Releases Advanced Deep Research Upgrade, Open-Sources DRACO Benchmark

DRACO is an open-source benchmark designed to evaluate the performance of AI systems built for deep research.

Google Launches Cheaper Video Model Veo 3.1 Lite

TSMC Targets 3nm Chip Production in Japan by 2028

Unpack More

Perplexity Launches Comet Enterprise

Perplexity AI Launches New Tools on Perplexity Computer

Perplexity AI Unveils ‘Perplexity Computer’ to Orchestrate Multiple AI Models

Perplexity’s AI Browser ‘Comet’ to Join Android

Why Data Leaders Are Wary of a Synthetic Future

What Everyone Got Wrong About AI in 2025

AI & Data Insider’s Contributors’ Circle: Meet 2025’s Leading Voices