BrowserBC

Scalable Behaviour Cloning on Browser Using via Skill Distillation.

BrowserBC distills successful human browser trajectories into reusable skills, helping web agents solve tasks with higher success rates and fewer interactions.

Core Idea

Human browsing trajectories contain more than clicks. They encode efficient paths, site-specific logic, and practical decisions that are hard for agents to infer from the current page alone.

BrowserBC turns these trajectories into reusable skills. Those skills give agents prior knowledge for acting under incomplete information, moving them from simply operating websites to operating them efficiently.

Method

BrowserBC extracts task evidence from human runs, summarizes it into skills, and retrieves relevant skills during new browser tasks.

BrowserBC method overview from raw trajectories to evidence extraction, skill distillation, and skill graph management.

Results

BrowserBC improves both success rate and interaction efficiency across WebArena-Hard and ClawBench.

Benchmark Group Skill-off BrowserBC Gain
WebArena-Hard
258 tasks
Overall60.581.4+20.9
GitLab64.986.0+21.1
Shopping60.789.3+28.6
Shopping admin56.470.9+14.5
Reddit78.685.7+7.1
Multi-site43.875.0+31.2
ClawBench
152 tasks
Overall32.968.4+35.5
Daily24.664.9+40.3
Finance50.0100.0+50.0
Work47.176.5+29.4
Dev33.366.7+33.4
Academic50.078.6+28.6
Travel38.576.9+38.4
Social25.056.2+31.2
Pets27.354.5+27.2

Efficiency

Mean WebArena-Hard tool calls drop from 31.2 to 22.7, median calls drop from 24 to 16, and Sonnet-distilled skills lift Qwen from 53% to 77%.

BrowserBC improves interaction efficiency by reducing average and median steps across WebArena-Hard task groups.
Skill distillation transfers across executors, with Sonnet-distilled skills improving both Qwen and Sonnet agents.

Live Case Demos

Watch skill-guided agents complete real browser tasks. Open demos.