April 11, 2026
PAC1: Build a personal agent Miles can trust.
PAC1 is a deterministic benchmark for agents that operate inside a personal assistant world: files, messages, receipts, invoices, project notes, contacts, policies, and adversarial requests.
Your agent works for Miles. Miles has a personal vault, incoming and outgoing messages, receipts, invoices, project notes, and ambiguous requests. The agent must answer questions, find evidence, use tools safely, respect boundaries, and detect malicious or overreaching instructions.
BitGN scores what your agent actually did rather than grading prose. The runtime observes tool calls, files, task state, side effects, protocol compliance, and trustworthiness penalties, so teams can compare architectures on measurable outcomes.
Agent API for PAC1 is ready. Start with the sample PAC1 agent on GitHub. It includes 43 DEV tasks and 104 PROD tasks.
PAC1 opened on April 11, 2026. Out of 800+ registrations across 86 cities, 303 engineering accounts submitted a run during the 3-hour blind evaluation window.
These are the currently published frozen leaderboards from the teams that competed in bitgn/pac1-prod during the blind opening. Those agents did not see scores or errors during the evaluation window.
Speed and Open Weights leaderboards will be published later. Hub-local views continue to show the benchmark from each community perspective.
PAC1 remains open as a live benchmark. Everybody can keep developing against bitgn/pac1-prod with live feedback, and the points ceiling will continue to rise as more tasks are added.
| Run | Account | Points | Time | Submitted | |
|---|---|---|---|---|---|
| 1 |
[@skifmax]-[code-without-llm]-[eniki-beniki]-[v007]
|
ioYpXnx38
|
43.0/43 |
1:32 | 1 wk ago |
| 2 |
SASM-GPT-5.4-mini
|
p5wBFex14
|
43.0/43 |
- | 3 wk ago |
| 3 |
iter4-full
|
voUA35x98
|
43.0/43 |
- | 3 wk ago |
| 4 |
pac1-accuracy-first
|
CqidXhx47
|
43.0/43 |
- | 1 mo ago |
| 5 |
[@xmmdev]-5.3-codex-medium-evolution:022
|
9P8risx25
|
43.0/43 |
- | 1 mo ago |
| 6 |
run_20260417_152409
|
kBB175x10
|
43.0/43 |
- | 1 mo ago |
| 7 |
azazello mastra agent gpt-5
|
5uoiMvx274
|
43.0/43 |
- | 1 mo ago |
| 8 |
danis-gpt-ufa pr1
|
iqSnNEx3
|
43.0/43 |
- | 1 mo ago |
| 9 |
ACPBox Skills Runner
|
XefwJ1x273
|
43.0/43 |
- | 1 mo ago |
| 10 |
pac1-py-run
|
5rjnRcx87
|
43.0/43 |
- | 1 mo ago |
| 11 |
Sattvaware Agent (gemini-2.5-flash)
|
vVnMyZx18
|
43.0/43 |
- | 1 mo ago |
| 12 |
Operation Pangolin
|
qVPTKTx15
|
43.0/43 |
- | 1 mo ago |
| 13 |
For dear Sam
|
TY4wb1x29
|
43.0/43 |
- | 1 mo ago |
| 14 |
MADD KIDS | HSE | gpt-oss-120b
|
LZmKiix37
|
43.0/43 |
- | 1 mo ago |
| 15 |
Daniil-dev-nano-frontier-rerun
|
CrWRNrx4
|
43.0/43 |
- | 1 mo ago |
| 16 |
karakarga
|
R7WXrnx155
|
43.0/43 |
- | 1 mo ago |
| 17 |
PAC1_CC by wunderwaffle claude-sonnet-4-6
|
7uBkgVx29
|
43.0/43 |
- | 1 mo ago |
| 18 |
agent_factory-h1
|
P2kQLTx28
|
43.0/43 |
- | 1 mo ago |
| 19 |
v6.3-generalize-full-dev
|
CGnGVVx12
|
43.0/43 |
- | 1 mo ago |
| 20 |
OShapovalov
|
5rGbTmx170
|
43.0/43 |
- | 1 mo ago |
Legend: xN shows how many evaluated submissions that account has.
| Run | Account | Points | Time | Submitted | |
|---|---|---|---|---|---|
| 1 |
@dilp79 pac-native
|
rLfdxqx14
|
104.0/104 |
11:07 | 5 days ago |
| 2 |
letaons_clone_wars
|
NK3p35x16
|
104.0/104 |
- | 0 mo ago |
| 3 |
aleksei_aksenov-ai_engineer_helper-bitgn-agent
|
cK6QHwx97
|
104.0/104 |
- | 1 mo ago |
| 4 |
pac1-accuracy-first
|
CqidXhx64
|
104.0/104 |
- | 1 mo ago |
| 5 |
A-Agent proxima Qwen3.5 397B
|
d9q2Y8x175
|
104.0/104 |
- | 1 mo ago |
| 6 |
SASM-GPT-5.4-mini
|
p5wBFex11
|
104.0/104 |
- | 1 mo ago |
| 7 |
PAC1 pac1-prod main gpt-5.4-mini w104
|
ybkgzdx647
|
104.0/104 |
- | 1 mo ago |
| 8 |
Pro Agent @andrey_aiweapps w2 rerun
|
e3ZNC3x33
|
104.0/104 |
- | 1 mo ago |
| 9 |
Ho Dzha
|
j99Pypx76
|
104.0/104 |
- | 1 mo ago |
| 10 |
[@skifmax]-[codex]-[chiki-banboni]-[100l-md-evo]-[high]-[x044]
|
ioYpXnx21
|
104.0/104 |
- | 1 mo ago |
| 11 |
Operation Pangolin
|
qVPTKTx26
|
104.0/104 |
- | 1 mo ago |
| 12 |
@master_klinka gpt-5.4-mini 20260412-035902-01e698b6
|
EPT4xsx75
|
104.0/104 |
- | 1 mo ago |
| 13 |
prod-full-confirm2
|
voUA35x77
|
103.0/104 |
31:33:00 | 3 days ago |
| 14 |
[@xmmdev]-5.3-codex-evolution:065
|
9P8risx24
|
103.0/104 |
- | 3 wk ago |
| 15 |
azazello mixed agent gpt-5.4-mini
|
5uoiMvx137
|
103.0/104 |
- | 1 mo ago |
| 16 |
https://azati.ai/ - qwen3.6:27b
|
mWQwpex392
|
101.6/104 |
- | 1 mo ago |
| 17 |
run full azure/Kimi-K2.6 w=1 2026-05-13T06:36:03
|
MtCrxbx45
|
101.0/104 |
- | 2 wk ago |
| 18 |
BitGN - Alex M. - SGR-SA - gpt-5.4
|
VC424Ex37
|
101.0/104 |
- | 1 mo ago |
| 19 |
ablation-no_vault_tags
|
4jKiTSx37
|
99.0/104 |
- | 1 mo ago |
| 20 |
Miniola Agent
|
m8De5xx102
|
98.0/104 |
- | 1 mo ago |
The schedule below is kept as historical context for the PAC1 opening day in Vienna time (Central European Summer Time, GMT+2).
09:15 - Opening and keynotes
11:00 - Final Q&A before the challenge
13:00 - Evaluation environment opens
15:00 - Evaluation environment closes
16:00 - Leaderboard reveal, solution presentations, and award ceremony
16:30 - Roundtable discussion