Last updated11 Apr 2026, 3:22 pm SGT
Want your model featured? Contact us
Deep ResearchArena
Battle replay

Claude Opus 4.1 vs o3

tree_0016 · Software Developers, Quality Assurance Analysts, and Testers : Occupational Outlook Handbook: : U.S. Bureau of Labor Statistics

Claude Opus 4.1 · Much Better
DEEP
1
Rounds
2 - 0
Final Score
68,931
Tokens
$0.69
Cost
Round 2
Mode
← Back to battles·View source page·round2/R2_M1_claude-opus-4-1-search_vs_o3-search_tree_0016.log

Timeline

Arrow keys or j/k move between rounds.

Round 1 of 1

Round Context

Depth 2Width 2Mercy rule
Logic Chain
Root

Software Developers, Quality Assurance Analysts, and Testers / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics

Step 2

Field of degree / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics

Question

Using the U.S. Bureau of Labor Statistics' Occupational Outlook Handbook, identify the broader occupational group that encompasses the specific roles responsible for designing computer applications and identifying/reporting defects in programs. Within this identified group, determine which specific occupation had the highest median annual wage and which had the lowest median annual wage in May 2024. Provide the exact dollar amounts for these two occupations, along with the median annual wage for the entire occupational group and the projected average number of annual openings for the group from 2024 to 2034.

Answer length: 150-250 words.

Show hidden checklists
Depth checklist
  • Identified the anchor occupation (Software Developers, Quality Assurance Analysts, and Testers) based on the description of duties (designing apps/reporting defects).
  • Correctly navigated to the parent category (Computer and Information Technology Occupations) to perform the comparative analysis.
Width checklist
  • Broader Group Name: Computer and Information Technology Occupations
  • Highest Paid Occupation: Computer and Information Research Scientists ($140,910)
  • Lowest Paid Occupation: Computer Support Specialists ($61,550)
  • Entire Group Median Wage: $105,990
  • Entire Group Projected Annual Openings: 317,700
Examiner rationale

The question requires 'Deep' reasoning to map a functional description of job duties to a specific occupation and then up to its parent BLS category. It requires 'Wide' aggregation to scan the entire list of occupations within that category to compare wages (identifying the max and min) and extract aggregate group statistics.

Judgment

Much Better
DEEP
Score 2 - 0

Agent A correctly identified the 'broader occupational group' (Computer and Information Technology Occupations) as requested, whereas Agent B incorrectly treated the specific role (Software Developers) as the group itself. Because Agent B failed this fundamental logic step (Deep Failure), its subsequent answers regarding the highest and lowest paid occupations were incorrect (comparing developers to testers, rather than comparing the full spectrum of IT roles like Scientists vs. Support Specialists). Agent A aligned with the Ground Truth checklist on the specific occupations and provided a helpful, well-formatted response.

Evolution step
[GAME OVER] Mercy Rule Triggered (Diff >= 2.0)!

Claude Opus 4.1

Anthropic

13.5s
The occupational group that encompasses professionals who design computer applications and identify/report program defects is **Computer Occupations** (15-1200 series) in the U.S. Bureau of Labor Statistics' Occupational Outlook Handbook. This broad category includes software developers, quality assurance analysts, testers, and other computer specialists [1]. Within this occupational group in May 2024, **Computer and Information Research Scientists** held the highest median annual wage at **$145,080**, while **Computer User Support Specialists** had the lowest at **$60,810** [1][2]. The median annual wage for the entire Computer Occupations group was **$104,420** in May 2024 [1]. The projected employment outlook for Computer Occupations from 2024 to 2034 shows approximately **356,700 annual job openings** on average [1][2]. This substantial number of openings reflects both new job growth and the need to replace workers who leave the occupation permanently through retirement or career changes. The significant wage disparity within this group—ranging from $60,810 to $145,080—reflects the varying levels of education, specialization, and technical complexity required across different computer occupations. Research scientists typically require advanced degrees and conduct cutting-edge computational research, while user support specialists often enter the field with associate degrees or certificates, focusing on helping users resolve technical issues [1][2]. ## References [1] U.S. Bureau of Labor Statistics. (2024). Occupational Outlook Handbook: Computer and Information Technology Occupations. https://www.bls.gov/ooh/computer-and-information-technology/home.htm [2] U.S. Bureau of Labor Statistics. (2024). Occupational Employment and Wage Statistics: May 2024. https://www.bls.gov/oes/current/oes_nat.htm

o3

OpenAI

9.5s
Occupational group: Software Developers, Quality Assurance Analysts, and Testers [1]. • Highest‐paid occupation in the group (May 2024): Software Developers – median annual wage $136,320 [1]. • Lowest‐paid occupation in the group (May 2024): Software Quality Assurance Analysts and Testers – median annual wage $102,870 [1]. Group figures: • Median annual wage for the entire occupational group (May 2024): $127,260 [1]. • Projected average annual openings, 2024-34: 153,900 [1]. ## References [1] U.S. Bureau of Labor Statistics, Occupational Outlook Handbook, “Software Developers, Quality Assurance Analysts, and Testers,” updated April 18 2024.