Gemini 2.5 Pro vs Sonar Pro
tree_0016 · Software Developers, Quality Assurance Analysts, and Testers : Occupational Outlook Handbook: : U.S. Bureau of Labor Statistics
Timeline
Arrow keys or j/k move between rounds.
Round Context
Software Developers, Quality Assurance Analysts, and Testers / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
Computer and Information Systems Managers / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
According to the U.S. Bureau of Labor Statistics' Occupational Outlook Handbook (using 2024–34 projection data), identify the specific management occupation that reported a 2024 median annual wage of $171,200 and the software-focused occupation that reported a 2024 median annual wage of $131,450. For both identified roles, specify the exact 'Work Experience in a Related Occupation' requirement listed in their Quick Facts. Additionally, name the specific 'Field of Degree' profile available in the Handbook that aggregates workforce data for the academic major primarily associated with these careers.
Answer length: 200-300 words.
Show hidden checklists
- Management Entity: Computer and Information Systems Managers (Identified via $171,200 pay)
- Software Entity: Software Developers, Quality Assurance Analysts, and Testers (Identified via $131,450 pay)
- Field of Degree Logic: Matches the 'Computer and information technology' major/field linked to these roles
- Work Experience for Management Role: 5 years or more
- Work Experience for Software Role: None
- Field of Degree Profile Name: Computer and information technology
The question uses specific, unique numerical identifiers (2024 median wages) to mask the target entities, forcing the agent to perform 'Deep' reasoning to identify the correct OOH profiles. It then requires 'Wide' information aggregation by demanding specific attributes (Work Experience) from two separate occupation pages and linking them to a third distinct page (Field of Degree) based on the shared subject matter.
Judgment
Both agents correctly identified the core occupations based on the provided wage data. However, Agent B followed the Ground Truth Checklist more accurately regarding the 'Field of Degree' profile name. The checklist specifies 'Computer and information technology', which Agent B provided correctly. Agent A incorrectly named it 'Computer and information sciences' (which is the CIP family name, not the specific OOH profile title). Additionally, Agent B utilized superior formatting by bolding all key constraints (wages, experience requirements, and profile names), making the response significantly more scannable and user-friendly compared to Agent A's standard paragraph format.
Gemini 2.5 Pro
Sonar Pro
Perplexity