Gemini 2.5 Pro vs GPT 5.4
tree_0016 · Software Developers, Quality Assurance Analysts, and Testers : Occupational Outlook Handbook: : U.S. Bureau of Labor Statistics
Timeline
Arrow keys or j/k move between rounds.
Round Context
Software Developers, Quality Assurance Analysts, and Testers / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
Computer and Information Technology Occupations / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
Within the broad category of computer and information technology occupations in the United States, identify all occupations that (1) typically require a bachelor’s degree for entry and (2) had a 2024 median annual wage higher than the overall median wage for this occupational group (which was $105,990). For each qualifying occupation, provide its official occupation title, a brief summary of its primary job duties, and its 2024 median annual pay.
Answer length: 200-300 words.
Show hidden checklists
- Computer Network Architects – requires bachelor’s degree and median pay exceeds $105,990 group median
- Database Administrators and Architects – requires bachelor’s degree and median pay exceeds $105,990 group median
- Information Security Analysts – requires bachelor’s degree and median pay exceeds $105,990 group median
- Software Developers, Quality Assurance Analysts, and Testers – requires bachelor’s degree and median pay exceeds $105,990 group median
- Computer Network Architects – brief job summary included
- Computer Network Architects – 2024 median pay of $130,390
- Database Administrators and Architects – brief job summary included
- Database Administrators and Architects – 2024 median pay of $123,100
- Information Security Analysts – brief job summary included
- Information Security Analysts – 2024 median pay of $124,910
- Software Developers, Quality Assurance Analysts, and Testers – brief job summary included
- Software Developers, Quality Assurance Analysts, and Testers – 2024 median pay of $131,450
The question uses logical filtering (Deep) by requiring identification of occupations within a broader technology category that simultaneously meet two criteria: a specific education level and a wage threshold relative to the group median. This forces the agent to interpret comparative wage data and entry-level education requirements. It then requires aggregation (Wide) by asking for job summaries and median wages for every qualifying occupation, ensuring the responder compiles details across multiple distinct occupation profiles rather than relying on a single source entry.
Judgment
First, Deep Logic: Both agents fail. The correct qualifying occupations (per checklist) are: Computer Network Architects; Database Administrators and Architects; Information Security Analysts; and Software Developers, Quality Assurance Analysts, and Testers (with specific 2024 median wages). Agent A includes an incorrect occupation (Computer and Information Systems Managers), uses May 2023 data instead of 2024 medians, and provides incorrect wage figures. Agent B omits Database Administrators and Architects entirely and instead includes Computer and Information Research Scientists (not on the checklist), with multiple incorrect 2024 wage values. Second, Width/Completeness: Agent A at least mentions Database Administrators and Architects and all four core categories conceptually, but uses wrong wage figures and incorrect year. Agent B completely misses one required occupation and provides incorrect wage data for others. Both therefore fail the WIDTH checklist significantly. Finally, Presentation & UX: Both are well-formatted, scannable, and clearly structured with citations. However, since both contain major factual inaccuracies (wrong entities and wrong median pay values), accuracy failures outweigh formatting strengths. Because both responses substantially fail the Deep Logic and Wide detail requirements, this is a Low Quality Tie.
Gemini 2.5 Pro
GPT 5.4
OpenAI