GPT-5.1 vs Gemini 3.1 Pro
tree_0016 · Software Developers, Quality Assurance Analysts, and Testers : Occupational Outlook Handbook: : U.S. Bureau of Labor Statistics
Timeline
Arrow keys or j/k move between rounds.
Round Context
Software Developers, Quality Assurance Analysts, and Testers / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
Field of degree / Occupational Outlook Handbook: / U.S. Bureau of Labor Statistics
An occupation is described as designing computer applications or programs, with overall employment projected to grow 15% from 2024 to 2034 and about 129,200 projected annual openings on average over the decade. This occupation typically requires a bachelor’s degree for entry. First, identify this specific occupation. Then, determine the broader occupational group that includes workers who create or support computer applications, systems, and networks. For both the specific occupation and its broader occupational group, report the 2024 median annual wage and the projected average annual openings (if available). Additionally, name the academic field of degree category that aligns with this occupation. Provide a comparative summary that clearly distinguishes between the specific occupation and the broader occupational group.
Answer length: 200-300 words.
Show hidden checklists
- Correct identification of Software Developers, Quality Assurance Analysts, and Testers based on description (design applications/programs, 15% growth, 129,200 annual openings, bachelor’s degree)
- Correct identification of the broader group as Computer and Information Technology Occupations (workers who create or support computer applications, systems, and networks)
- Correct identification of the aligned field of degree category as Computer and information technology
- 2024 median annual wage for Software Developers, Quality Assurance Analysts, and Testers ($131,450)
- Projected average annual openings for Software Developers, Quality Assurance Analysts, and Testers (about 129,200)
- Typical entry-level education for Software Developers, Quality Assurance Analysts, and Testers (Bachelor’s degree)
- 2024 median annual wage for Computer and Information Technology Occupations group ($105,990)
- Projected average annual openings for Computer and Information Technology Occupations group (about 317,700)
- Relevant field of degree category (Computer and information technology)
The question uses employment growth rate, job duties, and projected openings as logical filters to indirectly identify the specific occupation (Deep reasoning). It then requires aggregation of wage, openings, education, and degree-field data across both the specific occupation and its broader occupational group, plus the related academic field category (Wide aggregation). The entities are masked by description rather than named directly.
Judgment
Both agents correctly passed the DEEP logic check: they identified the specific occupation as Software Developers (within the detailed BLS category Software Developers, Quality Assurance Analysts, and Testers) and the broader group as Computer and Information Technology Occupations. They also correctly identified the aligned academic field. However, both failed the WIDTH/Completeness checklist by providing incorrect 2024 median wages and incorrect projected average annual openings for both the specific occupation and the broader group. These are core quantitative elements of the prompt, and both responses significantly deviate from the ground truth values. In terms of formatting and structure, both answers are well-organized, clearly separated, and user-friendly, with strong scannability and comparative summaries. Since both are logically correct but materially inaccurate on key statistics, this results in a Low Quality Tie.
GPT-5.1
OpenAI