Claude Opus 4.1 vs GPT-5.1
tree_0025 · Cosmetology
Timeline
Arrow keys or j/k move between rounds.
Round Context
Cosmetology
Specific Career Fields
Identify the educational institution in Washington State located at 1001 N. Broadway that houses the 'Cascade Learning Resource Center'. Locate the specific library subject guide for 'Cosmetology' maintained by Marianne Le at this institution. Based on this guide, list the titles and authors of the seven print books recommended in the 'Career Books' section, and enumerate the eleven other academic subjects for which Marianne Le is listed as the subject guide contact.
Answer length: 200-300 words.
Show hidden checklists
- Target Entity: Everett Community College (identified via address 1001 N. Broadway and Cascade Learning Resource Center)
- Target Entity: Marianne Le (identified as the specific librarian/subject guide contact)
- Book 1: Careers in Focus: Cosmetology by Ferguson Publishing Staff
- Book 2: Cosmetology Certification Exam by LearningExpress Staff
- Book 3: Guide to Good Hairdressing by Julie Handley
- Book 4: Opportunities in Beauty and Modeling Careers by Susan Wood Gearhart
- Book 5: Preparing for the Practical Exam by Deborah Beatty
- Book 6: Standard Cosmetology 2008 by Catherine M. Frangie / Milady Publishing Company Staff
- Book 7: Successful Salon Management by Edward J. Tezak
- Subject 1: Business
- Subject 2: Careers
- Subject 3: Education
- Subject 4: Human Development
- Subject 5: Nursing & Health Sciences
- Subject 6: Nutrition
- Subject 7: Philosophy
- Subject 8: Political Science
- Subject 9: Psychology
- Subject 10: Religion
- Subject 11: Sociology
The question uses 'Deep' reasoning by requiring the agent to identify the specific institution (Everett Community College) using only the address and the name of its resource center provided in the source text. It then applies 'Wide' aggregation by requiring the extraction of two distinct datasets: a list of 7 specific books (titles and authors) and a list of 11 specific academic subjects managed by the identified librarian, which are located in different sections of the source material.
Judgment
Agent B correctly identified the core entity (Everett Community College) based on the address and the name of the library (Cascade Learning Resource Center). Agent A failed this fundamental logic check, incorrectly claiming the institution was Spokane Falls Community College (which is located elsewhere). However, Agent B is capped at 'Better' rather than 'Much Better' because it failed the 'Wide' accuracy check; the specific books and subject lists provided by Agent B did not match the Ground Truth checklist, suggesting it hallucinated the specific content of the guide despite finding the correct library.
Claude Opus 4.1
Anthropic
GPT-5.1
OpenAI