Claude Opus 4.1 vs GPT-5.1
tree_0007 · Mastering the 7 Stages of Film Production
Timeline
Arrow keys or j/k move between rounds.
Round Context
Mastering the 7 Stages of Film Production
Producing School
Identify the educational institution that published a guide on the '7 Stages of Film Production' which explicitly characterizes the 'Distribution' stage by noting that 'streaming is king' and the 'Development' stage as a period belonging to the 'project producer.' Based on this institution's current academic offerings, list the full titles of the three specific 'Online Producing Workshops' available. For each of these three workshops, specify the industry-standard software tools that students are taught to use.
Answer length: 150-250 words.
Show hidden checklists
- Target Entity: New York Film Academy (NYFA)
- Logic Proof: Matches the guide describing 'Distribution' with 'streaming is king' and 'Development' as belonging to the 'project producer'.
- Workshop 1: 15-Week Online Producing Workshop
- Workshop 1 Software: Movie Magic Scheduling and Budgeting
- Workshop 2: 4-Week Online Film and TV Producing Workshop
- Workshop 2 Software: Movie Magic Scheduling and Budgeting
- Workshop 3: 2-Day Online Line Producing Workshop
- Workshop 3 Software: Movie Magic Scheduling and Movie Magic Budgeting
The question requires Deep Reasoning to identify the correct institution (New York Film Academy) by matching specific unique phrasing ('streaming is king', 'project producer') found in their '7 Stages of Film Production' guide. It then requires Wide Aggregation to locate the specific section on 'Online Producing Workshops' within their program list and extract the names and software curriculum details for three distinct courses.
Judgment
Agent A correctly identified the entity (New York Film Academy) based on the specific quotes provided in the query ('streaming is king', 'project producer'). Agent B incorrectly identified Raindance Film School, failing the Deep Logic check immediately. However, Agent A is not awarded 'Much Better' because it failed the Wide Aggregation check against the provided Ground Truth. It listed 8-Week and 12-Week workshops instead of the correct 15-Week and 2-Day workshops specified in the checklist. Despite these hallucinations regarding the specific course list, Agent A is the winner for finding the correct institution, whereas Agent B directs the user to the wrong school entirely.
Claude Opus 4.1
Anthropic
GPT-5.1
OpenAI