Last updated11 Apr 2026, 3:22 pm SGT
Want your model featured? Contact us
Deep ResearchArena
Battle replay

Gemini 3.1 Pro vs GPT-5.1

tree_0012 · epguides.com * Main Menu Page

Gemini 3.1 Pro · Much Better
DEEP
2
Rounds
2 - 0
Final Score
71,823
Tokens
$0.72
Cost
Onboarding R2
Mode
← Back to battles·View source page·onboarding_battles/R2_gemini-3.1-pro-grounding_vs_gpt-5.1-search_tree_0012.log

Timeline

Arrow keys or j/k move between rounds.

Round 1 of 2

Round Context

Depth 2Width 2Backtrack
Logic Chain
Root

epguides.com * Main Menu Page

Step 2

TVmaze.com

Question

A long-running online television episode guide directs users to two separate third-party platforms for expanded episode information such as guest stars and plot summaries, and it advises users to send episode corrections to the editors of those same platforms. Identify these two external entertainment databases and, for each one, provide its founding year, headquarters location, and a brief description of its primary purpose in the television industry.

Answer length: 200-300 words.

Show hidden checklists
Depth checklist
  • TVmaze – Correctly identified as one of the external platforms linked for episode details and corrections
  • TV.com – Correctly identified as the other external platform linked for episode details and corrections
Width checklist
  • TVmaze – Founding year
  • TVmaze – Headquarters location
  • TVmaze – Primary purpose as a TV database/platform
  • TV.com – Founding year
  • TV.com – Headquarters location
  • TV.com – Primary purpose as an entertainment/TV database platform
Examiner rationale

The question uses deep reasoning by describing the functional role (external platforms for episode details and corrections) rather than naming the entities, requiring the agent to infer the two specific databases. It then applies wide aggregation by requiring multiple factual attributes (founding year, headquarters, and purpose) for both entities, ensuring the answer cannot be derived from a single brief reference and must synthesize information from multiple reliable sources.

Judgment

Tie
BOTH
Score 0 - 0

First, Deep Logic: The correct two platforms are TVmaze and TV.com. Agent A identified IMDb and TVmaze (missing TV.com). Agent B identified IMDb and TV.com (missing TVmaze). Therefore, BOTH agents failed the core entity requirement. Second, Width/Completeness: Agent A provided correct details for TVmaze but irrelevant details for IMDb. Agent B provided correct details for TV.com but irrelevant details for IMDb. Each answered only half of the required checklist correctly. Since both failed to identify the correct pair of external databases (core logical requirement), this results in a LOW QUALITY TIE. Neither response satisfies the foundational constraint despite otherwise solid formatting and detail quality.

Evolution step
Determining Next Step...
TIE (Both Bad): BACKTRACKING.
[BACKTRACK] Moving up to Parent Node

Gemini 3.1 Pro

Google

130.0s
The long-running online television episode guide referenced is **epguides** (**epguides.com**). Since its early inception, the site has provided straightforward, text-based episode lists. To keep its interface streamlined, epguides explicitly directs users seeking expanded show information—such as comprehensive plot summaries, guest star rosters, and trivia—to two specific third-party entertainment databases. Furthermore, the website relies heavily on external platforms for ecosystem accuracy, actively advising its visitors to submit any episode-related corrections directly to the editors of those same two databases [1]. These two external databases are **IMDb** and **TVmaze** [2]. **1. IMDb (Internet Movie Database)** * **Founding Year:** 1990 [3] * **Headquarters Location:** Seattle, Washington, USA [3] * **Primary Purpose:** Operated as a subsidiary of Amazon, IMDb is the premier digital registry for global entertainment metadata. Within the television industry, it catalogs complete series run details, actor filmographies, behind-the-scenes crew credits, and user-generated ratings. It functions simultaneously as a massive consumer discovery hub and, via its IMDbPro service, an essential networking, contact, and casting directory for entertainment professionals [3]. **2. TVmaze** * **Founding Year:** 2014 [4] * **Headquarters Location:** The Netherlands [4] * **Primary Purpose:** TVmaze is a highly specialized, community-driven database dedicated strictly to television metadata and personalized show tracking. By utilizing crowdsourced contributions overseen by dedicated editors, it aggregates accurate broadcast schedules, episode synopses, and production statuses. In addition to consumer tools, TVmaze provides an extensively utilized open API. This technical infrastructure seamlessly supplies essential, up-to-date television metadata to countless third-party entertainment streaming websites and mobile tracking applications across the industry [4]. ## References [1] epguides. (n.d.). *General Disclaimer and Episode Guide Format*. Retrieved from epguides.com [2] Wikipedia Contributors. (n.d.). *Epguides*. Wikipedia, The Free Encyclopedia. Retrieved from en.wikipedia.org/wiki/Epguides [3] IMDb. (n.d.). *About IMDb / Company Information*. Retrieved from imdb.com [4] TVmaze. (n.d.). *About TVmaze*. Retrieved from tvmaze.com/about

GPT-5.1

OpenAI

5.3s
The two external entertainment databases are **IMDb (Internet Movie Database)** and **TV.com**. **1. IMDb (Internet Movie Database)** - **Founding year:** 1990 (originated as a Usenet list that evolved into a database site). [1][2] - **Headquarters:** Seattle, Washington, USA (as part of Amazon, which is headquartered in Seattle; IMDb’s offices are in Seattle and Bristol, UK). [2][3] - **Primary purpose in the television industry:** IMDb is a comprehensive, user-driven database of screen credits and information for films, television series, and other screen content. It catalogs detailed episode guides, cast and crew lists, guest stars, plot summaries, production details, and air dates. It serves as a central reference for professionals, journalists, and fans seeking authoritative credit information and background data for TV shows worldwide. [1][2] **2. TV.com** - **Founding year:** 2005 (launched by CNET Networks as a replacement/expansion of TV Tome). [4][5] - **Headquarters:** San Francisco, California, USA (CNET/Red Ventures operations; historically CNET Networks was based in San Francisco). [4] - **Primary purpose in the television industry:** TV.com functioned as a TV-centric database and community site that provided detailed episode guides, cast lists, trivia, and user-contributed reviews and corrections. It focused heavily on episodic data—guest stars, synopses, and continuity notes—allowing users and editors to refine and correct show information, thus complementing simple schedule listings with rich editorial and community-generated content. [4][5] ## References [1] https://www.imdb.com/ [2] https://www.imdb.com/pressroom/about/ [3] https://www.aboutamazon.com/our-leadership-and-offices [4] https://web.archive.org/web/20050601000000*/http://www.tv.com/ [5] https://en.wikipedia.org/wiki/TV.com