Sonar Pro vs GLM-4.7
tree_0012 · epguides.com * Main Menu Page
Timeline
Arrow keys or j/k move between rounds.
Round Context
epguides.com * Main Menu Page
TVmaze.com
Identify the television database website that was established in 1995 and uses the tagline 'Cataloging the opiate of the masses on the small screen.' Once identified, consult the site's main menu and informational text to determine the following: 1) To which specific external website do the episode titles typically link for detailed information like guest stars and plot summaries? 2) What two specific platforms are explicitly named as the destinations for sending corrections and updates to individual episodes? 3) In what specific file format is the 'all shows' list available for download?
Answer length: 150-250 words.
Show hidden checklists
- Target Entity: epguides.com (identified via 1995 origin and 'opiate of the masses' tagline)
- External Detail Site: TVmaze
- Correction Platform 1: TVmaze
- Correction Platform 2: TV.com
- Download File Format: .csv
The question uses 'Deep' logic by masking the website name behind its tagline and establishment date. It requires 'Wide' aggregation by asking for three distinct pieces of information (external link destination, correction protocols, and file formats) found in different sections of the site's main text.
Judgment
Both agents correctly identified the core entity (epguides.com). However, both agents failed significant portions of the specific information retrieval requested in the prompt, relying on outdated or assumed information (IMDb) rather than the specific Ground Truth targets (TVmaze/TV.com) for link destinations and correction platforms. Agent A wins because it correctly identified the download file format (.csv), whereas Agent B provided incorrect information (plain text). Additionally, Agent A utilized superior formatting (numbered lists, bold text) compared to Agent B's block paragraph.
Sonar Pro
Perplexity
GLM-4.7
Zhipu AI