Gemini 3.1 Pro vs Sonar Pro
tree_0004 · Sweepstake
Timeline
Arrow keys or j/k move between rounds.
Round Context
Sweepstake
Advertising mail
USPS defends junk mail
Researchers investigating the history of online sweepstakes promotions often rely on archived versions of promotional websites preserved by a well-known nonprofit digital library that operates the Wayback Machine. This archive has, since the mid-1990s, received daily web crawl data donations from a commercial web analytics company founded in 1996. Identify this company and explain its data donation relationship with the nonprofit archive, including when the donations began, how the data are incorporated into the Wayback Machine, and the name and current public accessibility status of one specific crawl collection associated with these donations.
Answer length: 220-320 words.
Show hidden checklists
- Alexa Internet + Identified as the web analytics company founded in 1996 that donates crawl data
- Internet Archive + Identified as the nonprofit digital library that operates the Wayback Machine and receives the crawl data
- States that Alexa Internet began donating crawl data starting in 1996
- Explains that the donated crawl data flow in daily and are added to the Wayback Machine after an embargo period
- Identifies the nonprofit recipient as the Internet Archive, operator of the Wayback Machine
- Names the specific collection '52_crawl' and notes that it is currently not publicly accessible
The question uses contextual logic (researching archived sweepstakes websites) to indirectly point to the Internet Archive and its data sources without naming them (Deep). It then requires the respondent to identify the donating company and aggregate multiple specific facts: the start year of donations, the daily flow and embargo process, and the specific '52_crawl' collection and its accessibility status (Wide).
Judgment
First, Deep Logic: Both agents correctly identify Alexa Internet (founded 1996) as the donating company and the Internet Archive as the nonprofit operating the Wayback Machine. So both pass the core entity check. Next, Width/Completeness: Both state donations began in 1996. Agent A explains the daily crawl flow, ARC/WARC formats, and specifically mentions an embargo period before incorporation into the Wayback Machine—this satisfies the checklist requirement about post-embargo integration. Agent B does not mention any embargo period, so it misses a required sub-point. Critically, both agents fail to name the required specific collection (“52_crawl”) and incorrectly describe accessibility (the checklist specifies it is not publicly accessible). Thus, both have factual gaps, but Agent B omits more required detail overall. Finally, Presentation & User Experience: Agent A is more comprehensive, better structured, and provides richer technical context about ingestion and indexing. Agent B is shorter and readable but less detailed and cites weak/irrelevant sources. Because both contain factual errors, the winner cannot receive MUCH_BETTER. However, Agent A provides stronger completeness and explanation, making it the better overall user experience. Therefore: A_BETTER, with Agent B failing on WIDE completeness.
Gemini 3.1 Pro
Sonar Pro
Perplexity