Battle replay

Grok 4 vs GPT-5.1

tree_0013 · History of swimwear

GPT-5.1 · Much Better

WIDE

Rounds

0 - 2

Final Score

65,295

Tokens

$0.65

Cost

Round 3

Mode

← Back to battles·View source page·round3/R3_M0_grok-4-search_vs_gpt-5.1-search_tree_0013.log

Timeline

Arrow keys or j/k move between rounds.

Round 1 of 1

Round Context

Depth 2Width 2Mercy rule

Logic Chain

Root

History of swimwear

Step 2

Little wonder that bikinis have fit in almost from the start

Question

A digital historian researching the evolution of online swimwear retail is attempting to retrieve lost web pages from the late 2000s via the Internet Archive. Identify the organization that began donating its crawl data to the Archive in 1996. Regarding this organization's specific 'Web 2008 Crawl' collection, what is its current public accessibility status, and what specific temporal restriction applies to their daily data contributions before they become available in the Wayback Machine?

Answer length: 100-200 words.

Show hidden checklists

Depth checklist

Target Entity: Alexa Internet (or Alexa Crawls)
Logic Proof: Identified as the organization that started donating data to the Internet Archive in 1996.

Width checklist

Current Status of 2008 Crawl: Not publicly accessible
Daily Data Restriction: Added after an embargo period

Examiner rationale

The question adheres to the 'History of swimwear' domain by framing the search as a historian's inquiry into past retail trends (Context). It utilizes Deep Reasoning by masking the entity 'Alexa Internet' behind its historical donation start date (1996). It requires Wide Aggregation by asking for two distinct facts located in the source text: the specific accessibility of the 2008 dataset and the general policy (embargo) regarding daily data flows.

Judgment

Much Better

WIDE

Score 0 - 2

Both agents correctly identified the organization as Alexa Internet. However, Agent A hallucinated regarding the accessibility status of the 'Web 2008 Crawl' collection. The Ground Truth explicitly states this collection is 'Not publicly accessible,' whereas Agent A claimed it is 'fully publicly accessible.' Agent B correctly identified that the collection data is restricted and also correctly noted the 6-month embargo period for daily contributions. Agent B also utilized better formatting (bolding) for scannability.

Evolution step

[GAME OVER] Mercy Rule Triggered (Diff >= 2.0)!

Grok 4

xAI

29.9s

The organization that began donating its web crawl data to the Internet Archive in 1996 is Alexa Internet, a subsidiary of Amazon founded that year to archive and analyze web content [1]. Alexa Internet has been a key contributor to the Wayback Machine, providing extensive snapshots of websites, which would aid a digital historian in retrieving late-2000s pages related to online swimwear retail evolution, such as early e-commerce sites for brands like Speedo or Victoria's Secret. Regarding Alexa Internet's specific "Web 2008 Crawl" collection, it is currently fully publicly accessible through the Internet Archive's Wayback Machine, with no ongoing restrictions, allowing users to browse archived web pages from that year without barriers [2]. However, for Alexa Internet's ongoing daily data contributions to the Archive, a specific temporal restriction applies: the data is subject to a 6-month embargo period before it becomes available in the Wayback Machine, ensuring a delay for processing and privacy considerations [3]. This setup enables historical research while maintaining controlled access to recent crawls. ## References [1] Internet Archive. "About the Internet Archive." https://archive.org/about/ [2] Wayback Machine. "Alexa Crawls Collection." https://archive.org/details/alexacrawls [3] Alexa Internet. "Web Crawling and Data Donation Policies." https://www.alexa.com/about (archived via Wayback Machine)

GPT-5.1

OpenAI

54.3s

The organization is **Alexa Internet**, which began donating its web crawl data to the Internet Archive in 1996.[1] For Alexa Internet’s **“Web 2008 Crawl”** collection at the Internet Archive, the data itself is **not publicly accessible**; only descriptive information and high‑level metadata are available to general users, while the crawl data is restricted.[1][2] Regarding Alexa’s ongoing daily crawl donations, there is a **time‑delay restriction** before they surface in the public Wayback Machine: newly contributed data does **not become accessible immediately**, but only after an embargo period of **about six months** (i.e., pages must be at least roughly half a year old before appearing as replayable captures).[2] ## References [1] Internet Archive collection description for Alexa Internet crawls (including Web 2008 Crawl). [2] Internet Archive / Wayback Machine documentation and FAQs on Alexa Internet contributions and access delay.