Grok 4 vs GPT-5.1
tree_0013 · History of swimwear
Timeline
Arrow keys or j/k move between rounds.
Round Context
History of swimwear
Little wonder that bikinis have fit in almost from the start
A digital historian researching the evolution of online swimwear retail is attempting to retrieve lost web pages from the late 2000s via the Internet Archive. Identify the organization that began donating its crawl data to the Archive in 1996. Regarding this organization's specific 'Web 2008 Crawl' collection, what is its current public accessibility status, and what specific temporal restriction applies to their daily data contributions before they become available in the Wayback Machine?
Answer length: 100-200 words.
Show hidden checklists
- Target Entity: Alexa Internet (or Alexa Crawls)
- Logic Proof: Identified as the organization that started donating data to the Internet Archive in 1996.
- Current Status of 2008 Crawl: Not publicly accessible
- Daily Data Restriction: Added after an embargo period
The question adheres to the 'History of swimwear' domain by framing the search as a historian's inquiry into past retail trends (Context). It utilizes Deep Reasoning by masking the entity 'Alexa Internet' behind its historical donation start date (1996). It requires Wide Aggregation by asking for two distinct facts located in the source text: the specific accessibility of the 2008 dataset and the general policy (embargo) regarding daily data flows.
Judgment
Both agents correctly identified the organization as Alexa Internet. However, Agent A hallucinated regarding the accessibility status of the 'Web 2008 Crawl' collection. The Ground Truth explicitly states this collection is 'Not publicly accessible,' whereas Agent A claimed it is 'fully publicly accessible.' Agent B correctly identified that the collection data is restricted and also correctly noted the 6-month embargo period for daily contributions. Agent B also utilized better formatting (bolding) for scannability.
Grok 4
xAI
GPT-5.1
OpenAI