The Attribution Crisis in LLM Search Results Estimating Ecosystem Exploitation

Wednesday, September 10, 2025 2:00 pm - 3:00 pm EDT (GMT -04:00)
Seminar, Ilan Strauss, Sept. 10, 2025

Date: Wednesday, September 10, 2025

Time: 2:00 pm - 3:00 pm EDT

Location: Virtual via Zoom



Title: The Attribution Crisis in LLM Search Results Estimating Ecosystem Exploitation

Abstract: The internet is under attack from AI. Web-enabled LLMs frequently answer queries without crediting the web pages they consume, creating an “attribution gap” – the difference between relevant URLs read and those actually cited. Drawing on approximately 14,000 real-world LMArena conversation logs with search-enabled LLM systems, we document three exploitation patterns: 1) No Search: 34% of Google Gemini and 24% of OpenAI GPT-4o responses are generated without explicitly fetching any online content; 2) No citation: Gemini provides no clickable citation source in 92% of answers; 3) High-volume, low-credit: Perplexity’s Sonar visits approximately 10 relevant pages per query but cites only three to four. A negative binomial hurdle model shows that the average query answered by Gemini or Sonar leaves about 3 relevant websites uncited, whereas GPT-4o’s tiny uncited gap is best explained by its selective log disclosures rather than by better attribution. Citation efficiency – extra citations provided per additional relevant web page visited – varies widely across models, from 0.19 to 0.45 on identical queries, underscoring that retrieval design, not technical limits, shapes ecosystem impact. We recommend a transparent LLM search architecture based on standardized telemetry and full disclosure of search traces and citation logs.

Speaker Bio: Dr. Ilan Strauss is Co-Director of the AI Disclosures Project (SSRC) in Brooklyn, New York. He is an Honorary Senior Fellow at the UCL Institute for Innovation and Public Purpose (London) and Visiting Associate Professor at the University of Johannesburg. He holds a Ph.D in economics from the New School for Social research.