The Daily New York

New York news, every day

News

New York Is Drowning in Duplicate Images Online. Here's How the City Stacks Up Against London and Tokyo.

As AI-generated photo floods overwhelm city agency websites and local news archives, New York's response is being watched — and found wanting — by digital records managers worldwide.

By New York News Desk · Published 4 July 2026, 2:51 pm

3 min read

New York Is Drowning in Duplicate Images Online. Here's How the City Stacks Up Against London and Tokyo.
Photo: Photo by Julien R on Pexels

New York City's public-facing digital infrastructure contains tens of thousands of duplicate and near-duplicate photographs — clogging agency servers, slowing search results on official portals, and quietly degrading the integrity of the city's visual public record. That's the picture emerging from ongoing audits conducted by archivists and IT managers inside agencies including the Department of City Planning and the New York Public Library's digital collections division on Fifth Avenue, both of which have flagged the problem internally in recent months.

The timing matters. With the 2026 FIFA World Cup drawing hundreds of thousands of visitors to venues across the five boroughs this summer, city agencies have been racing to refresh and reorganize their digital-facing content. Tourist-facing portals managed through NYC.gov have seen traffic spikes not recorded since the 2021 reopening push after the COVID shutdown. Duplicate imagery — stock photos reused across dozens of subpages, neighborhood photos misidentified and uploaded multiple times under different filenames — makes those portals slower and harder to navigate at exactly the moment the city needs them to perform.

What Other Cities Are Doing

London's Government Digital Service, which manages digital standards for Transport for London and the Greater London Authority, adopted an automated deduplication protocol in late 2024. The system, built around perceptual hashing software, flags images that are more than 85 percent visually similar before they enter the content management system. The GLA reported in its 2025 annual digital services review that the protocol reduced redundant image uploads by roughly 40 percent within its first year of deployment. Tokyo's metropolitan government launched a comparable pilot through its Bureau of Digital Services in March 2025, applying deduplication tools specifically to the city's tourism and urban planning image libraries ahead of anticipated international visitors.

New York has no equivalent citywide protocol in place. The Department of Information Technology and Telecommunications — DoITT, now officially rebranded as the Office of Technology and Innovation — has not published a deduplication standard for city agencies. Individual agencies operate their own content management systems with inconsistent rules. The result is what archivists describe as a fragmented patchwork: the Parks Department uses one platform, the Department of Buildings another, and NYC311's visual assets sit in a third environment entirely.

The New York Public Library's digital collections, housed at its Stephen A. Schwarzman Building at 42nd Street and Fifth Avenue, have addressed the problem independently. The library's digital imaging team has used open-source tools, including the Python-based library called imagededup, to process parts of its Milstein Collection of New York City photographs. Librarians there estimate the effort identified duplicate or near-duplicate image pairs running into the thousands across just one historical collection.

The Cost of Doing Nothing

Storage is not free. Cloud hosting costs for city agencies are billed through consolidated contracts managed by the Office of Technology and Innovation. Industry-standard cloud storage pricing in mid-2026 runs between $20 and $23 per terabyte per month for enterprise-grade services. Image files, particularly high-resolution TIFFs and RAW formats used in official documentation, accumulate fast. Duplicate images don't just waste that money — they create legal and records management risk, since multiple versions of the same photo can carry different metadata, different rights clearances, and different contextual captions.

The problem is compounding as AI image generation adds synthetic photographs to the mix. Several city-contracted communications vendors have begun submitting AI-generated imagery alongside traditional photography for agency campaigns, raising questions about which version of a given image is authoritative, and whether near-identical AI variants should be classified as duplicates at all. No city policy currently addresses that question.

Residents and journalists who rely on city image archives — including the Municipal Archives on Chambers Street in Lower Manhattan, which maintains millions of historical photographs — may not feel the immediate effects. But records managers warn the downstream consequences, from broken search results to misfiled historical images, compound quietly over years. London's early investment in deduplication tooling offers a straightforward model. Whether New York's Office of Technology and Innovation moves to adopt something similar before the city's digital infrastructure buckles further under World Cup traffic is a question its leadership has not yet publicly answered.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.