New York City's official image repositories — sprawling collections managed by agencies ranging from the Department of City Planning to NYC Media, the municipal television arm headquartered at 253 Broadway — contain tens of thousands of duplicate photographs, a problem that has quietly compounded since the mid-2000s digitisation push and is now disrupting government communications, historical archiving, and the city's own World Cup promotional materials ahead of the FIFA 2026 matches hosted at MetLife Stadium this summer.
The duplication issue matters now for a specific reason: the Adams administration's Office of Technology and Innovation launched a citywide asset-management consolidation effort in 2024 designed to merge separate photo libraries maintained by more than 30 city agencies into a single platform. That merger surfaced, rather than solved, the problem. When archivists from the Department of Records and Information Services on Chambers Street began cross-referencing files last year, they found multiple agencies holding identical or near-identical images saved under different filenames, metadata tags, and licensing classifications — some marked public domain, others erroneously flagged as rights-restricted.
A Long Road to This Mess
The roots run back to 2003, when the Bloomberg administration's first major push to digitise city records sent individual agencies scrambling to scan their own print archives independently, with no shared taxonomy. The Manhattan Borough President's office, the Parks Department, and the Economic Development Corporation each built separate digital libraries using different software vendors. By the time the de Blasio administration attempted a partial unification around 2017 through the NYC Digital program, the agencies had accumulated roughly two decades of incompatible file structures.
The 2020 pandemic accelerated the chaos. Remote-work mandates pushed communications staff across agencies to pull images directly from Google searches or stock sites, sometimes uploading those files back into official city servers without clearing rights. The Brooklyn Public Library's local history division flagged the issue formally in a 2022 letter to the city archivist, noting that several images appearing in official Department of Housing Preservation and Development publications were watermarked versions pulled from third-party providers — meaning the city was publicly distributing images it did not own.
Parks Department facilities in Prospect Park and Flushing Meadows-Corona Park — both heavily photographed for promotional materials — appear in city databases under dozens of variant filenames. A single aerial image of the Unisphere, taken during the 2021 Queens Botanical Garden reopening, was found in at least six separate agency libraries, each with different caption text and at least two different listed photographers.
The Cost of Fixing It
Deduplication at municipal scale is not cheap. Commercial digital asset management platforms capable of handling the estimated 2.1 million files across city agencies — a figure the Office of Technology and Innovation cited internally during budget discussions reported by City Council staff this spring — typically run between $400,000 and $900,000 annually for enterprise licensing. The current fiscal year 2026 budget allocated $1.2 million to the broader asset-management consolidation project, though that funding covers software, staffing, and legal review of rights clearances, not deduplication alone.
For New Yorkers, the practical fallout shows up in unexpected places. Freedom of Information Law requests for city photographs have been delayed when archivists cannot determine which version of a duplicated image is the authoritative file. Journalists requesting photos of the High Line or Hudson Yards for stories tied to the FIFA World Cup visitor influx this month have received multiple conflicting images labeled identically. The Department of Cultural Affairs has separately flagged the issue as a concern for the city's permanent historical record.
The Office of Technology and Innovation plans to complete the first phase of deduplication — covering the 12 agencies with the largest image libraries — by the end of calendar year 2026. Agencies holding smaller collections will follow through 2027. Anyone submitting a FOIL image request to a city agency in the interim can ask that the responding office confirm which database version is being provided and request accompanying metadata, including the original capture date and photographer credit, to help verify which file is authoritative. That is a small protection, but right now it is the one available.