How New York's Public Records Got Flooded With Duplicate Images—and Why It's Proving So Hard to Fix
A quiet crisis in city data management has been building for years, and the 2026 World Cup deadline is forcing agencies to finally confront it.
A quiet crisis in city data management has been building for years, and the 2026 World Cup deadline is forcing agencies to finally confront it.

New York City government websites and public-facing digital databases are carrying tens of thousands of duplicate images—redundant photographs, scanned documents, and planning graphics uploaded multiple times across agency portals—a problem that predates the Adams administration but has accelerated sharply since 2020, when pandemic-era remote work pushed city employees toward ad hoc digital filing practices.
The issue matters right now for a specific reason: with FIFA World Cup matches scheduled at MetLife Stadium in East Rutherford through July, and with the city's official tourism and event portals under unprecedented international traffic, broken image links and duplicate assets are generating visible errors that reflect poorly on city operations. The Department of City Planning and NYC Tourism + Conventions—the city's official destination marketing organisation—have both been working through backlogs of duplicate and orphaned image files that clog their content management systems.
The roots of duplicate image proliferation in city systems go back further than any single administration. When the Bloomberg-era 311 digital platform launched in the early 2000s, each agency maintained its own separate file repository. That siloed structure was never fully consolidated. By the time the de Blasio administration pushed agencies onto the NYC.gov unified platform between 2015 and 2017, thousands of legacy image files migrated across without deduplication protocols in place.
The Housing Preservation and Development department, which manages public-facing listings for affordable housing lotteries through the NYC Housing Connect portal on West 125th Street in Harlem, is among the agencies where duplicate image bloat is most operationally significant. Applicants navigating the portal for affordable units—a process that has grown more urgent given the city's rental vacancy rate, which the 2023 New York City Housing and Vacancy Survey put at just 1.4 percent—sometimes encounter mismatched or repeated building photographs that make it harder to evaluate listings accurately.
The Metropolitan Transportation Authority's digital communications team, headquartered at 2 Broadway in Lower Manhattan, faced a parallel problem when it began updating its MTA.info station pages as part of the ongoing Capital Program renovation work. Station image libraries built up over multiple website redesigns contained the same photographs filed under different filenames, creating storage drag and occasional front-end display errors.
Fixing duplicate images is not a simple delete-and-move operation in government systems. City agencies operate under records retention rules governed by the New York State Arts and Cultural Affairs Law, which requires that certain digital assets—including photographs associated with public projects or legal proceedings—be preserved for defined periods. Deleting what looks like a duplicate can, in some cases, destroy a legally distinct record.
The city's Department of Records and Information Services, based at 31 Chambers Street in lower Manhattan, issued updated digital records guidance in late 2024 that for the first time addressed image deduplication specifically, directing agencies to use hash-based comparison tools to identify true duplicates versus files that merely share similar content. Implementation across agencies has been uneven.
Budget is a factor. The city's Fiscal Year 2026 executive budget, released by Mayor Adams in April, allocated roughly $214 million to technology and cyber-infrastructure across city agencies, but line items specifically targeting content management modernisation have been difficult to isolate. Advocacy groups focused on government transparency, including Reinvent Albany, have pushed for a centralised city digital asset management system for years without seeing a standalone procurement move forward.
For residents and organisations that rely on city data portals daily—whether tracking affordable housing applications through NYC Housing Connect, monitoring transit updates through the MTA app, or accessing planning documents through the Department of City Planning's ZoLa mapping tool—the practical fix may be incremental. Agencies are being encouraged to run deduplication audits before the next major platform refresh cycle, expected to begin in the third quarter of 2026. In the meantime, users encountering broken or repeated images on city portals can report them through the NYC.gov feedback tool, which routes complaints directly to agency webmasters for triage.
How does this story make you feel?
Spread the word
About this article
Published by The Daily New York
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News
