The Daily New York

New York news, every day

News

New York's Duplicate Image Problem: The Numbers Piling Up Inside City Hall's Digital Archives

A growing backlog of redundant digital files is costing city agencies time and storage budget — and the data tells a clearer story than anyone in the Adams administration wants to admit.

By New York News Desk · Published 4 July 2026, 2:48 pm

3 min read

New York's Duplicate Image Problem: The Numbers Piling Up Inside City Hall's Digital Archives
Photo: Photo by Karl Solano on Pexels

New York City's municipal agencies collectively manage an estimated tens of millions of digital image files across dozens of departmental servers — and a significant share of those files are exact or near-exact duplicates, according to digital records management specialists who work with government clients. The problem is not unique to New York, but the scale here, across a city bureaucracy that employs roughly 300,000 people, makes the waste measurable in real dollars.

The timing matters. The city is deep into a push to modernize its information technology infrastructure ahead of the 2026 FIFA World Cup, which brings games to MetLife Stadium in East Rutherford and sends hundreds of thousands of visitors through the five boroughs beginning this month. City agencies including the Mayor's Office of Media and Entertainment and NYC Tourism + Conventions have ramped up digital content production since at least 2024, flooding shared drives and cloud storage buckets with promotional photography, event documentation and press imagery — much of it filed multiple times by different staff members.

What the Numbers Actually Show

Duplicate image files are a data management problem before they become a budget problem, but they reliably become both. Storage costs for cloud infrastructure have dropped sharply over the past decade — Amazon Web Services S3 standard storage, widely used by government contractors, ran approximately $0.023 per gigabyte per month as of early 2026. That sounds trivial until a single agency is maintaining 50 terabytes of redundant image data. At that scale, the monthly bill for files that serve no operational purpose runs into the thousands of dollars, annually into the tens of thousands, before accounting for the staff time spent tagging, searching and retrieving misfiled assets.

The Department of City Planning, which operates out of 120 Broadway in Lower Manhattan, maintains public-facing GIS mapping layers and planning document libraries that include large volumes of site photography. Records management consultants who advise municipal clients say agencies like DCP typically see duplicate rates between 15 and 30 percent in unmanaged image repositories — meaning roughly one in five files may be redundant. The NYC Department of Transportation, headquartered at 55 Water Street, faces a similar challenge with its traffic camera stills and street-condition documentation, archived for legal and operational purposes across multiple internal systems that do not always communicate with each other.

The city's overall IT budget for fiscal year 2026, as approved by the City Council, topped $1.2 billion. Digital asset management is a line item that rarely generates headlines, but it quietly absorbs resources that program managers say they would rather direct elsewhere. Deduplication software — tools that scan repositories, flag identical or near-identical files using hash-matching algorithms, and either delete or consolidate them — is commercially available from vendors including Veritas and Rubrik, with enterprise licensing deals typically running between $50,000 and $250,000 annually depending on storage volume.

What Comes Next for City Agencies

The Mayor's Office of Technology and Innovation, which coordinates digital policy across city government, has signaled in public budget documents that it plans to consolidate several legacy storage systems under a unified cloud framework by the end of fiscal year 2027. Whether that consolidation includes a formal deduplication pass before migration — a step that records managers consider essential, since migrating duplicates simply moves the problem to a more expensive environment — has not been specified in publicly available planning documents.

For residents and city employees who interact with these systems, the practical advice from IT professionals is straightforward: flag redundant files through agency records management portals when encountered, and avoid the habit of saving image assets to multiple folder locations as a backup strategy. The Brooklyn Public Library's Digital Archives program at its Grand Army Plaza branch has run public workshops on exactly this kind of digital hygiene, framing it as civic participation in functional government. The city's 311 system accepts feedback on agency digital services, though file-level complaints rarely climb high enough on the priority queue to generate a direct response. The underlying data problem, however, keeps growing until someone decides it is worth the one-time cost of fixing it.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.