The Daily New York

New York news, every day

News

New York's Duplicate Image Problem: The Numbers That Show How Much It's Costing the City

From city agency websites to MTA signage databases, redundant digital image files are quietly draining storage budgets and slowing public-facing systems across the five boroughs.

By New York News Desk · Published 4 July 2026, 3:16 pm

3 min read

New York's Duplicate Image Problem: The Numbers That Show How Much It's Costing the City
Photo: Photo by Gabriel Almanzar on Pexels

New York City's sprawling network of public digital infrastructure is sitting on a problem that has a precise, measurable cost: duplicate image files, stored redundantly across dozens of agency servers, are consuming terabytes of paid cloud storage and slowing the web platforms that millions of residents use every day. A review of city contracting records and technology audit documents points to a pattern that IT managers across the five boroughs have struggled to address for years.

The timing matters. The city is deep into a technology modernization push ahead of its role as a 2026 FIFA World Cup host, with the NYC Mayor's Office of Technology and Innovation — operating out of 255 Greenwich Street in Lower Manhattan — overseeing a suite of platform upgrades meant to handle global visitor traffic. Redundant data is not an abstract inefficiency when a stadium-bound tourist is trying to load a transit map on a lagging NYC.gov page during a July heatwave that has already forced the cancellation of public events up and down the Eastern Seaboard.

The Scale of the Problem

Digital asset bloat is a well-documented phenomenon in large municipal IT environments. Research published by the Cloud Security Alliance has found that typical enterprise storage environments contain between 30 and 40 percent redundant or obsolete data, a figure that scales aggressively in government contexts where file retention policies are often governed by legal mandate rather than operational need. For a city the size of New York, where the Department of Information Technology and Telecommunications manages contracts worth hundreds of millions of dollars annually, that percentage represents serious money.

The MTA's public-facing digital infrastructure offers a concrete local example. The authority's New York City Transit division maintains image libraries for station maps, accessibility guides, and service change notices across 472 subway stations. Duplicate image assets — the same accessibility icon stored in three different resolutions, the same service alert graphic re-uploaded rather than relinked — compound across those stations into storage overhead that costs real dollars per gigabyte per month under the MTA's cloud service agreements. The authority has not published a specific dollar figure tied solely to duplicate image removal, but its broader IT modernization program, described in MTA capital documents, runs into the tens of millions of dollars.

The Parks Department faces a similar, if lower-stakes, version of the issue. Its digital archive of park photography, used across the NYC Parks website and in community board presentations at locations like the Arsenal Building in Central Park and the Pelham Bay Park visitor center in the Bronx, has grown without a centralized deduplication protocol. Staff at borough offices routinely upload new versions of images already in the system, according to the structure of city IT policy documents reviewed as part of this reporting.

What Deduplication Actually Fixes

Duplicate image replacement — the process of identifying redundant files through hash-matching algorithms and consolidating storage to a single canonical version — is not glamorous work. But the returns are measurable. Industry benchmarks from storage analytics firms suggest that aggressive deduplication in a large municipal environment can reduce active image storage needs by 25 to 35 percent. On a cloud storage contract priced at roughly $0.023 per gigabyte per month — a standard rate tier on Amazon Web Services, the platform used by several city agencies — even a 10-terabyte reduction translates to roughly $2,760 in annual savings. That number compounds fast across 80-plus city agencies.

The Mayor's Office of Technology and Innovation launched its Citywide Digital Services Review in early 2025, with a stated goal of reducing operational IT waste ahead of the World Cup window. Residents and watchdog groups tracking the initiative through the city's Open Data portal at data.cityofnewyork.us can monitor contract amendments and storage procurement changes as they are filed. For now, the bureaucratic machinery of deduplication is moving — slowly, methodically, and very much by the numbers.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.