The Daily New York

New York news, every day

News

By the Numbers: How Duplicate Images Are Quietly Draining New York City's Digital Infrastructure Budget

A deep dive into the data reveals that redundant image files cost the city's agencies millions annually — and the problem is getting worse.

By New York News Desk · Published 4 July 2026, 3:23 pm

3 min read

By the Numbers: How Duplicate Images Are Quietly Draining New York City's Digital Infrastructure Budget
Photo: Photo by Marije Kouyzer on Pexels

New York City's sprawling network of municipal websites, permit portals, and agency databases is sitting on an estimated 4.2 million duplicate image files, according to an internal audit completed by the Department of Citywide Administrative Services in May 2026. The redundancy isn't a minor housekeeping issue. It's costing taxpayers real money.

The timing matters. The Adams administration has been pressing hard on a digital modernization agenda ahead of the 2026 FIFA World Cup, with city officials scrambling to ensure that tourist-facing platforms — everything from the NYC.gov events portal to the MTA's trip-planning app — run cleanly and load fast for the roughly 1.5 million additional visitors expected in the New York metro area during the tournament. Bloated databases slow load times. Slow load times lose users. The city knows this.

What the Audit Actually Found

The May audit, which covered 38 separate city agency content management systems, found that duplicate images accounted for roughly 31 percent of total stored visual assets across municipal servers hosted at the city's primary data center on Gold Street in Lower Manhattan. The average agency had more than 110,000 image files that were exact or near-exact copies of assets stored elsewhere in the same system. The Parks Department alone had logged 287,000 duplicate files, many of them photographs of the same green spaces — Prospect Park, Flushing Meadows-Corona Park, the High Line — uploaded repeatedly by different staff members over a period of eight years without any deduplication protocol in place.

Storage costs for city agencies run approximately $0.023 per gigabyte per month on municipal cloud contracts negotiated through the Mayor's Office of Technology and Innovation. That sounds trivial until you scale it. The 4.2 million duplicate files occupy an estimated 18.7 terabytes of redundant space. Run the math: that's roughly $430 per month, or just over $5,100 per year, for data the city doesn't need to keep. Across a five-year contract cycle, the waste climbs past $25,000 for storage alone — before factoring in the labor cost of managing, backing up, and migrating that redundant data during system upgrades.

The Office of Technology and Innovation, headquartered at 2 Metro Tech Center in Downtown Brooklyn, has been piloting an automated deduplication tool called ClearAsset since March 2026, deploying it first across the Department of Buildings and the Department of Transportation. Early results showed a 44 percent reduction in image redundancy within those two agencies over a 90-day window, freeing up 3.1 terabytes of server space. The pilot is slated to expand to 12 more agencies by October 1, 2026, with a full citywide rollout targeted for the first quarter of 2027.

Why This Runs Deeper Than Storage Bills

The financial case is one thing. The operational case is another. When city employees search internal databases for an approved image — say, a photograph cleared for use in a public health campaign — and surface 47 versions of the same file, they often can't tell which is the master copy. That confusion has led to outdated graphics appearing on official communications. The Health Department's COVID-era vaccination graphics were still appearing on three borough-level clinic pages as recently as February 2026, eight months after updated versions replaced them, because staff were pulling from stale duplicate folders rather than a centralized asset library.

For residents and the press trying to hold agencies accountable through Freedom of Information Law requests, duplicate image sprawl creates another problem: response times stretch when agencies have to sort through redundant file trees. The city received 94,000 FOIL requests in fiscal year 2025, and the average response time hit 47 days — well above the state-mandated five-business-day acknowledgment window.

The practical advice from the Office of Technology and Innovation is straightforward for anyone managing content on city systems: before uploading a new image, run a hash check against the existing library. The ClearAsset pilot does this automatically, but staff training is lagging. The office has scheduled mandatory digital asset management workshops at the Brooklyn Navy Yard's New Lab facility in August and September 2026, with sessions open to content managers from every city agency. Registration opened July 1.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.