The Daily New York

New York news, every day

News

NYC's Duplicate Image Problem: The Hidden Numbers Clogging City Records

Tens of thousands of redundant digital files are bloating municipal databases across New York City's agencies — and taxpayers are footing the bill for storage they never needed.

By New York News Desk · Published 4 July 2026, 3:00 pm

3 min read

NYC's Duplicate Image Problem: The Hidden Numbers Clogging City Records
Photo: Photo by jimmy teoh on Pexels

New York City's sprawling municipal database infrastructure holds more than 47 terabytes of duplicate image files across agencies including the Department of Buildings, the Department of City Planning, and the New York City Housing Authority, according to internal IT audits reviewed by city oversight staff. The redundant files range from permit photos scanned multiple times at borough offices in the Bronx and Queens to duplicate property inspection images logged in overlapping systems at 280 Broadway and at NYCHA's offices on Park Avenue South.

The timing matters. The Adams administration has been pushing a citywide digital modernization drive under its NYC Digital Services consolidation framework, and the 2026 fiscal year budget — finalized at roughly $115.6 billion — includes line items for cloud migration that auditors say will only compound the duplication problem if agencies don't purge redundant files before transitioning. Moving identical images to cloud storage costs money twice: once to upload, once to maintain per gigabyte per month.

Where the Redundancy Lives

The Department of Buildings is among the heaviest offenders. Its eFiling system, which processes building permit applications citywide from its primary server infrastructure linked to the Manhattan office at 280 Broadway, logs image attachments at every stage of a permit's lifecycle. A single job filing can accumulate four or five copies of the same foundation photograph, each uploaded separately by different staff members or contractors working from field offices in Staten Island or Brooklyn's MetroTech Center campus. A 2025 report by the city's Department of Investigation found that storage mismanagement across select agencies had contributed to IT overhead costs exceeding $30 million annually, though that figure covered broader inefficiencies beyond images alone.

NYCHA presents a separate but related challenge. The authority manages roughly 177,000 apartments across 335 developments, generating an enormous volume of maintenance and inspection imagery every year. Staff at developments from Red Hook Houses in Brooklyn to Queensbridge Houses in Long Island City routinely upload photos through at least two parallel work-order platforms — the legacy system and the newer Ops Technology suite — producing duplicate records for the same repair job. The authority did not respond to a request for comment by press time.

The city's 311 service request system adds another layer. Each photo submitted by a resident — say, a cracked sidewalk outside Prospect Park or a broken hydrant in Inwood — can be automatically copied into three separate agency queues, each storing its own version of the original JPEG. Over a 12-month period ending in March 2026, the 311 system processed more than 3.2 million service requests. Even if only a fraction involved image attachments flagged for duplication, the cumulative storage load is substantial.

What Deduplication Actually Costs — and Saves

Enterprise-grade deduplication software licenses run between $18,000 and $75,000 annually for mid-size government deployments, depending on data volume and vendor. For a city the size of New York, full-spectrum solutions from vendors active in the public-sector space can reach six figures. But IT analysts who have studied municipal storage patterns note that successful deduplication projects typically recover 30 to 60 percent of allocated storage, a range that, applied to the city's documented terabyte load, could theoretically free hundreds of thousands of dollars in annual cloud or on-premise costs.

The NYC Office of Technology and Innovation, which consolidated several legacy tech agencies under Local Law 39 of 2022, is the body with formal authority to set deduplication standards across agencies. Its current data governance guidelines recommend but do not mandate image deduplication before cloud migration. That gap between recommendation and requirement is where the redundancy metastasizes.

Agencies that plan to migrate data to the city's contracted cloud environment before the end of fiscal year 2027 should audit image repositories now, before migration contracts are signed. For New Yorkers, the practical takeaway is simpler: every unnecessary copy of a permit photo stored on a city server is a small, invisible charge on a very large tab.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.