New York City government systems are carrying millions of redundant digital files — duplicate images embedded in permit applications, housing inspection reports, court filings, and agency databases — and the cleanup effort, years in the making, has finally forced a reckoning across multiple departments simultaneously. The problem is not new. The scale of it, in 2026, finally is.
The timing matters for reasons that go beyond housekeeping. The city is hosting World Cup matches at MetLife Stadium this summer, drawing scrutiny to everything from sidewalk cleanliness to the speed of credentialing systems that rely on the same digital infrastructure plagued by redundant data. Meanwhile, the Adams administration has staked a significant portion of its second-term pitch on modernizing city tech, a promise that sits uneasily alongside a records management system that, in some agencies, dates to the mid-2000s.
How the Duplication Problem Grew
The roots of the current mess trace back to a structural decision made during the Bloomberg-era consolidation of city IT services under the Department of Citywide Administrative Services, known as DCAS. When agencies migrated to shared servers between roughly 2008 and 2014, file-naming conventions were never standardized. Inspectors uploading photos from housing violations in the Bronx, for instance, often uploaded the same image multiple times — once from a field device, once from an office terminal — because the system offered no deduplication check at the point of entry. The Buildings Department and the Department of Housing Preservation and Development both inherited versions of this workflow.
The problem compounded as the city's 311 portal expanded. When residents began submitting complaints with photo attachments — a feature that became standard after a 2016 portal update — the volume of incoming image files increased sharply. Many of those images were automatically copied into multiple agency queues. By the early 2020s, internal audits at the Mayor's Office of Technology and Innovation, located at 253 Broadway in lower Manhattan, flagged the storage redundancy as a budget concern, but remediation funding was repeatedly deprioritized during the pandemic fiscal crunch.
The Department of Records and Information Services, which maintains the city's official archives at 31 Chambers Street, began tracking the issue formally in 2023 as part of a broader digital preservation review. That review found that duplicate image files were consuming a disproportionate share of cloud storage costs contracted through the city's vendor agreements. Storage costs for city agencies have risen substantially since 2020, driven by expanded digitization mandates passed by the City Council.
What Agencies Are Doing Now
The Office of Technology and Innovation launched a deduplication pilot in late 2025 covering the Buildings Department and the 311 system. The pilot uses hash-matching software to identify identical files and flag them for archival review before deletion — a safeguard required under city records retention law, which prohibits agencies from destroying files without sign-off from the Department of Records. As of this spring, the pilot had not yet expanded citywide.
The practical stakes extend to residents navigating the city's bureaucracy right now. Homeowners in neighborhoods like Jackson Heights and Crown Heights who have filed for renovation permits through the Buildings Department's DOB NOW portal have sometimes received processing delays traced, in part, to bloated file queues that slow document retrieval. Tenant advocates working with organizations including Housing Court Answers, which operates out of 111 Centre Street, have noted anecdotally that case workers spend extra time locating correct inspection photos when duplicates exist across related complaints.
The City Council's Technology Committee has scheduled an oversight hearing for later this year on city data infrastructure, which is expected to include testimony on the deduplication pilot's progress. For residents and applicants tangled in the system right now, the practical advice from agencies is straightforward: when submitting documents or images through any city portal, confirm the upload was successful before resubmitting — every redundant file submitted by the public feeds the backlog that is already costing the city time and money to untangle.