New York City's sprawling network of digital archives holds millions of scanned documents, permit photographs, and property records — and a growing share of them are duplicates. The city's Department of Buildings, which processes roughly 1.5 million permit applications annually, has acknowledged internally that redundant image files are inflating storage costs and slowing down the retrieval systems that inspectors, contractors, and homeowners rely on every day. The question now is who decides how to fix it, and when.
The timing matters because 2026 is not a quiet year for city infrastructure. FIFA World Cup matches at MetLife Stadium, just across the Hudson in East Rutherford, have pushed every city agency that touches public-facing services into a review of its digital front end. Visitors and contractors pulling permits near venues along the West Side Highway, around Hudson Yards, and in Midtown have encountered slowdowns on the Department of Buildings' Digital Vault portal — a system that stores tens of thousands of archived construction photographs. Duplicate files are a documented contributor to those slowdowns, according to city IT procurement records reviewed as part of the fiscal year 2026 budget process.
Where the Backlog Lives
The problem is concentrated in a handful of legacy databases. The Buildings Department's BIS portal — Building Information System — was originally launched in the late 1990s and has absorbed multiple data migrations since. Each migration, city technology officials have noted in budget testimony before the City Council's Technology Committee, introduced duplicate image artifacts that were never purged. The Department of City Planning's ZOLA database, which covers zoning applications across all five boroughs, faces a related issue: duplicate scans of Environmental Impact Statement exhibits filed between 2010 and 2019 are estimated to occupy several terabytes of redundant storage on servers maintained at the city's data center on Gold Street in Lower Manhattan.
The Department of Citywide Administrative Services put out a request for proposals in March 2026 for a deduplication and digital asset management contract. The RFP listed a ceiling value of $4.2 million over three years. Three vendors submitted bids before the April 30 deadline, according to public procurement records posted on the city's PASSPort vendor portal. A contract award has not been announced. The Adams administration's Office of Technology and Innovation did not respond to a request for comment by publication time.
For businesses working near the Brooklyn Navy Yard or filing for renovations in Jackson Heights, the practical consequence is a permit system that sometimes returns duplicate image results when searching property histories — requiring staff to manually sort through files that should have been consolidated years ago. The Municipal Art Society of New York flagged the issue in a March 2026 report on digital access to landmark records, noting that redundant image entries were creating confusion in applications reviewed by the Landmarks Preservation Commission on Vesey Street.
The Decisions That Cannot Wait
Three choices are now sitting on desks at City Hall. First, the DCAS contract needs to be awarded before the fiscal year 2027 budget cycle locks in October; any delay pushes implementation into 2027 at the earliest. Second, the city must decide whether to run a retroactive purge of existing duplicate files or simply stop new duplicates from entering the system going forward — a distinction that affects both cost and the integrity of historical records. Third, the City Council's Technology and Telecommunications Committee, which held a hearing on city data infrastructure in February, is expected to take up the question of mandatory deduplication standards for all agencies, not just Buildings and City Planning.
For anyone interacting with city systems in the coming months — a developer pulling archived plans for a Bushwick warehouse, a homeowner checking permit history on a Crown Heights brownstone — the practical advice is straightforward: flag duplicate entries using the feedback tool on BIS and ZOLA, because agencies have indicated that user-reported duplicates are being routed into the remediation queue. It is not fast. But until the contract is awarded and the purge begins, it is the only pipeline that exists.