The Daily New York

New York news, every day

News

How New York's Public Records Got Flooded With Duplicate Images — And Why City Hall Is Only Now Cleaning It Up

A decades-long backlog of redundant digital files has quietly clogged municipal databases, cost taxpayers millions in storage fees, and slowed the agencies residents depend on most.

By New York News Desk · Published 4 July 2026, 2:40 pm

3 min read

The problem did not arrive overnight. Somewhere between the city's late-1990s push to digitize paper archives and the post-pandemic scramble to migrate agency workflows to the cloud, New York City's municipal databases became saturated with duplicate image files — scanned permits, inspection photos, building survey documents — stored redundantly across dozens of siloed systems. Now, nearly three decades into that digitization effort, city agencies are confronting a backlog that information-management specialists say may run into the hundreds of millions of redundant files.

The issue matters right now for a straightforward reason: New York is hosting FIFA World Cup matches this summer, with MetLife Stadium in East Rutherford serving as a primary venue and Midtown Manhattan functioning as the fan hub. City agencies — from the Department of Buildings to the NYPD's Real Time Crime Center on West 35th Street — are under pressure to move faster on data requests than their current infrastructure easily allows. Duplicate image files slow retrieval, inflate storage costs, and create version-control confusion when inspectors or officers need to pull the correct, most recent record quickly.

How the Backlog Built Up

The roots go back to 2003, when the Bloomberg administration launched the City of New York's first comprehensive enterprise content management initiative, pushing agencies to scan physical records en masse. The mandate was volume-focused: get paper off the shelves. Quality controls around deduplication — the process of identifying and purging identical or near-identical image files — were minimal at best. Agencies uploaded the same scanned documents to multiple servers to guard against data loss, which was a reasonable redundancy strategy, but nobody systematically reconciled those copies afterward.

By 2015, the de Blasio administration's Office of Technology and Innovation — then called the Department of Information Technology and Telecommunications, or DoITT, headquartered at 2 Metrotech Center in Downtown Brooklyn — had flagged the duplication problem internally. A 2019 audit by the city's Department of Investigation examined document management practices across 14 city agencies and found that storage expenditures had grown substantially faster than the volume of genuinely new records being created, though the audit did not publish a single consolidated duplication figure. The audit recommended a citywide deduplication protocol. That protocol was never fully implemented before the pandemic disrupted operations in March 2020.

Remote work made things dramatically worse. Between March 2020 and December 2021, city employees uploaded files from home networks to agency cloud environments using consumer-grade tools — Microsoft OneDrive, Google Drive, personal email — then re-uploaded the same files into official systems once they returned to offices. The Department of Buildings alone, which operates out of 280 Broadway in Lower Manhattan, saw its cloud storage consumption more than double during that window, according to budget documents the Adams administration submitted to the City Council's Technology Committee in February 2025.

What the Adams Administration Is Doing — and What It Isn't

The current administration's Office of Technology and Innovation, now operating under a restructured mandate at 1 Centre Street, began a phased deduplication project in the fall of 2024. The first phase focused on the Department of Buildings and the Department of City Planning. Phase two, targeting NYPD and Fire Department image repositories, was scheduled to begin in January 2026 but has been delayed; the OTI has not publicly explained why.

City Council Member Gale Brewer, who represents the Upper West Side and has long pressed for government transparency on records management, has asked OTI for a progress report through the Council's oversight process. No report had been publicly released as of July 4, 2026.

The practical consequences for ordinary New Yorkers show up in places like the Brooklyn Housing Court at 141 Livingston Street, where attorneys representing tenants in repair cases have reported delays obtaining building-inspection photographs — delays that can stall proceedings by weeks when the correct image cannot be quickly verified against the one on file.

For residents, the most immediate step is straightforward: when filing requests through the city's FOIL portal — accessible at records.nyc.gov — specify the document date range and the agency of origin. Narrower requests get routed through smaller data pools and tend to return results faster while the broader cleanup is still underway.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.