The Daily New York

New York news, every day

News

New York's War on Duplicate Images in City Records: How It Stacks Up Against London, Tokyo and Toronto

As municipalities worldwide rush to clean up bloated digital archives, New York's agencies are taking markedly different approaches — with uneven results.

By New York News Desk · Published 4 July 2026, 2:44 pm

4 min read

New York's War on Duplicate Images in City Records: How It Stacks Up Against London, Tokyo and Toronto
Photo: Photo by Andres Figueroa on Pexels

New York City's Department of Records and Information Services is sitting on a backlog. Across its digitized municipal archive — spanning property deeds, construction permits, and decades of agency filings — duplicate image files account for a significant share of stored data, creating retrieval bottlenecks for clerks in Lower Manhattan's 31 Chambers Street offices and frustrating residents trying to pull records through the city's online portal. The problem is neither new nor unique to New York, but the way the city is handling it stands in sharp contrast to how comparable global cities have tackled the same headache.

The timing matters. New York is three years into a broader push to digitize municipal records following the 2023 City Council legislation requiring agencies to migrate paper archives to searchable electronic formats by 2027. That mandate, combined with the surge in permit applications tied to World Cup infrastructure upgrades and the ongoing congestion pricing rollout, has flooded city servers with new filings — many of them redundant copies of the same scanned documents. The result is a digital pile-up that costs storage money and slows public access.

What New York Is — and Isn't — Doing

The Department of Records, working out of its Surrogate's Court annex on Chambers Street, launched a deduplication initiative in late 2024 using open-source software tools procured through the Mayor's Office of Technology and Innovation. The effort is manual-assist: algorithms flag likely duplicates, then human reviewers confirm deletions before anything is purged. It is methodical. It is also slow. As of early 2026, the office had cleared roughly 40 percent of its flagged backlog across legacy property record scans dating to the 1980s, according to city budget documents reviewed for this article.

The Metropolitan Transportation Authority faces a parallel problem in its Capital Program document library. Thousands of engineering drawings and environmental review files submitted since 2020 — many tied to the Second Avenue Subway Phase 2 planning process — exist in multiple identical versions across different internal servers. The MTA's information management team has no published deduplication policy, a gap that outside records management specialists have flagged in public comments submitted to the MTA board.

The contrast with London is instructive. Transport for London completed a full deduplication sweep of its project document archive in 2023, deploying a cloud-based system from a British public-sector technology firm. The Greater London Authority's digital records office consolidated storage across 11 borough partnerships, cutting redundant image files by roughly 60 percent within 18 months and reducing annual cloud storage costs by an estimated £2.3 million, figures the authority published in its 2024 annual transparency report. Tokyo's Metropolitan Government took a different route: a centralized digital records bureau, established in 2022, handles deduplication for all 23 special wards through a single vendor contract, standardizing file formats citywide before images are even archived.

Toronto's Model and What New York Could Learn

Toronto is the closest governance parallel to New York's fragmented agency structure. The City of Toronto's Archives and Records Management division runs what it calls a Unified Digital Stewardship Program, launched in January 2024 with a CAD $4.1 million budget allocation. Rather than waiting for duplicates to accumulate, the program intercepts redundant files at the point of upload through automated hash-matching — essentially a digital fingerprint check that rejects exact copies before they enter the archive. Toronto published a progress report in March 2026 showing the system had prevented over 800,000 duplicate image files from entering its municipal database in its first 14 months of operation.

New York's fragmented structure — where each agency controls its own records infrastructure — makes that kind of centralized interception difficult. The Mayor's Office of Technology and Innovation has proposed a shared cloud storage standard that would enable hash-matching across agencies, but the proposal has not yet been funded in the Fiscal Year 2027 budget currently before the City Council's Finance Committee.

For residents trying to pull property records through the city's ACRIS system, or contractors downloading permit documents through the Department of Buildings portal on Worth Street, the practical fix is straightforward: check file metadata before downloading, use the document ID number rather than the keyword search, and report duplicate listings directly to the agency's records desk. The city's 311 portal accepts records complaints and routes them to the relevant department — a workaround that is imperfect but, for now, the best option available.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily New York

This article was produced by the The Daily New York editorial desk and covers news in New York. See our editorial standards for how we use AI.

The Daily New York brief

The day's New York news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to New York news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily New York and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily New York

More in News

Enjoyed this story? Get tomorrow's briefing free.