New York City's Duplicate Image Problem: The Numbers Buried Inside the Housing Database
A quiet crisis in the city's property records system is costing agencies time and money—and the data tells a striking story about how bad it has gotten.
A quiet crisis in the city's property records system is costing agencies time and money—and the data tells a striking story about how bad it has gotten.

Tens of thousands of duplicate property images are clogging New York City's municipal records databases, according to city agency workload figures reviewed this week, creating a backlog that slows housing inspections, permit processing, and tenant complaint resolutions at a moment when the city can least afford the drag. The problem is old, but the scale is newly measurable—and the numbers are worse than many inside City Hall have publicly acknowledged.
The timing matters. With the Adams administration under sustained pressure to accelerate affordable housing approvals and the Department of Buildings working through a permit queue that swelled during the post-pandemic construction surge, any redundancy inside the image-management infrastructure that supports those workflows carries a direct cost. Every duplicate photo attached to a building complaint file means an inspector or a data analyst spends additional seconds—sometimes minutes—loading, sorting, and discarding before reaching actionable information. Multiply that across hundreds of thousands of filings per year and the arithmetic becomes uncomfortable.
The Department of Buildings processed more than 400,000 complaints citywide in fiscal year 2025, based on figures the agency has published through its public performance dashboard. Internal IT assessments shared with City Council members during budget hearings earlier this year suggested that image duplication rates in the agency's legacy BIS—Building Information System—platform ran as high as 18 percent in certain borough-specific file clusters. Brooklyn's Kings County records, particularly filings tied to residential conversions in Bushwick and East New York, showed the highest density of repeated attachments, sources familiar with the matter said, though the agency has not released a formal audit.
The problem compounds at the Department of Housing Preservation and Development, which manages its own parallel inspection photo archive through the Automated City Register Information System interface. HPD closed roughly 215,000 housing maintenance code violations in fiscal year 2024. When the same building generates multiple complaints—a common pattern on the Grand Concourse in the Bronx, where aging pre-war stock draws repeat filings—photo attachments from earlier cases are frequently re-uploaded rather than linked, inflating storage usage and search time simultaneously.
Storage is not free. City contracts for cloud-based municipal data infrastructure, including image hosting tied to the 311 complaint ecosystem, have run to multi-million-dollar annual agreements with vendors including Microsoft and Amazon Web Services. Duplicate images do not just slow processing—they consume contracted storage allocation that the city pays for by the gigabyte tier. A conservative internal estimate circulated among IT staff put the wasted storage from image duplication across three major agencies at roughly 4.2 terabytes as of March 2026, though that figure has not been independently verified or officially published.
The Mayor's Office of Technology and Innovation has been piloting a deduplication algorithm through its NYC Digital Services program since late 2024. The tool uses perceptual hash matching—a technique that identifies visually identical or near-identical images without comparing files byte-by-byte—and has been tested on a subset of HPD's archive covering properties in Community Board 7 in Manhattan, roughly the Upper West Side corridor between West 59th and West 110th Streets. Early results from that pilot, presented to the City Council Technology Committee in April 2026, showed a 22 percent reduction in retrievable duplicate images within the test dataset.
Scaling that pilot citywide is the next challenge. The full DOB image archive stretches back to digitization efforts begun around 2003, meaning more than two decades of accumulated files need to be processed. A full rollout is not expected before the second quarter of 2027 at the earliest, based on the technology office's own project timeline documents.
For New Yorkers navigating housing complaints right now, the practical upshot is straightforward: when filing through 311 or directly through the HPD online portal, attaching fewer, clearly labeled images—rather than uploading every photo taken at a problem site—reduces the chance of your complaint being buried in a duplicate-heavy queue. The agencies are catching up. The data says they have a long way to go.
How does this story make you feel?
Spread the word
About this article
Published by The Daily New York
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News