New York City's sprawling network of municipal agencies is sitting on hundreds of thousands of duplicate digital images — redundant files stored across servers maintained by the Department of Information Technology and Telecommunications, known as DoITT — and the redundancy is costing taxpayers a measurable slice of annual IT expenditure, according to a review of city budget documents and infrastructure procurement records filed with the Office of Management and Budget.
The problem is not new. But the numbers have grown sharper, and the timing is worse. With the city having committed significant capital toward modernizing subway signage through the MTA's digital display network and preparing public-facing web infrastructure to handle an expected surge in visitors for the 2026 FIFA World Cup — matches at MetLife Stadium in East Rutherford began June 11 — the redundancy embedded in city content management systems has moved from a background annoyance to a front-burner operational issue.
What the Numbers Actually Show
City budget filings for fiscal year 2026, which closed June 30, showed DoITT's enterprise storage line item running at roughly $47 million annually across primary and backup infrastructure contracts. Independent audits of municipal content repositories in comparably sized cities — Chicago's Department of Assets, Information and Services published a 2024 internal review — have found duplicate image files accounting for between 18 and 23 percent of total unstructured data storage consumption. Apply that range conservatively to New York's footprint and the implied waste runs into the low eight figures each year.
The NYC Department of City Planning's public data portal, hosted at 120 Broadway, maintains layers of map tile imagery and zoning illustration files that staff at the agency have flagged internally as containing significant duplication — the same base map rendered at different resolutions and saved under different filenames rather than referenced from a single canonical source. The Housing Preservation and Development agency, which manages photo documentation for roughly 1,100 buildings in its Alternative Enforcement Program across the Bronx and Upper Manhattan, faces a parallel problem: site inspection photographs uploaded by field staff frequently duplicate images already in the system because the intake workflow lacks automatic hash-comparison at upload.
Hash-based deduplication — a standard technique that generates a unique fingerprint for each file and blocks identical copies from being stored twice — has been commercially available since the mid-2000s and is embedded in most enterprise storage platforms sold after 2015. The city's contracts with storage vendors, including an agreement with Dell Technologies renewed in fiscal year 2024, include provisions for deduplication tooling. Whether those tools are fully activated across all agency environments is a separate question, and one the city's independent budget office has not publicly answered.
World Cup Pressure and a Practical Window
The World Cup timeline is adding urgency. NYC Tourism + Conventions, headquartered on Sixth Avenue in Midtown, has been coordinating with city agencies to push image-heavy promotional content through municipal web channels. Load testing conducted ahead of major international events has historically exposed latency problems rooted in unoptimized asset libraries — a lesson the city's Office of Technology and Innovation, which absorbed some DoITT functions after a 2023 reorganization, has been working to absorb.
The practical fix is not complicated. Agencies running Microsoft SharePoint environments — which include at minimum the Department of Buildings and the Administration for Children's Services, both of which completed SharePoint migrations between 2021 and 2023 — can enable built-in storage sense features that flag duplicates for administrator review. For legacy environments, third-party deduplication tools from vendors including Varonis and Komprise have been piloted in at least two city agencies since 2022, though citywide rollout has not been publicly announced.
The fiscal year 2027 preliminary budget, due from the Mayor's Office of Management and Budget in January, will offer the next clear window to see whether DoITT and the Office of Technology and Innovation have carved out funding for a systematic deduplication initiative. Advocates for leaner municipal IT spending, including the Citizens Budget Commission based on Broad Street in Lower Manhattan, have previously identified unstructured data governance as an underexamined area of potential savings. The numbers, at least, are there to make the case.