New York City's Department of Information Technology and Telecommunications quietly acknowledged last month that its street-level imaging database—used by planners, emergency responders, and the MTA—contains an estimated 340,000 duplicate or near-duplicate photographs collected during successive Cyclomedia scanning contracts since 2019. The redundant files are clogging servers at the city's Brooklyn Navy Yard data center, inflating storage costs by roughly $2.1 million annually, and in at least a dozen documented cases, feeding outdated imagery into tools used by the Office of Emergency Management on Cadman Plaza West.
The timing matters. With tens of thousands of FIFA World Cup visitors expected at MetLife Stadium and at fan zones stretching from Flushing Meadows-Corona Park to Hudson Yards this summer, the city's street navigation and public safety infrastructure is under more strain than at any point in recent memory. Accurate, deduplicated visual data underpins everything from real-time bus rerouting on the B46 in Flatbush to the NYPD's domain awareness feeds in Midtown. Garbage in, garbage out—and right now, there is a lot of garbage.
What New York Is Actually Doing About It
The Adams administration awarded a $4.8 million contract in March to Palantir-adjacent geospatial firm Nearmap to run automated perceptual hashing across the full image archive, a process that compares visual fingerprints of photographs to flag near-identical frames for human review. The project is expected to finish its first pass by September 30. NYC Digital, the city's technology arm, is running a parallel pilot with the Urban Tech Hub at Cornell Tech on Roosevelt Island, testing open-source deduplication algorithms that the city could eventually own outright rather than license. That pilot covers roughly 18 square miles of the South Bronx and East Harlem.
The Metropolitan Transportation Authority has its own, separate imaging backlog. Its Subway Station Environment Survey, which photographs every inch of all 472 stations for maintenance planning, generated an internal audit last October showing that 22 percent of images captured at stations including Atlantic Avenue-Barclays Center and 125th Street on the A/C/E were either exact duplicates or differed by fewer than four pixels. MTA infrastructure staff have been manually flagging the worst offenders since January, a process a senior official described to this reporter as "whack-a-mole."
How London and Seoul Are Handling the Same Problem
New York is not alone, but it is notably behind two cities that have confronted identical problems at scale. Transport for London embedded automated deduplication directly into its StreetSpace imaging pipeline in 2023, eliminating redundant capture at the source rather than cleaning archives after the fact. The result: TfL reports its Euston Road corridor imagery archive runs 31 percent leaner than equivalent New York street segments despite comparable scanning frequency.
Seoul's Smart City Division took a different route, mandating in January 2025 that all contractors submit imagery metadata in a standardized schema that flags potential duplicates before files are ever uploaded to the city's Mapo District servers. The Korean capital spent approximately $1.3 million building that intake system—less than two-thirds of what New York spends each year simply storing its redundant files.
Amsterdam has gone further still, integrating deduplication into its open-source FIWARE urban data platform, meaning the city's imaging data is cleansed continuously rather than in periodic, expensive contract cycles. New York's technology officials have visited Amsterdam twice in the past 18 months to study the model, according to procurement records obtained through a Freedom of Information request.
For New Yorkers, the practical stakes are clearest at the neighborhood level. Residents in Jackson Heights who use the city's 311 portal to report cracked sidewalks or broken street lamps are unknowingly relying on street-image verification that, in some blocks, pulls from photographs taken in 2021. The Nearmap contract should update those feeds. But advocates at the Center for Urban Future on Park Avenue South argue that the city needs a permanent deduplication standard baked into every future imaging procurement—not a one-time cleanup followed by the same cycle of accumulation. That argument is already circulating among staffers on the City Council's Technology Committee ahead of budget hearings scheduled for September.