Berlin's Senate Department for Urban Development confirmed earlier this year that the city's mapping and digital services division had identified more than 340,000 duplicate or outdated images embedded across municipal databases, planning portals and public-facing transport apps — a backlog that had accumulated over nearly a decade of piecemeal digitisation. The cleanup, now roughly 60 percent complete, is the largest such exercise any German city has publicly undertaken.
The issue matters because bloated image libraries slow down the systems that hundreds of thousands of Berliners use daily. The BVG transit app, which logged more than 2.3 million active users in 2025 according to the operator's annual report, has historically suffered loading delays partly attributable to redundant asset files. The city's official geodata portal, hosted at the Berliner Stadtplan infrastructure and maintained by the Senate's geoinformation unit, faces similar drag. With Berlin pushing hard to position itself as a European tech and startup hub — the Mitte and Prenzlauer Berg corridors alone house dozens of civic-tech firms with public contracts — the pressure to clean house has become commercially as well as administratively urgent.
What Berlin Is Actually Doing
The deduplication effort is being coordinated through the Berlin Open Data platform, a project under the Senate Chancellery's digital services arm. Technicians there are using a hashing-based comparison tool to flag images that are visually identical or near-identical across different datasets. Files flagged for removal are held in a quarantine folder for 30 days before permanent deletion, a safeguard designed to prevent accidental loss of historically significant materials. The Landesarchiv Berlin, based on Eichborndamm in Reinickendorf, has been brought in to review any flagged assets predating 2005.
Friedrichshain-Kreuzberg's district administration has run a parallel, smaller-scale version of the same process for its local planning documents, finishing that round in March 2026. Officials there cut the district's image asset volume by 28 percent, which reduced average page-load times on the district's public consultation portal, according to figures the administration published in its quarterly digital services bulletin.
The Rotes Rathaus project management office, which oversees cross-departmental IT coordination, has set a completion deadline of Q1 2027 for the Senate-level cleanup.
How Berlin Compares to Amsterdam, Vienna and Seoul
Amsterdam's Gemeente digital team ran a comparable deduplication exercise in 2024, targeting its city photography archive and infrastructure inspection database. The Dutch capital finished the job in seven months, faster than Berlin's current pace, aided by a single unified content management system adopted citywide in 2021. Berlin still operates across several legacy platforms, a fragmentation that slows cross-database matching considerably.
Vienna took a different approach entirely. Rather than retrospective cleanup, the Austrian capital built automated deduplication into its upload pipeline for all new government imagery from January 2025 onward, through a system procured from a local GovTech supplier. New duplicates are rejected at the point of entry. The retrospective backlog in Vienna remains largely unaddressed, which means Berlin will likely have cleaner historical data than its southern rival once the 2027 deadline passes.
Seoul's Smart City Division, operating under a significantly larger budget, completed a full deduplication of its 4.7 million-image public asset library in late 2025, compressing it to just under 1.9 million files. The scale dwarfs anything Berlin is managing, but city IT administrators have noted that Seoul's centralised governance model makes that kind of speed possible in ways that Berlin's federal-state structure does not.
For residents and developers who rely on Berlin's open data resources — particularly the roughly 1,400 registered users of the city's geodata API — the practical advice is straightforward: before Q3 2026, expect intermittent unavailability of specific image layers in the Stadtplan portal as batches are cleared and indexes rebuilt. The Senate's digital services team has published a maintenance calendar on berlin.de listing specific downtime windows through October. Developers building on city data are advised to cache image assets locally rather than pulling them live during that period.
The fuller picture won't be visible until the Rotes Rathaus publishes a final audit report, currently expected alongside the Q1 2027 completion announcement. That document is expected to form the basis of a pan-German best-practice recommendation through the IT-Planungsrat, the federal body that coordinates digital standards across all sixteen Länder.