Berlin's municipal digital infrastructure has a redundancy crisis. Across city-run databases — from the Senatsverwaltung für Stadtentwicklung's planning portals to the BVG's internal asset management systems — duplicate images have accumulated into the hundreds of thousands, clogging servers and complicating archival accuracy. The problem has grown severe enough that city IT officials are now openly calling for a coordinated replacement and deduplication strategy.
The issue sounds mundane. It isn't. Berlin's public institutions collectively manage enormous volumes of digital imagery — construction site documentation, transit infrastructure records, housing stock photography used to assess rent-cap compliance under the Mietspiegel framework. When the same image exists in two, five, or a dozen versions across separate databases, staff waste hours reconciling records, storage costs balloon, and the legal defensibility of official documentation comes into question.
Why Now, and Who Is Saying It
The urgency is partly fiscal. Berlin's 2026 budget, passed by the SPD-led Senate coalition in late 2025, allocated roughly €340 million to digital infrastructure modernisation — but a significant share of that was earmarked for new data capacity rather than cleaning up existing systems. Critics on the Abgeordnetenhaus science and technology committee have since argued publicly that expanding storage without tackling duplication is the digital equivalent of building more shelves instead of throwing out old boxes.
The Technologiestiftung Berlin, which operates out of Grunewaldstraße in Schöneberg and advises the Senate on smart-city policy, has flagged duplicate image accumulation as a systemic issue in its annual digital infrastructure reviews. The foundation's position, reflected in its published documentation, is that deduplication should be treated as a precondition for any serious AI-readiness programme — not an afterthought. Berlin's tech sector, concentrated in the startup clusters around Prenzlauer Berg and along the Revaler Straße corridor in Friedrichshain, has echoed that view, with several firms specialising in computer-vision-based deduplication now pitching directly to Senate procurement offices.
Akademie der Künste, whose digital archive on Pariser Platz holds decades of cultural documentation, has separately acknowledged internal reviews of its image catalogues after identifying redundant files from digitisation campaigns run between 2018 and 2023. The scale of duplication in cultural institutions is harder to quantify than in administrative systems, but archivists have described it as a structural problem inherited from rushed scanning projects with no consistent metadata standards.
What a Fix Actually Looks Like
Experts broadly agree the solution has two layers. The first is technical: deploying perceptual hashing algorithms that identify visually identical or near-identical images regardless of file name or format, then flagging them for human review before deletion. The second is governance: establishing a city-wide image asset policy with mandatory unique identifiers assigned at point of upload, so the duplication pipeline is closed from the source.
The BVG, which maintains tens of thousands of infrastructure photographs across its U-Bahn and bus network documentation, confirmed in its 2025 annual report that it was piloting an internal asset-management overhaul — though the report did not specify deduplication as a named objective. IT procurement specialists in the city say any serious deduplication tender for a mid-sized Berlin department could run between €80,000 and €250,000 depending on database complexity and the volume of images requiring human arbitration.
The practical path forward, according to the Technologiestiftung's published framework, runs through three steps: a full audit of existing image assets department by department, adoption of the EU's DCAT metadata standard across all public portals, and a cross-departmental working group with authority to enforce deletion protocols. The Senate's digital office has not publicly committed to a timeline for any of these steps.
For Berliners, the stakes are not abstract. Housing inspectors relying on unduplicated, accurately catalogued property photographs to enforce rent-cap rules under the Mietspiegel need clean records. BVG maintenance teams cross-referencing platform safety images need to know they are looking at the right file. City planners mapping Friedrichshain's development density need photographs that appear once, correctly dated, correctly sourced. The city has the budget line and the expertise nearby. The question officials are now being pressed to answer is whether it also has the will to act before the 2027 budget cycle closes another window.