Berlin's public digital infrastructure has a clutter problem. Across municipal databases, neighbourhood website portals and the Senatsverwaltung's own document management systems, administrators estimate tens of thousands of duplicate or broken placeholder images have accumulated over the past decade — slowing load times, confusing residents and creating real headaches for archivists trying to modernise the city's records ahead of key 2027 digitisation deadlines.
The issue has quietly become a talking point among Berlin's tech and governance circles this summer. At a working session of the Berliner Beauftragter für Datenschutz und Informationsfreiheit in late June, participants flagged the problem as a symptom of rushed migration projects from the mid-2010s, when dozens of district-level websites were consolidated under a shared CMS platform without proper deduplication protocols in place.
Where the Problem Shows Up
The districts hit hardest, according to internal workflow discussions circulated among IT administrators, are Mitte and Friedrichshain-Kreuzberg — both of which ran parallel digitisation drives between 2017 and 2020 under different vendors. Bezirksamt Mitte alone manages more than 4,000 active public-facing web pages, and staff responsible for content maintenance say image libraries attached to those pages have never undergone systematic review. The Karl-Marx-Allee cultural corridor documentation project, launched in 2022 to archive the boulevard's postwar architecture for public access, produced what one content team described internally as a cascade of mislabelled and duplicated assets when files were batch-uploaded from three separate photography contractors.
The Technologiestiftung Berlin, the city's primary body for applied digital research, has been vocal within administrative circles about the need for what specialists call "image hygiene" — a structured process of hash-matching, metadata tagging and archive consolidation. The foundation, which is based in Grunewaldstraße in Schöneberg, has published methodology guides for small public bodies, though uptake among Bezirksämter has been uneven.
Zentraler IT-Dienstleister des Landes Berlin, known as ZIT-BB, is the agency responsible for shared infrastructure across Senate departments. Staff there have flagged the deduplication question in at least two quarterly planning cycles since late 2024, framing it as a prerequisite for the city's wider migration to the BundID and EfA service platform by the end of 2027 — a federal mandate that Berlin cannot ignore.
What Specialists Want Done Now
Digital archivists and open-data advocates working with projects like Berlin Open Data, hosted at daten.berlin.de, argue the fix is technically straightforward but organisationally messy. Automated deduplication software — tools that scan file hashes and flag identical or near-identical images across repositories — costs as little as €3,000 for a mid-size municipal deployment, according to procurement documents from comparable German cities. The challenge is not the software; it is the human review layer that follows, requiring trained staff to decide which version of a duplicated image is canonical before anything gets deleted from a public record.
Several Berlin-based startups, particularly those clustered around the Factory Berlin campus in Mitte and the tech incubators along Oberbaum City in Friedrichshain, have pitched AI-assisted image classification tools to city procurement offices. So far, no public contract has been awarded specifically for the deduplication use case, though the Senatsverwaltung für Inneres und Digitales opened a general AI procurement framework in March 2026 that could theoretically accommodate such a project.
The pressure is also political. Members of the SPD-led coalition have pushed for measurable digital efficiency gains as part of the 2026-2027 coalition agreement's modernisation chapter. Cleaning up ghost images may seem unglamorous, but administrators argue it is a necessary precondition before any larger investment in citizen-facing services — whether that is housing application portals in Neukölln or transport disruption alerts for BVG commuters on the U8 — can actually function reliably.
The practical next step, according to discussions at ZIT-BB and among open-data advocates, is a citywide image audit scoped and budgeted before the autumn Senate budget review in September 2026. Without that formal commission, the work will continue to fall to individual Bezirksamt staff members handling it piecemeal — a slow and inconsistent approach that experts say simply cannot keep pace with the volume of new digital content Berlin's public bodies produce every week.