Berlin's sprawling network of public institutions is sitting on a problem that has been quietly compounding for years: thousands of duplicate images clogging municipal databases, slowing down public-facing platforms, and inflating storage costs at a moment when the city can least afford the waste. The question now is not whether to act, but how fast, and who pays.
The issue surfaced publicly this spring when the Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen began auditing its digital asset management system ahead of planned renovations to its Württembergische Straße offices in Wilmersdorf. Internal reviews found that redundant image files — the same housing project photographs uploaded multiple times under different file names across different departments — had accumulated to a degree that was measurably slowing database queries for planning applications. That kind of administrative drag has real consequences in a city where the housing shortage means planning decisions are politically radioactive.
Berlin is not alone in facing this. Hamburg's Behörde für Stadtentwicklung and Wohnen restructured its digital asset workflow in 2024 after a similar audit. But Berlin's situation is more complex because the problem extends well beyond one senate department.
Where the Bottlenecks Are
The Stadtbibliothek Berlin and the Stadtmuseum Berlin, both of which manage large digitised collections accessible through the Deutsche Digitale Bibliothek, have each flagged duplicate-image issues to the Kulturtechnik working group that advises the Senatskanzlei. The Stadtmuseum alone holds digitised records spanning more than 900,000 objects, and librarians have described the deduplication backlog as one of the more stubborn technical challenges in preparing for a planned platform migration scheduled for late 2026.
BVG, the public transport operator, faces a parallel version of the problem in its infrastructure documentation systems. The agency photographs stations, tunnels, and rolling stock continuously for maintenance records. According to BVG's 2025 annual report, the operator processed more than 1.4 million infrastructure images last year. Without automated deduplication, engineers searching for the most recent inspection photograph of, say, a specific section of the U8 line between Hermannstraße and Leinestraße risk pulling an outdated file. That is not a trivial risk in a system carrying roughly 1.1 million passengers daily.
The federal dimension tightens the timeline. Under the Onlinezugangsgesetz reform package moving through the Bundestag, public digital services must meet updated interoperability standards by January 1, 2027. Institutions that cannot demonstrate clean, deduplicated asset libraries may find themselves ineligible for certain federal co-financing streams, including funds earmarked under the Digitalstrategie Deutschland for municipal infrastructure.
The Decisions That Cannot Wait
Three choices are converging on Berlin's institutions right now. First, whether to run deduplication processes in-house using existing IT staff — an approach that saves procurement costs but stretches teams already managing BVG's ongoing digital ticket infrastructure — or to contract specialist vendors, several of whom have already pitched the Senatsverwaltung für Inneres und Sport. Second, which metadata standard to adopt. The European standard IPTC and the Library of Congress's PREMIS framework are both in use across Berlin institutions, and they do not naturally speak to each other, which is part of why duplicates proliferate in the first place. A unified standard across Bezirk-level databases would require a political decision at senate level, not just a technical one. Third, who funds the transition. The SPD-led coalition is currently navigating a budget round under significant pressure, with the 2026 Haushalt already committing significant resources to BVG infrastructure and the Wohnungsbaugesellschaften.
Institutions that move early have some advantage. The Zentral- und Landesbibliothek Berlin on Blücherplatz in Kreuzberg has already piloted an AI-assisted deduplication tool on a subset of its 3.5 million digitised items, according to the ZLB's 2025 digital strategy document. Results from that pilot are expected to inform the broader senate working group recommendation due in September. That September deadline is effectively the last practical decision point before the federal 2027 clock becomes unforgiving. Whatever approach Berlin chooses, the window to choose it is closing.