Berlin's Digital Archives Are Full of Duplicate Images — and Officials Are Finally Talking About the Cost
From Mitte to Marzahn, city agencies and tech experts are debating how to fix a sprawling problem hidden inside Berlin's public databases.
From Mitte to Marzahn, city agencies and tech experts are debating how to fix a sprawling problem hidden inside Berlin's public databases.
Berlin's municipal digital infrastructure is carrying a weight it was never designed to hold. Across city departments — from the Senatsverwaltung für Stadtentwicklung to the Berliner Feuerwehr's internal document systems — redundant and duplicate image files have accumulated for years inside public databases, bloating storage costs and slowing down retrieval systems that civil servants rely on daily. Now a combination of budget pressure and a push toward AI-assisted document management is forcing the issue into the open.
The timing matters. The SPD-led Berlin Senate has been tightening discretionary IT spending in 2026 as it simultaneously tries to fund housing programmes and BVG infrastructure upgrades. Wasteful data storage — the kind generated when the same scanned permit photograph gets uploaded six times across three departments — has become harder to defend. Technology policy advisers within the Senate chancellery have been circulating internal assessments of the problem since at least early this year, according to documents reviewed as background for this report.
Specialists in public-sector data governance point to a structural flaw rather than individual error. Berlin built its digital filing systems piecemeal across the 2000s and 2010s, often department by department, without a centralised deduplication layer. The result is that a single image of, say, a Neukölln building facade submitted as part of a planning application might exist in the Bezirksamt Neukölln's local server, in the Senatsverwaltung's regional planning archive, and in a shared citywide GIS database — three times, or more.
Experts in records management and digital infrastructure who work with German public bodies describe this as a known and widespread problem among large municipal governments. Storage costs for unmanaged image libraries in city-scale organisations can run to tens of thousands of euros per year in unnecessary cloud or on-premises capacity. While The Daily Berlin is not attributing specific budget figures to Berlin's departments without verified public accounts, the pattern is well-documented in comparable German Länder contexts.
The Berlin-based civic technology organisation CityLAB Berlin, which operates out of Platz der Luftbrücke in Tempelhof and works directly with Senate departments on digital modernisation, has been involved in broader conversations about AI tooling for public-sector data. Deduplication — automatically identifying and flagging redundant files — is among the functions that machine learning tools can now handle at scale. Whether Berlin's agencies have the budget and political will to implement such systems is a separate question.
Inside the Rotes Rathaus, the pressure is coming from two directions simultaneously. On one side, the Senate's digital transformation agenda — formalised under the Berliner E-Government-Gesetz and updated priorities set out in 2024 — calls for leaner, more interoperable data systems by 2027. On the other, the sheer administrative complexity of Berlin's 12 Bezirke, each with its own IT procurement history, makes any centralised fix a political as well as a technical challenge.
Discussions at the Tagesspiegel Digital Forum held in Berlin in May 2026 touched on exactly this tension, with public-sector IT managers describing the difficulty of reconciling legacy systems inherited from the pre-2001 merger of East and West Berlin's bureaucracies with modern cloud-first approaches. The duplicate image problem is a symptom of that deeper incompatibility.
For residents and businesses who interact with Berlin's planning portals — particularly in fast-developing corridors like Lichtenberg and along the Tempelhofer Feld perimeter, where construction documentation is voluminous — the practical consequence is slower processing times and occasional errors when retrieval systems surface the wrong version of a file.
The most immediate step available to city agencies is a structured audit of image assets within the Berliner Landesdatennetz, the city's official data network, using existing deduplication software before any AI investment is made. Technology advisers recommend that Bezirke coordinate through the Berliner Beauftragter für Informationstechnik — the city's IT commissioner — rather than procuring separate solutions independently, which would only compound the fragmentation. A coordinated review, they argue, could identify quick wins within six months and lay the groundwork for a unified digital asset management standard before the 2027 legislative deadline arrives.
How does this story make you feel?
Spread the word
About this article
Published by The Daily Berlin
Daily brief
Free, in your inbox before 7am. Weekdays.
More in News