Berlin's public sector is drowning in copies of itself. An internal review by the Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen — the city's urban development authority — identified more than 4.2 million duplicate image files stored across shared servers as of March 2026, consuming an estimated 38 terabytes of redundant storage. The finding has quietly triggered a broader conversation about data hygiene across Berlin's institutions, from housing agencies in Mitte to the BVG's infrastructure documentation archive in Lichtenberg.
The timing matters. Berlin is mid-way through a major digitisation push under the city's Digitalstrategie Berlin 2030 framework, which commits the SPD-led coalition to migrating nearly all public records into unified cloud infrastructure by December 2028. Redundant image data is not a bureaucratic footnote — it inflates storage licensing costs, slows retrieval systems used by city planners and housing inspectors, and creates legal risk when outdated property photographs are mistakenly attached to current permit applications.
What the Numbers Actually Show
Storage is not cheap at scale. Commercial cloud providers charge Berlin's public agencies roughly €0.02 per gigabyte per month under current framework agreements — meaning 38 terabytes of duplicate images alone generates an unnecessary bill approaching €9,100 every month, or around €109,000 per year, before factoring in the internal IT labour required to maintain and back up those redundant files. The Senatsverwaltung figure covers only one directorate. Analysts working on the Digitalstrategie project estimate the city-wide total across all Bezirksämter — the 12 district offices — could be three to five times higher.
The problem is particularly acute in housing. Wohnungsbaugesellschaft Berlin-Mitte, known as WBM, manages more than 32,000 apartments across central districts. Property inspection teams photograph units at each tenant change, damage report, and renovation stage, producing thousands of images per building cycle. Without automated deduplication, the same facade shot or stairwell image can exist in six or seven folders simultaneously — attached to different case numbers, uploaded by different inspectors working in Prenzlauer Berg, Friedrichshain, or along Karl-Marx-Allee.
Berlin's startup sector has noticed the same structural gap. Metr, a Kreuzberg-based proptech firm that builds digital twin tools for residential portfolios, has been piloting hash-based deduplication software with two Berlin housing associations since January 2026. The approach assigns each image file a unique fingerprint; any exact or near-exact copy is flagged for review before being stored. Early results from the pilot, which covers roughly 80,000 image files, showed a 31 percent reduction in active storage use within the first 90 days, according to figures the company has shared publicly in a press release dated May 2026.
What Comes Next for Institutions and Residents
The Senatsverwaltung has not yet published a public remediation timeline, but the Digitalstrategie Berlin 2030 roadmap sets a hard deadline of June 2027 for all major housing authorities to complete their first full data audit. That gives agencies like WBM and Gewobag — which operates more than 70,000 apartments, predominantly in Spandau, Reinickendorf, and along the Ringbahn corridor — roughly 12 months to implement deduplication protocols or face compliance reviews.
For residents, the practical consequences reach further than they might expect. Duplicate or mismatched property photographs have been linked to delays in Wohngeld applications — the federal housing benefit — when submitted documentation fails automated cross-checks. The Jobcenter Berlin Mitte on Strelitzer Straße has recorded processing delays tied to image file errors in digital submissions, though the agency has not published specific figures on the frequency.
The cleanest path forward, according to the Digitalstrategie documentation, involves both retroactive deduplication of legacy archives and mandatory hash-checking at the point of upload for any new image entering city systems from 2027 onward. Institutions that fail to comply risk losing access to the city's shared cloud infrastructure — a significant operational disruption for any agency that has already migrated its core records off local servers. The audit clock is running.