Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Problem: The Numbers Driving a City-Wide Data Cleanup

Municipal archives, housing agencies, and startups across the capital are sitting on millions of redundant digital files — and the cost of ignoring them is finally becoming clear.

By Berlin News Desk · Published 4 July 2026, 9:06 pm

3 min read

Berlin's Duplicate Image Problem: The Numbers Driving a City-Wide Data Cleanup
Photo: Photo by Arlind D on Pexels
Wird übersetzt…

Berlin's public sector is drowning in copies of itself. An internal review by the Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen — the city's urban development authority — identified more than 4.2 million duplicate image files stored across shared servers as of March 2026, consuming an estimated 38 terabytes of redundant storage. The finding has quietly triggered a broader conversation about data hygiene across Berlin's institutions, from housing agencies in Mitte to the BVG's infrastructure documentation archive in Lichtenberg.

The timing matters. Berlin is mid-way through a major digitisation push under the city's Digitalstrategie Berlin 2030 framework, which commits the SPD-led coalition to migrating nearly all public records into unified cloud infrastructure by December 2028. Redundant image data is not a bureaucratic footnote — it inflates storage licensing costs, slows retrieval systems used by city planners and housing inspectors, and creates legal risk when outdated property photographs are mistakenly attached to current permit applications.

What the Numbers Actually Show

Storage is not cheap at scale. Commercial cloud providers charge Berlin's public agencies roughly €0.02 per gigabyte per month under current framework agreements — meaning 38 terabytes of duplicate images alone generates an unnecessary bill approaching €9,100 every month, or around €109,000 per year, before factoring in the internal IT labour required to maintain and back up those redundant files. The Senatsverwaltung figure covers only one directorate. Analysts working on the Digitalstrategie project estimate the city-wide total across all Bezirksämter — the 12 district offices — could be three to five times higher.

The problem is particularly acute in housing. Wohnungsbaugesellschaft Berlin-Mitte, known as WBM, manages more than 32,000 apartments across central districts. Property inspection teams photograph units at each tenant change, damage report, and renovation stage, producing thousands of images per building cycle. Without automated deduplication, the same facade shot or stairwell image can exist in six or seven folders simultaneously — attached to different case numbers, uploaded by different inspectors working in Prenzlauer Berg, Friedrichshain, or along Karl-Marx-Allee.

Berlin's startup sector has noticed the same structural gap. Metr, a Kreuzberg-based proptech firm that builds digital twin tools for residential portfolios, has been piloting hash-based deduplication software with two Berlin housing associations since January 2026. The approach assigns each image file a unique fingerprint; any exact or near-exact copy is flagged for review before being stored. Early results from the pilot, which covers roughly 80,000 image files, showed a 31 percent reduction in active storage use within the first 90 days, according to figures the company has shared publicly in a press release dated May 2026.

What Comes Next for Institutions and Residents

The Senatsverwaltung has not yet published a public remediation timeline, but the Digitalstrategie Berlin 2030 roadmap sets a hard deadline of June 2027 for all major housing authorities to complete their first full data audit. That gives agencies like WBM and Gewobag — which operates more than 70,000 apartments, predominantly in Spandau, Reinickendorf, and along the Ringbahn corridor — roughly 12 months to implement deduplication protocols or face compliance reviews.

For residents, the practical consequences reach further than they might expect. Duplicate or mismatched property photographs have been linked to delays in Wohngeld applications — the federal housing benefit — when submitted documentation fails automated cross-checks. The Jobcenter Berlin Mitte on Strelitzer Straße has recorded processing delays tied to image file errors in digital submissions, though the agency has not published specific figures on the frequency.

The cleanest path forward, according to the Digitalstrategie documentation, involves both retroactive deduplication of legacy archives and mandatory hash-checking at the point of upload for any new image entering city systems from 2027 onward. Institutions that fail to comply risk losing access to the city's shared cloud infrastructure — a significant operational disruption for any agency that has already migrated its core records off local servers. The audit clock is running.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.