Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Problem: The Numbers Driving a Digital Reckoning in City Administration

Thousands of redundant files are clogging the databases of Berlin's public sector, and the cost — in storage, staff time and taxpayer money — is finally being counted.

By Berlin News Desk · Published 4 July 2026, 8:44 pm

3 min read

Wird übersetzt…

Berlin's Senate Department for Urban Development holds more than 340,000 digital image files across its property and planning databases, according to internal IT assessments reviewed this year — and a significant share of them are exact or near-exact duplicates. The problem is not unique to one agency. Across the city's administration, from the Senatsverwaltung für Inneres to the district offices in Mitte and Friedrichshain-Kreuzberg, duplicate image data has quietly ballooned into a measurable fiscal burden.

The timing matters. Berlin is in the middle of a €1.2 billion digitisation push under its Berliner E-Government-Gesetz, the city's electronic government framework updated in 2023. As agencies migrate legacy paper records to digital systems, they are also, often inadvertently, importing old duplication habits into new infrastructure. A scan gets uploaded twice. A press photograph lands in three separate department folders. A planning document's attached site image exists in six versions across four servers. Multiply that by decades of accumulated files and you arrive at a storage and management problem that IT administrators across Alexanderplatz-adjacent government buildings have been flagging for at least three years.

What the Data Actually Shows

Industry benchmarks from enterprise data management firms operating in the German market suggest that between 20 and 30 percent of files in large public-sector digital archives are duplicates or near-duplicates. Applied to Berlin's context, that range implies tens of thousands of redundant image files sitting in active storage — not archived, not flagged, just consuming server capacity and confusing search results for civil servants trying to do their jobs.

Storage is not cheap. Commercial cloud storage contracts of the type used by several Berlin Bezirksämter run between €0.02 and €0.05 per gigabyte per month depending on tier and contract terms. High-resolution planning images and geospatial photographs — the kind produced in volume by the Berliner Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen — can run to several megabytes each. At scale, redundant image storage across twelve district administrations plus central agencies adds up to tens of thousands of euros annually in costs that deliver zero operational value.

The Zentraler IT-Dienstleister Berlin, known as ITDZ Berlin, which manages core digital infrastructure for the city government from its base in Mitte, has been piloting automated deduplication tools since late 2024 as part of its broader data hygiene programme. The organisation has not yet published outcome figures publicly, but IT procurement documents from early 2026 show continued contract extensions for deduplication software licences — a reliable signal that the tooling is being used at scale.

Practical Stakes for Housing and Planning

The duplication issue carries particular weight in Berlin's housing sector, where pressure is intense. The city's Wohnungsmarktbericht has consistently shown a shortfall of tens of thousands of units, and planning approvals — already slowed by bureaucratic process — can be further delayed when caseworkers in offices like those on Württembergische Straße in Wilmersdorf cannot quickly retrieve the correct, current version of a site photograph or building survey image. Duplicate files create version-control chaos. A planner pulling the wrong image from a cluttered database is not a hypothetical; it is the kind of error that adds days to approval timelines.

The Technologiestiftung Berlin, which tracks the city's digital transformation from its offices near Gendarmenmarkt, has argued in its annual digital index reports that data quality — not just data quantity — is the real measure of administrative modernisation. Deduplication is foundational to that quality.

For Berliners watching housing approvals crawl and city services strain, the mundane business of deleting redundant image files is, in fact, directly connected to how fast the city can function. Agencies that have completed deduplication audits this year are advised to establish automated hash-checking on upload — a technical standard that prevents duplicates from entering the system in the first place, rather than requiring expensive cleanup after the fact. The ITDZ Berlin's procurement cycle for updated data governance tools closes in September 2026, making the next ten weeks the practical window for departments to submit deduplication requirements and get them funded.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.