Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Problem: The Numbers Behind a Digital Filing Crisis Costing the City Millions

From Mitte to Marzahn, Berlin's public institutions are sitting on vast troves of redundant digital image files — and the bill for storing them keeps climbing.

By Berlin News Desk · Published 4 July 2026, 8:58 pm

3 min read

Berlin's Duplicate Image Problem: The Numbers Behind a Digital Filing Crisis Costing the City Millions
Photo: Photo by wal_ 172619 on Pexels
Wird übersetzt…

Berlin's public administration is managing an estimated 47 terabytes of duplicate image files spread across municipal servers, according to an internal digital asset audit completed by the Senate Department for Digital Transformation in June 2026. The finding, which covers everything from planning photographs held by Bezirksamt Friedrichshain-Kreuzberg to construction site imagery archived by Berliner Verkehrsbetriebe (BVG), underscores a problem that city IT managers have quietly flagged for years: when nobody is responsible for deleting a file, nobody deletes it.

The timing matters. Berlin's SPD-led coalition has committed €220 million over the current legislative period to modernising the city's digital infrastructure, a program running under the banner of the Berliner Digitalisierungsstrategie 2025–2030. But auditors and independent IT consultants working with the Senate Chancellery have noted that a disproportionate share of that storage budget is being consumed not by new data, but by exact or near-exact copies of images that were uploaded multiple times, converted into different formats, and never reconciled. Duplicate images alone account for roughly 18 percent of total municipal cloud storage costs, according to the June audit summary circulated within the Senate.

The practical consequences are visible at street level. The Stadtentwicklungsgesellschaft Berlin mbH, which manages housing development files for several inner-city districts, reportedly held four distinct copies of the same set of construction progress photographs for a Prenzlauer Berg residential block — each copy saved by a different department as staff turned over during the pandemic years. The BVG, investing heavily in documenting its new U-Bahn extension work along the U5 corridor near Alexanderplatz, flagged a similar issue internally after a 2025 tender for server capacity came in 34 percent above budget projections, partly because existing storage was already bloated with unmanaged image archives.

What the Numbers Actually Show

Storage is not cheap at municipal scale. Berlin's primary public cloud contract, held with a consortium including T-Systems and running through data centres in Frankfurt and Berlin-Marzahn, prices enterprise storage at approximately €0.023 per gigabyte per month. Forty-seven terabytes of duplicate files therefore costs the city roughly €13,000 a month — or around €156,000 annually — for data it does not need. Multiply that across a projected five-year accumulation curve and the figure approaches €800,000 before any indexing or retrieval overhead is counted.

Duplicate image replacement — the systematic process of identifying redundant files through hash-matching algorithms, designating a single canonical version, and scrubbing the rest — is not a new concept. The German federal government's Digitalrat recommended standardised deduplication protocols for all Bundesbehörden back in 2022. Several Länder, including Hamburg and Bavaria, adopted those guidelines within 18 months. Berlin has not yet formally adopted equivalent standards across all Bezirke, leaving individual departments to handle the problem piecemeal. The IT unit at Bezirksamt Mitte, for example, ran its own deduplication sweep in March 2026 and recovered 1.2 terabytes of space — equivalent to wiping clean roughly 400,000 high-resolution image files.

What Comes Next for Berlin's Servers

The Senate Department for Digital Transformation is expected to table a city-wide deduplication framework before the Abgeordnetenhaus by September 2026. Under the draft proposal, all departments would be required to run automated duplicate-detection software quarterly, using tools already licensed under Berlin's existing Microsoft and open-source software agreements. Departments that fail two consecutive compliance checks would face a freeze on new storage allocation requests — a mechanism designed to create financial pressure without imposing fines.

For Berliners, the practical upside is modest but real. Faster document retrieval, lower long-term IT costs, and a cleaner foundation for the AI-assisted planning tools the coalition wants to deploy by 2028 all depend on having archives that aren't padded with redundant data. The Bezirksamt Friedrichshain-Kreuzberg has already volunteered to serve as a pilot district for the new framework, with rollout scheduled to begin in October 2026 at its Yorkstraße administrative offices. Whether the remaining eleven Bezirke move with equal speed is, for now, the open question city IT planners are watching most closely.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.