Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Problem: The Numbers Driving a City-Wide Digital Clean-Up

Across Berlin's public sector databases, tens of thousands of redundant image files are clogging servers, inflating IT budgets, and slowing the digital services residents rely on every day.

By Berlin News Desk · Published 4 July 2026, 8:44 pm

3 min read

Berlin's Duplicate Image Problem: The Numbers Driving a City-Wide Digital Clean-Up
Photo: Photo by Max Kladitin on Pexels
Wird übersetzt…

At least 34,000 duplicate image files have been identified across the Berlin Senate's central digital asset repositories, according to an internal audit completed in June 2026 by the Senatsverwaltung für Inneres und Digitales. The redundant files — ranging from duplicated housing permit photographs stored on servers in Mitte to replicated infrastructure images within the BVG transit authority's maintenance database — are consuming an estimated 2.8 terabytes of storage that administrators say could be freed within weeks using existing software tools.

The finding lands at an uncomfortable moment. Berlin's IT modernisation budget for 2026 was set at €47 million under the SPD-led coalition's Haushalt, with a significant share earmarked for the consolidation of legacy systems across twelve Bezirke. Redundant data, particularly duplicate images, is a recognised drag on that consolidation. Every hour spent processing, backing up, and securing files that already exist elsewhere is an hour — and a euro — diverted from the infrastructure improvements the city is trying to make.

Where the Duplicates Are Piling Up

The problem is not evenly distributed. Two institutions account for a disproportionate share of the identified redundancy. The Berliner Immobilienmanagement GmbH, which manages the city's public housing stock from its offices near Alexanderplatz, reported that more than 9,000 property photographs had been uploaded in duplicate or triplicate between 2021 and 2025, largely because different departments were uploading from disconnected local drives without a unified content management protocol. Meanwhile, Geoportal Berlin — the city's publicly accessible mapping and urban data platform — was found to have replicated aerial survey image sets across at least three internal mirror archives, producing unnecessary redundancy in a database that already holds over 600,000 georeferenced files.

The BVG, Berlin's public transport operator, faces its own version of the issue. Engineering teams maintaining U-Bahn infrastructure along the U5 corridor between Hönow and Hauptbahnhof rely on image documentation of tunnel inspections, cable runs, and platform conditions. A BVG technology review in March 2026 found that standard operating procedure had led to the same inspection photographs being saved to both a central SharePoint environment and individual project folders, effectively doubling storage demand on roughly 18 months of inspection records. Storage costs for the BVG's IT division rose by 11 percent between 2024 and 2025, and file duplication was cited as a contributing factor in the review document.

What the Clean-Up Actually Involves

Removing duplicate images sounds simple. In practice, across a city bureaucracy of Berlin's scale, it is not. Automated deduplication software can identify pixel-identical files quickly, but many duplicates in public sector databases are near-identical — the same photograph saved at two different resolutions, or with slightly different metadata timestamps — and those require human review before deletion to ensure legal and archival obligations are met. Under Germany's Landesarchivgesetz, certain categories of official records, including images tied to building permits or planning decisions, carry retention requirements of up to 30 years. A mistaken deletion is not simply an IT problem; it can become a legal one.

The Senate's digitisation unit has piloted a phased approach starting with the Stadtentwicklung und Wohnen file systems, targeting the least legally sensitive image categories first. A contractor selection process for a city-wide deduplication tool was opened via the Vergabeplattform Berlin procurement portal on 17 June 2026, with tenders due by 25 July. The contract is valued at up to €380,000 for an initial 18-month engagement.

For residents, the practical payoff is faster load times on services like the Bürgeramt appointment portal and the Geoportal mapping interface, both of which have drawn complaints about sluggish performance during peak hours. For the Senate, the arithmetic is straightforward: clearing those 2.8 terabytes and preventing future duplication through a unified upload policy could reduce annual storage licensing costs by an estimated €60,000 per year across the affected departments. That is not a budget transformation, but in a city still arguing about rent caps and Energiewende infrastructure bills, a six-figure annual saving from a digital house-keeping exercise is the kind of unglamorous efficiency gain that IT administrators have been pushing for since before the pandemic.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.