Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Digital Archives Are Riddled With Duplicate Images — and Officials Are Finally Talking About It

From the Landesarchiv to municipal housing portals, administrators and technology specialists are raising alarms about the scale of redundant visual data clogging Berlin's public systems.

By Berlin News Desk · Published 4 July 2026, 9:06 pm

3 min read

Berlin's Digital Archives Are Riddled With Duplicate Images — and Officials Are Finally Talking About It
Photo: Photo by Zois Fotis on Pexels
Wird übersetzt…

Berlin's public digital infrastructure has a clutter problem. Across city-run platforms — from the Senatsverwaltung für Stadtentwicklung's housing database to the Landesarchiv Berlin on Eichborndamm in Reinickendorf — duplicate images have accumulated over years of poorly coordinated digitisation drives, consuming server capacity and complicating search functions that civil servants and ordinary residents depend on daily. Now, administrators and technology specialists are pushing the issue into the open.

The timing matters. Berlin's SPD-led coalition committed in early 2025 to accelerating the digitisation of public records under the city's Digitalstrategie 2030 framework, a program that channelled roughly €47 million toward modernising back-end government systems. That investment brought auditors and IT contractors face to face with data quality failures that had previously gone unexamined. Duplicate image files — the same photograph or scanned document stored multiple times under different filenames — turned out to be among the most common and most costly.

Officials at the Senatsverwaltung für Inneres und Digitales have acknowledged the scope of the problem in internal working group sessions, though no public statement has been issued quantifying the volume of affected records. A review of procurement documents published on the Berlin Senate's transparency portal shows that a framework contract for database deduplication services was tendered in March 2026, with bids accepted from three software vendors by mid-April. The contract, worth up to €2.1 million over two years, covers metadata normalisation and image-hash comparison — the standard technical method for identifying exact or near-identical visual duplicates without human review of each file.

What the Experts Are Saying

Specialists in digital records management point to a structural cause: Berlin's district administrations long operated their own separate IT systems before partial consolidation began under the ITDZ Berlin, the state-owned IT service provider based in Müllerstraße in Wedding. Migration from twelve distinct district platforms into shared infrastructure inevitably dragged along redundant files. Technology consultants working on comparable projects in Hamburg and Frankfurt have noted that cross-platform migrations typically generate duplicate rates of between 15 and 30 percent in image-heavy archives.

Researchers at the Zuse Institute Berlin on Takustraße in Dahlem, which advises public bodies on data infrastructure, have published guidance arguing that automated hash-based deduplication should precede any migration rather than follow it — a sequence Berlin's earlier digitisation rounds did not consistently follow. The institute's documentation, updated in February 2026, warns that retroactive deduplication on live systems carries risks of accidental deletion if reference checks are not built into the process first.

Housing campaigners add a practical dimension. Tenant advocacy groups in Neukölln and Mitte have complained that the Wohnlagenkarte — the city's official neighbourhood rent map — has on occasion displayed outdated or repeated property photographs, undermining confidence in a tool that directly influences rent cap calculations under Berlin's Mietspiegel system. The Mietspiegel 2025, published last June, already drew criticism for methodological inconsistencies; image data integrity is now part of a broader conversation about the reliability of the underlying database.

What Comes Next

The deduplication contract is expected to move into active implementation by September 2026, starting with the Landesarchiv's photographic holdings, which run to more than 1.2 million digitised images. ITDZ Berlin has indicated that a phased rollout will follow, covering the Stadtentwicklung portal and BVG's internal asset management system — the public transport operator holds tens of thousands of infrastructure photographs used for maintenance planning across its U-Bahn and S-Bahn networks.

For residents and civil servants who use these platforms, the practical advice from IT administrators is straightforward: flag duplicate or outdated images using the feedback forms now embedded in the main Senate portals, as manual reports are feeding a verification queue that will inform the automated review. The Senate's digital office says it aims to publish a progress report on data quality improvements before the end of the third quarter. That deadline, modest as it sounds, will be the first public benchmark against which Berlin's long-promised digital overhaul can actually be measured.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.