Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Crisis: The Key Decisions Ahead for the City's Digital Archive

Municipal agencies and cultural institutions face a reckoning over how to handle thousands of redundant digital images clogging public databases — and the clock is ticking on a budget deadline.

By Berlin News Desk · Published 4 July 2026, 8:43 pm

4 min read

Wird übersetzt…

Berlin's public digital infrastructure has a storage problem nobody wanted to talk about publicly, until now. Across the Senatsverwaltung für Kultur und Gesellschaftlichen Zusammenhalt, the city's main cultural administration on Brunnenstraße, and the Landesarchiv Berlin in Reinickendorf, technical audits completed in early June 2026 identified tens of thousands of duplicate image files sitting in shared municipal databases — redundant copies accumulated over more than a decade of uncoordinated digitisation drives. The question of what to do with them is now a live budget and policy issue, with a Senate vote on digital infrastructure spending pencilled in for September.

Why does it matter now? The SPD-led coalition has staked part of its administrative credibility on streamlining Berlin's notoriously fragmented public IT landscape. Duplicate files aren't just a tidiness problem. Each redundant image file consumes server capacity, inflates licensing costs for metadata tagging software, and creates legal exposure around image rights — particularly for photographs sourced from third-party photographers who hold copyright under German Urheberrecht. With the city's digital archive budget running under pressure after broader fiscal consolidation measures introduced in the 2025-2026 Doppelhaushalt, the margin for waste has narrowed sharply.

The problem is especially acute at two institutions. The Stadtmuseum Berlin, which manages collections across multiple sites including the Märkisches Museum near Köllnischer Park in Mitte, has been running a separate digitisation workflow from the Zentral- und Landesbibliothek Berlin on Breite Straße — and the two systems have not historically communicated. Staff at both institutions have independently uploaded versions of the same historical photographs, some dating to the Weimar Republic era, creating overlapping records that now need manual reconciliation or automated deduplication. The city's CityLAB Berlin, the public innovation lab based in the former Deutschlandhaus in Tempelhof, has been brought in to prototype an algorithmic matching tool, but that work is still at the testing stage.

What the Deduplication Decision Actually Involves

Choosing how to handle duplicate images sounds technical but carries real consequences. The three options on the table, according to documents circulating within the Senatsverwaltung für Digitalisierung und Verwaltungsmodernisierung, are: automated deletion of files flagged as duplicates, manual review by archivists before any deletion, or a hybrid approach that automates flagging but requires human sign-off. Each carries a different cost. Automated deletion is fastest but risks removing files that differ only slightly — a critical concern for archivists because two scans of the same historical photograph can have different colour profiles, resolutions, or metadata annotations that make each independently valuable. Manual review is safest but would require additional archivist hours at institutions already running lean. The hybrid model is the current frontrunner but requires procurement of new software, estimated by comparable rollouts in Hamburg's Kulturbehörde at somewhere between €80,000 and €150,000 depending on licensing terms.

There is a broader rights dimension too. Berlin's cultural institutions hold image collections under a patchwork of licensing arrangements. Some files were digitised under Creative Commons terms; others carry restrictions. When duplicates exist, it is not always clear which version is the legally authorised one for public use. The Gesellschaft für Datenschutz und Datensicherheit has flagged this as a compliance risk under current EU digital market rules, though no formal enforcement action has been taken against any Berlin institution.

The Decisions Ahead — and Who Makes Them

The September Senate vote is the hard deadline. Before that, the Senatsverwaltung für Digitalisierung must deliver a formal recommendation by August 15. CityLAB Berlin's prototype deduplication tool is scheduled for an internal demonstration in mid-July, giving decision-makers roughly four weeks to assess whether automation is ready to deploy or whether the city needs to go to external tender. If it goes to tender, a contract award before January 2027 would be optimistic given public procurement timelines under Vergaberecht.

For the Landesarchiv, the Stadtmuseum, and the Zentralbibliothek, the practical near-term step is agreeing on a shared metadata standard — something archivists have been pushing for since at least 2022 without resolution. That agreement needs to happen regardless of which deduplication method the Senate ultimately backs. Without it, the same duplication problem will regenerate itself within years. The August deadline, in that sense, is less about cleaning up the past and more about whether Berlin's digital institutions can finally agree on a common future.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.