Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

How Berlin's Public Archives Ended Up Flooded With Duplicate Images — and What It Will Take to Fix It

Years of uncoordinated digitisation across dozens of city departments left Berlin's visual records in a tangle of redundant files, and now a reckoning is underway.

By Berlin News Desk · Published 4 July 2026, 8:40 pm

3 min read

Wird übersetzt…

Berlin's Landesarchiv has spent much of 2026 quietly confronting a problem that accumulated over nearly two decades: somewhere between 30 and 40 percent of the digitised image holdings stored across city departments are estimated to be duplicates, according to internal working documents circulated among Senate administration staff earlier this year. The figure is embarrassing, and the cost of correcting it runs into seven figures.

The roots of the problem stretch back to the mid-2000s, when individual Bezirke — Berlin's twelve districts — began scanning historical photographs, planning maps and building permits on their own timelines, with their own software, and without any shared metadata standard. Mitte digitised independently of Friedrichshain-Kreuzberg. The Stadtentwicklungsamt in Tempelhof-Schöneberg built its own image database. Charlottenburg-Wilmersdorf contracted a separate vendor. Nobody talked to anybody else in any systematic way.

A Decade of Parallel Pipelines

The Senate's 2013 eGovernment Act was supposed to harmonise digital infrastructure across the administration, but enforcement was patchy and image archiving was treated as a low priority. When the Rote Rathaus pushed forward its Smart City Strategy Berlin in 2019, data interoperability for text records and financial systems took precedence. Visual archives were left largely untouched.

The Zentral- und Landesbibliothek Berlin on Breite Straße in Mitte, which holds substantial photographic collections of its own, ran a separate digitisation drive between 2017 and 2021 that processed approximately 180,000 images. A portion of those overlapped with material already scanned by the Landesarchiv on Eichborndamm in Reinickendorf, but no automated deduplication layer was ever built between the two institutions' systems. Staff at both organisations have acknowledged the overlap in public budget hearings before the Abgeordnetenhaus, though the precise volume of redundancy between just those two collections has not been publicly quantified.

The BVG, Berlin's public transport operator, added another layer to the problem. Its historical image library — containing construction photographs of U-Bahn lines, tram depot records and station renovation files dating to the 1950s — was digitised in partnership with a private contractor around 2015 and uploaded to a proprietary platform that uses different file-naming conventions than any Senate-managed system. When planners at the Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen began pulling archival images for the Tegel urban development project documentation in 2023, they discovered dozens of BVG images already existed in the Landesarchiv under different identifiers.

The Push Toward a Unified Standard

The current SPD-led coalition, which took shape after the February 2023 election, included a digital infrastructure consolidation pledge in its coalition agreement. The Senate's digital state secretary office has since commissioned a technical working group — reporting quarterly to the Hauptausschuss — to design a unified image metadata schema based on the IPTC standard widely used in European public broadcasters and national archives.

The project has a proposed budget line of roughly €4.2 million over three years, though the Abgeordnetenhaus has not yet approved the full allocation as of this writing. Pilot deduplication work is scheduled to begin in the fourth quarter of 2026, starting with the Mitte and Pankow district collections, which together represent the largest single block of potentially redundant urban-planning imagery.

For Berliners who actually use these archives — researchers at the Technische Universität Berlin's urban history institute on Straße des 17. Juni, journalists pulling historical photographs, architects working on listed buildings in Prenzlauer Berg — the practical consequence has been wasted time and inconsistent results. A search for pre-war images of Potsdamer Platz, for example, can return the same photograph under three different accession numbers depending on which portal a researcher uses.

The deduplication work is genuinely unglamorous, but the stakes are real. Berlin has one of the richest urban photographic records of any European city. Getting it into a coherent, searchable shape will determine whether that record remains accessible or slowly becomes an unusable redundancy pile. The pilot results, expected in early 2027, will tell the Senate whether the €4.2 million estimate is even close to adequate.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.