Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Crisis: The Key Decisions Ahead for the City's Digital Archive

A tangled backlog of duplicate photographs in Berlin's public records systems is forcing administrators, archivists, and tech contractors to make choices that will shape how the city manages its visual history for decades.

By Berlin News Desk · Published 4 July 2026, 9:00 pm

3 min read

Berlin's Duplicate Image Crisis: The Key Decisions Ahead for the City's Digital Archive
Photo: Photo by Pawel aparatem_go on Pexels
Wird übersetzt…

Berlin's city administration faces a concrete deadline. By the end of the third quarter of 2026, the Senatsverwaltung für Stadtentwicklung must decide which software framework will govern the deduplication and replacement of tens of thousands of redundant image files spread across at least four separate municipal databases — a problem that has compounded quietly since the city's partial digitisation drive began in 2019.

The issue matters now because Berlin is mid-way through a broader digital infrastructure overhaul linked to the Smart City Berlin strategy, a programme that channels funding through the Senate Chancellery and is intended to consolidate public-facing data portals before the end of 2027. Duplicate images are not a cosmetic nuisance. They inflate storage costs, return conflicting metadata in public planning searches, and have already caused at least two documented errors in the Bebauungsplan — the binding land-use map — when outdated aerial photographs of Tempelhof Feld and the Spandau waterfront were served alongside current records without clear versioning labels.

Who Decides and What the Options Are

Three institutional actors are now circling the decision. The Berliner Senatskanzlei — Digitales Berlin holds overall programme authority. The Zentralarchiv des Landes Berlin on Eichborndamm in Reinickendorf is pushing for a hash-based deduplication protocol that would flag and quarantine suspect files before any replacement occurs. Meanwhile, Citylab Berlin, the public innovation lab based in Tempelhof, has been piloting a machine-learning assisted review tool since January 2026 that cross-references visual content against geotag data and upload timestamps.

The two approaches are not mutually exclusive, but they require different budget allocations and staff training timelines. The hash-based method is faster to deploy — the Zentralarchiv estimates a six-to-eight week rollout window — but it can only catch exact or near-exact pixel duplicates. The Citylab tool catches more nuanced cases, such as photographs of the same Prenzlauer Berg building taken in different seasons that carry conflicting planning designations, but it demands a longer validation period and a higher licensing cost.

A third path — outsourcing the deduplication review to a specialist contractor — has been floated internally but has encountered resistance from data protection advocates who cite the Berlin State Data Protection Act, which places restrictions on transferring image metadata linked to identifiable addresses to third-party processors without explicit authorisation from the Berliner Beauftragter für Datenschutz und Informationsfreiheit.

What Happens on the Ground

For residents and businesses interacting with the city's planning portal — the FIS-Broker, which logged roughly 1.4 million queries in 2024 according to Senate budget documentation — the practical consequences of a bad decision are real. An architect submitting a Baugenehmigung application in Neukölln or Lichtenberg today may pull reference images that are two or three versions out of date, with no system flag indicating that a replacement file exists.

The Senate's own internal audit, circulated among departmental heads in April 2026, identified at least 47,000 image records across the FIS-Broker and the separate Umweltatlas portal that carry duplicate identifiers. Of those, around 12,000 are flagged as high-priority because they overlap with active development zones — including the Tegel urban development area and the Südkreuz corridor.

The next formal decision point is a working-group session scheduled for September 2026, at which the Senatsverwaltung für Stadtentwicklung und Wohnen will present a recommended framework to the Digital Senate. Whichever option advances, administrators say the replacement pipeline itself — swapping outdated images with verified current versions — cannot begin before early 2027 at the earliest, given procurement rules under the Vergabeverordnung that govern public IT contracts above €50,000.

Until then, archivists at Eichborndamm are working manually, tagging suspect files one dataset at a time. For a city that bills itself as a European tech hub, it is a slow, painstaking fix — but officials argue that getting the governance framework right before touching the files is more important than moving fast and compounding the errors already buried in the system.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.