Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Duplicate Image Problem: The Key Decisions That Will Shape the City's Digital Archives

Thousands of redundant photographs are clogging Berlin's public databases — and how officials handle the cleanup will determine what gets preserved and what disappears.

By Berlin News Desk · Published 4 July 2026, 9:06 pm

3 min read

Berlin's Duplicate Image Problem: The Key Decisions That Will Shape the City's Digital Archives
Photo: Photo by Viviana Ceballos on Pexels
Wird übersetzt…

Berlin's public digital infrastructure is sitting on a growing problem. Across municipal archives, housing authority portals, and cultural documentation systems, duplicate images — redundant digital files stored in parallel across multiple servers — are consuming terabytes of storage, inflating licensing costs, and making it harder for researchers and city planners to access reliable visual records. The question now is not whether to act, but who decides which version of an image survives.

The issue has gained urgency this summer as the SPD-led Senate prepares a broader review of Berlin's digital asset management, a process expected to accelerate following the city's commitment to consolidate IT infrastructure under the 2025 Digitalstrategie Berlin framework. Housing authorities, cultural institutions along Unter den Linden, and transit planners at BVG are all affected — each holding overlapping image libraries with no unified deduplication standard in place.

Why the Stakes Are Higher Than They Look

Duplicate image replacement sounds like a technical housekeeping task. It is not. Every decision to delete or overwrite a digital file carries archival consequences. At the Landesarchiv Berlin on Eichborndamm in Reinickendorf, archivists have flagged that automated deduplication tools — increasingly offered by commercial vendors — risk flagging near-identical but historically distinct images as redundant. A photograph of Alexanderplatz taken in 1989 and another taken in 1990 may share enough pixel-level similarity to fool an algorithm. The human cost of that error is permanent.

The city's housing authorities face a parallel version of the same dilemma. The Berliner Stadtwerke and Gewobag, the municipal housing company managing roughly 70,000 apartments across the city, both maintain property image databases used for tenant communications, maintenance records, and planning submissions. Internal reviews earlier this year identified significant image duplication across those systems — the same facade photographs appearing under different file names in different departments, creating version-control gaps that slow down renovation approvals in already overloaded districts like Marzahn-Hellersdorf.

BVG's infrastructure documentation library presents a third pressure point. As the transit authority pushes forward with new U-Bahn station refits — including ongoing works at Hermannplatz in Neukölln — engineering teams rely on up-to-date photographic records of tunnel segments, signage configurations, and platform layouts. Outdated or duplicated reference images create liability risk. BVG's internal IT procurement cycle runs on a 24-month review schedule, with the next major audit window opening in early 2027.

The Decision Framework Taking Shape

Three options are now on the table for Berlin's digital asset managers, according to the draft parameters circulating within the Senate Department for Urban Development and Housing. The first is full automated deduplication, using hash-matching software to identify and delete identical files. The second is a semi-automated approach, where algorithms flag duplicates but human archivists make final deletion calls. The third — and most expensive — is a managed migration to a unified content management platform, estimated in comparable European municipal projects to run between €2 million and €5 million for a city of Berlin's administrative scale.

The Zentralbibliothek der Humboldt-Universität has already piloted a semi-automated deduplication workflow across its digitised photographic collections, completing a first pass over approximately 400,000 images between January and May 2026. That experience is being closely watched by Senate officials as a proof-of-concept before any city-wide rollout is authorised.

The political dimension matters too. Under the current coalition agreement, digital infrastructure decisions above a €1 million threshold require Senate committee sign-off — a process that typically adds three to six months to any implementation timeline. That means a formal decision before the end of 2026 is possible but would require both a vendor selection and a committee vote before October.

For Berliners, the practical stakes are clearest in housing and transit services. If Gewobag's image libraries remain fragmented, apartment handover documentation stays slower and more error-prone. If BVG's photographic records stay unsynchronised, station refits run longer. The city has the tools and the precedent. What it needs now are the political commitments to match.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.