Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Digital Archives Are Riddled With Duplicate Images — and Officials Are Finally Talking About It

From the Stadtbibliothek to the Senate's urban planning database, experts and administrators are debating who bears responsibility for cleaning up years of redundant visual data.

By Berlin News Desk · Published 4 July 2026, 9:00 pm

3 min read

Berlin's Digital Archives Are Riddled With Duplicate Images — and Officials Are Finally Talking About It
Photo: Photo by cami on Pexels
Wird übersetzt…

Berlin's public digital infrastructure has a clutter problem. Across municipal databases, cultural archives, and city-administered housing registers, tens of thousands of duplicate images have accumulated over the past decade — redundant files that consume server space, slow down public-facing portals, and increasingly complicate the city's push toward a coherent digital administration. Now, officials, archivists, and tech specialists are weighing in on who should fix it and how.

The issue has gained traction inside the Senate Chancellery after a working group within the Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen flagged the problem in an internal review earlier this year. The review — details of which were described at a public digital governance forum held at the Urania Berlin conference centre on Kleiststraße in late June — found that duplicate image files in the housing register portal alone were creating measurable delays in processing applications. The forum brought together representatives from the Berlin Senate, the Technologiestiftung Berlin, and several Bezirksämter.

The timing matters. Berlin is mid-way through an ambitious digital-services rollout under the Onlinezugangsgesetz — the federal law requiring all government services to be available online by the end of 2026. Cluttered, poorly managed image databases directly undermine that deadline. If a resident uploads a floor plan or a building photograph through a Senate portal and the system struggles to distinguish it from a near-identical file already stored, processing times lengthen and errors multiply.

What Experts Are Saying

Specialists at the Technologiestiftung Berlin, based on Gürtelstraße in Friedrichshain, have been among the more vocal voices on the technical side. The foundation has previously estimated that a significant share of storage inefficiencies in Berlin's public digital systems trace back to poor deduplication protocols rather than raw data volume — though the Stiftung has not published a specific figure tied to image files alone. Archivists at the Zentral- und Landesbibliothek Berlin on Breite Straße in Mitte have separately raised concerns about the same problem in cultural digitisation projects, noting that scanning campaigns conducted between 2018 and 2023 often produced multiple near-identical versions of the same historical photograph, with no automated process in place to flag or merge them.

On the commercial side, Berlin-based startups working in document management and AI-assisted archiving — several of them operating out of co-working spaces along Torstraße in Mitte and the Factory Berlin campus in Prenzlauer Berg — have positioned themselves as potential solution providers. Representatives from at least two such firms attended the June forum. Their argument, broadly, is that AI-assisted deduplication tools already used in private-sector media management can be adapted for public archive use at relatively low cost, with some vendors citing implementation timelines of three to six months for mid-sized municipal databases.

Not everyone in the room agreed. Archivists and data-protection advocates raised concerns about feeding sensitive building and property photographs through third-party AI tools, given Berlin's strict data-protection obligations under both federal law and the Berlin Datenschutzgesetz. The Berliner Beauftragte für Datenschutz und Informationsfreiheit has not yet formally weighed in on the specific question of image deduplication, but the office's broader guidance on cloud-based data processing suggests any solution will need to run on servers located within the European Economic Area.

What Comes Next

The Senatsverwaltung für Inneres und Digitales is expected to circulate a draft protocol for image data management standards across Bezirksämter by September 2026. That timeline aligns with the final push to meet the Onlinezugangsgesetz deadline and would give individual district offices roughly two months to audit their own databases before year-end compliance reviews begin.

For residents and housing applicants dealing with the current system, the practical advice from administrators at the Bürgeramt network is straightforward: upload images in standard JPEG or PDF format, avoid resubmitting files already included in a previous application, and use the document reference numbers issued at first submission when following up. Small steps, but ones that reduce the duplicate-file burden at the point of entry — which, officials acknowledge, is where the problem most often starts.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.