Kostenlos abonnieren
The Daily Berlin

Berlin news, every day

News

Berlin's Digital Archives Tackle a Growing Crisis: Thousands of Duplicate Images Clogging City Records

A citywide audit this week exposed the scale of a mundane but costly problem eating into public storage budgets and slowing down municipal services.

By Berlin News Desk · Published 4 July 2026, 8:36 pm

3 min read

Berlin's Digital Archives Tackle a Growing Crisis: Thousands of Duplicate Images Clogging City Records
Photo: Kahn, Otto Hermann, 1867-1934 / Public domain (Wikimedia Commons)
Wird übersetzt…

Berlin's Senatsverwaltung für Inneres und Digitalisierung confirmed this week that a systematic sweep of the city's centralised document management systems had turned up more than 340,000 duplicate image files across departmental servers — redundant scans, duplicate passport photos, and copied permit attachments that have accumulated since the migration to a unified digital platform began in January 2024. The cleanup effort, still ongoing, has already freed roughly 4.2 terabytes of storage across the network used by borough offices from Marzahn-Hellersdorf to Charlottenburg-Wilmersdorf.

The timing matters. Berlin is mid-way through a €47 million digitisation programme covering its 44 Bürgerämter, the citizen registration offices where Berliners queue — sometimes for weeks — for appointments to register addresses, apply for identity documents, or file residency paperwork. Any bloat in the underlying systems slows processing times and pushes up infrastructure costs at precisely the moment the Senate is under pressure from SPD coalition partners to show the programme is delivering value for money.

How the Duplicates Piled Up

The problem is less glamorous than it sounds. When scanning equipment at offices on Müllerstraße in Wedding or at the main Bürgeramt on Rathaus Reinickendorf uploads a document, the system logs an image file. If a clerk opens, saves, or transfers that file without using the platform's native version-control tools — which were only rolled out to all offices in March 2026 — a duplicate is created silently in the background. Staff were often unaware. Over roughly 26 months, the volume compounded.

The Berliner Beauftragter für Datenschutz und Informationsfreiheit, the city's data protection office, flagged duplicate image retention as a compliance risk in a March 2026 guidance note, pointing out that holding redundant copies of identity documents and biometric photos beyond their lawful retention period could breach GDPR obligations under Article 5. That regulatory nudge appears to have accelerated the Senate's decision to run the audit now rather than waiting until the full digitisation rollout concludes in late 2027.

The technical work has fallen largely to the Berliner Datenzentrum, the city's central IT infrastructure unit based in Tempelhof, which is running automated deduplication scripts across departmental folders. Civic tech observers familiar with the project — speaking generally, not attributing specific internal figures — note that municipal deduplication programmes in cities of comparable size typically recover between three and eight percent of active storage capacity. Berlin's early returns appear to sit toward the upper end of that range.

What Comes Next for Residents and Offices

For ordinary Berliners, the immediate practical effect is indirect but real. Storage constraints have contributed to slower upload speeds at terminals inside some Bürgerämter, and the backlog of digitised legacy files — paper records from before 2020 that are still being scanned — has competed with active casework for server bandwidth. Freeing capacity now should ease those bottlenecks before the autumn surge in new registrations, which historically peaks in September when students and new arrivals flood the appointment system.

The Senate's digitisation team has also announced a mandatory refresher training programme for scanning staff across all 44 offices, starting the week of 13 July 2026. The sessions will cover correct use of the version-control features that prevent duplicate creation at source. Staff at the Bürgeramt Friedrichshain-Kreuzberg on Petersburger Straße are scheduled among the first cohorts.

Longer term, the episode is likely to sharpen the debate inside the coalition about procurement standards for the second phase of the digitisation programme. Opposition voices on the Abgeordnetenhaus's digital affairs committee have already questioned whether the original platform contract — awarded without a full interoperability review — adequately specified deduplication as a baseline requirement. The Senate has not yet responded publicly to those criticisms. The audit findings are expected to be presented formally to the committee before the summer recess ends in mid-August.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Berlin

This article was produced by the The Daily Berlin editorial desk and covers news in Berlin. See our editorial standards for how we use AI.

The Daily Berlin brief

The day's Berlin news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Berlin news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Berlin and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Berlin

More in News

Enjoyed this story? Get tomorrow's briefing free.