Berlin's public sector has a clutter problem no one wants to talk about. Across municipal databases — from the Senatsverwaltung für Stadtentwicklung to the Landesarchiv Berlin on Eichborndamm in Reinickendorf — administrators are grappling with sprawling collections of duplicate digital images that are inflating storage costs, complicating public records requests, and threatening the integrity of official documentation. The question now is who decides what gets deleted, and by what standard.
The issue has sharpened this year because Berlin's coalition government committed in its 2025 Digital Strategy update to completing a full audit of municipal digital assets by the end of Q3 2026. That deadline is roughly ten weeks away. Agencies that have not yet developed a duplicate-removal policy are now facing pressure to act — or explain why they haven't.
Why the Stakes Are Higher Than They Look
Duplicate image files are not merely a housekeeping nuisance. In planning departments and housing registries — both under significant political scrutiny given the ongoing rent cap debate — a single duplicated building photograph filed under two different reference numbers can create conflicting records in legal disputes. The Bezirksamt Mitte, which handles some of the highest-volume property documentation in the city, processes tens of thousands of image attachments per year linked to construction permits and zoning files. A misfiled or duplicated image in that system can delay permit approvals and, in the worst case, be cited in administrative court proceedings.
Berlin's data protection framework adds another layer of complexity. The Berliner Beauftragter für Datenschutz und Informationsfreiheit — the city's data protection authority — has previously flagged that automated bulk deletion of digital records without human review may conflict with retention obligations under the Landesarchivgesetz Berlin. That law sets mandatory minimum retention periods for certain categories of public documents, and image files attached to official proceedings often fall within scope. Deleting a duplicate without confirming it is not the sole surviving version of a record is legally risky.
The city's tech procurement office has been evaluating at least two software platforms capable of perceptual hash-based duplicate detection — a method that identifies visually identical or near-identical images even when file names and metadata differ. One pilot, run through the Kompetenzzentrum Digitale Verwaltung, was tested in late 2025 on a subset of files held by the Stadtbibliothek Berlin. Results from that pilot have not been formally published, but the evaluation phase was scheduled to conclude by June 30, 2026.
The Decision Points Coming This Summer
Three choices now sit on the desks of senior administrators. First, whether to mandate a city-wide deduplication standard or leave each Bezirk to develop its own approach. The decentralised option is cheaper to launch but risks creating incompatible systems across Berlin's 12 districts — a problem already visible in the patchwork of property databases between Friedrichshain-Kreuzberg and Spandau.
Second, agencies must decide how to handle images that are near-duplicates rather than exact copies — photographs taken seconds apart, or the same document scanned twice at slightly different resolutions. Perceptual hashing catches these, but the deletion of a near-duplicate still requires a human sign-off under the current Landesarchivgesetz interpretation. Staffing that review process across all agencies is expensive; the Senatsverwaltung für Inneres estimated in its 2025 budget annex that a full manual review of flagged files across three major departments would require approximately 1,200 person-hours.
Third, and most politically sensitive, is the question of public transparency. The Abgeordnetenhaus committee on digital affairs has signalled interest in requiring agencies to publish deduplication logs — showing what was deleted, when, and under what authority. That kind of audit trail would satisfy open-government advocates but adds administrative overhead that smaller Bezirksämter say they cannot absorb without additional funding.
The practical path forward, according to the Digital Strategy timeline, requires agencies to submit their deduplication frameworks to the Senatskanzlei by September 15, 2026. Those that miss the deadline face the prospect of a centrally imposed standard — which may be the nudge some administrators are quietly waiting for. For Berliners who rely on accurate public records, from tenants disputing renovation claims in Neukölln to architects pulling permit histories in Prenzlauer Berg, the quality of that decision will matter well beyond the server room.