Berlin's sprawling network of public digital archives is carrying a significant and growing weight: tens of thousands of duplicate images stored redundantly across municipal systems, driving up storage costs and making it harder for civil servants, researchers, and ordinary residents to find usable public records. The Senatsverwaltung für Stadtentwicklung, Bauen und Wohnen confirmed this year that deduplication has been added to the agenda for its digital infrastructure review, with a deadline for initial recommendations set for autumn 2026.
The timing matters. Berlin is in the middle of a broader push to modernise public-sector IT after years of fragmented investment. The SPD-led coalition's 2025 coalition agreement committed to a unified digital services platform — provisionally called BerlinServicePortal — that would consolidate dozens of departmental systems. But that project cannot move cleanly forward while legacy databases, many of them maintained by bodies like the Stadtarchiv Berlin on Breite Straße and the Landesarchiv Berlin in Reinickendorf, continue to ingest unfiltered image data from multiple sources simultaneously.
Why Duplicates Accumulate — and Why They're Expensive
The core problem is architectural. Departments including the Bezirksamt Mitte and the Senatsverwaltung für Kultur have historically uploaded photographs to their own siloed databases without a shared tagging or hash-checking protocol. A single image of, say, the renovation works on Karl-Marx-Allee might exist in four or five separate folders under different file names, each copy consuming server capacity. Cloud storage costs for Berlin's public sector are billed on a per-gigabyte basis, and the Rechnungshof Berlin — the state audit office — flagged redundant digital storage as a line item concern in its 2024 annual report. No consolidated public figure for the total cost of duplicate image storage has been released, but the audit office's report noted that IT storage expenditure across Berlin's Senate departments rose by a double-digit percentage between 2021 and 2023.
The problem is not purely financial. Researchers using the Landesarchiv's online portal to find photographic documentation of specific construction projects — crucial given the current housing-shortage debate — frequently pull up multiple identical or near-identical images, wasting time and eroding trust in the archive's search functionality. Journalists and urban planners working on the Mietendeckel rent-cap discussion have raised similar frustrations informally through public comment channels.
What Happens Next: The Decisions That Will Define the Outcome
The Senatsverwaltung für Digitalisierung und Verwaltungsmodernisierung is expected to release a procurement brief by September 2026 for a deduplication tool compatible with the BerlinServicePortal framework. At least three approaches are under internal discussion, according to publicly circulated agenda papers from a June 2026 working group at the Rotes Rathaus: a perceptual hashing system that identifies visually identical images regardless of file name; a metadata-driven tagging overhaul requiring staff retraining across all Bezirke; and a hybrid that combines both, at considerably higher initial outlay.
The hybrid option is the most technically sound but also the most politically complicated. Budget negotiations for fiscal year 2027 begin in earnest in October, and the coalition's finance senator will be balancing demands from housing, transport — including BVG's ongoing U-Bahn extension work on the U7 line toward Spandau — and energy-transition programs under Energiewende commitments. A mid-range deduplication and archive modernisation project of this kind typically costs between €800,000 and €2 million for a city-state of Berlin's scale, based on comparable projects in Hamburg and Vienna, though Berlin has not yet published its own cost estimate.
Civil society organisations and archive professionals have until 31 August 2026 to submit formal responses to the Senatsverwaltung's public consultation on digital archive standards. The Förderverein für die Bewahrung schriftlichen Kulturgutes in Berlin, which represents archive users and professionals across the city, has encouraged members to engage with that process. The next working group session at the Rotes Rathaus is scheduled for early September. Whatever technical solution the Senate endorses will set the standard not just for images but for the broader deduplication question across all public digital assets — a decision with consequences well beyond a few redundant JPEGs.