Berlin's public sector is sitting on a digital hoarding problem. Across the city's major administrative databases — from the Senatsverwaltung für Stadtentwicklung's housing portal to the Landesarchiv Berlin on Eichborndamm in Reinickendorf — duplicate image files now account for an estimated 30 to 40 percent of total stored visual assets, according to internal efficiency assessments circulated among IT procurement teams in early 2026. That figure, while not yet published as official policy, has become a quiet driver behind a broader digital infrastructure review expected to conclude by the end of Q3 this year.
The timing matters. Berlin's Senate is currently finalising its 2027 budget framework, and cloud storage and server infrastructure represent one of the fastest-growing line items in the city's digital administration spend. The Senate's Digitalstrategie Berlin, launched in 2023, set a target of reducing redundant data infrastructure costs by 15 percent before the end of this legislative period. Duplicate image removal — unglamorous, technical, but quantifiable — has emerged as one of the clearest routes to hitting that number.
Where the Redundancy Accumulates
The problem is most acute in two areas: housing administration and cultural digitisation projects. The Investitionsbank Berlin, which administers subsidised housing loan documentation on Bundesallee in Wilmersdorf, processes thousands of property images per year as part of its Wohnraumförderung programme. Staff working across different departments regularly upload the same floor-plan scans, facade photographs and inspection images without a centralised deduplication check. A pilot audit covering just one quarter of 2025 submissions reportedly found that roughly one in five uploaded images was an exact or near-exact duplicate of a file already in the system.
The Stadtmuseum Berlin network — which spans venues including the Ephraim-Palais in Mitte and the Märkisches Museum near Köllnischer Park — faces a structurally similar challenge. Its digitisation programme, part of a broader effort funded under the Kulturprojekte Berlin umbrella, has digitised more than 600,000 objects since 2019. Cross-departmental uploads, format conversions and successive scanning rounds have created a layered archive where automated tools now flag duplicate or near-duplicate entries at a rate that curators describe internally as unsustainable without dedicated software intervention.
Storage is not cheap. Commercial cloud pricing for the kind of high-resolution TIFF and RAW files used in archival work runs between €0.02 and €0.05 per gigabyte per month on standard enterprise contracts. A mid-sized municipal archive holding 50 terabytes of image data — a conservative estimate for an institution the size of the Landesarchiv — pays upwards of €12,000 a year in storage alone before factoring in bandwidth and retrieval costs. Eliminating a third of that footprint through deduplication translates directly into five-figure annual savings.
What Deduplication Actually Requires
The technical solutions exist and are not exotic. Perceptual hashing algorithms — software that generates a compact fingerprint for each image and flags matches even across different file formats or resolutions — have been in commercial use since the early 2010s. Tools built on open-source libraries are already deployed by German federal institutions including the Deutsche Nationalbibliothek in Frankfurt and Leipzig. Berlin's challenge is less about the technology than about coordination across siloed departments that operate on different content management systems, some of which were procured under contracts dating back to 2014 or earlier.
The Senate's IT coordination body, the Berliner Beauftragte für Informationstechnik, is expected to issue updated data governance guidelines in autumn 2026 that will, for the first time, include mandatory deduplication checks as part of the procurement criteria for any new digital asset management system. Institutions tendering for new systems after January 2027 will need to demonstrate compliance.
For anyone managing image libraries inside Berlin's public sector right now, the practical advice is straightforward: run an audit before the new guidelines land. Institutions that can demonstrate existing deduplication processes will be better positioned in the next procurement cycle — and will spend less on storage in the meantime. The numbers, for once, are on the side of good housekeeping.