Berlin's Senate Department for Urban Development and Housing confirmed this spring that a citywide audit of its digital asset libraries had uncovered tens of thousands of duplicate image files spread across municipal servers — redundant photographs, scanned planning documents, and recycled graphics that had accumulated over more than a decade of fragmented IT procurement. The cleanup, now formally underway, is the largest data-hygiene operation the city has attempted since its 2019 e-government modernisation push.
The stakes are practical. Bloated digital archives slow down public-facing portals. The Berlin.de city portal, which handles everything from Bürgeramt appointment bookings to BVG transit updates, has faced repeated performance complaints from residents in high-density districts like Neukölln and Marzahn-Hellersdorf, where digital access to city services is especially relied upon. Duplicate images are not the only drag on system speed, but they are among the most fixable.
What Berlin Is Actually Doing
The Senate's IT service provider, ITDZ Berlin, based on Berliner Straße in Charlottenburg, is leading the deduplication work. The agency is deploying automated hash-comparison tools to flag identical or near-identical image files across departmental silos — a method standard in private-sector content management but only recently applied systematically to Berlin's public data stack. The project is running in parallel with a broader digital infrastructure overhaul budgeted at roughly €35 million over the 2025–2028 planning period, according to the Senate's mid-term financial framework published last year.
The Stadtbibliothek Berlin, whose digital collections span the Zentral- und Landesbibliothek on Breite Straße in Mitte, has been running its own image deduplication programme since early 2024, working through a catalogue of more than 1.2 million digitised historical photographs. Librarians there have flagged the process as painstaking: automated tools catch exact duplicates cleanly, but near-duplicates — slightly different scans of the same original print — require human review. That human bottleneck is slowing throughput.
Berlin is not alone in wrestling with this. Amsterdam's municipal archive, the Stadsarchief, completed a comparable deduplication sweep of its digital image holdings in 2023 and reported freeing up significant server capacity in a published project summary. Vienna's Wiener Stadtwerke, which manages transit and utilities data, integrated automated deduplication into its content management system as part of a 2022 IT consolidation, reducing redundant visual assets by a reported 40 percent within 18 months. Both cities benefited from more centralised IT governance than Berlin, whose 12 politically semi-autonomous Bezirke have historically maintained separate digital systems — a structural headache that no single Senate directive has yet fully resolved.
The Governance Gap
That decentralisation is the core of Berlin's disadvantage. A district like Friedrichshain-Kreuzberg, home to a dense concentration of tech startups along the Oberbaum corridor, may maintain its own image libraries for planning portals and local news releases that share no deduplication logic with those used by Tempelhof-Schöneberg three kilometres south. The result is systematic redundancy baked into governance architecture, not just IT choices.
London's Government Digital Service ran into similar fragmentation problems across borough councils but addressed it through a shared-platform mandate issued in 2021, requiring all London boroughs to route public-facing content through a common CMS with built-in deduplication by 2024. Berlin has no equivalent mandate in force, though the SPD-led Senate coalition's 2026 digital strategy document — released in March — signals intent to push for greater cross-district standardisation by 2028.
For Berliners using city services day-to-day, the immediate payoff of a successful cleanup would be faster-loading pages and fewer broken image links on official portals — small irritants that compound across millions of interactions. ITDZ Berlin has indicated the first full audit report will be presented to the Senate's digital committee in the fourth quarter of 2026. Whether the findings translate into binding cross-district policy, or remain advisory, will determine whether Berlin closes the gap on its European peers or keeps patching the same structural fault.