Berlin's network of public archives, cultural institutions, and municipal offices is sitting on a problem years in the making: tens of thousands of duplicate, mislabelled, and legally ambiguous images scattered across databases that were never designed to talk to each other. The Senate Department for Culture and Social Cohesion confirmed earlier this year that a full audit of the city's digital visual holdings was underway, but the hard decisions about what gets deleted, what gets consolidated, and who pays for it are only now landing on the desks of administrators and procurement officers.
The timing matters. Berlin is mid-cycle on its Smart City Strategy 2030, a framework that commits the city to unified, interoperable data systems by the end of the decade. Images — of infrastructure, public housing blocks in Marzahn-Hellersdorf, street-level documentation of Friedrichstraße construction corridors — are not incidental. They are the raw evidence base for planning decisions worth hundreds of millions of euros. When the same photograph exists in three separate databases under different file names and different copyright attributions, it doesn't just waste server space. It creates legal exposure and erodes the reliability of the archive itself.
What the Audit Revealed — and What It Didn't
The Stadtmuseum Berlin, which holds one of the largest photographic collections in the German-speaking world, has been running its own internal deduplication project since early 2025. The challenge there is acute: analogue-to-digital migration work done between 2010 and 2020 produced significant overlap, with some collections digitised multiple times by different contractors using different metadata standards. The institution has not yet published a final count of affected records, but the scope of the problem is understood to run into six figures.
At the Landesarchiv Berlin on Eichborndamm in Reinickendorf, archivists have been working with perceptual hashing tools — software that identifies near-identical images even when file names or formats differ — since a pilot programme launched in March 2025. The technology can flag duplicates automatically, but a human archivist still has to decide which version of an image is the authoritative one, which metadata set is correct, and whether any of the copies carry distinct legal rights. That last question is not trivial in a city where photograph rights disputes have ended up before the Landgericht Berlin.
The costs are real. Commercial deduplication and digital asset management platforms licensed for institutional use in Germany typically run between €40,000 and €150,000 annually depending on collection size and user count — figures that land uncomfortably in a Senate budget already under pressure from the city's housing construction programme and BVG rolling stock upgrades. The Senate's 2026 cultural technology budget line has not been publicly broken down at the project level.
The Decisions Ahead — and Who Has to Make Them
Three questions are now driving the internal debate. First, which institution leads? The Stadtmuseum, the Landesarchiv, and the Zentral- und Landesbibliothek on Blücherplatz in Kreuzberg all have overlapping mandates for the city's documentary image heritage. No single body currently has authority to impose a unified metadata standard across all three.
Second, what happens to images whose rights cannot be resolved? Under German copyright law, works by photographers who died after 1955 remain protected until 70 years after death. Some images in the municipal holdings are attributed to names that cannot be traced. Simply deleting them risks destroying irreplaceable documentation. Keeping them in an unresolved state perpetuates the legal uncertainty.
Third, does Berlin build its own system or buy into a shared federal infrastructure? The Deutsche Digitale Bibliothek, based in Frankfurt, already aggregates cultural data from institutions across all 16 Länder. Berlin has been a contributor since 2012 but has never fully integrated its municipal image holdings into that pipeline. Doing so now would solve the interoperability problem but would require Berlin to accept data governance rules it does not fully control.
A working group under the Senate Department for Culture is expected to produce a formal recommendation by September 2026. Whatever it proposes will need Senate approval before the city's next budget cycle closes in November. Institutions that have already invested in local solutions will be watching closely to see whether that recommendation respects the work already done — or asks them to start again.