Istanbul's municipal archives are sitting on a problem decades in the making. Duplicate and near-duplicate images — scanned photographs, drone footage stills, and digitised glass-plate negatives — have accumulated across at least three separate municipal databases, creating a tangle that archivists, urban planners, and heritage bodies are now being forced to confront before a long-delayed digitisation deadline arrives in March 2027.
The issue has sharpened this summer because Istanbul Metropolitan Municipality's (IBB) digital infrastructure directorate is midway through a 14-month project to consolidate records from the Atatürk Library in Taksim, the Fatih Municipality archive in the historic peninsula, and the Istanbul Urban Database maintained under the city's planning secretariat. When contractors began cross-referencing collections earlier this year, they found that a significant share of catalogued images appeared in two or more systems under different file names, different dates, or different location tags — some flagging the same street corner in Balat as being in Fener, or misattributing Ottoman-era photographs of the Grand Bazaar to the Covered Market in Üsküdar.
Why the Decisions Ahead Actually Matter
This is not a bureaucratic nuisance. Istanbul carries an earthquake risk that concentrates minds on exactly these records. After the February 2023 Kahramanmaraş earthquakes killed more than 50,000 people across southeastern Turkey, city planners across the country were reminded that analogue architectural documentation can vanish in minutes. Digital records are only as good as their integrity. A duplicate image filed under the wrong address in a damage-assessment database could send inspection teams to the wrong building or allow a vulnerable structure to escape review altogether.
The IBB's deadline pressure is real: the European Union's Urban Innovative Actions framework, under which part of the digitisation project is co-funded, requires certified data delivery by the first quarter of 2027. Missing that window risks losing a tranche of funding that city officials have not publicly quantified but which is understood to be substantial enough to stall phase two of the project.
The Atatürk Library alone holds an estimated 280,000 visual items, a figure the library itself has published in its collection overview. Of those, archivists working on the consolidation project believe a meaningful proportion — the working internal estimate, according to a project summary document circulated to municipal committees in May 2026 — may involve duplicates created during three separate earlier scanning drives between 2009 and 2021, each of which used different metadata standards.
The Practical Choices on the Table
Three options are now being weighed. The first is algorithmic deduplication: running the entire merged database through image-matching software that flags near-identical files for human review. This is fast but requires a capital outlay for licensed software and introduces its own errors — Ottoman photographs of similar-looking hans or caravanserais along Divanyolu can fool automated systems trained on modern imagery.
The second approach is manual curatorial review, led by a specialist team. The Istanbul Research Institute on İstiklal Caddesi, which holds one of the city's strongest concentrations of Ottoman and early Republican visual expertise, has been informally consulted about whether its curatorial staff could play a role. That conversation is ongoing and no formal agreement exists yet.
The third path is a hybrid: machine flagging followed by expert sign-off, phased over 18 months. Most observers familiar with the project consider this the likeliest outcome, though it requires the IBB and the Culture Ministry's regional directorate to agree on shared protocols — a coordination challenge given that the two bodies have operated under different data standards since at least 2015.
The clock is running. By September 2026, the project's steering committee is expected to select a methodology and commit resources. Neighbourhood-level planners in Beyoğlu and Fatih, where earthquake-risk assessments depend partly on accurate building-age records cross-referenced with photographic evidence, will be watching closely. Getting the deduplication right is unglamorous work. Getting it wrong, in a city where the ground itself is a risk, carries consequences that go well beyond a cluttered hard drive.