Abonnement gratuit
The Daily Paris

Paris news, every day

News

How Paris Got Serious About Duplicate Images in Its Public Archive — and Why It Took So Long

A decade of fragmented digital records, post-Olympics infrastructure investment, and a push to modernise the city's visual heritage have finally forced the question of what to do about thousands of duplicate images clogging municipal databases.

By Paris News Desk · Published 4 July 2026, 9:45 pm

3 min read

How Paris Got Serious About Duplicate Images in Its Public Archive — and Why It Took So Long
Photo: Photo by Colin Piret on Pexels
Traduction en cours…

Paris city hall confirmed this spring that its central image archive, maintained by the Direction des Affaires Culturelles, contained an estimated 40,000 duplicate files — photographs, scanned heritage documents, and promotional visuals accumulated across more than a decade of disconnected digitisation drives. The duplication problem, long acknowledged internally but never formally addressed, has now become a practical obstacle as the city attempts to activate its post-2024 Olympics legacy programming across venues from the Stade de France in Saint-Denis to the refurbished berges of the Seine.

The timing matters. Paris is in the middle of the most ambitious municipal digitisation effort in its modern history. The Grand Paris Express project alone has generated hundreds of thousands of engineering drawings, site photographs, and public-consultation images since construction began in earnest in 2016. Many of those records passed through multiple city departments — the Société du Grand Paris, the Île-de-France regional authority, individual arrondissement planning offices — before landing in central servers, each transfer creating duplicate or near-duplicate copies. Add the Archives de Paris on the rue des Quatre-Fils in the 3rd arrondissement, which has been uploading pre-war photographic collections since 2019, and the scale of the duplication problem becomes clearer.

A Problem Built Slowly, Over Many Administrations

The roots of the issue trace back to 2013, when the city launched its first serious push to digitise neighbourhood planning records ahead of the Grand Paris metropolitan merger. Each of the then-20 arrondissement mairies operated semi-independently, uploading scanned documents to their own local servers before those records were eventually migrated — incompletely — to a centralised system. No deduplication protocol was in place. By the time Paris won the 2024 Olympic bid in September 2017, the archive had already developed what one internal audit later described as a structural redundancy problem, with some heritage photographs of sites like the Palais Royal and the Canal Saint-Martin appearing in the system under four or five separate file names.

The Olympics accelerated the intake without solving the underlying architecture. Between 2018 and 2024, the city's communications teams, the Paris 2024 organising committee, and external contractors all fed imagery into overlapping databases. The Bibliothèque historique de la Ville de Paris, situated on the rue de Rivoli, found itself holding duplicate scans of the same 19th-century boulevard photographs sourced from three separate digitisation grants, each funded under different European Union cultural heritage programmes. EU funding rules required independent documentation for each grant, which meant parallel uploads were not just common — they were, in a bureaucratic sense, mandatory.

The Push Toward a Unified System

The practical cost is real. Storage for the city's main digital archive runs to roughly €2.3 million annually, according to budget documents published by the Mairie de Paris for the 2025 fiscal year. Analysts working on the Seine-Saint-Denis urban regeneration corridor — a priority zone stretching from Aubervilliers to Bondy — have reported that search times on the municipal image database can run to several minutes when queries return hundreds of near-identical results. That is not a trivial inconvenience when planners are working against deadlines set by the post-Olympics legacy programme, which commits to delivering upgraded public spaces across 22 communes by the end of 2027.

The Direction des Systèmes et Technologies de l'Information, the city's IT directorate, began piloting automated deduplication software in February 2026 across a test batch of 8,000 images drawn from the Belleville and Ménilmontant neighbourhood archives. The pilot is scheduled to conclude in September 2026, with a city council vote on full deployment expected before the end of the year. If approved, the rollout would cover all municipal holdings by mid-2028 — a timeline that aligns, not coincidentally, with the next round of EU structural fund reporting requirements. For anyone who submits image requests to city departments for planning or research purposes, the practical advice for now is straightforward: request files directly from the Archives de Paris rather than through the general municipal portal, where duplicates are most densely concentrated.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Paris

This article was produced by the The Daily Paris editorial desk and covers news in Paris. See our editorial standards for how we use AI.

The Daily Paris brief

The day's Paris news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Paris news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Paris

More in News

Enjoyed this story? Get tomorrow's briefing free.