Abonnement gratuit
The Daily Paris

Paris news, every day

News

Paris Archives Are Drowning in Duplicate Images — and the Numbers Tell a Costly Story

From the Grand Paris Express to the banks of the Seine, municipal digitisation projects are losing thousands of euros and untold hours to the same photographs filed twice, three times, sometimes a dozen.

By Paris News Desk · Published 4 July 2026, 9:26 pm

3 min read

Paris Archives Are Drowning in Duplicate Images — and the Numbers Tell a Costly Story
Photo: Photo by Leica Palma on Pexels
Traduction en cours…

Paris city administrators have quietly flagged a growing administrative headache buried inside the capital's accelerating push to digitise its public record: duplicate images. Across municipal departments, urban planning bodies and cultural institutions, redundant digital files are piling up at a rate that is inflating storage costs, slowing archival retrieval and undermining the credibility of official visual records. The scale, based on internal audit frameworks reviewed by comparable European municipal systems, is substantial enough to command budget attention.

The timing matters because Paris is midway through an unprecedented volume of publicly funded documentation work. The post-2024 Olympics legacy activation programme, coordinating venues from the Stade de France in Saint-Denis to the Bercy Arena in the 12th arrondissement, has generated tens of thousands of photographic assets. The Grand Paris Express construction authority, Société du Grand Paris, is simultaneously commissioning progress documentation at more than 200 worksites spread across Île-de-France. When images move between contractors, subcontractors, press offices and archivists without a unified file-management protocol, duplication is not a possibility — it is a certainty.

The Numbers That Drive the Problem

Storage is cheap until it isn't. A single uncompressed RAW image from a professional camera can run to 45 megabytes. Multiply that by a municipal photography library running into the hundreds of thousands of files — a conservative figure for an institution the size of the City of Paris — and duplicate rates of even 15 percent translate into terabytes of redundant data. European municipal IT benchmarks published by the Joinup platform, the European Commission's digital government observatory, have recorded duplicate file rates of between 12 and 22 percent in large urban digitisation programmes. At those rates, a storage and licensing budget of, say, €400,000 annually could be carrying €50,000 to €90,000 in pure waste.

Paris's own digitisation ambitions are not modest. The Bibliothèque nationale de France, headquartered on the Quai François-Mauriac in the 13th arrondissement, has been expanding its Gallica platform. The Atelier Parisien d'Urbanisme, known as APUR, produces dense visual documentation of neighbourhood change across all 20 arrondissements. Neither institution is immune to the duplication problem — both operate across multiple internal teams with overlapping mandates and, critically, different cataloguing conventions inherited from analogue-era workflows.

The specific mechanism of duplication is rarely malicious. A photograph taken during a Seine riverbank regeneration site visit in, say, the 10th arrondissement gets uploaded by the project photographer, re-uploaded by the communications officer who reformatted it for print, and then again by the archivist who received it as an email attachment. Three copies. Three file names. No automatic deduplication layer in place. Aggregated across a single year-long urban project, this produces archives that look comprehensive but are in practice bloated and internally inconsistent — the wrong image occasionally tagged as the definitive record, the right one buried three folders deep.

What Comes Next for Municipal Image Governance

The solution is technical but it requires political appetite to fund. Perceptual hashing — an algorithmic process that generates a fingerprint for each image based on its visual content rather than its file name — can identify near-duplicate images even when they have been resized, recompressed or renamed. Tools built on this approach are already deployed in media organisations across Europe. The cost of implementing such a system at municipal scale typically runs to six figures in the first year, inclusive of migration and staff training, but modelling from comparable rollouts in cities including Amsterdam and Barcelona suggests ongoing savings outpace that outlay within 24 to 36 months.

For Paris, the practical next step lies with the Direction des Systèmes et Technologies de l'Information, the city's central IT directorate, which has the mandate to set cross-departmental data standards. Any programme brought forward would ideally sync with the Grand Paris Express's documentation contracts as they roll toward their 2030 completion milestones — a moment when hundreds of thousands of additional construction images will arrive in municipal repositories. Getting ahead of that wave, rather than cleaning up after it, is the arithmetic that should be concentrating minds at the Hôtel de Ville right now.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Paris

This article was produced by the The Daily Paris editorial desk and covers news in Paris. See our editorial standards for how we use AI.

The Daily Paris brief

The day's Paris news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Paris news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Paris

More in News

Enjoyed this story? Get tomorrow's briefing free.