Abonnement gratuit
The Daily Paris

Paris news, every day

News

Paris Agencies Race to Purge Duplicate Images from Grand Paris Express Digital Archive This Week

A cataloguing push across multiple municipal platforms has exposed how thousands of redundant photographs are clogging city infrastructure databases, slowing public-facing tools and costing storage budgets.

By Paris News Desk · Published 4 July 2026, 8:48 pm

3 min read

Paris Agencies Race to Purge Duplicate Images from Grand Paris Express Digital Archive This Week
Photo: Photo by Artūras Kokorevas on Pexels
Traduction en cours…

Paris city authorities and the Société du Grand Paris confirmed this week that a system-wide audit of their shared digital asset library had identified more than 14,000 duplicate image files across project databases, triggering an emergency deduplication sprint that began Monday and is expected to run through July 11. The problem, long flagged by archivists at the Pavillon de l'Arsenal on Boulevard Morland, had compounded steadily since 2024, when the post-Olympics documentation effort flooded municipal servers with overlapping photography from dozens of separate contractor teams.

The timing matters. The Grand Paris Express — the largest urban transit project in Europe — is entering a critical phase of public communication, with Line 15 South now operational between Fort d'Aubervilliers and Bry-Villiers-Champigny and new station openings scheduled for late 2026. Every delay in updating the project's public image galleries translates into outdated visuals on information boards, press kits and the dedicated SGP website used by roughly 80,000 unique monthly visitors, according to internal figures cited in documents obtained by The Daily Paris.

Where the Backlog Built Up

The duplication problem is concentrated in two specific archive nodes. The first sits within Apur — the Atelier Parisien d'Urbanisme, based on Île de la Cité — which manages urban planning photography dating back to the 2016 launch of the Grand Paris Express environmental impact studies. The second node is held by the communication directorate of the Établissement public territorial Est Ensemble, covering the Seine-Saint-Denis municipalities from Pantin to Noisy-le-Sec. Together, the two repositories account for an estimated 60 percent of the duplicate volume identified this week.

The practical consequences are not trivial. Storage costs for municipal cloud infrastructure in Paris have risen sharply since 2023, with the city's Direction des Systèmes et Technologies de l'Information reporting that unmanaged file proliferation contributed to a 22 percent overspend against its digital storage budget line for fiscal year 2025. A duplicate image replacement protocol — essentially a scripted tool that cross-references file hashes, flags redundant copies, and substitutes a single canonical version — had been piloted at the Hôtel de Ville IT division in late 2025, but rollout to partner organisations stalled over data-sharing agreements.

This week's sprint is the first coordinated attempt to apply that protocol across all partner institutions simultaneously. Teams from the Direction de l'Urbanisme, Apur and the SGP communication unit held a working session at the Cité de l'Architecture et du Patrimoine on Trocadéro on Wednesday to align on file-naming conventions before running the deduplication scripts. The session had been rescheduled twice since May.

What Comes Next for City Digital Infrastructure

The immediate goal is to reduce the archive from its current 47 terabytes to under 30 terabytes before August 1, freeing capacity ahead of a planned expansion of the Paris urban data portal — data.paris.fr — which is due to integrate live construction imagery from Grand Paris Express worksites in the 14th and 15th arrondissements. Contractors working on the Villejuif–IGR station corridor have been supplying raw site photography at a rate that archivists say is outpacing any manual review process.

For Parisians and the press offices that serve them, the practical upshot is more reliable imagery in the months ahead. Public relations staff at Est Ensemble have reportedly been working around the duplicate problem for months by maintaining their own shadow folders — an ad hoc solution that itself generates fresh redundancy. Once the canonical archive is stabilised, all partner communications teams are expected to draw from a single authenticated source by September 2026.

Longer term, Apur has proposed embedding automated hash-checking into the ingest pipeline for all new photography commissions, a measure that would prevent duplication at the point of upload rather than requiring periodic clean-up campaigns. A formal recommendation is due before the Direction de l'Urbanisme's quarterly review, currently scheduled for October 14. Whether the city adopts it will depend partly on budget negotiations for the 2027 municipal digital plan, which Conseil de Paris committees begin examining in September.

Topic:#News

How does this story make you feel?

Spread the word

See something wrong? Suggest a correction.

Have your say

Loading comments…

Sources

About this article

Published by The Daily Paris

This article was produced by the The Daily Paris editorial desk and covers news in Paris. See our editorial standards for how we use AI.

The Daily Paris brief

The day's Paris news in a 2-minute read, every weekday morning. Free.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

Daily brief

Enjoyed this? Wake up to Paris news every morning.

Free, in your inbox before 7am. Weekdays.

By subscribing you agree to receive emails from The Daily Paris and accept our Privacy Policy. Unsubscribe anytime.

More from The Daily Paris

More in News

Enjoyed this story? Get tomorrow's briefing free.