Paris's municipal digital infrastructure is carrying tens of thousands of duplicate image files across its public-facing platforms, a sprawl that accumulated quietly over roughly a decade of uncoordinated uploads, agency mergers, and emergency web migrations — and that city administrators are now working to systematically clear. The problem, long treated as a low-priority housekeeping issue, moved up the agenda this spring after the Direction de la Communication at the Hôtel de Ville flagged storage inefficiencies during a broader audit of the Grand Paris digital estate.
The timing matters. With the Paris 2024 Olympic legacy programme still generating fresh content — venue repurposing at the Stade de France corridor, before-and-after imagery of the Seine riverbanks, drone footage of the Saint-Denis aquatic centre's new community programming — the city's content management systems are absorbing material at an accelerated rate. Layering new assets onto an already bloated archive without first resolving duplication creates compounding retrieval failures, accessibility gaps, and unnecessary server costs.
How the Backlog Built Up
The duplication problem did not happen overnight. It traces back to at least 2015, when the Mairie de Paris began migrating content from a patchwork of arrondissement-level websites onto a centralised paris.fr platform. Each of the capital's 20 arrondissements had maintained its own image library, often with overlapping stock — photographs of the same monuments, markets, and public squares saved under different filenames and metadata conventions. When those libraries were folded into the central content management system without deduplication protocols, identical or near-identical files were simply stacked on top of one another.
The problem deepened between 2020 and 2022, when the city's communications teams accelerated digital publishing to handle pandemic-era public information campaigns. Departments including the Direction des Affaires Scolaires and the Délégation à la Politique de la Ville uploaded image batches independently, often without cross-referencing existing assets. By the time the Grand Paris Express communications push launched in earnest — promoting new stations along Line 15 South and the approaching openings on Line 16 — the archive had grown unwieldy enough that search returns on common queries were generating dozens of visually identical results.
A 2025 internal review by the city's Direction des Systèmes et Technologies de l'Information, covering platforms managed directly by the Ville de Paris, identified file redundancy as one of three primary contributors to slow content delivery on mobile networks in outer arrondissements, particularly in the 19th and 20th. Page-load performance on neighbourhood information portals in areas around the Parc de la Villette and the Belleville slope was measurably slower than in more central districts, partly because those pages drew from the most densely duplicated sections of the image library — community event photography accumulated since 2016.
The Deduplication Push
Since March 2026, the city has been running a phased duplicate-image replacement programme through its web content teams, prioritising high-traffic pages first. The process involves flagging duplicate files using automated hash-matching tools, then replacing multi-instance images with a single canonical version stored in a structured asset directory. Editors working on content related to the Seine-Saint-Denis regeneration projects and the Plaine Commune urban development zone have been among the first to work through updated upload protocols introduced this year.
The broader lesson being absorbed across municipal communications offices in Paris is straightforward: governance of digital assets requires the same rigour applied to physical infrastructure. Failing to establish shared naming conventions and deduplication checkpoints at the point of upload — rather than attempting retrospective clean-up years later — is what turned a manageable inconvenience into a significant administrative project. City officials have not publicly confirmed a completion timeline for the full archive audit, but the phased rollout is expected to continue through the end of 2026, with updated contributor guidelines posted to the internal intranet used by all Direction-level communications staff. For residents and journalists accessing city image resources through the open data portal data.paris.fr, the practical effect should be cleaner search returns and faster load times — eventually.