Workshop : Transforming Complex Web Apps into Static Sites

Long title
How to transform a complex project-specific web application into a large static site
Starts at
Sat, Oct 25, 2025, 14:00 GMT+2
Finishes at
Sat, Oct 25, 2025, 18:00 GMT+2
Venue
Aula Rubió

How to transform a complex project-specific web application into a large static site

A practical case study in making special-collections web apps sustainable. We show how ZBW re-platformed the 20th Century Press Archives (PM20) from a costly, expert-dependent stack to an open, static, Wikidata-integrated site. PM20 now serves 67k folder descriptions, 2M indexed pages, and 4M images via client-side IIIF and interlinked HTML/JSON—kept online with minimal ops, lower security exposure, and measured defenses against aggressive crawlers. We’ll share principles, lightweight workflows, and a regeneration pipeline that publishes enriched metadata as JSON-LD, enabling ongoing community contributions. Designed for maintainers and planners, the session distills patterns to reduce complexity, boost resilience, and plan for long-term stewardship, followed by peer consulting on participants’ own cases.
  • Joachim Neubert

    WikiProject 20th Century Press Archives (former affiliation: ZBW – Leibniz Information Center for Economics), Hamburg, Germany

    Joachim Neubert is an information scientist and software developer, formerly working at ZBW - Leibniz Information Centre for Economics in Hamburg, researching, writing software and publishing about knowledge organization systems in the web of Linked Data. For many years, he was co-organizing [SWIB](https://swib.org) Semantic Web in Libraries conference. Currently he is active in the Wikidata und Wikimedia context, specifically the [Wikiprojekt Pressearchiv](https://de.wikipedia.org/wiki/Wikipedia:Projekt_Pressearchiv).

The 20th Century Press Archives, digitized in 2004-2007 with funding from the German Research Foundation, faced a tough challenge: the owning institution, ZBW – Leibniz Information Centre for Economics, has maintained a complex and expensive software infrastructure and continued indexing the holdings. 10 years after the original project ended, the costs were not affordable any longer, and the responsible person - the last one from the original project team with a deep understanding of the application - was going to retire in the following year.

By far not an uncommon situation: Continued online availability may be at risk in numerous similar projects, originally funded to make special collections available on the web, using the latest technology. Additionally, as is now widely recognized within the international scientific community, public knowledge and research results are under threat of reckless or targeted eradication by authoritarian governments.

ZBW decided to put the metadata of the collection under an open license, to integrate it into Wikidata, and to transform the application into a static site (https://pm20.zbw.eu). The site includes data about 67,000 folder descriptions, two million indexed pages (mostly digitized newspaper clippings) and four million images from digitized roll films. It consists of interlinked static HTML, JSON and image files, which are delivered by a standard web server. IIIF image presentation works client-side-only. Wikidata links provide context, discoverability and even dynamic search support.

The simplicity of the technical solution allows ZBW keep up the service with minimal effort and standard web administration skills. It does not require constant vigilance on outdated libraries and recurring security threats, and facilitates the defense against “AI” crawlers. As an added bonus, since last year a Wikidata/Wikipedia community project has been working to enhance data indexing. ZBW supports this by periodically running an automated recreation of the website with the extended metadata, which is fully published as JSON-LD (https://doi.org/10.5281/zenodo.11472914).

The workshop will share experiences with the techniques and workflows employed in the PM20 transition project and highlight design decisions. It aims to encourage participants to think about their own applications and ways to make them more simple and sustainable, offering room to discuss open questions or concerns with other participants.

Format

Half-day workshop. After a presentation of experiences from the PM20 project, questions and open issues from the participants will be collected and discussed in a peer consulting session, probably in smaller teams, so that everybody can contribute.

Intended audience and prerequisites

  1. Everybody who maintains, or oversees maintenance of, complex web applications for special collections and wants to lighten the burden.
  2. Everybody who plans for web projects and considers reducing complexity from start on for easier maintenance and better long-term sustainability.

The presentation will not go into the details of specific tools (providing links to working and publicly accessible code in the slides, however). It should not require deeper programming skills.

Background reading