Best Practices
The programme is still being finalized and is subject to ongoing updates as sessions are scheduled. Please check back regularly for the latest changes.
AI-Assisted Cataloging in Practice: A Human-in-the-Loop Approach to Scalable Metadata Creation
Academic libraries face growing resource volumes and a shortage of experienced catalogers. To address this, the National University of Singapore Libraries initiated the AI-Powered Cataloging (AICAT) Project, exploring how large language models (LLMs) can augment bibliographic metadata creation while maintaining human oversight.
This case study shares a practical implementation that generates draft MARC21 records compliant with RDA, ISBD, LCSH, and LCC standards. Utilizing engineered prompts with strict operational constraints—such as hallucination prevention and metadata validation—the project evaluates multiple LLMs to assess accuracy and standards adherence.
The three-pronged methodology encompasses prompt engineering, comparative model benchmarking, and workflow automation testing. Results show that AI significantly reduces repetitive cataloging effort and improves scalability, while catalogers retain final editorial authority. Ultimately, effective AI deployment depends on aligning machine outputs with human values through standards-based validation and expert oversight.
-
Mi-kyeong Kam
National University of Singapore
Kam Mi-Kyeong is a Principal Librarian in the Collections Management & Preservation Cluster at NUS Libraries. She leads the Resource Organization Team, overseeing metadata services, e-resource management, and resource organization workflows. Her work focuses on library systems and process improvement, leveraging information technologies to optimize resource management in support of research, teaching, and learning. Her professional interests include metadata innovation, workflow automation, and the strategic application of AI to enhance library services and operational efficiency.
From Records to Entities: Open Metadata Infrastructure for Consortial E-Book Resource Sharing
Metadata is increasingly recognized not merely as descriptive information, but as critical infrastructure enabling open knowledge ecosystems. As a foundation for interoperability, discovery, and reuse across institutional and national boundaries, open metadata supports more connected and efficient access to information across systems. Yet traditional record-based systems remain constrained by fragmentation, local variation, and limited interoperability, resulting in persistent data silos that hinder effective resource sharing.
This presentation examines an ongoing, multi-phase pilot led by the Partnership for Academic Library Collaboration & Innovation (PALCI), a consortium of over 70 academic and research libraries across the Mid-Atlantic United States. The E-ILL Pilot is a proof-of-concept effort to understand how Linked Open Data (LOD) may support resource sharing within a consortial environment. This pilot, a collaboration with Share-VDE and ReShare, examines integrating LOD entities into the ReShare platform. It evaluates how entity models can improve discovery and secure inter-institutional e-book delivery.
Moving beyond string-based bibliographic matching, which often fragments representations of the same work, the pilot employs entity-based clustering using the Share-VDE BIBFRAME ontology, a resource description model that reconciles IFLA LRM and RDA. This approach reconciles metadata variations and aggregates holdings at the work level, a capability especially important in a digital context where resources can have multiple formats and versions. To support emerging “e-first” interlibrary loan models, the project examines whether open, interoperable, entity-based metadata can identify related instances and improve matching across manifestations of the same work to support fulfillment.
Framed within PALCI’s Digital Sharing Strategy, this work operationalizes metadata as infrastructure, emphasizing openness, reuse, and interoperability as essential to scalable, vendor-neutral systems. As an exploratory initiative, it advances understanding of how open metadata and LOD frameworks can inform interoperable, efficient, and scalable resource-sharing ecosystems across consortial and institutional boundaries.
-
Nina Servizzi
New York University
Nina Servizzi is Associate Dean for Knowledge Access and Resource Management Services at New York University Libraries. Her work focuses on how libraries and cultural heritage institutions develop and sustain information infrastructures that support research and scholarship. She examines organizational adaptation, systems interoperability, and data governance as essential components of resilient information environments. She serves on the HathiTrust Program Steering Committee, contributes to IFLA, and is past Chair of the Share-VDE Advisory Council.
Modeling, Linking, and Augmenting Video Game Archive: Practices from the RCGS Collection
Digital game archives require more than descriptive cataloging; they call for a metadata ecosystem that supports both conceptual rigor and practical reusability, underpinning computational use. This paper reports on metadata practices developed at the Ritsumeikan Center for Game Studies (RCGS), organized into three interrelated layers: modeling, linking, and augmenting. These practices are implemented in the RCGS’s online catalog service, the RCGS Collection.
At the modeling layer, we develop an extended model based on the IFLA Library Reference Model (LRM) to represent the complex relationships inherent in digital games, including works, variations, packages, and individual items. This approach enables consistent description across heterogeneous archival materials while maintaining compatibility with established bibliographic standards. At the same time, for entities that require subjective interpretation—such as works, genres, franchises, and characters—we design the model to incorporate external authority data. In addition, for the implementation of an online catalog, we employ a “dumb-down” strategy (flattening structured metadata) to improve accessibility for general users.
At the linking layer, we implement a Linked Open Data (LOD) approach that connects local entities to external authority sources such as Wikidata and the Media Arts Database (MADB). This alignment supports the persistence of identifiers, enhances interoperability, and enables data enrichment through the incorporation of collectively curated external knowledge.
At the augmenting layer, we develop a dataset construction pipeline based on text extraction and image embeddings, transforming digitized materials—including packages, manuals, and gameplay images—into research-ready corpora. Through multimodal feature extraction and automated text structuring, archival data can be repurposed as a resource for computational analysis and metadata generation.
These practices are implemented through a SPARQL-enabled LOD infrastructure and a discovery interface based on Omeka S, achieving a balance between a robust identifier infrastructure, machine-readability, and user accessibility. Rather than proposing a prescriptive model, this paper reflects on design decisions and implementation strategies to share insights for building sustainable metadata ecosystems in game archives and related domains.
-
Kazufumi Fukuda
College of Image Arts and Sciences, Ritsumeikan University
Kazufumi Fukuda, Ph.D., is an Associate Professor at Ritsumeikan University specializing in game studies, digital humanities, and knowledge graphs. His research explores the preservation and analysis of video game archives through metadata modeling, linked data, and data science. He is involved in developing game archive infrastructures and contributes to national projects on media arts databases, bridging academic research and practical applications.
Re-using the ParliamentSampo framework and Sampo-UI to publish assembly minutes of the League of Nations (1920-1946) and United Nations (1946-) for Digital Humanities research on the Semantic Web
This presentation shows how the domain-specific “Sampo framework”, here ParliamentSampo for parliamentary speeches and prosopography [1], based on the domain-agnostic Sampo model [2] and Sampo-UI interface model [3-4], can be adapted to new application cases. As an example, our focus here is to show how to publish as a FAIR Linked Open Data (LOD) service the assembly minutes of the League of Nations [5], the predecessor of the United Nations (UN), and to publish next the General Assembly minutes of the UN. As an application on top of the LOD service, the portal League of Nations (1920-1946) Sampo [5] was published on the Web in March 2026. It can be used to answer questions, such as: What have the representatives of Japan talked about? How did the territorial focus of the League of Nations (LoN) shift in time?
In what roles and capacity did Fridtjof Nansen, the Polar explorer, participate in LoN meetings?
More information about the underlying Minutes of Multilateralism initiative:
https://seco.cs.aalto.fi/projects/minutes/
[1] Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala: Publishing and Using Parliamentary Linked Data on the Semantic Web: ParliamentSampo System for Parliament of Finland. Semantic Web, vol. 16, no. 1, 2025. DOI: 10.3233/SW-243683.
https://journals.sagepub.com/doi/full/10.3233/SW-243683
[2] Eero Hyvönen: Digital Humanities on the Semantic Web: Sampo Model and Portal Series. Semantic Web – Interoperability, Usability, Applicability, vol. 14, no. 4, pp. 729-744, IOS Press, 2023.
https://journals.sagepub.com/doi/full/10.3233/SW-223034
[3] Heikki Rantala, Annastiina Ahola, Esko Ikkala and Eero Hyvönen: How to create easily a data analytic semantic portal on top of a SPARQL endpoint: introducing the configurable Sampo-UI framework. VOILA! 2023 Visualization and Interaction for Ontologies, Linked Data and Knowledge Graphs 2023, pp. 28-39, CEUR Workshop Proceedings, Vol. 3508, October, 2023.
https://ceur-ws.org/Vol-3508/paper3.pdf
[4] Esko Ikkala, Eero Hyvönen, Heikki Rantala and Mikko Koho: Sampo-UI: A Full Stack JavaScript Framework for Developing Semantic Portal User Interfaces. Semantic Web – Interoperability, Usability, Applicability, vol. 13, no. 1, pp. 69-84, January, 2022. https://journals.sagepub.com/doi/full/10.3233/SW-210428
[5] Petri Leskinen, Eero Hyvönen, Alexandre Lionnet, Blandine Blukacz-Louisfert, Pierre-Etienne Bourneuf, Davide Rodogno, Grégoire Mallard, and Florian Cafiero: A Linked Open Data Service and Semantic Portal to Study the Assembly Minutes and Prosopography of the League of Nations (1920–1946). The Semantic Web: ESWC 2026, Dubrovnik, Croatia, May 10 - 14, 2026, Proceedings, Springer-Verlag, 2026. In press.
https://seco.cs.aalto.fi/publications/2026/leskinen-et-al-lon-2026.pdf
-
Petri Leskinen
University of Helsinki
Petri Leskinen is a postdoctoral researcher at the University of Helsinki and the Geneva Graduate Institute. He completed his doctoral dissertation in 2024 on biographical data and data analysis. His research focuses on biographical data, cultural heritage knowledge graphs, prosopography, and linked open data infrastructures for digital humanities. He has been one of the main engineers in developing systems such as SampoSampo, LetterSampo, and ParliamentSampo to support data-intensive historical research.
Sustainable Linked Open Data Publishing through Static Site Generation: A Decade of Educational LOD Infrastructure
The long-term sustainability of Linked Open Data (LOD) services remains a significant challenge. Many projects rely on complex server-side infrastructures, SPARQL endpoints, or custom applications that become difficult to maintain once project funding or dedicated technical support ends. This presentation introduces a practical approach to sustainable LOD publishing based on ttl2html, a static site generator that transforms RDF/Turtle datasets into human-readable and machine-accessible web resources.
Using this approach, we have developed and continuously maintained an educational LOD infrastructure in Japan since its public launch in 2017. The platform interlinks educational resources such as textbooks, curriculum guidelines, instructional units, and large-scale educational assessments, providing persistent identifiers and Linked Data access through a lightweight and maintainable architecture.
We discuss the design principles behind this approach, including static content generation, reduced operational complexity, and compatibility with Linked Data best practices. We report on long-term maintenance experience and examine how minimizing technical dependencies has contributed to the sustainability of the infrastructure. We also demonstrate how the same framework has been reused in other contexts, including student-led initiatives and library-related LOD projects, illustrating its adaptability and low barrier to adoption.
By sharing lessons learned from a decade of operation, this presentation highlights how reducing technical complexity through static site generation can improve the sustainability of LOD publishing. Our experience further demonstrates that lightweight publishing infrastructures can be readily reused across projects and organizations, providing a practical foundation for the long-term development and preservation of open knowledge infrastructures.
-
Masao Takaku
Institute of Library, Information and Media Science, University of Tsukuba
Masao Takaku is an Associate Professor in the Institute of Library, Information and Media Science at the University of Tsukuba. His research focuses on digital libraries and information organization, with recent work on metadata and knowledge graph development for cultural heritage and educational use. He also teaches information organization and related subjects at both undergraduate and graduate levels and works on knowledge infrastructure design across diverse domains.