Keil, Jan Martin and Berndt, Niklas (2026) Continuous RDF and OWL Vocabulary Quality Checks with LintedData. Helmholtz Metadata Collaboration (HMC) Conference 2026, 2026-04-28 - 2026-04-30, Heidelberg. doi: 10.5281/zenodo.20024644.
|
PDF
177kB | |
|
PDF
1MB |
Abstract
RDF and OWL vocabularies are an important prerequisite for the FAIR representation of metadata. These vocabularies itself must adhere to certain quality standards to be useful. Over the last years, the Semantic Web community has come up with many recommendations as well as anti-patterns for the development of vocabularies. But, checks for compliance to these best practices can still only be automated partially and require considerable effort. However, learning from software development, quality checks must be performed regularly and completely automated to fully come to effect. To address this gap, we develop LintedData, a command line tool for automated quality checks of RDF and OWL-based vocabularies. Currently, LintedData is able to perform more than 60 different quality checks. It covers a large part of the best practices that have been broadly accepted in the Semantic Web community as listed in the Ontology Pitfall Catalog or the OBO Foundry Principles. Due to the command line interface and an available Docker image, LintedData can easily be used in context of Continuous Integration (CI) pipelines to be automatically executed each time changes are pushed to an ontology development repository. Using JUnit XML or Markdown files as output formats enables the direct result presentation in the interface of platforms like GitLab or GitHub. An optionally provided configuration file allows to parameterize individual checks and define which checks to execute for a particular pipeline. During the demonstration we showcase the automated use of LintedData in a CI pipeline of an ontology development repository. Visitors will be able to trigger changes to an ontology, introducing either improvements or new issues, and to experience the automated timely response of LintedData on these changes, pointing to contained problems. LintedData is publicly available under a permissive license on: https://gitlab.com/dlr-dw/linteddata/
| Item URL in elib: | https://elib.dlr.de/223803/ | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Conference or Workshop Item (Poster, Other) | ||||||||||||
| Title: | Continuous RDF and OWL Vocabulary Quality Checks with LintedData | ||||||||||||
| Authors: |
| ||||||||||||
| Date: | 2026 | ||||||||||||
| Refereed publication: | Yes | ||||||||||||
| Open Access: | Yes | ||||||||||||
| Gold Open Access: | No | ||||||||||||
| In SCOPUS: | No | ||||||||||||
| In ISI Web of Science: | No | ||||||||||||
| DOI: | 10.5281/zenodo.20024644 | ||||||||||||
| Status: | Accepted | ||||||||||||
| Keywords: | Ontology Quality Knowledge Graph Quality Ontology Engineering Continuous Integration | ||||||||||||
| Event Title: | Helmholtz Metadata Collaboration (HMC) Conference 2026 | ||||||||||||
| Event Location: | Heidelberg | ||||||||||||
| Event Type: | national Conference | ||||||||||||
| Event Start Date: | 28 April 2026 | ||||||||||||
| Event End Date: | 30 April 2026 | ||||||||||||
| Organizer: | Helmholtz Metadata Collaboration (HMC) | ||||||||||||
| HGF - Research field: | other | ||||||||||||
| HGF - Program: | other | ||||||||||||
| HGF - Program Themes: | other | ||||||||||||
| DLR - Research area: | Digitalisation | ||||||||||||
| DLR - Program: | D - no assignment | ||||||||||||
| DLR - Research theme (Project): | D - no assignment | ||||||||||||
| Location: | Jena | ||||||||||||
| Institutes and Institutions: | Institute of Data Science > Data Management and Enrichment | ||||||||||||
| Deposited By: | Keil, Jan Martin | ||||||||||||
| Deposited On: | 22 Apr 2026 16:47 | ||||||||||||
| Last Modified: | 04 May 2026 15:02 |
Repository Staff Only: item control page