Hamm, Andreas (2022) Strategy Comparison for Semantic Zero-Shot Taxonomy Filters. 4th International Open Search Symposium (OSSYM 2022), 2022-10-10 - 2022-10-12, Genf, Schweiz.
|
PDF
497kB |
Abstract
In information retrieval, categorised filtering based on subject-related taxonomies is a way of supporting users in formulating their information needs in an efficient way. Progress in machine learning classification algorithms has made it possible to automatize the task of tagging or category assignment in a generally acceptable manner, provided a sufficient number of labelled example documents from all categories is put into the training process. The latter requirement, however, is a serious obstacle for a flexible use over a broad range of domains and in areas with limited amount of training data available. This contribution shows the outcome of experiments with transformer-based zero-shot text classification methods which work without any specific training. Using taxonomy descriptions, sentence aggregation with saturation, and hierarchical consistency, this approach can be enhanced to perform nearly as well as more elaborate classifiers.
| Item URL in elib: | https://elib.dlr.de/190966/ | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Document Type: | Conference or Workshop Item (Speech) | ||||||||
| Title: | Strategy Comparison for Semantic Zero-Shot Taxonomy Filters | ||||||||
| Authors: |
| ||||||||
| Date: | October 2022 | ||||||||
| Refereed publication: | Yes | ||||||||
| Open Access: | Yes | ||||||||
| Gold Open Access: | No | ||||||||
| In SCOPUS: | No | ||||||||
| In ISI Web of Science: | No | ||||||||
| Status: | Published | ||||||||
| Keywords: | information retrieval; text classification; transformer-based language models; taxonomies | ||||||||
| Event Title: | 4th International Open Search Symposium (OSSYM 2022) | ||||||||
| Event Location: | Genf, Schweiz | ||||||||
| Event Type: | international Conference | ||||||||
| Event Start Date: | 10 October 2022 | ||||||||
| Event End Date: | 12 October 2022 | ||||||||
| Organizer: | Open Search Foundation; CERN | ||||||||
| HGF - Research field: | other | ||||||||
| HGF - Program: | other | ||||||||
| HGF - Program Themes: | other | ||||||||
| DLR - Research area: | Digitalisation | ||||||||
| DLR - Program: | D - no assignment | ||||||||
| DLR - Research theme (Project): | D - MeToDiO, D - OpenSearch@DLR | ||||||||
| Location: | Köln-Porz | ||||||||
| Institutes and Institutions: | Institute of Software Technology > Intelligent and Distributed Systems Institute of Software Technology | ||||||||
| Deposited By: | Hamm, Dr. Andreas | ||||||||
| Deposited On: | 29 Nov 2022 11:58 | ||||||||
| Last Modified: | 27 Feb 2025 15:04 |
Repository Staff Only: item control page