Hamm, Andreas (2022) Strategy Comparison for Semantic Zero-Shot Taxonomy Filters. 4th International Open Search Symposium (OSSYM 2022), 2022-10-10 - 2022-10-12, Genf, Schweiz.
PDF
497kB |
Kurzfassung
In information retrieval, categorised filtering based on subject-related taxonomies is a way of supporting users in formulating their information needs in an efficient way. Progress in machine learning classification algorithms has made it possible to automatize the task of tagging or category assignment in a generally acceptable manner, provided a sufficient number of labelled example documents from all categories is put into the training process. The latter requirement, however, is a serious obstacle for a flexible use over a broad range of domains and in areas with limited amount of training data available. This contribution shows the outcome of experiments with transformer-based zero-shot text classification methods which work without any specific training. Using taxonomy descriptions, sentence aggregation with saturation, and hierarchical consistency, this approach can be enhanced to perform nearly as well as more elaborate classifiers.
elib-URL des Eintrags: | https://elib.dlr.de/190966/ | ||||||||
---|---|---|---|---|---|---|---|---|---|
Dokumentart: | Konferenzbeitrag (Vortrag) | ||||||||
Titel: | Strategy Comparison for Semantic Zero-Shot Taxonomy Filters | ||||||||
Autoren: |
| ||||||||
Datum: | Oktober 2022 | ||||||||
Referierte Publikation: | Ja | ||||||||
Open Access: | Ja | ||||||||
Gold Open Access: | Nein | ||||||||
In SCOPUS: | Nein | ||||||||
In ISI Web of Science: | Nein | ||||||||
Status: | akzeptierter Beitrag | ||||||||
Stichwörter: | information retrieval; text classification; transformer-based language models; taxonomies | ||||||||
Veranstaltungstitel: | 4th International Open Search Symposium (OSSYM 2022) | ||||||||
Veranstaltungsort: | Genf, Schweiz | ||||||||
Veranstaltungsart: | internationale Konferenz | ||||||||
Veranstaltungsbeginn: | 10 Oktober 2022 | ||||||||
Veranstaltungsende: | 12 Oktober 2022 | ||||||||
Veranstalter : | Open Search Foundation; CERN | ||||||||
HGF - Forschungsbereich: | keine Zuordnung | ||||||||
HGF - Programm: | keine Zuordnung | ||||||||
HGF - Programmthema: | keine Zuordnung | ||||||||
DLR - Schwerpunkt: | Digitalisierung | ||||||||
DLR - Forschungsgebiet: | D - keine Zuordnung | ||||||||
DLR - Teilgebiet (Projekt, Vorhaben): | D - MeToDiO, D - OpenSearch@DLR | ||||||||
Standort: | Köln-Porz | ||||||||
Institute & Einrichtungen: | Institut für Softwaretechnologie > Intelligente und verteilte Systeme Institut für Softwaretechnologie | ||||||||
Hinterlegt von: | Hamm, Dr. Andreas | ||||||||
Hinterlegt am: | 29 Nov 2022 11:58 | ||||||||
Letzte Änderung: | 24 Apr 2024 20:52 |
Nur für Mitarbeiter des Archivs: Kontrollseite des Eintrags