elib
DLR-Header
DLR-Logo -> http://www.dlr.de
DLR Portal Home | Imprint | Privacy Policy | Accessibility | Contact | Deutsch
Fontsize: [-] Text [+]

Large language models can outperform humans in social situational judgments

Mittelstädt, Justin and Maier, Julia and Goerke, Panja and Zinn, Frank and Hermes, Michael (2024) Large language models can outperform humans in social situational judgments. Scientific Reports, 14, p. 27449. Nature Publishing Group. doi: 10.1038/s41598-024-79048-0. ISSN 2045-2322.

[img] PDF - Published version
1MB

Official URL: https://www.nature.com/articles/s41598-024-79048-0

Abstract

Large language models (LLM) have been a catalyst for the public interest in artificial intelligence (AI). These technologies perform some knowledge-based tasks better and faster than human beings. However, whether AIs can correctly assess social situations and devise socially appropriate behavior, is still unclear. We conducted an established Situational Judgment Test (SJT) with five different chatbots and compared their results with responses of human participants (N = 276). Claude, Copilot and you.com’s smart assistant performed significantly better than humans in proposing suitable behaviors in social situations. Moreover, their effectiveness rating of different behavior options aligned well with expert ratings. These results indicate that LLMs are capable of producing adept social judgments. While this constitutes an important requirement for the use as virtual social assistants, challenges and risks are still associated with their wide-spread use in social contexts.

Item URL in elib:https://elib.dlr.de/209240/
Document Type:Article
Title:Large language models can outperform humans in social situational judgments
Authors:
AuthorsInstitution or Email of AuthorsAuthor's ORCID iDORCID Put Code
Mittelstädt, Justinjustin.mittelstaedt (at) dlr.dehttps://orcid.org/0000-0002-8419-6842UNSPECIFIED
Maier, JuliaJulia.Maier (at) dlr.dehttps://orcid.org/0000-0003-1387-7939172500551
Goerke, PanjaPanja.Goerke (at) dlr.dehttps://orcid.org/0000-0002-0340-388XUNSPECIFIED
Zinn, FrankFrank.Zinn (at) dlr.dehttps://orcid.org/0000-0002-9999-3318UNSPECIFIED
Hermes, MichaelMichael.Hermes (at) dlr.dehttps://orcid.org/0000-0001-6565-6957UNSPECIFIED
Date:10 November 2024
Journal or Publication Title:Scientific Reports
Refereed publication:Yes
Open Access:Yes
Gold Open Access:Yes
In SCOPUS:Yes
In ISI Web of Science:Yes
Volume:14
DOI:10.1038/s41598-024-79048-0
Page Range:p. 27449
Publisher:Nature Publishing Group
ISSN:2045-2322
Status:Published
Keywords:artificial intelligence, social judgment, human-computer interaction
HGF - Research field:Aeronautics, Space and Transport
HGF - Program:Space
HGF - Program Themes:Earth Observation
DLR - Research area:Raumfahrt
DLR - Program:R EO - Earth Observation
DLR - Research theme (Project):R - Artificial Intelligence
Location: Hamburg
Institutes and Institutions:Institute of Aerospace Medicine > Aviation and Space Psychology
Deposited By: Mittelstädt, Dr. Justin
Deposited On:26 Nov 2024 07:48
Last Modified:28 Nov 2024 12:43

Repository Staff Only: item control page

Browse
Search
Help & Contact
Information
OpenAIRE Validator logo electronic library is running on EPrints 3.3.12
Website and database design: Copyright © German Aerospace Center (DLR). All rights reserved.