
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
Psychopathology, how we measure it and our conceptualization of its structure, is thought to be well reflected in natural language — an idea known as the Lexical Hypothesis. Recent advances in machine learning and artificial intelligence provide opportunities to explore this connection quantitatively. Using a Large Language Model, we extracted sentence embeddings for the items of three well validated measures of psychopathology measuring Externalizing (ESI), Internalizing (IDAS), and Personality Disorders (PID-5). We analyzed the semantic relationships between the items in these inventories in an attempt to predict patterns of association between self-report responses in a previously collected sample of participants responding to these measures. Our analysis revealed moderate correlations between the semantic relationships and item-pair response distributions for all three measures. Applying factor analysis to the semantic embeddings, we were able to extract the hierarchical structure of these measures: the inferred structures in some cases reflect similar ones based on psychometric analyses and also shed light on the ways that key dimensions of psychopathology are encoded differentially in modern language models. Our findings align well with the Lexical Hypothesis.
Authors:
Maria Martin Lopez: University of California, Berkeley; Keanan Joyner: University of California, Berkeley; Bill Thompson: University of California, Berkeley
