United States

How do language models assign responsibility and reward, and is it similar to how humans do it? We instructed three state-of-the-art large language models to assign responsibility (Experiment 1) and reward (Experiment 2) to agents in a collaborative task. We then compared the language models’ responses to seven existing cognitive models of responsibility and reward allocation. We found that language models mostly evaluated agents based on force (how much they actually did), in line with classical production-style accounts of causation. By contrast, humans valued actual and counterfactual effort (how much agents tried or could have tried). These results indicate a potential barrier to effective human-machine collaboration.

CogSci 2025

Language models assign responsibility based on actual rather than counterfactual contributions

social cognition

computational modeling

psychology

causal reasoning

artificial intelligence

poster

### Welcome to CogSci Conference 2025!

The 47th Annual Meeting of the Cognitive Science Society was a hybrid meeting held in San Francisco. 

<div style="position:relative;padding-top:0;width:900px;height:500px;"><iframe style="position:absolute;border:none;width:100%;height:100%;left:0;top:0;" src="https://online.fliphtml5.com/ebtyf/amvr/"  seamless="seamless" scrolling="no" frameborder="0" allowtransparency="true" allowfullscreen="true" ></iframe></div>

#### About

The Cognitive Science Society brings together researchers from around the world who hold a common goal: understanding the nature of the human mind. The mission of the Society is to promote Cognitive Science as a discipline, and to foster scientific interchange among researchers in various areas of study, including Artificial Intelligence, Linguistics, Anthropology, Psychology, Neuroscience, Philosophy, and Education.

The Society is a non-profit professional organization and its activities include sponsoring an annual conference and publishing the journals Cognitive Science and TopiCS.

#### Our History 

* **Society Creation**<br>
The Society was incorporated as a 501(c)(3) non-profit professional organization in Massachusetts in 1979. The organizing committee included Roger Schank, Allan Collins, Donald Norman, and a number of other scholars from psychology, linguistics, computer science, and philosophy. 
<br><br>
* **Conference Creation**<br>
The first conference on cognitive science was held at La Jolla, California in August, 1979, and has occurred annually since then. The proceedings of each conference are published, and those from most years are available through Lawrence Erlbaum Associates, Inc. The annual proceedings of the Cognitive Science Conference represent a major source of information on new work and new ideas in the scientific study of thinking. In 1990, the Society, with help from an anonymous donor, established the David Marr Prize for the best student paper at each annual meeting.
<br><br>
* **Journal Creation**<br>
The Journal, Cognitive Science, began publication in 1976, and is now published by Wiley-Blackwell. The Executive Editor is currently Richard P. Cooper of Birkbeck, University of London, and there are 18 Associate Editors and a 30-member editorial board. It serves as the premier outlet for research reports that intersect two or more disciplines. Copyrights for articles published in the journal are held by the Society. The Governing Board of the Cognitive Science Society voted in late 2006 to found a new journal, Topics in Cognitive Science (topiCS). The Editor in Chief is Wayne Gray, Cognitive Science Department, Rensselaer Polytechnic Institute. The journal seeks to fill a niche not occupied by Cognitive Science Journal or other cognitive science journals. Membership in the Society includes a subscription to Cognitive Science and TopiCS. Copyrights for articles published in the journal are held by the Society.
<br><br>

#### Code of Conduct

By attending the CogSci 2025 Conference, you are required to adhere to the society’s **[Code of Conduct](https://drive.google.com/file/d/1ChPuihLy6jE_BWqfO7J2KKgX35JW2zsM/view?usp=sharing)**.
<br><br>


You need to log in with the email address you registered with. 

Login credentials were sent to you from Underline -  subject line "Welcome to the CogSci 2025 Conference". Please be sure to check your spam/promotional inbox  if you do not see an email confirmation right away.





Please log in to join this event.

To access the site, please register [**here**](https://cognitivesciencesociety.org/registration/).

If you are registered and feel like you are seeing this message by mistake, please make sure you are logged in with the same email that you registered with. 

Please register!

The 47th Annual Meeting of the Cognitive Science Society presents the latest research across cognitive science and highlights the theme of Cognition in Context.

Pragmatic atypicality is widely considered to be a central characteristic of autism. This is often explained as a consequence of Theory of Mind deficits. However, this account is flawed and biased. In this paper, we revisit the Double Empathy Problem and provide an experience-first approach to autistic pragmatics. We start with proposing a mechanistic explanation of a link between experiential differences and intentionality understanding in linguistic contexts using the Interpretive Sensory Access theory. Then, we explain how theories of common ground in communication involve factors beyond intention recognition and even beyond cooperation, highlighting how the egocentric nature of communication is relevant to one’s attention and experiences. Taken together, we put forward an experience-based approach to understand autistic pragmatic atypicalities. This view is compatible with many other non-linguistic characteristics well-documented in autism, and prioritizes the experience of autistic people, instead of framing it as a communication disorder with a “mind-reading failure”.

An Experience-First Approach to Autistic Pragmatics 

People can adjust how fast they update task rules, depending on the volatility of their environment. We investigated whether this adaptivity is primarily driven by recently experienced volatility in task demands, or can also be shaped by learned, environment-specific associations with expected levels of volatility. To this end, we trained participants on a Wisconsin Card Sorting Task where different environments required different speeds of task rule updating. We demonstrate that, initially, participants updated strategies depending on the most recent experienced levels of volatility and feedback (Experiment 1). However, after extensive (four days) training (Experiment 2), participants also developed environment-specific associations. Our findings provide important insights in how people learn to regulate cognitive flexibility.

Learning task rule updating strategies requires extensive practice 

Word meanings are rarely transparent from their extralinguistic contexts. How children learn words from an input with “low-informative” (LI) events is of interest because even adults struggle to learn from LI events (Gleitman & Trueswell, 2020; Medina et al., 2011). This study revisited LI events’ contribution to learning by probing what can be gleaned from LI events even if they don’t yield exact meanings. Adults (N = 120) learned words (e.g., “modi”) that had English meanings (e.g., “apple”) from LI events. Participants then both guessed the word’s exact meaning and rated several candidate meanings. Although LI events failed to yield accurate mappings of meanings, they led to representations (derived via the ratings) that were semantically aligned with those of the true meanings. These results highlight the potential for LI events to get learning off the ground and the implications of viewing word learning as more than a mapping problem.

Beyond Word Meaning Mappings: The Role of Low-Informative Events in Conceptual Alignment

The current study investigates how pronominal ambiguity is resolved in real-time, focusing on the role of referent bias and task context. In two self-paced reading experiments, we tested whether ambiguity leads to processing benefits or costs modulated by the presence of a biased referent and the task manipulations. Experiment 1 showed that the ambiguity advantage emerges only when a biased referent is not selected, supporting reanalysis-based accounts such as the unrestricted race model (Van Gompel et al., 2000, 2001, 2005). Experiment 2, however, revealed a delayed ambiguity penalty, suggesting task-induced shifts in processing strategy that better fit a delayed interpretation account. These findings highlight that pronominal ambiguity resolution may involve two processing mechanisms shaped by the parser's evaluation space and the timing of selection.

Who and when gets the race? Two processing routes for the advantages and penalties of pronominal ambiguity resolution

People regularly make inferences about objects in the world that they cannot see by flexibly integrating information from multiple sources: auditory and visual cues, language, and our prior beliefs and knowledge about the scene. How are we able to so flexibly integrate many sources of information to make sense of the world around us, even if we have no direct knowledge? In this work, we propose a neurosymbolic model that uses neural networks to parse open-ended multimodal inputs and then applies a Bayesian model to integrate different sources of information to evaluate different hypotheses. We evaluate our model with a novel object guessing game called "What's in the Box?'' where humans and models watch a video clip of an experimenter shaking boxes and then try to guess the objects inside the boxes. Through a human experiment, we show that our model correlates strongly with human judgments, whereas unimodal ablated models and large multimodal neural model baselines showed poor correlation.

What’s in the Box? Reasoning about Unseen Objects from Multimodal Cues

Xenophobia and anti-immigrant sentiments have been increasing in Western democratic countries, and it is important to understand how messaging can improve attitudes towards immigrants. Past studies show prior attitudes are associated with how individuals evaluate related arguments. The present study (N = 349) explores if people’s prior attitudes influence how they evaluate the strength of arguments in the context of immigration. We also test whether the style of argument (i.e., narrative or statistical) influences argument evaluation. We measured participants’ attitudes towards immigrants before and after an argument evaluation task, where participants rated the quality of a narrative and statistical argument.  Participants with high pre-existing negative attitudes towards immigrants rated pro-immigrant arguments poorly and anti-immigrant arguments strongly, and we see the opposite relationship for participants with pre-existing positive attitudes towards immigrants.  Our findings demonstrate that people can evaluate the same arguments about immigrants very differently depending on their pre-existing attitudes and that argument style can affect argument evaluation.

Reducing Negative Attitudes Towards Immigrants – The Role of Prior Attitudes and Argument Style 

Generics, general statements about categories, are believed to transmit essentialist beliefs---the idea that things have a hidden true nature. Research suggests that people essentialize natural (biological and non-living) and social kinds, but not artifacts. Previous studies using small datasets found that generics are often used to describe animate beings in speech to children. Using a larger corpus of children's books and parent speech, we examined a wider range of kinds and generalizing statements (habituals and universals). Our results show that generics are more likely used for biological kinds than artifacts and that their use increases in parent speech as children age. However, generics weren't more likely used for non-living or social kinds than artifacts. Habituals, at least in speech, were more likely used for social kinds than artifacts. Generalizing statements were more likely used for about non-living natural kinds than artifacts. These findings inform the debate over whether generics transmit essentialist beliefs.

Generics revisited: Analyzing generalizations in children's books and caregivers' speech

Music-emotion recognition, the ability to perceive emotions in music, has emerged as a means of understanding emotion beyond verbal language, specifically for individuals with special educational needs (SEN). However, there has been little focus on delineating emotion through quantified music features for a systematic comparison between different SEN groups. This study identified specific musical features and examined the different music-emotion processing patterns in 3-to 10-year-old Chinese children with and without SEN. Participants completed a forced-choice task by identifying four emotions involving happiness, sadness, anger, and fear from Western classical music. Through integrating a biologically-inspired filterbank into music information retrieval analysis, the result revealed that musical features, such as spectral density, contributed to human emotional recognition. In addition, children with SEN exhibited distinct confusion patterns in some emotion pairs compared to their typically developing counterparts. These findings demonstrated a novel approach to investigating musical-emotional recognition across the developmental span.

From hearing to feeling: Quantifying music-emotion and examining the different processing patterns in children with special educational needs (SEN)

Semantic vectors derived from training on large text corpora (e.g., word2vec, BERT) are widely used as a methodological tool to model similarity of concepts. Recent work has demonstrated that a small amount of human training data can be used to fine-tune these vectors for modeling specific tasks. For example, human ratings of pairwise similarity can be used to estimate a set of dimensional weights, and these weights can improve estimates of human similarity ratings for held-out pairs. We applied this methodology to the semantic fluency task (listing items from a category) and find that category- specific weights can be used to identify the semantic category of a fluency list. The results have methodological implications for modeling retrieval in semantic fluency tasks, estimating semantic representations, and identifying semantic clusters and switches in fluency data.

Fine-tuning semantic vectors with semantic fluency data

How do people’s understandings of abstract concepts evolve through interacting with others? While prior research has focused on individual cognitive processes, how people reflect on and adapt knowledge in social contexts remains underexplored. This study examines how shared interactive experiences during a word-guessing game influence semantic representations of abstract words. Participants completed a spatial arrangement task (SpAM) before, immediately after, and two weeks after the game. Abstract words used as game targets underwent significant positional changes, indicating semantic reorganization. Semantic alignment between game partners was stronger than between non-partners, as measured by a property listing task (PLT), highlighting the role of shared interaction in driving semantic changes. Additionally, in-game semantic alignment measures predicted post-game performance in SpAM and PLT, suggesting that dyadic interaction quality influenced the magnitude of semantic change. These findings provide empirical evidence for the socially driven and dynamic nature of abstract concept representations in collaborative contexts.

Downloads

Next from CogSci 2025

An Experience-First Approach to Autistic Pragmatics

.css-70qvj9{display:-webkit-box;display:-webkit-flex;display:-ms-flexbox;display:flex;-webkit-align-items:center;-webkit-box-align:center;-ms-flex-align:center;align-items:center;}Downloads

Next from CogSci 2025

An Experience-First Approach to Autistic Pragmatics

Downloads