How Neural Networks Detect and Interpret Wordplay: New Insights from HSE Researchers

An international team including researchers from the HSE Faculty of Computer Science has presented KoWit-24, an annotated dataset of 2,700 Russian-language Kommersant news headlines containing wordplay. The dataset enables an assessment of how artificial intelligence detects and interprets wordplay. Experiments with five large language models show that even advanced systems still make mistakes, and that interpreting wordplay is more challenging for them than detecting it. The results were presented at the RANLP conference; the paper is available on Arxiv.org, and the dataset and the code for reproducing the experiments are available on GitHub.
Wordplay refers to deliberate use of language that violates linguistic norms in order to attract attention, entertain, or amuse the reader. It is common in Russian news headlines and can take various forms. For example, the headline ‘Osobo bumazhnye persony’ plays on the phrase ‘Osobo vazhnye persony’ (Russian for ‘very important persons’). The word vazhnye (‘important’) is replaced with bumazhnye (‘paper-related’), which rhymes with the original and shifts the meaning toward the topic of paper production. Another example is ‘Kod naklikal,’ the headline of an article about open-source code. It closely resembles ‘kot naplakal,’ an idiom meaning ‘very little,’ thereby creating a humorous ambiguity.
For human readers, such wordplay in headlines is immediately apparent and requires no explanation. However, large language models such as ChatGPT or GigaChat Max are often at a loss, struggling not only to detect the wordplay but even more so to explain the joke. One reason for this difficulty is the limited humour datasets on which LLMs are trained. In most cases, humour in these datasets is represented by canned internet jokes explicitly labelled as ‘jokes,’ which is insufficient for the models to learn why something is funny. In addition, such datasets contain almost no annotation—there are no machine- or human-readable layers of description indicating whether wordplay is present, what type of technique is used, what the headline refers to, and so on.
Researchers from the HSE Faculty of Computer Science, in collaboration with colleagues from IT:U—Interdisciplinary Transformation University Austria—and independent researchers, have created KoWit-24, a dataset dedicated to wordplay. It comprises 2,700 headlines from the Russian business daily Kommersant published between January 2021 and December 2023, along with contextual information: each headline is accompanied by a short description of the news story (the lead) and a summary. For each instance of wordplay, the authors manually annotated the type of technique, identified the anchors—the words that trigger the wordplay—and, where possible, linked the original expressions to relevant Wikipedia articles.
The authors adopted linguist Alan Scott Partington’s definition of wordplay, according to which wordplay occurs when the same expression can be interpreted in at least two ways and this effect is intentional. Wordplay can arise in several ways. One case involves ambiguity inherent in a word or its sound. For example, in the headline ‘Volgu ne mogut zastavit’ tech’ bystree,’ the word Volgu (Volga) refers both to the river and to a federal highway with the same name. Another case involves a slight modification of a well-known phrase or title, in which the author alters the wording while relying on the reader to recognise the original and complete the joke. For instance, ‘Missiya sokratima’ alludes to ‘Missiya nevypolnima,’ the Russian title of the film Mission: Impossible, while the headline itself suggests that a diplomatic mission can be downsized.
The researchers also distinguished ‘nonce words’—coined for a single occasion—and oxymorons, which combine two contradictory meanings. This approach not only allowed them to collect and describe examples but also to compare the performance of different language models.
After annotation, the authors tested the dataset on five LLMs: GPT-4o, YandexGPT-4, GigaChat Lite, GigaChat Max, and Mistral NeMo. Each model was provided with a headline and the corresponding news lead and asked to perform two tasks: first, to determine whether the headline contained wordplay, and second, to interpret it by identifying the original phrase or reference. The researchers compared the effects of two types of prompts: a simple prompt asking whether the headline contained wordplay, and an extended prompt providing a definition along with examples of different wordplay types. The extended prompt improved performance on the detection task for three of the five models, while GPT-4o demonstrated the strongest performance in both detection and interpretation. For all models, interpreting the source of the joke proved significantly more difficult than simply detecting the presence of wordplay.
Pavel Braslavski
‘KoWit-24 addresses two key limitations of earlier datasets: it provides context for each headline and includes multi-level annotation. This transforms a collection of examples into a full-fledged “testbed” for AI. It now allows for an objective comparison of models—whether a model can detect wordplay, identify the anchor, and correctly recall the original phrase or reference. Such verifiable metrics not only allow for a more accurate evaluation of current systems but also support their intentional improvement through selection of prompts, training examples, and fact-checking strategies. In the future, we plan to investigate whether this dataset can be used to enhance humour generation,’ says Pavel Braslavski, Associate Professor at the HSE Faculty of Computer Science and co-author of the paper.
In addition, the dataset establishes a common and transparent standard for evaluation, as researchers use the same data and experimental scripts. This reduces variability in the results and helps develop models that better understand natural language, rather than merely following the logical structure of the text.
See also:
HSE Economists Find That Auction Prices Depend on Artist’s Life Story
Researchers from the Centre for Big Data in Economics and Finance at the HSE Faculty of Economic Sciences have found that facts from an artist’s life are statistically significant in pricing a painting, alongside such traditional characteristics as the material, the size of the canvas, or the presence of the artist’s signature. This conclusion is based on an analysis of prices for 15,000 works by 158 artists sold since 1999 by the major auction houses Sotheby’s and Christie’s. The article has been published in the journal Empirical Studies of the Arts.
HSE Physicists Propose Unified Theory for Describing Electric Double Layer
To develop more efficient batteries and catalysts, it is essential to understand the processes occurring at the metal–solution interface in the electric double layer (EDL). Physicists at HSE MIEM have proposed a unified theoretical model of the EDL that simultaneously accounts for selective adsorption of ions on the surface and partial charge transfer between ions and the metal—phenomena that had previously been described separately. The model’s predictions are consistent with experimental data. In the future, it may be used in the development of batteries, supercapacitors, and catalysts. The study has been published in Electrochimica Acta.
HSE Researchers Experimentally Demonstrate Positive Effects of Urban Parks on the Brain
Scientists at HSE University have investigated the effect of parks on the cognitive and emotional resources of city dwellers. The researchers compared brain electrical activity in 30 participants while they watched videos of walks through parks and along busy highways. The results showed that green urban environments with trees produce a consistent effect across individuals, helping the brain calm down and relax. By contrast, walks along busy streets were found to be distracting. The findings have been published in Scientific Reports.
Fourth Robotics Festival to Take Place at HSE University
From April 1 to 3, 2026, the HSE Pokrovka Campus will host the Fourth Robotics Festival—one of the key events organised by the Faculty of Computer Science for anyone interested in robotics, programming, and engineering creativity. The festival will bring together robotics competitions, discussions, educational formats, and demonstrations of technological developments.
HSE University Scholars Uncover E-Learning Preferences of Top Students
HSE University experts have analysed students’ digital footprints and shown for the first time that final grades depend on one’s personal approach to an online course. Balanced students have proven to be more successful than those who follow a more traditional and practical approach. The findings from this study will help create a more adaptive and personalised educational system. This research has been published in the journal The Internet and Higher Education.
HSE Scientists Develop Method to Stabilise Iodine in Solar Cells
Scientists at HSE MIEM, in collaboration with colleagues from China, have developed a method to improve the durability of perovskite solar cells by addressing iodine loss from the material. The researchers introduced quaternary ammonium molecules into the perovskite structure; these molecules form strong electrostatic pairs with iodine ions, effectively anchoring them within the crystal lattice. As a result, the solar cells retain more than 92% of their power after a thousand hours of operation at 85°C. The study has been published in Advanced Energy Materials.
HSE Researchers Create Genome-Wide Map of Quadruplexes
An international team, including researchers from HSE University, has created the first comprehensive map of quadruplexes—unstable DNA structures involved in gene regulation. For the first time, scientists have shown that these structures function in pairs: one is located in a DNA region that initiates gene transcription, while the other lies in a nearby region that enhances this process. In healthy tissues, quadruplexes regulate tissue-specific genes, whereas in cancerous tissues they influence genes responsible for cell growth and division. These findings may contribute to the development of new anticancer drugs that target quadruplexes. The study has been published in Nucleic Acids Research.
Mathematician from HSE University–Nizhny Novgorod Solves Equation Considered Unsolvable in Quadratures Since 19th Century
Mathematician Ivan Remizov from HSE University–Nizhny Novgorod and the Institute for Information Transmission Problems of the Russian Academy of Sciences has made a conceptual breakthrough in the theory of differential equations. He has derived a universal formula for solving problems that had been considered unsolvable in quadratures for more than 190 years. This result fundamentally reshapes one of the oldest areas of mathematics and has potential to have important implications for fundamental physics and economics. The paper has been published in Vladikavkaz Mathematical Journal.
Scientists Reveal How Language Supports Complex Cognitive Processing in the Brain
Valeria Vinogradova, a researcher at HSE University, together with British colleagues, studied how language proficiency affects cognitive processing in deaf adults. The study showed that higher language proficiency—regardless of whether the language is signed or spoken—is associated with higher activity and stronger functional connectivity within the brain network responsible for cognitive task performance. The findings have been published in Cerebral Cortex.
HSE AI Research Centre Simplifies Particle Physics Experiments
Scientists at the HSE AI Research Centre have developed a novel approach to determining robustness in deep learning models. Their method works eight times faster than an exhaustive model search and significantly reduces the need for manual verification. It can be applied to particle physics problems using neural networks of various architectures. The study has been published in IEEE Access.


