Skip to main content
Språkbanken Text is a part of Språkbanken.

Research meetings

SpråkbankenText's research meetings during Spring 2025, will take place on Tuesdays Thursdays, from 10:15 to 11.30 (occasionally to 12:00), typically in room F314, unless otherwise announced. The meetings are normally open for all members of SpråkbankenText. On some occasions, we invite a broader audience, these public seminars are announced in SpråkbankenText's calendar.

Please update the list of conference deadlines.

The schedule for the Spring of 2025 is as follows:

Date Topic Comments
January
9
15 <extra> Conference: NEWSPAPERS, MAGAZINES & AI MODELS: TRAINING AND (RE-)USE IN THE DIGITAL HUMANITIES Deadline f 300w abstracts More information here: Submissions
16 Planning meeting (internal) as well as discussing (RJ) grant applications (ideas or drafts) All (researchers and PhD students)
22 <extra> Title: An overview of Burrows’ Delta for authorship attribution (online) 9-10 am CET - Register here
23 <OBS! Room: J236>
28 <extra> Riksbankens Jubileumsfond (RJ) project Closes at: 15:00
30 Discussion of The new ethics law (Sasha leads) "PM: En ny lag om forskningsetiska krav på och etikprövning av forskning som avser människor"
31 <extra> Symposium: Reading Concordances in the 21st Century; Deadline f 500w abstracts More information here: Submissions
February
6 (guest) AI Competence Center (AICC) at Sahlgrenska University Hospital (SU). Research Collaboration Meeting
6-7 <extra> Forum för Textforskning (FoT) 17 Göteborg, more info here
13 Sportlov Sportlov
15 <extra> ACL-2025: Submission Deadline (Main Conference): February 15 More information here: Submissions
20 Postoponed: Whisper - Speech recognition Herbert: Using Whisper - An In-depth introduction and hands-on experience
25 <extra> NLP in medicine, life and environmental science (Workshop) Time: 10:00 to 13:30. More information here: online for free - requires registration
27 SBX papers at the NODALIDA main conference or/and the NODALIDA satellite workshops Presentation of drafts (Maria x2; Anna, Emilie, Arianna)
March
04 <extra> Swedish Research Council (VR) project (humanities and social sciences) Closes at: <14:00> More information here.
6 Postponed: 1. From 'raw text' to a searchable format in Korp (Martin and Anne). 2. Preliminary: Initial discussion for the PhD candidates - deadline for applications: 3/3 (Dimitris). NB: the second topic [2] depend whether Dimitris will make it to have proper information before the 6th!
13 First meeting dedicated to the PhD candidates (10.30-12) and later (same date), Herbert: Whisper and KB-Whisper - Speech recognition Time: 13:00 to 15:00; Location: J233. Using Whisper - An In-depth introduction and hands-on experience.
20 Forskarutbildningsfrågor VP forskutb besöker - (Gerlof is the contact person)
24 <extra> Anna Lindahl final seminar Obs! Monday at J431
27 GU lawyers: Erica Schweder, Ellen Nordback, Alessandra Rodrigues Padilha (Elena is the contact person). Room J439
April
3 Information/initial discussion/initial ranking of the PhD candidates Elena, Shafqat and Niklas
10 Some preparations for the big talk about SBX and AI on May 5: "Ett HUM om: Göteborgare vs AI – vem kan språket bäst?" Sasha (in Swedish)
11 <extra> CLARIN-2025 Deadline f extended abstracts More information here: Submissions
17 Påsklov / Easter red days
23 <extra> Risks and Benefits with AI in Research - Ethical Perspectives 13.00 to 16.15: (free) registration required here: Submissions.
24 Discussion and Ranking of the PhD candidates Only senior researchers and the PhD representative
May
1 Public Holiday Public Holiday
5 <extra> SBX event: The Language Bank's Open House Humanisten, main entrance area.
5 <extra> Ett HUM om: Göteborgare vs AI – vem kan språket bäst? Trappscenen, Stadsbiblioteket: 18:00-19:00 (Sasha).
8 Revealing Semantic Variation In Swedish Using Computational Models Of Semantic Proximity Shafqat
14 <extra> Application deadline: RJ Infrastructure for Research 2025 Closes at: <15:00> More information here.
14-15 <extra> Pedagogical Application of Constructicography Workshop 2025 More information here.
15 From 'raw text' to a searchable format in Korp Martin and Anne
21 <extra> Challenges in sarcasm handling by language models (and humans) At 13:15-15:00; Hyewon Jang (University of Konstanz): Attend in person at room J411 or via Zoom - CLASP guest.
22
29 Public Holiday Public Holiday
June
4 <OBS! Wednesday>Korpi: the lightweight Korp version for the public Peter Ljunglöf
5 Open access to research data and automatic pseudonymization. Two years with Mormor Karl project.
This talk will be devoted to the challenges of working with data that contains personal information. I will describe a set of experiments with automatic pseudonymization that we have performed within Mormor Karl project. Among others, experiments with detection and labeling of personal categories using BERT models (Szawerna et al. 2024, 2025), attempts att using LLMs to "fill in the blanks" when substituting personal information with pseudonyms (yet unpublished) and a study on whether pseudonyms can provoke biased automated classifications (Muñoz Sánchez et al. 2024). The choice of models for our experiments is currently dictated by the sensitive nature of our data. To extend the choice from open source to proprietary models, we are currently collecting a "pseudo-corpus" with fictitious personal information that we will be able to share freely for future research (you are welcome to contribute to the pseudo-corpus collection as well). Finally, in this talk I will name several strategies to unify the research on automatic pseudonymization, and outline further challenges, needs for standardization and a proposal of a shared task.
Szawerna et al. 2025. The Devil’s in the Details: the Detailedness of Classes Influences Personal Information Detection and Labeling.
Szawerna et al. 2024. Detecting Personal Identifiable Information in Swedish Learner Essays.
Muñoz Sánchez et al. 2024. Did the Names I Used within My Essay Affect My Score? Diagnosing Name Biases in Automated Essay Scoring?.
Elena Volodina
12
Investigating Linguistic Abilities of LLMs for Native Language Identification.
Large language models (LLMs) often achieve high performance in native language identification (NLI) benchmarks by leveraging superficial contextual cues such as names, locations, and cultural stereotypes, rather than the underlying linguistic patterns indicative of native language (L1) influence. To improve robustness, previous work has instructed LLMs to disregard such clues. In this work, we demonstrate that this strategy is unreliable, and predictions can be easily altered by misleading hints. To address this problem, we introduce an agentic NLI pipeline inspired by forensic linguistics, where specialized agents accumulate and categorize diverse linguistic evidence before a final overall assessment.
Ahmet Yavuz Uluslu (University of Zürich, Switzerland)
<OBS! Room: J406>
18 The 50th anniversary of Språkbanken 🥇🍾🎂 Wallenberg conference centre, starting in the afternoon and include dinner 🥇🍾🎂🍴🍽
19
26
July
3 Summer ... Summer ...





The schedule for the Autumn of 2024 was as follows:
Date Topic Comments
September
10 <extra> SLTC-related meeting All (particularly those that are about to submit to the conference)
17 Planning meeting (internal) All (researchers and PhD students)
24 The HUMINFRA handbook Discussing/presenting chapters fr the group
26 <extra>Workshop on social network analysis Approaches to computer-assisted tools for social network analysis: 1) the exploratory network analysis software Gephi, and 2) an overview and test of other tools on how social network analysis can be integrated, combined, and explored concerning various research interests. Apply to the workshop to pernilla.severson@lnu.se
October
1 Maria's idea seminar This talk is hybrid (link). There is no text to read in advance. The presentation will be in English, but questions may be in Swedish - Room: C362. (Rescheduled from Higher Seminar)
8 General researcher meeting Talking about what we are working on
9 <extra>"Robustness in NLP: challenges and opportunities" (13:15-15:00) Rob van der Goot (IT Univ., Copenhagen) in-person (room J411) seminar or via
15 The CLASP annual conference, 14-15 Oct Title: "Multimodality and Interaction in Language Learning (MILLing)". Also note: Deadline for NODALIDA; currently the 21/10 extended to the 27/10.
21 13:15 – 15:00 in J444: Arianna Masciolini's 'middle seminar' Title: Cross-lingual approaches to computational Second Language Acquisition: the potential of Universal Dependencies.
22 10:15 – 11:30 in J303 SBX local IT infrastructure (Herbert) --- 13:00 – 15:00 in J406 Helpdesk (case management system) (Leif-Jöran) Servers and the upcoming helpdesk system at Språkbanken Text
23 < extra> CLASP seminar: 1:15 PM - 3:00 PM (room J411 or via zoom) Yufang Hou from IBM Research & UKP Lab-TU Darmstadt: The AI-Augmented Scientist: Building Computational Models for Effective Knowledge Synthesis and Dissemination”
29 No meeting - autumn holidays/Höstlov ...for some of group's members, so the meeting is cancelled.
November
5 See the blog post (SWETALY meeting)
12 Giorgia Albertin, PhD student. Department of Classical Philology and Italian Studies. University of Bologna Title: About cohesion and other discourse properties in cognitive impairment.
18 <extra> 13:15 – 15:00 in J431 or on Zoom: Ricardo Muñoz's 'middle seminar' Title: From Algorithms to Classrooms: NLP for Second Language Learning as a Case Study for Bias and Fairness in AI.
19 AITrust workshop at WASP-HS and AI for Humanity and Society WASP-HS@Linholmen Conference Centre
26 SBX papers at the SLTC main conference or/and the SLTC satellite workshops Dry run for the 10th Swedish Language Technology Conference (SLTC); 27–29 Nov., at Linköping Univ.
27 <extra> 10:15 – 12:00 on Zoom: Tom Södahl Title: Don't Mention the Norm - On Reporting Bias and Social Bias in Humans and Language Models
29 <extra> 12:00 – 13:00 on EA, EDIT-huset, Chalmers: Kenny Smith, U of Edinburgh Title: Language learning, language use, and the evolution of linguistic structure (Experiments based on AI)
December
3 Canceled: Discuss upcoming research grant proposals In the first case, RJ - Do we have any - send an email to dimitrios.kokkinakis@svenska.gu.se, a few days before and inform about your plans/drafts etc. NB: This event may be canceled on short notice due to the trial lectures during the afternoon!
5 <extra> 18:00 – 19:30, on Zoom, Shalom Lappin Title: NLP/AI Seminar: AI Dangers
10 Christmas Smorgasbord/Julbord Christmas Smorgasbord/Julbord, at 11.30-15.00
16 <extra> Mini workshop: Grammatical Error Correction Time: 13.15-15.00, Room: J415
24 Happy Holidays! Happy Holidays!



The schedule for the Spring of 2024 is here:
Date Topic Comments
January
16 Planning meeting (internal)
22 Application seminar SFS
23 Ethics for NLP (FLoV)
30 RJ project deadline
February
6 Herbert presents earlier work (internal)
13 School holidays
20 Discussing VR application drafts (internal) Also deadline VR Tvärvetensk forskningsmiljö
27
March
4-6 Research retreat SFS
5 Deadline VR HS
12 Discussion of Felix & Sasha's draft (internal)
19 Also: EACL
25 Felix final seminar (Monday!)
26 Discussion of Emilie's draft (internal)
April
2 School holidays
9 Handledarkollegium SBX (closed) Also: deadline VR NT
16
23 Mini-workshop "Approaches to corpus searches" NB! location J233,
https://www.gu.se/en/event/approaches-to-corpus-searches
30 St Walpurga's eve
May
7 (free)
14 CLARIN:EL Speaker: Kanella Pouli, fr Athena RC/ILSP, Greece. Abstract: CLARIN:EL is a Research infrastructure for Language Resources & Technologies (LRTs); it is the Greek part of the European CLARIN ERIC Infrastructure. It provides a multitude of assets related to Language Technology (LT) for and by Social Sciences & Humanities (and beyond), focusing mainly but not exclusively on Greek LRTs. The Central Inventory provides access to corpora, lexica, tools and language descriptions. Users can effortlessly locate desired resources using keywords and filters. Additionally, the infrastructure hosts a Workbench, i.e. a variety of services to process corpora from the Central Inventory or data submitted by registered users. For those eager to contribute their own data, a user-friendly interface enables the creation of metadata records for their resources. All the information regarding the infrastructure, the CLARIN network, and the NLP community in Greece can be found in the user guide, portal, and NLP:EL Knowledge Centre. Brief Bio: Computational Linguist, MSc., KP's work focuses on the design and improvement of repositories, the development and adaptation of metadata schemata for language resource documentation as well as the collection and processing of language resources.
21 No meeting bec. of LREC
28 (free)
June
4 SBX Retreat
10 Emilie idea seminar (Monday!)
11 Stina Johansson and Lars Kullman from UB About publication points
18 LREC/COLING summary