About
Language Disorders 2026 Online – DELAD Special Session Language Disorders Corpora
Language Disorders 2026 Online SPECIAL Session on LANGUAGE DISORDERs CORPORA
Co-organised by the International Linguistic Society and DELAD (Database Enterprise for Language and speech Disorders)
Language disorders affect individuals across all ages and languages. While many are congenital and emerge in early childhood, others develop later in life as a result of neurological conditions, illness, or injury. Addressing this global and multilingual reality requires robust, well-documented, and ethically managed language disorder corpora.
This special session focuses on the FAIR sharing of speech disorders corpora in advancing research, clinical practice, and interdisciplinary collaboration.
Audience participation in the session will be complimentary.
The session is scheduled to take place on 26 August, 14:00–17:00 (GMT, CEST −1).

Session Format
Panel-led introduction
Keynote lecture: Iris Nowenstein
Panel: Iris Nowenstein, Paul Trilsbeek, Alice Lee, Satu Saalasti
Abstract presentations

Topics of interest
Contributions and discussion topics may include, but are not limited to:
Design and annotation of language disorder corpora
Ethical, legal, and governance issues in clinical language data
National and international initiatives to collect and share speech disorders corpora
Reuse and interoperability of corpora
Clinical and research applications of disorder corpora
Linking corpora with neuroimaging, behavioural, and longitudinal data

Who should attend
This session is relevant to:
Researchers working on language and speech disorders
Clinicians and speech-language pathologists
Corpus linguists and phoneticians
Cognitive scientists and neuroscientists
Researchers involved in data infrastructure, archiving, and research ethics
Audience participation in the session will be complimentary.
The session is scheduled to take place on 26 August, 14:00–17:00 (GMT, CEST −1).
About DELAD
DELAD is an international initiative dedicated to the collection, organisation, archiving, and sharing of language and speech disorders corpora. DELAD supports the scientific community by:
- Providing access to high-quality datasets
- Promoting best practices in data management and ethical use
- Supporting the long-term preservation of sensitive research materials
DELAD facilitates collaboration between researchers, infrastructure providers, and practitioners working in clinical linguistics, speech-language pathology, phonetics, and related disciplines

In association with

Keynote Speaker
Language Disorders 2026 OnlineIris Edda Nowenstein
University of Iceland

Main areas of research
- First language acquisition and multilingualism
- The memory-language interface
- Speech and language disorders
- Clinical linguistics
- Clinical language technology
- Case marking and argument structure
- Language variation and change
Bio
Iris Nowenstein is an Assistant Professor in Icelandic Linguistics at the University of Iceland and a Speech-Language Pathologist at the National Universal Hospital of Iceland. She defended her PhD dissertation on the acquisition of Icelandic case marking at University of Iceland in 2023 and was a Fulbright Visiting Scholar at the University of Pennsylvania in 2024-2025, within the Integrated Language Science and Technology initiative. Her current work is within clinical linguistics and clinical language technology, focusing on the automatic extraction of language-specific and language-universal manifestations of diseases and disorders. She is involved in various efforts to ensure that people in smaller and lesser-resourced language communities have access to digital health resources which are based on language technology.
Keynote Lecture
Language and Speech Disorders Corpora as Necessary Infrastructure for Health Tech Accessibility
Recent clinical applications of language technology show considerable potential for people with speech and language symptoms and disorders. This has resulted in a variety of health tech tools becoming available to them, including novel communication aids such as personalized automatic speech recognition for disordered speech and novel instruments for the screening, diagnosis and monitoring of diseases and disorders via automatic language sample analysis. This kind of monitoring through speech might, for example, provide cost-effective, person-centered and non-invasive endpoints for treatment efficacy assessments, including in the context of precision medicine and drug trials.
But crucially, these speech- and language-based health tech tools are currently almost exclusively accessible to speakers of English and a few other high-resource languages. A major hurdle for small, lower-resourced language communities in this context is the creation of clinical corpora, including language and speech disorders corpora. These corpora often constitute the basis for the successful generalization of technology across languages, both in the context of diagnosis and monitoring (where manifestations of diseases and disorders can be language-specific) and communication aids, where accuracy gains can heavily rely on dataset size and quality.
In this talk, I will describe ongoing efforts to build the necessary infrastructure for clinical speech and language data collection in Iceland. This is done through the creation of the Icelandic Language Biobank, a resource that leverages collaboration with clinicians and robust linguistically-informed data collection against data scarcity.
In contrast with the most comprehensive data collection efforts in high-resource languages, the Icelandic Language Biobank will contain language samples for communication aid development as well as the diagnosis and monitoring of diseases and disorders. Data will primarily be collected through a web-based automatic linguistic analysis platform, ALDA (Automatic Linguistic Data Analysis), designed for and co-created with speech-language pathologists/therapists (SLPs). A platform which combines accessible clinician-centered tools and a data sharing infrastructure serves the purpose of facilitating technological transfer and creating conditions for sustainable clinician-led data collection. I argue that taking these steps brings us closer to the language and speech disorders corpora that are necessary to ensure access to speech- and language-based health tech tools in small, less-resourced language communities.
- Data stewards interested in sensitive language data
Submit today
Authors wishing to present in the special session should submit via the Language Disorders 2026 submission page and select the topic:
“DELAD Special Session: Language Disorder Corpora.”

