Workshop on Structured Linguistic Data and Evaluation (SLiDE)
A full-day workshop at LREC 2026, 11-16 May 2026, Palma, Mallorca (Spain)

In the last ten years, significant advances in deep learning models and the development of Large Language Models (LLMs) have revolutionized the fields of computational linguistics (CL) and natural language processing (NLP). In turn, this has led to a complete re-assessment of the language resources and evaluation practices necessary for training LLMs and analyzing their outputs. In particular, the availability of very large amounts of unstructured data for training foundational models has come into focus, while the value of high-quality structured linguistic data with rich annotations at various levels of linguistic analysis has been downplayed by comparison. However, as CL and NLP practitioners engage further with LLMs and debate their strengths and weaknesses, the importance of high-quality, structured linguistic data has been re-emphasized. The proposed workshop can be seen as related to the Treebanks and Linguistic Theories (TLT) conference series and the more recent SyntaxFest venue. Over the years, these venues have provided a central forum for high-quality research on treebanks, syntactic theory, syntax-semantics interface, structured meaning representations, and annotated linguistic resources. With record participation in recent years, they demonstrate the vitality and relevance of this line of work. The Workshop on Structured Linguistic Data is conceived as both a continuation of this tradition and an adaptation to the new realities of an LLM-dominated research landscape. The workshop will bring together researchers from these overlapping traditions to advance methods, resources, and practices for integrating structured linguistic data into the LLM era.

Topics of interest include but are not limited to:
Linguistic Data Analyses, Language Resources, and Evaluation

Grammar processing with NLP and LLM-based tools
Phonological and morphological analysis and LLM tokenization
Annotation strategies with LLM-empowered methodologies and tools
Design principles and annotation schemes for structured linguistic data
Multi-lingual and cross-lingual settings
Mapping of structured linguistic data to Linked Open Data resources
Evaluation informed by language typology
Language resources for under-resourced and endangered languages
The use of structured linguistic data for NLP applications
The use of structured linguistic data in acquiring linguistic knowledge
(Semi-)automatic methods for creating structured linguistic data

Spoken language Data

Speech-to-text applications
Speech Generation techniques
Speech data preparation, curation and evaluation

Multimodality and Situated Dialogue

Structured multimodal resources: gesture AMR (GAMR), gaze and posture annotation, multimodal dialogue corpora. Multimodal grounding: linking language with visual, gestural, and action representations Structured representations for co-attention and alignment in multiparty dialogue
Multimodal evaluation resources for LLMs

Pragmatics and Discourse

Structured data for discourse and dialogue: discourse relation annotation, coherence structures, dialogue acts Pragmatic annotation (speech acts, presupposition, implicature, politeness, stance)
Structured approaches to common ground tracking and Theory of Mind in LLMs

Semantics and Lexical Meaning

Dependency analysis and semantic parsing
Annotation beyond syntax: semantics, pragmatics and discourse
Structured data for lexical semantics: sense inventories, semantic frames, qualia structure, and type-theoretic resources Computational semantics resources: Abstract Meaning Representation (AMR), Universal Meaning Representation (UMR), Discourse, Representation Structures, Minimal Recursion Semantics (MRS), Type Theory with Records (TTR) Distributional and neural-symbolic representations of lexical meaning: (e.g., Holographic Reduced Representations (HRR), hyperdimensional computing) for structured LLM grounding Aligning vector-based meaning representations with symbolic/typed structures

We invite paper submissions in two distinct tracks:

regular papers on substantial and original research, including empirical evaluation results, where appropriate – 6 to 8 pages excluding references and potential ethics statements; short papers on smaller, focused contributions, work in progress, negative results, surveys, or opinion pieces – 4 to 6 pages excluding references and potential ethics statements.

Invited speakers
Naiara Perez (University of the Basque Country)
Shira Wein (Amherst College)

Paper Submission and Templates

Submission follows the LREC 2026 conference instructions, using the START conference management system. Submissions should follow the LREC stylesheet, available on the conference website on the Author’s kit page.

For more information about paper submission, please consult: https://www.slide-workshop.org/

Papers must be anonymized to support double-blind reviewing.

Important Dates
February 22, 2026: Paper submission deadline
March 15, 2026: Notification of acceptance
March 25, 2026: Camera-ready papers
May 2026: Workshop at LREC 2026
All deadlines are 11.59 pm UTC -12h (“anywhere on Earth”).

Workshop Organizers
Jan Hajič (Charles University, Czech Republic)
Erhard Hinrichs (Tübingen University, Germany)
Sandra Kübler (Indiana University, USA)
Joakim Nivre (Uppsala University, Sweden)
Petya Osenova (Sofia University and IICT-BAS, Bulgaria)
James Pustejovsky (Brandeis University, USA)
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to