WaC-13 Workshop
Deadline: 2026-08-07
WaC-13 workshop at EMNLP2026, focusing on web data and its applications
The 13th Web-as-Corpus (WaC-13) workshop will be held at EMNLP2026 in Budapest, Hungary, from 24-29 Oct, 2026. It provides a multidisciplinary forum for research addressing the full lifecycle of web data.
Topics of interest include:
- Creation and evaluation of high-quality datasets for foundation models
- Use of web data in empirical linguistic research
- Analysis of web-scale corpora for quality, representativeness, and societal insights
- Ethical and legal aspects of collecting, sharing, and using web data
Submissions will be possible through ARR commitment and through openreview.net.
Tags: Web-as-Corpus, WaC-13, EMNLP2026, NLP, linguistics, web data, multilingual data