Field Matters 2024 workshop program
ICT, UTC +7
August 16
09:00 - 09:30 Opening word
09:30 - 10:30 Invited talk: What role can ASR play in real-world language documentation? - Emily Prud’hommeaux
10:30 - 11:00 Coffee break
11:00 - 11:10 Introduction to special track
11:10 - 12:10 Special track
- Leveraging Deep Learning to Shed Light on Tones of an Endangered Language:
A Case Study of Moklen -
Sireemas Maspong, Francesco Burroni, Teerawee Sukanchanon, Warunsiri Pornpottanamas and Pittayawat Pittayaporn
- Documenting Endangered Languages with LangDoc: A Wordlist-Based System and A Case Study on Moklen -
Piyapath T Spencer
- Zero-shot Cross-lingual POS Tagging for Filipino -
Jimson Paulo Layacan, Isaiah Edri W. Flores, Katrina Bernice M. Tan, Ma. Regina E. Estuar, Jann Railey Montalan and Marlene M. De Leon
12:10 - 12:20 Break
12:20 - 13:20 Invited talk: Insights from Language Resource Collection in Linguistically Diverse Southeast Asian Languages - Genta Winata and Alham Fikri Aji
13:20 - 14:30 Lunch Break
14:30 - 16:30 Main Session
- The Parallel Corpus of Russian and Ruska Romani Languages -
Kirill Koncha, Abina Abina, Kazakova Tatiana and Gloria Rozovskaya
- ManWav: The First Manchu ASR Model -
Jean Seo, Minha Kang, SungJoo Byun and Sangah Lee
- User-Centered Design of Digital Tools for Sociolinguistic Studies in UnderResourced Languages -
Jonas Adler, Carsten Scholle, Daniel Buschek, Nicolo’ Brandizzi and Muhadj Adnan
- A Comparative Analysis of Speaker Diarization Models: Creating a Dataset for German Dialectal Speech -
Lea Fischbach
- Noise Be Gone: Does Speech Enhancement Distort Linguistic Nuances? -
Inigo Parra
- Comparing Kaldi-Based Pipeline Elpis and Whisper for Cakavian Transcription -
Austin Jones, Shulin Zhang, John T. Hale, Margaret Renwick, Zvjezdana Vrzic and Keith Langston
16:30 - 17:00 Coffee break
17:00 - 18:30 Plenary discussion