Machine Learning and Statistical Language Models
January 10 - January 14, 2022
All times US Eastern timeParticipants login here to enable Zoom and video links
Day 1 - January 10
Introduction to Machine Learning Janco, Lassner
Preparation: Intro. to ML
Skills and concepts for model training.
Practical Introduction to Model Training Janco, Dombrowski
Preparation: LitBank Notebook
Hands-on activity with model training.
Day 2 - January 11
Overview: From Toy Data to Your Data Tasovac, Budak
Preparation: None, slides available here
spaCy projects, data processing and model training with your project's data and requirements.
Demonstration: Training Models with INCEpTION Data Tasovac, Budak
Preparation: New Language Training
Workflow for data preparation and model training with project data.
Day 3 - January 12
Practical Session with Teams’ Data Tasovac
Work in individual teams to run project files and train models. Assess model performance against applied research tasks.
Practical Session with Project Data and Requirements Janco
Lightning talks to share with other groups. Continued project work
Day 4 - January 13
Embeddings, Do You Need Them? Janco, Lassner
Adding FastText vectors to your model. Shared embedding layers. Transformer pipeline component.
Optional Applied Session Janco, Lassner
Preparation: Applied embeddings
Training with embeddings. Assess utility for research tasks
Day 5 - January 14
Review of Key Topics and Discussion Janco, Ermolaev
Time for Team Meetings and Planning Work Lassner, Budak