Workshop-II
Machine Learning and Statistical Language Models
January 10 - January 14, 2022
All times US Eastern time
Participants login here to enable Zoom and video linksCourse Materials
Day 1 - January 10

Introduction to Machine Learning Janco, Lassner
Preparation: Intro. to ML
Skills and concepts for model training.


Practical Introduction to Model Training Janco, Dombrowski
Preparation: LitBank Notebook
Hands-on activity with model training.
Day 2 - January 11


Overview: From Toy Data to Your Data Tasovac, Budak
Preparation: None, slides available here
spaCy projects, data processing and model training with your project's data and requirements.


Demonstration: Training Models with INCEpTION Data Tasovac, Budak
Preparation: New Language Training
Workflow for data preparation and model training with project data.
Day 3 - January 12

Practical Session with Teams’ Data Tasovac
Work in individual teams to run project files and train models. Assess model performance against applied research tasks.

Practical Session with Project Data and Requirements Janco
Lightning talks to share with other groups. Continued project work
Day 4 - January 13

Embeddings, Do You Need Them? Janco, Lassner
Preparation: Embeddings
Adding FastText vectors to your model. Shared embedding layers. Transformer pipeline component.

Optional Applied Session Janco, Lassner
Preparation: Applied embeddings
Training with embeddings. Assess utility for research tasks
Day 5 - January 14


Review of Key Topics and Discussion Janco, Ermolaev
.

Time for Team Meetings and Planning Work Lassner, Budak
.