Slides, Code & Data used in class

Date Topic Slides Code Data
18/11 Introduction to Text Analysis Slides 1 01_rmarkdown.rmd
01_textanalysis.rmd
01/12 Webscraping Slides 2 02_scraping.rmd
08/12 Descriptive Analyses, Dictionaries Slides 3 03_descriptive_analysis.rmd
03_dictionaries.rmd
english.yml
us_election_2020_1st_presidential_debate.csv
15/12 Supervised Learning Methods Slides 4 04_classifyingparliament.Rmd
04_classification.rmd
House of Commons Corpus from the ParlSpeech Dataset - caution: large file
sample_corp.RData
11/01 Unsupervised Learning Methods Slides 5 05_gadarian.rmd
11/01 Extra: Selenium Slides 5plus 05_selenium.rmd



Exercises

The table provides exercise code and solutions. All exercises and their solutions are discussed in the Lab session.

They should be uploaded until Friday 12am before the Lab Session via Dropbox File Request.

Date Topic Exercises
18/11 Introduction to Text Analysis Survey
01_forloops.rmd
01/12 Webscraping 02_scraping.rmd
02_scraping_briefings.rmd
02_singlefile.rmd
08/12 Descriptive Analyses, Dictionaries 03_transform_preproc.rmd
03_descriptive_analysis.rmd
03_dictionaries.rmd
15/12 Supervised Learning Methods 04_classifyingparliament.Rmd
04_classification.rmd
04_thesisabstracts.rmd with eui.csv
11/01 Unsupervised Learning Methods 05_brexitbill.rmd - use brexit.RData, stm_brexit1.RData and stm_brexit2.RData if needed

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA). If not stated otherwise, images are created by the Course Creator.