Date | Topic | Slides | Code | Data |
---|---|---|---|---|
18/11 | Introduction to Text Analysis | Slides 1 |
01_rmarkdown.rmd 01_textanalysis.rmd |
|
01/12 | Webscraping | Slides 2 | 02_scraping.rmd | |
08/12 | Descriptive Analyses, Dictionaries | Slides 3 |
03_descriptive_analysis.rmd 03_dictionaries.rmd |
english.yml us_election_2020_1st_presidential_debate.csv |
15/12 | Supervised Learning Methods | Slides 4 |
04_classifyingparliament.Rmd 04_classification.rmd |
House of Commons Corpus from the ParlSpeech Dataset - caution: large file sample_corp.RData |
11/01 | Unsupervised Learning Methods | Slides 5 | 05_gadarian.rmd | |
11/01 | Extra: Selenium | Slides 5plus | 05_selenium.rmd |
The table provides exercise code and solutions. All exercises and their solutions are discussed in the Lab session.
They should be uploaded until Friday 12am before the Lab Session via Dropbox File Request.
Date | Topic | Exercises |
---|---|---|
18/11 | Introduction to Text Analysis |
Survey 01_forloops.rmd |
01/12 | Webscraping |
02_scraping.rmd 02_scraping_briefings.rmd 02_singlefile.rmd |
08/12 | Descriptive Analyses, Dictionaries |
03_transform_preproc.rmd 03_descriptive_analysis.rmd 03_dictionaries.rmd |
15/12 | Supervised Learning Methods |
04_classifyingparliament.Rmd 04_classification.rmd 04_thesisabstracts.rmd with eui.csv |
11/01 | Unsupervised Learning Methods | 05_brexitbill.rmd - use brexit.RData, stm_brexit1.RData and stm_brexit2.RData if needed |
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA). If not stated otherwise, images are created by the Course Creator.