Partially offline for an update - new slide and code version to follow soon!
Date | Topic | Slides | Code | Data |
---|---|---|---|---|
Preparation | Survey |
01_rmarkdown.rmd / external resources 01_forloops.rmd |
||
10/05 | Introduction to Text Analysis |
Session 1 |
01_textanalysis.rmd | |
10/05 | Descriptive Analyses, Dictionaries |
Session 2 Session 2: Preprocessing |
02_descriptive_analysis.rmd 02_dictionaries.rmd 02_transform_preproc.rmd |
us_election_2020_1st_presidential_debate.csv theses.RData english.yml |
11/05 | Supervised Learning Methods | Slides 3 |
03_classifyingparliament.Rmd 03_classification.rmd 03_thesisabstracts.rmd |
House of Commons Corpus from the ParlSpeech Dataset - caution: large file sample_corp.RData |
11/05 | Unsupervised Learning Methods | Slides 4 |
04_gadarian.rmd 04_brexitbill.rmd |
brexit.RData |
12/05 | Webscraping |
Session 5 |
05_scraping.rmd 05_scraping_briefings.rmd 05_singlefile.rmd |
|
12/05 | Advanced Text Analysis Methods | Session 6 |
Solutions:
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA). If not stated otherwise, images are created by the Course Creator.