Place
Unicom Building
Room: 9.3120
Mary-Somerville-Straße 9
28359 Bremen
Time
2:00 p.m. - 5:30 p.m. (Thursday); 9:00 a.m.- 5:00 p.m. (Friday)
Contact Person
Dr. Mandi M. Larsen (Bremen International Graduate School of Social Sciences (BIGSSS))
Partic. Organisation
Bremen International Graduate School of Social Sciences (BIGSSS); SOCIUM Forschungszentrum Ungleichheit und Sozialpolitik, Universität Bremen; Sonderforschungsbereich 1342 "Globale Entwicklungsdynamiken von Sozialpolitik", Universität Bremen

This workshop with Stefan Müller from Trinity College Dublin offers a hands-on introduction to extracting data from text, and applying various methods to analyse the data. Topics included involve:

  • From Raw Text to Corpus – how to collect textual data and prepare it for analysis.
  • Classification techniques - the first steps in translating text to usable data; supervised and unsupervised learning; dictionary approaches and topic modelling.
  • Scaling – Supervised and Unsupervised techniques.
  • An overview of more advanced topics.


The applied elements of the workshop will make use of the programming language R. Therefore, a basic familiarity with R is a prerequisite for attending the course.

Registration: BIGSSS fellows register via CampusNet, SOCIUM and CRC 1342 members please send a short email to mlarsen@bigsss-bremen.de