Original corpus of articles relating to refugees scraped from Tennessee newspaper The Chattanoogan along with simple code for text-as-data word cloud.
-
Updated
Nov 11, 2019 - R
Original corpus of articles relating to refugees scraped from Tennessee newspaper The Chattanoogan along with simple code for text-as-data word cloud.
Material from my Machine Learning for the Social Sciences course
Replication script for the Webscrapping Transcripts of the Parliamentary Debates in the National Council of the Slovak Republic (2002-2023) and the ensuing sentiment analysis
Empirical framework applied to parliament discourses and Twitter data, with a Discourse Polarization Index.
Collection of text corpora for publicly available speeches from Mexican president Andres Manuel Lopez Obrador (AMLO) sourced from YouTube. The dataset includes his daily morning conferences (conferencias mañaneras) 😴🪿
The ABC of Computational Text Analysis. BA Seminar, Spring 2021, University of Lucerne
This repository uses text-as-data methods alongside traditional primary source reading to analyze early American state constitutions. The R scripts create a function to scrape and clean the constitutional text, run sentiment analysis, calculate tf-idf, and perform LDA. This is a work-in-progress.
From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records
A tutorial on using regular expressions in R
A small showcase for topic modeling with the tmtoolkit Python package. I use a corpus of articles from the German online news website Spiegel Online (SPON) to create a topic model for before and during the COVID-19 pandemic.
The ABC of Computational Text Analysis. BA Seminar, Spring 2022, University of Lucerne
'dictvectoR' measures the similarity between a concept dictionary and documents, using fastText word vectors. Implements the "Distributed-Dictionary-Representation" (Garten et al. 2018) method in R.
An Automation Webcrawler for Extracting Central Bankers' Speeches
LinkOrgs: An R package for linking linking records on organizations using half a billion open-collaborated records from LinkedIn
Summer 2017 Social Media Analytics Workshop Series
2018 Computational Text Analysis Notebooks, University of Mannheim
Code and models for 3 different tools to measure appeals to 8 discrete emotions in German political text
Literature 📄 and datasets 📚 on automatic populism detection
Add a description, image, and links to the text-as-data topic page so that developers can more easily learn about it.
To associate your repository with the text-as-data topic, visit your repo's landing page and select "manage topics."