Skip to content

Mack's term project for Data Science analyzing the text of LGBTQ+ laws in the USA.

License

Notifications You must be signed in to change notification settings

Data-Sci-2022/LGBTQ-Laws-Text-Analysis

Repository files navigation

LGBTQ-Laws-Text-Analysis

By Mack Campbell, [email protected] December 15, 2022

Project overview

This research project analyzes pro- and anti-LGBTQ+ legislation from the US to see what statistical connection there is between the language used in the legislation and measures of equality per state for its LGBTQ+ population.

Where the data come from

I looked at the equality index for each state as assigned by Equaldex and its overall LGBTQ+ Policy tally as calculated by the Movement Advancement Project. These measure different aspects of equality and legal protections. Freedom for All Americans has a list of Pro and Anti LGBTQ+ legislation that I used to guide what legislation I included in my analysis. The Freedom for All American's website had the legislation on their site as a widget from BillTrack50. I gained permission to scrape BillTrack50's website, otherwise all scraping is prohibited in their TOS. On BillTrack50's website each bill page links to the state's page that houses the PDF text of the bill. This is finally where I pulled in all the text to analyze.

Repo organization

  • Final Report explains my data, my analysis of it, and some of my methodology. The entire methodology can be seen in the analysis walkthrough (rmd) (md).
  • Data has all the data files used in the analysis.
  • State Bill csvs is separated into Anti and Pro.
  • BillTrack DF is a dataframe of BillTrack information, to avoid scraping.
  • State Text DF is a dataframe of bill texts, to avoid having to scrape and compile the data.
  • Data Visualization includes all graphs generated from my analysis walkthrough (rmd) (md).
  • Analysis Walkthrough (rmd) (md) goes through all the code I used to collect and manipulate my data.
  • License details how you may use and share the data and code in this repo.
  • Presentation is a copy of the presentation given of this project on December 8, 2022.
  • Progress Report details progress made on the project through the Fall semester of 2022.
  • Project Plan is the original project plan. Compare to the end of the final report to see changes made.

About

Mack's term project for Data Science analyzing the text of LGBTQ+ laws in the USA.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published