• Home
  • About
  • AI Projects
  • DE Projects
  • DA Projects
  • Interest
  • Contact

Student Debt Collection Case Analysis and Visualization




This project analyzed millions of student debt-related court cases to uncover key patterns and players in debt collection practices, focusing on Massachusetts. Using advanced querying, data standardization, and visualization techniques, I developed an interactive Looker Studio dashboard and enhanced database accuracy with NLP. These efforts provided actionable insights for investigative reporting on the student loan debt crisis.


Due to the confidentiality of the project, the final Github repo is kept private.
Read About Our Work in the Newspaper: A state agency that issues student loans has sued thousands of borrowers

My Contribution



  • Investigated large-scale database (5 tables, millions of rows) with SQL to identify student debt-related cases, applying advanced querying techniques to extract relevant data without clear indicators.
  • Developed an interactive Looker Studio dashboard, enabling a non-technical client to easily filter and navigate data by year, court type, and plaintiff party for better insights.
  • Led a comprehensive GitHub cleanup initiative, organizing and renaming files for both technical and non-technical users, ensuring data integrity while eliminating redundancies.


Final Presentation Slides

Final Report

Advanced Data Analysis

  • Improved ability to query and analyze large databases with millions of rows to extract meaningful insights.

Dashboard Development

  • Learned to create user-friendly dashboards in Looker Studio, making complex data accessible to non-technical users.

Data Cleaning with NLP

  • Developed skills in using NLP to standardize and clean data, ensuring accuracy and consistency for analysis and visualization.



Copyright © Vanessa Huang, modified by Caslow Chien, 2024.