End-to-end NLP system replacing manual labeling of correctional incident free text with automated multi-label prediction across facilities.
Role: Technical lead / product owner (BIOS program) Partners: Recidiviz ; Council of State Governments
Highlights:
MultiOutputClassifier with logistic regression; ongoing SVM experimentation (+~5 F1)Outcomes: 75.6% micro-F1, 88.3% precision, 66.1% recall, 60.8% micro-Jaccard, 60.6% exact match; ~30x vs. random baseline
Python scikit-learn pandas TF-IDF Oracle/SQL NLP production ML