Code for item classification is found in classification/prosody_classifier.py
.
Results for different classifier types:
-
Single-layer neural network (NN): 63% accuracy
-
Naive Bayes classifier: 87% accuracy
-
Support Vector Machine (SVM): 89% accuracy
Run prosody_classifier.py
for SVM results. The trained NN is in models/prosody_classifier.pt
.
Code for topic modeling is found under topic_modeling
. Word clouds are zipped under topic_modeling/results
.
Code for extracting IDs from text is found in id_extraction/id_extraction.py
.
This project is the work of Eve Fleisig and was created for the Princeton Prosody Archive in 2019-2020.