Getting Stuck on the Kaggle Disaster Tweets Project (and How I’m Shipping V1 Anyway)

I’ve been working on the Kaggle Disaster Tweets classification project, and for a while, progress felt good. I built a baseline model using TF-IDF and Logistic Regression and managed to get an F1 score of 0.82 without using a pipeline. Then I decided to “do things properly” and refactor everything into a scikit-learn pipeline — and … Read more