Last year, I entered the Kaggle March Madness machine learning competition for the first time. It threw me for a bit of a loop that a single competition has you submit predictions for both the men's and women's tournaments. It didn't seem safe to assume that the same features would predict both men's and women's games optimally, but due to time constraints, I just trained a model on men's data so I could also use it to fill out work and friends' bracket competitions, then reused the exact same model for women's predictions in the Kaggle competition.
I've started adjusting the model for 2026, and decided to test my intuition that men's and women's predictions would benefit from tailored models. I took the same feature set, trained two logistic regression model instances separately on men's and women's data, and compared the weights.
Here are the highlights: