request
new
DistilBERT body classifier as secondary signal
bill@transcom.net (Transcom IS) ·
2026-05-11 09:00
LightGBM gives us 0.998 AUC on the eFa corpus but misses some patterns (foreign-language marketing, cold B2B). A small DistilBERT model could catch what the gradient-booster misses, scored as a second signal added to the model_score.