Improving Sparse Reward Classifiers for Model-Based Reinforcement Learning in Low-Data Regimes