A Decision Tree Analysis on Drug Use and Health
This report contains the analysis of factors that are associated with youth drug use, using decision tree models on survey data from the National Survey on Drug Use and Health. The dataset contains detailed information on various aspects of respondents’ lives, including demographics, youth experiences, and substantial drug use. The study explores three problem types: binary classification for marijuana use, multi-class classification for frequency of marijuana in past year, and regression for first ever use of marijuana. Decision trees and some ensemble methods are used to build predictive models for each problem type. The results suggest the impact of different demographic variables and youth experience variables on various substantial youth drug uses. The findings of this study have practical implications for public health interventions aimed at reducing youth drug use.