Show simple item record

dc.contributor.authorChen, Junda
dc.description.abstractIn this paper, we analyze the classic US Adult Income Dataset using logistics regression and random forest to analyze potential factors that contribute to income bias for the 50Kincome bracket(income ≥ 50K per year). Using the two methods, we train the dataset and obtain stable models overcross validation. We also found that the two methods, although both showing good accuracy, exhibit conflicting interpretation about what factors have the most influence on the US adult income.en_US
dc.subjectmachine learningen_US
dc.subjectrandom foresten_US
dc.subjectbig dataen_US
dc.subjectlogistics regressionen_US
dc.subjectneural networken_US
dc.subjectfeature engineeringen_US
dc.titleFeature Significance Analysis of the US Adult Income Dataseten_US
dc.typeTechnical Reporten_US

Files in this item


This item appears in the following Collection(s)

  • CS Technical Reports
    Technical Reports Archive for the Department of Computer Sciences at the University of Wisconsin-Madison

Show simple item record