Search
Now showing items 1-3 of 3
Feature Significance Analysis of the US Adult Income Dataset
(2021-09-01)
In this paper, we analyze the classic US Adult Income Dataset using logistics regression and random forest to analyze potential factors that contribute to income bias for the 50Kincome bracket(income ≥ 50K per year). Using ...
To Join or Not to Join? Thinking Twice about Joins before Feature Selection
(2015-11-27)
Closer integration of machine learning (ML) with data processing is a booming area in both the data management industry and academia. Almost all ML toolkits assume that the input is a single table, but many datasets are ...
A Survey of the Existing Landscape of ML Systems
(2015-11-27)
We survey the existing landscape of ML systems to identify gaps that motivate our vision of a unifying abstraction to support the iterative process of model selection and lay a principled foundation for model selection ...



