• Login
    View Item 
    •   MINDS@UW Home
    • MINDS@UW Milwaukee
    • UW Milwaukee Electronic Theses and Dissertations
    • View Item
    •   MINDS@UW Home
    • MINDS@UW Milwaukee
    • UW Milwaukee Electronic Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    An Interactive Health Data Science Platform for Exploratory Analysis of Health Outcomes – a Case Study with Colon Cancer

    Thumbnail
    File(s)
    Main File (1.347Mb)
    Hemanth_Kumar_Alapati_Masterx2019s_thesis_presentation.pptx (1.084Mb)
    Date
    2021-12-01
    Author
    Alapati, Hemanth Kumar
    Department
    Computer Science
    Advisor(s)
    Jake Luo
    Mukul Goyal
    Metadata
    Show full item record
    Abstract
    Disease prediction is an important aspect of early disease detection and preventive care with wide range of applications in healthcare domain. Previous studies used image processing techniques, statistical and machine learning models to predict diseases. Prediction accuracies vary with data type and the target. Often the data is processed through models under different data conditions to identify what works best for a scenario. This results in tweaking the code, running multiple iterations making these methods usable only for people with technical skills. An interactive platform is developed that hides the technicalities and allows the users to change options like target disease for prognosis, feature selection method, sample size, ML algorithm. With this, multiple approaches can be tried and compared to find a combination of the options for an efficient outcome. Colon cancer is used to perform a case study to test this platform. 2 selection algorithms and 3 ML models are used. Although both selection methods identified identical features as significant for colon cancer prediction, the order of the features based on the scores is different. Hence, the machine learning algorithms performed similarly with both the selection methods. Random Forest, Logistic Regression, and Decision Tree had accuracies 87%, 86%, and 83% respectively.
    Subject
    data science
    Disease prediction
    disease prediction platform
    Disease prognosis
    machine learning application
    Permanent Link
    http://digital.library.wisc.edu/1793/92729
    Type
    thesis
    Part of
    • UW Milwaukee Electronic Theses and Dissertations

    Contact Us | Send Feedback
     

     

    Browse

    All of MINDS@UWCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    Contact Us | Send Feedback