To contribute to this archive with a new technical report, see the instructions on how to submit to the CS Technical Reports Archive.

Recent Submissions

  • Compositional, Monotone, and Non-linear Program Analysis 

    Cyphert, John (2024)
    The presence of bugs in deployed software can lead to great economic and or human cost. One strategy for mitigating these losses is to prove the functional correctness of programs---or sometimes aspects of a program's ...
  • NSF Workshop on Emerging Research Opportunities at the Intersection of Statistics and Internet Measurement Final Report 

    Barford, Paul; Ng, Tony (2023-11-30)
    The Workshop on Emerging Research Opportunities at the Intersection of Statistics and Internet Measurement was held in January 2023 at Boston University. The goal of the workshop was to bring together Internet measurement ...
  • Learning from Code and Non-Code Artifacts 

    Henkel, Jordan (2022-08-17)
    Three things are fundamentally true about software: (i) every day that passes we, as a society, generate more software (more code, more documentation, and more software-related artifacts of all kinds), (ii) it is easier ...
  • Feature Significance Analysis of the US Adult Income Dataset 

    Chen, Junda (2021-09-01)
    In this paper, we analyze the classic US Adult Income Dataset using logistics regression and random forest to analyze potential factors that contribute to income bias for the 50Kincome bracket(income ≥ 50K per year). Using ...
  • CS532 Course Project Activity - Climate Data Fitting and Local Warming Justification 

    Chen, Junda; Zhao, Haoruo; Duan, Doris (2021-09-01)
    The project introduce a simplified model to justify whether global warming is truly an issuein the current society. Student will first intensify their knowledge aboutBasis Matrix– itsconstruction and its application to the ...
  • Explore Optimal Degree of Parallelism for Distributed XGBoost Training 

    Chen, Junda; Akash, Aditya Kumar; Suzuki, Yukiko (2021-09-01)
    The XGBoost has been an extremely popular and effective machine learning method which gained its fame throughwinning multiple Kaggle competitions. One of its strengths lies in parallel processing which makes the ...
  • Research review on AI and Machine learning related works 

    Zhao, Wei (2021-03)
    In this review, several innovative methods and the associated system were created for agriculture navigation, mapping, object detection, and related vehicles technologies. Based on the experiences with these applications, ...
  • Project Report for Image Quilting Reproduction 

    Qi, Bozhao (2021-01)
    The demand for high-quality images has been greater in computer graphics and computer vision. Image-based rendering is playing bigger roles in video games, for instance. Rather than creating the whole physical world from ...
  • Prioritize Winter Crash Severity Influencing Factors in US Midwestern for Autonomous Vehicle 

    Dai, Shenghong (2020-02-28)
    Adverse weather conditions in winter have significant impacts on crash occurrences and risks. Human drivers can adjust driving styles based on the context information of the surrounding traffic and environments. Similar ...
  • A New Semantic Approach on Yelp Review-star Rating Classification 

    Wu, Shuang; Wang, Xiaodong; Qi, Bozhao (2020-02-26)
    This paper introduces a new semantic approach for yelp review star rating prediction. Our approach extracts feature vectors from user reviews to develop star prediction models. User review text contains detailed information ...
  • Sensor-Based Risk Perception for Drivers Under Adverse Environment 

    Zhao, Wei (2018-08-18)
    Due to factors such as snow and ice impeding drivers’ vision, automobile accidents significantly rise during winter months. This study sets forth an automated evaluation network of the Risk Perceived Ability (RPA) for ...
  • Dynamic Query Re-Planning Using QOOP 

    Mahajan, Kshiteej; Chowdhury, Mosharaf; Akella, Aditya; Chawla, Shuchi (2018-09-27)
    Modern data processing clusters are highly dynamic – both in terms of the number of concurrently running jobs and their resource usage. To improve job performance, recent works have focused on optimizing the cluster scheduler ...
  • Exploration on Deep Drug Discovery: Representation and Learning 

    Liu, Shengchao (2018-09-20)
    Virtual (computational) high-throughput screening provides a strategy for prioritizing compounds for experimental screens, but the choice of virtual screening algorithm depends on the dataset and evaluation strategy. We ...
  • A Formula That Generates Hash Collisions 

    Brockmann, Andrew (2018-08-08)
    We present an explicit formula that produces hash collisions for the Merkle-Damgard construction. The formula works for arbitrary choice of message block and irrespective of the standardized constants used in hash functions, ...
  • Overreliance on Classical Computing in Quantum Factorization 

    Brockmann, Andrew (2018-08-08)
    A 2012 quantum experiment factored 143 after performing some simplifications classically. Further research demonstrated that that experiment arguably performed the quantum factorizations of other numbers too, such as 56153. ...
  • Deep Learning for Entity Matching: A Design Space Exploration 

    Mudgal Sunil Kumar, Sidharth (2018-05-15)
    Entity matching (EM) finds data instances that refer to the same real-world entity. In this thesis we examine applying deep learning (DL) to EM, to understand DL's benefits and limitations. We review many DL solutions that ...
  • Error Backprojection Algorithms for Non-Line-of-Sight Imaging 

    La Manna, Marco; Kine, Fiona; Breitbach, Eric; Jackson, Jonathan; Velten, Andreas (2017-10-18)
    Recent advances in computer vision and inverse light transport theory have resulted in several non-line-of-sight imaging techniques. These techniques use photon time-of-flight information encoded in light after multiple, ...
  • What Are Optimal Coding Functions for Time-of-Flight Imaging? 

    Gupta, Mohit; Velten, Andreas; Nayer, Shree; Breitbach, Eric (2017-06-26)
    The depth resolution achieved by a continuous wave time-of-flight (C-ToF) imaging system is determined by the coding (modulation and demodulation) functions that it uses. Almost all current C-ToF systems use sinusoid or ...
  • Quickstep: A Data Platform Based on the Scaling-In Approach 

    Jignesh M. Patel; Harshad Deshmukh; Jianqiao Zhu; Hakan Memisoglu; Navneet Potti; Saket Saurabh; Marc Spehlmann; Zuyu Zhang (2017-06-19)
    Modern servers pack enough storage and computing power that just a decade ago was spread across a modest- sized cluster. This paper presents a prototype system, called Quickstep, to exploit the large amount of paral- lelism ...
  • A New Abstraction Framework for Affine Transformers 

    Reps, Thomas; Sharma, Tushar (2017-05-16)
    Abstract. This paper addresses the problem of abstracting a set of affine transformers v' = v C + d, where v and v' represent the pre-state and post-state, respectively. We introduce a framework to harness any base abstract ...

View more