Red Hat Research Quarterly

Test case prioritization: towards high-reliability continuous integration

Ilya Kolchinsky

Ilya Kolchinsky is a research scientist with Red Hat Research, specializing in the various aspects of AI-based system optimization. He has a PhD and BS in Computer Science, both from Technion, Israel Institute of Technology. His past and present research interests include cloud optimization, ML-driven resource management in containerized deployments, pattern mining in streaming data, stream and complex event processing optimization, distributed systems, automatic software testing/debugging, anomaly detection, and more.

Related Projects

Test Case Prioritization: Towards Efficient and Reliable Continuous Integration

Article featured in

Red Hat Research Quarterly

November 2021

Download PDF

Subscribe now

In this issue

Feature

The need for constant-time cryptography

Ján Jančár

Feature

The elastic bare metal cloud is here

Gagan Kumar

Feature

Creating a Linux-based unikernel

Gordon Haff

Feature

Making machine learning accessible across disciplines

Marek Grác

From the Director

Let’s help more programmers get into the groove

Hugh Brock

News

Red Hat Collaboratory at Boston University granting major awards

Shaun Strohmer

News

Test case prioritization: towards high-reliability continuous integration

Ilya Kolchinsky

News

Programmable networking project reports on its first year of progress

Toke Høiland-Jørgensen

Interview

Opening the doors of tech: why diversity is critical to the future of computing

Matej Hrušovský

Project Updates

Research Project Updates—November 2021

A new project seeks to make effective testing more compatible with expeditious releases

In August 2021, a team of graduate students from IDC Herzliya, a leading research college in Israel, began working on a test case prioritization (TCP) project under the guidance of senior engineers from Red Hat.

The goal of the TCP project is to create a novel tool based on machine learning (ML) that solves the test case prioritization problem in software regression testing. Automatic regression testing is a crucial step of any CI/CD pipeline. The primary purpose of this testing is to detect bugs and defects introduced by recent changes as early as possible while keeping verification costs low. The ability to perform regression testing efficiently and effectively (i.e., within a small time frame, yet catching the majority of bugs) would allow developers to rapidly deliver reliable software updates to users.

Unfortunately, the regression testing process in modern large-scale software products tends to be too complex and cumbersome to meet this objective. As the size of software increases, the test suite also grows bigger and requires more time and resources to be fully executed. In many cases, the time to run the entire test suite can reach three or four days or even a week. Consequently, executing all available tests during the CI/CD regression testing procedure is highly impractical and, in many cases, completely infeasible.

To address this issue, TCP methods have emerged. TCP aims to order a given set of test cases such that the earlier a test appears in the resulting order, the higher the probability that this test will detect a bug or a fault introduced by the given code change. Provided such an ordering exists for the entire test suite, the regression testing procedure can iterate over it starting from the beginning and advancing until a predefined limit on the maximal testing time (say, one minute) is reached. Because the most significant test cases are executed first, the chances of early fault detection rise even when executing only a negligible part (say, 1%) of the entire test suite.

In recent years, TCP solutions have been adopted by major industry players and have spawned a wave of academic research projects. However, the full potential of TCP in regression testing is yet to be explored. As of now, the most widely used approaches are mainly based on heuristic search strategies and/or code coverage methods. In contrast, methods based on machine learning in general and deep learning in particular are barely explored. Closing this gap and devising a ML-based TCP tool is the primary goal of this project.

The project team is now progressing towards the first major milestone: implementing an open source proof-of-concept prototype that replicates the state-of-the-art academic results in this area. This milestone is expected to be reached by November 2021. As an immediate next step, the team will be looking to improve the state of the art. To that end, a variety of machine learning and data mining methods will be considered.

Those interested in finding out more about the TCP project and/or looking for collaboration opportunities are kindly invited to contact Dr. Ilya Kolchinsky (ikolchin@redhat.com) or Gil Klein (gklein@redhat.com).

SHARE THIS ARTICLE

Composing a research symphony

Heidi Dempsey

Many talents contributed to one goal: a shared production-level research cloud. It was a chilly morning at Boston University, and I was looking for a quiet place to gather my thoughts and do some writing. I passed two painters covering up scuffs on the white walls and a man with a floor machine busily tracing […]

Project Updates

Research project updates | August 2023

Each quarter, RHRQ highlights new and ongoing research collaborations from around the world in one or more of our key areas of interest: AI and machine learning, hybrid cloud/research infrastructure, edge computing, and trust. This quarter we highlight collaborative projects with university partners at Boston University and the University of Massachusetts-Lowell. Contact academic@redhat.com for more […]

Feature

Applying lessons from our upstream hypervisor fuzzer to improve kernel fuzzing

Alexander Bulekov

Bandan Das

Could a grammarless approach increase its effectiveness? Low-level systems such as Linux kernels and hypervisors form the foundation of cloud systems today. The virtual machines (VMs) provided by hypervisors are attractive targets for attackers. Bugs in hypervisors create the risk of an attacker in a malicious VM, compromising the isolation guarantees provided by the hypervisor, […]

News

Undergraduate research projects advance the Red Hat Collaboratory’s educational mission

Shaun Strohmer

The Red Hat Collaboratory at Boston University is supporting select undergraduate student research projects during Summer 2022, in keeping with its mission of advancing education in open source technologies. So far, six projects have been chosen to receive funding and supervision from BU computer engineering professors active in their own Collaboratory projects, with more expected. […]

Feature

Unleashing the potential of Function as a Service in the cloud continuum

Luis Tomás Bolivar

José Castillo Lema

The PHYSICS project demonstrates the value of the FaaS paradigm for application development and data analysis. Here’s how we enhanced the infrastructure layer. The difficulty of scaling, optimizing, and maintaining infrastructure makes cloud computing too complex or resource-intensive for many developers and data scientists. The Function-as-a-Service (FaaS) model (often called serverless computing, generically) allows users […]

Feature

Meet Perun: a performance analysis tool suite

Jiří Pavela

Tomáš Fiedor

Jiří Hladký

Tomáš Vojnar

How do you turn a research project into an industry tool? Learn how the creators of Perun built a better performance analysis toolkit then brought it from academia to real-world implementation. Everyone has a horror story about poor performance in a continuously evolving product. Managing the performance of reasonably complex software is simply a difficult […]

Feature

BigDataStack delivers with contributions from industry and university partners

Yosef Moatti

Oshrit Feder

Guy Khazma

Gal Lushi

Paula Ta-Shma

Luis Tomás Bolivar

Miki Kenneth

Josh Salomon

Data skipping and network performance improvement technologies prove their value in data-intensive applications.

Perspectives

Research perspectives: Focus on testing and operations

Bandan Das

Daniel Bristot de Oliveira

Red Hat Research has fostered work on testing and analysis that started as open source explorations and ended as valuable upstreamed resources for anyone to use. We asked two engineers who’ve worked on highly successful projects, Daniel Bristot de Oliveira and Bandan Das, to share some of the biggest research accomplishments so far and let […]

Interview

AI DIY: How research is making custom language models work with more of us

Heidi Dempsey

“How many lives am I impacting?” That’s the question that set Akash Srivastava, Founding Manager of the Red Hat AI Innovation Team, on a path to developing the end-to-end open source LLM customization project known as InstructLab. A principal investigator (PI) at the MIT-IBM Watson AI Lab since 2019, Akash has a long professional history […]