Red Hat Research Quarterly

Unikernel Linux (UKL) moves forward

Richard Jones

Richard Jones has been using Linux since the early 1990s, joining Red Hat in 2007. Richard is now a Senior Principal Software Engineer in Red Hat’s R&D Platform team.

Related Projects

Unikernel Linux

Article featured in

Red Hat Research Quarterly

August 2023

Download PDF

Subscribe now

In this issue

From the Director

Red Hat Collaboratory at Boston University seeks proposals for 2024

News

Hybrid cloud, edge, and security research featured at DevConf.CZ 2023

News

Publication highlights—August 2023

Interview

“Research is an adventure”: Putting theory to the test at the university and in the field

Martin Ukrop

Feature

Unikernel Linux (UKL) moves forward

Richard Jones

Feature

“Open source opens doors”: mentoring students for success

Heidi Dempsey

Project Updates

Research project updates | August 2023

Feature

Generative AI and large language models: how did we get here, where are we going, and what does it mean for open source?

Sanjay Arora

Richard Fontana

RHRQ first looked at the Unikernel Linux (UKL) project—a joint effort involving professors, PhD students, and engineers at the Boston University-based Red Hat Collaboratory—almost two years ago (RHRQ 3:3, November 2021). This previous article covered the background of unikernels in detail, but in brief: an application links directly to a specialized kernel, a lightly modified version of Linux in this case, so that the resulting program can boot and run on its own. Unikernels have demonstrated significant advantages in boot time, security, resource utilization, and I/O performance. They enable those advantages by linking the application and kernel together in the same address space.

UKL’s focus to date has been on minimizing changes both to the Linux kernel and to applications. By reusing Linux, we gain the advantages of Linux for free, especially wide driver support. We also studied the performance and latency characteristics of the final unikernels to see if making small, targeted changes could provide benefits.

The significant progress made by this project was detailed at the Eighteenth European Conference on Computer Systems (EuroSys ’23), May 8–12, 2023, in Rome, Italy, and published in the conference’s proceedings. Here are some of the highlights.

Project evolution

Unikernels have demonstrated significant advantages in boot time, security, resource utilization, and I/O performance.

The Unikernel Linux (UKL) project started as an effort to exploit Linux’s configurability to create a new unikernel in a fashion that would avoid forking the kernel. A unikernel taking this approach could support a wide range of Linux applications and hardware while becoming a standard part of the ongoing investment by the Linux community. Our experience has led us to a more general goal: creating a kernel that can be configured to span the spectrum between a general-purpose operating system, amenable to a large class of applications, and a highly optimized, possibly application- and hardware-specialized, unikernel.

Work to date has demonstrated that we can integrate unikernel techniques into a general-purpose operating system in a way that avoids forking it. It has also demonstrated performance gains. We think that most applications would run under these techniques at parity or slightly faster with no changes. With relatively little effort, targeted changes to the kernel can achieve significant gains.

A spectrum of capabilities

If we enable a base model UKL configuration (requiring 550 lines of code changes to Linux) in the kernel, we’re starting at the general purpose end of the spectrum. This simplest configuration of UKL supports most applications, albeit with only modest (5%) performance advantages.

Like many unikernels, UKL is a single application that is statically linked with the kernel and executed in supervisor mode. However, the base model of UKL preserves most of the capabilities of Linux, including a separate pageable application portion of the address space and a pinned kernel portion, distinct execution modes for application and kernel code, and the ability to run multiple processes. The main changes are that system calls are replaced by function calls and application code is linked with kernel code and executes in kernel mode.

As a result, this base model provides an avenue toward supporting all hardware and applications of the original kernel and the entire Linux ecosystem of tools for deployment, debugging, and performance tuning—which has been very useful in the course of this research. It also allows a developer to run “perf” directly inside the unikernel to collect performance information and feed that back into changes they make to the application to improve performance.

For more effort but with potentially more gain, a developer can move along the spectrum toward a specialized unikernel. A larger set of configuration options (1,250 lines of code changes total) may improve performance but will not work for all applications. Once an application is running, a developer can easily explore a number of configuration options that, while not safe for all applications, may be safe and offer performance advantages for their application.

One configuration bypasses the entry/exit code, which usually executes whenever control transitions between application and kernel through system calls, interrupts, and exceptions. Running the entry/exit code can get expensive for applications making many small kernel requests. The developer can also select between two UKL configurations that avoid stack switches, each appropriate for a different class of applications.

Taking advantage of these optimizations increased Redis throughput by up to 26%

Knowledgeable developers can also (or alternatively) improve performance by modifying the application to call internal kernel routines and violating, in a controlled fashion, the standard assumptions and invariants of kernel versus application code. For example, they may be able to assert that only one thread is accessing a file descriptor and avoid costly locking operations.

To understand the implication of UKL’s design for applications, we evaluated it with Redis, a widely used in-memory database. We saw two clear opportunities for performance improvement. First, we saw that we could shorten the execution path by bypassing the entry and exit code for read and write system calls and invoke the underlying functionality directly. We also observed that read and write calls eventually translate into tcp_recvmsg and tcp_sendmsg, respectively. This led us to create a shortcut that enabled an application like Redis that always uses TCP to call the underlying routines directly. Only 10 lines of code were needed to implement this shortcut.

By taking advantage of these optimizations, researchers found that Redis throughput could be increased by up to 26% relative to standard Linux, whereas the UKL base model only improved throughput by 1.6%.

What’s next?

In addition to some cleanup work, such as rebasing to the latest kernel, glibc (which also requires code changes), and gcc, near-term work will focus on getting the project into the hands of more developers. The first step is adding the packages to the Fedora COPR service. The lengthy work of splitting up the Linux patches, authoring good commit messages, and checking that they pass Linux standards and tests is currently being done by Eric Munson at Boston University. After this is complete, we will submit them again to the Linux kernel community for comment.

Unikernel LinuxTo learn more, visit the Unikernel Linux project page on the Red Hat Research website. To see presentations and project artifacts, view the GitHub repository, or contact rjones@redhat.com.

The goal is, over time, to work with the community to add the changes to the Linux kernel as the current work is proven out and determined to be useful. In parallel with working with the kernel community, we need to demonstrate that the patches are useful for someone. To that end, we will work with other companies that have workloads requiring the highest performance and lowest latencies. We’re currently looking for additional partners, both commercial and individuals, who would like to try out their applications with UKL. Most plain C/C++ applications with few dependencies that already work on Linux can be ported to UKL in an afternoon.

While we have been working on UKL since around 2018, other technologies occupying a similar space have come along, especially io_uring and eBPF. io_uring is interesting because it amortizes syscall overhead. eBPF is interesting because it’s another way to run code in kernel space (albeit for a very limited definition of “code”). How do these approaches compare to UKL? We will be talking to developers who use these technologies to explore that question.

About the research team

The Unikernel Linux project is the work of a sizeable team. Primary researchers at Boston University include PhD candidates Ali Raza, Eric Munson, Thomas Unger and Professor Orran Krieger, with additional support from PhD candidates Arlo Albelli, James Cadden, Matthew Boyd, and Parul Sohal, and Professors Renato Mancuso and Jonathan Appavoo. Red Hatters contributing to the project include Richard Jones, Ulrich Drepper, Larry Woodman, Daniel Bristot de Oliveira, Isaiah Stapleton, and Ryan Sullivan.

SHARE THIS ARTICLE

Tuning Linux kernel policies for energy efficiency with machine learning

Han Dong

Presenting BayOp, a generic ML-enhanced controller that optimizes network application efficiency by automatically controlling performance and energy trade-offs. As global datacenter energy use rises and energy budgets are constrained, it becomes increasingly important for operating systems (OS) to enable higher efficiency and get more work done while consuming less. Concurrently, the environmental footprint of hardware […]

Feature

QUBIP and the transition to post-quantum cryptography

Gordon Haff

Quantum computing could put secure communication at risk sooner than you think. Current research aims to solve the problem before it starts. Post-quantum cryptography (alternatively, quantum-resistant cryptography) probably consumes more bandwidth than it should in quantum computing discussions. That’s because the potential to incrementally improve the efficiency of important but mundane tasks like optimizing logistics […]

Feature

Unleashing the potential of Function as a Service in the cloud continuum

Luis Tomás Bolivar

José Castillo Lema

The PHYSICS project demonstrates the value of the FaaS paradigm for application development and data analysis. Here’s how we enhanced the infrastructure layer. The difficulty of scaling, optimizing, and maintaining infrastructure makes cloud computing too complex or resource-intensive for many developers and data scientists. The Function-as-a-Service (FaaS) model (often called serverless computing, generically) allows users […]

Feature

Verifying programs that communicate with the environment

Henrich Lauko

Writing tests with high coverage is almost always tedious work that is still error prone. This can lead to missing crucial details that cause undesirable behavior, and, in the worst case, a complete system failure. What if there were an efficient way to automate this work?

Feature

Ops is the new code: Operate First brings open source to operations

Gordon Haff

Operations are attracting increased attention in the open source community, and the open source ethos is evolving to embrace it. The focus of open source was initially on the code. Over time, however, the health of communities creating that code and associated artifacts such as documentation has also become an open source issue. The approach […]

Feature

Yuga: A tool to help Rust developers write unsafe code more safely

Sanjay Arora

Baishakhi Ray

Vikram Nitin

Some bugs in unsafe Rust arise from errors that are so easy to make that they are easily overlooked. Researchers have developed a new analyzer to find them. By Vikram Nitin, Anne Mulhern, Baishakhi Ray, and Sanjay Arora Rust, a programming language that did not exist just 10 years ago, is now well known and […]

Feature

Can streaming data and machine learning build better communities?

Jim Craig

An open source powered smart village project underway at the Red Hat Collaboratory may have the potential to change the world—or at least a town near you. For as long as I can remember—and after almost 40 years in the IT industry, that’s quite a while now—every year for the last 20 years or so […]

Feature

Preserving privacy in the cloud: speeding up homomorphic encryption with custom hardware

Rashmi Agrawal

Lily Sturmann

Fully homomorphic encryption could be a great solution for secure data sharing, if only it weren’t so slow. Could an FPGA accelerator be the answer? Protecting sensitive data from being seen or tampered with, either while it is stored or while it is in transit, has been standard for some time. This practice is especially […]

Feature

Red Hat Research Quarterly

August 2023

Unikernel Linux (UKL) moves forward

Richard Jones

Red Hat Research Quarterly

August 2023

Unikernel Linux (UKL) moves forward

Richard Jones

Richard Jones

Related Projects

Red Hat Research Quarterly

August 2023

The uncertainty principle

Red Hat Collaboratory at Boston University seeks proposals for 2024

Hybrid cloud, edge, and security research featured at DevConf.CZ 2023

Publication highlights—August 2023

“Research is an adventure”: Putting theory to the test at the university and in the field

Unikernel Linux (UKL) moves forward

“Open source opens doors”: mentoring students for success

Research project updates | August 2023

Generative AI and large language models: how did we get here, where are we going, and what does it mean for open source?

Project evolution

A spectrum of capabilities

What’s next?

About the research team

Tuning Linux kernel policies for energy efficiency with machine learning

Han Dong

QUBIP and the transition to post-quantum cryptography

Gordon Haff

Unleashing the potential of Function as a Service in the cloud continuum

Luis Tomás Bolivar

José Castillo Lema

Verifying programs that communicate with the environment

Henrich Lauko

Ops is the new code: Operate First brings open source to operations

Gordon Haff

Yuga: A tool to help Rust developers write unsafe code more safely

Sanjay Arora

Baishakhi Ray

Vikram Nitin

Can streaming data and machine learning build better communities?

Jim Craig

Preserving privacy in the cloud: speeding up homomorphic encryption with custom hardware

Rashmi Agrawal

Lily Sturmann

A summer in Europe: US students thrive in open source research opportunities abroad

Patrick Harris

Mia Gortney