Red Hat Research Quarterly

Managing large-scale systems

Hugh Brock

Hugh Brock is the Research Director for Red Hat, coordinating Red Hat research and collaboration with universities, governments, and industry worldwide. A Red Hatter since 2002, Hugh brings intimate knowledge of the complex relationship between upstream projects and shippable products to the task of finding research to bring into the open source world.

Article featured in

Red Hat Research Quarterly

February 2021

Download PDF

Subscribe now

In this issue

From the Director

Managing large-scale systems

Hugh Brock

News

Red Hat Research Days 2020—What are we thinking about now?

Gordon Haff

News

What to expect from Devconf.cz 2021

Gordon Haff

Column

Shared knowledge or private IP? That is the question

Interview

When good models go bad: Minimizing dataset bias In AI

Sanjay Arora

Feature

Sequential Monte Carlo for streaming data

Rui Vieira

Feature

Blocks, microworlds, puzzles, and adaptivity: teaching programming effectively

Tomáš Effenberger

Feature

Changing the world, one lesson at a time

Matej Hrušovský

Feature

Efficient runtime verification for the Linux kernel

Daniel Bristot de Oliveira

Feature

PyLadies, welcome to open source!

Petr Viktorin

I have been spending a lot of time lately thinking about all the hard problems involved in managing large-scale systems. Why? Well, it turns out to be a really important topic for Red Hat Research and for the Red Hat engineering community that we hope to serve. If we are correct that operating large-scale systems will necessarily be the domain of “expert systems” with AI, then we need to understand exactly what we mean by “operating,” at a minimum.

“It will be very interesting to see what needs to happen over time before we can really trust a robot to know which tuning knob to turn…”

I tend to approach these kinds of issues from a typical engineering standpoint. How can I construct the “plumbing” that allows me to get decent data out of a system, in the form of logs, events, metering, and so on? And how can I then add the appropriate controls that let someone or something in possession of that decent data do something useful with it? Unfortunately, as hard as this problem is, it turns out to be just the tip of the iceberg. Sanjay Arora’s interview with computer vision expert Kate Saenko, our cover story for this issue, focuses on the difficulty of training models, neural networks, and the like so that they are generalizable and not “biased”—biased in the sense that they are unable to tell that an orange hanging from a tree in sunlight is the same object as one sitting in a bowl of fruit in candlelight. This lack of generality also affects the AI we train to control systems. It will be very interesting to see what needs to happen over time before we can really trust a robot to know which tuning knob to turn to keep a mission-critical compute cluster running.

A related problem with large-scale systems arises simply because of the quantity of data they generate and the expense of moving all those bits around. For any reasonably large system, some degree of processing will need to take place close to where the data is collected, so that a smaller amount can be sent on to a central processor. Red Hatter Rui Vieira’s article on using Bayesian inference on streaming data is a very deep look at the different methods available to approximate and reduce a very large data flow. I hope to see applications of his work soon.

In addition to training models, we spend a lot of time in this issue on the different ways we train human beings. Check out Tomaš Effenberger’s piece on using microworlds and puzzles to teach kids programming—it’s absolutely fascinating (and almost certainly more effective than the Fortran books I read at age 12). We don’t stop with kids, either. Petr Viktorin writes in this issue about establishing a Python training program for adult women. Through the program he developed, Petr helped a lot of people understand programming, and in return learned a lot from them about agency and motivation. Like children, adults have lots of different reasons to learn. Fortunately, both children and adults learn better than machines—for now, at least.

SHARE THIS ARTICLE

Research at Devconf.us: Optimizing and automating the foundations of computing

Research at Devconf.us: Optimizing and automating the foundations of computing.

Feature

The need for constant-time cryptography

Ján Jančár

Timing attacks have been used successfully against a variety of popular encryption techniques, but they can be prevented with consistent use of constant-time code practice. Cryptography provides privacy for millions of people, whether by ensuring end-to-end encrypted messaging, securing more than ninety percent of the web behind HTTPS, or establishing trust behind the digital signatures […]

News

Telemetry Working Group looks at observability

Gordon Haff

A new working group is tackling observability in production.

Project Updates

Research project updates—May 2022

Each quarter, Red Hat Research Quarterly highlights new and ongoing research collaborations from around the world. This quarter we highlight collaborative projects in Israel at The Technion, The Ben Gurion University of The Negev, Ariel University, Reichman University, and The Hebrew University. Contact academic@redhat.com for more information on any project described here, or explore more research […]

Project Updates

Research project updates—February 2023

Interview

Opening the doors of tech: why diversity is critical to the future of computing

Matej Hrušovský

Red Hat Research University Program Manager Matej Hrušovský interviewed Barbora Buhnová, Associate Professor and Vice Dean for industrial partners at Masaryk University, Faculty of Informatics in Brno, Czech Republic. She is also the chair of the Association of Industrial Partners of Masaryk University, Faculty of Informatics, and is a co-founding and governing board member of […]

Feature

When machine learning meets big data processing: From human-native tasks to machine-native tasks

Ilya Kolchinsky

Since the inception of artificial intelligence research, computer scientists have aimed to devise machines that think and learn like human beings. What else could AI do?

Interview

ChRIS five years later: the groundbreaking platform levels the playing field for advanced analytics and AI in medicine

Orran Krieger

Shaun Strohmer

What if there were an open source web-based computing platform that not only accelerates the time it takes to share and analyze life-saving radiological data, but also allows for collaborative and novel research on this data, all hosted on a public cloud to democratize access? In 2018, Red Hat and Boston Children’s Hospital announced a […]

Feature

Creating a Linux-based unikernel

Gordon Haff

Is there a way to gain the performance benefits of a unikernel without severing it from an existing general-purpose code base? Boston University professors, BU PhD students, and Red Hat engineers at the Red Hat Collaboratory at Boston University are getting close to finding the answer. A unikernel is a single bootable image consisting of […]

Red Hat Research Quarterly

February 2021

Managing large-scale systems

Hugh Brock

Red Hat Research Quarterly

February 2021

Managing large-scale systems

Hugh Brock

Hugh Brock

Red Hat Research Quarterly

February 2021

Managing large-scale systems

Red Hat Research Days 2020—What are we thinking about now?

What to expect from Devconf.cz 2021

Shared knowledge or private IP? That is the question

When good models go bad: Minimizing dataset bias In AI

Sequential Monte Carlo for streaming data

Blocks, microworlds, puzzles, and adaptivity: teaching programming effectively

Changing the world, one lesson at a time

Efficient runtime verification for the Linux kernel

PyLadies, welcome to open source!

Research at Devconf.us: Optimizing and automating the foundations of computing

The need for constant-time cryptography

Ján Jančár

Telemetry Working Group looks at observability

Gordon Haff

Research project updates—May 2022

Research project updates—February 2023

Opening the doors of tech: why diversity is critical to the future of computing

Matej Hrušovský

When machine learning meets big data processing: From human-native tasks to machine-native tasks

Ilya Kolchinsky

ChRIS five years later: the groundbreaking platform levels the playing field for advanced analytics and AI in medicine

Orran Krieger

Shaun Strohmer

Creating a Linux-based unikernel

Gordon Haff

LEARN

ENGAGE