Red Hat Research Quarterly

Spring 2026

Download PDF

Subscribe Now

Highlights from this issue

Dan Alistarsh standing amidst the rack in a datacenter. Photo credit: Johannes-Hloch

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

The Deep Partnership panel at the second annual NSF National AI Research Resource (NAIRR) meeting

Pushing the boundaries of AI development: a shared national AI research infrastructure

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Red Hat Research Quarterly

Spring 2026

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Nir Shavitz interviews ISTA professor Dan Alistarh on quantization, sparsity, and the next frontiers in efficiency research.

A shared national AI research infrastructure may be coming to a galaxy not so far away

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software using LLMs

Download PDF

Subscribe Now

Spring 2026

Volume 7, Issue 4 • ISSN 2691-5278

More issues

Search articles

Departments

Pushing the boundaries of AI development

RISC-V AI workshop with Red Hat, DeepComputing hosted in Boston

In this issue

Features

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Enhanced observability makes optimizing LLM inference performance easier

Developing AI telemetry, digital twins, and other data-driven websites with SPINE Programming Theory

Download PDF

Inside this issue

News

RISC-V AI workshop with Red Hat, DeepComputing hosted in Boston

Jeffrey (Jefro) Osier-Mixon

RISC-V is an increasingly prevalent hardware architecture in embedded systems, and it’s beginning to serve as the base architecture for many new AI accelerators. It is an instruction set architecture (ISA) with roots similar to Arm and other RISC-based architectures, but with a key difference: RISC-V is developed by a community-based standards organization, RISC-V International, […]

Feature

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Simone Ferlin-Reiter

The industry-academia collaboration aimed at using LLMs to help generate more secure code builds on its success to expand research into infrastructure. In an era when software underpins everything from critical communications and global financial systems to lifesaving medical devices, security and reliability can never be an afterthought. Yet traditional development practices often leave gaps: […]

From the Director

Pushing the boundaries of AI development

Heidi Dempsey

A shared national AI research infrastructure may be coming to a galaxy not so far away. Human time scales are slow—really slow. In the time it takes to type that sentence, one of the H100 GPUs powering a nearby academic datacenter has roughly 10 billion cycles to consider its place in the universe. Of course, […]

Interview

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Nir Shavit

Dan Alistarh is a humble guy. The ISTA (Institute of Science and Technology, Austria) professor and founding employee of Neural Magic (the startup Red Hat acquired by Red Hat in 2025) isn’t one to brag, but fortunately we called in his former postdoc advisor, MIT professor and Neural Magic cofounder Nir Shavit, to really draw […]

Feature

Enhanced observability makes optimizing LLM inference performance easier

Isaiah Stapleton

More metrics and more dashboards mean more ways for researchers to identify actionable improvements. Optimizing the performance, stability, and resource utilization of large language model (LLM) deployments is a challenge for both users and cluster administrators. The Mass Open Cloud (MOC) now supports the ability to collect inference performance metrics for LLMs deployed in our […]

From the Editor

In this issue

Shaun Strohmer

Often in RHRQ, we look at the work of people discovering new frontiers in technology, but what I hope makes us different from other technology journals is that we’re also asking how open source and open practices can make these discoveries accessible to people in the real world. This issue’s interview spotlights Nir Shavit and […]

Feature

Developing AI telemetry, digital twins, and other data-driven websites with SPINE Programming Theory

Christopher Tate

Dewayne Branch

Denis Poussard

Developers using SPINE Programming have drastically cut manual coding time while maintaining full control over their data. SPINE Programming Theory (SPT) is a form of on-device, local AI code indexing and generation that accelerates software development while ensuring that users maintain full control over their data in their own environment. SPT allows developers to focus […]

Sorry, No posts.

Red Hat Research Quarterly

Highlights from this issue

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Pushing the boundaries of AI development: a shared national AI research infrastructure

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Red Hat Research Quarterly

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

A shared national AI research infrastructure may be coming to a galaxy not so far away

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software using LLMs

Departments

Pushing the boundaries of AI development

RISC-V AI workshop with Red Hat, DeepComputing hosted in Boston

In this issue

Features

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Enhanced observability makes optimizing LLM inference performance easier

Developing AI telemetry, digital twins, and other data-driven websites with SPINE Programming Theory

Inside this issue

RISC-V AI workshop with Red Hat, DeepComputing hosted in Boston

Where AI meets secure coding: inside SEMLA’s ambition for more resilient software

Pushing the boundaries of AI development

Finding the next breakthrough: AI efficiency research from one of the world’s top optimization labs

Enhanced observability makes optimizing LLM inference performance easier

In this issue

Developing AI telemetry, digital twins, and other data-driven websites with SPINE Programming Theory

LEARN

ENGAGE