Red Hat associates from locations across North America collaborate with primarily North American-based researchers on many research projects. In addition to long-standing formal arrangements with Boston University, the Mass Open Cloud Alliance, and the University of Massachusetts, we support student and faculty research and open source development work for undergraduates, Master’s, and PhD students. We also teach classes, mentor students, deliver technology workshops, and support outreach programs that improve diversity in computer science and engineering.
If you are a student interested in a project opportunity, please contact us. If you are a Red Hatter interested in submitting a project, please copy this template and email your idea to Heidi Dempsey, Research and Innovation Director, North America.
The North America RIG typically meets the first Tuesday of each month from 3-4 PM EST. Details on the upcoming meeting and prior meetings are included below. And mark your calendar for future planned meetings through 2022.
- September 20 (off-cycle date)
- October 4
- November 1
- December 6
Catch recordings of meetings on the North America Research Interest Group YouTube Channel.
Date: September 20, 2022
Red Hat Collaboratory at Boston University Project – Temporal graph analytics on Apache Flink Stateful Functions
Title: Temporal graph analytics on Apache Flink Stateful Functions
Speaker: Vasia Kalavri, Boston University
Abstract: In this talk, I will present our progress towards building an open-source temporal graph analytics framework on top of Apache Flink Stateful Functions (AFSF). Our design aims to bridge the gap between (i) graph databases, that provide optimized storage and indexing to support efficient ad-hoc queries, and (ii) graph processing systems, that employ distributed computation to perform complex analytics on very large graphs. I will give an overview of AFSF, explain how we leverage its features for temporal graph analytics, and describe how we have used its actor-like APIs to implement common graph algorithms, like PageRank and connected components. Finally, I will share initial performance results, discuss our next steps, and present research opportunities in bringing graph analytics to serverless platforms.
Project Page: Serverless Streaming Graph Analytics
North America RIG Meetings Archive
Boston University (BU) has long been a big part of the Red Hat Research program and Red Hat’s academic collaboration generally. On April 27, the two organizations took that collaboration up a level.
Red Hat took part in a virtual panel hosted by MIT for students interested in doing research for high tech companies
MIT hosted a virtual panel for graduate students interested in doing research for high tech companies on January 27, 2021 as part of the Institute’s Independent Activities Period. The “PhD Careers in Tech” event, which covered many different fields of graduate study in both sciences and engineering, as well as general entrepreneurship, included panelists from diverse industries who shared key insights, spoke about skills required for success, how to navigate the interview process and how to build career paths in industry.
Title Summary Research Area Universities research_area_hfilter Understanding accuracy decay in online image retrieval systems within the context of open-set classification and unsupervised clustering Image retrieval systems are extremely useful to political scientists and human rights advocates attempting to understand the scope and spread of disinformation in massive datasets. However, in standard image retrieval tasks the corpus of images is unchanging as time moves forward. When considering online disinformation this is clearly not the case. Image retrieval in an online system can essentially be modeled as an open-set problem, where there is no guarantee that the classes of images seen before will have any correspondence to the classes of images seen at present or in the future. AI-ML, Cloud-DS University of Notre Dame ai-ml cloud-ds Automated detection of memory safety vulnerabilities in Rust In comparison to C, the Rust language provides significant memory safety guarantees through its concept of lifetimes and its borrow-checker. … Testing and Ops Columbia University testing-and-ops Tuning the Linux kernel The Linux kernel is a complicated piece of software with multiple components interacting with each other in complex ways. The … AI-ML, Hardware and the OS ai-ml hardware-and-the-os Disinformation Detection at Scale The increased prevalence of fake and manipulated visual media on the Internet has led to social and technical dilemmas in … AI-ML, Security, Privacy, Cryptography UNICAMP - Universidade Estadual de Campinas, University of Notre Dame ai-ml security-privacy-cryptography AI for Cloud Ops This project aims to address this gap in effective cloud management and operations with a concerted, systematic approach to building and integrating AI-driven software analytics into production systems. We aim to provide a rich selection of heavily-automated “ops” functionality as well as intuitive, easily-accessible analytics to users, developers, and administrators AI-ML, Cloud-DS, Hardware and the OS Boston University ai-ml cloud-ds hardware-and-the-os Creating a global open research platform to better understand social sustainability using data from a real-life smart village In this project, a team of BU faculty will team up with Red Hat researchers and with SmartaByar, an organization … AI-ML, Cloud-DS, Security, Privacy, Cryptography Boston University ai-ml cloud-ds security-privacy-cryptography DISL: A Dynamic Infrastructure Services Layer for Reconfigurable Hardware BU faculty member Martin Herbordt will work with Red Hat researchers Uli Drepper and Ahmed Sanaullah to create a generic … Cloud-DS, Hardware and the OS Boston University cloud-ds hardware-and-the-os Practical Programming of FPGAs with Open Source Tools This project has evolved from the Practical programming of FPGAs in the data center and on the edge project. Please see … Cloud-DS, Hardware and the OS Boston University cloud-ds hardware-and-the-os Near-Data Data Transformation BU faculty members Manos Athanassoulis and Renato Mancuso will work with Red Hat researchers Uli Drepper and Ahmed Sanaullah to create a hardware-software co-design paradigm for data systems that implements near-memory processing. Cloud-DS, Hardware and the OS Boston University cloud-ds hardware-and-the-os Towards high performance and energy efficiency in open-source stream processing. BU faculty members Vasiliki Kalavari and Jonathan Appavoo will work with Red Hat researcher Sanjay Arora to create an open-source … Hardware and the OS Boston University hardware-and-the-os OSMOSIS: Open-Source Multi-Organizational Collaborative Training for Societal-Scale AI Systems The goal of our project is to develop a novel framework and cloud-based implementation for facilitating collaboration among highly heterogeneous research, development, and educational settings. AI-ML, Cloud-DS Boston University ai-ml cloud-ds Privacy-Preserving Cloud Computing using Homomorphic Encryption In today’s data-driven world, a large amount of data is collected by billions of devices (cell phones, autonomous cars, handheld … Cloud-DS, Hardware and the OS, Security, Privacy, Cryptography Boston University cloud-ds hardware-and-the-os security-privacy-cryptography Serverless Streaming Graph Analytics In this project, we will focus on graph streams that can be used to model distributed systems, where workers are represented as nodes connected with edges that denote communication or dependencies. Cloud-DS, Testing and Ops Boston University cloud-ds testing-and-ops Enabling Intelligent In-Network Computing for Cloud Systems With the network infrastructure becoming highly programmable, it is time to rethink the role of networks in the cloud computing … Cloud-DS, Testing and Ops Boston University cloud-ds testing-and-ops Linux Computational Caching In this speculative work we are attempting to explore a biologically motivated conjecture on how memory of past computing can be stored and recalled to automatically improve a system’s behavior. AI-ML, Cloud-DS, Hardware and the OS Boston University ai-ml cloud-ds hardware-and-the-os Foundations in Open Source Education In this project we are developing an exemplar set of materials for an introductory computers systems class that exploits, Jupyter, Jupyter Books, OpenShift and the the Mass Open Cloud to develop and deliver a unique educational experience for learning about how computer systems work. Cloud-DS Boston University cloud-ds Symbiotes: A New step in Linux’s Evolution This work explores how a new kind of software entity, a symbiotie, might bridge this gap. By adding the ability for application software to shed the boundary that separates it from the OS kernel it is free to integrate, modify and evolve in to a hybrid that is both application and OS. Hardware and the OS, Security, Privacy, Cryptography Boston University hardware-and-the-os security-privacy-cryptography Intelligent Data Synchronization for Hybrid Clouds The goal of this project is to design configurable synchronization solutions on a common platform for a wide range of edge computing scenarios relevant to Red Hat. These solutions will be thoroughly validated on a state-of-the-art testbed capable of emulating realistic environments (e.g., smart cities). AI-ML, Cloud-DS, Testing and Ops Boston University ai-ml cloud-ds testing-and-ops Secure cross-site analytics on OpenShift logs The project aims to explore whether cryptographically secure Multi-Party Computation, or MPC for short, can be used to perform secure cross-site analytics on OpenShift logs with minimum client participation. Cloud-DS, Security, Privacy, Cryptography, Testing and Ops Boston University cloud-ds security-privacy-cryptography testing-and-ops Robust Data Systems Tuning BU faculty members Manos Athanassoulis and Evimaria Terzi will work on building a new robust tuning framework for LSM-based data … AI-ML, Hardware and the OS Boston University ai-ml hardware-and-the-os Robust LSM-Trees Under Workload Uncertainty We introduce a new robust tuning paradigm to aid in the design of data systems with uncertain assumptions by modeling the behavior of the system and then utilizing these models in conjunction with techniques in robust optimization. Our approach is demonstrated through tuning a popular log-structured merge-tree based storage engine, RocksDB Hardware and the OS Boston University hardware-and-the-os Does efficient, private, agnostic learning imply efficient, agnostic online learning? Users of online services today must trust platforms with their personal data. Platforms can choose to enable privacy by default … Boston University Are Adversarial Attacks a Viable Solution to Individual Privacy? Users of online services today must trust platforms with their personal data. Platforms can choose to enable privacy by default … Security, Privacy, Cryptography Boston University security-privacy-cryptography Workflow-Centric Tracing for Cloud Applications Workflow-centric tracing allows traces (i.e., graphs) of requests’ workflows to be constructed by stitching together trace points with the same request context. Three collaboratory projects focus on improving the observability and diagnosability of Red Hat products using this technique. Cloud-DS Boston University, Northeastern University cloud-ds Hybrid Cloud Caching A fundamental goal of the Hybrid Cloud Cache project is to allow simplified integration into existing data lakes, to enable caching to be transparently introduced into hybrid cloud computation, to support efficient caching of objects widely shared across clusters deployed by different organizations, and to avoid the complexity of managing a separate caching service on top of the data lake Boston University, Northeastern University Volume Storage Over Object Storage This project creates a hybrid storage system composed of a high-speed local device (e.g. Optane) to store short term data, along with a write-once object store (e.g, Ceph RGW) to store data blocks permanently. Cloud-DS Boston University, Northeastern University cloud-ds Kariz Cache Prefetching and Management Kariz is a caching system that works closely with analytic frameworks scheduler to find the best caching policy for the current running application. Cloud-DS Boston University, Northeastern University cloud-ds Elastic Secure Infrastructure This project encompasses work in several areas to design, build and evaluate secure bare-metal elastic infrastructure for data centers. Cloud-DS, Security, Privacy, Cryptography, Testing and Ops Boston University cloud-ds security-privacy-cryptography testing-and-ops Open Cloud Testbed The Open Cloud Testbed project will build and support a testbed for research and experimentation into new cloud platforms – the underlying software which provides cloud services to applications. Testbeds such as OCT are critical for enabling research into new cloud technologies – research that requires experiments which potentially change the operation of the cloud itself. AI-ML, Cloud-DS, Hardware and the OS, Security, Privacy, Cryptography, Testing and Ops Boston University, Northeastern University, UMass Amherst ai-ml cloud-ds hardware-and-the-os security-privacy-cryptography testing-and-ops Ceph Storage This research project is investigating how Ceph compression and erasure coded pools could optimize Prometheus tsdb storage. Cloud-DS UMass Lowell cloud-ds Implementing Secure Multi-Party Computing Secure Multiparty Computation (MPC) is a cryptographic primitive that allows several parties to jointly and privately compute desired functions over secret data. Building and deploying practical MPC applications faces several obstacles, including performance overhead, complicated deployment and setup procedures, and adoption of MPC protocols into modern software stacks. MPC applications expose trade-offs between efficiency and privacy that may be hard to reason about, formally characterize, and encode in a protocol design or implementation. Cloud-DS, Security, Privacy, Cryptography Boston University cloud-ds security-privacy-cryptography Outfitting QEMU/KVM with Partitioning Hypervisor Functionality This project extends the virtualization capabilities of QEMU and KVM by adding partitioning hypervisor functionality. With this implementation, hardware resources can be exclusively assigned to specific tasks and VMs. Current work supports KVM Isolation IOCTLs to query CPUs to find isolated CPUs. Hardware and the OS Boston University hardware-and-the-os An Optimizing Operating System: Accelerating Execution With Speculation To optimize performance, Automatically Scalable Computation (ASC), a Harvard/BU collaboration attempts to auto-parallelize single threaded workloads, reducing any new effort required from programmers to achieve wall clock speedup. SEUSS takes a different approach by splicing a custom operating system into the backend of a high throughput distributed serverless platform, Apache OpenWhisk. SEUSS uses an alternative isolation mechanism to containers, called Library Operating Systems (LibOSs). Cloud-DS, Hardware and the OS Boston University cloud-ds hardware-and-the-os Kernel Techniques to Optimize Memory Bandwidth with Predictable Latency Recent processors have started introducing the first mechanism to monitor and control memory bandwidth. Can we use these mechanisms to enable machines to be fully used while ensuring that primary workloads have deterministic performance? This project presents early results from using Intel’s Resource Director Technology and some insight into this new hardware support. The project also examines an algorithm using these tools to provide deterministic performance on different workloads. Hardware and the OS Boston University hardware-and-the-os Unikernel Linux This project aims to turn the Linux kernel into a unikernel with the following characteristics: 1) are easily compiled for any application, 2) use battle-tested, production Linux and glibc code, 3) allow the entire upstream Linux developer community to maintain and develop the code, and 4) provide applications normally running vanilla Linux to benefit from unikernel performance and security advantages. Hardware and the OS Boston University hardware-and-the-os Code2Vec: Learning code representations This project analyzed semantic similarities of learned code embeddings parsed from open source python libraries such as numpy, pandas and sklearn. Still in progress is another analysis that learns code embeddings in a supervised manner with the C++ codebase for performance measurement of program execution in CPU with performance counters (e.g. LLC misses to L1 requests, Cycles Per Instruction). AI-ML Boston University ai-ml Fuzzing Device Emulation in QEMU Hypervisors—the software that allows a computer to simulate multiple virtual computers—form the backbone of cloud computing. Because they are both ubiquitous and essential, they are security-critical applications that make attractive targets for potential attackers. Hardware and the OS, Security, Privacy, Cryptography, Testing and Ops Boston University hardware-and-the-os security-privacy-cryptography testing-and-ops D3N: A Multi-Layer Cache for Data Centers This project designs and develops D3N, a novel multi-layer cooperative caching architecture that mitigates network imbalances by caching data on the access side of each layer of hierarchical network topology. A prototype implementation, which incorporates a two-layer cache, is highly-performant (can read cached data at 5GB/s, the maximum speed of our SSDs) and significantly improves the performance of big-data jobs. Cloud-DS Boston University, Northeastern University cloud-ds Practical programming of FPGAs in the data center and on the edge FPGAs are now essential components in the data center and on the edge with millions deployed. FPGAs are found in a wide variety of system elements and provide such critical functions as SDN, encryption/decryption, and compression. Yet for nearly all system providers, much less system users, programming these FPGAs is impossible. Our overall goal is to enable FPGA application development by High Level Language (HLL) programmers, especially for the data center and the edge, and exclusively using existing open-source tools. AI-ML, Cloud-DS, Hardware and the OS Boston University ai-ml cloud-ds hardware-and-the-os Automatic Configuration of Complex Hardware In this project, we pursue three goals towards this understanding: 1) identify, via a set of microbenchmarks, application characteristics that will illuminate mappings between hardware register values and their corresponding microbenchmark performance impact, 2) use these mappings to frame NIC configuration as a set of learning problems such that an automated system can recommend hardware settings corresponding to each network application, and 3) introduce either new dynamic or application instrumented policy into the device driver in order to better attune dynamic hardware configuration to application runtime behavior. Hardware and the OS Boston University hardware-and-the-os Quest-V, a Partitioning Hypervisor for Latency-Sensitive Workloads Quest-V is a separation kernel that partitions services of different criticality levels across separate virtual machines, or sandboxes. Each sandbox encapsulates a subset of machine physical resources that it manages without requiring intervention from a hypervisor. In Quest-V, a hypervisor is only needed to bootstrap the system, recover from certain faults, and establish communication channels between sandboxes. Hardware and the OS Boston University hardware-and-the-os Performance Management for Serverless Computing Serverless computing provides developers the freedom to build and deploy applications without worrying about infrastructure. Resources (memory, cpu, location) specified … Cloud-DS Boston University cloud-ds