SAFARI Research Group at ETH Zurich and Carnegie Mellon University

All

115 repositories

Hermes
Public
A speculative mechanism to accelerate long-latency off-chip load requests by removing on-chip cache access latency from their critical path, as described by MICRO 2022 paper by Bera et al. (https://arxiv.org/pdf/2209.00188.pdf)
machine-learning cache perceptron computer-architecture microarchitecture perceptron-learning-algorithm prefetching
C++
•
MIT License
•13•74•0•0•Updated Sep 10, 2025Sep 10, 2025
RawBench
Public
Shell
•0•2•0•0•Updated Sep 9, 2025Sep 9, 2025
Chronus
Public
C++
•0•3•0•0•Updated Sep 9, 2025Sep 9, 2025
Virtuoso-Workshop-MICRO25
Public
HTML
•0•0•0•0•Updated Sep 7, 2025Sep 7, 2025
MQSim
Public
MQSim is a fast & accurate simulator for modern multi-queue (MQ) and SATA SSDs. MQSim faithfully models new high-bandwidth protocol implementations, steady-state SSD conditions, and full end-to-end latency of requests in modern SSDs. Described in detail in the FAST 2018 paper: http://usenix.org/system/files/conference/fast18/fast18-tavakkol.pdf
C++
•
MIT License
•160•324•23•5•Updated Aug 25, 2025Aug 25, 2025
RawHash
Public
RawHash can accurately and efficiently map raw nanopore signals to reference genomes of varying sizes (e.g., from viral to a human genomes) in real-time without basecalling. Described by Firtina et al. (published at https://academic.oup.com/bioinformatics/article/39/Supplement_1/i297/7210440).
bioinformatics nanopore seeding segmentation event-detection genome-analysis hash-tables contamination read-mapping relative-abundances
C
•
GNU General Public License v3.0
•8•57•3•1•Updated Aug 17, 2025Aug 17, 2025
DRAM-Bender
Public
DRAM Bender is the first open source DRAM testing infrastructure that can be used to easily and comprehensively test state-of-the-art HBM2 chips and DDR4 modules of different form factors. Six prototypes are available on different FPGA boards. Described in our preprint: https://arxiv.org/pdf/2211.05838.pdf
testing fpga dram rowhammer
VHDL
•
MIT License
•16•92•0•1•Updated Aug 10, 2025Aug 10, 2025
ramulator2
Public
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and evaluation of new memory system designs (e.g., new DRAM standards, emerging RowHammer mitigation techniques). Described in our paper https://people.inf.ethz.ch/omutlu/pub/Ramulator2_arxiv23.pdf
simulation memory dram
C++
•
MIT License
•101•385•45•10•Updated Jul 24, 2025Jul 24, 2025
EasyDRAM
Public
EasyDRAM is an FPGA-based framework for rapid and accurate end-to-end evaluation of DRAM techniques on real DRAM chips. Described in our DSN 2025 paper: https://arxiv.org/abs/2506.10441
Verilog
•0•4•0•0•Updated Jun 23, 2025Jun 23, 2025
ReadDisturbanceVTS25
Public
Data and code for the VTS'25 paper "Revisiting DRAM Read Disturbance: Identifying Inconsistencies Between Experimental Characterization and Device-Level Studies." Described in our VTS 2025 paper: https://www.arxiv.org/pdf/2503.16749
C++
•0•2•0•0•Updated May 9, 2025May 9, 2025
Virtuoso
Public
Virtuoso is a fast, accurate and versatile simulation framework designed for virtual memory research. Virtuoso uses a new simulation methodology for estimating OS overheads and models diverse VM designs, incorporating state-of-the-art TLB techniques, page table structures etc. More details in our ASPLOS 2025 paper: https://arxiv.org/pdf/2403.04635
C++
•14•71•4•0•Updated May 8, 2025May 8, 2025
PIM-TC
Public
PIM-TC implements a distributed Triangle Counting (TC) algorithm specifically designed for and evaluated on the UPMEM Processing-in-Memory (PIM) architecture. Described in our paper https://arxiv.org/abs/2505.04269.
C
•
MIT License
•0•3•0•0•Updated May 8, 2025May 8, 2025
PyGim
Public
PyGim is the first runtime framework to efficiently execute Graph Neural Networks (GNNs) on real Processing-in-Memory systems. It provides a high-level Python interface, currently integrated with PyTorch, and supports various GNN models and real-world input graphs. Described by SIGMETRICS'25 by Giannoula et al. (https://arxiv.org/pdf/2402.16731)
C
•1•29•0•0•Updated Apr 23, 2025Apr 23, 2025
IMPACT
Public
IMPACT is a new framework that leverages Processing-in-Memory (PiM) to amplify data leakage in main memory-based timing attacks. More details: https://arxiv.org/abs/2404.11284
C++
•
MIT License
•0•2•0•0•Updated Apr 22, 2025Apr 22, 2025
PIMDAL
Public
PIMDAL (PIM Data Analytics Library) is an implementation of DB operators and 5 TPC-H queries on the UPMEM PIM system. Additionally we provide code to generate the TPC-H data and reference implementations on the CPU and GPU. Described in our arxiv paper: https://arxiv.org/abs/2504.01948
C++
•
MIT License
•0•3•1•0•Updated Mar 31, 2025Mar 31, 2025
Pythia
Public
A customizable hardware prefetching framework using online reinforcement learning as described in the MICRO 2021 paper by Bera et al. (https://arxiv.org/pdf/2109.12021.pdf).
machine-learning reinforcement-learning computer-architecture prefetcher microarchitecture cache-replacement branch-predictor champsim-simulator champsim-tracer
C++
•
MIT License
•46•149•1•0•Updated Mar 25, 2025Mar 25, 2025
PaCRAM
Public
PaCRAM is a technique that reduces the performance and energy overheads of the existing RowHammer mitigation mechanisms by carefully reducing the latency of preventive refreshes issued by existing mitigation mechanisms without compromising system security. Described in the HPCA 2025 paper: https://arxiv.org/abs/2502.11745
C++
•0•3•0•0•Updated Feb 26, 2025Feb 26, 2025
Ariadne
Public
Ariadne is a new compressed swap scheme for mobile devices that reduces application relaunch latency and CPU usage while increasing the number of live applications for enhanced user experience. Described in the HPCA 2025 paper by Liang et al.: https://arxiv.org/pdf/2502.12826
C
•
MIT License
•1•6•0•0•Updated Feb 19, 2025Feb 19, 2025
MIMDRAM
Public
Source code for the architectural simulator used for modeling the PUD system proposed in our HPCA 2024 paper `MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Processing''. Paper is at: https://arxiv.org/pdf/2402.19080.pdf
C++
•
Other
•8•26•3•0•Updated Jan 15, 2025Jan 15, 2025
pim-ml
Public
PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-world processing-in-memory (PIM) architecture. Described in the ISPASS 2023 paper by Gomez-Luna et al. (https://arxiv.org/pdf/2207.07886.pdf).
C
•
MIT License
•6•24•0•0•Updated Jan 7, 2025Jan 7, 2025
MegIS
Public
MegIS is the first in-storage processing system designed to significantly reduce the data movement overhead of the end-to-end metagenomic analysis pipeline. Described in the ISCA 2024 paper by Mansouri Ghiasi et al.: https://arxiv.org/pdf/2406.19113
ftl metagenomics ssd computer-architecture hardware-acceleration taxonomic-classification near-data-processing in-storage-processing
Python
•
GNU General Public License v3.0
•0•7•0•0•Updated Dec 1, 2024Dec 1, 2024
BreakHammer
Public
BreakHammer is a technique that reduces the performance overhead of RowHammer mitigation mechanisms by carefully reducing the number of performed RowHammer-preventive actions without compromising system robustness. Described in the MICRO 2024 paper: https://arxiv.org/abs/2404.13477.
C++
•1•7•1•0•Updated Nov 25, 2024Nov 25, 2024
PIM-Opt
Public
Source code & scripts for distributed machine learning training workloads on a real-world Processing-In-Memory system (i.e., UPMEM). Described in our PACT'24 paper by Rhyner et al. at https://arxiv.org/pdf/2404.07164v2
C
•
MIT License
•1•5•0•0•Updated Oct 5, 2024Oct 5, 2024
Genome-on-Diet
Public
Genome-on-Diet is a fast and memory-frugal framework for exemplifying sparsified genomics for read mapping, containment search, and metagenomic profiling. It is much faster & more memory-efficient than minimap2 for Illumina, HiFi, and ONT reads. Described by Alser et al. (preliminary version: https://arxiv.org/abs/2211.08157).
metagenomics variant-calling containment-search genome-analysis large-scale sequence-alignment read-mapping metagenomic-analysis microbiome-analysis minimap2
Roff
•
MIT License
•3•14•1•0•Updated Sep 4, 2024Sep 4, 2024
Sectored-DRAM
Public
A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (ii) activating a disproportionately large number of DRAM cells at low cost. Described in our paper https://arxiv.org/pdf/2207.13795.
C++
•0•12•0•0•Updated Aug 23, 2024Aug 23, 2024
rawasm
Public
Rawasm is a patch to the popular miniasm tool. It enables the construction of genome assembly from raw nanopore signals.
C
•
GNU General Public License v3.0
•0•4•0•0•Updated Jul 8, 2024Jul 8, 2024
Load-Inspector
Public
A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arxiv.org/pdf/2406.18786
compiler x86-64 emulation x86 microarchitecture intel-sde binary-instrumentation
C++
•
MIT License
•3•21•0•0•Updated Jun 30, 2024Jun 30, 2024
AirLift
Public
AirLift is a tool that updates mapped reads from one reference genome to another. Unlike existing tools, It accounts for regions not shared between the two reference genomes and enables remapping across all parts of the references. Described by Kim et al. (preliminary version at http://arxiv.org/abs/1912.08735)
C
•4•27•5•1•Updated May 23, 2024May 23, 2024
SiMRA-DRAM
Public
Source code & scripts for experimental characterization and demonstration of 1) simultaneous many-row activation, 2) up to nine-input majority operations and 3) copying one row's content to up 31 rows in real DDR4 DRAM chips. Described in our DSN'24 paper by Yuksel et al. at https://arxiv.org/abs/2405.06081
VHDL
•
Other
•2•11•0•0•Updated May 17, 2024May 17, 2024
HBM-Read-Disturbance
Public
Detailed read disturbance (RowHammer and RowPress) characterization of six real HBM2 DRAM chips yielding 23 new observations and 8 new takeaways, as described in the DSN'24 paper https://arxiv.org/pdf/2310.14665.pdf
Jupyter Notebook
•
MIT License
•0•9•0•0•Updated May 3, 2024May 3, 2024