2026

PathSteiner: Improving PathFinder with Quasi-Optimal Steiner-Tree Initialization

S. Shrivastava; L. Kurešević; A. Poupakis; C. Ravishankar; D. Gaitonde et al.

2026.

Out with LSQs: Custom Circuits for Memory Access Reordering in Dynamic HLS

R. Pirayadi; A. Elakhras; M. Stojilovic; P. Ienne

2026. 34th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, Monterey, CA, USA, 2026-02-22 - 2026-02-24. p. 92 - 102. DOI : 10.1145/3748173.3779204.

Detailed record

2025

Out with LSQs: Custom Circuits for Memory Access Reordering in Dynamic HLS

R. Pirayadi; A. Elakhras; P. Ienne; M. Stojilovic

2025.

Detailed record

FRESCO: Efficient Subgraph Enumeration for Scalable Clustering in Heterogeneous CGRAs

L. Coulon; A. Ragab; J. Anderson; M. Stojilovic; P. Ienne

2025. 2025 IEEE/ACM International Conference on Computer Aided Design, Munich, Germany, 2025-10-26 - 2025-10-30.

Detailed record

A Low-latency On-chip Cache Hierarchy for Load-to-use Stall Reduction in GPUs

N. (. (Nematollahi zadeh) Mahani; H. Falahati; S. Darabi; A. Javadi-Nezhad; Y. Oh et al.

ACM Transactions on Architecture and Code Optimization. 2025. DOI : 10.1145/3760782.

Detailed record

Single-Address-Space FaaS with Jord

Y. Li; A. Bhattacharyya; M. Kumar; A. Bhattacharjee; Yoav Etsion et al.

2025. The 52nd Annual International Symposium on Computer Architecture, Tokyo, Japan, 2025-06-21 - 2025-06-25. p. 694 - 707. DOI : 10.1145/3695053.3731108.

Detailed record

QFlex 3.0: Fast and Accurate ARM Server Simulation

S. Lin; A. Ansari; A. Chakraborty; B. Eryilmaz; Y. Li et al.

2025. ARM-based General-Purpose Computing: Software-Hardware Co-Optimization for Performance Acceleration, Tokyo, Japan, 2025-06-21.

Detailed record

Avant-Garde: Empowering GPUs with Scaled Numeric Formats

M. Gil; D. Ha; S. B. Harma; M. K. Yoon; B. Falsafi et al.

2025. The 52nd Annual International Symposium on Computer Architecture, Tokyo, Japan, 2025-06-21 - 2025-06-25. p. 153 - 165. DOI : 10.1145/3695053.3731100.

Detailed record

Constrained bit allocation for neural networks

S. Boudouh; S. B. Harma; A. Mahmoud; B. Falsafi

2025. Machine Learning for Computer Architecture and Systems 2025, Tokyo, Japan, 2025-06-21.

Detailed record

Guaranteed Yet Hard to Find: Uncovering FPGA Routing Convergence Paradox

S. Shrivastava; S. Tanaka; S. Nikolic; C. Ravishankar; D. Gaitonde et al.

2025.

Detailed record

ROBoost: A Study of FPGA Logic-Based Power-Wasting Primitives

D. G. A. S. Mahmoud; S. Andreani; V. Lenders; M. Stojilovic

2025. The 21st International Symposium on Applied Reconfigurable Computing, Sevilla, Spain, 2025-04-09 - 2025-04-11.

Detailed record

Rethinking IOMMU for Future IO Devices

M. Kumar; Y. Li; Y. Etsion; A. Bhattacharjee; A. Basu et al.

2025. 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Rotterdam, The Netherlands, 2025-03-30 - 2025-04-03.

Detailed record

FRIDA: Reconfigurable Arrays for Dynamically Scheduled High-Level Synthesis

L. Coulon; L. Ramirez; J. Anderson; M. Stojilovic; P. Ienne

2025. ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA 2025), Monterey, California, USA, 2025-02-27 - 2025-03-01. p. 147 - 158. DOI : 10.1145/3706628.3708880.

Detailed record

Effective Interplay Between Sparsity and Quantization: from Theory to Practice

S. B. Harma; A. Chakraborty; E. Kostenok; D. Mishin; D. Ha et al.

2025. The Thirteenth International Conference on Learning Representations, Singapore, 2025-04-24 - 2025-04-28.

Detailed record

ROBoost: A Study of FPGA Logic-Based Power-Wasting Primitives. Artifacts

D. G. A. S. Mahmoud; S. Andreani; V. Lenders; M. Stojilovic

2025.

Detailed record

Guaranteed Yet Hard to Find: Uncovering FPGA Routing Convergence Paradox

S. Shrivastava; S. Nicolic; S. Tanaka; C. Ravishankar; D. Gaitonde et al.

2025. 33rd IEEE International Symposium on Field-Programmable Custom Computing Machines, Fayetteville, Arkansas, USA, 2025-05-04 - 2025-05-07. p. 143 - 151. DOI : 10.1109/FCCM62733.2025.00060.

Detailed record

2024

Parallel FPGA Routing with On-the-Fly Net Decomposition

F. Kos; M. Stojilovic; V. Betz

2024. The 23rd International Conference on Field-Programmable Technology, Sydney, Australia, 2024-12-10 - 2024-12-12. p. 1 - 9. DOI : 10.1109/ICFPT64416.2024.11113423.

Detailed record

MultiQueue-Based FPGA Routing: Relaxed A* Priority Ordering for Improved Parallelism

A. Singer; H. Yan; G. Zhang; M. Jeffrey; M. Stojilovic et al.

2024. The 23rd International Conference on Field-Programmable Technology, Sydney, Australia, 2024-12-10 - 2024-12-12. p. 1 - 9. DOI : 10.1109/ICFPT64416.2024.11113444.

Detailed record

UrbanTwin: An urban digital twin for climate action

D.-A. Constantinescu; V. Kartsch; Y. Nakatsuka; P. Wiese; P. Orbanovik et al.

EcoCloud Annual Event on IT Sustainability 2024, Lausanne, Switzerland, 2024-10-08.

Detailed record

Silicon Efficiency in Post-Moore Servers

A. Ansari; S. Lin; A. Chakraborty; M. Alian; B. Eryilmaz et al.

2024. Workshop on Hot Topics in Ethical Computer Systems, San Diego, California, USA, 2024-04-28.

Detailed record

X-Attack 2.0: The Risk of Power Wasters and Satisfiability Don’t-Care Hardware Trojans to Shared Cloud FPGAs

D. G. Mahmoud; B. Shokry; V. Lenders; W. Hu; M. Stojilović

IEEE Access. 2024. p. 1 - 1. DOI : 10.1109/ACCESS.2024.3353134.

Detailed record

Server Architecture from Enterprise to Post-Moore

B. Falsafi; M. Ferdman; B. Grot

IEEE Micro. 2024. Vol. 44, num. 5, p. 65 - 73. DOI : 10.1109/MM.2024.3418975.

Detailed record

Electrical-Level Fault-Injection Attacks on FPGA-Based Systems

D. G. A. S. Mahmoud / B. Falsafi; M. Stojilovic (Dir.)

Lausanne, EPFL, 2024. p. 227. DOI : 10.5075/epfl-thesis-10315.

Detailed record

2023

Practical Implementations of Remote Power Side-Channel and Fault-Injection Attacks on Multitenant FPGAs

D. G. Mahmoud; O. Glamočanin; F. Regazzoni; M. Stojilović

Security of FPGA-Accelerated Cloud Computing Environments; Springer, Cham, 2023. p. 101 - 135. - 978-3-031-45394-6.

DOI : 10.1007/978-3-031-45395-3_5.

Detailed record

GRAMM: Fast CGRA Application Mapping Based on A Heuristic for Finding Graph Minors

G. Zhou; M. Stojilovic; J. H. Anderson

2023. 33rd International Conference on Field-Programmable Logic and Applications (FPL), Gothenburg, SWEDEN, SEP 04-08, 2023. p. 305 - 310. DOI : 10.1109/FPL60245.2023.00052.

Detailed record

IIBLAST: Speeding Up Commercial FPGA Routing by Decoupling and Mitigating the Intra-CLB Bottleneck

S. Shrivastava; S. Nikolic; C. Ravishankar; D. Gaitonde; M. Stojilovic

2023. IEEE/ACM International Conference on Computer-Aided Design (IEEE/ACM ICCAD 2023), San Francisco, CA, USA, 2023-10-29 - 2023-11-02. DOI : 10.1109/ICCAD57390.2023.10323897.

Detailed record

What's Missing in Agile Hardware Design? Verification!

B. Falsafi

Journal Of Computer Science And Technology. 2023. Vol. 38, num. 4, p. 735 - 736. DOI : 10.1007/s11390-023-0005-3.

Detailed record

Scale-out Systolic Arrays

A. C. Yuzuguler; C. Sonmez; M. Drumond; Y. Oh; B. Falsafi et al.

Acm Transactions On Architecture And Code Optimization. 2023. Vol. 20, num. 2, p. 27. DOI : 10.1145/3572917.

Detailed record

Temperature Impact on Remote Power Side-Channel Attacks on Shared FPGAs

O. Glamocanin; H. Bazaz; M. Payer; M. Stojilovic

2023. Design, Automation and Test in Europe Conference DATE 2023, Antwerp, Belgium, April 17-19, 2023. DOI : 10.23919/DATE56975.2023.10136979.

Detailed record

The Side-channel Metrics Cheat Sheet

K. Papagiannopoulos; O. Glamočanin; M. Azouaoui; D. Ros; F. Regazzoni et al.

ACM Computing Surveys. 2023. Vol. 55, num. 10, p. 1 - 38. DOI : 10.1145/3565571.

Detailed record

A Visionary Look at the Security of Reconfigurable Cloud Computing

M. Stojilović; K. Rasmussen; F. Regazzoni; M. B. Tahoori; R. Tessier

Proceedings of the IEEE. 2023. p. 1 - 24. DOI : 10.1109/JPROC.2023.3330729.

Detailed record

AstriFlash: A Flash-Based System for Online Services

S. Gupta; Y. Oh; L. Yan; M. J. Sutherland; A. Bhattacharjee et al.

2023. The 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, QC, Canada, Feb 25 – March 01, 2023. DOI : 10.1109/HPCA56546.2023.10070955.

Detailed record

Evaluating, Exploiting, and Hiding Power Side-Channel Leakage of Remote FPGAs

O. Glamocanin / B. Falsafi; M. Stojilovic (Dir.)

Lausanne, EPFL, 2023. p. 249. DOI : 10.5075/epfl-thesis-9918.

Detailed record

Imprecise Store Exceptions

S. Gupta; Y. Li; Q. Kang; A. Bhattacharjee; B. Falsafi et al.

2023. The 50th Annual International Symposium on Computer Architecture (ISCA ’23), Orlando, FL, USA, June 17–21, 2023. DOI : 10.1145/3579371.3589087.

Detailed record

RDS: FPGA Routing Delay Sensors for Effective Remote Power Analysis Attacks

D. Spielmann; O. Glamočanin; M. Stojilović

IACR Transactions on Cryptographic Hardware and Embedded Systems. 2023. Vol. 2023, num. 2, p. 543 - 567. DOI : 10.46586/tches.v2023.i2.543-567.

Detailed record

Active Wire Fences for Multitenant FPGAs

O. Glamocanin; A. Kostic; S. Kostic; M. Stojilovic

2023. 26th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS), Tallinn, Estonia, May 3-5, 2023. p. 13 - 20. DOI : 10.1109/DDECS57882.2023.10138941.

Detailed record

IIBLAST: Speeding Up Commercial FPGA Routing by Decoupling and Mitigating the Intra-CLB Bottleneck

S. Shrivastava; S. Nikolic; C. Ravishankar; D. Gaitonde; M. Stojilovic

2023.

Detailed record

SecureCells: A Secure Compartmentalized Architecture

A. Bhattacharyya; F. Hofhammer; Y. Li; S. Gupta; A. Sánchez Marín et al.

2023. 44th IEEE Symposium on Security and Privacy, San Francisco, USA, May 22-24, 2023. p. 2921 - 2939. DOI : 10.1109/SP46215.2023.00125.

Detailed record

Cooperative Concurrency Control for Write-Intensive Key-Value Workloads

M. J. Sutherland; B. Falsafi; A. Daglis

2023. The 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'23), Vancouver, BC, Canada, March 25–29, 2023. p. 30 - 46. DOI : 10.1145/3567955.3567957.

Detailed record

Instruction-Level Power Side-Channel Leakage Evaluation of Soft-Core CPUs on Shared FPGAs

O. Glamočanin; S. Shrivastava; J. Yao; N. Ardo; M. Payer et al.

Journal of Hardware and Systems Security. 2023. DOI : 10.1007/s41635-023-00135-1.

Detailed record

Rebooting Virtual Memory with Midgard

S. Gupta / B. Falsafi; A. Bhattacharjee (Dir.)

Lausanne, EPFL, 2023. p. 178. DOI : 10.5075/epfl-thesis-8864.

Detailed record

2022

DFAulted: Analyzing and Exploiting CPU Software Faults Caused by FPGA-Driven Undervolting Attacks

D. G. A. S. Mahmoud; D. Dervishi; S. Hussein; V. Lenders; M. Stojilovic

IEEE Access. 2022. Vol. 10, p. 134199 - 134216. DOI : 10.1109/ACCESS.2022.3231753.

Detailed record

A Deep-Learning Approach to Side-Channel Based CPU Disassembly at Design Time

H. Fendri; M. Macchetti; J. Perrine; M. Stojilovic

2022. 25th Design, Automation and Test in Europe Conference and Exhibition (DATE), Antwerp, Belgium [Virtual], March 14-23, 2022. p. 670 - 675. DOI : 10.23919/DATE54114.2022.9774531.

Detailed record

FPGA-to-CPU Undervolting Attacks

D. G. A. S. Mahmoud; S. Hussein; V. Lenders; M. Stojilovic

2022. 25th Design, Automation and Test in Europe, Antwerp, Belgium [Virtual], March 14-23, 2022. p. 999 - 1004. DOI : 10.23919/DATE54114.2022.9774663.

Detailed record

Electrical-Level Attacks on CPUs, FPGAs, and GPUs: Survey and Implications in the Heterogeneous Era

D. G. Mahmoud; V. Lenders; M. Stojilović

ACM Computing Surveys. 2022. Vol. 55, num. 3, p. 1 - 40. DOI : 10.1145/3498337.

Detailed record

Deep Learning Detection of GPS Spoofing

O. Jullian; B. Otero; M. Stojilović; J. J. Costa; J. Verdú et al.

2022. 7th International Conference Machine Learning, Optimization, and Data Science (LOD 2021), Grasmere, UK, October 4-8, 2021. p. 527 - 540. DOI : 10.1007/978-3-030-95467-3_38.

Detailed record

Hardware and Software Support for RPC-Centric Server Architecture

M. J. Sutherland / B. Falsafi; A. Daglis (Dir.)

Lausanne, EPFL, 2022. p. 256. DOI : 10.5075/epfl-thesis-8017.

Detailed record

2021

Cerebros: Evading the RPC Tax in Datacenters

A. Pourhabibi Zarandi; M. J. Sutherland; A. Daglis; B. Falsafi

2021. MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, Virtual Event, Greece, October 18–22, 2021. p. 407 - 420. DOI : 10.1145/3466752.3480055.

Detailed record

Equinox: Training (for Free) on a Custom Inference Accelerator

M. P. Drumond Lages De Oliveira; L. Coulon; A. Pourhabibi Zarandi; A. C. Yüzügüler; B. Falsafi et al.

2021. 54th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO’21), Virtual Event, Greece, October 18–22, 2021. DOI : 10.1145/3466752.3480057.

Detailed record

Runtime Replacement of Machine Learning Modules in Fpga-based Systems

D. G. Mahmoud; B. Shokry; A. ElRefaey; H. H. Amer; I. Adly

2021. 10th Mediterranean Conference on Embedded Computing, Budva, Montenegro, 2021-06-07 - 2021-06-10. p. 340 - 343. DOI : 10.1109/MECO52532.2021.9460192.

Detailed record

NetCracker: A Peek into the Routing Architecture of Xilinx 7-Series FPGAs

M. B. Petersen; S. Nikolic; M. Stojilovic

2021. International Symposium on Field-Programmable Gate Arrays, Virtual Conference, February 28 - March 2, 2021. DOI : 10.1145/3431920.3439285.

Detailed record

Shared FPGAs and the Holy Grail: Protections against Side-Channel and Fault Attacks

O. Glamocanin; D. Mahmoud; F. Regazzoni; M. Stojilovic

2021. DATE 2021 Design, Automation and Test in Europe, Virtual, February 1-5, 2021. p. 1645 - 1650. DOI : 10.23919/DATE51398.2021.9473947.

Detailed record

Rebooting Virtual Memory with Midgard

S. Gupta; A. Bhattacharyya; Y. Oh; A. Bhattacharjee; B. Falsafi et al.

2021. ISCA 2021 48th International Symposium on Computer Architecture, Online conference, June 14-19, 2021. DOI : 10.1109/ISCA52012.2021.00047.

Detailed record

Improving First-Order Threshold Implementations of SKINNY

A. F. Caforio; D. P. Collins; S. Banik; O. Glamocanin

2021. 22nd International Conference on Cryptology in India (INDOCRYPT21), Remote, December 12-15, 2021. p. 246 - 267. DOI : 10.1007/978-3-030-92518-5_1.

Detailed record

Data transformer apparatus

A. Pourhabibi Zarandi; S. Gupta; H. Kassir; M. Sutherland; Z. Tian et al.

US11748254 ; US2022327048 ; WO2021037341 . 2021.

Detailed record

Hardware-Software Co-Design of an RPC Processor

A. Pourhabibi Zarandi / B. Falsafi (Dir.)

Lausanne, EPFL, 2021. p. 146. DOI : 10.5075/epfl-thesis-7217.

Detailed record

Shrinking FPGA Static Power via Machine Learning-Based Power Gating and Enhanced Routing

Z. Seifoori; H. Asadi; M. Stojilovic

IEEE Access. 2021. Vol. 9, p. 115599 - 115619. DOI : 10.1109/ACCESS.2021.3085005.

Detailed record

2020

Nonintrusive and Adaptive Monitoring for Locating Voltage Attacks in Virtualized FPGAs

S. S. Mirzargar; G. Renault; A. Guerrieri; M. Stojilovic

2020. 19th International Conference on Field-Programmable Technology (ICFPT), Maui, HI, USA (Virtual conference), December 7-11, 2020. p. 288 - 289. DOI : 10.1109/ICFPT51103.2020.00050.

Detailed record

Exploiting Errors for Efficiency: A Survey from Circuits to Applications

P. Stanley-Marbell; A. Alaghi; M. Carbin; E. Darulova; L. Dolecek et al.

ACM Computing Surveys. 2020. Vol. 53, num. 3, p. 51. DOI : 10.1145/3394898.

Detailed record

Are Cloud FPGAs Really Vulnerable to Power Analysis Attacks?

O. Glamocanin; L. Coulon; F. Regazzoni; M. Stojilovic

2020. Design, Automation and Test in Europe (DATE), Grenoble, France, March 9-13, 2020. p. 1007 - 1010. DOI : 10.23919/DATE48585.2020.9116481.

Detailed record

Built-in Self-Evaluation of First-Order Power Side-Channel Leakage for FPGAs

O. Glamocanin; L. Coulon; F. Regazzoni; M. Stojilovic

2020. 28th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA 2020), Seaside, California, USA, February 23-25, 2020. DOI : 10.1145/3373087.3375318.

Detailed record

Closing Leaks: Routing Against Crosstalk Side-Channel Attacks

Z. Seifoori; S. S. Mirzargar; M. Stojilovic

2020. 28th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA 2020), Seaside, California, USA, February 23-25, 2020. DOI : 10.1145/3373087.3375319.

Detailed record

A Shared-Memory Parallel Implementation of the RePlAce Global Cell Placer

F. Gessler; P. Brisk; M. Stojilovic

2020. 33rd International Conference on VLSI Design and 19th International Conference on Embedded Systems (VLSID), Bangalore, India, January 4-8, 2020. DOI : 10.1109/VLSID49098.2020.00031.

Detailed record

Optimus Prime: Accelerating Data Transformation in Servers

A. Pourhabibi Zarandi; S. Gupta; H. Kassir; M. J. Sutherland; Z. Tian et al.

2020. Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, March 16–20, 2020. p. 1203 - 1216. DOI : 10.1145/3373376.3378501.

Detailed record

SPARTA: A Divide and Conquer Approach to Address Translation for Accelerators

J. Picorel; S. A. S. Kohroudi; Z. Yan; A. Bhattacharjee; B. Falsafi et al.

2020

Detailed record

The NEBULA RPC-Optimized Architecture

M. Sutherland; S. Gupta; B. Falsafi; V. Marathe; D. Pnevmatikatos et al.

2020. 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA), Valencia, Spain, May, 30th - June, 3rd 2020. p. 199 - 212. DOI : 10.1109/ISCA45697.2020.00027.

Detailed record

X-Attack: Remote Activation of Satisfiability Don’t-Care Hardware Trojans on Shared FPGAs

D. Mahmoud; W. Hu; M. Stojilovic

2020. 30th International Conference on Field-Programmable Logic and Applications (FPL), ELECTR NETWORK, August 31 - September 4, 2020. p. 185 - 192. DOI : 10.1109/FPL50879.2020.00039.

Detailed record

ColTraIn: Co-located DNN training and inference

M. P. Drumond Lages De Oliveira / B. Falsafi; M. Jaggi (Dir.)

Lausanne, EPFL, 2020. p. 115. DOI : 10.5075/epfl-thesis-10265.

Detailed record

2019

A machine learning approach for power gating the FPGA routing network

S. Zeinab; H. Asadi; M. Stojilovic

2019. 2019 International Conference on Field-Programmable Technology (ICFPT), Tianjin, China, December 9-13, 2019. p. 10 - 18. DOI : 10.1109/ICFPT47387.2019.00010.

Detailed record

Distributed Logless Atomic Durability with Persistent Memory

S. Gupta; A. Daglis; B. Falsafi

2019. The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-52), Columbus, OH, USA, October 12–16, 2019. DOI : 10.1145/3352460.3358321.

Detailed record

Physical Side-Channel Attacks and Covert Communication on FPGAs: A Survey

S. S. Mirzargar; M. Stojilovic

2019. 29th International Conference on Field Programmable Logic and Applications (FPL), Barcelona, Spain, September 9 - 13, 2019. DOI : 10.1109/FPL.2019.00039.

Detailed record

FPGA-Assisted Deterministic Routing for FPGAs

D. Korolija; M. Stojilovic

2019. 2019 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Rio de Janeiro, Brasil, May 20-24, 2019. p. 155 - 162. DOI : 10.1109/IPDPSW.2019.00034.

Detailed record

RPCValet: NI-Driven Tail-Aware Balancing of µs-Scale RPCs

A. Daglis; M. Sutherland; B. Falsafi

2019. Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS '19, Providence, Rhode Island, USA, April 13-17, 2019. p. 35 - 48. DOI : 10.1145/3297858.3304070.

Detailed record

Mitigating Load Imbalance in Distributed Data Serving with Rack-Scale Memory Pooling

S. Novakovic; A. Daglis; D. Ustiugov; E. Bugnion; B. Falsafi et al.

ACM Transactions on Computer Systems. 2019. Vol. 36, num. 2, p. 1 - 37. DOI : 10.1145/3309986.

Detailed record

Timing Violation Induced Faults in Multi-Tenant FPGAs

D. Mahmoud; M. Stojilovic

2019. Design, Automation & Test in Europe Conference & Exhibition (DATE), Florence, ITALY, Mar 25-29, 2019. p. 1745 - 1750. DOI : 10.23919/DATE.2019.8715263.

Detailed record

Linebacker: Preserving Victim Cache Lines in Idle Register Files of GPUs

Y. Oh; G. Koo; M. Annavaram; W. W. Ro

2019. 46th International Symposium on Computer Architecture (ISCA), Phoenix, AZ, Jun 22-26, 2019. p. 183 - 196. DOI : 10.1145/3307650.3322222.

Detailed record

Analog Neural Networks with Deep-submicron Nonlinear Synapses

A. C. Yüzügüler; F. Çelik; M. P. Drumond Lages De Oliveira; B. Falsafi; P. Frossard

IEEE Micro. 2019. Vol. 39, num. 5, p. 55 - 63. DOI : 10.1109/MM.2019.2931182.

Detailed record

SMoTherSpectre: Exploiting Speculative Execution through Port Contention

A. Bhattacharyya; A. Sandulescu; M. Neugschwandtner; A. Sorniotti; B. Falsafi et al.

2019. The 26th ACM Conference on Computer and Communications Security - ACM CSS 2019, London, UK, November 11-15, 2019. p. 785 - 800. DOI : 10.1145/3319535.3363194.

Detailed record

Stretch: Balancing QoS and Throughput for Colocated Server Workloads on SMT Cores

A. Margaritov; S. Gupta; R. Gonzalez-Alberquilla; B. Grot

2019. 25th IEEE International Symposium on High Performance Computer Architecture (HPCA), Washington, DC, Feb 16-20, 2019. p. 15 - 27. DOI : 10.1109/HPCA.2019.00024.

Detailed record

2018

Design Guidelines for High-Performance SCM Hierarchies

D. Ustiugov; A. Daglis; J. Picorel Obando; M. J. Sutherland; E. Bugnion et al.

2018. 4th International Symposium on Memory Systems (MEMSYS), Old Town Alexandria, VA, USA, October 1-4, 2018. DOI : 10.1145/3240302.3240310.

Detailed record

Deterministic Parallel Routing for FPGAs based on Galois Parallel Execution Model

Y. Moctar; M. Stojilovic; P. Brisk

2018. 28th International Conference on Field Programmable Logic and Applications (FPL), Dublin, IRELAND, Aug 26-31, 2018. p. 21 - 25. DOI : 10.1109/FPL.2018.00011.

Detailed record

Towards Commoditizing Simulations of System Models Using Recurrent Neural Networks

A. C. Yuzuguler; A. Moga; C. Franke

2018. IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Aalborg, DENMARK, Oct 29-31, 2018. DOI : 10.1109/SmartGridComm.2018.8587599.

Detailed record

Atomic object reads for in-memory rack-scale computing

A. Daglis; B. R. Grot; B. Falsafi

US10929174 ; US2018173673 . 2018.

Detailed record

Training DNNs with Hybrid Block Floating Point

M. Drumond; T. Lin; M. Jaggi; B. Falsafi

2018. NeurIPS 2018 - 32nd Conference on Neural Information Processing Systems, Montreal, CANADA, Dec 02-08, 2018.

Detailed record

LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching

M. Sadrosadati; A. Mirhosseini; S. B. Ehsani; H. Sarbazi-Azad; M. Drumond et al.

2018. Twenty-Third International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS '18, Williamsburg, VA, USA, March 24th – March 28th, 2018. p. 489 - 502. DOI : 10.1145/3173162.3173211.

Detailed record

Network-Compute Co-Design for Distributed In-Memory Computing

A. Daglis / B. Falsafi; E. Bugnion (Dir.)

Lausanne, EPFL, 2018. p. 230. DOI : 10.5075/epfl-thesis-8749.

Detailed record

2017

Near-Memory Address Translation

J. Picorel; D. Jevdjic; B. Falsafi

2017. 26th International Conference on Parallel Architectures and Compilation Techniques (PACT), Portland, OR, SEP 09-13, 2017. p. 303 - 317. DOI : 10.1109/Pact.2017.56.

Detailed record

Near-Memory Address Translation

J. Picorel Obando / B. Falsafi (Dir.)

Lausanne, EPFL, 2017. p. 134. DOI : 10.5075/epfl-thesis-7875.

Detailed record

Unified prefetching into instruction cache and branch target buffer

B. Falsafi; I. C. Kaynak; B. R. Grot

US9996358 ; US2017090935 . 2017.

Detailed record

The Mondrian Data Engine

M. P. Drumond Lages De Oliveira; A. Daglis; N. Mirzadeh; D. Ustiugov; J. Picorel Obando et al.

2017. The 44th International Symposium on Computer Architecture, Toronto, ON, Canada, June 24-28, 2017. DOI : 10.1145/3079856.3080233.

Detailed record

Parallel FPGA routing: Survey and challenges

M. Stojilovic

2017. 2017 27th International Conference on Field Programmable Logic and Applications (FPL), Ghent, Belgium, September 4-8, 2017. p. 1 - 8. DOI : 10.23919/FPL.2017.8056782.

Detailed record

Fat Caches For Scale-Out Servers

S. Volos; D. Jevdjic; B. Falsafi; B. Grot

Ieee Micro. 2017. Vol. 37, num. 2, p. 90 - 103. DOI : 10.1109/MM.2017.32.

Detailed record

FPGAs versus GPUs in Data centers

B. Falsafi; B. Dally; D. Singh; D. Chiou; J. J. Yi et al.

IEEE Micro. 2017. Vol. 37, num. 1, p. 60 - 72. DOI : 10.1109/MM.2017.19.

Detailed record

2016

Unlocking Energy

B. Falsafi; R. Guerraoui; J. Picorel Obando; V. Trigonakis

2016. 2016 USENIX Annual Technical Conference, Denver, Colorado, USA, June 22-24, 2016. p. 393 - 406.

Detailed record

SABRes: Atomic Object Reads for In-Memory Rack-Scale Computing

A. Daglis; D. Ustiugov; S. Novakovic; E. Bugnion; B. Falsafi et al.

2016. 49th Annual IEEE/ACM International Symposium on Microarchitecture, Taipei, Taiwan, October 15-19, 2016. DOI : 10.1109/MICRO.2016.7783709.

Detailed record

A Cache-Assisted Scratchpad Memory for Multiple-Bit-Error Correction

H. Farbeh; N. S. Mirzadeh; N. F. Ghalaty; S.-G. Miremadi; M. Fazeli et al.

IEEE Transactions on Very Large Scale Integration (VLSI) Systems. 2016. Vol. 24, num. 11, p. 3296 - 3309. DOI : 10.1109/TVLSI.2016.2544811.

Detailed record

Towards Near-Threshold Server Processors

A. Pahlevan; J. Picorel Obando; A. Pourhabibi Zarandi; D. Rossi; M. Zapater Sancho et al.

2016. Design, Automation and Test in Europe Conference (DATE '16), Dresden, Germany, March 14-18, 2016. p. 7 - 12.

Detailed record

An Analysis of Load Imbalance in Scale-out Data Serving

S. Novakovic; A. Daglis; E. Bugnion; B. Falsafi; B. Grot

2016. ACM SIGMETRICS, Antibes Juan-Les-Pins, France, June 14-18, 2016. p. 367 - 368. DOI : 10.1145/2896377.2901501.

Detailed record

The Case for RackOut: Scalable Data Serving Using Rack-Scale Systems

S. Novakovic; A. Daglis; E. Bugnion; B. Falsafi; B. Grot

2016. ACM Symposium on Cloud Computing, Santa Clara, USA, October 05-07, 2016. DOI : 10.1145/2987550.2987577.

Detailed record

Near-Memory Data Services

B. Falsafi; M. Stan; K. Skadron; N. Jayasena; Y. Chen et al.

IEEE Micro. 2016. Vol. 36, num. 1, p. 6 - 13. DOI : 10.1109/MM.2016.9.

Detailed record