


default search action
23rd CGO 2026: Sydney, Australia
- Stephen M. Blackburn, Albert Cohen, Timothy M. Jones:

IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2026, Sydney, Australia, January 31 - Feb. 4, 2026. IEEE 2026, ISBN 979-8-3315-9288-2 - Prasanth Chatarasi, Alex Gatea, Wei Wang, Chris Bowler, Shubham Jain, Masoud Ataei Jaliseh, Nicole Khoun, Alberto Mannari, Bardia Mahjour, Viji Srinivasan, Swagath Venkataramani:

Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization. 1-13 - Damitha Lenadora, Vimarsh Sathia, Gerasimos Gerogiannis, Serif Yesil, Josep Torrellas, Charith Mendis:

GRANII: Selection and Ordering of Primitives in GRAph Neural Networks using Input Inspection. 14-27 - Bobby Yan, Alexander J. Root, Trevor Gale, David Broman, Fredrik Kjolstad:

Fast Autoscheduling for Sparse ML Frameworks. 28-43 - Prasanth Chatarasi, Alex Gatea, Bardia Mahjour, Jintao Zhang, Alberto Mannari, Chris Bowler, Shubham Jain, Masoud Ataei Jaliseh, Nicole Khoun, Kamlesh Kumar, Viji Srinivasan, Swagath Venkataramani:

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators. 44-56 - Yuechen Mu, Guangli Li, Shiping Chen, Jingling Xue:

PriTran: Privacy-Preserving Inference for Transformer-Based Language Models under Fully Homomorphic Encryption. 57-69 - Tianxiang Sui, Jianxin Lai, Long Li, Peng Yuan, Yan Liu, Qing Zhu, Xiaojing Zhang, Linjie Xiao, Mingzhe Zhang, Jingling Xue:

FHEFusion: Enabling Operator Fusion in FHE Compilers for Depth-Efficient DNN Inference. 70-83 - Giacomo Priamo, Daniele Cono D'Elia, Mathias Payer, Leonardo Querzoni:

Towards Path-Aware Coverage-Guided Fuzzing. 84-97 - François de Ferrière, Yves Janin, Sirine Mechmech:

SecSwift, a Compiler-Based Framework for Software Countermeasures in Cybersecurity. 98-108 - Florian Huemer, Aleksandar Prokopec, David Leopoldseder, Raphael Mosaner, Hanspeter Mössenböck

:
Partial-Evaluation Templates: Accelerating Partial Evaluation with Pre-compiled Templates. 109-122 - Bolei Tong, Yongyan Fang, Chaorui Wang, Qingan Li, Jingling Xue, Mengting Yuan:

Pyls: Enabling Python Hardware Synthesis with Dynamic Polymorphism via LCRS Encoding. 123-135 - Jonathan Van der Cruysse, Tzung-Han Juang, Shakiba Bolbolian Khah, Christophe Dubach:

SkeleShare: Algorithmic Skeletons and Equality Saturation for Hardware Resource Sharing. 136-149 - Marco Siracusa, Olivia Hsu, Víctor Soria Pardos, Joshua Randall, Arnaud Grasset, Eric Biscondi, Douglas J. Joseph, Randy Allen, Fredrik Kjolstad, Miquel Moretó Planas, Adrià Armejach:

Ember: A Compiler for Embedding Operations on Decoupled Access-Execute Architectures. 150-163 - Yeonoh Jeong, Taehyeong Park, Yongjun Park:

Flow-Graph-Aware Tiling and Rescheduling for Memory-Efficient On-Device Inference. 164-175 - Arjun H. Kumar, Bhavya Hirani, Hang Shao, Tobi Ajila, Vijay Sundaresan, Daryl Maier, Manas Thakur:

VFlatten: Selective Value-Object Flattening using Hybrid Static and Dynamic Analysis. 176-187 - Lingqi Zhang, Tengfei Wang, Jiajun Huang, Chen Zhuang, Ivan R. Ivanov, Peng Chen, Toshio Endo, Mohamed Wahib:

FRUGAL: Pushing GPU Applications beyond Memory Limits. 188-201 - Tommy McMichen, Simone Campanoni:

Automatic Data Enumeration for Fast Collections. 202-215 - Yoonho Choi, Kyoungtae Lee, Minji Kim, Hyungsoo Jung, Hyojin Sung:

FORTE: Online DataFrame Query Optimizer. 216-227 - Amir Mohammad Tavakkoli, Cosmin E. Oancea, Mary Hall:

LEGO: A Layout Expression Language for Code Generation of Hierarchical Mapping. 228-241 - Yihong Zhang, Derek K. Gerstmann, Andrew Adams, Maaz Bin Safeer Ahmad:

Pushing Tensor Accelerators beyond MatMul in a User-Schedulable Language. 242-254 - Hongzheng Chen, Bin Fan, Alexander Collins, Bastian Hagedorn, Evghenii Gaburov, Masahiro Masuda, Matthew Brookhart, Chris Sullivan, Jason Knight, Zhiru Zhang, Vinod Grover:

Tawa: Automatic Warp Specialization for Modern GPUs with Asynchronous References. 255-267 - Marouane Benbetka, Merwan Bekkar, Riyadh Baghdadi, Martin Kong:

Dependence-Driven, Scalable Quantum Circuit Mapping with Affine Abstractions. 268-280 - Sanaa Sharma, Prakash Murali:

Space-Time Optimisations for Early Fault-Tolerant Quantum Computation. 281-294 - Ed Younis:

OpenQudit: Extensible and Accelerated Numerical Quantum Compilation via a JIT-Compiled DSL. 295-305 - Sungwoo Yun, Seonyoung Cheon, Dongkwan Kim, Heelim Choi, Kunmo Jeong, Chan Lee, Yongwoo Lee, Hanjun Kim:

Selene: Cross-Level Barrier-Free Pipelining for Irregular Nested Loops in High-Level Synthesis. 306-318 - Shreya Alladi, Alberto Ros, Alexandra Jimborean:

Enabling Automatic Compiler-Driven Vectorization of Transformers. 319-333 - César Piñeiro, Juan Carlos Pichel:

Unlocking Python Multithreading Capabilities using OpenMP-Based Programming with OMP4Py. 334-347 - Yian Su, Brian Homerding, Haocheng Gao, Federico Sossai, Yebin Chon, David I. August, Simone Campanoni:

The Parallel-Semantics Program Dependence Graph for Parallel Optimization. 348-361 - Shuaijiang Li, Jiacheng Zhao, Ying Liu, Shuoming Zhang, Lei Chen, Yijin Li, Yangyu Zhang, Zhicheng Li, Runyu Zhou, Xiyu Shi, Chunwei Xia, Yuan Wen, Xiaobing Feng, Huimin Cui:

From Threads to Tiles: T2T, a Compiler for CUDA-to-NPU Translation via 2D Vectorization. 362-374 - Andrei Rimsa, Anderson Faustino da Silva, Camilo Santana, Fernando Magno Quintão Pereira:

Binary Diffing via Library Signatures. 375-389 - Puzhuo Liu, Peng Di, Jingling Xue, Yu Jiang:

BIT: Empowering Binary Analysis through the LLVM Toolchain. 390-402 - Yue Tang, Mianzhi Wu, Yufeng Li, Haoyu Liao, Jianmei Guo, Bo Huang:

Dr.avx: A Dynamic Compilation System for Seamlessly Executing Hardware-Unsupported Vectorization Instructions. 403-415 - Nahuel Palumbo, Guillermo Polito, Stéphane Ducasse, Pablo Tesone:

Practical: Are Abstract-Interpreter Baseline JITs Worth It? An Empirical Evaluation through Metacompilation. 416-426 - Tobias Schwarz, Tobias Kamm, Alexis Engelke:

TPDE: A Fast Adaptable Compiler Back-End Framework. 427-439 - Florian Drescher, Alexis Engelke:

Synthesizing Instruction Selection Back-Ends from ISA Specifications Made Practical. 440-452 - Ruifeng Zhang, Xiangwei Wang, Ang Li, Xipeng Shen:

SparseX: Synergizing GPU Libraries for Sparse Matrix Multiplication on Heterogeneous Processors. 453-465 - Francisco López, Lars Karlsson, Paolo Bientinesi:

Compilation of Generalized Matrix Chains with Symbolic Sizes. 466-478 - Haide He, Pengfei Su:

TRACE4J: A Lightweight, Flexible, and Insightful Performance Tracing Tool for Java. 479-492 - Keren Zhou, Tianle Zhong, Hao Wu, Jihyeong Lee, Yue Guan, Yufei Ding, Corbin Robeck, Yuanwei Fang, Jeff Niu, Philippe Tillet:

Proton: Towards Multi-level, Adaptive Profiling for Triton. 493-506 - Anderson Faustino Da Silva, Marcelo Borges Nogueira, Sérgio Queiroz de Medeiros, Jerónimo Castrillón, Fernando Magno Quintão Pereira:

On the Precision of Dynamic Program Fingerprints Based on Performance Counters. 507-519 - Mao Lin, Hyeran Jeon, Keren Zhou:

PASTA: A Modular Program Analysis Tool Framework for Accelerators. 520-534 - Håvard Rognebakke Krogstie, Helge Bahmann, Magnus Själander, Nico Reissmann:

PIP: Making Andersen's Points-to Analysis Sound and Practical for Incomplete C Programs. 535-547 - Siyuan Brant Qian, Vimarsh Sathia, Ivan R. Ivanov, Jan Hückelheim, Paul Hovland, William S. Moses:

Thinking Fast and Correct: Automated Rewriting of Numerical Code through Compiler Augmentation. 548-562 - Nilesh Rajendra Shah, M. V. V. S. Manoj Kumar, Dhairya Baxi

, Ramakrishna Upadrasta:
PolyUFC: Polyhedral Compilation Meets Roofline Analysis for Uncore Frequency Capping. 563-576 - Hongtao Wu, Yu Chen, Mengfei Xie, Futeng Yang, Jun Yan, Jiang Ma, Jianming Fu, Chun Jason Xue, Qingan Li:

Accelerating App Recompilation across Android System Updates by Code Reusing. 577-588 - Tommaso Pegolotti, Dan Alistarh, Markus Püschel:

QIGen: A Kernel Generator for Inference on Nonuniformly Quantized Large Language Models. 589-602 - Hao Qian, Guangli Li, Qiuchu Yu, Xueying Wang, Jingling Xue:

DyPARS: Dynamic-Shape DNN Optimization via Pareto-Aware MCTS for Graph Variants. 603-616 - Hyunho Kwon, Sanggyu Shin, Ju Min Lee, Hoyun Youm, Seungbin Song, Seongho Kim, Hanwoong Jung, Seungwon Lee, Hanjun Kim:

Compiler-Runtime Co-operative Chain of Verification for LLM-Based Code Optimization. 617-629 - Xiao Zhang, Yaoyao Ding, Bolin Sun, Yang Hu, Tatiana Shpeisman, Gennady Pekhimenko:

Hexcute: A Compiler Framework for Automating Layout Synthesis in GPU Programs. 630-643 - Kaio Henrique Andrade Ananias, Danila Seliayeu, José Nelson Amaral, Fernando Magno Quintão Pereira:

Multidirectional Propagation of Sparsity Information across Tensor Slices. 644-656 - Hamza Javed, Christophe Dubach:

Synthesizing Specialized Sparse Tensor Accelerators for FPGAs via High-Level Functional Abstractions. 657-669 - Fan Luo, Guangli Li, Zhaoyang Hao, Xueying Wang, Xiaobing Feng, Huimin Cui, Jingling Xue:

Progressive Low-Precision Approximation of Tensor Operators on GPUs: Enabling Greater Trade-Offs between Performance and Accuracy. 670-682 - Alexander Brauckmann, Aarsh Chaube, José Wesley de S. Magalhães, Elizabeth Polgreen, Michael F. P. O'Boyle:

Tensor Program Superoptimization through Cost-Guided Symbolic Program Synthesis. 683-695 - Mohammed Tirichine, Nassim Ameur, Nazim Bendib, Iheb Nassim Aouadj, Djad Bouchama, Rafik Bouloudene, Riyadh Baghdadi:

A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler. 696-710 - Cristian Assaiante

, Simone Di Biasio, Snehasish Kumar, Giuseppe Antonio Di Luna, Daniele Cono D'Elia, Leonardo Querzoni:
Towards Threading the Needle of Debuggable Optimized Binaries. 711-725 - Ravikiran Ravindranath Reddy, Sawan Singh, Arthur Perais, Alberto Ros, Alexandra Jimborean:

Compiler-Assisted Instruction Fusion. 726-739 - Xiangxin Fang, Jiaqin Kang, Rodrigo Rocha, Sam Ainsworth, Lev Mukhanov:

LLM-VeriOpt: Verification-Guided Reinforcement Learning for LLM-Based Compiler Optimization. 740-755

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














