Sowoong Kim, Youngsam Shin, YeonGon Cho and Woongki Baek,
"
Harmonia: QoS-Aware and High-Throughput Generative Inference with a Single GPU
,"
in the Proceedings of the 32nd International European Conference on Parallel and Distributed Computing (
Euro-Par), Aug. 2026 (to appear)
Sowoong Kim, Eunyeong Sim, Youngsam Shin, YeonGon Cho and Woongki Baek,
"
Activation Sequence Caching: High-Throughput and Memory-Efficient Generative Inference with a Single GPU
,"
in the Proceedings of the 33rd International Conference on Parallel Architectures and Compilation Techniques (
PACT), Jun. 2024 (
PDF)
Sowoong Kim, Myeonggyun Han and Woongki Baek,
"
MARF: A Memory-Aware CLFLUSH-Based Intra- and Inter-CPU Side-Channel Attack
,"
in the Proceedings of the 28th European Symposium on Research in Computer Security (
ESORICS), Sep. 2023 (
PDF)
Sowoong Kim, Myeonggyun Han and Woongki Baek,
"
DPrime+DAbort: A High-Precision and Timer-Free Directory-Based Side-Channel Attack in Non-Inclusive Cache Hierarchies using Intel TSX
,"
in the Proceedings of the 28th IEEE International Symposium on High-Performance Computer Architecture (
HPCA), Apr. 2022 (
PDF)