A malicious pattern detection engine for embedded security systems in the Internet of Things D Oh, D Kim, WW Ro Sensors 14 (12), 24188-24211, 2014 | 170 | 2014 |
Warped-compression: Enabling power efficient GPUs through register compression S Lee, K Kim, G Koo, H Jeon, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 43 (3S), 502-514, 2015 | 149 | 2015 |
Warped-slicer: Efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming Q Xu, H Jeon, K Kim, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 44 (3), 230-242, 2016 | 139 | 2016 |
Access pattern-aware cache management for improving data utilization in GPU G Koo, Y Oh, WW Ro, M Annavaram Proceedings of the 44th annual international symposium on computer …, 2017 | 84 | 2017 |
Fast CU depth decision for HEVC using neural networks K Kim, WW Ro IEEE Transactions on Circuits and Systems for Video Technology 29 (5), 1462-1473, 2018 | 77 | 2018 |
Virtual thread: Maximizing thread-level parallelism beyond GPU scheduling limit MK Yoon, K Kim, S Lee, WW Ro, M Annavaram ACM SIGARCH Computer Architecture News 44 (3), 609-621, 2016 | 67 | 2016 |
Warped-preexecution: A GPU pre-execution approach for improving latency hiding K Kim, S Lee, MK Yoon, G Koo, WW Ro, M Annavaram 2016 IEEE International Symposium on High Performance Computer Architecture …, 2016 | 59 | 2016 |
Xsd: Accelerating mapreduce by harnessing the gpu inside an ssd BY Cho, WS Jeong, D Oh, WW Ro | 59 | 2013 |
Efficient peer-to-peer file sharing using network coding in MANET U Lee, JS Park, SH Lee, WW Ro, G Pau, M Gerla Journal of Communications and Networks 10 (4), 422-429, 2008 | 49 | 2008 |
APRES: Improving cache efficiency by exploiting load characteristics on GPUs Y Oh, K Kim, MK Yoon, JH Park, Y Park, WW Ro, M Annavaram ACM SIGARCH computer architecture news 44 (3), 191-203, 2016 | 43 | 2016 |
Boosting CUDA applications with CPU–GPU hybrid computing C Lee, WW Ro, JL Gaudiot International Journal of Parallel Programming 42 (2), 384-404, 2014 | 40 | 2014 |
Space: locality-aware processing in heterogeneous memory for personalized recommendations H Kal, S Lee, G Ko, WW Ro 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 36 | 2021 |
Parallel GPU architecture simulation framework exploiting work allocation unit parallelism S Lee, WW Ro 2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013 | 34 | 2013 |
Deep learning with GPUs W Jeon, G Ko, J Lee, H Lee, D Ha, WW Ro Advances in Computers 122, 167-215, 2021 | 32 | 2021 |
On improving parallelized network coding with dynamic partitioning K Park, JS Park, WW Ro IEEE Transactions on Parallel and Distributed Systems 21 (11), 1547-1560, 2010 | 32 | 2010 |
Improving energy efficiency of GPUs through data compression and compressed execution S Lee, K Kim, G Koo, H Jeon, M Annavaram, WW Ro IEEE Transactions on Computers 66 (5), 834-847, 2016 | 29 | 2016 |
Mgmr: Multi-gpu based mapreduce Y Chen, Z Qiao, H Jiang, KC Li, WW Ro Grid and Pervasive Computing: 8th International Conference, GPC 2013 and …, 2013 | 27 | 2013 |
Cooperative heterogeneous computing for parallel processing on CPU/GPU hybrids C Lee, WW Ro, JL Gaudiot 2012 16th Workshop on Interaction between Compilers and Computer …, 2012 | 26 | 2012 |
Duplo: Lifting redundant memory accesses of deep neural networks for GPU tensor cores H Kim, S Ahn, Y Oh, B Kim, WW Ro, WJ Song 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 25 | 2020 |
WIR: Warp instruction reuse to minimize repeated computations in GPUs K Kim, WW Ro 2018 IEEE International Symposium on High Performance Computer Architecture …, 2018 | 25 | 2018 |