Publications

Note: Some links are only provided through paid subscription services, which may limit access. Consult your institution’s library for assistance in obtaining these documents.

Found 15 results
2017
A. Li, Zhao, W., and Song, S. Leon, BVF: Enabling Significant On-chip Power Savings via Bit-value-favor for Throughput Processors, in Proceedings of the 50th Annual IEEE/ACM International Symposium on Microarchitecture, New York, NY, USA, 2017.
J. Qiu, Zhao, Z., Wu, B., Vishnu, A., and Song, S. Leon, Enabling scalability-sensitive speculative parallelization for FSM computations, in Proceedings of the International Conference on Supercomputing, {ICS} 2017, Chicago, IL, USA, June 14-16, 2017, 2017.
R. D. Friese, Tallent, N. R., Vishnu, A., Kerbyson, D. J., and Hoisie, A., Generating Performance Models for Irregular Applications, in 2017 IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017, Orlando, FL, USA, May 29-June 2, 2017, 2017.
A. Li, Song, S. Leon, Liu, W., Liu, X., Kumar, A., and Corporaal, H., Locality-Aware CTA Clustering for Modern GPUs, in Proceedings of the Twenty-Second International Conference on Architectural Support for Programming Languages and Operating Systems, New York, NY, USA, 2017.
C. Xie, Song, S. L., Wang, J., Zhang, W., and Fu, X., Processing-in-Memory Enabled Graphics Processors for 3D Rendering, in 23rd IEEE International Symposium on High-Performance Computer Architecture (HPCA-23), Austin, Texas, 2017.
N. A. Gawande, Landwehr, J. B., Daily, J. A., Tallent, N. R., Vishnu, A., and Kerbyson, D. J., Scaling Deep Learning Workloads: NVIDIA DGX-1/Pascal and Intel Knights Landing, in 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2017, Orlando/Buena Vista, FL, USA, May 29 - June 2, 2017, 2017.
2016
N. R. Tallent, Barker, K. J., Gioiosa, R., Marquez, A., Kestor, G., Song, L., Tumeo, A., Kerbyson, D. J., and Hoisie, A., Assessing Advanced Technology in CENATE, 2016 IEEE International Conference on Networking, Architecture and Storage (NAS), pp. 1-2, 2016.
J. Tan, Song, S. Leon, Yan, K., Fu, X., Marquez, A., and Kerbyson, D., Combating the Reliability Challenge of GPU Register File at Low Supply Voltage, in Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, New York, NY, USA, 2016.
A. Li, Song, S. Leon, Kumar, A., Zhang, E. Z., Chavarría-Miranda, D. G., and Corporaal, H., Critical points based register-concurrency autotuning for GPUs, in 2016 Design, Automation {&} Test in Europe Conference {&} Exhibition, {DATE} 2016, Dresden, Germany, March 14-18, 2016, 2016.
A. B. Hayes, Li, L., Chavarría-Miranda, D., Song, S. Leon, and Zhang, E. Z., Orion: A Framework for GPU Occupancy Tuning, in Proceedings of the 17th International Middleware Conference, New York, NY, USA, 2016.
A. Li, Song, S. Leon, Wijtvliet, M., Kumar, A., and Corporaal, H., SFU-Driven Transparent Approximation Acceleration on GPUs, in Proceedings of the 2016 International Conference on Supercomputing, New York, NY, USA, 2016.
P. Roy, Liu, X., and Song, S. Leon, SMT-Aware Instantaneous Footprint Optimization, in Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, New York, NY, USA, 2016.
L. Li, Hayes, A. B., Song, S. Leon, and Zhang, E. Z., Tag-Split Cache for Efficient GPGPU Cache Utilization, in Proceedings of the 2016 International Conference on Supercomputing, New York, NY, USA, 2016.
A. Li, Song, L. Shuaiwen, Brugel, E., Kumar, A., Chavarria, D., and Corporaal, H., X: A Comprehensive Analytic Model for Parallel Machines, in 30th International Parallel and Distributed Processing Symposium (IPDPS), 2016.
2015
L. Tan, Chen, Z., and Song, S. Leon, Scalable Energy Efficiency with Resilience for High Performance Computing Systems: A Quantitative Methodology, ACM Transactions on Architecture and Code Optimization, vol. 12, pp. 35:1–35:27, 2015.