Publications

(2024). Synergy: Collaborating Centralized and Local Scheduling for Serverless Functions. 30th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2024, Belgrade, Serbia, October 10-14, 2024.

Cite DOI URL

ASPLOS 2024 (2024). FUYAO: DPU-enabled Direct Data Transfer for Serverless Computing. Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ASPLOS 2024, La Jolla, CA, USA, 27 April 2024- 1 May 2024. CCF-A

Cite DOI URL

(2024). Accelerating Cold Start of Thread-level Sandbox Using Snapshot and tfork. 30th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2024, Belgrade, Serbia, October 10-14, 2024.

Cite DOI URL

(2023). Rethinking Deployment for Serverless Functions: A Performance-First Perspective. Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2023, Denver, CO, USA, November 12-17, 2023.

Cite DOI URL

(2023). High-throughput Sampling, Communicating and Training for Reinforcement Learning Systems. 31st IEEE/ACM International Symposium on Quality of Service, IWQoS 2023, Orlando, FL, USA, June 19-21, 2023.

Cite DOI URL

(2023). Flame: A Centralized Cache Controller for Serverless Computing. Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 4, ASPLOS 2023, Vancouver, BC, Canada, March 25-29, 2023.

Cite DOI URL

(2022). Tetris: Memory-efficient Serverless Inference through Tensor Sharing. Proceedings of the 2022 USENIX Annual Technical Conference, USENIX ATC 2022, Carlsbad, CA, USA, July 11-13, 2022.

Cite URL

(2022). TailCmp - A Tail Latency Evaluation Solution of Public Cloud and Labeled von Neumann Architecture based Cloud Prototype. IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2022, Melbourne, Australia, December 17-19, 2022.

Cite DOI URL

(2022). Multi-Resource Scheduling for Multiple Service Function Chains with Deep Reinforcement Learning. 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022, Nanjing, China, January 10-12, 2023.

Cite DOI URL

(2022). Maxwell's Demon in Tail-tolerant, Resource-efficient Serverless Computing. 28th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2022, Nanjing, China, January 10-12, 2023.

Cite DOI URL

(2022). INFless: a native serverless system for low-latency, high-throughput inference. ASPLOS ‘22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022 - 4 March 2022.

Cite DOI URL

(2022). A Trigonometric Function Instruction Set Extension Method Based on RISC-V. 22nd IEEE/ACIS International Conference on Computer and Information Science, ICIS 2022, Zhuhai, China, June 26-28, 2022.

Cite DOI URL

(2021). Understanding, predicting and scheduling serverless workloads under partial interference. International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2021, St. Louis, Missouri, USA, November 14-19, 2021.

Cite DOI URL

(2021). RLTree: Website Fingerprinting Through Resource Loading Tree. Network and System Security - 15th International Conference, NSS 2021, Tianjin, China, October 23, 2021, Proceedings.

Cite DOI URL

(2021). RAEF: Energy-efficient Resource Allocation through Energy Fungibility in Serverless. 27th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2021, Beijing, China, December 14-16, 2021.

Cite DOI URL

(2021). $PSeer$: Performance Prediction for Partially Co-located Distributed Deep Learning. 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys), Haikou, Hainan, China, December 20-22, 2021.

Cite DOI URL

(2020). XShot: Light-weight Link Failure Localization using Crossed Probing Cycles in SDN. ICPP 2020: 49th International Conference on Parallel Processing, Edmonton, AB, Canada, August 17-20, 2020.

Cite DOI URL

(2020). Rhythm: component-distinguishable workload deployment in datacenters. EuroSys ‘20: Fifteenth EuroSys Conference 2020, Heraklion, Greece, April 27-30, 2020.

Cite DOI URL

(2020). NetCruiser: Localize Network Failures by Learning from Latency Data. 2020 IEEE International Conference on Smart Internet of Things, SmartIoT 2020, Beijing, China, August 14-16, 2020.

Cite DOI URL

(2020). Glad: Global And Local Anomaly Detection. IEEE International Conference on Multimedia and Expo, ICME 2020, London, UK, July 6-10, 2020.

Cite DOI URL

(2020). A Novel Blockchain Network Structure Based on Logical Nodes. Wireless Algorithms, Systems, and Applications - 15th International Conference, WASA 2020, Qingdao, China, September 13-15, 2020, Proceedings, Part I.

Cite DOI URL

(2019). QIMS: QoE-Centric Information-Agnostic Mix-Flows Scheduling in SD-WAN. 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019, Zhangjiajie, China, August 10-12, 2019.

Cite DOI URL

(2019). ElaX: Provisioning Resource Elastically for Containerized Online Cloud Services. 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, HPCC/SmartCity/DSS 2019, Zhangjiajie, China, August 10-12, 2019.

Cite DOI URL

(2019). Effective Straggler Mitigation with Cross-Layer Interference-Aware Optimization. 25th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2019, Tianjin, China, December 4-6, 2019.

Cite DOI URL

(2019). Distributed Traffic Engineering for Multi-Domain Software Defined Networks. 39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019, Dallas, TX, USA, July 7-10, 2019.

Cite DOI URL

(2019). Deeplive: QoE Optimization for Live Video Streaming through Deep Reinforcement Learning. 25th IEEE International Conference on Parallel and Distributed Systems, ICPADS 2019, Tianjin, China, December 4-6, 2019.

Cite DOI URL

(2019). Cloud Resource Provision of Competitive Content Providers: Models and Analysis. 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking, ISPA/BDCloud/SocialCom/SustainCom 2019, Xiamen, China, December 16-18, 2019.

Cite DOI URL

(2018). Less Provisioning: A Fine-grained Resource Scaling Engine for Long-running Services with Tail Latency Guarantees. Proceedings of the 47th International Conference on Parallel Processing, ICPP 2018, Eugene, OR, USA, August 13-16, 2018.

Cite DOI URL

(2018). Experience-Availability Analysis of Online Cloud Services using Stochastic Models. 2018 IFIP Networking Conference and Workshops, Networking 2018, Zurich, Switzerland, May 14-16, 2018.

PDF Cite DOI

(2018). A Stochastic Model for Analyzing Tail Latency of Multi-Tier Online Cloud Services. 9th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2018, Taipei, Taiwan, December 26-28, 2018.

Cite DOI URL

(2017). Slow or Down?: Seem to Be the Same for Cloud Users. Proceedings of the first Workshop on Emerging Technologies for software-defined and reconfigurable hardware-accelerated Cloud Datacenters, ETCD@ASPLOS 2017, Xi’an, China, April 8, 2017.

Cite DOI URL

(2016). Fast Big Data Analysis in Geo-Distributed Cloud. 2016 IEEE International Conference on Cluster Computing, CLUSTER 2016, Taipei, Taiwan, September 12-16, 2016.

Cite DOI URL

(2015). Traffic engineering in hierarchical SDN control plane. 23rd IEEE International Symposium on Quality of Service, IWQoS 2015, Portland, OR, USA, June 15-16, 2015.

Cite DOI URL

(2015). Joint Scheduling of Data and Computation in Geo-Distributed Cloud Systems. 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid 2015, Shenzhen, China, May 4-7, 2015.

Cite DOI URL

(2012). On Revenue Driven Server Management in Cloud. CLOSER 2012 - Proceedings of the 2nd International Conference on Cloud Computing and Services Science, Porto, Portugal, 18 - 21 April, 2012.

Cite

(2012). Improving Cost-Efficiency through Failure-Aware Server Management and Scheduling in Cloud. Cloud Computing and Services Science - Second International Conference, CLOSER 2012, Porto, Portugal, April 18-21, 2012. Revised Selected Papers.

Cite DOI URL

(2011). A Resource Minimizing Scheduling Algorithm with Ensuring the Deadline and Reliability in Heterogeneous Systems. 25th IEEE International Conference on Advanced Information Networking and Applications, AINA 2011, Biopolis, Singapore, March 22-25, 2011.

Cite DOI URL

(2010). SPSE: A flexible QoS-based service scheduling algorithm for service-oriented Grid. 24th IEEE International Symposium on Parallel and Distributed Processing, IPDPS 2010, Atlanta, Georgia, USA, 19-23 April 2010 - Workshop Proceedings.

Cite DOI URL

(2010). Fault tolerant scheduling with dynamic number of replicas in heterogeneous system. 12th IEEE International Conference on High Performance Computing and Communications, HPCC 2010, 1-3 September 2010, Melbourne, Australia.

Cite DOI URL