My Photo

Tiannuo Yang (杨天诺)

Master Student
Nankai University
yangtn@nbjl.nankai.edu.cn

           

Abstract

Hi there! I am an incoming PhD student at Univeristy of Southern California.

I work on System4ML and ML4System. My work aims to harness hardware resources, automate the operation of complex systems, and enhance the efficiency of ML (e.g., LLM and RAG). Parts of it have been deployed in industry. Click to see My CV.

Recently, I've been exploring the Retrieval-Augmented Reasoning efficiency, from the perspective of ANNS algorithm and LLM inference mechanisms. New AI paradigms are emerging, and what we are demystifying on AI effiency is veryyy fascinating!

News

  • [Mar. 2025] Thrilled to be pursuing my PhD at USC on AI Systems!
  • [Jan. 2025] One paper on Performance Tuning was accepted by WWW 2025 as an Oral Presentation. A wonderful collaboration with my colleagues at Ant Group!
  • [Oct. 2024] Served as a reviewer for WWW 2025.
  • [March 2024] One paper on Performance Tuning was accepted by ICDE 2024. Meet me in the Netherlands!
  • [Sep. 2023] Got the first citation (by Cai et al., 2023) in my career!
  • [Sep. 2023] Released a collaborative Survey on Green Computing.
  • [June 2023] Started academic internship at Ant Group (Beijing).
  • [June 2023] One paper on Performance Tuning was accepted by ICPP 2023.
  • [June 2023] Undergraduate thesis on Operations Research was accepted by SEPS.
  • [Dec. 2022] Won the 3rd prize in Massive Storage Competition (organizer: Huawei).

Publications

Performance Tuning

Towards SLO-Optimized LLM Serving via Automatic Inference Engine Tuning
K Cheng, Z Wang, W Hu, T Yang, J Li , S Zhang
The Web Conference (WWW), 2025
PDF | Code | BibTex    (Oral Presentation)

VDTuner: Automated Performance Tuning for Vector Data Management Systems
T Yang, W Hu, W Peng, Y Li , J Li , X Liu, G Wang
International Conference on Data Engineering (ICDE), 2024
PDF | Code | BibTex    (Deployed on Ant Group's CodeFuse)

CoTuner: A Hierarchical Learning Framework for Coordinately Optimizing Resource Partitioning and Parameter Tuning
T Yang, R Chen, Y Li , X Liu, G Wang
International Conference on Parallel Processing (ICPP), 2023
PDF | BibTex

Research Overview

On the Opportunities of Green Computing: A Survey
Y Zhou, …, T Yang (Co-First Author), … and X Zeng
arXiv preprint, 2023
PDF | BibTex

Operations Research (Undergraduate Thesis)

Feasibility on the Integration of Passenger and Freight Transportation in Rural Areas: A Service Mode and an Optimization Model
T Yang , Z Chu, B Wang
Socio-Economic Planning Sciences (SCI/SSCI, JCR Q1), 2023
PDF | Data | BibTex

Education

Internship

  • [June 2023 - Jan. 2024] Academic Internship, Ant Group, Beijing, China.

Honors & Awards

Besides Research

Take a look at my Life Outside of Research.