CV
Education
- B.S. in Cyber Security, Sichuan University, 2024
- Ph.D in Electrical and Computer Engineering, University of Maryland, College Park, 2029 (expected)
Work experience
- Summer 2025: Research Intern
- Snowflake
- Duties included: Reinforcement Learning, Text-to-SQL, Reasoning Model, Agentic Workflow, Verl, Ray
- Supervisor: Zhewei Yao & Yuxiong He
- Fall 2024: Research Intern
- LLM360 Team/MBZUAI
- Duties included: Scaling law, Megatron
- Supervisor: Hongyi Wang & Hector Liu
Skills
- Languages: Python, C#, C++, C, Java, JavaScript
- Toolkits: Git, Bash, Docker, MySQL
- Frameworks/Others: PyTorch, TensorFlow, LaTeX, Unity, scikit-learn, Blender
Publications
- G. Sun, Z. Wang, B. Tian, M. Liu, Z. Shen, S. He, Y. He, W. Ye, Y. Wang, A. Li. “CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs.” arXiv:2505.13778, 2025. [Link]
- Y. Wang*, G. Sun*, W. Ye, G. Qu, A. Li. “VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation.” arXiv:2505.11849, 2025. [Link]
- G. Sun*, Z. Wang*, X. Zhao, B. Tian, Z. Shen, Y. He, J. Xing, A. Li. “Invisible Tokens, Visible Bills: The Urgent Need to Audit Hidden Operations in Opaque LLM Services.” arXiv:2505.18471, 2025. [Link]
- Z. Yao, G. Sun, L. Borchmann, Z. Shen, M. Deng, B. Zhai, H. Zhang, A. Li, Y. He. “Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL.” arXiv:2505.20315, 2025. [Link] [Code]
- S. He*, G. Sun*, Z. Shen, A. Li. “What Matters in Transformers? Not All Attention is Needed.” arXiv:2406.15786, 2024. [Link] [Code]
- X. Zhao*, G. Sun*, R. Cai*, Y. Zhou*, P. Li*, P. Wang, B. Tan, Y. He, L. Chen, Y. Liang, B. Chen, B. Yuan, H. Wang†, A. Li†, Z. Wang†, T. Chen†. “Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild.” NeurIPS 2024 Datasets & Benchmarks. [Link]
- S. He, T. Ge, G. Sun, et al. “Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers.” EMNLP 2025 (Main). [Link]
- Z. Wang, Z. Shen, Y. He, G. Sun, et al. “Flora: Federated Fine-tuning Large Language Models with Heterogeneous Low-Rank Adaptations.” NeurIPS 2024. [Link]
- Y. He, Z. Wang, Z. Shen, G. Sun, et al. “SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning.” NeurIPS 2024. [Link]
Awards
- 2025 Qualcomm Innovation Fellowship, Qualcomm
- 2024 Dean’s Fellowship, University of Maryland, College Park
- 2023 Outstanding Undergraduate Graduate, Sichuan University
- 2022 National Scholarship (highest honor scholarship in China)
- 2022 & 2023 First-class University Annual Scholarship, Sichuan University
- 2021 & 2022 Outstanding Student, Sichuan University
- 2022 National 1st Prize, China International College Students “Internet+” Innovation & Entrepreneurship Competition
- 2022 National 3rd Prize, “China Software Cup” Software Design Competition