homeicon HOME>Announcements and Open Calls
【ANSO Science Lecture Series】Lecture 5 of ScienceOne
| 09 06 , 2026
Share this article

Reinforcement Learning and Large Language Models is the fifth lecture of ScienceOne.



Details of the Lecture

In this session, Prof. ZHANG Qichao will introduce the concepts of reinforcement learning and large language model (LLM) reasoning. It first presents the basic preliminaries of reinforcement learning, then demonstrates common techniques for LLM reasoning. The talk also covers the ScienceOne post-training methodology and its application to astronomy tasks.


Speaker Profile

ZHANG Qichao is a professor at the Institute of Automation, Chinese Academy of Sciences. His research focuses on reinforcement learning and large language model reasoning. He is a recipient of the Dean's Award of the Chinese Academy of Sciences. He has published over 80 papers in CCF-A/B conferences and IEEE Transactions journals, with over 3,800 citations. As a core member, he developed reinforcement learning post-training algorithms for the ScienceOne Foundation Model. He has also established collaborations with multiple companies, including Meituan and Baidu, among others.


Date & Time

15:00–16:00 (Beijing Time, GMT+8)

June 16, 2026


Organizers

● Alliance of National and International Science Organizations for the Belt and Road Regions (ANSO)

 Institute of Automation, Chinese Academy of Sciences (CASIA)


Online Access (Microsoft Teams)

Meeting Link: https://teams.live.com/meet/9356063392996?p=qL5Qvqq5TyF7pHo5zd

Meeting ID: 935 606 339 299 6

Passcode: fV3we7