
Zhihan Liu - Google Scholar
Northwestern University - Cited by 256 - large language models - reinforcement learning - offline learning - online learning
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is ...
2024年5月26日 · View a PDF of the paper titled Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer, by Zhihan Liu and 7 other authors
Zhihan Liu - OpenReview
Promoting openness in scientific communication and the peer-review process
Zhihan LIU | Professor (Associate) | PhD | Central South University ...
Zhihan LIU, Professor (Associate) | Cited by 22 | of Central South University, Changsha (CSU) | Read 13 publications | Contact Zhihan LIU
Provably Efficient Generative Adversarial Imitation Learning for …
2021年8月19日 · View a PDF of the paper titled Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation, by Zhihan Liu and 4 other authors
Zhihan Liu - Google Scholar
Empirical Evidence from Patient Satisfaction.
Zhihan Liu - Meta | LinkedIn
Experience: Meta · Education: Columbia University in the City of New York · Location: San Francisco Bay Area · 354 connections on LinkedIn. View Zhihan Liu’s profile on LinkedIn, a professional...
Zhihan Lyu - Google 学术搜索
ACM Transactions on Multimedia Computing, Communications, and Applications …
About me - Zihan Liu
I am a Senior research scientist at Nvidia Applied Deep Learning Research (ADLR) Team. I received Bachelor Degree from Zhejiang University, and Ph.D. Degree from the Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology under the supervision of Prof. Pascale Fung in Center of AI Research.
Zhihan LIU | Beijing University of Posts and Telecommunications ...
Zhihan LIU | Cited by 1,446 | of Beijing University of Posts and Telecommunications, Beijing (BUPT) | Read 55 publications | Contact Zhihan LIU
- 某些结果已被删除