Skip to content
View Battam1111's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing
  • Hong Kong Polytechnic University
  • Hong Kong
  • 17:40 (UTC +08:00)

Highlights

  • Pro

Organizations

@polyunlp

Block or report Battam1111

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Battam1111/README.md

Hi there, I'm Yanjun Chen (้™ˆๅฝฆ็ญ ) ๐Ÿ‘‹

I am an INTJ ๐Ÿง  and a PhD student at Hong Kong Polytechnic University & EIT ๐ŸŽ“, specializing in Reinforcement Learning with Human Feedback (RLHF) and EMBODYAI ๐Ÿค–.

About Me

  • ๐ŸŒ From: China ๐Ÿ‡จ๐Ÿ‡ณ, currently living in Hong Kong ๐Ÿ‡ญ๐Ÿ‡ฐ
  • ๐ŸŽฏ MBTI: INTJ (The Architect), always seeking knowledge and optimization.
  • ๐Ÿซ Education: PhD student at Hong Kong Polytechnic University
  • ๐Ÿ“š Research Interests: Reinforcement Learning, AI Embodiment

Skills & Tools

  • Programming Languages: Python ๐Ÿ, C/C++ โš™๏ธ
  • Spoken Languages: Chinese ๐Ÿ‡จ๐Ÿ‡ณ, English ๐Ÿ‡ฌ๐Ÿ‡ง, Japanese ๐Ÿ‡ฏ๐Ÿ‡ต
  • Favourite Fields: Computer Science, Artificial Intelligence, Mathematics

Hobbies & Interests

  • ๐Ÿ“ Playing table tennis
  • ๐ŸŽฎ Video games
  • ๐ŸŽค KTV enthusiast
  • ๐Ÿ“– Learning new things, especially science and tech topics

Contact Me

Pinned Loading

  1. AccuracyParadox-RLHF AccuracyParadox-RLHF Public

    [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models".

    Python 8

  2. YJ-SACR YJ-SACR Public

    Jupyter Notebook

  3. YJ-MADDPG YJ-MADDPG Public

    Python 1

  4. MCTSV MCTSV Public

    Python 3