English
Share
Sign In
The true power of chatGPT ๐Ÿ’ช โ€œReinforcement learning through human feedbackโ€
Haebom
๐Ÿ‘
There are two reasons why ChatGPT has attracted so much attention from people.
The first one gave users the experience of directly inputting data and receiving results right away.
The second is that this process can be developed as if you were having a conversation through a prompt interface.
The technology used here is RLHF (Reinforcement Learning from Human Feedback). This technology has infinite scalability, but has not received much attention from the public.
RLHF stands for integrating reinforcement learning and human feedback into NLP.
RLHF is an impressive large-scale application of reinforcement learning, which has been used in games and simulation environments, to a new domain.
RLHF consists of three phases, each of which includes a goal, an intuition about the need, and technical details.
RLHF will be used more and more in the future and will become an important part not only in the artificial intelligence industry but also in all industries where artificial intelligence is applied.
Subscribe to 'haebom'
๐Ÿ“š Welcome to Haebom's archives.
---
I post articles related to IT ๐Ÿ’ป, economy ๐Ÿ’ฐ, and humanities ๐ŸŽญ.
If you are curious about my thoughts, perspectives or interests, please subscribe.
Would you like to be notified when new articles are posted? ๐Ÿ”” Yes, that means subscribe.
haebom@kakao.com
Subscribe
๐Ÿ‘