HuatuoGPT, Towards Taming Language Models To Be a Doctor
HuatuoGPT, Towards Taming Language Models To Be a Doctor
Hongbo Zhang,Junying Chen,69 Authors,https
18 Citations
TLDR
Experimental results demonstrate that HuatuoGPT achieves state-of-the-art results in performing medical consulta-tion and introduces RLMF (Reinforcement Learning from Mixed Feedback) where a reward model is trained to align the language model with the merits that both sources (ChatGPT and doctors) bring.
