UPDF AI

HuatuoGPT, Towards Taming Language Models To Be a Doctor

Hongbo Zhang,Junying Chen,69 Authors,https

18 Citations

TLDR

Experimental results demonstrate that HuatuoGPT achieves state-of-the-art results in performing medical consulta-tion and introduces RLMF (Reinforcement Learning from Mixed Feedback) where a reward model is trained to align the language model with the merits that both sources (ChatGPT and doctors) bring.