Max Anfilofyev’s Post

View profile for Max Anfilofyev

Chief CareBot | Scaling Patient Care 8x with AI | Chief Product Officer @ DR | Connect to scale with AI

Study published in Nature doesn't conclude that LLMs are not ready for clinical decision-making https://lnkd.in/ekRZGyjf The study highlights the limitations of some open-source large language models (LLMs) in clinical decision-making. It’s not surprising they struggle, as they underperform humans. Exciting times ahead: proprietary models like GPT-4, which outperform humans on medical information understanding benchmarks, should be tested next! Chart with MedQA MSLE accuracy. The top evaluated model Meditron significantly underperforms human experts whereas as GPT-4 overperforms https://lnkd.in/edM8dB5H

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics