{"id":66700,"date":"2023-12-11T11:25:05","date_gmt":"2023-12-11T15:25:05","guid":{"rendered":"https:\/\/coinscreed.com\/staging\/?p=66700"},"modified":"2023-12-11T11:25:07","modified_gmt":"2023-12-11T15:25:07","slug":"chatgpt-4-0-passes-clinical-neurology-exam","status":"publish","type":"post","link":"https:\/\/coinscreed.com\/staging\/chatgpt-4-0-passes-clinical-neurology-exam\/","title":{"rendered":"ChatGPT 4.0 Passes Clinical Neurology Exam"},"content":{"rendered":"\n<p>ChatGPT 4.0, the most recent iteration of OpenAI's large language model (LLM), scored 85% correctly on a clinical neurology exam of the American Board of Psychiatry and Neurology during a <a href=\"https:\/\/coinscreed.com\/staging\/sri-lankas-central-bank-completes-proof-of-concept-kyc-platform.html\" target=\"_blank\" rel=\"noreferrer noopener\">proof-of-concept<\/a> study.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img fetchpriority=\"high\" decoding=\"async\" width=\"963\" height=\"524\" src=\"https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35.png\" alt=\"ChatGPT 4.0 Passes Clinical Neurology Exam\" class=\"wp-image-66709\" srcset=\"https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35.png 963w, https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35-300x163.png 300w, https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35-768x418.png 768w, https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35-18x10.png 18w, https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35-750x408.png 750w\" sizes=\"(max-width: 963px) 100vw, 963px\" \/><figcaption class=\"wp-element-caption\">ChatGPT 4.0 Passes Clinical Neurology Exam<\/figcaption><\/figure>\n\n\n\n<p>A group of researchers from the German Cancer Research Center in Heidelberg and University Hospital Heidelberg published the experiment results on December 7. Two LLMs were examined on May 31: ChatGPT 3.5 and its subsequent iteration, ChatGPT 4.0.<\/p>\n\n\n\n<p>The researchers supplemented a subset of the European Board of Neurology questions with those from the <a href=\"https:\/\/abpn.org\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">American Board of Psychiatry and Neurology<span class=\"wpil-link-icon\" title=\"Link goes to external site.\" style=\"margin: 0 0 0 5px;\"><svg width=\"24\" height=\"24\" style=\"height:16px; width:16px; fill:#000000; stroke:#000000; display:inline-block;\" viewBox=\"0 0 24 24\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:svg=\"http:\/\/www.w3.org\/2000\/svg\"><g id=\"wpil-svg-outbound-7-icon-path\" fill=\"none\" clip-path=\"url(#clip0_31_188)\">\r\n                            <path d=\"M9.16724 14.8891L20.1672 3.88908\" stroke-linecap=\"round\"\/>\r\n                            <path d=\"M13.4497 3.53554L20.5208 3.53554L20.5208 10.6066\" stroke-linecap=\"round\" stroke-linejoin=\"round\"\/>\r\n                            <path d=\"M17.5 13.5L17.5 16.26C17.5 17.4179 17.5 17.9968 17.2675 18.4359C17.0799 18.7902 16.7902 19.0799 16.4359 19.2675C15.9968 19.5 15.4179 19.5 14.26 19.5L7.74 19.5C6.58213 19.5 6.0032 19.5 5.56414 19.2675C5.20983 19.0799 4.92007 18.7902 4.73247 18.4359C4.5 17.9968 4.5 17.4179 4.5 16.26L4.5 9.74C4.5 8.58213 4.5 8.0032 4.73247 7.56414C4.92007 7.20983 5.20982 6.92007 5.56414 6.73247C6.0032 6.5 6.58213 6.5 7.74 6.5L11 6.5\" stroke-linecap=\"round\"\/>\r\n                        <\/g>\r\n                        <defs>\r\n                            <clipPath id=\"clip0_31_188\">\r\n                                <rect fill=\"white\" height=\"24\" width=\"24\"\/>\r\n                            <\/clipPath>\r\n                        <\/defs><\/svg><\/span><\/a>&#8216;s neurology exam question bank.<\/p>\n\n\n\n<p>The accuracy rate of the older ChatGPT model was 66.8%, or 1306 correct responses out of 1956, whereas the more recent ChatGPT 4.0 improved to 85% with 1662 correct answers. Humans achieved an average score of 73.8%. <\/p>\n\n\n\n<p>ChatGPT 4.0 demonstrated superior performance compared to human users on psychological, cognitive, and behavioral-related questions, effectively &#8220;passing&#8221; the neurology exam with a score of 70%. A passing grade is typically equivalent to 70% accurate responses in academic institutions.<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><em>\u201cThese findings suggest that with further refinements, <a href=\"https:\/\/coinscreed.com\/staging\/eu-reportedly-calls-for-additional-restrictions-for-large-ai-models.html\" target=\"_blank\" rel=\"noreferrer noopener\">large language models <\/a>could have significant applications in clinical neurology.\u201d<\/em><\/p>\n<\/blockquote>\n\n\n\n<p>As per the experiment's conducting group, the following modifications to the LLMs should be considered for clinical neurology application:<\/p>\n\n\n\n<p>The researchers note that several reservations remain. Although the documentation and decision-making support systems offer a distinct opportunity to implement LLMs, neurologists should exercise prudence in practice due to their continued limitations regarding high-order cognitive tasks. One of the authors of the study, Dr. Varun Venkataramani, stated:<\/p>\n\n\n\n<blockquote class=\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\">\n<p><em>We see our study more as a proof of concept for the capabilities of LLMs. There is still development needed and probably even specific fine-tuning of LLMs to make them properly applicable for clinical neurology.\u00a0<\/em><\/p>\n<\/blockquote>\n\n\n\n<p>More precisely, our research is a demonstration of the potential of LLMs. Further development and precise refinement are required to ensure that LLMs are applicable in clinical neurology.<\/p>\n\n\n\n<p>AI is already tackling significant healthcare challenges, including combating antibiotic overprescribing in Hong Kong and discovering a remedy for cancer on behalf of <a href=\"https:\/\/coinscreed.com\/staging\/astrazeneca-ai-company-absci-partner-to-find-cancer-cure.html\" target=\"_blank\" rel=\"noreferrer noopener\">AstraZeneca<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ChatGPT 4.0, the most recent iteration of OpenAI&#8217;s large language model (LLM), scored 85% correctly on a clinical neurology exam of the American Board of Psychiatry and Neurology during a proof-of-concept study. A group of researchers from the German Cancer Research Center in Heidelberg and University Hospital Heidelberg published the experiment results on December 7. [&hellip;]<\/p>\n","protected":false},"author":12,"featured_media":66709,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[9],"tags":[17547,17548,15168,14081],"class_list":["post-66700","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech","tag-chatgpt-4","tag-clinical-neurology-exam","tag-llm","tag-openai"],"jetpack_featured_media_url":"https:\/\/coinscreed.com\/staging\/wp-content\/uploads\/2023\/12\/image-35.png","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/posts\/66700","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/users\/12"}],"replies":[{"embeddable":true,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/comments?post=66700"}],"version-history":[{"count":0,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/posts\/66700\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/media\/66709"}],"wp:attachment":[{"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/media?parent=66700"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/categories?post=66700"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/coinscreed.com\/staging\/wp-json\/wp\/v2\/tags?post=66700"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}