RecruitingNCT07378358

Evaluation of AI Large Models for Diagnosis and Treatment in Real-World Cases: Multicenter Retrospective Study


Sponsor

First Affiliated Hospital of Fujian Medical University

Enrollment

800 participants

Start Date

Jan 1, 2026

Study Type

OBSERVATIONAL

Conditions

Summary

This multicenter retrospective study aims to evaluate the diagnostic and therapeutic performance of three large language models-ChatGPT, Gemini and Deepseek-using 800 archived inpatient medical records from urology departments across four tertiary hospitals. The study will focus on the accuracy and applicability of these models in disease recognition, preliminary diagnosis and treatment recommendation generation, in order to explore their potential value and limitations in supporting clinical decision-making in real-world settings.


Eligibility

Min Age: 18 Years

Plain Language Summary

Simplified for easier understanding

This clinical trial is studying Large Language Model Assessment (ChatGPT, Gemini, DeepSeek) for people with urologic diseases. The study is currently recruiting participants at 1 location. People eligible for this study include aged 18 Years and older.

This summary was AI-generated to explain the trial in plain language. It is not medical advice. Always discuss eligibility with your doctor before enrolling in a clinical trial.

Interested in this trial?

Get notified about updates and connect with the research team.

Interventions

OTHERLarge Language Model Assessment (ChatGPT, Gemini, DeepSeek)

De-identified inpatient medical records were retrospectively collected from the urology departments of four tertiary hospitals (200 cases per site, 800 in total). Each case included standardized clinical information such as demographics, chief complaint, history of present illness, past medical history, physical examination, laboratory and imaging findings, discharge diagnosis and treatment plan. To simulate the role of an AI system in a "first-visit physician" scenario, all diagnostic conclusions, differential diagnoses and treatment plans were removed before being input into the models. Three large language models (ChatGPT, Gemini and DeepSeek) were prompted with a standardized instruction: "Based on the above clinical information, provide your preliminary diagnosis, differential diagnoses and treatment recommendations." Each model generated outputs including (i) primary and secondary diagnoses, (ii) differential diagnosis lists with reasoning and (iii) preliminary treatment suggesti


Locations(1)

The First Affiliated Hospital of Fujian Medical University

Fuzhou, China

View Full Details on ClinicalTrials.gov

For the most up-to-date information, visit the official listing.

Visit

NCT07378358


Related Trials