Performance of Large Language Models for Structured Recognition and Refractive Prediction
Head-to-Head Evaluation of ChatGPT 4o, GPT-5, and DeepSeek for Structured Extraction, Toric IOL Recommendation, and Refractive Prediction
Jin Yang
100 participants
Aug 1, 2025
OBSERVATIONAL
Conditions
Summary
We conducted a single-center, retrospective observational study to evaluate large language models (ChatGPT 4o, GPT-5, DeepSeek) for automated interpretation of de-identified IOLMaster 700 reports provided as raster images. Models produced structured biometric extraction, toric IOL recommendation, and refractive predictions (sphere, cylinder, axis). Primary outcomes included parameter-level agreement and refractive error metrics; secondary outcomes included decision-support performance for toric IOL selection and agreement on ordered T-codes. No clinical intervention was performed.
Eligibility
Plain Language Summary
Simplified for easier understanding
This summary was AI-generated to explain the trial in plain language. It is not medical advice. Always discuss eligibility with your doctor before enrolling in a clinical trial.
Interested in this trial?
Get notified about updates and connect with the research team.
Locations(1)
View Full Details on ClinicalTrials.gov
For the most up-to-date information, visit the official listing.
NCT07183891