Skip to main content
NRS Approved Features

Artificial Intelligence for Rosacea Diagnosis Shows Both Promise and Shortcomings

April 2026

According to the company OpenAI, over 230 million people ask ChatGPT about health problems each week.1 Several recent studies have investigated whether publicly available large language models (LLMs), including ChatGPT, can aid in the detection and diagnosis of dermatologic conditions such as rosacea, with varying results. The studies used a variety of methods to prompt artificial intelligence (AI) models for diagnosis, including pulling photographs from online dermatologic databases, taking patient photographs in a clinical setting, and using text descriptions copied from online forums. 

Two recent studies prompted AI models to diagnose rosacea in photographs of patients. The first pulled photographs from a dermatologic database and prompted the model to diagnose 4 dermatologic conditions, including rosacea.2 The authors found large variation in the accuracy of different AI models, with the most accurate model, GPT-4o, correctly diagnosing rosacea in 67.92% of photographs. The least accurate model was Meta’s Llama 3.2 11B, which correctly diagnosed rosacea in only 18.87% of photographs. Twenty-three out of 53 (43.4%) photographs depicting rosacea were misclassified by at least 5 models, and 10 photographs (18.9%) were misclassified by all models. The authors found that misclassifications most often occurred when photographs showed ambiguous or visually overlapping features. 

The second study used 43 photographs taken in a clinical setting from patients diagnosed with acne or rosacea.3 In this study, GPT-4o correctly diagnosed all 10 of the rosacea cases included in the photoset but was less successful at identifying the subtype. The authors stated, “These findings highlight the potential and current limitations of LLMs in dermatologic diagnosis and suggest that dermatologists must prepare to see patients who may have consulted ‘Dr LLM’ before visiting their offices.” 

Two other studies looked primarily at text-based prompts. One study prompted LLMs with the text “Identify the top 20 rosacea websites” and compared the results with traditional search engines, using the search term “rosacea.”4 The authors found that a traditional Google search delivered the most reliable information with the highest readability. However, none of the evaluated platforms met the American Medical Association’s or National Institutes of Health’s recommended 6th to 8th grade reading level for public health materials. 

The other text-based study used descriptions posted on Reddit as prompts to diagnose, treat, and give a prognosis for 5 dermatologic conditions, including rosacea, and found ChatGPT to be the most accurate model.5 ChatGPT results were accurate enough for a “patient-facing platform” 95% of the time, but the answers were sufficient for a “clinical setting” just 55% of the time. This study also found the results did not meet readability standards. Additionally, the researchers noted that even with search engines such as Google, “most users are likely to engage with the top-of-page AI-generated content before exploring traditional links.” 

Overall, the authors of these studies suggested that the use of AI as a diagnostic tool in clinical settings may present both opportunities and challenges. Some AI models show potential in their accuracy in diagnosing rosacea and other skin conditions, particularly if the symptoms are clear and visually distinct. However, the authors cautioned that the use of AI as a diagnostic tool raises questions of ethics around personal privacy concerns, skin color bias, and a lack of transparency about the source materials used to train the models. In addition, some authors recommended long-term and behavioral studies to address consequences surrounding patients consulting LLMs as their primary sources of health information. 

References
1. Silberling A. OpenAI unveils ChatGPT Health, says 230 million users ask about health each week. TechCrunch. January 7, 2026. Accessed March 5, 2026. https:// techcrunch.com/2026/01/07/openai-unveils-chatgpt-health-says-230-million-users-ask-about-health-each-week 

2. Cirkel L, Lechner F, Henk LA, et al. Large language models for dermatological image interpretation—a comparative study. Diagnosis (Berl). 2025;13(1):75-81. doi:10.1515/dx-2025-0014 

3. Boostani M, Bánvölgyi A, Goldust M, et al. Diagnostic performance of GPT-4o and Gemini Flash 2.0 in acne and rosacea. Int J Dermatol. 2025;64(10):1881-1882. doi:10.1111/ijd.17729 

4. Nelson HC, Beauchamp MT, Pace AA. The reliability gap: how traditional search engines outperform artificial intelligence (AI) chatbots in rosacea public health information quality. Cureus. 2025;17(6):e86543. doi:10.7759/cureus.86543 

5. Chau CA, Feng H, Cobos G, Park J. The comparative sufficiency of ChatGPT, Google Bard, and Bing AI in answering diagnosis, treatment, and prognosis questions about common dermatological diagnoses. JMIR Dermatol. 2025;8:e60827. doi:10.2196/6082

 

Eden Robins is a medical writer with the National Rosacea Society. 
Disclosure: The author reports no relevant financial relationships. 
© 2026 HMP Global. All Rights Reserved.
Any views and opinions expressed are those of the author(s) and/or participants and do not necessarily reflect the views, policy, or position of the Dermatology Learning Network or HMP Global, their employees, and affiliates.