Diagens Launches DoctorBench, Setting a New Global Benchmark for 'Real-World Clinical Performance' in Medical Foundation Models
配信日時: 2026-04-30 16:57:00
HONG KONG, Apr 30, 2026 - ( JCN Newswire ) - HONG KONG, Apr 30, 2026 - (ACN Newswire) - Hangzhou Diagens Biotechnology Co., Ltd. (2526.HK, “Diagens”) today officially launched DoctorBench, a medical AI evaluation platform, and unveiled its inaugural global medical foundation model leaderboard in Hong Kong. WiseDiag Technology’s WiseDiag-v2, Google’s Gemini-3.1-Pro-Preview, and OpenAI’s GPT-5.4 secured the top three positions.
For the first time, the evaluation framework places “real-world clinical performance” at the center, constructing a multi-dimensional benchmarking system that closely mirrors authentic diagnostic and treatment scenarios.
As medical foundation models accelerate their transition from laboratory research to clinical application worldwide, the industry has long lacked a metric that genuinely measures a model’s “clinical competence.” Existing evaluations predominantly focus on medical knowledge recall, failing to capture a model’s comprehensive performance in complex clinical contexts. This gap between benchmarking and clinical reality has become a global obstacle hindering the deployment of medical AI.
OpenAI previously launched HealthBench, signaling that leading players are beginning to take this challenge seriously. However, medicine is inherently localized — diagnostic and treatment guidelines, language conventions, and patient populations vary significantly across countries and regions, rendering any single evaluation system insufficient for universal applicability.
Driven by a profound understanding of this global challenge, Diagens developed the DoctorBench platform. The platform’s creation is rooted in nearly a decade of deep collaboration by a cross-disciplinary team. Diagens brought together experts in basic medicine, clinical medicine, artificial intelligence, and the healthcare industry, tightly integrating rigorous clinical logic with cutting-edge deep learning algorithms. This enables DoctorBench to both comprehend the boundaries of AI technology and grasp the intricate demands of clinical practice, using that standard to construct its evaluation framework.
The core philosophy of DoctorBench is no longer to test a model’s “knowledge base,” but to assess its clinical communication and decision-making ability — its capacity to “think like a doctor.” The platform features three leaderboard tracks: the Medical Leaderboard (LLM), the Multimodal Leaderboard (VLM), and the Agent Leaderboard — evaluating textual diagnostic ability, multimodal understanding, and multi-turn decision-making with tool-use inside a simulated clinical environment respectively.
On the evaluation mechanism, DoctorBench pioneers a multi-dimensional architecture combining “2 Core Dimensions (Safety and Accuracy) + 3 General Dimensions (Interaction Quality, Information Prioritization, Proactive Inquiry) + 5 Specialized Modules (Evidence & Citation, Explainable Reasoning, Actionability, Personalized Adaptation, Emotional Support).” It is equipped with “Scenario-Adaptive Weighting,” dynamically adjusting the weight of each dimension according to the risk level of different clinical scenarios, making the scoring logic closely aligned with real-world diagnostic decision-making.
Crucially, the platform designates “Medical Factual Accuracy” and “Safety and Risk Control” as inviolable red lines with a “one-vote veto” power. Any model that exhibits critical deviations on issues affecting patient safety will be unable to achieve a high score, regardless of outstanding performance in other dimensions. This design stems from the team’s deep understanding of the essence of medicine: in a field where lives are at stake, safety is always the paramount principle and leaves no room for compromise.
“The advancement of medical AI is a long-distance race concerning the health and well-being of all humanity. It demands not only disruptive technological innovation and deep cross-disciplinary, cross-regional collaboration, but also an absolute reverence for and unwavering commitment to life and health,” said Dr. Song Ning, Founder of Diagens. He expressed the hope of joining hands with more global research institutions, clinical centers, and industry partners, so that truly capable technologies can be recognized, trusted, and ultimately used to benefit every patient.
スポンサードリンク
「ビジネス全般」のプレスリリース
スポンサードリンク
最新のプレスリリース
- 世界が注目するVAR判定の舞台裏に、RGB MiniLEDテレビ ハイセンスがFIFAワールドカップ2026(TM)公式VARレビューテレビプロバイダーとして大会を支援06/16 16:30
- 元ひらまつ広尾本店料理長「すべてを手放した先に見えた、本当の料理」──小川大樹の「人生を変えた決断」06/16 16:30
- Cloud-Clone、コア技術プラットフォームの継続的な改善、研究開発の全工程をカバーする製品体系のさらなる拡充を開始 次世代循環器疾患研究を牽引し、心筋梗塞の予防・診断・治療の未来を切り拓く06/16 16:30
- YEデジタルとFCCテクノ、物流DX推進に向けた合弁会社設立に合意06/16 16:10
- 【SEYSTUDIO】韓国発のアパレル企業「ディーエフコーポレーション」が新ブランド「SEYSTUDIO(セイスタジオ)」をローンチ。06/16 16:10
- 最新のプレスリリースをもっと見る
