ID | First author (Year) | Country/Region | Journal | Imaging/Modality | Primary Task | Dataset size (N) | AI Method | Reference Standard | Key Performance (primary metric) | External Validation | Citation (short) |
|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Cui Z (2022) | Multi-center (China, 15 sites) | Nat Commun | CBCT (3D) | Tooth & alveolar bone segmentation | 4,938 scans | Multi-stage 3D CNN (nnU-Net style pipeline) | Expert manual segmentations | Dice: teeth ≈ 0.915; bone ≈ 0.930; time ↓96.7% | Yes | Cui 2022 Nat Commun |
2 | Noeldeke B (2024) | Germany | Head Face Med | Intraoral photographs (2D) | Crossbite detection (binary & type) | 676 images (311 pts) | CNNs (DenseNet/ResNet variants) | Orthodontist labels | Accuracy (binary): 98.57% | No (single-center) | Noeldeke 2024 Head Face Med |
3 | Ryu S-M (2023) | Korea | Sci Rep | Intraoral photographs (2D) | Extraction decision recommendation | 3,136 images | CNN classifier + landmark regressor | Board-certified orthodontist decision | AUC 0.961; Accuracy 0.922; Mean error 0.84 mm | No | Ryu 2023 Sci Rep |
4 | Sahlsten T (2024) | Finland | PLoS ONE | CBCT (3D) | 3D cephalometric landmark detection (33 LM) | 309 scans | Deep learning landmarking | Expert annotations | Mean 3D distance: 1.99 mm (overall), 1.96 mm (skeletal) | Unclear | Sahlsten 2024 PLoS ONE |
5 | Shin J-H (2021) | Korea | BMC Oral Health | Clinical photos + ceph | Necessity of orthognathic surgery (Class III) | 140 pts | CNN | Panel consensus (surgery vs non-surgery) | Accuracy 0.954; Sens 0.889; Spec 0.971; AUC 0.948 | No | Shin 2021 BMC Oral Health |
6 | Volovic J (2023) | USA | Diagnostics (MDPI) | Structured records | Treatment duration prediction | 478 pts | Random Forest, Lasso, Elastic Net | Actual duration vs prediction | MAE 7.27 months | No | Volovic 2023 Diagnostics |
7 | Elnagar M (2022) | USA | Diagnostics (MDPI) | Structured records | Treatment duration prediction | 518 pts | Multiple ML models (DT, RF, etc.) | Actual duration vs prediction | Best models within clinically acceptable error | No | Elnagar 2022 Diagnostics |
8 | Wolf D (2024) | Germany (EU dataset) | J Clin Med | EMR + app data | Clear aligner refinement risk prediction | 9,942 CAT pts | L1-logistic, XGBoost, SVC-RBF (+ SHAP) | Clinician-recorded outcomes | AUC ≈ 0.67; well-calibrated (Brier ≈ 0.22) | Yes (held-out cohort) | Wolf 2024 J Clin Med |
9 | Etemad L (2024) | USA; France (2 sites) | Bioengineering (MDPI) | Structured records | Extraction vs non-extraction decision | 1,135 pts (2 universities) | Random Forest | Clinician decision | Acc 85%; Sens 50%; Spec 97% (combined model) | Cross-site tests | Etemad 2024 Bioengineering |
10 | Leavitt L (2023) | USA | Orthod Craniofac Res | Structured records | Predict specific extraction patterns | 366 pts (extraction cases) | RF, LR, SVM | Clinician treatment plan | Best class accuracy 81.6% (U/L4s patterns) | Stratified hold-out | Leavitt 2023 OCR |
11 | Mason T (2023) | USA | Int Orthod | Structured records | Extraction vs non-extraction | 393 pts | LR, RF, SVM, NN | Clinician decision | ROC-AUC reported; high accuracy (see paper) | Hold-out | Mason 2023 Int Orthod |
12 | Huang J (2024) | China | Front Bioeng Biotechnol | Structured records | Extraction decision | Institutional cohort | DT, RF, SVM, MLP; feature importance | Senior specialist plans | Good accuracy across models; RF/MLP leading | No | Huang 2024 Front Bioeng |
13 | Arik SÖ (2017) | USA | J Med Imaging | Lateral ceph (2D) | Landmark detection (15 LM) | 400 images | CNN (early DL) | Expert annotation | SDR@2mm: 72.3%; rising to 86.8%@4mm | No | Arik 2017 J Med Imaging (via 2025 PMC summary) |
14 | Gilmour R (2019) | — | — | Lateral ceph (2D) | Landmark detection (15 LM) | — | — | Expert annotation | MRE 1.14 mm; SDR@2mm 83.8% | — | Gilmour 2019 (via 2025 PMC summary) |
15 | Li P (2019) | China | Med Image Anal?/Sci Rep | Lateral ceph (2D) | Landmark detection (15 LM) | — | — | Expert annotation | MRE 1.20 mm; SDR@2mm 83.7% | — | Li 2019 (via 2025 PMC summary) |
16 | Kwon (2019) | Korea | — | Lateral ceph (2D) | Landmark detection (15 LM) | — | — | Expert annotation | MRE 1.24 mm; SDR@2mm 83.0% | — | Kwon 2019 (via 2025 PMC summary) |
17 | Oh (2019) | Korea | — | Lateral ceph (2D) | Landmark detection (15 LM) | — | — | Expert annotation | MRE 1.29 mm; SDR@2mm 82.1% | — | Oh 2019 (via 2025 PMC summary) |
18 | Kim (2019/2020) | Korea | — | Lateral ceph (2D) | Landmark detection (15 LM) | 860 images | — | Expert annotation | MRE 1.03 mm; SDR@2mm 87.1% | — | Kim 2020 (via 2025 PMC summary) |
19 | Kim (2020) | Korea | — | Lateral ceph (2D) | Landmark detection (23 LM) | 2,075 images | — | Expert annotation | MRE 1.37 mm; SDR@2mm 82.9% | — | Kim 2020 (via 2025 PMC summary) |
20 | Takahashi (2020) | Japan | — | Lateral face photographs (2D) | Ceph LM from photos (23 LM) | 2,000 images | HRNetV2 + MLP (2-stage) | Ceph-photo superimposition | MRE 0.61 mm; SDR@2mm 98.2% | — | Takahashi 2020 (via 2025 PMC summary) |
21 | Takahashi (2025) | Japan | — | Lateral face photos (2D) | Ceph LM from photos (Class II/III) | 2,320 images | HRNetV2 + MLP (2-stage) | Ceph-photo superimposition | MRE 0.42–0.46 mm; ceph error < 0.5° | — | Takahashi 2025 (PMC 2025 article) |
22 | Park J-H (2019) | Korea | Angle Orthod | Lateral ceph (2D) | Compare YOLOv3 vs SSD (80 LM) | Train:1028, Test:283 | YOLOv3 vs SSD | Expert labels | YOLOv3 faster & more accurate; real-time inference | — | Park 2019 Angle Orthod (Part 1) |
23 | Hwang H-W (2020) | Korea | Angle Orthod | Lateral ceph (2D) | AI vs human (80 LM) | — | YOLOv3-based pipeline | Human experts | AI as accurate as experts; perfect repeatability | — | Hwang 2020 Angle Orthod (Part 2) |
24 | Yoon H-J (2022) | Korea | Eur J Orthod | Lateral ceph (2D) | Airway-focused LM detection | — | Deep CNN pipeline | Expert annotation | High SDR comparable to state-of-art | — | Yoon 2022 EJO |
25 | Atici S.F. (2022) | UK/Turkey | PLoS ONE | Lateral ceph (2D) | Fully automated CVM stage classification | — | Custom CNN (directional filters) | Expert labels | High accuracy across CVM stages | — | Atici 2022 PLoS ONE |
26 | Atici S.F. (2023) | UK/Turkey | — | Lateral ceph (2D) | AggregateNet CVM classifier | — | Parallel structured CNN | Expert labels | Improved CVM classification over baseline | — | Atici 2023 (AggregateNet) |
27 | Gaudot I (2024) | Multi-center (EU) | Med Eng Phys | CBCT/CT (3D) | DentalSegmentator (5-class segmentation) | 470 train; 256 test | nnU-Net (3D Slicer extension) | Expert annotation | Robust multiclass segmentation across centers | Yes (external CBCT set) | Gaudot 2024 Med Eng Phys |
28 | Wang C (2024) | China | Biomed Signal Process Control | CBCT (3D) | Transformer-based tooth segmentation (Trans-VNet) | — | Transformer CNN hybrid | Expert annotation | Dice ≈ high (see paper) | — | Wang 2024 (Trans-VNet) |
29 | Kartbak SBA (2025) | Turkey | BMC Oral Health | Lateral ceph + intraoral photos (2D) | Intraoral classification via ceph-informed DL | 990 pts | DL classifier trained on ceph-derived labels | Cephalometric measurements | Reported improved classification vs baselines | — | Kartbak 2025 BMC Oral Health |
30 | Milani O-H (2024) | USA | — | Panoramic (2D) | Third molar development stage classification | — | DL classifier | Expert staging | High stage classification accuracy | — | Milani 2024 |
31 | JOMOS team (2025) | China | J Oral Med Oral Surg | Panoramic (2D) | Impacted mandibular third molar detection & class | 2,000 PRs | DL detector/classifier | Radiologist labels | Strong accuracy across classes | — | JOMOS 2025 |
32 | Kim S (2024) | Korea | BMC Oral Health | Panoramic (2D) | Indication for extraction (cracked tooth) | — | Multiple DL models | Clinician decision | Predictive performance significant (AUC reported) | — | BMC OH 2024 cracked tooth |
33 | Suh HY (2019) | Korea/USA | Angle Orthod | Structured + ceph | Soft tissue change prediction after surgery | — | Sparse partial least squares (ML) | Post-op measurements | Improved prediction vs baselines | — | Suh 2019 Angle Orthod |
34 | Lee YS (2014) | Korea/USA | AJODO | Structured + ceph | Soft tissue prediction (Class III) | — | Statistical/ML model | Post-op measurements | Higher accuracy than prior methods | — | Lee 2014 AJODO |
35 | Wang C-W (2015) | Taiwan | IEEE TMI | Lateral ceph (2D) | Grand challenge benchmark (evaluation) | — | Multiple methods compared | Expert GT (challenge) | Baseline SDR metrics provided | External (multi-team) | Wang 2015 IEEE TMI |
36 | Wang C-W (2016) | Taiwan | Med Image Anal | Dental radiographs (2D) | Benchmark for analysis algorithms | — | Benchmarking | Expert GT | Performance ranges reported | External (multi-team) | Wang 2016 MedIA |
37 | Xie X (2010) | China | Angle Orthod | Structured records | Extraction vs non-extraction | 200 pts | ANN | Clinician decision | Accuracy ~ 80% (reported) | — | Xie 2010 Angle |
38 | Jung S-K (2016) | Korea | AJODO | Structured records | Extraction vs non-extraction | 156 pts | 3-layer ANN | Single clinician decisions | Accuracy ~ 93% (reported) | — | Jung 2016 AJODO |
39 | Li P (2019) | China | Sci Rep | Structured records | Orthodontic treatment planning (broad) | — | ANN | Expert plan | Model feasible; high accuracy metrics reported | — | Li 2019 Sci Rep |
40 | Castillo J-C (2019) | Canada/USA | Angle Orthod | 3D photogrammetry | 3D facial-cheph relationships | — | Statistical + ML links | Manual measurements | Good correlations (diagnostic adjunct) | — | Castillo 2019 Angle |
41 | Schmidt S (2022) | Germany | Dentomaxillofac Radiol | Panoramic (2D) | Restoration segmentation | 1,781 PRs | U-Net variants | Pixelwise GT | F1 up to 0.95 (tiled) | — | Schmidt 2022 DMFR |
42 | Kim H (2022) | Korea | Dentomaxillofac Radiol | Panoramic (2D) | Detect restorations & implants | — | Object detection (DL) | Expert labels | Strong detection metrics (see paper) | — | Kim 2022 DMFR |
43 | Craniofacial Growth ML (2025) | USA | Orthod Craniofac Res | Structured records | Long-term growth change prediction | — | ML regression ensemble | Ceph serial records | MAE/metrics reported (see paper) | — | Myers 2025 OCR |
44 | Prasad J (2022) | India | Dent J (MDPI) | Structured records | Clinical decision support (diagnosis & plan) | — | XGBoost/RF (multilabel) | Clinician plan | High macro-F1 across labels | — | Prasad 2022 Dent J |
45 | Del Real A (2022) | Korea | Korean J Orthod | Structured records | Predict need for extraction | — | XGBoost/RF | Orthodontist decision | Good accuracy (see paper) | — | Del Real 2022 KJO |