Back to Benchmarks

PathoROB
pathology
robustness
H&E

Robustness benchmark evaluating pathology foundation models across domain shift scenarios including TCGA 2x2 splits, Camelyon, and Tolkach ESCA datasets.

23 models evaluated
3 tasks
Organs:
multi-organ
breast
esophagus

Detailed Results

Showing 3 domain shift scenarios. Robustness Index values - higher values indicate better robustness to distribution shifts.

Model
Average
rank
Average
metric
Camelyon
Rob. Index
TCGA 2x2
Rob. Index
Tolkach ESCA
Rob. Index
1.000.9280.9400.8790.964
2.330.8880.8650.8380.960
4.330.8610.8060.8220.955
4.330.8520.7740.8320.951
5.330.8500.7850.8260.938
8.670.8150.7510.7610.932
8.670.8150.7180.7940.932
6.330.8140.6450.8530.944
6.670.8120.6620.8240.951
9.330.8120.7050.8120.918
10.670.7570.5440.8030.923
12.670.7070.4670.7270.928
17.000.6630.6490.6140.726
16.000.6300.3990.7380.754
14.000.6020.1470.7630.896
16UNI
14.670.5980.1450.7470.902
17.000.5960.3180.5930.878
17.670.5430.1060.6520.872
18.000.5120.0430.6610.832
20.000.4890.1840.5870.695
19.670.4760.0110.6230.795
20.000.4690.0190.6190.768
21.670.4460.1350.5110.693