Add OoD detection to benchmark
An additional evaluation metric to include in the benchmark would be to analyze the behavior of the models when faced with Out-of-distribution data.
An additional evaluation metric to include in the benchmark would be to analyze the behavior of the models when faced with Out-of-distribution data.