Benchmark Results
0 run(s) stored. Best method per issue is highlighted.
No benchmark results yet. Run:
python manage.py run_benchmark --auto_eval_annotated --evaluators surya,docling