LLM Benchmark Table

Compare AI Model Performance Across Multiple Metrics

Model TOTAL Pass Refine Fail Refusal $ mToK Reason STEM Utility Code Censor
Model Alpha1008010550.58590757060
Model Beta957015820.78085806550
Model Gamma110908750.48892787555
Model Delta8865121010.97580706045

Performance Overview Chart (Placeholder for Visual Data Representation)

Frequently Asked Questions

What does TOTAL represent?
TOTAL represents the overall score of the model across all tested metrics.
How is Pass rate calculated?
Pass rate is the percentage of tasks the model completed successfully without needing refinement.
What does $ mToK mean?
$ mToK refers to the cost in millions per thousand tokens processed by the model.