LLM Benchmark Table

Model TOTAL Pass Refine Fail Refusal $ mToK Reason STEM Utility Code Censor

FAQ

This table ranks various Large Language Models (LLMs) based on their performance across multiple categories to help users compare capabilities and utility.

Click the column headers to sort the table by that category. Click again to reverse the order.

"$ mToK" stands for the cost per million tokens, indicating the pricing efficiency of using a model.

This is a static example table. In a real application, updates would occur as new performance data becomes available.