LLM Benchmark Table

Model	TOTAL	Pass	Refine	Fail	Refusal	$ mToK	Reason	STEM	Utility	Code	Censor
Model A	100	80	10	5	5	1000	Reason A	Yes	High	Yes	No
Model B	120	90	15	10	5	1200	Reason B	No	Medium	No	Yes

Frequently Asked Questions

What is LLM Benchmark Table?

This is a website comparing AI performance across different models.

How to use the table?

Simply browse through the table to compare different models. You can also toggle dark mode for better readability.