oobabooga benchmark

Score Model Size Loader Additional info
34/48platypus-yi-34b.Q8_034Bllamacpp_HF
34/48Meta-Llama-3-70B-Instruct-Q4_K_S70Bllamacpp_HF[link]
34/48LoneStriker_OpenBioLLM-Llama3-70B-6.0bpw-h6-exl270BExLlamav2_HF
33/48turboderp_Llama-3-70B-Instruct-exl2_6.0bpw70BExLlamav2_HF
33/48turboderp_Llama-3-70B-Instruct-exl2_5.0bpw70BExLlamav2_HF
33/48turboderp_Cat-Llama-3-70B-instruct-exl2_5.0bpw70BExLlamav2_HF
33/48Undi95_Meta-Llama-3-70B-Instruct-hf70BTransformers--load-in-4bit
33/48Meta-Llama-3-70B-Instruct.Q8_070Bllamacpp_HF
33/48Meta-Llama-3-70B-Instruct.Q4_K_M70Bllamacpp_HF
33/48Meta-Llama-3-70B-Instruct-Q3_K_S70Bllamacpp_HF[link]
33/48Meta-Llama-3-70B-Instruct-IQ3_XXS70Bllamacpp_HF[link]
33/48Meta-Llama-3-70B-Instruct-IQ3_XS70Bllamacpp_HF[link]
33/48LoneStriker_dolphin-2.9-llama3-70b-6.0bpw-h6-exl270BExLlamav2_HF
33/48Llama3-TenyxChat-70B.i1-Q4_K_S70Bllamacpp_HF
33/48ISTA-DASLab_Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x1670BTransformers
32/48Meta-Llama-3-70B-Instruct-Q3_K_M70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-Q3_K_L70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-IQ4_XS70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-IQ4_NL70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-IQ3_S70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-IQ2_S70Bllamacpp_HF[link]
32/48Meta-Llama-3-70B-Instruct-IQ2_M70Bllamacpp_HF[link]
32/48Llama-3-Giraffe-70B-Instruct.i1-Q4_K_S70Bllamacpp_HF[link]
31/48oobabooga_miqu-1-70b-sf-EXL2-6.000b70BExLlamav2_HF
31/48miqu-1-70b.q5_K_M70Bllamacpp_HF
31/48miqu-1-70b.q4_k_m70Bllamacpp_HF
31/48cloudyu_Phoenix_DPO_60B60BTransformers--load-in-8bit
31/48Meta-Llama-3-70B-Instruct-Q2_K70Bllamacpp_HF[link]
31/48Meta-Llama-3-70B-Instruct-IQ3_M70Bllamacpp_HF[link]
31/48Meta-Llama-3-70B-Instruct-IQ2_XS70Bllamacpp_HF[link]
31/48Meta-Llama-3-70B-Instruct-IQ2_XS70Bllamacpp_HF[link]
31/48Meta-Llama-3-120B-Instruct.Q4_K_M120Bllamacpp_HF[link]
30/48turboderp_Mixtral-8x22B-Instruct-v0.1-exl2_4.0bpw8x22BExLlamav2_HF
30/48qwen1_5-72b-chat-q4_k_m72Bllamacpp_HF
30/48miquliz-120b-v2.0.Q4_K_M120Bllamacpp_HF
30/48falcon-180b-chat.Q4_K_M180Bllamacpp_HF
30/48Senku-70B-Full-Q4_K_M70Bllamacpp_HF
30/48Rhea-72b-v0.5-Q4_K_M72Bllamacpp_HF
30/48Dracones_WizardLM-2-8x22B_exl2_4.0bpw8x22BExLlamav2_HF
29/48turboderp_command-r-plus-103B-exl2_3.0bpw104BExLlamav2_HF
29/48turboderp_Llama-3-70B-exl2_5.0bpw70BExLlamav2_HF
29/48daybreak-miqu-1-70b-v1.0-q5_k_m70Bllamacpp_HF
29/48command-r-plus-104b-iq4_xs104Bllamacpp_HF
29/48bartowski_Qwen1.5-32B-Chat-exl2_5_032BExLlamav2_HF
29/48Qwen1.5-110B-Chat-Q4_K_M110Bllamacpp_HF[link]
29/48LoneStriker_Yi-34B-Chat-8.0bpw-h8-exl234BExLlamav2_HF
29/48Llama3-ChatQA-1.5-70B.Q4_K_M70Bllamacpp_HFAlpaca template.
29/48Dracones_Llama-3-Lumimaid-70B-v0.1_exl2_4.5bpw70BExLlamav2_HF
29/4834b-beta.Q8_034Bllamacpp_HF
28/48turboderp_command-r-plus-103B-exl2_4.5bpw104BExLlamav2_HF
28/48turboderp_command-r-plus-103B-exl2_3.5bpw104BExLlamav2_HF
28/48dolphin-2.7-mixtral-8x7b.Q8_08x7Bllamacpp_HF
28/48command-r-plus-Q4_K_M104Bllamacpp_HF
28/48LoneStriker_Smaug-72B-v0.1-6.0bpw-h6-exl272BExLlamav2_HF
27/48mixtral-8x7b-instruct-v0.1.Q8_08x7Bllamacpp_HF
27/48miqu-1-70b.q2_K70Bllamacpp_HF
27/48TheBloke_Helion-4x34B-GPTQ4x34BExLlamav2_HF
27/48Platypus2-70B.i1-Q4_K_M70Bllamacpp_HF[link]
27/48Midnight-Miqu-70B-v1.0.Q4_K_M70Bllamacpp_HF
27/48Llama3-ChatQA-1.5-70B.Q4_K_M70Bllamacpp_HFNVIDIA-ChatQA template.
27/48ISTA-DASLab_c4ai-command-r-plus-AQLM-2Bit-1x16104BTransformers
26/48nous-hermes-2-mixtral-8x7b-dpo.Q8_08x7Bllamacpp_HF
26/48lzlv_70b_fp16_hf.Q5_K_M70Bllamacpp_HF
26/48Qwen_Qwen1.5-14B-Chat14BTransformers
26/48Mixtral-8x22B-Instruct-v0.1.Q4_K_M8x22Bllamacpp_HF
26/48Meta-Llama-3-70B-Instruct-IQ2_XXS70Bllamacpp_HF
26/48LoneStriker_dolphin-2.2-yi-34b-200k-8.0bpw-h8-exl234BExLlamav2_HF
26/48LoneStriker_Yi-34B-200K-8.0bpw-h8-exl234BExLlamav2_HF
26/48CausalLM-RP-34B.q8_034Bllamacpp_HF
25/48turboderp_Llama-3-70B-Instruct-exl2_2.4bpw70BExLlamav2_HF
25/48goliath-120b.Q4_K_M120Bllamacpp_HF
25/48NousResearch_Nous-Hermes-2-SOLAR-10.7B10.7BTransformers
25/48LoneStriker_Llama-3-70B-Instruct-Gradient-524k-6.0bpw-h6-exl270BExLlamav2_HF
24/48upstage_SOLAR-10.7B-Instruct-v1.010.7BTransformers
24/48maid-yuzu-v8-alter.Q8_08x7Bllamacpp_HF
24/48MultiVerse_70B.Q4_K_M70Bllamacpp_HF
24/48LoneStriker_Llama-3-70B-Instruct-Gradient-262k-6.0bpw-h6-exl270BExLlamav2_HF
23/48xwin-lm-70b-v0.1.Q4_K_M70Bllamacpp_HF
23/48microsoft_Phi-3-mini-4k-instruct3.8BTransformers
23/48bhenrym14_airoboros-3_1-yi-34b-200k34BTransformers--load-in-8bit
22/48wizardlm-70b-v1.0.Q4_K_M70Bllamacpp_HF
22/48turboderp_command-r-v01-35B-exl2_6.0bpw35BExLlamav2_HF
22/48turboderp_command-r-plus-103B-exl2_2.5bpw104BExLlamav2_HF
22/48tulu-2-dpo-70b.Q4_K_M70Bllamacpp_HF
22/48meraGPT_mera-mix-4x7B4x7BTransformers
22/48liuhaotian_llava-v1.5-13b13BTransformers
22/48Qwen_Qwen1.5-7B-Chat7BTransformers
22/48MoMo-72B-lora-1.8.6-DPO-Q4_K_M72Bllamacpp_HF
22/48Meta-Llama-3-8B-Instruct-Q4_K_S8Bllamacpp_HF
21/48internlm_internlm2-chat-20b20BTransformers
21/48falcon-180b.Q4_K_M180Bllamacpp_HF
21/48c4ai-command-r-v01-Q8_035Bllamacpp_HF
21/48Undi95_Meta-Llama-3-8B-Instruct-hf8BTransformers
21/48NurtureAI_Meta-Llama-3-8B-Instruct-64k8BTransformers
21/48Meta-Llama-3-8B-Instruct-fp168Bllamacpp_HF
21/48Meta-Llama-3-8B-Instruct-Q8_08Bllamacpp_HF
20/48openchat_openchat_3.57BTransformers
20/48llama-2-70b-chat.Q4_K_M70Bllamacpp_HF
20/48Weyaxi_Einstein-v6.1-Llama3-8B8BTransformers
20/48TheBloke_llava-v1.5-13B-GPTQ13BExLlamav2_HF
20/48Meta-Llama-3-8B-Instruct-Q6_K8Bllamacpp_HF
20/48Ein-72B-v0.1-full.Q4_K_M72Bllamacpp_HF
20/48BAAI_Bunny-Llama-3-8B-V8BTransformers
19/48zephyr-orpo-141b-A35b-v0.1.Q4_K_M141Bllamacpp_HF
19/48mistral-7b-instruct-v0.2.Q4_K_S7Bllamacpp_HF
19/48microsoft_Phi-3-mini-128k-instruct3.8BTransformers
19/48llama-2-70b.Q5_K_M70Bllamacpp_HF
19/48lightblue_suzume-llama-3-8B-multilingual8BTransformers
19/48internlm_internlm2-chat-7b7BTransformers
19/48internlm_internlm2-chat-20b-sft20BTransformers
19/48ai21labs_Jamba-v0.152BTransformers--load-in-4bit
19/48NousResearch_Hermes-2-Pro-Mistral-7B7BTransformers
19/48Nexusflow_Starling-LM-7B-beta7BTransformers
19/48Meta-Llama-3-8B-Instruct-Q5_K_S8Bllamacpp_HF
19/48Meta-Llama-3-8B-Instruct-Q5_K_M8Bllamacpp_HF
19/48Meta-Llama-3-8B-Instruct-Q4_K_M8Bllamacpp_HF
19/48Meta-Llama-3-8B-Instruct-IQ4_XS8Bllamacpp_HF
19/48Meta-Llama-3-8B-Instruct-IQ4_NL8Bllamacpp_HF
19/48Meta-Llama-3-8B-Instruct-IQ3_S8Bllamacpp_HF[link]
18/48mistralai_Mistral-7B-Instruct-v0.27BTransformers
18/48microsoft_Phi-3-mini-128k-instruct3.8BTransformers--load-in-8bit
18/48jieliu_Storm-7B7BTransformers
18/48failspy_kappa-3-phi-abliterated3.8BTransformers
18/48TheProfessor-155b.i1-IQ3_XS155Bllamacpp_HF[link]
18/48Qwen_Qwen1.5-MoE-A2.7B-Chat14.3BTransformers
18/48Orenguteng_Lexi-Llama-3-8B-Uncensored8BTransformers
18/48Meta-Llama-3-8B-Instruct-Q3_K_M8Bllamacpp_HF
18/48Meta-Llama-3-8B-Instruct-Q3_K_L8Bllamacpp_HF
18/48Meta-Llama-3-70B-Instruct-IQ1_M70Bllamacpp_HF
18/48LoneStriker_Nous-Capybara-34B-4.65bpw-h6-exl234BExLlamav2_HF
17/48turboderp_Phi-3-mini-128k-instruct-exl2_6.0bpw3.8BExLlamav2_HF
17/48turboderp_Phi-3-mini-128k-instruct-exl2_5.0bpw3.8BExLlamav2_HF
17/48mzbac_llama-3-8B-Instruct-function-calling8BTransformers
17/48microsoft_Phi-3-mini-128k-instruct3.8BTransformers--load-in-4bit
17/48internlm_internlm2-chat-7b-sft7BTransformers
17/48grok-1-IQ2_XS314Bllamacpp_HF[link]
17/48ggml-alpaca-dragon-72b-v1-q4_k_m72Bllamacpp_HF
17/48amazingvince_Not-WizardLM-2-7B7BTransformers
17/48Undi95_Toppy-M-7B7BTransformers
16/48mixtral-8x7b-instruct-v0.1.Q2_K8x7Bllamacpp_HF
16/48TheBloke_Mistral-7B-Instruct-v0.2-GPTQ7BExLlamav2_HF
16/48Phi-3-mini-128k-instruct-Q6_K3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
16/48Phi-3-mini-128k-instruct-Q5_03.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
16/48Phi-3-mini-128k-instruct-Q4_K_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
16/48Phi-3-mini-128k-instruct-Q4_K_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
16/48Phi-3-mini-128k-instruct-Q4_K3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
16/48Meta-Llama-3-8B-Instruct-IQ3_M8Bllamacpp_HF
15/48xtuner_llava-llama-3-8b-v1_18BTransformers
15/48hjhj3168_Llama-3-8b-Orthogonalized-exl28BExLlamav2_HF
15/48cognitivecomputations_dolphin-2.9-llama3-8b8BTransformers
15/48Phi-3-mini-128k-instruct-Q5_03.8Bllamacpp_HFCreated without --imatrix.
15/48Phi-3-mini-128k-instruct-Q4_03.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
15/48Phi-3-mini-128k-instruct-IQ4_XS3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
15/48Phi-3-mini-128k-instruct-IQ4_NL3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
15/48NousResearch_Hermes-2-Pro-Llama-3-8B8BTransformers
15/48Meta-Llama-3-8B-Instruct-IQ3_XS8Bllamacpp_HF
15/48CohereForAI_c4ai-command-r-v01-4bit35BTransformers
14/48turboderp_Phi-3-mini-128k-instruct-exl2_4.0bpw3.8BExLlamav2_HF
14/48nvidia_ChatQA-1.5-8B8BTransformersAlpaca template.
14/48microsoft_Orca-2-13b13BTransformers
14/48mattshumer_Llama-3-8B-16K8BTransformers
14/48Undi95_ReMM-SLERP-L2-13B13BTransformers
14/48Phi-3-mini-128k-instruct-Q5_K_S3.8Bllamacpp_HFCreated without --imatrix.
14/48Phi-3-mini-128k-instruct-Q5_K_M3.8Bllamacpp_HFCreated without --imatrix.
14/48Phi-3-mini-128k-instruct-Q5_K3.8Bllamacpp_HFCreated without --imatrix.
14/48Phi-3-mini-128k-instruct-Q5_13.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
14/48Phi-3-mini-128k-instruct-Q4_K_M3.8Bllamacpp_HFCreated without --imatrix.
14/48Phi-3-mini-128k-instruct-Q4_K3.8Bllamacpp_HFCreated without --imatrix.
14/48Phi-3-mini-128k-instruct-Q4_13.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
14/48Phi-3-mini-128k-instruct-F163.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
14/48Phi-3-mini-128k-instruct-F163.8Bllamacpp_HFCreated without --imatrix.
14/48Meta-Llama-3-8B-Instruct-Q3_K_S8Bllamacpp_HF
14/48Meta-Llama-3-8B-Instruct-IQ3_XXS8Bllamacpp_HF
14/48Gryphe_MythoMax-L2-13b13BTransformers
13/48turboderp_dbrx-instruct-exl2_3.75bpw132BExLlamav2_HFWithout the "You are DBRX..." system prompt.
13/48nvidia_ChatQA-1.5-8B8BTransformersNVIDIA-ChatQA template.
13/48gradientai_Llama-3-8B-Instruct-Gradient-1048k8BTransformers
13/48gradientai_Llama-3-8B-Instruct-262k8BTransformers
13/48alpindale_gemma-7b-it7BTransformers
13/48Phi-3-mini-128k-instruct-Q8_03.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
13/48Phi-3-mini-128k-instruct-Q8_03.8Bllamacpp_HFCreated without --imatrix.
13/48Phi-3-mini-128k-instruct-Q6_K3.8Bllamacpp_HFCreated without --imatrix.
13/48Phi-3-mini-128k-instruct-Q5_K_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
13/48Phi-3-mini-128k-instruct-Q5_K_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
13/48Phi-3-mini-128k-instruct-Q5_K3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
13/48Phi-3-mini-128k-instruct-Q5_13.8Bllamacpp_HFCreated without --imatrix.
13/48Phi-3-mini-128k-instruct-Q4_03.8Bllamacpp_HFCreated without --imatrix.
13/48Phi-3-mini-128k-instruct-F323.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
13/48Phi-3-mini-128k-instruct-F323.8Bllamacpp_HFCreated without --imatrix.
13/48GeorgiaTechResearchInstitute_galactica-30b-evol-instruct-70k30BTransformersGALACTICA template.
12/48llama-65b.Q5_K_M65Bllamacpp_HF
12/48Phi-3-mini-128k-instruct-Q3_K_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
12/48Phi-3-mini-128k-instruct-Q3_K_L3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
12/48Phi-3-mini-128k-instruct-Q3_K3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
12/48Phi-3-mini-128k-instruct-IQ3_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
12/48NousResearch_Llama-2-13b-chat-hf13BTransformers
12/48Meta-Llama-3-70B-Instruct-IQ1_S70Bllamacpp_HF
12/48ISTA-DASLab_Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x168BTransformers
12/48HuggingFaceH4_zephyr-7b-beta7BTransformers
11/48mlabonne_phixtral-2x2_82x2.8BTransformers
11/48Phi-3-mini-128k-instruct-Q4_K_S3.8Bllamacpp_HFCreated without --imatrix.
11/48Phi-3-mini-128k-instruct-Q4_13.8Bllamacpp_HFCreated without --imatrix.
11/48Phi-3-mini-128k-instruct-Q3_K_M3.8Bllamacpp_HFCreated without --imatrix.
11/48Phi-3-mini-128k-instruct-Q3_K3.8Bllamacpp_HFCreated without --imatrix.
11/48Phi-3-mini-128k-instruct-IQ3_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
11/48Meta-Llama-3-8B-Instruct-Q2_K8Bllamacpp_HF
11/48Meta-Llama-3-8B-Instruct-IQ2_M8Bllamacpp_HF
10/48facebook_galactica-30b30BTransformers
10/48Phi-3-mini-128k-instruct-Q3_K_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
10/48Phi-3-mini-128k-instruct-Q3_K_L3.8Bllamacpp_HFCreated without --imatrix.
10/48ISTA-DASLab_c4ai-command-r-v01-AQLM-2Bit-1x1635BTransformers
9/48microsoft_phi-22.7BTransformers
9/48TheBloke_vicuna-33B-GPTQ33BExLlamav2_HF
8/48mistralai_Mistral-7B-Instruct-v0.17BTransformers
8/48gradientai_Llama-3-8B-Instruct-Gradient-1048k8BTransformersRevision of 2024/05/04.
8/48Phi-3-mini-128k-instruct-Q3_K_S3.8Bllamacpp_HFCreated without --imatrix.
8/48Phi-3-mini-128k-instruct-IQ3_XXS3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
8/48Phi-3-mini-128k-instruct-IQ3_XS3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
8/48NousResearch_Nous-Capybara-7B-V1.97BTransformers
8/48NousResearch_Llama-2-7b-chat-hf7BTransformers
8/48Meta-Llama-3-8B-Instruct-IQ2_S8Bllamacpp_HF
8/48ISTA-DASLab_Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x168BTransformers
7/48GeorgiaTechResearchInstitute_galactica-30b-evol-instruct-70k30BTransformersAlpaca template.
6/48tiiuae_falcon-40b-instruct40BTransformers--load-in-8bit; falcon-180B-chat instruction template.
5/48unsloth_llama-3-70b-bnb-4bit70BTransformers
5/48internlm_internlm2-chat-1_8b-sft1.8BTransformers
5/48TheBloke_Llama-2-13B-GPTQ13BExLlamav2_HF
5/48Phi-3-mini-128k-instruct-Q2_K3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
5/48NousResearch_Llama-2-13b-hf13BTransformers
5/48Meta-Llama-3-8B-Instruct-IQ2_XS8Bllamacpp_HF
5/48LoneStriker_deepseek-coder-33b-instruct-6.0bpw-h6-exl233BExLlamav2_HF
4/48turboderp_Phi-3-mini-128k-instruct-exl2_3.0bpw3.8BExLlamav2_HF
4/48internlm_internlm2-chat-1_8b1.8BTransformers
4/48TheBloke_deepseek-coder-33B-instruct-AWQ33BAutoAWQ
3/48turboderp_dbrx-instruct-exl2_3.75bpw132BExLlamav2_HF
3/48TheBloke_Llama-2-7B-GPTQ7BExLlamav2_HF
2/48facebook_galactica-6.7b6.7BTransformers
2/48Meta-Llama-3-8B-Instruct-IQ2_XXS8Bllamacpp_HF
1/48turboderp_Phi-3-mini-128k-instruct-exl2_2.5bpw3.8BExLlamav2_HF
1/48bartowski_CodeQwen1.5-7B-Chat-exl2_8_07BExLlamav2_HF
1/48Qwen_CodeQwen1.5-7B-Chat7BTransformers
1/48Phi-3-mini-128k-instruct-Q2_K_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
1/48Phi-3-mini-128k-instruct-IQ2_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
1/48NousResearch_Llama-2-7b-hf7BTransformers
0/48openai-community_gpt2-xl1.5BTransformers
0/48openai-community_gpt2-medium0.355BTransformers
0/48openai-community_gpt2-large0.774BTransformers
0/48openai-community_gpt20.124BTransformers
0/48gpt4chan_model_float166BTransformers
0/48facebook_opt-6.7b6.7BTransformers
0/48facebook_opt-30b30BTransformers
0/48facebook_opt-13b13BTransformers
0/48facebook_galactica-125m0.125BTransformers
0/48facebook_galactica-1.3b1.3BTransformers
0/48TinyLlama_TinyLlama-1.1B-Chat-v1.01.1BTransformers
0/48Phi-3-mini-128k-instruct-Q2_K3.8Bllamacpp_HFCreated without --imatrix.
0/48Phi-3-mini-128k-instruct-IQ2_XXS3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
0/48Phi-3-mini-128k-instruct-IQ2_XS3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
0/48Phi-3-mini-128k-instruct-IQ2_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
0/48Phi-3-mini-128k-instruct-IQ1_S3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
0/48Phi-3-mini-128k-instruct-IQ1_M3.8Bllamacpp_HFCreated with groups_merged.txt for calibration.
0/48Meta-Llama-3-8B-Instruct-IQ1_S8Bllamacpp_HF
0/48Meta-Llama-3-8B-Instruct-IQ1_M8Bllamacpp_HF
0/48ISTA-DASLab_Llama-2-7b-AQLM-2Bit-1x16-hf7BTransformers
0/48EleutherAI_gpt-neox-20b20BTransformers
0/48EleutherAI_gpt-neo-2.7B2.7BTransformers
0/48EleutherAI_gpt-neo-1.3B1.3BTransformers
0/48EleutherAI_gpt-j-6b6BTransformers

Updates

2024/05/10

2024/05/07

2024/05/06

2024/05/05

2024/05/04

2024/05/03

2024/04/28

2024/04/27

2024/04/26

2024/04/25

2024/04/24

2024/04/23

About

This test consists of 48 manually written multiple-choice questions. It evaluates a combination of academic knowledge and logical reasoning.

Compared to MMLU, it has the advantage of not being in any training dataset, and the disadvantage of being much smaller. Compared to lmsys chatbot arena, it is harsher on small models like Starling-LM-7B-beta that write nicely formatted replies but don't have much knowledge.

The correct Jinja2 instruction template is used for each model, as autodetected by text-generation-webui from the model's metadata. For base models without a template, Alpaca is used. The questions are evaluated using the /v1/internal/logits endpoint in the project's API.

The questions are private.

Limitations

This benchmark does not evaluate code generation, non-English languages, role-playing, RAG, and long context understanding. The performance in those areas may have a weak or nonexistent correlation with what is being measured.