T
Model

Average ⬆️
ARC
HellaSwag
MMLU
TruthfulQA
Winogrande
GSM8K
🟢
cloudyu/Yi-34Bx2-MoE-60B 📑

76.72
71.08
85.23
77.47
66.19
84.85
75.51
🟢
cloudyu/Mixtral_34Bx2_MoE_60B 📑

76.66
71.33
85.25
77.34
66.59
84.85
74.6
🟢
cloudyu/Mixtral_34Bx2_MoE_60B 📑

76.63
71.25
85.36
77.28
66.61
84.69
74.6
🟦
moreh/MoMo-70B-lora-1.8.4-DPO 📑

76.23
69.62
85.35
77.33
64.64
84.14
76.27
🔶
cloudyu/Yi-34Bx3-MoE-90B 📑

76.18
70.9
85.33
77.41
66.31
84.29
72.86
🟦
moreh/MoMo-70B-lora-1.8.5-DPO 📑

76.14
69.54
85.6
77.49
65.79
84.14
74.3
🔶
TomGrc/FusionNet_7Bx2_MoE_14B 📑

75.91
73.55
88.84
64.68
69.6
88.16
70.66
🔶
one-man-army/UNA-34Beagles-32K-bf16-v1 📑

75.41
73.55
85.93
76.45
73.55
82.95
60.05
🔶
jondurbin/nontoxic-bagel-34b-v0.2 📑

74.69
72.44
85.64
76.41
72.7
82.48
58.45
⭕
jondurbin/bagel-dpo-34b-v0.2 📑

74.69
71.93
85.25
76.58
70.05
83.35
60.96
🔶
moreh/MoMo-70B-LoRA-V1.4 📑

74.67
69.2
85.07
77.12
62.66
83.74
70.2
🟦
udkai/Turdus 📑

74.66
73.38
88.56
64.52
67.11
86.66
67.7
🔶
jondurbin/bagel-dpo-34b-v0.2 📑

74.5
72.01
85.24
76.58
70.16
83.03
59.97
🔶
kodonho/Solar-OrcaDPO-Solar-Instruct-SLERP 📑

74.35
70.99
88.22
66.22
71.95
83.43
65.28
🔶
kodonho/SolarM-SakuraSolar-SLERP 📑

74.29
71.16
88.47
66.24
72.1
83.11
64.67
⭕
bhavinjawade/SOLAR-10B-OrcaDPO-Jawade 📑

74.27
71.16
88.27
66.12
71.57
83.66
64.82
🔶
VAGOsolutions/SauerkrautLM-SOLAR-Instruct 📑

74.21
70.82
88.63
66.2
71.95
83.5
64.14
🟦
upstage/SOLAR-10.7B-Instruct-v1.0 📑

74.2
71.08
88.16
66.21
71.43
83.58
64.75
🔶
fblgit/UNA-SOLAR-10.7B-Instruct-v1.0 📑

74.2
70.56
88.18
66.08
72.05
83.66
64.67
🟦
bhavinjawade/SOLAR-10B-Nector-DPO-Jawade 📑

74.19
71.33
88.62
66.22
70.92
83.43
64.59
🟦
dhanushreddy29/BrokenKeyboard 📑

74.08
71.25
88.34
66.04
71.36
83.19
64.29
🟦
fblgit/UNA-SOLAR-10.7B-Instruct-v1.0 📑

74.07
70.73
88.32
66.1
72.52
83.35
63.38
🔶
fblgit/UNA-POLAR-10.7B-InstructMath-v2 📑

74.07
70.73
88.2
66.03
71.73
82.95
64.75
🔶
yhyu13/LMCocktail-10.7B-v1 📑

74.06
70.65
88.13
66.21
71.03
83.35
64.97
🔶
rishiraj/meow 📑

73.94
70.48
88.08
66.25
70.49
83.43
64.9
🟦
fblgit/UNA-TheBeagle-7b-v1 📑

73.87
73.04
88
63.48
69.85
82.16
66.72
🔶
fblgit/UNAversal-8x7B-v1beta 📑

73.78
69.8
86.9
70.39
71.97
82
61.64
🔶
NousResearch/Nous-Hermes-2-Yi-34B 📑

73.74
66.89
85.49
76.7
60.37
82.95
70.05
🟦
argilla/distilabeled-Marcoro14-7B-slerp 📑

73.63
70.73
87.47
65.22
65.1
82.08
71.19
🟢
Qwen/Qwen-72B 📑

73.6
65.19
85.94
77.37
60.19
82.48
70.43
🟦
mlabonne/NeuralMarcoro14-7B 📑

73.57
71.42
87.59
64.84
65.64
81.22
70.74
🔶
abideen/NexoNimbus-7B 📑

73.5
70.82
87.86
64.69
62.43
84.85
70.36
🟦
Neuronovo/neuronovo-7B-v0.2 📑

73.44
73.04
88.32
65.15
71.02
80.66
62.47
🟢
cloudyu/Mixtral_7Bx2_MoE 📑

73.43
71.25
87.45
64.98
67.23
81.22
68.46
🟦
argilla/distilabeled-Marcoro14-7B-slerp-full 📑

73.4
70.65
87.55
65.33
64.21
82
70.66
🟦
CultriX/MistralTrix-v1 📑

73.39
72.27
88.33
65.24
70.73
80.98
62.77
🔶
cloudyu/Mixtral_7Bx5_MoE_30B 📑

73.39
69.97
86.82
64.42
65.97
80.98
72.18
🟢
macadeliccc/SOLAR-math-2x10.7b 📑

73.37
68.43
86.31
66.9
64.21
83.35
71.04
🟦
ryandt/MusingCaterpillar 📑

73.33
72.53
88.34
65.26
70.93
80.66
62.24
🟢
cloudyu/Mixtral_7Bx6_MoE_35B 📑

73.32
70.14
86.77
64.74
65.79
81.06
71.42
🔶
cloudyu/Mixtral_7Bx6_MoE_35B 📑

73.31
69.97
86.82
64.91
65.77
81.14
71.27
🟦
Neuronovo/neuronovo-7B-v0.3 📑

73.29
72.7
88.26
65.1
71.35
80.9
61.41
⭕
SUSTech/SUS-Chat-34B 📑

73.22
66.3
83.91
76.41
57.04
83.5
72.18
🔶
Sao10K/SOLAR-10.7B-NahIdWin 📑

73.21
64.51
85.67
64.17
76.73
80.51
67.7
🟦
argilla/notus-8x7b-experiment 📑

73.18
70.99
87.73
71.33
65.79
81.61
61.64
🟦
CultriX/MistralTrixTest 📑

73.17
72.53
88.4
65.22
70.77
81.37
60.73
🟢
macadeliccc/Orca-SOLAR-4x10.7b 📑

73.17
68.52
86.78
67.03
64.54
83.9
68.23
🔶
samir-fama/SamirGPT-v1 📑

73.11
69.54
87.04
65.3
63.37
81.69
71.72
🔶
SanjiWatsuki/Lelantos-DPO-7B 📑

73.09
71.08
87.22
64
67.77
80.03
68.46
🟦
argilla/notux-8x7b-v1-epoch-2 📑

73.05
70.65
87.8
71.43
65.97
82.08
60.35
🟦
CultriX/MistralTrixTest 📑

73.17
72.53
88.4
65.22
70.77
81.37
60.73
🟢
macadeliccc/Orca-SOLAR-4x10.7b 📑

73.17
68.52
86.78
67.03
64.54
83.9
68.23
🔶
samir-fama/SamirGPT-v1 📑

73.11
69.54
87.04
65.3
63.37
81.69
71.72
🔶
SanjiWatsuki/Lelantos-DPO-7B 📑

73.09
71.08
87.22
64
67.77
80.03
68.46
🟦
argilla/notux-8x7b-v1-epoch-2 📑

73.05
70.65
87.8
71.43
65.97
82.08
60.35
🔶
shadowml/Marcoro14-7B-ties 📑

73.01
69.8
87.13
65.11
63.54
81.61
70.89
🔶
argilla/notux-8x7b-v1 📑

72.97
70.65
87.72
71.39
66.21
80.74
61.11
🔶
AA051611/whattest 📑

72.96
66.81
84.43
76.59
58.04
82.48
69.45
🟦
bardsai/jaskier-7b-dpo 📑

72.91
70.82
87.02
64.67
64.41
80.19
70.36
🔶
VAGOsolutions/SauerkrautLM-Mixtral-8x7B-Instruct 📑

72.89
70.48
87.75
71.37
65.71
81.22
60.8
🔶
samir-fama/FernandoGPT-v1 📑

72.87
69.45
86.94
65.19
61.18
81.14
73.31
🔶
PSanni/MPOMixtral-8x7B-Instruct-v0.1 📑

72.8
70.99
87.95
70.26
66.52
82.56
58.53
🔶
cookinai/OpenCM-14 📑

72.75
69.28
86.89
65.01
61.07
81.29
72.93
🔶
VAGOsolutions/SauerkrautLM-Mixtral-8x7B-Instruct 📑

72.73
70.56
87.74
71.08
65.72
81.45
59.82
🔶
mistralai/Mixtral-8x7B-Instruct-v0.1 📑

72.7
70.14
87.55
71.4
64.98
81.06
61.11
🔶
senseable/garten2-7b 📑

72.65
69.37
87.54
65.44
59.5
84.69
69.37
⭕
mistralai/Mixtral-8x7B-Instruct-v0.1 📑

72.62
70.22
87.63
71.16
64.58
81.37
60.73
🔶
AIDC-ai-business/Marcoroni-7B-v3 📑

72.53
69.45
86.78
65
60.4
81.45
72.1
🟦
bardsai/jaskier-7b-dpo-v2 📑

72.53
69.28
86.8
64.92
61.64
80.74
71.8
🔶
Toten5/Marcoroni-v3-neural-chat-v3-3-Slerp 📑

72.51
68.77
86.55
64.51
62.7
80.74
71.8
🔶
jondurbin/bagel-dpo-8x7b-v0.2 📑

72.49
72.1
86.41
70.27
72.83
83.27
50.04
🔶
Brillibits/Instruct_Mixtral-8x7B-v0.1_Dolly15K 📑

72.44
69.28
87.59
70.96
64.83
82.56
59.44
🔶
SanjiWatsuki/Kunoichi-DPO-v2-7B 📑

72.4
69.37
87.42
64.83
66
80.74
66.03
🔶
mindy-labs/mindy-7b 📑

72.34
69.11
86.57
64.69
60.89
81.06
71.72
🔶
janhq/supermario-v2 📑

72.34
68.52
86.51
64.88
60.58
81.37
72.18
🔶
OpenBuddy/openbuddy-deepseek-67b-v15.2 📑

72.33
68.6
86.37
71.5
56.2
84.45
66.87
🔶
shadowml/Beyonder-4x7B-v2 📑

72.33
68.77
86.8
65.1
60.68
80.9
71.72
🔶
janhq/supermario-slerp 📑

72.32
68.94
86.58
64.93
60.11
81.29
72.1
⭕
mncai/yi-34B-v3 📑

72.26
67.06
85.11
75.8
57.54
83.5
64.52
🔶
Sao10K/Fimbulvetr-10.7B-v1 📑

72.25
68.94
87.27
66.59
60.54
83.5
66.64
🔶
SanjiWatsuki/Kunoichi-DPO-7B 📑

72.24
69.62
87.14
64.79
67.31
80.58
63.99
🟦
rwitz2/grindin 📑

72.18
69.88
87.02
64.98
59.34
80.9
70.96
🔶
SanjiWatsuki/Kunoichi-7B 📑

72.13
68.69
87.1
64.9
64.04
81.06
67.02
⭕
mncai/yi-34B-v2 📑

72.12
66.13
85
75.64
57.34
83.66
64.97
🔶
CausalLM/72B-preview 📑

72.12
65.19
83.23
77.14
52.58
82.48
72.1
🔶
mindy-labs/mindy-7b-v2 📑

72.11
68.69
86.59
65.18
60.16
81.06
70.96
🔶
CausalLM/72B-preview 📑

72.06
64.85
83.28
77.21
52.51
82.48
72.02
🔶
rwitz/dec10 📑

72.05
69.11
86.46
64.98
60.42
80.74
70.58
🔶
rwitz/dec10 📑

72.01
69.2
86.48
64.91
60.52
80.43
70.51
🔶
cookinai/Valkyrie-V1 📑

71.92
67.24
86.27
64.82
60.4
81.45
71.34
🔶
AA051611/A0110 📑

71.89
66.38
84.73
74.48
58.6
82.32
64.82
⭕
DopeorNope/COKAL-v1-70B 📑

71.87
87.46
83.29
68.13
72.79
80.27
39.27
🟦
bn22/Nous-Hermes-2-SOLAR-10.7B-MISALIGNED 📑

71.83
68.26
86.11
66.26
57.79
83.43
69.14
🔶
AA051611/A0109 📑

71.83
66.55
84.7
74.44
58.75
82.16
64.37
⭕
deepseek-ai/deepseek-llm-67b-chat 📑

71.79
67.75
86.82
72.42
55.85
84.21
63.68
🔶
OpenBuddy/openbuddy-deepseek-67b-v15.1 📑

71.76
67.66
86.49
70.3
54.42
84.77
66.94
🔶
migtissera/Tess-M-Creative-v1.0 📑

71.73
66.81
85.14
75.54
57.68
83.11
62.09
🟦
VitalContribution/Evangelion-7B 📑

71.71
68.94
86.45
63.97
64.01
79.95
66.94
⭕
bhenrym14/platypus-yi-34b 📑

71.69
68.43
85.21
78.13
54.48
84.06
59.82
🟦
RatanRohith/NeuralPizza-7B-V0.1 📑

71.53
70.48
87.3
64.42
67.22
80.35
59.44
