One DeepHermes-3 user reported a processing speed of 28.98 tokens per second on a MacBook Pro M4 Max consumer hardware.
OpenThinker-32B achieved benchmark-beating results using just 14% of the data its Chinese competitor needed, marking a win ...