MOE Super-Low Cost Production Capable LLM Workhorses and Assistants. LLMMaxxing on Mini-Bucks < $500 - $1500. We follow up on the current trends of how to get capable and productive LLM's on a limited budget.
Qwythos 9B Powerhouse? We look at Qwythos-9B-Claude-Mythos-5-1M-GGUF. 70-95T/s on a 4080. A LLM Boosters Dream Build. We take a look at Qwythos and definitely were impressed!
Ornith Ornith MTP FrakenModel 1.0 (Try 01) w/MTP. Slower than it's Original? We explore if MTP is hitting Ornith 1.0. We were not able to get significant breakthroughs - again configurations matter!