ZAYA1-74B-Preview: Scaling Pretraining on AMD

· r/LocalLLaMA ·

Zyphra releases ZAYA1-74B-Preview, a 74B model trained on AMD hardware with findings on large-scale pretraining scaling, marking a notable entry from a well-funded AI lab.

Categories: Model Releases, Research

Excerpt

r/LocalLLaMA · 100 points · 35 comments · zyphra.com

Discussions