Making LLMs more accurate by using all of their layers
Google proposes using signals across all transformer layers to improve LLM accuracy beyond final-layer decoding.
Excerpt
Algorithms & Theory
Read at source: https://research.google/blog/making-llms-more-accurate-by-using-all-of-their-layers/