Knowledge Distillation of Black-Box Large Language Models (2024)

· HN · ArXiv ·

The paper surveys methods for distilling proprietary LLM behavior into smaller models without direct access to weights or training data.

Categories: Research

Excerpt

HN · 122 points · 23 comments

Discussions