Embarrassingly simple self-distillation improves code generation

· HN · ArXiv ·

Researchers demonstrate that embarrassingly simple self-distillation improves code generation models, showing a lightweight technique to boost performance without additional data or compute.

Categories: Research

Excerpt

HN · 658 points · 201 comments

Discussions