Reviving PapersWithCode (by Hugging Face) [P]

· r/MachineLearning ·

Hugging Face is rebuilding PapersWithCode with agent-generated paper parsing, leaderboards, and task categorization for current ML research.

Categories: Products to Try

Excerpt

Hi, Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode. Sadly, that website is no longer maintained after its acquisition by Meta. Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale and automatically generate leaderboards (for now I'm the one verifying results). So far, I've only parsed high-impact papers for which I know they're SOTA, like Qwen 3.5 and 3.6, RF-DETR for object detection, DINOv3, SOTA embedding models from the MTEB leaderboard, the Open ASR Leaderboard for automatic speech recognition models, etc. For now, it includes the following: * trending papers by default based on Github star velocity * categorization by domain, e.g., [OCR](https://paperswithcode.co/tasks/ocr) * [methods](https://paperswithcode.co/methods), which PwC used to have, e.g., [RLVR](https://paperswithcode.co/methods/rlvr) * eval results for high-impact papers, see e.g., [Qwen 3.5](https://paperswithcode.co/paper/83017) at the bottom * leaderboards for each domain, e.g., [MMTEB](https://paperswithcode.co/benchmark/mmteb) or [COCO val 2017](https://paperswithcode.co/benchmark/coco-val2017) * support for [citation counts](https://paperswithcode.co/?order_by=citation_count) (you can also see the most cited papers by domain!) * automated linked Github, project page URLs, and artifacts (+ multiple repos are supported on a paper page) * support for external papers beyond Arxiv, see e.g., [DeepSeek v4]

Discussions