ai-for-less-suffering.com

← all sources

source · paper

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

src_deepseek_r1_paper

https://arxiv.org/abs/2501.12948

reliability
0.85

authors: DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Peiyi Wang, Wenfeng Liang

published: 2025-01-22

accessed: 2026-04-19

Notes

Nudged above paper prior (0.82) because the work was peer-reviewed and published in Nature (vol 645, pp 633-638, 2025) in addition to the arXiv preprint.

Intake provenance

method
httpx
tool
afls-ingest/0.0.1
git sha
4d098737f648
at
2026-04-19T20:47:57.761518Z
sha256
57a5dc3bd995…

Evidence from this source (4)