Hacker News

LLaDA2.0: Scaling Up Diffusion Language Models to 100B [pdf]