AI News Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows April 1, 2026 0 73 FacebookXPinterestWhatsAppLinkedinReddItEmailPrintTumblrTelegramMix