AI News Archives

Curiosity-Driven Reinforcement Learning from Human Feedback CD-RLHF: An AI Framework that Mitigates the Diversity...

January 31, 2025

Large Language Models (LLMs) have become increasingly reliant on Reinforcement Learning from...

Memorization vs. Generalization: How Supervised Fine-Tuning SFT and Reinforcement Learning RL Shape Foundation Model...

January 31, 2025

Modern AI systems rely heavily on post-training techniques like supervised fine-tuning (SFT)...

Meta AI Proposes EvalPlanner: A Preference Optimization Algorithm for Thinking-LLM-as-a-Judge

January 31, 2025

The rapid advancement of Large Language Models (LLMs) has significantly improved their...

Agentic AI: The Foundations Based on Perception Layer, Knowledge Representation and Memory Systems

January 31, 2025

Agentic AI stands at the intersection of autonomy, intelligence, and adaptability, offering...

From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

January 31, 2025

Knowledge Tracing (KT) plays a crucial role in Intelligent Tutoring Systems (ITS) by modeling students’ knowledge states and predicting their future performance. Traditional KT...

Baidu Research Introduces EICopilot: An Intelligent Agent-based Chatbot to Retrieve and Interpret Enterprise Information...

January 31, 2025

Knowledge graphs have been used tremendously in the field of enterprise lately,...

Open Thoughts: An Open Source Initiative Advancing AI Reasoning with High-Quality Datasets and Models...

January 30, 2025

The critical issue of restricted access to high-quality reasoning datasets has limited...

Decoupling Tokenization: How Over-Tokenized Transformers Redefine Vocabulary Scaling in Language Models

January 30, 2025

Tokenization plays a fundamental role in the performance and scalability of Large...

Quantization Space Utilization Rate (QSUR): A Novel Post-Training Quantization Method Designed to Enhance the...

January 30, 2025

Post-training quantization (PTQ) focuses on reducing the size and improving the speed...

YuE: An Open-Source Music Generation AI Model Family Capable of Creating Full-Length Songs with...

January 30, 2025

Significant progress has been made in short-form instrumental compositions in AI and...

AI News

MiniMax Sparse Attention (MSA): a Two-Branch Block-Sparse Attention Trained on a 109B-Parameter MoE With a 3T-Token Budget

OpenAI’s Deployment Simulation Extends Pre-Deployment Risk Assessment to Agentic Coding Through Simulated Tool Calls

Curiosity-Driven Reinforcement Learning from Human Feedback CD-RLHF: An AI Framework that Mitigates the Diversity...

Memorization vs. Generalization: How Supervised Fine-Tuning SFT and Reinforcement Learning RL Shape Foundation Model...

Meta AI Proposes EvalPlanner: A Preference Optimization Algorithm for Thinking-LLM-as-a-Judge

Agentic AI: The Foundations Based on Perception Layer, Knowledge Representation and Memory Systems

From Deep Knowledge Tracing to DKT2: A Leap Forward in Educational AI

Baidu Research Introduces EICopilot: An Intelligent Agent-based Chatbot to Retrieve and Interpret Enterprise Information...

Open Thoughts: An Open Source Initiative Advancing AI Reasoning with High-Quality Datasets and Models...

Decoupling Tokenization: How Over-Tokenized Transformers Redefine Vocabulary Scaling in Language Models

Quantization Space Utilization Rate (QSUR): A Novel Post-Training Quantization Method Designed to Enhance the...

YuE: An Open-Source Music Generation AI Model Family Capable of Creating Full-Length Songs with...

Recommended

The SpaceX IPO Is Here. What Does It Mean for Crypto?

Bitcoin Bottom Not Here Yet? This Indicator Remains In Transition Phase

Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B...

Coinbase Council Warns 7 Million BTC May Face Quantum Risk

Securitize Expands STAC Tokenized AAA CLO Fund to Solana

EDITOR PICKS

ASTER Price Surges 20% as Volume Explodes 200%—Can the Token Reach...

UK Sanctions HTX Over Alleged $1.5 Billion Russia-Linked Crypto Flows

Altcoin Spot Sell Pressure Hits Five-Year Extreme, CryptoQuant Data Shows

POPULAR POSTS

A New AI Research from Italy Introduces a Diffusion-Based Generative Model...

Sorare 2023-24: New Gameplay Formats & Experiences

What Does it Mean to Deploy a Machine Learning Model?

POPULAR CATEGORY