Microsoft AI Researchers Release LLaVA-Rad: A Lightweight Open-Source Foundation Model for Advanced Clinical Radiology...

Large foundation models have demonstrated remarkable potential in biomedical applications, offering promising...

Kyutai Releases Hibiki: A 2.7B Real-Time Speech-to-Speech and Speech-to-Text Translation with Near-Human Quality and...

Real-time speech translation presents a complex challenge, requiring seamless integration of speech...

This AI Paper Introduces MAETok: A Masked Autoencoder-Based Tokenizer for Efficient Diffusion Models

Diffusion models generate images by progressively refining noise into structured representations. However,...

ChunkKV: Optimizing KV Cache Compression for Efficient Long-Context Inference in LLMs

Efficient long-context inference with LLMs requires managing substantial GPU memory due to...

Meta AI Introduces ParetoQ: A Unified Machine Learning Framework for Sub-4-Bit Quantization in Large...

As deep learning models continue to grow, the quantization of machine learning...

Sundial: A New Era for Time Series Foundation Models with Generative AI

Time series forecasting presents a fundamental challenge due to its intrinsic non-determinism,...

Meet ZebraLogic: A Comprehensive AI Evaluation Framework for Assessing LLM Reasoning Performance on Logic...

Logical reasoning remains a crucial area where AI systems struggle despite advances...

IBM AI Releases Granite-Vision-3.1-2B: A Small Vision Language Model with Super Impressive Performance on...

The integration of visual and textual data in artificial intelligence presents a...

Singapore University of Technology and Design (SUTD) Explores Advancements and Challenges in Multimodal Reasoning...

After the success of large language models (LLMs), the current research extends...

Process Reinforcement through Implicit Rewards (PRIME): A Scalable Machine Learning Framework for Enhancing Reasoning...

Reinforcement learning (RL) for large language models (LLMs) has traditionally relied on...

Recommended