Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 20 hours ago

Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min · about 21 hours ago

All Content

Llms

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

This article evaluates the divergent thinking capabilities of Large Language Models (LLMs) for scientific idea generation using minimal c...

arXiv - AI · 4 min · about 1 month ago

Llms

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

The paper presents FedCoLLM, a federated co-tuning framework that enhances the performance of both Large Language Models (LLMs) and Small...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2511.09731] FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching

FlowCast introduces a novel probabilistic model for precipitation nowcasting using Conditional Flow Matching, improving accuracy and effi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.04934] Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

The paper discusses the limitations of current unlearning methods in large language models (LLMs), revealing that they fail to effectivel...

arXiv - Machine Learning · 4 min · about 1 month ago

Generative Ai

[2510.26376] Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings

This article presents a novel generative AI model, FM-Cast, which enhances the probabilistic forecasting of Sudden Stratospheric Warmings...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.11390] Medical Interpretability and Knowledge Maps of Large Language Models

This article presents a systematic study of medical interpretability in Large Language Models (LLMs), exploring how these models process ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

This paper presents Distribution Matching Policy Optimization (DMPO), a novel reinforcement learning method aimed at enhancing reasoning ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.11298] Voxtral Realtime

Voxtral Realtime presents a novel streaming automatic speech recognition model achieving offline transcription quality with sub-second la...

arXiv - AI · 5 min · about 1 month ago

Llms

[2510.00553] On Predictability of Reinforcement Learning Dynamics for Large Language Models

This article explores the predictability of reinforcement learning dynamics in large language models (LLMs), highlighting key properties ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.00502] Diffusion Alignment as Variational Expectation-Maximization

The paper introduces Diffusion Alignment as Variational Expectation-Maximization (DAV), a novel framework that optimizes diffusion models...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Safety

[2602.07754] Humanizing AI Grading: Student-Centered Insights on Fairness, Trust, Consistency and Transparency

This study explores student perceptions of AI grading systems, focusing on fairness, trust, consistency, and transparency in an undergrad...

arXiv - AI · 3 min · about 1 month ago

Llms

[2509.22387] SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly

SpinGPT introduces a novel approach using large language models to enhance poker strategies, particularly in the Spin & Go format, achiev...

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.22295] Aurora: Towards Universal Generative Multimodal Time Series Forecasting

Aurora introduces a Multimodal Time Series Foundation Model that enhances cross-domain generalization in time series forecasting by integ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.21655] DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

The paper presents DriftLite, a lightweight approach for inference-time scaling of diffusion models, enhancing adaptation to new distribu...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Agents

[2512.17898] Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on Engagement and Trust Globally

This study explores how humanlike AI design influences user engagement and trust across different cultures, revealing that anthropomorphi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2509.13648] Sequential Data Augmentation for Generative Recommendation

This article introduces GenPAS, a novel framework for data augmentation in generative recommendation systems, emphasizing its impact on m...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.12066] AI Agents as Universal Task Solvers

The paper discusses AI agents as stochastic dynamical systems, emphasizing their ability to learn and reason through transductive inferen...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2508.05612] Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

The paper presents Shuffle-R1, a novel reinforcement learning framework designed to enhance the efficiency of multimodal large language m...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.00523] VIRTUE: Visual-Interactive Text-Image Universal Embedder

The paper presents VIRTUE, a novel Visual-Interactive Text-Image Universal Embedder that enhances multimodal representation learning by i...

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.02066] MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs

The paper presents MolReasoner, a two-stage framework designed to enhance molecular reasoning in large language models (LLMs), addressing...

arXiv - AI · 4 min · about 1 month ago

Previous Page 54 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

AI video generation seems fundamentally more expensive than text, not just less optimized

Accelerating science with AI and simulations

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

All Content

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

[2511.09731] FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching

[2511.04934] Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

[2510.26376] Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings

[2510.11390] Medical Interpretability and Knowledge Maps of Large Language Models

[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

[2602.11298] Voxtral Realtime

[2510.00553] On Predictability of Reinforcement Learning Dynamics for Large Language Models

[2510.00502] Diffusion Alignment as Variational Expectation-Maximization

[2602.07754] Humanizing AI Grading: Student-Centered Insights on Fairness, Trust, Consistency and Transparency

[2509.22387] SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly

[2509.22295] Aurora: Towards Universal Generative Multimodal Time Series Forecasting

[2509.21655] DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

[2512.17898] Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on Engagement and Trust Globally

[2509.13648] Sequential Data Augmentation for Generative Recommendation

[2510.12066] AI Agents as Universal Task Solvers

[2508.05612] Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

[2510.00523] VIRTUE: Visual-Interactive Text-Image Universal Embedder

[2508.02066] MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs

Related Topics

Stay updated with AI News