Generative AI

Image, video, audio, and text generation

Top This Week

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion
Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min ·

All Content

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context
Llms

[2412.17596] Evaluating LLMs' Divergent Thinking Capabilities for Scientific Idea Generation with Minimal Context

This article evaluates the divergent thinking capabilities of Large Language Models (LLMs) for scientific idea generation using minimal c...

arXiv - AI · 4 min ·
[2411.11707] Federated Co-tuning Framework for Large and Small Language Models
Llms

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

The paper presents FedCoLLM, a federated co-tuning framework that enhances the performance of both Large Language Models (LLMs) and Small...

arXiv - AI · 4 min ·
[2511.09731] FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching
Machine Learning

[2511.09731] FlowCast: Advancing Precipitation Nowcasting with Conditional Flow Matching

FlowCast introduces a novel probabilistic model for precipitation nowcasting using Conditional Flow Matching, improving accuracy and effi...

arXiv - Machine Learning · 4 min ·
[2511.04934] Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding
Llms

[2511.04934] Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

The paper discusses the limitations of current unlearning methods in large language models (LLMs), revealing that they fail to effectivel...

arXiv - Machine Learning · 4 min ·
[2510.26376] Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings
Generative Ai

[2510.26376] Efficient Generative AI Boosts Probabilistic Forecasting of Sudden Stratospheric Warmings

This article presents a novel generative AI model, FM-Cast, which enhances the probabilistic forecasting of Sudden Stratospheric Warmings...

arXiv - Machine Learning · 4 min ·
[2510.11390] Medical Interpretability and Knowledge Maps of Large Language Models
Llms

[2510.11390] Medical Interpretability and Knowledge Maps of Large Language Models

This article presents a systematic study of medical interpretability in Large Language Models (LLMs), exploring how these models process ...

arXiv - AI · 4 min ·
[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization
Llms

[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

This paper presents Distribution Matching Policy Optimization (DMPO), a novel reinforcement learning method aimed at enhancing reasoning ...

arXiv - Machine Learning · 4 min ·
[2602.11298] Voxtral Realtime
Machine Learning

[2602.11298] Voxtral Realtime

Voxtral Realtime presents a novel streaming automatic speech recognition model achieving offline transcription quality with sub-second la...

arXiv - AI · 5 min ·
[2510.00553] On Predictability of Reinforcement Learning Dynamics for Large Language Models
Llms

[2510.00553] On Predictability of Reinforcement Learning Dynamics for Large Language Models

This article explores the predictability of reinforcement learning dynamics in large language models (LLMs), highlighting key properties ...

arXiv - AI · 4 min ·
[2510.00502] Diffusion Alignment as Variational Expectation-Maximization
Machine Learning

[2510.00502] Diffusion Alignment as Variational Expectation-Maximization

The paper introduces Diffusion Alignment as Variational Expectation-Maximization (DAV), a novel framework that optimizes diffusion models...

arXiv - Machine Learning · 3 min ·
[2602.07754] Humanizing AI Grading: Student-Centered Insights on Fairness, Trust, Consistency and Transparency
Ai Safety

[2602.07754] Humanizing AI Grading: Student-Centered Insights on Fairness, Trust, Consistency and Transparency

This study explores student perceptions of AI grading systems, focusing on fairness, trust, consistency, and transparency in an undergrad...

arXiv - AI · 3 min ·
[2509.22387] SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly
Llms

[2509.22387] SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly

SpinGPT introduces a novel approach using large language models to enhance poker strategies, particularly in the Spin & Go format, achiev...

arXiv - AI · 4 min ·
[2509.22295] Aurora: Towards Universal Generative Multimodal Time Series Forecasting
Llms

[2509.22295] Aurora: Towards Universal Generative Multimodal Time Series Forecasting

Aurora introduces a Multimodal Time Series Foundation Model that enhances cross-domain generalization in time series forecasting by integ...

arXiv - Machine Learning · 4 min ·
[2509.21655] DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models
Machine Learning

[2509.21655] DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models

The paper presents DriftLite, a lightweight approach for inference-time scaling of diffusion models, enhancing adaptation to new distribu...

arXiv - Machine Learning · 3 min ·
[2512.17898] Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on Engagement and Trust Globally
Ai Agents

[2512.17898] Humanlike AI Design Increases Anthropomorphism but Yields Divergent Outcomes on Engagement and Trust Globally

This study explores how humanlike AI design influences user engagement and trust across different cultures, revealing that anthropomorphi...

arXiv - AI · 4 min ·
[2509.13648] Sequential Data Augmentation for Generative Recommendation
Machine Learning

[2509.13648] Sequential Data Augmentation for Generative Recommendation

This article introduces GenPAS, a novel framework for data augmentation in generative recommendation systems, emphasizing its impact on m...

arXiv - Machine Learning · 4 min ·
[2510.12066] AI Agents as Universal Task Solvers
Machine Learning

[2510.12066] AI Agents as Universal Task Solvers

The paper discusses AI agents as stochastic dynamical systems, emphasizing their ability to learn and reason through transductive inferen...

arXiv - Machine Learning · 4 min ·
[2508.05612] Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle
Llms

[2508.05612] Shuffle-R1: Efficient RL framework for Multimodal Large Language Models via Data-centric Dynamic Shuffle

The paper presents Shuffle-R1, a novel reinforcement learning framework designed to enhance the efficiency of multimodal large language m...

arXiv - AI · 4 min ·
[2510.00523] VIRTUE: Visual-Interactive Text-Image Universal Embedder
Llms

[2510.00523] VIRTUE: Visual-Interactive Text-Image Universal Embedder

The paper presents VIRTUE, a novel Visual-Interactive Text-Image Universal Embedder that enhances multimodal representation learning by i...

arXiv - AI · 4 min ·
[2508.02066] MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs
Llms

[2508.02066] MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs

The paper presents MolReasoner, a two-stage framework designed to enhance molecular reasoning in large language models (LLMs), addressing...

arXiv - AI · 4 min ·
Previous Page 54 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime