Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 11 hours ago

Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min · about 12 hours ago

Llms

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

Abstract page for arXiv paper 2602.00388: Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

arXiv - Machine Learning · 4 min · about 12 hours ago

All Content

Machine Learning

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

This paper presents a multimodal denoising diffusion model for synthesizing discrete-continuous quantum circuits, improving efficiency in...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

HoloLLM introduces a Multimodal Large Language Model that enhances human sensing and reasoning by integrating diverse sensory inputs, out...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2504.18310] How much does context affect the accuracy of AI health advice?

This article examines how linguistic and contextual factors influence the accuracy of AI-generated health advice, revealing significant d...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2504.12007] Diffusion Generative Recommendation with Continuous Tokens

The paper presents ContRec, a novel framework that integrates continuous tokens into LLM-based recommender systems, enhancing user prefer...

arXiv - AI · 4 min · about 1 month ago

Llms

[2502.05310] Oracular Programming: A Modular Foundation for Building LLM-Enabled Software

The paper introduces 'oracular programming,' a paradigm that integrates traditional computations with LLMs to enhance software reliabilit...

arXiv - AI · 4 min · about 1 month ago

Llms

[2405.10385] Augmenting Lateral Thinking in Language Models with Humor and Riddle Data for the BRAINTEASER Task

This paper explores enhancing language models' lateral thinking abilities by integrating humor and riddle datasets for the BRAINTEASER ta...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

This article explores the potential of large language models (LLMs) to act as mediators in online conflicts, moving beyond moderation to ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.26784] LLMs Process Lists With General Filter Heads

This paper explores how large language models (LLMs) process list-based tasks using filter heads, revealing their ability to encode gener...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

This article evaluates biases in Large Language Models (LLMs) used as judges in communication systems, assessing their reliability and pr...

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.01012] AutoEDA: Enabling EDA Flow Automation through Microservice-Based LLM Agents

The article presents AutoEDA, a framework that utilizes microservice-based LLM agents to automate Electronic Design Automation (EDA) proc...

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.18777] Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs

The paper introduces Programming by Backprop (PBB), a novel training method for large language models (LLMs) that allows them to learn pr...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

This paper presents STPR, a framework that utilizes large language models to convert complex natural language constraints into executable...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

This survey reviews optimization techniques for Large Language Model (LLM)-based agents, categorizing methods into parameter-driven and p...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20981] Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

This paper presents MMHNet, a novel multimodal hierarchical network that enhances video-to-audio generation by enabling models to general...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

This paper presents ArtiAgent, a novel approach to automate the creation of artifact-annotated datasets for training visual language mode...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.20946] Some Simple Economics of AGI

This article explores the economic implications of Artificial General Intelligence (AGI), focusing on the transition from human cognition...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20752] OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation

OrthoDiffusion is a novel diffusion-based model designed for multi-task interpretation of musculoskeletal MRI scans, improving diagnostic...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

The paper presents RMIT-ADM+S, an award-winning system for the Text-to-Text track at the NeurIPS 2025 Competition, featuring a novel retr...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

The paper presents AdapTools, a novel framework for adaptive indirect prompt injection attacks on agentic large language models (LLMs), h...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

The paper presents TrajGPT-R, a framework for generating urban mobility trajectories using a reinforcement learning-enhanced generative t...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 48 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Accelerating science with AI and simulations

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

All Content

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

[2504.18310] How much does context affect the accuracy of AI health advice?

[2504.12007] Diffusion Generative Recommendation with Continuous Tokens

[2502.05310] Oracular Programming: A Modular Foundation for Building LLM-Enabled Software

[2405.10385] Augmenting Lateral Thinking in Language Models with Humor and Riddle Data for the BRAINTEASER Task

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

[2510.26784] LLMs Process Lists With General Filter Heads

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

[2508.01012] AutoEDA: Enabling EDA Flow Automation through Microservice-Based LLM Agents

[2506.18777] Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

[2602.20981] Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

[2602.20946] Some Simple Economics of AGI

[2602.20752] OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation

[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

Related Topics

Stay updated with AI News