Generative AI

Image, video, audio, and text generation

Top This Week

Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion
Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min ·
[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode
Llms

[2602.00388] Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

Abstract page for arXiv paper 2602.00388: Safer by Diffusion, Broken by Context: Diffusion LLM's Safety Blessing and Its Failure Mode

arXiv - Machine Learning · 4 min ·

All Content

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models
Machine Learning

[2506.01666] Synthesis of discrete-continuous quantum circuits with multimodal diffusion models

This paper presents a multimodal denoising diffusion model for synthesizing discrete-continuous quantum circuits, improving efficiency in...

arXiv - Machine Learning · 4 min ·
[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning
Llms

[2505.17645] HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning

HoloLLM introduces a Multimodal Large Language Model that enhances human sensing and reasoning by integrating diverse sensory inputs, out...

arXiv - Machine Learning · 4 min ·
[2504.18310] How much does context affect the accuracy of AI health advice?
Llms

[2504.18310] How much does context affect the accuracy of AI health advice?

This article examines how linguistic and contextual factors influence the accuracy of AI-generated health advice, revealing significant d...

arXiv - Machine Learning · 4 min ·
[2504.12007] Diffusion Generative Recommendation with Continuous Tokens
Llms

[2504.12007] Diffusion Generative Recommendation with Continuous Tokens

The paper presents ContRec, a novel framework that integrates continuous tokens into LLM-based recommender systems, enhancing user prefer...

arXiv - AI · 4 min ·
[2502.05310] Oracular Programming: A Modular Foundation for Building LLM-Enabled Software
Llms

[2502.05310] Oracular Programming: A Modular Foundation for Building LLM-Enabled Software

The paper introduces 'oracular programming,' a paradigm that integrates traditional computations with LLMs to enhance software reliabilit...

arXiv - AI · 4 min ·
[2405.10385] Augmenting Lateral Thinking in Language Models with Humor and Riddle Data for the BRAINTEASER Task
Llms

[2405.10385] Augmenting Lateral Thinking in Language Models with Humor and Riddle Data for the BRAINTEASER Task

This paper explores enhancing language models' lateral thinking abilities by integrating humor and riddle datasets for the BRAINTEASER ta...

arXiv - Machine Learning · 4 min ·
[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?
Llms

[2512.03005] From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?

This article explores the potential of large language models (LLMs) to act as mediators in online conflicts, moving beyond moderation to ...

arXiv - AI · 4 min ·
[2510.26784] LLMs Process Lists With General Filter Heads
Llms

[2510.26784] LLMs Process Lists With General Filter Heads

This paper explores how large language models (LLMs) process list-based tasks using filter heads, revealing their ability to encode gener...

arXiv - AI · 4 min ·
[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems
Llms

[2510.12462] Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

This article evaluates biases in Large Language Models (LLMs) used as judges in communication systems, assessing their reliability and pr...

arXiv - AI · 4 min ·
[2508.01012] AutoEDA: Enabling EDA Flow Automation through Microservice-Based LLM Agents
Llms

[2508.01012] AutoEDA: Enabling EDA Flow Automation through Microservice-Based LLM Agents

The article presents AutoEDA, a framework that utilizes microservice-based LLM agents to automate Electronic Design Automation (EDA) proc...

arXiv - AI · 4 min ·
[2506.18777] Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs
Llms

[2506.18777] Programming by Backprop: An Instruction is Worth 100 Examples When Finetuning LLMs

The paper introduces Programming by Backprop (PBB), a novel training method for large language models (LLMs) that allows them to learn pr...

arXiv - Machine Learning · 4 min ·
[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation
Llms

[2506.04500] "Don't Do That!": Guiding Embodied Systems through Large Language Model-based Constraint Generation

This paper presents STPR, a framework that utilizes large language models to convert complex natural language constraints into executable...

arXiv - AI · 4 min ·
[2503.12434] A Survey on the Optimization of Large Language Model-based Agents
Llms

[2503.12434] A Survey on the Optimization of Large Language Model-based Agents

This survey reviews optimization techniques for Large Language Model (LLM)-based agents, categorizing methods into parameter-driven and p...

arXiv - AI · 4 min ·
[2602.20981] Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models
Machine Learning

[2602.20981] Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

This paper presents MMHNet, a novel multimodal hierarchical network that enhances video-to-audio generation by enabling models to general...

arXiv - AI · 4 min ·
[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis
Machine Learning

[2602.20951] See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

This paper presents ArtiAgent, a novel approach to automate the creation of artifact-annotated datasets for training visual language mode...

arXiv - AI · 4 min ·
[2602.20946] Some Simple Economics of AGI
Ai Agents

[2602.20946] Some Simple Economics of AGI

This article explores the economic implications of Artificial General Intelligence (AGI), focusing on the transition from human cognition...

arXiv - Machine Learning · 4 min ·
[2602.20752] OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation
Llms

[2602.20752] OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation

OrthoDiffusion is a novel diffusion-based model designed for multi-task interpretation of musculoskeletal MRI scans, improving diagnostic...

arXiv - AI · 4 min ·
[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition
Llms

[2602.20735] RMIT-ADM+S at the MMU-RAG NeurIPS 2025 Competition

The paper presents RMIT-ADM+S, an award-winning system for the Text-to-Text track at the NeurIPS 2025 Competition, featuring a novel retr...

arXiv - AI · 3 min ·
[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs
Llms

[2602.20720] AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

The paper presents AdapTools, a novel framework for adaptive indirect prompt injection attacks on agentic large language models (LLMs), h...

arXiv - AI · 4 min ·
[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer
Llms

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

The paper presents TrajGPT-R, a framework for generating urban mobility trajectories using a reinforcement learning-enhanced generative t...

arXiv - Machine Learning · 4 min ·
Previous Page 48 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime