Generative AI

Image, video, audio, and text generation

Top This Week

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion
Machine Learning

[2603.10202] Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Approach with Jump-Diffusion

Abstract page for arXiv paper 2603.10202: Hybrid Hidden Markov Model for Modeling Equity Excess Growth Rate Dynamics: A Discrete-State Ap...

arXiv - Machine Learning · 4 min ·

All Content

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer
Llms

[2602.20643] TrajGPT-R: Generating Urban Mobility Trajectory with Reinforcement Learning-Enhanced Generative Pre-trained Transformer

The paper presents TrajGPT-R, a framework for generating urban mobility trajectories using a reinforcement learning-enhanced generative t...

arXiv - Machine Learning · 4 min ·
[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI
Machine Learning

[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI

This article explores the factors influencing students' adoption of AI chatbots for learning, utilizing the Technology Acceptance Model t...

arXiv - AI · 4 min ·
[2602.20520] How Do Inpainting Artifacts Propagate to Language?
Llms

[2602.20520] How Do Inpainting Artifacts Propagate to Language?

This paper investigates how visual artifacts from diffusion-based inpainting affect language generation in vision-language models, reveal...

arXiv - AI · 3 min ·
[2602.20497] LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration
Machine Learning

[2602.20497] LESA: Learnable Stage-Aware Predictors for Diffusion Model Acceleration

The paper introduces LESA, a framework for accelerating diffusion models using learnable stage-aware predictors, achieving significant sp...

arXiv - AI · 4 min ·
[2602.20492] Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA
Llms

[2602.20492] Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA

This paper presents a novel approach to decentralized federated learning for multi-task large language model fine-tuning, addressing key ...

arXiv - Machine Learning · 4 min ·
[2602.20480] VINA: Variational Invertible Neural Architectures
Machine Learning

[2602.20480] VINA: Variational Invertible Neural Architectures

The paper presents VINA, a framework for Variational Invertible Neural Architectures, addressing theoretical gaps in normalizing flows an...

arXiv - Machine Learning · 4 min ·
[2602.20408] Examining and Addressing Barriers to Diversity in LLM-Generated Ideas
Llms

[2602.20408] Examining and Addressing Barriers to Diversity in LLM-Generated Ideas

This article explores the limitations of diversity in ideas generated by large language models (LLMs) compared to human creativity, ident...

arXiv - AI · 4 min ·
[2602.20400] Three Concrete Challenges and Two Hopes for the Safety of Unsupervised Elicitation
Llms

[2602.20400] Three Concrete Challenges and Two Hopes for the Safety of Unsupervised Elicitation

This article discusses three significant challenges and two potential solutions for improving the safety of unsupervised elicitation in l...

arXiv - Machine Learning · 4 min ·
[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems
Llms

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

The paper presents a case-aware evaluation framework for enterprise-scale Retrieval-Augmented Generation (RAG) systems, addressing the li...

arXiv - AI · 3 min ·
[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance
Llms

[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

This article examines how specific linguistic features of queries impact the performance of Large Language Models (LLMs), particularly in...

arXiv - AI · 3 min ·
[2602.20294] InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
Llms

[2602.20294] InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

The paper presents InterviewSim, a framework for simulating personalities using large language models grounded in real interview data, en...

arXiv - AI · 4 min ·
[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem
Llms

[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem

The paper introduces KnapSpec, a framework for self-speculative decoding that optimizes layer selection in LLMs as a knapsack problem, en...

arXiv - Machine Learning · 4 min ·
[2602.20210] Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling
Machine Learning

[2602.20210] Multimodal Crystal Flow: Any-to-Any Modality Generation for Unified Crystal Modeling

The paper presents Multimodal Crystal Flow (MCFlow), a unified model for crystal generation tasks that enhances performance by integratin...

arXiv - Machine Learning · 3 min ·
[2602.20206] Mitigating "Epistemic Debt" in Generative AI-Scaffolded Novice Programming using Metacognitive Scripts
Llms

[2602.20206] Mitigating "Epistemic Debt" in Generative AI-Scaffolded Novice Programming using Metacognitive Scripts

This paper explores the concept of 'Epistemic Debt' in novice programming using generative AI, proposing metacognitive scripts to enhance...

arXiv - AI · 4 min ·
[2602.20193] When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks
Machine Learning

[2602.20193] When Backdoors Go Beyond Triggers: Semantic Drift in Diffusion Models Under Encoder Attacks

This paper investigates the impact of encoder-side poisoning on text-to-image models, revealing that traditional evaluations of backdoor ...

arXiv - AI · 3 min ·
[2602.20181] Closing the Expertise Gap in Residential Building Energy Retrofits: A Domain-Specific LLM for Informed Decision-Making
Llms

[2602.20181] Closing the Expertise Gap in Residential Building Energy Retrofits: A Domain-Specific LLM for Informed Decision-Making

This article presents a domain-specific large language model (LLM) designed to assist homeowners in making informed decisions about resid...

arXiv - AI · 3 min ·
[2602.20170] CAGE: A Framework for Culturally Adaptive Red-Teaming Benchmark Generation
Llms

[2602.20170] CAGE: A Framework for Culturally Adaptive Red-Teaming Benchmark Generation

The paper introduces CAGE, a framework for culturally adaptive red-teaming benchmark generation, addressing the limitations of existing b...

arXiv - AI · 3 min ·
[2602.20162] Talking to Yourself: Defying Forgetting in Large Language Models
Llms

[2602.20162] Talking to Yourself: Defying Forgetting in Large Language Models

The paper introduces SA-SFT, a self-augmentation method for fine-tuning large language models (LLMs) that mitigates catastrophic forgetti...

arXiv - AI · 3 min ·
[2602.21201] Aletheia tackles FirstProof autonomously
Llms

[2602.21201] Aletheia tackles FirstProof autonomously

The paper presents Aletheia, an autonomous mathematics research agent that successfully solved 6 out of 10 problems in the FirstProof cha...

arXiv - Machine Learning · 3 min ·
[2602.21143] A Benchmark for Deep Information Synthesis
Llms

[2602.21143] A Benchmark for Deep Information Synthesis

The paper introduces DEEPSYNTH, a benchmark for evaluating large language models on complex tasks requiring deep information synthesis an...

arXiv - Machine Learning · 4 min ·
Previous Page 49 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime