Generative AI

Image, video, audio, and text generation

Top This Week

Really, you made this without AI? Prove it | The Verge
Generative Ai

Really, you made this without AI? Prove it | The Verge

Creatives want to start labeling human-made text, images, audio, and video with AI-free logos. Now they just have to pick one.

The Verge - AI · 10 min ·
Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min ·
Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·

All Content

[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space
Llms

[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space

PonderLM introduces a novel approach to language model training by incorporating a 'pondering' phase, enhancing cognitive processing duri...

arXiv - AI · 4 min ·
[2505.17064] Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models
Machine Learning

[2505.17064] Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models

This article evaluates how Text-to-Image diffusion models represent historical contexts, introducing a benchmark to assess their accuracy...

arXiv - Machine Learning · 4 min ·
[2505.12664] Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design
Machine Learning

[2505.12664] Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design

This paper presents a novel framework for high-precision target sensing using multi-view wireless channel state information (CSI) through...

arXiv - Machine Learning · 4 min ·
[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning
Machine Learning

[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning

The paper introduces Generative Distribution Embeddings (GDE), a novel framework that enhances autoencoders for multiscale representation...

arXiv - Machine Learning · 4 min ·
[2505.11409] Visual Planning: Let's Think Only with Images
Llms

[2505.11409] Visual Planning: Let's Think Only with Images

The paper introduces 'Visual Planning', a new paradigm that utilizes images for reasoning in spatial tasks, enhancing planning capabiliti...

arXiv - Machine Learning · 4 min ·
[2601.06500] The AI Pyramid A Conceptual Framework for Workforce Capability in the Age of AI
Generative Ai

[2601.06500] The AI Pyramid A Conceptual Framework for Workforce Capability in the Age of AI

The article presents the AI Pyramid, a framework for understanding workforce capabilities in an AI-driven economy, emphasizing the need f...

arXiv - AI · 4 min ·
[1803.09319] SUNLayer: Stable denoising with generative networks
Machine Learning

[1803.09319] SUNLayer: Stable denoising with generative networks

The paper introduces SUNLayer, a theoretical framework for stable denoising using generative networks, focusing on activation functions a...

arXiv - Machine Learning · 3 min ·
[2510.25860] Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters
Llms

[2510.25860] Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters

This article discusses a framework that enhances the reliability of large language model (LLM) raters by inferring thinking traces from l...

arXiv - AI · 4 min ·
[2403.08802] Governance of Generative Artificial Intelligence for Companies
Llms

[2403.08802] Governance of Generative Artificial Intelligence for Companies

This article reviews governance frameworks for Generative AI, focusing on how companies can effectively manage the integration of large l...

arXiv - Machine Learning · 4 min ·
[2602.18372] "How Do I ...?": Procedural Questions Predominate Student-LLM Chatbot Conversations
Llms

[2602.18372] "How Do I ...?": Procedural Questions Predominate Student-LLM Chatbot Conversations

This paper investigates the predominance of procedural questions in student interactions with LLM chatbots, analyzing data from various l...

arXiv - AI · 4 min ·
[2602.18262] Simplifying Outcomes of Language Model Component Analyses with ELIA
Llms

[2602.18262] Simplifying Outcomes of Language Model Component Analyses with ELIA

The paper presents ELIA, an interactive web application designed to simplify the analysis of Large Language Models (LLMs) for non-experts...

arXiv - Machine Learning · 4 min ·
[2602.18171] Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models
Llms

[2602.18171] Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models

This paper presents a hybrid approach to detecting clickbait using large language models and informativeness measures, achieving a high F...

arXiv - AI · 3 min ·
[2602.18104] MeanVoiceFlow: One-step Nonparallel Voice Conversion with Mean Flows
Machine Learning

[2602.18104] MeanVoiceFlow: One-step Nonparallel Voice Conversion with Mean Flows

MeanVoiceFlow introduces a one-step nonparallel voice conversion model that enhances speech quality and speaker similarity while reducing...

arXiv - Machine Learning · 4 min ·
[2602.18092] Perceived Political Bias in LLMs Reduces Persuasive Abilities
Llms

[2602.18092] Perceived Political Bias in LLMs Reduces Persuasive Abilities

This article explores how perceived political bias in large language models (LLMs) can diminish their effectiveness in persuasion, reveal...

arXiv - AI · 3 min ·
[2602.17830] Drift Estimation for Stochastic Differential Equations with Denoising Diffusion Models
Machine Learning

[2602.17830] Drift Estimation for Stochastic Differential Equations with Denoising Diffusion Models

This paper explores drift estimation in multivariate stochastic differential equations using denoising diffusion models, proposing a new ...

arXiv - Machine Learning · 3 min ·
[2602.17787] Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry
Machine Learning

[2602.17787] Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry

This paper explores market dynamics in generative model ecosystems, focusing on equilibria, welfare implications, and strategic entry by ...

arXiv - Machine Learning · 3 min ·
[2602.18022] Dual-Channel Attention Guidance for Training-Free Image Editing Control in Diffusion Transformers
Machine Learning

[2602.18022] Dual-Channel Attention Guidance for Training-Free Image Editing Control in Diffusion Transformers

This paper introduces Dual-Channel Attention Guidance (DCAG), a novel training-free method for enhancing image editing control in Diffusi...

arXiv - AI · 4 min ·
[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds
Machine Learning

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

The paper presents a novel generative modeling framework for synthesizing physically feasible two-dimensional incompressible flows, addre...

arXiv - Machine Learning · 3 min ·
[2602.17770] CLUTCH: Contextualized Language model for Unlocking Text-Conditioned Hand motion modelling in the wild
Llms

[2602.17770] CLUTCH: Contextualized Language model for Unlocking Text-Conditioned Hand motion modelling in the wild

The paper introduces CLUTCH, a novel model for generating hand motions from text, leveraging a new dataset and advanced techniques to imp...

arXiv - Machine Learning · 4 min ·
[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications
Machine Learning

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

CUICurate introduces a GraphRAG framework for automated curation of clinical concepts in NLP, enhancing efficiency and accuracy in clinic...

arXiv - AI · 4 min ·
Previous Page 64 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime