Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Generative Ai

Really, you made this without AI? Prove it | The Verge

Creatives want to start labeling human-made text, images, audio, and video with AI-free logos. Now they just have to pick one.

The Verge - AI · 10 min · about 8 hours ago

Machine Learning

AI video generation seems fundamentally more expensive than text, not just less optimized

There’s been a lot of discussion recently about how expensive AI video generation is compared to text, and it feels like this is more tha...

Reddit - Artificial Intelligence · 1 min · 1 day ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · 1 day ago

All Content

Llms

[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space

PonderLM introduces a novel approach to language model training by incorporating a 'pondering' phase, enhancing cognitive processing duri...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2505.17064] Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models

This article evaluates how Text-to-Image diffusion models represent historical contexts, introducing a benchmark to assess their accuracy...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.12664] Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design

This paper presents a novel framework for high-precision target sensing using multi-view wireless channel state information (CSI) through...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning

The paper introduces Generative Distribution Embeddings (GDE), a novel framework that enhances autoencoders for multiscale representation...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.11409] Visual Planning: Let's Think Only with Images

The paper introduces 'Visual Planning', a new paradigm that utilizes images for reasoning in spatial tasks, enhancing planning capabiliti...

arXiv - Machine Learning · 4 min · about 1 month ago

Generative Ai

[2601.06500] The AI Pyramid A Conceptual Framework for Workforce Capability in the Age of AI

The article presents the AI Pyramid, a framework for understanding workforce capabilities in an AI-driven economy, emphasizing the need f...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[1803.09319] SUNLayer: Stable denoising with generative networks

The paper introduces SUNLayer, a theoretical framework for stable denoising using generative networks, focusing on activation functions a...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2510.25860] Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters

This article discusses a framework that enhances the reliability of large language model (LLM) raters by inferring thinking traces from l...

arXiv - AI · 4 min · about 1 month ago

Llms

[2403.08802] Governance of Generative Artificial Intelligence for Companies

This article reviews governance frameworks for Generative AI, focusing on how companies can effectively manage the integration of large l...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18372] "How Do I ...?": Procedural Questions Predominate Student-LLM Chatbot Conversations

This paper investigates the predominance of procedural questions in student interactions with LLM chatbots, analyzing data from various l...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18262] Simplifying Outcomes of Language Model Component Analyses with ELIA

The paper presents ELIA, an interactive web application designed to simplify the analysis of Large Language Models (LLMs) for non-experts...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18171] Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models

This paper presents a hybrid approach to detecting clickbait using large language models and informativeness measures, achieving a high F...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18104] MeanVoiceFlow: One-step Nonparallel Voice Conversion with Mean Flows

MeanVoiceFlow introduces a one-step nonparallel voice conversion model that enhances speech quality and speaker similarity while reducing...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18092] Perceived Political Bias in LLMs Reduces Persuasive Abilities

This article explores how perceived political bias in large language models (LLMs) can diminish their effectiveness in persuasion, reveal...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.17830] Drift Estimation for Stochastic Differential Equations with Denoising Diffusion Models

This paper explores drift estimation in multivariate stochastic differential equations using denoising diffusion models, proposing a new ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.17787] Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry

This paper explores market dynamics in generative model ecosystems, focusing on equilibria, welfare implications, and strategic entry by ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18022] Dual-Channel Attention Guidance for Training-Free Image Editing Control in Diffusion Transformers

This paper introduces Dual-Channel Attention Guidance (DCAG), a novel training-free method for enhancing image editing control in Diffusi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

The paper presents a novel generative modeling framework for synthesizing physically feasible two-dimensional incompressible flows, addre...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.17770] CLUTCH: Contextualized Language model for Unlocking Text-Conditioned Hand motion modelling in the wild

The paper introduces CLUTCH, a novel model for generating hand motions from text, leveraging a new dataset and advanced techniques to imp...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

CUICurate introduces a GraphRAG framework for automated curation of clinical concepts in NLP, enhancing efficiency and accuracy in clinic...

arXiv - AI · 4 min · about 1 month ago

Previous Page 64 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Really, you made this without AI? Prove it | The Verge

AI video generation seems fundamentally more expensive than text, not just less optimized

Accelerating science with AI and simulations

All Content

[2505.20674] PonderLM: Pretraining Language Models to Ponder in Continuous Space

[2505.17064] Synthetic History: Evaluating Visual Representations of the Past in Diffusion Models

[2505.12664] Multi-View Wireless Sensing via Conditional Generative Learning: Framework and Model Design

[2505.18150] Generative Distribution Embeddings: Lifting autoencoders to the space of distributions for multiscale representation learning

[2505.11409] Visual Planning: Let's Think Only with Images

[2601.06500] The AI Pyramid A Conceptual Framework for Workforce Capability in the Age of AI

[1803.09319] SUNLayer: Stable denoising with generative networks

[2510.25860] Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters

[2403.08802] Governance of Generative Artificial Intelligence for Companies

[2602.18372] "How Do I ...?": Procedural Questions Predominate Student-LLM Chatbot Conversations

[2602.18262] Simplifying Outcomes of Language Model Component Analyses with ELIA

[2602.18171] Click it or Leave it: Detecting and Spoiling Clickbait with Informativeness Measures and Large Language Models

[2602.18104] MeanVoiceFlow: One-step Nonparallel Voice Conversion with Mean Flows

[2602.18092] Perceived Political Bias in LLMs Reduces Persuasive Abilities

[2602.17830] Drift Estimation for Stochastic Differential Equations with Denoising Diffusion Models

[2602.17787] Market Games for Generative Models: Equilibria, Welfare, and Strategic Entry

[2602.18022] Dual-Channel Attention Guidance for Training-Free Image Editing Control in Diffusion Transformers

[2602.17773] Learning Flow Distributions via Projection-Constrained Diffusion on Manifolds

[2602.17770] CLUTCH: Contextualized Language model for Unlocking Text-Conditioned Hand motion modelling in the wild

[2602.17949] CUICurate: A GraphRAG-based Framework for Automated Clinical Concept Curation for NLP applications

Related Topics

Stay updated with AI News