Generative AI

Image, video, audio, and text generation

Top This Week

Generative Ai

Will Generative AI apps remain a revenue powerhouse in 2026?

AI Tools & Products · 1 min ·
Machine Learning

[D] USQL Joins Were Cool, But Now I Want to Join the GenAI Party

Hi Experts, I have 1.5 years of experience in Data Engineering, and now I want to start learning AI, ML, and Generative AI. I already hav...

Reddit - Machine Learning · 1 min ·
Report says Minnesota workers face highest generative AI exposure in the Midwest
Generative Ai

Report says Minnesota workers face highest generative AI exposure in the Midwest

A report from North Star Policy Action says Minnesota workers have the highest generative AI exposure in the Midwest and the 10th-highest...

AI Tools & Products · 6 min ·

All Content

[2512.04552] RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS
Llms

[2512.04552] RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS

The paper presents Robust Reward Policy Optimization (RRPO), a novel framework designed to enhance emotional text-to-speech (TTS) systems...

arXiv - AI · 4 min ·
[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting
Machine Learning

[2509.20928] Conditionally Whitened Generative Models for Probabilistic Time Series Forecasting

The paper introduces Conditionally Whitened Generative Models (CW-Gen) for probabilistic time series forecasting, addressing challenges l...

arXiv - Machine Learning · 4 min ·
[2510.22876] Batch Speculative Decoding Done Right
Nlp

[2510.22876] Batch Speculative Decoding Done Right

The paper presents a novel framework for batch speculative decoding, addressing critical failures in existing methods and achieving signi...

arXiv - AI · 4 min ·
[2507.07139] Image Can Bring Your Memory Back: A Novel Multi-Modal Guided Attack against Image Generation Model Unlearning
Machine Learning

[2507.07139] Image Can Bring Your Memory Back: A Novel Multi-Modal Guided Attack against Image Generation Model Unlearning

The paper presents Recall, a novel adversarial framework that targets the robustness of image generation model unlearning, revealing vuln...

arXiv - Machine Learning · 4 min ·
[2510.04398] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations
Llms

[2510.04398] SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

The paper presents SECA, a method for eliciting hallucinations in large language models (LLMs) through semantically equivalent and cohere...

arXiv - Machine Learning · 4 min ·
[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark
Llms

[2510.02356] Measuring Physical-World Privacy Awareness of Large Language Models: An Evaluation Benchmark

This article presents EAPrivacy, a benchmark for evaluating the physical-world privacy awareness of large language models (LLMs), reveali...

arXiv - AI · 4 min ·
[2510.00232] BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
Llms

[2510.00232] BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

The paper introduces BiasFreeBench, a benchmark designed to evaluate bias mitigation techniques in large language models (LLMs) by provid...

arXiv - Machine Learning · 4 min ·
[2509.18776] AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
Llms

[2509.18776] AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field

The paper introduces AECBench, a benchmark for evaluating large language models (LLMs) in the Architecture, Engineering, and Construction...

arXiv - Machine Learning · 4 min ·
[2503.10522] AudioX: A Unified Framework for Anything-to-Audio Generation
Machine Learning

[2503.10522] AudioX: A Unified Framework for Anything-to-Audio Generation

AudioX presents a unified framework for generating audio from various multimodal inputs, enhancing the quality and flexibility of audio g...

arXiv - Machine Learning · 4 min ·
[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity
Machine Learning

[2411.01629] Denoising Diffusions with Optimal Transport: Localization, Curvature, and Multi-Scale Complexity

This paper explores denoising diffusions using optimal transport, focusing on localization, curvature, and multi-scale complexity in gene...

arXiv - Machine Learning · 4 min ·
[2508.21285] A Financial Brain Scan of the LLM
Llms

[2508.21285] A Financial Brain Scan of the LLM

This article presents a novel approach to analyzing large language models (LLMs) in finance, enabling researchers to identify and manipul...

arXiv - AI · 3 min ·
[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation
Generative Ai

[2508.18210] Why Synthetic Isn't Real Yet: A Diagnostic Framework for Contact Center Dialogue Generation

This article presents a diagnostic framework for evaluating synthetic dialogue generation in contact centers, highlighting the limitation...

arXiv - AI · 4 min ·
[2307.14397] A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot
Machine Learning

[2307.14397] A Survey on Generative Modeling with Limited Data, Few Shots, and Zero Shot

This survey explores generative modeling under constraints of limited data, few shots, and zero shots, presenting challenges and methodol...

arXiv - Machine Learning · 4 min ·
[2507.04704] SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes
Machine Learning

[2507.04704] SPATIA: Multimodal Generation and Prediction of Spatial Cell Phenotypes

The paper introduces SPATIA, a novel multimodal model for predicting spatial cell phenotypes by integrating cellular morphology, gene exp...

arXiv - AI · 4 min ·
[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models
Llms

[2602.06801] On the Non-Identifiability of Steering Vectors in Large Language Models

This paper explores the non-identifiability of steering vectors in large language models (LLMs), revealing that these vectors cannot be u...

arXiv - AI · 3 min ·
[2506.04051] High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning
Llms

[2506.04051] High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

The paper presents HALT, a method for finetuning large language models (LLMs) to enhance reliability by generating responses only when co...

arXiv - AI · 4 min ·
[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective
Machine Learning

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

This paper introduces Accelerated Sequential Flow Matching, a Bayesian filtering framework that enhances real-time inference in stochasti...

arXiv - Machine Learning · 4 min ·
[2506.03407] Multi-Spectral Gaussian Splatting with Neural Color Representation
Machine Learning

[2506.03407] Multi-Spectral Gaussian Splatting with Neural Color Representation

The paper presents MS-Splatting, a novel multi-spectral 3D Gaussian Splatting framework that generates consistent views from images captu...

arXiv - Machine Learning · 4 min ·
[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs
Llms

[2602.00628] From Associations to Activations: Comparing Behavioral and Hidden-State Semantic Geometry in LLMs

This paper examines the relationship between behavioral and hidden-state semantic geometry in large language models (LLMs) through psycho...

arXiv - AI · 3 min ·
[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation
Llms

[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation

The paper presents Caprese, a low-rank distillation method designed to enhance reasoning capabilities in large language models (LLMs) whi...

arXiv - Machine Learning · 3 min ·
Previous Page 90 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime