Generative AI

Image, video, audio, and text generation

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 13 hours ago

Generative Ai

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

AI Tools & Products · 3 min · 1 day ago

Generative Ai

Really, you made this without AI? Prove it | The Verge

Creatives want to start labeling human-made text, images, audio, and video with AI-free logos. Now they just have to pick one.

The Verge - AI · 10 min · 1 day ago

All Content

Machine Learning

[2602.16839] Training Large Reasoning Models Efficiently via Progressive Thought Encoding

This paper presents Progressive Thought Encoding, a novel method for training large reasoning models (LRMs) that enhances efficiency and ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.16796] Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning

This article presents Tail-aware Flow Fine-Tuning (TFFT), a novel algorithm that optimizes generative models by controlling tail behavior...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift

This paper explores omitted variable bias in language models under distribution shifts, proposing a framework to evaluate and optimize pe...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.09437] Diffusion-Guided Pretraining for Brain Graph Foundation Models

The paper presents a diffusion-guided pretraining framework for brain graph models, addressing limitations in existing methods for learni...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.06355] Di3PO - Diptych Diffusion DPO for Targeted Improvements in Image Generation

The paper presents Di3PO, a novel method for improving image generation in text-to-image diffusion models by efficiently creating targete...

arXiv - AI · 3 min · about 1 month ago

Generative Ai

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

This study audits the collaboration between online graduate CS students and AI, exploring preferences for automation in academic tasks an...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

This paper presents Contrastive Object-centric Diffusion Alignment (CODA), an enhancement to object-centric learning that reduces slot en...

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.18696] Empathetic Cascading Networks: A Multi-Stage Prompting Technique for Reducing Social Biases in Large Language Models

The paper presents Empathetic Cascading Networks (ECN), a multi-stage prompting technique aimed at enhancing the empathetic responses of ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

This article evaluates the performance of language models in text classification tasks for South Slavic languages, comparing fine-tuned B...

arXiv - AI · 4 min · about 1 month ago

Generative Ai

[2510.24983] LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies

LRT-Diffusion introduces a risk-aware sampling method for diffusion policies in offline reinforcement learning, enhancing decision-making...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.15297] VERA-MH Concept Paper

The VERA-MH Concept Paper outlines an innovative framework for evaluating AI chatbots in mental health contexts, focusing on suicide risk...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

The paper presents pi-Flow, a novel approach to few-step generation in machine learning that utilizes imitation distillation to enhance m...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

This article introduces the concept of multimodal prompt optimization for Multimodal Large Language Models (MLLMs), proposing a new frame...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2510.03352] Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction

This article presents a novel inference-time search algorithm that enhances diffusion-based image reconstruction by utilizing side inform...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.24368] Watermarking Diffusion Language Models

This article presents a novel watermarking technique specifically designed for diffusion language models (DLMs), addressing challenges in...

arXiv - AI · 3 min · about 1 month ago

Generative Ai

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

CareerPooler introduces an AI-driven metaphorical simulation for career exploration, enhancing user engagement and decision-making throug...

arXiv - AI · 3 min · about 1 month ago

Llms

[2506.02529] Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs

This paper presents an automated system for generating end-to-end test cases for web applications using large language models and screen ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.19771] Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

The paper presents PROBE, a new framework for measuring proactive problem-solving capabilities in LLM agents, highlighting their limitati...

arXiv - AI · 4 min · about 1 month ago

Llms

[2503.23339] A Scalable Framework for Evaluating Health Language Models

This paper presents a scalable framework for evaluating health language models, introducing Adaptive Precise Boolean rubrics to enhance e...

arXiv - AI · 4 min · about 1 month ago

Llms

[2410.13957] Goal Inference from Open-Ended Dialog

The paper discusses a method for embodied AI agents to infer user goals from open-ended dialogues using Large Language Models (LLMs), emp...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 71 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Generative AI

Top This Week

Midjourney has a new offer on the cancel page there is 20 off for 2 months

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

Really, you made this without AI? Prove it | The Verge

All Content

[2602.16839] Training Large Reasoning Models Efficiently via Progressive Thought Encoding

[2602.16796] Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning

[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift

[2602.09437] Diffusion-Guided Pretraining for Brain Graph Foundation Models

[2602.06355] Di3PO - Diptych Diffusion DPO for Targeted Improvements in Image Generation

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

[2511.18696] Empathetic Cascading Networks: A Multi-Stage Prompting Technique for Reducing Social Biases in Large Language Models

[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

[2510.24983] LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies

[2510.15297] VERA-MH Concept Paper

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

[2510.03352] Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction

[2509.24368] Watermarking Diffusion Language Models

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

[2506.02529] Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs

[2510.19771] Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

[2503.23339] A Scalable Framework for Evaluating Health Language Models

[2410.13957] Goal Inference from Open-Ended Dialog

Related Topics

Stay updated with AI News