Generative AI

Image, video, audio, and text generation

Top This Week

Generative Ai

Midjourney has a new offer on the cancel page there is 20 off for 2 months

submitted by /u/RainDragonfly826 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup
Generative Ai

The Real Reason OpenAI Shut Sora Down Is a Warning to Every AI Startup

AI Tools & Products · 3 min ·
Really, you made this without AI? Prove it | The Verge
Generative Ai

Really, you made this without AI? Prove it | The Verge

Creatives want to start labeling human-made text, images, audio, and video with AI-free logos. Now they just have to pick one.

The Verge - AI · 10 min ·

All Content

[2602.16839] Training Large Reasoning Models Efficiently via Progressive Thought Encoding
Machine Learning

[2602.16839] Training Large Reasoning Models Efficiently via Progressive Thought Encoding

This paper presents Progressive Thought Encoding, a novel method for training large reasoning models (LRMs) that enhances efficiency and ...

arXiv - Machine Learning · 4 min ·
[2602.16796] Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning
Machine Learning

[2602.16796] Efficient Tail-Aware Generative Optimization via Flow Model Fine-Tuning

This article presents Tail-aware Flow Fine-Tuning (TFFT), a novel algorithm that optimizes generative models by controlling tail behavior...

arXiv - Machine Learning · 4 min ·
[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift
Llms

[2602.16784] Omitted Variable Bias in Language Models Under Distribution Shift

This paper explores omitted variable bias in language models under distribution shifts, proposing a framework to evaluate and optimize pe...

arXiv - Machine Learning · 3 min ·
[2602.09437] Diffusion-Guided Pretraining for Brain Graph Foundation Models
Llms

[2602.09437] Diffusion-Guided Pretraining for Brain Graph Foundation Models

The paper presents a diffusion-guided pretraining framework for brain graph models, addressing limitations in existing methods for learni...

arXiv - AI · 4 min ·
[2602.06355] Di3PO - Diptych Diffusion DPO for Targeted Improvements in Image Generation
Machine Learning

[2602.06355] Di3PO - Diptych Diffusion DPO for Targeted Improvements in Image Generation

The paper presents Di3PO, a novel method for improving image generation in text-to-image diffusion models by efficiently creating targete...

arXiv - AI · 3 min ·
[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students
Generative Ai

[2601.08697] Auditing Student-AI Collaboration: A Case Study of Online Graduate CS Students

This study audits the collaboration between online graduate CS students and AI, exploring preferences for automation in academic tasks an...

arXiv - AI · 3 min ·
[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
Machine Learning

[2601.01224] Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

This paper presents Contrastive Object-centric Diffusion Alignment (CODA), an enhancement to object-centric learning that reduces slot en...

arXiv - AI · 4 min ·
[2511.18696] Empathetic Cascading Networks: A Multi-Stage Prompting Technique for Reducing Social Biases in Large Language Models
Llms

[2511.18696] Empathetic Cascading Networks: A Multi-Stage Prompting Technique for Reducing Social Biases in Large Language Models

The paper presents Empathetic Cascading Networks (ECN), a multi-stage prompting technique aimed at enhancing the empathetic responses of ...

arXiv - AI · 3 min ·
[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?
Llms

[2511.07989] State of the Art in Text Classification for South Slavic Languages: Fine-Tuning or Prompting?

This article evaluates the performance of language models in text classification tasks for South Slavic languages, comparing fine-tuned B...

arXiv - AI · 4 min ·
[2510.24983] LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies
Generative Ai

[2510.24983] LRT-Diffusion: Calibrated Risk-Aware Guidance for Diffusion Policies

LRT-Diffusion introduces a risk-aware sampling method for diffusion policies in offline reinforcement learning, enhancing decision-making...

arXiv - AI · 4 min ·
[2510.15297] VERA-MH Concept Paper
Machine Learning

[2510.15297] VERA-MH Concept Paper

The VERA-MH Concept Paper outlines an innovative framework for evaluating AI chatbots in mental health contexts, focusing on suicide risk...

arXiv - AI · 4 min ·
[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation
Machine Learning

[2510.14974] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation

The paper presents pi-Flow, a novel approach to few-step generation in machine learning that utilizes imitation distillation to enhance m...

arXiv - AI · 4 min ·
[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs
Llms

[2510.09201] Multimodal Prompt Optimization: Why Not Leverage Multiple Modalities for MLLMs

This article introduces the concept of multimodal prompt optimization for Multimodal Large Language Models (MLLMs), proposing a new frame...

arXiv - AI · 4 min ·
[2510.03352] Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction
Machine Learning

[2510.03352] Inference-Time Search Using Side Information for Diffusion-Based Image Reconstruction

This article presents a novel inference-time search algorithm that enhances diffusion-based image reconstruction by utilizing side inform...

arXiv - Machine Learning · 4 min ·
[2509.24368] Watermarking Diffusion Language Models
Llms

[2509.24368] Watermarking Diffusion Language Models

This article presents a novel watermarking technique specifically designed for diffusion language models (DLMs), addressing challenges in...

arXiv - AI · 3 min ·
[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration
Generative Ai

[2509.11461] CareerPooler: AI-Powered Metaphorical Pool Simulation Improves Experience and Outcomes in Career Exploration

CareerPooler introduces an AI-driven metaphorical simulation for career exploration, enhancing user engagement and decision-making throug...

arXiv - AI · 3 min ·
[2506.02529] Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs
Llms

[2506.02529] Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs

This paper presents an automated system for generating end-to-end test cases for web applications using large language models and screen ...

arXiv - AI · 4 min ·
[2510.19771] Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents
Llms

[2510.19771] Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents

The paper presents PROBE, a new framework for measuring proactive problem-solving capabilities in LLM agents, highlighting their limitati...

arXiv - AI · 4 min ·
[2503.23339] A Scalable Framework for Evaluating Health Language Models
Llms

[2503.23339] A Scalable Framework for Evaluating Health Language Models

This paper presents a scalable framework for evaluating health language models, introducing Adaptive Precise Boolean rubrics to enhance e...

arXiv - AI · 4 min ·
[2410.13957] Goal Inference from Open-Ended Dialog
Llms

[2410.13957] Goal Inference from Open-Ended Dialog

The paper discusses a method for embodied AI agents to infer user goals from open-ended dialogues using Large Language Models (LLMs), emp...

arXiv - Machine Learning · 4 min ·
Previous Page 71 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime