Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO written from scratch in PyTorch - updates! [P]

So, yesterday run was a success and I did get an avg rollout length of about 64 tokens as attached in the image! This was with quality_re...

Reddit - Machine Learning · 1 min ·
Machine Learning

Thoughts and experience on ML journals [D]

Recently I’ve been thinking about shifting from conferences to journals due to a few bad experiences with ML conferences reviewing proces...

Reddit - Machine Learning · 1 min ·
Llms

Gemini Robotics-ER 1.6: Powering real-world robotics tasks through enhanced embodied reasoning

Gemini Robotics-ER 1.6 is a significant upgrade to the reasoning-first model that enables robots to understand their environments with un...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.26680] AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment
Llms

[2603.26680] AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment

Abstract page for arXiv paper 2603.26680: AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Align...

arXiv - AI · 4 min ·
[2603.26678] Power Couple? AI Growth and Renewable Energy Investment
Machine Learning

[2603.26678] Power Couple? AI Growth and Renewable Energy Investment

Abstract page for arXiv paper 2603.26678: Power Couple? AI Growth and Renewable Energy Investment

arXiv - AI · 4 min ·
[2603.26676] Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift
Machine Learning

[2603.26676] Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

Abstract page for arXiv paper 2603.26676: Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

arXiv - AI · 3 min ·
[2603.26673] Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies
Llms

[2603.26673] Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching Strategies

Abstract page for arXiv paper 2603.26673: Can AI be a Teaching Partner? Evaluating ChatGPT, Gemini, and DeepSeek across Three Teaching St...

arXiv - AI · 4 min ·
[2603.26668] Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter
Llms

[2603.26668] Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter

Abstract page for arXiv paper 2603.26668: Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm With Cuckoo ...

arXiv - AI · 3 min ·
[2603.26667] M-RAG: Making RAG Faster, Stronger, and More Efficient
Llms

[2603.26667] M-RAG: Making RAG Faster, Stronger, and More Efficient

Abstract page for arXiv paper 2603.26667: M-RAG: Making RAG Faster, Stronger, and More Efficient

arXiv - AI · 4 min ·
[2603.25240] Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells
Llms

[2603.25240] Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells

Abstract page for arXiv paper 2603.25240: Lingshu-Cell: A generative cellular world model for transcriptome modeling toward virtual cells

arXiv - AI · 4 min ·
[2506.12433] Exploring Cultural Variations in Moral Judgments with Large Language Models
Llms

[2506.12433] Exploring Cultural Variations in Moral Judgments with Large Language Models

Abstract page for arXiv paper 2506.12433: Exploring Cultural Variations in Moral Judgments with Large Language Models

arXiv - AI · 4 min ·
[2603.28651] Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning
Llms

[2603.28651] Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning

Abstract page for arXiv paper 2603.28651: Not Search, But Scan: Benchmarking MLLMs on Scan-Oriented Academic Paper Reasoning

arXiv - AI · 4 min ·
[2603.28643] The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle
Llms

[2603.28643] The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGENIE from its Bottle

Abstract page for arXiv paper 2603.28643: The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AI...

arXiv - AI · 4 min ·
[2603.28618] Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning
Llms

[2603.28618] Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning

Abstract page for arXiv paper 2603.28618: Seeing with You: Perception-Reasoning Coevolution for Multimodal Reasoning

arXiv - AI · 3 min ·
[2603.28590] MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models
Llms

[2603.28590] MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

Abstract page for arXiv paper 2603.28590: MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language M...

arXiv - AI · 4 min ·
[2603.28558] T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewicz, Product, and Gödel Semantics in a Neuro-Symbolic Reasoning System
Machine Learning

[2603.28558] T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewicz, Product, and Gödel Semantics in a Neuro-Symbolic Reasoning System

Abstract page for arXiv paper 2603.28558: T-Norm Operators for EU AI Act Compliance Classification: An Empirical Comparison of Lukasiewic...

arXiv - AI · 4 min ·
[2603.28444] Entropic Claim Resolution: Uncertainty-Driven Evidence Selection for RAG
Machine Learning

[2603.28444] Entropic Claim Resolution: Uncertainty-Driven Evidence Selection for RAG

Abstract page for arXiv paper 2603.28444: Entropic Claim Resolution: Uncertainty-Driven Evidence Selection for RAG

arXiv - AI · 3 min ·
[2603.28386] COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game
Llms

[2603.28386] COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments via Two-Player Zero-Sum Game

Abstract page for arXiv paper 2603.28386: COvolve: Adversarial Co-Evolution of Large-Language-Model-Generated Policies and Environments v...

arXiv - AI · 4 min ·
[2603.28361] Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science
Llms

[2603.28361] Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science

Abstract page for arXiv paper 2603.28361: Deep Research of Deep Research: From Transformer to Agent, From AI to AI for Science

arXiv - AI · 4 min ·
[2603.28360] CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems
Llms

[2603.28360] CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

Abstract page for arXiv paper 2603.28360: CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

arXiv - AI · 4 min ·
[2603.28295] Evaluating LLMs for Answering Student Questions in Introductory Programming Courses
Llms

[2603.28295] Evaluating LLMs for Answering Student Questions in Introductory Programming Courses

Abstract page for arXiv paper 2603.28295: Evaluating LLMs for Answering Student Questions in Introductory Programming Courses

arXiv - AI · 4 min ·
[2603.28248] Reasoning as Energy Minimization over Structured Latent Trajectories
Machine Learning

[2603.28248] Reasoning as Energy Minimization over Structured Latent Trajectories

Abstract page for arXiv paper 2603.28248: Reasoning as Energy Minimization over Structured Latent Trajectories

arXiv - AI · 4 min ·
[2603.28197] EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling
Llms

[2603.28197] EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

Abstract page for arXiv paper 2603.28197: EpiPersona: Persona Projection and Episode Coupling for Pluralistic Preference Modeling

arXiv - AI · 3 min ·
Previous Page 191 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime