Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

Arc Gate —LLM proxy that hits P=1.00 R=1.00 F1=1.00 on indirect/roleplay prompt injection (beats OpenAI Moderation and LlamaGuard)

Benchmarked on 40 out-of-distribution prompts, indirect requests, roleplay framings, hypothetical scenarios, technical phrasings. The stu...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

Visualizing Loss Landscapes of Neural Networks [P]

Hey r/MachineLearning, Visualizing the loss landscape of a neural network is notoriously tricky since we can't naturally comprehend milli...

Reddit - Machine Learning · 1 min ·
Llms

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models tr...

Reddit - Machine Learning · 1 min ·

All Content

[2604.03376] VERT: Reliable LLM Judges for Radiology Report Evaluation
Llms

[2604.03376] VERT: Reliable LLM Judges for Radiology Report Evaluation

Abstract page for arXiv paper 2604.03376: VERT: Reliable LLM Judges for Radiology Report Evaluation

arXiv - AI · 4 min ·
[2604.03356] Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing
Llms

[2604.03356] Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing

Abstract page for arXiv paper 2604.03356: Evaluating Artificial Intelligence Through a Christian Understanding of Human Flourishing

arXiv - AI · 3 min ·
[2604.03286] Toward Full Autonomous Laboratory Instrumentation Control with Large Language Models
Llms

[2604.03286] Toward Full Autonomous Laboratory Instrumentation Control with Large Language Models

Abstract page for arXiv paper 2604.03286: Toward Full Autonomous Laboratory Instrumentation Control with Large Language Models

arXiv - AI · 3 min ·
[2604.03232] IC3-Evolve: Proof-/Witness-Gated Offline LLM-Driven Heuristic Evolution for IC3 Hardware Model Checking
Llms

[2604.03232] IC3-Evolve: Proof-/Witness-Gated Offline LLM-Driven Heuristic Evolution for IC3 Hardware Model Checking

Abstract page for arXiv paper 2604.03232: IC3-Evolve: Proof-/Witness-Gated Offline LLM-Driven Heuristic Evolution for IC3 Hardware Model ...

arXiv - AI · 4 min ·
[2603.29171] Segmentation of Gray Matters and White Matters from Brain MRI data
Llms

[2603.29171] Segmentation of Gray Matters and White Matters from Brain MRI data

Abstract page for arXiv paper 2603.29171: Segmentation of Gray Matters and White Matters from Brain MRI data

arXiv - Machine Learning · 4 min ·
[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI
Machine Learning

[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

Abstract page for arXiv paper 2603.18104: Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometri...

arXiv - Machine Learning · 4 min ·
[2602.09924] LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations
Llms

[2602.09924] LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

Abstract page for arXiv paper 2602.09924: LLMs Encode Their Failures: Predicting Success from Pre-Generation Activations

arXiv - AI · 3 min ·
[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows
Machine Learning

[2602.09580] SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and Normalizing Flows

Abstract page for arXiv paper 2602.09580: SERNF: Sample-Efficient Real-World Dexterous Policy Fine-Tuning via Action-Chunked Critics and ...

arXiv - Machine Learning · 4 min ·
[2602.01528] Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning
Llms

[2602.01528] Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning

Abstract page for arXiv paper 2602.01528: Making Bias Non-Predictive: Training Robust LLM Reasoning via Reinforcement Learning

arXiv - Machine Learning · 4 min ·
[2601.22783] Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval
Llms

[2601.22783] Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval

Abstract page for arXiv paper 2601.22783: Compact Hypercube Embeddings for Fast Text-based Wildlife Observation Retrieval

arXiv - Machine Learning · 4 min ·
[2601.22264] Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models
Llms

[2601.22264] Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language Models

Abstract page for arXiv paper 2601.22264: Predicting Intermittent Job Failure Categories for Diagnosis Using Few-Shot Fine-Tuned Language...

arXiv - AI · 4 min ·
[2601.21670] Improving Multimodal Learning with Dispersive and Anchoring Regularization
Machine Learning

[2601.21670] Improving Multimodal Learning with Dispersive and Anchoring Regularization

Abstract page for arXiv paper 2601.21670: Improving Multimodal Learning with Dispersive and Anchoring Regularization

arXiv - Machine Learning · 3 min ·
[2601.21343] Self-Improving Pretraining: using post-trained models to pretrain better models
Llms

[2601.21343] Self-Improving Pretraining: using post-trained models to pretrain better models

Abstract page for arXiv paper 2601.21343: Self-Improving Pretraining: using post-trained models to pretrain better models

arXiv - AI · 3 min ·
[2601.19376] Teaching Machine Learning Fundamentals with LEGO Robotics
Machine Learning

[2601.19376] Teaching Machine Learning Fundamentals with LEGO Robotics

Abstract page for arXiv paper 2601.19376: Teaching Machine Learning Fundamentals with LEGO Robotics

arXiv - AI · 3 min ·
[2601.08950] ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors
Llms

[2601.08950] ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

Abstract page for arXiv paper 2601.08950: ConvoLearn: A Dataset for Fine-Tuning Dialogic AI Tutors

arXiv - AI · 3 min ·
[2601.06338] Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers
Machine Learning

[2601.06338] Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

Abstract page for arXiv paper 2601.06338: Circuit Mechanisms for Spatial Relation Generation in Diffusion Transformers

arXiv - AI · 4 min ·
[2512.23850] The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Language Models
Llms

[2512.23850] The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Language Models

Abstract page for arXiv paper 2512.23850: The Drill-Down and Fabricate Test (DDFT): A Protocol for Measuring Epistemic Robustness in Lang...

arXiv - AI · 4 min ·
[2512.18503] NASTaR: NovaSAR Automated Ship Target Recognition Dataset
Machine Learning

[2512.18503] NASTaR: NovaSAR Automated Ship Target Recognition Dataset

Abstract page for arXiv paper 2512.18503: NASTaR: NovaSAR Automated Ship Target Recognition Dataset

arXiv - Machine Learning · 4 min ·
[2601.04854] Projected Autoregression: Autoregressive Language Generation in Continuous State Space
Llms

[2601.04854] Projected Autoregression: Autoregressive Language Generation in Continuous State Space

Abstract page for arXiv paper 2601.04854: Projected Autoregression: Autoregressive Language Generation in Continuous State Space

arXiv - AI · 4 min ·
[2512.22227] Geometric Organization of Cognitive States in Transformer Embedding Spaces
Llms

[2512.22227] Geometric Organization of Cognitive States in Transformer Embedding Spaces

Abstract page for arXiv paper 2512.22227: Geometric Organization of Cognitive States in Transformer Embedding Spaces

arXiv - Machine Learning · 3 min ·
Previous Page 264 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime