Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

Abstract: We derive neural network weight updates from first principles without assuming gradient descent or a specific loss function. St...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] The memory chip market lost tens of billions over a paper this community would have understood in 10 minutes

TurboQuant was teased recently and tens of billions gone from memory chip market in 48 hours but anyone in this community who read the pa...

Reddit - Machine Learning · 1 min ·

All Content

[2508.16915] Reinforcement-Guided Hyper-Heuristic Hyperparameter Optimization for Fair and Explainable Spiking Neural Network-Based Financial Fraud Detection
Machine Learning

[2508.16915] Reinforcement-Guided Hyper-Heuristic Hyperparameter Optimization for Fair and Explainable Spiking Neural Network-Based Financial Fraud Detection

Abstract page for arXiv paper 2508.16915: Reinforcement-Guided Hyper-Heuristic Hyperparameter Optimization for Fair and Explainable Spiki...

arXiv - Machine Learning · 4 min ·
[2603.23342] Edge Radar Material Classification Under Geometry Shifts
Machine Learning

[2603.23342] Edge Radar Material Classification Under Geometry Shifts

Abstract page for arXiv paper 2603.23342: Edge Radar Material Classification Under Geometry Shifts

arXiv - AI · 3 min ·
[2507.00026] RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models
Llms

[2507.00026] RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models

Abstract page for arXiv paper 2507.00026: RedTopic: Toward Topic-Diverse Red Teaming of Large Language Models

arXiv - AI · 4 min ·
[2603.23319] WISTERIA: Weak Implicit Signal-based Temporal Relation Extraction with Attention
Machine Learning

[2603.23319] WISTERIA: Weak Implicit Signal-based Temporal Relation Extraction with Attention

Abstract page for arXiv paper 2603.23319: WISTERIA: Weak Implicit Signal-based Temporal Relation Extraction with Attention

arXiv - AI · 3 min ·
[2506.22039] UniCA: Unified Covariate Adaptation for Time Series Foundation Model
Llms

[2506.22039] UniCA: Unified Covariate Adaptation for Time Series Foundation Model

Abstract page for arXiv paper 2506.22039: UniCA: Unified Covariate Adaptation for Time Series Foundation Model

arXiv - AI · 4 min ·
[2603.23308] Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression
Llms

[2603.23308] Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression

Abstract page for arXiv paper 2603.23308: Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constraine...

arXiv - AI · 4 min ·
[2506.08916] Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)
Machine Learning

[2506.08916] Enhancing generalizability of model discovery across parameter space with multi-experiment equation learning (ME-EQL)

Abstract page for arXiv paper 2506.08916: Enhancing generalizability of model discovery across parameter space with multi-experiment equa...

arXiv - Machine Learning · 4 min ·
[2603.23300] Designing Agentic AI-Based Screening for Portfolio Investment
Llms

[2603.23300] Designing Agentic AI-Based Screening for Portfolio Investment

Abstract page for arXiv paper 2603.23300: Designing Agentic AI-Based Screening for Portfolio Investment

arXiv - AI · 3 min ·
[2505.20881] Generalizable Heuristic Generation Through LLMs with Meta-Optimization
Llms

[2505.20881] Generalizable Heuristic Generation Through LLMs with Meta-Optimization

Abstract page for arXiv paper 2505.20881: Generalizable Heuristic Generation Through LLMs with Meta-Optimization

arXiv - AI · 4 min ·
[2505.18179] GAIA: A Foundation Model for Operational Atmospheric Dynamics
Llms

[2505.18179] GAIA: A Foundation Model for Operational Atmospheric Dynamics

Abstract page for arXiv paper 2505.18179: GAIA: A Foundation Model for Operational Atmospheric Dynamics

arXiv - AI · 4 min ·
[2603.23279] Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook
Llms

[2603.23279] Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook

Abstract page for arXiv paper 2603.23279: Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook

arXiv - AI · 4 min ·
[2505.00333] Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates
Llms

[2505.00333] Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates

Abstract page for arXiv paper 2505.00333: Two Stage Wireless Federated LoRA Fine-Tuning with Sparsified Orthogonal Updates

arXiv - Machine Learning · 4 min ·
[2504.14094] Leakage and Interpretability in Concept-Based Models
Machine Learning

[2504.14094] Leakage and Interpretability in Concept-Based Models

Abstract page for arXiv paper 2504.14094: Leakage and Interpretability in Concept-Based Models

arXiv - AI · 3 min ·
[2503.10404] Architecture-Aware Minimization (A$^2$M): How to Find Flat Minima in Neural Architecture Search
Machine Learning

[2503.10404] Architecture-Aware Minimization (A$^2$M): How to Find Flat Minima in Neural Architecture Search

Abstract page for arXiv paper 2503.10404: Architecture-Aware Minimization (A$^2$M): How to Find Flat Minima in Neural Architecture Search

arXiv - Machine Learning · 4 min ·
[2603.23252] AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN
Machine Learning

[2603.23252] AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN

Abstract page for arXiv paper 2603.23252: AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN

arXiv - AI · 3 min ·
[2502.07861] Streaming Attention Approximation via Discrepancy Theory
Llms

[2502.07861] Streaming Attention Approximation via Discrepancy Theory

Abstract page for arXiv paper 2502.07861: Streaming Attention Approximation via Discrepancy Theory

arXiv - AI · 3 min ·
[2501.02949] MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification
Machine Learning

[2501.02949] MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

Abstract page for arXiv paper 2501.02949: MSA-CNN: A Lightweight Multi-Scale CNN with Attention for Sleep Stage Classification

arXiv - Machine Learning · 4 min ·
[2603.23184] ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment
Llms

[2603.23184] ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Abstract page for arXiv paper 2603.23184: ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

arXiv - AI · 4 min ·
[2412.05430] DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA
Llms

[2412.05430] DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA

Abstract page for arXiv paper 2412.05430: DART-Eval: A Comprehensive DNA Language Model Evaluation Benchmark on Regulatory DNA

arXiv - Machine Learning · 4 min ·
[2409.17517] Dataset Distillation-based Hybrid Federated Learning on Non-IID Data
Machine Learning

[2409.17517] Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

Abstract page for arXiv paper 2409.17517: Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

arXiv - AI · 4 min ·
Previous Page 112 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime