Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

The problem If you work with Italian text and local models, you know the pain. Every open-source LLM out there treats Italian as an after...

Reddit - Machine Learning · 1 min · 21 minutes ago

Machine Learning

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

Abstract: We derive neural network weight updates from first principles without assuming gradient descent or a specific loss function. St...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min · about 4 hours ago

All Content

Machine Learning

[2409.17517] Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

Abstract page for arXiv paper 2409.17517: Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

arXiv - AI · 4 min · 12 days ago

Llms

[2603.23146] Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

Abstract page for arXiv paper 2603.23146: Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2406.10538] Addressing Large Action Spaces in 3D Floorplanning via Spatial Generalization

Abstract page for arXiv paper 2406.10538: Addressing Large Action Spaces in 3D Floorplanning via Spatial Generalization

arXiv - Machine Learning · 4 min · 12 days ago

Llms

[2603.23073] Can an LLM Detect Instances of Microservice Infrastructure Patterns?

Abstract page for arXiv paper 2603.23073: Can an LLM Detect Instances of Microservice Infrastructure Patterns?

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.23069] AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing

Abstract page for arXiv paper 2603.23069: AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing

arXiv - AI · 3 min · 12 days ago

Machine Learning

[2406.07598] Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

Abstract page for arXiv paper 2406.07598: Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

arXiv - Machine Learning · 3 min · 12 days ago

Machine Learning

[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

Abstract page for arXiv paper 2406.01825: Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2402.12149] MLFEF: Machine Learning Fusion Model with Empirical Formula to Explore the Momentum in Competitive Sports

Abstract page for arXiv paper 2402.12149: MLFEF: Machine Learning Fusion Model with Empirical Formula to Explore the Momentum in Competit...

arXiv - Machine Learning · 4 min · 12 days ago

Machine Learning

[2603.23048] MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

Abstract page for arXiv paper 2603.23048: MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

arXiv - AI · 3 min · 12 days ago

Llms

[2603.23495] VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions

Abstract page for arXiv paper 2603.23495: VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language ...

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.23047] Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation

Abstract page for arXiv paper 2603.23047: Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.23481] VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

Abstract page for arXiv paper 2603.23481: VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.23356] Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

Abstract page for arXiv paper 2603.23356: Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

arXiv - AI · 4 min · 12 days ago

Machine Learning

[2603.23030] Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

Abstract page for arXiv paper 2603.23030: Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic...

arXiv - AI · 4 min · 12 days ago

Llms

[2603.23311] ARGENT: Adaptive Hierarchical Image-Text Representations

Abstract page for arXiv paper 2603.23311: ARGENT: Adaptive Hierarchical Image-Text Representations

arXiv - Machine Learning · 3 min · 12 days ago

Machine Learning

[2603.23020] Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

Abstract page for arXiv paper 2603.23020: Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

arXiv - AI · 4 min · 12 days ago

Llms

[2603.23219] Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

Abstract page for arXiv paper 2603.23219: Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

arXiv - Machine Learning · 4 min · 12 days ago

Machine Learning

[2603.23007] AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

Abstract page for arXiv paper 2603.23007: AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screensho...

arXiv - AI · 4 min · 12 days ago

Llms

[2603.23269] Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

Abstract page for arXiv paper 2603.23269: Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

arXiv - AI · 4 min · 12 days ago

Llms

[2603.23251] Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models

Abstract page for arXiv paper 2603.23251: Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguist...

arXiv - Machine Learning · 4 min · 12 days ago

Previous Page 113 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

All Content

[2409.17517] Dataset Distillation-based Hybrid Federated Learning on Non-IID Data

[2603.23146] Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy

[2406.10538] Addressing Large Action Spaces in 3D Floorplanning via Spatial Generalization

[2603.23073] Can an LLM Detect Instances of Microservice Infrastructure Patterns?

[2603.23069] AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing

[2406.07598] Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency

[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

[2402.12149] MLFEF: Machine Learning Fusion Model with Empirical Formula to Explore the Momentum in Competitive Sports

[2603.23048] MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

[2603.23495] VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions

[2603.23047] Parametric Knowledge and Retrieval Behavior in RAG Fine-Tuning for Electronic Design Automation

[2603.23481] VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

[2603.23356] Contrastive Metric Learning for Point Cloud Segmentation in Highly Granular Detectors

[2603.23030] Looking Beyond the Window: Global-Local Aligned CLIP for Training-free Open-Vocabulary Semantic Segmentation

[2603.23311] ARGENT: Adaptive Hierarchical Image-Text Representations

[2603.23020] Concept-based explanations of Segmentation and Detection models in Natural Disaster Management

[2603.23219] Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

[2603.23007] AgentRAE: Remote Action Execution through Notification-based Visual Backdoors against Screenshots-based Mobile GUI Agents

[2603.23269] Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs

[2603.23251] Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models

Related Topics

Stay updated with AI News