Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · 4 minutes ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 1 hour ago

Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min · about 1 hour ago

All Content

Machine Learning

[2510.14949] DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

Abstract page for arXiv paper 2510.14949: DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2510.17018] CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanced Toxic Comment Classification

Abstract page for arXiv paper 2510.17018: CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanc...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2509.22630] StateX: Enhancing RNN Recall via Post-training State Expansion

Abstract page for arXiv paper 2509.22630: StateX: Enhancing RNN Recall via Post-training State Expansion

arXiv - AI · 3 min · 27 days ago

Llms

[2509.17183] LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

Abstract page for arXiv paper 2509.17183: LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Prefere...

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2509.02617] Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry

Abstract page for arXiv paper 2509.02617: Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on ...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2506.18601] BulletGen: Improving 4D Reconstruction with Bullet-Time Generation

Abstract page for arXiv paper 2506.18601: BulletGen: Improving 4D Reconstruction with Bullet-Time Generation

arXiv - AI · 3 min · 27 days ago

Machine Learning

[2506.03863] STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Abstract page for arXiv paper 2506.03863: STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2504.14135] Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering

Abstract page for arXiv paper 2504.14135: Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2502.17421] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

Abstract page for arXiv paper 2502.17421: LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2411.05183] Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations

Abstract page for arXiv paper 2411.05183: Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2409.01962] Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout

Abstract page for arXiv paper 2409.01962: Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2407.14971] Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

Abstract page for arXiv paper 2407.14971: Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-...

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2405.06535] Controllable Image Generation with Composed Parallel Token Prediction

Abstract page for arXiv paper 2405.06535: Controllable Image Generation with Composed Parallel Token Prediction

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2305.02657] On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains

Abstract page for arXiv paper 2305.02657: On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on ...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2402.15095] The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional regime

Abstract page for arXiv paper 2402.15095: The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional ...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2512.24062] Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropic Dispersion

Abstract page for arXiv paper 2512.24062: Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropi...

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2512.15605] Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction

Abstract page for arXiv paper 2512.15605: Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Ca...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2512.13592] Image Diffusion Preview with Consistency Solver

Abstract page for arXiv paper 2512.13592: Image Diffusion Preview with Consistency Solver

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2512.07988] HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

Abstract page for arXiv paper 2512.07988: HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

arXiv - Machine Learning · 3 min · 27 days ago

Llms

[2511.01831] Routing-Based Continual Learning for Multimodal Large Language Models

Abstract page for arXiv paper 2511.01831: Routing-Based Continual Learning for Multimodal Large Language Models

arXiv - AI · 4 min · 27 days ago

Previous Page 333 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Accelerating science with AI and simulations

Improving AI models’ ability to explain their predictions

New technique makes AI models leaner and faster while they’re still learning

All Content

[2510.14949] DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

[2510.17018] CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanced Toxic Comment Classification

[2509.22630] StateX: Enhancing RNN Recall via Post-training State Expansion

[2509.17183] LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

[2509.02617] Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry

[2506.18601] BulletGen: Improving 4D Reconstruction with Bullet-Time Generation

[2506.03863] STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

[2504.14135] Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering

[2502.17421] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

[2411.05183] Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations

[2409.01962] Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout

[2407.14971] Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

[2405.06535] Controllable Image Generation with Composed Parallel Token Prediction

[2305.02657] On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains

[2402.15095] The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional regime

[2512.24062] Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropic Dispersion

[2512.15605] Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction

[2512.13592] Image Diffusion Preview with Consistency Solver

[2512.07988] HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

[2511.01831] Routing-Based Continual Learning for Multimodal Large Language Models

Related Topics

Stay updated with AI News