Machine Learning

ML algorithms, training, and inference

Top This Week

Accelerating science with AI and simulations
Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
New technique makes AI models leaner and faster while they’re still learning
Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min ·

All Content

[2510.14949] DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation
Machine Learning

[2510.14949] DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

Abstract page for arXiv paper 2510.14949: DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

arXiv - Machine Learning · 4 min ·
[2510.17018] CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanced Toxic Comment Classification
Machine Learning

[2510.17018] CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanced Toxic Comment Classification

Abstract page for arXiv paper 2510.17018: CoGate-LSTM: Prototype-Guided Feature-Space Gating for Mitigating Gradient Dilution in Imbalanc...

arXiv - Machine Learning · 4 min ·
[2509.22630] StateX: Enhancing RNN Recall via Post-training State Expansion
Machine Learning

[2509.22630] StateX: Enhancing RNN Recall via Post-training State Expansion

Abstract page for arXiv paper 2509.22630: StateX: Enhancing RNN Recall via Post-training State Expansion

arXiv - AI · 3 min ·
[2509.17183] LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization
Llms

[2509.17183] LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Preference Optimization

Abstract page for arXiv paper 2509.17183: LifeAlign: Lifelong Alignment for Large Language Models with Memory-Augmented Focalized Prefere...

arXiv - AI · 4 min ·
[2509.02617] Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry
Machine Learning

[2509.02617] Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on irregular geometry

Abstract page for arXiv paper 2509.02617: Gaussian process surrogate with physical law-corrected prior for multi-coupled PDEs defined on ...

arXiv - Machine Learning · 4 min ·
[2506.18601] BulletGen: Improving 4D Reconstruction with Bullet-Time Generation
Machine Learning

[2506.18601] BulletGen: Improving 4D Reconstruction with Bullet-Time Generation

Abstract page for arXiv paper 2506.18601: BulletGen: Improving 4D Reconstruction with Bullet-Time Generation

arXiv - AI · 3 min ·
[2506.03863] STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Machine Learning

[2506.03863] STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Abstract page for arXiv paper 2506.03863: STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

arXiv - Machine Learning · 4 min ·
[2504.14135] Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering
Machine Learning

[2504.14135] Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering

Abstract page for arXiv paper 2504.14135: Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering

arXiv - Machine Learning · 4 min ·
[2502.17421] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
Llms

[2502.17421] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

Abstract page for arXiv paper 2502.17421: LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification

arXiv - AI · 4 min ·
[2411.05183] Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations
Machine Learning

[2411.05183] Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations

Abstract page for arXiv paper 2411.05183: Why CNN Features Are not Gaussian: A Statistical Anatomy of Deep Representations

arXiv - Machine Learning · 4 min ·
[2409.01962] Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout
Machine Learning

[2409.01962] Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout

Abstract page for arXiv paper 2409.01962: Attentive Dilated Convolution for Automatic Sleep Staging using Force-directed Layout

arXiv - Machine Learning · 4 min ·
[2407.14971] Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models
Llms

[2407.14971] Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-Language Models

Abstract page for arXiv paper 2407.14971: Sim-CLIP: Unsupervised Siamese Adversarial Fine-Tuning for Robust and Semantically-Rich Vision-...

arXiv - AI · 4 min ·
[2405.06535] Controllable Image Generation with Composed Parallel Token Prediction
Machine Learning

[2405.06535] Controllable Image Generation with Composed Parallel Token Prediction

Abstract page for arXiv paper 2405.06535: Controllable Image Generation with Composed Parallel Token Prediction

arXiv - Machine Learning · 3 min ·
[2305.02657] On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains
Machine Learning

[2305.02657] On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on General Domains

Abstract page for arXiv paper 2305.02657: On the Eigenvalue Decay Rates of a Class of Neural-Network Related Kernel Functions Defined on ...

arXiv - Machine Learning · 4 min ·
[2402.15095] The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional regime
Machine Learning

[2402.15095] The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional regime

Abstract page for arXiv paper 2402.15095: The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional ...

arXiv - Machine Learning · 4 min ·
[2512.24062] Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropic Dispersion
Machine Learning

[2512.24062] Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropic Dispersion

Abstract page for arXiv paper 2512.24062: Energy-Balanced Hyperspherical Graph Representation Learning via Structural Binding and Entropi...

arXiv - Machine Learning · 4 min ·
[2512.15605] Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction
Llms

[2512.15605] Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction

Abstract page for arXiv paper 2512.15605: Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Ca...

arXiv - Machine Learning · 4 min ·
[2512.13592] Image Diffusion Preview with Consistency Solver
Machine Learning

[2512.13592] Image Diffusion Preview with Consistency Solver

Abstract page for arXiv paper 2512.13592: Image Diffusion Preview with Consistency Solver

arXiv - Machine Learning · 3 min ·
[2512.07988] HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability
Machine Learning

[2512.07988] HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

Abstract page for arXiv paper 2512.07988: HOLE: Homological Observation of Latent Embeddings for Neural Network Interpretability

arXiv - Machine Learning · 3 min ·
[2511.01831] Routing-Based Continual Learning for Multimodal Large Language Models
Llms

[2511.01831] Routing-Based Continual Learning for Multimodal Large Language Models

Abstract page for arXiv paper 2511.01831: Routing-Based Continual Learning for Multimodal Large Language Models

arXiv - AI · 4 min ·
Previous Page 333 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime