Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Diffusion for generating/editing ASTs? [D]

I’m not a machine learning expert or anything, but I do enjoy learning about how it all works. I’ve noticed that one of the main limitati...

Reddit - Machine Learning · 1 min ·
ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge
Llms

ChatGPT’s ‘Trusted Contact’ will alert loved ones of safety concerns | The Verge

OpenAI is launching an optional safety feature for ChatGPT that allows adult users to assign an emergency contact for mental health and s...

The Verge - AI · 4 min ·
Llms

AI is helpful but still not “there” yet

what I mean is that every time I use Claude, or Grok or any of the AI platforms and tools, I realize how far this technology is from repl...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction
Llms

[2510.15863] PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

Abstract page for arXiv paper 2510.15863: PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction

arXiv - AI · 4 min ·
[2510.20264] Optimistic Task Inference for Behavior Foundation Models
Llms

[2510.20264] Optimistic Task Inference for Behavior Foundation Models

Abstract page for arXiv paper 2510.20264: Optimistic Task Inference for Behavior Foundation Models

arXiv - Machine Learning · 4 min ·
[2510.21314] A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization
Llms

[2510.21314] A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization

Abstract page for arXiv paper 2510.21314: A Convergence Analysis of Adaptive Optimizers under Floating-point Quantization

arXiv - Machine Learning · 4 min ·
[2510.13888] Reliable Fine-Grained Evaluation of Natural Language Math Proofs
Llms

[2510.13888] Reliable Fine-Grained Evaluation of Natural Language Math Proofs

Abstract page for arXiv paper 2510.13888: Reliable Fine-Grained Evaluation of Natural Language Math Proofs

arXiv - AI · 4 min ·
[2510.17206] Soft-Masked Diffusion Language Models
Llms

[2510.17206] Soft-Masked Diffusion Language Models

Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
Llms

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Abstract page for arXiv paper 2510.18245: Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

arXiv - Machine Learning · 4 min ·
[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing
Llms

[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing

Abstract page for arXiv paper 2510.10066: OBsmith: LLM-Powered JavaScript Obfuscator Testing

arXiv - AI · 4 min ·
[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Llms

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Abstract page for arXiv paper 2510.09462: Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

arXiv - Machine Learning · 4 min ·
[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models
Llms

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models

arXiv - AI · 3 min ·
[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min ·
[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
Llms

[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Abstract page for arXiv paper 2510.06377: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

arXiv - Machine Learning · 4 min ·
[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
Llms

[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

Abstract page for arXiv paper 2510.06292: ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

arXiv - AI · 4 min ·
[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Llms

[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Abstract page for arXiv paper 2510.05064: Boomerang Distillation Enables Zero-Shot Model Size Interpolation

arXiv - Machine Learning · 4 min ·
[2510.05174] Emergent Coordination in Multi-Agent Language Models
Llms

[2510.05174] Emergent Coordination in Multi-Agent Language Models

Abstract page for arXiv paper 2510.05174: Emergent Coordination in Multi-Agent Language Models

arXiv - AI · 4 min ·
[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens
Llms

[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens

Abstract page for arXiv paper 2510.05132: Training Large Language Models To Reason In Parallel With Global Forking Tokens

arXiv - Machine Learning · 4 min ·
[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
Llms

[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Abstract page for arXiv paper 2510.05069: SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

arXiv - AI · 4 min ·
[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices
Llms

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

Abstract page for arXiv paper 2510.05109: Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on B...

arXiv - AI · 4 min ·
[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Llms

[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Abstract page for arXiv paper 2510.04682: TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

arXiv - AI · 4 min ·
[2510.04067] What Scales in Cross-Entropy Scaling Law?
Llms

[2510.04067] What Scales in Cross-Entropy Scaling Law?

Abstract page for arXiv paper 2510.04067: What Scales in Cross-Entropy Scaling Law?

arXiv - Machine Learning · 4 min ·
[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Llms

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

arXiv - Machine Learning · 4 min ·
Previous Page 337 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime