Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Sentient OS: a custom on-device vision LLM that understands your entire digital life (every screenshot, note, file, email...), while your device charges overnight. Talk to your data, get proactive reminders, and explore knowledge graphs!

99% of "AI" apps are just GPT wrappers that pipe your data to cloud LLMs and call it a product. No one's ever created an intelligence lay...

Reddit - Artificial Intelligence · 1 min ·
Llms

What to build while we still have access to cheap AI?

AI companies are subsidizing access the same way Uber subsidized rides and AWS subsidized compute in the early days - burning cash to gra...

Reddit - Artificial Intelligence · 1 min ·
Llms

OpenAI starts laying foundations for ChatGPT ads in EU

submitted by /u/ThereWas [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2510.17206] Soft-Masked Diffusion Language Models
Llms

[2510.17206] Soft-Masked Diffusion Language Models

Abstract page for arXiv paper 2510.17206: Soft-Masked Diffusion Language Models

arXiv - Machine Learning · 4 min ·
[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs
Llms

[2510.18245] Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

Abstract page for arXiv paper 2510.18245: Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs

arXiv - Machine Learning · 4 min ·
[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing
Llms

[2510.10066] OBsmith: LLM-Powered JavaScript Obfuscator Testing

Abstract page for arXiv paper 2510.10066: OBsmith: LLM-Powered JavaScript Obfuscator Testing

arXiv - AI · 4 min ·
[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Llms

[2510.09462] Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Abstract page for arXiv paper 2510.09462: Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

arXiv - Machine Learning · 4 min ·
[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models
Llms

[2510.13117] On the Reasoning Abilities of Masked Diffusion Language Models

Abstract page for arXiv paper 2510.13117: On the Reasoning Abilities of Masked Diffusion Language Models

arXiv - AI · 3 min ·
[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation
Llms

[2510.07940] TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Abstract page for arXiv paper 2510.07940: TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

arXiv - Machine Learning · 4 min ·
[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
Llms

[2510.06377] Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Abstract page for arXiv paper 2510.06377: Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

arXiv - Machine Learning · 4 min ·
[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations
Llms

[2510.06292] ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

Abstract page for arXiv paper 2510.06292: ChainMPQ: Interleaved Text-Image Reasoning Chains for Mitigating Relation Hallucinations

arXiv - AI · 4 min ·
[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation
Llms

[2510.05064] Boomerang Distillation Enables Zero-Shot Model Size Interpolation

Abstract page for arXiv paper 2510.05064: Boomerang Distillation Enables Zero-Shot Model Size Interpolation

arXiv - Machine Learning · 4 min ·
[2510.05174] Emergent Coordination in Multi-Agent Language Models
Llms

[2510.05174] Emergent Coordination in Multi-Agent Language Models

Abstract page for arXiv paper 2510.05174: Emergent Coordination in Multi-Agent Language Models

arXiv - AI · 4 min ·
[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens
Llms

[2510.05132] Training Large Language Models To Reason In Parallel With Global Forking Tokens

Abstract page for arXiv paper 2510.05132: Training Large Language Models To Reason In Parallel With Global Forking Tokens

arXiv - Machine Learning · 4 min ·
[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs
Llms

[2510.05069] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

Abstract page for arXiv paper 2510.05069: SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs

arXiv - AI · 4 min ·
[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices
Llms

[2510.05109] Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on Battery-Powered Small Devices

Abstract page for arXiv paper 2510.05109: Tiny but Mighty: A Software-Hardware Co-Design Approach for Efficient Multimodal Inference on B...

arXiv - AI · 4 min ·
[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA
Llms

[2510.04682] TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

Abstract page for arXiv paper 2510.04682: TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

arXiv - AI · 4 min ·
[2510.04067] What Scales in Cross-Entropy Scaling Law?
Llms

[2510.04067] What Scales in Cross-Entropy Scaling Law?

Abstract page for arXiv paper 2510.04067: What Scales in Cross-Entropy Scaling Law?

arXiv - Machine Learning · 4 min ·
[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Llms

[2510.02209] StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Abstract page for arXiv paper 2510.02209: StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

arXiv - Machine Learning · 4 min ·
[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents
Llms

[2510.03253] Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

Abstract page for arXiv paper 2510.03253: Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

arXiv - Machine Learning · 4 min ·
[2510.02999] Untargeted Jailbreak Attack
Llms

[2510.02999] Untargeted Jailbreak Attack

Abstract page for arXiv paper 2510.02999: Untargeted Jailbreak Attack

arXiv - AI · 4 min ·
[2510.02245] ExGRPO: Learning to Reason from Experience
Llms

[2510.02245] ExGRPO: Learning to Reason from Experience

Abstract page for arXiv paper 2510.02245: ExGRPO: Learning to Reason from Experience

arXiv - Machine Learning · 4 min ·
[2510.01051] GEM: A Gym for Agentic LLMs
Llms

[2510.01051] GEM: A Gym for Agentic LLMs

Abstract page for arXiv paper 2510.01051: GEM: A Gym for Agentic LLMs

arXiv - Machine Learning · 4 min ·
Previous Page 303 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime