Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

Hi r/MachineLearning, I’m looking for an arXiv endorser in cs.LG for a paper on inference-time distribution shift detection for deployed ...

Reddit - Machine Learning · 1 min · 28 minutes ago

Llms

How LLM sycophancy got the US into the Iran quagmire

submitted by /u/sow_oats [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Kept hitting ChatGPT and Claude limits during real work. This is the free setup I ended up using

I do a lot of writing and random problem solving for work. Mostly long drafts, edits, and breaking down ideas. Around Jan I kept hitting ...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

All Content

Llms

Xiaomi's MiMo models are making the AI pricing conversation uncomfortable

MiMo-V2-Flash is open source, scores 73.4% on SWE-Bench (#1 among open source models), and costs $0.10 per million input tokens. That's c...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Llms

Everyone is looking for friend here, just curious do you guys talk you chatgpt or claude like they are your friend or it's just me ?

Im 24 m,and I really can't carry the conversation in real, so I find myself talking to chatgpt or claude I even tried to make myself ai c...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Llms

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

Abstract page for arXiv paper 2603.14579: Medical Image Spatial Grounding with Semantic Sampling

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2511.17885] FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning

Abstract page for arXiv paper 2511.17885: FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and R...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2503.03773] A Phylogenetic Approach to Genomic Language Modeling

Abstract page for arXiv paper 2503.03773: A Phylogenetic Approach to Genomic Language Modeling

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.17246] On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

Abstract page for arXiv paper 2603.17246: On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2602.10014] A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

Abstract page for arXiv paper 2602.10014: A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2511.09833] ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

Abstract page for arXiv paper 2511.09833: ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2507.18014] Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

Abstract page for arXiv paper 2507.18014: Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19862] IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment

Abstract page for arXiv paper 2603.19862: IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.19545] Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations

Abstract page for arXiv paper 2603.19545: Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.19532] EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

Abstract page for arXiv paper 2603.19532: EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19473] Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids

Abstract page for arXiv paper 2603.19473: Reinforcement-guided generative protein language models enable de novo design of highly diverse...

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.19517] ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding

Abstract page for arXiv paper 2603.19517: ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.19375] Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

Abstract page for arXiv paper 2603.19375: Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19347] Exploring the Agentic Frontier of Verilog Code Generation

Abstract page for arXiv paper 2603.19347: Exploring the Agentic Frontier of Verilog Code Generation

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.19261] Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging

Abstract page for arXiv paper 2603.19261: Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword ...

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.20132] Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents

Abstract page for arXiv paper 2603.20132: Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual St...

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19935] Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

Abstract page for arXiv paper 2603.19935: Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

arXiv - Machine Learning · 3 min · 13 days ago

Llms

[2603.19835] FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Abstract page for arXiv paper 2603.19835: FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

arXiv - Machine Learning · 4 min · 13 days ago

Previous Page 75 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

How LLM sycophancy got the US into the Iran quagmire

Kept hitting ChatGPT and Claude limits during real work. This is the free setup I ended up using

All Content

Xiaomi's MiMo models are making the AI pricing conversation uncomfortable

Everyone is looking for friend here, just curious do you guys talk you chatgpt or claude like they are your friend or it's just me ?

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

[2511.17885] FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning

[2503.03773] A Phylogenetic Approach to Genomic Language Modeling

[2603.17246] On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

[2602.10014] A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

[2511.09833] ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

[2507.18014] Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

[2603.19862] IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment

[2603.19545] Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations

[2603.19532] EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

[2603.19473] Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids

[2603.19517] ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding

[2603.19375] Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

[2603.19347] Exploring the Agentic Frontier of Verilog Code Generation

[2603.19261] Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging

[2603.20132] Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents

[2603.19935] Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

[2603.19835] FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Related Topics

Stay updated with AI News