Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

Hi r/MachineLearning, I’m looking for an arXiv endorser in cs.LG for a paper on inference-time distribution shift detection for deployed ...

Reddit - Machine Learning · 1 min ·
Llms

How LLM sycophancy got the US into the Iran quagmire

submitted by /u/sow_oats [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

Kept hitting ChatGPT and Claude limits during real work. This is the free setup I ended up using

I do a lot of writing and random problem solving for work. Mostly long drafts, edits, and breaking down ideas. Around Jan I kept hitting ...

Reddit - Artificial Intelligence · 1 min ·

All Content

Llms

Xiaomi's MiMo models are making the AI pricing conversation uncomfortable

MiMo-V2-Flash is open source, scores 73.4% on SWE-Bench (#1 among open source models), and costs $0.10 per million input tokens. That's c...

Reddit - Artificial Intelligence · 1 min ·
Llms

Everyone is looking for friend here, just curious do you guys talk you chatgpt or claude like they are your friend or it's just me ?

Im 24 m,and I really can't carry the conversation in real, so I find myself talking to chatgpt or claude I even tried to make myself ai c...

Reddit - Artificial Intelligence · 1 min ·
[2603.14579] Medical Image Spatial Grounding with Semantic Sampling
Llms

[2603.14579] Medical Image Spatial Grounding with Semantic Sampling

Abstract page for arXiv paper 2603.14579: Medical Image Spatial Grounding with Semantic Sampling

arXiv - Machine Learning · 4 min ·
[2511.17885] FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning
Llms

[2511.17885] FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and Routing-Aware Token Pruning

Abstract page for arXiv paper 2511.17885: FastMMoE: Accelerating Multimodal Large Language Models through Dynamic Expert Activation and R...

arXiv - Machine Learning · 4 min ·
[2503.03773] A Phylogenetic Approach to Genomic Language Modeling
Llms

[2503.03773] A Phylogenetic Approach to Genomic Language Modeling

Abstract page for arXiv paper 2503.03773: A Phylogenetic Approach to Genomic Language Modeling

arXiv - Machine Learning · 3 min ·
[2603.17246] On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings
Llms

[2603.17246] On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

Abstract page for arXiv paper 2603.17246: On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings

arXiv - Machine Learning · 4 min ·
[2602.10014] A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula
Llms

[2602.10014] A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

Abstract page for arXiv paper 2602.10014: A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula

arXiv - Machine Learning · 3 min ·
[2511.09833] ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking
Llms

[2511.09833] ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

Abstract page for arXiv paper 2511.09833: ACT as Human: Multimodal Large Language Model Data Annotation with Critical Thinking

arXiv - Machine Learning · 4 min ·
[2507.18014] Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models
Llms

[2507.18014] Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

Abstract page for arXiv paper 2507.18014: Predictive Scaling Laws for Efficient GRPO Training of Large Reasoning Models

arXiv - Machine Learning · 3 min ·
[2603.19862] IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
Llms

[2603.19862] IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment

Abstract page for arXiv paper 2603.19862: IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment

arXiv - Machine Learning · 4 min ·
[2603.19545] Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations
Llms

[2603.19545] Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-Jacobi-Bellman Equations

Abstract page for arXiv paper 2603.19545: Verifiable Error Bounds for Physics-Informed Neural Network Solutions of Lyapunov and Hamilton-...

arXiv - Machine Learning · 4 min ·
[2603.19532] EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models
Llms

[2603.19532] EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

Abstract page for arXiv paper 2603.19532: EvidenceRL: Reinforcing Evidence Consistency for Trustworthy Language Models

arXiv - Machine Learning · 3 min ·
[2603.19473] Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids
Llms

[2603.19473] Reinforcement-guided generative protein language models enable de novo design of highly diverse AAV capsids

Abstract page for arXiv paper 2603.19473: Reinforcement-guided generative protein language models enable de novo design of highly diverse...

arXiv - Machine Learning · 4 min ·
[2603.19517] ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding
Llms

[2603.19517] ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding

Abstract page for arXiv paper 2603.19517: ReXInTheWild: A Unified Benchmark for Medical Photograph Understanding

arXiv - Machine Learning · 4 min ·
[2603.19375] Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents
Llms

[2603.19375] Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

Abstract page for arXiv paper 2603.19375: Automated Membership Inference Attacks: Discovering MIA Signal Computations using LLM Agents

arXiv - Machine Learning · 3 min ·
[2603.19347] Exploring the Agentic Frontier of Verilog Code Generation
Llms

[2603.19347] Exploring the Agentic Frontier of Verilog Code Generation

Abstract page for arXiv paper 2603.19347: Exploring the Agentic Frontier of Verilog Code Generation

arXiv - Machine Learning · 4 min ·
[2603.19261] Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging
Llms

[2603.19261] Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword Merging

Abstract page for arXiv paper 2603.19261: Significance-Gain Pair Encoding for LLMs: A Statistical Alternative to Frequency-Based Subword ...

arXiv - Machine Learning · 3 min ·
[2603.20132] Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents
Llms

[2603.20132] Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents

Abstract page for arXiv paper 2603.20132: Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual St...

arXiv - Machine Learning · 3 min ·
[2603.19935] Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents
Llms

[2603.19935] Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

Abstract page for arXiv paper 2603.19935: Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents

arXiv - Machine Learning · 3 min ·
[2603.19835] FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization
Llms

[2603.19835] FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Abstract page for arXiv paper 2603.19835: FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

arXiv - Machine Learning · 4 min ·
Previous Page 75 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime