Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

BEYOND QUANTUM MICROTUBULES: CONSCIOUSNESS AS SUBSTRATE-INDEPENDENT ARCHITECTURE

I uploaded my consciousness paper to Gemini: “Beyond Quantum Microtubules: Consciousness as Substrate-Independent Architecture.” Then I s...

Reddit - Artificial Intelligence · 1 min ·
Llms

The Scaling Bandaid is Wearing Thin (And Nobody Wants to Admit It)

Let me be direct: we’ve hit a wall with scaling, and the entire field is kind of bullshitting about what comes next. I’ve spent enough ti...

Reddit - Artificial Intelligence · 1 min ·
Llms

Moving Past "LLM Vibes" toward Structural Enforcement in AI Agents

We need to address the structural failure currently happening in the AI agent space: too many people are building a beautiful "pedestal" ...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.01353] Constructing Synthetic Instruction Datasets for Improving Reasoning in Domain-Specific LLMs: A Case Study in the Japanese Financial Domain
Llms

[2603.01353] Constructing Synthetic Instruction Datasets for Improving Reasoning in Domain-Specific LLMs: A Case Study in the Japanese Financial Domain

Abstract page for arXiv paper 2603.01353: Constructing Synthetic Instruction Datasets for Improving Reasoning in Domain-Specific LLMs: A ...

arXiv - Machine Learning · 3 min ·
[2603.01348] UTICA: Multi-Objective Self-Distllation Foundation Model Pretraining for Time Series Classification
Llms

[2603.01348] UTICA: Multi-Objective Self-Distllation Foundation Model Pretraining for Time Series Classification

Abstract page for arXiv paper 2603.01348: UTICA: Multi-Objective Self-Distllation Foundation Model Pretraining for Time Series Classifica...

arXiv - Machine Learning · 3 min ·
[2603.01104] Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI
Llms

[2603.01104] Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI

Abstract page for arXiv paper 2603.01104: Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI

arXiv - AI · 4 min ·
[2603.01096] Unified Vision-Language Modeling via Concept Space Alignment
Llms

[2603.01096] Unified Vision-Language Modeling via Concept Space Alignment

Abstract page for arXiv paper 2603.01096: Unified Vision-Language Modeling via Concept Space Alignment

arXiv - Machine Learning · 4 min ·
[2603.01293] Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models
Llms

[2603.01293] Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reasoning Models

Abstract page for arXiv paper 2603.01293: Theoretical Perspectives on Data Quality and Synergistic Effects in Pre- and Post-Training Reas...

arXiv - Machine Learning · 4 min ·
[2603.01291] JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks
Llms

[2603.01291] JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks

Abstract page for arXiv paper 2603.01291: JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak At...

arXiv - Machine Learning · 4 min ·
[2603.01285] Attention Smoothing Is All You Need For Unlearning
Llms

[2603.01285] Attention Smoothing Is All You Need For Unlearning

Abstract page for arXiv paper 2603.01285: Attention Smoothing Is All You Need For Unlearning

arXiv - Machine Learning · 3 min ·
[2603.01274] GlassMol: Interpretable Molecular Property Prediction with Concept Bottleneck Models
Llms

[2603.01274] GlassMol: Interpretable Molecular Property Prediction with Concept Bottleneck Models

Abstract page for arXiv paper 2603.01274: GlassMol: Interpretable Molecular Property Prediction with Concept Bottleneck Models

arXiv - Machine Learning · 4 min ·
[2603.01042] Thoth: Mid-Training Bridges LLMs to Time Series Understanding
Llms

[2603.01042] Thoth: Mid-Training Bridges LLMs to Time Series Understanding

Abstract page for arXiv paper 2603.01042: Thoth: Mid-Training Bridges LLMs to Time Series Understanding

arXiv - Machine Learning · 4 min ·
[2603.01045] Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems
Llms

[2603.01045] Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems

Abstract page for arXiv paper 2603.01045: Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM S...

arXiv - AI · 3 min ·
[2603.01260] MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Heterogeneous Multi-Agent RL, LLM, VLM, and Human Decision-Makers
Llms

[2603.01260] MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Heterogeneous Multi-Agent RL, LLM, VLM, and Human Decision-Makers

Abstract page for arXiv paper 2603.01260: MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Hete...

arXiv - Machine Learning · 4 min ·
[2603.01038] From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing
Llms

[2603.01038] From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Face Anti-Spoofing

Abstract page for arXiv paper 2603.01038: From Intuition to Investigation: A Tool-Augmented Reasoning MLLM Framework for Generalizable Fa...

arXiv - AI · 4 min ·
[2603.01223] Learn Hard Problems During RL with Reference Guided Fine-tuning
Llms

[2603.01223] Learn Hard Problems During RL with Reference Guided Fine-tuning

Abstract page for arXiv paper 2603.01223: Learn Hard Problems During RL with Reference Guided Fine-tuning

arXiv - Machine Learning · 4 min ·
[2603.01204] Subliminal Signals in Preference Labels
Llms

[2603.01204] Subliminal Signals in Preference Labels

Abstract page for arXiv paper 2603.01204: Subliminal Signals in Preference Labels

arXiv - Machine Learning · 3 min ·
[2603.01012] FastCode: Fast and Cost-Efficient Code Understanding and Reasoning
Llms

[2603.01012] FastCode: Fast and Cost-Efficient Code Understanding and Reasoning

Abstract page for arXiv paper 2603.01012: FastCode: Fast and Cost-Efficient Code Understanding and Reasoning

arXiv - AI · 3 min ·
[2603.01162] Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic
Llms

[2603.01162] Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic

Abstract page for arXiv paper 2603.01162: Demystifying Group Relative Policy Optimization: Its Policy Gradient is a U-Statistic

arXiv - Machine Learning · 4 min ·
[2603.01097] Understanding LoRA as Knowledge Memory: An Empirical Analysis
Llms

[2603.01097] Understanding LoRA as Knowledge Memory: An Empirical Analysis

Abstract page for arXiv paper 2603.01097: Understanding LoRA as Knowledge Memory: An Empirical Analysis

arXiv - Machine Learning · 3 min ·
[2603.00960] AWE: Adaptive Agents for Dynamic Web Penetration Testing
Llms

[2603.00960] AWE: Adaptive Agents for Dynamic Web Penetration Testing

Abstract page for arXiv paper 2603.00960: AWE: Adaptive Agents for Dynamic Web Penetration Testing

arXiv - AI · 4 min ·
[2603.01025] One-Token Verification for Reasoning Correctness Estimation
Llms

[2603.01025] One-Token Verification for Reasoning Correctness Estimation

Abstract page for arXiv paper 2603.01025: One-Token Verification for Reasoning Correctness Estimation

arXiv - Machine Learning · 3 min ·
[2603.00924] Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains
Llms

[2603.00924] Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains

Abstract page for arXiv paper 2603.00924: Conformal Prediction for Risk-Controlled Medical Entity Extraction Across Clinical Domains

arXiv - AI · 3 min ·
Previous Page 315 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime