Large Language Models

GPT, Claude, Gemini, and other LLMs

Top This Week

Llms

Artificial intelligence will always depends on human otherwise it will be obsolete.

I was looking for a tool for my specific need. There was not any. So i started to write the program in python, just basic structure. Then...

Reddit - Artificial Intelligence · 1 min ·
Llms

My AI spent last night modifying its own codebase

I've been working on a local AI system called Apis that runs completely offline through Ollama. During a background run, Apis identified ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Fake users generated by AI can't simulate humans — review of 182 research papers. Your thoughts?

https://www.researchsquare.com/article/rs-9057643/v1 There’s a massive trend right now where tech companies, businesses, even researchers...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2603.22303] Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models
Llms

[2603.22303] Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models

Abstract page for arXiv paper 2603.22303: Sample Transform Cost-Based Training-Free Hallucination Detector for Large Language Models

arXiv - AI · 4 min ·
[2603.22301] Latent Semantic Manifolds in Large Language Models
Llms

[2603.22301] Latent Semantic Manifolds in Large Language Models

Abstract page for arXiv paper 2603.22301: Latent Semantic Manifolds in Large Language Models

arXiv - AI · 3 min ·
[2603.22299] Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
Llms

[2603.22299] Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores

Abstract page for arXiv paper 2603.22299: Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Infor...

arXiv - AI · 3 min ·
[2603.22294] Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks
Llms

[2603.22294] Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

Abstract page for arXiv paper 2603.22294: Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

arXiv - AI · 3 min ·
Llms

[P] Cold Validation: Open-source system where one AI agent audits another with zero shared context

We released an open-source architecture for independent AI agent verification. The core idea: the agent that built something should never...

Reddit - Machine Learning · 1 min ·
Llms

FREE HUMANIZER SERVICES WITH GPT HUMAN!!!

Hey, I know how much it sucks to deal with AI detectors at school right now, so I wanted to help out. I recently paid for an unlimited me...

Reddit - Artificial Intelligence · 1 min ·
Llms

Open-source AI system on a $500 GPU outperforms Claude Sonnet on coding benchmarks

What if building more and more datacenters was not the only option? If we are able to get similar levels of performance for top models at...

Reddit - Artificial Intelligence · 1 min ·
Anthropic says Claude can now use your computer to finish tasks for you in AI agent push
Llms

Anthropic says Claude can now use your computer to finish tasks for you in AI agent push

Anthropic and its rivals are trying to ramp up capabilities of AI agents after OpenClaw went viral earlier this year.

AI Tools & Products · 3 min ·
Llms

Alright I'm just going to crash out a bit about LLMs rn downvote me upvote me up to you

Hello everyone hope you're having a nice day I'm just ugh I'm so tired and confused and frustrated. I'm desperately trying to map/figure ...

Reddit - Artificial Intelligence · 1 min ·
Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says | WIRED
Llms

Pentagon’s ‘Attempt to Cripple’ Anthropic Is Troubling, Judge Says | WIRED

During a hearing Tuesday, a district court judge questioned the Department of Defense’s motivations for labeling the Claude AI developer ...

Wired - AI · 6 min ·
Llms

I used an app to analyze 3 years of my Claude conversations. It identified a behavioral pattern I'd never named.

Exported everything. Normalized it. Ran cross-source analysis against my journal entries, calendar, and sleep data. The output I couldn't...

Reddit - Artificial Intelligence · 1 min ·
Llms

I tested ChatGPT vs Claude vs Gemini for coding ...here's what I found

So ive been going back and forth between these three for actual work (not just asking it to write fizzbuzz) and wanted to share what I fo...

Reddit - Artificial Intelligence · 1 min ·
OpenAI's plans to make ChatGPT more like Amazon aren't going so well | TechCrunch
Llms

OpenAI's plans to make ChatGPT more like Amazon aren't going so well | TechCrunch

OpenAI says its moving away from Instant Checkout, which allowed users to buy items directly through the ChatGPT interface.

TechCrunch - AI · 4 min ·
Google TV's new Gemini features keep fans updated on sports teams and more | TechCrunch
Llms

Google TV's new Gemini features keep fans updated on sports teams and more | TechCrunch

Three Gemini-powered features are coming to your Google TV. This includes visual responses, deep dives, and sports briefs.

TechCrunch - AI · 4 min ·
Llms

[For Hire] Full-Stack AI/ML Engineer | Agentic AI · RAG · Computer Vision · Voice AI · LangGraph · FastAPI | Remote

Hey everyone, I'm a Full-Stack AI/ML Engineer with 3+ years of production experience across Agentic AI, RAG pipelines, Computer Vision, a...

Reddit - ML Jobs · 1 min ·
Llms

I mapped how Reddit actually talks about AI safety: 6,374 posts, 23 clusters, some surprising patterns

I collected Reddit posts between Jan 29 - Mar 1, 2026 using 40 keyword-based search terms ("AI safety", "AI alignment", "EU AI Act", "AI ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Three companies shipped "AI agent on your desktop" in the same two weeks. That's not a coincidence.

Something interesting happened this month. March 11: Perplexity announced Personal Computer. An always-on Mac Mini running their AI agent...

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] Evaluating MLLMs with Child-Inspired Cognitive Tasks

Hey there, we’re sharing KidGym, an interactive 2D grid-based benchmark for evaluating MLLMs in continuous, trajectory-based interaction,...

Reddit - Machine Learning · 1 min ·
Anthropic’s Claude Code and Cowork can control your computer | The Verge
Llms

Anthropic’s Claude Code and Cowork can control your computer | The Verge

Anthropic has updated Claude to perform tasks in its Code and Cowork AI tools autonomously by using your computer for you.

The Verge - AI · 4 min ·
Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch
Llms

Agile Robots becomes the latest robotics company to partner with Google DeepMind | TechCrunch

Agile Robots will incorporate Google DeepMind's robotics foundation models into its bots while collecting data for the AI research lab.

TechCrunch - AI · 4 min ·
Previous Page 40 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime