Large Language Models

GPT, Claude, Gemini, and other LLMs

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Llms

To teach in the time of ChatGPT is to know pain

LLM use is the most demoralizing problem I’ve faced as a college instructor.

AI Tools & Products · 12 min · 40 minutes ago

Llms

Bluefish Raises $43M to Help Brands Show Up in ChatGPT, Rufus, and More

Series B brings total funding to $68 million as brands rethink AI visibility.

AI Tools & Products · 2 min · 40 minutes ago

Llms

Is Your Small Business Invisible to ChatGPT and Google’s AI Answers? Here’s How to Get Back on the Map

Discover why small businesses need a GEO strategy to stay relevant and competitive as AI platforms transform the search landscape.

AI Tools & Products · 7 min · 40 minutes ago

All Content

Llms

[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

Abstract page for arXiv paper 2509.10625: No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

Abstract page for arXiv paper 2509.05425: No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

arXiv - AI · 3 min · about 1 month ago

Llms

[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

Abstract page for arXiv paper 2511.10833: SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.08939] TransactionGPT

Abstract page for arXiv paper 2511.08939: TransactionGPT

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

Abstract page for arXiv paper 2507.05890: Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

arXiv - AI · 4 min · about 1 month ago

Llms

[2507.01335] LEDOM: Reverse Language Model

Abstract page for arXiv paper 2507.01335: LEDOM: Reverse Language Model

arXiv - AI · 3 min · about 1 month ago

Llms

[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

Abstract page for arXiv paper 2510.15165: Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation App...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.10902] Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness

Abstract page for arXiv paper 2510.10902: Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

Abstract page for arXiv paper 2510.04573: LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

Abstract page for arXiv paper 2510.08646: Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.11103] You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

Abstract page for arXiv paper 2506.11103: You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Abstract page for arXiv paper 2509.23202: Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

arXiv - Machine Learning · 4 min · about 1 month ago

$[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding$

Llms

[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Abstract page for arXiv paper 2503.01804: $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.07430] The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Abstract page for arXiv paper 2509.07430: The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Lea...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2503.03170] AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding

Abstract page for arXiv paper 2503.03170: AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding

arXiv - AI · 4 min · about 1 month ago

Llms

[2502.08666] Hallucination, Monofacts, and Miscalibration: An Empirical Investigation

Abstract page for arXiv paper 2502.08666: Hallucination, Monofacts, and Miscalibration: An Empirical Investigation

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.01077] The Lattice Geometry of Neural Network Quantization -- A Short Equivalence Proof of GPTQ and Babai's Algorithm

Abstract page for arXiv paper 2508.01077: The Lattice Geometry of Neural Network Quantization -- A Short Equivalence Proof of GPTQ and Ba...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2410.04949] Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law

Abstract page for arXiv paper 2410.04949: Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2407.16893] The Price of Prompting: Profiling Energy Use in Large Language Models Inference

Abstract page for arXiv paper 2407.16893: The Price of Prompting: Profiling Energy Use in Large Language Models Inference

arXiv - AI · 4 min · about 1 month ago

Previous Page 176 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Large Language Models

Top This Week

To teach in the time of ChatGPT is to know pain

Bluefish Raises $43M to Help Brands Show Up in ChatGPT, Rufus, and More

Is Your Small Business Invisible to ChatGPT and Google’s AI Answers? Here’s How to Get Back on the Map

All Content

[2509.10625] No Answer Needed: Predicting LLM Answer Accuracy from Question-Only Linear Probes

[2509.05425] No Text Needed: Forecasting MT Quality and Inequity from Fertility and Metadata

[2511.10833] SURFACEBENCH: A Geometry-Aware Benchmark for Symbolic Surface Discovery

[2511.08939] TransactionGPT

[2507.05890] Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators

[2507.01335] LEDOM: Reverse Language Model

[2510.15165] Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

[2510.10902] Auditing Information Disclosure During LLM-Scale Gradient Descent Using Gradient Uniqueness

[2510.04573] LaDiR: Latent Diffusion Enhances LLMs for Text Reasoning

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

[2506.11103] You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models

[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

[2509.07430] The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

[2503.03170] AttackSeqBench: Benchmarking the Capabilities of LLMs for Attack Sequences Understanding

[2502.08666] Hallucination, Monofacts, and Miscalibration: An Empirical Investigation

[2508.01077] The Lattice Geometry of Neural Network Quantization -- A Short Equivalence Proof of GPTQ and Babai's Algorithm

[2410.04949] Leverage Knowledge Graph and Large Language Model for Law Article Recommendation: A Case Study of Chinese Criminal Law

[2407.16893] The Price of Prompting: Profiling Energy Use in Large Language Models Inference

Related Topics

Stay updated with AI News