Natural Language Processing

Text understanding and language tasks

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

[R] Is autoresearch really better than classic hyperparameter tuning?

We did experiments comparing Optuna & autoresearch. Autoresearch converges faster, is more cost-efficient, and even generalizes bette...

Reddit - Machine Learning · 1 min · about 8 hours ago

Nlp

Automate IOS devices through XCUITest with droidrun.

Automate iOS apps with XCUITest and Droidrun using just natural language. You send the command to Droidrun, and the agent starts the task...

Reddit - Artificial Intelligence · 1 min · about 13 hours ago

Machine Learning

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

I trained a BERT-style transformer on 276K Kubernetes YAML files, replacing standard positional encoding with learned tree coordinates (d...

Reddit - Machine Learning · 1 min · about 15 hours ago

All Content

Machine Learning

[2602.20709] Onboard-Targeted Segmentation of Straylight in Space Camera Sensors

This paper presents an AI-driven methodology for segmenting straylight effects in space camera sensors, enhancing image analysis in resou...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20677] UrbanFM: Scaling Urban Spatio-Temporal Foundation Models

The paper presents UrbanFM, a novel framework for scaling urban spatio-temporal foundation models, addressing challenges in generalizabil...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20676] PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization

The paper presents PRECTR-V2, an advanced framework for improving search relevance and click-through rate (CTR) prediction by addressing ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery

The paper presents Agile V, a framework integrating AI in engineering workflows to ensure compliance and verification at machine-speed de...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20650] Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression

The paper presents Dataset Color Quantization (DCQ), a framework designed to compress large-scale image datasets by reducing color-space ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

This article evaluates various machine learning models for hate speech detection on social media, comparing traditional and advanced tech...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI

This article explores the factors influencing students' adoption of AI chatbots for learning, utilizing the Technology Acceptance Model t...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20520] How Do Inpainting Artifacts Propagate to Language?

This paper investigates how visual artifacts from diffusion-based inpainting affect language generation in vision-language models, reveal...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20449] Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

This article explores the differences between protein language models (PLMs) and natural language models, highlighting how these distinct...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

The paper presents a case-aware evaluation framework for enterprise-scale Retrieval-Augmented Generation (RAG) systems, addressing the li...

arXiv - AI · 3 min · about 1 month ago

Nlp

[2602.20344] Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

This article presents GraSPNet, a novel hierarchical self-supervised learning framework for molecular representation that enhances graph ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

This article examines how specific linguistic features of queries impact the performance of Large Language Models (LLMs), particularly in...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models

This article presents a novel clustering algorithm for analyzing anti-aging literature, improving topic modeling through convex optimizat...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

This article presents a novel multimodal framework for human-robot interaction that integrates video and speech processing with large lan...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

CodeHacker is an automated framework designed to generate test cases that identify vulnerabilities in competitive programming solutions, ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2310.15741] Interpretable Medical Image Classification using Prototype Learning and Privileged Information

This article presents a novel approach to medical image classification using prototype learning and privileged information, enhancing int...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.21143] A Benchmark for Deep Information Synthesis

The paper introduces DEEPSYNTH, a benchmark for evaluating large language models on complex tasks requiring deep information synthesis an...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.21044] LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification

LogicGraph introduces a benchmark for evaluating multi-path logical reasoning in large language models, highlighting their limitations in...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20918] Predicting Sentence Acceptability Judgments in Multimodal Contexts

This paper explores how visual context influences sentence acceptability judgments in humans and large language models (LLMs), revealing ...

arXiv - AI · 4 min · about 1 month ago

Previous Page 81 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Natural Language Processing

Top This Week

[R] Is autoresearch really better than classic hyperparameter tuning?

Automate IOS devices through XCUITest with droidrun.

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

All Content

[2602.20709] Onboard-Targeted Segmentation of Straylight in Space Camera Sensors

[2602.20677] UrbanFM: Scaling Urban Spatio-Temporal Foundation Models

[2602.20676] PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization

[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery

[2602.20650] Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression

[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI

[2602.20520] How Do Inpainting Artifacts Propagate to Language?

[2602.20449] Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

[2602.20344] Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

[2310.15741] Interpretable Medical Image Classification using Prototype Learning and Privileged Information

[2602.21143] A Benchmark for Deep Information Synthesis

[2602.21044] LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

[2602.20918] Predicting Sentence Acceptability Judgments in Multimodal Contexts

Related Topics

Stay updated with AI News