Natural Language Processing

Text understanding and language tasks

Top This Week

Llms

[R] Is autoresearch really better than classic hyperparameter tuning?

We did experiments comparing Optuna & autoresearch. Autoresearch converges faster, is more cost-efficient, and even generalizes bette...

Reddit - Machine Learning · 1 min ·
Nlp

Automate IOS devices through XCUITest with droidrun.

Automate iOS apps with XCUITest and Droidrun using just natural language. You send the command to Droidrun, and the agent starts the task...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] Trained a small BERT on 276K Kubernetes YAMLs using tree positional encoding instead of sequential

I trained a BERT-style transformer on 276K Kubernetes YAML files, replacing standard positional encoding with learned tree coordinates (d...

Reddit - Machine Learning · 1 min ·

All Content

[2602.20709] Onboard-Targeted Segmentation of Straylight in Space Camera Sensors
Machine Learning

[2602.20709] Onboard-Targeted Segmentation of Straylight in Space Camera Sensors

This paper presents an AI-driven methodology for segmenting straylight effects in space camera sensors, enhancing image analysis in resou...

arXiv - AI · 3 min ·
[2602.20677] UrbanFM: Scaling Urban Spatio-Temporal Foundation Models
Llms

[2602.20677] UrbanFM: Scaling Urban Spatio-Temporal Foundation Models

The paper presents UrbanFM, a novel framework for scaling urban spatio-temporal foundation models, addressing challenges in generalizabil...

arXiv - Machine Learning · 4 min ·
[2602.20676] PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization
Llms

[2602.20676] PRECTR-V2:Unified Relevance-CTR Framework with Cross-User Preference Mining, Exposure Bias Correction, and LLM-Distilled Encoder Optimization

The paper presents PRECTR-V2, an advanced framework for improving search relevance and click-through rate (CTR) prediction by addressing ...

arXiv - AI · 4 min ·
[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery
Machine Learning

[2602.20684] Agile V: A Compliance-Ready Framework for AI-Augmented Engineering -- From Concept to Audit-Ready Delivery

The paper presents Agile V, a framework integrating AI in engineering workflows to ensure compliance and verification at machine-speed de...

arXiv - AI · 4 min ·
[2602.20650] Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression
Machine Learning

[2602.20650] Dataset Color Quantization: A Training-Oriented Framework for Dataset-Level Compression

The paper presents Dataset Color Quantization (DCQ), a framework designed to compress large-scale image datasets by reducing color-space ...

arXiv - AI · 3 min ·
[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches
Machine Learning

[2602.20634] Enhancing Hate Speech Detection on Social Media: A Comparative Analysis of Machine Learning Models and Text Transformation Approaches

This article evaluates various machine learning models for hate speech detection on social media, comparing traditional and advanced tech...

arXiv - AI · 3 min ·
[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI
Machine Learning

[2602.20547] What Drives Students' Use of AI Chatbots? Technology Acceptance in Conversational AI

This article explores the factors influencing students' adoption of AI chatbots for learning, utilizing the Technology Acceptance Model t...

arXiv - AI · 4 min ·
[2602.20520] How Do Inpainting Artifacts Propagate to Language?
Llms

[2602.20520] How Do Inpainting Artifacts Propagate to Language?

This paper investigates how visual artifacts from diffusion-based inpainting affect language generation in vision-language models, reveal...

arXiv - AI · 3 min ·
[2602.20449] Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference
Llms

[2602.20449] Protein Language Models Diverge from Natural Language: Comparative Analysis and Improved Inference

This article explores the differences between protein language models (PLMs) and natural language models, highlighting how these distinct...

arXiv - Machine Learning · 4 min ·
[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems
Llms

[2602.20379] Case-Aware LLM-as-a-Judge Evaluation for Enterprise-Scale RAG Systems

The paper presents a case-aware evaluation framework for enterprise-scale Retrieval-Augmented Generation (RAG) systems, addressing the li...

arXiv - AI · 3 min ·
[2602.20344] Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction
Nlp

[2602.20344] Hierarchical Molecular Representation Learning via Fragment-Based Self-Supervised Embedding Prediction

This article presents GraSPNet, a novel hierarchical self-supervised learning framework for molecular representation that enhances graph ...

arXiv - Machine Learning · 3 min ·
[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance
Llms

[2602.20300] What Makes a Good Query? Measuring the Impact of Human-Confusing Linguistic Features on LLM Performance

This article examines how specific linguistic features of queries impact the performance of Large Language Models (LLMs), particularly in...

arXiv - AI · 3 min ·
[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models
Llms

[2602.20224] Exploring Anti-Aging Literature via ConvexTopics and Large Language Models

This article presents a novel clustering algorithm for analyzing anti-aging literature, improving topic modeling through convex optimizat...

arXiv - Machine Learning · 3 min ·
[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction
Llms

[2602.20219] An Approach to Combining Video and Speech with Large Language Models in Human-Robot Interaction

This article presents a novel multimodal framework for human-robot interaction that integrates video and speech processing with large lan...

arXiv - AI · 3 min ·
[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions
Llms

[2602.20213] CodeHacker: Automated Test Case Generation for Detecting Vulnerabilities in Competitive Programming Solutions

CodeHacker is an automated framework designed to generate test cases that identify vulnerabilities in competitive programming solutions, ...

arXiv - AI · 3 min ·
[2310.15741] Interpretable Medical Image Classification using Prototype Learning and Privileged Information
Machine Learning

[2310.15741] Interpretable Medical Image Classification using Prototype Learning and Privileged Information

This article presents a novel approach to medical image classification using prototype learning and privileged information, enhancing int...

arXiv - AI · 3 min ·
[2602.21143] A Benchmark for Deep Information Synthesis
Llms

[2602.21143] A Benchmark for Deep Information Synthesis

The paper introduces DEEPSYNTH, a benchmark for evaluating large language models on complex tasks requiring deep information synthesis an...

arXiv - Machine Learning · 4 min ·
[2602.21044] LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification
Llms

[2602.21044] LogicGraph : Benchmarking Multi-Path Logical Reasoning via Neuro-Symbolic Generation and Verification

LogicGraph introduces a benchmark for evaluating multi-path logical reasoning in large language models, highlighting their limitations in...

arXiv - AI · 4 min ·
[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence
Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min ·
[2602.20918] Predicting Sentence Acceptability Judgments in Multimodal Contexts
Llms

[2602.20918] Predicting Sentence Acceptability Judgments in Multimodal Contexts

This paper explores how visual context influences sentence acceptability judgments in humans and large language models (LLMs), revealing ...

arXiv - AI · 4 min ·
Previous Page 81 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime