Updated on 2025.03.26

LLM Reasoning

Publish Date Title Authors PDF Code
2025-03-24 Training-Free Personalization via Retrieval and Reasoning on Fingerprints Deepayan Das et.al. 2503.18623 null
2025-03-23 Mind with Eyes: from Language Reasoning to Multimodal Reasoning Zhiyu Lin et.al. 2503.18071 null
2025-03-23 MedPlan:A Two-Stage RAG-Based System for Personalized Medical Plan Generation Hsin-Ling Hsu et.al. 2503.17900 null
2025-03-22 A Modular Dataset to Demonstrate LLM Abstraction Capability Adam Atanas et.al. 2503.17645 null
2025-03-22 ConSol: Sequential Probability Ratio Testing to Find Consistent LLM Reasoning Paths Efficiently Jaeyeon Lee et.al. 2503.17587 link
2025-03-21 LEMMA: Learning from Errors for MatheMatical Advancement in LLMs Zhuoshi Pan et.al. 2503.17439 null
2025-03-21 V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms Javier J. Poveda Rodrigo et.al. 2503.17422 null
2025-03-21 Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique Yansi Li et.al. 2503.17363 null
2025-03-21 OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Yihe Deng et.al. 2503.17352 null
2025-03-21 LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language Kun Chu et.al. 2503.17309 null
2025-03-21 Does Chain-of-Thought Reasoning Help Mobile GUI Agent? An Empirical Study Li Zhang et.al. 2503.16788 null
2025-03-20 Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models Chengkai Huang et.al. 2503.16734 null
2025-03-21 MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering Feiyang Li et.al. 2503.16131 null
2025-03-20 Entropy-based Exploration Conduction for Multi-step Reasoning Jinghan Zhang et.al. 2503.15848 null
2025-03-19 LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Federico Cocchi et.al. 2503.15621 link
2025-03-19 EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models Yinan Liang et.al. 2503.15369 null
2025-03-19 Envisioning an AI-Enhanced Mental Health Ecosystem Kellie Yu Hui Sim et.al. 2503.14883 null
2025-03-19 Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning Chengran Yang et.al. 2503.14838 null
2025-03-18 Temporal Consistency for LLM Reasoning Process Error Identification Jiacheng Guo et.al. 2503.14495 link
2025-03-21 Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment Wei Chen et.al. 2503.13879 null
2025-03-18 Empowering GraphRAG with Knowledge Filtering and Integration Kai Guo et.al. 2503.13804 null
2025-03-15 Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms Xiaojian Li et.al. 2503.13530 null
2025-03-14 RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration Hong Qing Yu et.al. 2503.13514 null
2025-03-17 A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives Weiqiang Jin et.al. 2503.13415 null
2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link
2025-03-17 Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Hai-Long Sun et.al. 2503.13360 null
2025-03-17 Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning Junming Liu et.al. 2503.12972 null
2025-03-17 R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Jingyi Zhang et.al. 2503.12937 null
2025-03-17 Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation Songjun Tu et.al. 2503.12854 null
2025-03-18 DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Xinyu Ma et.al. 2503.12797 link
2025-03-16 MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification Zhaopan Xu et.al. 2503.12505 null
2025-03-21 Towards Self-Improving Systematic Cognition for Next-Generation Foundation MLLMs Xiaoying Zhang et.al. 2503.12303 link
2025-03-20 Applications of Large Language Model Reasoning in Feature Generation Dharani Chandra et.al. 2503.11989 null
2025-03-14 Neutralizing Bias in LLM Reasoning using Entailment Graphs Liang Cheng et.al. 2503.11614 link
2025-03-14 VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Jing Bi et.al. 2503.11557 null
2025-03-14 RESPONSE: Benchmarking the Ability of Language Models to Undertake Commonsense Reasoning in Crisis Situation Aissatou Diallo et.al. 2503.11348 null
2025-03-13 Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data Paul Quinlan et.al. 2503.10883 null
2025-03-18 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization Yi Yang et.al. 2503.10615 link
2025-03-15 VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Yiming Jia et.al. 2503.10582 null
2025-03-13 VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Weiyun Wang et.al. 2503.10291 null
2025-03-18 “Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding Hyunbin Jin et.al. 2503.10167 null
2025-03-13 How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game Ziyue Wang et.al. 2503.10042 link
2025-03-12 MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions Zhe Xu et.al. 2503.09499 link
2025-03-12 A Survey on Enhancing Causal Reasoning Ability of Large Language Models Xin Li et.al. 2503.09326 null
2025-03-11 Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework Zhuo Zhi et.al. 2503.08308 null
2025-03-11 FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback Kangan Qian et.al. 2503.08162 null
2025-03-05 An Optimization Algorithm for Multimodal Data Alignment Wei Zhang et.al. 2503.07636 null
2025-03-11 LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL Yingzhe Peng et.al. 2503.07536 null
2025-03-10 MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning Fanqing Meng et.al. 2503.07365 link
2025-03-10 Dynamic Path Navigation for Motion Agents with LLM Reasoning Yubo Zhao et.al. 2503.07323 null
2025-03-11 Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Wenxuan Huang et.al. 2503.06749 link
2025-03-09 Graph Retrieval-Augmented LLM for Conversational Recommendation Systems Zhangchi Qiu et.al. 2503.06430 null
2025-03-08 Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models? Kun Xiang et.al. 2503.06252 link
2025-03-15 Integrating Chain-of-Thought for Multimodal Alignment: A Study on 3D Vision-Language Learning Yanjun Chen et.al. 2503.06232 null
2025-03-08 KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis Weidong Zhan et.al. 2503.06218 link
2025-03-07 Extracting and Emulsifying Cultural Explanation to Improve Multilingual Capability of LLMs Hamin Koo et.al. 2503.05846 null
2025-03-07 Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning Mufan Xu et.al. 2503.05193 null
2025-03-07 Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning Jiachun Li et.al. 2503.05188 null
2025-03-07 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Simon A. Aytes et.al. 2503.05179 link
2025-03-10 R1-Zero’s “Aha Moment” in Visual Reasoning on a 2B Non-SFT Model Hengguang Zhou et.al. 2503.05132 link
2025-03-04 Learning from Failures in Multi-Attempt Reinforcement Learning Stephen Chung et.al. 2503.04808 null
2025-03-15 Can LLMs Reason About Program Semantics? A Comprehensive Evaluation of LLMs on Formal Specification Inference Thanh Le-Cong et.al. 2503.04779 null
2025-03-06 Better Process Supervision with Bi-directional Rewarding Signals Wenxiang Chen et.al. 2503.04618 null
2025-03-06 SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning Chen Li et.al. 2503.04530 null
2025-03-07 Question-Aware Gaussian Experts for Audio-Visual Question Answering Hongyeob Kim et.al. 2503.04459 link
2025-03-06 Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English Runtao Zhou et.al. 2503.04099 null
2025-03-06 ReasonGraph: Visualisation of Reasoning Paths Zongqian Li et.al. 2503.03979 link
2025-03-05 Process-based Self-Rewarding Language Models Shimao Zhang et.al. 2503.03746 null
2025-03-05 COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence Wentao Li et.al. 2503.03215 null
2025-03-04 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji et.al. 2503.02875 null
2025-03-04 Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models Zhifei Xie et.al. 2503.02318 null
2025-03-04 LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation Yunbo Long et.al. 2503.02161 null
2025-03-03 CorrA: Leveraging Large Language Models for Dynamic Obstacle Avoidance of Autonomous Vehicles Shanting Wang et.al. 2503.02076 null
2025-03-03 Graph-Augmented Reasoning: Evolving Step-by-Step Knowledge Graph Retrieval for LLM Reasoning Wenjie Wu et.al. 2503.01642 null
2025-03-03 Pragmatic Inference Chain (PIC) Improving LLMs’ Reasoning of Authentic Implicit Toxic Language Xi Chen et.al. 2503.01539 null
2025-03-03 CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs Artem Lykov et.al. 2503.01378 null
2025-03-06 SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph Teng Lin et.al. 2503.01346 null
2025-03-03 MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation Yi Wang et.al. 2503.01298 null
2025-02-28 Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations Zhongqi Yang et.al. 2503.00134 null
2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 link
2025-02-28 Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning Ayana Niwa et.al. 2502.20620 null
2025-02-27 FINEREASON: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving Guizhen Chen et.al. 2502.20238 link
2025-02-27 Collaborative Stance Detection via Small-Large Language Model Consistency Verification Yu Yan et.al. 2502.19954 link
2025-02-27 Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Yuan Sui et.al. 2502.19918 null
2025-02-27 Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation Qianxi He et.al. 2502.19907 null
2025-03-21 Towards Multimodal Large-Language Models for Parent-Child Interaction: A Focus on Joint Attention Weiyan Shi et.al. 2502.19877 null
2025-03-05 Weaker LLMs’ Opinions Also Matter: Mixture of Opinions Enhances LLM’s Mathematical Reasoning Yanan Chen et.al. 2502.19622 null
2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 null
2025-02-26 BIG-Bench Extra Hard Mehran Kazemi et.al. 2502.19187 link
2025-02-25 Scalable Best-of-N Selection for Large Language Models via Self-Certainty Zhewei Kang et.al. 2502.18581 null
2025-02-25 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Yuxiang Wei et.al. 2502.18449 null
2025-02-25 Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning Wenkai Yang et.al. 2502.18080 null
2025-02-21 Improving Value-based Process Verifier via Structural Prior Injection Zetian Sun et.al. 2502.17498 null
2025-02-24 Making LLMs Reason? The Intermediate Language Problem in Neurosymbolic Approaches Alexander Beiser et.al. 2502.17216 null
2025-02-24 Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI Syed Abdul Gaffar Shakhadri et.al. 2502.17092 null
2025-02-24 Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology Longchao Da et.al. 2502.17026 null
2025-02-24 All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark Davide Testa et.al. 2502.16989 null
2025-02-24 AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models Qin Zhu et.al. 2502.16906 link
2025-02-24 The Blessing of Reasoning: LLM-Based Contrastive Explanations in Black-Box Recommender Systems Yuyan Wang et.al. 2502.16759 null
2025-02-23 Reasoning about Affordances: Causal and Compositional Reasoning in LLMs Magnus F. Gjerde et.al. 2502.16606 null
2025-02-22 ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning Shulin Huang et.al. 2502.16268 null
2025-02-27 Dynamic Parallel Tree Search for Efficient LLM Reasoning Yifu Ding et.al. 2502.16235 null
2025-02-22 Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations Chunyang Li et.al. 2502.16169 link
2025-03-04 Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models Qianqi Yan et.al. 2502.16033 null
2025-02-21 MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use Zaid Khan et.al. 2502.15872 null
2025-02-21 Do Multilingual LLMs Think In English? Lisa Schut et.al. 2502.15603 null
2025-02-21 Evaluating Social Biases in LLM Reasoning Xuyang Wu et.al. 2502.15361 null
2025-02-21 Stepwise Informativeness Search for Improving LLM Reasoning Siyuan Wang et.al. 2502.15335 null
2025-02-21 Latent Factor Models Meets Instructions:Goal-conditioned Latent Factor Discovery without Task Supervision Zhouhang Xie et.al. 2502.15147 null
2025-02-19 SIFT: Grounding LLM Reasoning in Contexts via Stickers Zihao Zeng et.al. 2502.14922 link
2025-02-18 Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Bhavik Agarwal et.al. 2502.14905 null
2025-03-04 Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison Aiswarya Baby et.al. 2502.14827 null
2025-02-20 Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning Tian Xie et.al. 2502.14768 link
2025-02-19 Enhancing LLM-Based Recommendations Through Personalized Reasoning Jiahao Liu et.al. 2502.13845 null
2025-02-19 MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering Guanming Xiong et.al. 2502.13428 null
2025-02-19 MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification Linzhuang Sun et.al. 2502.13383 link
2025-02-22 Grounding LLM Reasoning with Knowledge Graphs Alfonso Amayuelas et.al. 2502.13247 null
2025-02-18 Theorem Prover as a Judge for Synthetic Data Generation Joshua Ong Jun Leang et.al. 2502.13137 null
2025-02-18 Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options Lakshmi Nair et.al. 2502.12929 link
2025-02-18 S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning Ruotian Ma et.al. 2502.12853 link
2025-02-18 CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base Cong-Duy Nguyen et.al. 2502.12591 null
2025-02-18 Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights Shubham Parashar et.al. 2502.12521 null
2025-02-18 HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation Hao Liu et.al. 2502.12442 null
2025-02-17 Evaluating Step-by-step Reasoning Traces: A Survey Jinu Lee et.al. 2502.12289 null
2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 null
2025-02-17 TokenSkip: Controllable Chain-of-Thought Compression in LLMs Heming Xia et.al. 2502.12067 link
2025-02-17 Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models Hyunwoo Kim et.al. 2502.11881 null
2025-02-17 Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities Hanbin Wang et.al. 2502.11829 link
2025-02-17 Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning Yuqi Pang et.al. 2502.11751 link
2025-02-17 DeFiScope: Detecting Various DeFi Price Manipulations with LLM Reasoning Juantao Zhong et.al. 2502.11521 null
2025-02-16 Don’t Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls Ante Wang et.al. 2502.11183 null
2025-02-16 LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning Tianshi Zheng et.al. 2502.11176 null
2025-02-15 A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1 Jun Wang et.al. 2502.10867 null
2025-02-28 USER-VLM 360: Personalized Vision Language Models with User-aware Tuning for Social Human-Robot Interactions Hamed Rahimi et.al. 2502.10636 null
2025-02-14 Do Large Language Models Reason Causally Like Us? Even Better? Hanna M. Dettki et.al. 2502.10215 null
2025-02-14 MathConstruct: Challenging LLM Reasoning with Constructive Proofs Mislav Balunović et.al. 2502.10197 null
2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 null
2025-02-14 EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges Clinton J. Wang et.al. 2502.08859 null
2025-02-11 CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs Lejla Skelic et.al. 2502.07980 null
2025-02-05 Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment Cheryl Li et.al. 2502.07803 null
2025-02-17 Bag of Tricks for Inference-time Computation of LLM Reasoning Fan Liu et.al. 2502.07191 link
2025-02-15 Self-Supervised Prompt Optimization Jinyu Xiang et.al. 2502.06855 link
2025-02-06 Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation Namhee Kim et.al. 2502.06843 null
2025-02-04 Policy Guided Tree Search for Enhanced LLM Reasoning Yang Li et.al. 2502.06813 null
2025-03-11 ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates Ling Yang et.al. 2502.06772 link
2025-02-10 Resurrecting saturated LLM benchmarks with adversarial encoding Igor Ivanov et.al. 2502.06738 null
2025-02-13 LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM Zhi Zhou et.al. 2502.06572 link
2025-02-09 A Generative Framework for Bidirectional Image-Report Understanding in Chest Radiography Nicholas Evans et.al. 2502.05926 null
2025-02-08 Evaluating Vision-Language Models for Emotion Recognition Sree Bhattacharyya et.al. 2502.05660 null
2025-02-07 GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity? Yang Zhou et.al. 2502.05252 link
2025-02-07 Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Tushar Pandey et.al. 2502.05078 link
2025-02-07 Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research Junde Wu et.al. 2502.04644 link
2025-02-05 Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications Bo Wen et.al. 2502.04384 link
2025-02-05 Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning Jonathan Kim et.al. 2502.04381 null
2025-02-04 Investigating the Robustness of Deductive Reasoning with Large Language Models Fabian Hoppe et.al. 2502.04352 null
2025-02-04 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Maohao Shen et.al. 2502.02508 null
2025-02-04 CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning Jianfeng Pan et.al. 2502.02390 null
2025-02-08 Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Jinyang Wu et.al. 2502.02339 null
2025-02-04 Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration Younan Zhu et.al. 2502.01969 null
2025-01-31 Improving Rule-based Reasoning in LLMs via Neurosymbolic Representations Varun Dhanraj et.al. 2502.01657 null
2025-02-03 Position: Empowering Time Series Reasoning with Multimodal LLMs Yaxuan Kong et.al. 2502.01477 null
2025-02-03 ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning Bill Yuchen Lin et.al. 2502.01100 null
2025-02-16 Learning Autonomous Code Integration for Math Language Models Haozhe Wang et.al. 2502.00691 null
2025-02-13 Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning Zhi Zhou et.al. 2502.00511 null
2025-02-14 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-28 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-02-15 Option-ID Based Elimination For Multiple Choice Questions Zhenhao Zhu et.al. 2501.15175 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-02-12 GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better Xu Chu et.al. 2501.14427 null
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-02-10 Cognitive Paradigms for Evaluating VLMs on Visual Reasoning Task Mohit Vaishnav et.al. 2501.13620 null
2025-01-22 EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering Chang Zong et.al. 2501.12746 null
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-22 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-23 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Fengli Xu et.al. 2501.09686 null
2025-01-15 Multimodal LLMs Can Reason about Aesthetics in Zero-Shot Ruixiang Jiang et.al. 2501.09012 link
2025-02-10 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Yunzhuo Hao et.al. 2501.05444 link
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests Charles Corbière et.al. 2501.04671 null
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-07 Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild Wanpeng Hu et.al. 2501.02964 link
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-03 Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models Kaleem Ullah Qasim et.al. 2501.02026 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-03 Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents Chengbo He et.al. 2501.00430 null
2024-12-31 EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta Raymond Bernard et.al. 2501.00257 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2025-02-17 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-23 StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2025-02-09 Evaluating LLM Reasoning in the Operations Research Domain with ORQA Mahdi Mostajabdaveh et.al. 2412.17874 link
2024-12-23 Diving into Self-Evolving Training for Multimodal Reasoning Wei Liu et.al. 2412.17451 null
2024-12-21 SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization Tan-Hanh Pham et.al. 2412.16771 null
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-19 Eliciting Causal Abilities in Large Language Models for Reasoning Tasks Yajing Wang et.al. 2412.15314 link
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 Progressive Multimodal Reasoning via Active Retrieval Guanting Dong et.al. 2412.14835 null
2024-12-19 FiVL: A Framework for Improved Vision-Language Alignment Estelle Aflalo et.al. 2412.14672 null
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-18 Cognition Chain for Explainable Psychological Stress Detection on Social Media Xin Wang et.al. 2412.14009 null
2024-12-27 Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence Jinghan He et.al. 2412.13949 null
2025-02-16 Do Language Models Understand Time? Xi Ding et.al. 2412.13845 link
2024-12-18 Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games Wenye Lin et.al. 2412.13602 null
2024-12-17 ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models Yuxi Sun et.al. 2412.12848 null
2024-12-12 A NotSo Simple Way to Beat Simple Bench Soham Sane et.al. 2412.12173 null
2024-12-11 What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis Jiayu Liu et.al. 2412.12157 null
2025-02-18 A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges Yibo Yan et.al. 2412.11936 null
2024-12-24 Stepwise Reasoning Error Disruption Attack of LLMs Jingyu Peng et.al. 2412.11934 null
2024-12-16 Leveraging Retrieval-Augmented Tags for Large Vision-Language Understanding in Complex Scenes Antonio Carlos Rivera et.al. 2412.11396 null
2024-12-15 SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang et.al. 2412.11026 null
2024-12-15 Entropy-Regularized Process Reward Model Hanning Zhang et.al. 2412.11006 link
2024-12-14 Optimizing Vision-Language Interactions Through Decoder-Only Models Kaito Tanaka et.al. 2412.10758 null
2024-12-14 Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation Sukai Huang et.al. 2412.10675 null
2024-12-14 Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data Xue Wu et.al. 2412.10654 null
2024-12-13 EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing Umar Khalid et.al. 2412.10566 null
2024-12-13 Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education Naiming Liu et.al. 2412.09914 null
2025-01-18 Neptune: The Long Orbit to Benchmarking Long Video Understanding Arsha Nagrani et.al. 2412.09582 link
2025-02-14 Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning Zhenni Bi et.al. 2412.09078 link
2024-12-11 Training Large Language Models to Reason in a Continuous Latent Space Shibo Hao et.al. 2412.06769 link
2025-01-23 GameArena: Evaluating LLM Reasoning through Live Computer Games Lanxiang Hu et.al. 2412.06394 null
2024-12-08 Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt Damien de Mijolla et.al. 2412.05967 null
2024-12-06 MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale Jarvis Guo et.al. 2412.05237 null
2024-12-05 Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Yiheng Xu et.al. 2412.04454 null
2024-12-05 SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions Bufang Yang et.al. 2412.04036 null
2024-12-04 DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation Qingdong He et.al. 2412.03255 null
2024-12-03 Explainable CTR Prediction via LLM Reasoning Xiaohan Yu et.al. 2412.02588 null
2025-02-12 NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers Angel Yahir Loredo Lopez et.al. 2412.01621 null
2025-01-13 Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability Zicheng Lin et.al. 2411.19943 link
2024-11-29 TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension Zipeng Qiu et.al. 2411.19504 link
2024-11-29 COLD: Causal reasOning in cLosed Daily activities Abhinav Joshi et.al. 2411.19500 link
2024-12-16 Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Di Zhang et.al. 2411.18203 null
2024-11-26 NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects? Jiaxuan Li et.al. 2411.17794 null
2024-11-25 Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision Zhiheng Xi et.al. 2411.16579 null
2024-11-22 On the Impact of Fine-Tuning on Chain-of-Thought Reasoning Elita Lobo et.al. 2411.15382 null
2024-11-21 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Yuhao Dong et.al. 2411.14432 link
2024-11-20 BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games Davide Paglieri et.al. 2411.13543 null
2024-11-20 Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving Hao Zhou et.al. 2411.13076 null
2024-11-15 Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination Haojie Zheng et.al. 2411.12591 link
2024-12-23 Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus Terufumi Morishita et.al. 2411.12498 link
2024-11-18 Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation Mingchao Qi et.al. 2411.11714 link
2024-12-31 Enhancing LLM Reasoning with Reward-guided Tree Search Jinhao Jiang et.al. 2411.11694 null
2024-12-15 A dataset of questions on decision-theoretic reasoning in Newcomb-like problems Caspar Oesterheld et.al. 2411.10588 link
2024-11-15 Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization Weiyun Wang et.al. 2411.10442 null
2025-01-09 LLaVA-CoT: Let Vision Language Models Reason Step-by-Step Guowei Xu et.al. 2411.10440 link
2024-11-15 Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level Andong Deng et.al. 2411.09921 null
2024-11-14 Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Nghia Trung Ngo et.al. 2411.09213 null
2024-11-13 Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding Deyi Ji et.al. 2411.08516 null
2024-11-18 What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? Katie Kang et.al. 2411.07681 link
2024-11-27 Self-Training Meets Consistency: Improving LLMs’ Reasoning With Consistency-Driven Rationale Evaluation Jaehyeok Lee et.al. 2411.06387 link
2024-11-09 A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization Haoxin Liu et.al. 2411.06018 null
2024-11-11 LLMs as Method Actors: A Model for Prompt Engineering and Architecture Colin Doyle et.al. 2411.05778 link
2024-11-12 Kwai-STaR: Transform LLMs into State-Transition Reasoners Xingyu Lu et.al. 2411.04799 null
2024-11-21 Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Haolin Chen et.al. 2411.04282 link
2024-11-05 CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library Yimeng Liu et.al. 2411.03477 null
2025-01-27 MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs Manar Abdelatty et.al. 2411.03471 link
2024-11-04 RuAG: Learned-rule-augmented Generation for Large Language Models Yudi Zhang et.al. 2411.03349 null
2024-10-30 Vision-Language Models Can Self-Improve Reasoning via Reflection Kanzhi Cheng et.al. 2411.00855 null
2024-11-01 Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling Yiwen Ding et.al. 2411.00750 link
2024-11-01 STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing Jiaru Zou et.al. 2411.00387 null
2024-11-08 GRS-QA – Graph Reasoning-Structured Question Answering Dataset Anish Pahilajani et.al. 2411.00369 null
2024-10-31 Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning Jinghan Zhang et.al. 2410.24155 null
2024-10-31 RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner Fu-Chieh Chang et.al. 2410.23912 null
2024-10-31 OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models Junda Wu et.al. 2410.23703 null
2024-10-30 ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning Millennium Bismay et.al. 2410.23180 link
2024-10-30 On Memorization of Large Language Models in Logical Reasoning Chulin Xie et.al. 2410.23123 null
2024-10-28 Causal Interventions on Causal Paths: Mapping GPT-2’s Reasoning From Syntax to Semantics Isabelle Lee et.al. 2410.21353 null
2024-10-28 Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments Sangmim Song et.al. 2410.20666 null
2024-10-25 Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models Danqing Wang et.al. 2410.20007 null
2024-10-25 Can Stories Help LLMs Reason? Curating Information Space Through Narrative Vahid Sadiri Javadi et.al. 2410.19221 null
2024-10-18 Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning Pengfei He et.al. 2410.19000 link
2024-10-25 CLR-Bench: Evaluating Large Language Models in College-level Reasoning Junnan Dong et.al. 2410.17558 null
2024-10-28 Non-myopic Generation of Language Models for Reasoning and Planning Chang Ma et.al. 2410.17195 link
2024-11-06 Improving Causal Reasoning in Large Language Models: A Survey Longxuan Yu et.al. 2410.16676 link
2024-10-22 A Statistical Analysis of LLMs’ Self-Evaluation Using Proverbs Ryosuke Sonoda et.al. 2410.16640 null
2024-10-21 Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models’ Reasoning with Formal Logic Jason Chan et.al. 2410.16502 null
2024-11-27 On Designing Effective RL Reward at Training Time for LLM Reasoning Jiaxuan Gao et.al. 2410.15115 null
2025-01-28 Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning Xingyu Tan et.al. 2410.14211 null
2024-10-21 Unconstrained Model Merging for Enhanced LLM Reasoning Yiming Zhang et.al. 2410.13699 null
2024-10-16 Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Linhao Luo et.al. 2410.13080 link
2024-10-16 KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs Yongqin Xu et.al. 2410.12480 null
2024-10-17 Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning Qian Wang et.al. 2410.12464 link
2024-10-16 Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up Jiahao Yuan et.al. 2410.12323 link
2024-10-16 Exploiting LLMs’ Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval Hai-Long Nguyen et.al. 2410.12154 null
2024-10-15 Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming Yilun Hao et.al. 2410.12112 null
2024-10-12 OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models Jun Wang et.al. 2410.09671 null
2024-10-11 P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains Simeng Han et.al. 2410.09207 null
2024-10-11 Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning Yunpeng Gao et.al. 2410.08500 null
2024-10-10 SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation Hang Yin et.al. 2410.08189 null
2024-10-10 Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning Amrith Setlur et.al. 2410.08146 null
2024-10-10 Automatic Curriculum Expert Iteration for Reliable LLM Reasoning Zirui Zhao et.al. 2410.07627 null
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 null
2024-10-09 Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning Xiyao Wang et.al. 2410.06508 null
2025-01-02 Filtering Discomforting Recommendations with Large Language Models Jiahao Liu et.al. 2410.05411 null
2024-10-05 Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification Zhenwen Liang et.al. 2410.05318 null
2024-10-06 Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval Pengcheng Jiang et.al. 2410.04585 link
2024-10-03 The Role of Deductive and Inductive Reasoning in Large Language Models Chengkun Cai et.al. 2410.02892 null
2024-10-02 Not All LLM Reasoners Are Created Equal Arian Hosseini et.al. 2410.01748 null
2024-12-25 Interpretable Contrastive Monte Carlo Tree Search Reasoning Zitian Gao et.al. 2410.01707 link
2024-10-02 VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Amirhossein Kazemnejad et.al. 2410.01679 link
2024-10-02 AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses Xiaotian Lu et.al. 2410.01246 null
2024-10-01 Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness Xiao Peng et.al. 2410.00359 null
2024-10-01 Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis Chun-Hsiao Yeh et.al. 2410.00292 null
2024-10-08 GUNDAM: Aligning Large Language Models with Graph Understanding Sheng Ouyang et.al. 2409.20053 null
2024-09-27 Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs Yanyuan Qiao et.al. 2409.18794 null
2024-10-23 Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning Debargha Ganguly et.al. 2409.17270 null
2024-09-20 CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency Kangsheng Wang et.al. 2409.17174 null
2024-09-20 Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM Zheng Wei Lim et.al. 2409.13949 null
2024-09-19 SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning Zhipeng Li et.al. 2409.12836 null
2024-10-04 Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning Jiaxin Wen et.al. 2409.12452 link
2024-12-16 Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data Jiaming Zhou et.al. 2409.12437 link
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-11-05 Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Fatemeh Haji et.al. 2409.11527 link
2024-09-16 Enhancing RL Safety with Counterfactual LLM Reasoning Dennis Gross et.al. 2409.10188 link
2024-09-11 Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation SeongYeub Chu et.al. 2409.07355 link

LLM Evaluation

Publish Date Title Authors PDF Code
2025-03-23 Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes Kelly O. Marshall et.al. 2503.18155 null
2025-03-22 GPBench: A Comprehensive and Fine-Grained Benchmark for Evaluating Large Language Models as General Practitioners Zheqing Li et.al. 2503.17599 null
2025-03-20 The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Yifan Sun et.al. 2503.16402 link
2025-03-20 Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation Shangqing Zhao et.al. 2503.15837 null
2025-03-19 Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering Francesco Maria Molfese et.al. 2503.14996 null
2025-03-13 It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education Shrutika Singh et.al. 2503.13508 null
2025-03-17 REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities Alexander Pugachev et.al. 2503.13102 null
2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null
2025-03-13 OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses Angela Lopez-Cardona et.al. 2503.10927 link
2025-03-13 Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data Paul Quinlan et.al. 2503.10883 null
2025-03-13 Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization Weisong Sun et.al. 2503.10737 null
2025-03-12 Medical Large Language Model Benchmarks Should Prioritize Construct Validity Ahmed Alaa et.al. 2503.10694 null
2025-03-10 ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model Competition Hisham A. Alyahya et.al. 2503.10673 link
2025-03-08 RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs Zhongzhan Huang et.al. 2503.10657 link
2025-03-12 Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts Hongyu Chen et.al. 2503.09347 null
2025-03-08 SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant? Xudong Lu et.al. 2503.06029 null
2025-03-07 SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs Samir Abdaljalil et.al. 2503.05980 null
2025-03-07 RocketEval: Efficient Automated LLM Evaluation via Grading Checklist Tianjun Wei et.al. 2503.05142 null
2025-02-09 Peeking Behind Closed Doors: Risks of LLM Evaluation by Private Data Curators Hritik Bansal et.al. 2503.04756 null
2025-03-07 Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm Hyeonjun Kim et.al. 2503.03796 null
2025-03-04 SAGE: Steering and Refining Dialog Generation with State-Action Augmentation Yizhe Zhang et.al. 2503.03040 link
2025-03-04 Position: Don’t use the CLT in LLM evals with fewer than a few hundred datapoints Sam Bowyer et.al. 2503.01747 null
2025-03-04 DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation Eliya Habba et.al. 2503.01622 null
2025-03-03 None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering Zhi Rui Tam et.al. 2503.01550 null
2025-03-03 SwiLTra-Bench: The Swiss Legal Translation Benchmark Joel Niklaus et.al. 2503.01372 null
2025-03-03 LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains Ling Xiao et.al. 2503.01236 null
2025-03-02 FunBench: Benchmarking Fundus Reading Skills of MLLMs Qijie Wei et.al. 2503.00901 null
2025-03-02 Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks Umar Ali Khan et.al. 2503.00781 null
2025-03-02 Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity Yupu Hao et.al. 2503.00771 link
2025-03-01 U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack Yunfan Gao et.al. 2503.00353 link
2025-02-28 Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking Samar M. Magdy et.al. 2503.00231 null
2025-02-28 Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models Colleen Gilhuly et.al. 2502.20647 null
2025-02-26 Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation Yuxiang Wang et.al. 2502.18771 link
2025-02-23 Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation Simin Chen et.al. 2502.17521 link
2025-02-24 Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective Chengyin Xu et.al. 2502.17262 null
2025-02-24 Detecting Benchmark Contamination Through Watermarking Tom Sander et.al. 2502.17259 null
2025-02-24 Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation Jaskaran Singh Walia et.al. 2502.17011 null
2025-02-24 AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha Decay Ziyi Tang et.al. 2502.16789 null
2025-01-30 Retrieval Augmented Generation Based LLM Evaluation For Protocol State Machine Inference With Chain-of-Thought Reasoning Youssef Maklad et.al. 2502.15727 null
2025-03-10 Prompt-to-Leaderboard Evan Frick et.al. 2502.14855 link
2025-03-05 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines M-A-P Team et.al. 2502.14739 null
2025-02-20 SEA-HELM: Southeast Asian Holistic Evaluation of Language Models Yosephine Susanto et.al. 2502.14301 null
2025-02-20 Transfer-Prompting: Enhancing Cross-Task Adaptation in Large Language Models via Dual-Stage Prompts Optimization Yupeng Chang et.al. 2502.14211 link
2025-02-19 Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above Nishant Balepur et.al. 2502.14127 null
2025-02-19 STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models Narun Raman et.al. 2502.13119 null
2025-02-18 HPSS: Heuristic Prompting Strategy Search for LLM Evaluators Bosi Wen et.al. 2502.13031 null
2025-03-19 None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks Eva Sánchez Salido et.al. 2502.12896 null
2025-02-18 Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages – A Singlish Case Study Isaac Lim et.al. 2502.12485 null
2025-02-17 Deviation Ratings: A General, Clone-Invariant Rating Method Luke Marris et.al. 2502.11645 null
2025-02-21 TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking Shahriar Kabir Nahin et.al. 2502.11187 null
2025-02-15 Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents Mauricio Tec et.al. 2502.10732 null
2025-03-02 An Empirical Analysis of Uncertainty in Large Language Model Evaluations Qiujie Xie et.al. 2502.10709 link
2025-02-25 Accelerating Unbiased LLM Evaluation via Synthetic Feedback Zhaoyi Zhou et.al. 2502.10563 link
2025-02-14 MathConstruct: Challenging LLM Reasoning with Constructive Proofs Mislav Balunović et.al. 2502.10197 null
2025-02-13 Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization Amit Levi et.al. 2502.09755 null
2025-02-13 NestQuant: Nested Lattice Quantization for Matrix Products and LLMs Semyon Savkin et.al. 2502.09720 null
2025-02-12 The Science of Evaluating Foundation Models Jiayi Yuan et.al. 2502.09670 null
2025-02-13 Copilot Arena: A Platform for Code LLM Evaluation in the Wild Wayne Chi et.al. 2502.09328 null
2025-02-12 Revisiting 3D LLM Benchmarks: Are We Really Testing 3D Capabilities? Jiahe Jin et.al. 2502.08503 link
2025-02-11 Forget What You Know about LLMs Evaluations – LLMs are Like a Chameleon Nurit Cohen-Inger et.al. 2502.07445 link
2025-02-10 Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring Alex Heyman et.al. 2502.07087 link
2025-02-10 Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models Lujain Ibrahim et.al. 2502.07077 null
2025-02-07 LLM-Supported Natural Language to Bash Translation Finnian Westenfelder et.al. 2502.06858 link
2025-02-15 Self-Supervised Prompt Optimization Jinyu Xiang et.al. 2502.06855 link
2025-02-10 Resurrecting saturated LLM benchmarks with adversarial encoding Igor Ivanov et.al. 2502.06738 null
2025-02-10 Automatic Evaluation of Healthcare LLMs Beyond Question-Answering Anna Arias-Duart et.al. 2502.06666 null
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 null
2025-02-10 LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks Xin Zhou et.al. 2502.06215 null
2025-02-05 Aero-LLM: A Distributed Framework for Secure UAV Communication and Intelligent Decision-Making Balakrishnan Dharmalingam et.al. 2502.05220 null
2025-02-06 TruthFlow: Truthful LLM Generation via Representation Flow Correction Hanyu Wang et.al. 2502.04556 null
2025-02-05 How do Humans and Language Models Reason About Creativity? A Comparative Analysis Antonio Laverghetta Jr. et.al. 2502.03253 null
2025-03-22 On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation Nghiem T. Diep et.al. 2502.03029 null
2025-02-02 LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient Peiwen Yuan et.al. 2502.01683 link
2025-02-02 HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs Mehdi Makni et.al. 2502.00899 null
2025-02-01 DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks Zhiliang Chen et.al. 2502.00270 null
2025-01-30 Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation Muhammed Yusuf Kocyigit et.al. 2501.18771 null
2025-01-31 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-02-01 LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering Beiming Liu et.al. 2501.17183 null
2025-03-18 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-26 HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI Tidor-Vlad Pricope et.al. 2501.15627 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-10 CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback En-Qi Tseng et.al. 2501.10421 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-13 Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles Samia Touileb et.al. 2501.07718 null
2025-01-03 FLAME: Financial Large-Language Model Assessment and Metrics Evaluation Jiayu Guo et.al. 2501.06211 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-04 LLMzSzŁ: a comprehensive LLM benchmark for Polish Krzysztof Jassem et.al. 2501.02266 null
2025-03-25 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2025-01-04 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs Weijia Xu et.al. 2501.00273 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2025-01-25 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2025-01-02 Baichuan4-Finance Technical Report Hanyu Zhang et.al. 2412.15270 null
2024-12-19 ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects Qihang Cao et.al. 2412.14837 null
2024-12-18 AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Xiaobao Wu et.al. 2412.13670 link
2025-02-16 Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning Eitan Wagner et.al. 2412.13631 null
2025-02-17 OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Shuting Wang et.al. 2412.13018 link
2024-12-10 How to Choose a Threshold for an Evaluation Metric for Large Language Models Bhaskarjit Sarmah et.al. 2412.12148 null
2024-12-15 Dual Traits in Probabilistic Reasoning of Large Language Models Shenxiong Li et.al. 2412.11009 link
2024-12-30 LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation Eunsu Kim et.al. 2412.10424 null
2024-12-13 Cultural Evolution of Cooperation among LLM Agents Aron Vallinder et.al. 2412.10270 null
2024-12-12 Towards Understanding the Robustness of LLM-based Evaluations under Perturbations Manav Chaudhary et.al. 2412.09269 null
2024-12-10 BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Sahal Shaji Mullappilly et.al. 2412.07769 link
2025-02-28 PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models Qian Zhang et.al. 2412.06287 link
2024-12-02 AI Benchmarks and Datasets for LLM Evaluation Todor Ivanov et.al. 2412.01020 null
2024-11-30 Evaluating the Consistency of LLM Evaluators Noah Lee et.al. 2412.00543 null
2024-11-29 MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks John Francis et.al. 2411.19689 null
2024-11-29 Beyond Surface Structure: A Causal Assessment of LLMs’ Comprehension Ability Yujin Han et.al. 2411.19456 link
2024-11-27 Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator Frederic Kirstein et.al. 2411.18444 null
2025-01-17 CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity Zhengmin Yu et.al. 2411.16239 link
2024-11-25 SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text Reshmi Ghosh et.al. 2411.16077 null
2024-11-26 Do LLMs Agree on the Creativity Evaluation of Alternative Uses? Abdullah Al Rabeyah et.al. 2411.15560 null
2025-02-17 Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat Roland Daynauth et.al. 2411.14483 link
2024-11-21 Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models Lovish Madaan et.al. 2411.14103 null
2024-11-21 An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture Boming Xia et.al. 2411.13768 null
2024-11-21 A Framework for Evaluating LLMs Under Task Indeterminacy Luke Guerdan et.al. 2411.13760 null
2024-11-12 Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning Linyang He et.al. 2411.07533 null
2024-11-13 Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models Yancheng He et.al. 2411.07140 null
2024-11-09 Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Xiaojun Wu et.al. 2411.06272 link
2025-02-09 ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding Israel Abebe Azime et.al. 2411.05049 null
2024-11-07 Bayesian Calibration of Win Rate Estimation with LLM Evaluators Yicheng Gao et.al. 2411.04424 link
2024-11-05 Enhancing LLM Evaluations: The Garbling Trick William F. Bradley et.al. 2411.01533 null
2025-02-19 Varco Arena: A Tournament Approach to Reference-Free Benchmarking Large Language Models Seonil Son et.al. 2411.01281 null
2025-02-07 Mastering the Craft of Data Synthesis for CodeLLMs Meng Chen et.al. 2411.00005 link
2024-10-28 Project MPG: towards a generalized performance benchmark for LLM capabilities Lucas Spangher et.al. 2410.22368 null
2024-10-29 Self-Preference Bias in LLM-as-a-Judge Koki Wataoka et.al. 2410.21819 null
2024-10-28 Unveiling Context-Aware Criteria in Self-Assessing LLMs Taneesh Gupta et.al. 2410.21545 null
2024-10-27 LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization Jui-Nan Yen et.al. 2410.20625 null
2024-10-26 Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks Annalisa Szymanski et.al. 2410.20266 null
2024-10-23 MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Jingfan Zhang et.al. 2410.18035 null
2025-02-21 Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements Isamu Isozaki et.al. 2410.17141 link
2024-10-21 CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Maosong Cao et.al. 2410.16256 link
2025-01-26 mHumanEval – A Multilingual Benchmark to Evaluate Large Language Models for Code Generation Nishat Raihan et.al. 2410.15037 link
2024-10-19 CAP: Data Contamination Detection via Consistency Amplification Yi Zhao et.al. 2410.15005 null
2024-10-18 Enabling Scalable Evaluation of Bias Patterns in Medical LLMs Hamed Fayyaz et.al. 2410.14763 link
2024-11-06 Diverging Preferences: When do Annotators Disagree and do Models Know? Michael JQ Zhang et.al. 2410.14632 null
2024-10-18 Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models James Vo et.al. 2410.14480 null
2024-10-21 BenTo: Benchmark Task Reduction with In-Context Transferability Hongyu Zhao et.al. 2410.13804 link
2024-10-16 BenchmarkCards: Large Language Model and Risk Reporting Anna Sokol et.al. 2410.12974 null
2025-02-01 Language Model Preference Evaluation with Multiple Weak Evaluators Zhengyu Hu et.al. 2410.12869 link
2024-10-11 Enterprise Benchmarks for Large Language Model Evaluation Bing Zhang et.al. 2410.12857 link
2024-10-16 An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation Junjie Chen et.al. 2410.12265 null
2024-10-15 Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi et.al. 2410.11672 link
2024-10-15 Black-box Uncertainty Quantification Method for LLM-as-a-Judge Nico Wagner et.al. 2410.11594 null
2024-10-14 Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting Yifan Luo et.al. 2410.10150 null
2024-12-13 HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Jingxuan Fan et.al. 2410.09988 link
2024-10-15 LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models Han Qiu et.al. 2410.09962 link
2024-10-17 Towards Multilingual LLM Evaluation for European Languages Klaudia Thellmann et.al. 2410.08928 null
2024-10-11 Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example Marcus Kessel et.al. 2410.08911 null
2024-10-10 Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks Mathis Pink et.al. 2410.08133 null
2025-02-03 COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act Philipp Guldimann et.al. 2410.07959 link
2024-11-06 News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News Tarun Jain et.al. 2410.07520 null
2024-10-09 Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Xiaosen Zheng et.al. 2410.07137 link
2024-10-09 ReIFE: Re-evaluating Instruction-Following Evaluation Yixin Liu et.al. 2410.07069 link
2024-10-08 Active Evaluation Acquisition for Efficient LLM Benchmarking Yang Li et.al. 2410.05952 null
2024-10-07 TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Qingchen Yu et.al. 2410.05262 link
2024-10-01 Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model Aidan Gilson et.al. 2410.03740 null
2024-10-04 TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation Jonathan Cook et.al. 2410.03608 null
2024-10-04 Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores Robert E. Blackwell et.al. 2410.03492 null
2024-10-29 AIME: AI System Optimization via Multiple LLM Evaluators Bhrij Patel et.al. 2410.03131 null
2024-10-02 Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation Annalisa Szymanski et.al. 2410.02054 null
2024-10-02 Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models Joseph Lee et.al. 2410.01795 link
2024-10-03 Extending Context Window of Large Language Models from a Distributional Perspective Yingsheng Wu et.al. 2410.01490 link
2024-10-02 ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving Yifan Qiao et.al. 2410.01228 null
2024-10-01 ViDAS: Vision-based Danger Assessment and Scoring Pranav Gupta et.al. 2410.00477 null
2024-10-01 PclGPT: A Large Language Model for Patronizing and Condescending Language Detection Hongbo Wang et.al. 2410.00361 link
2024-11-26 LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models Haitao Li et.al. 2409.20288 link
2024-09-29 Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems Xuyang Wu et.al. 2409.19804 null
2024-10-19 Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models Xin Li et.al. 2409.19667 link
2024-10-05 IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation Fan Lin et.al. 2409.18892 link
2024-12-13 A Character-Centric Creative Story Generation via Imagination Kyeongman Park et.al. 2409.16667 null
2024-09-25 Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models Sungjune Park et.al. 2409.16635 null
2024-12-18 Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino Jann Railey Montalan et.al. 2409.15380 link
2024-12-16 MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators Qingyu Lu et.al. 2409.14335 link
2024-09-21 ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models Yuqing Huang et.al. 2409.13989 link
2024-12-17 AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs Basel Mousi et.al. 2409.11404 null
2024-10-02 LLM-as-a-Judge & Reward Model: What They Can and Cannot Do Guijin Son et.al. 2409.11239 null
2024-12-08 Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges Vinay Samuel et.al. 2409.09927 link
2024-09-13 Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia Fajri Koto et.al. 2409.08564 null
2024-09-09 Assessing SPARQL capabilities of Large Language Models Lars-Peter Meyer et.al. 2409.05925 link
2024-10-08 LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs Yuhao Wu et.al. 2409.02076 link
2024-10-14 Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation Jasper Dekoninck et.al. 2409.00696 null
2024-08-26 Evaluating ChatGPT on Nuclear Domain-Specific Data Muhammad Anwar et.al. 2409.00090 null
2024-08-28 LLMSecCode: Evaluating Large Language Models for Secure Coding Anton Rydén et.al. 2408.16100 link
2024-08-26 LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Yayati Jadhav et.al. 2408.14307 null
2024-08-26 Epidemic Information Extraction for Event-Based Surveillance using Large Language Models Sergio Consoli et.al. 2408.14277 null
2024-10-04 MobileQuant: Mobile-friendly Quantization for On-device Language Models Fuwen Tan et.al. 2408.13933 link
2024-08-23 LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models Chongyan Sun et.al. 2408.13338 null
2024-08-23 Open Llama2 Model for the Lithuanian Language Artūras Nakvosas et.al. 2408.12963 null
2024-08-23 LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction Songwei Li et.al. 2408.12832 link
2024-12-20 Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts Jiaqing Liu et.al. 2408.09688 null
2024-08-20 Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge Ravi Raju et.al. 2408.08808 null
2024-10-16 The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation Samee Arif et.al. 2408.08688 link
2024-10-19 Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks Junseok Kim et.al. 2408.08631 null

LLM MLLM

Publish Date Title Authors PDF Code
2025-03-24 Equivariant Image Modeling Ruixiao Dong et.al. 2503.18948 link
2025-03-24 Aether: Geometric-Aware Unified World Modeling Aether Team et.al. 2503.18945 null
2025-03-24 DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation Karim Abou Zeid et.al. 2503.18944 link
2025-03-24 SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding Mingze Xu et.al. 2503.18943 null
2025-03-24 Video-T1: Test-Time Scaling for Video Generation Fangfu Liu et.al. 2503.18942 null
2025-03-24 Exploring Training and Inference Scaling Laws in Generative Retrieval Hongru Cai et.al. 2503.18941 null
2025-03-24 CoMP: Continual Multimodal Pre-training for Vision Foundation Models Yitong Chen et.al. 2503.18931 link
2025-03-24 Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Brian R. Bartoldson et.al. 2503.18929 null
2025-03-24 FFN Fusion: Rethinking Sequential Computation in Large Language Models Akhiad Bercovich et.al. 2503.18908 null
2025-03-24 xKV: Cross-Layer SVD for KV-Cache Compression Chi-Chih Chang et.al. 2503.18893 null
2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 null
2025-03-24 Toward building next-generation Geocoding systems: a systematic review Zhengcong Yin et.al. 2503.18888 null
2025-03-24 I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Andrey Galichin et.al. 2503.18878 null
2025-03-24 Efficient Self-Supervised Adaptation for Medical Image Analysis Moein Sorkhei et.al. 2503.18873 null
2025-03-24 Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design Rui Xie et.al. 2503.18869 null
2025-03-24 Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations Junlan Chen et.al. 2503.18865 null
2025-03-24 3DSwapping: Texture Swapping For 3D Object From Single Reference Image Xiao Cao et.al. 2503.18853 null
2025-03-24 Defeating Prompt Injections by Design Edoardo Debenedetti et.al. 2503.18813 null
2025-03-24 Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code Augusto B. Corrêa et.al. 2503.18809 null
2025-03-24 REALM: A Dataset of Real-World LLM Use Cases Jingwen Cheng et.al. 2503.18792 null
2025-03-24 BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache Dayou Du et.al. 2503.18773 null
2025-03-24 AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning Alan Dao et.al. 2503.18769 null
2025-03-24 RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation Chengbo Yuan et.al. 2503.18738 null
2025-03-24 Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving Hongkuan Zhou et.al. 2503.18730 null
2025-03-24 LLaVAction: evaluating and training multi-modal large language models for action recognition Shaokai Ye et.al. 2503.18712 null
2025-03-24 Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology Boqi Chen et.al. 2503.18709 null
2025-03-24 OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad Luyao Tang et.al. 2503.18695 null
2025-03-25 Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models Yazhou Zhang et.al. 2503.18681 null
2025-03-24 NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping Tianyi Wang et.al. 2503.18678 null
2025-03-24 Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark Bingchen Miao et.al. 2503.18665 null
2025-03-24 From Fragment to One Piece: A Survey on AI-Driven Graphic Design Xingxing Zou et.al. 2503.18641 null
2025-03-24 Adaptive Machine Learning for Resource-Constrained Environments Sebastián A. Cajas Ordóñez et.al. 2503.18634 null
2025-03-24 Generative Dataset Distillation using Min-Max Diffusion Model Junqiao Fan et.al. 2503.18626 null
2025-03-24 Scaling Laws for Emulation of Stellar Spectra Tomasz Różański et.al. 2503.18617 null
2025-03-24 LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment Jong Myoung Kim et.al. 2503.18603 null
2025-03-24 Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization Minsu Kim et.al. 2503.18599 null
2025-03-24 A Universal Model Combining Differential Equations and Neural Networks for Ball Trajectory Prediction Zhiwei Shi et.al. 2503.18584 null
2025-03-24 Anchor-based oversampling for imbalanced tabular data via contrastive and adversarial learning Hadi Mohammadi et.al. 2503.18569 null
2025-03-24 Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures Abdoul Majid O. Thiombiano et.al. 2503.18565 null
2025-03-24 Power-fractional distributions and branching processes Gerold Alsmeyer et.al. 2503.18563 null
2025-03-24 Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models Nariman Naderi et.al. 2503.18562 null
2025-03-25 AMD-Hummingbird: Towards an Efficient Text-to-Video Model Takashi Isobe et.al. 2503.18559 null
2025-03-24 HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications Guneet Mutreja et.al. 2503.18540 null
2025-03-24 SciClaims: An End-to-End Generative System for Biomedical Claim Analysis Raúl Ortega et.al. 2503.18526 null
2025-03-24 P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction Yufeng Zhong et.al. 2503.18525 null
2025-03-24 Can Text-to-Video Generation help Video-Language Alignment? Luca Zanella et.al. 2503.18507 null
2025-03-24 Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain Andrés García-Silva et.al. 2503.18502 null
2025-03-24 Verbal Process Supervision Elicits Better Coding Agents Hao-Yuan Chen et.al. 2503.18494 null
2025-03-24 Safeguarding Mobile GUI Agent via Logic-based Action Verification Jungjae Lee et.al. 2503.18492 null
2025-03-24 Large Language Models powered Network Attack Detection: Architecture, Opportunities and Case Study Xinggong Zhang et.al. 2503.18487 null
2025-03-24 Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification Zequn Zeng et.al. 2503.18483 null
2025-03-24 Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding Xiangrui Liu et.al. 2503.18478 null
2025-03-24 PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models Tadeusz Dziarmaga et.al. 2503.18462 null
2025-03-24 MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing Lingting Zhu et.al. 2503.18461 null
2025-03-24 ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation Jiahui Xiang et.al. 2503.18460 null
2025-03-24 SEAlign: Alignment Training for Software Engineering Agent Kechi Zhang et.al. 2503.18455 null
2025-03-24 InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment Yunhong Lu et.al. 2503.18454 null
2025-03-24 ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation Guosheng Zhao et.al. 2503.18438 null
2025-03-24 A Simple yet Effective Layout Token in Large Language Models for Document Understanding Zhaoqing Zhu et.al. 2503.18434 null
2025-03-24 Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning Junsong Li et.al. 2503.18432 null
2025-03-24 Breaking the Encoder Barrier for Seamless Video-Language Understanding Handong Li et.al. 2503.18422 null
2025-03-25 Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning Sherry X. Chen et.al. 2503.18406 null
2025-03-24 Solving Situation Puzzles with Large Language Model and External Reformulation Kun Li et.al. 2503.18394 null
2025-03-24 Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors Joshua Krook et.al. 2503.18387 null
2025-03-24 Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance Sicong Feng et.al. 2503.18386 null
2025-03-24 Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs Chang Gao et.al. 2503.18377 null
2025-03-24 J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain Yiran Hu et.al. 2503.18360 null
2025-03-24 Mitigating Cache Noise in Test-Time Adaptation for Large Vision-Language Models Haotian Zhai et.al. 2503.18334 null
2025-03-24 Optimizing Influence Campaigns: Nudging under Bounded Confidence Yen-Shao Chen et.al. 2503.18331 null
2025-03-24 Towards Training-free Anomaly Detection with Vision and Language Foundation Models Jinjin Zhang et.al. 2503.18325 null
2025-03-24 Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions Dong Jing et.al. 2503.18320 null
2025-03-24 Knowledge Transfer from LLMs to Provenance Analysis: A Semantic-Augmented Method for APT Detection Fei Zuo et.al. 2503.18316 null
2025-03-24 DeepFund: Will LLM be Professional at Fund Investment? A Live Arena Perspective Changlun Li et.al. 2503.18313 null
2025-03-24 Enhancing LLM-based Code Translation in Repository Context via Triple Knowledge-Augmented Guangsheng Ou et.al. 2503.18305 null
2025-03-24 How to Capture and Study Conversations Between Research Participants and ChatGPT: GPT for Researchers (g4r.org) Jin Kim et.al. 2503.18303 null
2025-03-24 Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module Yishen Liu et.al. 2503.18297 null
2025-03-24 Surgical Action Planning with Large Language Models Mengya Xu et.al. 2503.18296 null
2025-03-24 Fact-checking AI-generated news reports: Can LLMs catch their own lies? Jiayi Yao et.al. 2503.18293 null
2025-03-24 Jenga: Effective Memory Management for Serving LLM with Heterogeneity Chen Zhang et.al. 2503.18292 null
2025-03-24 Sun-Shine: A Large Language Model for Tibetan Culture Cheng Huang et.al. 2503.18288 null
2025-03-24 CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI Siyuan Cheng et.al. 2503.18286 null
2025-03-24 Analyzing Islamophobic Discourse Using Semi-Coded Terms and LLMs Raza Ul Mustafa et.al. 2503.18273 null
2025-03-24 Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift Xiudi Li et.al. 2503.18256 null
2025-03-24 Surface-Aware Distilled 3D Semantic Features Lukas Uzolas et.al. 2503.18254 null
2025-03-24 Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages Tadesse Destaw Belay et.al. 2503.18253 null
2025-03-23 CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation Jungsoo Lee et.al. 2503.18244 null
2025-03-23 ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices Aneesh Vathul et.al. 2503.18242 null
2025-03-23 Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters Roberto Garcia et.al. 2503.18216 null
2025-03-23 LakotaBERT: A Transformer-based Model for Low Resource Lakota Language Kanishka Parankusham et.al. 2503.18212 null
2025-03-23 The Power of Small LLMs in Geometry Generation for Physical Simulations Ossama Shafiq et.al. 2503.18178 null
2025-03-23 Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering Zixin Chen et.al. 2503.18172 null
2025-03-23 Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes Kelly O. Marshall et.al. 2503.18155 null
2025-03-23 LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space Zhangyu Wang et.al. 2503.18142 null
2025-03-23 AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs Diwei Wang et.al. 2503.18141 null
2025-03-23 MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation Jiaxin Huang et.al. 2503.18135 null
2025-03-23 MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection Yibo Yan et.al. 2503.18132 null
2025-03-23 Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization Juntao Dai et.al. 2503.18130 null
2025-03-23 GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks Varvara Krechetova et.al. 2503.18129 null
2025-03-23 $D^2LoRA$ : Data-Driven LoRA Initialization for Low Resource Tasks Javad SeraJ et.al. 2503.18089 null
2025-03-23 Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms Nachuan Ma et.al. 2503.18082 null
2025-03-21 Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique Yansi Li et.al. 2503.17363 null
2025-03-21 Position: Interactive Generative Video as Next-Generation Game Engine Jiwen Yu et.al. 2503.17359 null
2025-03-21 HCAST: Human-Calibrated Autonomy Software Tasks David Rein et.al. 2503.17354 null
2025-03-21 NdLinear Is All You Need for Representation Learning Alex Reneau et.al. 2503.17353 null
2025-03-21 OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Yihe Deng et.al. 2503.17352 null
2025-03-21 Capturing Individual Human Preferences with Reward Features André Barreto et.al. 2503.17338 null
2025-03-21 Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs Reem Gody et.al. 2503.17336 null
2025-03-21 CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities Yuxuan Zhu et.al. 2503.17332 link
2025-03-21 LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language Kun Chu et.al. 2503.17309 null
2025-03-21 Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests John Naulty et.al. 2503.17302 null
2025-03-21 Offline Model-Based Optimization: Comprehensive Review Minsu Kim et.al. 2503.17286 null
2025-03-21 CASE – Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement Gaifan Zhang et.al. 2503.17279 null
2025-03-21 Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras Shuang Guo et.al. 2503.17262 null
2025-03-21 SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging Aladin Djuhera et.al. 2503.17239 null
2025-03-21 FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs Albert Sawczyn et.al. 2503.17229 null
2025-03-21 Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation Giacomo Savazzi et.al. 2503.17224 null
2025-03-21 Automating Adjudication of Cardiovascular Events Using Large Language Models Sonish Sivarajkumar et.al. 2503.17222 null
2025-03-21 TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning Sheng Wang et.al. 2503.17195 null
2025-03-21 LLMs Love Python: A Study of LLMs’ Bias for Programming Languages and Libraries Lukas Twist et.al. 2503.17181 null
2025-03-21 D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens Panpan Wang et.al. 2503.17155 null
2025-03-21 Modifying Large Language Model Post-Training for Diverse Creative Writing John Joon Young Chung et.al. 2503.17126 null
2025-03-21 Large Language Model Compression via the Nested Activation-Aware Decomposition Jun Lu et.al. 2503.17101 null
2025-03-21 Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics J. M. Diederik Kruijssen et.al. 2503.17085 null
2025-03-21 A Study into Investigating Temporal Robustness of LLMs Jonas Wallat et.al. 2503.17073 null
2025-03-21 PVChat: Personalized Video Chat with One-Shot Learning Yufei Shi et.al. 2503.17069 null
2025-03-21 Problem Framing in the AI era: a new model Matteo Tuveri et.al. 2503.17040 null
2025-03-21 AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process Junjie Hu et.al. 2503.17029 null
2025-03-21 RiboFlow: Conditional De Novo RNA Sequence-Structure Co-Design via Synergistic Flow Matching Runze Ma et.al. 2503.17007 null
2025-03-21 Text2Model: Generating dynamic chemical reactor models using large language models (LLMs) Sophia Rupprecht et.al. 2503.17004 null
2025-03-21 A Survey on Personalized Alignment – The Missing Piece for Large Language Models in Real-World Applications Jian Guan et.al. 2503.17003 null
2025-03-21 Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation Qinghe Ma et.al. 2503.16997 null
2025-03-21 TRACE: Time SeRies PArameter EffiCient FinE-tuning Yuze Li et.al. 2503.16991 null
2025-03-21 Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models Haichao Zhang et.al. 2503.16980 null
2025-03-21 Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks Julian Junyan Wang et.al. 2503.16974 null
2025-03-21 Distilling Monocular Foundation Model for Fine-grained Depth Completion Yingping Liang et.al. 2503.16970 null
2025-03-21 HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis Mengtian Li et.al. 2503.16944 null
2025-03-21 TEMPO: Temporal Preference Optimization of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment Shicheng Li et.al. 2503.16929 null
2025-03-21 RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation Linxi Liang et.al. 2503.16922 null
2025-03-21 Malliavin-Bismut Score-based Diffusion Models Ehsan Mirafzali et.al. 2503.16917 null
2025-03-21 FAIT: Fault-Aware Fine-Tuning for Better Code Generation Lishui Fan et.al. 2503.16913 null
2025-03-21 Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation Jingzhi Fang et.al. 2503.16893 null
2025-03-21 Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation Jiangcheng Qin et.al. 2503.16875 null
2025-03-21 MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization Jian Zhang et.al. 2503.16874 null
2025-03-21 Lie Detector: Unified Backdoor Detection via Cross-Examination Framework Xuan Wang et.al. 2503.16872 null
2025-03-21 Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs Anshumann et.al. 2503.16870 null
2025-03-21 Nonparametric Factor Analysis and Beyond Yujia Zheng et.al. 2503.16865 null
2025-03-21 MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering Jialin Chen et.al. 2503.16858 null
2025-03-21 Generative Compositor for Few-Shot Visual Information Extraction Zhibo Yang et.al. 2503.16854 null
2025-03-21 Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models Suho Yoo et.al. 2503.16853 null
2025-03-21 Towards LLM Guardrails via Sparse Representation Steering Zeqing He et.al. 2503.16851 null
2025-03-21 LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models Jian Liang et.al. 2503.16843 null
2025-03-21 Downstream Analysis of Foundational Medical Vision Models for Disease Progression Basar Demir et.al. 2503.16842 null
2025-03-21 When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts Jun Seong Kim et.al. 2503.16826 null
2025-03-21 When Debate Fails: Bias Reinforcement in Large Language Models Jihwan Oh et.al. 2503.16814 null
2025-03-21 Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models Mengsong Wu et.al. 2503.16779 null
2025-03-21 Current and Future Use of Large Language Models for Knowledge Work Michelle Brachman et.al. 2503.16774 null
2025-03-21 On Explaining (Large) Language Models For Code Using Global Code-Based Explanations David N. Palacio et.al. 2503.16771 null
2025-03-20 Automated Harmfulness Testing for Code Large Language Models Honghao Tan et.al. 2503.16740 null
2025-03-20 Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models Chengkai Huang et.al. 2503.16734 null
2025-03-20 Natural Language Generation Emiel van Miltenburg et.al. 2503.16728 null
2025-03-20 Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding Jinlong Li et.al. 2503.16707 null
2025-03-20 APPA : Agentic Preformulation Pathway Assistant Julius Lange et.al. 2503.16698 null
2025-03-20 GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations Zeping Liu et.al. 2503.16683 null
2025-03-20 Echoes of Power: Investigating Geopolitical Bias in US and China Large Language Models Andre G. C. Pacheco et.al. 2503.16679 null
2025-03-20 Accelerating Transformer Inference and Training with 2:4 Activation Sparsity Daniel Haziza et.al. 2503.16672 null
2025-03-20 Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms Niki van Stein et.al. 2503.16668 null
2025-03-20 A preliminary data fusion study to assess the feasibility of Foundation Process-Property Models in Laser Powder Bed Fusion Oriol Vendrell-Gallart et.al. 2503.16667 null
2025-03-20 Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs Maxime Delmas et.al. 2503.16655 null
2025-03-20 Leveraging Large Language Models for Explainable Activity Recognition in Smart Homes: A Critical Evaluation Michele Fiori et.al. 2503.16622 null
2025-03-20 A Recipe for Generating 3D Worlds From a Single Image Katja Schwarz et.al. 2503.16611 null
2025-03-20 Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions Hadi Amini et.al. 2503.16585 null
2025-03-22 Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Yuqing Wang et.al. 2503.16430 null
2025-03-20 DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding Keyan Chen et.al. 2503.16426 link
2025-03-20 SynCity: Training-Free Generation of 3D Worlds Paul Engstler et.al. 2503.16420 null
2025-03-20 Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Yang Sui et.al. 2503.16419 link
2025-03-20 M3: 3D-Spatial MultiModal Memory Xueyan Zou et.al. 2503.16413 link
2025-03-20 DreamTexture: Shape from Virtual Texture with Analysis by Augmentation Ananta R. Bhattarai et.al. 2503.16412 null
2025-03-20 VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness SeungJu Cha et.al. 2503.16406 link
2025-03-20 The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Yifan Sun et.al. 2503.16402 link
2025-03-20 Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them Guanyu Chen et.al. 2503.16401 null
2025-03-20 Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation Yijia Luo et.al. 2503.16385 link
2025-03-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images Leyang Wang et.al. 2503.16376 null
2025-03-20 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Muyao Li et.al. 2503.16365 null
2025-03-20 CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Yunzhi Yao et.al. 2503.16356 link
2025-03-20 Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences Krithik Ramesh et.al. 2503.16351 null
2025-03-20 LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates Ying Shen et.al. 2503.16334 null
2025-03-20 OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Long Yuan et.al. 2503.16326 null
2025-03-20 Issue2Test: Generating Reproducing Test Cases from Issue Reports Noor Nashid et.al. 2503.16320 null
2025-03-21 Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 Peiran Gu et.al. 2503.16304 null
2025-03-20 SceneMI: Motion In-betweening for Modeling Human-Scene Interactions Inwoo Hwang et.al. 2503.16289 null
2025-03-21 Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens Shuqi Lu et.al. 2503.16278 link
2025-03-20 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Zijian Li et.al. 2503.16260 null
2025-03-20 Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Keda Tao et.al. 2503.16257 null
2025-03-21 Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Zhaowei Liu et.al. 2503.16252 null
2025-03-20 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t Quy-Anh Dang et.al. 2503.16219 link
2025-03-20 MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion Qizhi Pei et.al. 2503.16212 link
2025-03-20 VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis Chia-Yi Hsu et.al. 2503.16195 null
2025-03-21 Affective Polarization Amongst Swedish Politicians François t’Serstevens et.al. 2503.16193 link
2025-03-20 Large Language Models for Water Distribution Systems Modeling and Decision-Making Yinon Goldshtein et.al. 2503.16191 null
2025-03-20 CLS-RL: Image Classification with Rule-Based Reinforcement Learning Ming Li et.al. 2503.16188 null
2025-03-20 Narrowing Class-Wise Robustness Gaps in Adversarial Training Fatemeh Amerehi et.al. 2503.16179 null
2025-03-20 CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models Hong Yi Lin et.al. 2503.16167 null
2025-03-20 SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs Shibo Jie et.al. 2503.16163 null
2025-03-20 Towards Lighter and Robust Evaluation for Retrieval Augmented Generation Alex-Razvan Ispas et.al. 2503.16161 null
2025-03-20 Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems Shenbin Qian et.al. 2503.16158 null
2025-03-20 Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models Mats Faulborn et.al. 2503.16148 null
2025-03-20 Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs Djamel Eddine Khelladi et.al. 2503.16144 null
2025-03-21 MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering Feiyang Li et.al. 2503.16131 null
2025-03-20 The Impact of Revealing Large Language Model Stochasticity on Trust, Reliability, and Anthropomorphization Chelse Swoopes et.al. 2503.16114 null
2025-03-20 OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP Mohamad Hassan N C et.al. 2503.16106 null
2025-03-20 Cultural Alignment in Large Language Models Using Soft Prompt Tuning Reem I. Masoud et.al. 2503.16094 null
2025-03-20 Quantum Chebyshev Probabilistic Models for Fragmentation Functions Jorge J. Martínez de Lejarza et.al. 2503.16073 null
2025-03-20 Tuning LLMs by RAG Principles: Towards LLM-native Memory Jiale Wei et.al. 2503.16071 null
2025-03-20 SALT: Singular Value Adaptation with Low-Rank Transformation Abdelrahman Elsayed et.al. 2503.16055 null
2025-03-20 Meta-Learning Neural Mechanisms rather than Bayesian Priors Michael Goodale et.al. 2503.16048 null
2025-03-20 Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation Zhiyu Cao et.al. 2503.16043 null
2025-03-20 GreenIQ: A Deep Search Platform for Comprehensive Carbon Market Analysis and Automated Report Generation Bisola Faith Kayode et.al. 2503.16041 null
2025-03-20 Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond Yaoyao Yu et.al. 2503.16040 null
2025-03-20 Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models Zhihang Liu et.al. 2503.16036 null
2025-03-20 The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement Ruihan Yang et.al. 2503.16024 null
2025-03-20 BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models Zenghui Yuan et.al. 2503.16023 null
2025-03-20 Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models Mario Sanz-Guerrero et.al. 2503.16022 null
2025-03-21 Autonomous AI imitators increase diversity in homogeneous information ecosystems Emil Bakkensen Johansen et.al. 2503.16021 null
2025-03-20 GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions Xiaomeng Chu et.al. 2503.16013 null
2025-03-20 “This could save us months of work” – Use Cases of AI and Automation Support in Investigative Journalism Besjon Cifliku et.al. 2503.16011 null
2025-03-20 ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph Langming Liu et.al. 2503.15990 null
2025-03-20 A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli Pengyu Liu et.al. 2503.15978 null
2025-03-20 Stability of Schrödinger bridges and Sinkhorn semigroups for log-concave models Pierre Del Moral et.al. 2503.15963 null
2025-03-20 GAN-enhanced Simulation-driven DNN Testing in Absence of Ground Truth Mohammed Attaoui et.al. 2503.15953 null
2025-03-20 From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models Jinyi Liu et.al. 2503.15944 null
2025-03-21 Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment Gaole Dai et.al. 2503.15937 null
2025-03-20 Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning Peiyi Lin et.al. 2503.15924 null
2025-03-20 SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models Fahao Chen et.al. 2503.15921 null
2025-03-20 Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras Beilei Cui et.al. 2503.15917 null
2025-03-20 From Structured Prompts to Open Narratives: Measuring Gender Bias in LLMs Through Open-Ended Storytelling Evan Chen et.al. 2503.15904 null
2025-03-20 Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models Baolong Bi et.al. 2503.15888 null
2025-03-21 Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance Hui Liu et.al. 2503.15886 null
2025-03-20 DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System Kai Chen et.al. 2503.15876 null
2025-03-20 MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations Kyungho Bae et.al. 2503.15871 null
2025-03-20 TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data Rohit Kundu et.al. 2503.15867 null
2025-03-20 DroidTTP: Mapping Android Applications with TTP for Cyber Threat Intelligence Dincy R Arikkat et.al. 2503.15866 null
2025-03-20 VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling Hyojun Go et.al. 2503.15855 null
2025-03-20 Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey Xiaoou Liu et.al. 2503.15850 null
2025-03-20 Entropy-based Exploration Conduction for Multi-step Reasoning Jinghan Zhang et.al. 2503.15848 null
2025-03-20 Automatic Generation of Safety-compliant Linear Temporal Logic via Large Language Model: A Self-supervised Framework Junle Li et.al. 2503.15840 null
2025-03-20 Enhancing LLM Code Generation with Ensembles: A Similarity-Based Selection Approach Tarek Mahmud et.al. 2503.15838 null
2025-03-20 Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation Shangqing Zhao et.al. 2503.15837 null
2025-03-20 Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection Haotian Ma et.al. 2503.15818 null
2025-03-20 A Vision Centric Remote Sensing Benchmark Abduljaleel Adejumo et.al. 2503.15816 null
2025-03-20 Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing Vishnu Asutosh Dasu et.al. 2503.15815 null
2025-03-20 ChatGPT and U(X): A Rapid Review on Measuring the User Experience Katie Seaborn et.al. 2503.15808 null
2025-03-20 Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture Cheng Li et.al. 2503.15807 null
2025-03-20 DNA Bench: When Silence is Smarter – Benchmarking Over-Reasoning in Reasoning LLMs Masoud Hashemi et.al. 2503.15793 null
2025-03-20 RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models Parham Saremi et.al. 2503.15784 null
2025-03-20 Grammar and Gameplay-aligned RL for Game Description Generation with LLMs Tsunehiko Tanaka et.al. 2503.15783 null
2025-03-20 AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models Boshra Khalili et.al. 2503.15778 null
2025-03-20 Detecting LLM-Written Peer Reviews Vishisht Rao et.al. 2503.15772 null
2025-03-20 Towards Agentic AI Networking in 6G: A Generative Foundation Model-as-Agent Approach Yong Xiao et.al. 2503.15764 null
2025-03-20 Dialogic Learning in Child-Robot Interaction: A Hybrid Approach to Personalized Educational Content Generation Elena Malnatsky et.al. 2503.15762 null
2025-03-20 GraPLUS: Graph-based Placement Using Semantics for Image Composition Mir Mohammad Khaleghi et.al. 2503.15761 null
2025-03-20 AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration Andy Zhou et.al. 2503.15754 null
2025-03-20 Using Language Models to Decipher the Motivation Behind Human Behaviors Yutong Xie et.al. 2503.15752 null
2025-03-19 Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat Joseph Emmanuel DL Dayo et.al. 2503.15726 null
2025-03-21 Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication Sin-Yu Huang et.al. 2503.15722 null
2025-03-19 Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View Mathilde Aguiar et.al. 2503.15718 null
2025-03-19 Safety Aware Task Planning via Large Language Models in Robotics Azal Ahmad Khan et.al. 2503.15707 null
2025-03-19 GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving William Ljungbergh et.al. 2503.15672 null
2025-03-19 Enhancing Pancreatic Cancer Staging with Large Language Models: The Role of Retrieval-Augmented Generation Hisashi Johno et.al. 2503.15664 null
2025-03-19 R $^2$ : A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs Zefeng Lin et.al. 2503.15655 null
2025-03-19 LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Federico Cocchi et.al. 2503.15621 link
2025-03-19 Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings Austin Xu et.al. 2503.15620 null
2025-03-19 SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Yifei Zhou et.al. 2503.15478 null
2025-03-19 Cube: A Roblox View of 3D Intelligence Foundation AI Team et.al. 2503.15475 null
2025-03-19 EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining Boshen Xu et.al. 2503.15470 null
2025-03-19 From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment Jia-Nan Li et.al. 2503.15463 null
2025-03-19 Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator Yuanzhi Zhu et.al. 2503.15457 null
2025-03-19 SkyLadder: Better and Faster Pretraining via Context Window Scheduling Tongyao Zhu et.al. 2503.15450 null
2025-03-19 Visual Position Prompt for MLLM based Visual Grounding Wei Tang et.al. 2503.15426 null
2025-03-19 Probing the topology of the space of tokens with structured prompts Michael Robinson et.al. 2503.15421 null
2025-03-19 LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding Amirhossein Kazerouni et.al. 2503.15420 null
2025-03-19 Temporal Regularization Makes Your Video Generator Stronger Harold Haodong Chen et.al. 2503.15417 null
2025-03-19 Visual Persona: Foundation Model for Full-Body Human Customization Jisu Nam et.al. 2503.15406 null
2025-03-19 FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation Yumin Zhang et.al. 2503.15390 null
2025-03-19 Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers Corentin Vazia et.al. 2503.15383 null
2025-03-19 EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models Yinan Liang et.al. 2503.15369 null
2025-03-19 SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation Thomas Pickard et.al. 2503.15358 null
2025-03-19 SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models I-Fan Lin et.al. 2503.15351 null
2025-03-19 TruthLens:A Training-Free Paradigm for DeepFake Detection Ritabrata Chakraborty et.al. 2503.15342 null
2025-03-19 Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs Yuqi Zhu et.al. 2503.15341 null
2025-03-19 Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context Junyi Ao et.al. 2503.15338 null
2025-03-19 Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model Euclid Collaboration et.al. 2503.15312 null
2025-03-19 Euclid Quick Data Release (Q1): First visual morphology catalogue Euclid Collaboration et.al. 2503.15310 null
2025-03-19 aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion Jia Li et.al. 2503.15301 null
2025-03-19 Inside-Out: Hidden Factual Knowledge in LLMs Zorik Gekhman et.al. 2503.15299 null
2025-03-19 SENAI: Towards Software Engineering Native Generative Artificial Intelligence Mootez Saad et.al. 2503.15282 null
2025-03-19 MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration David Wan et.al. 2503.15272 null
2025-03-19 Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning? Roberto Araya et.al. 2503.15268 null
2025-03-19 LEGION: Learning to Ground and Explain for Synthetic Image Detection Hengrui Kang et.al. 2503.15264 null
2025-03-19 Efficient allocation of image recognition and LLM tasks on multi-GPU system Marcin Lawenda et.al. 2503.15252 null
2025-03-19 Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study Jomar Thomas Almonte et.al. 2503.15248 null
2025-03-19 Exploring Large Language Models for Word Games:Who is the Spy? Chentian Wei et.al. 2503.15235 null
2025-03-19 When LLMs Meet API Documentation: Can Retrieval Augmentation Aid Code Generation Just as It Helps Developers? Jingyi Chen et.al. 2503.15231 null
2025-03-19 A Personalized Data-Driven Generative Model of Human Motion Angelo Di Porzio et.al. 2503.15225 null
2025-03-19 A Foundation Model for Patient Behavior Monitoring and Suicide Detection Rodrigo Oliver et.al. 2503.15221 null
2025-03-19 Context-Aware Vision Language Foundation Models for Ocular Disease Screening in Retinal Images Lucie Berger et.al. 2503.15212 null
2025-03-19 DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation Jiazhe Guo et.al. 2503.15208 null
2025-03-19 Benchmarking Large Language Models for Handwritten Text Recognition Giorgia Crosilla et.al. 2503.15195 null
2025-03-19 Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems Sejong Kim et.al. 2503.15191 null
2025-03-19 Foundation models may exhibit staged progression in novel CBRN threat disclosure Kevin M Esvelt et.al. 2503.15182 null
2025-03-19 A Review on Large Language Models for Visual Analytics Navya Sonal Agarwal et.al. 2503.15176 null
2025-03-19 Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks Yuting Guo et.al. 2503.15169 null
2025-03-19 Object-Centric Pretraining via Target Encoder Bootstrapping Nikola Đukić et.al. 2503.15141 null
2025-03-19 VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention Mingzhe Zheng et.al. 2503.15138 null
2025-03-19 Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models Man Fai Wong et.al. 2503.15129 null
2025-03-19 Text-Derived Relational Graph-Enhanced Network for Skeleton-Based Action Segmentation Haoyu Ji et.al. 2503.15126 null
2025-03-19 Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification Shichen Li et.al. 2503.15117 null
2025-03-19 DeCaFlow: A Deconfounding Causal Generative Model Alejandro Almodóvar et.al. 2503.15114 null
2025-03-19 Reasoning Effort and Problem Complexity: A Scaling Analysis in LLMs Benjamin Estermann et.al. 2503.15113 null
2025-03-19 OpenLLM-RTL: Open Dataset and Benchmark for LLM-Aided Design RTL Generation Shang Liu et.al. 2503.15112 null
2025-03-19 VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making Mohamed Salim Aissi et.al. 2503.15108 null
2025-03-19 Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings Zonghao Ying et.al. 2503.15092 null
2025-03-19 Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs Yao Cheng et.al. 2503.15091 null
2025-03-19 LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents Ke Zhang et.al. 2503.15079 null
2025-03-19 Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis Imanol G. Estepa et.al. 2503.15060 null
2025-03-19 ELTEX: A Framework for Domain-Driven Synthetic Data Generation Arina Razmyslovich et.al. 2503.15055 link
2025-03-19 Studying and Understanding the Effectiveness and Failures of Conversational LLM-Based Repair Aolin Chen et.al. 2503.15050 null
2025-03-19 SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection Haoyi Li et.al. 2503.15044 null
2025-03-19 DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling Jianbo Zhao et.al. 2503.15029 null
2025-03-19 Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene Shengqiong Wu et.al. 2503.15019 null
2025-03-19 LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones? Amr Keleg et.al. 2503.15003 null
2025-03-19 Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering Francesco Maria Molfese et.al. 2503.14996 null
2025-03-19 ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents Hao Liang et.al. 2503.14948 null
2025-03-19 Generating Multimodal Driving Scenes via Next-Scene Prediction Yanhao Wu et.al. 2503.14945 null
2025-03-19 UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation Qihui Zhang et.al. 2503.14941 null
2025-03-19 VisNumBench: Evaluating Number Sense of Multimodal Large Language Models Tengjin Weng et.al. 2503.14939 null
2025-03-19 Proceedings of the 3rd Italian Conference on Big Data and Data Science (ITADATA2024) Nicola Bena et.al. 2503.14937 null
2025-03-19 FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding Chongjun Tu et.al. 2503.14935 null
2025-03-19 Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices Ziyao Wang et.al. 2503.14932 null
2025-03-19 GenM $^3$ : Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation Junyu Shi et.al. 2503.14919 null
2025-03-19 MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models Jiazheng Li et.al. 2503.14917 null
2025-03-19 Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology Siyuan Yan et.al. 2503.14911 null
2025-03-19 POSTA: A Go-to Framework for Customized Artistic Poster Generation Haoyu Chen et.al. 2503.14908 null
2025-03-19 Deep Contrastive Unlearning for Language Models Estrid He et.al. 2503.14900 null
2025-03-19 When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach Vaibhav Rathore et.al. 2503.14897 null
2025-03-19 Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations Shuo Li et.al. 2503.14895 null
2025-03-19 MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer Honglin Lin et.al. 2503.14891 null
2025-03-19 Pseudo-Relevance Feedback Can Improve Zero-Shot LLM-Based Dense Retrieval Hang Li et.al. 2503.14887 null
2025-03-19 Envisioning an AI-Enhanced Mental Health Ecosystem Kellie Yu Hui Sim et.al. 2503.14883 null
2025-03-19 Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks Kai Zhang et.al. 2503.14882 null
2025-03-19 Chemical Foundation Model Guided Design of High Ionic Conductivity Electrolyte Formulations Murtaza Zohair et.al. 2503.14878 null
2025-03-19 Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection Peipeng Yu et.al. 2503.14853 null
2025-03-19 LogLLaMA: Transformer-based log anomaly detection with LLaMA Zhuoyi Yang et.al. 2503.14849 null
2025-03-19 Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning Chengran Yang et.al. 2503.14838 null
2025-03-19 Robust Transmission of Punctured Text with Large Language Model-based Recovery Sojeong Park et.al. 2503.14831 null
2025-03-19 MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models Chejian Xu et.al. 2503.14827 null
2025-03-18 Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection Matt Franchi et.al. 2503.14754 null
2025-03-18 Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence Sophia Hager et.al. 2503.14749 null
2025-03-18 GR00T N1: An Open Foundation Model for Generalist Humanoid Robots NVIDIA et.al. 2503.14734 null
2025-03-18 CodingGenie: A Proactive LLM-Powered Programming Assistant Sebastian Zhao et.al. 2503.14724 null
2025-03-18 Generating Medically-Informed Explanations for Depression Detection using LLMs Xiangyong Chen et.al. 2503.14671 null
2025-03-18 RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving Wenqi Jiang et.al. 2503.14649 null
2025-03-18 Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache Hanchen Li et.al. 2503.14647 null
2025-03-18 Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor control Merkourios Simos et.al. 2503.14637 null
2025-03-18 Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving Priscylla Silva et.al. 2503.14630 null
2025-03-18 Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives Sara Sarto et.al. 2503.14604 null
2025-03-18 Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM Yazeed Alnumay et.al. 2503.14603 null
2025-03-18 Aligning Multimodal LLM with Human Preference: A Survey Tao Yu et.al. 2503.14504 null
2025-03-18 Deeply Supervised Flow-Based Generative Models Inkyu Shin et.al. 2503.14494 null
2025-03-18 Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control NVIDIA et.al. 2503.14492 link
2025-03-18 Engineering Scientific Assistants using Interactive Structured Induction of Programs Shraddha Surana et.al. 2503.14488 null
2025-03-18 Gricean Norms as a Basis for Effective Collaboration Fardin Saad et.al. 2503.14484 link
2025-03-18 ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing Yulin Pan et.al. 2503.14482 null
2025-03-18 Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Xinyu Fang et.al. 2503.14478 link
2025-03-18 The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models Erminia Calabrese et.al. 2503.14454 null
2025-03-18 Bolt3D: Generating 3D Scenes in Seconds Stanislaw Szymanowicz et.al. 2503.14445 null
2025-03-18 EnvBench: A Benchmark for Automated Environment Setup Aleksandra Eliseeva et.al. 2503.14443 link
2025-03-18 LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Nikhil Abhyankar et.al. 2503.14434 link
2025-03-18 PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play Wei Fang et.al. 2503.14432 null
2025-03-18 Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models Siwei Zhang et.al. 2503.14411 null
2025-03-18 Large Language Models for Virtual Human Gesture Selection Parisa Ghanad Torshizi et.al. 2503.14408 null
2025-03-18 DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers Mert Bulent Sariyildiz et.al. 2503.14405 null
2025-03-18 Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance Lisha Li et.al. 2503.14402 null
2025-03-18 From “Hallucination” to “Suture”: Insights from Language Philosophy to Enhance Large Language Models Qiantong Wang et.al. 2503.14392 null
2025-03-18 How much do LLMs learn from negative examples? Shadi Hamdan et.al. 2503.14391 null
2025-03-18 Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation Rikuto Tsuchida et.al. 2503.14382 null
2025-03-18 On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller? Pouria Sarhadi et.al. 2503.14379 link
2025-03-18 Impossible Videos Zechen Bai et.al. 2503.14378 null
2025-03-18 RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment Chao Wang et.al. 2503.14358 null
2025-03-18 MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts Runqi Meng et.al. 2503.14355 null
2025-03-18 MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration Yisen Xu et.al. 2503.14340 null
2025-03-18 DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies Wei Song et.al. 2503.14324 link
2025-03-18 COPA: Comparing the Incomparable to Explore the Pareto Front Adrián Javaloy et.al. 2503.14321 null
2025-03-18 RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT Yuheng Li et.al. 2503.14304 null
2025-03-18 Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs Nicolas Le Roux et.al. 2503.14286 null
2025-03-18 DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal Vaibhav Aggarwal et.al. 2503.14269 link
2025-03-18 Quantization-Free Autoregressive Action Transformer Ziyad Sheebaelhamd et.al. 2503.14259 link
2025-03-18 InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being Guang Dai et.al. 2503.14257 null
2025-03-18 Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search Yu Feng et.al. 2503.14251 null
2025-03-19 KG-IRAG: A Knowledge Graph-Based Iterative Retrieval-Augmented Generation Framework for Temporal Reasoning Ruiyi Yang et.al. 2503.14234 null
2025-03-18 CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models Yuyang Xue et.al. 2503.14232 null
2025-03-18 Decision Tree Induction Through LLMs via Semantically-Aware Evolution Tennison Liu et.al. 2503.14217 null
2025-03-18 Inferring Event Descriptions from Time Series with Language Models Mingtian Tan et.al. 2503.14190 link
2025-03-18 Towards Harmless Multimodal Assistants with Blind Preference Optimization Yongqi Li et.al. 2503.14189 null
2025-03-18 Can LLMs Enable Verification in Mainstream Programming? Aleksandr Shefer et.al. 2503.14183 null
2025-03-18 EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models Zongyun Zhang et.al. 2503.14162 null
2025-03-18 Speculative Decoding for Verilog: Speed and Quality, All in One Changran Xu et.al. 2503.14153 null
2025-03-18 Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding Zining Wang et.al. 2503.14140 null
2025-03-18 CARE: A QLoRA-Fine Tuned Multi-Domain Chatbot With Fast Learning On Minimal Hardware Ankit Dutta et.al. 2503.14136 null
2025-03-18 Inference-Time Intervention in Large Language Models for Reliable Requirement Verification Paul Darm et.al. 2503.14130 null
2025-03-18 SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models Subhadeep Koley et.al. 2503.14129 null
2025-03-18 PET-MAD, a universal interatomic potential for advanced materials modeling Arslan Mazitov et.al. 2503.14118 link
2025-03-18 DangerMaps: Personalized Safety Advice for Travel in Urban Environments using a Retrieval-Augmented Language Model Jonas Oppenlaender et.al. 2503.14103 null
2025-03-18 Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency Jiangxuan Long et.al. 2503.14076 null
2025-03-18 Fast Autoregressive Video Generation with Diagonal Decoding Yang Ye et.al. 2503.14070 null
2025-03-18 AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark Xinhao Xiang et.al. 2503.14064 link
2025-03-18 Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach Tianshu Wu et.al. 2503.14051 null
2025-03-18 Learning on LLM Output Signatures for gray-box LLM Behavior Analysis Guy Bar-Shalom et.al. 2503.14043 link
2025-03-18 Intra and Inter Parser-Prompted Transformers for Effective Image Restoration Cong Wang et.al. 2503.14037 link
2025-03-18 Synthetic Data Generation Using Large Language Models: Advances in Text and Code Mihai Nadas et.al. 2503.14023 null
2025-03-18 MP-GUI: Modality Perception with MLLMs for GUI Understanding Ziwei Wang et.al. 2503.14021 link
2025-03-18 Predicting Human Choice Between Textually Described Lotteries Eyal Marantz et.al. 2503.14004 null
2025-03-18 MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling Damian Boborzi et.al. 2503.14002 link
2025-03-18 The KoLMogorov Test: Compression by Code Generation Ori Yoran et.al. 2503.13992 null
2025-03-18 Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks Mykyta Syromiatnikov et.al. 2503.13988 link
2025-03-18 DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection Jaewoo Song et.al. 2503.13985 null
2025-03-18 SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability Jiankang Wang et.al. 2503.13983 null
2025-03-18 Empowering LLMs in Decision Games through Algorithmic Data Synthesis Haolin Wang et.al. 2503.13980 null
2025-03-18 FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks Siqi Zhang et.al. 2503.13966 null
2025-03-18 MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding Siwei Han et.al. 2503.13964 link
2025-03-18 Survey of Adversarial Robustness in Multimodal Large Language Models Chengze Jiang et.al. 2503.13962 null
2025-03-18 Improving LLM Video Understanding with 16 Frames Per Second Yixuan Li et.al. 2503.13956 null
2025-03-18 ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models Alexey Karev et.al. 2503.13923 null
2025-03-18 MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments Zhengsheng Guo et.al. 2503.13882 null
2025-03-18 MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation Donggon Jang et.al. 2503.13881 link
2025-03-18 Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment Wei Chen et.al. 2503.13879 null
2025-03-18 Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations Rui Yang et.al. 2503.13857 null
2025-03-18 MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation Kai Chen et.al. 2503.13856 null
2025-03-18 Causal Discovery from Data Assisted by Large Language Models Kamyar Barakati et.al. 2503.13833 null
2025-03-18 Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection Chunlei Li et.al. 2503.13828 link
2025-03-18 LLM-Empowered IoT for 6G Networks: Architecture, Challenges, and Solutions Xiaopei Chen et.al. 2503.13819 null
2025-03-18 Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models Mingming Peng et.al. 2503.13813 null
2025-03-18 The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations Suyash Fulay et.al. 2503.13812 null
2025-03-18 Empowering GraphRAG with Knowledge Filtering and Integration Kai Guo et.al. 2503.13804 null
2025-03-18 LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation Yang Zhou et.al. 2503.13794 null
2025-03-18 Mapping the Trust Terrain: LLMs in Software Engineering – Insights and Perspectives Dipin Khati et.al. 2503.13793 null
2025-03-17 Mitigating KV Cache Competition to Enhance User Experience in LLM Inference Haiying Shen et.al. 2503.13773 null
2025-03-17 Do Large Language Models Understand Performance Optimization? Bowen Cui et.al. 2503.13772 null
2025-03-17 Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion Kartik Thakral et.al. 2503.13769 null
2025-03-17 AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications Haiying Shen et.al. 2503.13737 null
2025-03-17 CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings Daniil Orel et.al. 2503.13733 null
2025-03-17 FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models Minghan Li et.al. 2503.13684 null
2025-03-17 Pensez: Less Data, Better Reasoning – Rethinking French LLM Huy Hoang Ha et.al. 2503.13661 null
2025-03-17 INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations Qian Meng et.al. 2503.13660 null
2025-03-17 SOSecure: Safer Code Generation with RAG and StackOverflow Discussions Manisha Mukherjee et.al. 2503.13654 null
2025-03-17 Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos Chiara Plizzari et.al. 2503.13646 link
2025-03-17 Plasmon-Plasmon Interaction in Nanoparticle Assemblies: Role of the Dipole-Quadrupole Coupling Olivier Masset et.al. 2503.13645 null
2025-03-17 Evaluating Programming Language Confusion Micheline Bénédicte Moumoula et.al. 2503.13620 null
2025-03-17 MetaScale: Test-Time Scaling with Evolving Meta-Thoughts Qin Liu et.al. 2503.13447 null
2025-03-17 MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation Zhenyu Wu et.al. 2503.13446 null
2025-03-17 Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance Noah Y. Siegel et.al. 2503.13445 null
2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 null
2025-03-17 Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images Tianhao Wu et.al. 2503.13439 null
2025-03-17 xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference Maximilian Beck et.al. 2503.13427 link
2025-03-17 Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation Xinyu Lian et.al. 2503.13424 null
2025-03-17 A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives Weiqiang Jin et.al. 2503.13415 null
2025-03-18 DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective Dengyun Peng et.al. 2503.13413 link
2025-03-17 Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis Alexander Ku et.al. 2503.13401 null
2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link
2025-03-17 Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning Mengyao Lyu et.al. 2503.13383 null
2025-03-17 Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Hai-Long Sun et.al. 2503.13360 null
2025-03-17 Agents Play Thousands of 3D Video Games Zhongwen Xu et.al. 2503.13356 null
2025-03-17 Valid Text-to-SQL Generation with Unification-based DeepStochLog Ying Jiao et.al. 2503.13342 link
2025-03-17 LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support Xinyu Jessica Wang et.al. 2503.13340 null
2025-03-17 LEAVS: An LLM-based Labeler for Abdominal CT Supervision Ricardo Bigolin Lanfredi et.al. 2503.13330 link
2025-03-17 Edit Transfer: Learning Image Editing via Vision In-Context Relations Lan Chen et.al. 2503.13327 null
2025-03-17 Computation Mechanism Behind LLM Position Generalization Chi Han et.al. 2503.13305 null
2025-03-17 LIMCA: LLM for Automating Analog In-Memory Computing Architecture Design Exploration Deepak Vungarala et.al. 2503.13301 null
2025-03-17 A Survey on Transformer Context Extension: Approaches and Evaluation Yijun Liu et.al. 2503.13299 null
2025-03-17 LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation Xiaodi Li et.al. 2503.13281 null
2025-03-17 Knowledge-Aware Iterative Retrieval for Multi-Agent Systems Seyoung Song et.al. 2503.13275 null
2025-03-17 Graph Generative Models Evaluation with Masked Autoencoder Chengen Wang et.al. 2503.13271 null
2025-03-17 TablePilot; Recommending Human-Preferred Tabular Data Analysis with Large Language Models Deyin Yi et.al. 2503.13262 null
2025-03-17 MindEye-OmniAssist: A Gaze-Driven LLM-Enhanced Assistive Robot System for Implicit Intention Recognition and Task Execution Zejia Zhang et.al. 2503.13250 null
2025-03-17 Can Language Models Follow Multiple Turns of Entangled Instructions? Chi Han et.al. 2503.13222 null
2025-03-17 Dense Policy: Bidirectional Autoregressive Learning of Actions Yue Su et.al. 2503.13217 null
2025-03-17 MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis Marvin Seyfarth et.al. 2503.13211 null
2025-03-17 Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach Sinan Fan et.al. 2503.13208 null
2025-03-17 MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways Zhen Chen et.al. 2503.13205 null
2025-03-17 3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o Dingning Liu et.al. 2503.13185 null
2025-03-17 Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs Jasmin Wachter et.al. 2503.13149 null
2025-03-17 Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images Yaxi Chen et.al. 2503.13131 null
2025-03-17 3D Human Interaction Generation: A Survey Siyuan Fan et.al. 2503.13120 null
2025-03-17 VeriLeaky: Navigating IP Protection vs Utility in Fine-Tuning for LLM-Driven Verilog Coding Zeng Wang et.al. 2503.13116 null
2025-03-17 MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs Erik Daxberger et.al. 2503.13111 null
2025-03-17 DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry Jing Li et.al. 2503.13110 null
2025-03-17 Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences Kedi Chen et.al. 2503.13109 null
2025-03-17 Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference Hao Yin et.al. 2503.13108 link
2025-03-17 ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models Hao Yin et.al. 2503.13107 link
2025-03-17 Managing Hybrid Solid-State Drives Using Large Language Models Qian Wei et.al. 2503.13105 null
2025-03-17 REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities Alexander Pugachev et.al. 2503.13102 null
2025-03-17 Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa Babangida Sani et.al. 2503.13101 null
2025-03-17 ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning Baohao Liao et.al. 2503.13089 null
2025-03-17 A Framework to Assess Multilingual Vulnerabilities of LLMs Likai Tang et.al. 2503.13081 null
2025-03-17 Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation Yihong Luo et.al. 2503.13070 null
2025-03-17 Do Vision Models Develop Human-Like Progressive Difficulty Understanding? Zeyi Huang et.al. 2503.13058 null
2025-03-17 MaskSDM with Shapley values to improve flexibility, robustness, and explainability in species distribution modeling Robin Zbinden et.al. 2503.13057 null
2025-03-17 Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided, Self-Consistent MLLMs for Food Preparation Task Planning Yu-Hong Shen et.al. 2503.13055 null
2025-03-17 InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving Ruiqi Song et.al. 2503.13047 null
2025-03-17 Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task Junjie Chen et.al. 2503.13038 null
2025-03-17 How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark Roba Al Majzoub et.al. 2503.12990 link
2025-03-17 A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models Palakorn Achananuparp et.al. 2503.12989 null
2025-03-17 ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM Wenqiang Wang et.al. 2503.12988 null
2025-03-17 Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning Junming Liu et.al. 2503.12972 null
2025-03-17 Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity Eliot Beyler et.al. 2503.12966 null
2025-03-17 Training Video Foundation Models with NVIDIA NeMo Zeeshan Patel et.al. 2503.12964 null
2025-03-17 HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding Jiahe Zhao et.al. 2503.12955 null
2025-03-17 HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model Haiyang Guo et.al. 2503.12941 null
2025-03-17 R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization Jingyi Zhang et.al. 2503.12937 null
2025-03-17 Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs Wei Hung et.al. 2503.12932 null
2025-03-17 MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting Rui Pu et.al. 2503.12931 null
2025-03-17 Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models Zhiyi Huang et.al. 2503.12923 null
2025-03-17 ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs Pengcheng Wen et.al. 2503.12918 null
2025-03-17 HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models Xinyan Jiang et.al. 2503.12908 null
2025-03-17 Optimizing Ansatz Design in Quantum Generative Adversarial Networks Using Large Language Models Kento Ueda et.al. 2503.12884 null
2025-03-17 nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity Tianqi Luo et.al. 2503.12880 null
2025-03-17 An interpretable approach to automating the assessment of biofouling in video footage Evelyn J. Mannix et.al. 2503.12875 null
2025-03-17 UniReg: Foundation Model for Controllable Medical Image Registration Zi Li et.al. 2503.12868 null
2025-03-17 Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English Duke Nguyen et.al. 2503.12858 null
2025-03-17 Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation Songjun Tu et.al. 2503.12854 null
2025-03-17 ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing Aditi Tiwari et.al. 2503.12852 null
2025-03-17 GuideDog: A Real-World Egocentric Multimodal Dataset for Blind and Low-Vision Accessibility-Aware Guidance Junhyeok Kim et.al. 2503.12844 null
2025-03-18 Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data Haozhe Si et.al. 2503.12843 null
2025-03-17 A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules Kairong Luo et.al. 2503.12811 null
2025-03-17 Grounded Chain-of-Thought for Multimodal Large Language Models Qiong Wu et.al. 2503.12799 null
2025-03-18 DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding Xinyu Ma et.al. 2503.12797 link
2025-03-17 Quantum-Enhanced LLM Efficient Fine Tuning Xiaofei Kong et.al. 2503.12790 null
2025-03-17 SAM2 for Image and Video Segmentation: A Comprehensive Survey Zhang Jiaxing et.al. 2503.12781 null
2025-03-17 NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models Sung-Yeon Park et.al. 2503.12772 null
2025-03-17 A Survey on Human Interaction Motion Generation Kewei Sui et.al. 2503.12763 link
2025-03-17 RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning Jerry Huang et.al. 2503.12759 null
2025-03-17 VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis Zhifeng Wang et.al. 2503.12758 null
2025-03-17 MAP: Multi-user Personalization with Collaborative LLM-powered Agents Christine Lee et.al. 2503.12757 link
2025-03-17 Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering Kenneth J. K. Ong et.al. 2503.12722 null
2025-03-17 Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective Luca Collini et.al. 2503.12721 null
2025-03-16 AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration Javier Tirado-Garín et.al. 2503.12701 null
2025-03-16 A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT Dazhou Guo et.al. 2503.12698 null
2025-03-16 AI Agents: Evolution, Architecture, and Real-World Applications Naveen Krishnan et.al. 2503.12687 null
2025-03-16 ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory Liangyu Wang et.al. 2503.12668 link
2025-03-16 Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility Jacob Chmura et.al. 2503.12667 null
2025-03-16 Quantum Chemistry Driven Molecular Inverse Design with Data-free Reinforcement Learning Francesco Calcagno et.al. 2503.12653 null
2025-03-16 UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing Tsu-Jui Fu et.al. 2503.12652 null
2025-03-16 VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures Yoo Yeon Sung et.al. 2503.12651 null
2025-03-16 FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization Hao Mark Chen et.al. 2503.12649 null
2025-03-16 LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization Alessio Spagnoletti et.al. 2503.12615 null
2025-03-16 VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and Invisibility Yitian Shi et.al. 2503.12609 null
2025-03-16 Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey Yaoting Wang et.al. 2503.12605 link
2025-03-16 SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models Kunyang Sun et.al. 2503.12602 null
2025-03-14 From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets P. Campeti et.al. 2503.11643 link
2025-03-14 Gradient-bridged Posterior: Bayesian Inference for Models with Implicit Functions Cheng Zeng et.al. 2503.11637 null
2025-03-14 ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning Xinyi Wang et.al. 2503.11617 link
2025-03-14 Pathology Image Compression with Pre-trained Autoencoders Srikar Yellapragada et.al. 2503.11591 null
2025-03-14 Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space Zhiliang Chen et.al. 2503.11586 link
2025-03-14 SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Ahmed Nassar et.al. 2503.11576 null
2025-03-14 Synthesizing Access Control Policies using Large Language Models Adarsh Vatsa et.al. 2503.11573 null
2025-03-14 Implicit Bias-Like Patterns in Reasoning Models Messi H. J. Lee et.al. 2503.11572 null
2025-03-14 VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Jing Bi et.al. 2503.11557 null
2025-03-14 AugGen: Synthetic Augmentation Can Improve Discriminative Models Parsa Rahimi et.al. 2503.11544 null
2025-03-14 Potential of large language model-powered nudges for promoting daily water and energy conservation Zonghan Li et.al. 2503.11531 null
2025-03-14 Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models Hao Cheng et.al. 2503.11519 null
2025-03-14 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Ziqin Zhou et.al. 2503.11513 null
2025-03-14 Perfect Stabilization of Biomolecular Adhesions under Load Anton F. Burnet et.al. 2503.11510 null
2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null
2025-03-14 A Review of DeepSeek Models’ Key Innovative Techniques Chengen Wang et.al. 2503.11486 null
2025-03-14 Exponential Quantum Advantage for Simulating Open Classical Systems Agi Villanyi et.al. 2503.11483 null
2025-03-14 T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation Seyed Mohammad Hadi Hosseini et.al. 2503.11481 null
2025-03-14 Integrating LLMs in Gamified Systems Carlos J. Costa et.al. 2503.11458 null
2025-03-14 D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning Jia Zhang et.al. 2503.11441 null
2025-03-14 Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models Xu Liu et.al. 2503.11411 null
2025-03-14 A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving Tin Stribor Sohn et.al. 2503.11400 null
2025-03-14 Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages Jiyeong Kim et.al. 2503.11384 null
2025-03-14 Modeling Subjectivity in Cognitive Appraisal with Language Models Yuxiang Zhou et.al. 2503.11381 null
2025-03-14 Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches Panggih Kusuma Ningrum et.al. 2503.11376 null
2025-03-14 Cornstarch: Distributed Multimodal Training Must Be Multimodality-Aware Insu Jang et.al. 2503.11367 link
2025-03-14 PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models Mayank Nautiyal et.al. 2503.11360 null
2025-03-14 Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data Analysis Zhenyi Zhang et.al. 2503.11347 null
2025-03-14 AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation Fengyu Li et.al. 2503.11346 link
2025-03-14 Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models Aissatou Diallo et.al. 2503.11336 null
2025-03-14 Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking Ziyi Wang et.al. 2503.11324 null
2025-03-14 MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens Jeong Hun Yeo et.al. 2503.11315 link
2025-03-14 Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering Xinyu Tang et.al. 2503.11314 link
2025-03-14 Are formal and functional linguistic mechanisms dissociated? Michael Hanna et.al. 2503.11302 link
2025-03-14 GNNs as Predictors of Agentic Workflow Performances Yuanshuo Zhang et.al. 2503.11301 link
2025-03-14 BriLLM: Brain-inspired Large Language Model Hai Zhao et.al. 2503.11299 null
2025-03-14 High-Dimensional Interlingual Representations of Large Language Models Bryan Wilie et.al. 2503.11280 null
2025-03-14 When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective Alireza Mousavi-Hosseini et.al. 2503.11272 link
2025-03-14 CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy Jonas Utz et.al. 2503.11266 null
2025-03-14 Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model Haoyang Huang et.al. 2503.11251 link
2025-03-14 Reasoning-Grounded Natural Language Explanations for Language Models Vojtech Cahlik et.al. 2503.11248 link
2025-03-14 LLMPerf: GPU Performance Modeling meets Large Language Models Khoi N. M. Nguyen et.al. 2503.11244 link
2025-03-14 PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders Ahmed Frikha et.al. 2503.11232 null
2025-03-14 Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment Ke Wang et.al. 2503.11229 null
2025-03-14 GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction Jian Zhang et.al. 2503.11227 null
2025-03-14 Heterogeneously structured compartmental models of epidemiological systems: from individual-level processes to population-scale dynamics Emanuele Bernardi et.al. 2503.11225 null
2025-03-14 Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? Giacomo Camposampiero et.al. 2503.11207 link
2025-03-14 LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs Leqi Shen et.al. 2503.11205 null
2025-03-14 Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering Gang Li et.al. 2503.11197 link
2025-03-14 FastVID: Dynamic Density Pruning for Fast Video Large Language Models Leqi Shen et.al. 2503.11187 link
2025-03-14 Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification Yingjie Zhang et.al. 2503.11185 null
2025-03-14 Palette of Language Models: A Solver for Controlled Text Generation Zhe Yang et.al. 2503.11182 null
2025-03-14 Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity Chi Xu et.al. 2503.11164 null
2025-03-14 Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models Shaotian Yan et.al. 2503.11154 null
2025-03-14 SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets Hao Liu et.al. 2503.11133 null
2025-03-14 Don’t Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning Matthew Khoriaty et.al. 2503.11127 null
2025-03-14 Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers Yifang Chen et.al. 2503.11108 null
2025-03-14 Quantifying Interpretability in CLIP Models with Concept Consistency Avinash Madasu et.al. 2503.11103 null
2025-03-14 Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space Weichen Zhan et.al. 2503.11094 link
2025-03-14 OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning Yuan Liu et.al. 2503.11093 null
2025-03-14 EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks Yi Zhang et.al. 2503.11089 null
2025-03-14 A Survey of Cross-domain Graph Learning: Progress and Future Directions Haihong Zhao et.al. 2503.11086 link
2025-03-14 Prompt Alchemy: Automatic Prompt Refinement for Enhancing Code Generation Sixiang Ye et.al. 2503.11085 link
2025-03-14 LLMs are Bug Replicators: An Empirical Study on LLMs’ Capability in Completing Bug-prone Code Liwei Guo et.al. 2503.11082 link
2025-03-14 Understanding Flatness in Generative Models: Its Role and Benefits Taehwan Lee et.al. 2503.11078 null
2025-03-14 Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities Xueyang Zhou et.al. 2503.11074 null
2025-03-14 Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models Hongyang Wei et.al. 2503.11073 link
2025-03-14 Falcon: A Remote Sensing Vision-Language Foundation Model Kelu Yao et.al. 2503.11070 link
2025-03-14 API Agents vs. GUI Agents: Divergence and Convergence Chaoyun Zhang et.al. 2503.11069 null
2025-03-14 DeepSeek Powered Solid Dosage Formulation Design and Development Leqi Lin et.al. 2503.11068 null
2025-03-14 Generative Modelling for Mathematical Discovery Jordan S. Ellenberg et.al. 2503.11061 link
2025-03-14 BannerAgency: Advertising Banner Design with Multimodal LLM Agents Heng Wang et.al. 2503.11060 null
2025-03-14 Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization Kyle Sargent et.al. 2503.11056 null
2025-03-14 Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning Jieyi Tan et.al. 2503.11051 null
2025-03-14 PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing Hasan Iqbal et.al. 2503.11044 null
2025-03-14 Beyond A Single AI Cluster: A Survey of Decentralized LLM Training Haotian Dong et.al. 2503.11023 null
2025-03-14 An LLM’s Attempts to Adapt to Diverse Software Engineers’ Problem-Solving Styles: More Inclusive & Equitable? Andrew Anderson et.al. 2503.11018 null
2025-03-14 RONA: Pragmatically Diverse Image Captioning with Coherence Relations Aashish Anantha Ramakrishnan et.al. 2503.10997 link
2025-03-14 TigerLLM – A Family of Bangla Large Language Models Nishat Raihan et.al. 2503.10995 null
2025-03-14 Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium Kaizhao Liu et.al. 2503.10990 link
2025-03-14 From Dionysius Emerges Apollo – Learning Patterns and Abstractions from Perceptual Sequences Shuchen Wu et.al. 2503.10973 null
2025-03-14 Combinatorial Optimization for All: Using LLMs to Aid Non-Experts in Improving Optimization Algorithms Camilo Chacón Sartori et.al. 2503.10968 null
2025-03-13 Empirical Computation Eric Tang et.al. 2503.10954 null
2025-03-13 Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations Piyush Gupta et.al. 2503.10941 null
2025-03-13 ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models Haoyu Zhang et.al. 2503.10937 null
2025-03-13 OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses Angela Lopez-Cardona et.al. 2503.10927 link
2025-03-13 Learning to Inference Adaptively for Multimodal Large Language Models Zhuoyan Xu et.al. 2503.10905 null
2025-03-13 Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification Nathaniel Lesperance et.al. 2503.10886 null
2025-03-13 Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data Paul Quinlan et.al. 2503.10883 null
2025-03-13 SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable Jiaxin Zhang et.al. 2503.10881 null
2025-03-13 Teamwork makes the dream work: LLMs-Based Agents for GitHub README.MD Summarization Duc S. H. Nguyen et.al. 2503.10876 null
2025-03-13 Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation Leonard Waldmann et.al. 2503.10845 link
2025-03-13 Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs? So Young Lee et.al. 2503.10838 null
2025-03-13 Exploiting Concavity Information in Gaussian Process Contextual Bandit Optimization Kevin Li et.al. 2503.10836 null
2025-03-13 Thinking Machines: A Survey of LLM based Reasoning Strategies Dibyanayan Bandyopadhyay et.al. 2503.10814 null
2025-03-13 HALURust: Exploiting Hallucinations of Large Language Models to Detect Vulnerabilities in Rust Yu Luo et.al. 2503.10793 null
2025-03-13 Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview Norbert Tihanyi et.al. 2503.10784 null
2025-03-13 Large-scale Pre-training for Grounded Video Caption Generation Evangelos Kazakos et.al. 2503.10781 link
2025-03-13 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Rongyao Fang et.al. 2503.10639 link
2025-03-13 HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model Jiaming Liu et.al. 2503.10631 null
2025-03-13 UniGoal: Towards Universal Zero-shot Goal-oriented Navigation Hang Yin et.al. 2503.10630 null
2025-03-13 From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM Kshitij Ambilduke et.al. 2503.10620 link
2025-03-13 Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search Andy Zhou et.al. 2503.10619 null
2025-03-13 Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Andy Zhou et.al. 2503.10617 null
2025-03-13 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization Yi Yang et.al. 2503.10615 link
2025-03-13 CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing Advait Gupta et.al. 2503.10613 link
2025-03-13 TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Jinhao Duan et.al. 2503.10602 link
2025-03-13 CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models Hao He et.al. 2503.10592 null
2025-03-13 Unlock the Power of Unlabeled Data in Language Driving Model Chaoqun Wang et.al. 2503.10586 null
2025-03-13 Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures Nina Vesseron et.al. 2503.10576 null
2025-03-13 Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models Afrar Jahin et.al. 2503.10573 null
2025-03-13 ASIDE: Architectural Separation of Instructions and Data in Language Models Egor Zverev et.al. 2503.10566 null
2025-03-13 Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations Brett Puppart et.al. 2503.10556 null
2025-03-13 KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation Zixian Liu et.al. 2503.10546 null
2025-03-13 DP-GPL: Differentially Private Graph Prompt Learning Jing Xu et.al. 2503.10544 null
2025-03-13 Foundation Models for Atomistic Simulation of Chemistry and Materials Eric C. -Y. Yuan et.al. 2503.10538 null
2025-03-13 PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models Zilu Guo et.al. 2503.10529 null
2025-03-13 Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set Florian Eichin et.al. 2503.10515 link
2025-03-13 Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression Hooman Shahrokhi et.al. 2503.10512 null
2025-03-13 SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models Sahar Admoni et.al. 2503.10509 null
2025-03-13 TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models Xudong Tan et.al. 2503.10501 link
2025-03-13 MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation Weihao Xuan et.al. 2503.10497 null
2025-03-13 Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents Hanxu Hu et.al. 2503.10494 link
2025-03-13 Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion Evgeniia Vu et.al. 2503.10488 null
2025-03-13 LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions Gaurav Kumar Gupta et.al. 2503.10486 null
2025-03-13 Siamese Foundation Models for Crystal Structure Prediction Liming Wu et.al. 2503.10471 null
2025-03-13 DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation Wenhao Hu et.al. 2503.10452 null
2025-03-13 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. 2503.10437 null
2025-03-13 Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback Derun Li et.al. 2503.10434 null
2025-03-13 BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models Can Zheng et.al. 2503.10432 null
2025-03-13 Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning Jonathan Shaki et.al. 2503.10408 null
2025-03-13 RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models Yijing Lin et.al. 2503.10406 null
2025-03-13 RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing Fengxiang Wang et.al. 2503.10392 link
2025-03-13 CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Yufan Deng et.al. 2503.10391 null
2025-03-13 SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading Qiaoling Chen et.al. 2503.10377 null
2025-03-13 Probabilistic Forecasting via Autoregressive Flow Matching Ahmed El-Gazzar et.al. 2503.10375 null
2025-03-13 G-Boost: Boosting Private SLMs with General LLMs Yijiang Fan et.al. 2503.10367 null
2025-03-13 Piece it Together: Part-Based Concepting with IP-Priors Elad Richardson et.al. 2503.10365 null
2025-03-13 BioSerenity-E1: a self-supervised EEG model for medical applications Ruggero G. Bettinardi et.al. 2503.10362 null
2025-03-13 Collaborative Speculative Inference for Efficient LLM Inference Serving Luyao Gao et.al. 2503.10325 null
2025-03-13 IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification Yuhao Wang et.al. 2503.10324 null
2025-03-13 Towards Fast, Memory-based and Data-Efficient Vision-Language Policy Haoxuan Li et.al. 2503.10322 null
2025-03-13 Capturing Semantic Flow of ML-based Systems Shin Yoo et.al. 2503.10310 null
2025-03-13 Test Amplification for REST APIs Using “Out-of-the-box” Large Language Models Tolgahan Bardakci et.al. 2503.10306 null
2025-03-13 CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems Peyman Neshaastegaran et.al. 2503.10297 null
2025-03-13 VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Weiyun Wang et.al. 2503.10291 null
2025-03-13 MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment Hao Zhou et.al. 2503.10287 null
2025-03-13 An Expanded Massive Multilingual Dataset for High-Performance Language Technologies Laurie Burchell et.al. 2503.10267 null
2025-03-13 Numerical Error Analysis of Large Language Models Stanislav Budzinskiy et.al. 2503.10251 null
2025-03-13 LLM Agents Display Human Biases but Exhibit Distinct Learning Patterns Idan Horowitz et.al. 2503.10248 null
2025-03-13 MinorBench: A hand-built benchmark for content-based risks for children Shaun Khoo et.al. 2503.10242 null
2025-03-13 SCOOP: A Framework for Proactive Collaboration and Social Continual Learning through Natural Language Interaction andCausal Reasoning Dimitri Ognibene et.al. 2503.10241 null
2025-03-13 Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA Zhixuan Li et.al. 2503.10225 null
2025-03-13 Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout Shilong Wang et.al. 2503.10217 null
2025-03-13 Adaptive Preference Aggregation Benjamin Heymann et.al. 2503.10215 null
2025-03-13 Singular Value Fine-tuning for Few-Shot Class-Incremental Learning Zhiwu Wang et.al. 2503.10214 null
2025-03-13 Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation Henglyu Liu et.al. 2503.10211 null
2025-03-13 LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Boyu Chen et.al. 2503.10200 null
2025-03-13 Robustness Tokens: Towards Adversarial Robustness of Transformers Brian Pulfer et.al. 2503.10191 link
2025-03-13 “Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding Hyunbin Jin et.al. 2503.10167 null
2025-03-13 Retrieval-Augmented Generation with Hierarchical Knowledge Haoyu Huang et.al. 2503.10150 link
2025-03-13 Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding Jinze Li et.al. 2503.10135 null
2025-03-13 PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models Runze He et.al. 2503.10127 null
2025-03-13 Hybrid Agents for Image Restoration Bingchen Li et.al. 2503.10120 null
2025-03-13 StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error Shu-Xun Yang et.al. 2503.10105 link
2025-03-13 AgentDAO: Synthesis of Proposal Transactions Via Abstract DAO Semantics Lin Ao et.al. 2503.10099 null
2025-03-13 Semantic Latent Motion for Portrait Video Generation Qiyuan Zhang et.al. 2503.10096 null
2025-03-13 Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text Avinash Patil et.al. 2503.10095 null
2025-03-13 Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model Qiyuan Deng et.al. 2503.10093 null
2025-03-13 Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset Xintong Dong et.al. 2503.10092 null
2025-03-13 Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective Xiang Zhang et.al. 2503.10084 null
2025-03-13 AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption Joonsung Jeon et.al. 2503.10081 link
2025-03-13 Information Density Principle for MLLM Benchmarks Chunyi Li et.al. 2503.10079 link
2025-03-13 VMBench: A Benchmark for Perception-Aligned Video Motion Generation Xinrang Ling et.al. 2503.10076 link
2025-03-13 SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation Xiangyu Shi et.al. 2503.10069 null
2025-03-13 Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations Ho Hin Lee et.al. 2503.10057 link
2025-03-13 Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy Ziqi Jia et.al. 2503.10049 null
2025-03-13 How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game Ziyue Wang et.al. 2503.10042 link
2025-03-13 NumScout: Unveiling Numerical Defects in Smart Contracts using LLM-Pruning Symbolic Execution Jiachi Chen et.al. 2503.10041 link
2025-03-13 OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model Bowen Zhang et.al. 2503.10009 link
2025-03-13 TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs Yunxiao Wang et.al. 2503.09994 null
2025-03-13 Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes JunYong Choi et.al. 2503.09993 null
2025-03-13 From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs Rohan Bhatnagar et.al. 2503.09986 null
2025-03-13 ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content Bhavik Chandna et.al. 2503.09964 null
2025-03-13 Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification Jiayu Jiang et.al. 2503.09962 link
2025-03-13 RMG: Real-Time Expressive Motion Generation with Self-collision Avoidance for 6-DOF Companion Robotic Arms Jiansheng Li et.al. 2503.09959 null
2025-03-13 Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey Yu Qiao et.al. 2503.09956 null
2025-03-13 UVE: Are MLLMs Unified Evaluators for AI-Generated Videos? Yuanxin Liu et.al. 2503.09949 link
2025-03-13 PluralLLM: Pluralistic Alignment in LLMs via Federated Learning Mahmoud Srewa et.al. 2503.09925 null
2025-03-13 Inter-environmental world modeling for continuous and compositional dynamics Kohei Hayashi et.al. 2503.09911 null
2025-03-12 Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets Zahra Abbasiantaeb et.al. 2503.09902 link
2025-03-12 Improving the Reusability of Conversational Search Test Collections Zahra Abbasiantaeb et.al. 2503.09899 link
2025-03-12 What’s In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models Abhipsha Das et.al. 2503.09894 link
2025-03-12 On the contraction properties of Sinkhorn semigroups O. Deniz Akyildiz et.al. 2503.09887 null
2025-03-12 CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation Hariprasath Govindarajan et.al. 2503.09878 null
2025-03-12 LuciBot: Automated Robot Policy Learning from Generated Videos Xiaowen Qiu et.al. 2503.09871 null
2025-03-12 Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis Nahid Ul Islam et.al. 2503.09860 null
2025-03-12 Media and responsible AI governance: a game-theoretic and LLM analysis Nataliya Balabanova et.al. 2503.09858 null
2025-03-12 MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System Jihao Zhao et.al. 2503.09600 link
2025-03-12 How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation Ruohao Guo et.al. 2503.09598 link
2025-03-12 PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop Chenyu Li et.al. 2503.09595 link
2025-03-12 SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment Katrin Renz et.al. 2503.09594 null
2025-03-12 BIMBA: Selective-Scan Compression for Long-Range Video Question Answering Md Mohaiminul Islam et.al. 2503.09590 link
2025-03-12 Minimax Optimality of the Probability Flow ODE for Diffusion Models Changxiao Cai et.al. 2503.09583 null
2025-03-12 Cost-Optimal Grouped-Query Attention for Long-Context LLMs Yingfa Chen et.al. 2503.09579 link
2025-03-12 Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks Lutfi Eren Erdogan et.al. 2503.09572 null
2025-03-13 Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models Qiguang Chen et.al. 2503.09567 null
2025-03-12 GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals Shuokang Huang et.al. 2503.09537 null
2025-03-13 Large Language Models for Multi-Facility Location Mechanism Design Nguyen Thach et.al. 2503.09533 null
2025-03-12 Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Bowen Jin et.al. 2503.09516 link
2025-03-12 ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning Ziyu Wan et.al. 2503.09501 null
2025-03-12 Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection Romain Thoreau et.al. 2503.09493 null
2025-03-12 DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction Junjie Zhou et.al. 2503.09491 null
2025-03-12 Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness Beier Zhu et.al. 2503.09487 null
2025-03-12 BAMBI: Developing Baby Language Models for Italian Alice Suozzi et.al. 2503.09481 null
2025-03-12 Explicit Learning and the LLM in Machine Translation Malik Marmonier et.al. 2503.09454 link
2025-03-12 How Well Does Your Tabular Generator Learn the Structure of Tabular Data? Xiangjian Jiang et.al. 2503.09453 link
2025-03-12 Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models Julian Spravil et.al. 2503.09443 null
2025-03-12 CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection Richard A. Dubniczky et.al. 2503.09433 null
2025-03-12 Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter Kechun Xu et.al. 2503.09423 null
2025-03-12 VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Kevin Qinghong Lin et.al. 2503.09402 link
2025-03-12 ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation Tobias Christian Nauen et.al. 2503.09399 link
2025-03-12 Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training Jiatong Xia et.al. 2503.09396 null
2025-03-12 Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs Jiani Huang et.al. 2503.09382 link
2025-03-12 Towards Graph Foundation Models: A Transferability Perspective Yuxiang Wang et.al. 2503.09363 null
2025-03-12 Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X Katharina Prasse et.al. 2503.09361 null
2025-03-12 RetSTA: An LLM-Based Approach for Standardizing Clinical Fundus Image Reports Jiushen Cai et.al. 2503.09358 null
2025-03-12 Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts Hongyu Chen et.al. 2503.09347 null
2025-03-12 NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model Yuzhi Lai et.al. 2503.09335 null
2025-03-12 CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data Adel ElZemity et.al. 2503.09334 null
2025-03-12 A Survey on Enhancing Causal Reasoning Ability of Large Language Models Xin Li et.al. 2503.09326 null
2025-03-12 Revealing the Implicit Noise-based Imprint of Generative Models Xinghan Li et.al. 2503.09314 null
2025-03-12 xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation Elio Musacchio et.al. 2503.09313 null
2025-03-12 Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactions Fynn Bachmann et.al. 2503.09311 link
2025-03-12 Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference Mohammad Siavashi et.al. 2503.09304 null
2025-03-12 Prompt Inference Attack on Distributed Large Language Model Inference Frameworks Xinjian Luo et.al. 2503.09291 null
2025-03-12 Crowdsourced Homophily Ties Based Graph Annotation Via Large Language Model Yu Bu et.al. 2503.09281 null
2025-03-12 Fine-Tuning Large Language Models for Educational Support: Leveraging Gagne’s Nine Events of Instruction for Lesson Planning Linzhao Jia et.al. 2503.09276 null
2025-03-12 COLA: A Scalable Multi-Agent Framework For Windows UI Task Automation Di Zhao et.al. 2503.09263 link
2025-03-13 DeepInnovation AI: A Global Dataset Mapping the AI innovation from Academic Research to Industrial Patents Haixing Gong et.al. 2503.09257 null
2025-03-12 City Models: Past, Present and Future Prospects Helge Ritter et.al. 2503.09237 null
2025-03-12 LREF: A Novel LLM-based Relevance Framework for E-commerce Tian Tang et.al. 2503.09223 null
2025-03-12 Rethinking Prompt-based Debiasing in Large Language Models Xinyi Yang et.al. 2503.09219 null
2025-03-12 Why LLMs Cannot Think and How to Fix It Marius Jahrens et.al. 2503.09211 null
2025-03-12 Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model Ali Vosoughi et.al. 2503.09205 null
2025-03-12 Token Weighting for Long-Range Language Modeling Falko Helm et.al. 2503.09202 link
2025-03-12 WonderVerse: Extendable 3D Scene Generation with Video Generative Models Hao Feng et.al. 2503.09160 null
2025-03-12 FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models Fufangchen Zhao et.al. 2503.09158 null
2025-03-12 AdaptAI: A Personalized Solution to Sense Your Stress, Fix Your Mess, and Boost Productivity Rushiraj Gadhvi et.al. 2503.09150 link
2025-03-12 Generative Frame Sampler for Long Video Understanding Linli Yao et.al. 2503.09146 null
2025-03-12 Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding Haoyu Zhang et.al. 2503.09143 null
2025-03-12 AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks Jin Li et.al. 2503.09124 null
2025-03-12 Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training? Yuechen Xie et.al. 2503.09122 link
2025-03-12 GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models Yue Wang et.al. 2503.09117 null
2025-03-12 VaxGuard: A Multi-Generator, Multi-Type, and Multi-Role Dataset for Detecting LLM-Generated Vaccine Misinformation Syed Talal Ahmad et.al. 2503.09103 null
2025-03-12 Multi-Modal Foundation Models for Computational Pathology: A Survey Dong Li et.al. 2503.09091 null
2025-03-12 Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows Chengyue Gong et.al. 2503.09069 null
2025-03-12 Probing Latent Subspaces in LLM for AI Security: Identifying and Manipulating Adversarial States Xin Wei Chia et.al. 2503.09066 null
2025-03-12 Discovering Influential Neuron Path in Vision Transformers Yifan Wang et.al. 2503.09046 null
2025-03-12 ManeuverGPT Agentic Control for Safe Autonomous Stunt Maneuvers Shawn Azdam et.al. 2503.09035 link
2025-03-12 Teaching LLMs How to Learn with Contextual Fine-Tuning Younwoo Choi et.al. 2503.09032 null
2025-03-12 DAST: Difficulty-Aware Self-Training on Large Language Models Boyang Xue et.al. 2503.09029 link
2025-03-12 Aligning to What? Limits to RLHF Based Alignment Logan Barnhart et.al. 2503.09025 null
2025-03-13 Prompt Inversion Attack against Collaborative Inference of Large Language Models Wenjie Qu et.al. 2503.09022 null
2025-03-12 Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning Yuan Jiang et.al. 2503.09020 link
2025-03-12 Natural Humanoid Robot Locomotion with Generative Motion Prior Haodong Zhang et.al. 2503.09015 null
2025-03-12 Leveraging Retrieval Augmented Generative LLMs For Automated Metadata Description Generation to Enhance Data Catalogs Mayank Singh et.al. 2503.09003 null
2025-03-12 KNighter: Transforming Static Analysis with LLM-Synthesized Checkers Chenyuan Yang et.al. 2503.09002 link
2025-03-12 JBFuzz: Jailbreaking LLMs Efficiently and Effectively Using Fuzzing Vasudev Gohil et.al. 2503.08990 null
2025-03-12 I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data? Yuhang Liu et.al. 2503.08980 null
2025-03-12 Large Language Models-Aided Program Debloating Bo Lin et.al. 2503.08969 null
2025-03-11 Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation Yu Wang et.al. 2503.08963 null
2025-03-11 FP3: A 3D Foundation Policy for Robotic Manipulation Rujia Yang et.al. 2503.08950 null
2025-03-11 Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model Zilong Deng et.al. 2503.08934 null
2025-03-11 ARCHED: A Human-Centered Framework for Transparent, Responsible, and Collaborative AI-Assisted Instructional Design Hongming Li et.al. 2503.08931 null
2025-03-11 Enhancing Large Language Models for Hardware Verification: A Novel SystemVerilog Assertion Dataset Anand Menon et.al. 2503.08923 null
2025-03-11 Backtracking for Safety Bilgehan Sel et.al. 2503.08919 null
2025-03-11 Multilevel Generative Samplers for Investigating Critical Phenomena Ankur Singha et.al. 2503.08918 link
2025-03-11 Reconstruct Anything Model: a lightweight foundation model for computational imaging Matthieu Terris et.al. 2503.08915 null
2025-03-11 Interpreting the Repeated Token Phenomenon in Large Language Models Itay Yona et.al. 2503.08908 link
2025-03-11 A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation Forough Fazeliasl et.al. 2503.08902 null
2025-03-11 Seeing What’s Not There: Spurious Correlation in Multimodal LLMs Parsa Hosseini et.al. 2503.08884 null
2025-03-11 LLMs Know What to Drop: Self-Attention Guided KV Cache Eviction for Efficient Long-Context Inference Guangtao Wang et.al. 2503.08879 null
2025-03-11 Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs Rafael Carranza et.al. 2503.08857 null
2025-03-11 Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs Tianyu Sun et.al. 2503.08842 null
2025-03-11 ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness Ce Guo et.al. 2503.08823 null
2025-03-11 Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations Danielle Villa et.al. 2503.08815 null
2025-03-11 Robust Multi-Objective Controlled Decoding of Large Language Models Seongho Son et.al. 2503.08796 link
2025-03-11 Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs Ariba Khan et.al. 2503.08688 null
2025-03-11 OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models Jialv Zou et.al. 2503.08686 link
2025-03-11 Self-Taught Self-Correction for Small Language Models Viktor Moskvoretskii et.al. 2503.08681 null
2025-03-11 GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing Yuanhao Wang et.al. 2503.08678 null
2025-03-12 OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting Yongsheng Yu et.al. 2503.08677 null
2025-03-11 Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields Tobias Kreiman et.al. 2503.08674 null
2025-03-11 REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder Yitian Zhang et.al. 2503.08665 null
2025-03-11 Generating Robot Constitutions & Benchmarks for Semantic Safety Pierre Sermanet et.al. 2503.08663 null
2025-03-11 Exploring the Word Sense Disambiguation Capabilities of Large Language Models Pierpaolo Basile et.al. 2503.08662 null
2025-03-11 YuE: Scaling Open Foundation Models for Long-Form Music Generation Ruibin Yuan et.al. 2503.08638 link
2025-03-11 LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization Xianfeng Wu et.al. 2503.08619 link
2025-03-11 EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments Dongping Li et.al. 2503.08604 null
2025-03-11 NSF-SciFy: Mining the NSF Awards Database for Scientific Claims Delip Rao et.al. 2503.08600 null
2025-03-11 3D Point Cloud Generation via Autoregressive Up-sampling Ziqiao Meng et.al. 2503.08594 null
2025-03-11 Proc4Gem: Foundation models for physical agency through procedural generation Yixin Lin et.al. 2503.08593 null
2025-03-11 HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding Shehreen Azad et.al. 2503.08585 null
2025-03-11 RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding Xichen Tan et.al. 2503.08576 null
2025-03-11 DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process Minjun Zhu et.al. 2503.08569 null
2025-03-11 Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies Chen Xu et.al. 2503.08558 null
2025-03-11 Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs Wanyong Feng et.al. 2503.08551 null
2025-03-11 Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling Craig Messner et.al. 2503.08550 null
2025-03-11 Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation Xian Gao et.al. 2503.08549 null
2025-03-11 DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering Sher Badshah et.al. 2503.08542 null
2025-03-11 Mellow: a small audio language model for reasoning Soham Deshmukh et.al. 2503.08540 link
2025-03-11 Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation Andres M Bran et.al. 2503.08537 link
2025-03-11 ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems Siddhant Arora et.al. 2503.08533 null
2025-03-11 GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training Tong Wei et.al. 2503.08525 null
2025-03-11 Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency Siqi Fan et.al. 2503.08524 null
2025-03-11 High-Quality 3D Head Reconstruction from Any Single Portrait Image Jianfu Zhang et.al. 2503.08516 null
2025-03-11 LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning Weijie Zhou et.al. 2503.08508 link
2025-03-11 Referring to Any Person Qing Jiang et.al. 2503.08507 link
2025-03-11 ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews Xian Gao et.al. 2503.08506 null
2025-03-11 Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models Han Cao et.al. 2503.08495 null
2025-03-11 TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting Fengyi Zhang et.al. 2503.08485 null
2025-03-11 Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum Shengpeng Xiao et.al. 2503.08484 null
2025-03-11 PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability Weijie Zhou et.al. 2503.08481 link
2025-03-11 FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework Jianian Zhu et.al. 2503.08461 null
2025-03-11 KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents Hsin-Ling Hsu et.al. 2503.08452 null
2025-03-11 LLM-Pack: Intuitive Grocery Handling for Logistics Applications Yannik Blei et.al. 2503.08445 null
2025-03-11 TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems Feiyang Wu et.al. 2503.08415 null
2025-03-11 Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information Elizaveta Kuznetsova et.al. 2503.08404 null
2025-03-11 OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning Jiawei Zhou et.al. 2503.08398 null
2025-03-11 Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens Qingsong Xie et.al. 2503.08377 null
2025-03-11 nnInteractive: Redefining 3D Promptable Segmentation Fabian Isensee et.al. 2503.08373 link
2025-03-11 MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model Haonan Chen et.al. 2503.08372 null
2025-03-11 Robust Latent Matters: Boosting Image Generation with Sampling Error Kai Qiu et.al. 2503.08354 link
2025-03-12 Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs Chongjun Tu et.al. 2503.08342 null
2025-03-11 Trinity: A Modular Humanoid Robot AI System Jingkai Sun et.al. 2503.08338 null
2025-03-11 Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos Soumya Shamarao Jahagirdar et.al. 2503.08335 null
2025-03-11 KiteRunner: Language-Driven Cooperative Local-Global Navigation Policy with UAV Mapping in Outdoor Environments Shibo Huang et.al. 2503.08330 null
2025-03-11 Towards Scalable and Cross-Lingual Specialist Language Models for Oncology Morteza Rohanian et.al. 2503.08323 null
2025-03-11 Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference Pol G. Recasens et.al. 2503.08311 null
2025-03-11 Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework Zhuo Zhi et.al. 2503.08308 null
2025-03-11 General-Purpose Aerial Intelligent Agents Empowered by Large Language Models Ji Zhao et.al. 2503.08302 null
2025-03-12 Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study Xian-Rong Zhang et.al. 2503.08301 null
2025-03-11 Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges Xiaoxiao Liu et.al. 2503.08292 null
2025-03-11 PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net Jun Yin et.al. 2503.08276 null
2025-03-11 LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization Wenzhe Niu et.al. 2503.08271 null
2025-03-11 DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness Yiming Zhong et.al. 2503.08257 link
2025-03-11 Aligning Text to Image in Diffusion Models is Easier Than You Think Jaa-Yeon Lee et.al. 2503.08250 null
2025-03-11 Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices Tao Shen et.al. 2503.08223 null
2025-03-11 EgoBlind: Towards Egocentric Visual Assistance for the Blind People Junbin Xiao et.al. 2503.08221 null
2025-03-11 S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction Guangting Zheng et.al. 2503.08217 null
2025-03-11 To Use or Not to Use a Universal Force Field Denan Li et.al. 2503.08207 null
2025-03-11 Route Sparse Autoencoder to Interpret Large Language Models Wei Shi et.al. 2503.08200 null
2025-03-11 A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models Miao Zhang et.al. 2503.08199 null
2025-03-11 Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation Wenlong Meng et.al. 2503.08195 link
2025-03-11 Automating Violence Detection and Categorization from Ancient Texts Alhassan Abdelhalim et.al. 2503.08192 null
2025-03-11 RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware Gonzalo Santamaría Gómez et.al. 2503.08188 null
2025-03-11 Mutation Testing via Iterative Large Language Model-Driven Scientific Debugging Philipp Straubinger et.al. 2503.08182 null
2025-03-12 ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models Zicheng Ma et.al. 2503.08179 null
2025-03-11 Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study Veronica Bot et.al. 2503.08174 null
2025-03-11 Towards All-in-One Medical Image Re-Identification Yuan Tian et.al. 2503.08173 link
2025-03-11 TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement Miao Zhang et.al. 2503.08168 null
2025-03-11 FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback Kangan Qian et.al. 2503.08162 null
2025-03-12 OASIS: Order-Augmented Strategy for Improved Code Search Zuchen Gao et.al. 2503.08161 null
2025-03-11 Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model Yufan Chen et.al. 2503.08156 null
2025-03-11 WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation Jing Wang et.al. 2503.08153 null
2025-03-11 Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features Hanbyul Lee et.al. 2503.08148 null
2025-03-11 FilmComposer: LLM-Driven Music Production for Silent Film Clips Zhifeng Xie et.al. 2503.08147 null
2025-03-11 Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method Fei Wang et.al. 2503.08144 null
2025-03-11 FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems Jeongsol Kim et.al. 2503.08136 null
2025-03-11 Large Scale Multi-Task Bayesian Optimization with Large Language Models Yimeng Zeng et.al. 2503.08131 null
2025-03-11 LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence Renxuan Tan et.al. 2503.08123 null
2025-03-11 Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments Soonwoo Kwon et.al. 2503.08122 null
2025-03-11 Uni $\textbf{F}^2$ ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Junzhe Li et.al. 2503.08120 null
2025-03-11 Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models Weiguo Gao et.al. 2503.08117 null
2025-03-11 AI-native Memory 2.0: Second Me Jiale Wei et.al. 2503.08102 null
2025-03-12 PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models Kyeongkook Seo et.al. 2503.08085 link
2025-03-11 Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation Fangyuan Wang et.al. 2503.08084 null
2025-03-11 Seeing Beyond Haze: Generative Nighttime Image Dehazing Beibei Lin et.al. 2503.08073 null
2025-03-11 Flow Matching for Discrete Systems: Efficient Free Energy Sampling Across Lattice Sizes and Temperatures Ping Tuo et.al. 2503.08063 null
2025-03-11 Odysseus Navigates the Sirens’ Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation Wen Luo et.al. 2503.08057 null
2025-03-11 Counterfactual Language Reasoning for Explainable Recommendation Systems Guanrong Li et.al. 2503.08051 null
2025-03-11 SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models Nadarasar Bahavan et.al. 2503.08049 null
2025-03-11 LongProLIP: A Probabilistic Vision-Language Model with Long Context Text Sanghyuk Chun et.al. 2503.08048 link
2025-03-11 Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection Ying Fu Lim et.al. 2503.08045 null
2025-03-11 ObjectMover: Generative Object Movement with Video Prior Xin Yu et.al. 2503.08037 null
2025-03-11 Learning to Search Effective Example Sequences for In-Context Learning Xiang Gao et.al. 2503.08030 null
2025-03-11 In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents Zhen Tan et.al. 2503.08026 null
2025-03-10 V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation Guiwei Zhang et.al. 2503.07493 link
2025-03-10 LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? Bangyan Li et.al. 2503.07487 null
2025-03-10 Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction Zongzheng Zhang et.al. 2503.07485 link
2025-03-10 GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models Ryugo Morita et.al. 2503.07463 null
2025-03-10 MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Xiangru Tang et.al. 2503.07459 link
2025-03-10 LLMs syntactically adapt their language use to their conversational partner Florian Kandra et.al. 2503.07457 null
2025-03-10 Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration Dylan J. Foster et.al. 2503.07453 null
2025-03-10 From Idea to Implementation: Evaluating the Influence of Large Language Models in Software Development – An Opinion Paper Sargam Yadav et.al. 2503.07450 null
2025-03-10 From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics Jaewook Lee et.al. 2503.07429 null
2025-03-10 RePO: ReLU-based Preference Optimization Junkang Wu et.al. 2503.07426 link
2025-03-10 REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Yan Tai et.al. 2503.07413 link
2025-03-10 Towards Safe Robot Foundation Models Maximilian Tölle et.al. 2503.07404 null
2025-03-10 Keeping Representation Similarity in Finetuning for Medical Image Analysis Wenqiang Zu et.al. 2503.07399 null
2025-03-10 Revisiting Noise in Natural Language Processing for Computational Social Science Nadav Borenstein et.al. 2503.07395 null
2025-03-10 Process-Supervised LLM Recommenders via Flow-guided Tuning Chongming Gao et.al. 2503.07377 link
2025-03-10 Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future Yannick Oswald et.al. 2503.07364 null
2025-03-10 RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing Yiqing Xie et.al. 2503.07358 link
2025-03-10 Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment Xing Xie et.al. 2503.07334 null
2025-03-10 Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models Hao Zhou et.al. 2503.07329 null
2025-03-10 Dynamic Path Navigation for Motion Agents with LLM Reasoning Yubo Zhao et.al. 2503.07323 null
2025-03-10 Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents Guanxuan Jiang et.al. 2503.07320 null
2025-03-10 Self-Corrective Task Planning by Inverse Prompting with Large Language Models Jiho Lee et.al. 2503.07317 null
2025-03-10 Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies Luyi Jiang et.al. 2503.07306 null
2025-03-10 A Graph-based Verification Framework for Fact-Checking Yani Huang et.al. 2503.07282 null
2025-03-10 COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition Baiyu Chen et.al. 2503.07259 link
2025-03-10 CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting Haicheng Liao et.al. 2503.07234 null
2025-03-10 Control Flow-Augmented Decompiler based on Large Language Model Peipei Liu et.al. 2503.07215 null
2025-03-10 Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion Mona Sheikh Zeinoddin et.al. 2503.07204 null
2025-03-10 A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding Bingchen Liu et.al. 2503.07202 null
2025-03-10 Effective and Efficient Masked Image Generation Models Zebin You et.al. 2503.07197 link
2025-03-10 Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems Lia Shahnazaryan et.al. 2503.07195 null
2025-03-10 Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms Jiaming Song et.al. 2503.07154 null
2025-03-10 MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark Shengkun Ma et.al. 2503.07144 link
2025-03-10 Application of Multiple Chain-of-Thought in Contrastive Reasoning for Implicit Sentiment Analysis Liwei Yang et.al. 2503.07140 null
2025-03-10 VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation Hanzhi Chen et.al. 2503.07135 null
2025-03-10 Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation Sihao Lin et.al. 2503.07125 null
2025-03-10 Quantizing Large Language Models for Code Generation: A Differentiated Replication Alessandro Giagnorio et.al. 2503.07103 null
2025-03-10 A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images Xiaoyi Liang et.al. 2503.07094 null
2025-03-10 Linguistic Knowledge Transfer Learning for Speech Enhancement Kuo-Hsuan Hung et.al. 2503.07078 null
2025-03-10 DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Jongwoo Ko et.al. 2503.07067 null
2025-03-10 Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning Huilin Deng et.al. 2503.07065 link
2025-03-10 TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation Victor Shea-Jay Huang et.al. 2503.07050 null
2025-03-10 Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion Yongle Zhang et.al. 2503.07047 null
2025-03-10 Conditional Generative Modeling for Amorphous Multi-Element Materials Honglin Li et.al. 2503.07043 link
2025-03-10 TCM-3CEval: A Triaxial Benchmark for Assessing Responses from Large Language Models in Traditional Chinese Medicine Tianai Huang et.al. 2503.07041 null
2025-03-10 Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams Nardine Basta et.al. 2503.07036 null
2025-03-10 Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation Zhi Qin et.al. 2503.07032 null
2025-03-10 Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense Yuting Hu et.al. 2503.07020 null
2025-03-10 Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning Xintong Li et.al. 2503.07018 null
2025-03-10 HELM: Human-Preferred Exploration with Language Models Shuhao Liao et.al. 2503.07006 null
2025-03-10 Large Language Models Often Say One Thing and Do Another Ruoxi Xu et.al. 2503.07003 link
2025-03-10 Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning Jiazheng Liu et.al. 2503.07002 null
2025-03-10 Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs Wenzhuo Xu et.al. 2503.06989 null
2025-03-10 Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations Jiho Jin et.al. 2503.06987 null
2025-03-10 Learning Decision Trees as Amortized Structure Inference Mohammed Mahfoud et.al. 2503.06985 link
2025-03-10 Exploring Multimodal Perception in Large Language Models Through Perceptual Strength Ratings Jonghyun Lee et.al. 2503.06980 null
2025-03-10 Lightweight Multimodal Artificial Intelligence Framework for Maritime Multi-Scene Recognition Xinyu Xi et.al. 2503.06978 null
2025-03-10 Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation Pengchen Liang et.al. 2503.06976 null
2025-03-10 ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA Zhao Xinjie et.al. 2503.06951 null
2025-03-10 CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation Runqi Sui et.al. 2503.06950 null
2025-03-11 LexPro-1.0 Technical Report Haotian Chen et.al. 2503.06949 link
2025-03-10 Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection Wentao Wu et.al. 2503.06948 null
2025-03-10 Handle Object Navigation as Weighted Traveling Repairman Problem Ruimeng Liu et.al. 2503.06937 link
2025-03-10 Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping Ning Ding et.al. 2503.06930 null
2025-03-10 Effect of Selection Format on LLM Performance Yuchen Han et.al. 2503.06926 null
2025-03-10 Combinatorial Optimization via LLM-driven Iterated Fine-tuning Pranjal Awasthi et.al. 2503.06917 null
2025-03-10 Beyond Code Generation: LLM-supported Exploration of the Program Design Space J. D. Zamfirescu-Pereira et.al. 2503.06911 null
2025-03-10 A Query Optimization Method Utilizing Large Language Models Zhiming Yao et.al. 2503.06902 null
2025-03-10 DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation Xiaoliang Ju et.al. 2503.06900 null
2025-03-10 SafePlan: Leveraging Formal Logic and Chain-of-Thought Reasoning for Enhanced Safety in LLM-based Robotic Task Planning Ike Obi et.al. 2503.06892 null
2025-03-10 ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks Yan Yang et.al. 2503.06885 null
2025-03-10 Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help Yuefan Cao et.al. 2503.06884 null
2025-03-10 ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration Mengting Ai et.al. 2503.06881 link
2025-03-10 Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception Wanjing Huang et.al. 2503.06866 link
2025-03-10 FIGLUT: An Energy-Efficient Accelerator Design for FP-INT GEMM Using Look-Up Tables Gunho Park et.al. 2503.06862 null
2025-03-10 Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention Mengzhe Hei et.al. 2503.06861 null
2025-03-10 MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification Xiangyan Qu et.al. 2503.06847 null
2025-03-10 GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought Sungsik Kim et.al. 2503.06832 null
2025-03-10 Towards a Multimodal MRI-Based Foundation Model for Multi-Level Feature Exploration in Segmentation, Molecular Subtyping, and Grading of Glioma Somayeh Farahani et.al. 2503.06828 null
2025-03-10 eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference Suraiya Tairin et.al. 2503.06823 null
2025-03-10 HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors Siyu Li et.al. 2503.06821 link
2025-03-10 Towards Fine-Grained Video Question Answering Wei Dai et.al. 2503.06820 null
2025-03-09 Privacy Auditing of Large Language Models Ashwinee Panda et.al. 2503.06808 null
2025-03-09 VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation Hritik Bansal et.al. 2503.06800 null
2025-03-09 Multimodal AI-driven Biomarker for Early Detection of Cancer Cachexia Sabeen Ahmed et.al. 2503.06797 null
2025-03-09 RoboDesign1M: A Large-scale Dataset for Robot Design Understanding Tri Le et.al. 2503.06796 null
2025-03-09 AutoMisty: A Multi-Agent LLM Framework for Automated Code Generation in the Misty Social Robot Xiao Wang et.al. 2503.06791 null
2025-03-09 Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models Tianyi Zhang et.al. 2503.06784 null
2025-03-09 Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting Yufei Li et.al. 2503.06781 null
2025-03-09 Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators Feng Gu et.al. 2503.06778 null
2025-03-09 Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints Max Buckley et.al. 2503.06751 null
2025-03-09 Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models Wenxuan Huang et.al. 2503.06749 link
2025-03-09 CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving Rui Song et.al. 2503.06744 null
2025-03-09 Delusions of Large Language Models Hongshen Xu et.al. 2503.06709 null
2025-03-09 Alignment for Efficient Tool Calling of Large Language Models Hongshen Xu et.al. 2503.06708 null
2025-03-09 PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts Ming Zhang et.al. 2503.06706 link
2025-03-09 InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models Yuchen Yan et.al. 2503.06692 null
2025-03-09 DependEval: Benchmarking LLMs for Repository Dependency Understanding Junjia Du et.al. 2503.06689 link
2025-03-09 UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion Gongbo Zhang et.al. 2503.06687 null
2025-03-09 FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation Wei Li et.al. 2503.06680 null
2025-03-09 Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets Tommaso Bendinelli et.al. 2503.06664 null
2025-03-07 Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints Parameswaran Kamalaruban et.al. 2503.05684 null
2025-03-07 Understanding the Limits of Lifelong Knowledge Editing in LLMs Lukas Thede et.al. 2503.05683 null
2025-03-07 AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data Zengqun Zhao et.al. 2503.05665 link
2025-03-07 A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Yu Zhang et.al. 2503.05659 link
2025-03-07 A functional approach for curve alignment and shape analysis Issam-Ali Moindjié et.al. 2503.05632 null
2025-03-07 Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings Xuanqing Liu et.al. 2503.05620 null
2025-03-07 A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Dong Shu et.al. 2503.05613 null
2025-03-07 From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing Prashant K. Jha et.al. 2503.05598 link
2025-03-07 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Huatong Song et.al. 2503.05592 null
2025-03-07 Evaluating open-source Large Language Models for automated fact-checking Nicolo’ Fontana et.al. 2503.05565 null
2025-03-07 Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance Bryan Etzine et.al. 2503.05551 null
2025-03-07 Leveraging Approximate Caching for Faster Retrieval-Augmented Generation Shai Bergman et.al. 2503.05530 null
2025-03-07 PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs Roberto Cerina et.al. 2503.05529 null
2025-03-07 Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations Eren Erogullari et.al. 2503.05522 link
2025-03-07 Cognitive Bias Detection Using Advanced Prompt Engineering Frederic Lemieux et.al. 2503.05516 null
2025-03-07 Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering Yusong Ke et.al. 2503.05505 null
2025-03-07 Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders Qijiong Liu et.al. 2503.05493 null
2025-03-07 Statistical Deficiency for Task Inclusion Estimation Loïc Fosse et.al. 2503.05491 null
2025-03-07 Maximum Hallucination Standards for Domain-Specific Large Language Models Tingmingke Lu et.al. 2503.05481 null
2025-03-07 The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence Noah Mamie et.al. 2503.05473 null
2025-03-07 De Novo Design of Protein-Binding Peptides by Quantum Computing Lars Meuser et.al. 2503.05458 null
2025-03-07 LLM-based Iterative Approach to Metamodeling in Automotive Nenad Petrovic et.al. 2503.05449 null
2025-03-07 Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts Weigao Sun et.al. 2503.05447 link
2025-03-07 Are Your LLM-based Text-to-SQL Models Secure? Exploring SQL Injection via Backdoor Attacks Meiyu Lin et.al. 2503.05445 null
2025-03-07 Static Program Analysis Guided LLM Based Unit Test Generation Sujoy Roychowdhury et.al. 2503.05394 null
2025-03-07 Ontology Generation using Large Language Models Anna Sofia Lippolis et.al. 2503.05388 link
2025-03-07 VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method Weiyu Ma et.al. 2503.05383 link
2025-03-07 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning Jiaxing Zhao et.al. 2503.05379 null
2025-03-07 Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs Zara Siddique et.al. 2503.05371 null
2025-03-07 Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter Weixiang Zhao et.al. 2503.05362 null
2025-03-07 GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation Zhenxuan Zhang et.al. 2503.05347 link
2025-03-07 AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications Leming Shen et.al. 2503.05346 link
2025-03-07 PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations? Martin Spitznagel et.al. 2503.05333 null
2025-03-07 Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models Anar Yeginbergen et.al. 2503.05328 null
2025-03-07 Routing for Large ML Models Ofir Cohen et.al. 2503.05324 link
2025-03-07 Riemannian Metric Learning: Closer to You than You Imagine Samuel Gruffaz et.al. 2503.05321 null
2025-03-07 Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms Zitao Fang et.al. 2503.05320 null
2025-03-07 Escaping Plato’s Cave: Towards the Alignment of 3D and Text Latent Spaces Souhail Hadgi et.al. 2503.05283 null
2025-03-07 Similarity-Based Domain Adaptation with LLMs Jie He et.al. 2503.05281 null
2025-03-07 Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching Bowen Pang et.al. 2503.05248 link
2025-03-07 L-FUSION: Laplacian Fetal Ultrasound Segmentation & Uncertainty Estimation Johanna P. Müller et.al. 2503.05245 null
2025-03-07 WritingBench: A Comprehensive Benchmark for Generative Writing Yuning Wu et.al. 2503.05244 link
2025-03-07 MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio Xuenan Xu et.al. 2503.05242 link
2025-03-07 Unveiling Biases in AI: ChatGPT’s Political Economy Perspectives and Human Comparisons Leonardo Becchetti et.al. 2503.05234 null
2025-03-07 Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction Shuo Jiang et.al. 2503.05231 null
2025-03-07 ARbiter: Generating Dialogue Options and Communication Support in Augmented Reality Julián Méndez et.al. 2503.05220 null
2025-03-07 Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning Guoxiu He et.al. 2503.05212 null
2025-03-07 Path Pooling: Train-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation Hairu Wang et.al. 2503.05203 null
2025-03-07 ORANSight-2.0: Foundational LLMs for O-RAN Pranshav Gajjar et.al. 2503.05200 null
2025-03-07 Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning Mufan Xu et.al. 2503.05193 null
2025-03-07 Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions Chan hur et.al. 2503.05186 null
2025-03-07 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Simon A. Aytes et.al. 2503.05179 null
2025-03-07 Development and Enhancement of Text-to-Image Diffusion Models Rajdeep Roshan Sahu et.al. 2503.05149 null
2025-03-07 RocketEval: Efficient Automated LLM Evaluation via Grading Checklist Tianjun Wei et.al. 2503.05142 null
2025-03-07 Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs Ling Team et.al. 2503.05139 null
2025-03-07 R1-Zero’s “Aha Moment” in Visual Reasoning on a 2B Non-SFT Model Hengguang Zhou et.al. 2503.05132 link
2025-03-07 Dilu: Enabling GPU Resourcing-on-Demand for Serverless DL Serving via Introspective Elasticity Cunchi Lv et.al. 2503.05130 null
2025-03-07 Can Large Language Models Grasp Concepts in Visual Content? A Case Study on YouTube Shorts about Depression Jiaying “Lizzy” Liu et.al. 2503.05109 null
2025-03-07 AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models Hengrui Xing et.al. 2503.05102 null
2025-03-07 SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding Kaiyu Huang et.al. 2503.05096 null
2025-03-07 S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Feng Jiang et.al. 2503.05085 null
2025-03-07 On a Connection Between Imitation Learning and RLHF Teng Xiao et.al. 2503.05079 null
2025-03-07 PromptPex: Automatic Test Generation for Language Model Prompts Reshabh K Sharma et.al. 2503.05070 link
2025-03-07 Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts Shwai He et.al. 2503.05066 null
2025-03-07 No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding Michael Krumdick et.al. 2503.05061 null
2025-03-06 Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets Preetam Prabhu Srikar Dammu et.al. 2503.05049 null
2025-03-06 Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference Grace Proebsting et.al. 2503.05047 null
2025-03-06 Continual Pre-training of MoEs: How robust is your router? Benjamin Thérien et.al. 2503.05029 null
2025-03-06 ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids Hannes Stark et.al. 2503.05025 link
2025-03-06 Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety Yuyou Zhang et.al. 2503.05021 null
2025-03-06 LLMs’ Reshaping of People, Processes, Products, and Society in Software Development: A Comprehensive Exploration with Early Adopters Benyamin Tabarsi et.al. 2503.05012 null
2025-03-06 Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation Bryan Li et.al. 2503.05010 null
2025-03-06 Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models Benyamin Jamialahmadi et.al. 2503.05005 link
2025-03-06 Wanda++: Pruning Large Language Models via Regional Gradients Yifan Yang et.al. 2503.04992 null
2025-03-06 DP-GTR: Differentially Private Prompt Protection via Group Text Rewriting Mingchen Li et.al. 2503.04990 null
2025-03-06 Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review Boris Malashenko et.al. 2503.04983 null
2025-03-06 LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression Souvik Kundu et.al. 2503.04982 null
2025-03-06 Quantifying the Relevance of Youth Research Cited in the US Policy Documents Miftahul Jannat Mokarrama et.al. 2503.04977 link
2025-03-06 Energy-Weighted Flow Matching for Offline Reinforcement Learning Shiyuan Zhang et.al. 2503.04975 null
2025-03-06 Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning Giulio Corallo et.al. 2503.04973 null
2025-03-06 Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge Songyuan Li et.al. 2503.04971 null
2025-03-06 DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL Haoyuan Ma et.al. 2503.04959 null
2025-03-06 Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems Jooyoung Lee et.al. 2503.04945 null
2025-03-06 HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models Yao Ge et.al. 2503.04930 null
2025-03-06 Metadata-free Georegistration of Ground and Airborne Imagery Adam Bredvik et.al. 2503.04927 null
2025-03-06 FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement Ian Huang et.al. 2503.04919 null
2025-03-06 L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling Zhuo Chen et.al. 2503.04725 link
2025-03-07 Shifting Long-Context LLMs Research from Input to Output Yuhao Wu et.al. 2503.04723 null
2025-03-06 Enough Coin Flips Can Make LLMs Act Bayesian Ritwik Gupta et.al. 2503.04722 null
2025-03-06 Predictable Scale: Part I – Optimal Hyperparameter Scaling Law in Large Language Model Pretraining Houyi Li et.al. 2503.04715 null
2025-03-07 Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size Alireza Behtash et.al. 2503.04704 null
2025-03-06 UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets Wenyu Wang et.al. 2503.04693 null
2025-03-06 Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases Pengcheng Qiu et.al. 2503.04691 null
2025-03-06 LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue Sangyeop Kim et.al. 2503.04675 null
2025-03-06 What Are You Doing? A Closer Look at Controllable Human Video Generation Emanuele Bugliarello et.al. 2503.04666 null
2025-03-06 CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models Shengzhuang Chen et.al. 2503.04655 null
2025-03-06 Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators Blaine Quackenbush et.al. 2503.04649 link
2025-03-06 Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment Wen Yang et.al. 2503.04647 null
2025-03-06 Simulating the Real World: A Unified Survey of Multimodal Generative Models Yuqi Hu et.al. 2503.04641 link
2025-03-06 Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Aishik Konwer et.al. 2503.04639 null
2025-03-06 Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking Yijie Xu et.al. 2503.04636 null
2025-03-06 3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs Artin Saberpour Abadian et.al. 2503.04635 null
2025-03-06 Better Process Supervision with Bi-directional Rewarding Signals Wenxiang Chen et.al. 2503.04618 null
2025-03-06 Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning Mohammad Amin Ghanizadeh et.al. 2503.04611 null
2025-03-06 HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization Zhijian Zhuo et.al. 2503.04598 null
2025-03-06 The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy Xinyi Hou et.al. 2503.04596 null
2025-03-06 Learning Generalizable Language-Conditioned Cloth Manipulation from Long Demonstrations Hanyi Zhao et.al. 2503.04557 null
2025-03-06 Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation Armel Zebaze et.al. 2503.04554 null
2025-03-06 Benchmarking Reasoning Robustness in Large Language Models Tong Yu et.al. 2503.04550 null
2025-03-06 Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model Wenke Huang et.al. 2503.04543 null
2025-03-06 SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning Chen Li et.al. 2503.04530 null
2025-03-06 Multi-modal Summarization in Model-Based Engineering: Automotive Software Development Case Study Nenad Petrovic et.al. 2503.04506 null
2025-03-06 Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training Adrian Chang et.al. 2503.04496 null
2025-03-06 Large Language Models in Bioinformatics: A Survey Zhenyu Wang et.al. 2503.04490 null
2025-03-06 InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference Tianyu Cui et.al. 2503.04483 null
2025-03-06 ToolFuzz – Automated Agent Tool Testing Ivan Milev et.al. 2503.04479 null
2025-03-06 Semantic Alignment of Unimodal Medical Text and Vision Representations Maxime Di Folco et.al. 2503.04478 null
2025-03-06 Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges Francisco Eiras et.al. 2503.04474 null
2025-03-06 Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification Van Bach Nguyen et.al. 2503.04463 null
2025-03-06 TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction Chao Wang et.al. 2503.04457 null
2025-03-06 Activation Space Interventions Can Be Transferred Between Large Language Models Narmeen Oozeer et.al. 2503.04429 null
2025-03-06 AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services Xiaoqi Wang et.al. 2503.04418 null
2025-03-06 Can Large Language Models Predict Antimicrobial Resistance Gene? Hyunwoo Yoo et.al. 2503.04413 null
2025-03-06 Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search Kou Misaki et.al. 2503.04412 null
2025-03-06 Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling Yan Li et.al. 2503.04398 null
2025-03-06 TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models Xinyi He et.al. 2503.04396 null
2025-03-06 Shaping Shared Languages: Human and Large Language Models’ Inductive Biases in Emergent Communication Tom Kouwenhoven et.al. 2503.04395 null
2025-03-06 AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management Junyuan Mao et.al. 2503.04392 null
2025-03-06 TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge Cheng-Han Chiang et.al. 2503.04381 null
2025-03-06 Lost in Literalism: How Supervised Training Shapes Translationese in LLMs Yafu Li et.al. 2503.04369 null
2025-03-06 A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery Yiheng Zhu et.al. 2503.04362 null
2025-03-06 LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding Jia Li et.al. 2503.04359 null
2025-03-06 scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge Zhen Yu et.al. 2503.04357 null
2025-03-06 Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling Zhenghua Wang et.al. 2503.04355 null
2025-03-06 Large Language Models for Zero-shot Inference of Causal Structures in Biology Izzy Newsham et.al. 2503.04347 null
2025-03-06 TRANSIT your events into a new mass: Fast background interpolation for weakly-supervised anomaly searches Ivan Oleksiyuk et.al. 2503.04342 link
2025-03-06 In-depth Analysis of Graph-based RAG in a Unified Framework Yingli Zhou et.al. 2503.04338 null
2025-03-06 The Challenge of Identifying the Origin of Black-Box Large Language Models Ziqing Yang et.al. 2503.04332 null
2025-03-06 Solving Word-Sense Disambiguation and Word-Sense Induction with Dictionary Examples Tadej Škvorc et.al. 2503.04328 null
2025-03-06 Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation Christian Rondanini et.al. 2503.04302 null
2025-03-06 Mapping AI Benchmark Data to Quantitative Risk Estimates Through Expert Elicitation Malcolm Murray et.al. 2503.04299 null
2025-03-06 MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs Tianyang Zhang et.al. 2503.04291 null
2025-03-06 How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale Jeanette Falk et.al. 2503.04290 null
2025-03-06 Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models Niccolò Turcato et.al. 2503.04280 null
2025-03-06 VirtualXAI: A User-Centric Framework for Explainability Assessment Leveraging GPT-Generated Personas Georgios Makridis et.al. 2503.04261 null
2025-03-06 Knowledge Retention for Continual Model-Based Reinforcement Learning Yixiang Sun et.al. 2503.04256 null
2025-03-06 ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput Junsoo Kim et.al. 2503.04253 null
2025-03-06 An Egocentric Vision-Language Model based Portable Real-time Smart Assistant Yifei Huang et.al. 2503.04250 link
2025-03-06 How to Mitigate Overfitting in Weak-to-strong Generalization? Junhao Shi et.al. 2503.04249 null
2025-03-06 ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions Julian Aron Prenner et.al. 2503.04241 link
2025-03-06 DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models Ruizhe Chen et.al. 2503.04240 null
2025-03-06 SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models Zesong Zhang et.al. 2503.04234 null
2025-03-06 FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Ziyi Yang et.al. 2503.04222 null
2025-03-06 Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition Bin Chen et.al. 2503.04201 null
2025-03-06 MASTER: Multimodal Segmentation with Text Prompts Fuyang Liu et.al. 2503.04199 null
2025-03-06 Measuring temporal effects of agent knowledge by date-controlled tool use R. Patrick Xian et.al. 2503.04188 null
2025-03-06 TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records Hejie Cui et.al. 2503.04176 null
2025-03-06 DuCos: Duality Constrained Depth Super-Resolution via Foundation Model Zhiqiang Yan et.al. 2503.04171 null
2025-03-06 CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation Yuki Tanaka et.al. 2503.04164 null
2025-03-06 VLA Model-Expert Collaboration for Bi-directional Manipulation Learning Tian-Yu Xiang et.al. 2503.04163 null
2025-03-06 Semantic Retrieval Augmented Contrastive Learning for Sequential Recommendation Ziqiang Cui et.al. 2503.04162 null
2025-03-06 KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease Yongchao Long et.al. 2503.04153 link
2025-03-06 Ticktack : Long Span Temporal Alignment of Large Language Models Leveraging Sexagenary Cycle Time Expression Xue Han et.al. 2503.04150 null
2025-03-06 Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination Simin Chen et.al. 2503.04149 null
2025-03-06 Biological Sequence with Language Model Prompting: A Survey Jiyue Jiang et.al. 2503.04135 null
2025-03-06 Token-Efficient Long Video Understanding for Multimodal LLMs Jindong Jiang et.al. 2503.04130 null
2025-03-06 TimeFound: A Foundation Model for Time Series Forecasting Congxi Xiao et.al. 2503.04118 null
2025-03-06 InterChat: Enhancing Generative Visual Analytics using Multimodal Interactions Juntong Chen et.al. 2503.04110 null
2025-03-06 WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining Haoran Wang et.al. 2503.04106 link
2025-03-06 LLMs Can Generate a Better Answer by Aggregating Their Own Responses Zichong Li et.al. 2503.04104 null
2025-03-06 Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English Runtao Zhou et.al. 2503.04099 null
2025-03-07 Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts Xiangnan Chen et.al. 2503.04095 null
2025-03-06 PokéChamp: an Expert-level Minimax Language Agent Seth Karten et.al. 2503.04094 null
2025-03-06 Beyond Memorization: Evaluating the True Type Inference Capabilities of LLMs for Java Code Snippets Yiwen Dong et.al. 2503.04076 null
2025-03-06 PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks Feng Ni et.al. 2503.04065 link
2025-03-06 Uncovering inequalities in new knowledge learning by large language models across different languages Chenglong Wang et.al. 2503.04064 link
2025-03-06 EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models Haiyang Yu et.al. 2503.04058 null
2025-03-06 Insights from Rights and Wrongs: A Large Language Model for Solving Assertion Failures in RTL Design Jie Zhou et.al. 2503.04057 link
2025-03-06 GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Xihan Wang et.al. 2503.04034 null
2025-03-06 Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting Jiyue Jiang et.al. 2503.04013 null
2025-03-06 DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Amin Karimi et.al. 2503.04006 null
2025-03-06 Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows Xiangxin Zhou et.al. 2503.03989 null
2025-03-06 RetinalGPT: A Retinal Clinical Preference Conversational Assistant Powered by Large Vision-Language Models Wenhui Zhu et.al. 2503.03987 null
2025-03-06 ReasonGraph: Visualisation of Reasoning Paths Zongqian Li et.al. 2503.03979 link
2025-03-05 Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge Fanwen Wang et.al. 2503.03971 link
2025-03-05 Model Behavior Specification by Leveraging LLM Self-Playing and Self-Improving Soya Park et.al. 2503.03967 null
2025-03-05 The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems Richard Ren et.al. 2503.03750 null
2025-03-05 Process-based Self-Rewarding Language Models Shimao Zhang et.al. 2503.03746 null
2025-03-05 Towards Understanding Distilled Reasoning Models: A Representational Approach David D. Baek et.al. 2503.03730 null
2025-03-05 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao et.al. 2503.03710 link
2025-03-05 Rethinking Video Tokenization: A Conditioned Diffusion-based Approach Nianzu Yang et.al. 2503.03708 null
2025-03-05 Effective LLM Knowledge Learning via Model Generalization Mingkang Zhu et.al. 2503.03705 null
2025-03-05 A Practical Memory Injection Attack against LLM Agents Shen Dong et.al. 2503.03704 null
2025-03-05 Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models Jiyue Jiang et.al. 2503.03702 null
2025-03-05 Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks Zihao Zhao et.al. 2503.03687 link
2025-03-05 Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models Bar Karov et.al. 2503.03669 link
2025-03-05 Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction Gustaw Opiełka et.al. 2503.03666 link
2025-03-05 A Generative Approach to High Fidelity 3D Reconstruction from Text Data Venkat Kumar R et.al. 2503.03664 null
2025-03-05 Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset Jessica Hoffmann et.al. 2503.03654 null
2025-03-05 Token-Level Privacy in Large Language Models Re’em Harel et.al. 2503.03652 null
2025-03-05 DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles Rui Zhao et.al. 2503.03651 link
2025-03-05 Psy-Copilot: Visual Chain of Thought for Counseling Keqi Chen et.al. 2503.03645 null
2025-03-05 Large language models in finance: estimating financial sentiment for stock prediction Kemal Kirtac et.al. 2503.03612 null
2025-03-05 Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering Lingli Cao et.al. 2503.03609 link
2025-03-05 Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling Keqi Chen et.al. 2503.03607 null
2025-03-05 Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Kristian Kuznetsov et.al. 2503.03601 null
2025-03-05 PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention Lida Chen et.al. 2503.03588 null
2025-03-05 “You don’t need a university degree to comprehend data protection this way”: LLM-Powered Interactive Privacy Policy Assessment Vincent Freiberger et.al. 2503.03587 null
2025-03-05 Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories Alperen Yildiz et.al. 2503.03586 null
2025-03-05 Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection Wenqiao Li et.al. 2503.03562 null
2025-03-05 Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation Xiaomeng Zhu et.al. 2503.03556 null
2025-03-05 Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems Yaoru Li et.al. 2503.03505 link
2025-03-05 Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization Jiajun Yu et.al. 2503.03503 link
2025-03-05 CURVALID: Geometrically-guided Adversarial Prompt Detection Canaan Yung et.al. 2503.03502 link
2025-03-05 TEDDY: A Family Of Foundation Models For Understanding Single Cell Biology Alexis Chevalier et.al. 2503.03485 null
2025-03-05 Generative Artificial Intelligence in Robotic Manipulation: A Survey Kun Zhang et.al. 2503.03464 null
2025-03-05 Open-Source Large Language Models as Multilingual Crowdworkers: Synthesizing Open-Domain Dialogues in Several Languages With No Examples in Targets and No Machine Translation Ahmed Njifenjou et.al. 2503.03462 null
2025-03-05 Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models Alessio Galatolo et.al. 2503.03460 link
2025-03-05 Unified Mind Model: Reimagining Autonomous Agents in the LLM Era Pengbo Hu et.al. 2503.03459 null
2025-03-05 Taxation Perspectives from Large Language Models: A Case Study on Additional Tax Penalties Eunkyung Choi et.al. 2503.03444 null
2025-03-05 RASD: Retrieval-Augmented Speculative Decoding Guofeng Quan et.al. 2503.03434 null
2025-03-05 Video Super-Resolution: All You Need is a Video Diffusion Model Zhihao Zhan et.al. 2503.03355 null
2025-03-05 Leveraging Large Language Models to Develop Heuristics for Emerging Optimization Problems Thomas Bömer et.al. 2503.03350 null
2025-03-05 EnigmaToM: Improve LLMs’ Theory-of-Mind Reasoning Capabilities with Neural Knowledge Base of Entity States Hainiu Xu et.al. 2503.03340 link
2025-03-05 LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models Xi Zhu et.al. 2503.03313 null
2025-03-05 SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection Yi-Fan Lu et.al. 2503.03303 null
2025-03-05 Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters Julia Hindel et.al. 2503.03299 null
2025-03-05 A 262 TOPS Hyperdimensional Photonic AI Accelerator powered by a Si3N4 microcomb laser Christos Pappas et.al. 2503.03263 null
2025-03-05 Can Frontier LLMs Replace Annotators in Biomedical Text Mining? Analyzing Challenges and Exploring Solutions Yichong Zhao et.al. 2503.03261 null
2025-03-05 Exploring the Potential of Large Language Models as Predictors in Dynamic Text-Attributed Graphs Runlin Lei et.al. 2503.03258 null
2025-03-05 PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary Algorithms Shady Ali et.al. 2503.03239 link
2025-03-05 FANS – Formal Answer Selection for Natural Language Math Reasoning Using Lean4 Jiarui Yao et.al. 2503.03238 null
2025-03-05 Targeted Distillation for Sentiment Analysis Yice Zhang et.al. 2503.03225 null
2025-03-05 Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture Zhumei Wang et.al. 2503.03222 null
2025-03-05 COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence Wentao Li et.al. 2503.03215 null
2025-03-05 PolyVer: A Compositional Approach for Polyglot System Modeling and Verification Pei-Wei Chen et.al. 2503.03207 null
2025-03-05 An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models Binxu Wang et.al. 2503.03206 null
2025-03-05 MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving Ruida Wang et.al. 2503.03205 link
2025-03-05 Find Matching Faces Based On Face Parameters Setu A. Bhatt et.al. 2503.03204 null
2025-03-05 Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution Jizhao Zhu et.al. 2503.03201 null
2025-03-05 Structured Outputs Enable General-Purpose LLMs to be Medical Experts Guangfu Guo et.al. 2503.03194 null
2025-03-05 Enhancing Memory Efficiency in Large Language Model Training Through Chronos-aware Pipeline Parallelism Xinyuan Lin et.al. 2503.03182 null
2025-03-05 Enhancing Cybersecurity in Critical Infrastructure with LLM-Assisted Explainable IoT Systems Ashutosh Ghimire et.al. 2503.03180 null
2025-03-05 AttackSeqBench: Benchmarking Large Language Models’ Understanding of Sequential Patterns in Cyber Attacks Javier Yong et.al. 2503.03170 link
2025-03-05 Dango: A Mixed-Initiative Data Wrangling System using Large Language Model Wei-Hao Chen et.al. 2503.03154 null
2025-03-05 Position: Model Collapse Does Not Mean What You Think Rylan Schaeffer et.al. 2503.03150 null
2025-03-05 DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models YiQiu Guo et.al. 2503.03149 null
2025-03-05 PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Function Secret Sharing Zhichao You et.al. 2503.03146 null
2025-03-05 A Survey of Foundation Models for Environmental Science Runlong Yu et.al. 2503.03142 null
2025-03-05 StarFlow: Leveraging Normalizing Flows for Stellar Age Estimation in SDSS-V DR19 Alexander Stone-Martinez et.al. 2503.03138 null
2025-03-05 Bridging Molecular Graphs and Large Language Models Runze Wang et.al. 2503.03135 link
2025-03-05 Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability Chenhui Xu et.al. 2503.03128 null
2025-03-05 The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models Zichao Li et.al. 2503.03122 null
2025-03-05 PromAssistant: Leveraging Large Language Models for Text-to-PromQL Chenxi Zhang et.al. 2503.03114 null
2025-03-05 SoK: Knowledge is All You Need: Last Mile Delivery for Automated Provenance-based Intrusion Detection with LLMs Wenrui Cheng et.al. 2503.03108 null
2025-03-05 Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation Yurui Chang et.al. 2503.03106 null
2025-03-05 BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving Katharina Winter et.al. 2503.03074 link
2025-03-04 Unification of Stochastic and Quantum Thermodynamics in Scalar Field Theory via a Model with Brownian Thermostat T. Koide et.al. 2503.03059 null
2025-03-04 SAGE: Steering and Refining Dialog Generation with State-Action Augmentation Yizhe Zhang et.al. 2503.03040 link
2025-03-04 SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs Samir Abdaljalil et.al. 2503.03032 null
2025-03-04 Generative Active Adaptation for Drifting and Imbalanced Network Intrusion Detection Ragini Gupta et.al. 2503.03022 null
2025-03-04 Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems? Evan Scope Crafts et.al. 2503.03007 link
2025-03-04 Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment Matthew DosSantos DiSorbo et.al. 2503.02976 null
2025-03-04 LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation Jude Khouja et.al. 2503.02972 null
2025-03-04 Multilingual Relative Clause Attachment Ambiguity Resolution in Large Language Models So Young Lee et.al. 2503.02971 link
2025-03-04 InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model Siqi Ouyang et.al. 2503.02969 link
2025-03-04 Privacy-Preserving Fair Synthetic Tabular Data Fatima J. Sarmin et.al. 2503.02968 null
2025-03-04 KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Zhangchen Xu et.al. 2503.02951 link
2025-03-04 Train on classical, deploy on quantum: scaling generative quantum machine learning to a thousand qubits Erik Recio-Armengol et.al. 2503.02934 link
2025-03-04 Optimizing open-domain question answering with graph-based retrieval augmented generation Joyce Cahoon et.al. 2503.02922 null
2025-03-04 ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models Qinyu Zhao et.al. 2503.02883 link
2025-03-04 Wikipedia in the Era of LLMs: Evolution and Risks Siming Huang et.al. 2503.02879 link
2025-03-04 SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models Dmitry Nechaev et.al. 2503.02876 link
2025-03-04 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji et.al. 2503.02875 null
2025-03-04 Prompting Generative AI with Interaction-Augmented Instructions Leixian Shen et.al. 2503.02874 null
2025-03-05 FairSense-AI: Responsible AI Meets Sustainability Shaina Raza et.al. 2503.02865 null
2025-03-04 Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework Ziang Zhou et.al. 2503.02863 null
2025-03-04 Privacy and Accuracy-Aware AI/ML Model Deduplication Hong Guan et.al. 2503.02862 null
2025-03-04 Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers Zicong He et.al. 2503.02851 link
2025-03-04 Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs Yuzhe Gu et.al. 2503.02846 link
2025-03-04 SeqFusion: Sequential Fusion of Pre-Trained Models for Zero-Shot Time-Series Forecasting Ting-Ji Huang et.al. 2503.02836 link
2025-03-04 AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation Songming Zhang et.al. 2503.02832 null
2025-03-04 Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging Yujin Oh et.al. 2503.02824 null
2025-03-04 A Multimodal Symphony: Integrating Taste and Sound through Generative AI Matteo Spanio et.al. 2503.02823 null
2025-03-04 Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts Marta Skreta et.al. 2503.02819 link
2025-03-04 RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration Alicia Russell-Gilbert et.al. 2503.02800 null
2025-03-04 Multimodal AI predicts clinical outcomes of drug combinations from preclinical data Yepeng Huang et.al. 2503.02781 link
2025-03-04 Implicit Bias in LLMs: A Survey Xinru Lin et.al. 2503.02776 null
2025-03-04 InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training Dingdong Wang et.al. 2503.02769 null
2025-03-04 BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression Daniil Larionov et.al. 2503.02756 null
2025-03-04 Large Language Models for Multilingual Previously Fact-Checked Claim Detection Ivan Vykopal et.al. 2503.02737 link
2025-03-04 RedChronos: A Large Language Model-Based Log Analysis System for Insider Threat Detection in Enterprises Chenyu Li et.al. 2503.02702 null
2025-03-04 MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality Shuaike Li et.al. 2503.02701 link
2025-03-04 Zero-Shot Complex Question-Answering on Long Scientific Documents Wanting Wang et.al. 2503.02695 link
2025-03-04 FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting Congluo Xu et.al. 2503.02692 null
2025-03-04 Generative Modeling of Microweather Wind Velocities for Urban Air Mobility Tristan A. Shah et.al. 2503.02690 link
2025-03-04 MPO: Boosting LLM Agents with Meta Plan Optimization Weimin Xiong et.al. 2503.02682 link
2025-03-04 Multidimensional Consistency Improves Reasoning in Language Models Huiyuan Lai et.al. 2503.02670 null
2025-03-04 LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models Pengwei Tang et.al. 2503.02659 null
2025-03-04 The Effectiveness of Large Language Models in Transforming Unstructured Text to Standardized Formats William Brach et.al. 2503.02650 link
2025-03-04 YARE-GAN: Yet Another Resting State EEG-GAN Yeganeh Farahzadi et.al. 2503.02636 link
2025-03-04 Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective Haotian Li et.al. 2503.02631 null
2025-03-04 Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning Extraction Wenxuan Liu et.al. 2503.02628 null
2025-03-04 Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models Paul Stangel et.al. 2503.02623 null
2025-03-04 OkraLong: A Flexible Retrieval-Augmented Framework for Long-Text Query Processing Yulong Hui et.al. 2503.02603 null
2025-03-04 Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs Wei-Yao Wang et.al. 2503.02597 link
2025-03-04 StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts Zhaoxing Gan et.al. 2503.02595 null
2025-03-04 MciteBench: A Benchmark for Multimodal Citation Text Generation in MLLMs Caiyu Hu et.al. 2503.02589 link
2025-03-04 Playing games with Large language models: Randomness and strategy Alicia Vidler et.al. 2503.02582 null
2025-03-04 LLM-Safety Evaluations Lack Robustness Tim Beyer et.al. 2503.02574 null
2025-03-04 SpecInF: Exploiting Idle GPU Resources in Distributed DL Training via Speculative Inference Filling Cunchi Lv et.al. 2503.02550 null
2025-03-04 PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks Sheng Shang et.al. 2503.02547 null
2025-03-04 SAGE-Amine: Generative Amine Design with Multi-Property Optimization for Efficient CO2 Capture Hocheol Lim et.al. 2503.02534 null
2025-03-04 Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development Dimitri Ognibene et.al. 2503.02532 null
2025-03-04 Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent Xingzuo Li et.al. 2503.02519 link
2025-03-04 Deepfake Detection via Knowledge Injection Tonghui Li et.al. 2503.02503 null
2025-03-04 LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs Jianghao Chen et.al. 2503.02502 null
2025-03-04 PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Dataset Haider Asif et.al. 2503.02497 null
2025-03-04 BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA Zhengyang Ji et.al. 2503.02476 link
2025-03-04 It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation Sohan Patnaik et.al. 2503.02463 null
2025-03-04 Don’t Get Too Excited – Eliciting Emotions in LLMs Gino Franco Fazzi et.al. 2503.02457 null
2025-03-04 Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations Yuhao Yang et.al. 2503.02453 null
2025-03-04 Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization Yilun Qiu et.al. 2503.02450 link
2025-03-04 AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking Iraklis Premptis et.al. 2503.02443 null
2025-03-04 AILS-NTUA at SemEval-2025 Task 3: Leveraging Large Language Models and Translation Strategies for Multilingual Hallucination Detection Dimitra Karkani et.al. 2503.02442 null
2025-03-04 Artificial Intelligence in Reactor Physics: Current Status and Future Prospects Ruizhi Zhang et.al. 2503.02440 null
2025-03-04 Beyond the Leland strategies Emmanuel Lepinette et.al. 2503.02419 null
2025-03-04 Building 3D In-Context Learning Universal Model in Neuroimaging Jiesi Hu et.al. 2503.02410 link
2025-03-04 Wyckoff Transformer: Generation of Symmetric Crystals Nikita Kazeev et.al. 2503.02407 link
2025-03-04 Hierarchical Re-ranker Retriever (HRR) Ashish Singh et.al. 2503.02401 null
2025-03-04 Promptware Engineering: Software Engineering for LLM Prompt Development Zhenpeng Chen et.al. 2503.02400 null
2025-03-04 PersonaX: A Recommendation Agent Oriented User Modeling Framework for Long Behavior Sequence Yunxiao Shi et.al. 2503.02398 link
2025-03-04 ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks Heng Zhou et.al. 2503.02390 link
2025-03-04 RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking Yifeng Xu et.al. 2503.02387 null
2025-03-04 An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning Wei Sun et.al. 2503.02382 null
2025-03-04 Teaching Metric Distance to Autoregressive Multimodal Foundational Models Jiwan Chung et.al. 2503.02379 null
2025-03-04 MedEthicEval: Evaluating Large Language Models Based on Chinese Medical Ethics Haoan Jin et.al. 2503.02374 null
2025-03-04 EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports Lama Moukheiber et.al. 2503.02365 null
2025-03-04 Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm Zhuo Li et.al. 2503.02359 null
2025-03-04 Efficient Long Context Fine-tuning with Chunk Flow Xiulong Yuan et.al. 2503.02356 null
2025-03-04 CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory Jiashun Suo et.al. 2503.02354 null
2025-03-04 DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability Yunzhen He et.al. 2503.02343 link
2025-03-04 GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning Zhun Mou et.al. 2503.02341 null
2025-03-04 Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection Eun Cheol Choi et.al. 2503.02328 null
2025-03-04 PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models Xueliang Zhao et.al. 2503.02324 link
2025-03-04 Generative Model-Assisted Demosaicing for Cross-multispectral Cameras Jiahui Luo et.al. 2503.02322 null
2025-03-04 Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration Pengchen Liang et.al. 2503.02321 null
2025-03-04 A Token-level Text Image Foundation Model for Document Understanding Tongkun Guan et.al. 2503.02304 null
2025-03-04 Towards Large Language Model Guided Kernel Direct Fuzzing Xie Li et.al. 2503.02301 null
2025-03-04 Towards Explainable Doctor Recommendation with Large Language Models Ziyang Zeng et.al. 2503.02298 null
2025-03-04 Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions Wentao Chen et.al. 2503.02296 null
2025-03-04 spike: A tool to drizzle HST, JWST, and Roman PSFs for improved analyses Ava Polzin et.al. 2503.02288 link
2025-03-04 AppAgentX: Evolving GUI Agents as Proficient Smartphone Users Wenjia Jiang et.al. 2503.02268 null
2025-03-04 Large Language Models as Natural Selector for Embodied Soft Robot Design Changhe Chen et.al. 2503.02249 null
2025-03-04 Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts Tong Liang et.al. 2503.02248 null
2025-03-04 From Code to Courtroom: LLMs as the New Software Judges Junda He et.al. 2503.02246 null
2025-03-04 OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale Haoyang Li et.al. 2503.02240 link
2025-03-04 V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors Keshu Wu et.al. 2503.02239 null
2025-03-04 Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions Zirui Wu et.al. 2503.02238 link
2025-03-04 Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling Hang Zheng et.al. 2503.02233 null
2025-03-04 ATLaS: Agent Tuning via Learning Critical Steps Zhixun Chen et.al. 2503.02197 null
2025-03-04 DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models Saeed Ranjbar Alvar et.al. 2503.02175 link
2025-03-04 Leveraging Large Language Models for Enhanced Digital Twin Modeling: Trends, Methods, and Challenges Linyao Yang et.al. 2503.02167 null
2025-03-04 X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning Jianzhong You et.al. 2503.02162 null
2025-03-04 LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation Yunbo Long et.al. 2503.02161 null
2025-03-04 Tabby: Tabular Data Synthesis with Language Models Sonia Cromp et.al. 2503.02152 null
2025-03-04 Malware Classification from Memory Dumps Using Machine Learning, Transformers, and Large Language Models Areej Dweib et.al. 2503.02144 null
2025-03-04 Measuring Intrinsic Dimension of Token Embeddings Takuya Kataiwa et.al. 2503.02142 null
2025-03-04 Network Traffic Classification Using Machine Learning, Transformer, and Large Language Models Ahmad Antari et.al. 2503.02141 null
2025-03-03 TMIQ: Quantifying Test and Measurement Domain Intelligence in Large Language Models Emmanuel A. Olowe et.al. 2503.02123 null
2025-02-28 LLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar et.al. 2502.21321 null
2025-02-28 How far can we go with ImageNet for Text-to-Image generation? L. Degeorge et.al. 2502.21318 null
2025-02-28 FANformer: Improving Large Language Models Through Effective Periodicity Modeling Yihong Dong et.al. 2502.21309 null
2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 link
2025-02-28 Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion Kulin Shah et.al. 2502.21278 null
2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 null
2025-03-03 Foundation Models – A Panacea for Artificial Intelligence in Pathology? Nita Mulliqi et.al. 2502.21264 null
2025-02-28 Modeling Human Beliefs about AI Behavior for Scalable Oversight Leon Lang et.al. 2502.21262 null
2025-02-28 RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Yuheng Ji et.al. 2502.21257 null
2025-02-28 TimesBERT: A BERT-Style Foundation Model for Time Series Understanding Haoran Zhang et.al. 2502.21245 null
2025-03-04 Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs Xiaomin Li et.al. 2502.21239 null
2025-02-28 Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication Daniil Filienko et.al. 2502.21236 null
2025-02-28 ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs Hao Ge et.al. 2502.21231 null
2025-03-03 ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer Omer Goldman et.al. 2502.21228 null
2025-02-28 Dynamic Markov Blanket Detection for Macroscopic Physics Discovery Jeff Beck et.al. 2502.21217 link
2025-02-28 Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought Jianhao Huang et.al. 2502.21212 null
2025-02-28 Chronologically Consistent Large Language Models Songrun He et.al. 2502.21206 null
2025-03-04 SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training Fakrul Islam Tushar et.al. 2502.21187 null
2025-02-28 $Δ$ -model correction of Foundation Model based on the models own understanding Mads-Peter Verner Christiansen et.al. 2502.21179 null
2025-03-03 Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models Ruta Binkyte et.al. 2502.21123 null
2025-02-28 Optimizing Large Language Models for ESG Activity Detection in Financial Texts Mattia Birti et.al. 2502.21112 link
2025-02-28 Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure? Charles Dawson et.al. 2502.21110 null
2025-02-28 Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization Lie Meng Pang et.al. 2502.21108 null
2025-02-28 Generating patient cohorts from electronic health records using two-step retrieval-augmented text-to-SQL generation Angelo Ziletti et.al. 2502.21107 null
2025-02-28 A Non-contrast Head CT Foundation Model for Comprehensive Neuro-Trauma Triage Youngjin Yoo et.al. 2502.21106 null
2025-02-28 Re-evaluating Theory of Mind evaluation in large language models Jennifer Hu et.al. 2502.21098 null
2025-02-28 An LLM-based Delphi Study to Predict GenAI Evolution Francesco Bertolotti et.al. 2502.21092 null
2025-02-28 PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information Hansi Yang et.al. 2502.21087 null
2025-02-28 Are foundation models useful feature extractors for electroencephalography analysis? Özgün Turgut et.al. 2502.21086 null
2025-02-28 Spatial Reasoning with Denoising Models Christopher Wewer et.al. 2502.21075 null
2025-02-28 CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation Zhenyi Shen et.al. 2502.21074 null
2025-02-28 GUIDE: LLM-Driven GUI Generation Decomposition for Automated Prototyping Kristian Kolthoff et.al. 2502.21068 null
2025-02-28 Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport Jingru Fu et.al. 2502.21049 link
2025-02-28 Incorporating Long-Range Interactions via the Multipole Expansion into Ground and Excited-State Molecular Simulations Rhyan Barrett et.al. 2502.21045 null
2025-02-28 The amplifier effect of artificial agents in social contagion Eric Hitz et.al. 2502.21037 null
2025-02-28 Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs José I. Orlicki et.al. 2502.21030 null
2025-02-28 Measuring and identifying factors of individuals’ trust in Large Language Models Edoardo Sebastiano De Duro et.al. 2502.21028 null
2025-02-28 PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues Fangxu Yu et.al. 2502.21017 null
2025-02-28 Explainable Biomedical Claim Verification with Large Language Models Siting Liang et.al. 2502.21014 null
2025-02-28 Merging Clinical Knowledge into Large Language Models for Medical Research and Applications: A Survey Qiyuan Li et.al. 2502.20988 null
2025-02-28 UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation Thanet Markchom et.al. 2502.20984 null
2025-02-28 Set-Theoretic Compositionality of Sentence Embeddings Naman Bansal et.al. 2502.20975 null
2025-02-28 TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval Chien-Yu Lin et.al. 2502.20969 null
2025-02-28 Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs Weixiang Zhao et.al. 2502.20968 null
2025-02-28 Fine-Grained Retrieval-Augmented Generation for Visual Question Answering Zhengxuan Zhang et.al. 2502.20964 null
2025-02-28 Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content Hongyuan Shen et.al. 2502.20952 null
2025-02-28 Generative Uncertainty in Diffusion Models Metod Jazbec et.al. 2502.20946 null
2025-02-28 A Deep User Interface for Exploring LLaMa Divya Perumal et.al. 2502.20938 null
2025-02-28 Large Language Models Are Innate Crystal Structure Generators Jingru Gan et.al. 2502.20933 null
2025-02-28 Automated Evaluation of Meter and Rhyme in Russian Generative and Human-Authored Poetry Ilya Koziev et.al. 2502.20931 null
2025-02-28 A database to support the evaluation of gender biases in GPT-4o output Luise Mehner et.al. 2502.20898 null
2025-02-28 Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals’ Subjective Text Perceptions Matthias Orlikowski et.al. 2502.20897 null
2025-02-28 PathVG: A New Benchmark and Dataset for Pathology Visual Grounding Chunlin Zhong et.al. 2502.20869 null
2025-02-28 ProBench: Benchmarking Large Language Models in Competitive Programming Lei Yang et.al. 2502.20868 null
2025-02-28 The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents Yifan Duan et.al. 2502.20859 null
2025-02-28 Learning to Substitute Components for Compositional Generalization Zhaoyi Li et.al. 2502.20834 null
2025-02-28 CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval Zelong Sun et.al. 2502.20826 null
2025-02-28 Can We Simplify Slide-level Fine-tuning of Pathology Foundation Models? Jiawen Li et.al. 2502.20823 null
2025-02-28 Towards Reliable Vector Database Management Systems: A Software Testing Roadmap for 2030 Shenao Wang et.al. 2502.20812 null
2025-02-28 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Xiao Wang et.al. 2502.20811 null
2025-02-28 PFD: Automatically Generating Machine Learning Force Fields from Universal Models Ruoyu Wang et.al. 2502.20809 link
2025-03-03 MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts Peijie Wang et.al. 2502.20808 null
2025-02-28 Digital Player: Evaluating Large Language Models based Human-like Agent in Games Jiawei Wang et.al. 2502.20807 link
2025-02-28 Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation Kuang-Da Wang et.al. 2502.20795 null
2025-02-28 Cyber Defense Reinvented: Large Language Models as Threat Intelligence Copilots Xiaoqun Liu et.al. 2502.20791 null
2025-02-28 Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision Dawei Zhu et.al. 2502.20790 null
2025-02-28 Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective Yuko Nakagi et.al. 2502.20779 null
2025-02-28 FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference Xunhao Lai et.al. 2502.20766 link
2025-02-28 Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth Seyed Pouyan Mousavi Davoudi et.al. 2502.20758 null
2025-02-28 The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue Agents Yihong Tang et.al. 2502.20757 null
2025-02-28 SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models Yichi Zhang et.al. 2502.20749 link
2025-02-28 Teach-to-Reason with Scoring: Self-Explainable Rationale-Driven Multi-Trait Essay Scoring Heejin Do et.al. 2502.20748 null
2025-02-28 Measuring Determinism in Large Language Models for Software Code Review Eugene Klishevich et.al. 2502.20747 null
2025-02-28 CADDreamer: CAD object Generation from Single-view Images Yuan Li et.al. 2502.20732 null
2025-02-28 SPD: Sync-Point Drop for efficient tensor parallelism of Large Language Models Han-Byul Kim et.al. 2502.20727 null
2025-02-28 Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition Yifei Duan et.al. 2502.20726 link
2025-02-28 Generating Clinically Realistic EHR Data via a Hierarchy- and Semantics-Guided Transformer Guanglin Zhou et.al. 2502.20719 null
2025-02-28 Why Trust in AI May Be Inevitable Nghi Truong et.al. 2502.20701 null
2025-02-28 Towards General Visual-Linguistic Face Forgery Detection(V2) Ke Sun et.al. 2502.20698 link
2025-02-28 WorldModelBench: Judging Video Generation Models As World Models Dacheng Li et.al. 2502.20694 null
2025-02-28 Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale Matching Yihan Wang et.al. 2502.20687 null
2025-02-28 JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation Yingbing Huang et.al. 2502.20684 null
2025-02-28 STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding Aaryan Garg et.al. 2502.20678 null
2025-02-28 SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition Shanshan Wan et.al. 2502.20676 null
2025-02-28 Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA Ojonugwa Oluwafemi Ejiga Peter et.al. 2502.20667 null
2025-02-28 Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models Colleen Gilhuly et.al. 2502.20647 null
2025-02-28 LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation Haitao Li et.al. 2502.20640 null
2025-02-28 Can LLM Assist in the Evaluation of the Quality of Machine Learning Explanations? Bo Wang et.al. 2502.20635 null
2025-02-28 Are LLMs Ready for Practical Adoption for Assertion Generation? Vaishnavi Pulavarthi et.al. 2502.20633 null
2025-02-28 Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning Ayana Niwa et.al. 2502.20620 null
2025-02-28 Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems Jędrzej Warczyński et.al. 2502.20609 null
2025-02-28 NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence Saman Khamesian et.al. 2502.20601 link
2025-02-27 Few-Shot, No Problem: Descriptive Continual Relation Extraction Nguyen Xuan Thanh et.al. 2502.20596 null
2025-02-27 Multi $^2$ : Multi-Agent Test-Time Scalable Framework for Multi-Document Processing Juntai Cao et.al. 2502.20592 null
2025-02-27 LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis Saeif Alhazbi et.al. 2502.20589 null
2025-03-04 InstaFace: Identity-Preserving Facial Editing with Single Image Inference MD Wahiduzzaman Khan et.al. 2502.20577 null
2025-02-27 ECCOS: Efficient Capability and Cost Coordinated Scheduling for Multi-LLM Serving Kai Mei et.al. 2502.20576 link
2025-02-27 Visual Reasoning at Urban Intersections: FineTuning GPT-4o for Traffic Conflict Detection Sari Masri et.al. 2502.20573 null
2025-02-27 Stochastic Rounding for LLM Training: Theory and Practice Kaan Ozkara et.al. 2502.20566 null
2025-02-27 LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks Joana C. Costa et.al. 2502.20562 null
2025-02-27 R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Zhongyang Li et.al. 2502.20395 link
2025-02-27 InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions Sirui Xu et.al. 2502.20390 null
2025-02-27 Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Sucheng Ren et.al. 2502.20388 link
2025-02-27 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis Jeffrey Yang Fan Chiang et.al. 2502.20383 null
2025-02-27 Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers Shalev Lifshitz et.al. 2502.20379 null
2025-02-27 PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Albert Gong et.al. 2502.20377 link
2025-02-27 Constrained Generative Modeling with Manually Bridged Diffusion Models Saeid Naderiparizi et.al. 2502.20371 null
2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 null
2025-02-27 Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs Kuan Lok Zhou et.al. 2502.20356 null
2025-02-27 KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model Kai Zhang et.al. 2502.20350 null
2025-02-27 Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models Yi Jing et.al. 2502.20344 null
2025-02-27 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta et.al. 2502.20339 null
2025-02-27 Expertise Is What We Want Alan Ashworth et.al. 2502.20335 null
2025-02-27 Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models Yukang Yang et.al. 2502.20332 null
2025-02-27 Long-Context Inference with Retrieval-Augmented Speculative Decoding Guanzheng Chen et.al. 2502.20330 link
2025-02-27 EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants Franck Cappello et.al. 2502.20309 null
2025-02-27 M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging Jinghao Feng et.al. 2502.20301 null
2025-02-27 An exploration of features to improve the generalisability of fake news detection models Nathaniel Hoy et.al. 2502.20299 null
2025-02-27 Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Benjamin Gutteridge et.al. 2502.20295 link
2025-02-27 Conformal Tail Risk Control for Large Language Model Alignment Catherine Yu-Chi Chen et.al. 2502.20285 null
2025-02-27 Evaluating Human Trust in LLM-Based Planners: A Preliminary Study Shenghui Chen et.al. 2502.20284 null
2025-02-27 Large Language Models as Attribution Regularizers for Efficient Model Training Davor Vukadin et.al. 2502.20268 link
2025-02-27 Vector-Quantized Vision Foundation Models for Object-Centric Learning Rongzhen Zhao et.al. 2502.20263 null
2025-02-27 LLM as a Broken Telephone: Iterative Generation Distorts Information Amr Mohamed et.al. 2502.20258 link
2025-02-27 Do computer vision foundation models learn the low-level characteristics of the human visual system? Yancheng Cai et.al. 2502.20256 null
2025-02-27 Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets Chichien Tsai et.al. 2502.20246 null
2025-02-27 From Retrieval to Generation: Comparing Different Approaches Abdelrahman Abdallah et.al. 2502.20245 null
2025-02-27 FINEREASON: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving Guizhen Chen et.al. 2502.20238 link
2025-02-27 AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions Clare Grogan et.al. 2502.20231 link
2025-02-27 Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars Tobias Kirschstein et.al. 2502.20220 null
2025-02-27 ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models Haibin Chen et.al. 2502.20196 link
2025-02-27 Model Checking Linear Temporal Logic with Standpoint Modalities Rajab Aghamov et.al. 2502.20193 null
2025-02-27 Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge Yan-Lun Chen et.al. 2502.20186 null
2025-02-27 DGFM: Full Body Dance Generation Driven by Music Foundation Models Xinran Liu et.al. 2502.20176 null
2025-02-27 An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs Kaustubh Vyas et.al. 2502.20175 null
2025-02-27 Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think Liang Chen et.al. 2502.20172 link
2025-02-27 Re-evaluating Open-ended Evaluation of Large Language Models Siqi Liu et.al. 2502.20170 null
2025-02-27 Adaptive H&E-IHC information fusion staining framework based on feature extra Yifan Jia et.al. 2502.20156 link
2025-02-27 Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale Max M. Lang et.al. 2502.20140 null
2025-02-27 Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking Yifan Zhang et.al. 2502.20129 null
2025-02-27 Self-Training Elicits Concise Reasoning in Large Language Models Tergel Munkhbat et.al. 2502.20122 link
2025-02-27 LongRoPE2: Near-Lossless LLM Context Window Scaling Ning Shang et.al. 2502.20082 link
2025-02-27 Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents Haochen Sun et.al. 2502.20073 link
2025-02-27 A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation Tianyang Qi et.al. 2502.20068 null
2025-02-27 Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish Marta Lango et.al. 2502.20046 null
2025-02-27 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds Hengshuo Chu et.al. 2502.20041 null
2025-02-27 AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs Xuyang Wei et.al. 2502.20035 link
2025-02-27 Erasing Without Remembering: Safeguarding Knowledge Forgetting in Large Language Models Huazheng Wang et.al. 2502.19982 link
2025-02-27 The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs Tanja Baeumel et.al. 2502.19981 null
2025-02-27 Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios Chao Wang et.al. 2502.19973 null
2025-02-27 Deterministic or probabilistic? The psychology of LLMs as random number generators Javier Coronado-Blázquez et.al. 2502.19965 null
2025-02-27 SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model Xinghao Wang et.al. 2502.19960 link
2025-02-27 Collaborative Stance Detection via Small-Large Language Model Consistency Verification Yu Yan et.al. 2502.19954 link
2025-02-27 GeoEdit: Geometric Knowledge Editing for Large Language Models Yujie Feng et.al. 2502.19953 null
2025-02-27 Algebraic Machine Learning: Learning as computing an algebraic decomposition of a task Fernando Martin-Maroto et.al. 2502.19944 link
2025-02-27 Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation Xiang Geng et.al. 2502.19941 null
2025-02-27 Playing Pokémon Red via Deep Reinforcement Learning Marco Pleines et.al. 2502.19920 link
2025-02-27 Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models Yuan Sui et.al. 2502.19918 null
2025-02-27 Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents Zhenyu Liu et.al. 2502.19917 link
2025-02-27 LLM-driven Effective Knowledge Tracing by Integrating Dual-channel Difficulty Jiahui Cen et.al. 2502.19915 null
2025-02-27 SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks Nikolay Blagoev et.al. 2502.19913 link
2025-02-27 Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation Qianxi He et.al. 2502.19907 null
2025-02-27 Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy Zaijing Li et.al. 2502.19902 null
2025-02-27 GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors An Li et.al. 2502.19896 null
2025-02-27 Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models Sibo Yi et.al. 2502.19883 null
2025-02-27 Towards Multimodal Large-Language Models for Parent-Child Interaction: A Focus on Joint Attention Weiyan Shi et.al. 2502.19877 null
2025-02-27 MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge Yuntao Du et.al. 2502.19870 link
2025-02-27 MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue Yujia Chen et.al. 2502.19860 null
2025-02-27 ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments Hojae Han et.al. 2502.19852 null
2025-02-27 One-for-More: Continual Diffusion Model for Anomaly Detection Xiaofan Li et.al. 2502.19848 link
2025-02-27 ProAPO: Progressively Automatic Prompt Optimization for Visual Classification Xiangyan Qu et.al. 2502.19844 link
2025-02-27 Shared Stochastic Gaussian Process Latent Variable Models: A Multi-modal Generative Model for Quasar Spectra Vidhi Lalchand et.al. 2502.19824 link
2025-02-27 Foot-In-The-Door: A Multi-turn Jailbreak for LLMs Zixuan Weng et.al. 2502.19820 link
2025-02-27 Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts Shulai Zhang et.al. 2502.19811 link
2025-02-27 Implicit Search via Discrete Diffusion: A Study on Chess Jiacheng Ye et.al. 2502.19805 link
2025-02-27 Developmental Support Approach to AI’s Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning Taichiro Endo et.al. 2502.19798 null
2025-02-27 ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model Chuanliu Fan et.al. 2502.19794 null
2025-02-27 Mixtera: A Data Plane for Foundation Model Training Maximilian Böther et.al. 2502.19790 link
2025-02-27 Advancements in Natural Language Processing for Automatic Text Summarization Nevidu Jayatilleke et.al. 2502.19773 null
2025-02-27 Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models Heeseung Kim et.al. 2502.19759 null
2025-02-27 PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation Nathan Roll et.al. 2502.19756 null
2025-02-27 Beneath the Surface: How Large Language Models Reflect Hidden Bias Jinhao Pan et.al. 2502.19749 link
2025-02-27 HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture Taiqiang Wu et.al. 2502.19747 null
2025-02-27 R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Minggui He et.al. 2502.19735 null
2025-02-27 Preference Learning Unlocks LLMs’ Psycho-Counseling Skills Mian Zhang et.al. 2502.19731 null
2025-02-27 Do Expressions Change Decisions? Exploring the Impact of AI’s Explanation Tone on Decision-Making Ayano Okoso et.al. 2502.19730 null
2025-02-27 Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training Toan Tran et.al. 2502.19726 null
2025-02-27 Few-Shot Multilingual Open-Domain QA from 5 Examples Fan Jiang et.al. 2502.19722 link
2025-02-27 Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs Hannah Cyberey et.al. 2502.19721 link
2025-02-27 Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation Manveer Singh Tamber et.al. 2502.19712 null
2025-02-27 AoECR: AI-ization of Elderly Care Robot Linkun Zhou et.al. 2502.19706 null
2025-02-27 You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving Guangfeng Jiang et.al. 2502.19698 null
2025-02-27 M-LLM Based Video Frame Selection for Efficient Video Understanding Kai Hu et.al. 2502.19680 null
2025-02-27 Old Experience Helps: Leveraging Survey Methodology to Improve AI Text Annotation Reliability in Social Sciences Linzhuo li et.al. 2502.19679 null
2025-02-27 Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack Chenhe Gu et.al. 2502.19672 null
2025-02-27 SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning Mingsheng Cai et.al. 2502.19668 null
2025-02-27 Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models Jan Wehner et.al. 2502.19649 null
2025-02-27 cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning Micha Livne et.al. 2502.19642 null
2025-02-26 Agentic Mixture-of-Workflows for Multi-Modal Chemical Search Tiffany J. Callahan et.al. 2502.19629 null
2025-02-26 Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication Zhongyuan Liang et.al. 2502.19625 null
2025-02-26 Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing Akshat Gupta et.al. 2502.19416 null
2025-02-26 Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs Dayu Yang et.al. 2502.19411 link
2025-02-26 Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices Xinru Wang et.al. 2502.19410 null
2025-02-26 ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Danae Sánchez Villegas et.al. 2502.19409 null
2025-02-26 Learning Code-Edit Embedding to Model Student Debugging Behavior Hasnain Heickal et.al. 2502.19407 null
2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 null
2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 null
2025-02-26 Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis Minjoo Lim et.al. 2502.19390 null
2025-02-26 LiDAR Registration with Visual Foundation Models Niclas Vödisch et.al. 2502.19374 null
2025-02-26 Deep Learning For Time Series Analysis With Application On Human Motion Ali Ismail-Fawaz et.al. 2502.19364 null
2025-02-26 DataMan: Data Manager for Pre-training Large Language Models Ru Peng et.al. 2502.19363 null
2025-02-26 Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? Yancheng He et.al. 2502.19361 link
2025-02-26 Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets Tohida Rehman et.al. 2502.19339 null
2025-02-26 Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Hao Peng et.al. 2502.19328 link
2025-02-26 Shh, don’t say that! Domain Certification in LLMs Cornelius Emde et.al. 2502.19320 null
2025-02-26 Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond Qizhou Wang et.al. 2502.19301 null
2025-02-26 Agent-centric Information Access Evangelos Kanoulas et.al. 2502.19298 null
2025-02-26 Complex LLM Planning via Automated Heuristics Discovery Hongyi Ling et.al. 2502.19295 null
2025-02-26 Efficient Federated Search for Retrieval-Augmented Generation Rachid Guerraoui et.al. 2502.19280 null
2025-02-26 ArtInsight: Enabling AI-Powered Artwork Engagement for Mixed Visual-Ability Families Arnavi Chheda-Kothary et.al. 2502.19263 null
2025-02-26 AI-Powered Bayesian Inference Veronika Ročková et.al. 2502.19231 null
2025-02-26 Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time Jiazheng Li et.al. 2502.19230 null
2025-02-26 A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images Nikita Shvetsov et.al. 2502.19217 null
2025-02-26 A Hybrid Transformer Architecture with a Quantized Self-Attention Mechanism Applied to Molecular Generation Anthony M. Smaldone et.al. 2502.19214 link
2025-02-26 Negation-Induced Forgetting in LLMs Francesca Capuano et.al. 2502.19211 null
2025-02-26 Bi’an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation Zhouyu Jiang et.al. 2502.19209 null
2025-02-26 Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms Jinyu Cai et.al. 2502.19193 null
2025-02-26 BIG-Bench Extra Hard Mehran Kazemi et.al. 2502.19187 link
2025-02-26 INFO-SEDD: Continuous Time Markov Chains as Scalable Information Metrics Estimators Alberto Foresti et.al. 2502.19183 null
2025-02-26 UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering Langming Liu et.al. 2502.19178 null
2025-02-26 MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis Daniel Rose et.al. 2502.19175 null
2025-02-26 A Model-Centric Review of Deep Learning for Protein Design Gregory W. Kyro et.al. 2502.19173 null
2025-02-26 CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation Kaiwen Yan et.al. 2502.19166 link
2025-02-26 TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency Henry Peng Zou et.al. 2502.19163 null
2025-02-26 Detecting Linguistic Indicators for Stereotype Assessment with Large Language Models Rebekka Görge et.al. 2502.19160 null
2025-02-26 A Sliding Layer Merging Method for Efficient Depth-Wise Pruning in LLMs Xuan Ding et.al. 2502.19159 null
2025-02-26 When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning Yijiang River Dong et.al. 2502.19158 null
2025-02-26 Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval Jiarong Wu et.al. 2502.19149 null
2025-02-26 Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs Zhaowei Zhang et.al. 2502.19148 null
2025-02-26 Identification Under the Semantic Effective Secrecy Constraint Abdalla Ibrahim et.al. 2502.19142 null
2025-02-26 A Temporal Planning Framework for Multi-Agent Systems via LLM-Aided Knowledge Base Management Enrico Saccon et.al. 2502.19135 null
2025-02-26 Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement Siyuan Zhang et.al. 2502.19127 null
2025-02-26 A Survey on Foundation-Model-Based Industrial Defect Detection Tianle Yang et.al. 2502.19106 null
2025-02-26 Evaluating Gender Bias in German Machine Translation Michelle Kappl et.al. 2502.19104 link
2025-02-26 LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm Siwei Wu et.al. 2502.19103 null
2025-02-26 Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation Humza Sami et.al. 2502.19091 link
2025-02-26 EndoMamba: An Efficient Foundation Model for Endoscopic Videos Qingyao Tian et.al. 2502.19090 null
2025-02-26 Sparse Brains are Also Adaptive Brains: Cognitive-Load-Aware Dynamic Activation for LLMs Yiheng Yang et.al. 2502.19078 null
2025-02-26 IndicEval-XL: Bridging Linguistic Diversity in Code Generation Across Indic Languages Ujjwal Singh et.al. 2502.19067 link
2025-02-26 Can Large Language Models Outperform Non-Experts in Poetry Evaluation? A Comparative Study Using the Consensual Assessment Technique Piotr Sawicki et.al. 2502.19064 null
2025-02-26 MathClean: A Benchmark for Synthetic Mathematical Data Cleaning Hao Liang et.al. 2502.19058 null
2025-02-26 Beyond Surface-Level Patterns: An Essence-Driven Defense Framework Against Jailbreak Attacks in LLMs Shiyu Xiang et.al. 2502.19041 null
2025-02-26 FungalZSL: Zero-Shot Fungal Classification with Image Captioning Using a Synthetic Data Approach Anju Rani et.al. 2502.19038 null
2025-02-26 InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model Fengbin Guan et.al. 2502.19026 null
2025-02-26 Binary Neural Networks for Large Language Model: A Survey Liangdong Liu et.al. 2502.19008 null
2025-02-26 The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training Jinbo Wang et.al. 2502.19002 null
2025-02-26 MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering Teng Lin et.al. 2502.18993 null
2025-02-26 OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models Hui Feng et.al. 2502.18992 null
2025-02-26 GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation Jie He et.al. 2502.18990 null
2025-02-26 PEToolLLM: Towards Personalized Tool Learning in Large Language Models Qiancheng Xu et.al. 2502.18980 null
2025-02-26 Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning Hongyi Cal et.al. 2502.18978 null
2025-02-26 (Mis)Fitting: A Survey of Scaling Laws Margaret Li et.al. 2502.18969 link
2025-02-26 Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles Kuang Wang et.al. 2502.18968 link
2025-02-26 OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment Jiaxin Deng et.al. 2502.18965 null
2025-02-26 DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model Lei Zhao et.al. 2502.18952 null
2025-02-26 Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models Yu He et.al. 2502.18943 null
2025-02-26 JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models Shuyi Liu et.al. 2502.18935 null
2025-02-26 Talking like Piping and Instrumentation Diagrams (P&IDs) Achmad Anggawirya Alimin et.al. 2502.18928 null
2025-02-26 ClassInvGen: Class Invariant Synthesis using Large Language Models Chuyue Sun et.al. 2502.18917 null
2025-02-26 END: Early Noise Dropping for Efficient and Effective Context Denoising Hongye Jin et.al. 2502.18915 null
2025-02-26 CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning Ping Zhang et.al. 2502.18910 null
2025-02-26 An Empirical Study on Commit Message Generation using LLMs via In-Context Learning Yifan Wu et.al. 2502.18904 link
2025-02-26 From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens Tong Wu et.al. 2502.18890 link
2025-02-26 Letters from Future Self: Augmenting the Letter-Exchange Exercise with LLM-based Agents to Enhance Young Adults’ Career Exploration Hayeon Jeon et.al. 2502.18881 null
2025-02-26 Learning to Generate Structured Output with Schema Reinforcement Learning Yaxi Lu et.al. 2502.18878 null
2025-02-26 Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework Kaishuai Xu et.al. 2502.18874 null
2025-02-26 Multi-LLM Collaborative Search for Complex Problem Solving Sen Yang et.al. 2502.18873 null
2025-02-26 A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops Shi Fu et.al. 2502.18865 null
2025-02-26 Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM Junxiao Ma et.al. 2502.18863 null
2025-02-26 A Causal Lens for Evaluating Faithfulness Metrics Kerem Zaman et.al. 2502.18848 null
2025-02-26 Sliding Window Attention Training for Efficient Large Language Models Zichuan Fu et.al. 2502.18845 null
2025-02-26 Evidence-Driven Marker Extraction for Social Media Suicide Risk Detection Carter Adams et.al. 2502.18823 null
2025-02-26 Data-Efficient Multi-Agent Spatial Planning with LLMs Huangyuan Su et.al. 2502.18822 null
2025-02-26 CAMEx: Curvature-aware Merging of Experts Dung V. Nguyen et.al. 2502.18821 link
2025-02-26 Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models Shuliang Liu et.al. 2502.18817 null
2025-02-26 Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal Weipeng Jiang et.al. 2502.18810 null
2025-02-26 Optimal Stochastic Trace Estimation in Generative Modeling Xinyang Liu et.al. 2502.18808 null
2025-02-26 SolEval: Benchmarking Large Language Models for Repository-level Solidity Code Generation Zhiyuan Peng et.al. 2502.18793 null
2025-02-26 Active Few-Shot Learning for Text Classification Saeed Ahmadnia et.al. 2502.18782 null
2025-02-26 Towards Optimal Multi-draft Speculative Decoding Zhengmian Hu et.al. 2502.18779 null
2025-02-26 M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance Qingpei Guo et.al. 2502.18778 null
2025-02-26 Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance Xueqing Peng et.al. 2502.18772 null
2025-02-26 Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation Yuxiang Wang et.al. 2502.18771 link
2025-02-26 Reward Shaping to Mitigate Reward Hacking in RLHF Jiayi Fu et.al. 2502.18770 link
2025-02-26 CommGPT: A Graph and Retrieval-Augmented Multimodal Communication Foundation Model Feibo Jiang et.al. 2502.18763 null
2025-02-26 Training Large Recommendation Models via Graph-Language Token Alignment Mingdai Yang et.al. 2502.18757 null
2025-02-26 M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type Weiming Hu et.al. 2502.18755 null
2025-02-26 AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms Yuwei Yan et.al. 2502.18754 link
2025-02-26 Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking Shaheer Mohamed et.al. 2502.18748 null
2025-02-26 Automatic Prompt Optimization via Heuristic Search: A Survey Wendi Cui et.al. 2502.18746 null
2025-02-25 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Xueguang Ma et.al. 2502.18460 link
2025-02-25 LLM-Based Design Pattern Detection Christian Schindler et.al. 2502.18458 null
2025-02-25 FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Mollie Shichman et.al. 2502.18452 null
2025-02-25 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Yuxiang Wei et.al. 2502.18449 null
2025-02-25 MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning Chanwoo Park et.al. 2502.18439 null
2025-02-25 TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning Frederikus Hudi et.al. 2502.18431 link
2025-02-25 OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Xiangyu Zhao et.al. 2502.18411 link
2025-02-25 Enhancing DNA Foundation Models to Address Masking Inefficiencies Monireh Safari et.al. 2502.18405 null
2025-02-25 Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods Nicola Cecere et.al. 2502.18389 null
2025-02-25 How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities Minhua Lin et.al. 2502.18387 null
2025-02-25 MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning Sepehr Asgarian et.al. 2502.18371 null
2025-02-25 Sparse Bayesian Generative Modeling for Joint Parameter and Channel Estimation Benedikt Böck et.al. 2502.18369 null
2025-02-25 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation Yifan Pu et.al. 2502.18364 null
2025-02-25 Responsible AI Agents Deven R. Desai et.al. 2502.18359 null
2025-02-25 Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation Jessica He et.al. 2502.18357 null
2025-02-25 BRIDO: Bringing Democratic Order to Abstractive Summarization Junhyun Lee et.al. 2502.18342 null
2025-02-25 Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology Romy Beauté et.al. 2502.18318 null
2025-02-25 GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music Xinran Liu et.al. 2502.18309 null
2025-02-25 RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction Jianhao Yan et.al. 2502.18308 null
2025-02-25 LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation Pengzhi Li et.al. 2502.18302 null
2025-02-25 Bayesian Computation in Deep Learning Wenlong Chen et.al. 2502.18300 null
2025-02-25 DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis Zeju Li et.al. 2502.18297 null
2025-02-25 AMPO: Active Multi-Preference Optimization Taneesh Gupta et.al. 2502.18293 null
2025-02-25 Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases Shanshan Xu et.al. 2502.18282 null
2025-02-25 Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support Guoxin Wang et.al. 2502.18274 link
2025-02-25 Imperfect Knowledge Management (IKM) in GEFRED (GENeralized model for Fuzzy RElational Databases) Leoncio Jimenez et.al. 2502.18255 null
2025-02-25 Iterative Counterfactual Data Augmentation Mitchell Plyler et.al. 2502.18249 link
2025-02-25 Unveiling and Causalizing CoT: A Causal Pespective Jiarun Fu et.al. 2502.18239 null
2025-02-25 Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints Mihaela Cătălina Stoian et.al. 2502.18237 link
2025-02-25 Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent Xiaofeng Wang et.al. 2502.18228 null
2025-02-25 From ChatGPT to DeepSeek: Can LLMs Simulate Humanity? Qian Wang et.al. 2502.18210 null
2025-02-25 LAG: LLM agents for Leaderboard Auto Generation on Demanding Jian Wu et.al. 2502.18209 null
2025-02-25 Grandes modelos de lenguaje: de la predicción de palabras a la comprensión? Carlos Gómez-Rodríguez et.al. 2502.18205 null
2025-02-25 Intersubjective Model of AI-mediated Communication: Augmenting Human-Human Text Chat through LLM-based Adaptive Agent Pair Shutaro Aoyama et.al. 2502.18201 null
2025-02-25 Task-Agnostic Semantic Communication with Multimodal Foundation Models Jiangjing Hu et.al. 2502.18200 null
2025-02-25 Agnostic calculation of atomic free energies with the descriptor density of states Thomas D Swinburne et.al. 2502.18191 link
2025-02-25 ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis Li Lei et.al. 2502.18180 null
2025-02-25 Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs Gaye Colakoglu et.al. 2502.18179 link
2025-02-25 CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification Mingkun Zhang et.al. 2502.18176 link
2025-02-25 SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models Zhang Yuxuan et.al. 2502.18168 null
2025-02-25 Can LLMs Explain Themselves Counterfactually? Zahra Dehghanighobadi et.al. 2502.18156 null
2025-02-25 Carbon and Silicon, Coexist or Compete? A Survey on Human-AI Interactions in Agent-based Modeling and Simulation Ziyue Lin et.al. 2502.18145 null
2025-02-25 LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers Zhuocheng Zhang et.al. 2502.18139 link
2025-02-25 Large Language Model Driven Agents for Simulating Echo Chamber Formation Chenhao Gu et.al. 2502.18138 null
2025-02-25 Inverse Materials Design by Large Language Model-Assisted Generative Framework Yun Hao et.al. 2502.18127 link
2025-02-25 HyperG: Hypergraph-Enhanced LLMs for Structured Knowledge Sirui Huang et.al. 2502.18125 null
2025-02-25 Bayesian Optimization for Controlled Image Editing via LLMs Chengkun Cai et.al. 2502.18116 null
2025-02-25 PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching Han Nie et.al. 2502.18104 link
2025-02-25 Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models Cao Yuxuan et.al. 2502.18101 link
2025-02-25 Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning Wenkai Yang et.al. 2502.18080 null
2025-02-25 Examining the Threat Landscape: Foundation Models and Model Stealing Ankita Raj et.al. 2502.18077 null
2025-02-25 MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration Yishuai Cai et.al. 2502.18072 link
2025-02-25 Golden Ratio Mixing of Real and Synthetic Data for Stabilizing Generative Model Training Hengzhi He et.al. 2502.18049 null
2025-02-25 AutoCas: Autoregressive Cascade Predictor in Social Networks via Large Language Models Yuhao Zheng et.al. 2502.18040 null
2025-02-25 Harnessing Multiple Large Language Models: A Survey on LLM Ensemble Zhijun Chen et.al. 2502.18036 link
2025-02-25 Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference Zhuo Chen et.al. 2502.18023 null
2025-02-25 AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages Joshua Sakthivel Raju et.al. 2502.18020 null
2025-02-25 NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms Yashan Wang et.al. 2502.18008 null
2025-02-25 Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning Xinghao Chen et.al. 2502.18001 link
2025-02-25 Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation Guang Lin et.al. 2502.17972 null
2025-02-25 LLM Knows Geometry Better than Algebra: Numerical Understanding of LLM-Based Agents in A Trading Arena Tianmi Ma et.al. 2502.17967 link
2025-02-25 Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments Patomporn Payoungkhamdee et.al. 2502.17956 null
2025-02-25 DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning Pusheng Xu et.al. 2502.17947 null
2025-02-25 Assessing Large Language Models in Agentic Multilingual National Bias Qianying Liu et.al. 2502.17945 null
2025-02-25 CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation Haitao Li et.al. 2502.17943 link
2025-02-25 Advantage-Guided Distillation for Preference Alignment in Small Language Models Shiping Gao et.al. 2502.17927 link
2025-02-25 LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction Suozhi Huang et.al. 2502.17925 null
2025-02-25 FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models Hongzhan Lin et.al. 2502.17924 link
2025-02-25 Towards Sustainable Web Agents: A Plea for Transparency and Dedicated Metrics for Energy Consumption Lars Krupp et.al. 2502.17903 null
2025-02-25 Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs Che Liu et.al. 2502.17900 null
2025-02-25 Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation Tong Li et.al. 2502.17899 null
2025-02-25 FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real Weiheng Liu et.al. 2502.17894 null
2025-02-25 RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts Mingyan Wu et.al. 2502.17888 link
2025-02-25 Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers Hannah Calzi Kleidermacher et.al. 2502.17882 null
2025-02-25 EEGM2: An Efficient Mamba-2-Based Self-Supervised Framework for Long-Sequence EEG Modeling Jiazhen Hong et.al. 2502.17873 link
2025-02-25 ASurvey: Spatiotemporal Consistency in Video Generation Zhiyu Yin et.al. 2502.17863 null
2025-02-25 HRR: Hierarchical Retrospection Refinement for Generated Image Detection Peipei Yuan et.al. 2502.17862 null
2025-02-25 LR ${}^{2}$ Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems Jianghao Chen et.al. 2502.17848 null
2025-02-25 Quantifying interdisciplinary synergy in higher STEM education Gahyoun Gim et.al. 2502.17841 null
2025-02-25 A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation Beibei Xiong et.al. 2502.17840 null
2025-02-25 TagGAN: A Generative Model for Data Tagging Muhammad Nawaz et.al. 2502.17836 null
2025-02-25 MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks Hyeonjeong Ha et.al. 2502.17832 link
2025-02-25 A General Framework to Enhance Fine-tuning-based LLM Unlearning Jie Ren et.al. 2502.17823 link
2025-02-25 An Overview of Large Language Models for Statisticians Wenlong Ji et.al. 2502.17814 null
2025-02-25 Can Multimodal LLMs Perform Time Series Anomaly Detection? Xiongxiao Xu et.al. 2502.17812 link
2025-02-25 URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models Ruiqi Yan et.al. 2502.17810 null
2025-02-25 DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities Tianyi Zhuang et.al. 2502.17807 null
2025-02-25 Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training Yihang Yao et.al. 2502.17800 null
2025-02-25 AIR: Complex Instruction Generation via Automatic Iterative Refinement Wei Liu et.al. 2502.17787 link
2025-02-25 Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty Yoshee Jain et.al. 2502.17785 null
2025-02-25 Tip of the Tongue Query Elicitation for Simulated Evaluation Yifan He et.al. 2502.17776 link
2025-02-25 FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks Tanawan Premsri et.al. 2502.17775 link
2025-02-25 Uncertainty Quantification for LLM-Based Survey Simulations Chengpiao Huang et.al. 2502.17773 null
2025-02-25 DeepSeek vs. ChatGPT: A Comparative Study for Scientific Computing and Scientific Machine Learning Tasks Qile Jiang et.al. 2502.17764 null
2025-02-25 Design and implementation of a distributed security threat detection system integrating federated learning and multimodal LLM Yuqing Wang et.al. 2502.17763 null
2025-02-25 Detection of LLM-Paraphrased Code and Identification of the Responsible LLM Using Coding Style Features Shinwoo Park et.al. 2502.17749 null
2025-02-24 LLM Inference Acceleration via Efficient Operation Fusion Mahsa Salmani et.al. 2502.17728 null
2025-02-24 Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? Sushmita Sarker et.al. 2502.17727 link
2025-02-24 Spontaneous Giving and Calculated Greed in Language Models Yuxuan Li et.al. 2502.17720 null
2025-02-24 Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures Akhila Yerukola et.al. 2502.17710 link
2025-02-24 Fractal Generative Models Tianhong Li et.al. 2502.17437 link
2025-02-24 Introducing Visual Perception Token into Multimodal Large Language Model Runpeng Yu et.al. 2502.17425 link
2025-02-24 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Jiarui Zhang et.al. 2502.17422 link
2025-02-24 LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification Penghui Yang et.al. 2502.17421 link
2025-02-24 The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence Tom Wollschläger et.al. 2502.17420 null
2025-02-24 From System 1 to System 2: A Survey of Reasoning Large Language Models Zhong-Zhi Li et.al. 2502.17419 link
2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Nikunj Saunshi et.al. 2502.17416 null
2025-02-24 COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs Liming Liu et.al. 2502.17410 link
2025-02-24 Large Language Models are Powerful EHR Encoders Stefan Hegselmann et.al. 2502.17403 link
2025-02-24 What is a Good Question? Utility Estimation with LLM-based Simulations Dong-Ho Lee et.al. 2502.17383 null
2025-02-24 KV-Edit: Training-Free Image Editing for Precise Background Preservation Tianrui Zhu et.al. 2502.17363 link
2025-02-24 A Closer Look at TabPFN v2: Strength, Limitation, and Extension Han-Jia Ye et.al. 2502.17361 null
2025-02-24 RELICT: A Replica Detection Framework for Medical Image Generation Orhun Utku Aydin et.al. 2502.17360 link
2025-02-24 On Relation-Specific Neurons in Large Language Models Yihong Liu et.al. 2502.17355 link
2025-02-24 How Scientists Use Large Language Models to Program Gabrielle O’Brien et.al. 2502.17348 null
2025-02-24 Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators João Pedro Matos-Carvalho et.al. 2502.17341 null
2025-02-24 HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization Zhenghao Liu et.al. 2502.17315 link
2025-02-24 Delta Decompression for MoE-based LLMs Compression Hao Gu et.al. 2502.17298 link
2025-02-24 Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts Zhenghao Liu et.al. 2502.17297 link
2025-02-24 Integrating protein sequence embeddings with structure via graph-based deep learning for the prediction of single-residue properties Kevin Michalewicz et.al. 2502.17294 link
2025-02-24 Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing Yi-Kai Zhang et.al. 2502.17282 link
2025-02-24 MonoTODia: Translating Monologue Requests to Task-Oriented Dialogues Sebastian Steindl et.al. 2502.17268 null
2025-02-24 Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective Chengyin Xu et.al. 2502.17262 null
2025-02-24 Detecting Benchmark Contamination Through Watermarking Tom Sander et.al. 2502.17259 null
2025-02-24 REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective Simon Geisler et.al. 2502.17254 null
2025-02-24 Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search Boyan Li et.al. 2502.17248 null
2025-02-24 Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction Tianpeng Li et.al. 2502.17239 link
2025-02-24 Making LLMs Reason? The Intermediate Language Problem in Neurosymbolic Approaches Alexander Beiser et.al. 2502.17216 null
2025-02-24 CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought Boxuan Zhang et.al. 2502.17214 link
2025-02-24 Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following Jie Zeng et.al. 2502.17204 link
2025-02-24 IGDA: Interactive Graph Discovery through Large Language Model Agents Alex Havrilla et.al. 2502.17189 null
2025-02-24 Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks Andrei Chernov et.al. 2502.17187 null
2025-02-24 Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric Yuming Yang et.al. 2502.17184 link
2025-02-24 Unsupervised Accelerated MRI Reconstruction via Ground-Truth-Free Flow Matching Xinzhe Luo et.al. 2502.17174 null
2025-02-24 Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch Xueru Wen et.al. 2502.17173 null
2025-02-24 Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding) Damien Sileo et.al. 2502.17169 null
2025-02-24 JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning Huanghai Liu et.al. 2502.17166 link
2025-02-24 MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation María Andrea Cruz Blandón et.al. 2502.17163 null
2025-02-24 Real-time Monitoring of Economic Shocks using Company Websites Michael Koenig et.al. 2502.17161 null
2025-02-24 A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis Yuli Wu et.al. 2502.17160 null
2025-02-24 Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation Fanhu Zeng et.al. 2502.17159 null
2025-02-24 CodeSwift: Accelerating LLM Inference for Efficient Code Generation Qianhui Zhao et.al. 2502.17139 null
2025-02-24 Evaluating the Effectiveness of Large Language Models in Automated News Article Summarization Lionel Richy Panlap Houamegni et.al. 2502.17136 null
2025-02-24 Applications of Large Models in Medicine YunHe Su et.al. 2502.17132 null
2025-02-24 Thus Spake Long-Context Large Language Model Xiaoran Liu et.al. 2502.17129 null
2025-02-24 Adversarial Training for Defense Against Label Poisoning Attacks Melis Ilayda Bal et.al. 2502.17121 link
2025-02-24 Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions Zhong Li et.al. 2502.17119 link
2025-02-24 SFLD: Reducing the content bias for AI-generated Image Detection Seoyeon Gye et.al. 2502.17105 null
2025-02-24 Generative Models in Decision Making: A Survey Yinchuan Li et.al. 2502.17100 null
2025-02-24 Improved Diffusion-based Generative Model with Better Adversarial Robustness Zekun Wang et.al. 2502.17099 link
2025-02-24 Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies Julieth Katherine Riveros et.al. 2502.17087 null
2025-02-24 Automatically Evaluating the Paper Reviewing Capability of Large Language Models Hyungyu Shin et.al. 2502.17086 null
2025-02-24 Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence Bolin Chen et.al. 2502.17085 null
2025-02-24 Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability Ashhadul Islam et.al. 2502.17071 null
2025-02-24 LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking Preferences Sijia Yao et.al. 2502.17057 link
2025-02-24 PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance Haoran Li et.al. 2502.17041 link
2025-02-24 Evolution 6.0: Evolving Robotic Capabilities Through Generative Design Muhammad Haris Khan et.al. 2502.17034 null
2025-02-24 Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology Longchao Da et.al. 2502.17026 null
2025-02-24 Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization Zixuan Gong et.al. 2502.17024 null
2025-02-24 Quantifying Logical Consistency in Transformers via Query-Key Alignment Eduard Tulchinskii et.al. 2502.17017 null
2025-02-24 Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation Jaskaran Singh Walia et.al. 2502.17011 null
2025-02-24 Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators Shixin Zhao et.al. 2502.17006 null
2025-02-24 An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT Shreya Singh et.al. 2502.17000 null
2025-02-24 Active Learning for Conditional Inverse Design with Crystal Generation and Foundation Atomic Models Zhuoyuan Li et.al. 2502.16984 null
2025-02-24 LongSafety: Evaluating Long-Context Safety of Large Language Models Yida Lu et.al. 2502.16971 link
2025-02-24 Autoregressive Image Generation Guided by Chains of Thought Miaomiao Cai et.al. 2502.16965 null
2025-02-24 Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM Lian Liu et.al. 2502.16963 null
2025-02-24 UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings Layba Fiaz et.al. 2502.16961 null
2025-02-24 Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance Chenghua Huang et.al. 2502.16944 null
2025-02-24 Reasoning Does Not Necessarily Improve Role-Playing Ability Xiachong Feng et.al. 2502.16940 null
2025-02-24 BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference Zewen Jin et.al. 2502.16927 null
2025-02-24 FilterLLM: Text-To-Distribution LLM for Billion-Scale Cold-Start Recommendation Ruochen Liu et.al. 2502.16924 null
2025-02-24 A Systematic Survey of Automatic Prompt Optimization Techniques Kiran Ramnath et.al. 2502.16923 null
2025-02-24 Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties Zhenglin Wang et.al. 2502.16922 link
2025-02-24 SS-MPC: A Sequence-Structured Multi-Party Conversation System Yoonjin Jang et.al. 2502.16920 null
2025-02-24 Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model Kang Fu et.al. 2502.16915 null
2025-02-24 SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models Kevin Miller et.al. 2502.16911 null
2025-02-24 AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models Qin Zhu et.al. 2502.16906 link
2025-02-24 GuidedBench: Equipping Jailbreak Evaluation with Guidelines Ruixuan Huang et.al. 2502.16903 null
2025-02-24 Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinment Suchae Jeong et.al. 2502.16902 null
2025-02-24 Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs Himanshu Beniwal et.al. 2502.16901 link
2025-02-24 Zero-shot Load Forecasting for Integrated Energy Systems: A Large Language Model-based Framework with Multi-task Learning Jiaheng Li et.al. 2502.16896 null
2025-02-24 Unlocking Scientific Concepts: How Effective Are LLM-Generated Analogies for Student Understanding and Classroom Practice? Zekai Shao et.al. 2502.16895 null
2025-02-24 Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Chenghao Fan et.al. 2502.16894 null
2025-02-24 Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data Yejian Zhang et.al. 2502.16892 null
2025-02-24 Unveiling Institution-Specific Bias in Pathology Foundation Models: Detriments, Causes, and Potential Solutions Weiping Lin et.al. 2502.16889 null
2025-02-24 DBudgetKV: Dynamic Budget in KV Cache Compression for Ensuring Optimal Performance Xuanfan Ni et.al. 2502.16886 null
2025-02-24 CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter Yepeng Weng et.al. 2502.16880 null
2025-02-24 A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis Yuzhi Hao et.al. 2502.16879 null
2025-02-24 Graphy’our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data Longbin Lai et.al. 2502.16868 null
2025-02-24 Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment Kartik Nagpal et.al. 2502.16863 null
2025-02-24 LongAttn: Selecting Long-context Training Data via Token-level Attention Longyun Wu et.al. 2502.16860 link
2025-02-24 Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models Avinash Trivedi et.al. 2502.16857 null
2025-02-24 Improving LLM General Preference Alignment via Optimistic Online Mirror Descent Yuheng Zhang et.al. 2502.16852 null
2025-02-24 Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models Yaqi Sun et.al. 2502.16842 null
2025-02-24 Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives Dilermando Queiroz et.al. 2502.16841 null
2025-02-24 In-context learning of evolving data streams with tabular foundational models Afonso Lourenço et.al. 2502.16840 null
2025-02-24 “Actionable Help” in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts Rabindra Lamsal et.al. 2502.16839 null
2025-02-24 REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction Omar Sharif et.al. 2502.16838 null
2025-02-24 Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization Yao Xiao et.al. 2502.16825 null
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 null
2025-02-21 Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training Jaydeep Borkar et.al. 2502.15680 link
2025-02-21 FLEKE: Federated Locate-then-Edit Knowledge Editing Zongkai Zhao et.al. 2502.15677 link
2025-02-21 AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Zhining Zhang et.al. 2502.15676 link
2025-02-21 VaViM and VaVAM: Autonomous Driving through Video Generative Modeling Florent Bartoccioni et.al. 2502.15672 link
2025-02-21 Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing Shoumik Saha et.al. 2502.15666 link
2025-02-21 Machine-generated text detection prevents language model collapse George Drayson et.al. 2502.15654 null
2025-02-21 Empowering LLMs with Logical Reasoning: A Comprehensive Survey Fengxiang Cheng et.al. 2502.15652 null
2025-02-21 Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models Anirudh Sundar et.al. 2502.15639 null
2025-02-21 Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification Vasilii Feofanov et.al. 2502.15637 link
2025-02-21 The Relationship Between Reasoning and Performance in Large Language Models – o3 (mini) Thinks Harder, Not Longer Marthe Ballon et.al. 2502.15631 link
2025-02-21 Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing Qi Le et.al. 2502.15618 link
2025-02-21 On the Robustness of Transformers against Context Hijacking for Linear Classification Tianle Li et.al. 2502.15609 null
2025-02-21 Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance Akos Nagy et.al. 2502.15604 null
2025-02-21 Do Multilingual LLMs Think In English? Lisa Schut et.al. 2502.15603 null
2025-02-21 WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents Xinhang Liu et.al. 2502.15601 null
2025-02-21 SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention Jiaqi Wu et.al. 2502.15594 null
2025-02-21 Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning Wenhao Zhu et.al. 2502.15592 link
2025-02-21 LightThinker: Thinking Step-by-Step Compression Jintian Zhang et.al. 2502.15589 null
2025-02-21 Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid Yunfeng Li et.al. 2502.15583 null
2025-02-21 Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning Mariia Radova et.al. 2502.15582 null
2025-02-21 Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders Xuansheng Wu et.al. 2502.15576 null
2025-02-21 DReSD: Dense Retrieval for Speculative Decoding Milan Gritta et.al. 2502.15572 link
2025-02-21 A Cautionary Tale About “Neutrally” Informative AI Tools Ahead of the 2025 Federal Elections in Germany Ina Dormuth et.al. 2502.15568 null
2025-02-21 PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning Pengcheng Huang et.al. 2502.15543 link
2025-02-21 Accurate and efficient machine learning interatomic potentials for finite temperature modeling of molecular crystals Flaviano Della Pia et.al. 2502.15530 null
2025-02-21 Scaling Sparse and Dense Retrieval in Decoder-Only LLMs Hansi Zeng et.al. 2502.15526 link
2025-02-21 Towards Swift Serverless LLM Cold Starts with ParaServe Chiheng Lou et.al. 2502.15524 null
2025-02-21 Activation Steering in Neural Theorem Provers Shashank Kirtania et.al. 2502.15507 null
2025-02-21 Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing Masaya Kobayashi et.al. 2502.15506 null
2025-02-21 Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models Ya Wang et.al. 2502.15499 link
2025-02-21 Programmers Aren’t Obsolete Yet: A Syllabus for Teaching CS Students to Responsibly Use Large Language Models for Code Generation Bruno Pereira Cipriano et.al. 2502.15493 null
2025-02-21 ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models Martina Miliani et.al. 2502.15487 null
2025-02-21 Enhancing RWKV-based Language Models for Long-Sequence Text Generation Xinghan Pan et.al. 2502.15485 link
2025-02-21 FaultGPT: Industrial Fault Diagnosis Question Answering System by Vision Language Models Jiao Chen et.al. 2502.15481 null
2025-02-21 PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System Yintao He et.al. 2502.15470 null
2025-02-21 Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation Wenxuan Wang et.al. 2502.15466 null
2025-02-21 Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs Gengyuan Zhang et.al. 2502.15457 null
2025-02-21 R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning Jinda Liu et.al. 2502.15455 link
2025-02-21 A fast convergence algorithm based on binary integer programming for expert load balancing in MoE LLMs Yuan Sun et.al. 2502.15451 null
2025-02-21 When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models Weilan Wang et.al. 2502.15443 null
2025-02-21 On the Effectiveness of Large Language Models in Writing Alloy Formulas Yang Hong et.al. 2502.15441 null
2025-02-21 Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning Raghav Singhal et.al. 2502.15436 link
2025-02-21 Single-pass Detection of Jailbreaking Input in Large Language Models Leyla Naz Candogan et.al. 2502.15435 null
2025-02-21 Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation Yue Zhou et.al. 2502.15434 link
2025-02-21 Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations Lihu Chen et.al. 2502.15429 link
2025-02-21 Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs Giulio Zizzo et.al. 2502.15427 link
2025-02-21 Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking Yi-Ling Chung et.al. 2502.15419 link
2025-02-21 MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models Suraj Racha et.al. 2502.15418 link
2025-02-21 HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings Rasmus Aavang et.al. 2502.15411 link
2025-02-21 Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning Xuetao Ma et.al. 2502.15401 null
2025-02-21 Beyond Tools: Understanding How Heavy Users Integrate LLMs into Everyday Tasks and Decision-Making Eunhye Kim et.al. 2502.15395 null
2025-02-21 Chitrarth: Bridging Vision and Language for a Billion People Shaharukh Khan et.al. 2502.15392 null
2025-02-21 MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing Matvey Skripkin et.al. 2502.15381 null
2025-02-21 Weakly Supervised Video Scene Graph Generation via Natural Language Supervision Kibum Kim et.al. 2502.15370 link
2025-02-21 Identifying Features that Shape Perceived Consciousness in Large Language Model-based AI: A Quantitative Study of Human Responses Kang Bongsu et.al. 2502.15365 null
2025-02-21 Evaluating Social Biases in LLM Reasoning Xuyang Wu et.al. 2502.15361 null
2025-02-21 ARS: Automatic Routing Solver with Large Language Models Kai Li et.al. 2502.15359 link
2025-02-21 AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms Feiyang Chen et.al. 2502.15349 link
2025-02-21 Constructing a Norm for Children’s Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models Yi Zhang et.al. 2502.15348 null
2025-02-21 Efficiently Solving Discounted MDPs with Predictions on Transition Matrices Lixing Lyu et.al. 2502.15345 null
2025-02-21 Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions Shoubin Chen et.al. 2502.15336 null
2025-02-21 Stepwise Informativeness Search for Improving LLM Reasoning Siyuan Wang et.al. 2502.15335 null
2025-02-21 Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment Pedram Zaree et.al. 2502.15334 null
2025-02-21 Detecting Future-related Contexts of Entity Mentions Puneet Prashar et.al. 2502.15332 null
2025-02-21 DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation Luzhou Ge et.al. 2502.15309 link
2025-02-21 SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention Hong Yankun et.al. 2502.15304 null
2025-02-21 Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference Yaohua Tang et.al. 2502.15294 null
2025-02-21 Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models Jianming Chang et.al. 2502.15292 null
2025-02-21 BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization Tonghan Wang et.al. 2502.15283 null
2025-02-21 A Training-free LLM-based Approach to General Chinese Character Error Correction Houquan Zhou et.al. 2502.15266 link
2025-02-21 Retrieval-Augmented Speech Recognition Approach for Domain Challenges Peng Shen et.al. 2502.15264 null
2025-02-21 LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design Renjie Wei et.al. 2502.15260 null
2025-02-21 An approach for API synthesis using large language models Hua Zhong et.al. 2502.15246 null
2025-02-21 Comparative Analysis of Large Language Models for Context-Aware Code Completion using SAFIM Framework Hang Zhang et.al. 2502.15243 null
2025-02-21 From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants Manisha Mukherjee et.al. 2502.15237 null
2025-02-21 A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation Shilong Hou et.al. 2502.15233 link
2025-02-21 User Experience with LLM-powered Conversational Recommendation Systems: A Case of Music Recommendation Sojeong Yun et.al. 2502.15229 null
2025-02-21 Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews Mengqiao Liu et.al. 2502.15226 null
2025-02-21 Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs Tingting Chen et.al. 2502.15224 null
2025-02-21 FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs Madhurima Chakraborty et.al. 2502.15217 link
2025-02-21 The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning Sheila Schoepp et.al. 2502.15214 null
2025-02-21 Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing Zhilin Wang et.al. 2502.15208 null
2025-02-21 Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis Yifan Jiang et.al. 2502.15204 link
2025-02-21 TETRIS: Optimal Draft Token Selection for Batch Speculative Decoding Zhaoxuan Wu et.al. 2502.15197 null
2025-02-21 LEDD: Large Language Model-Empowered Data Discovery in Data Lakes Qi An et.al. 2502.15182 null
2025-02-21 Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders Weiqiao Shan et.al. 2502.15178 null
2025-02-21 Methods and Trends in Detecting Generated Images: A Comprehensive Review Arpan Mahara et.al. 2502.15176 null
2025-02-21 M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment Chuan Cui et.al. 2502.15167 null
2025-02-21 Extreme Speech Classification in the Era of LLMs: Exploring Open-Source and Proprietary Models Sarthak Mahajan et.al. 2502.15155 null
2025-02-21 Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems Tianjie Ju et.al. 2502.15153 link
2025-02-21 Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns Naiming Liu et.al. 2502.15140 null
2025-02-21 Chain-of-Rank: Enhancing Large Language Models for Domain-Specific RAG in Edge Device Juntae Lee et.al. 2502.15134 null
2025-02-21 TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Xiuwei Chen et.al. 2502.15130 null
2025-02-20 LUME: LLM Unlearning with Multitask Evaluations Anil Ramakrishna et.al. 2502.15097 null
2025-02-20 Detecting Student Intent for Chat-Based Intelligent Tutoring Systems Ella Cutler et.al. 2502.15096 null
2025-02-20 Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models Marianne Chuang et.al. 2502.15094 null
2025-02-20 Optimizing Singular Spectrum for Large Language Model Compression Dengjie Li et.al. 2502.15092 null
2025-02-20 Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans Masha Fedzechkina et.al. 2502.15090 null
2025-02-20 Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models Yeonjun In et.al. 2502.15086 link
2025-02-20 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Shang Yang et.al. 2502.14866 link
2025-02-20 Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning Shuyue Stella Li et.al. 2502.14860 link
2025-02-20 FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling Weilin Zhao et.al. 2502.14856 null
2025-02-20 Prompt-to-Leaderboard Evan Frick et.al. 2502.14855 link
2025-02-20 GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Jianwen Luo et.al. 2502.14848 link
2025-02-20 Red-Teaming LLM Multi-Agent Systems via Communication Attacks Pengfei He et.al. 2502.14847 null
2025-02-20 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Yue Yang et.al. 2502.14846 null
2025-02-20 Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang et.al. 2502.14838 link
2025-02-20 Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs Danni Liu et.al. 2502.14830 link
2025-02-20 A Survey of Model Architectures in Information Retrieval Zhichao Xu et.al. 2502.14822 null
2025-02-20 eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables Luis Antonio Gutiérrez Guanilo et.al. 2502.14820 null
2025-02-20 Dynamic Low-Rank Sparse Adaptation for Large Language Models Weizhong Huang et.al. 2502.14816 link
2025-02-20 FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis Fadillah Maani et.al. 2502.14807 link
2025-02-20 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Bernal Jiménez Gutiérrez et.al. 2502.14802 link
2025-02-20 A Multi-Agent Perspective on Modern Information Retrieval Haya Nachimovsky et.al. 2502.14796 null
2025-02-20 Rapid Word Learning Through Meta In-Context Learning Wentao Wang et.al. 2502.14791 null
2025-02-20 DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models Hongji Yang et.al. 2502.14779 null
2025-02-20 SurveyX: Academic Survey Automation via Large Language Models Xun Liang et.al. 2502.14776 null
2025-02-20 Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective Weizhong Huang et.al. 2502.14770 null
2025-02-20 Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis Priyanka Kargupta et.al. 2502.14767 link
2025-02-20 EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations Haotian Zhai et.al. 2502.14760 link
2025-02-20 On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems Juraj Vladika et.al. 2502.14759 link
2025-02-20 TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators Jianling Li et.al. 2502.14752 link
2025-02-20 Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs Zongxia Li et.al. 2502.14748 null
2025-02-20 Multi-Agent Coordination across Diverse Applications: A Survey Lijun Sun et.al. 2502.14743 null
2025-02-20 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines M-A-P Team et.al. 2502.14739 null
2025-02-20 EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration Minjie Hong et.al. 2502.14735 null
2025-02-20 WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models Yifu Chen et.al. 2502.14727 null
2025-02-20 I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search Zujie Liang et.al. 2502.14693 null
2025-02-20 Bridging the Gap: Transforming Natural Language Questions into SQL Queries via Abstract Query Pattern and Contextual Schema Markup Yonghui Kong et.al. 2502.14682 null
2025-02-20 How to Get Your LLM to Generate Challenging Problems for Evaluation Arkil Patel et.al. 2502.14678 link
2025-02-20 Data-Constrained Synthesis of Training Data for De-Identification Thomas Vakili et.al. 2502.14677 null
2025-02-20 Explanations of Deep Language Models Explain Language Representations in the Brain Maryam Rahimi et.al. 2502.14671 null
2025-02-20 AlphaMaze: Enhancing Large Language Models’ Spatial Intelligence via GRPO Alan Dao et.al. 2502.14669 link
2025-02-20 Beyond the Surface: Uncovering Implicit Locations with LLMs for Personalized Local News Gali Katz et.al. 2502.14660 null
2025-02-20 Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs Yuchen Wu et.al. 2502.14645 null
2025-02-20 LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning Yansheng Mao et.al. 2502.14644 null
2025-02-20 Length-Controlled Margin-Based Preference Optimization without Reference Model Gengxu Li et.al. 2502.14643 link
2025-02-20 ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation Angxiao Yue et.al. 2502.14637 link
2025-02-20 CER: Confidence Enhanced Reasoning in LLMs Ali Razghandi et.al. 2502.14634 link
2025-02-20 Augmenting Coaching with GenAI: Insights into Use, Effectiveness, and Future Potential Jennifer Haase et.al. 2502.14632 null
2025-02-20 Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery Minh-Quyet Ha et.al. 2502.14631 null
2025-02-20 PEARL: Towards Permutation-Resilient LLMs Liang Chen et.al. 2502.14628 link
2025-02-20 Reward Models Identify Consistency, Not Causality Yuhui Xu et.al. 2502.14619 null
2025-02-20 Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale Shashwat Jaiswal et.al. 2502.14617 null
2025-02-20 FIND: Fine-grained Information Density Guided Adaptive Retrieval-Augmented Generation for Disease Diagnosis Mingyi Jia et.al. 2502.14614 null
2025-02-20 Behavioral Analysis of Information Salience in Large Language Models Jan Trienes et.al. 2502.14613 link
2025-02-20 “Don’t Forget the Teachers”: Towards an Educator-Centered Understanding of Harms from Large Language Models in Education Emma Harvey et.al. 2502.14592 null
2025-02-20 Vision Foundation Models in Medical Image Analysis: Advances and Challenges Pengchen Liang et.al. 2502.14584 null
2025-02-20 A Theory for Conditional Generative Modeling on Multiple Data Sources Rongzhen Wang et.al. 2502.14583 link
2025-02-20 ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification Hyunseok Lee et.al. 2502.14565 null
2025-02-20 Plan-over-Graph: Towards Parallelable LLM Agent Schedule Shiqi Zhang et.al. 2502.14563 link
2025-02-20 Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs Paris Koloveas et.al. 2502.14561 link
2025-02-20 Less is More: Improving LLM Alignment via Preference Data Selection Xun Deng et.al. 2502.14560 null
2025-02-20 Multiscale Byte Language Models – A Hierarchical Architecture for Causal Million-Length Sequence Modeling Eric Egli et.al. 2502.14553 link
2025-02-20 Position: Graph Learning Will Lose Relevance Due To Poor Benchmarks Maya Bechler-Speicher et.al. 2502.14546 null
2025-02-20 LLM-based User Profile Management for Recommender System Seunghwan Bang et.al. 2502.14541 null
2025-02-20 LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization Yupeng Chang et.al. 2502.14538 link
2025-02-20 CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models Zhenhong Zhou et.al. 2502.14529 link
2025-02-20 Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation Austin A. Barr et.al. 2502.14523 link
2025-02-20 Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases Rena Gao et.al. 2502.14507 link
2025-02-20 How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Sergey Pletenev et.al. 2502.14502 link
2025-02-20 MLGym: A New Framework and Benchmark for Advancing AI Research Agents Deepak Nathani et.al. 2502.14499 null
2025-02-20 StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following Jinnan Li et.al. 2502.14494 link
2025-02-20 How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation Zhuohang Long et.al. 2502.14486 null
2025-02-20 NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models Chenlu Guo et.al. 2502.14482 link
2025-02-20 Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression Haoyu Wang et.al. 2502.14477 null
2025-02-20 Argument-Based Comparative Question Answering Evaluation Benchmark Irina Nikishina et.al. 2502.14476 null
2025-02-20 Enhancing Smart Environments with Context-Aware Chatbots using Large Language Models Aurora Polo-Rodríguez et.al. 2502.14469 null
2025-02-20 Narrative-Driven Travel Planning: Geoculturally-Grounded Script Generation with Evolutionary Itinerary Optimization Ran Ding et.al. 2502.14456 link
2025-02-20 Optimal word order for non-causal text generation with Large Language Models: the Spanish case Andrea Busto-Castiñeira et.al. 2502.14451 null
2025-02-20 LLM4FaaS: No-Code Application Development using LLMs and FaaS Minghe Wang et.al. 2502.14450 null
2025-02-20 PredictaBoard: Benchmarking LLM Score Predictability Lorenzo Pacchiardi et.al. 2502.14445 link
2025-02-20 Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models Artem Vazhentsev et.al. 2502.14427 link
2025-02-20 A Survey on Data Contamination for Large Language Models Yuxing Cheng et.al. 2502.14425 link
2025-02-20 ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model Zhongyi Zhou et.al. 2502.14420 null
2025-02-20 Towards Efficient Automatic Self-Pruning of Large Language Models Weizhong Huang et.al. 2502.14413 null
2025-02-20 Evaluating Precise Geolocation Inference Capabilities of Vision Language Models Neel Jay et.al. 2502.14412 link
2025-02-20 Unstructured Evidence Attribution for Long Context Query Focused Summarization Dustin Wright et.al. 2502.14409 link
2025-02-20 HPS: Hard Preference Sampling for Human Preference Alignment Xiandong Zou et.al. 2502.14400 null
2025-02-20 Enhancing Portuguese Variety Identification with Cross-Domain Approaches Hugo Sousa et.al. 2502.14394 null
2025-02-20 Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment Lucile Favero et.al. 2502.14389 null
2025-02-20 S*: Test Time Scaling for Code Generation Dacheng Li et.al. 2502.14382 link
2025-02-20 PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization Xinpeng Shou et.al. 2502.14370 null
2025-02-20 Entropy-UID: A Method for Optimizing Information Density Xinpeng Shou et.al. 2502.14366 null
2025-02-20 Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning Jiachen Zhu et.al. 2502.14361 null
2025-02-20 SR-LLM: Rethinking the Structured Representation in Large Language Model Jiahuan Zhang et.al. 2502.14352 null
2025-02-20 SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images Yichi Zhang et.al. 2502.14351 link
2025-02-20 FlowAgent: Achieving Compliance and Flexibility for Workflow Agents Yuchen Shi et.al. 2502.14345 link
2025-02-20 Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective Ruichen Shao et.al. 2502.14340 null
2025-02-20 A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics Ting-Ruen Wei et.al. 2502.14333 null
2025-02-20 SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation Junjie Sheng et.al. 2502.14328 null
2025-02-20 ChemHTS: Hierarchical Tool Stacking for Enhancing Chemical Agents Zhucong Li et.al. 2502.14327 link
2025-02-20 Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems Bingyu Yan et.al. 2502.14321 null
2025-02-20 Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models James Fodor et.al. 2502.14318 null
2025-02-20 ParallelComp: Parallel Long-Context Compressor for Length Extrapolation Jing Xiong et.al. 2502.14317 null
2025-02-20 Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension Amir Hossein Yari et.al. 2502.14315 null
2025-02-20 Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications Kayhan Behdin et.al. 2502.14305 null
2025-02-20 MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models Shrey Pandit et.al. 2502.14302 null
2025-02-20 SEA-HELM: Southeast Asian Holistic Evaluation of Language Models Yosephine Susanto et.al. 2502.14301 null
2025-02-19 Where’s the Bug? Attention Probing for Scalable Fault Localization Adam Stein et.al. 2502.13966 null
2025-02-19 Autellix: An Efficient Serving Engine for LLM Agents as General Programs Michael Luo et.al. 2502.13965 null
2025-02-19 MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Weihao Liu et.al. 2502.13963 link
2025-02-19 Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering William Jurayj et.al. 2502.13962 null
2025-02-19 LIDDIA: Language-based Intelligent Drug Discovery Agent Reza Averly et.al. 2502.13959 null
2025-02-19 Neurosymbolic artificial intelligence via large language models and coherence-driven inference Steve Huntsman et.al. 2502.13953 null
2025-02-19 Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region Chak Tou Leong et.al. 2502.13946 null
2025-02-19 Image compositing is all you need for data augmentation Ang Jia Ning Shermaine et.al. 2502.13936 null
2025-02-19 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Guanzheng Chen et.al. 2502.13922 link
2025-02-19 Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis Jiahao Gai et.al. 2502.13921 null
2025-02-19 Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health Xingbo Wang et.al. 2502.13920 null
2025-02-19 How Do LLMs Perform Two-Hop Reasoning in Context? Tianyu Guo et.al. 2502.13913 null
2025-02-19 Lost in Sequence: Do Large Language Models Understand Sequential Recommendation? Sein Kim et.al. 2502.13909 link
2025-02-19 Judging the Judges: A Collection of LLM-Generated Relevance Judgements Hossein A. Rahmani et.al. 2502.13908 link
2025-02-19 DataSciBench: An LLM Agent Benchmark for Data Science Dan Zhang et.al. 2502.13897 link
2025-02-19 NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants Yiran Qin et.al. 2502.13894 null
2025-02-19 Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models Matthew P. Wilson et.al. 2502.13886 link
2025-02-19 SPEX: Scaling Feature Interaction Explanations for LLMs Justin Singh Kang et.al. 2502.13870 link
2025-02-19 MagicGeo: Training-Free Text-Guided Geometric Diagram Generation Junxiao Wang et.al. 2502.13855 null
2025-02-19 Enhancing LLM-Based Recommendations Through Personalized Reasoning Jiahao Liu et.al. 2502.13845 null
2025-02-19 Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents Jiahao Liu et.al. 2502.13843 null
2025-02-19 Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking Yilong Chen et.al. 2502.13842 null
2025-02-19 Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models Peter Carragher et.al. 2502.13836 null
2025-02-19 Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning Zenan Li et.al. 2502.13834 link
2025-02-19 ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities Chanjin Zheng et.al. 2502.13832 link
2025-02-19 LESA: Learnable LLM Layer Scaling-Up Yifei Yang et.al. 2502.13794 link
2025-02-19 From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions Nathanaël Carraz Rakotonirina et.al. 2502.13791 link
2025-02-19 From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education Yi-Fan Zhang et.al. 2502.13789 null
2025-02-19 Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics Matthew Wood et.al. 2502.13785 link
2025-02-19 Generative Large Recommendation Models: Emerging Trends in LLMs for Recommendation Hao Wang et.al. 2502.13783 null
2025-02-19 Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions Beatrice Savoldi et.al. 2502.13780 null
2025-02-19 VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare Anudeex Shetty et.al. 2502.13775 null
2025-02-19 AI Software Engineer: Programming with Trust Abhik Roychoudhury et.al. 2502.13767 null
2025-02-19 SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning Renxi Wang et.al. 2502.13753 link
2025-02-19 Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions Xinwei Shen et.al. 2502.13747 null
2025-02-19 Enhancing Input-Label Mapping in In-Context Learning with Contrastive Decoding Keqin Peng et.al. 2502.13738 null
2025-02-19 CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models Nikolaos Dionelis et.al. 2502.13734 null
2025-02-19 Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method Juyuan Zhang et.al. 2502.13725 null
2025-02-19 Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values Hongbo Zhang et.al. 2502.13723 null
2025-02-19 TALKPLAY: Multimodal Music Recommendation with Large Language Models Seungheon Doh et.al. 2502.13713 null
2025-02-19 Is This Collection Worth My LLM’s Time? Automatically Measuring Information Potential in Text Corpora Tristan Karch et.al. 2502.13691 null
2025-02-19 An LLM-based Agent for Reliable Docker Environment Configuration Ruida Hu et.al. 2502.13681 link
2025-02-19 SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation Song Duong et.al. 2502.13674 null
2025-02-19 Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models Liyang He et.al. 2502.13656 link
2025-02-19 C2T: A Classifier-Based Tree Construction Method in Speculative Decoding Feiye Huo et.al. 2502.13652 null
2025-02-19 Reliability Across Parametric and External Knowledge: Understanding Knowledge Handling in LLMs Youna Kim et.al. 2502.13648 null
2025-02-19 D.Va: Validate Your Demonstration First Before You Use It Qi Zhang et.al. 2502.13646 null
2025-02-19 Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts Maiya Goloburda et.al. 2502.13640 null
2025-02-19 Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization Or Raphael Bidusa et.al. 2502.13632 null
2025-02-19 AI-Empowered Catalyst Discovery: A Survey from Classical Machine Learning Approaches to Large Language Models Yuanyuan Xu et.al. 2502.13626 null
2025-02-19 REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models DongGeon Lee et.al. 2502.13622 null
2025-02-19 Complex Ontology Matching with Large Language Model Embeddings Guilherme Sousa et.al. 2502.13619 null
2025-02-19 LaVCa: LLM-assisted Visual Cortex Captioning Takuya Matsuyama et.al. 2502.13606 null
2025-02-19 BeamLoRA: Beam-Constraint Low-Rank Adaptation Naibin Gu et.al. 2502.13604 null
2025-02-19 MMTEB: Massive Multilingual Text Embedding Benchmark Kenneth Enevoldsen et.al. 2502.13595 null
2025-02-19 Don’t Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints Nicolò Penzo et.al. 2502.13592 link
2025-02-19 Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts Xin Li et.al. 2502.13577 null
2025-02-19 LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation Xin Li et.al. 2502.13568 null
2025-02-19 Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs Joonatan Laato et.al. 2502.13566 null
2025-02-19 PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models Guangwei Li et.al. 2502.13564 link
2025-02-19 Are Large Language Models In-Context Graph Learners? Jintang Li et.al. 2502.13562 null
2025-02-19 Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs Yushi Feng et.al. 2502.13555 link
2025-02-19 STaR-SQL: Self-Taught Reasoner for Text-to-SQL Mingqian He et.al. 2502.13550 null
2025-02-19 Detecting Linguistic Bias in Government Documents Using Large language Models Milena de Swart et.al. 2502.13548 null
2025-02-19 From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN Peiwen Yuan et.al. 2502.13544 null
2025-02-19 Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference Qingfa Xiao et.al. 2502.13542 null
2025-02-19 Bursting Filter Bubble: Enhancing Serendipity Recommendations with Aligned Large Language Models Yunjia Xi et.al. 2502.13539 null
2025-02-19 Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models Jun Zhang et.al. 2502.13533 link
2025-02-19 Exploiting Prefix-Tree in Structured Output Interfaces for Enhancing Jailbreak Attacking Yanzeng Li et.al. 2502.13527 link
2025-02-19 SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin Hao Yi et.al. 2502.13516 null
2025-02-19 Unlocking Multimodal Integration in EHRs: A Prompt Learning Framework for Language and Time Series Fusion Shuai Niu et.al. 2502.13509 null
2025-02-19 Reproducing NevIR: Negation in Neural Information Retrieval Coen van Elsen et.al. 2502.13506 link
2025-02-19 PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference Burc Gokden et.al. 2502.13502 link
2025-02-19 Towards Geo-Culturally Grounded LLM Generations Piyawat Lertvittayakumjorn et.al. 2502.13497 null
2025-02-19 What are Models Thinking about? Understanding Large Language Model Hallucinations “Psychology” through Model Inner State Analysis Peiran Wang et.al. 2502.13490 null
2025-02-19 LLM4Tag: Automatic Tagging System for Information Retrieval via Large Language Models Ruiming Tang et.al. 2502.13481 null
2025-02-19 Integration of Agentic AI with 6G Networks for Mission-Critical Applications: Use-case and Challenges Sunder Ali Khowaja et.al. 2502.13476 null
2025-02-19 LLM should think and action as a human Haun Leung et.al. 2502.13475 null
2025-02-19 Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models Chenyu Zhu et.al. 2502.13474 null
2025-02-19 ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails Xiaofei Wen et.al. 2502.13458 link
2025-02-19 Interleaved Gibbs Diffusion for Constrained Generation Gautham Govind Anil et.al. 2502.13450 null
2025-02-19 Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning Yang Yan et.al. 2502.13447 null
2025-02-19 TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation Jialin Ouyang et.al. 2502.13442 link
2025-02-19 The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding? Yutao Sun et.al. 2502.13441 null
2025-02-19 MATS: An Audio Language Model under Text-only Supervision Wen Wang et.al. 2502.13433 null
2025-02-19 Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning Hao Ma et.al. 2502.13430 null
2025-02-19 MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering Guanming Xiong et.al. 2502.13428 null
2025-02-19 TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition Yuxiang Wang et.al. 2502.13422 null
2025-02-19 RLTHF: Targeted Human Feedback for LLM Alignment Yifei Xu et.al. 2502.13417 null
2025-02-19 Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning Ningke Li et.al. 2502.13416 null
2025-02-19 Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction Yanbang Sun et.al. 2502.13412 null
2025-02-19 Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks Vince Kurtz et.al. 2502.13406 null
2025-02-19 $\mathtt{GeLLM^3O}$ : Generalizing Large Language Models for Multi-property Molecule Optimization Vishal Dey et.al. 2502.13398 link
2025-02-19 Prompting a Weighting Mechanism into LLM-as-a-Judge in Two-Step: A Case Study Wenwen Xie et.al. 2502.13396 null
2025-02-19 Flow-based generative models as iterative algorithms in probability space Yao Xie et.al. 2502.13394 null
2025-02-19 Reasoning with Reinforced Functional Token Tuning Kongcheng Zhang et.al. 2502.13389 link
2025-02-19 Reflection of Episodes: Learning to Play Game from Expert and Self Experiences Xiaojie Xu et.al. 2502.13388 null
2025-02-19 MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification Linzhuang Sun et.al. 2502.13383 link
2025-02-19 AutoTEE: Automated Migration and Protection of Programs in Trusted Execution Environments Ruidong Han et.al. 2502.13379 link
2025-02-19 Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor Barys Liskavets et.al. 2502.13374 null
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 link
2025-02-18 Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Bencheng Liao et.al. 2502.13145 link
2025-02-18 Pre-training Auto-regressive Robotic Models with 4D Representations Dantong Niu et.al. 2502.13142 null
2025-02-18 UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Huawei Lin et.al. 2502.13141 link
2025-02-18 AIDE: AI-Driven Exploration in the Space of Code Zhengyao Jiang et.al. 2502.13138 link
2025-02-18 Theorem Prover as a Judge for Synthetic Data Generation Joshua Ong Jun Leang et.al. 2502.13137 null
2025-02-18 AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Aggelina Chatziagapi et.al. 2502.13133 null
2025-02-18 Learning to Defer for Causal Discovery with Imperfect Experts Oscar Clivio et.al. 2502.13132 null
2025-02-18 Rethinking Diverse Human Preference Learning through Principal Component Analysis Feng Luo et.al. 2502.13131 null
2025-02-18 Magma: A Foundation Model for Multimodal AI Agents Jianwei Yang et.al. 2502.13130 link
2025-02-18 Is Noise Conditioning Necessary for Denoising Generative Models? Qiao Sun et.al. 2502.13129 null
2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null
2025-02-18 RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises Zenan Zhai et.al. 2502.13125 link
2025-02-18 Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context Marion Bartl et.al. 2502.13120 null
2025-02-18 STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models Narun Raman et.al. 2502.13119 null
2025-02-18 Performance Evaluation of Large Language Models in Statistical Programming Xinyi Song et.al. 2502.13117 link
2025-02-18 MatterChat: A Multi-Modal LLM for Material Science Yingheng Tang et.al. 2502.13107 null
2025-02-18 Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Mengkang Hu et.al. 2502.13092 null
2025-02-18 A Neural Difference-of-Entropies Estimator for Mutual Information Haoran Ni et.al. 2502.13085 null
2025-02-18 Personalized Image Generation with Deep Generative Models: A Decade Survey Yuxiang Wei et.al. 2502.13081 link
2025-02-18 SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Xianfu Cheng et.al. 2502.13059 null
2025-02-18 LAMD: Context-driven Android Malware Detection and Classification with LLMs Xingzhi Qian et.al. 2502.13055 null
2025-02-18 Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction Nils Constantin Hellwig et.al. 2502.13044 null
2025-02-18 HPSS: Heuristic Prompting Strategy Search for LLM Evaluators Bosi Wen et.al. 2502.13031 null
2025-02-18 A deep learning framework for efficient pathology image analysis Peter Neidlinger et.al. 2502.13027 null
2025-02-18 Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks Markus J. Buehler et.al. 2502.13025 link
2025-02-18 Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation Sha Li et.al. 2502.13019 null
2025-02-18 Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents Chaoran Chen et.al. 2502.13012 null
2025-02-18 Adaptive Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge Mohammad Reza Rezaei et.al. 2502.13010 null
2025-02-18 You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations Frederic Kirstein et.al. 2502.13001 null
2025-02-18 Personalized Top-k Set Queries Over Predicted Scores Sohrab Namazi Nia et.al. 2502.12998 null
2025-02-18 Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs Zixiao Wang et.al. 2502.12988 null
2025-02-18 Towards Variational Flow Matching on General Geometries Olga Zaghen et.al. 2502.12981 null
2025-02-18 Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search Yifan Ji et.al. 2502.12974 link
2025-02-18 Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking Junda Zhu et.al. 2502.12970 link
2025-02-18 Trust Me, I’m Wrong: High-Certainty Hallucinations in LLMs Adi Simhi et.al. 2502.12964 null
2025-02-18 Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing Xiaoju Ye et.al. 2502.12962 null
2025-02-18 Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger Wenjun Li et.al. 2502.12961 null
2025-02-18 Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression Jaemoon Lee et.al. 2502.12951 null
2025-02-18 Fake It Till You Make It: Using Synthetic Data and Domain Knowledge for Improved Text-Based Learning for LGE Detection Athira J Jacob et.al. 2502.12948 null
2025-02-18 Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models Gyeongman Kim et.al. 2502.12947 null
2025-02-18 LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation Junchen Fu et.al. 2502.12945 null
2025-02-18 Performance of Zero-Shot Time Series Foundation Models on Cloud Data William Toner et.al. 2502.12944 null
2025-02-18 Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options Lakshmi Nair et.al. 2502.12929 link
2025-02-18 Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts Leiyu Pan et.al. 2502.12928 null
2025-02-18 SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems Mike Zhang et.al. 2502.12927 link
2025-02-18 Towards more Contextual Agents: An extractor-Generator Optimization Framework Mourad Aouini et.al. 2502.12926 null
2025-02-18 Keep what you need : extracting efficient subnetworks from large audio representation models David Genova et.al. 2502.12925 link
2025-02-18 Conditioning LLMs to Generate Code-Switched Text: A Methodology Grounded in Naturally Occurring Data Maite Heredia et.al. 2502.12924 link
2025-02-18 On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation Rune Birkmose et.al. 2502.12923 link
2025-02-18 Q-STRUM Debate: Query-Driven Contrastive Summarization for Recommendation Comparison George-Kirollos Saad et.al. 2502.12921 null
2025-02-18 Lightweight Online Adaption for Time Series Foundation Model Forecasts Thomas L. Lee et.al. 2502.12920 null
2025-02-18 GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning Sifan Zhou et.al. 2502.12913 null
2025-02-18 Probabilistic neural operators for functional uncertainty quantification Christopher Bülte et.al. 2502.12902 link
2025-02-18 Soundwave: Less is More for Speech-Text Alignment in LLMs Yuhao Zhang et.al. 2502.12900 link
2025-02-18 Multilingual European Language Models: Benchmarking Approaches and Challenges Fabio Barth et.al. 2502.12895 null
2025-02-18 CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image Kaixin Yao et.al. 2502.12894 null
2025-02-18 Are Multilingual Language Models an Off-ramp for Under-resourced Languages? Will we arrive at Digital Language Equality in Europe in 2030? Georg Rehm et.al. 2502.12886 null
2025-02-18 How desirable is alignment between LLMs and linguistically diverse human users? Pia Knoeferle et.al. 2502.12884 null
2025-02-18 Continuous Learning Conversational AI: A Personalized Agent Framework via A2C Reinforcement Learning Nandakishor M et.al. 2502.12876 null
2025-02-18 RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution Emmanuel K. Raptis et.al. 2502.12862 link
2025-02-18 PAFT: Prompt-Agnostic Fine-Tuning Chenxing Wei et.al. 2502.12859 null
2025-02-18 Rejected Dialects: Biases Against African American Language in Reward Models Joel Mire et.al. 2502.12858 link
2025-02-18 MeMo: Towards Language Models with Associative Memory Mechanisms Fabio Massimo Zanzotto et.al. 2502.12851 null
2025-02-18 MOLLM: Multi-Objective Large Language Model for Molecular Design – Optimizing with Experts Nian Ran et.al. 2502.12845 null
2025-02-18 Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols Kathrin Seßler et.al. 2502.12842 null
2025-02-18 Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing Berk Yilmaz et.al. 2502.12838 null
2025-02-18 An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation Mohammad Feli et.al. 2502.12836 null
2025-02-18 KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan Mukhammed Togmanov et.al. 2502.12829 null
2025-02-18 Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models Rubing Lu et.al. 2502.12825 null
2025-02-18 Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models Elena Stringli et.al. 2502.12821 null
2025-02-18 Simulating User Diversity in Task-Oriented Dialogue Systems using Large Language Models Adnan Ahmad et.al. 2502.12813 null
2025-02-18 Towards Text-Image Interleaved Retrieval Xin Zhang et.al. 2502.12799 link
2025-02-18 RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models Tanqiu Jiang et.al. 2502.12794 link
2025-02-18 Commonsense Reasoning in Arab Culture Abdelrahman Sadallah et.al. 2502.12788 null
2025-02-18 Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models Daiki Chijiwa et.al. 2502.12776 null
2025-02-18 How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild Saad Obaid ul Islam et.al. 2502.12769 link
2025-02-18 R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs Sumin Jo et.al. 2502.12767 null
2025-02-18 One-bit Compressed Sensing using Generative Models Swatantra Kafle et.al. 2502.12762 null
2025-02-18 Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models Kamer Ali Yuksel et.al. 2502.12755 link
2025-02-18 Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table Haoyuan Wu et.al. 2502.12751 null
2025-02-18 Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation Yong Zhang et.al. 2502.12744 null
2025-02-18 “I know myself better, but not really greatly”: Using LLMs to Detect and Explain LLM-Generated Texts Jiazhou Ji et.al. 2502.12743 null
2025-02-18 Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment Haoyuan Wu et.al. 2502.12732 null
2025-02-18 TREND: A Whitespace Replacement Information Hiding Method Malte Hellmeier et.al. 2502.12710 null
2025-02-18 Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming Arash Lagzian et.al. 2502.12700 null
2025-02-18 Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees Yongtao Wu et.al. 2502.12678 null
2025-02-18 Baichuan-M1: Pushing the Medical Capability of Large Language Models Bingning Wang et.al. 2502.12671 null
2025-02-18 Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research Xiang Liu et.al. 2502.12669 null
2025-02-18 Evaluation of Best-of-N Sampling Strategies for Language Model Alignment Yuki Ichihara et.al. 2502.12668 null
2025-02-18 A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization Junhui He et.al. 2502.12665 null
2025-02-18 Demystifying Multilingual Chain-of-Thought in Process Reward Modeling Weixuan Wang et.al. 2502.12663 null
2025-02-18 The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1 Kaiwen Zhou et.al. 2502.12659 null
2025-02-18 R.R.: Unveiling LLM Training Privacy through Recollection and Ranking Wenlong Meng et.al. 2502.12658 link
2025-02-18 NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation Zhiyuan Liu et.al. 2502.12638 link
2025-02-18 Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning Yunhao Gou et.al. 2502.12635 null
2025-02-18 \textit{One Size doesn’t Fit All}: A Personalized Conversational Tutoring Agent for Mathematics Instruction Ben Liu et.al. 2502.12633 null
2025-02-18 Automating Prompt Leakage Attacks on Large Language Models Using Agentic Approach Tvrtko Sternak et.al. 2502.12630 link
2025-02-18 DeepResonance: Enhancing Multimodal Music Understanding via Music-centric Multi-way Instruction Tuning Zhuoyuan Mao et.al. 2502.12623 null
2025-02-18 Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions Leonardo Ranaldi et.al. 2502.12616 null
2025-02-17 Idiosyncrasies in Large Language Models Mingjie Sun et.al. 2502.12150 link
2025-02-17 HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation Ling Yang et.al. 2502.12148 link
2025-02-17 Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Jinyan Su et.al. 2502.12145 link
2025-02-17 Small Models Struggle to Learn from Strong Reasoners Yuetai Li et.al. 2502.12143 null
2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 null
2025-02-17 Transformer Dynamics: A neuroscientific approach to interpretability of large language models Jesseba Fernando et.al. 2502.12131 null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 null
2025-02-17 LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities Florian Sestak et.al. 2502.12128 link
2025-02-17 Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA Patryk Marszałek et.al. 2502.12122 link
2025-02-17 LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws Prasanna Mayilvahanan et.al. 2502.12120 null
2025-02-17 PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection Jinhe Bi et.al. 2502.12119 null
2025-02-17 A-MEM: Agentic Memory for LLM Agents Wujiang Xu et.al. 2502.12110 link
2025-02-17 Personality Structured Interview for Large Language Model Simulation in Personality Research Pengda Wang et.al. 2502.12109 null
2025-02-17 Relational Norms for Human-AI Cooperation Brian D. Earp et.al. 2502.12102 null
2025-02-17 Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications Li Qiao et.al. 2502.12096 null
2025-02-17 How compositional generalization and creativity improve as diffusion models are trained Alessandro Favero et.al. 2502.12089 null
2025-02-17 Meta-Statistical Learning: Supervised Learning of Statistical Inference Maxime Peyrard et.al. 2502.12088 null
2025-02-17 APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs Yuxiang Huang et.al. 2502.12085 link
2025-02-17 Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation Zhongyi Qiu et.al. 2502.12073 null
2025-02-17 TokenSkip: Controllable Chain-of-Thought Compression in LLMs Heming Xia et.al. 2502.12067 link
2025-02-17 CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models Yifan Zhang et.al. 2502.12066 null
2025-02-17 AI-generated Text Detection with a GLTR-based Approach Lucía Yan Wu et.al. 2502.12064 null
2025-02-17 Designing Role Vectors to Improve LLM Inference Behaviour Daniele Potertì et.al. 2502.12055 null
2025-02-17 PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning Xinyu Zhang et.al. 2502.12054 null
2025-02-17 A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond Shreya Shukla et.al. 2502.12048 null
2025-02-17 KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs Qi Zhao et.al. 2502.12029 null
2025-02-17 SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities Fengqing Jiang et.al. 2502.12025 null
2025-02-17 Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving Xin Xu et.al. 2502.12022 null
2025-02-17 Atom of Thoughts for Markov LLM Test-Time Scaling Fengwei Teng et.al. 2502.12018 link
2025-02-17 Unsupervised Structural-Counterfactual Generation under Domain Shift Krishn Vishwas Kher et.al. 2502.12013 null
2025-02-17 Design Considerations Based on Stability for a Class of TCP Algorithms Sreekanth Prabhakar et.al. 2502.11983 null
2025-02-17 Image Inversion: A Survey from GANs to Diffusion and Beyond Yinan Chen et.al. 2502.11974 link
2025-02-17 Generating Text from Uniform Meaning Representation Emma Markle et.al. 2502.11973 link
2025-02-17 A MIMO Wireless Channel Foundation Model via CIR-CSI Consistency Jun Jiang et.al. 2502.11965 null
2025-02-17 Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning Tianyi Wu et.al. 2502.11962 null
2025-02-17 On Representational Dissociation of Language and Arithmetic in Large Language Models Riku Kisako et.al. 2502.11932 null
2025-02-17 GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs Yi Fang et.al. 2502.11925 null
2025-02-17 From Text to Trust: Empowering AI-assisted Decision Making with Adaptive LLM-powered Analysis Zhuoyan Li et.al. 2502.11919 null
2025-02-17 EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models Jiamin Su et.al. 2502.11916 null
2025-02-17 Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives Leo Schwinn et.al. 2502.11910 null
2025-02-17 MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation Haochen Xue et.al. 2502.11903 null
2025-02-17 DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation Zhihang Yuan et.al. 2502.11897 link
2025-02-17 CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning Yanxiao Zhao et.al. 2502.11896 null
2025-02-17 Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models? Jacob Nielsen et.al. 2502.11895 null
2025-02-17 Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration Shao Zhang et.al. 2502.11882 link
2025-02-17 Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models Hyunwoo Kim et.al. 2502.11881 null
2025-02-17 Bitnet.cpp: Efficient Edge Inference for Ternary LLMs Jinheng Wang et.al. 2502.11880 link
2025-02-17 JoLT: Joint Probabilistic Predictions on Tabular Data Using LLMs Aliaksandra Shysheya et.al. 2502.11877 link
2025-02-17 FedEAT: A Robustness Optimization Framework for Federated LLMs Yahao Pang et.al. 2502.11863 null
2025-02-17 Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu Renhao Pei et.al. 2502.11862 link
2025-02-17 Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics Shuqi Yang et.al. 2502.11861 null
2025-02-17 StructTransform: A Scalable Attack Surface for Safety-Aligned Large Language Models Shehel Yoosuf et.al. 2502.11853 link
2025-02-17 BaxBench: Can LLMs Generate Correct and Secure Backends? Mark Vero et.al. 2502.11844 null
2025-02-17 Can LLM Agents Maintain a Persona in Discourse? Pranav Bhandari et.al. 2502.11843 null
2025-02-17 Model Generalization on Text Attribute Graphs: Principles with Large Language Models Haoyu Wang et.al. 2502.11836 link
2025-02-17 HAAN: A Holistic Approach for Accelerating Normalization Operations in Large Language Models Tianfan Peng et.al. 2502.11832 null
2025-02-17 Intuitive physics understanding emerges from self-supervised pretraining on natural videos Quentin Garrido et.al. 2502.11831 link
2025-02-17 Text Classification in the LLM Era - Where do we stand? Sowmya Vajjala et.al. 2502.11830 null
2025-02-17 Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities Hanbin Wang et.al. 2502.11829 link
2025-02-17 M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Chengyan Wu et.al. 2502.11824 link
2025-02-17 Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis Xu Wang et.al. 2502.11812 null
2025-02-17 FineFilter: A Fine-grained Noise Filtering Mechanism for Retrieval-Augmented Large Language Models Qianchi Zhang et.al. 2502.11811 null
2025-02-17 Exploring Translation Mechanism of Large Language Models Hongbin Zhang et.al. 2502.11806 null
2025-02-17 Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning Peiying Yu et.al. 2502.11799 null
2025-02-17 Personality Editing for Language Models through Relevant Knowledge Editing Seojin Hwang et.al. 2502.11789 null
2025-02-17 Efficient Response Generation Method Selection for Fine-Tuning Large Language Models Xuan Ren et.al. 2502.11779 null
2025-02-17 video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model Guangzhi Sun et.al. 2502.11775 null
2025-02-17 The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It Leonardo Bertolazzi et.al. 2502.11771 link
2025-02-17 Cognitive-Aligned Document Selection for Retrieval-augmented Generation Bingyu Wan et.al. 2502.11770 null
2025-02-17 From Selection to Generation: A Survey of LLM-based Active Learning Yu Xia et.al. 2502.11767 null
2025-02-17 Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation Zengkui Sun et.al. 2502.11766 link
2025-02-17 HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims Michiel van der Meer et.al. 2502.11753 null
2025-02-17 Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning Yuqi Pang et.al. 2502.11751 link
2025-02-17 ILIAS: Instance-Level Image retrieval At Scale Giorgos Kordopatis-Zilos et.al. 2502.11748 null
2025-02-17 SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL Shuai Lyu et.al. 2502.11741 link
2025-02-17 ReviewEval: An Evaluation Framework for AI-Generated Reviews Chavvi Kirtani et.al. 2502.11736 null
2025-02-17 Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment Jonathan Jordan et.al. 2502.11733 null
2025-02-17 Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption Alireza Nik et.al. 2502.11723 null
2025-02-17 Enhancing Recommendation Explanations through User-Centric Refinement Jingsen Zhang et.al. 2502.11721 null
2025-02-17 Can you pass that tool?: Implications of Indirect Speech in Physical Human-Robot Collaboration Yan Zhang et.al. 2502.11720 null
2025-02-17 Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection Xuan Tong et.al. 2502.11712 null
2025-02-17 Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models Sherzod Hakimov et.al. 2502.11707 null
2025-02-17 LLM Agents Making Agent Tools Georg Wölflein et.al. 2502.11705 null
2025-02-17 CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation Guangya Yu et.al. 2502.11703 null
2025-02-17 MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow Hanzhuo Huang et.al. 2502.11697 null
2025-02-17 Improve LLM-as-a-Judge Ability as a General Ability Jiachen Yu et.al. 2502.11689 null
2025-02-17 MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task Yuchen Yan et.al. 2502.11684 null
2025-02-17 RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars Yuncheng Hua et.al. 2502.11681 link
2025-02-17 Exploring LLM-based Student Simulation for Metacognitive Cultivation Haoxuan Li et.al. 2502.11678 null
2025-02-17 Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception Shiyu Ni et.al. 2502.11677 null
2025-02-17 Diversity-Oriented Data Augmentation with Large Language Models Zaitian Wang et.al. 2502.11671 null
2025-02-17 VRoPE: Rotary Position Embedding for Video Large Language Models Zikang Liu et.al. 2502.11664 link
2025-02-17 An Innovative Brain-Computer Interface Interaction System Based on the Large Language Model Jing Jina et.al. 2502.11659 null
2025-02-17 Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation Amin Qasmi et.al. 2502.11649 null
2025-02-17 DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing Yi Wang et.al. 2502.11647 null
2025-02-17 Hyperspherical Energy Transformer with Recurrent Depth Yunzhe Hu et.al. 2502.11646 null
2025-02-17 Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI Yuxia Wang et.al. 2502.11614 null
2025-02-17 Maximum Entropy Reinforcement Learning with Diffusion Policy Xiaoyi Dong et.al. 2502.11612 link
2025-02-17 Accuracy Assessment of OpenAlex and Clarivate Scholar ID with an LLM-Assisted Benchmark Renyu Zhao et.al. 2502.11610 null
2025-02-17 GraphThought: Graph Combinatorial Optimization with Thought Generation Zixiao Huang et.al. 2502.11607 null
2025-02-14 MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Yi-Fan Zhang et.al. 2502.10391 null
2025-02-14 Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction WonJin Yoon et.al. 2502.10388 null
2025-02-14 Robustness tests for biomedical foundation models should tailor to specification R. Patrick Xian et.al. 2502.10374 link
2025-02-14 AffinityFlow: Guided Flows for Antibody Affinity Maturation Can Chen et.al. 2502.10365 null
2025-02-14 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer et.al. 2502.10361 null
2025-02-14 Dimension-free Score Matching and Time Bootstrapping for Diffusion Models Syamantak Kumar et.al. 2502.10354 null
2025-02-14 Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Alexander Wettig et.al. 2502.10341 null
2025-02-14 Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering Nick Ferguson et.al. 2502.10338 null
2025-02-14 Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions Leo Zhang et.al. 2502.10328 null
2025-02-14 LLM-Powered Preference Elicitation in Combinatorial Assignment Ermis Soumalias et.al. 2502.10308 null
2025-02-14 SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models Aditya Mishra et.al. 2502.10307 null
2025-02-14 Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2 Saem Hasan et.al. 2502.10299 null
2025-02-14 Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification Pengyu Zhang et.al. 2502.10280 null
2025-02-14 Are Large Language Models the future crowd workers of Linguistics? Iris Ferrazzo et.al. 2502.10266 null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 null
2025-02-14 VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models Gokul Karthik Kumar et.al. 2502.10250 null
2025-02-14 Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Guoqing Ma et.al. 2502.10248 link
2025-02-14 Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices Mohamed Aboelenien Ahmed et.al. 2502.10239 null
2025-02-14 Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control Thomas Jiralerspong et.al. 2502.10236 null
2025-02-14 AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting Abdelhakim Benechehab et.al. 2502.10235 link
2025-02-14 Do Large Language Models Reason Causally Like Us? Even Better? Hanna M. Dettki et.al. 2502.10215 null
2025-02-14 Can Post-Training Quantization Benefit from an Additional QLoRA Integration? Xiliang Zhu et.al. 2502.10202 null
2025-02-14 Prediction hubs are context-informed frequent tokens in LLMs Beatrix M. G. Nielsen et.al. 2502.10201 null
2025-02-14 MathConstruct: Challenging LLM Reasoning with Constructive Proofs Mislav Balunović et.al. 2502.10197 null
2025-02-14 Translating Common Security Assertions Across Processor Designs: A RISC-V Case Study Sharjeel Imtiaz et.al. 2502.10194 null
2025-02-14 VideoDiff: Human-AI Video Co-Creation with Alternatives Mina Huh et.al. 2502.10190 null
2025-02-14 Modeling biases in binary decision-making within the generalized nonlinear q-voter model Maciej Doniec et.al. 2502.10172 null
2025-02-14 Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries Serkan Sulun et.al. 2502.10154 null
2025-02-14 Semantica: Decentralized Search using a LLM-Guided Semantic Tree Overlay Petru Neague et.al. 2502.10151 link
2025-02-14 Cooperative Multi-Agent Planning with Adaptive Skill Synthesis Zhiyuan Li et.al. 2502.10148 null
2025-02-14 Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages Daniil Gurgurov et.al. 2502.10140 null
2025-02-14 Physics-Informed Generative Modeling of Wireless Channels Benedikt Böck et.al. 2502.10137 null
2025-02-14 ScamFerret: Detecting Scam Websites Autonomously with Large Language Models Hiroki Nakano et.al. 2502.10110 link
2025-02-14 NeuroXVocal: Detection and Explanation of Alzheimer’s Disease through Non-invasive Analysis of Picture-prompted Speech Nikolaos Ntampakis et.al. 2502.10108 null
2025-02-14 A novel approach to data generation in generative model JaeHong Kim et.al. 2502.10092 null
2025-02-14 Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations Tianyu Song et.al. 2502.10088 link
2025-02-14 DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery Utkarsh Mall et.al. 2502.10060 null
2025-02-14 A Generalized Modeling Approach to Liquid-driven Ballooning Membranes Mirroyal Ismayilov et.al. 2502.10057 null
2025-02-14 ORI: O Routing Intelligence Ahmad Shadid et.al. 2502.10051 null
2025-02-14 A Survey on LLM-powered Agents for Recommender Systems Qiyao Peng et.al. 2502.10050 null
2025-02-14 ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments Juyeong Hwang et.al. 2502.10046 null
2025-02-14 POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning Jiawei Cheng et.al. 2502.10038 null
2025-02-14 Probabilistic Lexical Manifold Construction in Large Language Models via Hierarchical Vector Field Interpolation Clive Pendleton et.al. 2502.10013 null
2025-02-14 ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy? Jian Chen et.al. 2502.10008 null
2025-02-14 EmbBERT-Q: Breaking Memory Barriers in Embedded NLP Riccardo Bravin et.al. 2502.10001 null
2025-02-14 Decision Information Meets Large Language Models: The Future of Explainable Operations Research Yansen Zhang et.al. 2502.09994 null
2025-02-14 Large Language Diffusion Models Shen Nie et.al. 2502.09992 null
2025-02-14 V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models Hsu-kuang Chiu et.al. 2502.09980 null
2025-02-14 LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing Kuan Li et.al. 2502.09977 null
2025-02-14 Has My System Prompt Been Used? Large Language Model Prompt Membership Inference Roman Levin et.al. 2502.09974 null
2025-02-14 KGGen: Extracting Knowledge Graphs from Plain Text with Language Models Belinda Mo et.al. 2502.09956 null
2025-02-14 A Preliminary Exploration with GPT-4o Voice Mode Yu-Xiang Lin et.al. 2502.09940 null
2025-02-14 Precise Parameter Localization for Textual Generation in Diffusion Models Łukasz Staniszewski et.al. 2502.09935 null
2025-02-14 MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning Kai Yan et.al. 2502.09933 null
2025-02-14 Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence Granite Vision Team et.al. 2502.09927 null
2025-02-14 λScale: Enabling Fast Scaling for Serverless Large Language Model Inference Minchen Yu et.al. 2502.09922 null
2025-02-14 INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing Hongsun Jang et.al. 2502.09921 null
2025-02-14 AutoS $^2$ earch: Unlocking the Reasoning Potential of Large Models for Web-based Source Search Zhengqiu Zhu et.al. 2502.09913 null
2025-02-14 Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding Thanh-Dat Truong et.al. 2502.09906 null
2025-02-14 The Ann Arbor Architecture for Agent-Oriented Programming Wei Dong et.al. 2502.09903 link
2025-02-14 Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond Kehan Guo et.al. 2502.09897 null
2025-02-14 ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation Ye Dong et.al. 2502.09896 null
2025-02-14 ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation Shu Wang et.al. 2502.09891 null
2025-02-14 Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos Weirui Ye et.al. 2502.09886 null
2025-02-14 Solvable Dynamics of Self-Supervised Word Embeddings and the Emergence of Analogical Reasoning Dhruva Karkada et.al. 2502.09863 null
2025-02-14 Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge Naoyuki Kamo et.al. 2502.09859 null
2025-02-14 Automated Hypothesis Validation with Agentic Sequential Falsifications Kexin Huang et.al. 2502.09858 link
2025-02-14 Port-LLM: A Port Prediction Method for Fluid Antenna based on Large Language Models Yali Zhang et.al. 2502.09857 null
2025-02-14 Efficient Multitask Learning in Small Language Models Through Upside-Down Reinforcement Learning Yu-Chen Lin et.al. 2502.09854 null
2025-02-14 HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation Tianwei Lin et.al. 2502.09838 link
2025-02-13 A Solver-Aided Hierarchical Language for LLM-Driven CAD Design Benjamin T. Jones et.al. 2502.09819 null
2025-02-13 Statistical Coherence Alignment for Large Language Model Representation Learning Through Tensor Field Convergence Jonathan Gale et.al. 2502.09815 null
2025-02-13 INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages Hao Yu et.al. 2502.09814 null
2025-02-13 AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration Jizhou Chen et.al. 2502.09809 null
2025-02-13 Unit Testing Past vs. Present: Examining LLMs’ Impact on Defect Detection and Efficiency Rudolf Ramler et.al. 2502.09801 null
2025-02-13 Co-designing Large Language Model Tools for Project-Based Learning with K12 Educators Prerna Ravi et.al. 2502.09799 null
2025-02-13 A Survey on LLM-based News Recommender Systems Rongyao Wang et.al. 2502.09797 null
2025-02-13 TableTalk: Scaffolding Spreadsheet Development with a Language Agent Jenny T. Liang et.al. 2502.09787 null
2025-02-13 Improving Acoustic Side-Channel Attacks on Keyboards Using Transformers and Large Language Models Jin Hyun Park et.al. 2502.09782 null
2025-02-13 CellFlow: Simulating Cellular Morphology Changes via Flow Matching Yuhui Zhang et.al. 2502.09775 null
2025-02-13 Non-Markovian Discrete Diffusion with Causal Language Models Yangtian Zhang et.al. 2502.09767 null
2025-02-13 LLM-Generated Microservice Implementations from RESTful API Definitions Saurabh Chauhan et.al. 2502.09766 link
2025-02-13 Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization Amit Levi et.al. 2502.09755 null
2025-02-13 Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting Chaoyuan Zhang et.al. 2502.09749 null
2025-02-13 The Widespread Adoption of Large Language Model-Assisted Writing Across Society Weixin Liang et.al. 2502.09747 null
2025-02-13 Fine-Tuning Foundation Models with Federated Learning for Privacy Preserving Medical Time Series Forecasting Mahad Ali et.al. 2502.09744 null
2025-02-13 FoNE: Precise Single-Token Number Embeddings via Fourier Features Tianyi Zhou et.al. 2502.09741 null
2025-02-13 Making Them a Malicious Database: Exploiting Query Code to Jailbreak Aligned Large Language Models Qingsong Zou et.al. 2502.09723 link
2025-02-13 NestQuant: Nested Lattice Quantization for Matrix Products and LLMs Semyon Savkin et.al. 2502.09720 null
2025-02-13 Genetic Data Governance in Crisis: Policy Recommendations for Safeguarding Privacy and Preventing Discrimination Vivek Ramanan et.al. 2502.09716 null
2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 null
2025-02-13 Exploring the Potential of Encoder-free Architectures in 3D LMMs Yiwen Tang et.al. 2502.09620 link
2025-02-13 Designing a Conditional Prior Distribution for Flow-Based Generative Models Noam Issachar et.al. 2502.09611 null
2025-02-14 Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions Tejas Jayashankar et.al. 2502.09609 null
2025-02-13 Human-LLM Coevolution: Evidence from Academic Writing Mingmeng Geng et.al. 2502.09606 null
2025-02-13 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Yung-Sung Chuang et.al. 2502.09604 link
2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link
2025-02-13 KIMAs: A Configurable Knowledge Integrated Multi-Agent System Zitao Li et.al. 2502.09596 null
2025-02-13 Logical forms complement probability in understanding language model (and human) performance Yixuan Wang et.al. 2502.09589 null
2025-02-13 Rolling Ahead Diffusion for Traffic Scene Simulation Yunpeng Liu et.al. 2502.09587 null
2025-02-13 Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks Qian Wan et.al. 2502.09577 null
2025-02-13 Zero-shot generation of synthetic neurosurgical data with large language models Austin A. Barr et.al. 2502.09566 link
2025-02-13 MDCrow: Automating Molecular Dynamics Workflows with Large Language Models Quintina Campbell et.al. 2502.09565 link
2025-02-13 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Rui Yang et.al. 2502.09560 null
2025-02-13 Explainable AI-assisted Optimization for Feynman Integral Reduction Zhuo-Yang Song et.al. 2502.09544 null
2025-02-13 Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages Shreyan Biswas et.al. 2502.09532 null
2025-02-13 SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Francesco Pezone et.al. 2502.09520 link
2025-02-13 Diffusion Models for Molecules: A Survey of Methods and Tasks Liang Wang et.al. 2502.09511 link
2025-02-13 EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling Theodoros Kouzelis et.al. 2502.09509 null
2025-02-13 Improve LLM-based Automatic Essay Scoring with Linguistic Features Zhaoyi Joey Hou et.al. 2502.09497 null
2025-02-13 Foundation Neural-Network Quantum States Riccardo Rende et.al. 2502.09488 null
2025-02-13 Objective quantification of mood states using large language models Jakub Onysk et.al. 2502.09487 null
2025-02-13 DiffRenderGAN: Addressing Training Data Scarcity in Deep Segmentation Networks for Quantitative Nanomaterial Analysis through Differentiable Rendering and Generative Modelling Dennis Possart et.al. 2502.09477 null
2025-02-13 Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction Ziyi Chen et.al. 2502.09423 null
2025-02-13 ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation Rotem Shalev-Arkushin et.al. 2502.09411 null
2025-02-13 SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Daniel Fleischer et.al. 2502.09390 link
2025-02-13 Truth Knows No Language: Evaluating Truthfulness Beyond English Blanca Calvo Figueras et.al. 2502.09387 null
2025-02-13 APT-LLM: Embedding-Based Anomaly Detection of Cyber Advanced Persistent Threats Using Large Language Models Sidahmed Benabderrahmane et.al. 2502.09385 null
2025-02-13 LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won’t Fail) Junsu Kim et.al. 2502.09376 null
2025-02-13 Inverse problems with experiment-guided AlphaFold Advaith Maddipatla et.al. 2502.09372 null
2025-02-13 Language Agents as Digital Representatives in Collective Decision-Making Daniel Jarrett et.al. 2502.09369 null
2025-02-13 Machine learning for modelling unstructured grid data in computational physics: a review Sibo Cheng et.al. 2502.09346 null
2025-02-13 ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments Youhe Jiang et.al. 2502.09334 null
2025-02-13 Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs Itai Mondshine et.al. 2502.09331 null
2025-02-13 Copilot Arena: A Platform for Code LLM Evaluation in the Wild Wayne Chi et.al. 2502.09328 null
2025-02-13 A Benchmark for Crime Surveillance Video Analysis with Large Models Haoran Chen et.al. 2502.09325 null
2025-02-13 A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis Kentaro Imajo et.al. 2502.09316 link
2025-02-13 When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models Samuel Joseph Amouyal et.al. 2502.09307 null
2025-02-13 Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling Paula Cordero-Encinar et.al. 2502.09306 null
2025-02-13 KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG Yiqian Huang et.al. 2502.09304 link
2025-02-13 When do neural networks learn world models? Tianren Zhang et.al. 2502.09297 null
2025-02-13 SparQLe: Speech Queries to Text Translation Through LLMs Amirbek Djanibekov et.al. 2502.09284 link
2025-02-13 GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation Hongyin Zhang et.al. 2502.09268 null
2025-02-13 AnomalyGFM: Graph Foundation Model for Zero/Few-shot Anomaly Detection Hezhe Qiao et.al. 2502.09254 null
2025-02-13 From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine Lukas Buess et.al. 2502.09242 null
2025-02-13 OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart Logistics Junhui Wang et.al. 2502.09238 null
2025-02-13 Reliable Conversational Agents under ASP Control that Understand Natural Language Yankai Zeng et.al. 2502.09237 null
2025-02-13 Data2Concept2Text: An Explainable Multilingual Framework for Data Analysis Narration Flavio Bertini et.al. 2502.09218 null
2025-02-13 LP-LM: No Hallucinations in Question Answering with Logic Programming Katherine Wu et.al. 2502.09212 link
2025-02-13 Visual Graph Question Answering with ASP and LLMs for Language Parsing Jakob Johannes Bauer et.al. 2502.09211 null
2025-02-13 On LLM-generated Logic Programs and their Inference Execution Methods Paul Tarau et.al. 2502.09209 null
2025-02-13 Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York Sanskar Sehgal et.al. 2502.09204 null
2025-02-13 XAInomaly: Explainable and Interpretable Deep Contractive Autoencoder for O-RAN Traffic Anomaly Detection Osman Tugay Basaran et.al. 2502.09194 null
2025-02-13 Thinking beyond the anthropomorphic paradigm benefits LLM research Lujain Ibrahim et.al. 2502.09192 null
2025-02-13 Matina: A Large-Scale 73B Token Persian Text Corpus Sara Bourbour Hosseinbeigi et.al. 2502.09188 null
2025-02-13 RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation Changzhi Zhou et.al. 2502.09183 null
2025-02-13 FLAME: Flexible LLM-Assisted Moderation Engine Ivan Bakulin et.al. 2502.09175 null
2025-02-13 Two-Stage Representation Learning for Analyzing Movement Behavior Dynamics in People Living with Dementia Jin Cui et.al. 2502.09173 null
2025-02-13 Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs Chang Liu et.al. 2502.09156 null
2025-02-13 Finite-Time Analysis of Discrete-Time Stochastic Interpolants Yuhao Liu et.al. 2502.09130 null
2025-02-13 One-shot Federated Learning Methods: A Practical Guide Xiang Liu et.al. 2502.09104 null
2025-02-13 Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation Zongyu Chang et.al. 2502.09101 null
2025-02-13 Logical Reasoning in Large Language Models: A Survey Hanmeng Liu et.al. 2502.09100 null
2025-02-13 Show Me the Work: Fact-Checkers’ Requirements for Explainable Automated Fact-Checking Greta Warren et.al. 2502.09083 null
2025-02-13 CoSER: Coordinating LLM-Based Persona Simulation of Established Roles Xintao Wang et.al. 2502.09082 link
2025-02-13 Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables Xuzhao Geng et.al. 2502.09073 null
2025-02-13 Unleashing the Power of Large Language Model for Denoising Recommendation Shuyao Wang et.al. 2502.09058 null
2025-02-13 An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Kunat Pipatanakul et.al. 2502.09056 null
2025-02-13 Game Theory Meets Large Language Models: A Systematic Survey Haoran Sun et.al. 2502.09053 null
2025-02-13 Typhoon T1: An Open Thai Reasoning Model Pittawat Taveekitworachai et.al. 2502.09042 null
2025-02-13 Implementation of a Fuzzy Relational Database. Case Study: Chilean Cardboard Industry in the Maule Region Leoncio Jimenez et.al. 2502.09035 null
2025-02-13 MTDP: Modulated Transformer Diffusion Policy Model Qianhao Wang et.al. 2502.09029 null
2025-02-13 EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition Xiao Wang et.al. 2502.09020 link
2025-02-13 Diversity Enhances an LLM’s Performance in RAG and Long-context Task Zhchao Wang et.al. 2502.09017 null
2025-02-13 Hope vs. Hate: Understanding User Interactions with LGBTQ+ News Content in Mainstream US News Media through the Lens of Hope Speech Jonathan Pofcher et.al. 2502.09004 null
2025-02-13 RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models Quan Wei et.al. 2502.09003 null
2025-02-13 End-to-End triplet loss based fine-tuning for network embedding in effective PII detection Rishika Kohli et.al. 2502.09002 null
2025-02-13 Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks? Amirhesam Abedsoltan et.al. 2502.08991 null
2025-02-13 Prophet Inequalities for Bandits, Cabinets, and DAGs Robin Bowers et.al. 2502.08976 null
2025-02-13 Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning Leon Nissen et.al. 2502.08954 link
2025-02-13 Structured Convergence in Large Language Model Representations via Hierarchical Latent Space Folding Fenella Harcourt et.al. 2502.08947 null
2025-02-13 Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis Wenbo Zhang et.al. 2502.08943 null
2025-02-13 Escaping Collapse: The Strength of Weak Data for Large Language Model Training Kareem Amin et.al. 2502.08924 null
2025-02-13 Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models Xin Zhou et.al. 2502.08922 null
2025-02-13 Detecting Malicious Concepts Without Image Generation in AIGC Kun Xu et.al. 2502.08921 null
2025-02-13 InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Heejun Lee et.al. 2502.08910 null
2025-02-13 Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs Premtim Sahitaj et.al. 2502.08909 null
2025-02-13 Reinforced Large Language Model is a formal theorem prover Zhiling Luo et.al. 2502.08908 link
2025-02-13 DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation Tangyu Jiang et.al. 2502.08905 null
2025-02-13 MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training Xinxin You et.al. 2502.08904 null
2025-02-13 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning Guoqin Tang et.al. 2502.08903 null
2025-02-13 Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication Weicheng Ma et.al. 2502.08896 null
2025-02-13 ShapeLib: designing a library of procedural 3D shape abstractions with Large Language Models R. Kenny Jones et.al. 2502.08884 null
2025-02-13 Utilizing Pre-trained and Large Language Models for 10-K Items Segmentation Hsin-Min Lu et.al. 2502.08875 null
2025-02-13 Harnessing Vision Models for Time Series Analysis: A Survey Jingchao Ni et.al. 2502.08869 link
2025-02-13 A Systematic Evaluation of Generative Models on Tabular Transportation Data Chengen Wang et.al. 2502.08856 link
2025-02-12 Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation Mohammad Mahdi Abootorabi et.al. 2502.08826 link
2025-02-12 DejAIvu: Identifying and Explaining AI Art on the Web in Real-Time with Saliency Maps Jocelyn Dzuong et.al. 2502.08821 link
2025-02-12 Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model Emre Can Acikgoz et.al. 2502.08820 null
2025-02-12 Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation Koinis Vassilis et.al. 2502.08818 null
2025-02-12 Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples Andrianos Michail et.al. 2502.08638 null
2025-02-12 Ensemble based approach to quantifying uncertainty of LLM based classifications Srijith Rajamohan et.al. 2502.08631 null
2025-02-12 Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model Saurabh Kataria et.al. 2502.08612 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-12 Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks Ang Li et.al. 2502.08586 null
2025-02-12 Statistically validated projection of bipartite signed networks Anna Gallo et.al. 2502.08567 null
2025-02-12 QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval Wonduk Seo et.al. 2502.08557 null
2025-02-12 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling Shixiang Tang et.al. 2502.08556 link
2025-02-12 Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies Sunnie S. Y. Kim et.al. 2502.08554 null
2025-02-12 LLMs can implicitly learn from mistakes in-context Lisa Alazraki et.al. 2502.08550 null
2025-02-12 LLM Pretraining with Continuous Concepts Jihoon Tack et.al. 2502.08524 null
2025-02-12 FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices Dezhong Yao et.al. 2502.08518 link
2025-02-12 The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data Evgenii Evstafev et.al. 2502.08515 null
2025-02-12 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 link
2025-02-12 Measuring Diversity in Synthetic Datasets Yuchang Zhu et.al. 2502.08512 link
2025-02-12 Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction Wei Li et.al. 2502.08507 link
2025-02-12 Salamandra Technical Report Aitor Gonzalez-Agirre et.al. 2502.08489 link
2025-02-12 One-Shot Federated Learning with Classifier-Free Diffusion Models Obaidullah Zaland et.al. 2502.08488 null
2025-02-12 Computed fingertip touch for the instrumental control of musical sound with an excursion on the computed retinal afterimage Staas de Jong et.al. 2502.08471 null
2025-02-12 mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data Haonan Chen et.al. 2502.08468 link
2025-02-12 From Haystack to Needle: Label Space Reduction for Zero-shot Classification Nathan Vandemoortele et.al. 2502.08436 null
2025-02-12 IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance Paul Röttger et.al. 2502.08395 null
2025-02-12 ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification Jiangbo Shi et.al. 2502.08391 link
2025-02-12 Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding Konstantin Berestizshevsky et.al. 2502.08363 link
2025-02-12 Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAG Kushagra Bhushan et.al. 2502.08356 null
2025-02-12 Trustworthy GNNs with LLMs: A Systematic Review and Taxonomy Ruizhan Xue et.al. 2502.08353 null
2025-02-12 Graph Foundation Models for Recommendation: A Comprehensive Survey Bin Wu et.al. 2502.08346 null
2025-02-12 Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact Mohsin Bilal et.al. 2502.08333 null
2025-02-12 Modification and Generated-Text Detection: Achieving Dual Detection Capabilities for the Outputs of LLM by Watermark Yuhang Cai et.al. 2502.08332 null
2025-02-12 Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning Barnaby Schmitt et.al. 2502.08323 null
2025-02-12 MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection Lubna Al-Henaki et.al. 2502.08319 null
2025-02-12 Word Synchronization Challenge: A Benchmark for Word Association Responses for LLMs Tanguy Cazalets et.al. 2502.08312 null
2025-02-12 Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model Bencheng Yan et.al. 2502.08309 null
2025-02-12 HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting Shibo Feng et.al. 2502.08302 link
2025-02-12 Compromising Honesty and Harmlessness in Language Models via Deception Attacks Laurène Vaugrante et.al. 2502.08301 null
2025-02-12 Improving Existing Optimization Algorithms with LLMs Camilo Chacón Sartori et.al. 2502.08298 null
2025-02-12 Redefining Simplicity: Benchmarking Large Language Models from Lexical to Document Simplification Jipeng Qiang et.al. 2502.08281 null
2025-02-12 MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation Min Hou et.al. 2502.08271 null
2025-02-12 Exploring the Potential of Large Language Models to Simulate Personality Maria Molchanova et.al. 2502.08265 link
2025-02-12 GenIAS: Generator for Instantiating Anomalies in time Series Zahra Zamanzadeh Darban et.al. 2502.08262 null
2025-02-12 FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation Yang Sun et.al. 2502.08260 link
2025-02-12 Learning Human Skill Generators at Key-Step Levels Yilu Wu et.al. 2502.08234 null
2025-02-12 Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause Analysis Changhua Pei et.al. 2502.08224 null
2025-02-12 Memory Offloading for Large Language Model Inference with Latency SLO Guarantees Chenxiang Ma et.al. 2502.08182 null
2025-02-12 Enhancing LLM Character-Level Manipulation via Divide and Conquer Zhen Xiong et.al. 2502.08180 null
2025-02-12 ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation Ruobing Yao et.al. 2502.08178 null
2025-02-12 SycEval: Evaluating LLM Sycophancy Aaron Fanous et.al. 2502.08177 null
2025-02-12 Intention is All You Need: Refining Your Code from Your Intention Qi Guo et.al. 2502.08172 null
2025-02-12 Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling Yang Cao et.al. 2502.08150 null
2025-02-12 ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning Vy Vo et.al. 2502.08148 null
2025-02-12 Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers Siddharth Singh et.al. 2502.08145 null
2025-02-12 Bridging the Safety Gap: A Guardrail Pipeline for Trustworthy LLM Inferences Shanshan Han et.al. 2502.08142 null
2025-02-12 LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits Zikai Zhou et.al. 2502.08141 null
2025-02-12 Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models Sonam Gupta et.al. 2502.08130 null
2025-02-12 Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Lingfei Qian et.al. 2502.08127 link
2025-02-12 HuDEx: Integrating Hallucination Detection and Explainability for Enhancing the Reliability of LLM responses Sujeong Lee et.al. 2502.08109 null
2025-02-12 Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD) Naomi Akhras et.al. 2502.08073 null
2025-02-12 On Mechanistic Circuits for Extractive Question-Answering Samyadeep Basu et.al. 2502.08059 null
2025-02-12 Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs Mohsinul Kabir et.al. 2502.08045 null
2025-02-12 Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery Fan Jiang et.al. 2502.08037 null
2025-02-12 Stochastic Kinetics of Transcription: Analysis and Computation Yuntao Lu et.al. 2502.08028 null
2025-02-12 Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations Alistair Wren et.al. 2502.08026 null
2025-02-11 Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding Ziyao Wang et.al. 2502.08020 null
2025-02-11 The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models Artem Kirsanov et.al. 2502.08009 null
2025-02-11 An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models Kasra Ahmadi et.al. 2502.08008 link
2025-02-11 Towards Training One-Step Diffusion Models Without Distillation Mingtian Zhang et.al. 2502.08005 null
2025-02-11 Universal Adversarial Attack on Aligned Multimodal LLMs Temurbek Rahmatullaev et.al. 2502.07987 null
2025-02-11 Deep Semantic Graph Learning via LLM based Node Enhancement Chuanqi Shi et.al. 2502.07982 null
2025-02-11 CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs Lejla Skelic et.al. 2502.07980 null
2025-02-11 From Hazard Identification to Controller Design: Proactive and LLM-Supported Safety Engineering for ML-Powered Systems Yining Hong et.al. 2502.07974 null
2025-02-11 Caught in the Web of Words: Do LLMs Fall for Spin in Medical Literature? Hye Sun Yun et.al. 2502.07963 null
2025-02-11 Accelerating Scientific Research Through a Multi-LLM Framework Joaquin Ramirez-Medina et.al. 2502.07960 null
2025-02-11 Bridging HCI and AI Research for the Evaluation of Conversational SE Assistants Jonan Richards et.al. 2502.07956 null
2025-02-11 Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs Ruichen Zhang et.al. 2502.07942 null
2025-02-11 Discrete Markov Probabilistic Models Le-Tuyet-Nhi Pham et.al. 2502.07939 null
2025-02-11 Distributed Approach to Haskell Based Applications Refactoring with LLMs Based Multi-Agent Systems Shahbaz Siddeeq et.al. 2502.07928 null
2025-02-11 Sign Operator for Coping with Heavy-Tailed Noise: High Probability Convergence Bounds with Extensions to Distributed Optimization and Comparison Oracle Nikita Kornilov et.al. 2502.07923 null
2025-02-11 Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning Rujing Yao et.al. 2502.07912 link
2025-02-11 DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities Chashi Mahiul Islam et.al. 2502.07905 null
2025-02-11 Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering Rujing Yao et.al. 2502.07904 null
2025-02-11 HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment Youhe Jiang et.al. 2502.07903 null
2025-02-11 TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation Alex Jinpeng Wang et.al. 2502.07870 link
2025-02-11 TransMLA: Multi-head Latent Attention Is All You Need Fanxu Meng et.al. 2502.07864 link
2025-02-11 BalanceKV: KV Cache Compression through Discrepancy Theory Insu Han et.al. 2502.07861 null
2025-02-11 Pippo: High-Resolution Multi-View Humans from a Single Image Yash Kant et.al. 2502.07785 null
2025-02-11 DarwinLM: Evolutionary Structured Pruning of Large Language Models Shengkun Tang et.al. 2502.07780 link
2025-02-11 Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection Anirudh Sundara Rajan et.al. 2502.07778 null
2025-02-11 Auditing Prompt Caching in Language Model APIs Chenchen Gu et.al. 2502.07776 link
2025-02-11 Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming Azizjon Kobilov et.al. 2502.07772 null
2025-02-11 Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers Italo Santos et.al. 2502.07763 null
2025-02-11 Scalable Fingerprinting of Large Language Models Anshul Nasery et.al. 2502.07760 null
2025-02-11 Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension Wenbo Gong et.al. 2502.07752 null
2025-02-11 WHODUNIT: Evaluation benchmark for culprit detection in mystery stories Kshitij Gupta et.al. 2502.07747 link
2025-02-11 The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing Dirk Bergemann et.al. 2502.07736 null
2025-02-11 Revisiting Non-Acyclic GFlowNets in Discrete Environments Nikita Morozov et.al. 2502.07735 link
2025-02-11 Economics of Sourcing Human Data Sebastin Santy et.al. 2502.07732 null
2025-02-11 Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK Marcos Cramer et.al. 2502.07728 null
2025-02-11 Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning Aya Kayal et.al. 2502.07715 null
2025-02-11 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link
2025-02-11 A Framework for LLM-powered Design Assistants Swaroop Panda et.al. 2502.07698 null
2025-02-11 Large Language Models as Proxies for Theories of Human Linguistic Cognition Imry Ziv et.al. 2502.07687 null
2025-02-11 Steering Protein Family Design through Profile Bayesian Flow Jingjing Gong et.al. 2502.07671 null
2025-02-11 Guiding Time-Varying Generative Models with Natural Gradients on Exponential Family Manifold Song Liu et.al. 2502.07650 null
2025-02-11 SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models Shihao Xia et.al. 2502.07644 null
2025-02-11 FoQA: A Faroese Question-Answering Dataset Annika Simonsen et.al. 2502.07642 null
2025-02-11 Distributional Instrumental Variable Method Anastasiia Holovchak et.al. 2502.07641 link
2025-02-11 Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Yong Lin et.al. 2502.07640 link
2025-02-11 Consistency Training with Physical Constraints Che-Chia Chang et.al. 2502.07636 null
2025-02-11 Exploring Mobile Touch Interaction with Large Language Models Tim Zindulka et.al. 2502.07629 null
2025-02-11 Tractable Transformers for Flexible Conditional Generation Anji Liu et.al. 2502.07616 null
2025-02-11 Beyond Prompting: Time2Lang – Bridging Time-Series Foundation Models and Large Language Models for Health Sensing Arvind Pillai et.al. 2502.07608 null
2025-02-11 Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models Jiacong Xu et.al. 2502.07601 null
2025-02-11 Towards spatial computing: recent advances in multimodal natural interaction for XR headsets Zhimin Wang et.al. 2502.07598 null
2025-02-11 SEMU: Singular Value Decomposition for Efficient Machine Unlearning Marcin Sendera et.al. 2502.07587 null
2025-02-11 Generative Modeling with Bayesian Sample Inference Marten Lienen et.al. 2502.07580 link
2025-02-11 PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference Yufeng Gu et.al. 2502.07578 link
2025-02-11 Automated Capability Discovery via Model Self-Exploration Cong Lu et.al. 2502.07577 link
2025-02-11 JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation Shenyi Zhang et.al. 2502.07557 link
2025-02-11 O1 Embedder: Let Retrievers Think Before Action Ruin Yan et.al. 2502.07555 null
2025-02-11 Grammar Control in Dialogue Response Generation for Language Learning Chatbots Dominik Glandorf et.al. 2502.07544 link
2025-02-11 NatureLM: Deciphering the Language of Nature for Scientific Discovery Yingce Xia et.al. 2502.07527 null
2025-02-11 The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation Raman Dutt et.al. 2502.07516 link
2025-02-11 Enhance-A-Video: Better Generated Video for Free Yang Luo et.al. 2502.07508 link
2025-02-11 Towards THz-based Obstacle Sensing: A Generative Radio Environment Awareness Framework Tianyu Hu et.al. 2502.07504 null
2025-02-11 Unified Graph Networks (UGN): A Deep Neural Framework for Solving Graph Problems Rudrajit Dawn et.al. 2502.07500 null
2025-02-11 LLM-Sketch: Enhancing Network Sketches with LLM Yuanpeng Li et.al. 2502.07495 link
2025-02-11 Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More Xialie Zhuang et.al. 2502.07490 link
2025-02-11 Improving Adaptive Moment Optimization via Preconditioner Diagonalization Son Nguyen et.al. 2502.07488 null
2025-02-11 ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model Xiaochen Liu et.al. 2502.07474 null
2025-02-11 JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata Abhinaba Roy et.al. 2502.07461 link
2025-02-11 Logarithmic Regret for Online KL-Regularized Reinforcement Learning Heyang Zhao et.al. 2502.07460 null
2025-02-11 PerCul: A Story-Driven Cultural Evaluation of LLMs in Persian Erfan Moosavi Monazzah et.al. 2502.07459 null
2025-02-11 RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation Viacheslav Vasilev et.al. 2502.07455 link
2025-02-11 Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon Nurit Cohen-Inger et.al. 2502.07445 link
2025-02-11 Towards a Foundation Model for Physics-Informed Neural Networks: Multi-PDE Learning with Active Sampling Keon Vin Park et.al. 2502.07425 null
2025-02-11 RomanLens: Latent Romanization and its role in Multilinguality in LLMs Alan Saji et.al. 2502.07424 null
2025-02-11 Entity Linking using LLMs for Automated Product Carbon Footprint Estimation Steffen Castle et.al. 2502.07418 null
2025-02-11 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Sheng Zhou et.al. 2502.07411 link
2025-02-11 MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification Anh-Tien Nguyen et.al. 2502.07409 link
2025-02-11 On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o Rundong Liu et.al. 2502.07399 link
2025-02-11 FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents Mostapha Benhenda et.al. 2502.07393 link
2025-02-11 LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Dacheng Li et.al. 2502.07374 link
2025-02-11 EvoFlow: Evolving Diverse Agentic Workflows On The Fly Guibin Zhang et.al. 2502.07373 null
2025-02-11 LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation Zican Dong et.al. 2502.07365 null
2025-02-11 Bridging the Evaluation Gap: Leveraging Large Language Models for Topic Model Evaluation Zhiyin Tan et.al. 2502.07352 link
2025-02-11 KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems Jusheng Zhang et.al. 2502.07350 null
2025-02-11 BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Xu Huang et.al. 2502.07346 link
2025-02-11 Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering Shuzheng Si et.al. 2502.07340 link
2025-02-11 Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready) Atharva Mehta et.al. 2502.07328 link
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 MEMIT-Merge: Addressing MEMIT’s Key-Value Conflicts in Same-Subject Batch Editing for LLMs Zilu Dong et.al. 2502.07322 null
2025-02-11 CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Junlong Li et.al. 2502.07316 link
2025-02-11 Prompt-Based Document Modifications In Ranking Competitions Niv Bardas et.al. 2502.07315 null
2025-02-11 CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry Xiaopeng Ye et.al. 2502.07307 link
2025-02-11 TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation Navid Rajabi et.al. 2502.07306 null
2025-02-11 Flow Matching for Collaborative Filtering Chengkai Liu et.al. 2502.07303 link
2025-02-11 Generation of Drug-Induced Cardiac Reactions towards Virtual Clinical Trials Qian Shao et.al. 2502.07297 null
2025-02-11 Small Language Model Makes an Effective Long Text Extractor Yelin Chen et.al. 2502.07286 link
2025-02-11 Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization Aditya Vora et.al. 2502.07278 null
2025-02-11 Cost-Efficient Continual Learning with Sufficient Exemplar Memory Dongkyu Cho et.al. 2502.07274 null
2025-02-11 GENERator: A Long-Context Generative Genomic Foundation Model Wei Wu et.al. 2502.07272 link
2025-02-11 When More is Less: Understanding Chain-of-Thought Length in LLMs Yuyang Wu et.al. 2502.07266 null
2025-02-11 DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization Xuefeng Liu et.al. 2502.07237 null
2025-02-11 A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models Yiming Chen et.al. 2502.07222 null
2025-02-11 MLLM4PUE: Toward Universal Embeddings in Computational Pathology through Multimodal LLMs Qifeng Zhou et.al. 2502.07221 null
2025-02-11 LUNAR: LLM Unlearning via Neural Activation Redirection William F. Shen et.al. 2502.07218 null
2025-02-11 Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion Xingpei Ma et.al. 2502.07203 null
2025-02-11 Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits Long-Fei Li et.al. 2502.07193 link
2025-02-11 Bag of Tricks for Inference-time Computation of LLM Reasoning Fan Liu et.al. 2502.07191 link
2025-02-11 A Large-Scale Benchmark for Vietnamese Sentence Paraphrases Sang Quang Nguyen et.al. 2502.07188 link
2025-02-11 Refine Knowledge of Large Language Models via Adaptive Contrastive Learning Yinghui Li et.al. 2502.07184 null
2025-02-11 Does Training on Synthetic Data Make Models Less Robust? Lingze Zhang et.al. 2502.07164 null
2025-02-11 Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning Feng Chen et.al. 2502.07154 link
2025-02-11 Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning Jiayuan Zhu et.al. 2502.07143 null
2025-02-11 Language-TPP: Integrating Temporal Point Processes with Language Models for Event Analysis Quyu Kong et.al. 2502.07139 null
2025-02-10 Cardiverse: Harnessing LLMs for Novel Card Game Prototyping Danrui Li et.al. 2502.07128 null
2025-02-10 Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation Denis Bakushev et.al. 2502.07124 null
2025-02-10 Online Scheduling for LLM Inference with KV Cache Constraints Patrick Jaillet et.al. 2502.07115 null
2025-02-10 Generative Distribution Prediction: A Unified Approach to Multimodal Learning Xinyu Tian et.al. 2502.07090 null
2025-02-10 Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring Alex Heyman et.al. 2502.07087 link
2025-02-10 MPFBench: A Large Scale Dataset for SciML of Multi-Phase-Flows: Droplet and Bubble Dynamics Mehdi Shadkhah et.al. 2502.07080 null
2025-02-10 Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models Lujain Ibrahim et.al. 2502.07077 null
2025-02-10 IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models Sayem Mohammad Imtiaz et.al. 2502.07072 null
2025-02-10 Specializing Large Language Models to Simulate Survey Response Distributions for Global Populations Yong Cao et.al. 2502.07068 link
2025-02-10 Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT Dongyang Liu et.al. 2502.06782 null
2025-02-10 Enhancing Performance of Explainable AI Models with Constrained Concept Refinement Geyu Liang et.al. 2502.06775 null
2025-02-10 Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions Jaeyeon Kim et.al. 2502.06768 null
2025-02-10 Rationalization Models for Text-to-SQL Gaetano Rossiello et.al. 2502.06759 null
2025-02-10 Accelerating Data Processing and Benchmarking of AI Models for Pathology Andrew Zhang et.al. 2502.06750 link
2025-02-10 Gradient Multi-Normalization for Stateless and Scalable LLM Training Meyer Scetbon et.al. 2502.06742 null
2025-02-10 VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data Thomas Zeng et.al. 2502.06737 null
2025-02-10 Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists Bojia Zi et.al. 2502.06734 null
2025-02-10 Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining Daouda Sow et.al. 2502.06733 null
2025-02-10 Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Runze Liu et.al. 2502.06703 link
2025-02-10 No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers Jiajun He et.al. 2502.06685 null
2025-02-10 EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks Michael Arbel et.al. 2502.06684 null
2025-02-10 Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations Rui Chen et.al. 2502.06669 null
2025-02-10 Automatic Evaluation of Healthcare LLMs Beyond Question-Answering Anna Arias-Duart et.al. 2502.06666 null
2025-02-10 Evaluation of Deep Audio Representations for Hearables Fabian Gröger et.al. 2502.06664 null
2025-02-10 EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models Xingrun Xing et.al. 2502.06663 null
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 null
2025-02-10 In-Context Learning (and Unlearning) of Length Biases Stephanie Schoch et.al. 2502.06653 null
2025-02-10 Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A Anna Leschanowsky et.al. 2502.06652 null
2025-02-10 Automatic Annotation Augmentation Boosts Translation between Molecules and Natural Language Zhiqiang Zhong et.al. 2502.06634 null
2025-02-10 Combining Large Language Models with Static Analyzers for Code Review Generation Imen Jaoua et.al. 2502.06633 null
2025-02-10 Multi-Scale Feature Fusion with Image-Driven Spatial Integration for Left Atrium Segmentation from Cardiac MRI Images Bipasha Kundu et.al. 2502.06615 null
2025-02-10 A Large-scale AI-generated Image Inpainting Benchmark Paschalis Giakoumoglou et.al. 2502.06593 null
2025-02-10 Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training Yuchen Zhuang et.al. 2502.06589 null
2025-02-10 A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems Linxiao Gong et.al. 2502.06581 null
2025-02-10 LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM Zhi Zhou et.al. 2502.06572 link
2025-02-10 Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation Chengwen Qi et.al. 2502.06563 link
2025-02-10 Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data? Marika Swanberg et.al. 2502.06555 null
2025-02-10 Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments Marc Felix Brinner et.al. 2502.06551 null
2025-02-10 Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning Jean Vassoyan et.al. 2502.06533 null
2025-02-10 Properties of Wasserstein Gradient Flows for the Sliced-Wasserstein Distance Christophe Vauthier et.al. 2502.06525 null
2025-02-10 GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing Jinhao Duan et.al. 2502.06494 null
2025-02-10 Recent Advances in Discrete Speech Tokens: A Review Yiwei Guo et.al. 2502.06490 null
2025-02-10 Adaptive Prompting: Ad-hoc Prompt Composition for Social Bias Detection Maximilian Spliethöver et.al. 2502.06487 null
2025-02-10 WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry Filip Ekström Kelvinius et.al. 2502.06485 null
2025-02-10 UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths Weijia Mao et.al. 2502.06474 null
2025-02-10 KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment Yuxing Lu et.al. 2502.06472 link
2025-02-10 A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks Hieu Minh “Jord” Nguyen et.al. 2502.06470 null
2025-02-10 MATH-Perturb: Benchmarking LLMs’ Math Reasoning Abilities against Hard Perturbations Kaixuan Huang et.al. 2502.06453 null
2025-02-10 FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model Anna Tegon et.al. 2502.06438 null
2025-02-10 Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising Huaqiu Li et.al. 2502.06432 link
2025-02-10 CoS: Chain-of-Shot Prompting for Long Video Understanding Jian Hu et.al. 2502.06428 null
2025-02-10 Generating Privacy-Preserving Personalized Advice with Zero-Knowledge Proofs and LLMs Hiroki Watanabe et.al. 2502.06425 null
2025-02-10 Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models Tianshuo Xu et.al. 2502.06419 null
2025-02-10 Systematic Outliers in Large Language Models Yongqi An et.al. 2502.06415 link
2025-02-10 AppVLM: A Lightweight Vision Language Model for Online App Control Georgios Papoudakis et.al. 2502.06395 null
2025-02-10 How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators Shang Liu et.al. 2502.06387 null
2025-02-10 Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment Long Zhang et.al. 2502.06371 null
2025-02-10 Calibrating LLMs with Information-Theoretic Evidential Deep Learning Yawei Li et.al. 2502.06351 link
2025-02-10 Can AI Examine Novelty of Patents?: Novelty Evaluation Based on the Correspondence between Patent Claim and Prior Art Hayato Ikoma et.al. 2502.06316 null
2025-02-10 Latent Convergence Modulation in Large Language Models: A Novel Approach to Iterative Contextual Realignment Patricia Porretta et.al. 2502.06302 null
2025-02-10 SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia Chaoqun Liu et.al. 2502.06298 null
2025-02-10 Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases? Qingshan Hou et.al. 2502.06289 null
2025-02-10 Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE Haiduo Huang et.al. 2502.06282 link
2025-02-10 DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models Utkarsh Tiwari et.al. 2502.06279 null
2025-02-10 Emergent Response Planning in LLM Zhichen Dong et.al. 2502.06258 null
2025-02-10 K-ON: Stacking Knowledge On the Head Layer of Large Language Model Lingbing Guo et.al. 2502.06257 null
2025-02-10 Find Central Dogma Again Wang Liang et.al. 2502.06253 null
2025-02-10 Amplifying Minority Voices: AI-Mediated Devil’s Advocate System for Inclusive Group Decision-Making Soohwan Lee et.al. 2502.06251 null
2025-02-10 PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts Zeman Li et.al. 2502.06244 null
2025-02-10 Fully Exploiting Vision Foundation Model’s Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing Sicen Guo et.al. 2502.06219 null
2025-02-10 LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks Xin Zhou et.al. 2502.06215 null
2025-02-10 Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement Junyu Lu et.al. 2502.06207 null
2025-02-10 C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation Guoxin Chen et.al. 2502.06205 null
2025-02-10 Non-literal Understanding of Number Words by Language Models Polina Tsvilodub et.al. 2502.06204 null
2025-02-10 Timing Matters: How Using LLMs at Different Timings Influences Writers’ Perceptions and Ideation Outcomes in AI-Assisted Ideation Peinuan Qin et.al. 2502.06197 null
2025-02-10 Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering Ruiqi Wang et.al. 2502.06193 null
2025-02-10 Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis Sanket Jantre et.al. 2502.06173 null
2025-02-10 A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation Wenhui Lei et.al. 2502.06171 null
2025-02-10 Universal Approximation of Visual Autoregressive Transformers Yifang Chen et.al. 2502.06167 null
2025-02-10 Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy Kamyar Kazari et.al. 2502.06150 null
2025-02-10 Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection Yan Weng et.al. 2502.06148 null
2025-02-10 LegalViz: Legal Text Visualization by Text To Diagram Generation Eri Onami et.al. 2502.06147 null
2025-02-10 LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMs Sumin An et.al. 2502.06139 null
2025-02-10 Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models Ce Zhang et.al. 2502.06130 null
2025-02-10 Foundation Model of Electronic Medical Records for Adaptive Risk Estimation Pawel Renc et.al. 2502.06124 link
2025-02-10 Task-driven Layerwise Additive Activation Intervention Hieu Trung Nguyen et.al. 2502.06115 null
2025-02-10 CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories Yijia Xiao et.al. 2502.06111 null
2025-02-10 RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning Jian Xu et.al. 2502.06101 link
2025-02-10 ConMeC: A Dataset for Metonymy Resolution with Common Nouns Saptarshi Ghosh et.al. 2502.06087 link
2025-02-10 Physics-Guided Foundation Model for Scientific Discovery: An Application to Aquatic Science Runlong Yu et.al. 2502.06084 link
2025-02-10 Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo Cheuk Kit Lee et.al. 2502.06079 null
2025-02-09 Deconstructing Depression Stigma: Integrating AI-driven Data Collection and Analysis with Causal Knowledge Graphs Han Meng et.al. 2502.06075 null
2025-02-09 Allegro-FM: Towards Equivariant Foundation Model for Exascale Molecular Dynamics Simulations Ken-ichi Nomura et.al. 2502.06073 null
2025-02-09 Benchmarking Prompt Sensitivity in Large Language Models Amirhossein Razavi et.al. 2502.06065 null
2025-02-09 Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization Jiajun Fan et.al. 2502.06061 null
2025-02-09 Benchmarking Prompt Engineering Techniques for Secure Code Generation with GPT Models Marc Bruni et.al. 2502.06039 null
2025-02-09 Investigating Compositional Reasoning in Time Series Foundation Models Willa Potosnak et.al. 2502.06037 link
2025-02-09 A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions Elisa Negrini et.al. 2502.06026 link
2025-02-09 Dual Caption Preference Optimization for Diffusion Models Amir Saeidi et.al. 2502.06023 null
2025-02-09 Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding Xingjian Diao et.al. 2502.06020 link
2025-02-09 Media Bias Detector: Designing and Implementing a Tool for Real-Time Selection and Framing Bias Analysis in News Coverage Jenny S Wang et.al. 2502.06009 null
2025-02-09 Analysis of LLM as a grammatical feature tagger for African American English Rahul Porwal et.al. 2502.06004 null
2025-02-09 HamRaz: A Culture-Based Persian Conversation Dataset for Person-Centered Therapy Using LLM Agents Mohammad Amin Abbasi et.al. 2502.05982 null
2025-02-09 $μ$ nit Scaling: Simple and Scalable FP8 LLM Training Saaketh Narayan et.al. 2502.05967 null
2025-02-09 Redefining Robot Generalization Through Interactive Intelligence Sharmita Dey et.al. 2502.05963 null
2025-02-09 MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents Jiabin Tang et.al. 2502.05957 null
2025-02-09 Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks Antonio La Torre et.al. 2502.05951 null
2025-02-09 Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention Zhendong Zhang et.al. 2502.05947 null
2025-02-09 “Let the AI conspiracy begin…” Language Model coordination is just one inference-intervention away Paul Darm et.al. 2502.05945 null
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link
2025-02-07 Fillerbuster: Multi-View Scene Completion for Casual Captures Ethan Weber et.al. 2502.05175 null
2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching Ali Modarressi et.al. 2502.05167 link
2025-02-07 Multitwine: Multi-Object Compositing with Text and Layout Control Gemma Canet Tarrés et.al. 2502.05165 null
2025-02-07 DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Yihe Deng et.al. 2502.05163 link
2025-02-07 A Lightweight Method to Disrupt Memorized Sequences in LLM Parjanya Prajakta Prashant et.al. 2502.05159 null
2025-02-07 Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation Steffen Eger et.al. 2502.05151 link
2025-02-07 CodeSCM: Causal Analysis for Multi-Modal Code Generation Mukur Gupta et.al. 2502.05150 link
2025-02-07 An Annotated Reading of ‘The Singer of Tales’ in the LLM Era Kush R. Varshney et.al. 2502.05148 null
2025-02-07 Chest X-ray Foundation Model with Global and Local Representations Integration Zefan Yang et.al. 2502.05142 link
2025-02-07 Latent Swap Joint Diffusion for Long-Form Audio Generation Yusheng Dai et.al. 2502.05130 null
2025-02-07 Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning Matt von Hippel et.al. 2502.05121 null
2025-02-07 Flexible and Efficient Grammar-Constrained Decoding Kanghee Park et.al. 2502.05111 null
2025-02-07 Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs Rohit Saxena et.al. 2502.05092 null
2025-02-07 Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs Thierry Bossy et.al. 2502.05087 link
2025-02-07 Causality can systematically address the monsters under the bench(marks) Felix Leeb et.al. 2502.05085 null
2025-02-07 ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework Xiaoyu Deng et.al. 2502.05084 null
2025-02-07 Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Tushar Pandey et.al. 2502.05078 link
2025-02-07 Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images Aditya Kumar et.al. 2502.05066 link
2025-02-07 nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Geliang Ouyang et.al. 2502.05036 link
2025-02-07 Prospects for detecting generic fast-time features in the neutrino lightcurve of nearby supernovae in neutrino telescopes Jakob Beise et.al. 2502.05024 null
2025-02-07 QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Andrei Panferov et.al. 2502.05003 link
2025-02-07 Aligning Black-box Language Models with Human Judgments Gerrit J. J. van den Burg et.al. 2502.04997 null
2025-02-07 C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features Chenxing Sun et.al. 2502.04991 null
2025-02-07 MoGraphGPT: Creating Interactive Scenes Using Modular LLM and Graphical Control Hui Ye et.al. 2502.04983 null
2025-02-07 Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits Finn Rietz et.al. 2502.04979 null
2025-02-07 Towards Multimodal Empathetic Response Generation: A Rich Text-Speech-Vision Avatar-based Benchmark Han Zhang et.al. 2502.04976 null
2025-02-07 CoCoA: A Generalized Approach to Uncertainty Quantification by Integrating Confidence and Consistency of LLM Outputs Roman Vashurin et.al. 2502.04964 null
2025-02-07 The Rising Threat to Emerging AI-Powered Search Engines Zeren Luo et.al. 2502.04951 null
2025-02-07 Mobile Network-specialized Large Language Models for 6G: Architectures, Innovations, Challenges, and Future Trends Abdelaali Chaoub et.al. 2502.04933 null
2025-02-07 Generative-enhanced optimization for knapsack problems: an industry-relevant study Yelyzaveta Vodovozova et.al. 2502.04928 null
2025-02-07 Classification or Prompting: A Case Study on Legal Requirements Traceability Romina Etezadi et.al. 2502.04916 null
2025-02-07 Goku: Flow Based Video Generative Foundation Models Shoufa Chen et.al. 2502.04896 null
2025-02-07 A Foundational Brain Dynamics Model via Stochastic Optimal Control Joonhyeong Park et.al. 2502.04892 null
2025-02-07 Training-free Task-oriented Grasp Generation Jiaming Wang et.al. 2502.04873 null
2025-02-07 Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration Yifeng Yu et.al. 2502.04849 null
2025-02-07 Developmentally-plausible Working Memory Shapes a Critical Period for Language Acquisition Masato Mita et.al. 2502.04795 null
2025-02-07 S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency Yuting Zeng et.al. 2502.04790 null
2025-02-07 Probing Internal Representations of Multi-Word Verbs in Large Language Models Hassane Kissane et.al. 2502.04789 null
2025-02-07 Enhancing SQL Injection Detection and Prevention Using Generative Models Naga Sai Dasari et.al. 2502.04786 null
2025-02-07 SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning Wanjia Zhao et.al. 2502.04780 link
2025-02-07 SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation Jungwoo Kim et.al. 2502.04774 null
2025-02-07 Enhancing Phishing Email Identification with Large Language Models Catherine Lee et.al. 2502.04759 null
2025-02-07 Concept Navigation and Classification via Open Source Large Language Model Processing Maël Kubli et.al. 2502.04756 null
2025-02-07 Every Software as an Agent: Blueprint and Case Study Mengwei Xu et.al. 2502.04747 null
2025-02-07 PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational Autoencoders Tianyu Xie et.al. 2502.04730 link
2025-02-07 Generating Symbolic World Models via Test-time Scaling of Large Language Models Zhouliang Yu et.al. 2502.04728 link
2025-02-07 Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics? Sourabrata Mukherjee et.al. 2502.04718 null
2025-02-07 Enhancing Impression Change Prediction in Speed Dating Simulations Based on Speakers’ Personalities Kazuya Matsuo et.al. 2502.04706 null
2025-02-07 STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion Zhenwei Wu et.al. 2502.04692 null
2025-02-07 ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning Yuwei Yin et.al. 2502.04689 link
2025-02-07 M-IFEval: Multilingual Instruction-Following Evaluation Antoine Dussolle et.al. 2502.04688 link
2025-02-07 Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization Zelai Xu et.al. 2502.04686 null
2025-02-07 G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models Mengdi Liu et.al. 2502.04684 null
2025-02-07 CALF-SBM: A Covariate-Assisted Latent Factor Stochastic Block Model Sydney Louit et.al. 2502.04681 null
2025-02-07 LLM Query Scheduling with Prefix Reuse and Latency Constraints Gregory Dexter et.al. 2502.04677 null
2025-02-07 AdParaphrase: Paraphrase Dataset for Analyzing Linguistic Features toward Generating Attractive Ad Texts Soichiro Murakami et.al. 2502.04674 link
2025-02-07 Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization Xinhao Yao et.al. 2502.04667 link
2025-02-07 Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy Rishabh Uapadhyay et.al. 2502.04666 null
2025-02-07 Importance Sampling via Score-based Generative Models Heasung Kim et.al. 2502.04646 null
2025-02-07 Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research Junde Wu et.al. 2502.04644 link
2025-02-07 Confidence Elicitation: A New Attack Vector for Large Language Models Brian Formento et.al. 2502.04643 link
2025-02-07 Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition Khaled Sellami et.al. 2502.04604 null
2025-02-07 Extracting and Understanding the Superficial Knowledge in Alignment Runjin Chen et.al. 2502.04602 link
2025-02-07 The $α$ -Alternator: Dynamic Adaptation To Varying Noise Levels In Sequences Using The Vendi Score For Improved Robustness and Performance Mohammad Reza Rezaei et.al. 2502.04593 null
2025-02-07 Position-aware Automatic Circuit Discovery Tal Haklay et.al. 2502.04577 link
2025-02-06 My LLM might Mimic AAE – But When Should it? Sandra C. Sandoval et.al. 2502.04564 link
2025-02-06 Speeding up Speculative Decoding via Approximate Verification Meiyu Zhong et.al. 2502.04557 null
2025-02-06 TruthFlow: Truthful LLM Generation via Representation Flow Correction Hanyu Wang et.al. 2502.04556 null
2025-02-06 Contextual Gradient Flow Modeling for Large Language Model Generalization in Multi-Scale Feature Spaces Daphne Quillington et.al. 2502.04548 null
2025-02-06 Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection Minseok Jung et.al. 2502.04528 null
2025-02-06 Safety is Essential for Responsible Open-Ended Systems Ivaxi Sheth et.al. 2502.04512 null
2025-02-06 ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization Zijun Wu et.al. 2502.04501 null
2025-02-06 Verifiable Format Control for Large Language Model Generations Zhaoyang Wang et.al. 2502.04498 null
2025-02-06 Multi-Agent Reinforcement Learning with Focal Diversity Optimization Selim Furkan Tekin et.al. 2502.04492 link
2025-02-06 Building A Unified AI-centric Language System: analysis, framework and future work Edward Hong Wang et.al. 2502.04488 null
2025-02-06 Active Task Disambiguation with LLMs Katarzyna Kobalczyk et.al. 2502.04485 link
2025-02-06 The ML Supply Chain in the Era of Software 2.0: Lessons Learned from Hugging Face Trevor Stalnaker et.al. 2502.04484 null
2025-02-06 Near-Optimal Sample Complexity for MDPs via Anchoring Jongmin Lee et.al. 2502.04477 null
2025-02-06 ADIFF: Explaining audio difference using natural language Soham Deshmukh et.al. 2502.04476 link
2025-02-06 Augmented Conditioning Is Enough For Effective Training Image Generation Jiahui Chen et.al. 2502.04475 null
2025-02-06 Iterative Importance Fine-tuning of Diffusion Models Alexander Denker et.al. 2502.04468 null
2025-02-06 FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks Luca Della Libera et.al. 2502.04465 null
2025-02-06 Training Language Models to Reason Efficiently Daman Arora et.al. 2502.04463 link
2025-02-06 Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization Yu-Neng Chuang et.al. 2502.04428 null
2025-02-06 Decoding AI Judgment: How LLMs Assess News Credibility and Bias Edoardo Loru et.al. 2502.04426 null
2025-02-06 EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models He Hu et.al. 2502.04424 null
2025-02-06 Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Zuyan Liu et.al. 2502.04328 link
2025-02-06 Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness Karolina Rudnicka et.al. 2502.04324 null
2025-02-06 Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions Yik Siu Chan et.al. 2502.04322 link
2025-02-06 ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features Alec Helbling et.al. 2502.04320 link
2025-02-06 sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views Eyvaz Najafli et.al. 2502.04318 null
2025-02-06 ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters Kamer Ali Yuksel et.al. 2502.04315 link
2025-02-06 ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Yinjie Wang et.al. 2502.04306 link
2025-02-06 MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Jinbo Xing et.al. 2502.04299 null
2025-02-06 Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression Lirui Wang et.al. 2502.04296 null
2025-02-06 Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Yuanye Liu et.al. 2502.04295 link
2025-02-06 PILAF: Optimal Human Preference Sampling for Reward Modeling Yunzhen Feng et.al. 2502.04270 null
2025-02-06 Efficient Randomized Experiments Using Foundation Models Piersilvio De Bartolomeis et.al. 2502.04262 link
2025-02-06 Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention Ayush K. Varshney et.al. 2502.04260 null
2025-02-06 MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Xintong Hao et.al. 2502.04235 null
2025-02-06 Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks Andreas Happe et.al. 2502.04227 null
2025-02-06 Keep It Light! Simplifying Image Clustering Via Text-Free Adapters Yicen Li et.al. 2502.04226 null
2025-02-06 Éclair – Extracting Content and Layout with Integrated Reading Order for Documents Ilia Karmanov et.al. 2502.04223 null
2025-02-06 Sports and Women’s Sports: Gender Bias in Text Generation with Olympic Data Laura Biester et.al. 2502.04218 null
2025-02-06 Algorithmic causal structure emerging through compression Liang Wendong et.al. 2502.04210 null
2025-02-06 “Short-length” Adversarial Training Helps LLMs Defend “Long-length” Jailbreak Attacks: Theoretical and Empirical Evidence Shaopeng Fu et.al. 2502.04204 link
2025-02-06 The Best Instruction-Tuning Data are Those That Fit Dylan Zhang et.al. 2502.04194 null
2025-02-06 PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models? Mennatullah Siam et.al. 2502.04192 link
2025-02-06 Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models Carlos Eduardo Duarte et.al. 2502.04188 null
2025-02-06 Multi-agent Architecture Search via Agentic Supernet Guibin Zhang et.al. 2502.04180 link
2025-02-06 MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation Qinhan Yu et.al. 2502.04176 null
2025-02-06 Diffusion-based mass map reconstruction from weak lensing data Supranta S. Boruah et.al. 2502.04158 null
2025-02-06 UltraIF: Advancing Instruction Following from the Wild Kaikai An et.al. 2502.04153 link
2025-02-06 The Order Effect: Investigating Prompt Sensitivity in Closed-Source LLMs Bryan Guan et.al. 2502.04134 null
2025-02-06 Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Zhen Ye et.al. 2502.04128 null
2025-02-06 Generative Adversarial Networks Bridging Art and Machine Intelligence Junhao Song et.al. 2502.04116 null
2025-02-06 VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output Eason Chen et.al. 2502.04103 null
2025-02-06 LLMs to Support a Domain Specific Knowledge Assistant Maria-Flavia Lovin et.al. 2502.04095 null
2025-02-06 AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference Qingyue Yang et.al. 2502.04077 link
2025-02-06 Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency Shangkun Sun et.al. 2502.04076 link
2025-02-06 Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training Changhao Jiang et.al. 2502.04066 null
2025-02-06 TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers Younghye Hwang et.al. 2502.04056 null
2025-02-06 Exploring Imbalanced Annotations for Effective In-Context Learning Hongfu Gao et.al. 2502.04037 null
2025-02-06 Fine, I’ll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging Guinan Su et.al. 2502.04030 null
2025-02-06 Echo-Teddy: Preliminary Design and Development of Large Language Model-based Social Robot for Autistic Students Unggi Lee et.al. 2502.04029 null
2025-02-06 Quantification of Biodiversity from Historical Survey Text with LLM-based Best-Worst Scaling Thomas Haider et.al. 2502.04022 null
2025-02-06 Automating a Complete Software Test Process Using LLMs: An Automotive Case Study Shuai Wang et.al. 2502.04008 null
2025-02-06 CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing Yu Yuan et.al. 2502.03997 null
2025-02-06 Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering Longquan Jiang et.al. 2502.03992 link
2025-02-06 Tight Bounds on Jensen’s Gap: Novel Approach with Applications in Generative Modeling Marcin Mazur et.al. 2502.03988 null
2025-02-06 MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation YoonJe Kang et.al. 2502.03966 null
2025-02-06 MAQInstruct: Instruction-based Unified Event Relation Extraction Jun Xu et.al. 2502.03954 null
2025-02-06 LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models Priyank Pathak et.al. 2502.03950 link
2025-02-06 Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond Mardhiyah Sanni et.al. 2502.03945 null
2025-02-06 Unravelling Causal Genetic Biomarkers of Alzheimer’s Disease via Neuron to Gene-token Backtracking in Neural Architecture: A Groundbreaking Reverse-Gene-Finder Approach Victor OK Li et.al. 2502.03938 null
2025-02-06 Quantifying Correlations of Machine Learning Models Yuanyuan Li et.al. 2502.03937 link
2025-02-06 HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture Jai Bardhan et.al. 2502.03933 null
2025-02-06 Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software Andreas Baumann et.al. 2502.03916 null
2025-02-06 No Free Lunch in Annotation either: An objective evaluation of foundation models for streamlining annotation in animal tracking Emil Mededovic et.al. 2502.03907 link
2025-02-06 LeAP: Consistent multi-domain 3D labeling using Foundation Models Simon Gebraad et.al. 2502.03901 null
2025-02-06 InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers Chenchen Shou et.al. 2502.03885 null
2025-02-06 Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning Peizhuang Cong et.al. 2502.03884 null
2025-02-06 BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation Bo Pang et.al. 2502.03860 null
2025-02-06 PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication Zhuohui Zhang et.al. 2502.03845 null
2025-02-06 Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis Lin Yuan et.al. 2502.03843 null
2025-02-06 FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing Jinya Sakurai et.al. 2502.03826 null
2025-02-06 Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation Tianhao Li et.al. 2502.03825 null
2025-02-06 PsyPlay: Personality-Infused Role-Playing Conversational Agents Tao Yang et.al. 2502.03821 null
2025-02-06 Large Language Models for Multi-Robot Systems: A Survey Peihan Li et.al. 2502.03814 null
2025-02-06 Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective Yuan Feng et.al. 2502.03805 link
2025-02-06 Understanding and Supporting Formal Email Exchange by Answering AI-Generated Questions Yusuke Miura et.al. 2502.03804 null
2025-02-06 Enhancing Hallucination Detection through Noise Injection Litian Liu et.al. 2502.03799 null
2025-02-06 Distribution learning via neural differential equations: minimal energy regularization and approximation theory Youssef Marzouk et.al. 2502.03795 null
2025-02-06 It’s All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers Benjamin Clavié et.al. 2502.03793 null
2025-02-06 Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence Jacob Fein-Ashley et.al. 2502.03787 null
2025-02-06 GistVis: Automatic Generation of Word-scale Visualizations from Data-rich Documents Ruishi Zou et.al. 2502.03784 link
2025-02-06 Adaptive Semantic Prompt Caching with VectorQ Luis Gaspar Schroeder et.al. 2502.03771 null
2025-02-06 Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models Meiquan Dong et.al. 2502.03766 null
2025-02-06 Rethinking the Residual Distribution of Locate-then-Editing Methods in Model Editing Xiaopeng Li et.al. 2502.03748 null
2025-02-06 Speaking the Language of Teamwork: LLM-Guided Credit Assignment in Multi-Agent Reinforcement Learning Muhan Lin et.al. 2502.03723 null
2025-02-06 Boosting Knowledge Graph-based Recommendations through Confidence-Aware Augmentation with Large Language Models Rui Cai et.al. 2502.03715 null
2025-02-06 MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers Nicole Cho et.al. 2502.03711 null
2025-02-06 Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers Daniel Beaglehole et.al. 2502.03708 null
2025-02-06 LLM Alignment as Retriever Optimization: An Information Retrieval Perspective Bowen Jin et.al. 2502.03699 null
2025-02-06 A Comparison of DeepSeek and Other LLMs Tianchen Gao et.al. 2502.03688 null
2025-02-06 Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free Gian Mario Favero et.al. 2502.03687 null
2025-02-06 Controlled LLM Decoding via Discrete Auto-regressive Biasing Patrick Pynadath et.al. 2502.03685 null
2025-02-05 Reflection-Window Decoding: Text Generation with Selective Refinement Zeyu Tang et.al. 2502.03678 null
2025-02-05 Advancing Reasoning in Large Language Models: Promising Methods and Approaches Avinash Patil et.al. 2502.03671 null
2025-02-05 Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set Yikai Wu et.al. 2502.03669 null
2025-02-05 Privacy-Preserving Generative Models: A Comprehensive Survey Debalina Padariya et.al. 2502.03668 null
2025-02-05 Context-Preserving Gradient Modulation for Large Language Models: A Novel Approach to Semantic Consistency in Long-Form Text Generation Nirola Kobanov et.al. 2502.03643 null
2025-02-05 SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models Daniel Levy et.al. 2502.03638 link
2025-02-05 AdaPhish: AI-Powered Adaptive Defense and Education Resource Against Deceptive Emails Rei Meguro et.al. 2502.03622 null
2025-02-05 Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training Reza Shirkavand et.al. 2502.03604 null
2025-02-05 HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference Zeyu Zhang et.al. 2502.03589 null
2025-02-05 A Mixed-Methods Evaluation of LLM-Based Chatbots for Menopause Roshini Deva et.al. 2502.03579 null
2025-02-05 Code Simulation as a Proxy for High-order Tasks in Large Language Models Emanuele La Malfa et.al. 2502.03568 null
2025-02-05 Kronecker Mask and Interpretive Prompts are Language-Action Video Learners Jingyi Yang et.al. 2502.03549 link
2025-02-05 YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment Amitava Das et.al. 2502.03512 null
2025-02-05 Do Large Language Model Benchmarks Test Reliability? Joshua Vendrow et.al. 2502.03461 link
2025-02-05 Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training Boyao Wang et.al. 2502.03460 null
2025-02-05 A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) Yiye Chen et.al. 2502.03450 null
2025-02-05 Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics Xuan Li et.al. 2502.03449 null
2025-02-05 BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving Ran Xin et.al. 2502.03438 null
2025-02-05 Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization Yu-Han Wu et.al. 2502.03435 null
2025-02-05 On Fairness of Unified Multimodal Large Language Model for Image Generation Ming Liu et.al. 2502.03429 null
2025-02-05 Harnessing Large Language Models for Curated Code Reviews Oussama Ben Sghaier et.al. 2502.03425 link
2025-02-05 Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation Alexey A. Novikov et.al. 2502.03420 null
2025-02-05 Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts Nikta Gohari Sadr et.al. 2502.03418 null
2025-02-05 SPRI: Aligning Large Language Models with Context-Situated Principles Hongli Zhan et.al. 2502.03397 null
2025-02-05 Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications Issar Arab et.al. 2502.03395 null
2025-02-05 LIMO: Less is More for Reasoning Yixin Ye et.al. 2502.03387 link
2025-02-05 Transformers and Their Roles as Time Series Foundation Models Dennis Wu et.al. 2502.03383 null
2025-02-05 Demystifying Long Chain-of-Thought Reasoning in LLMs Edward Yeo et.al. 2502.03373 link
2025-02-05 PalimpChat: Declarative and Interactive AI analytics Chunwei Liu et.al. 2502.03368 null
2025-02-05 RadVLM: A Multitask Conversational Vision-Language Model for Radiology Nicolas Deperrois et.al. 2502.03333 null
2025-02-05 ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Qiguang Chen et.al. 2502.03325 null
2025-02-05 Out-of-Distribution Detection using Synthetic Data Generation Momin Abbas et.al. 2502.03323 null
2025-02-05 Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques Sangjun Han et.al. 2502.03321 null
2025-02-05 Intent Representation Learning with Large Language Model for Recommendation Yu Wang et.al. 2502.03307 link
2025-02-05 Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning Qitao Tan et.al. 2502.03304 null
2025-02-05 MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters Amin Dada et.al. 2502.03298 null
2025-02-05 SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs Ben Liu et.al. 2502.03283 null
2025-02-05 Posterior SBC: Simulation-Based Calibration Checking Conditional on Data Teemu Säilynoja et.al. 2502.03279 link
2025-02-05 Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning DiJia Su et.al. 2502.03275 null
2025-02-05 ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models Ying Zhang et.al. 2502.03266 link
2025-02-05 General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data Cheng He et.al. 2502.03264 null
2025-02-05 CARROT: A Cost Aware Rate Optimal Router Seamus Somerstep et.al. 2502.03261 null
2025-02-05 RiemannGFM: Learning a Graph Foundation Model from Riemannian Geometry Li Sun et.al. 2502.03251 null
2025-02-05 Exploring the Security Threats of Knowledge Base Poisoning in Retrieval-Augmented Code Generation Bo Lin et.al. 2502.03233 null
2025-02-05 Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models Jialiang Wu et.al. 2502.03199 null
2025-02-05 MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Pengyi Li et.al. 2502.03183 null
2025-02-05 PICBench: Benchmarking LLMs for Photonic Integrated Circuits Design Yuchao Wu et.al. 2502.03159 null
2025-02-05 Strategizing with AI: Insights from a Beauty Contest Experiment Iuliia Alekseenko et.al. 2502.03158 null
2025-02-05 Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models Xumeng Wen et.al. 2502.03147 null
2025-02-05 Symmetry-Aware Bayesian Flow Networks for Crystal Generation Laura Ruple et.al. 2502.03146 null
2025-02-05 Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales Zhen Qian et.al. 2502.03129 null
2025-02-05 Metis: A Foundation Speech Generation Model with Masked Generative Pre-training Yuancheng Wang et.al. 2502.03128 link
2025-02-05 Structured Token Retention and Computational Memory Paths in Large Language Models Jonathan Delena et.al. 2502.03102 null
2025-02-05 Reveal the Mystery of DPO: The Connection between DPO and RL Algorithms Xuerui Su et.al. 2502.03095 null
2025-02-05 Implementing Large Quantum Boltzmann Machines as Generative AI Models for Dataset Balancing Salvatore Sinno et.al. 2502.03086 null
2025-02-05 IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates Aissatou Diallo et.al. 2502.03080 null
2025-02-05 Poisson Flow Joint Model for Multiphase contrast-enhanced CT Rongjun Ge et.al. 2502.03079 null
2025-02-05 Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation Nina Freise et.al. 2502.03078 null
2025-02-05 Optimizing Electric Vehicles Charging using Large Language Models and Graph Neural Networks Stavros Orfanoudakis et.al. 2502.03067 null
2025-02-05 Understanding and Enhancing the Transferability of Jailbreaking Attacks Runqi Lin et.al. 2502.03052 link
2025-02-05 RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts Tuan Truong et.al. 2502.03044 null
2025-02-05 Large Language Models Are Universal Recommendation Learners Junguang Jiang et.al. 2502.03041 null
2025-02-05 FuXi- $α$ : Scaling Recommendation Model with Feature Interaction Enhanced Transformer Yufei Ye et.al. 2502.03036 null
2025-02-05 Knowledge Distillation from Large Language Models for Household Energy Modeling Mohannad Takrouri et.al. 2502.03034 null
2025-02-05 Analyze Feature Flow to Enhance Interpretation and Steering in Language Models Daniil Laptev et.al. 2502.03032 null
2025-02-05 Scaling Laws for Upcycling Mixture-of-Experts Language Models Seng Pei Liew et.al. 2502.03009 null
2025-02-05 MedBioLM: Optimizing Medical and Biological QA with Fine-Tuned Large Language Models and Retrieval-Augmented Generation Seonok Kim et.al. 2502.03004 null
2025-02-05 Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons Renjun Hu et.al. 2502.02988 null
2025-02-05 Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models Muxing Li et.al. 2502.02970 null
2025-02-05 The Labeled Coupon Collector Problem with Random Sample Sizes and Partial Recovery Shoham Shimon Berrebi et.al. 2502.02968 null
2025-02-05 Large Language Model Adversarial Landscape Through the Lens of Attack Objectives Nan Wang et.al. 2502.02960 null
2025-02-05 Position: Editing Large Language Models Poses Serious Safety Risks Paul Youssef et.al. 2502.02958 null
2025-02-05 Control Search Rankings, Control the World: What is a Good Search Engine? Simon Coghlan et.al. 2502.02957 null
2025-02-05 LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction Ziwei Wang et.al. 2502.02945 null
2025-02-05 Large Language Model Guided Self-Debugging Code Generation Muntasir Adnan et.al. 2502.02928 null
2025-02-05 SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs Dinithi Jayasuriya et.al. 2502.02909 null
2025-02-05 AI-driven materials design: a mini-review Mouyang Cheng et.al. 2502.02905 null
2025-02-05 A Benchmark for the Detection of Metalinguistic Disagreements between LLMs and Knowledge Graphs Bradley P. Allen et.al. 2502.02896 null
2025-02-05 Lowering the Barrier of Machine Learning: Achieving Zero Manual Labeling in Review Classification Using LLMs Yejian Zhang et.al. 2502.02893 null
2025-02-05 Expertized Caption Auto-Enhancement for Video-Text Retrieval Junxiang Chen et.al. 2502.02885 null
2025-02-05 SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions Xiaofan Yu et.al. 2502.02883 null
2025-02-05 Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning Yibo Yan et.al. 2502.02871 null
2025-02-05 A Systematic Approach for Assessing Large Language Models’ Test Case Generation Capability Hung-Fu Chang et.al. 2502.02866 null
2025-02-05 OceanChat: The Effect of Virtual Conversational AI Agents on Sustainable Attitude and Behavior Change Pat Pataranutaporn et.al. 2502.02863 null
2025-02-05 A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges Lei Ding et.al. 2502.02835 null
2025-02-05 COFFE: A Code Efficiency Benchmark for Code Generation Yun Peng et.al. 2502.02827 link
2025-02-05 Accessible and Portable LLM Inference by Compiling Computational Graphs into SQL Wenbo Sun et.al. 2502.02818 null
2025-02-05 Mol-LLM: Generalist Molecular LLM with Improved Graph Utilization Chanhui Lee et.al. 2502.02810 null
2025-02-05 CAMI: A Counselor Agent Supporting Motivational Interviewing through State Inference and Topic Exploration Yizhe Yang et.al. 2502.02807 null
2025-02-05 Leveraging the true depth of LLMs Ramón Calvo González et.al. 2502.02790 null
2025-02-05 Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation Jingyu Liu et.al. 2502.02789 link
2025-02-05 SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models Amirhossein Dabiriaghdam et.al. 2502.02787 link
2025-02-04 Classroom Simulacra: Building Contextual Student Generative Agents in Online Education for Learning Behavioral Simulation Songlin Xu et.al. 2502.02780 link
2025-02-04 3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography Weicheng Zhu et.al. 2502.02779 null
2025-02-04 Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning Chaofan Lin et.al. 2502.02770 null
2025-02-04 LLM-USO: Large Language Model-based Universal Sizing Optimizer Karthik Somayaji N. S et.al. 2502.02764 null
2025-02-04 Rethinking Vision Transformer for Object Centric Foundation Models Manuel Traub et.al. 2502.02763 null
2025-02-04 Too Noisy To Learn: Enhancing Data Quality for Code Review C Chunhua Liu et.al. 2502.02757 null
2025-02-04 PatchPilot: A Stable and Cost-Efficient Agentic Patching Framework Hongwei Li et.al. 2502.02747 null
2025-02-04 LLM Bandit: Cost-Efficient LLM Generation via Preference-Conditioned Dynamic Routing Yang Li et.al. 2502.02743 null
2025-02-04 RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2 Bin Xie et.al. 2502.02741 null
2025-02-04 SmolLM2: When Smol Goes Big – Data-Centric Training of a Small Language Model Loubna Ben Allal et.al. 2502.02737 null
2025-02-04 Peri-LN: Revisiting Layer Normalization in the Transformer Architecture Jeonghoon Kim et.al. 2502.02732 null
2025-02-04 Cross-Lingual Transfer for Low-Resource Natural Language Processing Iker García-Ferrero et.al. 2502.02722 null
2025-02-04 Astromer 2 Cristobal Donoso-Oliva et.al. 2502.02717 null
2025-02-04 A Unified Understanding and Evaluation of Steering Methods Shawn Im et.al. 2502.02716 null
2025-02-04 An Analysis of LLM Fine-Tuning and Few-Shot Learning for Flaky Test Detection and Classification Riddhi More et.al. 2502.02715 null
2025-02-04 Exploring LLMs Impact on Student-Created User Stories and Acceptance Testing in Software Development Allan Brockenbrough et.al. 2502.02675 null
2025-02-04 MedRAX: Medical Reasoning Agent for Chest X-ray Adibvafa Fallahpour et.al. 2502.02673 link
2025-02-04 Transformers Boost the Performance of Decision Trees on Tabular Data across Sample Sizes Mayuka Jayawardhana et.al. 2502.02672 null
2025-02-04 Machine-learning approaches to accelerating lattice simulations Scott Lawrence et.al. 2502.02670 null
2025-02-04 A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI) Yan Li et.al. 2502.02659 link
2025-02-04 Introducing the Rhea simulations of Milky-Way-like galaxies I: Effect of gravitational potential on morphology and star formation Junia Göller et.al. 2502.02646 null
2025-02-04 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Xueqing Deng et.al. 2502.02589 null
2025-02-04 Open Materials Generation with Stochastic Interpolants Philipp Hoellmer et.al. 2502.02582 null
2025-02-04 A comparison of translation performance between DeepL and Supertext Alex Flückiger et.al. 2502.02577 link
2025-02-04 Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement Soheil Abbasloo et.al. 2502.02573 null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 null
2025-02-04 Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Junha Lee et.al. 2502.02548 null
2025-02-04 LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World Shrikara Arun et.al. 2502.02539 null
2025-02-04 Adaptive Self-improvement LLM Agentic System for ML Library Development Genghan Zhang et.al. 2502.02534 link
2025-02-04 Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Han Zhou et.al. 2502.02533 null
2025-02-04 Generative Modeling on Lie Groups via Euclidean Generalized Score Matching Marco Bertolini et.al. 2502.02513 null
2025-02-04 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Maohao Shen et.al. 2502.02508 null
2025-02-04 Learning to generate physical ocean states: Towards hybrid climate modeling Etienne Meunier et.al. 2502.02499 null
2025-02-04 EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization Yize Wu et.al. 2502.02493 null
2025-02-04 Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Menglong Cui et.al. 2502.02481 null
2025-02-04 Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography Emir Ahmed et.al. 2502.02475 null
2025-02-04 Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification Valentina Vadori et.al. 2502.02471 link
2025-02-04 SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency Qianhao Yuan et.al. 2502.02458 null
2025-02-04 Personalization Toolkit: Training Free Personalization of Large Vision Language Models Soroush Seifi et.al. 2502.02452 null
2025-02-04 Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study Calvin Yixiang Cheng et.al. 2502.02451 link
2025-02-04 Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models Haoran Ye et.al. 2502.02444 null
2025-02-04 LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models Jiangong Chen et.al. 2502.02441 link
2025-02-04 Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment Yaling Shen et.al. 2502.02438 null
2025-02-04 TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification Jiaju Kang et.al. 2502.02428 null
2025-02-04 Activation-Informed Merging of Large Language Models Amin Heyrani Nobari et.al. 2502.02421 link
2025-02-04 Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling Markus Krimmel et.al. 2502.02415 link
2025-02-04 AI-Powered, But Power-Hungry? Energy Efficiency of LLM-Generated Code Lola Solovyeva et.al. 2502.02412 null
2025-02-04 Avoiding spurious sharpness minimization broadens applicability of SAM Sidak Pal Singh et.al. 2502.02407 null
2025-02-04 LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models Tzu-Tao Chang et.al. 2502.02406 null
2025-02-04 CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning Jianfeng Pan et.al. 2502.02390 null
2025-02-04 Hypergraph Link Prediction via Hyperedge Copying Xie He et.al. 2502.02386 link
2025-02-04 STAIR: Improving Safety Alignment with Introspective Reasoning Yichi Zhang et.al. 2502.02384 link
2025-02-04 Evaluating the Effectiveness of LLMs in Fixing Maintainability Issues in Real-World Projects Henrique Nunes et.al. 2502.02368 null
2025-02-04 Field Matching: an Electrostatic Paradigm to Generate and Transfer Data Alexander Kolesov et.al. 2502.02367 null
2025-02-04 Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs Sagnik Mukherjee et.al. 2502.02362 null
2025-02-04 SHIELD: APT Detection and Intelligent Explanation Using LLM Parth Atulbhai Gandhi et.al. 2502.02342 null
2025-02-04 Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Jinyang Wu et.al. 2502.02339 null
2025-02-04 ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs Yuan Tian et.al. 2502.02329 null
2025-02-04 Information-Theoretic Proofs for Diffusion Sampling Galen Reeves et.al. 2502.02305 null
2025-02-04 Density Ratio Estimation with Conditional Probability Paths Hanlin Yu et.al. 2502.02300 null
2025-02-04 Evalita-LLM: Benchmarking Large Language Models on Italian Bernardo Magnini et.al. 2502.02289 null
2025-02-04 Adaptive Resource Allocation Optimization Using Large Language Models in Dynamic Wireless Environments Hyeonho Noh et.al. 2502.02287 null
2025-02-04 Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation Atharva Mangeshkumar Agrawal et.al. 2502.02249 null
2025-02-04 Flatten Graphs as Sequences: Transformers are Scalable Graph Generators Dexiong Chen et.al. 2502.02216 null
2025-02-04 When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks Felix Drinkall et.al. 2502.02199 link
2025-02-04 Large language models in climate and sustainability policy: limits and opportunities Francesca Larosa et.al. 2502.02191 null
2025-02-04 ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion Nissim Maruani et.al. 2502.02187 null
2025-02-04 Generative Kernel Spectral Clustering David Winant et.al. 2502.02185 null
2025-02-04 Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge Daniel Tamayo et.al. 2502.02173 link
2025-02-04 EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues Rohit Girmaji et.al. 2502.02172 null
2025-02-04 Risk-Aware Driving Scenario Analysis with Large Language Models Yuan Gao et.al. 2502.02145 link
2025-02-04 IPO: Iterative Preference Optimization for Text-to-Video Generation Xiaomeng Yang et.al. 2502.02088 null
2025-02-04 Position Paper: Building Trust in Synthetic Data for Clinical AI Krishan Agyakari Raja Babu et.al. 2502.02076 null
2025-02-04 Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models Prasanta Bhattacharya et.al. 2502.02074 null
2025-02-04 ASCenD-BDS: Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping Rajiv Bahl et.al. 2502.02072 null
2025-02-04 Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign Ruisi Zhang et.al. 2502.02068 null
2025-02-04 AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement Shivam Singh et.al. 2502.02067 link
2025-02-04 Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments Raghav Arora et.al. 2502.02066 null
2025-02-04 CASIM: Composite Aware Semantic Injection for Text to Motion Generation Che-Jui Chang et.al. 2502.02063 null
2025-02-04 Large Language Models for Recommendation with Deliberative User Preference Alignment Yi Fang et.al. 2502.02061 null
2025-02-04 Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning Georgios Margaritis et.al. 2502.02048 null
2025-02-04 Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction Frederick Dillon et.al. 2502.02046 null
2025-02-04 M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference Nikhil Bhendawade et.al. 2502.02040 null
2025-02-04 ContinuouSP: Generative Model for Crystal Structure Prediction with Invariance and Continuity Yuji Tone et.al. 2502.02026 link
2025-02-04 From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing Siwei Luo et.al. 2502.02025 null
2025-02-04 ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling Yi-Chiao Wu et.al. 2502.02019 null
2025-02-04 Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment Shuo Wang et.al. 2502.02017 null
2025-02-04 A Periodic Bayesian Flow for Material Generation Hanlin Wu et.al. 2502.02016 link
2025-02-04 Layer by Layer: Uncovering Hidden Representations in Language Models Oscar Skean et.al. 2502.02013 null
2025-02-04 LLMSecConfig: An LLM-Based Approach for Fixing Software Container Misconfigurations Ziyang Ye et.al. 2502.02009 null
2025-02-04 Reasoning Bias of Next Token Prediction Training Pengxiao Lin et.al. 2502.02007 null
2025-02-04 FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024 Arnav Grover et.al. 2502.01992 null
2025-02-04 Can LLMs Assist Annotators in Identifying Morality Frames? – Case Study on Vaccination Debate on Social Media Tunazzina Islam et.al. 2502.01991 null
2025-02-04 Generative Data Mining with Longtail-Guided Diffusion David S. Hayden et.al. 2502.01980 null
2025-02-04 Gradient-Regularized Latent Space Modulation in Large Language Models for Structured Contextual Synthesis Derek Yotheringhay et.al. 2502.01979 null
2025-02-04 AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs Hongxin Li et.al. 2502.01977 null
2025-02-04 CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing Wenhao Zheng et.al. 2502.01976 null
2025-02-04 Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning Jinlong Pang et.al. 2502.01968 null
2025-02-04 MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving Shiju Zhao et.al. 2502.01960 null
2025-02-04 Local minima of the empirical risk in high dimension: General theorems and convex examples Kiana Asgari et.al. 2502.01953 null
2025-02-04 DAMO: Data- and Model-aware Alignment of Multi-modal LLMs Jinda Lu et.al. 2502.01943 null
2025-02-04 Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Xiang Liu et.al. 2502.01941 null
2025-02-04 Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach Mohammed Alsakabi et.al. 2502.01940 null
2025-02-04 Distributionally Robust Direct Preference Optimization Zaiyan Xu et.al. 2502.01930 null
2025-02-04 PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling Avery Ma et.al. 2502.01925 null
2025-02-04 LAST SToP For Modeling Asynchronous Time Series Shubham Gupta et.al. 2502.01922 null
2025-02-04 Anomaly Detection via Autoencoder Composite Features and NCE Yalin Liao et.al. 2502.01920 null
2025-02-04 Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales Arian Eamaz et.al. 2502.01908 null
2025-02-04 Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models Chia-Wen Kuo et.al. 2502.01906 null
2025-02-04 Conceptual Metaphor Theory as a Prompting Paradigm for Large Language Models Oliver Kramer et.al. 2502.01901 null
2025-02-03 Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement Ziad Shaker et.al. 2502.01882 null
2025-02-03 SE Arena: Benchmarking Software Engineering Chatbots with Iterative Interactions Zhimin Zhao et.al. 2502.01860 null
2025-02-03 Security and Quality in LLM-Generated Code: A Multi-Language, Multi-Model Analysis Mohammed Kharma et.al. 2502.01853 null
2025-02-03 Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting Keyi Zhu et.al. 2502.01850 link
2025-02-03 Relatively-Secure LLM-Based Steganography via Constrained Markov Decision Processes Yu-Shin Huang et.al. 2502.01827 link
2025-02-03 Agentic Bug Reproduction for Effective Automated Program Repair at Google Runxiang Cheng et.al. 2502.01821 null
2025-02-03 Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning Hanyang Zhao et.al. 2502.01819 null
2025-02-03 SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models Diyana Muhammed et.al. 2502.01812 null
2025-02-03 Toward Neurosymbolic Program Comprehension Alejandro Velasco et.al. 2502.01806 null
2025-02-03 Discovering Chunks in Neural Embeddings for Interpretability Shuchen Wu et.al. 2502.01803 null
2025-02-03 Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale Elisa Tsai et.al. 2502.01798 link
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 link
2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 link
2025-01-31 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Wenzhi Fang et.al. 2501.19389 link
2025-02-03 SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Dominik Wagner et.al. 2501.19377 null
2025-01-31 Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions Sören Christensen et.al. 2501.19373 null
2025-01-31 We’re Different, We’re the Same: Creative Homogeneity Across LLMs Emily Wenger et.al. 2501.19361 null
2025-01-31 Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies Brandon P. Chelstrom et.al. 2501.19359 null
2025-01-31 The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking Yuchun Miao et.al. 2501.19358 null
2025-01-31 Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters Adrián Juan-Delgado et.al. 2501.19356 null
2025-01-31 Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 Ting-Yao E. Hsu et.al. 2501.19353 null
2025-01-31 Towards Adaptive Self-Improvement for Smarter Energy Systems Alexander Sommer et.al. 2501.19340 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 Homogeneity Bias as Differential Sampling Uncertainty in Language Models Messi H. J. Lee et.al. 2501.19337 null
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 LLM-based Affective Text Generation Quality Based on Different Quantization Values Yarik Menchaca Resendiz et.al. 2501.19317 null
2025-01-31 Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment Gregor Bachmann et.al. 2501.19309 null
2025-02-03 SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling Jiefeng Chen et.al. 2501.19306 null
2025-01-31 Beyond checkmate: exploring the creative chokepoints in AI text Nafis Irtiza Tripto et.al. 2501.19301 link
2025-01-31 Offline Learning for Combinatorial Multi-armed Bandits Xutong Liu et.al. 2501.19300 null
2025-01-31 Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes Zhiyao Xu et.al. 2501.19298 null
2025-01-31 Analysis of LLMs vs Human Experts in Requirements Engineering Cory Hymel et.al. 2501.19297 null
2025-01-31 Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators Kunpeng Zhang et.al. 2501.19282 null
2025-01-31 Pheromone-based Learning of Optimal Reasoning Paths Anirudh Chari et.al. 2501.19278 null
2025-01-31 From Assistance to Autonomy – A Researcher Study on the Potential of AI Support for Qualitative Data Analysis Elisabeth Kirsten et.al. 2501.19275 null
2025-01-31 Jackpot! Alignment as a Maximal Lottery Roberto-Rafael Maura-Rivero et.al. 2501.19266 null
2025-01-31 Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge Amogh Joshi et.al. 2501.19259 null
2025-01-31 A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation Yunzhe Li et.al. 2501.19232 null
2025-01-31 Autonomous Legacy Web Application Upgrades Using a Multi-Agent System Valtteri Ala-Salmi et.al. 2501.19204 link
2025-02-03 Improving the Robustness of Representation Misdirection for Large Language Model Unlearning Dang Huu-Tien et.al. 2501.19202 link
2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link
2025-01-31 Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning Xianglin Yang et.al. 2501.19180 null
2025-01-31 No Foundations without Foundations – Why semi-mechanistic models are essential for regulatory biology Luka Kovačević et.al. 2501.19178 null
2025-01-31 Position: Contextual Integrity Washing for Language Models Yan Shvartzshnaider et.al. 2501.19173 null
2025-01-31 Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs Kejia Zhang et.al. 2501.19164 null
2025-01-31 A theoretical framework for overfitting in energy-based modeling Giovanni Catania et.al. 2501.19158 null
2025-01-31 A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator Sixiao Huang et.al. 2501.19135 null
2025-01-31 Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations Sihwan Park et.al. 2501.19099 null
2025-01-31 Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data Xichen Xu et.al. 2501.19094 null
2025-01-31 Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models Jialin Zhao et.al. 2501.19090 null
2025-01-31 Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification Xiangyu Sun et.al. 2501.19086 null
2025-01-31 Enhancing Code Generation for Low-Resource Languages: No Silver Bullet Alessandro Giagnorio et.al. 2501.19085 null
2025-01-31 Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations Dahye Kim et.al. 2501.19066 link
2025-01-31 TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs Yan Sun et.al. 2501.19057 null
2025-01-31 Enabling Autonomic Microservice Management through Self-Learning Agents Fenglin Yu et.al. 2501.19056 null
2025-01-31 Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models Ruiyu Wang et.al. 2501.19054 null
2025-01-31 Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors Simon Idoko et.al. 2501.19042 link
2025-01-31 Towards the Worst-case Robustness of Large Language Models Huanran Chen et.al. 2501.19040 null
2025-01-31 Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs Hongliang Li et.al. 2501.19036 null
2025-01-31 XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Bo Lan et.al. 2501.19034 link
2025-01-31 Multilayer Networks in Neuroimaging Vesna Vuksanovic et.al. 2501.19024 null
2025-01-31 Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation Bin Zhu et.al. 2501.19017 null
2025-01-31 Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities Arjun Krishna et.al. 2501.19012 null
2025-01-31 Visual Autoregressive Modeling for Image Super-Resolution Yunpeng Qu et.al. 2501.18993 link
2025-01-31 Symmetric Pruning of Large Language Models Kai Yi et.al. 2501.18980 null
2025-01-31 BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics Yuxuan Liu et.al. 2501.18972 null
2025-01-31 Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping Pu Yang et.al. 2501.18962 link
2025-01-31 Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow Alfred Bexley et.al. 2501.18957 null
2025-01-31 LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Shenghao Fu et.al. 2501.18954 link
2025-01-31 TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment Zi-Jian Cheng et.al. 2501.18935 link
2025-01-31 Language Games as the Pathway to Artificial Superhuman Intelligence Ying Wen et.al. 2501.18924 null
2025-01-31 KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search Haoran Luo et.al. 2501.18922 link
2025-01-31 LLM Program Optimization via Retrieval Augmented Search Sagnik Anupam et.al. 2501.18916 null
2025-01-31 Scaling Laws for Differentially Private Language Models Ryan McKenna et.al. 2501.18914 null
2025-01-31 Streamlining Security Vulnerability Triage with Large Language Models Mohammad Jalili Torkamani et.al. 2501.18908 null
2025-01-31 Trustworthy Evaluation of Generative AI Models Zijun Gao et.al. 2501.18897 null
2025-01-31 Can We Predict the Effect of Prompts? Jae Yong Lee et.al. 2501.18883 null
2025-01-31 Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models Jiaqi Tang et.al. 2501.18863 null
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-31 Equivariant Hypergraph Diffusion for Crystal Structure Prediction Yang Liu et.al. 2501.18850 null
2025-01-31 Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities Yaping Chai et.al. 2501.18845 null
2025-01-31 Trading Inference-Time Compute for Adversarial Robustness Wojciech Zaremba et.al. 2501.18841 null
2025-01-31 Partially Rewriting a Transformer in Natural Language Gonçalo Paulo et.al. 2501.18838 link
2025-01-31 Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Mrinank Sharma et.al. 2501.18837 null
2025-01-31 Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential Chenyu Gao et.al. 2501.18834 null
2025-01-31 Structural Embedding Projection for Contextual Large Language Model Inference Vincent Enoasmo et.al. 2501.18826 null
2025-01-31 Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies Andrey Borro et.al. 2501.18817 link
2025-01-31 Large Language Models as Common-Sense Heuristics Andrey Borro et.al. 2501.18816 null
2025-01-30 Compositional Generalization Requires More Than Disentangled Representations Qiyao Liang et.al. 2501.18797 null
2025-01-30 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang et.al. 2501.18795 null
2025-01-30 Survey and Improvement Strategies for Gene Prioritization with Large Language Models Matthew Neeley et.al. 2501.18794 null
2025-01-30 LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? Alexander Tuisov et.al. 2501.18784 null
2025-01-30 Navigating the Fragrance space Via Graph Generative Models And Predicting Odors Mrityunjay Sharma et.al. 2501.18777 link
2025-01-30 Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring Zijun Deng et.al. 2501.18761 null
2025-01-30 Synthetic Data Generation for Augmenting Small Samples Dan Liu et.al. 2501.18741 null
2025-01-30 Examining the Robustness of Large Language Models across Language Complexity Jiayi Zhang et.al. 2501.18738 null
2025-01-30 Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks Mohd. Farhan Israk Soumik et.al. 2501.18727 null
2025-01-30 Strong and Controllable 3D Motion Generation Canxuan Gang et.al. 2501.18726 null
2025-01-30 Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning Maya Kruse et.al. 2501.18724 null
2025-02-03 Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps Devansh Bhardwaj et.al. 2501.18712 null
2025-01-30 Regularized second-order optimization of tensor-network Born machines Matan Ben-Dov et.al. 2501.18691 null
2025-01-30 Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting Yansong Qu et.al. 2501.18672 null
2025-01-30 Foundational Models for 3D Point Clouds: A Survey and Outlook Vishal Thengane et.al. 2501.18594 null
2025-01-30 Diffusion Autoencoders are Scalable Image Tokenizers Yinbo Chen et.al. 2501.18593 null
2025-02-03 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Yue Wang et.al. 2501.18585 null
2025-01-30 Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH Evgenii Evstafev et.al. 2501.18576 null
2025-01-30 BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos Lehao Lin et.al. 2501.18565 null
2025-01-30 SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Haoquan Fang et.al. 2501.18564 link
2025-01-30 Semantic Web and Creative AI – A Technical Report from ISWS 2023 Raia Abu Ahmad et.al. 2501.18542 null
2025-01-30 Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges Manveer Singh Tamber et.al. 2501.18536 link
2025-01-30 Differentially Private Steering for Large Language Model Alignment Anmol Goel et.al. 2501.18532 link
2025-01-30 Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models Guanqun Cao et.al. 2501.18516 null
2025-01-30 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Arthur Douillard et.al. 2501.18512 null
2025-01-30 WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Benjamin Feuer et.al. 2501.18511 link
2025-01-30 CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction Peter J. Bentley et.al. 2501.18504 null
2025-01-30 Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline Shivani Kapania et.al. 2501.18493 null
2025-01-30 A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models Changshu Liu et.al. 2501.18482 null
2025-01-30 CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization Yanxia Deng et.al. 2501.18475 null
2025-01-30 Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations Chengxi Zeng et.al. 2501.18474 null
2025-01-30 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-01-30 CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering Yumeng Wang et.al. 2501.18457 null
2025-01-30 GENIE: Generative Note Information Extraction model for structuring EHR data Huaiyuan Ying et.al. 2501.18435 null
2025-01-30 Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation Youngjoon Lee et.al. 2501.18416 null
2025-01-30 RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects Yiteng Tu et.al. 2501.18365 link
2025-01-30 A Video-grounded Dialogue Dataset and Metric for Event-driven Activities Wiradee Imrattanatrai et.al. 2501.18324 link
2025-01-30 Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach Tianpeng Pan et.al. 2501.18320 null
2025-01-30 Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models Jennifer D’Souza et.al. 2501.18287 null
2025-01-30 Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models Haoyu Liang et.al. 2501.18280 null
2025-01-30 Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence Kevin Roitero et.al. 2501.18265 null
2025-01-30 How to Select Datapoints for Efficient Human Evaluation of NLG Models? Vilém Zouhar et.al. 2501.18251 link
2025-01-30 Statistical multi-metric evaluation and visualization of LLM system predictive performance Samuel Ackerman et.al. 2501.18243 null
2025-01-30 Contextually Structured Token Dependency Encoding for Large Language Models James Blades et.al. 2501.18205 null
2025-01-30 Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents ShuiDe Wen et.al. 2501.18190 null
2025-01-30 Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation Teddy Lazebnik et.al. 2501.18177 null
2025-01-30 Continually Evolved Multimodal Foundation Models for Cancer Prognosis Jie Peng et.al. 2501.18170 null
2025-01-30 RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing Jinyao Guo et.al. 2501.18160 null
2025-01-30 Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study Yuchen Lei et.al. 2501.18158 null
2025-01-30 Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models Wanlong Liu et.al. 2501.18154 null
2025-01-30 Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Qika Lin et.al. 2501.18119 null
2025-01-30 Scaling Inference-Efficient Language Models Song Bian et.al. 2501.18107 null
2025-01-30 Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation Yibo Wang et.al. 2501.18100 link
2025-01-30 AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates Da Chang et.al. 2501.18094 null
2025-01-30 Normative Evaluation of Large Language Models with Everyday Moral Dilemmas Pratik S. Sachdeva et.al. 2501.18081 null
2025-01-30 FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models Spencer Mateega et.al. 2501.18062 null
2025-01-29 RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems Duy A. Nguyen et.al. 2501.18056 null
2025-01-29 Current Pathology Foundation Models are unrobust to Medical Center Differences Edwin D. de Jong et.al. 2501.18055 null
2025-01-29 A Proximal Operator for Inducing 2:4-Sparsity Jonas M Kübler et.al. 2501.18015 null
2025-01-29 Large Language Models Think Too Fast To Explore Effectively Lan Pan et.al. 2501.18009 null
2025-01-29 Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces Neetha Jambigi et.al. 2501.18005 null
2025-01-29 InnerThoughts: Disentangling Representations and Predictions in Large Language Models Didier Chételat et.al. 2501.17994 null
2025-01-29 Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study Marwah Alaofi et.al. 2501.17981 link
2025-01-29 Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu et.al. 2501.17974 null
2025-01-29 “I Would Never Trust Anything Western”: Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools Manas Mhasakar et.al. 2501.17942 null
2025-01-29 DReSS: Data-driven Regularized Structured Streamlining for Large Language Models Mingkuan Feng et.al. 2501.17905 null
2025-01-29 Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? Pouya Pezeshkpour et.al. 2501.17840 link
2025-01-29 Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology Sobhan Hemati et.al. 2501.17822 null
2025-01-30 Leveraging Multimodal LLM for Inspirational User Interface Search Seokhyeon Park et.al. 2501.17799 link
2025-01-29 BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights Chan-Jan Hsu et.al. 2501.17790 null
2025-01-29 AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing Peter Pak et.al. 2501.17784 null
2025-01-29 2SSP: A Two-Stage Framework for Structured Pruning of LLMs Fabrizio Sandri et.al. 2501.17771 link
2025-01-29 Generative Unordered Flow for Set-Structured Data Generation Yangming Li et.al. 2501.17770 null
2025-01-29 Hybrid Graphs for Table-and-Text based Question Answering using LLMs Ankush Agarwal et.al. 2501.17767 null
2025-01-29 On the Partitioning of GPU Power among Multi-Instances Tirth Vamja et.al. 2501.17752 null
2025-01-29 Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation Aitor Arrieta et.al. 2501.17749 null
2025-01-29 A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches Ana R. Baião et.al. 2501.17729 null
2025-01-29 Using Code Generation to Solve Open Instances of Combinatorial Design Problems Christopher D. Rosin et.al. 2501.17725 link
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Source-Channel Separation Theorems for Distortion Perception Coding Chao Tian et.al. 2501.17706 null
2025-01-29 Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching Xuzhe Dang et.al. 2501.17665 null
2025-01-30 In-Context Meta LoRA Generation Yihua Shao et.al. 2501.17635 null
2025-01-29 Uncertainty Quantification and Decomposition for LLM-based Recommendation Wonbin Kweon et.al. 2501.17630 link
2025-01-29 The Imitation Game According To Turing Sharon Temtsin et.al. 2501.17629 null
2025-01-29 Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment Jonathan Teel et.al. 2501.17617 null
2025-01-29 Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis Kunrong Li et.al. 2501.17598 null
2025-01-30 Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models Behraj Khan et.al. 2501.17595 null
2025-01-29 GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback Mohamed Abdelaal et.al. 2501.17584 null
2025-01-29 CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs Amey Hengle et.al. 2501.17581 null
2025-01-29 Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding Marco Pasini et.al. 2501.17578 null
2025-01-29 Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models Wooyoung Kim et.al. 2501.17549 null
2025-01-29 Towards Training-Free Open-World Classification with 3D Generative Models Xinzhe Xia et.al. 2501.17547 null
2025-01-29 Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant Gaole He et.al. 2501.17546 link
2025-01-29 Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison Martin Nizon-Deladoeuille et.al. 2501.17539 null
2025-01-29 Neural Spelling: A Spell-Based BCI System for Language Neural Decoding Xiaowei Jiang et.al. 2501.17489 null
2025-01-29 DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance Seffi Cohen et.al. 2501.17479 link
2025-01-29 AugmenTest: Enhancing Tests with LLM-Driven Oracles Shaker Mahmud Khandaker et.al. 2501.17461 null
2025-01-29 Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction Kaiwei Luo et.al. 2501.17459 null
2025-01-29 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Tiansheng Huang et.al. 2501.17433 link
2025-01-29 Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models Yuxuan Li et.al. 2501.17420 null
2025-01-29 MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs Ved Sirdeshmukh et.al. 2501.17399 link
2025-01-29 Learning Free Token Reduction for Multi-Modal LLM Zihui Zhao et.al. 2501.17391 null
2025-01-29 Context-Aware Semantic Recomposition Mechanism for Large Language Models Richard Katrix et.al. 2501.17386 null
2025-01-28 Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations Md Tauhidul Islam et.al. 2501.17347 null
2025-01-28 Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction Mingyu Derek Ma et.al. 2501.17326 null
2025-01-28 CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data Lee Carlin et.al. 2501.17324 null
2025-01-30 Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding Yun-Shiuan Chuang et.al. 2501.17310 null
2025-01-28 “Ownership, Not Just Happy Talk”: Co-Designing a Participatory Large Language Model for Journalism Emily Tseng et.al. 2501.17299 null
2025-01-28 Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization Zilu Tang et.al. 2501.17295 null
2025-01-28 Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology Peilong Wang et.al. 2501.17286 null
2025-01-30 From Natural Language to Extensive-Form Game Representations Shilong Deng et.al. 2501.17282 link
2025-01-28 Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models Abdalaziz Al-Maeeni et.al. 2501.17279 null
2025-01-28 Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics Jasper Timm et.al. 2501.17273 link
2025-01-28 Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Fengpei Yuan et.al. 2501.17206 null
2025-01-28 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Tianzhe Chu et.al. 2501.17161 null
2025-01-28 FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Deren Lei et.al. 2501.17144 link
2025-01-28 ASTRAL: Automated Safety Testing of Large Language Models Miriam Ugarte et.al. 2501.17132 null
2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization Ruizhe Wang et.al. 2501.17116 null
2025-01-28 Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction Carl-Leander Henneking et.al. 2501.17112 null
2025-01-28 Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics Guillaume Le Mailloux et.al. 2501.17107 link
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models Minghan Li et.al. 2501.17039 null
2025-01-28 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Manojkumar Parmar et.al. 2501.17030 null
2025-01-28 Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs Alessandro Midolo et.al. 2501.17024 link
2025-01-28 Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement Kei Katsumata et.al. 2501.17022 link
2025-01-28 MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition Philippe Pasquier et.al. 2501.17011 null
2025-01-28 Large Language Models for Code Generation: The Practitioners Perspective Zeeshan Rasheed et.al. 2501.16998 link
2025-01-28 Artificial Intelligence Clones Annie Liang et.al. 2501.16996 null
2025-01-28 FedEFM: Federated Endovascular Foundation Model with Unseen Data Tuong Do et.al. 2501.16992 null
2025-01-28 Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver Shunya Minami et.al. 2501.16986 null
2025-01-28 Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Hongzhi Huang et.al. 2501.16975 null
2025-01-28 Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers Mohammad Raza et.al. 2501.16961 null
2025-01-28 Multiple Abstraction Level Retrieve Augment Generation Zheng Zheng et.al. 2501.16952 null
2025-01-29 TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Makoto Shing et.al. 2501.16937 null
2025-01-28 Detecting harassment and defamation in cyberbullying with emotion-adaptive training Peiling Yi et.al. 2501.16925 link
2025-01-28 RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains Shady Nasrat et.al. 2501.16899 link
2025-01-28 Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model Eoghan Cronin et.al. 2501.16893 link
2025-01-28 Irony Detection, Reasoning and Understanding in Zero-shot Learning Peiling Yi et.al. 2501.16884 null
2025-01-28 Comparing Human and LLM Generated Code: The Jury is Still Out! Sherlock A. Licorish et.al. 2501.16857 null
2025-01-28 Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis Tiao Tan et.al. 2501.16842 null
2025-01-28 Misspellings in Natural Language Processing: A survey Gianluca Sperduti et.al. 2501.16836 null
2025-01-28 DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model Josua Spisak et.al. 2501.16800 null
2025-01-28 Algorithm for Automatic Legislative Text Consolidation Matias Etcheverry et.al. 2501.16794 null
2025-01-28 Exponential Family Attention Kevin Christian Wibisono et.al. 2501.16790 link
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network Yumingzhi Pan et.al. 2501.16784 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-29 Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Muhammad Atta ur Rahman et.al. 2501.16769 null
2025-01-28 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Chenguo Lin et.al. 2501.16764 null
2025-01-28 HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns Xinyue Shen et.al. 2501.16750 link
2025-01-28 Through the Prism of Culture: Evaluating LLMs’ Understanding of Indian Subcultures and Traditions Garima Chhikara et.al. 2501.16748 null
2025-01-28 LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience Nimesh Jha et.al. 2501.16744 null
2025-01-28 Distilling Large Language Models for Network Active Queue Management Deol Satish et.al. 2501.16734 null
2025-01-28 xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking Sunbowen Lee et.al. 2501.16727 link
2025-01-28 One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning Chunpeng Zhou et.al. 2501.16720 null
2025-01-28 Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection Hengzhuang Li et.al. 2501.16718 link
2025-01-28 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Yueen Ma et.al. 2501.16698 null
2025-01-28 MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark Dongyi Yi et.al. 2501.16688 null
2025-01-28 Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting Li Yin et.al. 2501.16673 link
2025-01-28 VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records Philip Chung et.al. 2501.16672 link
2025-01-28 Contextual Reinforcement in Multimodal Token Compression for Large Language Models Naderdel Piero et.al. 2501.16658 null
2025-01-28 Large Language Model Critics for Execution-Free Evaluation of Code Changes Aashish Yadavally et.al. 2501.16655 link
2025-01-28 Molecular-driven Foundation Model for Oncologic Pathology Anurag Vaidya et.al. 2501.16652 link
2025-01-28 DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models Zeping Min et.al. 2501.16650 null
2025-01-28 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-28 CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs Jinlan Fu et.al. 2501.16629 link
2025-01-28 Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems Baraa Hikal et.al. 2501.16616 null
2025-01-28 Sparse Autoencoders Trained on the Same Data Learn Different Features Gonçalo Paulo et.al. 2501.16615 null
2025-01-28 Fine-Tuned Language Models as Space Systems Controllers Enrico M. Zucchelli et.al. 2501.16588 null
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation Farzad Farhadzadeh et.al. 2501.16559 null
2025-01-27 Distributional Information Embedding: A Framework for Multi-bit Watermarking Haiyun He et.al. 2501.16558 null
2025-01-27 PackDiT: Joint Human Motion and Text Generation via Mutual Prompting Zhongyu Jiang et.al. 2501.16551 null
2025-01-27 PhysAnimator: Physics-Guided Generative Cartoon Animation Tianyi Xie et.al. 2501.16550 null
2025-01-27 Sample-Efficient Behavior Cloning Using General Domain Knowledge Feiyu Zhu et.al. 2501.16546 null
2025-01-27 Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees Piyush Gupta et.al. 2501.16539 null
2025-01-27 Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs Jean-Charles Noirot Ferrand et.al. 2501.16534 null
2025-01-27 A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain Jorge del Pozo Lérida et.al. 2501.16533 null
2025-01-27 Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction Atharva Naik et.al. 2501.16524 null
2025-01-27 How well can LLMs Grade Essays in Arabic? Rayed Ghazawi et.al. 2501.16516 null
2025-01-27 Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models Sudarshan Kamath Barkur et.al. 2501.16513 null
2025-01-27 Smoothed Embeddings for Robust Language Models Ryo Hase et.al. 2501.16497 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM Payal Kamboj et.al. 2501.16481 link
2025-01-27 Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Philip Hughes et.al. 2501.16467 null
2025-01-27 CoCoNUT: Structural Code Understanding does not fall out of a tree Claas Beger et.al. 2501.16456 link
2025-01-27 Detecting Zero-Day Attacks in Digital Substations via In-Context Learning Faizan Manzoor et.al. 2501.16453 null
2025-01-27 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation Hamed Firooz et.al. 2501.16450 null
2025-01-27 DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Han Sun et.al. 2501.16410 null
2025-01-27 Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology Meiyun Cao et.al. 2501.16309 null
2025-01-27 RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval Long Nguyen et.al. 2501.16303 null
2025-01-27 Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width Zheng Liu et.al. 2501.16302 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-27 FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Renshan Zhang et.al. 2501.16297 null
2025-01-27 Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models Jing Zhang et.al. 2501.16282 null
2025-01-27 Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation Jiayi Hong et.al. 2501.16277 link
2025-01-27 URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT Long Nguyen et.al. 2501.16276 null
2025-01-27 A foundation model for human-AI collaboration in medical literature mining Zifeng Wang et.al. 2501.16255 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Zero-Shot Decision Tree Construction via Large Language Models Lucas Carrasco et.al. 2501.16247 null
2025-01-27 CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation Xiaochuan Ma et.al. 2501.16246 null
2025-01-27 Phase Transitions in Large Language Models and the $O(N)$ Model Youran Sun et.al. 2501.16241 null
2025-01-27 AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses Runze Cai et.al. 2501.16240 link
2025-01-28 Distilling foundation models for robust and efficient models in digital pathology Alexandre Filiot et.al. 2501.16239 null
2025-01-27 Language-Based Bayesian Optimization Research Assistant (BORA) Abdoulatif Cissé et.al. 2501.16224 null
2025-01-27 Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models Huayu Li et.al. 2501.16215 link
2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null
2025-01-27 Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs Antony Bartlett et.al. 2501.16191 null
2025-01-27 SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting Wenxuan Xie et.al. 2501.16178 link
2025-01-27 BAG: Body-Aligned 3D Wearable Asset Generation Zhongjin Luo et.al. 2501.16177 null
2025-01-27 Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma Richard Willis et.al. 2501.16173 link
2025-01-27 MetaDecorator: Generating Immersive Virtual Tours through Multimodality Shuang Xie et.al. 2501.16164 null
2025-01-27 CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge Yuwei Zhang et.al. 2501.16155 null
2025-01-27 AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought Xin Huang et.al. 2501.16154 null
2025-01-27 AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants Pascal J. Sager et.al. 2501.16150 null
2025-01-27 PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing Yuwei Zhang et.al. 2501.16149 null
2025-01-27 SampleLLM: Optimizing Tabular Data Synthesis in Recommendations Jingtong Gao et.al. 2501.16125 null
2025-01-27 Using Generative Models to Produce Realistic Populations of UK Windstorms Yee Chun Tsoi et.al. 2501.16110 null
2025-01-27 Integration of LLM Quality Assurance into an NLG System Ching-Yi Chen et.al. 2501.16078 null
2025-01-27 PISCO: Pretty Simple Compression for Retrieval-Augmented Generation Maxime Louis et.al. 2501.16075 null
2025-01-27 A generative material transformer using Wyckoff representation Pierre-Paul De Breuck et.al. 2501.16051 null
2025-01-27 Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation Xing Zhang et.al. 2501.16050 null
2025-01-27 PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment Vincent Freiberger et.al. 2501.16033 null
2025-01-27 FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments Zhiyuan Fu et.al. 2501.16029 null
2025-01-27 Transformability reveals the interplay of dynamics across different network orders Ming Xie et.al. 2501.16016 null
2025-01-27 TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference Jack Min Ong et.al. 2501.16007 null
2025-01-27 EDSep: An Effective Diffusion-Based Method for Speech Source Separation Jinwei Dong et.al. 2501.15965 null
2025-01-27 Rethinking the Bias of Foundation Model under Long-tailed Distribution Jiahao Chen et.al. 2501.15955 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-27 TimeHF: Billion-Scale Time Series Models Guided by Human Feedback Yongzhi Qi et.al. 2501.15942 null
2025-01-27 SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub Benjamin C. Carter et.al. 2501.15922 null
2025-01-27 Parametric Retrieval Augmented Generation Weihang Su et.al. 2501.15915 link
2025-01-27 Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation Muhammad Taha Tariq et.al. 2501.15901 null
2025-01-27 Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects Victor Deng et.al. 2501.15900 null
2025-01-27 Adaptive Width Neural Networks Federico Errica et.al. 2501.15889 null
2025-01-27 LCTG Bench: LLM Controlled Text Generation Benchmark Kentaro Kurihara et.al. 2501.15875 link
2025-01-27 LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models Yuewen Mei et.al. 2501.15850 null
2025-01-27 SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Delin Qu et.al. 2501.15830 null
2025-01-27 Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference Tharindu B. Hewage et.al. 2501.15829 link
2025-01-27 MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer Qi Chen et.al. 2501.15826 null
2025-01-27 LemmaHead: RAG Assisted Proof Generation Using Large Language Models Tianbo Yang et.al. 2501.15797 null
2025-01-27 Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? Zhiling Chen et.al. 2501.15795 null
2025-01-27 Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs Yu Li et.al. 2501.15791 link
2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link
2025-01-27 Large Language Models to Diffusion Finetuning Edoardo Cetin et.al. 2501.15781 null
2025-01-27 Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages Ivory Yang et.al. 2501.15773 link
2025-01-27 GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design Yuanfu Sun et.al. 2501.15755 null
2025-01-27 IndicMMLU-Pro: Benchmarking the Indic Large Language Models Sankalp KJ et.al. 2501.15747 null
2025-01-27 Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning Michael Xieyang Liu et.al. 2501.15727 null
2025-01-27 A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks Dong Li et.al. 2501.15724 null
2025-01-27 On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models Tadahiro Taniguchi et.al. 2501.15721 null
2025-01-26 Adapting Biomedical Abstracts into Plain language using Large Language Models Haritha Gangavarapu et.al. 2501.15700 null
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-26 Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting Yuxin Zhang et.al. 2501.15641 null
2025-01-26 BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation Ali Khodabandeh Yalabadi et.al. 2501.15631 link
2025-01-26 Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets Eduard Barbu et.al. 2501.15624 null
2025-01-26 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals Yinzhou Wang et.al. 2501.15599 null
2025-01-26 Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images Sichen Zhu et.al. 2501.15598 link
2025-01-26 SedarEval: Automated Evaluation using Self-Adaptive Rubrics Zhiyuan Fan et.al. 2501.15595 link
2025-01-26 SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain Dakuan Lu et.al. 2501.15587 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-01-26 Instruction Tuning for Story Understanding and Generation with Weak Supervision Yangshu Yuan et.al. 2501.15574 null
2025-01-26 Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models Spencer Ramsey et.al. 2501.15571 null
2025-01-26 ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Lin Yueyu et.al. 2501.15570 link
2025-01-26 Ocean-OCR: Towards General OCR Application via a Vision-Language Model Song Chen et.al. 2501.15558 link
2025-01-26 Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles Hanwen Zhang et.al. 2501.15544 null
2025-01-26 Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths Yueyang Wang et.al. 2501.15522 null
2025-01-26 Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification Dan Song et.al. 2501.15503 null
2025-01-26 Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning Xiaohan Yu et.al. 2501.15470 null
2025-01-26 Data-adaptive Safety Rules for Training Reward Models Xiaomin Li et.al. 2501.15453 null
2025-01-26 OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas Xiaoyang Wang et.al. 2501.15427 null
2025-01-26 Visual Generation Without Guidance Huayu Chen et.al. 2501.15420 link
2025-01-26 AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement Junan Zhang et.al. 2501.15417 null
2025-01-26 The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation Raha Aghaei et.al. 2501.15411 null
2025-01-26 Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency Irin Kabakum et.al. 2501.15405 null
2025-01-26 How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning Tohida Rehman et.al. 2501.15398 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-26 How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback Manzong Huang et.al. 2501.15378 null
2025-01-26 Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models Melkamu Abay Mersha et.al. 2501.15374 null
2025-01-26 Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis Robinson Umeike et.al. 2501.15370 null
2025-01-26 Decentralized Low-Rank Fine-Tuning of Large Language Models Sajjad Ghiasvand et.al. 2501.15361 null
2025-01-26 Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection Bo Yang et.al. 2501.15355 null
2025-01-25 Fairness in LLM-Generated Surveys Andrés Abeliuk et.al. 2501.15351 null
2025-01-25 Between Puppet and Actor: Reframing Authorship in this Age of AI Agents Yuqian Sun et.al. 2501.15346 null
2025-01-25 Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data Jiajie Li et.al. 2501.15326 null
2025-01-25 ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning Shangqian Gao et.al. 2501.15316 null
2025-01-25 The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? Ayo Adedeji et.al. 2501.15310 null
2025-01-25 You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning Ayan Sengupta et.al. 2501.15296 null
2025-01-24 HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Xin Zhou et.al. 2501.14729 link
2025-01-24 Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? Ipek Baris Schlicht et.al. 2501.14719 null
2025-01-24 Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models Naihao Deng et.al. 2501.14717 null
2025-01-24 FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing James Seale Smith et.al. 2501.14713 null
2025-01-24 The Karp Dataset Mason DiCicco et.al. 2501.14705 null
2025-01-24 Rethinking Table Instruction Tuning Naihao Deng et.al. 2501.14693 null
2025-01-24 Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST Fuping Wu et.al. 2501.14685 null
2025-01-24 An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations Shabnam Hassani et.al. 2501.14683 null
2025-01-24 Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning Jisi Zhang et.al. 2501.14680 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion Ziyao Xu et.al. 2501.14649 link
2025-01-24 Towards Scalable Topological Regularizers Hiu-Tung Wong et.al. 2501.14641 null
2025-01-24 Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics Renato Ghisellini et.al. 2501.14634 null
2025-01-24 Extracting Problem Structure with LLMs for Optimized SAT Local Search André Schilder et.al. 2501.14630 null
2025-01-24 Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data Jordi Abante et.al. 2501.14615 null
2025-01-24 ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Tianming Liang et.al. 2501.14607 null
2025-01-24 Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research Hamid Sarmadi et.al. 2501.14546 null
2025-01-24 VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning Benjamin Callewaert et.al. 2501.14540 null
2025-01-24 Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models Zhenguang Zhong et.al. 2501.14530 link
2025-01-24 Scene Understanding Enabled Semantic Communication with Open Channel Coding Zhe Xiang et.al. 2501.14520 null
2025-01-24 Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel Zhuoran Liu et.al. 2501.14512 null
2025-01-24 Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course Pavlin G. Poličar et.al. 2501.14499 null
2025-01-24 Evaluating and Improving Graph to Text Generation with Large Language Models Jie He et.al. 2501.14497 link
2025-01-24 RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Zhengyang Tang et.al. 2501.14492 link
2025-01-24 Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design Taehan Kim et.al. 2501.14469 null
2025-01-24 Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis Xiujing Guo et.al. 2501.14465 null
2025-01-24 Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing Zeping Yu et.al. 2501.14457 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null
2025-01-24 CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios Michael Fuest et.al. 2501.14426 null
2025-01-24 DeepFlow: Serverless Large Language Model Serving at Scale Junhao Hu et.al. 2501.14417 null
2025-01-24 SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation Shengjie Wang et.al. 2501.14400 null
2025-01-24 ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer Yoni Schirris et.al. 2501.14379 link
2025-01-24 DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing Xinyu Ma et.al. 2501.14371 link
2025-01-24 Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches Ziad Sakr et.al. 2501.14366 null
2025-01-24 FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration Kai-Tuo Xu et.al. 2501.14350 link
2025-01-24 Chain-of-Retrieval Augmented Generation Liang Wang et.al. 2501.14342 null
2025-01-24 Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts Clément Desroches et.al. 2501.14334 null
2025-01-24 Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models Ridhi Jain et.al. 2501.14326 null
2025-01-24 PAID: A Framework of Product-Centric Advertising Image Design Hongyu Chen et.al. 2501.14316 null
2025-01-24 Locality-aware Fair Scheduling in LLM Serving Shiyi Cao et.al. 2501.14312 null
2025-01-24 A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education Calvin Yeung et.al. 2501.14305 link
2025-01-24 MASTER: A Multi-Agent System with LLM Specialized MCTS Bingzheng Gan et.al. 2501.14304 null
2025-01-24 Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph Xujian Liang et.al. 2501.14300 link
2025-01-24 Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment Julian A. Schnabel et.al. 2501.14296 null
2025-01-24 Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes Sullam Jeoung et.al. 2501.14294 link
2025-01-24 Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches Feng Zhou et.al. 2501.14291 null
2025-01-24 Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation Sadegh Mahdavi et.al. 2501.14275 link
2025-01-24 Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors Yi Zhao et.al. 2501.14250 link
2025-01-24 Humanity’s Last Exam Long Phan et.al. 2501.14249 null
2025-01-24 Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game Rong Ye et.al. 2501.14225 null
2025-01-24 Top Ten Challenges Towards Agentic Neural Graph Databases Jiaxin Bai et.al. 2501.14224 null
2025-01-24 TFG-Flow: Training-free Guidance in Multimodal Generative Flow Haowei Lin et.al. 2501.14216 link
2025-01-24 Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading Minrui Xu et.al. 2501.14205 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models Saaduddin Mahmud et.al. 2501.14189 null
2025-01-24 GeoSim.AI: AI assistants for numerical simulations in geomechanics Yared W. Bekele et.al. 2501.14186 null
2025-01-24 AI Chatbots as Professional Service Agents: Developing a Professional Identity Wenwen Li et.al. 2501.14179 null
2025-01-24 Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models Yile Gu et.al. 2501.14170 null
2025-01-24 Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction Dongming Sheng et.al. 2501.14144 null
2025-01-23 Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation Derek Yotheringhay et.al. 2501.14119 null
2025-01-23 Domain-Factored Untrained Deep Prior for Spectrum Cartography Subash Timilsina et.al. 2501.14116 null
2025-01-23 MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning Joshua Davis et.al. 2501.14105 link
2025-01-23 StreamingRAG: Real-time Contextual Retrieval and Generation Framework Murugan Sankaradas et.al. 2501.14101 null
2025-01-23 Enhancing Biomedical Relation Extraction with Directionality Po-Ting Lai et.al. 2501.14079 link
2025-01-23 LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language Yubin Ge et.al. 2501.14073 null
2025-01-23 Efficient 2D CT Foundation Model for Contrast Phase Classification Benjamin Hou et.al. 2501.14066 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-23 LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps Andrey Palaev et.al. 2501.14046 link
2025-01-23 Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions Jianfeng Zhu et.al. 2501.14037 null
2025-01-23 CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation Guofeng Cui et.al. 2501.13927 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 Binary Diffusion Probabilistic Model Vitaliy Kinakh et.al. 2501.13915 null
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 null
2025-01-23 Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models Linh Tran et.al. 2501.13904 null
2025-01-23 Exploring Finetuned Audio-LLM on Heart Murmur Features Adrian Florea et.al. 2501.13884 null
2025-01-23 The machine learning platform for developers of large systems Alexey Naikov et.al. 2501.13881 null
2025-01-23 A RAG-Based Institutional Assistant Gustavo Kuratomi et.al. 2501.13880 null
2025-01-23 On the Reasoning Capacity of AI Models and How to Quantify It Santosh Kumar Radha et.al. 2501.13833 null
2025-01-23 Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Hao Zhang et.al. 2501.13831 null
2025-01-23 Hallucinations Can Improve Large Language Models in Drug Discovery Shuzhou Yuan et.al. 2501.13824 null
2025-01-23 Large Language Model driven Policy Exploration for Recommender Systems Jie Wang et.al. 2501.13816 null
2025-01-23 Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change Mowafak Allaham et.al. 2501.13802 null
2025-01-23 Parameter-Efficient Fine-Tuning for Foundation Models Dan Zhang et.al. 2501.13787 link
2025-01-23 Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling Tanya Rodchenko et.al. 2501.13779 null
2025-01-23 Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework Yoonsang Kim et.al. 2501.13778 link
2025-01-23 Do Large Language Models Truly Understand Geometric Structures? Xiaofeng Wang et.al. 2501.13773 link
2025-01-23 Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak Erjia Xiao et.al. 2501.13772 null
2025-01-23 UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models Xin Xu et.al. 2501.13766 null
2025-01-23 EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents Yuhui Yun et.al. 2501.13746 null
2025-01-23 GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification Te Pei et.al. 2501.13743 null
2025-01-23 An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities Zezhou Yang et.al. 2501.13742 link
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-01-23 RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation Shi-Qi Yan et.al. 2501.13726 null
2025-01-23 Musical ethnocentrism in Large Language Models Anna Kruspe et.al. 2501.13720 null
2025-01-23 A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation Dario Serez et.al. 2501.13718 null
2025-01-23 EventVL: Understand Event Streams via Multimodal Large Language Model Pengteng Li et.al. 2501.13707 null
2025-01-23 DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale Linghao Zhang et.al. 2501.13699 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-23 HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor Zihui Wu et.al. 2501.13677 link
2025-01-23 How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization Shezheng Song et.al. 2501.13669 null
2025-01-23 LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models Yizheng Sun et.al. 2501.13652 null
2025-01-23 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Zhenghao Lin et.al. 2501.13629 null
2025-01-23 Text-to-SQL based on Large Language Models and Database Keyword Search Eduardo R. Nascimento et.al. 2501.13594 null
2025-01-23 Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization Lei Huang et.al. 2501.13573 null
2025-01-23 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Tao Liu et.al. 2501.13554 link
2025-01-23 LLMs Can Plan Only If We Tell Them Bilgehan Sel et.al. 2501.13545 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles Munachiso Nwadike et.al. 2501.13491 link
2025-01-23 Adaptive Testing for LLM-Based Applications: A Diversity-based Approach Juyeon Yoon et.al. 2501.13480 null
2025-01-23 LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation JiaXin Chen et.al. 2501.13475 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 Spurious Forgetting in Continual Learning of Language Models Junhao Zheng et.al. 2501.13453 link
2025-01-23 Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models Bo Gao et.al. 2501.13428 null
2025-01-23 Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling Tomek Jaroslawski et.al. 2501.13415 null
2025-01-23 VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework He Kong et.al. 2501.13411 link
2025-01-23 Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures Minglong Dong et.al. 2501.13396 null
2025-01-23 Can Large Language Models Understand Preferences in Personalized Recommendation? Zhaoxuan Tan et.al. 2501.13391 link
2025-01-23 Do as We Do, Not as You Think: the Conformity of Large Language Models Zhiyuan Weng et.al. 2501.13381 link
2025-01-23 Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility Gabrielle Hoyer et.al. 2501.13376 link
2025-01-23 Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement Jae-Sung Bae et.al. 2501.13372 null
2025-01-23 Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification Yuzhuo Li et.al. 2501.13368 null
2025-01-23 50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications Zewei Shi et.al. 2501.13351 link
2025-01-23 MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize Haohang Xu et.al. 2501.13349 null
2025-01-23 Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Rong Shan et.al. 2501.13344 link
2025-01-23 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 link
2025-01-23 Generative Multi-Form Bayesian Optimization Zhendong Guo et.al. 2501.13337 null
2025-01-23 SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network Songge Zhang et.al. 2501.13318 null
2025-01-23 Representing Visualization Insights as a Dense Insight Network Jane Hoffswell et.al. 2501.13309 null
2025-01-23 OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia Xuelong Geng et.al. 2501.13306 link
2025-01-23 Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers Akshit Achara et.al. 2501.13302 link
2025-01-23 Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents Shrinidhi Kumbhar et.al. 2501.13299 null
2025-01-23 RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering Yang Bai et.al. 2501.13297 link
2025-01-23 Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols John Joon Young Chung et.al. 2501.13284 null
2025-01-22 MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis Daeun Jung et.al. 2501.13277 link
2025-01-22 RAG-Reward: Optimizing RAG with Reward Modeling and RLHF Hanning Zhang et.al. 2501.13264 null
2025-01-22 Exploring GPT’s Ability as a Judge in Music Understanding Kun Fang et.al. 2501.13261 link
2025-01-22 Bypassing Array Canaries via Autonomous Function Call Resolution Nathaniel Oh et.al. 2501.13256 link
2025-01-22 S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning Yichen Wu et.al. 2501.13198 null
2025-01-22 Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century Axel Loewe et.al. 2501.13142 null
2025-01-23 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-22 Robust Representation Consistency Model via Contrastive Denoising Jiachen Lei et.al. 2501.13094 link
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Bohao Yang et.al. 2501.13042 link
2025-01-22 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Yantao Liu et.al. 2501.13007 link
2025-01-22 Neural network enhanced cross entropy benchmark for monitored circuits Yangrui Hu et.al. 2501.13005 null
2025-01-22 Large Language Model-Based Semantic Communication System for Image Transmission Soheyb Ribouh et.al. 2501.12988 null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null
2025-01-22 Low-dimensional adaptation of diffusion models: Convergence in total variation Jiadong Liang et.al. 2501.12982 null
2025-01-22 OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun et.al. 2501.12975 link
2025-01-22 Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs Jan Corazza et.al. 2501.12972 null
2025-01-22 It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act Kristof Meding et.al. 2501.12962 null
2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null
2025-01-22 GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models Pengxiang Zhao et.al. 2501.12956 null
2025-01-22 3D Object Manipulation in a Single Image using Generative Models Ruisi Zhao et.al. 2501.12935 null
2025-01-22 Correctness Assessment of Code Generated by Large Language Models Using Internal Representations Tuan-Dung Bui et.al. 2501.12934 link
2025-01-22 DynamicEarth: How Far are We from Open-Vocabulary Change Detection? Kaiyu Li et.al. 2501.12931 null
2025-01-22 A Functional Software Reference Architecture for LLM-Integrated Systems Alessio Bucaioni et.al. 2501.12904 null
2025-01-22 Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration Offa Kingsleigh et.al. 2501.12901 null
2025-01-22 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Yafu Li et.al. 2501.12895 link
2025-01-23 Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program Carlton Shepherd et.al. 2501.12883 null
2025-01-22 WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge Jingyuan Chen et.al. 2501.12877 null
2025-01-22 ACEBench: Who Wins the Match Point in Tool Learning? Chen Chen et.al. 2501.12851 null
2025-01-22 AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation Aghiles Kebaili et.al. 2501.12840 null
2025-01-22 Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Viktor Moskvoretskii et.al. 2501.12835 null
2025-01-22 Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek John Pavlopoulos et.al. 2501.12826 link
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 link
2025-01-22 Certified Guidance for Planning with Deep Generative Models Francesco Giacomarra et.al. 2501.12815 null
2025-01-22 Revisit Self-Debugging with Self-Generated Tests for Code Generation Xiancai Chen et.al. 2501.12793 null
2025-01-22 LLMs as Repositories of Factual Knowledge: Limitations and Solutions Seyed Mahed Mousavi et.al. 2501.12774 null
2025-01-22 NExtLong: Toward Effective Long-Context Training without Long Documents Chaochen Gao et.al. 2501.12766 link
2025-01-22 Online Preference Alignment for Language Models via Count-based Exploration Chenjia Bai et.al. 2501.12735 link
2025-01-22 Paradigm-Based Automatic HDL Code Generation Using LLMs Wenhao Sun et.al. 2501.12702 null
2025-01-22 Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression Kai Yoshida et.al. 2501.12698 null
2025-01-22 Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering Qian Tao et.al. 2501.12697 null
2025-01-22 SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling Shengshi Yao et.al. 2501.12696 null
2025-01-22 EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation Yifan Yu et.al. 2501.12689 null
2025-01-22 Distillation Quantification for Large Language Models Sunbowen Lee et.al. 2501.12619 link
2025-01-22 Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? Taiming Wang et.al. 2501.12617 null
2025-01-22 Kimi k1.5: Scaling Reinforcement Learning with LLMs Kimi Team et.al. 2501.12599 null
2025-01-22 Leveraging LLMs to Create a Haptic Devices’ Recommendation System Yang Liu et.al. 2501.12573 null
2025-01-22 Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review Rock Yuren Pang et.al. 2501.12557 link
2025-01-21 Human-like conceptual representations emerge from language prediction Ningyu Xu et.al. 2501.12547 null
2025-01-21 How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? Mirali Purohit et.al. 2501.12535 null
2025-01-21 An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts Dhia Elhaq Rzig et.al. 2501.12521 null
2025-01-21 A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data Minh Tran et.al. 2501.12501 null
2025-01-21 The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws Tian Jin et.al. 2501.12486 null
2025-01-21 An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models Xiaoyu Chu et.al. 2501.12469 link
2025-01-21 Adaptive PII Mitigation Framework for Large Language Models Shubhi Asthana et.al. 2501.12465 null
2025-01-21 Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management Arthur Vitui et.al. 2501.12461 link
2025-01-21 Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications Shubhi Asthana et.al. 2501.12456 null
2025-01-21 Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation Dongsheng Zhu et.al. 2501.12432 null
2025-01-21 FREYR: A Framework for Recognizing and Executing Your Requests Roberto Gallotta et.al. 2501.12423 link
2025-01-21 CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning Eunjee Choi et.al. 2501.12422 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks Greg Olmschenk et.al. 2501.12383 null
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 null
2025-01-21 Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration Thomas Walshe et.al. 2501.12332 null
2025-01-21 Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops Mohamed Harmanani et.al. 2501.12331 link
2025-01-21 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Xianwei Zhuang et.al. 2501.12327 link
2025-01-21 LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations Hasan Abu-Rasheed et.al. 2501.12300 null
2025-01-21 MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks Qishen Zhou et.al. 2501.12281 link
2025-01-21 Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Maosong Cao et.al. 2501.12273 link
2025-01-21 FOCUS: First Order Concentrated Updating Scheme Yizhou Liu et.al. 2501.12243 null
2025-01-21 InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models Pha Nguyen et.al. 2501.12231 null
2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null
2025-01-21 Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces Allard Oelen et.al. 2501.12221 null
2025-01-21 You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense Wuyuao Mai et.al. 2501.12210 null
2025-01-21 Explainability for Vision Foundation Models: A Survey Rémi Kazmierczak et.al. 2501.12203 null
2025-01-22 Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Zibo Zhao et.al. 2501.12202 link
2025-01-21 BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks Zhuang Li et.al. 2501.12174 null
2025-01-21 Contextualizing Recommendation Explanations with LLMs: A User Study Yuanjun Feng et.al. 2501.12152 null
2025-01-21 Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities Qirun Dai et.al. 2501.12147 null
2025-01-21 Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Daniele Bifolco et.al. 2501.12134 null
2025-01-21 Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions Tim Schreiter et.al. 2501.12128 null
2025-01-21 Can open source large language models be used for tumor documentation in Germany? – An evaluation on urological doctors’ notes Stefan Lenz et.al. 2501.12106 link
2025-01-21 Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis Weile Luo et.al. 2501.12084 null
2025-01-21 Phishing Awareness via Game-Based Learning Argianto Rahartomo et.al. 2501.12077 link
2025-01-21 PINNsAgent: Automated PDE Surrogation with Large Language Models Qingpo Wuwu et.al. 2501.12053 null
2025-01-21 Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation Chen Griner et.al. 2501.12033 null
2025-01-21 Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing’s Syndrome Diagnosis in Facial Analysis Hongjun Liu et.al. 2501.12023 null
2025-01-21 Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? Samantha Min Er Yew et.al. 2501.12016 null
2025-01-21 Rate-Aware Learned Speech Compression Jun Xu et.al. 2501.11999 null
2025-01-21 Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models Rupesh Raj Karn et.al. 2501.11979 null
2025-01-21 Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues Maya Medjad et.al. 2501.11977 link
2025-01-21 Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization Jie Zhao et.al. 2501.11968 null
2025-01-21 A Hybrid Attention Framework for Fake News Detection with Large Language Models Xiaochuan Xu et.al. 2501.11967 null
2025-01-21 TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection Yang Cao et.al. 2501.11960 null
2025-01-21 Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model Minghan Wang et.al. 2501.11953 null
2025-01-21 ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation Peter Devine et.al. 2501.11929 link
2025-01-21 Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model He Chang et.al. 2501.11911 null
2025-01-21 Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation Junhong Lian et.al. 2501.11900 link
2025-01-22 Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine Keer Lu et.al. 2501.11885 link
2025-01-21 From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning Yafu Li et.al. 2501.11877 link
2025-01-21 LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems Venkata Sai Aswath Duvvuru et.al. 2501.11864 null
2025-01-21 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Zhili Cheng et.al. 2501.11858 link
2025-01-21 Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance Nikos Kanakaris et.al. 2501.11849 link
2025-01-21 A Survey on Memory-Efficient Large-Scale Model Training in AI for Science Kaiyuan Tian et.al. 2501.11847 null
2025-01-21 Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction Noah L. Schroeder et.al. 2501.11840 null
2025-01-21 PXGen: A Post-hoc Explainable Method for Generative Models Yen-Lung Huang et.al. 2501.11827 null
2025-01-21 CogMorph: Cognitive Morphing Attacks for Text-to-Image Models Zonglei Jing et.al. 2501.11815 null
2025-01-20 Benchmarking Large Language Models via Random Variables Zijin Hong et.al. 2501.11790 null
2025-01-20 Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection Ali Naseh et.al. 2501.11786 null
2025-01-20 Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference Pouya Hamadanian et.al. 2501.11779 link
2025-01-20 The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers Alina Starovolsky-Shitrit et.al. 2501.11770 null
2025-01-20 Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems Fatemeh Nazary et.al. 2501.11759 link
2025-01-20 A generalizable 3D framework and model for self-supervised learning in medical imaging Tony Xu et.al. 2501.11755 link
2025-01-20 Are generative models fair? A study of racial bias in dermatological image generation Miguel López-Pérez et.al. 2501.11752 null
2025-01-20 Optimizing Pretraining Data Mixtures with LLM-Estimated Utility William Held et.al. 2501.11747 null
2025-01-20 MedicoSAM: Towards foundation models for medical image segmentation Anwai Archit et.al. 2501.11734 link
2025-01-20 Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Zhenhailong Wang et.al. 2501.11733 null
2025-01-20 Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy Saeid Asgari Taghanaki et.al. 2501.11721 link
2025-01-20 YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners’ Perspectives Nong Ming et.al. 2501.11712 link
2025-01-20 Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution Ramtin Ehsani et.al. 2501.11709 link
2025-01-20 Trustformer: A Trusted Federated Transformer Ali Abbasi Tadi et.al. 2501.11706 null
2025-01-20 Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) Brian E. Perron et.al. 2501.11705 null
2025-01-20 Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Zhenyu Hou et.al. 2501.11651 link
2025-01-20 Trojan Detection Through Pattern Recognition for Large Language Models Vedant Bhasin et.al. 2501.11621 null
2025-01-20 Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems Giorgio Robino et.al. 2501.11613 null
2025-01-20 SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks Wentao Wan et.al. 2501.11599 link
2025-01-20 Recurrent Diffusion for Large-Scale Parameter Generation Kai Wang et.al. 2501.11587 link
2025-01-20 Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models Mahyar Habibi et.al. 2501.11581 null
2025-01-20 Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution Zhiyuan You et.al. 2501.11561 null
2025-01-20 PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation Jinyu Wang et.al. 2501.11551 link
2025-01-20 UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion Zixuan Chen et.al. 2501.11515 null
2025-01-20 Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges Vincent Koc et.al. 2501.11496 null
2025-01-20 Graph-defined Language Learning with LLMs Huachi Zhou et.al. 2501.11478 null
2025-01-20 Curiosity-Driven Reinforcement Learning from Human Feedback Haoran Sun et.al. 2501.11463 link
2025-01-20 Ontology Matching with Large Language Models and Prioritized Depth-First Search Maria Taboada et.al. 2501.11441 null
2025-01-20 One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor Zhikun Wu et.al. 2501.11433 null
2025-01-20 A Survey on Diffusion Models for Anomaly Detection Jing Liu et.al. 2501.11430 link
2025-01-20 Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Siyu Yuan et.al. 2501.11425 link
2025-01-20 Neural Contextual Reinforcement Framework for Logical Structure Language Generation Marcus Irvin et.al. 2501.11417 null
2025-01-20 Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing Kevin Sim et.al. 2501.11411 null
2025-01-20 Revisiting Language Models in Neural News Recommender Systems Yuyue Zhao et.al. 2501.11391 link
2025-01-20 Towards Advancing Code Generation with Large Language Models: A Research Roadmap Haolin Jin et.al. 2501.11354 null
2025-01-20 EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery Guankun Wang et.al. 2501.11347 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 Few-shot Policy (de)composition in Conversational Question Answering Kyle Erwin et.al. 2501.11335 null
2025-01-20 Nested Annealed Training Scheme for Generative Adversarial Networks Chang Wan et.al. 2501.11318 null
2025-01-20 Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning Zhongtian Hu et.al. 2501.11292 null
2025-01-20 Large Language Model Agents for Radio Map Generation and Wireless Network Planning Hongye Quan et.al. 2501.11283 null
2025-01-20 Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries Yi-Hui Lee et.al. 2501.11273 null
2025-01-20 Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios Zhongtian Hu et.al. 2501.11269 null
2025-01-20 Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian Wannita Takerngsaksiri et.al. 2501.11264 link
2025-01-20 Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models Zhuangzhuang Yan et.al. 2501.11247 null
2025-01-20 Irony in Emojis: A Comparative Study of Human and LLM Interpretation Yawen Zheng et.al. 2501.11241 null
2025-01-20 KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jiaxiang Liu et.al. 2501.11231 link
2025-01-20 Reasoning Language Models: A Blueprint Maciej Besta et.al. 2501.11223 link
2025-01-20 Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation Ivan Lopez et.al. 2501.11199 null
2025-01-19 Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests Kristin Blesch et.al. 2501.11178 link
2025-01-17 FaceXBench: Evaluating Multimodal LLMs on Face Understanding Kartik Narayan et.al. 2501.10360 link
2025-01-17 Zero-Shot Monocular Scene Flow Estimation in the Wild Yiqing Liang et.al. 2501.10357 null
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 link
2025-01-17 Large language models for automated scholarly paper review: A survey Zhenzhen Zhuang et.al. 2501.10326 null
2025-01-17 HiMix: Reducing Computational Complexity in Large Vision-Language Models Xuange Zhang et.al. 2501.10318 null
2025-01-17 Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs Claudio Di Sipio et.al. 2501.10313 null
2025-01-17 Computational Protein Science in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2501.10282 null
2025-01-17 Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation Azat Abdullin et.al. 2501.10200 null
2025-01-17 Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education William Hersh et.al. 2501.10186 null
2025-01-17 Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval Vera Pavlova et.al. 2501.10175 null
2025-01-17 Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis Abhishek Kaushik et.al. 2501.10134 null
2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-17 AI-Generated Music Detection and its Challenges Darius Afchar et.al. 2501.10111 link
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization Zhaopeng Gu et.al. 2501.10067 link
2025-01-17 Accelerating Large Language Models through Partially Linear Feed-Forward Network Gansen Hu et.al. 2501.10054 null
2025-01-17 AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search Wenfeng Feng et.al. 2501.10053 null
2025-01-17 Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists Alyssia Chen et.al. 2501.10037 null
2025-01-17 Mapping scientific communities at scale Victor Barbier et.al. 2501.10035 link
2025-01-17 Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions Zhijie Tan et.al. 2501.10011 null
2025-01-17 Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models Qiang Liu et.al. 2501.09997 null
2025-01-17 Agent-as-Judge for Factual Summarization of Long Narratives Yeonseok Jeong et.al. 2501.09993 link
2025-01-17 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-17 GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions Heda Zuo et.al. 2501.09972 null
2025-01-17 Explainable artificial intelligence (XAI): from inherent explainability to large language models Fuseini Mumuni et.al. 2501.09967 null
2025-01-17 A Survey on Multi-Turn Interaction Capabilities of Large Language Models Chen Zhang et.al. 2501.09959 null
2025-01-17 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations Jamin Seo et.al. 2501.09954 link
2025-01-17 Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt Qingcheng Zeng et.al. 2501.09950 null
2025-01-17 MultiPruner: Balanced Structure Removal in Foundation Models J. Pablo Muñoz et.al. 2501.09949 link
2025-01-17 Steering Large Language Models with Feature Guided Activation Additions Samuel Soo et.al. 2501.09929 null
2025-01-17 Towards A Litmus Test for Common Sense Hugo Latapie et.al. 2501.09913 null
2025-01-17 Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph Jiawei Xu et.al. 2501.09909 null
2025-01-17 Position: Open and Closed Large Language Models in Healthcare Jiawei Xu et.al. 2501.09906 null
2025-01-17 FoundationStereo: Zero-Shot Stereo Matching Bowen Wen et.al. 2501.09898 link
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-17 Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment Mohammad Sadegh Sheikhaei et.al. 2501.09888 link
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-16 ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction Izzeddin Teeti et.al. 2501.09878 null
2025-01-16 Geometry-Preserving Encoder/Decoder in Latent Generative Models Wonjun Lee et.al. 2501.09876 null
2025-01-16 An LLM-Guided Tutoring System for Social Skills Training Michael Guevarra et.al. 2501.09870 null
2025-01-16 Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing Wenhan Wang et.al. 2501.09866 null
2025-01-16 Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message Jiawei Li et.al. 2501.09861 null
2025-01-16 PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery Shristi Das Biswas et.al. 2501.09826 link
2025-01-16 Bridging Language Barriers in Healthcare: A Study on Arabic LLMs Nada Saadi et.al. 2501.09825 null
2025-01-16 BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling Daniele Castellana et.al. 2501.09821 link
2025-01-16 Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems Soham Roy et.al. 2501.09801 null
2025-01-16 Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API Andrey Labunets et.al. 2501.09798 null
2025-01-16 GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation Weiliang Tang et.al. 2501.09783 null
2025-01-16 SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation Wanqi Yin et.al. 2501.09782 link
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Tairan Fu et.al. 2501.09775 null
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-01-16 Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Youngjoon Jang et.al. 2501.09754 null
2025-01-16 OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi et.al. 2501.09751 link
2025-01-16 Enhancing Lexicon-Based Text Embeddings with Large Language Models Yibin Lei et.al. 2501.09749 null
2025-01-16 Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models Bihui Jin et.al. 2501.09745 null
2025-01-16 KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports Hajung Kim et.al. 2501.09744 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text Jihed Ncib et.al. 2501.09719 null
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 Domain Adaptation of Foundation LLMs for e-Commerce Christian Herold et.al. 2501.09706 null
2025-01-16 Cueless EEG imagined speech for subject identification: dataset and benchmarks Ali Derakhshesh et.al. 2501.09700 link
2025-01-16 Simulated Interactive Debugging Yannic Noller et.al. 2501.09694 null
2025-01-17 Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities Fengli Xu et.al. 2501.09686 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark Alexis Roger et.al. 2501.09672 null
2025-01-16 A Survey of Research in Large Language Models for Electronic Design Automation Jingyu Pan et.al. 2501.09655 null
2025-01-16 The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Jonathan Katzy et.al. 2501.09653 null
2025-01-16 CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding Johannes Kirmayr et.al. 2501.09645 link
2025-01-17 LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading Kuan-Ming Liu et.al. 2501.09636 null
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-16 Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment Chaoqi Wang et.al. 2501.09620 link
2025-01-16 From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs Hrithik Majumdar Shibu et.al. 2501.09604 link
2025-01-16 Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Pratyush Dhingra et.al. 2501.09588 null
2025-01-16 Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis Tingxuan Chen et.al. 2501.09555 link
2025-01-16 AI in Support of Diversity and Inclusion Çiçek Güven et.al. 2501.09534 null
2025-01-16 Confidence Estimation for Error Detection in Text-to-SQL Systems Oleg Somov et.al. 2501.09527 link
2025-01-16 Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data Omar Mena et.al. 2501.09521 null
2025-01-16 AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Junjie He et.al. 2501.09503 null
2025-01-16 Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis Qize Yang et.al. 2501.09502 null
2025-01-16 Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework Nuo Chen et.al. 2501.09493 null
2025-01-16 Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators Zhaocheng Liu et.al. 2501.09484 link
2025-01-16 Guided Debugging of Auto-Translated Code Using Differential Testing Shengnan Wu et.al. 2501.09475 null
2025-01-16 DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Hualie Jiang et.al. 2501.09466 link
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464 null
2025-01-16 “A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design Tianhao He et.al. 2501.09457 null
2025-01-16 Solving the unsolvable: Translating case law in Hong Kong King-kui Sin et.al. 2501.09444 null
2025-01-16 Scaling up self-supervised learning for improved surgical foundation models Tim J. M. Jaspers et.al. 2501.09436 link
2025-01-16 CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Hwan Heo et.al. 2501.09433 link
2025-01-16 A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy Huandong Wang et.al. 2501.09431 null
2025-01-16 AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring Xinyi Wang et.al. 2501.09428 null
2025-01-16 AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling Ancheng Xu et.al. 2501.09426 null
2025-01-16 FASP: Fast and Accurate Structured Pruning of Large Language Models Hanyu Hu et.al. 2501.09412 null
2025-01-16 MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models Lyudong Jin et.al. 2501.09410 null
2025-01-16 Adaptive Contextual Caching for Mobile Edge Large Language Model Service Guangyuan Liu et.al. 2501.09383 null
2025-01-16 Aligning Instruction Tuning with Pre-training Yiming Liang et.al. 2501.09368 null
2025-01-16 PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks Huiyou Zhan et.al. 2501.09367 null
2025-01-16 YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks Saptarashmi Bandyopadhyay et.al. 2501.09355 null
2025-01-16 UVRM: A Scalable 3D Reconstruction Model from Unposed Videos Shiu-hong Kao et.al. 2501.09347 null
2025-01-16 Rational Tuning of LLM Cascades via Probabilistic Modeling Michael J. Zellinger et.al. 2501.09345 null
2025-01-16 SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs Anbang Ye et.al. 2501.09316 null
2025-01-16 A Study of In-Context-Learning-Based Text-to-SQL Errors Jiawei Shen et.al. 2501.09310 link
2025-01-16 To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation Kaustubh D. Dhole et.al. 2501.09292 null
2025-01-16 LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport Kyeongha Rho et.al. 2501.09291 link
2025-01-16 Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding Kohei Torimi et.al. 2501.09278 null
2025-01-16 Large Language Model is Secretly a Protein Sequence Optimizer Yinkai Wang et.al. 2501.09274 null
2025-01-16 Perspective Transition of Large Language Models for Solving Subjective Tasks Xiaolong Wang et.al. 2501.09265 null
2025-01-16 Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition Takaaki Hori et.al. 2501.09258 null
2025-01-16 Clone-Robust AI Alignment Ariel D. Procaccia et.al. 2501.09254 null
2025-01-16 Split Fine-Tuning for Large Language Models in Wireless Networks Songge Zhang et.al. 2501.09237 null
2025-01-16 Foundations of Large Language Models Tong Xiao et.al. 2501.09223 link
2025-01-16 Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs Sanchit Sinha et.al. 2501.09221 null
2025-01-16 A Simple Graph Contrastive Learning Framework for Short Text Classification Yonghao Liu et.al. 2501.09219 link
2025-01-16 Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics Yuanyuan Wei et.al. 2501.09218 null
2025-01-16 Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning Yonghao Liu et.al. 2501.09214 link
2025-01-16 FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training Hongzhou Yu et.al. 2501.09213 link
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 Towards Semantics Lifting for Scientific Computing: A Case Study on FFT Naifeng Zhang et.al. 2501.09201 null
2025-01-15 Guiding Retrieval using LLM-based Listwise Rankers Mandeep Rathee et.al. 2501.09186 link
2025-01-15 The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching Yevhen Kostiuk et.al. 2501.09164 null
2025-01-15 Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability Stephanie L. Day et.al. 2501.09158 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-15 Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation Xingxin He et.al. 2501.09138 null
2025-01-15 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Aditi Singh et.al. 2501.09136 link
2025-01-15 HAFix: History-Augmented Large Language Models for Bug Fixing Yu Shi et.al. 2501.09135 link
2025-01-15 Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing Eshaan Tanwar et.al. 2501.09127 link
2025-01-15 Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment Conrad Borchers et.al. 2501.09126 null
2025-01-15 Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach Alireza Ghaffari et.al. 2501.09107 null
2025-01-15 Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites Hans W. A. Hanley et.al. 2501.09102 link
2025-01-15 Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative Yuqian Sun et.al. 2501.09099 null
2025-01-15 SteLLA: A Structured Grading System Using LLMs with RAG Hefei Qiu et.al. 2501.09092 null
2025-01-15 Generative diffusion model with inverse renormalization group flows Kanta Masuki et.al. 2501.09064 link
2025-01-15 Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition Sneheel Sarangi et.al. 2501.09056 link
2025-01-15 How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias Tosin Fadahunsi et.al. 2501.09014 link
2025-01-15 Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians Ishan Amin et.al. 2501.09009 link
2025-01-15 Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails Shaona Ghosh et.al. 2501.09004 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 link
2025-01-15 CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks Krit Tangsongcharoen et.al. 2501.08998 link
2025-01-15 VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science Youssef Abdalla et.al. 2501.08995 link
2025-01-15 CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Haozhe Xie et.al. 2501.08983 link
2025-01-15 Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models Emma Croxford et.al. 2501.08977 null
2025-01-15 Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models Karukriti Kaushik Ghosh et.al. 2501.08974 null
2025-01-15 Analyzing the Ethical Logic of Six Large Language Models W. Russell Neuman et.al. 2501.08951 null
2025-01-15 Applying General Turn-taking Models to Conversational Human-Robot Interaction Gabriel Skantze et.al. 2501.08946 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Liam Dugan et.al. 2501.08913 link
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Connecting SPDE to SGMs Junsu Seo et.al. 2501.08877 null
2025-01-15 Exploring Task-Level Optimal Prompts for Visual In-Context Learning Yan Zhu et.al. 2501.08841 null
2025-01-15 How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering Christoph Treude et.al. 2501.08774 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-15 Enhanced Large Language Models for Effective Screening of Depression and Anxiety June M. Liu et.al. 2501.08769 null
2025-01-15 Few-Shot Learner Generalizes Across AI-Generated Image Detection Shiyu Wu et.al. 2501.08763 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-15 The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities Irina Bigoulaeva et.al. 2501.08716 link
2025-01-15 Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching Chuangtao Ma et.al. 2501.08686 link
2025-01-15 RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency Siqi Li et.al. 2501.08682 null
2025-01-15 Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery Zeqin Liao et.al. 2501.08670 null
2025-01-15 MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities Savya Khosla et.al. 2501.08648 null
2025-01-15 Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations Kaiyuan Zheng et.al. 2501.08641 null
2025-01-15 SWSC: Shared Weight for Similar Channel in LLM Binrui Zeng et.al. 2501.08631 null
2025-01-15 Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan et.al. 2501.08618 link
2025-01-15 RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Kaiqu Liang et.al. 2501.08617 null
2025-01-15 Assessing the Alignment of FOL Closeness Metrics with Human Judgement Ramya Keerthy Thatikonda et.al. 2501.08613 link
2025-01-15 Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design Zhi Zheng et.al. 2501.08603 link
2025-01-15 AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL Tyler Stennett et.al. 2501.08600 null
2025-01-15 LlamaRestTest: Effective REST API Testing with Small Language Models Myeongsoo Kim et.al. 2501.08598 null
2025-01-15 Sound Scene Synthesis at the DCASE 2024 Challenge Mathieu Lagrange et.al. 2501.08587 null
2025-01-15 LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model Yuxuan Hu et.al. 2501.08582 null
2025-01-15 Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation Jiaqi Huang et.al. 2501.08580 link
2025-01-15 Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms Kewei Li et.al. 2501.08570 link
2025-01-15 Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications Jin Chen et.al. 2501.08563 link
2025-01-15 LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation Yiran Tao et.al. 2501.08558 null
2025-01-15 The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Sitong Gong et.al. 2501.08549 link
2025-01-15 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null
2025-01-14 Quantifying the Importance of Data Alignment in Downstream Model Performance Krrish Chawla et.al. 2501.08496 null
2025-01-14 Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition Md Meem Hossain et.al. 2501.08471 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin Joao Carmo de Almeida Neto et.al. 2501.08464 null
2025-01-14 Large Language Models For Text Classification: Case Study And Comprehensive Review Arina Kostina et.al. 2501.08457 null
2025-01-14 Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack Sagiv Antebi et.al. 2501.08454 null
2025-01-14 Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies Ajwad Abrar et.al. 2501.08441 link
2025-01-14 SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models Anurag Kumar et.al. 2501.08421 null
2025-01-14 Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation Seyed Mehdi Rakhtala et.al. 2501.08420 null
2025-01-14 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 OptiChat: Bridging Optimization Models and Practitioners with Large Language Models Hao Chen et.al. 2501.08406 link
2025-01-14 Towards Best Practices for Open Datasets for LLM Training Stefan Baack et.al. 2501.08365 null
2025-01-14 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-14 PokerBench: Training Large Language Models to become Professional Poker Players Richard Zhuang et.al. 2501.08328 link
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Exploring Robustness of Multilingual LLMs on Real-World Noisy Data Amirhossein Aliakbarzadeh et.al. 2501.08322 link
2025-01-14 Enhancing Automated Interpretability with Output-Centric Feature Descriptions Yoav Gur-Arieh et.al. 2501.08319 link
2025-01-14 MiniMax-01: Scaling Foundation Models with Lightning Attention MiniMax et.al. 2501.08313 null
2025-01-14 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Abhilasha Ravichander et.al. 2501.08292 null
2025-01-14 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Hongyu Li et.al. 2501.08282 link
2025-01-14 Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing Pulkit Arora et.al. 2501.08276 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 null
2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset Jiawei Du et.al. 2501.08238 null
2025-01-14 Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings Paul Joe Maliakel et.al. 2501.08219 null
2025-01-14 ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving Zain Ul Abedin et.al. 2501.08203 null
2025-01-14 CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jinjun Peng et.al. 2501.08200 link
2025-01-14 OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Yijiong Yu et.al. 2501.08197 link
2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-15 A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following Yin Fang et.al. 2501.08187 link
2025-01-14 Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Rewina Bedemariam et.al. 2501.08167 null
2025-01-14 I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution Soohyeon Choi et.al. 2501.08165 null
2025-01-14 Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data Phai Vu Dinh et.al. 2501.08149 null
2025-01-14 Refusal Behavior in Large Language Models: A Nonlinear Perspective Fabian Hildebrandt et.al. 2501.08145 link
2025-01-14 Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying Jonathan Lyhs et.al. 2501.08142 null
2025-01-14 Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 Seamie Hayes et.al. 2501.08118 null
2025-01-15 Consistency of Responses and Continuations Generated by Large Language Models on Social Media Wenlu Fan et.al. 2501.08102 null
2025-01-14 Hierarchical Autoscaling for Large Language Model Serving with Chiron Archit Patke et.al. 2501.08090 null
2025-01-14 Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving Nert Keser et.al. 2501.08083 null
2025-01-14 CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning Guoliang He et.al. 2501.08071 link
2025-01-14 A Roadmap to Guide the Integration of LLMs in Hierarchical Planning Israel Puerta-Merino et.al. 2501.08068 null
2025-01-14 Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT Awritrojit Banerjee et.al. 2501.08053 null
2025-01-14 TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Yao Liang et.al. 2501.08008 null
2025-01-14 LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS Muhammad Ashfaq et.al. 2501.07992 null
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 link
2025-01-14 Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models Yifang Xu et.al. 2501.07972 null
2025-01-14 Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning Jiaqi Hua et.al. 2501.07959 link
2025-01-14 AI Guide Dog: Egocentric Path Prediction on Smartphone Aishwarya Jadhav et.al. 2501.07957 null
2025-01-14 Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations Waqar Hussain et.al. 2501.07931 null
2025-01-14 Gandalf the Red: Adaptive Security for LLMs Niklas Pfister et.al. 2501.07927 link
2025-01-14 VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models Hui Kuurila-Zhang et.al. 2501.07922 link
2025-01-14 Large Language Model Interface for Home Energy Management Systems François Michelon et.al. 2501.07919 null
2025-01-14 Bridge-SR: Schrödinger Bridge for Efficient SR Chang Li et.al. 2501.07897 null
2025-01-14 Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs Shuai Wang et.al. 2501.07892 null
2025-01-14 ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding Zhongxiang Sun et.al. 2501.07861 null
2025-01-14 Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques Shobhit Ratan et.al. 2501.07853 null
2025-01-14 Unveiling Provider Bias in Large Language Models for Code Generation Xiaoyu Zhang et.al. 2501.07849 null
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-14 A Driver Advisory System Based on Large Language Model for High-speed Train Y. C. Luo et.al. 2501.07837 null
2025-01-14 Flow: A Modular Approach to Automated Agentic Workflow Generation Boye Niu et.al. 2501.07834 link
2025-01-14 Real-time Verification and Refinement of Language Model Text Generation Joonho Ko et.al. 2501.07824 null
2025-01-14 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding Haomiao Xiong et.al. 2501.07819 link
2025-01-14 A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models Kaustubh D. Dhole et.al. 2501.07818 null
2025-01-14 Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models Dhruv Dhamani et.al. 2501.07815 null
2025-01-14 Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering Feijie Wu et.al. 2501.07813 null
2025-01-14 CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation Ruwei Pan et.al. 2501.07811 null
2025-01-14 Visual Language Models as Operator Agents in the Space Domain Alejandro Carrasco et.al. 2501.07802 null
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-14 Symmetry-Aware Generative Modeling through Learned Canonicalization Kusha Sareen et.al. 2501.07773 null
2025-01-14 Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey Bingchen Liu et.al. 2501.07766 null
2025-01-14 On the Statistical Capacity of Deep Generative Models Edric Tam et.al. 2501.07763 link
2025-01-13 Advancing Student Writing Through Automated Syntax Feedback Kamyar Zeinalipour et.al. 2501.07740 null
2025-01-13 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Dongwon Kim et.al. 2501.07730 null
2025-01-13 LLMic: Romanian Foundation Language Model Vlad-Andrei Bădoiu et.al. 2501.07721 null
2025-01-13 CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory Haokun Zhao et.al. 2501.07674 null
2025-01-13 Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning Karishma Thakrar et.al. 2501.07663 null
2025-01-13 Large Language Models for Interpretable Mental Health Diagnosis Brian Hyeongseok Kim et.al. 2501.07653 null
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 GPT as a Monte Carlo Language Tree: A Probabilistic Perspective Kun-Peng Ning et.al. 2501.07641 null
2025-01-13 SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models Fabien Bernier et.al. 2501.07639 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chengzu Li et.al. 2501.07542 null
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 Investigating Large Language Models in Inferring Personality Traits from User Conversations Jianfeng Zhu et.al. 2501.07532 null
2025-01-13 RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment Difei Gu et.al. 2501.07525 link
2025-01-13 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 null
2025-01-13 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards Yangsibo Huang et.al. 2501.07493 null
2025-01-13 TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models Thales Sales Almeida et.al. 2501.07482 null
2025-01-13 A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities Yihao Liu et.al. 2501.07468 null
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI Rolf Pfister et.al. 2501.07458 null
2025-01-13 Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection Xin Yin et.al. 2501.07425 null
2025-01-13 Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion Lala Shakti Swarup Ray et.al. 2501.07408 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 Simulating the Hubbard Model with Equivariant Normalizing Flows Dominic Schuh et.al. 2501.07371 null
2025-01-13 Emergent effects of scaling on the functional hierarchies within large language models Paul C. Bogdan et.al. 2501.07359 null
2025-01-13 Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring Buse Sibel Korkmaz et.al. 2501.07324 link
2025-01-13 FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering Erik Henriksson et.al. 2501.07314 link
2025-01-13 The Lessons of Developing Process Reward Models in Mathematical Reasoning Zhenru Zhang et.al. 2501.07301 null
2025-01-13 GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction Oleg Kobzarev et.al. 2501.07295 null
2025-01-13 LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks Zan-Kai Chong et.al. 2501.07288 null
2025-01-13 Lifelong Learning of Large Language Model based Agents: A Roadmap Junhao Zheng et.al. 2501.07278 link
2025-01-13 Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation Amir Sartipi et.al. 2501.07276 null
2025-01-13 Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics Wonduk Seo et.al. 2501.07267 null
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 EdgeTAM: On-Device Track Anything Model Chong Zhou et.al. 2501.07256 null
2025-01-13 Large Language Models: New Opportunities for Access to Science Jutta Schnabel et.al. 2501.07250 null
2025-01-13 Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training Ziqing Wen et.al. 2501.07237 link
2025-01-13 Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction Qiaoqiao Ren et.al. 2501.07224 link
2025-01-13 Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing Laifa Tao et.al. 2501.07191 null
2025-01-13 Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study Huashan Chen et.al. 2501.07165 null
2025-01-13 AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model Bangchen Yin et.al. 2501.07155 link
2025-01-13 LLM360 K2: Scaling Up 360-Open-Source Large Language Models Zhengzhong Liu et.al. 2501.07124 null
2025-01-13 How GPT learns layer by layer Jason Du et.al. 2501.07108 link
2025-01-13 ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training Jiayang Wu et.al. 2501.07078 link
2025-01-13 D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation Zhejun Zhang et.al. 2501.07077 link
2025-01-13 Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values Jing Yao et.al. 2501.07071 null
2025-01-13 Enhancing Image Generation Fidelity via Progressive Prompts Zhen Xiong et.al. 2501.07070 link
2025-01-13 Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities ZeKe Xiao et.al. 2501.07058 null
2025-01-13 SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation Yee-Fan Tan et.al. 2501.07055 null
2025-01-13 PoAct: Policy and Action Dual-Control Agent for Generalized Applications Guozhi Yuan et.al. 2501.07054 null
2025-01-13 ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction Yan Zhang et.al. 2501.07051 link
2025-01-13 Unveiling the Potential of Text in High-Dimensional Time Series Forecasting Xin Zhou et.al. 2501.07048 link
2025-01-13 Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis Luwei Zeng et.al. 2501.07034 null
2025-01-13 A Proposed Large Language Model-Based Smart Search for Archive System Ha Dung Nguyen et.al. 2501.07024 null
2025-01-13 Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps Henry Li et.al. 2501.06999 link
2025-01-13 LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models Mozhgan Nasr Azadani et.al. 2501.06986 link
2025-01-13 Combining LLM decision and RL action selection to improve RL policy for adaptive interventions Karine Karine et.al. 2501.06980 null
2025-01-12 How is Google using AI for internal code migrations? Stoyan Nikolov et.al. 2501.06972 null
2025-01-12 Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives Xinyao Ma et.al. 2501.06964 null
2025-01-12 Comparison of Autoencoders for tokenization of ASL datasets Vouk Praun-Petrovic et.al. 2501.06942 null
2025-01-12 Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy Evgeny Ugolkov et.al. 2501.06939 link
2025-01-12 Harnessing Large Language Models for Disaster Management: A Survey Zhenyu Lei et.al. 2501.06932 null
2025-01-12 Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories Faaiq Waqar et.al. 2501.06921 null
2025-01-12 Risk-Averse Finetuning of Large Language Models Sapana Chaudhary et.al. 2501.06911 link
2025-01-12 Deep Learning and Foundation Models for Weather Prediction: A Survey Jimeng Shi et.al. 2501.06907 null
2025-01-12 A Foundational Generative Model for Breast Ultrasound Image Analysis Haojun Yu et.al. 2501.06869 null
2025-01-12 Transfer Learning of Tabular Data by Finetuning Large Language Models Shourav B. Rabbani et.al. 2501.06863 null
2025-01-12 A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context Noureldin Zahran et.al. 2501.06859 null
2025-01-12 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training Tianjin Huang et.al. 2501.06842 link
2025-01-12 An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering Zaber Al Hassan Ayon et.al. 2501.06837 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents Augusto Gonzalez-Bonorino et.al. 2501.06834 link
2025-01-12 GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing Ruizhe Ou et.al. 2501.06828 null
2025-01-12 Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification Shijing Chen et.al. 2501.06827 null
2025-01-12 Event Argument Extraction with Enriched Prompts Chen Liang et.al. 2501.06825 link
2025-01-12 A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT Yizhou Zhou et.al. 2501.06819 null
2025-01-12 RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models Keyan Chen et.al. 2501.06809 link
2025-01-12 Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting Yongshuo Zhu et.al. 2501.06808 null
2025-01-12 MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference Wenxuan Zeng et.al. 2501.06807 null
2025-01-12 Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences Liu Yu et.al. 2501.06795 null
2025-01-12 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes Mahmoud Ahmed et.al. 2501.06785 link
2025-01-12 Cost-Effective Robotic Handwriting System with AI Integration Tianyi Huang et.al. 2501.06783 null
2025-01-12 Eliza: A Web3 friendly AI Agent Operating System Shaw Walters et.al. 2501.06781 link
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-12 Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation Shunfan Zheng et.al. 2501.06741 null
2025-01-12 ZOQO: Zero-Order Quantized Optimization Noga Bar et.al. 2501.06736 null
2025-01-12 Better Prompt Compression Without Multi-Layer Perceptrons Edouardo Honig et.al. 2501.06730 null
2025-01-12 Measuring the Robustness of Reference-Free Dialogue Evaluation Systems Justin Vasselli et.al. 2501.06728 link
2025-01-12 Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G Zhiyan Liu et.al. 2501.06726 null
2025-01-12 DRDT3: Diffusion-Refined Decision Test-Time Training Model Xingshuai Huang et.al. 2501.06718 null
2025-01-12 ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian Mykyta Syromiatnikov et.al. 2501.06715 link
2025-01-12 Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management Liu Qianli et.al. 2501.06709 null
2025-01-12 Evaluating Sample Utility for Data Selection by Mimicking Model Weights Tzu-Heng Huang et.al. 2501.06708 null
2025-01-12 AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds Yinfang Chen et.al. 2501.06706 null
2025-01-12 Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese Jie Yang et.al. 2501.06704 null
2025-01-12 Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users’ Questions Aidan Hogan et.al. 2501.06699 null
2025-01-12 DVM: Towards Controllable LLM Agents in Social Deduction Games Zheng Zhang et.al. 2501.06695 null
2025-01-12 TAPO: Task-Referenced Adaptation for Prompt Optimization Wenxin Luo et.al. 2501.06689 link
2025-01-12 Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning Xiangen Hu et.al. 2501.06682 null
2025-01-12 Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving Haoxiang Gao et.al. 2501.06680 null
2025-01-11 Challenging reaction prediction models to generalize to novel chemistry John Bradshaw et.al. 2501.06669 link
2025-01-11 Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training Sanjit Kakarla et.al. 2501.06658 link
2025-01-11 FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings Tong Liu et.al. 2501.06645 null
2025-01-11 Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models Veronika Smilga et.al. 2501.06638 link
2025-01-11 Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach Mohammed Maree et.al. 2501.06628 null
2025-01-11 Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Amr Almorsi et.al. 2501.06625 null
2025-01-11 Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks Xuanhao Luo et.al. 2501.06604 null
2025-01-11 ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Xuanle Zhao et.al. 2501.06598 link
2025-01-11 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning Xiangru Tang et.al. 2501.06590 link
2025-01-11 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Muru Zhang et.al. 2501.06589 link
2025-01-10 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Omkar Thawakar et.al. 2501.06186 link
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-10 GenMol: A Drug Discovery Generalist with Discrete Diffusion Seul Lee et.al. 2501.06158 null
2025-01-10 Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories Gerd Kortemeyer et.al. 2501.06143 null
2025-01-10 Supervision policies can shape long-term risk management in general-purpose AI models Manuel Cebrian et.al. 2501.06137 link
2025-01-10 Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Yuya Asano et.al. 2501.06129 null
2025-01-10 Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Fabian David Schmidt et.al. 2501.06117 link
2025-01-10 From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy Elham Aghakhani et.al. 2501.06101 null
2025-01-10 Photokinetics of Photothermal Reactions Mounir Maafi et.al. 2501.06057 null
2025-01-10 AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery Johann Wenckstern et.al. 2501.06039 link
2025-01-10 Addressing speaker gender bias in large scale speech translation systems Shubham Bansal et.al. 2501.05989 null
2025-01-10 Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Eklavya Sarkar et.al. 2501.05987 link
2025-01-10 Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys Divya Mani Adhikari et.al. 2501.05985 null
2025-01-10 Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea Eunjung Cho et.al. 2501.05981 null
2025-01-10 Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory Yunmeng Shu et.al. 2501.05965 null
2025-01-10 Effective faking of verbal deception detection with target-aligned adversarial attacks Bennett Kleinberg et.al. 2501.05962 null
2025-01-10 Reusable specimen-level inference in computational pathology Jakub R. Kaczmarzyk et.al. 2501.05945 link
2025-01-10 DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information Yongfan Lai et.al. 2501.05932 link
2025-01-10 LLMs Reproduce Stereotypes of Sexual and Gender Minorities Ruby Ostrow et.al. 2501.05926 null
2025-01-10 Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction Petraq Nako et.al. 2501.05925 null
2025-01-10 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Prompt engineering and its implications on the energy consumption of Large Language Models Riccardo Rubei et.al. 2501.05899 link
2025-01-10 Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs Bianca Raimondi et.al. 2501.05891 link
2025-01-10 Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs Dabing Cheng et.al. 2501.05884 null
2025-01-10 VideoRAG: Retrieval-Augmented Generation over Video Corpus Soyeong Jeong et.al. 2501.05874 null
2025-01-10 ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability Antonin Poché et.al. 2501.05855 link
2025-01-10 Understanding Impact of Human Feedback via Influence Functions Taywon Min et.al. 2501.05790 link
2025-01-10 Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models You Li et.al. 2501.05767 null
2025-01-10 Controlling Large Language Models Through Concept Activation Vectors Hanyu Zhang et.al. 2501.05764 null
2025-01-10 StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation Shangjin Zhai et.al. 2501.05763 null
2025-01-10 CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech Madhurananda Pahar et.al. 2501.05755 null
2025-01-10 Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models Sungjae Lee et.al. 2501.05752 null
2025-01-10 TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos Korawat Charoenpitaks et.al. 2501.05733 link
2025-01-10 Enabling Scalable Oversight via Self-Evolving Critic Zhengyang Tang et.al. 2501.05727 null
2025-01-10 I Can’t Share Code, but I need Translation – An Empirical Study on Code Translation through Federated LLM Jahnavi Kumar et.al. 2501.05724 null
2025-01-10 How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond Chen Huang et.al. 2501.05714 null
2025-01-10 Multi-Step Reasoning in Korean and the Emergent Mirage Guijin Son et.al. 2501.05712 null
2025-01-10 EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model Yi He et.al. 2501.05710 null
2025-01-10 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Vighnesh Subramaniam et.al. 2501.05707 null
2025-01-10 Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness Audrey Salmon et.al. 2501.05706 null
2025-01-10 Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection Feiyi Chen et.al. 2501.05675 null
2025-01-10 Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration Zuyuan Zhang et.al. 2501.05673 null
2025-01-10 Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models Zheqi Lv et.al. 2501.05662 null
2025-01-10 Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation Zheqi Lv et.al. 2501.05647 null
2025-01-10 Iconicity in Large Language Models Anna Marklová et.al. 2501.05643 null
2025-01-10 HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection Anant Mehta et.al. 2501.05631 link
2025-01-10 The Impact of Model Scaling on Seen and Unseen Language Performance Rhitabrat Pokharel et.al. 2501.05629 null
2025-01-09 Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study Zhenyu Qi et.al. 2501.05625 null
2025-01-09 Exploring Large Language Models for Translating Romanian Computational Problems into English Adrian Marius Dumitran et.al. 2501.05601 null
2025-01-09 Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics Gert Aarts et.al. 2501.05580 null
2025-01-09 Exploring Large Language Models (LLMs) through interactive Python activities Eugenio Tufino et.al. 2501.05577 link
2025-01-09 LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts Yuri Facanha Bezerra et.al. 2501.05554 link
2025-01-09 The dynamics of meaning through time: Assessment of Large Language Models Mohamed Taher Alrefaie et.al. 2501.05552 null
2025-01-09 Infecting Generative AI With Viruses David Noever et.al. 2501.05542 null
2025-01-09 NSChat: A Chatbot System To Rule Them All Zenon Lamprou et.al. 2501.05541 null
2025-01-09 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Xingyu Fu et.al. 2501.05452 null
2025-01-09 Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Yifan Yu et.al. 2501.05446 link
2025-01-09 Consistent Flow Distillation for Text-to-3D Generation Runjie Yan et.al. 2501.05445 null
2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Yunzhuo Hao et.al. 2501.05444 link
2025-01-09 A survey of textual cyber abuse detection using cutting-edge language models and large language models Jose A. Diaz-Garcia et.al. 2501.05443 null
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers Jerry Chongyi Hu et.al. 2501.05423 null
2025-01-09 Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation Darius Petermann et.al. 2501.05413 null
2025-01-10 Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics Maximilian Alber et.al. 2501.05409 null
2025-01-09 TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts Yu-Hao Huang et.al. 2501.05403 link
2025-01-09 Mechanistic understanding and validation of large AI models with SemanticLens Maximilian Dreyer et.al. 2501.05398 null
2025-01-09 FairCode: Evaluating Social Bias of LLMs in Code Generation Yongkang Du et.al. 2501.05396 link
2025-01-09 Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models Kristian G. Barman et.al. 2501.05382 null
2025-01-09 Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance Dimitrios Gerogiannis et.al. 2501.05379 null
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 null
2025-01-09 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou et.al. 2501.05336 link
2025-01-09 “What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles Xuewen Luo et.al. 2501.05322 null
2025-01-09 Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning Nora Gourmelon et.al. 2501.05281 link
2025-01-09 CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models Fabian Hörst et.al. 2501.05269 link
2025-01-09 Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal Wanli Ma et.al. 2501.05265 null
2025-01-09 CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models Yewei Song et.al. 2501.05255 null
2025-01-09 From Scientific Texts to Verifiable Code: Automating the Process with Transformers Changjie Wang et.al. 2501.05252 null
2025-01-09 RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models Peizhuo Lv et.al. 2501.05249 null
2025-01-09 Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning Laura Puccioni et.al. 2501.05248 null
2025-01-09 Online Prompt and Solver Selection for Program Synthesis Yixuan Li et.al. 2501.05247 null
2025-01-09 Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs Artem Fedorchenko et.al. 2501.05234 null
2025-01-09 Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection Pei-Kang Lee et.al. 2501.05228 null
2025-01-09 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 null
2025-01-09 Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond Tomas Goldsack et.al. 2501.05224 null
2025-01-09 A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education Ziqing Li et.al. 2501.05220 null
2025-01-09 Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration Xuyang Liu et.al. 2501.05179 link
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-09 Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering Matteo Esposito et.al. 2501.05165 null
2025-01-09 Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier Yufei Shang et.al. 2501.05155 null
2025-01-09 DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving Xuran Zheng et.al. 2501.05081 null
2025-01-09 Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization Harshith Manjunath et.al. 2501.05079 null
2025-01-09 Analyzing Memorization in Large Language Models through the Lens of Model Attribution Tarun Ram Menta et.al. 2501.05078 link
2025-01-09 A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model Shuo Tong et.al. 2501.05075 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators Leonard Lupin-Jimenez et.al. 2501.05058 null
2025-01-09 LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models Zengqi Peng et.al. 2501.05057 null
2025-01-09 On the Generalizability of Transformer Models to Code Completions of Different Lengths Nathan Cooper et.al. 2501.05051 null
2025-01-09 SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Chengxing Xie et.al. 2501.05040 link
2025-01-09 Enhancing Human-Like Responses in Large Language Models Ethem Yağız Çalık et.al. 2501.05032 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-09 A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications Ofir Marom et.al. 2501.05030 null
2025-01-09 TreeKV: Smooth Key-Value Cache Compression with Tree Structures Ziwei He et.al. 2501.04987 null
2025-01-09 SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs Muhammad Salman et.al. 2501.04985 null
2025-01-09 V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer Hangzhou He et.al. 2501.04975 link
2025-01-09 Demystifying Domain-adaptive Post-training for Financial LLMs Zixuan Ke et.al. 2501.04961 link
2025-01-09 Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments Yifan Xu et.al. 2501.04947 null
2025-01-09 Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models Qingyu Ren et.al. 2501.04945 link
2025-01-09 Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency Shiji Zhao et.al. 2501.04931 null
2025-01-09 Investigating Numerical Translation with Large Language Models Wei Tang et.al. 2501.04927 null
2025-01-09 FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jun-Hak Yun et.al. 2501.04926 link
2025-01-09 HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers Yiyao Yang et.al. 2501.04908 link
2025-01-09 JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis Jun-Hyeok Cha et.al. 2501.04904 null
2025-01-09 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries Keke Huang et.al. 2501.04901 null
2025-01-09 SUGAR: Leveraging Contextual Confidence for Smarter Retrieval Hanna Zubkova et.al. 2501.04899 null
2025-01-08 Leveraging Log Probabilities in Language Models to Forecast Future Events Tommaso Soru et.al. 2501.04880 null
2025-01-08 Real-Time Textless Dialogue Generation Long Mai et.al. 2501.04877 link
2025-01-08 Modelling complex proton transport phenomena – Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields Malte Grunert et.al. 2501.04876 null
2025-01-08 Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware Brandon J Walton et.al. 2501.04848 null
2025-01-08 Do Code LLMs Understand Design Patterns? Zhenyu Pan et.al. 2501.04835 null
2025-01-08 On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability Andreas Vogelsang et.al. 2501.04810 null
2025-01-08 IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX Erik Recio-Armengol et.al. 2501.04776 link
2025-01-08 Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations Kirandeep Kaur et.al. 2501.04762 null
2025-01-08 Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch Phillip Richter et.al. 2501.04755 null
2025-01-08 EditAR: Unified Conditional Generation with Autoregressive Models Jiteng Mu et.al. 2501.04699 null
2025-01-08 Re-ranking the Context for Multimodal Retrieval Augmented Generation Matin Mortaheb et.al. 2501.04695 null
2025-01-08 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Zixuan Huang et.al. 2501.04689 null
2025-01-08 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Ruilin Luo et.al. 2501.04686 link
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 Assessing Language Comprehension in Large Language Models Using Construction Grammar Wesley Scivetti et.al. 2501.04661 null
2025-01-08 Multi-task retriever fine-tuning for domain-specific and efficient RAG Patrice Béchard et.al. 2501.04652 null
2025-01-08 FlairGPT: Repurposing LLMs for Interior Designs Gabrielle Littlefair et.al. 2501.04648 null
2025-01-08 Knowledge Retrieval Based on Generative AI Te-Lun Yang et.al. 2501.04635 null
2025-01-08 “Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era Giulio Antonio Abbo et.al. 2501.04633 null
2025-01-09 MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation Daniele Molino et.al. 2501.04614 null
2025-01-08 Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning Ivan Kankeu et.al. 2501.04591 link
2025-01-08 Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models Miaoyang He et.al. 2501.04582 null
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 link
2025-01-09 OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Run Luo et.al. 2501.04561 link
2025-01-08 The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? Christopher Lazik et.al. 2501.04543 null
2025-01-08 Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time Uri Berger et.al. 2501.04513 null
2025-01-08 CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection Ruijun Feng et.al. 2501.04510 null
2025-01-08 Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction Guofeng Yang et.al. 2501.04487 null
2025-01-08 When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages Archchana Sindhujan et.al. 2501.04473 null
2025-01-08 Hidden Entity Detection from GitHub Leveraging Large Language Models Lu Gan et.al. 2501.04455 link
2025-01-08 Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions Doaa Mahmud et.al. 2501.04437 null
2025-01-08 Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions Na Yan et.al. 2501.04436 null
2025-01-08 End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach H. M. Shadman Tabib et.al. 2501.04425 null
2025-01-08 SEO: Stochastic Experience Optimization for Large Language Models Jitao Xu et.al. 2501.04393 null
2025-01-08 iFADIT: Invertible Face Anonymization via Disentangled Identity Transform Lin Yuan et.al. 2501.04390 null
2025-01-08 DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications Feng Liu et.al. 2501.04366 link
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-09 Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models Haonan Shi et.al. 2501.04323 null
2025-01-08 Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts Preethi Seshadri et.al. 2501.04316 link
2025-01-08 RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation Jun Liu et.al. 2501.04315 null
2025-01-08 Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models Kunpeng Zhang et.al. 2501.04312 null
2025-01-08 LLM4SR: A Survey on Large Language Models for Scientific Research Ziming Luo et.al. 2501.04306 link
2025-01-08 Multimodal Graph Constrastive Learning and Prompt for ChartQA Yue Dai et.al. 2501.04303 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 An Analysis of Model Robustness across Concurrent Distribution Shifts Myeongho Jeon et.al. 2501.04288 null
2025-01-08 Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models Bahman Torkamandi et.al. 2501.04286 link
2025-01-08 Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking Tianqi Ren et.al. 2501.04285 null
2025-01-08 OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments Yujie Tang et.al. 2501.04279 null
2025-01-08 Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering Christophe Bajan et.al. 2501.04277 link
2025-01-08 Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation Senwei Xie et.al. 2501.04268 null
2025-01-08 Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning Lang Xu et.al. 2501.04266 null
2025-01-08 IOLBENCH: Benchmarking LLMs on Linguistic Reasoning Satyam Goyal et.al. 2501.04249 link
2025-01-08 TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events Jian-Hua Fang et.al. 2501.04247 null
2025-01-08 Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks Rachel Longjohn et.al. 2501.04234 null
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-07 AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots Shaoyue Wen et.al. 2501.04156 link
2025-01-07 Multilingual Open QA on the MIA Shared Task Navya Yarrabelly et.al. 2501.04153 null
2025-01-07 The angular momentum spiral of the Milky Way disc in Gaia Rashid Yaaqib et.al. 2501.04095 null
2025-01-07 More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives Xiaoqing Zhang et.al. 2501.04070 link
2025-01-07 ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono Jingquan Wang et.al. 2501.04062 null
2025-01-07 LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Lingdong Kong et.al. 2501.04005 null
2025-01-07 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Haobo Yuan et.al. 2501.04001 link
2025-01-07 RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance Matin Mortaheb et.al. 2501.03995 null
2025-01-07 Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance Adil Rengim Cetingoz et.al. 2501.03993 null
2025-01-07 Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles Yuxi Xia et.al. 2501.03991 null
2025-01-07 (De)-Indexing and the Right to be Forgotten Salvatore Vilella et.al. 2501.03989 null
2025-01-07 VLM-driven Behavior Tree for Context-aware Task Planning Naoki Wake et.al. 2501.03968 link
2025-01-07 Vision Language Models as Values Detectors Giulio Antonio Abbo et.al. 2501.03957 null
2025-01-07 Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States Jurgita Kapočiūtė-Dzikienė et.al. 2501.03952 null
2025-01-07 Synthetic Data Privacy Metrics Amy Steier et.al. 2501.03941 null
2025-01-07 Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection Pablo Miralles-González et.al. 2501.03940 null
2025-01-07 A precise asymptotic analysis of learning diffusion models: theory and insights Hugo Cui et.al. 2501.03937 link
2025-01-07 Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study Ramya Jonnala et.al. 2501.03904 null
2025-01-07 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Shaolei Zhang et.al. 2501.03895 link
2025-01-07 AlphaPO – Reward shape matters for LLM alignment Aman Gupta et.al. 2501.03884 null
2025-01-07 CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds Keonwoo Kim et.al. 2501.03879 null
2025-01-07 Progressive Document-level Text Simplification via Large Language Models Dengzhao Fang et.al. 2501.03857 null
2025-01-07 MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention Aadya Arora et.al. 2501.03839 null
2025-01-07 Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging Simon W. Penninga et.al. 2501.03825 null
2025-01-08 MADation: Face Morphing Attack Detection with Foundation Models Eduarda Caldeira et.al. 2501.03800 link
2025-01-07 KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration Chengyuan Li et.al. 2501.03786 null
2025-01-07 Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series Yuxiao Hu et.al. 2501.03747 null
2025-01-07 Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein Xiaotong Guo et.al. 2501.03722 null
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-07 Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Haoning Xu et.al. 2501.03643 null
2025-01-07 CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems Zhaonan Wu et.al. 2501.03626 link
2025-01-07 LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment Gaoussou Youssouf Kebe et.al. 2501.03624 null
2025-01-07 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 link
2025-01-07 From Code to Compliance: Assessing ChatGPT’s Utility in Designing an Accessible Webpage – A Case Study Ammar Ahmed et.al. 2501.03572 null
2025-01-07 What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs Muneera Bano et.al. 2501.03569 null
2025-01-07 Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities Benedikt Reitemeyer et.al. 2501.03566 null
2025-01-07 Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis Haoran Lai et.al. 2501.03565 null
2025-01-07 PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models Lingzhi Yuan et.al. 2501.03544 null
2025-01-07 Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions Weijieying Ren et.al. 2501.03540 null
2025-01-07 Deep Learning for Pathological Speech: A Survey Shakeel A. Sheikh et.al. 2501.03536 null
2025-01-08 SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving Xuewen Luo et.al. 2501.03535 null
2025-01-07 A generative approach for lensless imaging in low-light conditions Ziyang Liu et.al. 2501.03511 null
2025-01-07 A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models Shuyang Wang et.al. 2501.03508 null
2025-01-07 Textualize Visual Prompt for Image Editing via Diffusion Bridge Pengcheng Xu et.al. 2501.03495 null
2025-01-07 Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment Prashant Trivedi et.al. 2501.03486 null
2025-01-07 Reading with Intent – Neutralizing Intent Benjamin Reichman et.al. 2501.03475 null
2025-01-07 Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning Chuang Niu et.al. 2501.03469 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-07 ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation Yu-Cheng Liu et.al. 2501.03462 null
2025-01-07 Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation Xiao Wang et.al. 2501.03458 link
2025-01-07 CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering Jialiang Chen et.al. 2501.03447 null
2025-01-07 LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models Mohamad Fakih et.al. 2501.03446 null
2025-01-07 Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology Sarah E. Finch et.al. 2501.03441 link
2025-01-06 SALT: Sales Autocompletion Linked Business Tables Dataset Tassilo Klein et.al. 2501.03413 link
2025-01-06 BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations Simone Giovannini et.al. 2501.03403 null
2025-01-06 DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes Xuyang Wang et.al. 2501.03397 link
2025-01-06 Evolved Quantum Boltzmann Machines Michele Minervini et.al. 2501.03367 null
2025-01-06 CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Tanay Agrawal et.al. 2501.03332 null
2025-01-06 LiLMaps: Learnable Implicit Language Maps Evgenii Kruzhkov et.al. 2501.03304 null
2025-01-06 A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval Shuo Tong et.al. 2501.03295 null
2025-01-06 Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model Naibo Wang et.al. 2501.03292 null
2025-01-06 ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning Pengwei Tang et.al. 2501.03291 link
2025-01-06 CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models Zhenyu Xu et.al. 2501.03288 null
2025-01-06 BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Beichen Zhang et.al. 2501.03226 link
2025-01-06 Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text Ayat Najjar et.al. 2501.03212 null
2025-01-06 Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity Ayat A. Najjar et.al. 2501.03203 null
2025-01-06 CLIX: Cross-Lingual Explanations of Idiomatic Expressions Aaron Gluck et.al. 2501.03191 null
2025-01-06 Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text Ali Al-Lawati et.al. 2501.03166 link
2025-01-06 Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy Risha Goel et.al. 2501.03153 link
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity Yerong Li et.al. 2501.03139 null
2025-01-07 PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Mingyang Song et.al. 2501.03124 link
2025-01-06 CAT: Content-Adaptive Image Tokenization Junhong Shen et.al. 2501.03120 null
2025-01-06 LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases Dylan Bouchard et.al. 2501.03112 link
2025-01-06 Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling Aseem Srivastava et.al. 2501.03088 null
2025-01-06 Retrieval-Augmented TLAPS Proof Generation with Large Language Models Yuhao Zhou et.al. 2501.03073 null
2025-01-06 ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events Duygu Sezen Islakoglu et.al. 2501.03040 null
2025-01-06 Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Zhen Li et.al. 2501.03035 null
2025-01-06 TransPixar: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 CALM: Curiosity-Driven Auditing for Large Language Models Xiang Zheng et.al. 2501.02997 link
2025-01-06 Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation Zhi Qu et.al. 2501.02979 link
2025-01-06 FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2501.02968 null
2025-01-07 Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild Wanpeng Hu et.al. 2501.02964 link
2025-01-07 SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild Jiawei Liu et.al. 2501.02962 null
2025-01-06 The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features Shi Bin Hoo et.al. 2501.02945 link
2025-01-07 Inhibition of bacterial growth by antibiotics Barnabe Ledoux et.al. 2501.02944 null
2025-01-06 Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions Jianhua Pei et.al. 2501.02928 null
2025-01-06 DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model Hao Yu et.al. 2501.02901 link
2025-01-06 FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection Guray Ozgur et.al. 2501.02892 link
2025-01-06 MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs Hui Sun et.al. 2501.02885 null
2025-01-06 IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment Yiming Zhang et.al. 2501.02869 null
2025-01-06 Large Language Models for Video Surveillance Applications Ulindu De Silva et.al. 2501.02850 null
2025-01-06 Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification Yubo Wang et.al. 2501.02844 null
2025-01-06 Foundations of GenIR Qingyao Ai et.al. 2501.02842 null
2025-01-06 An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models Dezhi Ran et.al. 2501.02829 null
2025-01-06 InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion Zhaoyi Yan et.al. 2501.02795 null
2025-01-06 CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation Yuanhong Chen et.al. 2501.02786 null
2025-01-06 GeAR: Generation Augmented Retrieval Haoyu Liu et.al. 2501.02772 null
2025-01-06 Visual Large Language Models for Generalized and Specialized Applications Yifan Li et.al. 2501.02765 link
2025-01-06 Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? Hongyi Miao et.al. 2501.02751 null
2025-01-06 Artificial Intelligence in Creative Industries: Advances Prior to 2025 Nantheera Anantrasirichai et.al. 2501.02725 null
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-06 QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance Binita Saha et.al. 2501.02702 null
2025-01-06 EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models Andrés Villa et.al. 2501.02699 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-05 Decoding specialised feature neurons in LLMs with the final projection layer Harry J Davies et.al. 2501.02688 null
2025-01-05 From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering Wen-ran Li et.al. 2501.02680 null
2025-01-05 A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model Shivaram Kalyanakrishnan et.al. 2501.02652 null
2025-01-05 Representation Learning of Lab Values via Masked AutoEncoder David Restrepo et.al. 2501.02648 link
2025-01-05 Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense Yang Ouyang et.al. 2501.02629 link
2025-01-05 Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets Mahmoud Jahanshahi et.al. 2501.02628 null
2025-01-05 HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning Saleh Ashkboos et.al. 2501.02625 link
2025-01-05 LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment Yifei Liu et.al. 2501.02621 null
2025-01-05 TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jovan Stojkovic et.al. 2501.02600 null
2025-01-05 LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations Jiaping Wang et.al. 2501.02573 link
2025-01-05 Multi-LLM Collaborative Caption Generation in Scientific Documents Jaeyoung Kim et.al. 2501.02552 link
2025-01-05 Transformers Simulate MLE for Sequence Generation in Bayesian Networks Yuan Cao et.al. 2501.02547 null
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-05 Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI Ljubisa Bojic et.al. 2501.02531 null
2025-01-05 Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks Leo Franklin et.al. 2501.02527 null
2025-01-05 Unified Guidance for Geometry-Conditioned Molecular Generation Sirine Ayadi et.al. 2501.02526 null
2025-01-05 Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors Minglin Chen et.al. 2501.02519 null
2025-01-05 CHAIR-Classifier of Hallucination as Improver Ao Sun et.al. 2501.02518 link
2025-01-05 ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Junjie Ye et.al. 2501.02506 null
2025-01-05 Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons Sebastian Morel-Balbi et.al. 2501.02505 null
2025-01-05 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null
2025-01-05 LLMPC: Large Language Model Predictive Control Gabriel Maher et.al. 2501.02486 link
2025-01-05 Decoding News Bias: Multi Bias Detection in News Articles Bhushan Santosh Shah et.al. 2501.02482 null
2025-01-05 Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine Yishen Liu et.al. 2501.02471 null
2025-01-05 Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Yuliang Guo et.al. 2501.02464 link
2025-01-05 Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications Zhe Chen et.al. 2501.02460 null
2025-01-05 Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap Hyunwoo Ko et.al. 2501.02448 null
2025-01-05 RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework Kun Wang et.al. 2501.02446 null
2025-01-05 A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models Yinpeng Cai et.al. 2501.02441 null
2025-01-05 Efficient Deployment of Large Language Models on Resource-constrained Devices Zhiwei Yao et.al. 2501.02438 null
2025-01-05 FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance Haicheng Wang et.al. 2501.02430 link
2025-01-05 GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems Mehmet Deniz Türkmen et.al. 2501.02408 null
2025-01-04 Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities Tara Radvand et.al. 2501.02406 null
2025-01-04 Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Markus J. Buehler et.al. 2501.02393 link
2025-01-04 Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations Kangyu Zhu et.al. 2501.02385 null
2025-01-04 Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison Tsz Kin Lam et.al. 2501.02370 null
2025-01-04 Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving Sanghyun Park et.al. 2501.02348 null
2025-01-04 Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support Florian Putz et.al. 2501.02346 null
2025-01-04 UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility Yonglin Tian et.al. 2501.02341 link
2025-01-04 AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference Zhuomin He et.al. 2501.02336 link
2025-01-04 Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications Jodi M. Casabianca et.al. 2501.02334 null
2025-01-04 Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance Marta Gentiloni-Silveri et.al. 2501.02298 null
2025-01-04 Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection Yachao Zhao et.al. 2501.02295 null
2025-01-04 Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control Yongjeong Oh et.al. 2501.02273 null
2025-01-04 What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph Yutao Jiang et.al. 2501.02268 link
2025-01-04 Unsupervised Class Generation to Expand Semantic Segmentation Datasets Javier Montalvo et.al. 2501.02264 null
2025-01-04 Financial Named Entity Recognition: How Far Can LLM Go? Yi-Te Lu et.al. 2501.02237 link
2025-01-04 Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends Camille Barboule et.al. 2501.02235 null
2025-01-04 Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection S M Mostaq Hossain et.al. 2501.02229 null
2025-01-04 Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation Shijie Wang et.al. 2501.02226 null
2025-01-04 Can ChatGPT implement finite element models for geotechnical engineering applications? Taegu Kim et.al. 2501.02199 null
2025-01-04 EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks Shixuan Liu et.al. 2501.02192 null
2025-01-04 On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing Jianwei Wang et.al. 2501.02191 link
2025-01-04 The Application of Large Language Models in Recommendation Systems Peiyang Yu et.al. 2501.02178 null
2025-01-04 The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit Huixue Zhou et.al. 2501.02173 null
2025-01-04 Personalized Graph-Based Retrieval for Large Language Models Steven Au et.al. 2501.02157 link
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-04 Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN Yanxi Chen et.al. 2501.02146 null
2025-01-03 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Chaoyou Fu et.al. 2501.01957 link
2025-01-03 Metadata Conditioning Accelerates Language Model Pre-training Tianyu Gao et.al. 2501.01956 link
2025-01-03 MADGEN – Mass-Spec attends to De Novo Molecular generation Yinkai Wang et.al. 2501.01950 null
2025-01-03 Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap Weizhi Zhang et.al. 2501.01945 link
2025-01-03 Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models Manh Duong Nguyen et.al. 2501.01932 link
2025-01-03 Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Yifan Du et.al. 2501.01904 link
2025-01-03 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Siyuan Huang et.al. 2501.01895 null
2025-01-03 Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions Rachneet Sachdeva et.al. 2501.01872 link
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 link
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 Time Series Language Model for Descriptive Caption Generation Mohamed Trabelsi et.al. 2501.01832 null
2025-01-03 Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models Yanjiang Liu et.al. 2501.01830 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction Ferhat Ozgur Catak et.al. 2501.01802 link
2025-01-03 Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation Mohammad Khalil et.al. 2501.01793 link
2025-01-03 Efficient LLM Inference with Activation Checkpointing and Hybrid Caching Sanghyeon Lee et.al. 2501.01792 null
2025-01-03 Nonparametric estimation of a factorizable density using diffusion models Hyeok Kyu Kwon et.al. 2501.01783 null
2025-01-03 SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation Mingjie Li et.al. 2501.01765 null
2025-01-03 Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models Andrea Matteazzi et.al. 2501.01761 null
2025-01-03 MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling Simon Rouard et.al. 2501.01757 null
2025-01-03 Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation Kangcheng Luo et.al. 2501.01743 null
2025-01-03 How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models Simone Corbo et.al. 2501.01741 null
2025-01-03 AR4D: Autoregressive 4D Generation from Monocular Videos Hanxin Zhu et.al. 2501.01722 null
2025-01-03 Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models Guosheng Zhang et.al. 2501.01720 null
2025-01-03 LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries Michal Kuk et.al. 2501.01711 null
2025-01-03 MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders Jiajun Cao et.al. 2501.01709 null
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-03 Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models Lei Tang et.al. 2501.01679 null
2025-01-03 Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption Zhang Ruoyan et.al. 2501.01672 null
2025-01-03 BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction Alaeddine Diaf et.al. 2501.01664 null
2025-01-03 Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning Danni Peng et.al. 2501.01653 null
2025-01-03 MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments Cai Yin et.al. 2501.01652 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-03 Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation Rini Smita Thakur et.al. 2501.01640 null
2025-01-03 A non-ergodic framework for understanding emergent capabilities in Large Language Models Javier Marin et.al. 2501.01638 null
2025-01-03 Revisiting Data Analysis with Pre-trained Foundation Models Chen Liang et.al. 2501.01631 null
2025-01-03 ICPC: In-context Prompt Compression with Faster Inference Ziyang Yu et.al. 2501.01625 null
2025-01-03 PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents Jingoo Lee et.al. 2501.01594 null
2025-01-03 (WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges Mohamed Hisham Abdellatif et.al. 2501.01588 null
2025-01-02 Predicting the Performance of Black-box LLMs through Self-Queries Dylan Sam et.al. 2501.01558 link
2025-01-02 Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) Jumana Almahmoud et.al. 2501.01545 null
2025-01-02 Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information Rasul Tutnov et.al. 2501.01544 null
2025-01-02 Denoising Diffused Embeddings: a Generative Approach for Hypergraphs Shihao Wu et.al. 2501.01541 null
2025-01-02 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi et.al. 2501.01540 link
2025-01-02 SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers Bhavna Gopal et.al. 2501.01529 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-02 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Jingfeng Yao et.al. 2501.01423 link
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers Seunghyun Lee et.al. 2501.01414 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios Xize Cheng et.al. 2501.01384 null
2025-01-02 ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI Neda Tavakoli et.al. 2501.01372 link
2025-01-02 Aligning Large Language Models for Faithful Integrity Against Opposing Argument Yong Zhao et.al. 2501.01336 link
2025-01-02 CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models Johan Wahréus et.al. 2501.01335 link
2025-01-02 Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension Yanbo Fang et.al. 2501.01332 null
2025-01-02 The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation Shuzheng Gao et.al. 2501.01329 null
2025-01-03 Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking Xiaoxue Cheng et.al. 2501.01306 null
2025-01-02 Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case Kaushik Roy et.al. 2501.01305 null
2025-01-02 Does a Large Language Model Really Speak in Human-Like Language? Mose Park et.al. 2501.01273 null
2025-01-02 ProgCo: Program Helps Self-Correction of Large Language Models Xiaoshuai Song et.al. 2501.01264 null
2025-01-03 CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Shanghaoran Quan et.al. 2501.01257 null
2025-01-02 Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? Manuel Weber et.al. 2501.01256 null
2025-01-02 Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion Qiyuan He et.al. 2501.01246 null
2025-01-02 SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization Yongle Huang et.al. 2501.01245 link
2025-01-02 Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants Lixiong Qin et.al. 2501.01243 null
2025-01-02 Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction Alexander Brinkmann et.al. 2501.01237 link
2025-01-03 TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer Jiayu Li et.al. 2501.01216 null
2025-01-02 Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects Abdullah Mushtaq et.al. 2501.01205 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-02 LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge Kyoungkook Kang et.al. 2501.01197 null
2025-01-02 Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education Annika Bush et.al. 2501.01192 null
2025-01-02 Towards Interactive Deepfake Analysis Lixiong Qin et.al. 2501.01164 link
2025-01-02 TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions Vriksha Srihari et.al. 2501.01156 null
2025-01-02 A3: Android Agent Arena for Mobile GUI Agents Yuxiang Chai et.al. 2501.01149 null
2025-01-03 BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference Wonsuk Jang et.al. 2501.01144 link
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2025-01-02 Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning Shuo Yu et.al. 2501.01124 null
2025-01-02 MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification Jimin Park et.al. 2501.01110 link
2025-01-03 MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Haina Zhu et.al. 2501.01108 link
2025-01-02 Graph Generative Pre-trained Transformer Xiaohui Chen et.al. 2501.01073 null
2025-01-02 Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models Yanwen Huang et.al. 2501.01059 null
2025-01-02 Risks of Cultural Erasure in Large Language Models Rida Qadri et.al. 2501.01056 null
2025-01-02 Dynamic Scaling of Unit Tests for Code Reward Modeling Zeyao Ma et.al. 2501.01054 null
2025-01-02 Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs Linhao Huang et.al. 2501.01042 null
2025-01-02 Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Bin Wang et.al. 2501.01034 link
2025-01-02 ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning Wonduk Seo et.al. 2501.01031 null
2025-01-03 KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Xinshuo Hu et.al. 2501.01028 link
2025-01-02 MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model Chengze Zhang et.al. 2501.01014 null
2025-01-02 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving Zihao Ye et.al. 2501.01005 link
2025-01-02 Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory Zhou Yang et.al. 2501.00999 null
2025-01-02 Optimizing Noise Schedules of Generative Models in High Dimensionss Santiago Aranguri et.al. 2501.00988 null
2025-01-02 Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice Federico Ravenda et.al. 2501.00982 link
2025-01-01 IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs Junfeng Jiao et.al. 2501.00959 null
2025-01-01 Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors Junfeng Jiao et.al. 2501.00957 null
2025-01-01 Incremental Dialogue Management: Survey, Discussion, and Implications for HRI Casey Kennington et.al. 2501.00953 null
2025-01-01 SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering Shihab Ahmed et.al. 2501.00940 null
2025-01-01 Diffusion Policies for Generative Modeling of Spacecraft Trajectories Julia Briden et.al. 2501.00915 null
2025-01-01 Aligning LLMs with Domain Invariant Reward Models David Wu et.al. 2501.00911 link
2025-01-01 Population Aware Diffusion for Time Series Generation Yang Li et.al. 2501.00910 link
2025-01-01 Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things Talha Zeeshan et.al. 2501.00906 null
2025-01-01 Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model Chenyang Liu et.al. 2501.00895 null
2025-01-01 Evaluating Time Series Foundation Models on Noisy Periodic Time Series Syamantak Datta Gupta et.al. 2501.00889 null
2025-01-01 Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization Weiqi Wu et.al. 2501.00888 link
2025-01-01 Representation in large language models Cameron C. Yetman et.al. 2501.00885 null
2025-01-01 Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents Fouad Bousetouane et.al. 2501.00881 null
2025-01-01 Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction Teng Hu et.al. 2501.00880 null
2025-01-01 TrustRAG: Enhancing Robustness and Trustworthiness in RAG Huichi Zhou et.al. 2501.00879 link
2025-01-01 LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models Hieu Man et.al. 2501.00874 link
2025-01-01 Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation Mingjia Li et.al. 2501.00873 link
2025-01-01 Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation Shoutao Guo et.al. 2501.00868 link
2025-01-01 Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era Mihnea C. Moldoveanu et.al. 2501.00867 null
2025-01-01 Alzheimer’s disease detection based on large language model prompt engineering Tian Zheng et.al. 2501.00861 null
2025-01-01 LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions Adam Ishay et.al. 2501.00830 null
2025-01-01 An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems Haoxiang Tian et.al. 2501.00829 null
2025-01-01 LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management Yichen Luo et.al. 2501.00826 null
2025-01-01 Multimodal Large Models Are Effective Action Anticipators Binglu Wang et.al. 2501.00795 link
2025-01-01 Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models Minhao Bai et.al. 2501.00786 null
2025-01-01 NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model Yuzhi Lai et.al. 2501.00785 null
2025-01-01 REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization Huyen Nguyen et.al. 2501.00779 null
2025-01-01 FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation Qianli Wang et.al. 2501.00777 null
2025-01-01 Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis Jie Gao et.al. 2501.00775 null
2025-01-01 An AI-powered Bayesian generative modeling approach for causal inference in observational studies Qiao Liu et.al. 2501.00755 null
2025-01-01 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-01 DIVE: Diversified Iterative Self-Improvement Yiwei Qin et.al. 2501.00747 link
2025-01-01 Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines Xiyang Hu et.al. 2501.00745 null
2025-01-01 A Distributional Evaluation of Generative Image Models Edric Tam et.al. 2501.00744 null
2025-01-01 New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory Madiha Ajmal et.al. 2501.00721 null
2025-01-01 Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection Hao Wang et.al. 2501.00700 null
2025-01-01 Adjoint sharding for very long context training of state space models Xingzi Xu et.al. 2501.00692 null
2025-01-01 Labels Generated by Large Language Model Helps Measuring People’s Empathy in Vitro Md Rakibul Hasan et.al. 2501.00691 null
2025-01-01 IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently Florian Dietz et.al. 2501.00684 null
2024-12-31 Grade Inflation in Generative Models Phuc Nguyen et.al. 2501.00664 null
2024-12-31 Finding Missed Code Size Optimizations in Compilers using LLMs Davide Italiano et.al. 2501.00655 null
2024-12-31 Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models Suttisak Wizadwongsa et.al. 2501.00651 null
2024-12-31 Efficient Standardization of Clinical Notes using Large Language Models Daniel B. Hier et.al. 2501.00644 null
2024-12-31 Enabling New HDLs with Agents Mark Zakharov et.al. 2501.00642 null
2024-12-31 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2024-12-31 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders Yipeng Kang et.al. 2501.00581 null
2024-12-31 AI and Quantum Computing in Binary Photocatalytic Hydrogen Production Dennis Delali Kwesi Wayo et.al. 2501.00575 null
2024-12-31 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li et.al. 2501.00574 link
2024-12-31 Probing Visual Language Priors in VLMs Tiange Luo et.al. 2501.00569 null
2024-12-31 Robust and Adaptive Optimization under a Large Language Model Lens Dimitris Bertsimas et.al. 2501.00568 null
2024-12-30 Distributed Mixture-of-Agents for Edge Inference with Large Language Models Purbesh Mitra et.al. 2412.21200 link
2024-12-31 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Zhaojian Yu et.al. 2412.21199 link
2024-12-30 The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick Jonathan Berkheim et.al. 2412.21186 null
2024-12-30 Facilitating large language model Russian adaptation with Learned Embedding Propagation Mikhail Tikhomirov et.al. 2412.21140 link
2024-12-30 ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation Ruixuan Liu et.al. 2412.21123 null
2025-01-02 Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Yuanbo Yang et.al. 2412.21117 null
2024-12-30 Varformer: Adapting VAR’s Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense Yuyang Zhou et.al. 2412.21051 link
2024-12-30 E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models Zhiyu Tan et.al. 2412.21044 null
2024-12-30 Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration Wanglong Lu et.al. 2412.21042 link
2024-12-30 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Chia-Yu Hung et.al. 2412.21037 link
2024-12-30 GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models Shangyu Xing et.al. 2412.21036 null
2024-12-30 MapQaTor: A System for Efficient Annotation of Map Query Datasets Mahir Labib Dihan et.al. 2412.21015 link
2024-12-31 Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria Joonwon Jang et.al. 2412.21006 null
2024-12-30 Plug-and-Play Training Framework for Preference Optimization Jingyuan Ma et.al. 2412.20996 null
2024-12-30 KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation Siyuan Fang et.al. 2412.20995 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-30 QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing Shlomo Kashani et.al. 2412.20956 null
2024-12-30 AGON: Automated Design Framework for Customizing Processors from ISA Documents Chongxiao Li et.al. 2412.20954 null
2024-12-30 Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema Xiaohan Feng et.al. 2412.20942 null
2024-12-30 Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering Junxiao Xue et.al. 2412.20927 null
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Towards Compatible Fine-tuning for Vision-Language Model Updates Zhengbo Wang et.al. 2412.20895 null
2024-12-30 DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models Xiaolin Hu et.al. 2412.20891 null
2024-12-30 Enhancing Annotated Bibliography Generation with LLM Ensembles Sergio Bermejo et.al. 2412.20864 null
2024-12-30 Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs’ Memory Xingjian Tao et.al. 2412.20846 null
2024-12-30 Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment Jianfei Zhang et.al. 2412.20834 link
2024-12-30 Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model Runtao Ren et.al. 2412.20820 null
2024-12-30 TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting Huanyu Zhang et.al. 2412.20810 null
2024-12-30 Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves Chayan Chatterjee et.al. 2412.20789 null
2024-12-31 SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity Pengfei Jing et.al. 2412.20787 null
2024-12-30 Large Language Model Enabled Multi-Task Physical Layer Network Tianyue Zheng et.al. 2412.20772 null
2024-12-30 Attributing Culture-Conditioned Generations to Pretraining Corpora Huihan Li et.al. 2412.20760 link
2024-12-30 M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs Bei Yan et.al. 2412.20718 link
2024-12-30 HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images Sungik Choi et.al. 2412.20704 null
2024-12-30 UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design Zijie Chen et.al. 2412.20694 null
2024-12-30 Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks Yuhe Ding et.al. 2412.20682 null
2024-12-30 Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA Qingyun Jin et.al. 2412.20677 null
2024-12-30 Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner Yitong Zhou et.al. 2412.20662 link
2024-12-30 Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Yousef Yeganeh et.al. 2412.20651 null
2024-12-30 SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy Md Mahadi Hasan Nahid et.al. 2412.20641 null
2024-12-30 Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble Yongchang Li et.al. 2412.20637 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-29 Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study Yulin Fei et.al. 2412.20613 link
2024-12-29 NLP-based Regulatory Compliance – Using GPT 4.0 to Decode Regulatory Documents Bimal Kumar et.al. 2412.20602 null
2024-12-29 MATEY: multiscale adaptive foundation models for spatiotemporal physical systems Pei Zhang et.al. 2412.20601 null
2024-12-29 Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection Dmitri Roussinov et.al. 2412.20595 link
2024-12-29 Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches Madhavendra Thakur et.al. 2412.20584 null
2024-12-29 Counterfactual Samples Constructing and Training for Commonsense Statements Estimation Chong Liu et.al. 2412.20563 null
2024-12-29 Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces Linglingzhi Zhu et.al. 2412.20556 null
2024-12-29 The Impact of Prompt Programming on Function-Level Code Generation Ranim Khojah et.al. 2412.20545 link
2024-12-29 Goal-Conditioned Data Augmentation for Offline Reinforcement Learning Xingshuai Huang et.al. 2412.20519 null
2024-12-29 Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning Hang Ni et.al. 2412.20505 null
2024-12-29 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication Zongwu Wang et.al. 2412.20501 link
2024-12-29 Multimodal Variational Autoencoder: a Barycentric View Peijie Qiu et.al. 2412.20487 null
2024-12-29 JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling Haorui Ji et.al. 2412.20470 null
2024-12-29 Improving Vision-Language-Action Models via Chain-of-Affordance Jinming Li et.al. 2412.20451 null
2024-12-29 Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs Pratik Rakesh Singh et.al. 2412.20440 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-29 Unlocking adaptive digital pathology through dynamic feature learning Jiawen Li et.al. 2412.20430 null
2024-12-29 AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models Mansi et.al. 2412.20427 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422 null
2024-12-29 Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection Kalin Kopanov et.al. 2412.20414 null
2024-12-29 Multi-Objective Large Language Model Unlearning Zibin Pan et.al. 2412.20412 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-29 Natural Language Fine-Tuning Jia Liu et.al. 2412.20382 link
2024-12-29 Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) Jia Wei Sii et.al. 2412.20381 null
2024-12-29 FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation Yan Luo et.al. 2412.20374 link
2024-12-29 LLM2: Let Large Language Models Harness System 2 Reasoning Cheng Yang et.al. 2412.20372 link
2025-01-02 Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey Junqiao Wang et.al. 2412.20367 null
2024-12-29 HindiLLM: Large Language Model for Hindi Sanjay Chouhan et.al. 2412.20357 null
2024-12-29 Distilling Desired Comments for Enhanced Code Review with Large Language Models Yongda Yu et.al. 2412.20340 null
2024-12-29 Mind the Data Gap: Bridging LLMs to Enterprise Data Integration Moe Kayali et.al. 2412.20331 null
2024-12-29 GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions Tianyao Shi et.al. 2412.20322 null
2024-12-29 Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain Shintaro Ozaki et.al. 2412.20309 null
2024-12-28 FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration Jia Liu et.al. 2412.20297 null
2024-12-28 Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games Guan-Horng Liu et.al. 2412.20279 null
2024-12-28 Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues Henry J. Xie et.al. 2412.20264 link
2024-12-28 Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception Athanasios Karagounis et.al. 2412.20230 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2024-12-28 Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation Yeonhong Park et.al. 2412.20185 null
2024-12-28 LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System Hyucksung Kwon et.al. 2412.20166 null
2024-12-28 StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN Andrzej Bedychaj et.al. 2412.20164 null
2024-12-28 Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems Minhye Jeon et.al. 2412.20163 null
2024-12-28 Multi-Modality Driven LoRA for Adverse Condition Depth Estimation Guanglei Yang et.al. 2412.20162 null
2024-12-28 Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses Xinru Wen et.al. 2412.20154 null
2024-12-28 Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering Wei Zhou et.al. 2412.20145 null
2024-12-28 TradingAgents: Multi-Agents LLM Financial Trading Framework Yijia Xiao et.al. 2412.20138 null
2024-12-28 M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation Zhaopeng Feng et.al. 2412.20127 link
2024-12-28 Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers Tuomas Hakoniemi et.al. 2412.20114 null
2024-12-28 ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming Jiedong Zhuang et.al. 2412.20105 null
2024-12-28 On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs Atmane Ayoub Mansour Bahar et.al. 2412.20087 null
2024-12-31 Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset Chongjian Yue et.al. 2412.20072 null
2024-12-28 On the Compositional Generalization of Multimodal LLMs for Medical Imaging Zhenyang Cai et.al. 2412.20070 link
2024-12-28 VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition Lan Chen et.al. 2412.20064 link
2024-12-28 MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion Zechao Zhan et.al. 2412.20062 null
2024-12-28 Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts Yanxin Shen et.al. 2412.20061 null
2024-12-28 “My life is miserable, have to sign 500 autographs everyday”: Exposing Humblebragging, the Brags in Disguise Sharath Naganna et.al. 2412.20057 null
2024-12-27 Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization Kumud Tripathi et.al. 2412.19785 null
2024-12-27 Can AI Help with Your Personal Finances? Oudom Hean et.al. 2412.19784 null
2024-12-27 Tensor Network Estimation of Distribution Algorithms John Gardiner et.al. 2412.19780 null
2024-12-27 Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration Le Chen et.al. 2412.19770 link
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-27 On dual-projectively equivalent connections associated to second order superintegrable systems Andreas Vollmer et.al. 2412.19739 null
2024-12-27 Can Large Language Models Adapt to Other Agents In-Context? Matthew Riemer et.al. 2412.19726 null
2024-12-27 From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Jiawei Lin et.al. 2412.19712 null
2024-12-27 Toward Adaptive Reasoning in Large Language Models with Thought Rollback Sijia Chen et.al. 2412.19707 link
2024-12-27 A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization Jingchun Lian et.al. 2412.19685 null
2024-12-27 Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework Jiang Liu et.al. 2412.19684 null
2024-12-27 CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs Siyu Wang et.al. 2412.19663 null
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 link
2024-12-27 FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios Kaiyi Pang et.al. 2412.19652 null
2024-12-27 Xmodel-2 Technical Report Wang Qun et.al. 2412.19638 null
2024-12-27 IMTP: Search-based Code Generation for In-memory Tensor Programs Yongwon Shin et.al. 2412.19630 null
2024-12-27 Signatures of prediction during natural listening in MEG data? Sahel Azizpour et.al. 2412.19622 null
2024-12-27 Gradient Weight-normalized Low-rank Projection for Efficient LLM Training Jia-Hong Huang et.al. 2412.19616 link
2024-12-27 SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms Shashank Rao Marpally et.al. 2412.19595 null
2024-12-27 Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Yuxiao Yang et.al. 2412.19562 null
2024-12-27 Diverse Rare Sample Generation with Pretrained GANs Subeen Lee et.al. 2412.19543 link
2024-12-27 Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations Yuanfei Huang et.al. 2412.19520 null
2024-12-27 Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model Hyunwoo Cho et.al. 2412.19517 null
2024-12-27 Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs Zhe Yang et.al. 2412.19513 link
2024-12-27 Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging Hua Farn et.al. 2412.19512 null
2024-12-27 Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion Koustav Ghosal et.al. 2412.19510 null
2024-12-27 MBQ: Modality-Balanced Quantization for Large Vision-Language Models Shiyao Li et.al. 2412.19509 link
2024-12-27 DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-27 Casevo: A Cognitive Agents and Social Evolution Simulator Zexun Jiang et.al. 2412.19498 link
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 link
2024-12-27 Focusing Image Generation to Mitigate Spurious Correlations Xuewei Li et.al. 2412.19457 null
2024-12-27 Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models Hyeonseok Moon et.al. 2412.19450 link
2024-12-27 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-27 A Survey on Large Language Model Acceleration based on KV Cache Management Haoyang Li et.al. 2412.19442 link
2024-12-27 Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback Seong Jin Lee et.al. 2412.19436 null
2024-12-27 Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints Alberto Maté et.al. 2412.19424 null
2024-12-27 Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning Chen Li et.al. 2412.19422 link
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-27 MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios Jiaqi Fan et.al. 2412.19406 link
2024-12-27 An Engorgio Prompt Makes Large Language Model Babble on Jianshuo Dong et.al. 2412.19394 link
2024-12-26 Large Language Models for Market Research: A Data-augmentation Approach Mengxin Wang et.al. 2412.19363 null
2024-12-26 Dynamic Skill Adaptation for Large Language Models Jiaao Chen et.al. 2412.19361 null
2024-12-26 Identifying Split Vacancies with Foundation Models and Electrostatics Seán R. Kavanagh et.al. 2412.19330 null
2024-12-26 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan et.al. 2412.19326 link
2024-12-26 Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami et.al. 2412.19325 null
2024-12-26 From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries Hugh Van Deventer et.al. 2412.19312 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-26 RecLM: Recommendation Instruction Tuning Yangqin Jiang et.al. 2412.19302 link
2024-12-26 RAG with Differential Privacy Nicolas Grislain et.al. 2412.19291 link
2024-12-26 Time Series Foundational Models: Their Role in Anomaly Detection and Prediction Chathurangi Shyalika et.al. 2412.19286 link
2024-12-26 PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing Michael Bezick et.al. 2412.19284 null
2024-12-26 MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes Asma Ben Abacha et.al. 2412.19260 link
2024-12-26 VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis Jaemin Jung et.al. 2412.19259 null
2024-12-26 Sentiment trading with large language models Kemal Kirtac et.al. 2412.19245 null
2024-12-26 SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model Xuyang Li et.al. 2412.19237 null
2024-12-26 Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining Yuxin You et.al. 2412.19211 null
2024-12-26 Multi-Attribute Constraint Satisfaction via Language Model Rewriting Ashutosh Baheti et.al. 2412.19198 null
2024-12-26 Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models Haonan He et.al. 2412.19191 null
2024-12-26 Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue Shuzhi Xu et.al. 2412.19154 null
2024-12-26 AskChart: Universal Chart Understanding through Textual Enhancement Xudong Yang et.al. 2412.19146 link
2024-12-26 SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis Senbin Zhu et.al. 2412.19140 link
2024-12-26 PlanLLM: Video Procedure Planning with Refinable Large Language Models Dejie Yang et.al. 2412.19139 link
2024-12-26 Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing Inpyo Hong et.al. 2412.19125 link
2024-12-26 Discrete vs. Continuous Trade-offs for Generative Models Jathin Korrapati et.al. 2412.19114 null
2024-12-26 SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values Yunfan Zhang et.al. 2412.19113 null
2024-12-26 Stochastic normalizing flows for Effective String Theory Michele Caselle et.al. 2412.19109 null
2024-12-26 “I’ve Heard of You!”: Generate Spoken Named Entity Recognition Data for Unseen Entities Jiawei Yu et.al. 2412.19102 null
2024-12-26 Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security Vasileios Alevizos et.al. 2412.19088 null
2024-12-26 Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation Haotian Qian et.al. 2412.19080 null
2024-12-26 CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers Jingyi Zheng et.al. 2412.19037 link
2024-12-26 Repository Structure-Aware Training Makes SLMs Better Issue Resolver Zexiong Ma et.al. 2412.19031 null
2024-12-26 Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation Yixin Chen et.al. 2412.19026 link
2024-12-26 Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication Xiqiang Qu et.al. 2412.19025 null
2024-12-26 Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation Tao Liu et.al. 2412.19021 null
2024-12-26 Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability Ruixi Lin et.al. 2412.19018 null
2024-12-25 How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study Alejandro Velasco et.al. 2412.18989 null
2024-12-25 ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement Zhefan Rao et.al. 2412.18966 null
2024-12-25 Musings About the Future of Search: A Return to the Past? Jimmy Lin et.al. 2412.18956 null
2024-12-25 A Power-Efficient Hardware Implementation of L-Mul Ruiqi Chen et.al. 2412.18948 null
2024-12-25 MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models Kaiwen Zuo et.al. 2412.18947 null
2024-12-25 Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations Yewon Kim et.al. 2412.18940 null
2024-12-25 Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference Libo Zhang et.al. 2412.18934 null
2024-12-25 UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Lunhao Duan et.al. 2412.18928 null
2024-12-25 Exemplar-condensed Federated Class-incremental Learning Rui Sun et.al. 2412.18926 null
2024-12-25 Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model Yi-Chia Chen et.al. 2412.18917 link
2024-12-25 AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures Situo Zhang et.al. 2412.18910 null
2024-12-25 CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models Ping Guo et.al. 2412.18890 link
2024-12-25 MotionMap: Representing Multimodality in Human Pose Forecasting Reyhaneh Hosseininejad et.al. 2412.18883 null
2024-12-25 Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models Meltem Aksoy et.al. 2412.18863 null
2024-12-25 Improving the Readability of Automatically Generated Tests using Large Language Models Matteo Biagiola et.al. 2412.18843 null
2024-12-25 LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements Hao Zhang et.al. 2412.18835 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-25 RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting Yilei Jiang et.al. 2412.18826 null
2024-12-25 CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection Wenbin Li et.al. 2412.18820 link
2024-12-25 LLM-assisted vector similarity search Md Riyadh et.al. 2412.18819 null
2024-12-25 DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search Lei Yang et.al. 2412.18811 link
2024-12-25 Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation Xinkai Du et.al. 2412.18800 null
2024-12-25 Torque-Aware Momentum Pranshu Malviya et.al. 2412.18790 null
2024-12-25 Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models Yu-An Liu et.al. 2412.18770 link
2024-12-25 The Impact of Input Order Bias on Large Language Models for Software Fault Localization Md Nakhla Rafi et.al. 2412.18750 null
2024-12-24 Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Zehan Wang et.al. 2412.18605 link
2024-12-24 Long-Form Speech Generation with Spoken Language Models Se Jin Park et.al. 2412.18603 link
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-24 A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs OpenMind et.al. 2412.18588 null
2024-12-24 Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control Sergey Sedov et.al. 2412.18582 null
2024-12-24 Zero-resource Speech Translation and Recognition with LLMs Karel Mundnich et.al. 2412.18566 null
2024-12-24 Distilling Fine-grained Sentiment Understanding from Large Language Models Yice Zhang et.al. 2412.18552 link
2024-12-24 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-24 PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction Xingjian Xu et.al. 2412.18541 null
2024-12-24 Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation Derong Xu Xinhang Li et.al. 2412.18537 link
2024-12-24 Automated Code Review In Practice Umut Cihan et.al. 2412.18531 null
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 null
2024-12-24 Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization Yi-Fu Fu et.al. 2412.18497 null
2024-12-24 GeFL: Model-Agnostic Federated Learning with Generative Models Honggu Kang et.al. 2412.18460 null
2024-12-24 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Tatiana Zemskova et.al. 2412.18450 link
2024-12-24 Is Large Language Model Good at Triple Set Prediction? An Empirical Study Yuan Yuan et.al. 2412.18443 null
2024-12-24 Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm O. Deniz Akyildiz et.al. 2412.18432 null
2024-12-24 GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent Kangjia Zhao et.al. 2412.18426 null
2024-12-24 Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models Zihan Zhou et.al. 2412.18419 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English Avinash Anand et.al. 2412.18415 link
2024-12-24 Discovery of 2D Materials via Symmetry-Constrained Diffusion Model Shihang Xu et.al. 2412.18414 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2024-12-24 Extract Free Dense Misalignment from CLIP JeongYeon Nam et.al. 2412.18404 link
2024-12-24 RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction Wu Xiaoping et.al. 2412.18390 null
2024-12-24 MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs Qiuyi Gu et.al. 2412.18381 link
2024-12-24 Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents Kaiwen Ning et.al. 2412.18371 link
2024-12-24 Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.18351 null
2024-12-24 M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models Jiaxin Guo et.al. 2412.18299 null
2024-12-24 Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight Xi Ding et.al. 2412.18298 link
2024-12-24 Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases Christian Di Maio et.al. 2412.18295 null
2024-12-24 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-24 Improved Feature Generating Framework for Transductive Zero-shot Learning Zihan Ye et.al. 2412.18282 null
2024-12-24 GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications Zhenzhou Jin et.al. 2412.18281 null
2024-12-24 Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Jiacai Liu et.al. 2412.18279 null
2024-12-24 GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge Shammur Absar Chowdhury et.al. 2412.18274 null
2024-12-24 Annotating References to Mythological Entities in French Literature Thierry Poibeau et.al. 2412.18270 null
2024-12-24 Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study Xuefeng Jiang et.al. 2412.18260 link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 null
2024-12-24 An Automatic Graph Construction Framework based on Large Language Models for Recommendation Rong Shan et.al. 2412.18241 link
2024-12-24 Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection Jango Zhang et.al. 2412.18225 null
2024-12-24 Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Peijin Xie et.al. 2412.18224 link
2024-12-24 ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Mengyang Wu et.al. 2412.18216 link
2024-12-24 Adapting Large Language Models for Improving TCP Fairness over WiFi Shyam Kumar Shrestha et.al. 2412.18200 null
2024-12-24 Robustness-aware Automatic Prompt Optimization Zeru Shi et.al. 2412.18196 link
2024-12-24 VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Shiduo Zhang et.al. 2412.18194 null
2024-12-24 TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization Yucong Luo et.al. 2412.18185 null
2024-12-24 Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Yucong Luo et.al. 2412.18176 null
2024-12-24 INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Haohang Li et.al. 2412.18174 null
2024-12-24 Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models Xiaomeng Hu et.al. 2412.18171 null
2024-12-24 KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management Rongxin Cheng et.al. 2412.18169 null
2024-12-24 Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence Yinbin Han et.al. 2412.18164 null
2024-12-24 VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities Shray Mathur et.al. 2412.18161 null
2024-12-24 Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task Jinming Liu et.al. 2412.18158 null
2024-12-24 Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance Yaoyun Zhang et.al. 2412.18157 null
2024-12-24 scReader: Prompting Large Language Models to Interpret scRNA-seq Data Cong Li et.al. 2412.18156 null
2024-12-24 GeneSUM: Large Language Model-based Gene Summary Extraction Zhijian Chen et.al. 2412.18154 null
2024-12-24 CoAM: Corpus of All-Type Multiword Expressions Yusuke Ide et.al. 2412.18151 null
2024-12-24 EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation Shuhao Han et.al. 2412.18150 link
2024-12-24 Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction Xiao Guo et.al. 2412.18149 null
2024-12-24 Ensuring Consistency for In-Image Translation Chengpeng Fu et.al. 2412.18139 null
2024-12-24 LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment Binrui Zeng et.al. 2412.18135 null
2024-12-24 VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection Zhaohui Jin et.al. 2412.18124 null
2024-12-24 AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation Hao Wen et.al. 2412.18116 null
2024-12-24 AIGT: AI Generative Table Based on Prompt Mingming Zhang et.al. 2412.18111 null
2024-12-24 SlimGPT: Layer-wise Structured Pruning for Large Language Models Gui Ling et.al. 2412.18110 null
2024-12-24 Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach Jing Bi et.al. 2412.18108 null
2024-12-24 Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels Mingcong Song et.al. 2412.18106 null
2024-12-24 EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent Suyuan Wang et.al. 2412.18100 null
2024-12-24 Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) – a Large Language Model Chatbot for Perioperative Medicine Yu He Ke et.al. 2412.18096 null
2024-12-24 Molly: Making Large Language Model Agents Solve Python Problem More Logically Rui Xiao et.al. 2412.18093 null
2024-12-24 Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner Aizierjiang Aiersilan et.al. 2412.18086 link
2024-12-24 Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models Xuan Lin et.al. 2412.18084 link
2024-12-24 Improving Factuality with Explicit Working Memory Mingda Chen et.al. 2412.18069 null
2024-12-24 LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR Osama Hosam Abdellaif et.al. 2412.18063 link
2024-12-24 Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction Hyunbae Jeon et.al. 2412.18061 null
2024-12-24 An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM Wen Wen et.al. 2412.18060 null
2024-12-23 Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations Maya Patel et.al. 2412.18051 null
2024-12-23 AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data Mirko Zaffaroni et.al. 2412.18038 link
2024-12-23 Generating refactored code accurately using reinforcement learning Indranil Palit et.al. 2412.18035 null
2024-12-23 More than Chit-Chat: Developing Robots for Small-Talk Interactions Rebecca Ramnauth et.al. 2412.18023 null
2024-12-23 Trustworthy and Efficient LLMs Meet Databases Kyoungmin Kim et.al. 2412.18022 null
2024-12-23 StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2024-12-23 LMV-RPA: Large Model Voting-based Robotic Process Automation Osama Abdellatif et.al. 2412.17965 link
2024-12-23 Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models Antony Seabra et.al. 2412.17964 null
2024-12-23 Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models Ge Zhang et.al. 2412.17963 null
2024-12-23 Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents Antony Seabra et.al. 2412.17942 null
2024-12-23 BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism Martin Fajcik et.al. 2412.17933 null
2024-12-23 Causal Composition Diffusion Model for Closed-loop Traffic Generation Haohong Lin et.al. 2412.17920 null
2024-12-23 Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning Orson Mengara et.al. 2412.17908 null
2024-12-23 LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses Sara Riazi et.al. 2412.17892 null
2024-12-23 ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Siyuan Bian et.al. 2412.17811 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 null
2024-12-23 Automating the Search for Artificial Life with Foundation Models Akarsh Kumar et.al. 2412.17799 link
2024-12-23 ResearchTown: Simulator of Human Research Community Haofei Yu et.al. 2412.17767 link
2024-12-23 ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback Wei Zhang et.al. 2412.17754 null
2024-12-23 Deliberation in Latent Space via Differentiable Cache Augmentation Luyang Liu et.al. 2412.17747 null
2024-12-23 YuLan-Mini: An Open Data-efficient Language Model Yiwen Hu et.al. 2412.17743 link
2024-12-23 **Reasoning to Attend: Try to Understand How Token Works** Rui Qian et.al. 2412.17741 link
2024-12-23 Knowledge Editing through Chain-of-Thought Changyue Wang et.al. 2412.17727 link
2024-12-23 Understanding the Logic of Direct Preference Alignment through Logic Kyle Richardson et.al. 2412.17696 null
2024-12-23 Large Language Model Safety: A Holistic Survey Dan Shi et.al. 2412.17686 link
2024-12-23 A Bias-Free Training Paradigm for More General AI-generated Image Detection Fabrizio Guillaro et.al. 2412.17671 null
2024-12-23 Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models Sijbren van Vaals et.al. 2412.17669 link
2024-12-23 Detecting anxiety and depression in dialogues: a multi-label and explainable approach Francisco de Arriba-Pérez et.al. 2412.17651 null
2024-12-23 SCBench: A Sports Commentary Benchmark for Video LLMs Kuangzhi Ge et.al. 2412.17637 null
2024-12-23 ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance Renyang Liu et.al. 2412.17632 link
2024-12-23 Tracking the Feature Dynamics in LLM Training: A Mechanistic Study Yang Xu et.al. 2412.17626 null
2024-12-23 Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models Parham Rezaei et.al. 2412.17622 link
2024-12-23 Emerging Security Challenges of Large Language Models Herve Debar et.al. 2412.17614 null
2024-12-23 Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs Fabrizio Frasca et.al. 2412.17609 null
2024-12-23 EasyTime: Time Series Forecasting Made Easy Xiangfei Qiu et.al. 2412.17603 null
2024-12-23 LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context Kai Ruan et.al. 2412.17596 link
2024-12-23 Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation Chengbing Wang et.al. 2412.17593 null
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field Zixi Liang et.al. 2412.17561 link
2024-12-23 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Chao Zeng et.al. 2412.17560 null
2024-12-23 A Survey of Query Optimization in Large Language Models Mingyang Song et.al. 2412.17558 null
2024-12-23 Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing Prakash Aryan et.al. 2412.17548 link
2024-12-23 Retention Score: Quantifying Jailbreak Risks for Vision Language Models Zaitang Li et.al. 2412.17544 null
2024-12-23 Constructing Fair Latent Space for Intersection of Fairness and Explainability Hyungjun Joo et.al. 2412.17523 null
2024-12-23 DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak Hao Wang et.al. 2412.17522 null
2024-12-23 Improving the Noise Estimation of Latent Neural Stochastic Differential Equations Linus Heck et.al. 2412.17499 null
2024-12-23 Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings Jérémie Sublime et.al. 2412.17486 null
2024-12-23 Power- and Fragmentation-aware Online Scheduling for GPU Datacenters Francesco Lettich et.al. 2412.17484 link
2024-12-23 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Chenlong Deng et.al. 2412.17483 null
2024-12-23 A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers Shuaihang Chen et.al. 2412.17481 link
2024-12-23 CALLIC: Content Adaptive Learning for Lossless Image Compression Daxin Li et.al. 2412.17464 null
2024-12-23 Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning Xiaodan Chen et.al. 2412.17456 null
2024-12-23 Applying LLM and Topic Modelling in Psychotherapeutic Contexts Alexander Vanin et.al. 2412.17449 null
2024-12-23 Measuring Contextual Informativeness in Child-Directed Text Maria Valentini et.al. 2412.17427 link
2024-12-23 Multimodal Preference Data Synthetic Alignment with Reward Model Robert Wijaya et.al. 2412.17417 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance Muhammad Reza Qorib et.al. 2412.17408 link
2024-12-23 Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning Huchen Jiang et.al. 2412.17397 null
2024-12-23 WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models Huawen Feng et.al. 2412.17395 null
2024-12-23 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-23 Interweaving Memories of a Siamese Large Language Model Xin Song et.al. 2412.17383 link
2024-12-23 MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models Beibei Yu et.al. 2412.17339 null
2024-12-23 A Dual-Perspective Metaphor Detection Framework Using Large Language Models Yujie Lin et.al. 2412.17332 link
2024-12-23 Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance Nicolas Devatine et.al. 2412.17321 null
2024-12-23 CodeV: Issue Resolving with Visual Data Linhao Zhang et.al. 2412.17315 link
2024-12-23 Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories Mahan Tafreshipour et.al. 2412.17298 null
2024-12-23 Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Taewoong Kim et.al. 2412.17288 link
2024-12-23 LLM4AD: A Platform for Algorithm Design with Large Language Model Fei Liu et.al. 2412.17287 link
2024-12-23 Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning Rui Liang et.al. 2412.17285 null
2024-12-23 Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach Rafid Ishrak Jahan et.al. 2412.17255 link
2024-12-23 SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval Xiaopeng Li et.al. 2412.17250 null
2024-12-23 EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling Zichen Song et.al. 2412.17249 null
2024-12-23 On the Generalization Ability of Machine-Generated Text Detectors Yule Liu et.al. 2412.17242 link
2024-12-23 Brain-to-Text Benchmark ‘24: Lessons Learned Francis R. Willett et.al. 2412.17227 link
2024-12-23 CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder Lichen Ma et.al. 2412.17225 null
2024-12-22 Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension Jio Oh et.al. 2412.17189 null
2024-12-22 Foundation Model for Lossy Compression of Spatiotemporal Scientific Data Xiao Li et.al. 2412.17184 null
2024-12-22 Enhancing Item Tokenization for Generative Recommendation through Self-Improvement Runjin Chen et.al. 2412.17171 null
2024-12-22 Generative Diffusion Modeling: A Practical Handbook Zihan Ding et.al. 2412.17162 null
2024-12-22 LLM-based relevance assessment still can’t replace human relevance assessment Charles L. A. Clarke et.al. 2412.17156 null
2024-12-22 LLM Agent for Fire Dynamics Simulations Leidong Xu et.al. 2412.17146 null
2024-12-22 Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs Rushendra Sidibomma et.al. 2412.17131 link
2024-12-22 Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models Cameron R. Jones et.al. 2412.17128 null
2024-12-22 Learning to Adapt to Low-Resource Paraphrase Generation Zhigen Li et.al. 2412.17111 null
2024-12-22 DreamOmni: Unified Image Generation and Editing Bin Xia et.al. 2412.17098 null
2024-12-22 Analysis on LLMs Performance for Code Summarization Md. Ahnaf Akib et.al. 2412.17094 null
2024-12-22 SAIL: Sample-Centric In-Context Learning for Document Information Extraction Jinyu Zhang et.al. 2412.17092 link
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2024-12-22 The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM’s Internal States Fabian Ridder et.al. 2412.17056 link
2024-12-22 DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately Huiwen Wu et.al. 2412.17053 null
2024-12-22 ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues Shaina Raza et.al. 2412.17052 link
2024-12-22 Modular Conversational Agents for Surveys and Interviews Jiangbo Yu et.al. 2412.17049 null
2024-12-22 Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective Hankun Wang et.al. 2412.17048 null
2024-12-22 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories Eric Hedlin et.al. 2412.17040 null
2024-12-22 Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model Kevin Shen et.al. 2412.17039 null
2024-12-22 Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models Lang Gao et.al. 2412.17034 null
2024-12-22 MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge Jie He et.al. 2412.17032 link
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 GAS: Generative Auto-bidding with Post-training Search Yewen Li et.al. 2412.17018 null
2024-12-22 Robustness of Large Language Models Against Adversarial Attacks Yiyi Tao et.al. 2412.17011 null
2024-12-22 InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions Ronghui Li et.al. 2412.16982 null
2024-12-22 On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora Tzu-Chieh Chen et.al. 2412.16976 null
2024-12-22 Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs Alexander von Recum et.al. 2412.16974 null
2024-12-22 Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach Chunxu Zhang et.al. 2412.16969 link
2024-12-22 System-2 Mathematical Reasoning via Enriched Instruction Tuning Huanqia Cai et.al. 2412.16964 null
2024-12-22 Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework Jundong Xu et.al. 2412.16953 null
2024-12-22 A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation Ekai Hashimoto et.al. 2412.16943 null
2024-12-22 Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.16936 null
2024-12-22 Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models Kai Zheng et.al. 2412.16933 null
2024-12-22 Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs Bohan Jin et.al. 2412.16922 null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 null
2024-12-22 Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation Quan Dao et.al. 2412.16906 null
2024-12-22 Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model Songjun Tu et.al. 2412.16878 link
2024-12-20 HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding Chenxin Tao et.al. 2412.16158 null
2024-12-20 Can Generative Video Models Help Pose Estimation? Ruojin Cai et.al. 2412.16155 null
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-20 Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation Seyedreza Mohseni et.al. 2412.16135 null
2024-12-20 Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information Dirk Bergemann et.al. 2412.16132 null
2024-12-20 PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics Daniil Larionov et.al. 2412.16120 null
2024-12-20 Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts Muhammad Abdullah Sohail et.al. 2412.16119 link
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse Mahyar Habibi et.al. 2412.16114 null
2024-12-20 Logical Consistency of Large Language Models in Fact-checking Bishwamittra Ghosh et.al. 2412.16100 null
2024-12-20 The Evolution of LLM Adoption in Industry Data Curation Practices Crystal Qian et.al. 2412.16089 null
2024-12-20 Efficient MedSAMs: Segment Anything in Medical Images on Laptop Jun Ma et.al. 2412.16085 link
2024-12-20 Formal Mathematical Reasoning: A New Frontier in AI Kaiyu Yang et.al. 2412.16075 null
2024-12-20 The Only Way is Ethics: A Guide to Ethical Research with Large Language Models Eddie L. Ungless et.al. 2412.16022 link
2024-12-20 Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support Qijiong Liu et.al. 2412.15973 link
2024-12-20 From General to Specific: Tailoring Large Language Models for Personalized Healthcare Ruize Shi et.al. 2412.15957 null
2024-12-20 Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring Markus Borg et.al. 2412.15948 null
2024-12-20 Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation Gautier Evennou et.al. 2412.15939 link
2024-12-20 Large Language Model assisted Hybrid Fuzzing Ruijie Meng et.al. 2412.15931 null
2024-12-20 MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection Andrea Moglia et.al. 2412.15925 link
2024-12-20 RiTTA: Modeling Event Relations in Text-to-Audio Generation Yuhang He et.al. 2412.15922 link
2024-12-20 Less is More: Towards Green Code Large Language Models via Unified Structural Pruning Guang Yang et.al. 2412.15921 null
2024-12-20 Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model Yosuke Yamagishi et.al. 2412.15907 null
2024-12-20 Evaluation of Reliability Criteria for News Publishers with Large Language Models Manuel Pratelli et.al. 2412.15896 null
2024-12-20 TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain Camille Barboule et.al. 2412.15891 null
2024-12-20 AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI Katja Bühler et.al. 2412.15876 null
2024-12-20 Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback Jiaming Ji et.al. 2412.15838 link
2024-12-20 WebLLM: A High-Performance In-Browser LLM Inference Engine Charlie F. Ruan et.al. 2412.15803 link
2024-12-20 Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Sungjin Park et.al. 2412.15797 null
2024-12-20 GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning Heming Zhang et.al. 2412.15790 link
2024-12-20 Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech Jonathan Heitz et.al. 2412.15772 link
2024-12-20 Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference Jorge García-Carrasco et.al. 2412.15750 link
2024-12-20 Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models Shamus Sim et.al. 2412.15748 null
2024-12-20 VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models Dexter Neo et.al. 2412.15739 null
2024-12-20 AutoLife: Automatic Life Journaling with Smartphones and LLMs Huatao Xu et.al. 2412.15714 null
2024-12-20 Contrastive Learning for Task-Independent SpeechLLM-Pretraining Maike Züfle et.al. 2412.15712 link
2024-12-20 Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback Niklas Ippisch et.al. 2412.15702 null
2024-12-20 Code Review Automation Via Multi-task Federated LLM – An Empirical Study Jahnavi Kumar et.al. 2412.15676 null
2024-12-20 Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline Guancheng Zeng et.al. 2412.15660 null
2024-12-20 Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class Annie D’souza et.al. 2412.15657 link
2024-12-20 MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Sieun Hyeon et.al. 2412.15655 link
2024-12-20 Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution Wentao Tan et.al. 2412.15650 link
2024-12-20 Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model Xin Du et.al. 2412.15634 link
2024-12-20 Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? Mengyu Ye et.al. 2412.15628 null
2024-12-20 JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs Hongyi Li et.al. 2412.15623 null
2024-12-20 Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Zhi Gao et.al. 2412.15606 null
2024-12-20 Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link
2024-12-20 Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification Gyutae Park et.al. 2412.15603 null
2024-12-20 Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Xiaoqiang Kang et.al. 2412.15594 link
2024-12-20 NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization Danial Kamali et.al. 2412.15588 link
2024-12-20 To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models Jessica Y. Bo et.al. 2412.15584 null
2024-12-20 A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation Ryien Hosseini et.al. 2412.15582 link
2024-12-20 Score-based Generative Diffusion Models for Social Recommendations Chengyi Liu et.al. 2412.15579 link
2024-12-20 QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning Xinyang Tong et.al. 2412.15576 null
2024-12-20 J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM Takero Yoshida et.al. 2412.15574 null
2024-12-20 Continual Learning Using a Kernel-Based Method Over Foundation Models Saleh Momeni et.al. 2412.15571 link
2024-12-20 DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation Yichun Tai et.al. 2412.15570 link
2024-12-20 In-context Continual Learning Assisted by an External Continual Learner Saleh Momeni et.al. 2412.15563 null
2024-12-20 NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning Zheyuan Zhang et.al. 2412.15547 null
2024-12-20 MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering Zhang Siyue et.al. 2412.15540 null
2024-12-20 XRAG: eXamining the Core – Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation Qianren Mao et.al. 2412.15529 link
2024-12-20 HREF: Human Response-Guided Evaluation of Instruction Following in Language Models Xinxi Lyu et.al. 2412.15524 link
2024-12-20 PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time Alireza Pourali et.al. 2412.15519 link
2024-12-20 Stylish and Functional: Guided Interpolation Subject to Physical Constraints Yan-Ying Chen et.al. 2412.15507 null
2024-12-20 Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework Zhenjie Xu et.al. 2412.15504 link
2024-12-20 Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models Zhisheng Tang et.al. 2412.15501 null
2024-12-20 TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use Junjie Ye et.al. 2412.15495 link
2024-12-20 PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search Jiaxin Wu et.al. 2412.15494 null
2024-12-20 GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators Hengjia Li et.al. 2412.15491 null
2024-12-20 Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Saehyung Lee et.al. 2412.15484 null
2024-12-20 Continual Learning Using Only Large Language Model Prompting Jiabao Qiu et.al. 2412.15479 null
2024-12-19 TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models Ammar N. Abbas et.al. 2412.15462 null
2024-12-19 Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization Sahil Wadhwa et.al. 2412.15453 null
2024-12-19 AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals Angela Mastrianni et.al. 2412.15444 null
2024-12-19 SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval Aakash Mahalingam et.al. 2412.15443 null
2024-12-19 Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models Tianchen Zhang et.al. 2412.15431 null
2024-12-19 MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale Swapnil Gandhi et.al. 2412.15411 null
2024-12-19 Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification Edoardo Allegrini et.al. 2412.15410 null
2024-12-19 Systematic Evaluation of Long-Context LLMs on Financial Concepts Lavanya Gupta et.al. 2412.15386 null
2024-12-19 Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation Joanne Boisson et.al. 2412.15375 link
2024-12-19 Automated Root Cause Analysis System for Complex Data Products Mathieu Demarne et.al. 2412.15374 null
2024-12-19 Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs Liam Seymour et.al. 2412.15352 link
2024-12-19 Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models Reza Shirkavand et.al. 2412.15341 link
2024-12-19 Complete background cosmology of parity-even quadratic metric-affine gravity Thomas Dyer et.al. 2412.15329 null
2024-12-19 OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Shuo Xing et.al. 2412.15208 link
2024-12-19 MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark Qihao Zhao et.al. 2412.15194 link
2024-12-19 LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation Weijia Shi et.al. 2412.15188 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning Simon Frieder et.al. 2412.15184 null
2024-12-19 STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning Marius Memmel et.al. 2412.15182 null
2024-12-19 HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages Aman Chaturvedi et.al. 2412.15178 null
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 Rethinking Uncertainty Estimation in Natural Language Generation Lukas Aichberger et.al. 2412.15176 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Language Models as Continuous Self-Evolving Data Engineers Peidong Wang et.al. 2412.15151 null
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Adaptive Pruning for Large Language Models with Structural Importance Awareness Haotian Zheng et.al. 2412.15127 null
2024-12-19 Outcome-Refining Process Supervision for Code Generation Zhuohao Yu et.al. 2412.15118 link
2024-12-19 Qwen2.5 Technical Report Qwen et.al. 2412.15115 link
2024-12-19 Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture Thomas F Burns et.al. 2412.15113 link
2024-12-19 Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation Yang Tian et.al. 2412.15109 link
2024-12-19 Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability Xiangsen Chen et.al. 2412.15101 null
2024-12-19 Nano-ESG: Extracting Corporate Sustainability Information from News Articles Fabian Billert et.al. 2412.15093 link
2024-12-19 Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation Haoran Liu et.al. 2412.15086 null
2024-12-19 ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots Bhupendra Acharya et.al. 2412.15072 null
2024-12-19 ConfliBERT: A Language Model for Political Conflict Patrick T. Brandt et.al. 2412.15060 link
2024-12-19 LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Felix Friedrich et.al. 2412.15035 null
2024-12-19 DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space Mang Ning et.al. 2412.15032 link
2024-12-19 Large Language Models and Code Security: A Systematic Literature Review Enna Basic et.al. 2412.15004 null
2024-12-19 HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs Pham Vu Tuan Dat et.al. 2412.14995 link
2024-12-19 RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning Raphael Memmesheimer et.al. 2412.14989 null
2024-12-19 Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts Ioana Buhnila et.al. 2412.14986 null
2024-12-19 AI and Cultural Context: An Empirical Investigation of Large Language Models’ Performance on Chinese Social Work Professional Standards Zia Qi et.al. 2412.14971 null
2024-12-19 Movie2Story: A framework for understanding videos and telling stories in the form of novel text Kangning Li et.al. 2412.14965 null
2024-12-19 Knowledge Injection via Prompt Distillation Kalle Kujanpää et.al. 2412.14964 null
2024-12-19 Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities Daniil Medyakov et.al. 2412.14935 null
2024-12-19 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Junyu Luo et.al. 2412.14922 link
2024-12-19 Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation Zexiong Ma et.al. 2412.14905 null
2024-12-19 Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering Peize Li et.al. 2412.14880 null
2024-12-19 Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering Imed Keraghel et.al. 2412.14867 null
2024-12-19 Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling Junyi Li et.al. 2412.14860 null
2024-12-19 DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis Hongling Xu et.al. 2412.14849 link
2024-12-19 Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas Pietro Bernardelle et.al. 2412.14843 null
2024-12-19 Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis Greta Dolcetti et.al. 2412.14841 null
2024-12-19 Progressive Multimodal Reasoning via Active Retrieval Guanting Dong et.al. 2412.14835 null
2024-12-19 Answer Set Networks: Casting Answer Set Programming into Deep Learning Arseny Skryagin et.al. 2412.14814 link
2024-12-19 ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis Zeao Tu et.al. 2412.14809 link
2024-12-19 Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning Ziang Ye et.al. 2412.14780 null
2024-12-19 ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine Rabee Qasem et.al. 2412.14771 null
2024-12-19 PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children Yiqun Zhang et.al. 2412.14769 link
2024-12-19 CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering Ruida Hu et.al. 2412.14764 link
2024-12-19 Query pipeline optimization for cancer patient question answering systems Maolin He et.al. 2412.14751 null
2024-12-19 Active Inference and Human–Computer Interaction Roderick Murray-Smith et.al. 2412.14741 null
2024-12-19 On Verbalized Confidence Scores for LLMs Daniel Yang et.al. 2412.14737 link
2024-12-19 Creation of AI-driven Smart Spaces for Enhanced Indoor Environments – A Survey Aygün Varol et.al. 2412.14708 null
2024-12-19 LLMs as mediators: Can they diagnose conflicts accurately? Özgecan Koçak et.al. 2412.14675 null
2024-12-19 Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT Hassane Kissane et.al. 2412.14670 null
2024-12-19 IOHunter: Graph Foundation Model to Uncover Online Information Operations Marco Minici et.al. 2412.14663 link
2024-12-19 Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models Zijun Chen et.al. 2412.14660 link
2024-12-19 Length Controlled Generation for Black-box LLMs Yuxuan Gu et.al. 2412.14656 null
2024-12-19 Learning to Generate Research Idea with Dynamic Control Ruochen Li et.al. 2412.14626 null
2024-12-19 How good is GPT at writing political speeches for the White House? Jacques Savoy et.al. 2412.14617 null
2024-12-19 Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning Kepu Zhang et.al. 2412.14588 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Tao He et.al. 2412.14584 null
2024-12-19 CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation Youngwon Lee et.al. 2412.14581 null
2024-12-19 DiffSim: Taming Diffusion Models for Evaluating Visual Similarity Yiren Song et.al. 2412.14580 link
2024-12-19 Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models Wenhan Liu et.al. 2412.14574 link
2024-12-19 ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model Shunlin Lu et.al. 2412.14559 null
2024-12-19 The Current Challenges of Software Engineering in the Era of Large Language Models Cuiyun Gao et.al. 2412.14554 null
2024-12-19 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models Xiao Cui et.al. 2412.14528 link
2024-12-19 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment Teng Xiao et.al. 2412.14516 link
2024-12-19 Relational Programming with Foundation Models Ziyang Li et.al. 2412.14515 null
2024-12-19 PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization Jiayi Wu et.al. 2412.14510 link
2024-12-19 Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs Yuzuki Arai et.al. 2412.14501 null
2024-12-19 Guided Diffusion Model for Sensor Data Obfuscation Xin Yang et.al. 2412.14499 null
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-19 Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities Amandeep Kaur et.al. 2412.14486 null
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-19 Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs Koshiro Saito et.al. 2412.14471 null
2024-12-19 Agent-SafetyBench: Evaluating the Safety of LLM Agents Zhexin Zhang et.al. 2412.14470 link
2024-12-19 From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research Xiang Cheng et.al. 2412.14461 null
2024-12-19 LEDiff: Latent Exposure Diffusion for HDR Generation Chao Wang et.al. 2412.14456 null
2024-12-19 Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems Genki Kusano et.al. 2412.14454 null
2024-12-19 Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation Shengqi Liu et.al. 2412.14453 null
2024-12-19 ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study Eric Modesitt et.al. 2412.14436 link
2024-12-19 All-in-One Tuning and Structural Pruning for Domain-Specific LLMs Lei Lu et.al. 2412.14426 null
2024-12-19 FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning Pramit Saha et.al. 2412.14424 null
2024-12-19 Enhancing Diffusion Models for High-Quality Image Generation Jaineet Shah et.al. 2412.14422 null
2024-12-18 ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers Haowei Liu et.al. 2412.14405 null
2024-12-18 Clinical Trials Ontology Engineering with Large Language Models Berkan Çakır et.al. 2412.14387 null
2024-12-18 ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling William Han et.al. 2412.14373 link
2024-12-18 Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models’ Character Understanding Evaluation Yuxuan Jiang et.al. 2412.14368 null
2024-12-18 Surrealistic-like Image Generation with Vision-Language Models Elif Ayten et.al. 2412.14366 link
2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link
2024-12-18 A Unifying Information-theoretic Perspective on Evaluating Generative Models Alexis Fox et.al. 2412.14340 null
2024-12-18 Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation Benjamin Steenhoek et.al. 2412.14308 null
2024-12-18 Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs David Restrepo et.al. 2412.14304 null
2024-12-18 Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data haina Raza et.al. 2412.14276 link
2024-12-18 Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Jihan Yang et.al. 2412.14171 link
2024-12-18 MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Shengbang Tong et.al. 2412.14164 null
2024-12-18 TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Frank F. Xu et.al. 2412.14161 link
2024-12-18 Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models Atin Sakkeer Hussain et.al. 2412.14146 null
2024-12-18 LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research Tianyang Gu et.al. 2412.14141 null

Video Understanding

Publish Date Title Authors PDF Code
2025-03-24 Aether: Geometric-Aware Unified World Modeling Aether Team et.al. 2503.18945 null
2025-03-24 SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding Mingze Xu et.al. 2503.18943 null
2025-03-24 Video-T1: Test-Time Scaling for Video Generation Fangfu Liu et.al. 2503.18942 null
2025-03-24 Training-free Diffusion Acceleration with Bottleneck Sampling Ye Tian et.al. 2503.18940 null
2025-03-24 CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos Yang Liu et.al. 2503.18808 null
2025-03-24 Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks Nina Shvetsova et.al. 2503.18637 null
2025-03-25 AMD-Hummingbird: Towards an Efficient Text-to-Video Model Takashi Isobe et.al. 2503.18559 null
2025-03-24 EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation Qiang Qu et.al. 2503.18552 null
2025-03-24 Can Text-to-Video Generation help Video-Language Alignment? Luca Zanella et.al. 2503.18507 null
2025-03-24 Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding Xiangrui Liu et.al. 2503.18478 null
2025-03-24 Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation Dingcheng Zhen et.al. 2503.18429 null
2025-03-24 Breaking the Encoder Barrier for Seamless Video-Language Understanding Handong Li et.al. 2503.18422 null
2025-03-25 VTD-CLIP: Video-to-Text Discretization via Prompting CLIP Wencheng Zhu et.al. 2503.18407 null
2025-03-24 Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance Sicong Feng et.al. 2503.18386 null
2025-03-23 MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps Valentin Gabeff et.al. 2503.18223 null
2025-03-23 LongDiff: Training-Free Long Video Generation in One Go Zhuoling Li et.al. 2503.18150 null
2025-03-23 TransAnimate: Taming Layer Diffusion to Generate RGBA Video Xuewei Chen et.al. 2503.17934 null
2025-03-22 4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding Wenxuan Zhu et.al. 2503.17827 null
2025-03-22 V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction Yiming Zhao et.al. 2503.17736 null
2025-03-22 RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation Zhiqiang Yuan et.al. 2503.17735 null
2025-03-22 Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization Zhuo Tao et.al. 2503.17651 null
2025-03-21 Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks Bhishma Dedhia et.al. 2503.17539 null
2025-03-21 Enhancing Subsequent Video Retrieval via Vision-Language Models (VLMs) Yicheng Duan et.al. 2503.17415 null
2025-03-21 Position: Interactive Generative Video as Next-Generation Game Engine Jiwen Yu et.al. 2503.17359 null
2025-03-21 PVChat: Personalized Video Chat with One-Shot Learning Yufei Shi et.al. 2503.17069 null
2025-03-21 AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process Junjie Hu et.al. 2503.17029 null
2025-03-21 Enabling Versatile Controls for Video Diffusion Models Xu Zhang et.al. 2503.16983 null
2025-03-25 Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model Yingying Fan et.al. 2503.16942 null
2025-03-21 Temporal Action Detection Model Compression by Progressive Block Drop Xiaoyong Chen et.al. 2503.16916 null
2025-03-17 Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow Yongjia Ma et.al. 2503.16522 null
2025-03-20 XAttention: Block Sparse Attention with Antidiagonal Scoring Ruyi Xu et.al. 2503.16428 link
2025-03-20 MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance Quanhao Li et.al. 2503.16421 null
2025-03-20 ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos Haolin Yang et.al. 2503.16400 null
2025-03-20 PoseTraj: Pose-Aware Trajectory Control in Video Diffusion Longbin Ji et.al. 2503.16068 null
2025-03-20 Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models Zhihang Liu et.al. 2503.16036 null
2025-03-20 Agentic Keyframe Search for Video Question Answering Sunqi Fan et.al. 2503.16032 link
2025-03-20 Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models Marc Benedí San Millán et.al. 2503.15996 null
2025-03-25 STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding Zichen Liu et.al. 2503.15973 null
2025-03-20 DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering Haochen Wang et.al. 2503.15887 null
2025-03-20 MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving Haiguang Wang et.al. 2503.15875 link
2025-03-20 MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations Kyungho Bae et.al. 2503.15871 null
2025-03-20 VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling Hyojun Go et.al. 2503.15855 null
2025-03-20 What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation? Xuanming Cui et.al. 2503.15846 null
2025-03-19 Temporal Regularization Makes Your Video Generator Stronger Harold Haodong Chen et.al. 2503.15417 null
2025-03-20 VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention Mingzhe Zheng et.al. 2503.15138 null
2025-03-19 Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering Thanh-Son Nguyen et.al. 2503.14957 null
2025-03-19 FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding Chongjun Tu et.al. 2503.14935 null
2025-03-18 MusicInfuser: Making Video Diffusion Listen and Dance Susung Hong et.al. 2503.14505 null
2025-03-18 MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation Hongyu Zhang et.al. 2503.14428 null
2025-03-18 Impossible Videos Zechen Bai et.al. 2503.14378 null
2025-03-18 LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models Yu Cheng et.al. 2503.14325 link
2025-03-18 Concat-ID: Towards Universal Identity-Preserving Video Synthesis Yong Zhong et.al. 2503.14151 null
2025-03-18 Fast Autoregressive Video Generation with Diagonal Decoding Yang Ye et.al. 2503.14070 null
2025-03-18 AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark Xinhao Xiang et.al. 2503.14064 link
2025-03-18 SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability Jiankang Wang et.al. 2503.13983 null
2025-03-18 Improving LLM Video Understanding with 16 Frames Per Second Yixuan Li et.al. 2503.13956 null
2025-03-17 Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition Shristi Das Biswas et.al. 2503.13724 null
2025-03-17 Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory Saket Gurukar et.al. 2503.13707 null
2025-03-17 Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos Chiara Plizzari et.al. 2503.13646 link
2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 null
2025-03-17 Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding Weiyu Guo et.al. 2503.13139 null
2025-03-17 Efficient Motion-Aware Video MLLM Zijia Zhao et.al. 2503.13016 null
2025-03-17 Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction Zheyuan Liu et.al. 2503.12953 null
2025-03-17 VITED: Video Temporal Evidence Distillation Yujie Lu et.al. 2503.12855 null
2025-03-17 AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations Quang Trung Truong et.al. 2503.12828 null
2025-03-17 ViSpeak: Visual Instruction Feedback in Streaming Videos Shenghao Fu et.al. 2503.12769 null
2025-03-16 AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding Xiao Wang et.al. 2503.12559 link
2025-03-16 SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs Guibiao Liao et.al. 2503.12535 null
2025-03-16 Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma? Tianyuan Qu et.al. 2503.12496 null
2025-03-16 Causality Model for Semantic Understanding on Videos Li Yicong et.al. 2503.12447 null
2025-03-16 VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining Yunze Liu et.al. 2503.12332 null
2025-03-15 A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI Paula Andrea Pérez-Toro et.al. 2503.12102 null
2025-03-15 SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering Byeongjun Park et.al. 2503.12024 link
2025-03-14 ReCamMaster: Camera-Controlled Generative Rendering from A Single Video Jianhong Bai et.al. 2503.11647 null
2025-03-14 Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers Weiming Ren et.al. 2503.11579 null
2025-03-14 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Ziqin Zhou et.al. 2503.11513 null
2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null
2025-03-14 TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation Hongxiang Zhao et.al. 2503.11423 null
2025-03-14 Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding David Gastager et.al. 2503.11392 null
2025-03-14 Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model Haoyang Huang et.al. 2503.11251 link
2025-03-14 LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs Leqi Shen et.al. 2503.11205 null
2025-03-14 Cross-Modal Learning for Music-to-Music-Video Description Generation Zhuoyuan Mao et.al. 2503.11190 null
2025-03-24 Large-scale Pre-training for Grounded Video Caption Generation Evangelos Kazakos et.al. 2503.10781 link
2025-03-13 Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing Yudong Liu et.al. 2503.10742 link
2025-03-12 Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Jing Wang et.al. 2503.10704 null
2025-03-12 Neighboring Autoregressive Modeling for Efficient Visual Generation Yefei He et.al. 2503.10696 link
2025-03-12 Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation Qiji Zhou et.al. 2503.10691 null
2025-03-11 VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion Lehan Yang et.al. 2503.10678 link
2025-03-13 CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models Hao He et.al. 2503.10592 null
2025-03-13 Long Context Tuning for Video Generation Yuwei Guo et.al. 2503.10589 null
2025-03-13 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Wanhua Li et.al. 2503.10437 null
2025-03-13 CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance Yufan Deng et.al. 2503.10391 null
2025-03-13 LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents Boyu Chen et.al. 2503.10200 null
2025-03-13 StableFusion: Continual Video Retrieval via Frame Adaptation Zecheng Zhao et.al. 2503.10111 link
2025-03-13 Semantic Latent Motion for Portrait Video Generation Qiyuan Zhang et.al. 2503.10096 null
2025-03-16 VMBench: A Benchmark for Perception-Aligned Video Motion Generation Xinran Ling et.al. 2503.10076 link
2025-03-13 TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs Yunxiao Wang et.al. 2503.09994 null
2025-03-21 UVE: Are MLLMs Unified Evaluators for AI-Generated Videos? Yuanxin Liu et.al. 2503.09949 link
2025-03-13 VideoMerge: Towards Training-free Long Video Generation Siyang Zhang et.al. 2503.09926 null
2025-03-12 LuciBot: Automated Robot Policy Learning from Generated Videos Xiaowen Qiu et.al. 2503.09871 null
2025-03-14 On the Limitations of Vision-Language Models in Understanding Image Transforms Ahmad Mustafa Anis et.al. 2503.09837 null
2025-03-12 I2V3D: Controllable image-to-video generation with 3D guidance Zhiyuan Zhang et.al. 2503.09733 null
2025-03-12 Accelerating Diffusion Sampling via Exploiting Local Transition Coherence Shangwen Zhu et.al. 2503.09675 null
2025-03-12 Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k Xiangyu Peng et.al. 2503.09642 link
2025-03-11 V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video Jianqi Chen et.al. 2503.09631 null
2025-03-12 PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop Chenyu Li et.al. 2503.09595 link
2025-03-13 BIMBA: Selective-Scan Compression for Long-Range Video Question Answering Md Mohaiminul Islam et.al. 2503.09590 link
2025-03-12 VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Kevin Qinghong Lin et.al. 2503.09402 link
2025-03-17 VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers Ruanjun Li et.al. 2503.09387 null
2025-03-12 Unified Dense Prediction of Video Diffusion Lehan Yang et.al. 2503.09344 null
2025-03-12 Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption Luozheng Qin et.al. 2503.09279 null
2025-03-17 Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space Jian Zhu et.al. 2503.09215 null
2025-03-13 Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model Ali Vosoughi et.al. 2503.09205 null
2025-03-15 WonderVerse: Extendable 3D Scene Generation with Video Generative Models Hao Feng et.al. 2503.09160 null
2025-03-13 FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models Fufangchen Zhao et.al. 2503.09158 null
2025-03-17 Reangle-A-Video: 4D Video Generation as Video-to-Video Translation Hyeonho Jeong et.al. 2503.09151 null
2025-03-12 Memory-enhanced Retrieval Augmentation for Long Video Understanding Huaying Yuan et.al. 2503.09149 null
2025-03-12 Generative Frame Sampler for Long Video Understanding Linli Yao et.al. 2503.09146 null
2025-03-12 Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding Haoyu Zhang et.al. 2503.09143 null
2025-03-12 Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment Xiaowei Bi et.al. 2503.09081 null
2025-03-12 Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization Zongshang Pang et.al. 2503.09027 null
2025-03-11 QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension Yongdong Luo et.al. 2503.08689 link
2025-03-11 REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder Yitian Zhang et.al. 2503.08665 null
2025-03-11 Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling Subin Kim et.al. 2503.08605 null
2025-03-11 HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding Shehreen Azad et.al. 2503.08585 null
2025-03-11 RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding Xichen Tan et.al. 2503.08576 null
2025-03-11 Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos Soumya Shamarao Jahagirdar et.al. 2503.08335 null
2025-03-12 $^R$ FLAV: Rolling Flow matching for infinite Audio Video generation Alex Ergasti et.al. 2503.08307 link
2025-03-11 WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation Jing Wang et.al. 2503.08153 null
2025-03-11 ObjectMover: Generative Object Movement with Video Prior Xin Yu et.al. 2503.08037 null
2025-03-11 How Can Video Generative AI Transform K-12 Education? Examining Teachers’ Perspectives through TPACK and TAM Unggi Lee et.al. 2503.08003 null
2025-03-10 BEARCUBS: A benchmark for computer-using web agents Yixiao Song et.al. 2503.07919 null
2025-03-10 DreamRelation: Relation-Centric Video Customization Yujie Wei et.al. 2503.07602 null
2025-03-11 VACE: All-in-One Video Creation and Editing Zeyinzi Jiang et.al. 2503.07598 null
2025-03-10 AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion Mingzhen Sun et.al. 2503.07418 null
2025-03-10 Automated Movie Generation via Multi-Agent CoT Planning Weijia Wu et.al. 2503.07314 link
2025-03-10 ALLVB: All-in-One Long Video Understanding Benchmark Xichen Tan et.al. 2503.07298 null
2025-03-10 Towards Fine-Grained Video Question Answering Wei Dai et.al. 2503.06820 null
2025-03-09 VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation Hritik Bansal et.al. 2503.06800 null
2025-03-09 TR-DQ: Time-Rotation Diffusion Quantization Yihua Shao et.al. 2503.06564 null
2025-03-09 QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation Junyi Wu et.al. 2503.06545 link
2025-03-09 TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos Chen-Lin Zhang et.al. 2503.06526 link
2025-03-11 LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation Quanjian Song et.al. 2503.06508 null
2025-03-09 Generative Video Bi-flow Chen Liu et.al. 2503.06364 null
2025-03-08 Text2Story: Advancing Video Storytelling with Text Guidance Taewon Kang et.al. 2503.06310 null
2025-03-08 Get In Video: Add Anything You Want to the Video Shaobin Zhuang et.al. 2503.06268 null
2025-03-12 Object-Centric World Model for Language-Guided Manipulation Youngjoon Jeong et.al. 2503.06170 null
2025-03-08 VACT: A Video Automatic Causal Testing System and a Benchmark Haotong Yang et.al. 2503.06163 null
2025-03-08 GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation Ye Tao et.al. 2503.06136 null
2025-03-08 DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation Runze Zhang et.al. 2503.06053 null
2025-03-07 MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice Hongwei Yi et.al. 2503.05978 null
2025-03-07 MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio Xuenan Xu et.al. 2503.05242 link
2025-03-07 Unified Reward Model for Multimodal Understanding and Generation Yibin Wang et.al. 2503.05236 null
2025-03-13 Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions Chan Hur et.al. 2503.05186 null
2025-03-06 Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation Alexey Buzovkin et.al. 2503.04871 link
2025-03-05 ProReflow: Progressive Reflow with Decomposed Velocity Lei Ke et.al. 2503.04824 null
2025-03-04 LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Zhibin Lan et.al. 2503.04812 null
2025-03-06 FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video Yue Gao et.al. 2503.04720 null
2025-03-06 What Are You Doing? A Closer Look at Controllable Human Video Generation Emanuele Bugliarello et.al. 2503.04666 null
2025-03-08 The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation Aoxiong Yin et.al. 2503.04606 null
2025-03-06 Token-Efficient Long Video Understanding for Multimodal LLMs Jindong Jiang et.al. 2503.04130 null
2025-03-06 EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models Haiyang Yu et.al. 2503.04058 null
2025-03-05 EgoLife: Towards Egocentric Life Assistant Jingkang Yang et.al. 2503.03803 null
2025-03-05 GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Xuanchi Ren et.al. 2503.03751 link
2025-03-08 Rethinking Video Tokenization: A Conditioned Diffusion-based Approach Nianzu Yang et.al. 2503.03708 null
2025-03-05 DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance Zhao Yang et.al. 2503.03689 link
2025-03-06 Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection Wenqiao Li et.al. 2503.03562 null
2025-03-05 High-Quality Virtual Single-Viewpoint Surgical Video: Geometric Autocalibration of Multiple Cameras in Surgical Lights Yuna Kato et.al. 2503.03558 link
2025-03-13 Video Super-Resolution: All You Need is a Video Diffusion Model Zhihao Zhan et.al. 2503.03355 null
2025-03-04 GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning Zhun Mou et.al. 2503.02341 null
2025-02-26 Online Pseudo-average Shifting Attention(PASA) for Robust Low-precision LLM Inference: Algorithms and Numerical Analysis Long Cheng et.al. 2503.01873 null
2025-03-03 VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation Wenhao Wang et.al. 2503.01739 null
2025-03-03 Learning to Generate Long-term Future Narrations Describing Activities of Daily Living Ramanathan Rajendiran et.al. 2503.01416 null
2025-03-03 Composed Multi-modal Retrieval: A Survey of Approaches and Applications Kun Zhang et.al. 2503.01334 link
2025-03-03 Parameter-free Video Segmentation for Vision and Language Understanding Louis Mahon et.al. 2503.01201 null
2025-03-03 VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors Juil Koo et.al. 2503.01107 null
2025-03-02 Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning Baoqi Pei et.al. 2503.00986 link
2025-03-02 Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think Jie Tian et.al. 2503.00948 link
2025-03-04 An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions Zhe Wang et.al. 2503.00796 null
2025-03-01 Streaming Video Question-Answering with In-context Video KV-Cache Retrieval Shangzhe Di et.al. 2503.00540 link
2025-03-10 Learning to Animate Images from A Few Videos to Portray Delicate Human Actions Haoxin Li et.al. 2503.00276 null
2025-03-04 Unified Video Action Model Shuang Li et.al. 2503.00200 null
2025-02-28 PreMind: Multi-Agent Video Understanding for Advanced Indexing of Presentation-style Videos Kangda Wei et.al. 2503.00162 null
2025-02-26 Glad: A Streaming Scene Generator for Autonomous Driving Bin Xie et.al. 2503.00045 null
2025-02-25 An Analysis of Segment Anything 2 Clayton Bromley et.al. 2503.00042 null
2025-03-07 Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos Zhiyu Tan et.al. 2502.21314 null
2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 null
2025-02-28 Training-free and Adaptive Sparse Attention for Efficient Long Video Generation Yifei Xia et.al. 2502.21079 null
2025-02-28 HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Xiao Wang et.al. 2502.20811 null
2025-02-28 WorldModelBench: Judging Video Generation Models As World Models Dacheng Li et.al. 2502.20694 null
2025-02-27 OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Shuming Liu et.al. 2502.20361 link
2025-02-27 Mobius: Text to Seamless Looping Video Generation via Latent Shift Xiuli Bi et.al. 2502.20307 link
2025-02-27 FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Sotiris Anagnostidis et.al. 2502.20126 null
2025-02-27 C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation Yuhao Li et.al. 2502.19868 link
2025-02-27 M-LLM Based Video Frame Selection for Efficient Video Understanding Kai Hu et.al. 2502.19680 null
2025-02-26 FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode Lingzhou Mu et.al. 2502.19455 null
2025-03-03 TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis Menghao Li et.al. 2502.19454 null
2025-02-26 InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model Fengbin Guan et.al. 2502.19026 null
2025-02-25 SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference Jintao Zhang et.al. 2502.18137 link
2025-02-25 ASurvey: Spatiotemporal Consistency in Video Generation Zhiyu Yin et.al. 2502.17863 null
2025-02-26 Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric Videos Luigi Seminara et.al. 2502.17753 link
2025-02-24 X-Dancer: Expressive Music to Human Dance Video Generation Zeyuan Chen et.al. 2502.17414 null
2025-02-24 VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Xiangpeng Yang et.al. 2502.17258 null
2025-02-24 Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions Zhong Li et.al. 2502.17119 link
2025-02-23 Fine-Grained Video Captioning through Scene Graph Consolidation Sanghyeok Chu et.al. 2502.16427 null
2025-02-21 RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers Min Zhao et.al. 2502.15894 null
2025-02-21 VaViM and VaVAM: Autonomous Driving through Video Generative Modeling Florent Bartoccioni et.al. 2502.15672 link
2025-03-01 LongCaptioning: Unlocking the Power of Long Video Caption Generation in Large Multimodal Models Hongchen Wei et.al. 2502.15393 null
2025-02-21 Weakly Supervised Video Scene Graph Generation via Natural Language Supervision Kibum Kim et.al. 2502.15370 link
2025-02-21 TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Xiuwei Chen et.al. 2502.15130 null
2025-02-20 Can Hallucination Correction Improve Video-Language Alignment? Lingjun Zhao et.al. 2502.15079 null
2025-02-20 Hardware-Friendly Static Quantization Method for Video Diffusion Transformers Sanghyun Yi et.al. 2502.15077 null
2025-02-20 LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection Qingyuan Liu et.al. 2502.14994 null
2025-02-20 Improving the Diffusability of Autoencoders Ivan Skorokhodov et.al. 2502.14831 null
2025-03-04 AVD2: Accident Video Diffusion for Accident Video Description Cheng Li et.al. 2502.14801 null
2025-02-28 RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers Ke Cao et.al. 2502.14377 null
2025-02-20 Designing Parameter and Compute Efficient Diffusion Transformers using Distillation Vignesh Sundaresha et.al. 2502.14226 null
2025-02-19 FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation Yunpeng Zhang et.al. 2502.13995 link
2025-02-19 Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning Caihua Liu et.al. 2502.13754 null
2025-02-19 Pretrained Image-Text Models are Secretly Video Captioners Chunhui Zhang et.al. 2502.13363 link
2025-02-19 LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation Junchen Fu et.al. 2502.12945 null
2025-02-18 VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation Xinlong Chen et.al. 2502.12782 null
2025-02-18 MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation Sihyun Yu et.al. 2502.12632 null
2025-03-10 MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos Huaying Yuan et.al. 2502.12558 null
2025-02-21 LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities Florian Sestak et.al. 2502.12128 link
2025-02-17 DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation Zhihang Yuan et.al. 2502.11897 link
2025-02-17 video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model Guangzhi Sun et.al. 2502.11775 null
2025-02-18 Open-Ended and Knowledge-Intensive Video Question Answering Md Zarif Ul Alam et.al. 2502.11747 null
2025-02-17 VRoPE: Rotary Position Embedding for Video Large Language Models Zikang Liu et.al. 2502.11664 link
2025-02-17 Object-Centric Image to Video Generation with Language Guidance Angel Villar-Corrales et.al. 2502.11655 null
2025-02-18 iMOVE: Instance-Motion-Aware Video Understanding Jiaze Li et.al. 2502.11594 null
2025-02-16 MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation Michael Fuest et.al. 2502.11234 null
2025-02-16 Phantom: Subject-consistent video generation via cross-modal alignment Lijie Liu et.al. 2502.11079 null
2025-02-15 SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding Zhenyu Yang et.al. 2502.10810 null
2025-02-15 Semantics-aware Test-time Adaptation for 3D Human Pose Estimation Qiuxia Lin et.al. 2502.10724 null
2025-02-24 Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Guoqing Ma et.al. 2502.10248 link
2025-02-14 RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control Teng Li et.al. 2502.10059 null
2025-02-14 Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering Mark Beliaev et.al. 2502.09573 null
2025-02-14 GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation Hongyin Zhang et.al. 2502.09268 null
2025-02-12 CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation Qinghe Wang et.al. 2502.08639 null
2025-02-12 FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis Wonjoon Jin et.al. 2502.08244 null
2025-02-12 Learning Human Skill Generators at Key-Step Levels Yilu Wu et.al. 2502.08234 null
2025-02-12 AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance Zhao Wang et.al. 2502.08189 null
2025-02-10 Pre-Trained Video Generative Models as World Simulators Haoran He et.al. 2502.07825 null
2025-02-12 Next Block Prediction: Video Generation via Semi-Autoregressive Modeling Shuhuai Ren et.al. 2502.07737 null
2025-02-17 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link
2025-02-12 VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation Sixiao Zheng et.al. 2502.07531 null
2025-02-27 Enhance-A-Video: Better Generated Video for Free Yang Luo et.al. 2502.07508 link
2025-02-11 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Sheng Zhou et.al. 2502.07411 link
2025-02-11 Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos Haowen Gao et.al. 2502.07327 null
2025-02-11 Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization Aditya Vora et.al. 2502.07278 null
2025-02-11 Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis Amir Hosein Fadaei et.al. 2502.07277 null
2025-02-11 Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation Pinxin Liu et.al. 2502.07239 null
2025-02-11 A Survey on Mamba Architecture for Vision Applications Fady Ibrahim et.al. 2502.07161 null
2025-02-10 Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization Aadit Barua et.al. 2502.07096 null
2025-02-19 Conditional diffusion model with spatial attention and latent embedding for medical image segmentation Behzad Hejrati et.al. 2502.06997 link
2025-02-06 TorchResist: Open-Source Differentiable Resist Simulator Zixiao Wang et.al. 2502.06838 link
2025-02-17 Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models Shuting Wang et.al. 2502.06812 null
2025-02-12 Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT Dongyang Liu et.al. 2502.06782 null
2025-02-10 History-Guided Video Diffusion Kiwhan Song et.al. 2502.06764 null
2025-02-10 Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists Bojia Zi et.al. 2502.06734 null
2025-02-27 TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Yangguang Li et.al. 2502.06608 null
2025-02-27 A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems Linxiao Gong et.al. 2502.06581 null
2025-02-20 CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers D. She et.al. 2502.06527 null
2025-02-11 CoS: Chain-of-Shot Prompting for Long Video Understanding Jian Hu et.al. 2502.06428 null
2025-02-17 Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile Hangliang Ding et.al. 2502.06155 null
2025-02-09 Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding Xingjian Diao et.al. 2502.06020 link
2025-02-09 Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search Hengzhu Tang et.al. 2502.05924 null
2025-02-08 Towards AI-driven Sign Language Generation with Non-manual Markers Han Zhang et.al. 2502.05661 null
2025-02-08 Training-Free Constrained Generation With Stable Diffusion Models Stefano Zampini et.al. 2502.05625 null
2025-02-18 A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction Yongfan Chen et.al. 2502.05503 link
2025-02-08 Content-based Video Retrieval in Traffic Videos using Latent Dirichlet Allocation Topic Model Mohammad Kianpisheh et.al. 2502.05457 null
2025-02-28 FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation Shilong Zhang et.al. 2502.05179 link
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link
2025-02-07 VideoRoPE: What Makes for Good Video Rotary Position Embedding? Xilin Wei et.al. 2502.05173 link
2025-02-10 Goku: Flow Based Video Generative Foundation Models Shoufa Chen et.al. 2502.04896 null
2025-02-10 HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation Qijun Gan et.al. 2502.04847 null
2025-02-06 Fast Video Generation with Sliding Tile Attention Peiyuan Zhang et.al. 2502.04507 null
2025-02-06 UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation Wenzhang Sun et.al. 2502.04393 null
2025-02-05 On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices Bosung Kim et.al. 2502.04363 link
2025-02-06 WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs Jack Hong et.al. 2502.04326 null
2025-02-06 MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation Jinbo Xing et.al. 2502.04299 null
2025-02-06 Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression Lirui Wang et.al. 2502.04296 null
2025-02-06 Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency Shangkun Sun et.al. 2502.04076 link
2025-02-08 UniForm: A Unified Diffusion Transformer for Audio-Video Generation Lei Zhao et.al. 2502.03897 null
2025-02-05 Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Yunuo Chen et.al. 2502.03639 null
2025-02-19 FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise Yunlong Yuan et.al. 2502.03496 null
2025-02-05 SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living Arkaprava Sinha et.al. 2502.03459 null
2025-02-05 MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent Xinyao Liao et.al. 2502.03207 null
2025-02-27 MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Pengyi Li et.al. 2502.03183 null
2025-02-05 A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions Hao Yin et.al. 2502.02817 null
2025-02-04 Controllable Video Generation with Provable Disentanglement Yifan Shen et.al. 2502.02690 null
2025-02-03 Secure & Personalized Music-to-Video Generation via CHARCHA Mehul Agarwal et.al. 2502.02610 null
2025-02-04 VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Hila Chefer et.al. 2502.02492 null
2025-02-04 Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task Perspectives Simone Alberto Peirone et.al. 2502.02487 link
2025-02-04 TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes Xingcheng Zhou et.al. 2502.02449 null
2025-02-06 LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models Tzu-Tao Chang et.al. 2502.02406 null
2025-02-05 IPO: Iterative Preference Optimization for Text-to-Video Generation Xiaomeng Yang et.al. 2502.02088 null
2025-02-03 VILP: Imitation Learning with Latent Video Planning Zhengtong Xu et.al. 2502.01784 link
2025-02-03 Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity Haocheng Xi et.al. 2502.01776 null
2025-02-07 MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation Haibo Tong et.al. 2502.01719 null
2025-02-02 HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment Lifan Jiang et.al. 2502.01690 null
2025-02-03 VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos Xubin Ren et.al. 2502.01549 link
2025-02-03 Improved Training Technique for Latent Consistency Models Quan Dao et.al. 2502.01441 null
2025-02-17 VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion Control Lifan Jiang et.al. 2502.01101 link
2025-02-13 OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Gaojie Lin et.al. 2502.01061 null
2025-02-03 Pushing the Boundaries of State Space Models for Image and Video Generation Yicong Hong et.al. 2502.00972 null
2025-02-02 Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Tao Ren et.al. 2502.00639 null
2025-02-04 Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation Yang Cao et.al. 2502.00500 null
2025-02-01 Masked Generative Nested Transformers with Decode Time Scaling Sahil Goyal et.al. 2502.00382 null
2025-02-01 Shape from Semantics: 3D Shape Generation from Multi-View Semantics Liangchen Li et.al. 2502.00360 null
2025-02-04 AIN: The Arabic INclusive Large Multimodal Model Ahmed Heakl et.al. 2502.00094 link
2025-01-31 Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Yuta Oshima et.al. 2501.19252 null
2025-01-31 $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Saul Santos et.al. 2501.19098 link
2025-01-30 Every Image Listens, Every Image Dances: Music-Driven Image Animation Zhikang Dong et.al. 2501.18801 null
2025-01-30 MAMS: Model-Agnostic Module Selection Framework for Video Captioning Sangho Lee et.al. 2501.18269 null
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 CascadeV: An Implementation of Wurstchen Architecture for Video Generation Wenfeng Lin et.al. 2501.16612 link
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-26 TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Xingjian Zhang et.al. 2501.15513 link
2025-01-26 “See What I Imagine, Imagine What I See”: Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR Yunge Wen et.al. 2501.15456 null
2025-01-25 HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding Jiaxing Zhao et.al. 2501.15111 null
2025-01-25 VideoPure: Diffusion-based Adversarial Purification for Video Recognition Kaixun Jiang et.al. 2501.14999 link
2025-01-11 HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators Le Chen et.al. 2501.14794 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 ENTER: Event Based Interpretable Reasoning for VideoQA Hammad Ayyubi et.al. 2501.14194 null
2025-01-30 Temporal Preference Optimization for Long-Form Video Understanding Rui Li et.al. 2501.13919 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion Jiangchuan Wei et.al. 2501.13452 null
2025-01-28 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-21 Taming Teacher Forcing for Masked Autoregressive Video Generation Deyu Zhou et.al. 2501.12389 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Yuhang Zang et.al. 2501.12368 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Zheng Chong et.al. 2501.11325 link
2025-02-03 HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition Pengcheng Dong et.al. 2501.11007 null
2025-01-18 EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Linrui Tian et.al. 2501.10687 null
2025-01-17 DiffuEraser: A Diffusion Model for Video Inpainting Xiaowen Li et.al. 2501.10018 link
2025-02-02 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-02-10 Do generative video models learn physical principles from watching videos? Saman Motamed et.al. 2501.09038 link
2025-01-15 Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Jingyuan Chen et.al. 2501.09019 null
2025-01-15 RepVideo: Rethinking Cross-Layer Representation for Video Generation Chenyang Si et.al. 2501.08994 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-31 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-14 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Weichen Fan et.al. 2501.08453 null
2025-01-14 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering Meenakshi Krishnan et.al. 2501.08370 null
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 GameFactory: Creating New Games with Generative Interactive Videos Jiwen Yu et.al. 2501.08325 null
2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation Shanchuan Lin et.al. 2501.08316 null
2025-01-17 LayerAnimate: Layer-specific Control for Animation Yuxue Yang et.al. 2501.08295 null
2025-01-14 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Yabo Zhang et.al. 2501.08225 link
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 link
2025-01-24 Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Liping Yuan et.al. 2501.07888 link
2025-01-14 AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation Sitong Gong et.al. 2501.07810 link
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-17 MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning Tieyuan Chen et.al. 2501.07227 null
2025-01-13 TimeLogic: A Temporal Logic Benchmark for Video QA Sirnam Swetha et.al. 2501.07214 null
2025-01-13 Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling Jiebin Yan et.al. 2501.07087 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-11 Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning Maomao Li et.al. 2501.06438 null
2025-01-10 MEt3R: Measuring Multi-View Consistency in Generated Images Mohammad Asim et.al. 2501.06336 null
2025-01-10 Multi-subject Open-set Personalization in Video Generation Tsai-Shien Chen et.al. 2501.06187 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-13 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-10 From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities Dominick Reilly et.al. 2501.05711 link
2025-01-09 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Yifei Li et.al. 2501.05510 link
2025-01-08 Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion Yongjia Ma et.al. 2501.05484 null
2025-01-09 Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces Aniruddha Mahapatra et.al. 2501.05442 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 LongViTU: Instruction Tuning for Long-Form Video Understanding Rujie Wu et.al. 2501.05037 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-08 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Yuzhou Huang et.al. 2501.04698 null
2025-01-08 Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Zeyi Huang et.al. 2501.04336 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Bowen Hao et.al. 2501.04204 null
2024-12-18 FlexCache: Flexible Approximate Cache System for Video Diffusion Desen Sun et.al. 2501.04012 null
2025-01-07 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Yuechen Zhang et.al. 2501.03931 link
2025-01-09 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Zekai Gu et.al. 2501.03847 link
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-06 License Plate Images Generation with Diffusion Models Mariia Shpir et.al. 2501.03374 null
2025-01-03 Classifier-Guided Captioning Across Modalities Ariel Shaulov et.al. 2501.03183 null
2025-01-06 Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Guy Yariv et.al. 2501.03059 null
2025-01-20 TransPixeler: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Wenyi Hong et.al. 2501.02955 null
2025-01-06 Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising Yunlong Yuan et.al. 2501.02741 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-29 Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey Zongxia Li et.al. 2501.02189 link
2025-01-10 Gender Bias in Text-to-Video Generation Models: A case study of Sora Mohammad Nadeem et.al. 2501.01987 null
2024-12-30 FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models Tianyu Fu et.al. 2501.01986 link
2025-01-03 JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing Qili Wang et.al. 2501.01798 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-07 VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Yuanpeng Tu et.al. 2501.01427 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-03 Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions Xincheng Shuai et.al. 2501.01425 null
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-29 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-03 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2025-01-08 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Fine-grained Video-Text Retrieval: A New Benchmark and Method Yifan Xu et.al. 2501.00513 null
2024-12-31 OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models Lala Shakti Swarup Ray et.al. 2501.00432 null
2025-01-09 Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Yue Fan et.al. 2501.00358 null
2024-12-30 Detection-Fusion for Knowledge Graph Extraction from Videos Taniya Das et.al. 2501.00136 link
2024-12-30 LTX-Video: Realtime Video Latent Diffusion Yoav HaCohen et.al. 2501.00103 link
2024-12-30 Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model Yifei Huang et.al. 2412.21080 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Hierarchical Banzhaf Interaction for General Video-Language Representation Learning Peng Jin et.al. 2412.20964 link
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling Min Zhang et.al. 2412.20725 null
2025-01-05 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-28 DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments Xijun Wang et.al. 2412.20042 null
2025-01-17 MVTamperBench: Evaluating Robustness of Vision-Language Models Amit Agarwal et.al. 2412.19794 null
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-30 VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Tao Wu et.al. 2412.19645 null
2024-12-30 DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-25 Accelerating Diffusion Transformers with Dual Feature Caching Chang Zou et.al. 2412.18911 link
2024-12-24 Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation Faraz Waseem et.al. 2412.18688 null
2024-12-24 Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models Jinhui Yi et.al. 2412.18609 link
2024-12-24 DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Yuntao Chen et.al. 2412.18607 null
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-23 Large Motion Video Autoencoding with Cross-modal Video VAE Yazhou Xing et.al. 2412.17805 null
2024-12-23 VidTwin: Video VAE with Decoupled Structure and Dynamics Yuchi Wang et.al. 2412.17726 link
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 FFA Sora, video generation as fundus fluorescein angiography simulator Xinyuan Wu et.al. 2412.17346 null
2024-12-23 Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory Xingyao Li et.al. 2412.17254 null
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2025-01-08 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 Video Domain Incremental Learning for Human Action Recognition in Home Environments Yuanda Hu et.al. 2412.16946 null
2024-12-21 GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space Souhaib Attaiki et.al. 2412.16717 null
2024-12-21 TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models Haocheng Huang et.al. 2412.16700 null
2024-12-21 VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation Chi Zhang et.al. 2412.16677 null
2024-12-25 Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Beiyuan Zhang et.al. 2412.16495 null
2024-12-18 ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping Youxin Pang et.al. 2412.16212 null
2024-12-17 Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation Yiping Wang et.al. 2412.16211 null
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization Zihan Ding et.al. 2412.15689 null
2024-12-23 CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training Xiuli Bi et.al. 2412.15646 link
2024-12-20 PolySmart @ TRECVid 2024 Medical Video Question Answering Jiaxin Wu et.al. 2412.15514 null
2024-12-19 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Moayed Haji-Ali et.al. 2412.15191 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Parallelized Autoregressive Visual Generation Yuqing Wang et.al. 2412.15119 null
2024-12-19 Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations Yucheng Hu et.al. 2412.14803 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Consistent Human Image and Video Generation with Spatially Conditioned Diffusion Mingdeng Cao et.al. 2412.14531 link
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-18 Learning from Massive Human Videos for Universal Humanoid Pose Control Jiageng Mao et.al. 2412.14172 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-29 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-18 InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models Cong Wei et.al. 2412.14006 link
2024-12-18 Do Language Models Understand Time? Xi Ding et.al. 2412.13845 link
2024-12-19 G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o Tony Cheng Tong et.al. 2412.13647 link
2024-12-18 Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Yunbin Tu et.al. 2412.13543 null
2024-12-18 Real-time One-Step Diffusion-based Expressive Portrait Videos Generation Hanzhong Guo et.al. 2412.13479 link
2024-12-18 SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation Kazuki Shimada et.al. 2412.13462 null
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 null
2025-01-07 MotionBridge: Dynamic Video Inbetweening with Flexible Controls Maham Tanveer et.al. 2412.13190 null
2024-12-17 VidTok: A Versatile and Open-Source Video Tokenizer Anni Tang et.al. 2412.13061 link
2024-12-17 FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering Zheng Cheng et.al. 2412.12833 null
2024-12-17 Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Shiping Ge et.al. 2412.12791 link
2024-12-17 ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries Wangyu Xue et.al. 2412.12675 null
2024-12-16 Can video generation replace cinematographers? Research on the cinematic language of generated video Xiaozhe Li et.al. 2412.12223 null
2024-12-16 CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Guo Chen et.al. 2412.12075 null
2024-12-16 InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Rick Akkerman et.al. 2412.11785 null
2024-12-16 Generative Inbetweening through Frame-wise Conditions-Driven Video Generation Tianyi Zhu et.al. 2412.11755 link
2024-12-16 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Muhammet Furkan Ilaslan et.al. 2412.11621 link
2024-12-16 Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning Zhuyang Xie et.al. 2412.11467 null
2024-12-15 Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition Yulin Wang et.al. 2412.11228 link
2024-12-15 GenLit: Reformulating Single-Image Relighting as Video Generation Shrisha Bharadwaj et.al. 2412.11224 null
2024-12-15 DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes Jinxiu Liu et.al. 2412.11100 null
2024-12-15 Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track Deepak Gupta et.al. 2412.11056 null
2024-12-20 Video Diffusion Transformers are In-Context Learners Zhengcong Fei et.al. 2412.10783 link
2024-12-14 Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Ji-jun Park et.al. 2412.10720 null
2024-12-13 SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device Yushu Wu et.al. 2412.10494 null
2024-12-12 VCA: Video Curious Agent for Long Video Understanding Zeyuan Yang et.al. 2412.10471 null
2024-12-17 SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization Zhentao Tan et.al. 2412.10443 null
2024-12-11 COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework Xin Dong et.al. 2412.10435 null
2024-12-13 Apollo: An Exploration of Video Understanding in Large Multimodal Models Orr Zohar et.al. 2412.10360 null
2024-12-16 TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation Xingrui Wang et.al. 2412.10275 null
2024-12-19 AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era Yudong Jiang et.al. 2412.10255 link
2024-12-13 B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens Zhuqiang Lu et.al. 2412.09919 link
2024-12-16 IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs Sosuke Yamao et.al. 2412.09907 null
2024-12-13 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Hongjie Wang et.al. 2412.09856 null
2024-12-13 MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion Xunnong Xu et.al. 2412.09828 null
2024-12-17 ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Ali Athar et.al. 2412.09754 null
2024-12-11 Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model Junqi You et.al. 2412.09647 null
2024-12-16 Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Fan Zhang et.al. 2412.09645 link
2024-12-12 Doe-1: Closed-Loop Autonomous Driving with Large World Model Wenzhao Zheng et.al. 2412.09627 link
2024-12-12 OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation Weiqi Li et.al. 2412.09623 null
2024-12-12 PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models Chenyu Yang et.al. 2412.09613 null
2024-12-12 Owl-1: Omni World Model for Consistent Long Video Generation Yuanhui Huang et.al. 2412.09600 link
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 Neptune: The Long Orbit to Benchmarking Long Video Understanding Arsha Nagrani et.al. 2412.09582 link
2024-12-12 Video Creation by Demonstration Yihong Sun et.al. 2412.09551 null
2024-12-12 Agent-based Video Trimming Lingfeng Yang et.al. 2412.09513 null
2024-12-12 UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer Delong Liu et.al. 2412.09389 link
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323 null
2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null
2024-12-12 Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering Sai Bhargav Rongali et.al. 2412.09230 null
2024-12-12 LVMark: Robust Watermark for latent video diffusion models MinHyuk Jang et.al. 2412.09122 null
2024-12-12 Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation Lianrui Mu et.al. 2412.08976 null
2024-12-12 Mojito: Motion Trajectory and Intensity Control for Video Generation Xuehai He et.al. 2412.08948 null
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-13 Physical Informed Driving World Model Zhuoran Yang et.al. 2412.08410 null
2024-12-11 FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks Chongkai Gao et.al. 2412.08261 null
2024-12-11 VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation Zhiqiang Yuan et.al. 2412.08259 null
2024-12-10 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark Wufei Ma et.al. 2412.07825 null
2024-12-11 UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Xi Chen et.al. 2412.07774 null
2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null
2024-12-10 SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Jianhong Bai et.al. 2412.07760 link
2024-12-10 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Xiao Fu et.al. 2412.07759 null
2024-12-10 Multi-Shot Character Consistency for Text-to-Video Generation Yuval Atzmon et.al. 2412.07750 null
2024-12-10 StyleMaster: Stylize Your Video with Artistic Generation and Translation Zixuan Ye et.al. 2412.07744 null
2024-12-10 STIV: Scalable Text and Image Conditioned Video Generation Zongyu Lin et.al. 2412.07730 null
2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link
2024-12-10 GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning Yicheng Wang et.al. 2412.07704 null
2024-12-10 Multimodal Contextualized Support for Enhancing Video Retrieval System Quoc-Bao Nguyen-Le et.al. 2412.07584 null
2024-12-19 Multi-Scale Contrastive Learning for Video Temporal Grounding Thong Thanh Nguyen et.al. 2412.07157 null
2024-12-09 SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations Zhaorun Chen et.al. 2412.06878 null
2024-12-09 VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features Sifei Li et.al. 2412.06296 null
2024-12-11 Towards Long Video Understanding via Fine-detailed Video Story Generation Zeng You et.al. 2412.06182 null
2024-12-08 Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training Zhenghong Zhou et.al. 2412.06029 null
2024-12-08 FlexDiT: Dynamic Token Density Control for Diffusion Transformer Shuning Chang et.al. 2412.06028 null
2024-12-10 Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Hyeonho Jeong et.al. 2412.06016 null
2024-12-08 Accelerating Video Diffusion Models via Distribution Matching Yuanzhi Zhu et.al. 2412.05899 null
2024-12-08 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation Shuwei Shi et.al. 2412.05848 null
2024-12-08 Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval Shanti Stewart et.al. 2412.05831 null
2024-12-08 Self-Guidance: Boosting Flow and Diffusion Generation on Their Own Tiancheng Li et.al. 2412.05827 null
2024-12-07 Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation Leonardo Pina et.al. 2412.05694 null
2024-12-11 Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model Lening Wang et.al. 2412.05280 link
2024-12-17 Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Zhe Chen et.al. 2412.05271 link
2024-12-06 Mind the Time: Temporally-Controlled Multi-Event Video Generation Ziyi Wu et.al. 2412.05263 null
2024-12-11 LinVT: Empower Your Image-level Large Language Model to Understand Videos Lishuai Gao et.al. 2412.05185 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-06 UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving Rui Chen et.al. 2412.04842 link
2024-12-12 Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model Keunwoo Peter Yu et.al. 2412.04729 null
2024-12-05 Using Diffusion Priors for Video Amodal Segmentation Kaihua Chen et.al. 2412.04623 null