Updated on 2025.02.04

LLM Reasoning

Publish Date Title Authors PDF Code
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-28 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-02-02 GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better Xu Chu et.al. 2501.14427 null
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-01-22 EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering Chang Zong et.al. 2501.12746 null
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-22 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-23 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Fengli Xu et.al. 2501.09686 null
2025-01-14 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-03 Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models Kaleem Ullah Qasim et.al. 2501.02026 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-03 Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents Chengbo He et.al. 2501.00430 null
2024-12-31 EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta Raymond Bernard et.al. 2501.00257 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2024-12-31 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-23 StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2024-12-22 Evaluating LLM Reasoning in the Operations Research Domain with ORQA Mahdi Mostajabdaveh et.al. 2412.17874 link
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-19 Eliciting Causal Abilities in Large Language Models for Reasoning Tasks Yajing Wang et.al. 2412.15314 link
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-18 Cognition Chain for Explainable Psychological Stress Detection on Social Media Xin Wang et.al. 2412.14009 null
2024-12-18 Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games Wenye Lin et.al. 2412.13602 null
2024-12-17 ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models Yuxi Sun et.al. 2412.12848 null
2024-12-12 A NotSo Simple Way to Beat Simple Bench Soham Sane et.al. 2412.12173 null
2024-12-11 What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis Jiayu Liu et.al. 2412.12157 null
2024-12-24 Stepwise Reasoning Error Disruption Attack of LLMs Jingyu Peng et.al. 2412.11934 null
2024-12-15 SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang et.al. 2412.11026 null
2024-12-15 Entropy-Regularized Process Reward Model Hanning Zhang et.al. 2412.11006 link
2024-12-14 Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation Sukai Huang et.al. 2412.10675 null
2024-12-14 Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data Xue Wu et.al. 2412.10654 null
2024-12-13 Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education Naiming Liu et.al. 2412.09914 null
2024-12-12 Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning Zhenni Bi et.al. 2412.09078 null
2024-12-11 Training Large Language Models to Reason in a Continuous Latent Space Shibo Hao et.al. 2412.06769 null
2025-01-23 GameArena: Evaluating LLM Reasoning through Live Computer Games Lanxiang Hu et.al. 2412.06394 null
2024-12-08 Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt Damien de Mijolla et.al. 2412.05967 null
2024-12-05 SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions Bufang Yang et.al. 2412.04036 null
2024-12-03 Explainable CTR Prediction via LLM Reasoning Xiaohan Yu et.al. 2412.02588 null
2024-12-02 NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers Angel Yahir Loredo Lopez et.al. 2412.01621 null
2025-01-13 Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability Zicheng Lin et.al. 2411.19943 null
2024-11-29 TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension Zipeng Qiu et.al. 2411.19504 link
2024-11-29 COLD: Causal reasOning in cLosed Daily activities Abhinav Joshi et.al. 2411.19500 link
2024-11-25 Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision Zhiheng Xi et.al. 2411.16579 null
2024-11-22 On the Impact of Fine-Tuning on Chain-of-Thought Reasoning Elita Lobo et.al. 2411.15382 null
2024-11-21 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Yuhao Dong et.al. 2411.14432 link
2024-11-15 Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination Haojie Zheng et.al. 2411.12591 link
2024-12-23 Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus Terufumi Morishita et.al. 2411.12498 link
2024-11-18 Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation Mingchao Qi et.al. 2411.11714 link
2024-12-31 Enhancing LLM Reasoning with Reward-guided Tree Search Jinhao Jiang et.al. 2411.11694 null
2024-12-15 A dataset of questions on decision-theoretic reasoning in Newcomb-like problems Caspar Oesterheld et.al. 2411.10588 link
2024-11-14 Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Nghia Trung Ngo et.al. 2411.09213 null
2024-11-13 Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding Deyi Ji et.al. 2411.08516 null
2024-11-18 What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? Katie Kang et.al. 2411.07681 link
2024-11-27 Self-Training Meets Consistency: Improving LLMs’ Reasoning With Consistency-Driven Rationale Evaluation Jaehyeok Lee et.al. 2411.06387 link
2024-11-09 A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization Haoxin Liu et.al. 2411.06018 null
2024-11-11 LLMs as Method Actors: A Model for Prompt Engineering and Architecture Colin Doyle et.al. 2411.05778 link
2024-11-12 Kwai-STaR: Transform LLMs into State-Transition Reasoners Xingyu Lu et.al. 2411.04799 null
2024-11-21 Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Haolin Chen et.al. 2411.04282 link
2024-11-05 CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library Yimeng Liu et.al. 2411.03477 null
2025-01-27 MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs Manar Abdelatty et.al. 2411.03471 link
2024-11-04 RuAG: Learned-rule-augmented Generation for Large Language Models Yudi Zhang et.al. 2411.03349 null
2024-10-30 Vision-Language Models Can Self-Improve Reasoning via Reflection Kanzhi Cheng et.al. 2411.00855 null
2024-11-01 Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling Yiwen Ding et.al. 2411.00750 link
2024-11-01 STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing Jiaru Zou et.al. 2411.00387 null
2024-11-08 GRS-QA – Graph Reasoning-Structured Question Answering Dataset Anish Pahilajani et.al. 2411.00369 null
2024-10-31 Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning Jinghan Zhang et.al. 2410.24155 null
2024-10-31 RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner Fu-Chieh Chang et.al. 2410.23912 null
2024-10-31 OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models Junda Wu et.al. 2410.23703 null
2024-10-30 ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning Millennium Bismay et.al. 2410.23180 link
2024-10-30 On Memorization of Large Language Models in Logical Reasoning Chulin Xie et.al. 2410.23123 null
2024-10-28 Causal Interventions on Causal Paths: Mapping GPT-2’s Reasoning From Syntax to Semantics Isabelle Lee et.al. 2410.21353 null
2024-10-28 Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments Sangmim Song et.al. 2410.20666 null
2024-10-25 Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models Danqing Wang et.al. 2410.20007 null
2024-10-25 Can Stories Help LLMs Reason? Curating Information Space Through Narrative Vahid Sadiri Javadi et.al. 2410.19221 null
2024-10-18 Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning Pengfei He et.al. 2410.19000 link
2024-10-25 CLR-Bench: Evaluating Large Language Models in College-level Reasoning Junnan Dong et.al. 2410.17558 null
2024-10-28 Non-myopic Generation of Language Models for Reasoning and Planning Chang Ma et.al. 2410.17195 link
2024-11-06 Improving Causal Reasoning in Large Language Models: A Survey Longxuan Yu et.al. 2410.16676 link
2024-10-22 A Statistical Analysis of LLMs’ Self-Evaluation Using Proverbs Ryosuke Sonoda et.al. 2410.16640 null
2024-10-21 Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models’ Reasoning with Formal Logic Jason Chan et.al. 2410.16502 null
2024-11-27 On Designing Effective RL Reward at Training Time for LLM Reasoning Jiaxuan Gao et.al. 2410.15115 null
2025-01-28 Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning Xingyu Tan et.al. 2410.14211 null
2024-10-21 Unconstrained Model Merging for Enhanced LLM Reasoning Yiming Zhang et.al. 2410.13699 null
2024-10-16 Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Linhao Luo et.al. 2410.13080 link
2024-10-16 KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs Yongqin Xu et.al. 2410.12480 null
2024-10-17 Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning Qian Wang et.al. 2410.12464 null
2024-10-16 Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up Jiahao Yuan et.al. 2410.12323 link
2024-10-16 Exploiting LLMs’ Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval Hai-Long Nguyen et.al. 2410.12154 null
2024-10-15 Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming Yilun Hao et.al. 2410.12112 null
2024-10-12 OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models Jun Wang et.al. 2410.09671 null
2024-10-11 P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains Simeng Han et.al. 2410.09207 null
2024-10-11 Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning Yunpeng Gao et.al. 2410.08500 null
2024-10-10 SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation Hang Yin et.al. 2410.08189 null
2024-10-10 Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning Amrith Setlur et.al. 2410.08146 null
2024-10-10 Automatic Curriculum Expert Iteration for Reliable LLM Reasoning Zirui Zhao et.al. 2410.07627 null
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 null
2024-10-09 Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning Xiyao Wang et.al. 2410.06508 null
2025-01-02 Filtering Discomforting Recommendations with Large Language Models Jiahao Liu et.al. 2410.05411 null
2024-10-05 Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification Zhenwen Liang et.al. 2410.05318 null
2024-10-06 Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval Pengcheng Jiang et.al. 2410.04585 link
2024-10-03 The Role of Deductive and Inductive Reasoning in Large Language Models Chengkun Cai et.al. 2410.02892 null
2024-10-02 Not All LLM Reasoners Are Created Equal Arian Hosseini et.al. 2410.01748 null
2024-12-25 Interpretable Contrastive Monte Carlo Tree Search Reasoning Zitian Gao et.al. 2410.01707 link
2024-10-02 VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Amirhossein Kazemnejad et.al. 2410.01679 link
2024-10-02 AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses Xiaotian Lu et.al. 2410.01246 null
2024-10-01 Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness Xiao Peng et.al. 2410.00359 null
2024-10-01 Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis Chun-Hsiao Yeh et.al. 2410.00292 null
2024-10-08 GUNDAM: Aligning Large Language Models with Graph Understanding Sheng Ouyang et.al. 2409.20053 null
2024-09-27 Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs Yanyuan Qiao et.al. 2409.18794 null
2024-10-23 Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning Debargha Ganguly et.al. 2409.17270 null
2024-09-20 CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency Kangsheng Wang et.al. 2409.17174 null
2024-09-20 Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM Zheng Wei Lim et.al. 2409.13949 null
2024-09-19 SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning Zhipeng Li et.al. 2409.12836 null
2024-10-04 Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning Jiaxin Wen et.al. 2409.12452 link
2024-12-16 Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data Jiaming Zhou et.al. 2409.12437 link
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-11-05 Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Fatemeh Haji et.al. 2409.11527 link
2024-09-16 Enhancing RL Safety with Counterfactual LLM Reasoning Dennis Gross et.al. 2409.10188 link
2024-09-11 Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation SeongYeub Chu et.al. 2409.07355 link

LLM Evaluation

Publish Date Title Authors PDF Code
2025-01-30 Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation Muhammed Yusuf Kocyigit et.al. 2501.18771 null
2025-01-31 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-02-01 LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering Beiming Liu et.al. 2501.17183 null
2025-01-28 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-26 HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI Tidor-Vlad Pricope et.al. 2501.15627 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-10 CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback En-Qi Tseng et.al. 2501.10421 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-13 Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles Samia Touileb et.al. 2501.07718 null
2025-01-03 FLAME: Financial Large-Language Model Assessment and Metrics Evaluation Jiayu Guo et.al. 2501.06211 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-04 LLMzSzŁ: a comprehensive LLM benchmark for Polish Krzysztof Jassem et.al. 2501.02266 null
2025-01-08 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2025-01-04 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs Weijia Xu et.al. 2501.00273 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2025-01-25 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2025-01-02 Baichuan4-Finance Technical Report Hanyu Zhang et.al. 2412.15270 null
2024-12-19 ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects Qihang Cao et.al. 2412.14837 null
2024-12-18 AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Xiaobao Wu et.al. 2412.13670 link
2024-12-18 Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning Eitan Wagner et.al. 2412.13631 null
2024-12-17 OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Shuting Wang et.al. 2412.13018 link
2024-12-10 How to Choose a Threshold for an Evaluation Metric for Large Language Models Bhaskarjit Sarmah et.al. 2412.12148 null
2024-12-15 Dual Traits in Probabilistic Reasoning of Large Language Models Shenxiong Li et.al. 2412.11009 link
2024-12-30 LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation Eunsu Kim et.al. 2412.10424 null
2024-12-13 Cultural Evolution of Cooperation among LLM Agents Aron Vallinder et.al. 2412.10270 null
2024-12-12 Towards Understanding the Robustness of LLM-based Evaluations under Perturbations Manav Chaudhary et.al. 2412.09269 null
2024-12-10 BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Sahal Shaji Mullappilly et.al. 2412.07769 link
2024-12-12 PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models Qian Zhang et.al. 2412.06287 link
2024-12-02 AI Benchmarks and Datasets for LLM Evaluation Todor Ivanov et.al. 2412.01020 null
2024-11-30 Evaluating the Consistency of LLM Evaluators Noah Lee et.al. 2412.00543 null
2024-11-29 MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks John Francis et.al. 2411.19689 null
2024-11-29 Beyond Surface Structure: A Causal Assessment of LLMs’ Comprehension Ability Yujin Han et.al. 2411.19456 link
2024-11-27 Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator Frederic Kirstein et.al. 2411.18444 null
2025-01-17 CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity Zhengmin Yu et.al. 2411.16239 link
2024-11-25 SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text Reshmi Ghosh et.al. 2411.16077 null
2024-11-26 Do LLMs Agree on the Creativity Evaluation of Alternative Uses? Abdullah Al Rabeyah et.al. 2411.15560 null
2024-11-19 Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat Roland Daynauth et.al. 2411.14483 link
2024-11-21 Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models Lovish Madaan et.al. 2411.14103 null
2024-11-21 An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture Boming Xia et.al. 2411.13768 null
2024-11-21 A Framework for Evaluating LLMs Under Task Indeterminacy Luke Guerdan et.al. 2411.13760 null
2024-11-12 Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning Linyang He et.al. 2411.07533 null
2024-11-13 Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models Yancheng He et.al. 2411.07140 null
2024-11-09 Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Xiaojun Wu et.al. 2411.06272 link
2024-11-16 ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding Israel Abebe Azime et.al. 2411.05049 null
2024-11-07 Bayesian Calibration of Win Rate Estimation with LLM Evaluators Yicheng Gao et.al. 2411.04424 link
2024-11-05 Enhancing LLM Evaluations: The Garbling Trick William F. Bradley et.al. 2411.01533 null
2025-01-31 Mastering the Craft of Data Synthesis for CodeLLMs Meng Chen et.al. 2411.00005 null
2024-10-28 Project MPG: towards a generalized performance benchmark for LLM capabilities Lucas Spangher et.al. 2410.22368 null
2024-10-29 Self-Preference Bias in LLM-as-a-Judge Koki Wataoka et.al. 2410.21819 null
2024-10-28 Unveiling Context-Aware Criteria in Self-Assessing LLMs Taneesh Gupta et.al. 2410.21545 null
2024-10-27 LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization Jui-Nan Yen et.al. 2410.20625 null
2024-10-26 Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks Annalisa Szymanski et.al. 2410.20266 null
2024-10-23 MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Jingfan Zhang et.al. 2410.18035 null
2025-01-30 Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements Isamu Isozaki et.al. 2410.17141 link
2024-10-21 CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Maosong Cao et.al. 2410.16256 link
2025-01-26 mHumanEval – A Multilingual Benchmark to Evaluate Large Language Models for Code Generation Nishat Raihan et.al. 2410.15037 link
2024-10-19 CAP: Data Contamination Detection via Consistency Amplification Yi Zhao et.al. 2410.15005 null
2024-10-18 Enabling Scalable Evaluation of Bias Patterns in Medical LLMs Hamed Fayyaz et.al. 2410.14763 link
2024-11-06 Diverging Preferences: When do Annotators Disagree and do Models Know? Michael JQ Zhang et.al. 2410.14632 null
2024-10-18 Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models James Vo et.al. 2410.14480 null
2024-10-21 BenTo: Benchmark Task Reduction with In-Context Transferability Hongyu Zhao et.al. 2410.13804 link
2024-10-16 BenchmarkCards: Large Language Model and Risk Reporting Anna Sokol et.al. 2410.12974 null
2025-02-01 Language Model Preference Evaluation with Multiple Weak Evaluators Zhengyu Hu et.al. 2410.12869 link
2024-10-11 Enterprise Benchmarks for Large Language Model Evaluation Bing Zhang et.al. 2410.12857 link
2024-10-16 An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation Junjie Chen et.al. 2410.12265 null
2024-10-15 Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi et.al. 2410.11672 link
2024-10-15 Black-box Uncertainty Quantification Method for LLM-as-a-Judge Nico Wagner et.al. 2410.11594 null
2024-10-14 Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting Yifan Luo et.al. 2410.10150 null
2024-12-13 HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Jingxuan Fan et.al. 2410.09988 link
2024-10-15 LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models Han Qiu et.al. 2410.09962 link
2024-10-17 Towards Multilingual LLM Evaluation for European Languages Klaudia Thellmann et.al. 2410.08928 null
2024-10-11 Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example Marcus Kessel et.al. 2410.08911 null
2024-10-10 Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks Mathis Pink et.al. 2410.08133 null
2025-02-03 COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act Philipp Guldimann et.al. 2410.07959 link
2024-11-06 News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News Tarun Jain et.al. 2410.07520 null
2024-10-09 Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Xiaosen Zheng et.al. 2410.07137 link
2024-10-09 ReIFE: Re-evaluating Instruction-Following Evaluation Yixin Liu et.al. 2410.07069 link
2024-10-08 Active Evaluation Acquisition for Efficient LLM Benchmarking Yang Li et.al. 2410.05952 null
2024-10-07 TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Qingchen Yu et.al. 2410.05262 link
2024-10-01 Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model Aidan Gilson et.al. 2410.03740 null
2024-10-04 TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation Jonathan Cook et.al. 2410.03608 null
2024-10-04 Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores Robert E. Blackwell et.al. 2410.03492 null
2024-10-29 AIME: AI System Optimization via Multiple LLM Evaluators Bhrij Patel et.al. 2410.03131 null
2024-10-02 Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation Annalisa Szymanski et.al. 2410.02054 null
2024-10-02 Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models Joseph Lee et.al. 2410.01795 link
2024-10-03 Extending Context Window of Large Language Models from a Distributional Perspective Yingsheng Wu et.al. 2410.01490 null
2024-10-02 ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving Yifan Qiao et.al. 2410.01228 null
2024-10-01 ViDAS: Vision-based Danger Assessment and Scoring Pranav Gupta et.al. 2410.00477 null
2024-10-01 PclGPT: A Large Language Model for Patronizing and Condescending Language Detection Hongbo Wang et.al. 2410.00361 link
2024-11-26 LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models Haitao Li et.al. 2409.20288 link
2024-09-29 Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems Xuyang Wu et.al. 2409.19804 null
2024-10-19 Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models Xin Li et.al. 2409.19667 link
2024-10-05 IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation Fan Lin et.al. 2409.18892 link
2024-12-13 A Character-Centric Creative Story Generation via Imagination Kyeongman Park et.al. 2409.16667 null
2024-09-25 Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models Sungjune Park et.al. 2409.16635 null
2024-12-18 Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino Jann Railey Montalan et.al. 2409.15380 link
2024-12-16 MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators Qingyu Lu et.al. 2409.14335 link
2024-09-21 ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models Yuqing Huang et.al. 2409.13989 link
2024-12-17 AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs Basel Mousi et.al. 2409.11404 null
2024-10-02 LLM-as-a-Judge & Reward Model: What They Can and Cannot Do Guijin Son et.al. 2409.11239 null
2024-12-08 Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges Vinay Samuel et.al. 2409.09927 link
2024-09-13 Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia Fajri Koto et.al. 2409.08564 null
2024-09-09 Assessing SPARQL capabilities of Large Language Models Lars-Peter Meyer et.al. 2409.05925 link
2024-10-08 LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs Yuhao Wu et.al. 2409.02076 link
2024-10-14 Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation Jasper Dekoninck et.al. 2409.00696 null
2024-08-26 Evaluating ChatGPT on Nuclear Domain-Specific Data Muhammad Anwar et.al. 2409.00090 null
2024-08-28 LLMSecCode: Evaluating Large Language Models for Secure Coding Anton Rydén et.al. 2408.16100 link
2024-08-26 LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Yayati Jadhav et.al. 2408.14307 null
2024-08-26 Epidemic Information Extraction for Event-Based Surveillance using Large Language Models Sergio Consoli et.al. 2408.14277 null
2024-10-04 MobileQuant: Mobile-friendly Quantization for On-device Language Models Fuwen Tan et.al. 2408.13933 link
2024-08-23 LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models Chongyan Sun et.al. 2408.13338 null
2024-08-23 Open Llama2 Model for the Lithuanian Language Artūras Nakvosas et.al. 2408.12963 null
2024-08-23 LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction Songwei Li et.al. 2408.12832 link
2024-12-20 Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts Jiaqing Liu et.al. 2408.09688 null
2024-08-20 Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge Ravi Raju et.al. 2408.08808 null
2024-10-16 The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation Samee Arif et.al. 2408.08688 link
2024-10-19 Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks Junseok Kim et.al. 2408.08631 null

LLM MLLM

Publish Date Title Authors PDF Code
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 null
2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 null
2025-01-31 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Wenzhi Fang et.al. 2501.19389 null
2025-02-03 SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Dominik Wagner et.al. 2501.19377 null
2025-01-31 Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions Sören Christensen et.al. 2501.19373 null
2025-01-31 We’re Different, We’re the Same: Creative Homogeneity Across LLMs Emily Wenger et.al. 2501.19361 null
2025-01-31 Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies Brandon P. Chelstrom et.al. 2501.19359 null
2025-01-31 The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking Yuchun Miao et.al. 2501.19358 null
2025-01-31 Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters Adrián Juan-Delgado et.al. 2501.19356 null
2025-01-31 Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 Ting-Yao E. Hsu et.al. 2501.19353 null
2025-01-31 Towards Adaptive Self-Improvement for Smarter Energy Systems Alexander Sommer et.al. 2501.19340 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 Homogeneity Bias as Differential Sampling Uncertainty in Language Models Messi H. J. Lee et.al. 2501.19337 null
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 LLM-based Affective Text Generation Quality Based on Different Quantization Values Yarik Menchaca Resendiz et.al. 2501.19317 null
2025-01-31 Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment Gregor Bachmann et.al. 2501.19309 null
2025-02-03 SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling Jiefeng Chen et.al. 2501.19306 null
2025-01-31 Beyond checkmate: exploring the creative chokepoints in AI text Nafis Irtiza Tripto et.al. 2501.19301 link
2025-01-31 Offline Learning for Combinatorial Multi-armed Bandits Xutong Liu et.al. 2501.19300 null
2025-01-31 Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes Zhiyao Xu et.al. 2501.19298 null
2025-01-31 Analysis of LLMs vs Human Experts in Requirements Engineering Cory Hymel et.al. 2501.19297 null
2025-01-31 Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators Kunpeng Zhang et.al. 2501.19282 null
2025-01-31 Pheromone-based Learning of Optimal Reasoning Paths Anirudh Chari et.al. 2501.19278 null
2025-01-31 From Assistance to Autonomy – A Researcher Study on the Potential of AI Support for Qualitative Data Analysis Elisabeth Kirsten et.al. 2501.19275 null
2025-01-31 Jackpot! Alignment as a Maximal Lottery Roberto-Rafael Maura-Rivero et.al. 2501.19266 null
2025-01-31 Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge Amogh Joshi et.al. 2501.19259 null
2025-01-31 A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation Yunzhe Li et.al. 2501.19232 null
2025-01-31 Autonomous Legacy Web Application Upgrades Using a Multi-Agent System Valtteri Ala-Salmi et.al. 2501.19204 null
2025-02-03 Improving the Robustness of Representation Misdirection for Large Language Model Unlearning Dang Huu-Tien et.al. 2501.19202 link
2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link
2025-01-31 Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning Xianglin Yang et.al. 2501.19180 null
2025-01-31 No Foundations without Foundations – Why semi-mechanistic models are essential for regulatory biology Luka Kovačević et.al. 2501.19178 null
2025-01-31 Position: Contextual Integrity Washing for Language Models Yan Shvartzshnaider et.al. 2501.19173 null
2025-01-31 Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs Kejia Zhang et.al. 2501.19164 null
2025-01-31 A theoretical framework for overfitting in energy-based modeling Giovanni Catania et.al. 2501.19158 null
2025-01-31 A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator Sixiao Huang et.al. 2501.19135 null
2025-01-31 Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations Sihwan Park et.al. 2501.19099 null
2025-01-31 Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data Xichen Xu et.al. 2501.19094 null
2025-01-31 Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models Jialin Zhao et.al. 2501.19090 null
2025-01-31 Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification Xiangyu Sun et.al. 2501.19086 null
2025-01-31 Enhancing Code Generation for Low-Resource Languages: No Silver Bullet Alessandro Giagnorio et.al. 2501.19085 null
2025-01-31 Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations Dahye Kim et.al. 2501.19066 link
2025-01-31 TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs Yan Sun et.al. 2501.19057 null
2025-01-31 Enabling Autonomic Microservice Management through Self-Learning Agents Fenglin Yu et.al. 2501.19056 null
2025-01-31 Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models Ruiyu Wang et.al. 2501.19054 null
2025-01-31 Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors Simon Idoko et.al. 2501.19042 link
2025-01-31 Towards the Worst-case Robustness of Large Language Models Huanran Chen et.al. 2501.19040 null
2025-01-31 Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs Hongliang Li et.al. 2501.19036 null
2025-01-31 XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Bo Lan et.al. 2501.19034 link
2025-01-31 Multilayer Networks in Neuroimaging Vesna Vuksanovic et.al. 2501.19024 null
2025-01-31 Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation Bin Zhu et.al. 2501.19017 null
2025-01-31 Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities Arjun Krishna et.al. 2501.19012 null
2025-01-31 Visual Autoregressive Modeling for Image Super-Resolution Yunpeng Qu et.al. 2501.18993 null
2025-01-31 Symmetric Pruning of Large Language Models Kai Yi et.al. 2501.18980 null
2025-01-31 BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics Yuxuan Liu et.al. 2501.18972 null
2025-01-31 Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping Pu Yang et.al. 2501.18962 null
2025-01-31 Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow Alfred Bexley et.al. 2501.18957 null
2025-01-31 LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Shenghao Fu et.al. 2501.18954 link
2025-01-31 TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment Zi-Jian Cheng et.al. 2501.18935 link
2025-01-31 Language Games as the Pathway to Artificial Superhuman Intelligence Ying Wen et.al. 2501.18924 null
2025-01-31 KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search Haoran Luo et.al. 2501.18922 link
2025-01-31 LLM Program Optimization via Retrieval Augmented Search Sagnik Anupam et.al. 2501.18916 null
2025-01-31 Scaling Laws for Differentially Private Language Models Ryan McKenna et.al. 2501.18914 null
2025-01-31 Streamlining Security Vulnerability Triage with Large Language Models Mohammad Jalili Torkamani et.al. 2501.18908 null
2025-01-31 Trustworthy Evaluation of Generative AI Models Zijun Gao et.al. 2501.18897 null
2025-01-31 Can We Predict the Effect of Prompts? Jae Yong Lee et.al. 2501.18883 null
2025-01-31 Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models Jiaqi Tang et.al. 2501.18863 null
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-31 Equivariant Hypergraph Diffusion for Crystal Structure Prediction Yang Liu et.al. 2501.18850 null
2025-01-31 Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities Yaping Chai et.al. 2501.18845 null
2025-01-31 Trading Inference-Time Compute for Adversarial Robustness Wojciech Zaremba et.al. 2501.18841 null
2025-01-31 Partially Rewriting a Transformer in Natural Language Gonçalo Paulo et.al. 2501.18838 null
2025-01-31 Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Mrinank Sharma et.al. 2501.18837 null
2025-01-31 Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential Chenyu Gao et.al. 2501.18834 null
2025-01-31 Structural Embedding Projection for Contextual Large Language Model Inference Vincent Enoasmo et.al. 2501.18826 null
2025-01-31 Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies Andrey Borro et.al. 2501.18817 link
2025-01-31 Large Language Models as Common-Sense Heuristics Andrey Borro et.al. 2501.18816 null
2025-01-30 Compositional Generalization Requires More Than Disentangled Representations Qiyao Liang et.al. 2501.18797 null
2025-01-30 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang et.al. 2501.18795 null
2025-01-30 Survey and Improvement Strategies for Gene Prioritization with Large Language Models Matthew Neeley et.al. 2501.18794 null
2025-01-30 LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? Alexander Tuisov et.al. 2501.18784 null
2025-01-30 Navigating the Fragrance space Via Graph Generative Models And Predicting Odors Mrityunjay Sharma et.al. 2501.18777 link
2025-01-30 Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring Zijun Deng et.al. 2501.18761 null
2025-01-30 Synthetic Data Generation for Augmenting Small Samples Dan Liu et.al. 2501.18741 null
2025-01-30 Examining the Robustness of Large Language Models across Language Complexity Jiayi Zhang et.al. 2501.18738 null
2025-01-30 Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks Mohd. Farhan Israk Soumik et.al. 2501.18727 null
2025-01-30 Strong and Controllable 3D Motion Generation Canxuan Gang et.al. 2501.18726 null
2025-01-30 Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning Maya Kruse et.al. 2501.18724 null
2025-02-03 Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps Devansh Bhardwaj et.al. 2501.18712 null
2025-01-30 Regularized second-order optimization of tensor-network Born machines Matan Ben-Dov et.al. 2501.18691 null
2025-01-30 Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting Yansong Qu et.al. 2501.18672 null
2025-01-30 Foundational Models for 3D Point Clouds: A Survey and Outlook Vishal Thengane et.al. 2501.18594 null
2025-01-30 Diffusion Autoencoders are Scalable Image Tokenizers Yinbo Chen et.al. 2501.18593 null
2025-02-03 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Yue Wang et.al. 2501.18585 null
2025-01-30 Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH Evgenii Evstafev et.al. 2501.18576 null
2025-01-30 BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos Lehao Lin et.al. 2501.18565 null
2025-01-30 SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Haoquan Fang et.al. 2501.18564 null
2025-01-30 Semantic Web and Creative AI – A Technical Report from ISWS 2023 Raia Abu Ahmad et.al. 2501.18542 null
2025-01-30 Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges Manveer Singh Tamber et.al. 2501.18536 link
2025-01-30 Differentially Private Steering for Large Language Model Alignment Anmol Goel et.al. 2501.18532 link
2025-01-30 Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models Guanqun Cao et.al. 2501.18516 null
2025-01-30 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Arthur Douillard et.al. 2501.18512 null
2025-01-30 WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Benjamin Feuer et.al. 2501.18511 link
2025-01-30 CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction Peter J. Bentley et.al. 2501.18504 null
2025-01-30 Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline Shivani Kapania et.al. 2501.18493 null
2025-01-30 A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models Changshu Liu et.al. 2501.18482 null
2025-01-30 CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization Yanxia Deng et.al. 2501.18475 null
2025-01-30 Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations Chengxi Zeng et.al. 2501.18474 null
2025-01-30 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-01-30 CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering Yumeng Wang et.al. 2501.18457 null
2025-01-30 GENIE: Generative Note Information Extraction model for structuring EHR data Huaiyuan Ying et.al. 2501.18435 null
2025-01-30 Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation Youngjoon Lee et.al. 2501.18416 null
2025-01-30 RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects Yiteng Tu et.al. 2501.18365 link
2025-01-30 A Video-grounded Dialogue Dataset and Metric for Event-driven Activities Wiradee Imrattanatrai et.al. 2501.18324 link
2025-01-30 Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach Tianpeng Pan et.al. 2501.18320 null
2025-01-30 Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models Jennifer D’Souza et.al. 2501.18287 null
2025-01-30 Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models Haoyu Liang et.al. 2501.18280 null
2025-01-30 Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence Kevin Roitero et.al. 2501.18265 null
2025-01-30 How to Select Datapoints for Efficient Human Evaluation of NLG Models? Vilém Zouhar et.al. 2501.18251 link
2025-01-30 Statistical multi-metric evaluation and visualization of LLM system predictive performance Samuel Ackerman et.al. 2501.18243 null
2025-01-30 Contextually Structured Token Dependency Encoding for Large Language Models James Blades et.al. 2501.18205 null
2025-01-30 Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents ShuiDe Wen et.al. 2501.18190 null
2025-01-30 Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation Teddy Lazebnik et.al. 2501.18177 null
2025-01-30 Continually Evolved Multimodal Foundation Models for Cancer Prognosis Jie Peng et.al. 2501.18170 null
2025-01-30 RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing Jinyao Guo et.al. 2501.18160 null
2025-01-30 Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study Yuchen Lei et.al. 2501.18158 null
2025-01-30 Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models Wanlong Liu et.al. 2501.18154 null
2025-01-30 Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Qika Lin et.al. 2501.18119 null
2025-01-30 Scaling Inference-Efficient Language Models Song Bian et.al. 2501.18107 null
2025-01-30 Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation Yibo Wang et.al. 2501.18100 link
2025-01-30 AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates Da Chang et.al. 2501.18094 null
2025-01-30 Normative Evaluation of Large Language Models with Everyday Moral Dilemmas Pratik S. Sachdeva et.al. 2501.18081 null
2025-01-30 FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models Spencer Mateega et.al. 2501.18062 null
2025-01-29 RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems Duy A. Nguyen et.al. 2501.18056 null
2025-01-29 Current Pathology Foundation Models are unrobust to Medical Center Differences Edwin D. de Jong et.al. 2501.18055 null
2025-01-29 A Proximal Operator for Inducing 2:4-Sparsity Jonas M Kübler et.al. 2501.18015 null
2025-01-29 Large Language Models Think Too Fast To Explore Effectively Lan Pan et.al. 2501.18009 null
2025-01-29 Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces Neetha Jambigi et.al. 2501.18005 null
2025-01-29 InnerThoughts: Disentangling Representations and Predictions in Large Language Models Didier Chételat et.al. 2501.17994 null
2025-01-29 Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study Marwah Alaofi et.al. 2501.17981 link
2025-01-29 Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu et.al. 2501.17974 null
2025-01-29 “I Would Never Trust Anything Western”: Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools Manas Mhasakar et.al. 2501.17942 null
2025-01-29 DReSS: Data-driven Regularized Structured Streamlining for Large Language Models Mingkuan Feng et.al. 2501.17905 null
2025-01-29 Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? Pouya Pezeshkpour et.al. 2501.17840 link
2025-01-29 Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology Sobhan Hemati et.al. 2501.17822 null
2025-01-30 Leveraging Multimodal LLM for Inspirational User Interface Search Seokhyeon Park et.al. 2501.17799 link
2025-01-29 BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights Chan-Jan Hsu et.al. 2501.17790 null
2025-01-29 AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing Peter Pak et.al. 2501.17784 null
2025-01-29 2SSP: A Two-Stage Framework for Structured Pruning of LLMs Fabrizio Sandri et.al. 2501.17771 link
2025-01-29 Generative Unordered Flow for Set-Structured Data Generation Yangming Li et.al. 2501.17770 null
2025-01-29 Hybrid Graphs for Table-and-Text based Question Answering using LLMs Ankush Agarwal et.al. 2501.17767 null
2025-01-29 On the Partitioning of GPU Power among Multi-Instances Tirth Vamja et.al. 2501.17752 null
2025-01-29 Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation Aitor Arrieta et.al. 2501.17749 null
2025-01-29 A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches Ana R. Baião et.al. 2501.17729 null
2025-01-29 Using Code Generation to Solve Open Instances of Combinatorial Design Problems Christopher D. Rosin et.al. 2501.17725 link
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Source-Channel Separation Theorems for Distortion Perception Coding Chao Tian et.al. 2501.17706 null
2025-01-29 Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching Xuzhe Dang et.al. 2501.17665 null
2025-01-30 In-Context Meta LoRA Generation Yihua Shao et.al. 2501.17635 null
2025-01-29 Uncertainty Quantification and Decomposition for LLM-based Recommendation Wonbin Kweon et.al. 2501.17630 link
2025-01-29 The Imitation Game According To Turing Sharon Temtsin et.al. 2501.17629 null
2025-01-29 Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment Jonathan Teel et.al. 2501.17617 null
2025-01-29 Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis Kunrong Li et.al. 2501.17598 null
2025-01-30 Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models Behraj Khan et.al. 2501.17595 null
2025-01-29 GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback Mohamed Abdelaal et.al. 2501.17584 null
2025-01-29 CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs Amey Hengle et.al. 2501.17581 null
2025-01-29 Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding Marco Pasini et.al. 2501.17578 null
2025-01-29 Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models Wooyoung Kim et.al. 2501.17549 null
2025-01-29 Towards Training-Free Open-World Classification with 3D Generative Models Xinzhe Xia et.al. 2501.17547 null
2025-01-29 Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant Gaole He et.al. 2501.17546 link
2025-01-29 Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison Martin Nizon-Deladoeuille et.al. 2501.17539 null
2025-01-29 Neural Spelling: A Spell-Based BCI System for Language Neural Decoding Xiaowei Jiang et.al. 2501.17489 null
2025-01-29 DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance Seffi Cohen et.al. 2501.17479 link
2025-01-29 AugmenTest: Enhancing Tests with LLM-Driven Oracles Shaker Mahmud Khandaker et.al. 2501.17461 null
2025-01-29 Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction Kaiwei Luo et.al. 2501.17459 null
2025-01-29 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Tiansheng Huang et.al. 2501.17433 link
2025-01-29 Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models Yuxuan Li et.al. 2501.17420 null
2025-01-29 MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs Ved Sirdeshmukh et.al. 2501.17399 link
2025-01-29 Learning Free Token Reduction for Multi-Modal LLM Zihui Zhao et.al. 2501.17391 null
2025-01-29 Context-Aware Semantic Recomposition Mechanism for Large Language Models Richard Katrix et.al. 2501.17386 null
2025-01-28 Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations Md Tauhidul Islam et.al. 2501.17347 null
2025-01-28 Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction Mingyu Derek Ma et.al. 2501.17326 null
2025-01-28 CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data Lee Carlin et.al. 2501.17324 null
2025-01-30 Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding Yun-Shiuan Chuang et.al. 2501.17310 null
2025-01-28 “Ownership, Not Just Happy Talk”: Co-Designing a Participatory Large Language Model for Journalism Emily Tseng et.al. 2501.17299 null
2025-01-28 Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization Zilu Tang et.al. 2501.17295 null
2025-01-28 Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology Peilong Wang et.al. 2501.17286 null
2025-01-30 From Natural Language to Extensive-Form Game Representations Shilong Deng et.al. 2501.17282 link
2025-01-28 Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models Abdalaziz Al-Maeeni et.al. 2501.17279 null
2025-01-28 Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics Jasper Timm et.al. 2501.17273 link
2025-01-28 Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Fengpei Yuan et.al. 2501.17206 null
2025-01-28 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Tianzhe Chu et.al. 2501.17161 null
2025-01-28 FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Deren Lei et.al. 2501.17144 link
2025-01-28 ASTRAL: Automated Safety Testing of Large Language Models Miriam Ugarte et.al. 2501.17132 null
2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization Ruizhe Wang et.al. 2501.17116 null
2025-01-28 Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction Carl-Leander Henneking et.al. 2501.17112 null
2025-01-28 Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics Guillaume Le Mailloux et.al. 2501.17107 link
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models Minghan Li et.al. 2501.17039 null
2025-01-28 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Manojkumar Parmar et.al. 2501.17030 null
2025-01-28 Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs Alessandro Midolo et.al. 2501.17024 link
2025-01-28 Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement Kei Katsumata et.al. 2501.17022 link
2025-01-28 MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition Philippe Pasquier et.al. 2501.17011 null
2025-01-28 Large Language Models for Code Generation: The Practitioners Perspective Zeeshan Rasheed et.al. 2501.16998 link
2025-01-28 Artificial Intelligence Clones Annie Liang et.al. 2501.16996 null
2025-01-28 FedEFM: Federated Endovascular Foundation Model with Unseen Data Tuong Do et.al. 2501.16992 null
2025-01-28 Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver Shunya Minami et.al. 2501.16986 null
2025-01-28 Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Hongzhi Huang et.al. 2501.16975 null
2025-01-28 Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers Mohammad Raza et.al. 2501.16961 null
2025-01-28 Multiple Abstraction Level Retrieve Augment Generation Zheng Zheng et.al. 2501.16952 null
2025-01-29 TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Makoto Shing et.al. 2501.16937 null
2025-01-28 Detecting harassment and defamation in cyberbullying with emotion-adaptive training Peiling Yi et.al. 2501.16925 link
2025-01-28 RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains Shady Nasrat et.al. 2501.16899 link
2025-01-28 Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model Eoghan Cronin et.al. 2501.16893 null
2025-01-28 Irony Detection, Reasoning and Understanding in Zero-shot Learning Peiling Yi et.al. 2501.16884 null
2025-01-28 Comparing Human and LLM Generated Code: The Jury is Still Out! Sherlock A. Licorish et.al. 2501.16857 null
2025-01-28 Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis Tiao Tan et.al. 2501.16842 null
2025-01-28 Misspellings in Natural Language Processing: A survey Gianluca Sperduti et.al. 2501.16836 null
2025-01-28 DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model Josua Spisak et.al. 2501.16800 null
2025-01-28 Algorithm for Automatic Legislative Text Consolidation Matias Etcheverry et.al. 2501.16794 null
2025-01-28 Exponential Family Attention Kevin Christian Wibisono et.al. 2501.16790 link
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network Yumingzhi Pan et.al. 2501.16784 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-29 Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Muhammad Atta ur Rahman et.al. 2501.16769 null
2025-01-28 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Chenguo Lin et.al. 2501.16764 null
2025-01-28 HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns Xinyue Shen et.al. 2501.16750 link
2025-01-28 Through the Prism of Culture: Evaluating LLMs’ Understanding of Indian Subcultures and Traditions Garima Chhikara et.al. 2501.16748 null
2025-01-28 LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience Nimesh Jha et.al. 2501.16744 null
2025-01-28 Distilling Large Language Models for Network Active Queue Management Deol Satish et.al. 2501.16734 null
2025-01-28 xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking Sunbowen Lee et.al. 2501.16727 link
2025-01-28 One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning Chunpeng Zhou et.al. 2501.16720 null
2025-01-28 Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection Hengzhuang Li et.al. 2501.16718 link
2025-01-28 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Yueen Ma et.al. 2501.16698 null
2025-01-28 MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark Dongyi Yi et.al. 2501.16688 null
2025-01-28 Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting Li Yin et.al. 2501.16673 link
2025-01-28 VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records Philip Chung et.al. 2501.16672 link
2025-01-28 Contextual Reinforcement in Multimodal Token Compression for Large Language Models Naderdel Piero et.al. 2501.16658 null
2025-01-28 Large Language Model Critics for Execution-Free Evaluation of Code Changes Aashish Yadavally et.al. 2501.16655 link
2025-01-28 Molecular-driven Foundation Model for Oncologic Pathology Anurag Vaidya et.al. 2501.16652 null
2025-01-28 DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models Zeping Min et.al. 2501.16650 null
2025-01-28 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-28 CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs Jinlan Fu et.al. 2501.16629 link
2025-01-28 Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems Baraa Hikal et.al. 2501.16616 null
2025-01-28 Sparse Autoencoders Trained on the Same Data Learn Different Features Gonçalo Paulo et.al. 2501.16615 null
2025-01-28 Fine-Tuned Language Models as Space Systems Controllers Enrico M. Zucchelli et.al. 2501.16588 null
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation Farzad Farhadzadeh et.al. 2501.16559 null
2025-01-27 Distributional Information Embedding: A Framework for Multi-bit Watermarking Haiyun He et.al. 2501.16558 null
2025-01-27 PackDiT: Joint Human Motion and Text Generation via Mutual Prompting Zhongyu Jiang et.al. 2501.16551 null
2025-01-27 PhysAnimator: Physics-Guided Generative Cartoon Animation Tianyi Xie et.al. 2501.16550 null
2025-01-27 Sample-Efficient Behavior Cloning Using General Domain Knowledge Feiyu Zhu et.al. 2501.16546 null
2025-01-27 Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees Piyush Gupta et.al. 2501.16539 null
2025-01-27 Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs Jean-Charles Noirot Ferrand et.al. 2501.16534 null
2025-01-27 A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain Jorge del Pozo Lérida et.al. 2501.16533 null
2025-01-27 Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction Atharva Naik et.al. 2501.16524 null
2025-01-27 How well can LLMs Grade Essays in Arabic? Rayed Ghazawi et.al. 2501.16516 null
2025-01-27 Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models Sudarshan Kamath Barkur et.al. 2501.16513 null
2025-01-27 Smoothed Embeddings for Robust Language Models Ryo Hase et.al. 2501.16497 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM Payal Kamboj et.al. 2501.16481 link
2025-01-27 Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Philip Hughes et.al. 2501.16467 null
2025-01-27 CoCoNUT: Structural Code Understanding does not fall out of a tree Claas Beger et.al. 2501.16456 link
2025-01-27 Detecting Zero-Day Attacks in Digital Substations via In-Context Learning Faizan Manzoor et.al. 2501.16453 null
2025-01-27 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation Hamed Firooz et.al. 2501.16450 null
2025-01-27 DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Han Sun et.al. 2501.16410 null
2025-01-27 Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology Meiyun Cao et.al. 2501.16309 null
2025-01-27 RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval Long Nguyen et.al. 2501.16303 null
2025-01-27 Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width Zheng Liu et.al. 2501.16302 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-27 FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Renshan Zhang et.al. 2501.16297 null
2025-01-27 Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models Jing Zhang et.al. 2501.16282 null
2025-01-27 Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation Jiayi Hong et.al. 2501.16277 link
2025-01-27 URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT Long Nguyen et.al. 2501.16276 null
2025-01-27 A foundation model for human-AI collaboration in medical literature mining Zifeng Wang et.al. 2501.16255 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Zero-Shot Decision Tree Construction via Large Language Models Lucas Carrasco et.al. 2501.16247 null
2025-01-27 CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation Xiaochuan Ma et.al. 2501.16246 null
2025-01-27 Phase Transitions in Large Language Models and the $O(N)$ Model Youran Sun et.al. 2501.16241 null
2025-01-27 AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses Runze Cai et.al. 2501.16240 null
2025-01-28 Distilling foundation models for robust and efficient models in digital pathology Alexandre Filiot et.al. 2501.16239 null
2025-01-27 Language-Based Bayesian Optimization Research Assistant (BORA) Abdoulatif Cissé et.al. 2501.16224 null
2025-01-27 Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models Huayu Li et.al. 2501.16215 link
2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null
2025-01-27 Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs Antony Bartlett et.al. 2501.16191 null
2025-01-27 SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting Wenxuan Xie et.al. 2501.16178 link
2025-01-27 BAG: Body-Aligned 3D Wearable Asset Generation Zhongjin Luo et.al. 2501.16177 null
2025-01-27 Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma Richard Willis et.al. 2501.16173 link
2025-01-27 MetaDecorator: Generating Immersive Virtual Tours through Multimodality Shuang Xie et.al. 2501.16164 null
2025-01-27 CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge Yuwei Zhang et.al. 2501.16155 null
2025-01-27 AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought Xin Huang et.al. 2501.16154 null
2025-01-27 AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants Pascal J. Sager et.al. 2501.16150 null
2025-01-27 PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing Yuwei Zhang et.al. 2501.16149 null
2025-01-27 SampleLLM: Optimizing Tabular Data Synthesis in Recommendations Jingtong Gao et.al. 2501.16125 null
2025-01-27 Using Generative Models to Produce Realistic Populations of UK Windstorms Yee Chun Tsoi et.al. 2501.16110 null
2025-01-27 Integration of LLM Quality Assurance into an NLG System Ching-Yi Chen et.al. 2501.16078 null
2025-01-27 PISCO: Pretty Simple Compression for Retrieval-Augmented Generation Maxime Louis et.al. 2501.16075 null
2025-01-27 A generative material transformer using Wyckoff representation Pierre-Paul De Breuck et.al. 2501.16051 null
2025-01-27 Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation Xing Zhang et.al. 2501.16050 null
2025-01-27 PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment Vincent Freiberger et.al. 2501.16033 null
2025-01-27 FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments Zhiyuan Fu et.al. 2501.16029 null
2025-01-27 Transformability reveals the interplay of dynamics across different network orders Ming Xie et.al. 2501.16016 null
2025-01-27 TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference Jack Min Ong et.al. 2501.16007 null
2025-01-27 EDSep: An Effective Diffusion-Based Method for Speech Source Separation Jinwei Dong et.al. 2501.15965 null
2025-01-27 Rethinking the Bias of Foundation Model under Long-tailed Distribution Jiahao Chen et.al. 2501.15955 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-27 TimeHF: Billion-Scale Time Series Models Guided by Human Feedback Yongzhi Qi et.al. 2501.15942 null
2025-01-27 SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub Benjamin C. Carter et.al. 2501.15922 null
2025-01-27 Parametric Retrieval Augmented Generation Weihang Su et.al. 2501.15915 link
2025-01-27 Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation Muhammad Taha Tariq et.al. 2501.15901 null
2025-01-27 Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects Victor Deng et.al. 2501.15900 null
2025-01-27 Adaptive Width Neural Networks Federico Errica et.al. 2501.15889 null
2025-01-27 LCTG Bench: LLM Controlled Text Generation Benchmark Kentaro Kurihara et.al. 2501.15875 link
2025-01-27 LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models Yuewen Mei et.al. 2501.15850 null
2025-01-27 SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Delin Qu et.al. 2501.15830 null
2025-01-27 Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference Tharindu B. Hewage et.al. 2501.15829 link
2025-01-27 MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer Qi Chen et.al. 2501.15826 null
2025-01-27 LemmaHead: RAG Assisted Proof Generation Using Large Language Models Tianbo Yang et.al. 2501.15797 null
2025-01-27 Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? Zhiling Chen et.al. 2501.15795 null
2025-01-27 Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs Yu Li et.al. 2501.15791 link
2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link
2025-01-27 Large Language Models to Diffusion Finetuning Edoardo Cetin et.al. 2501.15781 null
2025-01-27 Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages Ivory Yang et.al. 2501.15773 link
2025-01-27 GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design Yuanfu Sun et.al. 2501.15755 null
2025-01-27 IndicMMLU-Pro: Benchmarking the Indic Large Language Models Sankalp KJ et.al. 2501.15747 null
2025-01-27 Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning Michael Xieyang Liu et.al. 2501.15727 null
2025-01-27 A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks Dong Li et.al. 2501.15724 null
2025-01-27 On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models Tadahiro Taniguchi et.al. 2501.15721 null
2025-01-26 Adapting Biomedical Abstracts into Plain language using Large Language Models Haritha Gangavarapu et.al. 2501.15700 null
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-26 Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting Yuxin Zhang et.al. 2501.15641 null
2025-01-26 BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation Ali Khodabandeh Yalabadi et.al. 2501.15631 link
2025-01-26 Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets Eduard Barbu et.al. 2501.15624 null
2025-01-26 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals Yinzhou Wang et.al. 2501.15599 null
2025-01-26 Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images Sichen Zhu et.al. 2501.15598 link
2025-01-26 SedarEval: Automated Evaluation using Self-Adaptive Rubrics Zhiyuan Fan et.al. 2501.15595 link
2025-01-26 SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain Dakuan Lu et.al. 2501.15587 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-01-26 Instruction Tuning for Story Understanding and Generation with Weak Supervision Yangshu Yuan et.al. 2501.15574 null
2025-01-26 Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models Spencer Ramsey et.al. 2501.15571 null
2025-01-26 ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Lin Yueyu et.al. 2501.15570 link
2025-01-26 Ocean-OCR: Towards General OCR Application via a Vision-Language Model Song Chen et.al. 2501.15558 null
2025-01-26 Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles Hanwen Zhang et.al. 2501.15544 null
2025-01-26 Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths Yueyang Wang et.al. 2501.15522 null
2025-01-26 Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification Dan Song et.al. 2501.15503 null
2025-01-26 Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning Xiaohan Yu et.al. 2501.15470 null
2025-01-26 Data-adaptive Safety Rules for Training Reward Models Xiaomin Li et.al. 2501.15453 null
2025-01-26 OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas Xiaoyang Wang et.al. 2501.15427 null
2025-01-26 Visual Generation Without Guidance Huayu Chen et.al. 2501.15420 link
2025-01-26 AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement Junan Zhang et.al. 2501.15417 null
2025-01-26 The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation Raha Aghaei et.al. 2501.15411 null
2025-01-26 Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency Irin Kabakum et.al. 2501.15405 null
2025-01-26 How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning Tohida Rehman et.al. 2501.15398 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-26 How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback Manzong Huang et.al. 2501.15378 null
2025-01-26 Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models Melkamu Abay Mersha et.al. 2501.15374 null
2025-01-26 Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis Robinson Umeike et.al. 2501.15370 null
2025-01-26 Decentralized Low-Rank Fine-Tuning of Large Language Models Sajjad Ghiasvand et.al. 2501.15361 null
2025-01-26 Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection Bo Yang et.al. 2501.15355 null
2025-01-25 Fairness in LLM-Generated Surveys Andrés Abeliuk et.al. 2501.15351 null
2025-01-25 Between Puppet and Actor: Reframing Authorship in this Age of AI Agents Yuqian Sun et.al. 2501.15346 null
2025-01-25 Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data Jiajie Li et.al. 2501.15326 null
2025-01-25 ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning Shangqian Gao et.al. 2501.15316 null
2025-01-25 The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? Ayo Adedeji et.al. 2501.15310 null
2025-01-25 You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning Ayan Sengupta et.al. 2501.15296 null
2025-01-24 HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Xin Zhou et.al. 2501.14729 link
2025-01-24 Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? Ipek Baris Schlicht et.al. 2501.14719 null
2025-01-24 Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models Naihao Deng et.al. 2501.14717 null
2025-01-24 FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing James Seale Smith et.al. 2501.14713 null
2025-01-24 The Karp Dataset Mason DiCicco et.al. 2501.14705 null
2025-01-24 Rethinking Table Instruction Tuning Naihao Deng et.al. 2501.14693 null
2025-01-24 Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST Fuping Wu et.al. 2501.14685 null
2025-01-24 An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations Shabnam Hassani et.al. 2501.14683 null
2025-01-24 Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning Jisi Zhang et.al. 2501.14680 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion Ziyao Xu et.al. 2501.14649 link
2025-01-24 Towards Scalable Topological Regularizers Hiu-Tung Wong et.al. 2501.14641 null
2025-01-24 Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics Renato Ghisellini et.al. 2501.14634 null
2025-01-24 Extracting Problem Structure with LLMs for Optimized SAT Local Search André Schilder et.al. 2501.14630 null
2025-01-24 Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data Jordi Abante et.al. 2501.14615 null
2025-01-24 ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Tianming Liang et.al. 2501.14607 null
2025-01-24 Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research Hamid Sarmadi et.al. 2501.14546 null
2025-01-24 VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning Benjamin Callewaert et.al. 2501.14540 null
2025-01-24 Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models Zhenguang Zhong et.al. 2501.14530 link
2025-01-24 Scene Understanding Enabled Semantic Communication with Open Channel Coding Zhe Xiang et.al. 2501.14520 null
2025-01-24 Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel Zhuoran Liu et.al. 2501.14512 null
2025-01-24 Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course Pavlin G. Poličar et.al. 2501.14499 null
2025-01-24 Evaluating and Improving Graph to Text Generation with Large Language Models Jie He et.al. 2501.14497 link
2025-01-24 RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Zhengyang Tang et.al. 2501.14492 link
2025-01-24 Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design Taehan Kim et.al. 2501.14469 null
2025-01-24 Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis Xiujing Guo et.al. 2501.14465 null
2025-01-24 Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing Zeping Yu et.al. 2501.14457 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null
2025-01-24 CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios Michael Fuest et.al. 2501.14426 null
2025-01-24 DeepFlow: Serverless Large Language Model Serving at Scale Junhao Hu et.al. 2501.14417 null
2025-01-24 SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation Shengjie Wang et.al. 2501.14400 null
2025-01-24 ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer Yoni Schirris et.al. 2501.14379 link
2025-01-24 DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing Xinyu Ma et.al. 2501.14371 link
2025-01-24 Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches Ziad Sakr et.al. 2501.14366 null
2025-01-24 FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration Kai-Tuo Xu et.al. 2501.14350 link
2025-01-24 Chain-of-Retrieval Augmented Generation Liang Wang et.al. 2501.14342 null
2025-01-24 Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts Clément Desroches et.al. 2501.14334 null
2025-01-24 Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models Ridhi Jain et.al. 2501.14326 null
2025-01-24 PAID: A Framework of Product-Centric Advertising Image Design Hongyu Chen et.al. 2501.14316 null
2025-01-24 Locality-aware Fair Scheduling in LLM Serving Shiyi Cao et.al. 2501.14312 null
2025-01-24 A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education Calvin Yeung et.al. 2501.14305 link
2025-01-24 MASTER: A Multi-Agent System with LLM Specialized MCTS Bingzheng Gan et.al. 2501.14304 null
2025-01-24 Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph Xujian Liang et.al. 2501.14300 link
2025-01-24 Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment Julian A. Schnabel et.al. 2501.14296 null
2025-01-24 Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes Sullam Jeoung et.al. 2501.14294 link
2025-01-24 Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches Feng Zhou et.al. 2501.14291 null
2025-01-24 Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation Sadegh Mahdavi et.al. 2501.14275 link
2025-01-24 Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors Yi Zhao et.al. 2501.14250 link
2025-01-24 Humanity’s Last Exam Long Phan et.al. 2501.14249 null
2025-01-24 Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game Rong Ye et.al. 2501.14225 null
2025-01-24 Top Ten Challenges Towards Agentic Neural Graph Databases Jiaxin Bai et.al. 2501.14224 null
2025-01-24 TFG-Flow: Training-free Guidance in Multimodal Generative Flow Haowei Lin et.al. 2501.14216 null
2025-01-24 Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading Minrui Xu et.al. 2501.14205 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models Saaduddin Mahmud et.al. 2501.14189 null
2025-01-24 GeoSim.AI: AI assistants for numerical simulations in geomechanics Yared W. Bekele et.al. 2501.14186 null
2025-01-24 AI Chatbots as Professional Service Agents: Developing a Professional Identity Wenwen Li et.al. 2501.14179 null
2025-01-24 Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models Yile Gu et.al. 2501.14170 null
2025-01-24 Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction Dongming Sheng et.al. 2501.14144 null
2025-01-23 Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation Derek Yotheringhay et.al. 2501.14119 null
2025-01-23 Domain-Factored Untrained Deep Prior for Spectrum Cartography Subash Timilsina et.al. 2501.14116 null
2025-01-23 MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning Joshua Davis et.al. 2501.14105 link
2025-01-23 StreamingRAG: Real-time Contextual Retrieval and Generation Framework Murugan Sankaradas et.al. 2501.14101 null
2025-01-23 Enhancing Biomedical Relation Extraction with Directionality Po-Ting Lai et.al. 2501.14079 link
2025-01-23 LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language Yubin Ge et.al. 2501.14073 null
2025-01-23 Efficient 2D CT Foundation Model for Contrast Phase Classification Benjamin Hou et.al. 2501.14066 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-23 LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps Andrey Palaev et.al. 2501.14046 link
2025-01-23 Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions Jianfeng Zhu et.al. 2501.14037 null
2025-01-23 CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation Guofeng Cui et.al. 2501.13927 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 Binary Diffusion Probabilistic Model Vitaliy Kinakh et.al. 2501.13915 null
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 null
2025-01-23 Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models Linh Tran et.al. 2501.13904 null
2025-01-23 Exploring Finetuned Audio-LLM on Heart Murmur Features Adrian Florea et.al. 2501.13884 null
2025-01-23 The machine learning platform for developers of large systems Alexey Naikov et.al. 2501.13881 null
2025-01-23 A RAG-Based Institutional Assistant Gustavo Kuratomi et.al. 2501.13880 null
2025-01-23 On the Reasoning Capacity of AI Models and How to Quantify It Santosh Kumar Radha et.al. 2501.13833 null
2025-01-23 Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Hao Zhang et.al. 2501.13831 null
2025-01-23 Hallucinations Can Improve Large Language Models in Drug Discovery Shuzhou Yuan et.al. 2501.13824 null
2025-01-23 Large Language Model driven Policy Exploration for Recommender Systems Jie Wang et.al. 2501.13816 null
2025-01-23 Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change Mowafak Allaham et.al. 2501.13802 null
2025-01-23 Parameter-Efficient Fine-Tuning for Foundation Models Dan Zhang et.al. 2501.13787 link
2025-01-23 Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling Tanya Rodchenko et.al. 2501.13779 null
2025-01-23 Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework Yoonsang Kim et.al. 2501.13778 link
2025-01-23 Do Large Language Models Truly Understand Geometric Structures? Xiaofeng Wang et.al. 2501.13773 link
2025-01-23 Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak Erjia Xiao et.al. 2501.13772 null
2025-01-23 UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models Xin Xu et.al. 2501.13766 null
2025-01-23 EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents Yuhui Yun et.al. 2501.13746 null
2025-01-23 GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification Te Pei et.al. 2501.13743 null
2025-01-23 An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities Zezhou Yang et.al. 2501.13742 link
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-01-23 RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation Shi-Qi Yan et.al. 2501.13726 null
2025-01-23 Musical ethnocentrism in Large Language Models Anna Kruspe et.al. 2501.13720 null
2025-01-23 A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation Dario Serez et.al. 2501.13718 null
2025-01-23 EventVL: Understand Event Streams via Multimodal Large Language Model Pengteng Li et.al. 2501.13707 null
2025-01-23 DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale Linghao Zhang et.al. 2501.13699 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-23 HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor Zihui Wu et.al. 2501.13677 link
2025-01-23 How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization Shezheng Song et.al. 2501.13669 null
2025-01-23 LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models Yizheng Sun et.al. 2501.13652 null
2025-01-23 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Zhenghao Lin et.al. 2501.13629 null
2025-01-23 Text-to-SQL based on Large Language Models and Database Keyword Search Eduardo R. Nascimento et.al. 2501.13594 null
2025-01-23 Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization Lei Huang et.al. 2501.13573 null
2025-01-23 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Tao Liu et.al. 2501.13554 link
2025-01-23 LLMs Can Plan Only If We Tell Them Bilgehan Sel et.al. 2501.13545 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles Munachiso Nwadike et.al. 2501.13491 null
2025-01-23 Adaptive Testing for LLM-Based Applications: A Diversity-based Approach Juyeon Yoon et.al. 2501.13480 null
2025-01-23 LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation JiaXin Chen et.al. 2501.13475 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 Spurious Forgetting in Continual Learning of Language Models Junhao Zheng et.al. 2501.13453 link
2025-01-23 Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models Bo Gao et.al. 2501.13428 null
2025-01-23 Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling Tomek Jaroslawski et.al. 2501.13415 null
2025-01-23 VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework He Kong et.al. 2501.13411 link
2025-01-23 Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures Minglong Dong et.al. 2501.13396 null
2025-01-23 Can Large Language Models Understand Preferences in Personalized Recommendation? Zhaoxuan Tan et.al. 2501.13391 link
2025-01-23 Do as We Do, Not as You Think: the Conformity of Large Language Models Zhiyuan Weng et.al. 2501.13381 link
2025-01-23 Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility Gabrielle Hoyer et.al. 2501.13376 null
2025-01-23 Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement Jae-Sung Bae et.al. 2501.13372 null
2025-01-23 Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification Yuzhuo Li et.al. 2501.13368 null
2025-01-23 50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications Zewei Shi et.al. 2501.13351 null
2025-01-23 MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize Haohang Xu et.al. 2501.13349 null
2025-01-23 Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Rong Shan et.al. 2501.13344 null
2025-01-23 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 link
2025-01-23 Generative Multi-Form Bayesian Optimization Zhendong Guo et.al. 2501.13337 null
2025-01-23 SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network Songge Zhang et.al. 2501.13318 null
2025-01-23 Representing Visualization Insights as a Dense Insight Network Jane Hoffswell et.al. 2501.13309 null
2025-01-23 OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia Xuelong Geng et.al. 2501.13306 link
2025-01-23 Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers Akshit Achara et.al. 2501.13302 link
2025-01-23 Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents Shrinidhi Kumbhar et.al. 2501.13299 null
2025-01-23 RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering Yang Bai et.al. 2501.13297 link
2025-01-23 Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols John Joon Young Chung et.al. 2501.13284 null
2025-01-22 MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis Daeun Jung et.al. 2501.13277 link
2025-01-22 RAG-Reward: Optimizing RAG with Reward Modeling and RLHF Hanning Zhang et.al. 2501.13264 null
2025-01-22 Exploring GPT’s Ability as a Judge in Music Understanding Kun Fang et.al. 2501.13261 link
2025-01-22 Bypassing Array Canaries via Autonomous Function Call Resolution Nathaniel Oh et.al. 2501.13256 link
2025-01-22 S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning Yichen Wu et.al. 2501.13198 null
2025-01-22 Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century Axel Loewe et.al. 2501.13142 null
2025-01-23 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-22 Robust Representation Consistency Model via Contrastive Denoising Jiachen Lei et.al. 2501.13094 link
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Bohao Yang et.al. 2501.13042 link
2025-01-22 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Yantao Liu et.al. 2501.13007 link
2025-01-22 Neural network enhanced cross entropy benchmark for monitored circuits Yangrui Hu et.al. 2501.13005 null
2025-01-22 Large Language Model-Based Semantic Communication System for Image Transmission Soheyb Ribouh et.al. 2501.12988 null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null
2025-01-22 Low-dimensional adaptation of diffusion models: Convergence in total variation Jiadong Liang et.al. 2501.12982 null
2025-01-22 OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun et.al. 2501.12975 link
2025-01-22 Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs Jan Corazza et.al. 2501.12972 null
2025-01-22 It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act Kristof Meding et.al. 2501.12962 null
2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null
2025-01-22 GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models Pengxiang Zhao et.al. 2501.12956 null
2025-01-22 3D Object Manipulation in a Single Image using Generative Models Ruisi Zhao et.al. 2501.12935 null
2025-01-22 Correctness Assessment of Code Generated by Large Language Models Using Internal Representations Tuan-Dung Bui et.al. 2501.12934 null
2025-01-22 DynamicEarth: How Far are We from Open-Vocabulary Change Detection? Kaiyu Li et.al. 2501.12931 null
2025-01-22 A Functional Software Reference Architecture for LLM-Integrated Systems Alessio Bucaioni et.al. 2501.12904 null
2025-01-22 Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration Offa Kingsleigh et.al. 2501.12901 null
2025-01-22 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Yafu Li et.al. 2501.12895 link
2025-01-23 Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program Carlton Shepherd et.al. 2501.12883 null
2025-01-22 WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge Jingyuan Chen et.al. 2501.12877 null
2025-01-22 ACEBench: Who Wins the Match Point in Tool Learning? Chen Chen et.al. 2501.12851 null
2025-01-22 AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation Aghiles Kebaili et.al. 2501.12840 null
2025-01-22 Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Viktor Moskvoretskii et.al. 2501.12835 null
2025-01-22 Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek John Pavlopoulos et.al. 2501.12826 link
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 null
2025-01-22 Certified Guidance for Planning with Deep Generative Models Francesco Giacomarra et.al. 2501.12815 null
2025-01-22 Revisit Self-Debugging with Self-Generated Tests for Code Generation Xiancai Chen et.al. 2501.12793 null
2025-01-22 LLMs as Repositories of Factual Knowledge: Limitations and Solutions Seyed Mahed Mousavi et.al. 2501.12774 null
2025-01-22 NExtLong: Toward Effective Long-Context Training without Long Documents Chaochen Gao et.al. 2501.12766 link
2025-01-22 Online Preference Alignment for Language Models via Count-based Exploration Chenjia Bai et.al. 2501.12735 link
2025-01-22 Paradigm-Based Automatic HDL Code Generation Using LLMs Wenhao Sun et.al. 2501.12702 null
2025-01-22 Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression Kai Yoshida et.al. 2501.12698 null
2025-01-22 Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering Qian Tao et.al. 2501.12697 null
2025-01-22 SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling Shengshi Yao et.al. 2501.12696 null
2025-01-22 EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation Yifan Yu et.al. 2501.12689 null
2025-01-22 Distillation Quantification for Large Language Models Sunbowen Lee et.al. 2501.12619 link
2025-01-22 Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? Taiming Wang et.al. 2501.12617 null
2025-01-22 Kimi k1.5: Scaling Reinforcement Learning with LLMs Kimi Team et.al. 2501.12599 null
2025-01-22 Leveraging LLMs to Create a Haptic Devices’ Recommendation System Yang Liu et.al. 2501.12573 null
2025-01-22 Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review Rock Yuren Pang et.al. 2501.12557 link
2025-01-21 Human-like conceptual representations emerge from language prediction Ningyu Xu et.al. 2501.12547 null
2025-01-21 How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? Mirali Purohit et.al. 2501.12535 null
2025-01-21 An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts Dhia Elhaq Rzig et.al. 2501.12521 null
2025-01-21 A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data Minh Tran et.al. 2501.12501 null
2025-01-21 The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws Tian Jin et.al. 2501.12486 null
2025-01-21 An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models Xiaoyu Chu et.al. 2501.12469 link
2025-01-21 Adaptive PII Mitigation Framework for Large Language Models Shubhi Asthana et.al. 2501.12465 null
2025-01-21 Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management Arthur Vitui et.al. 2501.12461 link
2025-01-21 Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications Shubhi Asthana et.al. 2501.12456 null
2025-01-21 Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation Dongsheng Zhu et.al. 2501.12432 null
2025-01-21 FREYR: A Framework for Recognizing and Executing Your Requests Roberto Gallotta et.al. 2501.12423 link
2025-01-21 CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning Eunjee Choi et.al. 2501.12422 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks Greg Olmschenk et.al. 2501.12383 null
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 null
2025-01-21 Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration Thomas Walshe et.al. 2501.12332 null
2025-01-21 Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops Mohamed Harmanani et.al. 2501.12331 link
2025-01-21 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Xianwei Zhuang et.al. 2501.12327 link
2025-01-21 LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations Hasan Abu-Rasheed et.al. 2501.12300 null
2025-01-21 MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks Qishen Zhou et.al. 2501.12281 link
2025-01-21 Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Maosong Cao et.al. 2501.12273 link
2025-01-21 FOCUS: First Order Concentrated Updating Scheme Yizhou Liu et.al. 2501.12243 null
2025-01-21 InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models Pha Nguyen et.al. 2501.12231 null
2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null
2025-01-21 Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces Allard Oelen et.al. 2501.12221 null
2025-01-21 You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense Wuyuao Mai et.al. 2501.12210 null
2025-01-21 Explainability for Vision Foundation Models: A Survey Rémi Kazmierczak et.al. 2501.12203 null
2025-01-22 Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Zibo Zhao et.al. 2501.12202 link
2025-01-21 BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks Zhuang Li et.al. 2501.12174 null
2025-01-21 Contextualizing Recommendation Explanations with LLMs: A User Study Yuanjun Feng et.al. 2501.12152 null
2025-01-21 Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities Qirun Dai et.al. 2501.12147 null
2025-01-21 Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Daniele Bifolco et.al. 2501.12134 null
2025-01-21 Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions Tim Schreiter et.al. 2501.12128 null
2025-01-21 Can open source large language models be used for tumor documentation in Germany? – An evaluation on urological doctors’ notes Stefan Lenz et.al. 2501.12106 link
2025-01-21 Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis Weile Luo et.al. 2501.12084 null
2025-01-21 Phishing Awareness via Game-Based Learning Argianto Rahartomo et.al. 2501.12077 link
2025-01-21 PINNsAgent: Automated PDE Surrogation with Large Language Models Qingpo Wuwu et.al. 2501.12053 null
2025-01-21 Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation Chen Griner et.al. 2501.12033 null
2025-01-21 Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing’s Syndrome Diagnosis in Facial Analysis Hongjun Liu et.al. 2501.12023 null
2025-01-21 Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? Samantha Min Er Yew et.al. 2501.12016 null
2025-01-21 Rate-Aware Learned Speech Compression Jun Xu et.al. 2501.11999 null
2025-01-21 Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models Rupesh Raj Karn et.al. 2501.11979 null
2025-01-21 Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues Maya Medjad et.al. 2501.11977 link
2025-01-21 Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization Jie Zhao et.al. 2501.11968 null
2025-01-21 A Hybrid Attention Framework for Fake News Detection with Large Language Models Xiaochuan Xu et.al. 2501.11967 null
2025-01-21 TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection Yang Cao et.al. 2501.11960 null
2025-01-21 Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model Minghan Wang et.al. 2501.11953 null
2025-01-21 ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation Peter Devine et.al. 2501.11929 link
2025-01-21 Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model He Chang et.al. 2501.11911 null
2025-01-21 Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation Junhong Lian et.al. 2501.11900 link
2025-01-22 Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine Keer Lu et.al. 2501.11885 null
2025-01-21 From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning Yafu Li et.al. 2501.11877 link
2025-01-21 LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems Venkata Sai Aswath Duvvuru et.al. 2501.11864 null
2025-01-21 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Zhili Cheng et.al. 2501.11858 link
2025-01-21 Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance Nikos Kanakaris et.al. 2501.11849 link
2025-01-21 A Survey on Memory-Efficient Large-Scale Model Training in AI for Science Kaiyuan Tian et.al. 2501.11847 null
2025-01-21 Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction Noah L. Schroeder et.al. 2501.11840 null
2025-01-21 PXGen: A Post-hoc Explainable Method for Generative Models Yen-Lung Huang et.al. 2501.11827 null
2025-01-21 CogMorph: Cognitive Morphing Attacks for Text-to-Image Models Zonglei Jing et.al. 2501.11815 null
2025-01-20 Benchmarking Large Language Models via Random Variables Zijin Hong et.al. 2501.11790 null
2025-01-20 Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection Ali Naseh et.al. 2501.11786 null
2025-01-20 Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference Pouya Hamadanian et.al. 2501.11779 link
2025-01-20 The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers Alina Starovolsky-Shitrit et.al. 2501.11770 null
2025-01-20 Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems Fatemeh Nazary et.al. 2501.11759 link
2025-01-20 A generalizable 3D framework and model for self-supervised learning in medical imaging Tony Xu et.al. 2501.11755 null
2025-01-20 Are generative models fair? A study of racial bias in dermatological image generation Miguel López-Pérez et.al. 2501.11752 null
2025-01-20 Optimizing Pretraining Data Mixtures with LLM-Estimated Utility William Held et.al. 2501.11747 null
2025-01-20 MedicoSAM: Towards foundation models for medical image segmentation Anwai Archit et.al. 2501.11734 link
2025-01-20 Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Zhenhailong Wang et.al. 2501.11733 null
2025-01-20 Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy Saeid Asgari Taghanaki et.al. 2501.11721 link
2025-01-20 YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners’ Perspectives Nong Ming et.al. 2501.11712 link
2025-01-20 Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution Ramtin Ehsani et.al. 2501.11709 null
2025-01-20 Trustformer: A Trusted Federated Transformer Ali Abbasi Tadi et.al. 2501.11706 null
2025-01-20 Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) Brian E. Perron et.al. 2501.11705 null
2025-01-20 Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Zhenyu Hou et.al. 2501.11651 link
2025-01-20 Trojan Detection Through Pattern Recognition for Large Language Models Vedant Bhasin et.al. 2501.11621 null
2025-01-20 Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems Giorgio Robino et.al. 2501.11613 null
2025-01-20 SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks Wentao Wan et.al. 2501.11599 link
2025-01-20 Recurrent Diffusion for Large-Scale Parameter Generation Kai Wang et.al. 2501.11587 link
2025-01-20 Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models Mahyar Habibi et.al. 2501.11581 null
2025-01-20 Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution Zhiyuan You et.al. 2501.11561 null
2025-01-20 PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation Jinyu Wang et.al. 2501.11551 link
2025-01-20 UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion Zixuan Chen et.al. 2501.11515 null
2025-01-20 Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges Vincent Koc et.al. 2501.11496 null
2025-01-20 Graph-defined Language Learning with LLMs Huachi Zhou et.al. 2501.11478 null
2025-01-20 Curiosity-Driven Reinforcement Learning from Human Feedback Haoran Sun et.al. 2501.11463 link
2025-01-20 Ontology Matching with Large Language Models and Prioritized Depth-First Search Maria Taboada et.al. 2501.11441 null
2025-01-20 One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor Zhikun Wu et.al. 2501.11433 null
2025-01-20 A Survey on Diffusion Models for Anomaly Detection Jing Liu et.al. 2501.11430 link
2025-01-20 Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Siyu Yuan et.al. 2501.11425 link
2025-01-20 Neural Contextual Reinforcement Framework for Logical Structure Language Generation Marcus Irvin et.al. 2501.11417 null
2025-01-20 Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing Kevin Sim et.al. 2501.11411 null
2025-01-20 Revisiting Language Models in Neural News Recommender Systems Yuyue Zhao et.al. 2501.11391 link
2025-01-20 Towards Advancing Code Generation with Large Language Models: A Research Roadmap Haolin Jin et.al. 2501.11354 null
2025-01-20 EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery Guankun Wang et.al. 2501.11347 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 Few-shot Policy (de)composition in Conversational Question Answering Kyle Erwin et.al. 2501.11335 null
2025-01-20 Nested Annealed Training Scheme for Generative Adversarial Networks Chang Wan et.al. 2501.11318 null
2025-01-20 Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning Zhongtian Hu et.al. 2501.11292 null
2025-01-20 Large Language Model Agents for Radio Map Generation and Wireless Network Planning Hongye Quan et.al. 2501.11283 null
2025-01-20 Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries Yi-Hui Lee et.al. 2501.11273 null
2025-01-20 Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios Zhongtian Hu et.al. 2501.11269 null
2025-01-20 Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian Wannita Takerngsaksiri et.al. 2501.11264 link
2025-01-20 Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models Zhuangzhuang Yan et.al. 2501.11247 null
2025-01-20 Irony in Emojis: A Comparative Study of Human and LLM Interpretation Yawen Zheng et.al. 2501.11241 null
2025-01-20 KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jiaxiang Liu et.al. 2501.11231 link
2025-01-20 Reasoning Language Models: A Blueprint Maciej Besta et.al. 2501.11223 link
2025-01-20 Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation Ivan Lopez et.al. 2501.11199 null
2025-01-19 Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests Kristin Blesch et.al. 2501.11178 link
2025-01-17 FaceXBench: Evaluating Multimodal LLMs on Face Understanding Kartik Narayan et.al. 2501.10360 link
2025-01-17 Zero-Shot Monocular Scene Flow Estimation in the Wild Yiqing Liang et.al. 2501.10357 null
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 null
2025-01-17 Large language models for automated scholarly paper review: A survey Zhenzhen Zhuang et.al. 2501.10326 null
2025-01-17 HiMix: Reducing Computational Complexity in Large Vision-Language Models Xuange Zhang et.al. 2501.10318 null
2025-01-17 Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs Claudio Di Sipio et.al. 2501.10313 null
2025-01-17 Computational Protein Science in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2501.10282 null
2025-01-17 Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation Azat Abdullin et.al. 2501.10200 null
2025-01-17 Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education William Hersh et.al. 2501.10186 null
2025-01-17 Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval Vera Pavlova et.al. 2501.10175 null
2025-01-17 Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis Abhishek Kaushik et.al. 2501.10134 null
2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-17 AI-Generated Music Detection and its Challenges Darius Afchar et.al. 2501.10111 link
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization Zhaopeng Gu et.al. 2501.10067 link
2025-01-17 Accelerating Large Language Models through Partially Linear Feed-Forward Network Gansen Hu et.al. 2501.10054 null
2025-01-17 AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search Wenfeng Feng et.al. 2501.10053 null
2025-01-17 Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists Alyssia Chen et.al. 2501.10037 null
2025-01-17 Mapping scientific communities at scale Victor Barbier et.al. 2501.10035 link
2025-01-17 Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions Zhijie Tan et.al. 2501.10011 null
2025-01-17 Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models Qiang Liu et.al. 2501.09997 null
2025-01-17 Agent-as-Judge for Factual Summarization of Long Narratives Yeonseok Jeong et.al. 2501.09993 link
2025-01-17 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-17 GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions Heda Zuo et.al. 2501.09972 null
2025-01-17 Explainable artificial intelligence (XAI): from inherent explainability to large language models Fuseini Mumuni et.al. 2501.09967 null
2025-01-17 A Survey on Multi-Turn Interaction Capabilities of Large Language Models Chen Zhang et.al. 2501.09959 null
2025-01-17 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations Jamin Seo et.al. 2501.09954 link
2025-01-17 Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt Qingcheng Zeng et.al. 2501.09950 null
2025-01-17 MultiPruner: Balanced Structure Removal in Foundation Models J. Pablo Muñoz et.al. 2501.09949 link
2025-01-17 Steering Large Language Models with Feature Guided Activation Additions Samuel Soo et.al. 2501.09929 null
2025-01-17 Towards A Litmus Test for Common Sense Hugo Latapie et.al. 2501.09913 null
2025-01-17 Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph Jiawei Xu et.al. 2501.09909 null
2025-01-17 Position: Open and Closed Large Language Models in Healthcare Jiawei Xu et.al. 2501.09906 null
2025-01-17 FoundationStereo: Zero-Shot Stereo Matching Bowen Wen et.al. 2501.09898 null
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-17 Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment Mohammad Sadegh Sheikhaei et.al. 2501.09888 link
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-16 ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction Izzeddin Teeti et.al. 2501.09878 null
2025-01-16 Geometry-Preserving Encoder/Decoder in Latent Generative Models Wonjun Lee et.al. 2501.09876 null
2025-01-16 An LLM-Guided Tutoring System for Social Skills Training Michael Guevarra et.al. 2501.09870 null
2025-01-16 Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing Wenhan Wang et.al. 2501.09866 null
2025-01-16 Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message Jiawei Li et.al. 2501.09861 null
2025-01-16 PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery Shristi Das Biswas et.al. 2501.09826 link
2025-01-16 Bridging Language Barriers in Healthcare: A Study on Arabic LLMs Nada Saadi et.al. 2501.09825 null
2025-01-16 BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling Daniele Castellana et.al. 2501.09821 link
2025-01-16 Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems Soham Roy et.al. 2501.09801 null
2025-01-16 Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API Andrey Labunets et.al. 2501.09798 null
2025-01-16 GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation Weiliang Tang et.al. 2501.09783 null
2025-01-16 SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation Wanqi Yin et.al. 2501.09782 link
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Tairan Fu et.al. 2501.09775 null
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-01-16 Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Youngjoon Jang et.al. 2501.09754 null
2025-01-16 OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi et.al. 2501.09751 null
2025-01-16 Enhancing Lexicon-Based Text Embeddings with Large Language Models Yibin Lei et.al. 2501.09749 null
2025-01-16 Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models Bihui Jin et.al. 2501.09745 null
2025-01-16 KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports Hajung Kim et.al. 2501.09744 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text Jihed Ncib et.al. 2501.09719 null
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 Domain Adaptation of Foundation LLMs for e-Commerce Christian Herold et.al. 2501.09706 null
2025-01-16 Cueless EEG imagined speech for subject identification: dataset and benchmarks Ali Derakhshesh et.al. 2501.09700 link
2025-01-16 Simulated Interactive Debugging Yannic Noller et.al. 2501.09694 null
2025-01-17 Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities Fengli Xu et.al. 2501.09686 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark Alexis Roger et.al. 2501.09672 null
2025-01-16 A Survey of Research in Large Language Models for Electronic Design Automation Jingyu Pan et.al. 2501.09655 null
2025-01-16 The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Jonathan Katzy et.al. 2501.09653 null
2025-01-16 CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding Johannes Kirmayr et.al. 2501.09645 link
2025-01-17 LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading Kuan-Ming Liu et.al. 2501.09636 null
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-16 Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment Chaoqi Wang et.al. 2501.09620 link
2025-01-16 From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs Hrithik Majumdar Shibu et.al. 2501.09604 link
2025-01-16 Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Pratyush Dhingra et.al. 2501.09588 null
2025-01-16 Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis Tingxuan Chen et.al. 2501.09555 null
2025-01-16 AI in Support of Diversity and Inclusion Çiçek Güven et.al. 2501.09534 null
2025-01-16 Confidence Estimation for Error Detection in Text-to-SQL Systems Oleg Somov et.al. 2501.09527 null
2025-01-16 Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data Omar Mena et.al. 2501.09521 null
2025-01-16 AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Junjie He et.al. 2501.09503 null
2025-01-16 Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis Qize Yang et.al. 2501.09502 null
2025-01-16 Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework Nuo Chen et.al. 2501.09493 null
2025-01-16 Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators Zhaocheng Liu et.al. 2501.09484 link
2025-01-16 Guided Debugging of Auto-Translated Code Using Differential Testing Shengnan Wu et.al. 2501.09475 null
2025-01-16 DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Hualie Jiang et.al. 2501.09466 link
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464 null
2025-01-16 “A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design Tianhao He et.al. 2501.09457 null
2025-01-16 Solving the unsolvable: Translating case law in Hong Kong King-kui Sin et.al. 2501.09444 null
2025-01-16 Scaling up self-supervised learning for improved surgical foundation models Tim J. M. Jaspers et.al. 2501.09436 link
2025-01-16 CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Hwan Heo et.al. 2501.09433 link
2025-01-16 A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy Huandong Wang et.al. 2501.09431 null
2025-01-16 AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring Xinyi Wang et.al. 2501.09428 null
2025-01-16 AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling Ancheng Xu et.al. 2501.09426 null
2025-01-16 FASP: Fast and Accurate Structured Pruning of Large Language Models Hanyu Hu et.al. 2501.09412 null
2025-01-16 MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models Lyudong Jin et.al. 2501.09410 null
2025-01-16 Adaptive Contextual Caching for Mobile Edge Large Language Model Service Guangyuan Liu et.al. 2501.09383 null
2025-01-16 Aligning Instruction Tuning with Pre-training Yiming Liang et.al. 2501.09368 null
2025-01-16 PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks Huiyou Zhan et.al. 2501.09367 null
2025-01-16 YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks Saptarashmi Bandyopadhyay et.al. 2501.09355 null
2025-01-16 UVRM: A Scalable 3D Reconstruction Model from Unposed Videos Shiu-hong Kao et.al. 2501.09347 null
2025-01-16 Rational Tuning of LLM Cascades via Probabilistic Modeling Michael J. Zellinger et.al. 2501.09345 null
2025-01-16 SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs Anbang Ye et.al. 2501.09316 null
2025-01-16 A Study of In-Context-Learning-Based Text-to-SQL Errors Jiawei Shen et.al. 2501.09310 link
2025-01-16 To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation Kaustubh D. Dhole et.al. 2501.09292 null
2025-01-16 LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport Kyeongha Rho et.al. 2501.09291 link
2025-01-16 Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding Kohei Torimi et.al. 2501.09278 null
2025-01-16 Large Language Model is Secretly a Protein Sequence Optimizer Yinkai Wang et.al. 2501.09274 null
2025-01-16 Perspective Transition of Large Language Models for Solving Subjective Tasks Xiaolong Wang et.al. 2501.09265 null
2025-01-16 Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition Takaaki Hori et.al. 2501.09258 null
2025-01-16 Clone-Robust AI Alignment Ariel D. Procaccia et.al. 2501.09254 null
2025-01-16 Split Fine-Tuning for Large Language Models in Wireless Networks Songge Zhang et.al. 2501.09237 null
2025-01-16 Foundations of Large Language Models Tong Xiao et.al. 2501.09223 null
2025-01-16 Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs Sanchit Sinha et.al. 2501.09221 null
2025-01-16 A Simple Graph Contrastive Learning Framework for Short Text Classification Yonghao Liu et.al. 2501.09219 link
2025-01-16 Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics Yuanyuan Wei et.al. 2501.09218 null
2025-01-16 Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning Yonghao Liu et.al. 2501.09214 link
2025-01-16 FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training Hongzhou Yu et.al. 2501.09213 link
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 Towards Semantics Lifting for Scientific Computing: A Case Study on FFT Naifeng Zhang et.al. 2501.09201 null
2025-01-15 Guiding Retrieval using LLM-based Listwise Rankers Mandeep Rathee et.al. 2501.09186 link
2025-01-15 The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching Yevhen Kostiuk et.al. 2501.09164 null
2025-01-15 Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability Stephanie L. Day et.al. 2501.09158 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-15 Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation Xingxin He et.al. 2501.09138 null
2025-01-15 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Aditi Singh et.al. 2501.09136 link
2025-01-15 HAFix: History-Augmented Large Language Models for Bug Fixing Yu Shi et.al. 2501.09135 link
2025-01-15 Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing Eshaan Tanwar et.al. 2501.09127 link
2025-01-15 Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment Conrad Borchers et.al. 2501.09126 null
2025-01-15 Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach Alireza Ghaffari et.al. 2501.09107 null
2025-01-15 Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites Hans W. A. Hanley et.al. 2501.09102 link
2025-01-15 Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative Yuqian Sun et.al. 2501.09099 null
2025-01-15 SteLLA: A Structured Grading System Using LLMs with RAG Hefei Qiu et.al. 2501.09092 null
2025-01-15 Generative diffusion model with inverse renormalization group flows Kanta Masuki et.al. 2501.09064 link
2025-01-15 Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition Sneheel Sarangi et.al. 2501.09056 link
2025-01-15 How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias Tosin Fadahunsi et.al. 2501.09014 link
2025-01-15 Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians Ishan Amin et.al. 2501.09009 link
2025-01-15 Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails Shaona Ghosh et.al. 2501.09004 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 null
2025-01-15 CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks Krit Tangsongcharoen et.al. 2501.08998 link
2025-01-15 VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science Youssef Abdalla et.al. 2501.08995 link
2025-01-15 CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Haozhe Xie et.al. 2501.08983 link
2025-01-15 Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models Emma Croxford et.al. 2501.08977 null
2025-01-15 Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models Karukriti Kaushik Ghosh et.al. 2501.08974 null
2025-01-15 Analyzing the Ethical Logic of Six Large Language Models W. Russell Neuman et.al. 2501.08951 null
2025-01-15 Applying General Turn-taking Models to Conversational Human-Robot Interaction Gabriel Skantze et.al. 2501.08946 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Liam Dugan et.al. 2501.08913 link
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Connecting SPDE to SGMs Junsu Seo et.al. 2501.08877 null
2025-01-15 Exploring Task-Level Optimal Prompts for Visual In-Context Learning Yan Zhu et.al. 2501.08841 null
2025-01-15 How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering Christoph Treude et.al. 2501.08774 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-15 Enhanced Large Language Models for Effective Screening of Depression and Anxiety June M. Liu et.al. 2501.08769 null
2025-01-15 Few-Shot Learner Generalizes Across AI-Generated Image Detection Shiyu Wu et.al. 2501.08763 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-15 The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities Irina Bigoulaeva et.al. 2501.08716 link
2025-01-15 Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching Chuangtao Ma et.al. 2501.08686 link
2025-01-15 RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency Siqi Li et.al. 2501.08682 null
2025-01-15 Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery Zeqin Liao et.al. 2501.08670 null
2025-01-15 MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities Savya Khosla et.al. 2501.08648 null
2025-01-15 Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations Kaiyuan Zheng et.al. 2501.08641 null
2025-01-15 SWSC: Shared Weight for Similar Channel in LLM Binrui Zeng et.al. 2501.08631 null
2025-01-15 Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan et.al. 2501.08618 link
2025-01-15 RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Kaiqu Liang et.al. 2501.08617 null
2025-01-15 Assessing the Alignment of FOL Closeness Metrics with Human Judgement Ramya Keerthy Thatikonda et.al. 2501.08613 link
2025-01-15 Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design Zhi Zheng et.al. 2501.08603 link
2025-01-15 AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL Tyler Stennett et.al. 2501.08600 null
2025-01-15 LlamaRestTest: Effective REST API Testing with Small Language Models Myeongsoo Kim et.al. 2501.08598 null
2025-01-15 Sound Scene Synthesis at the DCASE 2024 Challenge Mathieu Lagrange et.al. 2501.08587 null
2025-01-15 LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model Yuxuan Hu et.al. 2501.08582 null
2025-01-15 Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation Jiaqi Huang et.al. 2501.08580 link
2025-01-15 Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms Kewei Li et.al. 2501.08570 link
2025-01-15 Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications Jin Chen et.al. 2501.08563 link
2025-01-15 LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation Yiran Tao et.al. 2501.08558 null
2025-01-15 The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Sitong Gong et.al. 2501.08549 null
2025-01-15 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null
2025-01-14 Quantifying the Importance of Data Alignment in Downstream Model Performance Krrish Chawla et.al. 2501.08496 null
2025-01-14 Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition Md Meem Hossain et.al. 2501.08471 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin Joao Carmo de Almeida Neto et.al. 2501.08464 null
2025-01-14 Large Language Models For Text Classification: Case Study And Comprehensive Review Arina Kostina et.al. 2501.08457 null
2025-01-14 Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack Sagiv Antebi et.al. 2501.08454 null
2025-01-14 Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies Ajwad Abrar et.al. 2501.08441 null
2025-01-14 SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models Anurag Kumar et.al. 2501.08421 null
2025-01-14 Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation Seyed Mehdi Rakhtala et.al. 2501.08420 null
2025-01-14 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 OptiChat: Bridging Optimization Models and Practitioners with Large Language Models Hao Chen et.al. 2501.08406 link
2025-01-14 Towards Best Practices for Open Datasets for LLM Training Stefan Baack et.al. 2501.08365 null
2025-01-14 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-14 PokerBench: Training Large Language Models to become Professional Poker Players Richard Zhuang et.al. 2501.08328 link
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Exploring Robustness of Multilingual LLMs on Real-World Noisy Data Amirhossein Aliakbarzadeh et.al. 2501.08322 link
2025-01-14 Enhancing Automated Interpretability with Output-Centric Feature Descriptions Yoav Gur-Arieh et.al. 2501.08319 link
2025-01-14 MiniMax-01: Scaling Foundation Models with Lightning Attention MiniMax et.al. 2501.08313 null
2025-01-14 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Abhilasha Ravichander et.al. 2501.08292 null
2025-01-14 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Hongyu Li et.al. 2501.08282 link
2025-01-14 Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing Pulkit Arora et.al. 2501.08276 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 null
2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset Jiawei Du et.al. 2501.08238 null
2025-01-14 Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings Paul Joe Maliakel et.al. 2501.08219 null
2025-01-14 ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving Zain Ul Abedin et.al. 2501.08203 null
2025-01-14 CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jinjun Peng et.al. 2501.08200 link
2025-01-14 OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Yijiong Yu et.al. 2501.08197 link
2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-15 A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following Yin Fang et.al. 2501.08187 link
2025-01-14 Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Rewina Bedemariam et.al. 2501.08167 null
2025-01-14 I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution Soohyeon Choi et.al. 2501.08165 null
2025-01-14 Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data Phai Vu Dinh et.al. 2501.08149 null
2025-01-14 Refusal Behavior in Large Language Models: A Nonlinear Perspective Fabian Hildebrandt et.al. 2501.08145 link
2025-01-14 Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying Jonathan Lyhs et.al. 2501.08142 null
2025-01-14 Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 Seamie Hayes et.al. 2501.08118 null
2025-01-15 Consistency of Responses and Continuations Generated by Large Language Models on Social Media Wenlu Fan et.al. 2501.08102 null
2025-01-14 Hierarchical Autoscaling for Large Language Model Serving with Chiron Archit Patke et.al. 2501.08090 null
2025-01-14 Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving Nert Keser et.al. 2501.08083 null
2025-01-14 CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning Guoliang He et.al. 2501.08071 link
2025-01-14 A Roadmap to Guide the Integration of LLMs in Hierarchical Planning Israel Puerta-Merino et.al. 2501.08068 null
2025-01-14 Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT Awritrojit Banerjee et.al. 2501.08053 null
2025-01-14 TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Yao Liang et.al. 2501.08008 null
2025-01-14 LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS Muhammad Ashfaq et.al. 2501.07992 null
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 null
2025-01-14 Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models Yifang Xu et.al. 2501.07972 null
2025-01-14 Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning Jiaqi Hua et.al. 2501.07959 link
2025-01-14 AI Guide Dog: Egocentric Path Prediction on Smartphone Aishwarya Jadhav et.al. 2501.07957 null
2025-01-14 Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations Waqar Hussain et.al. 2501.07931 null
2025-01-14 Gandalf the Red: Adaptive Security for LLMs Niklas Pfister et.al. 2501.07927 link
2025-01-14 VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models Hui Kuurila-Zhang et.al. 2501.07922 link
2025-01-14 Large Language Model Interface for Home Energy Management Systems François Michelon et.al. 2501.07919 null
2025-01-14 Bridge-SR: Schrödinger Bridge for Efficient SR Chang Li et.al. 2501.07897 null
2025-01-14 Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs Shuai Wang et.al. 2501.07892 null
2025-01-14 ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding Zhongxiang Sun et.al. 2501.07861 null
2025-01-14 Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques Shobhit Ratan et.al. 2501.07853 null
2025-01-14 Unveiling Provider Bias in Large Language Models for Code Generation Xiaoyu Zhang et.al. 2501.07849 null
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-14 A Driver Advisory System Based on Large Language Model for High-speed Train Y. C. Luo et.al. 2501.07837 null
2025-01-14 Flow: A Modular Approach to Automated Agentic Workflow Generation Boye Niu et.al. 2501.07834 null
2025-01-14 Real-time Verification and Refinement of Language Model Text Generation Joonho Ko et.al. 2501.07824 null
2025-01-14 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding Haomiao Xiong et.al. 2501.07819 link
2025-01-14 A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models Kaustubh D. Dhole et.al. 2501.07818 null
2025-01-14 Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models Dhruv Dhamani et.al. 2501.07815 null
2025-01-14 Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering Feijie Wu et.al. 2501.07813 null
2025-01-14 CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation Ruwei Pan et.al. 2501.07811 null
2025-01-14 Visual Language Models as Operator Agents in the Space Domain Alejandro Carrasco et.al. 2501.07802 null
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-14 Symmetry-Aware Generative Modeling through Learned Canonicalization Kusha Sareen et.al. 2501.07773 null
2025-01-14 Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey Bingchen Liu et.al. 2501.07766 null
2025-01-14 On the Statistical Capacity of Deep Generative Models Edric Tam et.al. 2501.07763 link
2025-01-13 Advancing Student Writing Through Automated Syntax Feedback Kamyar Zeinalipour et.al. 2501.07740 null
2025-01-13 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Dongwon Kim et.al. 2501.07730 null
2025-01-13 LLMic: Romanian Foundation Language Model Vlad-Andrei Bădoiu et.al. 2501.07721 null
2025-01-13 CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory Haokun Zhao et.al. 2501.07674 null
2025-01-13 Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning Karishma Thakrar et.al. 2501.07663 null
2025-01-13 Large Language Models for Interpretable Mental Health Diagnosis Brian Hyeongseok Kim et.al. 2501.07653 null
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 GPT as a Monte Carlo Language Tree: A Probabilistic Perspective Kun-Peng Ning et.al. 2501.07641 null
2025-01-13 SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models Fabien Bernier et.al. 2501.07639 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chengzu Li et.al. 2501.07542 null
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 Investigating Large Language Models in Inferring Personality Traits from User Conversations Jianfeng Zhu et.al. 2501.07532 null
2025-01-13 RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment Difei Gu et.al. 2501.07525 link
2025-01-13 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 null
2025-01-13 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards Yangsibo Huang et.al. 2501.07493 null
2025-01-13 TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models Thales Sales Almeida et.al. 2501.07482 null
2025-01-13 A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities Yihao Liu et.al. 2501.07468 null
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI Rolf Pfister et.al. 2501.07458 null
2025-01-13 Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection Xin Yin et.al. 2501.07425 null
2025-01-13 Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion Lala Shakti Swarup Ray et.al. 2501.07408 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 Simulating the Hubbard Model with Equivariant Normalizing Flows Dominic Schuh et.al. 2501.07371 null
2025-01-13 Emergent effects of scaling on the functional hierarchies within large language models Paul C. Bogdan et.al. 2501.07359 null
2025-01-13 Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring Buse Sibel Korkmaz et.al. 2501.07324 link
2025-01-13 FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering Erik Henriksson et.al. 2501.07314 link
2025-01-13 The Lessons of Developing Process Reward Models in Mathematical Reasoning Zhenru Zhang et.al. 2501.07301 null
2025-01-13 GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction Oleg Kobzarev et.al. 2501.07295 null
2025-01-13 LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks Zan-Kai Chong et.al. 2501.07288 null
2025-01-13 Lifelong Learning of Large Language Model based Agents: A Roadmap Junhao Zheng et.al. 2501.07278 link
2025-01-13 Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation Amir Sartipi et.al. 2501.07276 null
2025-01-13 Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics Wonduk Seo et.al. 2501.07267 null
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 EdgeTAM: On-Device Track Anything Model Chong Zhou et.al. 2501.07256 null
2025-01-13 Large Language Models: New Opportunities for Access to Science Jutta Schnabel et.al. 2501.07250 null
2025-01-13 Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training Ziqing Wen et.al. 2501.07237 link
2025-01-13 Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction Qiaoqiao Ren et.al. 2501.07224 link
2025-01-13 Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing Laifa Tao et.al. 2501.07191 null
2025-01-13 Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study Huashan Chen et.al. 2501.07165 null
2025-01-13 AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model Bangchen Yin et.al. 2501.07155 link
2025-01-13 LLM360 K2: Scaling Up 360-Open-Source Large Language Models Zhengzhong Liu et.al. 2501.07124 null
2025-01-13 How GPT learns layer by layer Jason Du et.al. 2501.07108 link
2025-01-13 ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training Jiayang Wu et.al. 2501.07078 link
2025-01-13 D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation Zhejun Zhang et.al. 2501.07077 link
2025-01-13 Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values Jing Yao et.al. 2501.07071 null
2025-01-13 Enhancing Image Generation Fidelity via Progressive Prompts Zhen Xiong et.al. 2501.07070 link
2025-01-13 Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities ZeKe Xiao et.al. 2501.07058 null
2025-01-13 SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation Yee-Fan Tan et.al. 2501.07055 null
2025-01-13 PoAct: Policy and Action Dual-Control Agent for Generalized Applications Guozhi Yuan et.al. 2501.07054 null
2025-01-13 ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction Yan Zhang et.al. 2501.07051 link
2025-01-13 Unveiling the Potential of Text in High-Dimensional Time Series Forecasting Xin Zhou et.al. 2501.07048 link
2025-01-13 Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis Luwei Zeng et.al. 2501.07034 null
2025-01-13 A Proposed Large Language Model-Based Smart Search for Archive System Ha Dung Nguyen et.al. 2501.07024 null
2025-01-13 Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps Henry Li et.al. 2501.06999 link
2025-01-13 LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models Mozhgan Nasr Azadani et.al. 2501.06986 link
2025-01-13 Combining LLM decision and RL action selection to improve RL policy for adaptive interventions Karine Karine et.al. 2501.06980 null
2025-01-12 How is Google using AI for internal code migrations? Stoyan Nikolov et.al. 2501.06972 null
2025-01-12 Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives Xinyao Ma et.al. 2501.06964 null
2025-01-12 Comparison of Autoencoders for tokenization of ASL datasets Vouk Praun-Petrovic et.al. 2501.06942 null
2025-01-12 Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy Evgeny Ugolkov et.al. 2501.06939 link
2025-01-12 Harnessing Large Language Models for Disaster Management: A Survey Zhenyu Lei et.al. 2501.06932 null
2025-01-12 Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories Faaiq Waqar et.al. 2501.06921 null
2025-01-12 Risk-Averse Finetuning of Large Language Models Sapana Chaudhary et.al. 2501.06911 link
2025-01-12 Deep Learning and Foundation Models for Weather Prediction: A Survey Jimeng Shi et.al. 2501.06907 null
2025-01-12 A Foundational Generative Model for Breast Ultrasound Image Analysis Haojun Yu et.al. 2501.06869 null
2025-01-12 Transfer Learning of Tabular Data by Finetuning Large Language Models Shourav B. Rabbani et.al. 2501.06863 null
2025-01-12 A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context Noureldin Zahran et.al. 2501.06859 null
2025-01-12 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training Tianjin Huang et.al. 2501.06842 link
2025-01-12 An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering Zaber Al Hassan Ayon et.al. 2501.06837 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents Augusto Gonzalez-Bonorino et.al. 2501.06834 link
2025-01-12 GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing Ruizhe Ou et.al. 2501.06828 null
2025-01-12 Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification Shijing Chen et.al. 2501.06827 null
2025-01-12 Event Argument Extraction with Enriched Prompts Chen Liang et.al. 2501.06825 link
2025-01-12 A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT Yizhou Zhou et.al. 2501.06819 null
2025-01-12 RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models Keyan Chen et.al. 2501.06809 link
2025-01-12 Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting Yongshuo Zhu et.al. 2501.06808 null
2025-01-12 MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference Wenxuan Zeng et.al. 2501.06807 null
2025-01-12 Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences Liu Yu et.al. 2501.06795 null
2025-01-12 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes Mahmoud Ahmed et.al. 2501.06785 link
2025-01-12 Cost-Effective Robotic Handwriting System with AI Integration Tianyi Huang et.al. 2501.06783 null
2025-01-12 Eliza: A Web3 friendly AI Agent Operating System Shaw Walters et.al. 2501.06781 link
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-12 Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation Shunfan Zheng et.al. 2501.06741 null
2025-01-12 ZOQO: Zero-Order Quantized Optimization Noga Bar et.al. 2501.06736 null
2025-01-12 Better Prompt Compression Without Multi-Layer Perceptrons Edouardo Honig et.al. 2501.06730 null
2025-01-12 Measuring the Robustness of Reference-Free Dialogue Evaluation Systems Justin Vasselli et.al. 2501.06728 link
2025-01-12 Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G Zhiyan Liu et.al. 2501.06726 null
2025-01-12 DRDT3: Diffusion-Refined Decision Test-Time Training Model Xingshuai Huang et.al. 2501.06718 null
2025-01-12 ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian Mykyta Syromiatnikov et.al. 2501.06715 link
2025-01-12 Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management Liu Qianli et.al. 2501.06709 null
2025-01-12 Evaluating Sample Utility for Data Selection by Mimicking Model Weights Tzu-Heng Huang et.al. 2501.06708 null
2025-01-12 AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds Yinfang Chen et.al. 2501.06706 null
2025-01-12 Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese Jie Yang et.al. 2501.06704 null
2025-01-12 Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users’ Questions Aidan Hogan et.al. 2501.06699 null
2025-01-12 DVM: Towards Controllable LLM Agents in Social Deduction Games Zheng Zhang et.al. 2501.06695 null
2025-01-12 TAPO: Task-Referenced Adaptation for Prompt Optimization Wenxin Luo et.al. 2501.06689 link
2025-01-12 Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning Xiangen Hu et.al. 2501.06682 null
2025-01-12 Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving Haoxiang Gao et.al. 2501.06680 null
2025-01-11 Challenging reaction prediction models to generalize to novel chemistry John Bradshaw et.al. 2501.06669 link
2025-01-11 Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training Sanjit Kakarla et.al. 2501.06658 link
2025-01-11 FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings Tong Liu et.al. 2501.06645 null
2025-01-11 Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models Veronika Smilga et.al. 2501.06638 link
2025-01-11 Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach Mohammed Maree et.al. 2501.06628 null
2025-01-11 Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Amr Almorsi et.al. 2501.06625 null
2025-01-11 Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks Xuanhao Luo et.al. 2501.06604 null
2025-01-11 ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Xuanle Zhao et.al. 2501.06598 link
2025-01-11 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning Xiangru Tang et.al. 2501.06590 link
2025-01-11 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Muru Zhang et.al. 2501.06589 link
2025-01-10 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Omkar Thawakar et.al. 2501.06186 link
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-10 GenMol: A Drug Discovery Generalist with Discrete Diffusion Seul Lee et.al. 2501.06158 null
2025-01-10 Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories Gerd Kortemeyer et.al. 2501.06143 null
2025-01-10 Supervision policies can shape long-term risk management in general-purpose AI models Manuel Cebrian et.al. 2501.06137 link
2025-01-10 Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Yuya Asano et.al. 2501.06129 null
2025-01-10 Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Fabian David Schmidt et.al. 2501.06117 link
2025-01-10 From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy Elham Aghakhani et.al. 2501.06101 null
2025-01-10 Photokinetics of Photothermal Reactions Mounir Maafi et.al. 2501.06057 null
2025-01-10 AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery Johann Wenckstern et.al. 2501.06039 link
2025-01-10 Addressing speaker gender bias in large scale speech translation systems Shubham Bansal et.al. 2501.05989 null
2025-01-10 Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Eklavya Sarkar et.al. 2501.05987 link
2025-01-10 Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys Divya Mani Adhikari et.al. 2501.05985 null
2025-01-10 Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea Eunjung Cho et.al. 2501.05981 null
2025-01-10 Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory Yunmeng Shu et.al. 2501.05965 null
2025-01-10 Effective faking of verbal deception detection with target-aligned adversarial attacks Bennett Kleinberg et.al. 2501.05962 null
2025-01-10 Reusable specimen-level inference in computational pathology Jakub R. Kaczmarzyk et.al. 2501.05945 link
2025-01-10 DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information Yongfan Lai et.al. 2501.05932 link
2025-01-10 LLMs Reproduce Stereotypes of Sexual and Gender Minorities Ruby Ostrow et.al. 2501.05926 null
2025-01-10 Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction Petraq Nako et.al. 2501.05925 null
2025-01-10 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Prompt engineering and its implications on the energy consumption of Large Language Models Riccardo Rubei et.al. 2501.05899 link
2025-01-10 Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs Bianca Raimondi et.al. 2501.05891 link
2025-01-10 Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs Dabing Cheng et.al. 2501.05884 null
2025-01-10 VideoRAG: Retrieval-Augmented Generation over Video Corpus Soyeong Jeong et.al. 2501.05874 null
2025-01-10 ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability Antonin Poché et.al. 2501.05855 link
2025-01-10 Understanding Impact of Human Feedback via Influence Functions Taywon Min et.al. 2501.05790 link
2025-01-10 Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models You Li et.al. 2501.05767 null
2025-01-10 Controlling Large Language Models Through Concept Activation Vectors Hanyu Zhang et.al. 2501.05764 null
2025-01-10 StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation Shangjin Zhai et.al. 2501.05763 null
2025-01-10 CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech Madhurananda Pahar et.al. 2501.05755 null
2025-01-10 Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models Sungjae Lee et.al. 2501.05752 null
2025-01-10 TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos Korawat Charoenpitaks et.al. 2501.05733 link
2025-01-10 Enabling Scalable Oversight via Self-Evolving Critic Zhengyang Tang et.al. 2501.05727 null
2025-01-10 I Can’t Share Code, but I need Translation – An Empirical Study on Code Translation through Federated LLM Jahnavi Kumar et.al. 2501.05724 null
2025-01-10 How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond Chen Huang et.al. 2501.05714 null
2025-01-10 Multi-Step Reasoning in Korean and the Emergent Mirage Guijin Son et.al. 2501.05712 null
2025-01-10 EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model Yi He et.al. 2501.05710 null
2025-01-10 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Vighnesh Subramaniam et.al. 2501.05707 null
2025-01-10 Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness Audrey Salmon et.al. 2501.05706 null
2025-01-10 Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection Feiyi Chen et.al. 2501.05675 null
2025-01-10 Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration Zuyuan Zhang et.al. 2501.05673 null
2025-01-10 Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models Zheqi Lv et.al. 2501.05662 null
2025-01-10 Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation Zheqi Lv et.al. 2501.05647 null
2025-01-10 Iconicity in Large Language Models Anna Marklová et.al. 2501.05643 null
2025-01-10 HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection Anant Mehta et.al. 2501.05631 link
2025-01-10 The Impact of Model Scaling on Seen and Unseen Language Performance Rhitabrat Pokharel et.al. 2501.05629 null
2025-01-09 Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study Zhenyu Qi et.al. 2501.05625 null
2025-01-09 Exploring Large Language Models for Translating Romanian Computational Problems into English Adrian Marius Dumitran et.al. 2501.05601 null
2025-01-09 Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics Gert Aarts et.al. 2501.05580 null
2025-01-09 Exploring Large Language Models (LLMs) through interactive Python activities Eugenio Tufino et.al. 2501.05577 link
2025-01-09 LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts Yuri Facanha Bezerra et.al. 2501.05554 link
2025-01-09 The dynamics of meaning through time: Assessment of Large Language Models Mohamed Taher Alrefaie et.al. 2501.05552 null
2025-01-09 Infecting Generative AI With Viruses David Noever et.al. 2501.05542 null
2025-01-09 NSChat: A Chatbot System To Rule Them All Zenon Lamprou et.al. 2501.05541 null
2025-01-09 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Xingyu Fu et.al. 2501.05452 null
2025-01-09 Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Yifan Yu et.al. 2501.05446 link
2025-01-09 Consistent Flow Distillation for Text-to-3D Generation Runjie Yan et.al. 2501.05445 null
2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Yunzhuo Hao et.al. 2501.05444 null
2025-01-09 A survey of textual cyber abuse detection using cutting-edge language models and large language models Jose A. Diaz-Garcia et.al. 2501.05443 null
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers Jerry Chongyi Hu et.al. 2501.05423 null
2025-01-09 Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation Darius Petermann et.al. 2501.05413 null
2025-01-10 Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics Maximilian Alber et.al. 2501.05409 null
2025-01-09 TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts Yu-Hao Huang et.al. 2501.05403 null
2025-01-09 Mechanistic understanding and validation of large AI models with SemanticLens Maximilian Dreyer et.al. 2501.05398 null
2025-01-09 FairCode: Evaluating Social Bias of LLMs in Code Generation Yongkang Du et.al. 2501.05396 link
2025-01-09 Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models Kristian G. Barman et.al. 2501.05382 null
2025-01-09 Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance Dimitrios Gerogiannis et.al. 2501.05379 null
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 null
2025-01-09 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou et.al. 2501.05336 link
2025-01-09 “What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles Xuewen Luo et.al. 2501.05322 null
2025-01-09 Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning Nora Gourmelon et.al. 2501.05281 link
2025-01-09 CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models Fabian Hörst et.al. 2501.05269 link
2025-01-09 Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal Wanli Ma et.al. 2501.05265 null
2025-01-09 CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models Yewei Song et.al. 2501.05255 null
2025-01-09 From Scientific Texts to Verifiable Code: Automating the Process with Transformers Changjie Wang et.al. 2501.05252 null
2025-01-09 RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models Peizhuo Lv et.al. 2501.05249 null
2025-01-09 Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning Laura Puccioni et.al. 2501.05248 null
2025-01-09 Online Prompt and Solver Selection for Program Synthesis Yixuan Li et.al. 2501.05247 null
2025-01-09 Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs Artem Fedorchenko et.al. 2501.05234 null
2025-01-09 Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection Pei-Kang Lee et.al. 2501.05228 null
2025-01-09 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 null
2025-01-09 Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond Tomas Goldsack et.al. 2501.05224 null
2025-01-09 A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education Ziqing Li et.al. 2501.05220 null
2025-01-09 Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration Xuyang Liu et.al. 2501.05179 link
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-09 Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering Matteo Esposito et.al. 2501.05165 null
2025-01-09 Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier Yufei Shang et.al. 2501.05155 null
2025-01-09 DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving Xuran Zheng et.al. 2501.05081 null
2025-01-09 Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization Harshith Manjunath et.al. 2501.05079 null
2025-01-09 Analyzing Memorization in Large Language Models through the Lens of Model Attribution Tarun Ram Menta et.al. 2501.05078 link
2025-01-09 A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model Shuo Tong et.al. 2501.05075 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators Leonard Lupin-Jimenez et.al. 2501.05058 null
2025-01-09 LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models Zengqi Peng et.al. 2501.05057 null
2025-01-09 On the Generalizability of Transformer Models to Code Completions of Different Lengths Nathan Cooper et.al. 2501.05051 null
2025-01-09 SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Chengxing Xie et.al. 2501.05040 link
2025-01-09 Enhancing Human-Like Responses in Large Language Models Ethem Yağız Çalık et.al. 2501.05032 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-09 A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications Ofir Marom et.al. 2501.05030 null
2025-01-09 TreeKV: Smooth Key-Value Cache Compression with Tree Structures Ziwei He et.al. 2501.04987 null
2025-01-09 SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs Muhammad Salman et.al. 2501.04985 null
2025-01-09 V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer Hangzhou He et.al. 2501.04975 link
2025-01-09 Demystifying Domain-adaptive Post-training for Financial LLMs Zixuan Ke et.al. 2501.04961 link
2025-01-09 Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments Yifan Xu et.al. 2501.04947 null
2025-01-09 Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models Qingyu Ren et.al. 2501.04945 link
2025-01-09 Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency Shiji Zhao et.al. 2501.04931 null
2025-01-09 Investigating Numerical Translation with Large Language Models Wei Tang et.al. 2501.04927 null
2025-01-09 FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jun-Hak Yun et.al. 2501.04926 null
2025-01-09 HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers Yiyao Yang et.al. 2501.04908 link
2025-01-09 JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis Jun-Hyeok Cha et.al. 2501.04904 null
2025-01-09 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries Keke Huang et.al. 2501.04901 null
2025-01-09 SUGAR: Leveraging Contextual Confidence for Smarter Retrieval Hanna Zubkova et.al. 2501.04899 null
2025-01-08 Leveraging Log Probabilities in Language Models to Forecast Future Events Tommaso Soru et.al. 2501.04880 null
2025-01-08 Real-Time Textless Dialogue Generation Long Mai et.al. 2501.04877 link
2025-01-08 Modelling complex proton transport phenomena – Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields Malte Grunert et.al. 2501.04876 null
2025-01-08 Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware Brandon J Walton et.al. 2501.04848 null
2025-01-08 Do Code LLMs Understand Design Patterns? Zhenyu Pan et.al. 2501.04835 null
2025-01-08 On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability Andreas Vogelsang et.al. 2501.04810 null
2025-01-08 IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX Erik Recio-Armengol et.al. 2501.04776 link
2025-01-08 Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations Kirandeep Kaur et.al. 2501.04762 null
2025-01-08 Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch Phillip Richter et.al. 2501.04755 null
2025-01-08 EditAR: Unified Conditional Generation with Autoregressive Models Jiteng Mu et.al. 2501.04699 null
2025-01-08 Re-ranking the Context for Multimodal Retrieval Augmented Generation Matin Mortaheb et.al. 2501.04695 null
2025-01-08 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Zixuan Huang et.al. 2501.04689 null
2025-01-08 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Ruilin Luo et.al. 2501.04686 link
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 Assessing Language Comprehension in Large Language Models Using Construction Grammar Wesley Scivetti et.al. 2501.04661 null
2025-01-08 Multi-task retriever fine-tuning for domain-specific and efficient RAG Patrice Béchard et.al. 2501.04652 null
2025-01-08 FlairGPT: Repurposing LLMs for Interior Designs Gabrielle Littlefair et.al. 2501.04648 null
2025-01-08 Knowledge Retrieval Based on Generative AI Te-Lun Yang et.al. 2501.04635 null
2025-01-08 “Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era Giulio Antonio Abbo et.al. 2501.04633 null
2025-01-09 MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation Daniele Molino et.al. 2501.04614 null
2025-01-08 Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning Ivan Kankeu et.al. 2501.04591 link
2025-01-08 Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models Miaoyang He et.al. 2501.04582 null
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 link
2025-01-09 OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Run Luo et.al. 2501.04561 link
2025-01-08 The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? Christopher Lazik et.al. 2501.04543 null
2025-01-08 Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time Uri Berger et.al. 2501.04513 null
2025-01-08 CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection Ruijun Feng et.al. 2501.04510 null
2025-01-08 Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction Guofeng Yang et.al. 2501.04487 null
2025-01-08 When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages Archchana Sindhujan et.al. 2501.04473 null
2025-01-08 Hidden Entity Detection from GitHub Leveraging Large Language Models Lu Gan et.al. 2501.04455 link
2025-01-08 Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions Doaa Mahmud et.al. 2501.04437 null
2025-01-08 Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions Na Yan et.al. 2501.04436 null
2025-01-08 End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach H. M. Shadman Tabib et.al. 2501.04425 null
2025-01-08 SEO: Stochastic Experience Optimization for Large Language Models Jitao Xu et.al. 2501.04393 null
2025-01-08 iFADIT: Invertible Face Anonymization via Disentangled Identity Transform Lin Yuan et.al. 2501.04390 null
2025-01-08 DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications Feng Liu et.al. 2501.04366 link
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-09 Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models Haonan Shi et.al. 2501.04323 null
2025-01-08 Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts Preethi Seshadri et.al. 2501.04316 link
2025-01-08 RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation Jun Liu et.al. 2501.04315 null
2025-01-08 Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models Kunpeng Zhang et.al. 2501.04312 null
2025-01-08 LLM4SR: A Survey on Large Language Models for Scientific Research Ziming Luo et.al. 2501.04306 link
2025-01-08 Multimodal Graph Constrastive Learning and Prompt for ChartQA Yue Dai et.al. 2501.04303 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 An Analysis of Model Robustness across Concurrent Distribution Shifts Myeongho Jeon et.al. 2501.04288 null
2025-01-08 Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models Bahman Torkamandi et.al. 2501.04286 null
2025-01-08 Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking Tianqi Ren et.al. 2501.04285 null
2025-01-08 OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments Yujie Tang et.al. 2501.04279 null
2025-01-08 Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering Christophe Bajan et.al. 2501.04277 link
2025-01-08 Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation Senwei Xie et.al. 2501.04268 null
2025-01-08 Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning Lang Xu et.al. 2501.04266 null
2025-01-08 IOLBENCH: Benchmarking LLMs on Linguistic Reasoning Satyam Goyal et.al. 2501.04249 link
2025-01-08 TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events Jian-Hua Fang et.al. 2501.04247 null
2025-01-08 Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks Rachel Longjohn et.al. 2501.04234 null
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-07 AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots Shaoyue Wen et.al. 2501.04156 link
2025-01-07 Multilingual Open QA on the MIA Shared Task Navya Yarrabelly et.al. 2501.04153 null
2025-01-07 The angular momentum spiral of the Milky Way disc in Gaia Rashid Yaaqib et.al. 2501.04095 null
2025-01-07 More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives Xiaoqing Zhang et.al. 2501.04070 link
2025-01-07 ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono Jingquan Wang et.al. 2501.04062 null
2025-01-07 LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Lingdong Kong et.al. 2501.04005 null
2025-01-07 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Haobo Yuan et.al. 2501.04001 link
2025-01-07 RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance Matin Mortaheb et.al. 2501.03995 null
2025-01-07 Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance Adil Rengim Cetingoz et.al. 2501.03993 null
2025-01-07 Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles Yuxi Xia et.al. 2501.03991 null
2025-01-07 (De)-Indexing and the Right to be Forgotten Salvatore Vilella et.al. 2501.03989 null
2025-01-07 VLM-driven Behavior Tree for Context-aware Task Planning Naoki Wake et.al. 2501.03968 link
2025-01-07 Vision Language Models as Values Detectors Giulio Antonio Abbo et.al. 2501.03957 null
2025-01-07 Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States Jurgita Kapočiūtė-Dzikienė et.al. 2501.03952 null
2025-01-07 Synthetic Data Privacy Metrics Amy Steier et.al. 2501.03941 null
2025-01-07 Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection Pablo Miralles-González et.al. 2501.03940 null
2025-01-07 A precise asymptotic analysis of learning diffusion models: theory and insights Hugo Cui et.al. 2501.03937 link
2025-01-07 Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study Ramya Jonnala et.al. 2501.03904 null
2025-01-07 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Shaolei Zhang et.al. 2501.03895 link
2025-01-07 AlphaPO – Reward shape matters for LLM alignment Aman Gupta et.al. 2501.03884 null
2025-01-07 CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds Keonwoo Kim et.al. 2501.03879 null
2025-01-07 Progressive Document-level Text Simplification via Large Language Models Dengzhao Fang et.al. 2501.03857 null
2025-01-07 MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention Aadya Arora et.al. 2501.03839 null
2025-01-07 Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging Simon W. Penninga et.al. 2501.03825 null
2025-01-08 MADation: Face Morphing Attack Detection with Foundation Models Eduarda Caldeira et.al. 2501.03800 link
2025-01-07 KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration Chengyuan Li et.al. 2501.03786 null
2025-01-07 Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series Yuxiao Hu et.al. 2501.03747 null
2025-01-07 Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein Xiaotong Guo et.al. 2501.03722 null
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-07 Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Haoning Xu et.al. 2501.03643 null
2025-01-07 CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems Zhaonan Wu et.al. 2501.03626 link
2025-01-07 LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment Gaoussou Youssouf Kebe et.al. 2501.03624 null
2025-01-07 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 link
2025-01-07 From Code to Compliance: Assessing ChatGPT’s Utility in Designing an Accessible Webpage – A Case Study Ammar Ahmed et.al. 2501.03572 null
2025-01-07 What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs Muneera Bano et.al. 2501.03569 null
2025-01-07 Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities Benedikt Reitemeyer et.al. 2501.03566 null
2025-01-07 Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis Haoran Lai et.al. 2501.03565 null
2025-01-07 PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models Lingzhi Yuan et.al. 2501.03544 null
2025-01-07 Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions Weijieying Ren et.al. 2501.03540 null
2025-01-07 Deep Learning for Pathological Speech: A Survey Shakeel A. Sheikh et.al. 2501.03536 null
2025-01-08 SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving Xuewen Luo et.al. 2501.03535 null
2025-01-07 A generative approach for lensless imaging in low-light conditions Ziyang Liu et.al. 2501.03511 null
2025-01-07 A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models Shuyang Wang et.al. 2501.03508 null
2025-01-07 Textualize Visual Prompt for Image Editing via Diffusion Bridge Pengcheng Xu et.al. 2501.03495 null
2025-01-07 Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment Prashant Trivedi et.al. 2501.03486 null
2025-01-07 Reading with Intent – Neutralizing Intent Benjamin Reichman et.al. 2501.03475 null
2025-01-07 Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning Chuang Niu et.al. 2501.03469 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-07 ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation Yu-Cheng Liu et.al. 2501.03462 null
2025-01-07 Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation Xiao Wang et.al. 2501.03458 link
2025-01-07 CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering Jialiang Chen et.al. 2501.03447 null
2025-01-07 LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models Mohamad Fakih et.al. 2501.03446 null
2025-01-07 Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology Sarah E. Finch et.al. 2501.03441 link
2025-01-06 SALT: Sales Autocompletion Linked Business Tables Dataset Tassilo Klein et.al. 2501.03413 link
2025-01-06 BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations Simone Giovannini et.al. 2501.03403 null
2025-01-06 DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes Xuyang Wang et.al. 2501.03397 link
2025-01-06 Evolved Quantum Boltzmann Machines Michele Minervini et.al. 2501.03367 null
2025-01-06 CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Tanay Agrawal et.al. 2501.03332 null
2025-01-06 LiLMaps: Learnable Implicit Language Maps Evgenii Kruzhkov et.al. 2501.03304 null
2025-01-06 A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval Shuo Tong et.al. 2501.03295 null
2025-01-06 Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model Naibo Wang et.al. 2501.03292 null
2025-01-06 ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning Pengwei Tang et.al. 2501.03291 null
2025-01-06 CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models Zhenyu Xu et.al. 2501.03288 null
2025-01-06 BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Beichen Zhang et.al. 2501.03226 link
2025-01-06 Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text Ayat Najjar et.al. 2501.03212 null
2025-01-06 Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity Ayat A. Najjar et.al. 2501.03203 null
2025-01-06 CLIX: Cross-Lingual Explanations of Idiomatic Expressions Aaron Gluck et.al. 2501.03191 null
2025-01-06 Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text Ali Al-Lawati et.al. 2501.03166 link
2025-01-06 Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy Risha Goel et.al. 2501.03153 link
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity Yerong Li et.al. 2501.03139 null
2025-01-07 PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Mingyang Song et.al. 2501.03124 link
2025-01-06 CAT: Content-Adaptive Image Tokenization Junhong Shen et.al. 2501.03120 null
2025-01-06 LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases Dylan Bouchard et.al. 2501.03112 link
2025-01-06 Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling Aseem Srivastava et.al. 2501.03088 null
2025-01-06 Retrieval-Augmented TLAPS Proof Generation with Large Language Models Yuhao Zhou et.al. 2501.03073 null
2025-01-06 ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events Duygu Sezen Islakoglu et.al. 2501.03040 null
2025-01-06 Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Zhen Li et.al. 2501.03035 null
2025-01-06 TransPixar: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 CALM: Curiosity-Driven Auditing for Large Language Models Xiang Zheng et.al. 2501.02997 link
2025-01-06 Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation Zhi Qu et.al. 2501.02979 link
2025-01-06 FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2501.02968 null
2025-01-07 Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild Wanpeng Hu et.al. 2501.02964 link
2025-01-07 SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild Jiawei Liu et.al. 2501.02962 null
2025-01-06 The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features Shi Bin Hoo et.al. 2501.02945 link
2025-01-07 Inhibition of bacterial growth by antibiotics Barnabe Ledoux et.al. 2501.02944 null
2025-01-06 Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions Jianhua Pei et.al. 2501.02928 null
2025-01-06 DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model Hao Yu et.al. 2501.02901 link
2025-01-06 FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection Guray Ozgur et.al. 2501.02892 link
2025-01-06 MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs Hui Sun et.al. 2501.02885 null
2025-01-06 IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment Yiming Zhang et.al. 2501.02869 null
2025-01-06 Large Language Models for Video Surveillance Applications Ulindu De Silva et.al. 2501.02850 null
2025-01-06 Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification Yubo Wang et.al. 2501.02844 null
2025-01-06 Foundations of GenIR Qingyao Ai et.al. 2501.02842 null
2025-01-06 An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models Dezhi Ran et.al. 2501.02829 null
2025-01-06 InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion Zhaoyi Yan et.al. 2501.02795 null
2025-01-06 CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation Yuanhong Chen et.al. 2501.02786 null
2025-01-06 GeAR: Generation Augmented Retrieval Haoyu Liu et.al. 2501.02772 null
2025-01-06 Visual Large Language Models for Generalized and Specialized Applications Yifan Li et.al. 2501.02765 link
2025-01-06 Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? Hongyi Miao et.al. 2501.02751 null
2025-01-06 Artificial Intelligence in Creative Industries: Advances Prior to 2025 Nantheera Anantrasirichai et.al. 2501.02725 null
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-06 QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance Binita Saha et.al. 2501.02702 null
2025-01-06 EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models Andrés Villa et.al. 2501.02699 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-05 Decoding specialised feature neurons in LLMs with the final projection layer Harry J Davies et.al. 2501.02688 null
2025-01-05 From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering Wen-ran Li et.al. 2501.02680 null
2025-01-05 A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model Shivaram Kalyanakrishnan et.al. 2501.02652 null
2025-01-05 Representation Learning of Lab Values via Masked AutoEncoder David Restrepo et.al. 2501.02648 link
2025-01-05 Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense Yang Ouyang et.al. 2501.02629 link
2025-01-05 Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets Mahmoud Jahanshahi et.al. 2501.02628 null
2025-01-05 HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning Saleh Ashkboos et.al. 2501.02625 null
2025-01-05 LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment Yifei Liu et.al. 2501.02621 null
2025-01-05 TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jovan Stojkovic et.al. 2501.02600 null
2025-01-05 LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations Jiaping Wang et.al. 2501.02573 link
2025-01-05 Multi-LLM Collaborative Caption Generation in Scientific Documents Jaeyoung Kim et.al. 2501.02552 link
2025-01-05 Transformers Simulate MLE for Sequence Generation in Bayesian Networks Yuan Cao et.al. 2501.02547 null
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-05 Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI Ljubisa Bojic et.al. 2501.02531 null
2025-01-05 Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks Leo Franklin et.al. 2501.02527 null
2025-01-05 Unified Guidance for Geometry-Conditioned Molecular Generation Sirine Ayadi et.al. 2501.02526 null
2025-01-05 Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors Minglin Chen et.al. 2501.02519 null
2025-01-05 CHAIR-Classifier of Hallucination as Improver Ao Sun et.al. 2501.02518 link
2025-01-05 ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Junjie Ye et.al. 2501.02506 null
2025-01-05 Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons Sebastian Morel-Balbi et.al. 2501.02505 null
2025-01-05 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null
2025-01-05 LLMPC: Large Language Model Predictive Control Gabriel Maher et.al. 2501.02486 link
2025-01-05 Decoding News Bias: Multi Bias Detection in News Articles Bhushan Santosh Shah et.al. 2501.02482 null
2025-01-05 Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine Yishen Liu et.al. 2501.02471 null
2025-01-05 Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Yuliang Guo et.al. 2501.02464 null
2025-01-05 Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications Zhe Chen et.al. 2501.02460 null
2025-01-05 Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap Hyunwoo Ko et.al. 2501.02448 null
2025-01-05 RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework Kun Wang et.al. 2501.02446 null
2025-01-05 A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models Yinpeng Cai et.al. 2501.02441 null
2025-01-05 Efficient Deployment of Large Language Models on Resource-constrained Devices Zhiwei Yao et.al. 2501.02438 null
2025-01-05 FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance Haicheng Wang et.al. 2501.02430 link
2025-01-05 GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems Mehmet Deniz Türkmen et.al. 2501.02408 null
2025-01-04 Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities Tara Radvand et.al. 2501.02406 null
2025-01-04 Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Markus J. Buehler et.al. 2501.02393 link
2025-01-04 Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations Kangyu Zhu et.al. 2501.02385 null
2025-01-04 Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison Tsz Kin Lam et.al. 2501.02370 null
2025-01-04 Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving Sanghyun Park et.al. 2501.02348 null
2025-01-04 Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support Florian Putz et.al. 2501.02346 null
2025-01-04 UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility Yonglin Tian et.al. 2501.02341 link
2025-01-04 AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference Zhuomin He et.al. 2501.02336 link
2025-01-04 Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications Jodi M. Casabianca et.al. 2501.02334 null
2025-01-04 Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance Marta Gentiloni-Silveri et.al. 2501.02298 null
2025-01-04 Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection Yachao Zhao et.al. 2501.02295 null
2025-01-04 Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control Yongjeong Oh et.al. 2501.02273 null
2025-01-04 What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph Yutao Jiang et.al. 2501.02268 link
2025-01-04 Unsupervised Class Generation to Expand Semantic Segmentation Datasets Javier Montalvo et.al. 2501.02264 null
2025-01-04 Financial Named Entity Recognition: How Far Can LLM Go? Yi-Te Lu et.al. 2501.02237 link
2025-01-04 Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends Camille Barboule et.al. 2501.02235 null
2025-01-04 Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection S M Mostaq Hossain et.al. 2501.02229 null
2025-01-04 Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation Shijie Wang et.al. 2501.02226 null
2025-01-04 Can ChatGPT implement finite element models for geotechnical engineering applications? Taegu Kim et.al. 2501.02199 null
2025-01-04 EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks Shixuan Liu et.al. 2501.02192 null
2025-01-04 On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing Jianwei Wang et.al. 2501.02191 link
2025-01-04 The Application of Large Language Models in Recommendation Systems Peiyang Yu et.al. 2501.02178 null
2025-01-04 The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit Huixue Zhou et.al. 2501.02173 null
2025-01-04 Personalized Graph-Based Retrieval for Large Language Models Steven Au et.al. 2501.02157 link
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-04 Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN Yanxi Chen et.al. 2501.02146 null
2025-01-03 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Chaoyou Fu et.al. 2501.01957 link
2025-01-03 Metadata Conditioning Accelerates Language Model Pre-training Tianyu Gao et.al. 2501.01956 link
2025-01-03 MADGEN – Mass-Spec attends to De Novo Molecular generation Yinkai Wang et.al. 2501.01950 null
2025-01-03 Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap Weizhi Zhang et.al. 2501.01945 link
2025-01-03 Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models Manh Duong Nguyen et.al. 2501.01932 link
2025-01-03 Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Yifan Du et.al. 2501.01904 link
2025-01-03 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Siyuan Huang et.al. 2501.01895 null
2025-01-03 Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions Rachneet Sachdeva et.al. 2501.01872 link
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 link
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 Time Series Language Model for Descriptive Caption Generation Mohamed Trabelsi et.al. 2501.01832 null
2025-01-03 Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models Yanjiang Liu et.al. 2501.01830 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction Ferhat Ozgur Catak et.al. 2501.01802 link
2025-01-03 Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation Mohammad Khalil et.al. 2501.01793 link
2025-01-03 Efficient LLM Inference with Activation Checkpointing and Hybrid Caching Sanghyeon Lee et.al. 2501.01792 null
2025-01-03 Nonparametric estimation of a factorizable density using diffusion models Hyeok Kyu Kwon et.al. 2501.01783 null
2025-01-03 SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation Mingjie Li et.al. 2501.01765 null
2025-01-03 Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models Andrea Matteazzi et.al. 2501.01761 null
2025-01-03 MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling Simon Rouard et.al. 2501.01757 null
2025-01-03 Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation Kangcheng Luo et.al. 2501.01743 null
2025-01-03 How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models Simone Corbo et.al. 2501.01741 null
2025-01-03 AR4D: Autoregressive 4D Generation from Monocular Videos Hanxin Zhu et.al. 2501.01722 null
2025-01-03 Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models Guosheng Zhang et.al. 2501.01720 null
2025-01-03 LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries Michal Kuk et.al. 2501.01711 null
2025-01-03 MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders Jiajun Cao et.al. 2501.01709 null
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-03 Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models Lei Tang et.al. 2501.01679 null
2025-01-03 Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption Zhang Ruoyan et.al. 2501.01672 null
2025-01-03 BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction Alaeddine Diaf et.al. 2501.01664 null
2025-01-03 Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning Danni Peng et.al. 2501.01653 null
2025-01-03 MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments Cai Yin et.al. 2501.01652 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-03 Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation Rini Smita Thakur et.al. 2501.01640 null
2025-01-03 A non-ergodic framework for understanding emergent capabilities in Large Language Models Javier Marin et.al. 2501.01638 null
2025-01-03 Revisiting Data Analysis with Pre-trained Foundation Models Chen Liang et.al. 2501.01631 null
2025-01-03 ICPC: In-context Prompt Compression with Faster Inference Ziyang Yu et.al. 2501.01625 null
2025-01-03 PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents Jingoo Lee et.al. 2501.01594 null
2025-01-03 (WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges Mohamed Hisham Abdellatif et.al. 2501.01588 null
2025-01-02 Predicting the Performance of Black-box LLMs through Self-Queries Dylan Sam et.al. 2501.01558 link
2025-01-02 Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) Jumana Almahmoud et.al. 2501.01545 null
2025-01-02 Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information Rasul Tutnov et.al. 2501.01544 null
2025-01-02 Denoising Diffused Embeddings: a Generative Approach for Hypergraphs Shihao Wu et.al. 2501.01541 null
2025-01-02 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi et.al. 2501.01540 link
2025-01-02 SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers Bhavna Gopal et.al. 2501.01529 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-02 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Jingfeng Yao et.al. 2501.01423 link
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers Seunghyun Lee et.al. 2501.01414 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios Xize Cheng et.al. 2501.01384 null
2025-01-02 ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI Neda Tavakoli et.al. 2501.01372 link
2025-01-02 Aligning Large Language Models for Faithful Integrity Against Opposing Argument Yong Zhao et.al. 2501.01336 link
2025-01-02 CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models Johan Wahréus et.al. 2501.01335 link
2025-01-02 Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension Yanbo Fang et.al. 2501.01332 null
2025-01-02 The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation Shuzheng Gao et.al. 2501.01329 null
2025-01-03 Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking Xiaoxue Cheng et.al. 2501.01306 null
2025-01-02 Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case Kaushik Roy et.al. 2501.01305 null
2025-01-02 Does a Large Language Model Really Speak in Human-Like Language? Mose Park et.al. 2501.01273 null
2025-01-02 ProgCo: Program Helps Self-Correction of Large Language Models Xiaoshuai Song et.al. 2501.01264 null
2025-01-03 CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Shanghaoran Quan et.al. 2501.01257 null
2025-01-02 Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? Manuel Weber et.al. 2501.01256 null
2025-01-02 Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion Qiyuan He et.al. 2501.01246 null
2025-01-02 SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization Yongle Huang et.al. 2501.01245 link
2025-01-02 Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants Lixiong Qin et.al. 2501.01243 null
2025-01-02 Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction Alexander Brinkmann et.al. 2501.01237 link
2025-01-03 TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer Jiayu Li et.al. 2501.01216 null
2025-01-02 Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects Abdullah Mushtaq et.al. 2501.01205 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-02 LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge Kyoungkook Kang et.al. 2501.01197 null
2025-01-02 Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education Annika Bush et.al. 2501.01192 null
2025-01-02 Towards Interactive Deepfake Analysis Lixiong Qin et.al. 2501.01164 link
2025-01-02 TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions Vriksha Srihari et.al. 2501.01156 null
2025-01-02 A3: Android Agent Arena for Mobile GUI Agents Yuxiang Chai et.al. 2501.01149 null
2025-01-03 BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference Wonsuk Jang et.al. 2501.01144 link
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2025-01-02 Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning Shuo Yu et.al. 2501.01124 null
2025-01-02 MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification Jimin Park et.al. 2501.01110 null
2025-01-03 MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Haina Zhu et.al. 2501.01108 link
2025-01-02 Graph Generative Pre-trained Transformer Xiaohui Chen et.al. 2501.01073 null
2025-01-02 Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models Yanwen Huang et.al. 2501.01059 null
2025-01-02 Risks of Cultural Erasure in Large Language Models Rida Qadri et.al. 2501.01056 null
2025-01-02 Dynamic Scaling of Unit Tests for Code Reward Modeling Zeyao Ma et.al. 2501.01054 null
2025-01-02 Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs Linhao Huang et.al. 2501.01042 null
2025-01-02 Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Bin Wang et.al. 2501.01034 link
2025-01-02 ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning Wonduk Seo et.al. 2501.01031 null
2025-01-03 KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Xinshuo Hu et.al. 2501.01028 link
2025-01-02 MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model Chengze Zhang et.al. 2501.01014 null
2025-01-02 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving Zihao Ye et.al. 2501.01005 link
2025-01-02 Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory Zhou Yang et.al. 2501.00999 null
2025-01-02 Optimizing Noise Schedules of Generative Models in High Dimensionss Santiago Aranguri et.al. 2501.00988 null
2025-01-02 Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice Federico Ravenda et.al. 2501.00982 link
2025-01-01 IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs Junfeng Jiao et.al. 2501.00959 null
2025-01-01 Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors Junfeng Jiao et.al. 2501.00957 null
2025-01-01 Incremental Dialogue Management: Survey, Discussion, and Implications for HRI Casey Kennington et.al. 2501.00953 null
2025-01-01 SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering Shihab Ahmed et.al. 2501.00940 null
2025-01-01 Diffusion Policies for Generative Modeling of Spacecraft Trajectories Julia Briden et.al. 2501.00915 null
2025-01-01 Aligning LLMs with Domain Invariant Reward Models David Wu et.al. 2501.00911 link
2025-01-01 Population Aware Diffusion for Time Series Generation Yang Li et.al. 2501.00910 link
2025-01-01 Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things Talha Zeeshan et.al. 2501.00906 null
2025-01-01 Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model Chenyang Liu et.al. 2501.00895 null
2025-01-01 Evaluating Time Series Foundation Models on Noisy Periodic Time Series Syamantak Datta Gupta et.al. 2501.00889 null
2025-01-01 Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization Weiqi Wu et.al. 2501.00888 link
2025-01-01 Representation in large language models Cameron C. Yetman et.al. 2501.00885 null
2025-01-01 Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents Fouad Bousetouane et.al. 2501.00881 null
2025-01-01 Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction Teng Hu et.al. 2501.00880 null
2025-01-01 TrustRAG: Enhancing Robustness and Trustworthiness in RAG Huichi Zhou et.al. 2501.00879 link
2025-01-01 LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models Hieu Man et.al. 2501.00874 link
2025-01-01 Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation Mingjia Li et.al. 2501.00873 link
2025-01-01 Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation Shoutao Guo et.al. 2501.00868 link
2025-01-01 Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era Mihnea C. Moldoveanu et.al. 2501.00867 null
2025-01-01 Alzheimer’s disease detection based on large language model prompt engineering Tian Zheng et.al. 2501.00861 null
2025-01-01 LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions Adam Ishay et.al. 2501.00830 null
2025-01-01 An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems Haoxiang Tian et.al. 2501.00829 null
2025-01-01 LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management Yichen Luo et.al. 2501.00826 null
2025-01-01 Multimodal Large Models Are Effective Action Anticipators Binglu Wang et.al. 2501.00795 link
2025-01-01 Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models Minhao Bai et.al. 2501.00786 null
2025-01-01 NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model Yuzhi Lai et.al. 2501.00785 null
2025-01-01 REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization Huyen Nguyen et.al. 2501.00779 null
2025-01-01 FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation Qianli Wang et.al. 2501.00777 null
2025-01-01 Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis Jie Gao et.al. 2501.00775 null
2025-01-01 An AI-powered Bayesian generative modeling approach for causal inference in observational studies Qiao Liu et.al. 2501.00755 null
2025-01-01 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-01 DIVE: Diversified Iterative Self-Improvement Yiwei Qin et.al. 2501.00747 link
2025-01-01 Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines Xiyang Hu et.al. 2501.00745 null
2025-01-01 A Distributional Evaluation of Generative Image Models Edric Tam et.al. 2501.00744 null
2025-01-01 New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory Madiha Ajmal et.al. 2501.00721 null
2025-01-01 Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection Hao Wang et.al. 2501.00700 null
2025-01-01 Adjoint sharding for very long context training of state space models Xingzi Xu et.al. 2501.00692 null
2025-01-01 Labels Generated by Large Language Model Helps Measuring People’s Empathy in Vitro Md Rakibul Hasan et.al. 2501.00691 null
2025-01-01 IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently Florian Dietz et.al. 2501.00684 null
2024-12-31 Grade Inflation in Generative Models Phuc Nguyen et.al. 2501.00664 null
2024-12-31 Finding Missed Code Size Optimizations in Compilers using LLMs Davide Italiano et.al. 2501.00655 null
2024-12-31 Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models Suttisak Wizadwongsa et.al. 2501.00651 null
2024-12-31 Efficient Standardization of Clinical Notes using Large Language Models Daniel B. Hier et.al. 2501.00644 null
2024-12-31 Enabling New HDLs with Agents Mark Zakharov et.al. 2501.00642 null
2024-12-31 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2024-12-31 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders Yipeng Kang et.al. 2501.00581 null
2024-12-31 AI and Quantum Computing in Binary Photocatalytic Hydrogen Production Dennis Delali Kwesi Wayo et.al. 2501.00575 null
2024-12-31 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li et.al. 2501.00574 link
2024-12-31 Probing Visual Language Priors in VLMs Tiange Luo et.al. 2501.00569 null
2024-12-31 Robust and Adaptive Optimization under a Large Language Model Lens Dimitris Bertsimas et.al. 2501.00568 null
2024-12-30 Distributed Mixture-of-Agents for Edge Inference with Large Language Models Purbesh Mitra et.al. 2412.21200 link
2024-12-31 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Zhaojian Yu et.al. 2412.21199 link
2024-12-30 The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick Jonathan Berkheim et.al. 2412.21186 null
2024-12-30 Facilitating large language model Russian adaptation with Learned Embedding Propagation Mikhail Tikhomirov et.al. 2412.21140 link
2024-12-30 ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation Ruixuan Liu et.al. 2412.21123 null
2025-01-02 Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Yuanbo Yang et.al. 2412.21117 null
2024-12-30 Varformer: Adapting VAR’s Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense Yuyang Zhou et.al. 2412.21051 link
2024-12-30 E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models Zhiyu Tan et.al. 2412.21044 null
2024-12-30 Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration Wanglong Lu et.al. 2412.21042 link
2024-12-30 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Chia-Yu Hung et.al. 2412.21037 link
2024-12-30 GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models Shangyu Xing et.al. 2412.21036 null
2024-12-30 MapQaTor: A System for Efficient Annotation of Map Query Datasets Mahir Labib Dihan et.al. 2412.21015 link
2024-12-31 Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria Joonwon Jang et.al. 2412.21006 null
2024-12-30 Plug-and-Play Training Framework for Preference Optimization Jingyuan Ma et.al. 2412.20996 null
2024-12-30 KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation Siyuan Fang et.al. 2412.20995 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-30 QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing Shlomo Kashani et.al. 2412.20956 null
2024-12-30 AGON: Automated Design Framework for Customizing Processors from ISA Documents Chongxiao Li et.al. 2412.20954 null
2024-12-30 Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema Xiaohan Feng et.al. 2412.20942 null
2024-12-30 Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering Junxiao Xue et.al. 2412.20927 null
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Towards Compatible Fine-tuning for Vision-Language Model Updates Zhengbo Wang et.al. 2412.20895 null
2024-12-30 DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models Xiaolin Hu et.al. 2412.20891 null
2024-12-30 Enhancing Annotated Bibliography Generation with LLM Ensembles Sergio Bermejo et.al. 2412.20864 null
2024-12-30 Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs’ Memory Xingjian Tao et.al. 2412.20846 null
2024-12-30 Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment Jianfei Zhang et.al. 2412.20834 link
2024-12-30 Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model Runtao Ren et.al. 2412.20820 null
2024-12-30 TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting Huanyu Zhang et.al. 2412.20810 null
2024-12-30 Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves Chayan Chatterjee et.al. 2412.20789 null
2024-12-31 SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity Pengfei Jing et.al. 2412.20787 null
2024-12-30 Large Language Model Enabled Multi-Task Physical Layer Network Tianyue Zheng et.al. 2412.20772 null
2024-12-30 Attributing Culture-Conditioned Generations to Pretraining Corpora Huihan Li et.al. 2412.20760 link
2024-12-30 M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs Bei Yan et.al. 2412.20718 link
2024-12-30 HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images Sungik Choi et.al. 2412.20704 null
2024-12-30 UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design Zijie Chen et.al. 2412.20694 null
2024-12-30 Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks Yuhe Ding et.al. 2412.20682 null
2024-12-30 Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA Qingyun Jin et.al. 2412.20677 null
2024-12-30 Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner Yitong Zhou et.al. 2412.20662 link
2024-12-30 Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Yousef Yeganeh et.al. 2412.20651 null
2024-12-30 SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy Md Mahadi Hasan Nahid et.al. 2412.20641 null
2024-12-30 Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble Yongchang Li et.al. 2412.20637 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-29 Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study Yulin Fei et.al. 2412.20613 link
2024-12-29 NLP-based Regulatory Compliance – Using GPT 4.0 to Decode Regulatory Documents Bimal Kumar et.al. 2412.20602 null
2024-12-29 MATEY: multiscale adaptive foundation models for spatiotemporal physical systems Pei Zhang et.al. 2412.20601 null
2024-12-29 Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection Dmitri Roussinov et.al. 2412.20595 link
2024-12-29 Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches Madhavendra Thakur et.al. 2412.20584 null
2024-12-29 Counterfactual Samples Constructing and Training for Commonsense Statements Estimation Chong Liu et.al. 2412.20563 null
2024-12-29 Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces Linglingzhi Zhu et.al. 2412.20556 null
2024-12-29 The Impact of Prompt Programming on Function-Level Code Generation Ranim Khojah et.al. 2412.20545 link
2024-12-29 Goal-Conditioned Data Augmentation for Offline Reinforcement Learning Xingshuai Huang et.al. 2412.20519 null
2024-12-29 Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning Hang Ni et.al. 2412.20505 null
2024-12-29 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication Zongwu Wang et.al. 2412.20501 link
2024-12-29 Multimodal Variational Autoencoder: a Barycentric View Peijie Qiu et.al. 2412.20487 null
2024-12-29 JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling Haorui Ji et.al. 2412.20470 null
2024-12-29 Improving Vision-Language-Action Models via Chain-of-Affordance Jinming Li et.al. 2412.20451 null
2024-12-29 Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs Pratik Rakesh Singh et.al. 2412.20440 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-29 Unlocking adaptive digital pathology through dynamic feature learning Jiawen Li et.al. 2412.20430 null
2024-12-29 AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models Mansi et.al. 2412.20427 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422 null
2024-12-29 Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection Kalin Kopanov et.al. 2412.20414 null
2024-12-29 Multi-Objective Large Language Model Unlearning Zibin Pan et.al. 2412.20412 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-29 Natural Language Fine-Tuning Jia Liu et.al. 2412.20382 link
2024-12-29 Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) Jia Wei Sii et.al. 2412.20381 null
2024-12-29 FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation Yan Luo et.al. 2412.20374 link
2024-12-29 LLM2: Let Large Language Models Harness System 2 Reasoning Cheng Yang et.al. 2412.20372 link
2025-01-02 Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey Junqiao Wang et.al. 2412.20367 null
2024-12-29 HindiLLM: Large Language Model for Hindi Sanjay Chouhan et.al. 2412.20357 null
2024-12-29 Distilling Desired Comments for Enhanced Code Review with Large Language Models Yongda Yu et.al. 2412.20340 null
2024-12-29 Mind the Data Gap: Bridging LLMs to Enterprise Data Integration Moe Kayali et.al. 2412.20331 null
2024-12-29 GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions Tianyao Shi et.al. 2412.20322 null
2024-12-29 Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain Shintaro Ozaki et.al. 2412.20309 null
2024-12-28 FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration Jia Liu et.al. 2412.20297 null
2024-12-28 Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games Guan-Horng Liu et.al. 2412.20279 null
2024-12-28 Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues Henry J. Xie et.al. 2412.20264 link
2024-12-28 Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception Athanasios Karagounis et.al. 2412.20230 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2024-12-28 Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation Yeonhong Park et.al. 2412.20185 null
2024-12-28 LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System Hyucksung Kwon et.al. 2412.20166 null
2024-12-28 StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN Andrzej Bedychaj et.al. 2412.20164 null
2024-12-28 Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems Minhye Jeon et.al. 2412.20163 null
2024-12-28 Multi-Modality Driven LoRA for Adverse Condition Depth Estimation Guanglei Yang et.al. 2412.20162 null
2024-12-28 Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses Xinru Wen et.al. 2412.20154 null
2024-12-28 Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering Wei Zhou et.al. 2412.20145 null
2024-12-28 TradingAgents: Multi-Agents LLM Financial Trading Framework Yijia Xiao et.al. 2412.20138 null
2024-12-28 M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation Zhaopeng Feng et.al. 2412.20127 link
2024-12-28 Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers Tuomas Hakoniemi et.al. 2412.20114 null
2024-12-28 ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming Jiedong Zhuang et.al. 2412.20105 null
2024-12-28 On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs Atmane Ayoub Mansour Bahar et.al. 2412.20087 null
2024-12-31 Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset Chongjian Yue et.al. 2412.20072 null
2024-12-28 On the Compositional Generalization of Multimodal LLMs for Medical Imaging Zhenyang Cai et.al. 2412.20070 link
2024-12-28 VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition Lan Chen et.al. 2412.20064 link
2024-12-28 MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion Zechao Zhan et.al. 2412.20062 null
2024-12-28 Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts Yanxin Shen et.al. 2412.20061 null
2024-12-28 “My life is miserable, have to sign 500 autographs everyday”: Exposing Humblebragging, the Brags in Disguise Sharath Naganna et.al. 2412.20057 null
2024-12-27 Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization Kumud Tripathi et.al. 2412.19785 null
2024-12-27 Can AI Help with Your Personal Finances? Oudom Hean et.al. 2412.19784 null
2024-12-27 Tensor Network Estimation of Distribution Algorithms John Gardiner et.al. 2412.19780 null
2024-12-27 Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration Le Chen et.al. 2412.19770 link
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-27 On dual-projectively equivalent connections associated to second order superintegrable systems Andreas Vollmer et.al. 2412.19739 null
2024-12-27 Can Large Language Models Adapt to Other Agents In-Context? Matthew Riemer et.al. 2412.19726 null
2024-12-27 From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Jiawei Lin et.al. 2412.19712 null
2024-12-27 Toward Adaptive Reasoning in Large Language Models with Thought Rollback Sijia Chen et.al. 2412.19707 link
2024-12-27 A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization Jingchun Lian et.al. 2412.19685 null
2024-12-27 Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework Jiang Liu et.al. 2412.19684 null
2024-12-27 CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs Siyu Wang et.al. 2412.19663 null
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 link
2024-12-27 FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios Kaiyi Pang et.al. 2412.19652 null
2024-12-27 Xmodel-2 Technical Report Wang Qun et.al. 2412.19638 null
2024-12-27 IMTP: Search-based Code Generation for In-memory Tensor Programs Yongwon Shin et.al. 2412.19630 null
2024-12-27 Signatures of prediction during natural listening in MEG data? Sahel Azizpour et.al. 2412.19622 null
2024-12-27 Gradient Weight-normalized Low-rank Projection for Efficient LLM Training Jia-Hong Huang et.al. 2412.19616 link
2024-12-27 SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms Shashank Rao Marpally et.al. 2412.19595 null
2024-12-27 Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Yuxiao Yang et.al. 2412.19562 null
2024-12-27 Diverse Rare Sample Generation with Pretrained GANs Subeen Lee et.al. 2412.19543 link
2024-12-27 Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations Yuanfei Huang et.al. 2412.19520 null
2024-12-27 Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model Hyunwoo Cho et.al. 2412.19517 null
2024-12-27 Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs Zhe Yang et.al. 2412.19513 link
2024-12-27 Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging Hua Farn et.al. 2412.19512 null
2024-12-27 Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion Koustav Ghosal et.al. 2412.19510 null
2024-12-27 MBQ: Modality-Balanced Quantization for Large Vision-Language Models Shiyao Li et.al. 2412.19509 link
2024-12-27 DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-27 Casevo: A Cognitive Agents and Social Evolution Simulator Zexun Jiang et.al. 2412.19498 link
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 link
2024-12-27 Focusing Image Generation to Mitigate Spurious Correlations Xuewei Li et.al. 2412.19457 null
2024-12-27 Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models Hyeonseok Moon et.al. 2412.19450 link
2024-12-27 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-27 A Survey on Large Language Model Acceleration based on KV Cache Management Haoyang Li et.al. 2412.19442 link
2024-12-27 Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback Seong Jin Lee et.al. 2412.19436 null
2024-12-27 Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints Alberto Maté et.al. 2412.19424 null
2024-12-27 Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning Chen Li et.al. 2412.19422 link
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-27 MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios Jiaqi Fan et.al. 2412.19406 null
2024-12-27 An Engorgio Prompt Makes Large Language Model Babble on Jianshuo Dong et.al. 2412.19394 link
2024-12-26 Large Language Models for Market Research: A Data-augmentation Approach Mengxin Wang et.al. 2412.19363 null
2024-12-26 Dynamic Skill Adaptation for Large Language Models Jiaao Chen et.al. 2412.19361 null
2024-12-26 Identifying Split Vacancies with Foundation Models and Electrostatics Seán R. Kavanagh et.al. 2412.19330 null
2024-12-26 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan et.al. 2412.19326 link
2024-12-26 Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami et.al. 2412.19325 null
2024-12-26 From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries Hugh Van Deventer et.al. 2412.19312 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-26 RecLM: Recommendation Instruction Tuning Yangqin Jiang et.al. 2412.19302 link
2024-12-26 RAG with Differential Privacy Nicolas Grislain et.al. 2412.19291 link
2024-12-26 Time Series Foundational Models: Their Role in Anomaly Detection and Prediction Chathurangi Shyalika et.al. 2412.19286 link
2024-12-26 PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing Michael Bezick et.al. 2412.19284 null
2024-12-26 MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes Asma Ben Abacha et.al. 2412.19260 link
2024-12-26 VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis Jaemin Jung et.al. 2412.19259 null
2024-12-26 Sentiment trading with large language models Kemal Kirtac et.al. 2412.19245 null
2024-12-26 SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model Xuyang Li et.al. 2412.19237 null
2024-12-26 Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining Yuxin You et.al. 2412.19211 null
2024-12-26 Multi-Attribute Constraint Satisfaction via Language Model Rewriting Ashutosh Baheti et.al. 2412.19198 null
2024-12-26 Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models Haonan He et.al. 2412.19191 null
2024-12-26 Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue Shuzhi Xu et.al. 2412.19154 null
2024-12-26 AskChart: Universal Chart Understanding through Textual Enhancement Xudong Yang et.al. 2412.19146 link
2024-12-26 SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis Senbin Zhu et.al. 2412.19140 link
2024-12-26 PlanLLM: Video Procedure Planning with Refinable Large Language Models Dejie Yang et.al. 2412.19139 link
2024-12-26 Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing Inpyo Hong et.al. 2412.19125 link
2024-12-26 Discrete vs. Continuous Trade-offs for Generative Models Jathin Korrapati et.al. 2412.19114 null
2024-12-26 SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values Yunfan Zhang et.al. 2412.19113 null
2024-12-26 Stochastic normalizing flows for Effective String Theory Michele Caselle et.al. 2412.19109 null
2024-12-26 “I’ve Heard of You!”: Generate Spoken Named Entity Recognition Data for Unseen Entities Jiawei Yu et.al. 2412.19102 null
2024-12-26 Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security Vasileios Alevizos et.al. 2412.19088 null
2024-12-26 Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation Haotian Qian et.al. 2412.19080 null
2024-12-26 CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers Jingyi Zheng et.al. 2412.19037 link
2024-12-26 Repository Structure-Aware Training Makes SLMs Better Issue Resolver Zexiong Ma et.al. 2412.19031 null
2024-12-26 Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation Yixin Chen et.al. 2412.19026 link
2024-12-26 Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication Xiqiang Qu et.al. 2412.19025 null
2024-12-26 Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation Tao Liu et.al. 2412.19021 null
2024-12-26 Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability Ruixi Lin et.al. 2412.19018 null
2024-12-25 How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study Alejandro Velasco et.al. 2412.18989 null
2024-12-25 ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement Zhefan Rao et.al. 2412.18966 null
2024-12-25 Musings About the Future of Search: A Return to the Past? Jimmy Lin et.al. 2412.18956 null
2024-12-25 A Power-Efficient Hardware Implementation of L-Mul Ruiqi Chen et.al. 2412.18948 null
2024-12-25 MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models Kaiwen Zuo et.al. 2412.18947 null
2024-12-25 Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations Yewon Kim et.al. 2412.18940 null
2024-12-25 Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference Libo Zhang et.al. 2412.18934 null
2024-12-25 UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Lunhao Duan et.al. 2412.18928 null
2024-12-25 Exemplar-condensed Federated Class-incremental Learning Rui Sun et.al. 2412.18926 null
2024-12-25 Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model Yi-Chia Chen et.al. 2412.18917 link
2024-12-25 AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures Situo Zhang et.al. 2412.18910 null
2024-12-25 CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models Ping Guo et.al. 2412.18890 link
2024-12-25 MotionMap: Representing Multimodality in Human Pose Forecasting Reyhaneh Hosseininejad et.al. 2412.18883 null
2024-12-25 Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models Meltem Aksoy et.al. 2412.18863 null
2024-12-25 Improving the Readability of Automatically Generated Tests using Large Language Models Matteo Biagiola et.al. 2412.18843 null
2024-12-25 LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements Hao Zhang et.al. 2412.18835 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-25 RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting Yilei Jiang et.al. 2412.18826 null
2024-12-25 CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection Wenbin Li et.al. 2412.18820 link
2024-12-25 LLM-assisted vector similarity search Md Riyadh et.al. 2412.18819 null
2024-12-25 DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search Lei Yang et.al. 2412.18811 null
2024-12-25 Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation Xinkai Du et.al. 2412.18800 null
2024-12-25 Torque-Aware Momentum Pranshu Malviya et.al. 2412.18790 null
2024-12-25 Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models Yu-An Liu et.al. 2412.18770 link
2024-12-25 The Impact of Input Order Bias on Large Language Models for Software Fault Localization Md Nakhla Rafi et.al. 2412.18750 null
2024-12-24 Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Zehan Wang et.al. 2412.18605 link
2024-12-24 Long-Form Speech Generation with Spoken Language Models Se Jin Park et.al. 2412.18603 link
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-24 A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs OpenMind et.al. 2412.18588 null
2024-12-24 Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control Sergey Sedov et.al. 2412.18582 null
2024-12-24 Zero-resource Speech Translation and Recognition with LLMs Karel Mundnich et.al. 2412.18566 null
2024-12-24 Distilling Fine-grained Sentiment Understanding from Large Language Models Yice Zhang et.al. 2412.18552 link
2024-12-24 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-24 PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction Xingjian Xu et.al. 2412.18541 null
2024-12-24 Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation Derong Xu Xinhang Li et.al. 2412.18537 link
2024-12-24 Automated Code Review In Practice Umut Cihan et.al. 2412.18531 null
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 null
2024-12-24 Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization Yi-Fu Fu et.al. 2412.18497 null
2024-12-24 GeFL: Model-Agnostic Federated Learning with Generative Models Honggu Kang et.al. 2412.18460 null
2024-12-24 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Tatiana Zemskova et.al. 2412.18450 link
2024-12-24 Is Large Language Model Good at Triple Set Prediction? An Empirical Study Yuan Yuan et.al. 2412.18443 null
2024-12-24 Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm O. Deniz Akyildiz et.al. 2412.18432 null
2024-12-24 GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent Kangjia Zhao et.al. 2412.18426 null
2024-12-24 Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models Zihan Zhou et.al. 2412.18419 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English Avinash Anand et.al. 2412.18415 link
2024-12-24 Discovery of 2D Materials via Symmetry-Constrained Diffusion Model Shihang Xu et.al. 2412.18414 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2024-12-24 Extract Free Dense Misalignment from CLIP JeongYeon Nam et.al. 2412.18404 link
2024-12-24 RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction Wu Xiaoping et.al. 2412.18390 null
2024-12-24 MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs Qiuyi Gu et.al. 2412.18381 null
2024-12-24 Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents Kaiwen Ning et.al. 2412.18371 link
2024-12-24 Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.18351 null
2024-12-24 M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models Jiaxin Guo et.al. 2412.18299 null
2024-12-24 Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight Xi Ding et.al. 2412.18298 link
2024-12-24 Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases Christian Di Maio et.al. 2412.18295 null
2024-12-24 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-24 Improved Feature Generating Framework for Transductive Zero-shot Learning Zihan Ye et.al. 2412.18282 null
2024-12-24 GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications Zhenzhou Jin et.al. 2412.18281 null
2024-12-24 Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Jiacai Liu et.al. 2412.18279 null
2024-12-24 GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge Shammur Absar Chowdhury et.al. 2412.18274 null
2024-12-24 Annotating References to Mythological Entities in French Literature Thierry Poibeau et.al. 2412.18270 null
2024-12-24 Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study Xuefeng Jiang et.al. 2412.18260 link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 null
2024-12-24 An Automatic Graph Construction Framework based on Large Language Models for Recommendation Rong Shan et.al. 2412.18241 link
2024-12-24 Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection Jango Zhang et.al. 2412.18225 null
2024-12-24 Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Peijin Xie et.al. 2412.18224 link
2024-12-24 ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Mengyang Wu et.al. 2412.18216 link
2024-12-24 Adapting Large Language Models for Improving TCP Fairness over WiFi Shyam Kumar Shrestha et.al. 2412.18200 null
2024-12-24 Robustness-aware Automatic Prompt Optimization Zeru Shi et.al. 2412.18196 link
2024-12-24 VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Shiduo Zhang et.al. 2412.18194 null
2024-12-24 TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization Yucong Luo et.al. 2412.18185 null
2024-12-24 Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Yucong Luo et.al. 2412.18176 null
2024-12-24 INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Haohang Li et.al. 2412.18174 null
2024-12-24 Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models Xiaomeng Hu et.al. 2412.18171 null
2024-12-24 KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management Rongxin Cheng et.al. 2412.18169 null
2024-12-24 Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence Yinbin Han et.al. 2412.18164 null
2024-12-24 VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities Shray Mathur et.al. 2412.18161 null
2024-12-24 Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task Jinming Liu et.al. 2412.18158 null
2024-12-24 Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance Yaoyun Zhang et.al. 2412.18157 null
2024-12-24 scReader: Prompting Large Language Models to Interpret scRNA-seq Data Cong Li et.al. 2412.18156 null
2024-12-24 GeneSUM: Large Language Model-based Gene Summary Extraction Zhijian Chen et.al. 2412.18154 null
2024-12-24 CoAM: Corpus of All-Type Multiword Expressions Yusuke Ide et.al. 2412.18151 null
2024-12-24 EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation Shuhao Han et.al. 2412.18150 link
2024-12-24 Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction Xiao Guo et.al. 2412.18149 null
2024-12-24 Ensuring Consistency for In-Image Translation Chengpeng Fu et.al. 2412.18139 null
2024-12-24 LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment Binrui Zeng et.al. 2412.18135 null
2024-12-24 VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection Zhaohui Jin et.al. 2412.18124 null
2024-12-24 AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation Hao Wen et.al. 2412.18116 null
2024-12-24 AIGT: AI Generative Table Based on Prompt Mingming Zhang et.al. 2412.18111 null
2024-12-24 SlimGPT: Layer-wise Structured Pruning for Large Language Models Gui Ling et.al. 2412.18110 null
2024-12-24 Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach Jing Bi et.al. 2412.18108 null
2024-12-24 Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels Mingcong Song et.al. 2412.18106 null
2024-12-24 EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent Suyuan Wang et.al. 2412.18100 null
2024-12-24 Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) – a Large Language Model Chatbot for Perioperative Medicine Yu He Ke et.al. 2412.18096 null
2024-12-24 Molly: Making Large Language Model Agents Solve Python Problem More Logically Rui Xiao et.al. 2412.18093 null
2024-12-24 Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner Aizierjiang Aiersilan et.al. 2412.18086 link
2024-12-24 Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models Xuan Lin et.al. 2412.18084 link
2024-12-24 Improving Factuality with Explicit Working Memory Mingda Chen et.al. 2412.18069 null
2024-12-24 LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR Osama Hosam Abdellaif et.al. 2412.18063 link
2024-12-24 Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction Hyunbae Jeon et.al. 2412.18061 null
2024-12-24 An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM Wen Wen et.al. 2412.18060 null
2024-12-23 Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations Maya Patel et.al. 2412.18051 null
2024-12-23 AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data Mirko Zaffaroni et.al. 2412.18038 link
2024-12-23 Generating refactored code accurately using reinforcement learning Indranil Palit et.al. 2412.18035 null
2024-12-23 More than Chit-Chat: Developing Robots for Small-Talk Interactions Rebecca Ramnauth et.al. 2412.18023 null
2024-12-23 Trustworthy and Efficient LLMs Meet Databases Kyoungmin Kim et.al. 2412.18022 null
2024-12-23 StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2024-12-23 LMV-RPA: Large Model Voting-based Robotic Process Automation Osama Abdellatif et.al. 2412.17965 link
2024-12-23 Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models Antony Seabra et.al. 2412.17964 null
2024-12-23 Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models Ge Zhang et.al. 2412.17963 null
2024-12-23 Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents Antony Seabra et.al. 2412.17942 null
2024-12-23 BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism Martin Fajcik et.al. 2412.17933 null
2024-12-23 Causal Composition Diffusion Model for Closed-loop Traffic Generation Haohong Lin et.al. 2412.17920 null
2024-12-23 Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning Orson Mengara et.al. 2412.17908 null
2024-12-23 LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses Sara Riazi et.al. 2412.17892 null
2024-12-23 ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Siyuan Bian et.al. 2412.17811 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 null
2024-12-23 Automating the Search for Artificial Life with Foundation Models Akarsh Kumar et.al. 2412.17799 link
2024-12-23 ResearchTown: Simulator of Human Research Community Haofei Yu et.al. 2412.17767 link
2024-12-23 ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback Wei Zhang et.al. 2412.17754 null
2024-12-23 Deliberation in Latent Space via Differentiable Cache Augmentation Luyang Liu et.al. 2412.17747 null
2024-12-23 YuLan-Mini: An Open Data-efficient Language Model Yiwen Hu et.al. 2412.17743 link
2024-12-23 **Reasoning to Attend: Try to Understand How Token Works** Rui Qian et.al. 2412.17741 link
2024-12-23 Knowledge Editing through Chain-of-Thought Changyue Wang et.al. 2412.17727 link
2024-12-23 Understanding the Logic of Direct Preference Alignment through Logic Kyle Richardson et.al. 2412.17696 null
2024-12-23 Large Language Model Safety: A Holistic Survey Dan Shi et.al. 2412.17686 link
2024-12-23 A Bias-Free Training Paradigm for More General AI-generated Image Detection Fabrizio Guillaro et.al. 2412.17671 null
2024-12-23 Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models Sijbren van Vaals et.al. 2412.17669 link
2024-12-23 Detecting anxiety and depression in dialogues: a multi-label and explainable approach Francisco de Arriba-Pérez et.al. 2412.17651 null
2024-12-23 SCBench: A Sports Commentary Benchmark for Video LLMs Kuangzhi Ge et.al. 2412.17637 null
2024-12-23 ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance Renyang Liu et.al. 2412.17632 link
2024-12-23 Tracking the Feature Dynamics in LLM Training: A Mechanistic Study Yang Xu et.al. 2412.17626 null
2024-12-23 Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models Parham Rezaei et.al. 2412.17622 link
2024-12-23 Emerging Security Challenges of Large Language Models Herve Debar et.al. 2412.17614 null
2024-12-23 Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs Fabrizio Frasca et.al. 2412.17609 null
2024-12-23 EasyTime: Time Series Forecasting Made Easy Xiangfei Qiu et.al. 2412.17603 null
2024-12-23 LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context Kai Ruan et.al. 2412.17596 link
2024-12-23 Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation Chengbing Wang et.al. 2412.17593 null
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field Zixi Liang et.al. 2412.17561 link
2024-12-23 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Chao Zeng et.al. 2412.17560 null
2024-12-23 A Survey of Query Optimization in Large Language Models Mingyang Song et.al. 2412.17558 null
2024-12-23 Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing Prakash Aryan et.al. 2412.17548 link
2024-12-23 Retention Score: Quantifying Jailbreak Risks for Vision Language Models Zaitang Li et.al. 2412.17544 null
2024-12-23 Constructing Fair Latent Space for Intersection of Fairness and Explainability Hyungjun Joo et.al. 2412.17523 null
2024-12-23 DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak Hao Wang et.al. 2412.17522 null
2024-12-23 Improving the Noise Estimation of Latent Neural Stochastic Differential Equations Linus Heck et.al. 2412.17499 null
2024-12-23 Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings Jérémie Sublime et.al. 2412.17486 null
2024-12-23 Power- and Fragmentation-aware Online Scheduling for GPU Datacenters Francesco Lettich et.al. 2412.17484 link
2024-12-23 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Chenlong Deng et.al. 2412.17483 null
2024-12-23 A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers Shuaihang Chen et.al. 2412.17481 link
2024-12-23 CALLIC: Content Adaptive Learning for Lossless Image Compression Daxin Li et.al. 2412.17464 null
2024-12-23 Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning Xiaodan Chen et.al. 2412.17456 null
2024-12-23 Applying LLM and Topic Modelling in Psychotherapeutic Contexts Alexander Vanin et.al. 2412.17449 null
2024-12-23 Measuring Contextual Informativeness in Child-Directed Text Maria Valentini et.al. 2412.17427 link
2024-12-23 Multimodal Preference Data Synthetic Alignment with Reward Model Robert Wijaya et.al. 2412.17417 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance Muhammad Reza Qorib et.al. 2412.17408 link
2024-12-23 Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning Huchen Jiang et.al. 2412.17397 null
2024-12-23 WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models Huawen Feng et.al. 2412.17395 null
2024-12-23 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-23 Interweaving Memories of a Siamese Large Language Model Xin Song et.al. 2412.17383 link
2024-12-23 MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models Beibei Yu et.al. 2412.17339 null
2024-12-23 A Dual-Perspective Metaphor Detection Framework Using Large Language Models Yujie Lin et.al. 2412.17332 link
2024-12-23 Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance Nicolas Devatine et.al. 2412.17321 null
2024-12-23 CodeV: Issue Resolving with Visual Data Linhao Zhang et.al. 2412.17315 link
2024-12-23 Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories Mahan Tafreshipour et.al. 2412.17298 null
2024-12-23 Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Taewoong Kim et.al. 2412.17288 link
2024-12-23 LLM4AD: A Platform for Algorithm Design with Large Language Model Fei Liu et.al. 2412.17287 link
2024-12-23 Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning Rui Liang et.al. 2412.17285 null
2024-12-23 Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach Rafid Ishrak Jahan et.al. 2412.17255 link
2024-12-23 SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval Xiaopeng Li et.al. 2412.17250 null
2024-12-23 EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling Zichen Song et.al. 2412.17249 null
2024-12-23 On the Generalization Ability of Machine-Generated Text Detectors Yule Liu et.al. 2412.17242 link
2024-12-23 Brain-to-Text Benchmark ‘24: Lessons Learned Francis R. Willett et.al. 2412.17227 link
2024-12-23 CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder Lichen Ma et.al. 2412.17225 null
2024-12-22 Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension Jio Oh et.al. 2412.17189 null
2024-12-22 Foundation Model for Lossy Compression of Spatiotemporal Scientific Data Xiao Li et.al. 2412.17184 null
2024-12-22 Enhancing Item Tokenization for Generative Recommendation through Self-Improvement Runjin Chen et.al. 2412.17171 null
2024-12-22 Generative Diffusion Modeling: A Practical Handbook Zihan Ding et.al. 2412.17162 null
2024-12-22 LLM-based relevance assessment still can’t replace human relevance assessment Charles L. A. Clarke et.al. 2412.17156 null
2024-12-22 LLM Agent for Fire Dynamics Simulations Leidong Xu et.al. 2412.17146 null
2024-12-22 Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs Rushendra Sidibomma et.al. 2412.17131 null
2024-12-22 Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models Cameron R. Jones et.al. 2412.17128 null
2024-12-22 Learning to Adapt to Low-Resource Paraphrase Generation Zhigen Li et.al. 2412.17111 null
2024-12-22 DreamOmni: Unified Image Generation and Editing Bin Xia et.al. 2412.17098 null
2024-12-22 Analysis on LLMs Performance for Code Summarization Md. Ahnaf Akib et.al. 2412.17094 null
2024-12-22 SAIL: Sample-Centric In-Context Learning for Document Information Extraction Jinyu Zhang et.al. 2412.17092 link
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2024-12-22 The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM’s Internal States Fabian Ridder et.al. 2412.17056 link
2024-12-22 DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately Huiwen Wu et.al. 2412.17053 null
2024-12-22 ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues Shaina Raza et.al. 2412.17052 link
2024-12-22 Modular Conversational Agents for Surveys and Interviews Jiangbo Yu et.al. 2412.17049 null
2024-12-22 Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective Hankun Wang et.al. 2412.17048 null
2024-12-22 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories Eric Hedlin et.al. 2412.17040 null
2024-12-22 Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model Kevin Shen et.al. 2412.17039 null
2024-12-22 Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models Lang Gao et.al. 2412.17034 null
2024-12-22 MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge Jie He et.al. 2412.17032 null
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 GAS: Generative Auto-bidding with Post-training Search Yewen Li et.al. 2412.17018 null
2024-12-22 Robustness of Large Language Models Against Adversarial Attacks Yiyi Tao et.al. 2412.17011 null
2024-12-22 InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions Ronghui Li et.al. 2412.16982 null
2024-12-22 On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora Tzu-Chieh Chen et.al. 2412.16976 null
2024-12-22 Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs Alexander von Recum et.al. 2412.16974 null
2024-12-22 Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach Chunxu Zhang et.al. 2412.16969 link
2024-12-22 System-2 Mathematical Reasoning via Enriched Instruction Tuning Huanqia Cai et.al. 2412.16964 null
2024-12-22 Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework Jundong Xu et.al. 2412.16953 null
2024-12-22 A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation Ekai Hashimoto et.al. 2412.16943 null
2024-12-22 Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.16936 null
2024-12-22 Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models Kai Zheng et.al. 2412.16933 null
2024-12-22 Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs Bohan Jin et.al. 2412.16922 null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 null
2024-12-22 Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation Quan Dao et.al. 2412.16906 null
2024-12-22 Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model Songjun Tu et.al. 2412.16878 link
2024-12-20 HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding Chenxin Tao et.al. 2412.16158 null
2024-12-20 Can Generative Video Models Help Pose Estimation? Ruojin Cai et.al. 2412.16155 null
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-20 Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation Seyedreza Mohseni et.al. 2412.16135 null
2024-12-20 Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information Dirk Bergemann et.al. 2412.16132 null
2024-12-20 PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics Daniil Larionov et.al. 2412.16120 null
2024-12-20 Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts Muhammad Abdullah Sohail et.al. 2412.16119 link
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse Mahyar Habibi et.al. 2412.16114 null
2024-12-20 Logical Consistency of Large Language Models in Fact-checking Bishwamittra Ghosh et.al. 2412.16100 null
2024-12-20 The Evolution of LLM Adoption in Industry Data Curation Practices Crystal Qian et.al. 2412.16089 null
2024-12-20 Efficient MedSAMs: Segment Anything in Medical Images on Laptop Jun Ma et.al. 2412.16085 link
2024-12-20 Formal Mathematical Reasoning: A New Frontier in AI Kaiyu Yang et.al. 2412.16075 null
2024-12-20 The Only Way is Ethics: A Guide to Ethical Research with Large Language Models Eddie L. Ungless et.al. 2412.16022 link
2024-12-20 Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support Qijiong Liu et.al. 2412.15973 link
2024-12-20 From General to Specific: Tailoring Large Language Models for Personalized Healthcare Ruize Shi et.al. 2412.15957 null
2024-12-20 Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring Markus Borg et.al. 2412.15948 null
2024-12-20 Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation Gautier Evennou et.al. 2412.15939 link
2024-12-20 Large Language Model assisted Hybrid Fuzzing Ruijie Meng et.al. 2412.15931 null
2024-12-20 MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection Andrea Moglia et.al. 2412.15925 link
2024-12-20 RiTTA: Modeling Event Relations in Text-to-Audio Generation Yuhang He et.al. 2412.15922 link
2024-12-20 Less is More: Towards Green Code Large Language Models via Unified Structural Pruning Guang Yang et.al. 2412.15921 null
2024-12-20 Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model Yosuke Yamagishi et.al. 2412.15907 null
2024-12-20 Evaluation of Reliability Criteria for News Publishers with Large Language Models Manuel Pratelli et.al. 2412.15896 null
2024-12-20 TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain Camille Barboule et.al. 2412.15891 null
2024-12-20 AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI Katja Bühler et.al. 2412.15876 null
2024-12-20 Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback Jiaming Ji et.al. 2412.15838 link
2024-12-20 WebLLM: A High-Performance In-Browser LLM Inference Engine Charlie F. Ruan et.al. 2412.15803 link
2024-12-20 Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Sungjin Park et.al. 2412.15797 null
2024-12-20 GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning Heming Zhang et.al. 2412.15790 null
2024-12-20 Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech Jonathan Heitz et.al. 2412.15772 link
2024-12-20 Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference Jorge García-Carrasco et.al. 2412.15750 link
2024-12-20 Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models Shamus Sim et.al. 2412.15748 null
2024-12-20 VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models Dexter Neo et.al. 2412.15739 null
2024-12-20 AutoLife: Automatic Life Journaling with Smartphones and LLMs Huatao Xu et.al. 2412.15714 null
2024-12-20 Contrastive Learning for Task-Independent SpeechLLM-Pretraining Maike Züfle et.al. 2412.15712 link
2024-12-20 Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback Niklas Ippisch et.al. 2412.15702 null
2024-12-20 Code Review Automation Via Multi-task Federated LLM – An Empirical Study Jahnavi Kumar et.al. 2412.15676 null
2024-12-20 Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline Guancheng Zeng et.al. 2412.15660 null
2024-12-20 Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class Annie D’souza et.al. 2412.15657 null
2024-12-20 MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Sieun Hyeon et.al. 2412.15655 link
2024-12-20 Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution Wentao Tan et.al. 2412.15650 null
2024-12-20 Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model Xin Du et.al. 2412.15634 link
2024-12-20 Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? Mengyu Ye et.al. 2412.15628 null
2024-12-20 JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs Hongyi Li et.al. 2412.15623 null
2024-12-20 Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Zhi Gao et.al. 2412.15606 null
2024-12-20 Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link
2024-12-20 Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification Gyutae Park et.al. 2412.15603 null
2024-12-20 Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Xiaoqiang Kang et.al. 2412.15594 link
2024-12-20 NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization Danial Kamali et.al. 2412.15588 link
2024-12-20 To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models Jessica Y. Bo et.al. 2412.15584 null
2024-12-20 A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation Ryien Hosseini et.al. 2412.15582 null
2024-12-20 Score-based Generative Diffusion Models for Social Recommendations Chengyi Liu et.al. 2412.15579 link
2024-12-20 QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning Xinyang Tong et.al. 2412.15576 null
2024-12-20 J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM Takero Yoshida et.al. 2412.15574 null
2024-12-20 Continual Learning Using a Kernel-Based Method Over Foundation Models Saleh Momeni et.al. 2412.15571 link
2024-12-20 DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation Yichun Tai et.al. 2412.15570 link
2024-12-20 In-context Continual Learning Assisted by an External Continual Learner Saleh Momeni et.al. 2412.15563 null
2024-12-20 NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning Zheyuan Zhang et.al. 2412.15547 null
2024-12-20 MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering Zhang Siyue et.al. 2412.15540 null
2024-12-20 XRAG: eXamining the Core – Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation Qianren Mao et.al. 2412.15529 link
2024-12-20 HREF: Human Response-Guided Evaluation of Instruction Following in Language Models Xinxi Lyu et.al. 2412.15524 link
2024-12-20 PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time Alireza Pourali et.al. 2412.15519 link
2024-12-20 Stylish and Functional: Guided Interpolation Subject to Physical Constraints Yan-Ying Chen et.al. 2412.15507 null
2024-12-20 Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework Zhenjie Xu et.al. 2412.15504 link
2024-12-20 Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models Zhisheng Tang et.al. 2412.15501 null
2024-12-20 TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use Junjie Ye et.al. 2412.15495 link
2024-12-20 PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search Jiaxin Wu et.al. 2412.15494 null
2024-12-20 GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators Hengjia Li et.al. 2412.15491 null
2024-12-20 Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Saehyung Lee et.al. 2412.15484 null
2024-12-20 Continual Learning Using Only Large Language Model Prompting Jiabao Qiu et.al. 2412.15479 null
2024-12-19 TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models Ammar N. Abbas et.al. 2412.15462 null
2024-12-19 Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization Sahil Wadhwa et.al. 2412.15453 null
2024-12-19 AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals Angela Mastrianni et.al. 2412.15444 null
2024-12-19 SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval Aakash Mahalingam et.al. 2412.15443 null
2024-12-19 Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models Tianchen Zhang et.al. 2412.15431 null
2024-12-19 MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale Swapnil Gandhi et.al. 2412.15411 null
2024-12-19 Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification Edoardo Allegrini et.al. 2412.15410 null
2024-12-19 Systematic Evaluation of Long-Context LLMs on Financial Concepts Lavanya Gupta et.al. 2412.15386 null
2024-12-19 Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation Joanne Boisson et.al. 2412.15375 link
2024-12-19 Automated Root Cause Analysis System for Complex Data Products Mathieu Demarne et.al. 2412.15374 null
2024-12-19 Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs Liam Seymour et.al. 2412.15352 link
2024-12-19 Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models Reza Shirkavand et.al. 2412.15341 null
2024-12-19 Complete background cosmology of parity-even quadratic metric-affine gravity Thomas Dyer et.al. 2412.15329 null
2024-12-19 OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Shuo Xing et.al. 2412.15208 link
2024-12-19 MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark Qihao Zhao et.al. 2412.15194 link
2024-12-19 LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation Weijia Shi et.al. 2412.15188 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning Simon Frieder et.al. 2412.15184 null
2024-12-19 STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning Marius Memmel et.al. 2412.15182 null
2024-12-19 HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages Aman Chaturvedi et.al. 2412.15178 null
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 Rethinking Uncertainty Estimation in Natural Language Generation Lukas Aichberger et.al. 2412.15176 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Language Models as Continuous Self-Evolving Data Engineers Peidong Wang et.al. 2412.15151 null
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Adaptive Pruning for Large Language Models with Structural Importance Awareness Haotian Zheng et.al. 2412.15127 null
2024-12-19 Outcome-Refining Process Supervision for Code Generation Zhuohao Yu et.al. 2412.15118 link
2024-12-19 Qwen2.5 Technical Report Qwen et.al. 2412.15115 link
2024-12-19 Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture Thomas F Burns et.al. 2412.15113 link
2024-12-19 Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation Yang Tian et.al. 2412.15109 link
2024-12-19 Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability Xiangsen Chen et.al. 2412.15101 null
2024-12-19 Nano-ESG: Extracting Corporate Sustainability Information from News Articles Fabian Billert et.al. 2412.15093 link
2024-12-19 Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation Haoran Liu et.al. 2412.15086 null
2024-12-19 ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots Bhupendra Acharya et.al. 2412.15072 null
2024-12-19 ConfliBERT: A Language Model for Political Conflict Patrick T. Brandt et.al. 2412.15060 link
2024-12-19 LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Felix Friedrich et.al. 2412.15035 null
2024-12-19 DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space Mang Ning et.al. 2412.15032 link
2024-12-19 Large Language Models and Code Security: A Systematic Literature Review Enna Basic et.al. 2412.15004 null
2024-12-19 HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs Pham Vu Tuan Dat et.al. 2412.14995 link
2024-12-19 RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning Raphael Memmesheimer et.al. 2412.14989 null
2024-12-19 Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts Ioana Buhnila et.al. 2412.14986 null
2024-12-19 AI and Cultural Context: An Empirical Investigation of Large Language Models’ Performance on Chinese Social Work Professional Standards Zia Qi et.al. 2412.14971 null
2024-12-19 Movie2Story: A framework for understanding videos and telling stories in the form of novel text Kangning Li et.al. 2412.14965 null
2024-12-19 Knowledge Injection via Prompt Distillation Kalle Kujanpää et.al. 2412.14964 null
2024-12-19 Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities Daniil Medyakov et.al. 2412.14935 null
2024-12-19 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Junyu Luo et.al. 2412.14922 link
2024-12-19 Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation Zexiong Ma et.al. 2412.14905 null
2024-12-19 Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering Peize Li et.al. 2412.14880 null
2024-12-19 Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering Imed Keraghel et.al. 2412.14867 null
2024-12-19 Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling Junyi Li et.al. 2412.14860 null
2024-12-19 DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis Hongling Xu et.al. 2412.14849 link
2024-12-19 Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas Pietro Bernardelle et.al. 2412.14843 null
2024-12-19 Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis Greta Dolcetti et.al. 2412.14841 null
2024-12-19 Progressive Multimodal Reasoning via Active Retrieval Guanting Dong et.al. 2412.14835 null
2024-12-19 Answer Set Networks: Casting Answer Set Programming into Deep Learning Arseny Skryagin et.al. 2412.14814 link
2024-12-19 ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis Zeao Tu et.al. 2412.14809 link
2024-12-19 Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning Ziang Ye et.al. 2412.14780 null
2024-12-19 ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine Rabee Qasem et.al. 2412.14771 null
2024-12-19 PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children Yiqun Zhang et.al. 2412.14769 link
2024-12-19 CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering Ruida Hu et.al. 2412.14764 link
2024-12-19 Query pipeline optimization for cancer patient question answering systems Maolin He et.al. 2412.14751 null
2024-12-19 Active Inference and Human–Computer Interaction Roderick Murray-Smith et.al. 2412.14741 null
2024-12-19 On Verbalized Confidence Scores for LLMs Daniel Yang et.al. 2412.14737 link
2024-12-19 Creation of AI-driven Smart Spaces for Enhanced Indoor Environments – A Survey Aygün Varol et.al. 2412.14708 null
2024-12-19 LLMs as mediators: Can they diagnose conflicts accurately? Özgecan Koçak et.al. 2412.14675 null
2024-12-19 Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT Hassane Kissane et.al. 2412.14670 null
2024-12-19 IOHunter: Graph Foundation Model to Uncover Online Information Operations Marco Minici et.al. 2412.14663 link
2024-12-19 Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models Zijun Chen et.al. 2412.14660 link
2024-12-19 Length Controlled Generation for Black-box LLMs Yuxuan Gu et.al. 2412.14656 null
2024-12-19 Learning to Generate Research Idea with Dynamic Control Ruochen Li et.al. 2412.14626 null
2024-12-19 How good is GPT at writing political speeches for the White House? Jacques Savoy et.al. 2412.14617 null
2024-12-19 Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning Kepu Zhang et.al. 2412.14588 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Tao He et.al. 2412.14584 null
2024-12-19 CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation Youngwon Lee et.al. 2412.14581 null
2024-12-19 DiffSim: Taming Diffusion Models for Evaluating Visual Similarity Yiren Song et.al. 2412.14580 link
2024-12-19 Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models Wenhan Liu et.al. 2412.14574 link
2024-12-19 ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model Shunlin Lu et.al. 2412.14559 null
2024-12-19 The Current Challenges of Software Engineering in the Era of Large Language Models Cuiyun Gao et.al. 2412.14554 null
2024-12-19 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models Xiao Cui et.al. 2412.14528 link
2024-12-19 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment Teng Xiao et.al. 2412.14516 link
2024-12-19 Relational Programming with Foundation Models Ziyang Li et.al. 2412.14515 null
2024-12-19 PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization Jiayi Wu et.al. 2412.14510 link
2024-12-19 Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs Yuzuki Arai et.al. 2412.14501 null
2024-12-19 Guided Diffusion Model for Sensor Data Obfuscation Xin Yang et.al. 2412.14499 null
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-19 Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities Amandeep Kaur et.al. 2412.14486 null
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-19 Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs Koshiro Saito et.al. 2412.14471 null
2024-12-19 Agent-SafetyBench: Evaluating the Safety of LLM Agents Zhexin Zhang et.al. 2412.14470 link
2024-12-19 From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research Xiang Cheng et.al. 2412.14461 null
2024-12-19 LEDiff: Latent Exposure Diffusion for HDR Generation Chao Wang et.al. 2412.14456 null
2024-12-19 Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems Genki Kusano et.al. 2412.14454 null
2024-12-19 Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation Shengqi Liu et.al. 2412.14453 null
2024-12-19 ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study Eric Modesitt et.al. 2412.14436 link
2024-12-19 All-in-One Tuning and Structural Pruning for Domain-Specific LLMs Lei Lu et.al. 2412.14426 null
2024-12-19 FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning Pramit Saha et.al. 2412.14424 null
2024-12-19 Enhancing Diffusion Models for High-Quality Image Generation Jaineet Shah et.al. 2412.14422 null
2024-12-18 ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers Haowei Liu et.al. 2412.14405 null
2024-12-18 Clinical Trials Ontology Engineering with Large Language Models Berkan Çakır et.al. 2412.14387 null
2024-12-18 ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling William Han et.al. 2412.14373 link
2024-12-18 Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models’ Character Understanding Evaluation Yuxuan Jiang et.al. 2412.14368 null
2024-12-18 Surrealistic-like Image Generation with Vision-Language Models Elif Ayten et.al. 2412.14366 link
2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link
2024-12-18 A Unifying Information-theoretic Perspective on Evaluating Generative Models Alexis Fox et.al. 2412.14340 null
2024-12-18 Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation Benjamin Steenhoek et.al. 2412.14308 null
2024-12-18 Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs David Restrepo et.al. 2412.14304 null
2024-12-18 Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data haina Raza et.al. 2412.14276 link
2024-12-18 Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Jihan Yang et.al. 2412.14171 link
2024-12-18 MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Shengbang Tong et.al. 2412.14164 null
2024-12-18 TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Frank F. Xu et.al. 2412.14161 link
2024-12-18 Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models Atin Sakkeer Hussain et.al. 2412.14146 null
2024-12-18 LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research Tianyang Gu et.al. 2412.14141 null

Video Understanding

Publish Date Title Authors PDF Code
2025-01-31 Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Yuta Oshima et.al. 2501.19252 null
2025-01-31 $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Saul Santos et.al. 2501.19098 link
2025-01-30 Every Image Listens, Every Image Dances: Music-Driven Image Animation Zhikang Dong et.al. 2501.18801 null
2025-01-30 MAMS: Model-Agnostic Module Selection Framework for Video Captioning Sangho Lee et.al. 2501.18269 null
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 CascadeV: An Implementation of Wurstchen Architecture for Video Generation Wenfeng Lin et.al. 2501.16612 link
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-26 TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Xingjian Zhang et.al. 2501.15513 link
2025-01-26 “See What I Imagine, Imagine What I See”: Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR Yunge Wen et.al. 2501.15456 null
2025-01-25 HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding Jiaxing Zhao et.al. 2501.15111 null
2025-01-25 VideoPure: Diffusion-based Adversarial Purification for Video Recognition Kaixun Jiang et.al. 2501.14999 link
2025-01-11 HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators Le Chen et.al. 2501.14794 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 ENTER: Event Based Interpretable Reasoning for VideoQA Hammad Ayyubi et.al. 2501.14194 null
2025-01-30 Temporal Preference Optimization for Long-Form Video Understanding Rui Li et.al. 2501.13919 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion Jiangchuan Wei et.al. 2501.13452 null
2025-01-28 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-21 Taming Teacher Forcing for Masked Autoregressive Video Generation Deyu Zhou et.al. 2501.12389 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Yuhang Zang et.al. 2501.12368 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Zheng Chong et.al. 2501.11325 null
2025-02-03 HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition Pengcheng Dong et.al. 2501.11007 null
2025-01-18 EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Linrui Tian et.al. 2501.10687 null
2025-01-17 DiffuEraser: A Diffusion Model for Video Inpainting Xiaowen Li et.al. 2501.10018 link
2025-02-02 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-01-14 Do generative video models learn physical principles from watching videos? Saman Motamed et.al. 2501.09038 link
2025-01-15 Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Jingyuan Chen et.al. 2501.09019 null
2025-01-15 RepVideo: Rethinking Cross-Layer Representation for Video Generation Chenyang Si et.al. 2501.08994 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-31 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-14 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Weichen Fan et.al. 2501.08453 null
2025-01-14 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering Meenakshi Krishnan et.al. 2501.08370 null
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 GameFactory: Creating New Games with Generative Interactive Videos Jiwen Yu et.al. 2501.08325 null
2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation Shanchuan Lin et.al. 2501.08316 null
2025-01-17 LayerAnimate: Layer-specific Control for Animation Yuxue Yang et.al. 2501.08295 null
2025-01-14 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Yabo Zhang et.al. 2501.08225 link
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 null
2025-01-24 Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Liping Yuan et.al. 2501.07888 link
2025-01-14 AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation Sitong Gong et.al. 2501.07810 link
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-17 MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning Tieyuan Chen et.al. 2501.07227 null
2025-01-13 TimeLogic: A Temporal Logic Benchmark for Video QA Sirnam Swetha et.al. 2501.07214 null
2025-01-13 Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling Jiebin Yan et.al. 2501.07087 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-11 Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning Maomao Li et.al. 2501.06438 null
2025-01-10 MEt3R: Measuring Multi-View Consistency in Generated Images Mohammad Asim et.al. 2501.06336 null
2025-01-10 Multi-subject Open-set Personalization in Video Generation Tsai-Shien Chen et.al. 2501.06187 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-13 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-10 From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities Dominick Reilly et.al. 2501.05711 link
2025-01-09 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Yifei Li et.al. 2501.05510 link
2025-01-08 Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion Yongjia Ma et.al. 2501.05484 null
2025-01-09 Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces Aniruddha Mahapatra et.al. 2501.05442 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 LongViTU: Instruction Tuning for Long-Form Video Understanding Rujie Wu et.al. 2501.05037 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-08 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Yuzhou Huang et.al. 2501.04698 null
2025-01-08 Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Zeyi Huang et.al. 2501.04336 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Bowen Hao et.al. 2501.04204 null
2024-12-18 FlexCache: Flexible Approximate Cache System for Video Diffusion Desen Sun et.al. 2501.04012 null
2025-01-07 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Yuechen Zhang et.al. 2501.03931 link
2025-01-09 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Zekai Gu et.al. 2501.03847 link
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-06 License Plate Images Generation with Diffusion Models Mariia Shpir et.al. 2501.03374 null
2025-01-03 Classifier-Guided Captioning Across Modalities Ariel Shaulov et.al. 2501.03183 null
2025-01-06 Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Guy Yariv et.al. 2501.03059 null
2025-01-20 TransPixeler: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Wenyi Hong et.al. 2501.02955 null
2025-01-06 Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising Yunlong Yuan et.al. 2501.02741 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-29 Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey Zongxia Li et.al. 2501.02189 link
2025-01-10 Gender Bias in Text-to-Video Generation Models: A case study of Sora Mohammad Nadeem et.al. 2501.01987 null
2024-12-30 FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models Tianyu Fu et.al. 2501.01986 link
2025-01-03 JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing Qili Wang et.al. 2501.01798 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-07 VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Yuanpeng Tu et.al. 2501.01427 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-03 Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions Xincheng Shuai et.al. 2501.01425 null
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-29 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-03 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2025-01-08 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Fine-grained Video-Text Retrieval: A New Benchmark and Method Yifan Xu et.al. 2501.00513 null
2024-12-31 OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models Lala Shakti Swarup Ray et.al. 2501.00432 null
2025-01-09 Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Yue Fan et.al. 2501.00358 null
2024-12-30 Detection-Fusion for Knowledge Graph Extraction from Videos Taniya Das et.al. 2501.00136 link
2024-12-30 LTX-Video: Realtime Video Latent Diffusion Yoav HaCohen et.al. 2501.00103 link
2024-12-30 Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model Yifei Huang et.al. 2412.21080 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Hierarchical Banzhaf Interaction for General Video-Language Representation Learning Peng Jin et.al. 2412.20964 link
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling Min Zhang et.al. 2412.20725 null
2025-01-05 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-28 DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments Xijun Wang et.al. 2412.20042 null
2025-01-17 MVTamperBench: Evaluating Robustness of Vision-Language Models Amit Agarwal et.al. 2412.19794 null
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-30 VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Tao Wu et.al. 2412.19645 null
2024-12-30 DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-25 Accelerating Diffusion Transformers with Dual Feature Caching Chang Zou et.al. 2412.18911 link
2024-12-24 Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation Faraz Waseem et.al. 2412.18688 null
2024-12-24 Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models Jinhui Yi et.al. 2412.18609 link
2024-12-24 DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Yuntao Chen et.al. 2412.18607 null
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-23 Large Motion Video Autoencoding with Cross-modal Video VAE Yazhou Xing et.al. 2412.17805 null
2024-12-23 VidTwin: Video VAE with Decoupled Structure and Dynamics Yuchi Wang et.al. 2412.17726 link
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 FFA Sora, video generation as fundus fluorescein angiography simulator Xinyuan Wu et.al. 2412.17346 null
2024-12-23 Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory Xingyao Li et.al. 2412.17254 null
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2025-01-08 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 Video Domain Incremental Learning for Human Action Recognition in Home Environments Yuanda Hu et.al. 2412.16946 null
2024-12-21 GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space Souhaib Attaiki et.al. 2412.16717 null
2024-12-21 TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models Haocheng Huang et.al. 2412.16700 null
2024-12-21 VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation Chi Zhang et.al. 2412.16677 null
2024-12-25 Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Beiyuan Zhang et.al. 2412.16495 null
2024-12-18 ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping Youxin Pang et.al. 2412.16212 null
2024-12-17 Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation Yiping Wang et.al. 2412.16211 null
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization Zihan Ding et.al. 2412.15689 null
2024-12-23 CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training Xiuli Bi et.al. 2412.15646 link
2024-12-20 PolySmart @ TRECVid 2024 Medical Video Question Answering Jiaxin Wu et.al. 2412.15514 null
2024-12-19 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Moayed Haji-Ali et.al. 2412.15191 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Parallelized Autoregressive Visual Generation Yuqing Wang et.al. 2412.15119 null
2024-12-19 Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations Yucheng Hu et.al. 2412.14803 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Consistent Human Image and Video Generation with Spatially Conditioned Diffusion Mingdeng Cao et.al. 2412.14531 link
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-18 Learning from Massive Human Videos for Universal Humanoid Pose Control Jiageng Mao et.al. 2412.14172 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-29 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-18 InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models Cong Wei et.al. 2412.14006 link
2024-12-18 Do Language Models Understand Time? Xi Ding et.al. 2412.13845 link
2024-12-19 G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o Tony Cheng Tong et.al. 2412.13647 link
2024-12-18 Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Yunbin Tu et.al. 2412.13543 null
2024-12-18 Real-time One-Step Diffusion-based Expressive Portrait Videos Generation Hanzhong Guo et.al. 2412.13479 link
2024-12-18 SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation Kazuki Shimada et.al. 2412.13462 null
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 null
2025-01-07 MotionBridge: Dynamic Video Inbetweening with Flexible Controls Maham Tanveer et.al. 2412.13190 null
2024-12-17 VidTok: A Versatile and Open-Source Video Tokenizer Anni Tang et.al. 2412.13061 link
2024-12-17 FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering Zheng Cheng et.al. 2412.12833 null
2024-12-17 Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Shiping Ge et.al. 2412.12791 link
2024-12-17 ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries Wangyu Xue et.al. 2412.12675 null
2024-12-16 Can video generation replace cinematographers? Research on the cinematic language of generated video Xiaozhe Li et.al. 2412.12223 null
2024-12-16 CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Guo Chen et.al. 2412.12075 null
2024-12-16 InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Rick Akkerman et.al. 2412.11785 null
2024-12-16 Generative Inbetweening through Frame-wise Conditions-Driven Video Generation Tianyi Zhu et.al. 2412.11755 link
2024-12-16 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Muhammet Furkan Ilaslan et.al. 2412.11621 link
2024-12-16 Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning Zhuyang Xie et.al. 2412.11467 null
2024-12-15 Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition Yulin Wang et.al. 2412.11228 link
2024-12-15 GenLit: Reformulating Single-Image Relighting as Video Generation Shrisha Bharadwaj et.al. 2412.11224 null
2024-12-15 DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes Jinxiu Liu et.al. 2412.11100 null
2024-12-15 Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track Deepak Gupta et.al. 2412.11056 null
2024-12-20 Video Diffusion Transformers are In-Context Learners Zhengcong Fei et.al. 2412.10783 link
2024-12-14 Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Ji-jun Park et.al. 2412.10720 null
2024-12-13 SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device Yushu Wu et.al. 2412.10494 null
2024-12-12 VCA: Video Curious Agent for Long Video Understanding Zeyuan Yang et.al. 2412.10471 null
2024-12-17 SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization Zhentao Tan et.al. 2412.10443 null
2024-12-11 COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework Xin Dong et.al. 2412.10435 null
2024-12-13 Apollo: An Exploration of Video Understanding in Large Multimodal Models Orr Zohar et.al. 2412.10360 null
2024-12-16 TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation Xingrui Wang et.al. 2412.10275 null
2024-12-19 AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era Yudong Jiang et.al. 2412.10255 link
2024-12-13 B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens Zhuqiang Lu et.al. 2412.09919 link
2024-12-16 IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs Sosuke Yamao et.al. 2412.09907 null
2024-12-13 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Hongjie Wang et.al. 2412.09856 null
2024-12-13 MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion Xunnong Xu et.al. 2412.09828 null
2024-12-17 ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Ali Athar et.al. 2412.09754 null
2024-12-11 Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model Junqi You et.al. 2412.09647 null
2024-12-16 Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Fan Zhang et.al. 2412.09645 link
2024-12-12 Doe-1: Closed-Loop Autonomous Driving with Large World Model Wenzhao Zheng et.al. 2412.09627 link
2024-12-12 OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation Weiqi Li et.al. 2412.09623 null
2024-12-12 PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models Chenyu Yang et.al. 2412.09613 null
2024-12-12 Owl-1: Omni World Model for Consistent Long Video Generation Yuanhui Huang et.al. 2412.09600 link
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 Neptune: The Long Orbit to Benchmarking Long Video Understanding Arsha Nagrani et.al. 2412.09582 link
2024-12-12 Video Creation by Demonstration Yihong Sun et.al. 2412.09551 null
2024-12-12 Agent-based Video Trimming Lingfeng Yang et.al. 2412.09513 null
2024-12-12 UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer Delong Liu et.al. 2412.09389 link
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323 null
2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null
2024-12-12 Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering Sai Bhargav Rongali et.al. 2412.09230 null
2024-12-12 LVMark: Robust Watermark for latent video diffusion models MinHyuk Jang et.al. 2412.09122 null
2024-12-12 Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation Lianrui Mu et.al. 2412.08976 null
2024-12-12 Mojito: Motion Trajectory and Intensity Control for Video Generation Xuehai He et.al. 2412.08948 null
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-13 Physical Informed Driving World Model Zhuoran Yang et.al. 2412.08410 null
2024-12-11 FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks Chongkai Gao et.al. 2412.08261 null
2024-12-11 VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation Zhiqiang Yuan et.al. 2412.08259 null
2024-12-10 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark Wufei Ma et.al. 2412.07825 null
2024-12-11 UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Xi Chen et.al. 2412.07774 null
2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null
2024-12-10 SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Jianhong Bai et.al. 2412.07760 link
2024-12-10 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Xiao Fu et.al. 2412.07759 null
2024-12-10 Multi-Shot Character Consistency for Text-to-Video Generation Yuval Atzmon et.al. 2412.07750 null
2024-12-10 StyleMaster: Stylize Your Video with Artistic Generation and Translation Zixuan Ye et.al. 2412.07744 null
2024-12-10 STIV: Scalable Text and Image Conditioned Video Generation Zongyu Lin et.al. 2412.07730 null
2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link
2024-12-10 GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning Yicheng Wang et.al. 2412.07704 null
2024-12-10 Multimodal Contextualized Support for Enhancing Video Retrieval System Quoc-Bao Nguyen-Le et.al. 2412.07584 null
2024-12-19 Multi-Scale Contrastive Learning for Video Temporal Grounding Thong Thanh Nguyen et.al. 2412.07157 null
2024-12-09 SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations Zhaorun Chen et.al. 2412.06878 null
2024-12-09 VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features Sifei Li et.al. 2412.06296 null
2024-12-11 Towards Long Video Understanding via Fine-detailed Video Story Generation Zeng You et.al. 2412.06182 null
2024-12-08 Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training Zhenghong Zhou et.al. 2412.06029 null
2024-12-08 FlexDiT: Dynamic Token Density Control for Diffusion Transformer Shuning Chang et.al. 2412.06028 null
2024-12-10 Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Hyeonho Jeong et.al. 2412.06016 null
2024-12-08 Accelerating Video Diffusion Models via Distribution Matching Yuanzhi Zhu et.al. 2412.05899 null
2024-12-08 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation Shuwei Shi et.al. 2412.05848 null
2024-12-08 Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval Shanti Stewart et.al. 2412.05831 null
2024-12-08 Self-Guidance: Boosting Flow and Diffusion Generation on Their Own Tiancheng Li et.al. 2412.05827 null
2024-12-07 Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation Leonardo Pina et.al. 2412.05694 null
2024-12-11 Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model Lening Wang et.al. 2412.05280 link
2024-12-17 Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Zhe Chen et.al. 2412.05271 link
2024-12-06 Mind the Time: Temporally-Controlled Multi-Event Video Generation Ziyi Wu et.al. 2412.05263 null
2024-12-11 LinVT: Empower Your Image-level Large Language Model to Understand Videos Lishuai Gao et.al. 2412.05185 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-06 UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving Rui Chen et.al. 2412.04842 link
2024-12-12 Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model Keunwoo Peter Yu et.al. 2412.04729 null
2024-12-05 Using Diffusion Priors for Video Amodal Segmentation Kaihua Chen et.al. 2412.04623 null