Updated on 2025.10.22

LLM Reasoning

Publish Date	Title	Authors	PDF	Code
2025-07-23	InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation	Shuai Yang et.al.	2507.17520	null
2025-07-23	MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs	Alexander R. Fabbri et.al.	2507.17476	null
2025-07-23	HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs	Zhaolin Cai et.al.	2507.17394	null
2025-07-23	Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance	Rishi Parekh et.al.	2507.17273	null
2025-07-22	Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning	Junhao Shen et.al.	2507.16814	null
2025-07-22	Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning	Ang Li et.al.	2507.16746	null
2025-07-23	WAKENLLM: Evaluating Reasoning Potential and Stability in LLMs via Fine-Grained Benchmarking	Zipeng Ling et.al.	2507.16199	null
2025-07-21	Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization	Shengchao Liu et.al.	2507.16110	null
2025-07-21	The Impact of Language Mixing on Bilingual LLM Reasoning	Yihao Li et.al.	2507.15849	null
2025-07-21	EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent	Jiaao Li et.al.	2507.15428	null
2025-07-20	LEKIA: A Framework for Architectural Alignment via Expert Knowledge Injection	Boning Zhao et.al.	2507.14944	null
2025-07-18	A Simple “Try Again” Can Elicit Multi-Turn LLM Reasoning	Licheng Liu et.al.	2507.14295	null
2025-07-18	Team of One: Cracking Complex Video QA with Model Synergy	Jun Xie et.al.	2507.13820	null
2025-07-17	The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner	Zhouqi Hua et.al.	2507.13332	null
2025-07-17	Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark	Junsu Kim et.al.	2507.13314	null
2025-07-17	HATS: Hindi Analogy Test Set for Evaluating Reasoning in Large Language Models	Ashray Gupta et.al.	2507.13238	null
2025-07-17	Probabilistic Soundness Guarantees in LLM Reasoning Chains	Weiqiu You et.al.	2507.12948	null
2025-07-16	Reasoning Strategies in Large Language Models: Can They Follow, Prefer, and Optimize?	Yanjian Zhang et.al.	2507.11423	null
2025-07-15	KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?	Soumadeep Saha et.al.	2507.11408	null
2025-07-15	Guiding LLM Decision-Making with Fairness Reward Models	Zara Hall et.al.	2507.11344	null
2025-07-15	MSA at ImageCLEF 2025 Multimodal Reasoning: Multilingual Multimodal Reasoning With Ensemble Vision Language Models	Seif Ahmed et.al.	2507.11114	null
2025-07-15	Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation	Yanbo Wang et.al.	2507.11001	null
2025-07-15	Modeling Understanding of Story-Based Analogies Using Large Language Models	Kalit Inani et.al.	2507.10957	null
2025-07-14	Foundation Model Driven Robotics: A Comprehensive Review	Muhammad Tayyab Khan et.al.	2507.10087	null
2025-07-13	Reframing SAR Target Recognition as Visual Reasoning: A Chain-of-Thought Dataset with Multimodal LLMs	Chaoran Li et.al.	2507.09535	null
2025-07-11	GraphRunner: A Multi-Stage Framework for Efficient and Accurate Graph-Based Retrieval	Savini Kashmira et.al.	2507.08945	null
2025-07-11	Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement Learning	Xingguang Ji et.al.	2507.08649	null
2025-07-11	ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains	Zilu Dong et.al.	2507.08427	null
2025-07-10	ALCo-FM: Adaptive Long-Context Foundation Model for Accident Prediction	Pinaki Prasad Guha Neogi et.al.	2507.08153	null
2025-07-10	MIRA: A Novel Framework for Fusing Modalities in Medical RAG	Jinhong Wang et.al.	2507.07902	null
2025-07-10	The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs	Jierun Chen et.al.	2507.07562	null
2025-07-10	RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning	Hongzhi Zhang et.al.	2507.07451	null
2025-07-11	StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley	Weihao Tan et.al.	2507.07445	null
2025-07-09	MagiC: Evaluating Multimodal Cognition Toward Grounded Visual Reasoning	Chengfei Wu et.al.	2507.07297	null
2025-07-07	DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning	Shreyas Vinaya Sathyanarayana et.al.	2507.07060	null
2025-07-09	First Return, Entropy-Eliciting Explore	Tianyu Zheng et.al.	2507.07017	null
2025-07-09	Learning Deliberately, Acting Intuitively: Unlocking Test-Time Reasoning in Multimodal LLMs	Yahan Yu et.al.	2507.06999	null
2025-07-09	Are They All Good? Evaluating the Quality of CoTs in LLM-based Code Generation	Binquan Zhang et.al.	2507.06980	null
2025-07-10	Rethinking Verification for LLM Code Generation: From Generation to Testing	Zihan Ma et.al.	2507.06920	null
2025-07-09	From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization	Xinjie Chen et.al.	2507.06573	null
2025-07-13	Perception-Aware Policy Optimization for Multimodal Reasoning	Zhenhailong Wang et.al.	2507.06448	null
2025-07-08	Enhancing Scientific Visual Question Answering through Multimodal Reasoning and Ensemble Modeling	Prahitha Movva et.al.	2507.06183	null
2025-07-10	Skywork-R1V3 Technical Report	Wei Shen et.al.	2507.06167	null
2025-07-08	KERAG_R: Knowledge-Enhanced Retrieval-Augmented Generation for Recommendation	Zeyuan Meng et.al.	2507.05863	null
2025-07-09	Measuring how changes in code readability attributes affect code quality evaluation by Large Language Models	Igor Regis da Silva Simoes et.al.	2507.05289	null
2025-07-07	Spatio-Temporal LLM: Reasoning about Environments and Actions	Haozhen Zheng et.al.	2507.05258	null
2025-07-07	Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning	Yana Wei et.al.	2507.05255	null
2025-07-07	MARBLE: A Multi-Agent Rule-Based LLM Reasoning Engine for Accident Severity Prediction	Kaleem Ullah Qasim et.al.	2507.04893	null
2025-07-17	DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge	Wenyao Zhang et.al.	2507.04447	null
2025-07-05	CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes via Chain-of-Thought Reasoning	Jeonghyo Song et.al.	2507.03984	null
2025-07-04	Effects of structure on reasoning in instance-level Self-Discover	Sachith Gunasekara et.al.	2507.03347	null
2025-07-03	RCA Copilot: Transforming Network Data into Actionable Insights via Large Language Models	Alexander Shan et.al.	2507.03224	null
2025-07-03	Improving LLM Reasoning for Vulnerability Detection via Group Relative Policy Optimization	Marco Simoni et.al.	2507.03051	null
2025-07-02	Look-Back: Implicit Visual Re-focusing in MLLM Reasoning	Shuo Yang et.al.	2507.03019	null
2025-07-01	From Answers to Rationales: Self-Aligning Multimodal Reasoning with Answer-Oriented Chain-of-Thought	Wentao Tan et.al.	2507.02984	null
2025-06-26	Large Language Model Agent for Modular Task Execution in Drug Discovery	Janghoon Ock et.al.	2507.02925	null
2025-07-03	MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs	Purbesh Mitra et.al.	2507.02851	null
2025-07-03	Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation	Jungkoo Kang et.al.	2507.02253	null
2025-07-02	Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs	Mohammad Ali Alomrani et.al.	2507.02076	null
2025-07-02	GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning	GLM-V Team et.al.	2507.01006	null
2025-07-01	HumanoidGen: Data Generation for Bimanual Dexterous Manipulation via LLM Reasoning	Zhi Jing et.al.	2507.00833	null
2025-07-01	Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning	Maggie Huan et.al.	2507.00432	null
2025-07-01	Causal Prompting for Implicit Sentiment Analysis with Large Language Models	Jing Ren et.al.	2507.00389	null
2025-06-22	TalentMine: LLM-Based Extraction and Question-Answering from Multimodal Talent Tables	Varun Mannam et.al.	2507.00041	null
2025-07-03	Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers	Zhaochen Su et.al.	2506.23918	null
2025-06-30	Interactive Reasoning: Visualizing and Controlling Chain-of-Thought Reasoning in Large Language Models	Rock Yuren Pang et.al.	2506.23678	null
2025-06-30	MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI	Huanjin Yao et.al.	2506.23563	null
2025-06-29	Are Large Language Models Capable of Deep Relational Reasoning? Insights from DeepSeek-R1 and Benchmark Comparisons	Chi Chiu So et.al.	2506.23128	null
2025-06-29	Decoding Memes: Benchmarking Narrative Role Classification across Multilingual and Multimodal Models	Shivam Sharma et.al.	2506.23122	null
2025-06-28	MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning	Yulun Jiang et.al.	2506.22992	null
2025-06-26	APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization	Minjie Hong et.al.	2506.21655	null
2025-06-24	FrankenBot: Brain-Morphic Modular Orchestration for Robotic Manipulation with Vision-Language Models	Shiyi Wang et.al.	2506.21627	null
2025-06-30	FinEval-KR: A Financial Domain Evaluation Framework for Large Language Models’ Knowledge and Reasoning	Shaoyu Dou et.al.	2506.21591	null
2025-06-11	Debunk and Infer: Multimodal Fake News Detection via Diffusion-Generated Evidence and LLM Reasoning	Kaiying Yan et.al.	2506.21557	null
2025-06-26	HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context	Qize Yang et.al.	2506.21277	null
2025-06-26	Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?	Haoang Chi et.al.	2506.21215	null
2025-06-25	MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering	Chinmay Gondhalekar et.al.	2506.20821	null
2025-06-25	Generative AI for Vulnerability Detection in 6G Wireless Networks: Advances, Case Study, and Future Directions	Shuo Yang et.al.	2506.20488	null
2025-06-24	KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs	Kelin Fu et.al.	2506.19527	null
2025-06-24	MSR-Align: Policy-Grounded Multimodal Alignment for Safety-Aware Reasoning in Vision-Language Models	Yinan Xia et.al.	2506.19257	null
2025-06-25	Thought Anchors: Which LLM Reasoning Steps Matter?	Paul C. Bogdan et.al.	2506.19143	null
2025-06-23	Finding Clustering Algorithms in the Transformer Architecture	Kenneth L. Clarkson et.al.	2506.19125	null
2025-06-23	Human-Aligned Faithfulness in Toxicity Explanations of LLMs	Ramaravind K. Mothilal et.al.	2506.19113	null
2025-06-23	Baba is LLM: Reasoning in a Game with Dynamic Rules	Fien van Wetten et.al.	2506.19095	null
2025-06-23	OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization	Yiyou Sun et.al.	2506.18880	null
2025-06-24	ReDit: Reward Dithering for Improved LLM Policy Optimization	Chenxing Wei et.al.	2506.18631	null
2025-06-22	Adapting Vision-Language Models for Evaluating World Models	Mariya Hendriksen et.al.	2506.17967	null
2025-06-20	Aha Moment Revisited: Are VLMs Truly Capable of Self Verification in Inference-time Scaling?	Mingyuan Wu et.al.	2506.17417	null
2025-06-14	CORONA: A Coarse-to-Fine Framework for Graph-based Recommendation with Large Language Models	Junze Chen et.al.	2506.17281	null
2025-06-25	No Free Lunch: Rethinking Internal Feedback for LLM Reasoning	Yanzhi Zhang et.al.	2506.17219	null
2025-06-20	Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens	Zeyuan Yang et.al.	2506.17218	link
2025-06-20	MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation	Shoubin Yu et.al.	2506.17113	link
2025-06-20	MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models	Xiaolong Wang et.al.	2506.17046	null
2025-06-20	LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation	Tongtian Yue et.al.	2506.16691	null
2025-06-19	GeoGuess: Multimodal Reasoning based on Hierarchy of Visual Information in Street View	Fenghua Cheng et.al.	2506.16633	null
2025-06-19	History-Augmented Vision-Language Models for Frontier-Based Zero-Shot Object Navigation	Mobin Habibpour et.al.	2506.16623	null
2025-06-19	How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?	Giuseppe Lando et.al.	2506.16450	null
2025-06-19	TrajSceneLLM: A Multimodal Perspective on Semantic GPS Trajectory Analysis	Chunhou Ji et.al.	2506.16401	link
2025-07-17	SHREC: A Framework for Advancing Next-Generation Computational Phenotyping with Large Language Models	Sarah Pungitore et.al.	2506.16359	null
2025-06-19	GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning	Yi Chen et.al.	2506.16141	link
2025-06-23	SLR: An Automated Synthesis Framework for Scalable Logical Reasoning	Lukas Helff et.al.	2506.15787	null
2025-06-18	CC-LEARN: Cohort-based Consistency Learning	Xiao Ye et.al.	2506.15662	null
2025-06-18	MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering	Xinqi Fan et.al.	2506.15298	null
2025-06-17	Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective	Zhoujun Cheng et.al.	2506.14965	link
2025-06-17	Structured Moral Reasoning in Language Models: A Value-Grounded Evaluation Framework	Mohna Chakraborty et.al.	2506.14948	null
2025-06-17	PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning	Yizhen Zhang et.al.	2506.14907	link
2025-06-12	FedNano: Toward Lightweight Federated Tuning for Pretrained Multimodal Large Language Models	Yao Zhang et.al.	2506.14824	null
2025-06-17	RadFabric: Agentic AI System with Reasoning Capability for Radiology	Wenting Chen et.al.	2506.14142	null
2025-06-17	A Hierarchical Test Platform for Vision Language Model (VLM)-Integrated Real-World Autonomous Driving	Yupeng Zhou et.al.	2506.14100	null
2025-06-16	How Does LLM Reasoning Work for Code? A Survey and a Call to Action	Ira Ceka et.al.	2506.13932	null
2025-06-16	VL-GenRM: Enhancing Vision-Language Verification via Vision Experts and Iterative Training	Jipeng Zhang et.al.	2506.13888	null
2025-06-16	LocationReasoner: Evaluating LLMs on Real-World Site Selection Reasoning	Miho Koda et.al.	2506.13841	link
2025-06-16	Steering LLM Thinking with Budget Guidance	Junyan Li et.al.	2506.13752	link
2025-06-16	Decompositional Reasoning for Graph Retrieval with Large Language Models	Valentin Six et.al.	2506.13380	null
2025-07-10	Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models	James Chua et.al.	2506.13206	null
2025-06-16	FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design	Kai Lan et.al.	2506.13066	null
2025-06-26	Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning	Haibo Qiu et.al.	2506.13056	null
2025-06-20	Domain Specific Benchmarks for Evaluating Multimodal Large Language Models	Khizar Anjum et.al.	2506.12958	null
2025-06-15	SciDA: Scientific Dynamic Assessor of LLMs	Junting Zhou et.al.	2506.12909	null
2025-06-14	Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs	Jiwei Fang et.al.	2506.12509	null
2025-06-14	Advances in LLMs with Focus on Reasoning, Adaptability, Efficiency and Ethics	Asifullah khan et.al.	2506.12365	null
2025-06-22	MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval	Mingjun Xu et.al.	2506.12364	null
2025-06-13	Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making	Xiaopeng Yuan et.al.	2506.12012	null
2025-06-22	How Visual Representations Map to Language Feature Space in Multimodal LLMs	Constantin Venhoff et.al.	2506.11976	null
2025-06-13	LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?	Zihan Zheng et.al.	2506.11928	null
2025-06-13	EasyARC: Evaluating Vision Language Models on True Visual Reasoning	Mert Unsal et.al.	2506.11595	null
2025-06-13	VFaith: Do Large Multimodal Models Really Reason on Seen Images Rather than Previous Memories?	Jiachen Yu et.al.	2506.11571	null
2025-07-04	LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment	Shipeng Li et.al.	2506.11480	null
2025-06-09	KokushiMD-10: Benchmark for Evaluating Large Language Models on Ten Japanese National Healthcare Licensing Examinations	Junyu Liu et.al.	2506.11114	null
2025-06-13	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null
2025-06-12	Improving Named Entity Transcription with Contextual LLM-based Revision	Viet Anh Trinh et.al.	2506.10779	null
2025-06-12	NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors	Numaan Naeem et.al.	2506.10627	link
2025-06-25	Scientists’ First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning	Yuhao Zhou et.al.	2506.10521	null
2025-06-12	Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs	Yilin Xiao et.al.	2506.10508	null
2025-06-16	Specification and Evaluation of Multi-Agent LLM Systems – Prototype and Cybersecurity Applications	Felix Härer et.al.	2506.10467	link
2025-06-12	Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty	Zehui Ling et.al.	2506.10446	null
2025-06-12	Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts	Zaijing Li et.al.	2506.10357	null
2025-06-12	Code Execution as Grounded Supervision for LLM Reasoning	Dongwon Jung et.al.	2506.10343	link
2025-06-11	ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering	Caijun Jia et.al.	2506.10116	null
2025-06-19	Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing	Junfei Wu et.al.	2506.09965	link
2025-06-11	Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning	Xiangning Yu et.al.	2506.09853	null
2025-06-11	AD^2-Bench: A Hierarchical CoT Benchmark for MLLM in Autonomous Driving under Adverse Conditions	Zhaoyang Wei et.al.	2506.09557	null
2025-06-11	Athena: Enhancing Multimodal Reasoning with Data-efficient Process Reward Models	Shuai Wang et.al.	2506.09532	null
2025-06-13	e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs	Amrith Setlur et.al.	2506.09026	null
2025-06-10	Learning to Reason Across Parallel Samples for LLM Reasoning	Jianing Qi et.al.	2506.09014	null
2025-06-10	SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Xiao Liang et.al.	2506.08989	link
2025-06-10	Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning	Kongcheng Zhang et.al.	2506.08745	link
2025-06-10	Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness	Yanwei Gong et.al.	2506.08532	null
2025-06-10	Reinforce LLM Reasoning through Multi-Agent Reflection	Yurun Yuan et.al.	2506.08379	null
2025-06-18	Wait, We Don’t Need to “Wait”! Removing Thinking Tokens Improves Reasoning Efficiency	Chenlong Wang et.al.	2506.08343	null
2025-06-09	From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium	Xie Yi et.al.	2506.08292	link
2025-06-09	Automatic Generation of Inference Making Questions for Reading Comprehension Assessments	Wanjing Anya Ma et.al.	2506.08260	link
2025-06-12	Play to Generalize: Learning to Reason Through Game Play	Yunfei Xie et.al.	2506.08011	link
2025-06-11	Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations	Yizhen Li et.al.	2506.07943	null
2025-06-09	WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning	Jie Yang et.al.	2506.07905	link
2025-06-10	Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation	Jiaxiang Chen et.al.	2506.07820	null
2025-06-11	AbstRaL: Augmenting LLMs’ Reasoning by Reinforcing Abstract Thinking	Silin Gao et.al.	2506.07751	null
2025-06-10	Synthesis by Design: Controlled Data Generation via Structural Guidance	Lei Xu et.al.	2506.07664	null
2025-06-11	SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems	Peiran Li et.al.	2506.07564	null
2025-06-09	SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition	Mengsong Wu et.al.	2506.07557	null
2025-06-09	Learning What Reinforcement Learning Can’t: Interleaved Online Fine-Tuning for Hardest Questions	Lu Ma et.al.	2506.07527	link
2025-06-11	MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models	Philip R. Liu et.al.	2506.07400	link
2025-06-09	Improving LLM Reasoning through Interpretable Role-Playing Steering	Anyi Wang et.al.	2506.07335	null
2025-06-08	Overclocking LLM Reasoning: Monitoring and Controlling Thinking Path Lengths in LLMs	Roy Eisenstadt et.al.	2506.07240	null
2025-06-08	Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward	Tong Xiao et.al.	2506.07218	null
2025-06-08	Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs	Wenrui Zhou et.al.	2506.07180	null
2025-06-08	Learning Compact Vision Tokens for Efficient Large Multimodal Models	Hao Tang et.al.	2506.07138	link
2025-06-08	Theorem-of-Thought: A Multi-Agent Framework for Abductive, Deductive, and Inductive Reasoning in Language Models	Samir Abdaljalil et.al.	2506.07106	null
2025-06-12	Chain-of-Code Collapse: Reasoning Failures in LLMs via Adversarial Prompting in Code Generation	Jaechul Roh et.al.	2506.06971	link
2025-06-07	Boosting LLM Reasoning via Spontaneous Self-Correction	Xutong Zhao et.al.	2506.06923	null
2025-06-07	Harnessing Vision-Language Models for Time Series Anomaly Detection	Zelin He et.al.	2506.06836	null
2025-06-07	VisioMath: Benchmarking Figure-based Mathematical Reasoning in LMMs	Can Li et.al.	2506.06727	null
2025-06-07	Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning	Shubham Parashar et.al.	2506.06632	null
2025-06-14	RARL: Improving Medical VLM Reasoning and Generalization with Reinforcement Learning and LoRA under Data and Hardware Constraints	Tan-Hanh Pham et.al.	2506.06600	null
2025-06-06	SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation	Yanwei Ren et.al.	2506.06470	null
2025-06-06	Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety Assurance	Ruizhong Qiu et.al.	2506.06444	link
2025-06-06	PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts	Hengzhi Li et.al.	2506.06211	null
2025-06-06	Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router	Chenyang Shao et.al.	2506.05901	null
2025-06-06	BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions	Saptarshi Sengupta et.al.	2506.05766	null
2025-06-05	MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning	Zikui Cai et.al.	2506.05523	null
2025-06-05	DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Tanmay Parekh et.al.	2506.05128	null
2025-06-09	Reason-to-Recommend: Using Interaction-of-Thought Reasoning to Enhance LLM Recommendation	Keyu Zhao et.al.	2506.05069	null
2025-06-12	Context Is Not Comprehension	Alex Pan et.al.	2506.04907	null
2025-06-05	ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests	Shiyi Xu et.al.	2506.04894	link
2025-06-10	Evaluation is All You Need: Strategic Overclaiming of LLM Reasoning Capabilities Through Evaluation Design	Lin Sun et.al.	2506.04734	null
2025-06-05	Look Before You Leap: A GUI-Critic-R1 Model for Pre-Operative Error Diagnosis in GUI Automation	Yuyang Wanyan et.al.	2506.04614	null
2025-06-05	MuSciClaims: Multimodal Scientific Claim Verification	Yash Kumar Lal et.al.	2506.04585	null
2025-06-04	Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences	Hadi Hosseini et.al.	2506.04478	null
2025-06-04	RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought	Yi Lu et.al.	2506.04277	null
2025-06-04	Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning	Shuang Chen et.al.	2506.04207	null
2025-06-04	R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning	Qingfei Zhao et.al.	2506.04185	link
2025-06-04	MMR-V: What’s Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos	Kejian Zhu et.al.	2506.04141	null
2025-06-04	Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning	Junqi Gao et.al.	2506.03939	link
2025-06-04	Reason from Future: Reverse Thought Chain Enhances LLM Reasoning	Yinlong Xu et.al.	2506.03673	null
2025-06-16	Zero-Shot Temporal Interaction Localization for Egocentric Videos	Erhang Zhang et.al.	2506.03662	link
2025-06-04	MiMo-VL Technical Report	Xiaomi LLM-Core Team et.al.	2506.03569	link
2025-06-04	Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback	Xiaoying Zhang et.al.	2506.03106	null
2025-06-04	Demystifying Reasoning Dynamics with Mutual Information: Thinking Tokens are Information Peaks in LLM Reasoning	Chen Qian et.al.	2506.02867	link
2025-06-14	TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression	Zhong-Zhi Li et.al.	2506.02678	link
2025-06-03	A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning	Xuejiao Zhao et.al.	2506.02470	link
2025-06-02	Act Only When It Pays: Efficient Reinforcement Learning for LLM Reasoning via Selective Rollouts	Haizhong Zheng et.al.	2506.02177	null
2025-06-02	Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains	Juncheng Wu et.al.	2506.02126	null
2025-06-02	Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning	Shenzhi Wang et.al.	2506.01939	null
2025-06-02	Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books	Chen Zhang et.al.	2506.01796	null
2025-06-02	R2SM: Referring and Reasoning for Selective Masks	Yu-Lin Shih et.al.	2506.01795	null
2025-06-02	SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning	Zhongwei Wan et.al.	2506.01713	null
2025-06-02	K12Vista: Exploring the Boundaries of MLLMs in K-12 Education	Chong Li et.al.	2506.01676	null
2025-06-02	EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation	Bingqian Lin et.al.	2506.01551	null
2025-06-02	Compiler Optimization via LLM Reasoning for Efficient Model Serving	Sujun Tang et.al.	2506.01374	null
2025-06-02	The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning	Xinyu Zhu et.al.	2506.01347	link
2025-06-01	GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking	Yufei Zhan et.al.	2506.01078	link
2025-06-01	Enhancing LLM Reasoning for Time Series Classification by Tailored Thinking and Fused Decision	Jiahui Zhou et.al.	2506.00807	null
2025-05-31	Beyond Context to Cognitive Appraisal: Emotion Reasoning as a Theory of Mind Benchmark for Large Language Models	Gerard Christopher Yeo et.al.	2506.00334	null
2025-05-30	Tournament of Prompts: Evolving LLM Instructions Through Structured Debates and Elo Ratings	Anirudh Nair et.al.	2506.00178	null
2025-05-30	Werewolf: A Straightforward Game Framework with TTS for Improved User Engagement	Qihui Fan et.al.	2506.00160	null
2025-05-28	Rethinking Hybrid Retrieval: When Small Embeddings and LLM Re-ranking Beat Bigger Models	Arjun Rao et.al.	2506.00049	null
2025-05-30	Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents	Yaxin Luo et.al.	2505.24878	link
2025-05-30	Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks	Tajamul Ashraf et.al.	2505.24876	link
2025-05-30	Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning	Shuyao Xu et.al.	2505.24850	link
2025-05-30	Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success	Ben Griffin et.al.	2505.24622	null
2025-06-10	Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting	Jiahao Wang et.al.	2505.24511	link
2025-05-30	Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation	Ximing Xing et.al.	2505.24499	null
2025-05-30	How Much Backtracking is Enough? Exploring the Interplay of SFT and RL in Enhancing LLM Reasoning	Hongyi James Cai et.al.	2505.24273	null
2025-06-02	MIRAGE: Assessing Hallucination in Multimodal Reasoning Chains of MLLM	Bowen Dong et.al.	2505.24238	null
2025-05-30	Semi-structured LLM Reasoners Can Be Rigorously Audited	Jixuan Leng et.al.	2505.24217	null
2025-05-30	HardTests: Synthesizing High-Quality Test Cases for LLM Coding	Zhongmou He et.al.	2505.24098	null
2025-05-29	Preemptive Hallucination Reduction: An Input-Level Approach for Multimodal Language Model	Nokimul Hasan Arif et.al.	2505.24007	null
2025-05-29	VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL	Yichen Feng et.al.	2505.23977	null
2025-05-29	Infi-Med: Low-Resource Medical MLLMs with Robust Reasoning Evaluation	Zeyu Liu et.al.	2505.23867	null
2025-05-29	Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought	Yunze Man et.al.	2505.23766	null
2025-06-03	DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning	Ziyin Zhang et.al.	2505.23754	link
2025-05-29	Don’t Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models	Jinzhe Li et.al.	2505.23715	link
2025-05-29	Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation	Ziling Cheng et.al.	2505.23701	null
2025-05-29	Probability-Consistent Preference Optimization for Enhanced LLM Reasoning	Yunqiao Yang et.al.	2505.23540	link
2025-05-29	Diversity-Aware Policy Optimization for Large Language Model Reasoning	Jian Yao et.al.	2505.23433	null
2025-05-29	GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning	Jusheng Zhang et.al.	2505.23399	null
2025-06-05	MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration	Zhitao He et.al.	2505.23224	link
2025-05-29	Elicit and Enhance: Advancing Multimodal Reasoning in Medical Scenarios	Linjie Mu et.al.	2505.23118	null
2025-06-06	Infi-MMR: Curriculum-based Unlocking Multimodal Reasoning via Phased Reinforcement Learning in Multimodal Small Language Models	Zeyu Liu et.al.	2505.23091	null
2025-05-29	Case-Based Reasoning Enhances the Predictive Power of LLMs in Drug-Drug Interaction	Guangyi Liu et.al.	2505.23034	null
2025-05-29	StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs	Haohan Yuan et.al.	2505.22950	null
2025-05-28	VidText: Towards Comprehensive Evaluation for Video Text Understanding	Zhoufaran Yang et.al.	2505.22810	link
2025-05-28	Decomposing Elements of Problem Solving: What “Math” Does RL Teach?	Tian Qin et.al.	2505.22756	link
2025-05-28	AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models	Feng Luo et.al.	2505.22662	null
2025-05-28	SAM-R1: Leveraging SAM for Reward Feedback in Multimodal Segmentation via Reinforcement Learning	Jiaqi Huang et.al.	2505.22596	null
2025-05-28	ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM	Hoang Pham et.al.	2505.22552	null
2025-05-28	Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO	Lai Wei et.al.	2505.22453	link
2025-05-29	Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition	Hanting Chen et.al.	2505.22375	null
2025-05-28	Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start	Lai Wei et.al.	2505.22334	link
2025-05-28	If Pigs Could Fly… Can LLMs Logically Reason Through Counterfactuals?	Ishwar B Balappanawar et.al.	2505.22318	null
2025-05-28	Rethinking the Unsolvable: When In-Context Search Meets Test-Time Scaling	Fanzeng Xia et.al.	2505.22290	null
2025-05-28	What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning	Gangwei Jiang et.al.	2505.22148	null
2025-05-28	OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning	Shifang Zhao et.al.	2505.22039	null
2025-05-27	Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation	Tharindu Kumarage et.al.	2505.21784	null
2025-05-27	Don’t Think Longer, Think Wisely: Optimizing Thinking Dynamics for Large Reasoning Models	Sohyun An et.al.	2505.21765	null
2025-05-27	R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing	Tianyu Fu et.al.	2505.21600	link
2025-05-31	More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models	Chengzhi Liu et.al.	2505.21523	null
2025-05-27	Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?	Junhao Cheng et.al.	2505.21374	link
2025-05-27	MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs	Jiakang Yuan et.al.	2505.21327	null
2025-05-27	Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning	Mingyang Song et.al.	2505.21178	null
2025-05-27	DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response	Junjue Wang et.al.	2505.21089	null
2025-06-04	LLMs Think, But Not In Your Flow: Reasoning-Level Personalization for Black-Box Large Language Models	Jieyong Kim et.al.	2505.21082	null
2025-05-27	Def-DTS: Deductive Reasoning for Open-domain Dialogue Topic Segmentation	Seungmin Lee et.al.	2505.21033	null
2025-05-27	Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA	Xiangqing Shen et.al.	2505.20971	null
2025-05-28	VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models	Kui Wu et.al.	2505.20718	null
2025-05-27	Accelerating RL for LLM Reasoning with Optimal Advantage Regression	Kianté Brantley et.al.	2505.20686	null
2025-05-27	Can Past Experience Accelerate LLM Reasoning?	Bo Pan et.al.	2505.20643	null
2025-05-26	Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning	Shenao Zhang et.al.	2505.20561	null
2025-05-26	Enhancing Logical Reasoning in Language Models via Symbolically-Guided Monte Carlo Process Supervision	Xingwei Tan et.al.	2505.20415	null
2025-05-23	Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence	Amirhosein Ghasemabadi et.al.	2505.20325	null
2025-05-26	KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing	Rui Li et.al.	2505.20245	link
2025-06-04	DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning	Qi Cao et.al.	2505.20241	null
2025-05-26	THiNK: Can Large Language Models Think-aloud?	Yongan Yu et.al.	2505.20184	link
2025-05-26	Visual Abstract Thinking Empowers Multimodal Reasoning	Dairu Liu et.al.	2505.20164	link
2025-05-26	Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning	Jaehun Jung et.al.	2505.20161	null
2025-05-26	Agentic 3D Scene Generation with Spatially Contextualized VLMs	Xinhang Liu et.al.	2505.20129	null
2025-05-26	REARANK: Reasoning Re-ranking Agent via Reinforcement Learning	Le Zhang et.al.	2505.20046	link
2025-05-26	An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning	Andrew Zamai et.al.	2505.19954	null
2025-05-26	Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19952	null
2025-05-26	Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions	Siqi Kou et.al.	2505.19949	null
2025-05-26	HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation	Feng Xiong et.al.	2505.19866	null
2025-05-26	Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective	Junnan Liu et.al.	2505.19815	link
2025-05-26	MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19707	null
2025-05-26	Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning	Minheng Ni et.al.	2505.19702	null
2025-05-26	Large Language Models’ Reasoning Stalls: An Investigation into the Capabilities of Frontier Models	Lachlan McGinness et.al.	2505.19676	null
2025-05-26	Interleaved Reasoning for Large Language Models via Reinforcement Learning	Roy Xie et.al.	2505.19640	null
2025-05-26	Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering	Jiajun Zhu et.al.	2505.19410	null
2025-05-25	SituatedThinker: Grounding LLM Reasoning with Real-World through Situated Thinking	Junnan Liu et.al.	2505.19300	link
2025-05-28	VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use	Mingyuan Wu et.al.	2505.19255	null
2025-05-25	ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning	Yeyuan Wang et.al.	2505.19100	null
2025-05-30	SeePhys: Does Seeing Help Thinking? – Benchmarking Vision-Based Physics Reasoning	Kun Xiang et.al.	2505.19099	link
2025-05-25	SATORI-R1: Incentivizing Multimodal Reasoning with Spatial Grounding and Verifiable Rewards	Chuming Shen et.al.	2505.19094	link
2025-05-25	ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning	Tuan Van Vo et.al.	2505.19080	null
2025-05-25	Can Large Language Models Infer Causal Relationships from Real-World Text?	Ryan Saklad et.al.	2505.18931	null
2025-05-24	Don’t Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation	Jiwan Chung et.al.	2505.18842	null
2025-05-24	Enhancing LLMs’ Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning	Jinzheng Li et.al.	2505.18831	null
2025-05-24	How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark	Minglai Yang et.al.	2505.18761	link
2025-05-24	GainRAG: Preference Alignment in Retrieval-Augmented Generation through Gain Signal Synthesis	Yi Jiang et.al.	2505.18710	link
2025-05-24	Steering LLM Reasoning Through Bias-Only Adaptation	Viacheslav Sinii et.al.	2505.18706	null
2025-05-31	ChartGalaxy: A Dataset for Infographic Chart Understanding and Generation	Zhen Li et.al.	2505.18668	link
2025-05-24	Unraveling Misinformation Propagation in LLM Reasoning	Yiyang Feng et.al.	2505.18555	link
2025-05-23	One Demo Is All It Takes: Planning Domain Derivation with LLMs from A Single Demonstration	Jinbang Huang et.al.	2505.18382	null
2025-05-23	Seeing Beyond Words: MatVQA for Challenging Visual-Scientific Reasoning in Materials Science	Sifan Wu et.al.	2505.18319	null
2025-05-23	Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL	Che Liu et.al.	2505.17952	null
2025-05-23	Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning	Zezhong Wang et.al.	2505.17829	null
2025-05-23	Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning	Michael Hassid et.al.	2505.17813	null
2025-05-23	Towards General Continuous Memory for Vision-Language Models	Wenyi Wu et.al.	2505.17670	null
2025-05-23	EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications	Ancheng Xu et.al.	2505.17654	null
2025-05-29	Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective	Deyang Kong et.al.	2505.17652	null
2025-05-27	Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration	Jingtong Gao et.al.	2505.17621	null
2025-05-23	MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation	Jihan Yao et.al.	2505.17613	null
2025-05-23	On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning	Yifan Zhang et.al.	2505.17508	null
2025-05-23	From Reasoning to Generalization: Knowledge-Augmented LLMs for ARC Benchmark	Chao Lei et.al.	2505.17482	null
2025-05-23	Hydra: Structured Cross-Source Enhanced Large Language Model Reasoning	Xingyu Tan et.al.	2505.17464	null
2025-05-23	LeTS: Learning to Think-and-Search via Process-and-Outcome Reward Hybridization	Qi Zhang et.al.	2505.17447	null
2025-05-23	Misaligning Reasoning with Answers – A Framework for Assessing LLM CoT Robustness	Enyi Jiang et.al.	2505.17406	null
2025-05-22	LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving Scenarios	Huaiyuan Yao et.al.	2505.17209	link
2025-05-21	NeSyGeo: A Neuro-Symbolic Framework for Multimodal Geometric Reasoning Data Generation	Weiming Wu et.al.	2505.17121	null
2025-05-21	Systematic Evaluation of Machine-Generated Reasoning and PHQ-9 Labeling for Depression Detection Using Large Language Models	Zongru Shao et.al.	2505.17119	null
2025-05-21	Swarm Intelligence Enhanced Reasoning: A Density-Driven Framework for LLM-Based Multi-Agent Optimization	Ying Zhu et.al.	2505.17115	null
2025-05-21	CAMA: Enhancing Multimodal In-Context Learning with Context-Aware Modulated Attention	Yanshu Li et.al.	2505.17097	null
2025-05-22	ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark	Sara Ghaboura et.al.	2505.17021	link
2025-05-22	SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward	Kaixuan Fan et.al.	2505.17018	link
2025-05-22	$\text{R}^2\text{ec}$ : Towards Large Recommender Models with Reasoning	Runyang You et.al.	2505.16994	link
2025-05-22	Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary?	Nour Jedidi et.al.	2505.16886	null
2025-05-26	DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation	Bowen Zheng et.al.	2505.16810	null
2025-05-22	Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement	Kexin Zhang et.al.	2505.16806	null
2025-05-22	Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning	Xinghao Chen et.al.	2505.16782	link
2025-05-22	Collaboration among Multiple Large Language Models for Medical Question Answering	Kexin Shang et.al.	2505.16648	null
2025-05-27	Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains	Wenhui Tan et.al.	2505.16552	null
2025-05-22	SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning	Huanyu Liu et.al.	2505.16368	link
2025-05-22	EquivPruner: Boosting Efficiency and Quality in LLM-Based Search via Action Pruning	Jiawei Liu et.al.	2505.16312	link
2025-05-22	Augmenting LLM Reasoning with Dynamic Notes Writing for Complex QA	Rishabh Maheshwary et.al.	2505.16293	null
2025-05-22	Training-Free Reasoning and Reflection in MLLMs	Hongchen Wei et.al.	2505.16151	null
2025-05-22	Distilling the Implicit Multi-Branch Structure in LLMs’ Reasoning via Reinforcement Learning	Shicheng Xu et.al.	2505.16142	null
2025-05-26	Abstractions-of-Thought: Intermediate Representations for LLM Reasoning in Hardware Design	Matthew DeLorenzo et.al.	2505.15873	null
2025-05-21	LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models	Ruilin Yao et.al.	2505.15616	null
2025-05-21	Chain-of-Focus: Adaptive Visual Search and Zooming for Multimodal Reasoning via RL	Xintong Zhang et.al.	2505.15436	null
2025-05-21	Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning	Yurun Yuan et.al.	2505.15311	null
2025-05-21	Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	Jie Ma et.al.	2505.15210	link
2025-05-21	Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning	Jinghui Lu et.al.	2505.15154	null
2025-05-21	The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning	Shivam Agarwal et.al.	2505.15134	link
2025-05-21	Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision	Eric Hanchen Jiang et.al.	2505.14999	null
2025-05-20	Self-Evolving Curriculum for LLM Reasoning	Xiaoyin Chen et.al.	2505.14970	null
2025-05-20	MORALISE: A Structured Benchmark for Moral Alignment in Visual Language Models	Xiao Lin et.al.	2505.14728	null
2025-05-18	KGAlign: Joint Semantic-Structural Knowledge Encoding for Multimodal Fake News Detection	Tuan-Vinh La et.al.	2505.14714	link
2025-05-23	Emerging Properties in Unified Multimodal Pretraining	Chaorui Deng et.al.	2505.14683	null
2025-05-27	General-Reasoner: Advancing LLM Reasoning Across All Domains	Xueguang Ma et.al.	2505.14652	null
2025-05-22	TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning	Zhangchen Xu et.al.	2505.14625	link
2025-05-20	SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas	Anjiang Wei et.al.	2505.14615	null
2025-05-21	KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation	Jiajun Shi et.al.	2505.14552	link
2025-05-23	Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning	Zhaohui Yang et.al.	2505.14403	null
2025-05-26	DeepEyes: Incentivizing “Thinking with Images” via Reinforcement Learning	Ziwei Zheng et.al.	2505.14362	link
2025-05-20	Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning	Minwu Kim et.al.	2505.14216	link
2025-05-20	RL of Thoughts: Navigating LLM Reasoning with Inference-time Reinforcement Learning	Qianyue Hao et.al.	2505.14140	null
2025-05-20	Code2Logic: Game-Code-Driven Data Synthesis for Enhancing VLMs General Reasoning	Jingqi Tong et.al.	2505.13886	link
2025-05-20	Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning	Jiwon Song et.al.	2505.13866	link
2025-05-18	RAGXplain: From Explainable Evaluation to Actionable Guidance of RAG Pipelines	Dvir Cohen et.al.	2505.13538	null
2025-05-16	IRLBench: A Multi-modal, Culturally Grounded, Parallel Irish-English Benchmark for Open-Ended LLM Reasoning Evaluation	Khanh-Tung Tran et.al.	2505.13498	link
2025-05-19	MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision	Lingxiao Du et.al.	2505.13427	link
2025-05-19	MR. Judge: Multimodal Reasoner as a Judge	Renjie Pi et.al.	2505.13403	null
2025-05-20	Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning	Adam Štorek et.al.	2505.13353	null
2025-05-19	Thinking Short and Right Over Thinking Long: Serving LLM Reasoning Efficiently and Accurately	Yuhang Wang et.al.	2505.13326	null
2025-05-19	Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space	Hengli Li et.al.	2505.13308	link
2025-05-19	RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning	Qiguang Chen et.al.	2505.13307	link
2025-05-19	Unlocking the Potential of Difficulty Prior in RL-based Multimodal Reasoning	Mingrui Chen et.al.	2505.13261	null
2025-05-23	SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information	Chih-Kai Yang et.al.	2505.13237	link
2025-05-21	Hearing from Silence: Reasoning Audio Descriptions from Silent Videos via Vision-Language Model	Yong Ren et.al.	2505.13062	null
2025-05-25	Fractured Chain-of-Thought Reasoning	Baohao Liao et.al.	2505.12992	null
2025-05-19	DGRO: Enhancing LLM Reasoning via Exploration-Exploitation Control and Reward Variance Management	Xuerui Su et.al.	2505.12951	null
2025-05-19	The Traitors: Deception and Trust in Multi-Agent Language Model Simulations	Pedro M. P. Curvo et.al.	2505.12923	link
2025-05-19	AdaToken-3D: Dynamic Spatial Gating for Efficient 3D Large Multimodal-Models Reasoning	Kai Zhang et.al.	2505.12782	null
2025-05-19	Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation	Weiliang Tang et.al.	2505.12744	null
2025-05-18	Reasoning-CV: Fine-tuning Powerful Reasoning LLMs for Knowledge-Assisted Claim Verification	Zhi Zheng et.al.	2505.12348	link
2025-05-18	LogicOCR: Do Your Large Multimodal Models Excel at Logical Reasoning on Text-Rich Images?	Maoyuan Ye et.al.	2505.12307	link
2025-05-18	MMS-VPR: Multimodal Street-Level Visual Place Recognition Dataset and Benchmark	Yiwei Ou et.al.	2505.12254	null
2025-05-17	Do Code LLMs Do Static Analysis?	Chia-Yi Su et.al.	2505.12118	link
2025-05-17	Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier	Jianyuan Zhong et.al.	2505.11966	null
2025-05-22	PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging	Quoc-Huy Trinh et.al.	2505.11872	null
2025-05-17	Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning	Yansong Ning et.al.	2505.11827	link
2025-05-16	REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning	Pawin Taechoyotin et.al.	2505.11718	null
2025-05-16	Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner	Wenchuan Zhang et.al.	2505.11404	link
2025-05-23	SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning	Zheng Li et.al.	2505.11274	null
2025-05-24	Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans	Yansheng Qiu et.al.	2505.11141	null
2025-05-16	Scaling Reasoning can Improve Factuality in Large Language Models	Mike Zhang et.al.	2505.11140	link
2025-05-16	Humans expect rationality and cooperation from LLM opponents in strategic games	Darija Barak et.al.	2505.11011	null
2025-05-16	Vaiage: A Multi-Agent Solution to Personalized Travel Planning	Binwen Liu et.al.	2505.10922	null
2025-05-15	Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning	Yoichi Ishibashi et.al.	2505.10182	null
2025-05-15	XRAG: Cross-lingual Retrieval-Augmented Generation	Wei Liu et.al.	2505.10089	null
2025-05-13	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	Yuhan Liu et.al.	2505.08532	null
2025-05-13	Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation	Enci Zhang et.al.	2505.08364	null
2025-05-12	KAQG: A Knowledge-Graph-Enhanced RAG for Difficulty-Controlled Question Generation	Ching Han Chen et.al.	2505.07618	null
2025-05-12	How well do LLMs reason over tabular data, really?	Cornelius Wolff et.al.	2505.07453	null
2025-05-12	Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning	Xiaokun Wang et.al.	2505.07263	null
2025-05-12	Critique Before Thinking: Mitigating Hallucination through Rationale-Augmented Instruction Tuning	Zexian Yang et.al.	2505.07172	null
2025-05-11	Seed1.5-VL Technical Report	Dong Guo et.al.	2505.07062	null
2025-05-17	Bridging AI and Carbon Capture: A Dataset for LLMs in Ionic Liquids and CBE Research	Gaurab Sarkar et.al.	2505.06964	link
2025-05-11	UniDiffGrasp: A Unified Framework Integrating VLM Reasoning and VLM-Guided Part Diffusion for Open-Vocabulary Constrained Grasping with Dual Arms	Xueyang Guo et.al.	2505.06832	null
2025-05-11	Overview of the NLPCC 2025 Shared Task 4: Multi-modal, Multilingual, and Multi-hop Medical Instructional Video Question Answering Challenge	Bin Li et.al.	2505.06814	null
2025-05-10	STRIVE: Structured Representation Integrating VLM Reasoning for Efficient Object Navigation	Haokun Zhu et.al.	2505.06729	null
2025-05-17	Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning	Hang Gao et.al.	2505.06321	link
2025-05-07	Q-Heart: ECG Question Answering via Knowledge-Informed Multimodal LLMs	Hung Manh Pham et.al.	2505.06296	null
2025-05-09	From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling	Vahid Rahimzadeh et.al.	2505.06184	null
2025-05-12	APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning	Azim Ospanov et.al.	2505.05758	null
2025-05-09	Evolutionary thoughts: integration of large language models and evolutionary algorithms	Antonio Jimeno Yepes et.al.	2505.05756	link
2025-05-08	Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models	Yunxin Li et.al.	2505.04921	link
2025-05-07	Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers	Kusha Sareen et.al.	2505.04842	null
2025-05-06	Advancing Conversational Diagnostic AI with Multimodal Reasoning	Khaled Saab et.al.	2505.04653	null
2025-05-07	SToLa: Self-Adaptive Touch-Language Framework with Tactile Commonsense Reasoning in Open-Ended Scenarios	Ning Cheng et.al.	2505.04201	null
2025-05-20	On-Device LLM for Context-Aware Wi-Fi Roaming	Ju-Hyung Lee et.al.	2505.04174	link
2025-05-06	X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains	Qianchu Liu et.al.	2505.03981	null
2025-04-30	When Reasoning Beats Scale: A 1.5B Reasoning Model Outranks 13B LLMs as Discriminator	Md Fahim Anjum et.al.	2505.03786	link
2025-05-06	The Steganographic Potentials of Language Models	Artem Karpov et.al.	2505.03439	null
2025-05-12	Geospatial Mechanistic Interpretability of Large Language Models	Stef De Sabbata et.al.	2505.03368	link
2025-05-03	Accelerating Large Language Model Reasoning via Speculative Search	Zhihai Wang et.al.	2505.02865	null
2025-05-05	HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking	Runquan Gui et.al.	2505.02322	null
2025-05-04	DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving	Xinmeng Hou et.al.	2505.02123	link
2025-05-04	R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation	Meng-Hao Guo et.al.	2505.02018	null
2025-05-02	VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos	Zongxia Li et.al.	2505.01481	link
2025-05-01	Reasoning Capabilities and Invariability of Large Language Models	Alessandro Raganato et.al.	2505.00776	link
2025-04-30	Audo-Sight: Enabling Ambient Interaction For Blind And Visually Impaired Individuals	Bhanuja Ainary et.al.	2505.00153	null
2025-05-02	Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning	Shaun Baek et.al.	2505.00001	null
2025-05-21	Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models	Guanghao Zhou et.al.	2504.21277	null
2025-05-09	Token-Efficient RL for LLM Reasoning	Alan Lee et.al.	2504.20834	null
2025-04-29	Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression	Yu Cui et.al.	2504.20493	null
2025-04-30	VideoMultiAgents: A Multi-Agent Framework for Video Question Answering	Noriyuki Kugo et.al.	2504.20091	link
2025-04-28	From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review	Mohamed Amine Ferrag et.al.	2504.19678	null
2025-05-17	SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning	Jiaqi Chen et.al.	2504.19162	null
2025-04-27	CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges	Yu Li et.al.	2504.19093	null
2025-04-24	Training Large Language Models to Reason via EM Policy Gradient	Tianbing Xu et.al.	2504.18587	null
2025-05-08	MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind	Zheng Zhang et.al.	2504.18039	null
2025-05-13	DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training	Xiaoyu Tian et.al.	2504.17565	null
2025-04-25	Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning	Chris et.al.	2504.16656	link
2025-04-27	Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL	Simone Papicchio et.al.	2504.15077	null
2025-04-20	a1: Steep Test-time Scaling Law via Environment Augmented Generation	Lingrui Mei et.al.	2504.14597	null
2025-04-20	CoLoTa: A Dataset for Entity-based Commonsense Reasoning over Long-Tail Knowledge	Armin Toroghi et.al.	2504.14462	null
2025-04-19	Improving RL Exploration for LLM Reasoning through Retrospective Replay	Shihan Dou et.al.	2504.14363	null
2025-05-21	An Empirical Study of LLM Reasoning Ability Under Strict Output Length Constraint	Yi Sun et.al.	2504.14350	null
2025-04-22	SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM	Xiaojiang Zhang et.al.	2504.14286	null
2025-04-19	CODECRASH: Stress Testing LLM Reasoning under Structural and Semantic Perturbations	Man Ho Lam et.al.	2504.14119	null
2025-04-18	Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods	Junlin Wang et.al.	2504.14047	null
2025-03-26	3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark	Ivan Sviridov et.al.	2504.13861	link
2025-05-16	Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?	Yang Yue et.al.	2504.13837	null
2025-04-18	Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning	Jianing Wang et.al.	2504.13500	link
2025-04-17	Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks	Amey Hengle et.al.	2504.12845	null
2025-05-19	GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks	Hao Xu et.al.	2504.12764	link
2025-04-17	Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning	Baining Zhao et.al.	2504.12680	link
2025-04-17	VLMGuard-R1: Proactive Safety Alignment for VLMs via Reasoning-Driven Prompt Optimization	Menglan Chen et.al.	2504.12661	null
2025-04-24	GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning	Liangyu Xu et.al.	2504.12597	null
2025-04-13	HM-RAG: Hierarchical Multi-Agent Multimodal Retrieval Augmented Generation	Pei Liu et.al.	2504.12330	link
2025-04-16	d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning	Siyan Zhao et.al.	2504.12216	null
2025-04-16	Could Thinking Multilingually Empower LLM Reasoning?	Changjiang Gao et.al.	2504.11833	link
2025-04-15	A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce	Wei Xiong et.al.	2504.11343	link
2025-04-15	MMC: Iterative Refinement of VLM Reasoning via MCTS-based Multimodal Critique	Shuhang Liu et.al.	2504.11009	null
2025-05-14	CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives	Ayoung Lee et.al.	2504.10823	null
2025-04-14	Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning	Saif Punjwani et.al.	2504.10646	link
2025-04-30	VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge	Yueqi Song et.al.	2504.10342	null
2025-04-14	SlowFastVAD: Video Anomaly Detection via Integrating Simple Detector and RAG-Enhanced Vision-Language Model	Zongcan Ding et.al.	2504.10320	null
2025-04-14	PRM-BAS: Enhancing Multimodal Reasoning through PRM-guided Beam Annealing Search	Pengfei Hu et.al.	2504.10222	null
2025-04-15	Breaking the Data Barrier – Building GUI Agents Through Task Generalization	Junlei Zhang et.al.	2504.10127	link
2025-04-14	CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation	Jia Li et.al.	2504.10046	null
2025-04-13	Short-Path Prompting in LLMs: Analyzing Reasoning Instability and Solutions for Robust Performance	Zuoli Tang et.al.	2504.09586	null
2025-04-13	Draw with Thought: Unleashing Multimodal Reasoning for Scientific Diagram Generation	Zhiqing Cui et.al.	2504.09479	null
2025-04-12	NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding	Aniket Pal et.al.	2504.09249	null
2025-04-12	A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems	Zixuan Ke et.al.	2504.09037	null
2025-04-11	Mixed Signals: Decoding VLMs’ Reasoning and Underlying Bias in Vision-Language Conflict	Pouya Pezeshkpour et.al.	2504.08974	null
2025-05-08	VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning	Haozhe Wang et.al.	2504.08837	null
2025-04-06	AdaptRec: A Self-Adaptive Framework for Sequential Recommendations with Large Language Models	Tong Zhang et.al.	2504.08786	null
2025-04-01	Accelerating Causal Network Discovery of Alzheimer Disease Biomarkers via Scientific Literature-based Retrieval Augmented Generation	Xiaofan Zhou et.al.	2504.08768	null
2025-04-11	Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning	Fangzhi Xu et.al.	2504.08672	link
2025-04-11	VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering	Qi Zhi Lim et.al.	2504.08269	null
2025-04-15	Kimi-VL Technical Report	Kimi Team et.al.	2504.07491	link
2025-04-02	DeepSeek-R1 Thoughtology: Let’s about LLM Reasoning	Sara Vera Marjanović et.al.	2504.07128	null
2025-04-09	KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs	Elan Markowitz et.al.	2504.07087	null
2025-04-09	DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning	Atharva Pandey et.al.	2504.07080	null
2025-04-09	To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning	Tian Qin et.al.	2504.07052	null
2025-04-09	SCI-Reason: A Dataset with Chain-of-Thought Rationales for Complex Multimodal Reasoning in Academic Areas	Chenghao Ma et.al.	2504.06637	null
2025-04-08	FEABench: Evaluating Language Models on Multiphysics Reasoning Ability	Nayantara Mudur et.al.	2504.06260	link
2025-04-23	Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization	Qingyang Zhang et.al.	2504.05812	link
2025-04-08	MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models	Pengfei Zhou et.al.	2504.05782	link
2025-04-08	Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought	Yi Peng et.al.	2504.05599	null
2025-04-06	ZeroED: Hybrid Zero-shot Error Detection through Large Language Model Reasoning	Wei Ni et.al.	2504.05345	null
2025-04-07	Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning	Sugyeong Eo et.al.	2504.05047	null
2025-04-07	LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts	Yimu Wang et.al.	2504.04653	null
2025-04-06	Hierarchical Planning for Complex Tasks with Knowledge Graph-RAG and Symbolic Verification	Cristina Cornelio et.al.	2504.04578	null
2025-04-06	Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models	Rui Gan et.al.	2504.04562	link
2025-04-06	Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning	Xuerui Su et.al.	2504.04524	link
2025-04-06	Geo-OLM: Enabling Sustainable Earth Observation Studies with Cost-Efficient Open Language Models & State-Driven Workflows	Dimitrios Stamoulis et.al.	2504.04319	null
2025-04-04	Language Models Are Implicitly Continuous	Samuele Marro et.al.	2504.03933	link
2025-04-04	Have Large Language Models Learned to Reason? A Characterization via 3-SAT Phase Transition	Rishi Hazra et.al.	2504.03930	null
2025-04-07	MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models	Wulin Xie et.al.	2504.03641	null
2025-04-04	Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)	Jing Bi et.al.	2504.03151	null
2025-04-04	LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph	Tu Ao et.al.	2504.03137	null
2025-04-25	Generative Evaluation of Complex Reasoning in Large Language Models	Haowei Lin et.al.	2504.02810	link
2025-04-10	Affordable AI Assistants with Knowledge Graph of Thoughts	Maciej Besta et.al.	2504.02670	null
2025-04-03	LexPam: Legal Procedure Awareness-Guided Mathematical Reasoning	Kepu Zhang et.al.	2504.02590	null
2025-04-03	AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology	Xiang Feng et.al.	2504.02404	link
2025-04-02	A Survey of Scaling in Large Language Model Reasoning	Zihan Chen et.al.	2504.02181	null
2025-04-02	Exploring LLM Reasoning Through Controlled Prompt Variations	Giannis Chatziveroglou et.al.	2504.02111	link
2025-04-02	Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning	Yinggan Xu et.al.	2504.01911	null
2025-04-02	TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables	Abhilash Shankarampeta et.al.	2504.01879	null
2025-04-02	Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models	Zhiwei Yu et.al.	2504.01857	null
2025-04-03	GTR: Graph-Table-RAG for Cross-Table Question Answering	Jiaru Zou et.al.	2504.01346	null
2025-04-01	When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning	Nishad Singhi et.al.	2504.01005	null
2025-04-01	How Difficulty-Aware Staged Reinforcement Learning Enhances LLMs’ Reasoning Capabilities: A Preliminary Experimental Study	Yunjie Ji et.al.	2504.00829	null
2025-04-02	FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning	Jie Ma et.al.	2504.00487	link
2025-04-01	Agentic Multimodal AI for Hyperpersonalized B2B and B2C Advertising in Competitive Markets: An AI-Driven Competitive Advertising Framework	Sakhinana Sagar Srinivas et.al.	2504.00338	null
2025-03-31	Do Large Language Models Exhibit Spontaneous Rational Deception?	Samuel M. Taylor et.al.	2504.00285	null
2025-03-31	SVLA: A Unified Speech-Vision-Language Assistant with Multimodal Reasoning and Speech Generation	Ngoc Dung Huynh et.al.	2503.24164	null
2025-03-31	Boosting MLLM Reasoning with Text-Debiased Hint-GRPO	Qihan Huang et.al.	2503.23905	null
2025-03-31	WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization	Ine Gevers et.al.	2503.23779	null
2025-03-30	Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models	Sid Bharthulwar et.al.	2503.23503	null
2025-03-29	The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction	Yihuai Hong et.al.	2503.23084	null
2025-04-03	Cognitive Prompts Using Guilford’s Structure of Intellect Model	Oliver Kramer et.al.	2503.22036	null
2025-03-27	SWI: Speaking with Intent in Large Language Models	Yuwei Yin et.al.	2503.21544	link
2025-03-27	Cultivating Game Sense for Yourself: Making VLMs Gaming Experts	Wenxuan Lu et.al.	2503.21263	null
2025-03-27	Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning	Huajie Tan et.al.	2503.20752	null
2025-03-26	Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging	Han Wu et.al.	2503.20641	link
2025-03-25	Gemini Robotics: Bringing AI into the Physical World	Gemini Robotics Team et.al.	2503.20020	null
2025-03-25	VisualQuest: A Diverse Image Dataset for Evaluating Visual Recognition in LLMs	Kelaiti Xiao et.al.	2503.19936	null
2025-04-06	A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design	Jie Tian et.al.	2503.19889	null
2025-03-25	Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking	Xiaoyu Tian et.al.	2503.19855	null
2025-03-24	Training-Free Personalization via Retrieval and Reasoning on Fingerprints	Deepayan Das et.al.	2503.18623	null
2025-03-23	Mind with Eyes: from Language Reasoning to Multimodal Reasoning	Zhiyu Lin et.al.	2503.18071	null
2025-04-19	Reason2Attack: Jailbreaking Text-to-Image Models via LLM Reasoning	Chenyu Zhang et.al.	2503.17987	null
2025-03-23	MedPlan:A Two-Stage RAG-Based System for Personalized Medical Plan Generation	Hsin-Ling Hsu et.al.	2503.17900	null
2025-03-22	A Modular Dataset to Demonstrate LLM Abstraction Capability	Adam Atanas et.al.	2503.17645	null
2025-03-22	ConSol: Sequential Probability Ratio Testing to Find Consistent LLM Reasoning Paths Efficiently	Jaeyeon Lee et.al.	2503.17587	link
2025-03-21	LEMMA: Learning from Errors for MatheMatical Advancement in LLMs	Zhuoshi Pan et.al.	2503.17439	link
2025-03-21	V-Seek: Accelerating LLM Reasoning on Open-hardware Server-class RISC-V Platforms	Javier J. Poveda Rodrigo et.al.	2503.17422	null
2025-03-21	Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique	Yansi Li et.al.	2503.17363	null
2025-03-21	OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement	Yihe Deng et.al.	2503.17352	link
2025-03-21	LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language	Kun Chu et.al.	2503.17309	link
2025-03-21	Does Chain-of-Thought Reasoning Help Mobile GUI Agent? An Empirical Study	Li Zhang et.al.	2503.16788	link
2025-03-20	Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models	Chengkai Huang et.al.	2503.16734	null
2025-03-21	MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering	Feiyang Li et.al.	2503.16131	null
2025-03-20	Entropy-based Exploration Conduction for Multi-step Reasoning	Jinghan Zhang et.al.	2503.15848	null
2025-03-19	LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning	Federico Cocchi et.al.	2503.15621	link
2025-03-19	EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models	Yinan Liang et.al.	2503.15369	null
2025-04-01	Envisioning an AI-Enhanced Mental Health Ecosystem	Kellie Yu Hui Sim et.al.	2503.14883	null
2025-03-19	Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning	Chengran Yang et.al.	2503.14838	null
2025-03-18	Temporal Consistency for LLM Reasoning Process Error Identification	Jiacheng Guo et.al.	2503.14495	link
2025-03-21	Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment	Wei Chen et.al.	2503.13879	null
2025-03-18	Empowering GraphRAG with Knowledge Filtering and Integration	Kai Guo et.al.	2503.13804	null
2025-03-15	Cognitive Activation and Chaotic Dynamics in Large Language Models: A Quasi-Lyapunov Analysis of Reasoning Mechanisms	Xiaojian Li et.al.	2503.13530	null
2025-03-14	RAG-KG-IL: A Multi-Agent Hybrid Framework for Reducing Hallucinations and Enhancing LLM Reasoning through RAG and Incremental Knowledge Graph Learning Integration	Hong Qing Yu et.al.	2503.13514	null
2025-03-17	A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives	Weiqiang Jin et.al.	2503.13415	null
2025-03-17	MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research	James Burgess et.al.	2503.13399	link
2025-03-17	Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning	Hai-Long Sun et.al.	2503.13360	null
2025-03-17	Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning	Junming Liu et.al.	2503.12972	null
2025-03-17	R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization	Jingyi Zhang et.al.	2503.12937	link
2025-03-28	Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation	Songjun Tu et.al.	2503.12854	link
2025-03-18	DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding	Xinyu Ma et.al.	2503.12797	link
2025-03-16	MPBench: A Comprehensive Multimodal Reasoning Benchmark for Process Errors Identification	Zhaopan Xu et.al.	2503.12505	null
2025-03-31	Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition	Xiaoying Zhang et.al.	2503.12303	link
2025-03-20	Applications of Large Language Model Reasoning in Feature Generation	Dharani Chandra et.al.	2503.11989	null
2025-03-14	Neutralizing Bias in LLM Reasoning using Entailment Graphs	Liang Cheng et.al.	2503.11614	link
2025-03-14	VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity	Jing Bi et.al.	2503.11557	null
2025-03-14	RESPONSE: Benchmarking the Ability of Language Models to Undertake Commonsense Reasoning in Crisis Situation	Aissatou Diallo et.al.	2503.11348	null
2025-03-13	Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data	Paul Quinlan et.al.	2503.10883	null
2025-03-18	R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization	Yi Yang et.al.	2503.10615	link
2025-03-15	VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search	Yiming Jia et.al.	2503.10582	null
2025-03-13	VisualPRM: An Effective Process Reward Model for Multimodal Reasoning	Weiyun Wang et.al.	2503.10291	null
2025-03-18	“Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding	Hyunbin Jin et.al.	2503.10167	null
2025-03-13	How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game	Ziyue Wang et.al.	2503.10042	link
2025-04-08	Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning	Bowen Jin et.al.	2503.09516	link
2025-03-12	MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions	Zhe Xu et.al.	2503.09499	link
2025-03-12	A Survey on Enhancing Causal Reasoning Ability of Large Language Models	Xin Li et.al.	2503.09326	null
2025-03-11	Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework	Zhuo Zhi et.al.	2503.08308	null
2025-03-11	FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback	Kangan Qian et.al.	2503.08162	null
2025-03-05	An Optimization Algorithm for Multimodal Data Alignment	Wei Zhang et.al.	2503.07636	null
2025-03-11	LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL	Yingzhe Peng et.al.	2503.07536	null
2025-03-10	MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning	Fanqing Meng et.al.	2503.07365	link
2025-03-10	Dynamic Path Navigation for Motion Agents with LLM Reasoning	Yubo Zhao et.al.	2503.07323	null
2025-03-11	Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models	Wenxuan Huang et.al.	2503.06749	link
2025-03-09	Graph Retrieval-Augmented LLM for Conversational Recommendation Systems	Zhangchi Qiu et.al.	2503.06430	null
2025-03-08	Can Atomic Step Decomposition Enhance the Self-structured Reasoning of Multimodal Large Models?	Kun Xiang et.al.	2503.06252	link
2025-03-15	Integrating Chain-of-Thought for Multimodal Alignment: A Study on 3D Vision-Language Learning	Yanjun Chen et.al.	2503.06232	null
2025-03-08	KnowLogic: A Benchmark for Commonsense Reasoning via Knowledge-Driven Data Synthesis	Weidong Zhan et.al.	2503.06218	link
2025-03-07	Extracting and Emulsifying Cultural Explanation to Improve Multilingual Capability of LLMs	Hamin Koo et.al.	2503.05846	null
2025-03-07	Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning	Mufan Xu et.al.	2503.05193	null
2025-03-07	Rewarding Curse: Analyze and Mitigate Reward Modeling Issues for LLM Reasoning	Jiachun Li et.al.	2503.05188	null
2025-03-07	Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching	Simon A. Aytes et.al.	2503.05179	link
2025-03-10	R1-Zero’s “Aha Moment” in Visual Reasoning on a 2B Non-SFT Model	Hengguang Zhou et.al.	2503.05132	link
2025-03-04	Learning from Failures in Multi-Attempt Reinforcement Learning	Stephen Chung et.al.	2503.04808	null
2025-03-15	Can LLMs Reason About Program Semantics? A Comprehensive Evaluation of LLMs on Formal Specification Inference	Thanh Le-Cong et.al.	2503.04779	null
2025-03-06	Better Process Supervision with Bi-directional Rewarding Signals	Wenxiang Chen et.al.	2503.04618	null
2025-04-02	SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning	Chen Li et.al.	2503.04530	null
2025-03-07	Question-Aware Gaussian Experts for Audio-Visual Question Answering	Hongyeob Kim et.al.	2503.04459	link
2025-03-06	Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English	Runtao Zhou et.al.	2503.04099	null
2025-03-06	ReasonGraph: Visualisation of Reasoning Paths	Zongqian Li et.al.	2503.03979	link
2025-03-05	Process-based Self-Rewarding Language Models	Shimao Zhang et.al.	2503.03746	link
2025-03-05	COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence	Wentao Li et.al.	2503.03215	null
2025-03-04	The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models	Ke Ji et.al.	2503.02875	null
2025-03-04	Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models	Zhifei Xie et.al.	2503.02318	null
2025-03-04	LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation	Yunbo Long et.al.	2503.02161	null
2025-03-03	CorrA: Leveraging Large Language Models for Dynamic Obstacle Avoidance of Autonomous Vehicles	Shanting Wang et.al.	2503.02076	null
2025-03-03	Graph-Augmented Reasoning: Evolving Step-by-Step Knowledge Graph Retrieval for LLM Reasoning	Wenjie Wu et.al.	2503.01642	null
2025-03-03	Pragmatic Inference Chain (PIC) Improving LLMs’ Reasoning of Authentic Implicit Toxic Language	Xi Chen et.al.	2503.01539	null
2025-03-03	CognitiveDrone: A VLA Model and Evaluation Benchmark for Real-Time Cognitive Task Solving and Reasoning in UAVs	Artem Lykov et.al.	2503.01378	null
2025-03-06	SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph	Teng Lin et.al.	2503.01346	null
2025-03-03	MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation	Yi Wang et.al.	2503.01298	null
2025-02-28	Personalized Causal Graph Reasoning for LLMs: A Case Study on Dietary Recommendations	Zhongqi Yang et.al.	2503.00134	null
2025-02-28	Contextualizing biological perturbation experiments through language	Menghua Wu et.al.	2502.21290	link
2025-02-28	Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning	Ayana Niwa et.al.	2502.20620	null
2025-02-27	FINEREASON: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving	Guizhen Chen et.al.	2502.20238	link
2025-02-27	Collaborative Stance Detection via Small-Large Language Model Consistency Verification	Yu Yan et.al.	2502.19954	link
2025-02-27	Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models	Yuan Sui et.al.	2502.19918	null
2025-02-27	Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation	Qianxi He et.al.	2502.19907	null
2025-03-21	Towards Multimodal Large-Language Models for Parent-Child Interaction: A Focus on Joint Attention	Weiyan Shi et.al.	2502.19877	null
2025-03-05	Weaker LLMs’ Opinions Also Matter: Mixture of Opinions Enhances LLM’s Mathematical Reasoning	Yanan Chen et.al.	2502.19622	null
2025-02-26	General Reasoning Requires Learning to Reason from the Get-go	Seungwook Han et.al.	2502.19402	null
2025-02-26	BIG-Bench Extra Hard	Mehran Kazemi et.al.	2502.19187	link
2025-02-25	Scalable Best-of-N Selection for Large Language Models via Self-Certainty	Zhewei Kang et.al.	2502.18581	link
2025-02-25	SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution	Yuxiang Wei et.al.	2502.18449	null
2025-02-25	Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning	Wenkai Yang et.al.	2502.18080	null
2025-02-21	Improving Value-based Process Verifier via Structural Prior Injection	Zetian Sun et.al.	2502.17498	null
2025-02-24	Making LLMs Reason? The Intermediate Language Problem in Neurosymbolic Approaches	Alexander Beiser et.al.	2502.17216	null
2025-02-24	Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI	Syed Abdul Gaffar Shakhadri et.al.	2502.17092	null
2025-02-24	Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology	Longchao Da et.al.	2502.17026	null
2025-02-24	All-in-one: Understanding and Generation in Multimodal Reasoning with the MAIA Benchmark	Davide Testa et.al.	2502.16989	null
2025-02-24	AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models	Qin Zhu et.al.	2502.16906	link
2025-02-24	The Blessing of Reasoning: LLM-Based Contrastive Explanations in Black-Box Recommender Systems	Yuyan Wang et.al.	2502.16759	null
2025-02-23	Reasoning about Affordances: Causal and Compositional Reasoning in LLMs	Magnus F. Gjerde et.al.	2502.16606	null
2025-02-22	ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning	Shulin Huang et.al.	2502.16268	null
2025-02-27	Dynamic Parallel Tree Search for Efficient LLM Reasoning	Yifu Ding et.al.	2502.16235	null
2025-02-22	Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations	Chunyang Li et.al.	2502.16169	link
2025-03-04	Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models	Qianqi Yan et.al.	2502.16033	null
2025-02-21	MutaGReP: Execution-Free Repository-Grounded Plan Search for Code-Use	Zaid Khan et.al.	2502.15872	null
2025-02-21	Do Multilingual LLMs Think In English?	Lisa Schut et.al.	2502.15603	null
2025-02-21	Evaluating Social Biases in LLM Reasoning	Xuyang Wu et.al.	2502.15361	null
2025-02-21	Stepwise Informativeness Search for Improving LLM Reasoning	Siyuan Wang et.al.	2502.15335	null
2025-02-21	Latent Factor Models Meets Instructions:Goal-conditioned Latent Factor Discovery without Task Supervision	Zhouhang Xie et.al.	2502.15147	null
2025-02-19	SIFT: Grounding LLM Reasoning in Contexts via Stickers	Zihao Zeng et.al.	2502.14922	link
2025-02-18	Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence	Bhavik Agarwal et.al.	2502.14905	null
2025-03-04	Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison	Aiswarya Baby et.al.	2502.14827	null
2025-02-20	Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning	Tian Xie et.al.	2502.14768	link
2025-02-19	Enhancing LLM-Based Recommendations Through Personalized Reasoning	Jiahao Liu et.al.	2502.13845	link
2025-02-19	MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering	Guanming Xiong et.al.	2502.13428	null
2025-02-19	MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification	Linzhuang Sun et.al.	2502.13383	link
2025-02-22	Grounding LLM Reasoning with Knowledge Graphs	Alfonso Amayuelas et.al.	2502.13247	null
2025-02-18	Theorem Prover as a Judge for Synthetic Data Generation	Joshua Ong Jun Leang et.al.	2502.13137	null
2025-02-18	Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options	Lakshmi Nair et.al.	2502.12929	link
2025-02-18	S $^2$ R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning	Ruotian Ma et.al.	2502.12853	link
2025-02-18	CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base	Cong-Duy Nguyen et.al.	2502.12591	null
2025-02-18	Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights	Shubham Parashar et.al.	2502.12521	null
2025-02-18	HopRAG: Multi-Hop Reasoning for Logic-Aware Retrieval-Augmented Generation	Hao Liu et.al.	2502.12442	null
2025-02-17	Evaluating Step-by-step Reasoning Traces: A Survey	Jinu Lee et.al.	2502.12289	null
2025-02-17	SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs	Yige Xu et.al.	2502.12134	link
2025-02-17	TokenSkip: Controllable Chain-of-Thought Compression in LLMs	Heming Xia et.al.	2502.12067	link
2025-02-17	Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models	Hyunwoo Kim et.al.	2502.11881	null
2025-02-17	Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities	Hanbin Wang et.al.	2502.11829	link
2025-02-17	Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning	Yuqi Pang et.al.	2502.11751	link
2025-02-17	DeFiScope: Detecting Various DeFi Price Manipulations with LLM Reasoning	Juantao Zhong et.al.	2502.11521	null
2025-02-16	Don’t Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls	Ante Wang et.al.	2502.11183	link
2025-02-16	LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning	Tianshi Zheng et.al.	2502.11176	null
2025-02-15	A Tutorial on LLM Reasoning: Relevant Methods behind ChatGPT o1	Jun Wang et.al.	2502.10867	null
2025-02-28	USER-VLM 360: Personalized Vision Language Models with User-aware Tuning for Social Human-Robot Interactions	Hamed Rahimi et.al.	2502.10636	null
2025-02-14	Do Large Language Models Reason Causally Like Us? Even Better?	Hanna M. Dettki et.al.	2502.10215	null
2025-02-14	MathConstruct: Challenging LLM Reasoning with Constructive Proofs	Mislav Balunović et.al.	2502.10197	null
2025-02-13	MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Dongzhi Jiang et.al.	2502.09621	null
2025-02-14	EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges	Clinton J. Wang et.al.	2502.08859	null
2025-02-11	CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs	Lejla Skelic et.al.	2502.07980	null
2025-02-05	Reasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language Models Through Logic Unit Alignment	Cheryl Li et.al.	2502.07803	null
2025-02-17	Bag of Tricks for Inference-time Computation of LLM Reasoning	Fan Liu et.al.	2502.07191	link
2025-02-15	Self-Supervised Prompt Optimization	Jinyu Xiang et.al.	2502.06855	link
2025-02-06	Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation	Namhee Kim et.al.	2502.06843	null
2025-02-04	Policy Guided Tree Search for Enhanced LLM Reasoning	Yang Li et.al.	2502.06813	null
2025-03-11	ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates	Ling Yang et.al.	2502.06772	link
2025-02-10	Resurrecting saturated LLM benchmarks with adversarial encoding	Igor Ivanov et.al.	2502.06738	null
2025-02-13	LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	Zhi Zhou et.al.	2502.06572	link
2025-02-09	A Generative Framework for Bidirectional Image-Report Understanding in Chest Radiography	Nicholas Evans et.al.	2502.05926	null
2025-02-08	Evaluating Vision-Language Models for Emotion Recognition	Sree Bhattacharyya et.al.	2502.05660	null
2025-02-07	GSM-Infinite: How Do Your LLMs Behave over Infinitely Increasing Context Length and Reasoning Complexity?	Yang Zhou et.al.	2502.05252	link
2025-02-07	Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures	Tushar Pandey et.al.	2502.05078	link
2025-02-07	Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research	Junde Wu et.al.	2502.04644	link
2025-02-05	Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications	Bo Wen et.al.	2502.04384	link
2025-02-05	Limitations of Large Language Models in Clinical Problem-Solving Arising from Inflexible Reasoning	Jonathan Kim et.al.	2502.04381	null
2025-02-04	Investigating the Robustness of Deductive Reasoning with Large Language Models	Fabian Hoppe et.al.	2502.04352	null
2025-02-04	Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search	Maohao Shen et.al.	2502.02508	null
2025-02-04	CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning	Jianfeng Pan et.al.	2502.02390	null
2025-02-08	Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking	Jinyang Wu et.al.	2502.02339	null
2025-02-04	Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration	Younan Zhu et.al.	2502.01969	null
2025-01-31	Improving Rule-based Reasoning in LLMs via Neurosymbolic Representations	Varun Dhanraj et.al.	2502.01657	null
2025-02-03	Position: Empowering Time Series Reasoning with Multimodal LLMs	Yaxuan Kong et.al.	2502.01477	null
2025-02-03	ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning	Bill Yuchen Lin et.al.	2502.01100	null
2025-02-16	Learning Autonomous Code Integration for Math Language Models	Haozhe Wang et.al.	2502.00691	null
2025-02-13	Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning	Zhi Zhou et.al.	2502.00511	null
2025-02-14	Reward-Guided Speculative Decoding for Efficient LLM Reasoning	Baohao Liao et.al.	2501.19324	null
2025-01-31	Efficient Reasoning with Hidden Thinking	Xuan Shen et.al.	2501.19201	link
2025-01-31	BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning	Han Zhong et.al.	2501.18858	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-27	Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations	Pablo Valenzuela-Toledo et.al.	2501.16495	null
2025-01-27	Large Models in Dialogue for Active Perception and Anomaly Detection	Tzoulio Chamiti et.al.	2501.16300	link
2025-01-26	TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs	Yuxuan Gu et.al.	2501.15674	link
2025-01-28	Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning	Zeyu Gan et.al.	2501.15602	link
2025-01-26	Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Yuhong Sun et.al.	2501.15581	null
2025-02-15	Option-ID Based Elimination For Multiple Choice Questions	Zhenhao Zhu et.al.	2501.15175	link
2025-01-24	Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains	Xu Chu et.al.	2501.14431	null
2025-02-12	GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better	Xu Chu et.al.	2501.14427	null
2025-01-23	Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks	Chang Gong et.al.	2501.13731	null
2025-02-10	Cognitive Paradigms for Evaluating VLMs on Visual Reasoning Task	Mohit Vaishnav et.al.	2501.13620	null
2025-01-22	EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering	Chang Zong et.al.	2501.12746	null
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-22	FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs	Zengyi Gao et.al.	2501.09957	null
2025-01-17	Evolving Deeper LLM Thinking	Kuang-Huei Lee et.al.	2501.09891	null
2025-01-23	Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models	Fengli Xu et.al.	2501.09686	null
2025-01-15	Multimodal LLMs Can Reason about Aesthetics in Zero-Shot	Ruixiang Jiang et.al.	2501.09012	link
2025-02-10	Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data	Jiaxing Qiu et.al.	2501.08413	link
2025-01-14	Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning	Haoyu Han et.al.	2501.07845	null
2025-01-09	Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark	Yunzhuo Hao et.al.	2501.05444	link
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	DRIVINGVQA: Analyzing Visual Chain-of-Thought Reasoning of Vision Language Models in Real-World Scenarios with Driving Theory Tests	Charles Corbière et.al.	2501.04671	null
2025-01-08	Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting	Dong-Hai Zhu et.al.	2501.04341	link
2025-01-07	Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation	Alireza Salemi et.al.	2501.04167	null
2025-01-07	Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild	Wanpeng Hu et.al.	2501.02964	link
2025-01-06	KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models	Zaiyi Zheng et.al.	2501.02711	null
2025-01-04	Table as Thought: Exploring Structured Thoughts in LLM Reasoning	Zhenjie Sun et.al.	2501.02152	null
2025-01-03	Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models	Kaleem Ullah Qasim et.al.	2501.02026	null
2025-01-02	Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search	Shuangtao Li et.al.	2501.01478	null
2025-01-02	HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation	Runsong Jia et.al.	2501.01203	null
2025-01-03	Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents	Chengbo He et.al.	2501.00430	null
2024-12-31	EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta	Raymond Bernard et.al.	2501.00257	null
2024-12-30	Efficiently Serving LLM Reasoning Programs with Certaindex	Yichao Fu et.al.	2412.20993	null
2024-12-28	LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning	Shuguang Chen et.al.	2412.20227	null
2025-02-17	Token-Budget-Aware LLM Reasoning	Tingxu Han et.al.	2412.18547	link
2024-12-23	StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs	Hailin Chen et.al.	2412.18011	null
2025-02-09	Evaluating LLM Reasoning in the Operations Research Domain with ORQA	Mahdi Mostajabdaveh et.al.	2412.17874	link
2024-12-23	Diving into Self-Evolving Training for Multimodal Reasoning	Wei Liu et.al.	2412.17451	null
2024-12-21	SilVar: Speech Driven Multimodal Model for Reasoning Visual Question Answering and Object Localization	Tan-Hanh Pham et.al.	2412.16771	null
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-19	Eliciting Causal Abilities in Large Language Models for Reasoning Tasks	Yajing Wang et.al.	2412.15314	link
2024-12-19	Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Federico Castagna et.al.	2412.15177	link
2024-12-19	Progressive Multimodal Reasoning via Active Retrieval	Guanting Dong et.al.	2412.14835	null
2024-12-19	FiVL: A Framework for Improved Vision-Language Alignment	Estelle Aflalo et.al.	2412.14672	null
2024-12-19	FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis	Abdullah Khan et.al.	2412.14492	link
2024-12-18	Cognition Chain for Explainable Psychological Stress Detection on Social Media	Xin Wang et.al.	2412.14009	link
2024-12-27	Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence	Jinghan He et.al.	2412.13949	null
2025-02-16	Do Language Models Understand Time?	Xi Ding et.al.	2412.13845	link
2024-12-18	Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games	Wenye Lin et.al.	2412.13602	link
2024-12-17	ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models	Yuxi Sun et.al.	2412.12848	null
2024-12-12	A NotSo Simple Way to Beat Simple Bench	Soham Sane et.al.	2412.12173	null
2024-12-11	What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis	Jiayu Liu et.al.	2412.12157	null
2025-02-18	A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges	Yibo Yan et.al.	2412.11936	null
2024-12-24	Stepwise Reasoning Error Disruption Attack of LLMs	Jingyu Peng et.al.	2412.11934	null
2024-12-16	Leveraging Retrieval-Augmented Tags for Large Vision-Language Understanding in Complex Scenes	Antonio Carlos Rivera et.al.	2412.11396	null
2024-12-15	SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation	Hang Zhang et.al.	2412.11026	null
2024-12-15	Entropy-Regularized Process Reward Model	Hanning Zhang et.al.	2412.11006	link
2024-12-14	Optimizing Vision-Language Interactions Through Decoder-Only Models	Kaito Tanaka et.al.	2412.10758	null
2024-12-14	Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation	Sukai Huang et.al.	2412.10675	null
2024-12-14	Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data	Xue Wu et.al.	2412.10654	null
2024-12-13	EVLM: Self-Reflective Multimodal Reasoning for Cross-Dimensional Visual Editing	Umar Khalid et.al.	2412.10566	null
2024-12-13	Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education	Naiming Liu et.al.	2412.09914	null
2025-01-18	Neptune: The Long Orbit to Benchmarking Long Video Understanding	Arsha Nagrani et.al.	2412.09582	link
2025-02-14	Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning	Zhenni Bi et.al.	2412.09078	link
2024-12-11	Training Large Language Models to Reason in a Continuous Latent Space	Shibo Hao et.al.	2412.06769	link
2025-01-23	GameArena: Evaluating LLM Reasoning through Live Computer Games	Lanxiang Hu et.al.	2412.06394	null
2024-12-08	Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt	Damien de Mijolla et.al.	2412.05967	null
2024-12-06	MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale	Jarvis Guo et.al.	2412.05237	null
2024-12-05	Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction	Yiheng Xu et.al.	2412.04454	null
2024-12-05	SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions	Bufang Yang et.al.	2412.04036	null
2024-12-04	DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation	Qingdong He et.al.	2412.03255	null
2024-12-03	Explainable CTR Prediction via LLM Reasoning	Xiaohan Yu et.al.	2412.02588	null
2025-02-12	NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers	Angel Yahir Loredo Lopez et.al.	2412.01621	null
2025-01-13	Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM’s Reasoning Capability	Zicheng Lin et.al.	2411.19943	link
2024-11-29	TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension	Zipeng Qiu et.al.	2411.19504	link
2024-11-29	COLD: Causal reasOning in cLosed Daily activities	Abhinav Joshi et.al.	2411.19500	link
2024-12-16	Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning	Di Zhang et.al.	2411.18203	null
2024-11-26	NEMO: Can Multimodal LLMs Identify Attribute-Modified Objects?	Jiaxuan Li et.al.	2411.17794	null
2024-11-25	Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision	Zhiheng Xi et.al.	2411.16579	null
2024-11-22	On the Impact of Fine-Tuning on Chain-of-Thought Reasoning	Elita Lobo et.al.	2411.15382	null
2024-11-21	Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models	Yuhao Dong et.al.	2411.14432	link
2024-11-20	BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games	Davide Paglieri et.al.	2411.13543	null
2024-11-20	Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving	Hao Zhou et.al.	2411.13076	null
2024-11-15	Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination	Haojie Zheng et.al.	2411.12591	link
2024-12-23	Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus	Terufumi Morishita et.al.	2411.12498	link
2024-11-18	Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation	Mingchao Qi et.al.	2411.11714	link
2024-12-31	Enhancing LLM Reasoning with Reward-guided Tree Search	Jinhao Jiang et.al.	2411.11694	null
2024-12-15	A dataset of questions on decision-theoretic reasoning in Newcomb-like problems	Caspar Oesterheld et.al.	2411.10588	link
2024-11-15	Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization	Weiyun Wang et.al.	2411.10442	null
2025-01-09	LLaVA-CoT: Let Vision Language Models Reason Step-by-Step	Guowei Xu et.al.	2411.10440	link
2024-11-15	Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level	Andong Deng et.al.	2411.09921	null
2024-11-14	Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering	Nghia Trung Ngo et.al.	2411.09213	null
2024-11-13	Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding	Deyi Ji et.al.	2411.08516	null
2024-11-18	What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?	Katie Kang et.al.	2411.07681	link
2024-11-27	Self-Training Meets Consistency: Improving LLMs’ Reasoning With Consistency-Driven Rationale Evaluation	Jaehyeok Lee et.al.	2411.06387	link
2024-11-09	A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization	Haoxin Liu et.al.	2411.06018	null
2024-11-11	LLMs as Method Actors: A Model for Prompt Engineering and Architecture	Colin Doyle et.al.	2411.05778	link
2024-11-12	Kwai-STaR: Transform LLMs into State-Transition Reasoners	Xingyu Lu et.al.	2411.04799	null
2024-11-21	Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding	Haolin Chen et.al.	2411.04282	link
2024-11-05	CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library	Yimeng Liu et.al.	2411.03477	null
2025-01-27	MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs	Manar Abdelatty et.al.	2411.03471	link
2024-11-04	RuAG: Learned-rule-augmented Generation for Large Language Models	Yudi Zhang et.al.	2411.03349	null
2024-10-30	Vision-Language Models Can Self-Improve Reasoning via Reflection	Kanzhi Cheng et.al.	2411.00855	null
2024-11-01	Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling	Yiwen Ding et.al.	2411.00750	link
2024-11-01	STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing	Jiaru Zou et.al.	2411.00387	null
2024-11-08	GRS-QA – Graph Reasoning-Structured Question Answering Dataset	Anish Pahilajani et.al.	2411.00369	null
2024-10-31	Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning	Jinghan Zhang et.al.	2410.24155	null
2024-10-31	RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner	Fu-Chieh Chang et.al.	2410.23912	null
2024-10-31	OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models	Junda Wu et.al.	2410.23703	null
2024-10-30	ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning	Millennium Bismay et.al.	2410.23180	link
2024-10-30	On Memorization of Large Language Models in Logical Reasoning	Chulin Xie et.al.	2410.23123	null
2024-10-28	Causal Interventions on Causal Paths: Mapping GPT-2’s Reasoning From Syntax to Semantics	Isabelle Lee et.al.	2410.21353	null
2024-10-28	Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments	Sangmim Song et.al.	2410.20666	null
2024-10-25	Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models	Danqing Wang et.al.	2410.20007	null
2024-10-25	Can Stories Help LLMs Reason? Curating Information Space Through Narrative	Vahid Sadiri Javadi et.al.	2410.19221	null
2024-10-18	Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning	Pengfei He et.al.	2410.19000	link
2024-10-25	CLR-Bench: Evaluating Large Language Models in College-level Reasoning	Junnan Dong et.al.	2410.17558	null
2024-10-28	Non-myopic Generation of Language Models for Reasoning and Planning	Chang Ma et.al.	2410.17195	link
2024-11-06	Improving Causal Reasoning in Large Language Models: A Survey	Longxuan Yu et.al.	2410.16676	link
2024-10-22	A Statistical Analysis of LLMs’ Self-Evaluation Using Proverbs	Ryosuke Sonoda et.al.	2410.16640	null
2024-10-21	Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models’ Reasoning with Formal Logic	Jason Chan et.al.	2410.16502	null
2024-11-27	On Designing Effective RL Reward at Training Time for LLM Reasoning	Jiaxuan Gao et.al.	2410.15115	null
2025-01-28	Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning	Xingyu Tan et.al.	2410.14211	null
2024-10-21	Unconstrained Model Merging for Enhanced LLM Reasoning	Yiming Zhang et.al.	2410.13699	null
2024-10-16	Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models	Linhao Luo et.al.	2410.13080	link
2024-10-16	KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs	Yongqin Xu et.al.	2410.12480	null
2024-10-17	Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning	Qian Wang et.al.	2410.12464	link
2024-10-16	Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up	Jiahao Yuan et.al.	2410.12323	link
2024-10-16	Exploiting LLMs’ Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval	Hai-Long Nguyen et.al.	2410.12154	null
2024-10-15	Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming	Yilun Hao et.al.	2410.12112	null
2024-10-12	OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models	Jun Wang et.al.	2410.09671	null
2024-10-11	P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains	Simeng Han et.al.	2410.09207	null
2024-10-11	Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning	Yunpeng Gao et.al.	2410.08500	null
2024-10-10	SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation	Hang Yin et.al.	2410.08189	null
2024-10-10	Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning	Amrith Setlur et.al.	2410.08146	null
2024-10-10	Automatic Curriculum Expert Iteration for Reliable LLM Reasoning	Zirui Zhao et.al.	2410.07627	link
2024-10-09	Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis	Ahmed Abdullah et.al.	2410.06841	null
2024-10-09	Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning	Xiyao Wang et.al.	2410.06508	null
2025-01-02	Filtering Discomforting Recommendations with Large Language Models	Jiahao Liu et.al.	2410.05411	null
2024-10-05	Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification	Zhenwen Liang et.al.	2410.05318	null
2024-10-06	Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval	Pengcheng Jiang et.al.	2410.04585	link
2024-10-03	The Role of Deductive and Inductive Reasoning in Large Language Models	Chengkun Cai et.al.	2410.02892	null
2024-10-02	Not All LLM Reasoners Are Created Equal	Arian Hosseini et.al.	2410.01748	null
2024-12-25	Interpretable Contrastive Monte Carlo Tree Search Reasoning	Zitian Gao et.al.	2410.01707	link
2024-10-02	VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment	Amirhossein Kazemnejad et.al.	2410.01679	link
2024-10-02	AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses	Xiaotian Lu et.al.	2410.01246	null
2024-10-01	Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness	Xiao Peng et.al.	2410.00359	null
2024-10-01	Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis	Chun-Hsiao Yeh et.al.	2410.00292	null
2024-10-08	GUNDAM: Aligning Large Language Models with Graph Understanding	Sheng Ouyang et.al.	2409.20053	null
2024-09-27	Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs	Yanyuan Qiao et.al.	2409.18794	null
2024-10-23	Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning	Debargha Ganguly et.al.	2409.17270	null
2024-09-20	CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency	Kangsheng Wang et.al.	2409.17174	null
2024-09-20	Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM	Zheng Wei Lim et.al.	2409.13949	null
2024-09-19	SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning	Zhipeng Li et.al.	2409.12836	null
2024-10-04	Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning	Jiaxin Wen et.al.	2409.12452	link
2024-12-16	Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data	Jiaming Zhou et.al.	2409.12437	link
2024-09-18	MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning	Justin Chih-Yao Chen et.al.	2409.12147	link
2024-11-05	Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent	Fatemeh Haji et.al.	2409.11527	link
2024-09-16	Enhancing RL Safety with Counterfactual LLM Reasoning	Dennis Gross et.al.	2409.10188	link
2024-09-11	Think Together and Work Better: Combining Humans’ and LLMs’ Think-Aloud Outcomes for Effective Text Evaluation	SeongYeub Chu et.al.	2409.07355	link

LLM Evaluation

Publish Date	Title	Authors	PDF	Code
2025-07-22	Mind the Gap: Evaluating the Representativeness of Quantitative Medical Language Reasoning LLM Benchmarks for African Disease Burdens	Fred Mutisya et.al.	2507.16322	null
2025-07-18	Document Haystack: A Long Context Multimodal Image/Document Understanding Vision LLM Benchmark	Goeric Huybrechts et.al.	2507.15882	null
2025-07-21	Left Leaning Models: AI Assumptions on Economic Policy	Maxim Chupilkin et.al.	2507.15771	null
2025-07-21	From Queries to Criteria: Understanding How Astronomers Evaluate LLMs	Alina Hyk et.al.	2507.15715	null
2025-07-21	Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification	Vitaly Protasov et.al.	2507.15557	null
2025-07-15	LLM-based ambiguity detection in natural language instructions for collaborative surgical robots	Ana Davila et.al.	2507.11525	null
2025-07-15	DCR: Quantifying Data Contamination in LLMs Evaluation	Cheng Xu et.al.	2507.11405	null
2025-07-17	SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks	Pavel Adamenko et.al.	2507.11059	null
2025-07-11	OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique	Wasi Uddin Ahmad et.al.	2507.09075	null
2025-07-18	From KMMLU-Redux to KMMLU-Pro: A Professional Korean Benchmark Suite for LLM Evaluation	Seokhee Hong et.al.	2507.08924	null
2025-07-11	A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench	David Schlangen et.al.	2507.08491	null
2025-07-07	Train-before-Test Harmonizes Language Model Rankings	Guanhua Zhang et.al.	2507.05195	null
2025-07-13	SymbolicThought: Integrating Language Models and Symbolic Reasoning for Consistent and Interpretable Human Relationship Understanding	Runcong Zhao et.al.	2507.04189	null
2025-07-09	Skewed Score: A statistical framework to assess autograders	Magda Dubois et.al.	2507.03772	null
2025-07-12	Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages	Samridhi Raj Sinha et.al.	2507.01853	null
2025-07-01	Pitfalls of Evaluating Language Models with Open Benchmarks	Md. Najib Hasan et.al.	2507.00460	null
2025-06-30	AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data	JiaRu Wu et.al.	2506.23735	null
2025-06-27	WildSpeech-Bench: Benchmarking Audio LLMs in Natural Speech Conversation	Jian Zhang et.al.	2506.21875	null
2025-06-25	DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs	Ruokai Yin et.al.	2506.20194	null
2025-06-23	Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection	Lei Yu et.al.	2506.18245	null
2025-06-22	The Democratic Paradox in Large Language Models’ Underestimation of Press Freedom	I. Loaiza et.al.	2506.18045	null
2025-06-21	CodeMorph: Mitigating Data Leakage in Large Language Model Assessment	Hongzhou Rao et.al.	2506.17627	null
2025-06-20	Re-Evaluating Code LLM Benchmarks Under Semantic Mutation	Zhiyuan Pan et.al.	2506.17369	null
2025-06-19	LMR-BENCH: Evaluating LLM Agent’s Ability on Reproducing Language Modeling Research	Shuo Yan et.al.	2506.17335	null
2025-06-20	Do We Need Large VLMs for Spotting Soccer Actions?	Ritabrata Chakraborty et.al.	2506.17144	null
2025-06-17	SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models	Gyuhak Kim et.al.	2506.15021	null
2025-06-19	MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation	Xueqing Peng et.al.	2506.14028	null
2025-06-18	The NordDRG AI Benchmark for Large Language Models	Tapio Pitkäranta et.al.	2506.13790	link
2025-06-20	Domain Specific Benchmarks for Evaluating Multimodal Large Language Models	Khizar Anjum et.al.	2506.12958	null
2025-06-06	The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs	Songyang Liu et.al.	2506.11094	null
2025-05-22	NSW-EPNews: A News-Augmented Benchmark for Electricity Price Forecasting with LLMs	Zhaoge Bi et.al.	2506.11050	null
2025-04-23	Impact of Comments on LLM Comprehension of Legacy Code	Rock Sabetto et.al.	2506.11007	null
2025-06-12	LLM-Driven Personalized Answer Generation and Evaluation	Mohammadreza Molavi et.al.	2506.10829	null
2025-06-11	Textual Bayes: Quantifying Uncertainty in LLM-Based Systems	Brendan Leigh Ross et.al.	2506.10060	null
2025-06-16	Metritocracy: Representative Metrics for Lite Benchmarks	Ariel Procaccia et.al.	2506.09813	null
2025-06-10	Breaking the ICE: Exploring promises and challenges of benchmarks for Inference Carbon & Energy estimation for LLMs	Samarth Sikand et.al.	2506.08727	null
2025-06-10	Sample Efficient Demonstration Selection for In-Context Learning	Kiran Purohit et.al.	2506.08607	link
2025-06-09	How Benchmark Prediction from Fewer Data Misses the Mark	Guanhua Zhang et.al.	2506.07673	link
2025-06-09	Beyond Benchmarks: A Novel Framework for Domain-Specific LLM Evaluation and Knowledge Mapping	Nitin Sharma et.al.	2506.07658	null
2025-06-09	Vuyko Mistral: Adapting LLMs for Low-Resource Dialectal Translation	Roman Kyslyi et.al.	2506.07617	null
2025-06-05	LLM-First Search: Self-Guided Exploration of the Solution Space	Nathan Herr et.al.	2506.05213	link
2025-06-05	Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation	Noy Sternlicht et.al.	2506.05062	link
2025-06-04	BEAR: BGP Event Analysis and Reporting	Hanqing Li et.al.	2506.04514	link
2025-06-04	N $^2$ : A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion	Caleb Chin et.al.	2506.04166	link
2025-06-04	Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis	Kejian Zhu et.al.	2506.04142	null
2025-06-03	NetPress: Dynamically Generated LLM Benchmarks for Network Applications	Yajie Zhou et.al.	2506.03231	link
2025-06-04	PC-MoE: Memory-Efficient and Privacy-Preserving Collaborative Training for Mixture-of-Experts LLMs	Ze Yu Zhang et.al.	2506.02965	null
2025-06-02	Multilingual Definition Modeling	Edison Marrese-Taylor et.al.	2506.01489	null
2025-06-01	Taming LLMs by Scaling Learning Rates with Gradient Grouping	Siyuan Li et.al.	2506.01049	null
2025-06-06	Data Swarms: Optimizable Generation of Synthetic Evaluation Data	Shangbin Feng et.al.	2506.00741	null
2025-05-31	AgentAuditor: Human-Level Safety and Security Evaluation for LLM Agents	Hanjun Luo et.al.	2506.00641	null
2025-05-31	BenchHub: A Unified Benchmark Suite for Holistic and Customizable LLM Evaluation	Eunsu Kim et.al.	2506.00482	null
2025-05-30	MetaFaith: Faithful Natural Language Uncertainty Expression in LLMs	Gabrielle Kaili-May Liu et.al.	2505.24858	link
2025-05-30	Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization	Utsav Maskey et.al.	2505.24621	null
2025-05-30	Simulating Training Data Leakage in Multiple-Choice Benchmarks for LLM Evaluation	Naila Shafirni Hidayat et.al.	2505.24263	link
2025-05-29	Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs	Yinong Oliver Wang et.al.	2505.23996	null
2025-05-29	Revisiting Uncertainty Estimation and Calibration of Large Language Models	Linwei Tao et.al.	2505.23854	null
2025-05-28	Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective	Qingchuan Ma et.al.	2505.23833	link
2025-06-24	MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning	Yong-Cheng Liaw et.al.	2505.23254	null
2025-07-03	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-29	Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition	Hanting Chen et.al.	2505.22375	null
2025-05-28	ReliableEval: A Recipe for Stochastic LLM Evaluation via Method of Moments	Gili Lior et.al.	2505.22169	null
2025-05-28	Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate	Ashim Gupta et.al.	2505.21999	null
2025-05-21	SIMCOPILOT: Evaluating Large Language Models for Copilot-Style Code Generation	Mingchao Jiang et.al.	2505.21514	null
2025-05-26	Dynamically Learned Test-Time Model Routing in Language Model Zoos with Service Level Guarantees	Herbert Woisetschläger et.al.	2505.19947	null
2025-05-26	BizFinBench: A Business-Driven Real-World Financial Benchmark for Evaluating LLMs	Guilong Lu et.al.	2505.19457	link
2025-05-25	Likert or Not: LLM Absolute Relevance Judgments on Fine-Grained Ordinal Scales	Charles Godfrey et.al.	2505.19334	null
2025-05-25	Can Large Language Models Infer Causal Relationships from Real-World Text?	Ryan Saklad et.al.	2505.18931	null
2025-05-24	MedScore: Factuality Evaluation of Free-Form Medical Answers	Heyuan Huang et.al.	2505.18452	link
2025-05-23	How Can I Publish My LLM Benchmark Without Giving the True Answers Away?	Takashi Ishida et.al.	2505.18102	null
2025-05-23	ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction	Yan Yu et.al.	2505.17691	null
2025-05-22	CaseReportBench: An LLM Benchmark Dataset for Dense Information Extraction in Clinical Case Reports	Xiao Yu Cindy Zhang et.al.	2505.17265	null
2025-05-21	NEXT-EVAL: Next Evaluation of Traditional and LLM Web Data Record Extraction	Soyeon Kim et.al.	2505.17125	null
2025-05-21	Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector	Haoyan Yang et.al.	2505.17100	null
2025-05-22	AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios	Yunjia Qi et.al.	2505.16944	link
2025-05-22	CASTILLO: Characterizing Response Length Distributions of Large Language Models	Daniel F. Perez-Ramirez et.al.	2505.16881	link
2025-05-21	Reverse Engineering Human Preferences with Reinforcement Learning	Lisa Alazraki et.al.	2505.15795	null
2025-05-21	An Empirical Study of the Anchoring Effect in LLMs: Existence, Mechanism, and Potential Mitigations	Yiming Huang et.al.	2505.15392	null
2025-05-21	Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory	Hongli Zhou et.al.	2505.15055	link
2025-05-20	FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain	Rohan Deb et.al.	2505.14826	null
2025-05-20	Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?	Bo Feng et.al.	2505.14321	null
2025-05-29	YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering	Jennifer D’Souza et.al.	2505.14279	null
2025-05-20	Think-J: Learning to Think for Generative LLM-as-a-Judge	Hui Huang et.al.	2505.14268	link
2025-05-19	4Hammer: a board-game reinforcement learning environment for the hour long time frame	Massimo Fioravanti et.al.	2505.13638	link
2025-05-18	KG-QAGen: A Knowledge-Graph-Based Framework for Systematic Question Generation and Long-Context LLM Evaluation	Nikita Tatarinov et.al.	2505.12495	link
2025-05-17	Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation	Vincent Koc et.al.	2505.12058	link
2025-05-21	Model Performance-Guided Evaluation Data Selection for Effective Prompt Optimization	Ximing Dong et.al.	2505.10736	null
2025-05-13	A suite of LMs comprehend puzzle statements as well as humans	Adele E Goldberg et.al.	2505.08996	null
2025-05-13	Towards Contamination Resistant Benchmarks	Rahmatullah Musawi et.al.	2505.08389	null
2025-05-12	A Case Study Investigating the Role of Generative AI in Quality Evaluations of Epics in Agile Software Development	Werner Geyer et.al.	2505.07664	null
2025-05-09	LLMs Get Lost In Multi-Turn Conversation	Philippe Laban et.al.	2505.06120	link
2025-05-15	Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information	Joshua Harris et.al.	2505.06046	null
2025-05-02	Cer-Eval: Certifiable and Cost-Efficient Evaluation Framework for LLMs	Ganghua Wang et.al.	2505.03814	null
2025-05-29	am-ELO: A Stable Framework for Arena-based LLM Evaluation	Zirui Liu et.al.	2505.03475	null
2025-05-05	Developing A Framework to Support Human Evaluation of Bias in Generated Free Response Text	Jennifer Healey et.al.	2505.03053	null
2025-05-01	Unlearning Sensitive Information in Multimodal LLMs: Benchmark and Attack-Defense Evaluation	Vaidehi Patil et.al.	2505.01456	link
2025-04-30	A Report on the llms evaluating the high school questions	Zhu Jiawei et.al.	2505.00057	null
2025-04-30	RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations	Jonas Gwozdz et.al.	2504.21605	null
2025-04-30	Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges	Xiao Xiao et.al.	2504.21303	null
2025-04-27	LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations	Laura Dietz et.al.	2504.19076	null
2025-04-23	Agree to Disagree? A Meta-Evaluation of LLM Misgendering	Arjun Subramonian et.al.	2504.17075	link
2025-04-23	IberBench: LLM Evaluation on Iberian Languages	José Ángel González et.al.	2504.16921	null
2025-04-23	Private Federated Learning using Preference-Optimized Synthetic Data	Charlie Hou et.al.	2504.16438	link
2025-04-29	Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark	Jasper Götting et.al.	2504.16137	null
2025-05-16	DMind Benchmark: Toward a Holistic Assessment of LLM Capabilities across the Web3 Domain	Enhao Huang et.al.	2504.16116	null
2025-04-22	Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach	Ruizhe Li et.al.	2504.15784	null
2025-04-20	Meta-Thinking in LLMs via Multi-Agent Reinforcement Learning: A Survey	Ahsan Bilal et.al.	2504.14520	null
2025-04-20	Information Diffusion and Preferential Attachment in a Network of Large Language Models	Adit Jain et.al.	2504.14438	null
2025-04-18	MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks	Jaime Raldua Veuthey et.al.	2504.14039	null
2025-04-17	ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition	Haidar Khan et.al.	2504.12562	link
2025-04-17	ELAB: Extensive LLM Alignment Benchmark in Persian Language	Zahra Pourbahman et.al.	2504.12553	null
2025-04-16	MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models	Hang Yuan et.al.	2504.12234	null
2025-04-17	Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation	Julia Kreutzer et.al.	2504.11829	null
2025-04-14	HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving	Avinash Kumar et.al.	2504.10724	null
2025-05-19	Large Language Models Could Be Rote Learners	Yuyang Xu et.al.	2504.08300	null
2025-05-30	DeepSeek-R1 vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Daniil Larionov et.al.	2504.08120	null
2025-05-15	Model Utility Law: Evaluating LLMs beyond Performance through Mechanism Interpretable Metric	Yixin Cao et.al.	2504.07440	link
2025-06-20	TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models	Sher Badshah et.al.	2504.07385	null
2025-04-08	NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge	Firoj Alam et.al.	2504.05995	null
2025-04-09	How Accurately Do Large Language Models Understand Code?	Sabaat Haroon et.al.	2504.04372	null
2025-04-04	Do LLM Evaluators Prefer Themselves for a Reason?	Wei-Lin Chen et.al.	2504.03846	link
2025-04-15	Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning	Kai Ye et.al.	2504.03784	null
2025-04-04	Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Erik Johannes Husom et.al.	2504.03360	null
2025-04-02	YourBench: Easy Custom Evaluation Sets for Everyone	Sumuk Shashidhar et.al.	2504.01833	link
2025-04-08	Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?	Kai Yan et.al.	2504.00509	null
2025-04-01	HRET: A Self-Evolving LLM Evaluation Toolkit for Korean	Hanwool Lee et.al.	2503.22968	null
2025-03-27	CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?	Jiefu Ou et.al.	2503.21717	link
2025-03-27	Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach	Javier Coronado-Blázquez et.al.	2503.21613	null
2025-05-19	Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models	Haoxiang Sun et.al.	2503.21380	link
2025-03-25	FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models	Dahyun Jung et.al.	2503.19540	link
2025-05-30	LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages	Patrick Diehl et.al.	2503.19217	null
2025-03-28	Overtrained Language Models Are Harder to Fine-Tune	Jacob Mitchell Springer et.al.	2503.19206	null
2025-03-25	Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes	Kelly O. Marshall et.al.	2503.18155	null
2025-05-14	Evaluating Clinical Competencies of Large Language Models with a General Practice Benchmark	Zheqing Li et.al.	2503.17599	null
2025-03-20	The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination	Yifan Sun et.al.	2503.16402	link
2025-03-20	Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation	Shangqing Zhao et.al.	2503.15837	link
2025-06-08	Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering	Francesco Maria Molfese et.al.	2503.14996	null
2025-03-13	It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education	Shrutika Singh et.al.	2503.13508	null
2025-03-17	REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities	Alexander Pugachev et.al.	2503.13102	null
2025-03-14	V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Zixu Cheng et.al.	2503.11495	null
2025-06-03	OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses	Angela Lopez-Cardona et.al.	2503.10927	link
2025-03-13	Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data	Paul Quinlan et.al.	2503.10883	null
2025-03-13	Commenting Higher-level Code Unit: Full Code, Reduced Code, or Hierarchical Code Summarization	Weisong Sun et.al.	2503.10737	null
2025-03-12	Medical Large Language Model Benchmarks Should Prioritize Construct Validity	Ahmed Alaa et.al.	2503.10694	null
2025-04-17	ZeroSumEval: An Extensible Framework For Scaling LLM Evaluation with Inter-Model Competition	Hisham A. Alyahya et.al.	2503.10673	link
2025-05-20	RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs	Zhongzhan Huang et.al.	2503.10657	link
2025-05-26	MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation	Weihao Xuan et.al.	2503.10497	null
2025-03-12	Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts	Hongyu Chen et.al.	2503.09347	null
2025-03-08	SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?	Xudong Lu et.al.	2503.06029	null
2025-03-07	SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs	Samir Abdaljalil et.al.	2503.05980	null
2025-03-07	RocketEval: Efficient Automated LLM Evaluation via Grading Checklist	Tianjun Wei et.al.	2503.05142	link
2025-02-09	Peeking Behind Closed Doors: Risks of LLM Evaluation by Private Data Curators	Hritik Bansal et.al.	2503.04756	null
2025-03-07	Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm	Hyeonjun Kim et.al.	2503.03796	null
2025-03-04	SAGE: Steering and Refining Dialog Generation with State-Action Augmentation	Yizhe Zhang et.al.	2503.03040	link
2025-05-28	Position: Don’t Use the CLT in LLM Evals With Fewer Than a Few Hundred Datapoints	Sam Bowyer et.al.	2503.01747	null
2025-03-04	DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation	Eliya Habba et.al.	2503.01622	null
2025-03-03	None of the Above, Less of the Right: Parallel Patterns between Humans and LLMs on Multi-Choice Questions Answering	Zhi Rui Tam et.al.	2503.01550	null
2025-03-03	SwiLTra-Bench: The Swiss Legal Translation Benchmark	Joel Niklaus et.al.	2503.01372	null
2025-03-03	LLM-Advisor: An LLM Benchmark for Cost-efficient Path Planning across Multiple Terrains	Ling Xiao et.al.	2503.01236	null
2025-03-02	FunBench: Benchmarking Fundus Reading Skills of MLLMs	Qijie Wei et.al.	2503.00901	null
2025-03-02	Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks	Umar Ali Khan et.al.	2503.00781	null
2025-04-12	Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity	Yupu Hao et.al.	2503.00771	link
2025-03-01	U-NIAH: Unified RAG and LLM Evaluation for Long Context Needle-In-A-Haystack	Yunfan Gao et.al.	2503.00353	link
2025-02-28	Jawaher: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking	Samar M. Magdy et.al.	2503.00231	null
2025-02-28	Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models	Colleen Gilhuly et.al.	2502.20647	null
2025-05-23	Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review	Sungduk Yu et.al.	2502.19614	null
2025-02-26	Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation	Yuxiang Wang et.al.	2502.18771	link
2025-02-23	Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation	Simin Chen et.al.	2502.17521	link
2025-05-23	Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective	Chengyin Xu et.al.	2502.17262	null
2025-02-24	Detecting Benchmark Contamination Through Watermarking	Tom Sander et.al.	2502.17259	null
2025-02-24	Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation	Jaskaran Singh Walia et.al.	2502.17011	null
2025-02-24	AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha Decay	Ziyi Tang et.al.	2502.16789	link
2025-01-30	Retrieval Augmented Generation Based LLM Evaluation For Protocol State Machine Inference With Chain-of-Thought Reasoning	Youssef Maklad et.al.	2502.15727	null
2025-03-10	Prompt-to-Leaderboard	Evan Frick et.al.	2502.14855	link
2025-03-28	SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines	M-A-P Team et.al.	2502.14739	null
2025-02-20	SEA-HELM: Southeast Asian Holistic Evaluation of Language Models	Yosephine Susanto et.al.	2502.14301	null
2025-02-20	Transfer-Prompting: Enhancing Cross-Task Adaptation in Large Language Models via Dual-Stage Prompts Optimization	Yupeng Chang et.al.	2502.14211	link
2025-02-19	Which of These Best Describes Multiple Choice Evaluation with LLMs? A) Forced B) Flawed C) Fixable D) All of the Above	Nishant Balepur et.al.	2502.14127	null
2025-02-19	STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models	Narun Raman et.al.	2502.13119	null
2025-02-18	HPSS: Heuristic Prompting Strategy Search for LLM Evaluators	Bosi Wen et.al.	2502.13031	null
2025-05-23	None of the Others: a General Technique to Distinguish Reasoning from Memorization in Multiple-Choice LLM Evaluation Benchmarks	Eva Sánchez Salido et.al.	2502.12896	null
2025-04-08	Safe at the Margins: A General Approach to Safety Alignment in Low-Resource English Languages – A Singlish Case Study	Isaac Lim et.al.	2502.12485	null
2025-02-17	Deviation Ratings: A General, Clone-Invariant Rating Method	Luke Marris et.al.	2502.11645	null
2025-02-21	TituLLMs: A Family of Bangla LLMs with Comprehensive Benchmarking	Shahriar Kabir Nahin et.al.	2502.11187	null
2025-02-15	Rule-Bottleneck Reinforcement Learning: Joint Explanation and Decision Optimization for Resource Allocation with Language Agents	Mauricio Tec et.al.	2502.10732	null
2025-03-02	An Empirical Analysis of Uncertainty in Large Language Model Evaluations	Qiujie Xie et.al.	2502.10709	link
2025-02-25	Accelerating Unbiased LLM Evaluation via Synthetic Feedback	Zhaoyi Zhou et.al.	2502.10563	link
2025-02-14	MathConstruct: Challenging LLM Reasoning with Constructive Proofs	Mislav Balunović et.al.	2502.10197	null
2025-02-13	Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization	Amit Levi et.al.	2502.09755	null
2025-02-13	NestQuant: Nested Lattice Quantization for Matrix Products and LLMs	Semyon Savkin et.al.	2502.09720	null
2025-02-12	The Science of Evaluating Foundation Models	Jiayi Yuan et.al.	2502.09670	null
2025-02-13	Copilot Arena: A Platform for Code LLM Evaluation in the Wild	Wayne Chi et.al.	2502.09328	null
2025-02-12	Revisiting 3D LLM Benchmarks: Are We Really Testing 3D Capabilities?	Jiahe Jin et.al.	2502.08503	link
2025-02-11	Forget What You Know about LLMs Evaluations – LLMs are Like a Chameleon	Nurit Cohen-Inger et.al.	2502.07445	link
2025-02-10	Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring	Alex Heyman et.al.	2502.07087	link
2025-02-10	Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models	Lujain Ibrahim et.al.	2502.07077	null
2025-02-07	LLM-Supported Natural Language to Bash Translation	Finnian Westenfelder et.al.	2502.06858	link
2025-02-15	Self-Supervised Prompt Optimization	Jinyu Xiang et.al.	2502.06855	link
2025-02-10	Resurrecting saturated LLM benchmarks with adversarial encoding	Igor Ivanov et.al.	2502.06738	null
2025-02-10	Automatic Evaluation of Healthcare LLMs Beyond Question-Answering	Anna Arias-Duart et.al.	2502.06666	null
2025-02-10	Unbiased Evaluation of Large Language Models from a Causal Perspective	Meilin Chen et.al.	2502.06655	null
2025-02-10	LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks	Xin Zhou et.al.	2502.06215	null
2025-02-05	Aero-LLM: A Distributed Framework for Secure UAV Communication and Intelligent Decision-Making	Balakrishnan Dharmalingam et.al.	2502.05220	null
2025-02-06	TruthFlow: Truthful LLM Generation via Representation Flow Correction	Hanyu Wang et.al.	2502.04556	null
2025-02-05	How do Humans and Language Models Reason About Creativity? A Comparative Analysis	Antonio Laverghetta Jr. et.al.	2502.03253	null
2025-03-22	On Zero-Initialized Attention: Optimal Prompt and Gating Factor Estimation	Nghiem T. Diep et.al.	2502.03029	null
2025-02-02	LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient	Peiwen Yuan et.al.	2502.01683	link
2025-02-02	HASSLE-free: A unified Framework for Sparse plus Low-Rank Matrix Decomposition for LLMs	Mehdi Makni et.al.	2502.00899	null
2025-02-01	DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks	Zhiliang Chen et.al.	2502.00270	link
2025-01-30	Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation	Muhammed Yusuf Kocyigit et.al.	2501.18771	null
2025-01-31	ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation	Minghua He et.al.	2501.18460	null
2025-02-01	LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering	Beiming Liu et.al.	2501.17183	null
2025-03-18	An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue	Koji Inoue et.al.	2501.16643	null
2025-01-26	HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI	Tidor-Vlad Pricope et.al.	2501.15627	null
2025-01-23	Question Answering on Patient Medical Records with Private Fine-Tuned LLMs	Sara Kothari et.al.	2501.13687	null
2025-01-10	CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback	En-Qi Tseng et.al.	2501.10421	null
2025-01-15	Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History	Yevhen Kostiuk et.al.	2501.09154	null
2025-01-13	Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles	Samia Touileb et.al.	2501.07718	null
2025-01-03	FLAME: Financial Large-Language Model Assessment and Metrics Evaluation	Jiayu Guo et.al.	2501.06211	link
2025-01-07	MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems	Yannis Katsis et.al.	2501.03468	link
2025-01-05	Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm	Ljubisa Bojic et.al.	2501.02532	null
2025-01-04	LLMzSzŁ: a comprehensive LLM benchmark for Polish	Krzysztof Jassem et.al.	2501.02266	null
2025-03-25	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2025-01-04	Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation	M. Ali Bayram et.al.	2501.00593	null
2024-12-31	Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs	Weijia Xu et.al.	2501.00273	null
2024-12-30	EVOLVE: Emotion and Visual Output Learning via LLM Evaluation	Jordan Sinclair et.al.	2412.20632	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-24	A Statistical Framework for Ranking LLM-Based Chatbots	Siavash Ameli et.al.	2412.18407	link
2025-01-25	DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation	Junyi Lu et.al.	2412.18291	null
2024-12-23	CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models	Ruibo Tu et.al.	2412.17970	link
2025-01-02	Baichuan4-Finance Technical Report	Hanyu Zhang et.al.	2412.15270	null
2024-12-19	ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects	Qihang Cao et.al.	2412.14837	null
2024-12-18	AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge	Xiaobao Wu et.al.	2412.13670	link
2025-02-16	Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning	Eitan Wagner et.al.	2412.13631	null
2025-02-17	OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain	Shuting Wang et.al.	2412.13018	link
2024-12-10	How to Choose a Threshold for an Evaluation Metric for Large Language Models	Bhaskarjit Sarmah et.al.	2412.12148	null
2024-12-15	Dual Traits in Probabilistic Reasoning of Large Language Models	Shenxiong Li et.al.	2412.11009	link
2024-12-30	LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation	Eunsu Kim et.al.	2412.10424	null
2024-12-13	Cultural Evolution of Cooperation among LLM Agents	Aron Vallinder et.al.	2412.10270	null
2024-12-12	Towards Understanding the Robustness of LLM-based Evaluations under Perturbations	Manav Chaudhary et.al.	2412.09269	null
2024-12-10	BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities	Sahal Shaji Mullappilly et.al.	2412.07769	link
2025-02-28	PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models	Qian Zhang et.al.	2412.06287	link
2024-12-02	AI Benchmarks and Datasets for LLM Evaluation	Todor Ivanov et.al.	2412.01020	null
2024-11-30	Evaluating the Consistency of LLM Evaluators	Noah Lee et.al.	2412.00543	null
2024-11-29	MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks	John Francis et.al.	2411.19689	null
2024-11-29	Beyond Surface Structure: A Causal Assessment of LLMs’ Comprehension Ability	Yujin Han et.al.	2411.19456	link
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2025-01-17	CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity	Zhengmin Yu et.al.	2411.16239	link
2024-11-25	SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text	Reshmi Ghosh et.al.	2411.16077	null
2024-11-26	Do LLMs Agree on the Creativity Evaluation of Alternative Uses?	Abdullah Al Rabeyah et.al.	2411.15560	null
2025-02-17	Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat	Roland Daynauth et.al.	2411.14483	link
2024-11-21	Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models	Lovish Madaan et.al.	2411.14103	null
2024-11-21	An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture	Boming Xia et.al.	2411.13768	null
2024-11-21	A Framework for Evaluating LLMs Under Task Indeterminacy	Luke Guerdan et.al.	2411.13760	null
2024-11-12	Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning	Linyang He et.al.	2411.07533	null
2024-11-13	Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models	Yancheng He et.al.	2411.07140	null
2024-11-09	Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models	Xiaojun Wu et.al.	2411.06272	link
2025-02-09	ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding	Israel Abebe Azime et.al.	2411.05049	null
2024-11-07	Bayesian Calibration of Win Rate Estimation with LLM Evaluators	Yicheng Gao et.al.	2411.04424	link
2024-11-05	Enhancing LLM Evaluations: The Garbling Trick	William F. Bradley et.al.	2411.01533	null
2025-02-19	Varco Arena: A Tournament Approach to Reference-Free Benchmarking Large Language Models	Seonil Son et.al.	2411.01281	null
2025-02-07	Mastering the Craft of Data Synthesis for CodeLLMs	Meng Chen et.al.	2411.00005	link
2024-10-28	Project MPG: towards a generalized performance benchmark for LLM capabilities	Lucas Spangher et.al.	2410.22368	null
2024-10-29	Self-Preference Bias in LLM-as-a-Judge	Koki Wataoka et.al.	2410.21819	null
2024-10-28	Unveiling Context-Aware Criteria in Self-Assessing LLMs	Taneesh Gupta et.al.	2410.21545	null
2024-10-27	LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization	Jui-Nan Yen et.al.	2410.20625	link
2024-10-26	Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks	Annalisa Szymanski et.al.	2410.20266	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2025-02-21	Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements	Isamu Isozaki et.al.	2410.17141	link
2024-10-21	CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution	Maosong Cao et.al.	2410.16256	link
2025-01-26	mHumanEval – A Multilingual Benchmark to Evaluate Large Language Models for Code Generation	Nishat Raihan et.al.	2410.15037	link
2024-10-19	CAP: Data Contamination Detection via Consistency Amplification	Yi Zhao et.al.	2410.15005	null
2024-10-18	Enabling Scalable Evaluation of Bias Patterns in Medical LLMs	Hamed Fayyaz et.al.	2410.14763	link
2024-11-06	Diverging Preferences: When do Annotators Disagree and do Models Know?	Michael JQ Zhang et.al.	2410.14632	null
2024-10-18	Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models	James Vo et.al.	2410.14480	null
2024-10-21	BenTo: Benchmark Task Reduction with In-Context Transferability	Hongyu Zhao et.al.	2410.13804	link
2024-10-16	BenchmarkCards: Large Language Model and Risk Reporting	Anna Sokol et.al.	2410.12974	null
2025-02-01	Language Model Preference Evaluation with Multiple Weak Evaluators	Zhengyu Hu et.al.	2410.12869	link
2024-10-11	Enterprise Benchmarks for Large Language Model Evaluation	Bing Zhang et.al.	2410.12857	link
2024-10-16	An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation	Junjie Chen et.al.	2410.12265	null
2024-10-15	Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers	Lorenzo Pacchiardi et.al.	2410.11672	link
2024-10-15	Black-box Uncertainty Quantification Method for LLM-as-a-Judge	Nico Wagner et.al.	2410.11594	null
2024-10-14	Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting	Yifan Luo et.al.	2410.10150	null
2024-12-13	HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics	Jingxuan Fan et.al.	2410.09988	link
2024-10-15	LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models	Han Qiu et.al.	2410.09962	link
2024-10-17	Towards Multilingual LLM Evaluation for European Languages	Klaudia Thellmann et.al.	2410.08928	null
2024-10-11	Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example	Marcus Kessel et.al.	2410.08911	null
2024-10-10	Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks	Mathis Pink et.al.	2410.08133	null
2025-02-03	COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act	Philipp Guldimann et.al.	2410.07959	link
2024-11-06	News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News	Tarun Jain et.al.	2410.07520	null
2024-10-09	Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates	Xiaosen Zheng et.al.	2410.07137	link
2024-10-09	ReIFE: Re-evaluating Instruction-Following Evaluation	Yixin Liu et.al.	2410.07069	link
2024-10-08	Active Evaluation Acquisition for Efficient LLM Benchmarking	Yang Li et.al.	2410.05952	null
2024-10-07	TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles	Qingchen Yu et.al.	2410.05262	link
2024-10-01	Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model	Aidan Gilson et.al.	2410.03740	null
2024-10-04	TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation	Jonathan Cook et.al.	2410.03608	null
2024-10-04	Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores	Robert E. Blackwell et.al.	2410.03492	null
2024-10-29	AIME: AI System Optimization via Multiple LLM Evaluators	Bhrij Patel et.al.	2410.03131	null
2024-10-02	Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation	Annalisa Szymanski et.al.	2410.02054	null
2024-10-02	Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models	Joseph Lee et.al.	2410.01795	link
2024-10-03	Extending Context Window of Large Language Models from a Distributional Perspective	Yingsheng Wu et.al.	2410.01490	link
2024-10-02	ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving	Yifan Qiao et.al.	2410.01228	null
2024-10-01	ViDAS: Vision-based Danger Assessment and Scoring	Pranav Gupta et.al.	2410.00477	null
2024-10-01	PclGPT: A Large Language Model for Patronizing and Condescending Language Detection	Hongbo Wang et.al.	2410.00361	link
2024-11-26	LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models	Haitao Li et.al.	2409.20288	link
2024-09-29	Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems	Xuyang Wu et.al.	2409.19804	link
2024-10-19	Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models	Xin Li et.al.	2409.19667	link
2024-10-05	IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation	Fan Lin et.al.	2409.18892	link
2024-12-13	A Character-Centric Creative Story Generation via Imagination	Kyeongman Park et.al.	2409.16667	null
2024-09-25	Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models	Sungjune Park et.al.	2409.16635	null
2024-12-18	Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino	Jann Railey Montalan et.al.	2409.15380	link
2024-12-16	MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators	Qingyu Lu et.al.	2409.14335	link
2024-09-21	ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models	Yuqing Huang et.al.	2409.13989	link
2024-12-17	AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs	Basel Mousi et.al.	2409.11404	null
2024-10-02	LLM-as-a-Judge & Reward Model: What They Can and Cannot Do	Guijin Son et.al.	2409.11239	null
2024-12-08	Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges	Vinay Samuel et.al.	2409.09927	link
2024-09-13	Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia	Fajri Koto et.al.	2409.08564	null
2024-09-09	Assessing SPARQL capabilities of Large Language Models	Lars-Peter Meyer et.al.	2409.05925	link
2024-10-08	LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs	Yuhao Wu et.al.	2409.02076	link
2024-10-14	Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation	Jasper Dekoninck et.al.	2409.00696	null
2024-08-26	Evaluating ChatGPT on Nuclear Domain-Specific Data	Muhammad Anwar et.al.	2409.00090	null
2024-08-28	LLMSecCode: Evaluating Large Language Models for Secure Coding	Anton Rydén et.al.	2408.16100	link
2024-08-26	LLM-3D Print: Large Language Models To Monitor and Control 3D Printing	Yayati Jadhav et.al.	2408.14307	null
2024-08-26	Epidemic Information Extraction for Event-Based Surveillance using Large Language Models	Sergio Consoli et.al.	2408.14277	null
2024-10-04	MobileQuant: Mobile-friendly Quantization for On-device Language Models	Fuwen Tan et.al.	2408.13933	link
2024-08-23	LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models	Chongyan Sun et.al.	2408.13338	null
2024-08-23	Open Llama2 Model for the Lithuanian Language	Artūras Nakvosas et.al.	2408.12963	null
2024-08-23	LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction	Songwei Li et.al.	2408.12832	link
2024-12-20	Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts	Jiaqing Liu et.al.	2408.09688	null
2024-08-20	Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge	Ravi Raju et.al.	2408.08808	null
2024-10-16	The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation	Samee Arif et.al.	2408.08688	link
2024-10-19	Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks	Junseok Kim et.al.	2408.08631	null

LLM MLLM

Publish Date	Title	Authors	PDF	Code
2025-07-23	Yume: An Interactive World Generation Model	Xiaofeng Mao et.al.	2507.17744	null
2025-07-23	Flow Matching Meets Biology and Life Science: A Survey	Zihao Li et.al.	2507.17731	null
2025-07-23	BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems	Malsha Ashani Mahawatta Dona et.al.	2507.17722	null
2025-07-23	AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer	Danny D. Leybzon et.al.	2507.17718	null
2025-07-23	HydraOpt: Navigating the Efficiency-Performance Trade-off of Adapter Merging	Taha Ceritli et.al.	2507.17706	null
2025-07-23	Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models	Changxin Tian et.al.	2507.17702	null
2025-07-23	Thinking Isn’t an Illusion: Overcoming the Limitations of Reasoning Models via Tool Augmentations	Zhao Song et.al.	2507.17699	null
2025-07-23	Symbiotic Agents: A Novel Paradigm for Trustworthy AGI-driven Networks	Ilias Chatzistefanidis et.al.	2507.17695	null
2025-07-23	Simulating multiple human perspectives in socio-ecological systems using large language models	Yongchao Zeng et.al.	2507.17680	null
2025-07-23	See the Forest and the Trees: A Synergistic Reasoning Framework for Knowledge-Based Visual Question Answering	Junjie Wang et.al.	2507.17659	null
2025-07-23	CNS-Bench: Benchmarking Image Classifier Robustness Under Continuous Nuisance Shifts	Olaf Dünkel et.al.	2507.17651	null
2025-07-23	Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries	Victor Hartman et.al.	2507.17636	null
2025-07-23	A Hybrid Early-Exit Algorithm for Large Language Models Based on Space Alignment Decoding (SPADE)	Bowen Zheng et.al.	2507.17618	null
2025-07-23	CodeReasoner: Enhancing the Code Reasoning Ability with Reinforcement Learning	Lingxiao Tang et.al.	2507.17548	null
2025-07-23	Anticipate, Simulate, Reason (ASR): A Comprehensive Generative AI Framework for Combating Messaging Scams	Xue Wen Tan et.al.	2507.17543	null
2025-07-23	AssertFlip: Reproducing Bugs via Inversion of LLM-Generated Passing Tests	Lara Khatib et.al.	2507.17542	null
2025-07-23	Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning	Xinyao Liu et.al.	2507.17539	null
2025-07-23	Enabling Cyber Security Education through Digital Twins and Generative AI	Vita Santa Barletta et.al.	2507.17518	null
2025-07-23	URPO: A Unified Reward & Policy Optimization Framework for Large Language Models	Songshuo Lu et.al.	2507.17515	null
2025-07-23	HOTA: Hamiltonian framework for Optimal Transport Advection	Nazar Buzun et.al.	2507.17513	null
2025-07-23	Unsupervised anomaly detection using Bayesian flow networks: application to brain FDG PET in the context of Alzheimer’s disease	Hugues Roy et.al.	2507.17486	null
2025-07-23	An Uncertainty-Driven Adaptive Self-Alignment Framework for Large Language Models	Haoran Sun et.al.	2507.17477	null
2025-07-23	MultiNRC: A Challenging and Native Multilingual Reasoning Evaluation Benchmark for LLMs	Alexander R. Fabbri et.al.	2507.17476	null
2025-07-23	BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles	Junhua Liu et.al.	2507.17472	null
2025-07-23	ERMV: Editing 4D Robotic Multi-view images to enhance embodied agents	Chang Nie et.al.	2507.17462	null
2025-07-23	Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning	Situo Zhang et.al.	2507.17448	null
2025-07-23	Each to Their Own: Exploring the Optimal Embedding in RAG	Shiting Chen et.al.	2507.17442	null
2025-07-23	A Comprehensive Evaluation on Quantization Techniques for Large Language Models	Yutong Liu et.al.	2507.17417	null
2025-07-23	HiProbe-VAD: Video Anomaly Detection via Hidden States Probing in Tuning-Free Multimodal LLMs	Zhaolin Cai et.al.	2507.17394	null
2025-07-23	Investigating Training Data Detection in AI Coders	Tianlin Li et.al.	2507.17389	null
2025-07-23	Confidence Calibration in Vision-Language-Action Models	Thomas P Zollo et.al.	2507.17383	null
2025-07-23	Language-Conditioned Open-Vocabulary Mobile Manipulation with Pretrained Models	Shen Tan et.al.	2507.17379	null
2025-07-23	DynaSearcher: Dynamic Knowledge Graph Augmented Search Agent via Multi-Reward Reinforcement Learning	Chuzhan Hao et.al.	2507.17365	null
2025-07-23	RoadBench: A Vision-Language Foundation Model and Benchmark for Road Damage Understanding	Xi Xiao et.al.	2507.17353	null
2025-07-23	CartoonAlive: Towards Expressive Live2D Modeling from Single Portraits	Chao He et.al.	2507.17327	null
2025-07-23	Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task	Milena Davudova et.al.	2507.17326	null
2025-07-23	R-Stitch: Dynamic Trajectory Stitching for Efficient Reasoning	Zhuokun Chen et.al.	2507.17307	null
2025-07-23	A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model	Zhe Xu et.al.	2507.17303	null
2025-07-23	Exploring the Potential of LLMs for Serendipity Evaluation in Recommender Systems	Li Kang et.al.	2507.17290	null
2025-07-23	Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge	Miaomiao Gao et.al.	2507.17288	null
2025-07-23	Fully Automated SAM for Single-source Domain Generalization in Medical Image Segmentation	Huanli Zhuo et.al.	2507.17281	null
2025-07-23	Leveraging Knowledge Graphs and LLM Reasoning to Identify Operational Bottlenecks for Warehouse Planning Assistance	Rishi Parekh et.al.	2507.17273	null
2025-07-23	Seed&Steer: Guiding Large Language Models with Compilable Prefix and Branch Signals for Unit Test Generation	Shuaiyu Zhou et.al.	2507.17271	null
2025-07-23	Understanding Prompt Programming Tasks and Questions	Jenny T. Liang et.al.	2507.17264	null
2025-07-23	Tab-MIA: A Benchmark Dataset for Membership Inference Attacks on Tabular Data in LLMs	Eyal German et.al.	2507.17259	null
2025-07-23	Agent Identity Evals: Measuring Agentic Identity	Elija Perrier et.al.	2507.17257	null
2025-07-23	Rethinking VAE: From Continuous to Discrete Representations Without Probabilistic Assumptions	Songxuan Shi et.al.	2507.17255	null
2025-07-23	R4ec: A Reasoning, Reflection, and Refinement Framework for Recommendation Systems	Hao Gu et.al.	2507.17249	null
2025-07-23	Perceptual Classifiers: Detecting Generative Images using Perceptual Features	Krishna Srikar Durbha et.al.	2507.17240	null
2025-07-23	MaskedCLIP: Bridging the Masked and CLIP Space for Semi-Supervised Medical Vision-Language Pre-training	Lei Zhu et.al.	2507.17239	null
2025-07-23	A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task	Mashiro Toyooka et.al.	2507.17232	null
2025-07-23	PIG-Nav: Key Insights for Pretrained Image Goal Navigation Models	Jiansong Wan et.al.	2507.17220	null
2025-07-23	The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models	Giuseppe Russo et.al.	2507.17216	null
2025-07-23	EFS: Evolutionary Factor Searching for Sparse Portfolio Optimization Using Large Language Models	Haochen Luo et.al.	2507.17211	null
2025-07-23	HypoChainer: A Collaborative System Combining LLMs and Knowledge Graphs for Hypothesis-Driven Scientific Discovery	Haoran Jiang et.al.	2507.17209	null
2025-07-23	Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation	Zixuan Wang et.al.	2507.17204	null
2025-07-23	DesignLab: Designing Slides Through Iterative Detection and Correction	Jooyeol Yun et.al.	2507.17202	null
2025-07-23	Vec2Face+ for Face Dataset Generation	Haiyu Wu et.al.	2507.17192	null
2025-07-23	LLM Meets the Sky: Heuristic Multi-Agent Reinforcement Learning for Secure Heterogeneous UAV Networks	Lijie Zheng et.al.	2507.17188	null
2025-07-23	SKA-Bench: A Fine-Grained Benchmark for Evaluating Structured Knowledge Understanding of LLMs	Zhiqiang Liu et.al.	2507.17178	null
2025-07-23	Improving LLMs’ Generalized Reasoning Abilities by Graph Problems	Qifan Zhang et.al.	2507.17168	null
2025-07-23	Can LLMs Write CI? A Study on Automatic Generation of GitHub Actions Configurations	Taher A. Ghaleb et.al.	2507.17165	null
2025-07-23	DOOMGAN:High-Fidelity Dynamic Identity Obfuscation Ocular Generative Morphing	Bharath Krishnamurthy et.al.	2507.17158	null
2025-07-23	UNICE: Training A Universal Image Contrast Enhancer	Ruodai Cui et.al.	2507.17157	null
2025-07-23	CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards	Cheng Liu et.al.	2507.17147	null
2025-07-23	SADA: Stability-guided Adaptive Diffusion Acceleration	Ting Jiang et.al.	2507.17135	null
2025-07-23	Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination	Mariam ALMutairi et.al.	2507.17134	null
2025-07-23	BrownoutServe: SLO-Aware Inference Serving under Bursty Workloads for MoE-based LLMs	Jianmin Hu et.al.	2507.17133	null
2025-07-23	Enabling Self-Improving Agents to Learn at Test Time With Human-In-The-Loop Guidance	Yufei He et.al.	2507.17131	null
2025-07-23	BucketServe: Bucket-Based Dynamic Batching for Smart and Efficient LLM Inference Serving	Wanyi Zheng et.al.	2507.17120	null
2025-07-23	HySafe-AI: Hybrid Safety Architectural Analysis Framework for AI Systems: A Case Study	Mandar Pitale et.al.	2507.17118	null
2025-07-23	Probabilistic Graphical Models: A Concise Tutorial	Jacqueline Maasch et.al.	2507.17116	null
2025-07-23	Enhancing Transferability and Consistency in Cross-Domain Recommendations via Supervised Disentanglement	Yuhan Wang et.al.	2507.17112	null
2025-07-23	Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models	Andrii Balashov et.al.	2507.17107	null
2025-07-22	Risk In Context: Benchmarking Privacy Leakage of Foundation Models in Synthetic Tabular Data Generation	Jessup Byun et.al.	2507.17066	null
2025-07-22	Parallelism Meets Adaptiveness: Scalable Documents Understanding in Multi-Agent LLM Systems	Chengxuan Xia et.al.	2507.17061	null
2025-07-22	Toward Scalable Video Narration: A Training-free Approach Using Multimodal Large Language Models	Tz-Ying Wu et.al.	2507.17050	null
2025-07-22	Controllable Hybrid Captioner for Improved Long-form Video Understanding	Kuleen Sasse et.al.	2507.17047	null
2025-07-22	Write, Rank, or Rate: Comparing Methods for Studying Visualization Affordances	Chase Stokes et.al.	2507.17024	null
2025-07-22	Causal Graph Fuzzy LLMs: A First Introduction and Applications in Time Series Forecasting	Omid Orang et.al.	2507.17016	null
2025-07-22	Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?	Arduin Findeis et.al.	2507.17015	null
2025-07-22	Multi-Label Classification with Generative AI Models in Healthcare: A Case Study of Suicidality and Risk Factors	Ming Huang et.al.	2507.17009	null
2025-07-22	Bringing Balance to Hand Shape Classification: Mitigating Data Imbalance Through Generative Models	Gaston Gustavo Rios et.al.	2507.17008	null
2025-07-22	PyG 2.0: Scalable Learning on Real World Graphs	Matthias Fey et.al.	2507.16991	null
2025-07-22	Obscured but Not Erased: Evaluating Nationality Bias in LLMs via Name-Based Bias Benchmarks	Giulio Pelosio et.al.	2507.16989	null
2025-07-22	Leveraging Synthetic Data for Question Answering with Multilingual LLMs in the Agricultural Domain	Rishemjit Kaur et.al.	2507.16974	null
2025-07-22	LLM4MEA: Data-free Model Extraction Attacks on Sequential Recommenders via Large Language Models	Shilong Zhao et.al.	2507.16969	null
2025-07-22	Harnessing RLHF for Robust Unanswerability Recognition and Trustworthy Response Generation in LLMs	Shuyuan Lin et.al.	2507.16951	null
2025-07-22	AI-based Clinical Decision Support for Primary Care: A Real-World Study	Robert Korom et.al.	2507.16947	null
2025-07-22	AURA: A Multi-Modal Medical Agent for Understanding, Reasoning & Annotation	Nima Fathi et.al.	2507.16940	null
2025-07-22	SiLQ: Simple Large Language Model Quantization-Aware Training	Steven K. Esser et.al.	2507.16933	null
2025-07-22	Stellar Mass-Dispersion Measure Correlations Constrain Baryonic Feedback in Fast Radio Burst Host Galaxies	Calvin Leung et.al.	2507.16816	null
2025-07-22	LingBench++: A Linguistically-Informed Benchmark and Reasoning Framework for Multi-Step and Cross-Cultural Inference with LLMs	Da-Chen Lian et.al.	2507.16809	null
2025-07-22	Rethinking LLM-Based RTL Code Optimization Via Timing Logic Metamorphosis	Zhihao Xu et.al.	2507.16808	null
2025-07-23	Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning	Yanjun Zheng et.al.	2507.16802	null
2025-07-23	Test-Time-Matching: Decouple Personality, Memory, and Linguistic Style in LLM-based Role-Playing Language Agent	Xiaoyu Zhan et.al.	2507.16799	null
2025-07-22	Steering Out-of-Distribution Generalization with Concept Ablation Fine-Tuning	Helena Casademunt et.al.	2507.16795	null
2025-07-22	ChatChecker: A Framework for Dialogue System Testing and Evaluation Through Non-cooperative User Simulation	Roman Mayr et.al.	2507.16792	null
2025-07-22	Enhancing Domain Diversity in Synthetic Data Face Recognition with Dataset Fusion	Anjith George et.al.	2507.16790	null
2025-07-22	Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning	Hongyin Luo et.al.	2507.16784	null
2025-07-22	Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems	Imran Latif et.al.	2507.16781	null
2025-07-22	When LLMs Copy to Think: Uncovering Copy-Guided Attacks in Reasoning LLMs	Yue Li et.al.	2507.16773	null
2025-07-22	WGRAMMAR: Leverage Prior Knowledge to Accelerate Structured Decoding	Ran Wang et.al.	2507.16768	null
2025-07-22	Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support	Fangjian Lei et.al.	2507.16754	null
2025-07-22	CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation	Shuai Chen et.al.	2507.16753	null
2025-07-22	Collaborative Inference and Learning between Edge SLMs and Cloud LLMs: A Survey of Algorithms, Execution, and Open Challenges	Senyao Li et.al.	2507.16731	null
2025-07-22	Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with constraints	Zhenyun Yin et.al.	2507.16727	null
2025-07-22	Enhancing Remote Sensing Vision-Language Models Through MLLM and LLM-Based High-Quality Image-Text Dataset Generation	Yiguo He et.al.	2507.16716	null
2025-07-22	Advancing Risk and Quality Assurance: A RAG Chatbot for Improved Regulatory Compliance	Lars Hillebrand et.al.	2507.16711	null
2025-07-22	Biases in LLM-Generated Musical Taste Profiles for Recommendation	Bruno Sguerra et.al.	2507.16708	null
2025-07-22	FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation	Pingyi Fan et.al.	2507.16696	null
2025-07-22	Generating Search Explanations using Large Language Models	Arif Laksito et.al.	2507.16692	null
2025-07-22	PICACO: Pluralistic In-Context Value Alignment of LLMs via Total Correlation Optimization	Han Jiang et.al.	2507.16679	null
2025-07-22	Custom Algorithm-based Fault Tolerance for Attention Layers in Transformers	Vasileios Titopoulos et.al.	2507.16676	null
2025-07-22	Meta-Learning for Cold-Start Personalization in Prompt-Tuned LLMs	Yushang Zhao et.al.	2507.16672	null
2025-07-22	VulCoCo: A Simple Yet Effective Method for Detecting Vulnerable Code Clones	Tan Bui et.al.	2507.16661	null
2025-07-22	P-CoT: A Pedagogically-motivated Participatory Chain-of-Thought Prompting for Phonological Reasoning in LLMs	Dongjun Jang et.al.	2507.16656	null
2025-07-22	Towards Automated Regulatory Compliance Verification in Financial Auditing with Large Language Models	Armin Berger et.al.	2507.16642	null
2025-07-22	Step-Audio 2 Technical Report	Boyong Wu et.al.	2507.16632	null
2025-07-22	Automatic Fine-grained Segmentation-assisted Report Generation	Frederic Jonske et.al.	2507.16623	null
2025-07-22	On the Effectiveness of LLM-as-a-judge for Code Generation and Summarization	Giuseppe Crupi et.al.	2507.16587	null
2025-07-22	LLMxCPG: Context-Aware Vulnerability Detection Through Code Property Graph-Guided Large Language Models	Ahmed Lekssays et.al.	2507.16585	null
2025-07-22	From Text to Actionable Intelligence: Automating STIX Entity and Relationship Extraction	Ahmed Lekssays et.al.	2507.16576	null
2025-07-22	Pixels to Principles: Probing Intuitive Physics Understanding in Multimodal Language Models	Mohamad Ballout et.al.	2507.16572	null
2025-07-22	TTMBA: Towards Text To Multiple Sources Binaural Audio Generation	Yuxuan He et.al.	2507.16564	null
2025-07-22	Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language	Kristin Gnadt et.al.	2507.16557	null
2025-07-22	Alternative Loss Function in Evaluation of Transformer Models	Jakub Michańków et.al.	2507.16548	null
2025-07-22	Learning Text Styles: A Study on Transfer, Attribution, and Verification	Zhiqiang Hu et.al.	2507.16530	null
2025-07-22	Spatial 3D-LLM: Exploring Spatial Awareness in 3D Vision-Language Models	Xiaoyan Wang et.al.	2507.16524	null
2025-07-22	C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning	Xiuwei Chen et.al.	2507.16518	null
2025-07-22	The Ever-Evolving Science Exam	Junying Wang et.al.	2507.16514	null
2025-07-22	Agentic RAG with Knowledge Graphs for Complex Multi-Hop Reasoning in Real-World Applications	Jean Lelong et.al.	2507.16507	null
2025-07-22	ICR Probe: Tracking Hidden State Dynamics for Reliable Hallucination Detection in LLMs	Zhenliang Zhang et.al.	2507.16488	null
2025-07-22	ACT: Bridging the Gap in Code Translation through Synthetic Data Generation & Adaptive Training	Shreya Saxena et.al.	2507.16478	null
2025-07-22	Learning Temporal Abstractions via Variational Homomorphisms in Option-Induced Abstract MDPs	Chang Li et.al.	2507.16473	null
2025-07-22	Towards Enforcing Company Policy Adherence in Agentic Workflows	Naama Zwerdling et.al.	2507.16459	null
2025-07-22	An approach to measuring the performance of Automatic Speech Recognition (ASR) models in the context of Large Language Model (LLM) powered applications	Sujith Pulikodan et.al.	2507.16456	null
2025-07-22	VGGT-Long: Chunk it, Loop it, Align it – Pushing VGGT’s Limits on Kilometer-scale Long RGB Sequences	Kai Deng et.al.	2507.16443	null
2025-07-22	Exploring Large Language Models for Analyzing and Improving Method Names in Scientific Code	Gunnar Larsen et.al.	2507.16439	null
2025-07-22	Identifying Pre-training Data in LLMs: A Neuron Activation-Based Detection Framework	Hongyi Tang et.al.	2507.16414	null
2025-07-22	GG-BBQ: German Gender Bias Benchmark for Question Answering	Shalaka Satheesh et.al.	2507.16410	null
2025-07-22	Improving Code LLM Robustness to Prompt Perturbations via Layer-Aware Model Editing	Shuhan Liu et.al.	2507.16407	null
2025-07-22	Sparse-View 3D Reconstruction: Recent Advances and Open Challenges	Tanveer Younis et.al.	2507.16406	null
2025-07-22	LLM-Driven Collaborative Model for Untangling Commits via Explicit and Implicit Dependency Reasoning	Bo Hou et.al.	2507.16395	null
2025-07-22	Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?	Lazaro Janier Gonzalez-Sole et.al.	2507.16393	null
2025-07-22	A general model for frictional contacts in colloidal systems	Kay Hofmann et.al.	2507.16388	null
2025-07-22	Application of LLM Guided Reinforcement Learning in Formation Control with Collision Avoidance	Chenhao Yao et.al.	2507.16382	null
2025-07-22	Depth Gives a False Sense of Privacy: LLM Internal States Inversion	Tian Dong et.al.	2507.16372	null
2025-07-22	One Polyp Identifies All: One-Shot Polyp Segmentation with SAM via Cascaded Priors and Iterative Prompt Evolution	Xinyu Mao et.al.	2507.16337	null
2025-07-22	Re:Form – Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny	Chuanhao Yan et.al.	2507.16331	null
2025-07-22	DREAM: Scalable Red Teaming for Text-to-Image Generative Systems via Distribution Modeling	Boheng Li et.al.	2507.16329	null
2025-07-22	M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision	Kailai Zhou et.al.	2507.16318	null
2025-07-22	Perovskite-R1: A Domain-Specialized LLM for Intelligent Discovery of Precursor Additives and Experimental Design	Xin-De Wang et.al.	2507.16307	null
2025-07-22	Talking Like a Phisher: LLM-Based Attacks on Voice Phishing Classifiers	Wenhao Li et.al.	2507.16291	null
2025-07-22	Dens3R: A Foundation Model for 3D Geometry Prediction	Xianze Fang et.al.	2507.16290	null
2025-07-22	Time to Split: Exploring Data Splitting Strategies for Offline Evaluation of Sequential Recommenders	Danil Gusak et.al.	2507.16289	null
2025-07-22	Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition	Zefeng Qian et.al.	2507.16287	null
2025-07-22	Reducing GPU Memory Fragmentation via Spatio-Temporal Planning for Efficient Large-Scale Model Training	Zixiao Huang et.al.	2507.16274	null
2025-07-22	Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction	Tianyun Zhong et.al.	2507.16271	null
2025-07-22	iShumei-Chinchunmei at SemEval-2025 Task 4: A balanced forgetting and retention multi-task framework using effective unlearning loss	Yujian Sun et.al.	2507.16263	null
2025-07-22	Edge-case Synthesis for Fisheye Object Detection: A Data-centric Perspective	Seunghyeon Kim et.al.	2507.16254	null
2025-07-22	Efficient RL for optimizing conversation level outcomes with an LLM-based tutor	Hyunji Nam et.al.	2507.16252	null
2025-07-22	eX-NIDS: A Framework for Explainable Network Intrusion Detection Leveraging Large Language Models	Paul R. B. Houssel et.al.	2507.16241	null
2025-07-22	Scale Your Instructions: Enhance the Instruction-Following Fidelity of Unified Image Generation Model by Self-Adaptive Attention Scaling	Chao Zhou et.al.	2507.16240	null
2025-07-22	LLM-Enhanced Reranking for Complementary Product Recommendation	Zekun Xu et.al.	2507.16237	null
2025-07-22	Voice-based AI Agents: Filling the Economic Gaps in Digital Health Delivery	Bo Wen et.al.	2507.16229	null
2025-07-22	Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design	Dong Ben et.al.	2507.16226	null
2025-07-22	Towards Compute-Optimal Many-Shot In-Context Learning	Shahriar Golchin et.al.	2507.16217	null
2025-07-22	Advancing Visual Large Language Model for Multi-granular Versatile Perception	Wentao Xiang et.al.	2507.16213	null
2025-07-22	LOCOFY Large Design Models – Design to code conversion solution	Sohaib Muhammad et.al.	2507.16208	null
2025-07-22	A Human-Centered Approach to Identifying Promises, Risks, & Challenges of Text-to-Image Generative AI in Radiology	Katelyn Morrison et.al.	2507.16207	null
2025-07-22	RealBench: Benchmarking Verilog Generation Models with Real-World IP Designs	Pengwei Jin et.al.	2507.16200	null
2025-07-22	WakenLLM: A Fine-Grained Benchmark for Evaluating LLM Reasoning Potential and Reasoning Process Stability	Zipeng Ling et.al.	2507.16199	null
2025-07-22	Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task	Jared Moore et.al.	2507.16196	null
2025-07-22	Emergent Cognitive Convergence via Implementation: A Structured Loop Reflecting Four Theories of Mind (A Position Paper)	Myung Ho Kim et.al.	2507.16184	null
2025-07-22	LLM Data Selection and Utilization via Dynamic Bi-level Optimization	Yang Yu et.al.	2507.16178	null
2025-07-22	SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting	Shuhao Mei et.al.	2507.16145	null
2025-07-22	Disability Across Cultures: A Human-Centered Audit of Ableism in Western and Indic LLMs	Mahika Phutane et.al.	2507.16130	null
2025-07-22	Benchmarking LLM Privacy Recognition for Social Robot Decision Making	Dakota Sullivan et.al.	2507.16124	null
2025-07-22	PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation	Yaofang Liu et.al.	2507.16116	null
2025-07-21	Expert-Guided LLM Reasoning for Battery Discovery: From AI-Driven Hypothesis to Synthesis and Characterization	Shengchao Liu et.al.	2507.16110	null
2025-07-21	Efficient Compositional Multi-tasking for On-device Large Language Models	Ondrej Bohdal et.al.	2507.16083	null
2025-07-21	The Prompt Makes the Person(a): A Systematic Evaluation of Sociodemographic Persona Prompting for Large Language Models	Marlene Lutz et.al.	2507.16076	null
2025-07-21	Deep Researcher with Test-Time Diffusion	Rujun Han et.al.	2507.16075	null
2025-07-21	Compositional Coordination for Multi-Robot Teams with Large Language Models	Zhehui Huang et.al.	2507.16068	null
2025-07-21	AI-Powered Commit Explorer (APCE)	Yousab Grees et.al.	2507.16063	null
2025-07-21	AutoMeet: a proof-of-concept study of genAI to automate meetings in automotive engineering	Simon Baeuerle et.al.	2507.16054	null
2025-07-21	Making REST APIs Agent-Ready: From OpenAPI to Model Context Protocol Servers for Tool-Augmented LLMs	Meriem Mastouri et.al.	2507.16044	null
2025-07-21	A Pilot Study on LLM-Based Agentic Translation from Android to iOS: Pitfalls and Insights	Zhili Zeng et.al.	2507.16037	null
2025-07-21	From Logic to Language: A Trust Index for Problem Solving with LLMs	Tehseen Rug et.al.	2507.16028	null
2025-07-21	AI, Expert or Peer? – Examining the Impact of Perceived Feedback Source on Pre-Service Teachers Feedback Perception and Uptake	Lucas Jasper Jacobsen et.al.	2507.16013	null
2025-07-21	Diffusion Beats Autoregressive in Data-Constrained Settings	Mihir Prabhudesai et.al.	2507.15857	null
2025-07-21	Latent Denoising Makes Good Visual Tokenizers	Jiawei Yang et.al.	2507.15856	null
2025-07-21	Gemini 2.5 Pro Capable of Winning Gold at IMO 2025	Yichen Huang et.al.	2507.15855	null
2025-07-21	The Other Mind: How Language Models Exhibit Human Temporal Cognition	Lingyu Li et.al.	2507.15851	null
2025-07-21	3LM: Bridging Arabic, STEM, and Code through Benchmarking	Basma El Amel Boussaha et.al.	2507.15850	null
2025-07-21	The Impact of Language Mixing on Bilingual LLM Reasoning	Yihao Li et.al.	2507.15849	null
2025-07-21	FASTGEN: Fast and Cost-Effective Synthetic Tabular Data Generation with LLMs	Anh Nguyen et.al.	2507.15839	null
2025-07-21	Just Ask for Music (JAM): Multimodal and Personalized Natural Language Music Recommendation	Alessandro B. Melchiorre et.al.	2507.15826	null
2025-07-21	ACS: An interactive framework for conformal selection	Yu Gui et.al.	2507.15825	null
2025-07-21	Can Your Model Separate Yolks with a Water Bottle? Benchmarking Physical Commonsense Understanding in Video Generation Models	Enes Sanli et.al.	2507.15824	null
2025-07-21	Do AI models help produce verified bug fixes?	Li Huang et.al.	2507.15822	null
2025-07-21	LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra	Seth Karten et.al.	2507.15815	null
2025-07-21	Diffusion models for multivariate subsurface generation and efficient probabilistic inversion	Roberto Miele et.al.	2507.15809	null
2025-07-21	True Multimodal In-Context Learning Needs Attention to the Visual Context	Shuo Chen et.al.	2507.15807	null
2025-07-21	ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction	Danhui Chen et.al.	2507.15803	null
2025-07-21	Regularized Low-Rank Adaptation for Few-Shot Organ Segmentation	Ghassen Baklouti et.al.	2507.15793	null
2025-07-21	Small LLMs Do Not Learn a Generalizable Theory of Mind via Reinforcement Learning	Sneheel Sarangi et.al.	2507.15788	null
2025-07-21	Reservoir Computing as a Language Model	Felix Köster et.al.	2507.15779	null
2025-07-21	Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR	Jiakang Wang et.al.	2507.15778	null
2025-07-21	Left Leaning Models: AI Assumptions on Economic Policy	Maxim Chupilkin et.al.	2507.15771	null
2025-07-21	A Framework for Analyzing Abnormal Emergence in Service Ecosystems Through LLM-based Agent Intention Mining	Yifan Shen et.al.	2507.15770	null
2025-07-21	GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts	Jingyi Zheng et.al.	2507.15761	null
2025-07-21	Understanding Large Language Models’ Ability on Interdisciplinary Research	Yuanhao Shen et.al.	2507.15736	null
2025-07-21	Gaze-supported Large Language Model Framework for Bi-directional Human-Robot Interaction	Jens V. Rüppel et.al.	2507.15729	null
2025-07-21	TokensGen: Harnessing Condensed Tokens for Long Video Generation	Wenqi Ouyang et.al.	2507.15728	null
2025-07-21	A Practical Investigation of Spatially-Controlled Image Generation with Transformers	Guoxuan Xia et.al.	2507.15724	null
2025-07-21	BEnchmarking LLMs for Ophthalmology (BELO) for Ophthalmological Knowledge and Reasoning	Sahana Srinivasan et.al.	2507.15717	null
2025-07-21	Chinchunmei at SemEval-2025 Task 11: Boosting the Large Language Model’s Capability of Emotion Perception using Contrastive Learning	Tian Li et.al.	2507.15714	null
2025-07-21	Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?	Seok Hwan Song et.al.	2507.15707	null
2025-07-21	Estimating Rate-Distortion Functions Using the Energy-Based Model	Shitong Wu et.al.	2507.15700	null
2025-07-21	CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models	Congmin Zheng et.al.	2507.15698	null
2025-07-21	Surfacing Variations to Calibrate Perceived Reliability of MLLM-generated Image Descriptions	Meng Chen et.al.	2507.15692	null
2025-07-21	P3: Prompts Promote Prompting	Xinyu Zhang et.al.	2507.15675	null
2025-07-21	BugScope: Learn to Find Bugs Like Human	Jinyao Guo et.al.	2507.15671	null
2025-07-21	VeriRAG: A Retrieval-Augmented Framework for Automated RTL Testability Repair	Haomin Qi et.al.	2507.15664	null
2025-07-21	SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models	Giordano d’Aloisio et.al.	2507.15663	null
2025-07-21	HW-MLVQA: Elucidating Multilingual Handwritten Document Understanding with a Comprehensive VQA Benchmark	Aniket Pal et.al.	2507.15655	null
2025-07-21	Extracting Visual Facts from Intermediate Layers for Mitigating Hallucinations in Multimodal Large Language Models	Haoran Zhou et.al.	2507.15652	null
2025-07-21	Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training	Kailai Yang et.al.	2507.15640	null
2025-07-21	DHEvo: Data-Algorithm Based Heuristic Evolution for Generalizable MILP Solving	Zhihao Zhang et.al.	2507.15615	null
2025-07-21	Multi-Stage Prompt Inference Attacks on Enterprise LLM Systems	Andrii Balashov et.al.	2507.15613	null
2025-07-21	CylinderPlane: Nested Cylinder Representation for 3D-aware Image Generation	Ru Jia et.al.	2507.15606	null
2025-07-21	Applying the Chinese Wall Reverse Engineering Technique to Large Language Model Code Editing	Manatsawin Hanmongkolchai et.al.	2507.15599	null
2025-07-21	Learning to Extract Rational Evidence via Reinforcement Learning for Retrieval-Augmented Generation	Xinping Zhao et.al.	2507.15586	null
2025-07-21	DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding	Xiaoyi Bao et.al.	2507.15569	null
2025-07-21	Evaluating Text Style Transfer: A Nine-Language Benchmark for Text Detoxification	Vitaly Protasov et.al.	2507.15557	null
2025-07-21	Efficient Routing of Inference Requests across LLM Instances in Cloud-Edge Computing	Shibo Yu et.al.	2507.15553	null
2025-07-21	RankMixer: Scaling Up Ranking Models in Industrial Recommenders	Jie Zhu et.al.	2507.15551	null
2025-07-21	PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors	Yimeng Chen et.al.	2507.15550	null
2025-07-21	LLM world models are mental: Output layer evidence of brittle world model use in LLM mechanical reasoning	Cole Robertson et.al.	2507.15521	null
2025-07-21	HAMLET: Hyperadaptive Agent-based Modeling for Live Embodied Theatrics	Sizhou Chen et.al.	2507.15518	null
2025-07-21	Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models	Kaiyan Chang et.al.	2507.15512	null
2025-07-21	ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution	Alexandru Coca et.al.	2507.15501	null
2025-07-21	PhishIntentionLLM: Uncovering Phishing Website Intentions through Multi-Agent Retrieval-Augmented Generation	Wenhao Li et.al.	2507.15419	null
2025-07-21	PDEformer-2: A Versatile Foundation Model for Two-Dimensional Partial Differential Equations	Zhanhong Ye et.al.	2507.15409	null
2025-07-21	PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants	Ruofan Liu et.al.	2507.15393	null
2025-07-21	DAViD: Data-efficient and Accurate Vision Models from Synthetic Data	Fatemeh Saleh et.al.	2507.15365	null
2025-07-21	Revisiting the Effect of Grid-Following Converter on Frequency Dynamics – Part I: Center of Inertia	Jiahao Liu et.al.	2507.15358	null
2025-07-21	Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding	Elisa Sanchez-Bayona et.al.	2507.15357	null
2025-07-21	RAD: Retrieval High-quality Demonstrations to Enhance Decision-making	Lu Guo et.al.	2507.15356	null
2025-07-21	Scaling Decentralized Learning with FLock	Zehua Cheng et.al.	2507.15349	null
2025-07-21	Probing Information Distribution in Transformer Architectures through Entropy Analysis	Amedeo Buonanno et.al.	2507.15347	null
2025-07-21	StackTrans: From Large Language Model to Large Pushdown Automata Model	Kechi Zhang et.al.	2507.15343	null
2025-07-21	Reasoning Models are Test Exploiters: Rethinking Multiple-Choice	Narun Raman et.al.	2507.15337	null
2025-07-21	On the Inevitability of Left-Leaning Political Bias in Aligned Language Models	Thilo Hagendorff et.al.	2507.15328	null
2025-07-21	BenchDepth: Are We on the Right Way to Evaluate Depth Foundation Models?	Zhenyu Li et.al.	2507.15321	null
2025-07-21	Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems	Qian Xiong et.al.	2507.15296	null
2025-07-21	A Novel Self-Evolution Framework for Large Language Models	Haoran Sun et.al.	2507.15281	null
2025-07-21	ChiMed 2.0: Advancing Chinese Medical Dataset in Facilitating Large Language Modeling	Yuanhe Tian et.al.	2507.15275	null
2025-07-21	Conditional Video Generation for High-Efficiency Video Compression	Fangqiu Yi et.al.	2507.15269	null
2025-07-21	IM-Chat: A Multi-agent LLM-based Framework for Knowledge Transfer in Injection Molding Industry	Junhyeong Lee et.al.	2507.15268	null
2025-07-21	VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving	Haichao Liu et.al.	2507.15266	null
2025-07-21	CHORDS: Diffusion Sampling Accelerator with Multi-core Hierarchical ODE Solvers	Jiaqi Han et.al.	2507.15260	null
2025-07-21	MEETI: A Multimodal ECG Dataset from MIMIC-IV-ECG with Signals, Images, Features and Interpretations	Deyun Zhang et.al.	2507.15255	null
2025-07-21	Input Reduction Enhanced LLM-based Program Repair	Boyang Yang et.al.	2507.15251	null
2025-07-21	FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers	Yanbing Zhang et.al.	2507.15249	null
2025-07-21	SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search	Xiaofeng Shi et.al.	2507.15245	null
2025-07-21	Mammo-SAE: Interpreting Breast Cancer Concept Learning with Sparse Autoencoders	Krishna Kanth Nakka et.al.	2507.15227	null
2025-07-21	Solving Formal Math Problems by Decomposition and Iterative Reflection	Yichi Zhou et.al.	2507.15225	null
2025-07-21	SimdBench: Benchmarking Large Language Models for SIMD-Intrinsic Code Generation	Yibo He et.al.	2507.15224	null
2025-07-21	Hierarchical Part-based Generative Model for Realistic 3D Blood Vessel	Siqi Chen et.al.	2507.15223	null
2025-07-21	Improving Joint Embedding Predictive Architecture with Diffusion Noise	Yuping Qiu et.al.	2507.15216	null
2025-07-21	Collaborative Distillation Strategies for Parameter-Efficient Language Model Deployment	Xiandong Meng et.al.	2507.15198	null
2025-07-21	Better Models and Algorithms for Learning Ising Models from Dynamics	Jason Gaitonde et.al.	2507.15173	null
2025-07-20	What Level of Automation is “Good Enough”? A Benchmark of Large Language Models for Meta-Analysis Data Extraction	Lingbo Li et.al.	2507.15152	null
2025-07-20	Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction	Ce Zhang et.al.	2507.15130	null
2025-07-20	AnalogFed: Federated Discovery of Analog Circuit Topologies with Generative AI	Qiufeng Li et.al.	2507.15104	null
2025-07-20	Filling the Gap: Is Commonsense Knowledge Generation useful for Natural Language Inference?	Chathuri Jayaweera et.al.	2507.15100	null
2025-07-20	BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking	Mengya Xu et.al.	2507.15094	null
2025-07-20	A Penalty Goes a Long Way: Measuring Lexical Diversity in Synthetic Texts Under Prompt-Influenced Length Variations	Vijeta Deshpande et.al.	2507.15092	null
2025-07-20	Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR	Peirong Zhang et.al.	2507.15085	null
2025-07-20	Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback	Yiyuan Yang et.al.	2507.15066	null
2025-07-20	WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization	Zhengwei Tao et.al.	2507.15061	null
2025-07-20	LibLMFuzz: LLM-Augmented Fuzz Target Generation for Black-box Libraries	Ian Hardgrove et.al.	2507.15058	null
2025-07-20	Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding	Yuanhan Zhang et.al.	2507.15028	null
2025-07-20	Deep Generative Models in Condition and Structural Health Monitoring: Opportunities, Limitations and Future Outlook	Xin Yang et.al.	2507.15026	null
2025-07-20	Survey of GenAI for Automotive Software Development: From Requirements to Executable Code	Nenad Petrovic et.al.	2507.15025	null
2025-07-20	RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback	Qiaoyu Tang et.al.	2507.15024	null
2025-07-20	EduThink4AI: Translating Educational Critical Thinking into Multi-Agent LLM Systems	Xinmeng Hou et.al.	2507.15015	null
2025-07-20	Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression	Roy H. Jennings et.al.	2507.14997	null
2025-07-18	Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning	Shashanka Venkataramanan et.al.	2507.14137	null
2025-07-18	NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining	Maksim Kuprashevich et.al.	2507.14119	null
2025-07-18	CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning	Xiaoya Li et.al.	2507.14111	null
2025-07-18	Automated Interpretation of Non-Destructive Evaluation Contour Maps Using Large Language Models for Bridge Condition Assessment	Viraj Nishesh Darji et.al.	2507.14107	null
2025-07-18	Generative AI-Driven High-Fidelity Human Motion Simulation	Hari Iyer et.al.	2507.14097	null
2025-07-18	Lessons from the TREC Plain Language Adaptation of Biomedical Abstracts (PLABA) track	Brian Ondov et.al.	2507.14096	null
2025-07-18	DPMT: Dual Process Multi-scale Theory of Mind Framework for Real-time Human-AI Collaboration	Xiyun Li et.al.	2507.14088	null
2025-07-18	DENSE: Longitudinal Progress Note Generation with Temporal Modeling of Heterogeneous Clinical Notes Across Hospital Visits	Garapati Keerthana et.al.	2507.14079	null
2025-07-18	Foundation Models as Class-Incremental Learners for Dermatological Image Classification	Mohamed Elkhayat et.al.	2507.14050	null
2025-07-18	Evaluating the Effectiveness of Cost-Efficient Large Language Models in Benchmark Biomedical Tasks	Israt Jahan et.al.	2507.14045	null
2025-07-18	TGIF: Talker Group-Informed Familiarization of Target Speaker Extraction	Tsun-An Hsieh et.al.	2507.14044	null
2025-07-18	Architecting Human-AI Cocreation for Technical Services – Interaction Modes and Contingency Factors	Jochen Wulf et.al.	2507.14034	null
2025-07-18	KROMA: Ontology Matching with Knowledge Retrieval and Large Language Models	Lam Nguyen et.al.	2507.14032	null
2025-07-18	Moodifier: MLLM-Enhanced Emotion-Driven Image Editing	Jiarong Ye et.al.	2507.14024	null
2025-07-18	Efficient Temporal Tokenization for Mobility Prediction with Large Language Models	Haoyu He et.al.	2507.14017	null
2025-07-18	Leveraging Pathology Foundation Models for Panoptic Segmentation of Melanoma in H&E Images	Jiaqi Lv et.al.	2507.13974	null
2025-07-18	DUALRec: A Hybrid Sequential and Language Model Framework for Context-Aware Movie Recommendation	Yitong Li et.al.	2507.13957	null
2025-07-18	Cross-modal Causal Intervention for Alzheimer’s Disease Prediction	Yutao Jin et.al.	2507.13956	null
2025-07-18	Exploiting Primacy Effect To Improve Large Language Models	Bianca Raimondi et.al.	2507.13949	null
2025-07-18	Generalist Forecasting with Frozen Video Models via Latent Diffusion	Jacob C Walker et.al.	2507.13942	null
2025-07-18	Preprint: Did I Just Browse A Website Written by LLMs?	Sichang “Steven” He et.al.	2507.13933	null
2025-07-18	Enhancing LiDAR Point Features with Foundation Model Priors for 3D Object Detection	Yujian Mo et.al.	2507.13899	null
2025-07-18	Using LLMs to identify features of personal and professional skills in an open-response situational judgment test	Cole Walsh et.al.	2507.13881	null
2025-07-18	Large Language Models as Innovators: A Framework to Leverage Latent Space Exploration for Novelty Discovery	Mateusz Bystroński et.al.	2507.13874	null
2025-07-18	SPARQL Query Generation with LLMs: Measuring the Impact of Training Data Memorization and Knowledge Injection	Aleksandr Gashkov et.al.	2507.13859	null
2025-07-18	InTraVisTo: Inside Transformer Visualisation Tool	Nicolò Brunello et.al.	2507.13858	null
2025-07-18	DistFlow: A Fully Distributed RL Framework for Scalable and Efficient LLM Post-Training	Zhixin Wang et.al.	2507.13833	null
2025-07-18	Question-Answer Extraction from Scientific Articles Using Knowledge Graphs and Large Language Models	Hosein Azarbonyad et.al.	2507.13827	null
2025-07-18	RAG-based Architectures for Drug Side Effect Retrieval in LLMs	Shad Nygren et.al.	2507.13822	null
2025-07-18	Team of One: Cracking Complex Video QA with Model Synergy	Jun Xie et.al.	2507.13820	null
2025-07-18	CodeEdu: A Multi-Agent Collaborative Platform for Personalized Coding Education	Jianing Zhao et.al.	2507.13814	null
2025-07-18	SkySense V2: A Unified Foundation Model for Multi-modal Remote Sensing	Yingying Zhang et.al.	2507.13812	null
2025-07-18	On-the-Fly Fine-Tuning of Foundational Neural Network Potentials: A Bayesian Neural Network Approach	Tim Rensmeyer et.al.	2507.13805	null
2025-07-18	MolPIF: A Parameter Interpolation Flow Model for Molecule Generation	Yaowei Jin et.al.	2507.13762	null
2025-07-18	PRIDE – Parameter-Efficient Reduction of Identity Discrimination for Equality in LLMs	Maluna Menke et.al.	2507.13743	null
2025-07-18	Can Synthetic Images Conquer Forgetting? Beyond Unexplored Doubts in Few-Shot Class-Incremental Learning	Junsu Kim et.al.	2507.13739	null
2025-07-18	DailyLLM: Context-Aware Activity Log Generation Using Multi-Modal Sensors and LLMs	Ye Tian et.al.	2507.13737	null
2025-07-18	The Judge Variable: Challenging Judge-Agnostic Legal Judgment Prediction	Guillaume Zambrano et.al.	2507.13732	null
2025-07-18	LLaPipe: LLM-Guided Reinforcement Learning for Automated Data Preparation Pipeline Construction	Jing Chang et.al.	2507.13712	null
2025-07-18	CogniQ-H: A Soft Hierarchical Reinforcement Learning Paradigm for Automated Data Preparation	Jing Chang et.al.	2507.13710	null
2025-07-18	Consistent Explainers or Unreliable Narrators? Understanding LLM-generated Group Recommendations	Cedric Waterschoot et.al.	2507.13705	null
2025-07-18	TopicAttack: An Indirect Prompt Injection Attack via Topic Transition	Yulin Chen et.al.	2507.13686	null
2025-07-18	LoopServe: An Adaptive Dual-phase LLM Inference Acceleration System for Multi-Turn Dialogues	Haoyang Li et.al.	2507.13681	null
2025-07-18	KiC: Keyword-inspired Cascade for Cost-Efficient Text Generation with LLMs	Woo-Chan Kim et.al.	2507.13666	null
2025-07-18	CU-ICU: Customizing Unsupervised Instruction-Finetuned Language Models for ICU Datasets via Text-to-Text Transfer Transformer	Teerapong Panboonyuen et.al.	2507.13655	null
2025-07-18	Towards channel foundation models (CFMs): Motivations, methodologies and opportunities	Jun Jiang et.al.	2507.13637	null
2025-07-18	Large Language Models in Cybersecurity: Applications, Vulnerabilities, and Defense Techniques	Niveen O. Jaffal et.al.	2507.13629	null
2025-07-18	BifrostRAG: Bridging Dual Knowledge Graphs for Multi-Hop Question Answering in Construction Safety	Yuxin Zhang et.al.	2507.13625	null
2025-07-18	Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters	Shanbo Cheng et.al.	2507.13618	null
2025-07-18	Linguistic and Embedding-Based Profiling of Texts generated by Humans and Large Language Models	Sergio E. Zanotto et.al.	2507.13614	null
2025-07-18	CoTasks: Chain-of-Thought based Video Instruction Tuning Tasks	Yanan Wang et.al.	2507.13609	null
2025-07-18	GIFT: Gradient-aware Immunization of diffusion models against malicious Fine-Tuning with safe concepts retention	Amro Abdalla et.al.	2507.13598	null
2025-07-17	A Collaborative Framework Integrating Large Language Model and Chemical Fragment Space: Mutual Inspiration for Lead Design	Hao Tuo et.al.	2507.13580	null
2025-07-17	Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries	Hyunji Nam et.al.	2507.13579	null
2025-07-17	LLM-Based Community Surveys for Operational Decision Making in Interconnected Utility Infrastructures	Adaeze Okeukwu-Ogbonnaya et.al.	2507.13577	null
2025-07-17	Apple Intelligence Foundation Language Models: Tech Report 2025	Hanzhi Zhou et.al.	2507.13575	null
2025-07-17	Temporal Adaptation of Pre-trained Foundation Models for Music Structure Analysis	Yixiao Zhang et.al.	2507.13572	null
2025-07-17	A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models	Kirill Borodin et.al.	2507.13563	null
2025-07-17	Demystifying Feature Requests: Leveraging LLMs to Refine Feature Requests in Open-Source Software	Pragyan K C et.al.	2507.13555	null
2025-07-17	GOFAI meets Generative AI: Development of Expert Systems by means of Large Language Models	Eduardo C. Garrido-Merchán et.al.	2507.13550	null
2025-07-17	A Computational Approach to Modeling Conversational Systems: Analyzing Large-Scale Quasi-Patterned Dialogue Flows	Mohamed Achref Ben Ammar et.al.	2507.13544	null
2025-07-17	Provable Low-Frequency Bias of In-Context Learning of Representations	Yongyi Yang et.al.	2507.13540	null
2025-07-17	Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation	Genki Kusano et.al.	2507.13525	null
2025-07-17	Humans learn to prefer trustworthy AI over human partners	Yaomin Jiang et.al.	2507.13524	null
2025-07-17	GraphTrafficGPT: Enhancing Traffic Management Through Graph-Based AI Agent Coordination	Nabil Abdelaziz Ferhat Taleb et.al.	2507.13511	null
2025-07-17	Fake or Real: The Impostor Hunt in Texts for Space Operations	Agata Kaczmarek et.al.	2507.13508	null
2025-07-17	Revisiting LLM Value Probing Strategies: Are They Robust and Expressive?	Siqi Shen et.al.	2507.13490	null
2025-07-17	Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers	Liang Lin et.al.	2507.13474	null
2025-07-17	ERR@HRI 2.0 Challenge: Multimodal Detection of Errors and Failures in Human-Robot Conversations	Shiye Cao et.al.	2507.13468	null
2025-07-17	“PhyWorldBench”: A Comprehensive Evaluation of Physical Realism in Text-to-Video Models	Jing Gu et.al.	2507.13428	null
2025-07-17	VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding	Shihao Wang et.al.	2507.13353	null
2025-07-17	Hierarchical Rectified Flow Matching with Mini-Batch Couplings	Yichi Zhang et.al.	2507.13350	null
2025-07-17	Imbalance in Balance: Online Concept Balancing in Generation Models	Yukai Shi et.al.	2507.13345	null
2025-07-17	Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes	Tyler Loakman et.al.	2507.13335	null
2025-07-17	A Survey of Context Engineering for Large Language Models	Lingrui Mei et.al.	2507.13334	null
2025-07-17	The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner	Zhouqi Hua et.al.	2507.13332	null
2025-07-17	GeoReg: Weight-Constrained Few-Shot Regression for Socio-Economic Estimation using LLM	Kyeongjin Ahn et.al.	2507.13323	null
2025-07-17	Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark	Junsu Kim et.al.	2507.13314	null
2025-07-17	The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations	Carlos Arriaga et.al.	2507.13302	null
2025-07-17	AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research	Yilun Zhao et.al.	2507.13300	null
2025-07-17	Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management	Luis Gasco et.al.	2507.13275	null
2025-07-17	Automating Steering for Safe Multimodal Large Language Models	Lyucheng Wu et.al.	2507.13255	null
2025-07-17	RemVerse: Supporting Reminiscence Activities for Older Adults through AI-Assisted Virtual Reality	Ruohao Li et.al.	2507.13247	null
2025-07-17	HATS: Hindi Analogy Test Set for Evaluating Reasoning in Large Language Models	Ashray Gupta et.al.	2507.13238	null
2025-07-17	Enhancing Cross-task Transfer of Large Language Models via Activation Steering	Xinyu Tang et.al.	2507.13236	null
2025-07-17	VITA: Vision-to-Action Flow Matching Policy	Dechen Gao et.al.	2507.13231	null
2025-07-18	MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling	Etienne Le Naour et.al.	2507.13207	null
2025-07-18	Automatically assessing oral narratives of Afrikaans and isiXhosa children	Retief Louw et.al.	2507.13205	null
2025-07-17	Black Box Deployed – Functional Criteria for Artificial Moral Agents in the LLM Era	Matthew E. Brophy et.al.	2507.13175	null
2025-07-17	SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks	Kutub Uddin et.al.	2507.13170	null
2025-07-17	Online Rounding for Set Cover under Subset Arrivals	Jarosław Byrka et.al.	2507.13159	null
2025-07-17	Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities	Hao Sun et.al.	2507.13158	null
2025-07-17	Multi-population GAN Training: Analyzing Co-Evolutionary Algorithms	Walter P. Casas et.al.	2507.13157	null
2025-07-17	SE-VLN: A Self-Evolving Vision-Language Navigation Framework Based on Multimodal Large Language Models	Xiangyu Dong et.al.	2507.13152	null
2025-07-17	DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model	Maulana Bisyir Azhari et.al.	2507.13145	null
2025-07-17	RIDAS: A Multi-Agent Framework for AI-RAN with Representation- and Intention-Driven Agents	Kuiyuan Ding et.al.	2507.13140	null
2025-07-17	Detecting LLM-generated Code with Subtle Modification by Adversarial Training	Xin Yin et.al.	2507.13123	null
2025-07-17	A Computational Framework to Identify Self-Aspects in Text	Jaya Caporusso et.al.	2507.13115	null
2025-07-17	R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept Learning	Xiaohan Guo et.al.	2507.13107	null
2025-07-17	Intelligent Virtual Sonographer (IVS): Enhancing Physician-Robot-Patient Communication	Tianyu Song et.al.	2507.13052	null
2025-07-17	MAD-Spear: A Conformity-Driven Prompt Injection Attack on Multi-Agent Debate Systems	Yu Cui et.al.	2507.13038	null
2025-07-17	Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities	Liuyi Wang et.al.	2507.13019	null
2025-07-17	Teach Old SAEs New Domain Tricks with Boosting	Nikita Koriagin et.al.	2507.12990	null
2025-07-17	A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints	Youssef Tawfilis et.al.	2507.12979	null
2025-07-17	UniSLU: Unified Spoken Language Understanding from Heterogeneous Cross-Task Datasets	Zhichao Sheng et.al.	2507.12951	null
2025-07-17	Insights into a radiology-specialised multimodal large language model with sparse autoencoders	Kenza Bouzid et.al.	2507.12950	null
2025-07-17	Probabilistic Soundness Guarantees in LLM Reasoning Chains	Weiqiu You et.al.	2507.12948	null
2025-07-17	Analysis of Image-and-Text Uncertainty Propagation in Multimodal Large Language Models with Cardiac MR-Based Applications	Yucheng Tang et.al.	2507.12945	null
2025-07-17	Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion	Caixia Dong et.al.	2507.12938	null
2025-07-17	Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models	Yifan Xu et.al.	2507.12916	null
2025-07-17	Agentar-DeepFinance-300K: A Large-Scale Financial Dataset via Systematic Chain-of-Thought Synthesis Optimization	Xiaoke Zhao et.al.	2507.12901	null
2025-07-17	Generalist Bimanual Manipulation via Foundation Video Diffusion Models	Yao Feng et.al.	2507.12898	null
2025-07-17	DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization	Huakang Chen et.al.	2507.12890	null
2025-07-17	VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks	Jian Yao et.al.	2507.12885	null
2025-07-17	Generative Multi-Target Cross-Domain Recommendation	Jinqiu Jin et.al.	2507.12871	null
2025-07-17	Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)	Chongli Qin et.al.	2507.12856	null
2025-07-17	DEMONSTRATE: Zero-shot Language to Robotic Control via Multi-task Demonstration Learning	Rahel Rickenbach et.al.	2507.12855	null
2025-07-17	AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning	Yiming Ren et.al.	2507.12841	null
2025-07-17	Bridging the Gap: Leveraging Retrieval-Augmented Generation to Better Understand Public Concerns about Vaccines	Muhammad Javed et.al.	2507.12840	null
2025-07-17	MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval	Jeong-Woo Park et.al.	2507.12819	null
2025-07-17	Large Language Models’ Internal Perception of Symbolic Music	Andrew Shin et.al.	2507.12808	null
2025-07-17	Semantic-guided Fine-tuning of Foundation Model for Long-tailed Visual Recognition	Yufei Peng et.al.	2507.12807	null
2025-07-17	MCPEval: Automatic MCP-based Deep Evaluation for AI Agent Models	Zhiwei Liu et.al.	2507.12806	null
2025-07-17	DeQA-Doc: Adapting DeQA-Score to Document Image Quality Assessment	Junjie Gao et.al.	2507.12796	null
2025-07-17	Learning Robust Negation Text Representations	Thinh Hung Truong et.al.	2507.12782	null
2025-07-17	A Comprehensive Survey of Electronic Health Record Modeling: From Deep Learning Approaches to Large Language Models	Weijieying Ren et.al.	2507.12774	null
2025-07-17	Local Representative Token Guided Merging for Text-to-Image Generation	Min-Jeong Lee et.al.	2507.12771	null
2025-07-17	Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation	Hanlei Shi et.al.	2507.12761	null
2025-07-17	osmAG-LLM: Zero-Shot Open-Vocabulary Object Navigation via Semantic Maps and Large Language Models Reasoning	Fujing Xie et.al.	2507.12753	null
2025-07-17	Multimodal-Guided Dynamic Dataset Pruning for Robust and Efficient Data-Centric Learning	Suorong Yang et.al.	2507.12750	null
2025-07-17	Strategy Adaptation in Large Language Model Werewolf Agents	Fuya Nakamori et.al.	2507.12732	null
2025-07-17	PinFM: Foundation Model for User Activity Sequences at a Billion-scale Visual Discovery Platform	Xiangyi Chen et.al.	2507.12704	null
2025-07-17	Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images	Zahra TehraniNasab et.al.	2507.12698	null
2025-07-16	Improving Drug Identification in Overdose Death Surveillance using Large Language Models	Arthur J. Funnell et.al.	2507.12679	null
2025-07-16	ParaStudent: Generating and Evaluating Realistic Student Code by Teaching LLMs to Struggle	Mihran Miroyan et.al.	2507.12674	null
2025-07-16	The first open machine translation system for the Chechen language	Abu-Viskhan A. Umishov et.al.	2507.12672	null
2025-07-16	Single Conversation Methodology: A Human-Centered Protocol for AI-Assisted Software Development	Salvador D. Escobedo et.al.	2507.12665	null
2025-07-16	VLMgineer: Vision Language Models as Robotic Toolsmiths	George Jiayuan Gao et.al.	2507.12644	null
2025-07-16	NLI4VolVis: Natural Language Interaction for Volume Visualization via LLM Multi-Agents and Editable 3D Gaussian Splatting	Kuangshi Ai et.al.	2507.12621	null
2025-07-16	BootSeer: Analyzing and Mitigating Initialization Bottlenecks in Large-Scale LLM Training	Rui Li et.al.	2507.12619	null
2025-07-16	Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning	Prateek Chanda et.al.	2507.12612	null
2025-07-16	Enhancing In-Domain and Out-Domain EmoFake Detection via Cooperative Multilingual Speech Foundation Models	Orchid Chetia Phukan et.al.	2507.12595	null
2025-07-16	Assay2Mol: large language model-based drug design using BioAssay context	Yifan Deng et.al.	2507.12574	null
2025-07-16	Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models	Gen Luo et.al.	2507.12566	null
2025-07-17	PhysX: Physical-Grounded 3D Asset Generation	Ziang Cao et.al.	2507.12465	null
2025-07-16	CytoSAE: Interpretable Cell Embeddings for Hematology	Muhammed Furkan Dasdelen et.al.	2507.12464	null
2025-07-16	Mitigating Object Hallucinations via Sentence-Level Early Intervention	Shangpin Peng et.al.	2507.12455	null
2025-07-16	Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models	Yik Siu Chan et.al.	2507.12428	null
2025-07-16	Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	Chandana Cheerla et.al.	2507.12425	null
2025-07-16	SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?	Xinyi He et.al.	2507.12415	null
2025-07-16	Modeling Feasible Locomotion of Nanobots for Cancer Detection and Treatment	Noble Harasha et.al.	2507.12400	null
2025-07-16	Assessing the Value of Visual Input: A Benchmark of Multimodal Large Language Models for Robotic Path Planning	Jacinto Colan et.al.	2507.12391	null
2025-07-16	Web-Browsing LLMs Can Access Social Media Profiles and Infer User Demographics	Meysam Alizadeh et.al.	2507.12372	null
2025-07-16	Beyond Single Models: Enhancing LLM Detection of Ambiguity in Requests through Debate	Ana Davila et.al.	2507.12370	null
2025-07-16	GitChameleon: Evaluating AI Code Generation Against Python Library Version Incompatibilities	Diganta Misra et.al.	2507.12367	null
2025-07-16	Thought Purity: Defense Paradigm For Chain-of-Thought Attack	Zihao Xue et.al.	2507.12314	null
2025-07-16	Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization	Prashanth Vijayaraghavan et.al.	2507.12308	null
2025-07-16	Humans are more gullible than LLMs in believing common psychological myths	Bevan Koopman et.al.	2507.12296	null
2025-07-16	Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding	Feng Xiao et.al.	2507.12295	null
2025-07-16	SHACL Validation in the Presence of Ontologies: Semantics and Rewriting Techniques	Anouk Oudshoorn et.al.	2507.12286	null
2025-07-16	FADE: Adversarial Concept Erasure in Flow Models	Zixuan Fu et.al.	2507.12283	null
2025-07-17	Next-Gen Museum Guides: Autonomous Navigation and Visitor Interaction with an Agentic Robot	Luca Garello et.al.	2507.12273	null
2025-07-16	Improving Contextual ASR via Multi-grained Fusion with Large Language Models	Shilin Zhou et.al.	2507.12252	null
2025-07-16	Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Felix Nützel et.al.	2507.12236	null
2025-07-16	MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM	Tao Chen et.al.	2507.12232	null
2025-07-16	Xiangqi-R1: Enhancing Spatial Strategic Reasoning in LLMs for Chinese Chess via Reinforcement Learning	Yuhao Chen et.al.	2507.12215	null
2025-07-16	Draw an Ugly Person An Exploration of Generative AIs Perceptions of Ugliness	Garyoung Kim et.al.	2507.12212	null
2025-07-16	BuildEvo: Designing Building Energy Consumption Forecasting Heuristics via LLM-driven Evolution	Subin Lin et.al.	2507.12207	null
2025-07-16	Toward Efficient SpMV in Sparse LLMs via Block Extraction and Compressed Storage	Junqing Lin et.al.	2507.12205	null
2025-07-16	RODS: Robust Optimization Inspired Diffusion Sampling for Detecting and Reducing Hallucination in Generative Models	Yiqi Tian et.al.	2507.12201	null
2025-07-16	Multi-Component VAE with Gaussian Markov Random Field	Fouad Oubari et.al.	2507.12165	null
2025-07-16	PRISM: Distributed Inference for Foundation Models at Edge	Muhammad Azlan Qazi et.al.	2507.12145	null
2025-07-16	Overview of the Sensemaking Task at the ELOQUENT 2025 Lab: LLMs as Teachers, Students and Evaluators	Pavel Šindelář et.al.	2507.12143	null
2025-07-16	RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA Optimization	Vladimir Bogachev et.al.	2507.12142	null
2025-07-16	Room Impulse Response Generation Conditioned on Acoustic Parameters	Silvia Arellano et.al.	2507.12136	null
2025-07-16	Iterative Augmentation with Summarization Refinement (IASR) Evaluation for Unstructured Survey data Modeling and Analysis	Payal Bhattad et.al.	2507.12126	null
2025-07-16	Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph	Sergey Linok et.al.	2507.12123	null
2025-07-16	DeepShade: Enable Shade Simulation by Text-conditioned Image Generation	Longchao Da et.al.	2507.12103	null
2025-07-16	LLAMA: Multi-Feedback Smart Contract Fuzzing Framework with LLM-Guided Seed Generation	Keke Gai et.al.	2507.12084	null
2025-07-16	Findings of MEGA: Maths Explanation with LLMs using the Socratic Method for Active Learning	Tosin Adewumi et.al.	2507.12079	null
2025-07-16	Evaluating the Ability of Large Language Models to Reason about Cardinal Directions, Revisited	Anthony G Cohn et.al.	2507.12059	null
2025-07-16	FloGAN: Scenario-Based Urban Mobility Flow Generation via Conditional GANs and Dynamic Region Decoupling	Seanglidet Yean et.al.	2507.12053	null
2025-07-16	A Comparative Approach to Assessing Linguistic Creativity of Large Language Models and Humans	Anca Dinu et.al.	2507.12039	null
2025-07-16	3D-MoRe: Unified Modal-Contextual Reasoning for Embodied Question Answering	Rongtao Xu et.al.	2507.12026	null
2025-07-16	EME-TTS: Unlocking the Emphasis and Emotion Link in Speech Synthesis	Haoxun Li et.al.	2507.12015	null
2025-07-16	DSSD: Efficient Edge-Device Deployment and Collaborative Inference via Distributed Split Speculative Decoding	Jiahong Ning et.al.	2507.12000	null
2025-07-16	Can LLMs Find Fraudsters? Multi-level LLM Enhanced Graph Fraud Detection	Tairan Huang et.al.	2507.11997	null
2025-07-16	Robust Planning for Autonomous Vehicles with Diffusion-Based Failure Samplers	Juanran Wang et.al.	2507.11991	null
2025-07-16	Aime: Towards Fully-Autonomous Multi-Agent Framework	Yexuan Shi et.al.	2507.11988	null
2025-07-16	Simplifications are Absolutists: How Simplified Language Reduces Word Sense Awareness in LLM-Generated Definitions	Lukas Ellinger et.al.	2507.11981	null
2025-07-16	Value-Based Large Language Model Agent Simulation for Mutual Evaluation of Trust and Interpersonal Closeness	Yuki Sakamoto et.al.	2507.11979	null
2025-07-16	Graph Representations for Reading Comprehension Analysis using Large Language Model and Eye-Tracking Biomarker	Yuhong Zhang et.al.	2507.11972	null
2025-07-16	Watch, Listen, Understand, Mislead: Tri-modal Adversarial Attacks on Short Videos for Content Appropriateness Evaluation	Sahid Hossain Mustakim et.al.	2507.11968	null
2025-07-16	Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation	Ziyu Ge et.al.	2507.11966	null
2025-07-16	PoTPTQ: A Two-step Power-of-Two Post-training for LLMs	Xinyu Wang et.al.	2507.11959	null
2025-07-16	The benefits of query-based KGQA systems for complex and temporal questions in LLM era	Artem Alekseev et.al.	2507.11954	null
2025-07-16	BlockBPE: Parallel BPE Tokenization	Amos You et.al.	2507.11941	null
2025-07-16	A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning	Hao Chen et.al.	2507.11938	null
2025-07-16	A Survey of Deep Learning for Geometry Problem Solving	Jianzhe Ma et.al.	2507.11936	null
2025-07-16	Hyperphantasia: A Benchmark for Evaluating the Mental Visualization Capabilities of Multimodal LLMs	Mohammad Shahab Sepehri et.al.	2507.11932	null
2025-07-16	From Generative to Episodic: Sample-Efficient Replicable Reinforcement Learning	Max Hopkins et.al.	2507.11926	null
2025-07-16	Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models	Bo Zeng et.al.	2507.11882	null
2025-07-16	DualReward: A Dynamic Reinforcement Learning Framework for Cloze Tests Distractor Generation	Tianyou Huang et.al.	2507.11875	null
2025-07-16	CosmoFlow: Scale-Aware Representation Learning for Cosmology with Flow Matching	Sidharth Kannan et.al.	2507.11842	null
2025-07-16	The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist	Haoxuan Zhang et.al.	2507.11810	null
2025-07-16	Tracing Facts or just Copies? A critical investigation of the Competitions of Mechanisms in Large Language Models	Dante Campregher et.al.	2507.11809	null
2025-07-15	Enforcing Latent Euclidean Geometry in Single-Cell VAEs for Manifold Interpolation	Alessandro Palma et.al.	2507.11789	null
2025-07-15	Foundation Models for Brain Signals: A Critical Review of Current Progress and Future Directions	Gayal Kuruppu et.al.	2507.11783	null
2025-07-15	Large-scale distributed synchronization systems, using a cancel-on-completion redundancy mechanism	Alexander Stolyar et.al.	2507.11779	null
2025-07-15	Scaling laws for activation steering with Llama 2 models and refusal mechanisms	Sheikh Abdur Raheem Ali et.al.	2507.11771	null
2025-07-15	LLMs are Bayesian, in Expectation, not in Realization	Leon Chlon et.al.	2507.11768	null
2025-07-15	Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning	Fan Shi et.al.	2507.11761	null
2025-07-15	CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks	Meng Li et.al.	2507.11742	null
2025-07-15	Auto-Formulating Dynamic Programming Problems with Large Language Models	Chenyu Zhou et.al.	2507.11737	null
2025-07-15	Subgraph Generation for Generalizing on Out-of-Distribution Links	Jay Revolinsky et.al.	2507.11710	null
2025-07-15	MetaLint: Generalizable Idiomatic Code Quality Analysis through Instruction-Following and Easy-to-Hard Generalization	Atharva Naik et.al.	2507.11687	null
2025-07-15	Let’s Think in Two Steps: Mitigating Agreement Bias in MLLMs with Self-Grounded Verification	Moises Andrade et.al.	2507.11662	null
2025-07-15	Deep Generative Methods and Tire Architecture Design	Fouad Oubari et.al.	2507.11639	null
2025-07-15	Interpretable Prediction of Lymph Node Metastasis in Rectal Cancer MRI Using Variational Autoencoders	Benjamin Keel et.al.	2507.11638	null
2025-07-15	MapIQ: Benchmarking Multimodal Large Language Models for Map Question Answering	Varun Srivastava et.al.	2507.11625	null
2025-07-15	k-Contextuality as a Heuristic for Memory Separations in Learning	Mariesa H. Teo et.al.	2507.11604	null
2025-07-15	SToFM: a Multi-scale Foundation Model for Spatial Transcriptomics	Suyuan Zhao et.al.	2507.11588	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540	null
2025-07-15	Streaming 4D Visual Geometry Transformer	Dong Zhuo et.al.	2507.11539	null
2025-07-15	DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering	Yinsheng Li et.al.	2507.11527	null
2025-07-15	LLM-based ambiguity detection in natural language instructions for collaborative surgical robots	Ana Davila et.al.	2507.11525	null
2025-07-15	AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air	Shiyi Yang et.al.	2507.11515	null
2025-07-15	HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Pan Du et.al.	2507.11474	null
2025-07-15	LRMR: LLM-Driven Relational Multi-node Ranking for Lymph Node Metastasis Assessment in Rectal Cancer	Yaoxian Dong et.al.	2507.11457	null
2025-07-15	Implementing Adaptations for Vision AutoRegressive Model	Kaif Shaikh et.al.	2507.11441	null
2025-07-15	Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models	Paul A. Bereuter et.al.	2507.11427	null
2025-07-16	Reasoning Strategies in Large Language Models: Can They Follow, Prefer, and Optimize?	Yanjian Zhang et.al.	2507.11423	null
2025-07-15	Quantifying the Energy Consumption and Carbon Emissions of LLM Inference via Simulations	Miray Özcan et.al.	2507.11417	null
2025-07-15	Seq vs Seq: An Open Suite of Paired Encoders and Decoders	Orion Weller et.al.	2507.11412	null
2025-07-15	KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?	Soumadeep Saha et.al.	2507.11408	null
2025-07-15	EXAONE 4.0: Unified Large Language Models Integrating Non-reasoning and Reasoning Modes	LG AI Research et.al.	2507.11407	null
2025-07-15	DCR: Quantifying Data Contamination in LLMs Evaluation	Cheng Xu et.al.	2507.11405	null
2025-07-15	Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs	Gabriel Bo et.al.	2507.11371	null
2025-07-15	From Chaos to Automation: Enabling the Use of Unstructured Data for Robotic Process Automation	Kelly Kurowski et.al.	2507.11364	null
2025-07-15	What is the Best Process Model Representation? A Comparative Analysis for Process Modeling with Large Language Models	Alexis Brissard et.al.	2507.11356	null
2025-07-15	Foundation Models for Logistics: Toward Certifiable, Conversational Planning Interfaces	Yunhao Yang et.al.	2507.11352	null
2025-07-15	RefModel: Detecting Refactorings using Foundation Models	Pedro Simões et.al.	2507.11346	null
2025-07-15	Guiding LLM Decision-Making with Fairness Reward Models	Zara Hall et.al.	2507.11344	null
2025-07-15	MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network	Jianfei Jiang et.al.	2507.11333	null
2025-07-15	Automated Novelty Evaluation of Academic Paper: A Collaborative Approach Integrating Human and Large Language Model Knowledge	Wenqing Wu et.al.	2507.11330	null
2025-07-15	Internal Value Alignment in Large Language Models through Controlled Value Vector Activation	Haoran Jin et.al.	2507.11316	null
2025-07-15	LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification	Fengxiao Tang et.al.	2507.11310	null
2025-07-15	Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian	Andrei Niculae et.al.	2507.11299	null
2025-07-15	Opus: A Prompt Intention Framework for Complex Workflow Generation	Théo Fagnoni et.al.	2507.11288	null
2025-07-15	Taming Uncertainty via Automation: Observing, Analyzing, and Optimizing Agentic AI Systems	Dany Moshkovich et.al.	2507.11277	null
2025-07-15	FMC: Formalization of Natural Language Mathematical Competition Problems	Jiaxuan Xie et.al.	2507.11275	null
2025-07-15	KV-Latent: Dimensional-level KV Cache Reduction with Frequency-aware Rotary Positional Embedding	Luohe Shi et.al.	2507.11273	null
2025-07-15	An Empirical Study of Multi-Agent RAG for Real-World University Admissions Counseling	Anh Nguyen-Duc et.al.	2507.11272	null
2025-07-15	MFGDiffusion: Mask-Guided Smoke Synthesis for Enhanced Forest Fire Detection	Guanghao Wu et.al.	2507.11252	null
2025-07-15	Generative Click-through Rate Prediction with Applications to Search Advertising	Lingwei Kong et.al.	2507.11246	null
2025-07-15	NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models	X. Feng et.al.	2507.11245	null
2025-07-15	Sparse Autoencoders Can Capture Language-Specific Concepts Across Diverse Languages	Lyzander Marciano Andrylie et.al.	2507.11230	null
2025-07-15	An Agentic Flow for Finite State Machine Extraction using Prompt Chaining	Fares Wael et.al.	2507.11222	null
2025-07-15	EsBBQ and CaBBQ: The Spanish and Catalan Bias Benchmarks for Question Answering	Valle Ruiz-Fernández et.al.	2507.11216	null
2025-07-15	Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias	Rushia Harada et.al.	2507.11210	null
2025-07-15	Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding	Conrad Borchers et.al.	2507.11198	null
2025-07-15	Mixture of Experts in Large Language Models	Danyang Zhang et.al.	2507.11181	null
2025-07-15	Latent Space Consistency for Sparse-View CT Reconstruction	Duoyou Chen et.al.	2507.11152	null
2025-07-15	What Should LLMs Forget? Quantifying Personal Data in LLMs for Right-to-Be-Forgotten Requests	Dimitri Staufer et.al.	2507.11128	null
2025-07-15	MSA at ImageCLEF 2025 Multimodal Reasoning: Multilingual Multimodal Reasoning With Ensemble Vision Language Models	Seif Ahmed et.al.	2507.11114	null
2025-07-15	Multi-Trigger Poisoning Amplifies Backdoor Vulnerabilities in LLMs	Sanhanat Sivapiromrat et.al.	2507.11112	null
2025-07-15	KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model	Jie Yang et.al.	2507.11102	null
2025-07-15	The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs	Zichen Wen et.al.	2507.11097	null
2025-07-15	EditGen: Harnessing Cross-Attention Control for Instruction-Based Auto-Regressive Audio Editing	Vassilis Sioros et.al.	2507.11096	null
2025-07-15	Beyond Traditional Algorithms: Leveraging LLMs for Accurate Cross-Border Entity Identification	Andres Azqueta-Gavaldón et.al.	2507.11086	null
2025-07-15	Function-to-Style Guidance of LLMs for Code Translation	Longhui Zhang et.al.	2507.11083	null
2025-07-15	Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander	Li Wang et.al.	2507.11079	null
2025-07-15	LogTinyLLM: Tiny Large Language Models Based Contextual Log Anomaly Detection	Isaiah Thompson Ocansey et.al.	2507.11071	null
2025-07-15	SWE-MERA: A Dynamic Benchmark for Agenticly Evaluating Large Language Models on Software Engineering Tasks	Pavel Adamenko et.al.	2507.11059	null
2025-07-15	LLM-Augmented Symptom Analysis for Cardiovascular Disease Risk Prediction: A Clinical NLP	Haowei Yang et.al.	2507.11052	null
2025-07-15	Aligned Query Expansion: Efficient Query Expansion for Information Retrieval through LLM Alignment	Adam Yang et.al.	2507.11042	null
2025-07-15	Functional Emotion Modeling in Biomimetic Reinforcement Learning	Louis Wang et.al.	2507.11027	null
2025-07-15	Incentivizing Knowledge Transfers	Zhonghong Kuang et.al.	2507.11018	null
2025-07-15	First-Order Error Matters: Accurate Compensation for Quantized Large Language Models	Xingyu Zheng et.al.	2507.11017	null
2025-07-15	SIMCODE: A Benchmark for Natural Language to ns-3 Network Simulation Code Generation	Tasnim Ahmed et.al.	2507.11014	null
2025-07-15	Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation	Yanbo Wang et.al.	2507.11001	null
2025-07-15	Teach Me Sign: Stepwise Prompting LLM for Sign Language Production	Zhaoyi An et.al.	2507.10972	null
2025-07-15	DS@GT at eRisk 2025: From prompts to predictions, benchmarking early depression detection with conversational agent based assessments and temporal attention models	Anthony Miyaguchi et.al.	2507.10958	null
2025-07-15	Modeling Understanding of Story-Based Analogies Using Large Language Models	Kalit Inani et.al.	2507.10957	null
2025-07-15	Towards Practical Benchmarking of Data Cleaning Techniques: On Generating Authentic Errors via Large Language Models	Xinyuan Liu et.al.	2507.10934	null
2025-07-15	Artificial Finance: How AI Thinks About Money	Orhan Erdem et.al.	2507.10933	null
2025-07-15	Enhancing Safe and Controllable Protein Generation via Knowledge Preference Optimization	Yuhao Wang et.al.	2507.10923	null
2025-07-15	HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training	Seungho Choi et.al.	2507.10920	null
2025-07-15	LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation	Ziyan Wang et.al.	2507.10917	null
2025-07-15	Lessons Learned from Evaluation of LLM based Multi-agents in Safer Therapy Recommendation	Yicong Wu et.al.	2507.10911	null
2025-07-15	Evaluating Generated Commit Messages with Large Language Models	Qunhong Zeng et.al.	2507.10906	null
2025-07-15	LiLM-RDB-SFC: Lightweight Language Model with Relational Database-Guided DRL for Optimized SFC Provisioning	Parisa Fard Moshiri et.al.	2507.10903	null
2025-07-15	Object-Centric Mobile Manipulation through SAM2-Guided Perception and Imitation Learning	Wang Zhicheng et.al.	2507.10899	null
2025-07-15	LLMATCH: A Unified Schema Matching Framework with Large Language Models	Sha Wang et.al.	2507.10897	null
2025-07-15	Learning from Imperfect Data: Robust Inference of Dynamic Systems using Simulation-based Generative Model	Hyunwoo Cho et.al.	2507.10884	null
2025-07-15	From Alerts to Intelligence: A Novel LLM-Aided Framework for Host-based Intrusion Detection	Danyu Sun et.al.	2507.10873	null
2025-07-14	WhisperKit: On-device Real-time ASR with Billion-Scale Transformers	Atila Orhon et.al.	2507.10860	null
2025-07-14	MultiVox: Benchmarking Voice Assistants for Multimodal Interactions	Ramaneswaran Selvakumar et.al.	2507.10859	null
2025-07-14	LLMs on Trial: Evaluating Judicial Fairness for Large Language Models	Yiran Hu et.al.	2507.10852	null
2025-07-14	LLM-Guided Agentic Object Detection for Open-World Understanding	Furkan Mumcu et.al.	2507.10844	null
2025-07-14	REAL-IoT: Characterizing GNN Intrusion Detection Robustness under Practical Adversarial Attack	Zhonghao Zhan et.al.	2507.10836	null
2025-07-14	Supporting SENĆOTEN Language Documentation Efforts with Automatic Speech Recognition	Mengzhe Geng et.al.	2507.10827	null
2025-07-14	Semantic Context for Tool Orchestration	Robert Müller et.al.	2507.10820	null
2025-07-14	How Robust are LLM-Generated Library Imports? An Empirical Study using Stack Overflow	Jasmine Latendresse et.al.	2507.10818	null
2025-07-14	Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection	Huiyi Wang et.al.	2507.10814	null
2025-07-14	Automated Thematic Analyses Using LLMs: Xylazine Wound Management Social Media Chatter Use Case	JaMor Hairston et.al.	2507.10803	null
2025-07-14	Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers	Yilun Zhao et.al.	2507.10787	null
2025-07-14	Warehouse Spatial Question Answering with LLM Agent	Hsiang-Wei Huang et.al.	2507.10778	null
2025-07-14	rt-RISeg: Real-Time Model-Free Robot Interactive Segmentation for Active Instance-Level Object Understanding	Howard H. Qian et.al.	2507.10776	null
2025-07-14	Spatial Reasoners for Continuous Variables in Any Domain	Bart Pogodzinski et.al.	2507.10768	null
2025-07-14	Integrating Biological Knowledge for Robust Microscopy Image Profiling on De Novo Cell Lines	Jiayuan Chen et.al.	2507.10737	null
2025-07-14	Bridging Brains and Machines: A Unified Frontier in Neuroscience, Artificial Intelligence, and Neuromorphic Systems	Sohan Shankar et.al.	2507.10722	null
2025-07-14	Exploring User Security and Privacy Attitudes and Concerns Toward the Use of General-Purpose LLM Chatbots for Mental Health	Jabari Kwesi et.al.	2507.10695	null
2025-07-14	Machine-learning inference of stellar properties using integrated photometric and spectroscopic data	Ilay Kamai et.al.	2507.10666	null
2025-07-14	Emulating Dark Matter Halo Merger Trees with Graph Generative Models	Tri Nguyen et.al.	2507.10652	null
2025-07-14	MP1: Mean Flow Tames Policy Learning in 1-step for Robotic Manipulation	Juyi Sheng et.al.	2507.10543	null
2025-07-14	Fusing LLM Capabilities with Routing Data	Tao Feng et.al.	2507.10540	null
2025-07-14	Graph World Model	Tao Feng et.al.	2507.10539	null
2025-07-14	CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks	Hongchao Jiang et.al.	2507.10535	null
2025-07-14	Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination	Mingqi Wu et.al.	2507.10532	null
2025-07-14	Accurate generation of chemical reaction transition states by conditional flow matching	Ping Tuo et.al.	2507.10530	null
2025-07-14	Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI	Jiangkai Wu et.al.	2507.10510	null
2025-07-14	Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance	Kyungtae Han et.al.	2507.10500	null
2025-07-14	Can You Detect the Difference?	İsmail Tarım et.al.	2507.10475	null
2025-07-14	MLAR: Multi-layer Large Language Model-based Robotic Process Automation Applicant Tracking	Mohamed T. Younes et.al.	2507.10472	null
2025-07-14	An Empirical Evaluation of AI-Powered Non-Player Characters’ Perceived Realism and Performance in Virtual Reality Environments	Mikko Korkiakoski et.al.	2507.10469	null
2025-07-14	Logic layer Prompt Control Injection (LPCI): A Novel Security Vulnerability Class in Agentic Systems	Hammad Atta et.al.	2507.10457	null
2025-07-14	Text-Visual Semantic Constrained AI-Generated Image Quality Assessment	Qiang Li et.al.	2507.10432	null
2025-07-14	Towards Emotion Co-regulation with LLM-powered Socially Assistive Robots: Integrating LLM Prompts and Robotic Behaviors to Support Parent-Neurodivergent Child Dyads	Jing Li et.al.	2507.10427	null
2025-07-14	Zorse: Optimizing LLM Training Efficiency on Heterogeneous GPU Clusters	Runsheng Benson Guo et.al.	2507.10392	null
2025-07-14	Test-Time Canonicalization by Foundation Models for Robust Perception	Utkarsh Singhal et.al.	2507.10375	null
2025-07-14	Using AI to replicate human experimental results: a motion study	Rosa Illan Castillo et.al.	2507.10342	null
2025-07-14	Grammar-Guided Evolutionary Search for Discrete Prompt Optimisation	Muzhaffar Hazman et.al.	2507.10326	null
2025-07-14	Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching	Yuhan Liu et.al.	2507.10318	null
2025-07-14	Recognizing Dementia from Neuropsychological Tests with State Space Models	Liming Wang et.al.	2507.10311	null
2025-07-14	DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs	Jiahe Zhao et.al.	2507.10302	null
2025-07-14	FaceLLM: A Multimodal Large Language Model for Face Understanding	Hatef Otroshi Shahreza et.al.	2507.10300	null
2025-07-14	Prompt Informed Reinforcement Learning for Visual Coverage Path Planning	Venkat Margapuri et.al.	2507.10284	null
2025-07-14	Cross-Timeslot Optimization for Distributed GPU Inference Using Reinforcement Learning	Chengze Du et.al.	2507.10259	null
2025-07-14	Synthesizing Near-Boundary OOD Samples for Out-of-Distribution Detection	Jinglun Li et.al.	2507.10225	null
2025-07-14	Absher: A Benchmark for Evaluating Large Language Models Understanding of Saudi Dialects	Renad Al-Monef et.al.	2507.10216	null
2025-07-14	A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images	Jaeseong Lee et.al.	2507.10202	null
2025-07-14	History Matching under Uncertainty of Geological Scenarios with Implicit Geological Realism Control with Generative Deep Learning and Graph Convolutions	Gleb Shishaev et.al.	2507.10201	null
2025-07-14	Natural Language-based Assessment of L2 Oral Proficiency using LLMs	Stefano Bannò et.al.	2507.10200	null
2025-07-14	Breaking the Myth: Can Small Models Infer Postconditions Too?	Gehao Zhang et.al.	2507.10182	null
2025-07-14	Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving	Wonung Kim et.al.	2507.10178	null
2025-07-14	Abusive text transformation using LLMs	Rohitash Chandra et.al.	2507.10177	null
2025-07-14	Task-Based Flexible Feature Distillation for LLMs	Khouloud Saadi et.al.	2507.10155	null
2025-07-14	Past-Future Scheduler for LLM Serving under SLA Guarantees	Ruihao Gong et.al.	2507.10150	null
2025-07-14	DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Ivan Martinović et.al.	2507.10118	null
2025-07-14	Accelerating Automatic Program Repair with Dual Retrieval-Augmented Fine-Tuning and Patch Generation on Large Language Models	Hanyang Guo et.al.	2507.10103	null
2025-07-14	Fusing Large Language Models with Temporal Transformers for Time Series Forecasting	Chen Su et.al.	2507.10098	null
2025-07-14	Towards High Supervised Learning Utility Training Data Generation: Data Pruning and Column Reordering	Tung Sum Thomas Kwok et.al.	2507.10088	null
2025-07-14	Foundation Model Driven Robotics: A Comprehensive Review	Muhammad Tayyab Khan et.al.	2507.10087	null
2025-07-14	Cultural Bias in Large Language Models: Evaluating AI Agents through Moral Questionnaires	Simon Münker et.al.	2507.10073	null
2025-07-14	ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism	Zedong Liu et.al.	2507.10069	null
2025-07-14	LLMShot: Reducing snapshot testing maintenance via LLMs	Ergün Batuhan Kaynak et.al.	2507.10062	null
2025-07-14	GeLaCo: An Evolutionary Approach to Layer Compression	David Ponce et.al.	2507.10059	null
2025-07-14	Explicit Vulnerability Generation with LLMs: An Investigation Beyond Adversarial Attacks	Emir Bosnak et.al.	2507.10054	null
2025-07-14	Automating SPARQL Query Translations between DBpedia and Wikidata	Malte Christian Bartels et.al.	2507.10045	null
2025-07-14	Towards Applying Large Language Models to Complement Single-Cell Foundation Models	Steven Palayew et.al.	2507.10039	null
2025-07-14	EAT: QoS-Aware Edge-Collaborative AIGC Task Scheduling via Attention-Guided Diffusion Reinforcement Learning	Zhifei Xu et.al.	2507.10026	null
2025-07-14	Qualitative Study for LLM-assisted Design Study Process: Strategies, Challenges, and Roles	Shaolun Ruan et.al.	2507.10024	null
2025-07-14	The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents	Lixu Wang et.al.	2507.10016	null
2025-07-14	(Almost) Free Modality Stitching of Foundation Models	Jaisidh Singh et.al.	2507.10015	null
2025-07-14	Protective Factor-Aware Dynamic Influence Learning for Suicide Risk Prediction on Social Media	Jun Li et.al.	2507.10008	null
2025-07-14	Deep Hidden Cognition Facilitates Reliable Chain-of-Thought Reasoning	Zijun Chen et.al.	2507.10007	null
2025-07-14	Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix	Ming Wen et.al.	2507.09990	null
2025-07-14	Demonstrating the Octopi-1.5 Visual-Tactile-Language Model	Samson Yu et.al.	2507.09985	null
2025-07-14	Tiny Reward Models	Sarah Pan et.al.	2507.09973	null
2025-07-14	AnalogTester: A Large Language Model-Based Framework for Automatic Testbench Generation in Analog Circuit Design	Weiyu Chen et.al.	2507.09965	null
2025-07-14	DeepSeek: Paradigm Shifts and Technical Evolution in Large AI Models	Luolin Xiong et.al.	2507.09955	null
2025-07-14	Can GPT-4o mini and Gemini 2.0 Flash Predict Fine-Grained Fashion Product Attributes? A Zero-Shot Analysis	Shubham Shukla et.al.	2507.09950	null
2025-07-14	Iceberg: Enhancing HLS Modeling with Synthetic Data	Zijian Ding et.al.	2507.09948	null
2025-07-14	Green-LLM: Optimal Workload Allocation for Environmentally-Aware Distributed Inference	Jiaming Cheng et.al.	2507.09942	null
2025-07-14	Memorization Sinks: Isolating Memorization during LLM Training	Gaurav R. Ghosal et.al.	2507.09937	null
2025-07-14	Enhancing Retrieval Augmented Generation with Hierarchical Text Segmentation Chunking	Hai Toan Nguyen et.al.	2507.09935	null
2025-07-14	Mechanistic Interpretability of LoRA-Adapted Language Models for Nuclear Reactor Safety Applications	Yoon Pyo Lee et.al.	2507.09931	null
2025-07-14	Solving dynamic portfolio selection problems via score-based diffusion models	Ahmad Aghapour et.al.	2507.09916	null
2025-07-14	Crucial-Diff: A Unified Diffusion Model for Crucial Image and Annotation Synthesis in Data-scarce Scenarios	Siyue Yao et.al.	2507.09915	null
2025-07-14	TolerantECG: A Foundation Model for Imperfect Electrocardiogram	Huynh Nguyen Dang et.al.	2507.09887	null
2025-07-14	VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains	Xuzhao Li et.al.	2507.09884	null
2025-07-14	AdaBrain-Bench: Benchmarking Brain Foundation Models for Brain-Computer Interface Applications	Jiamin Wu et.al.	2507.09882	null
2025-07-14	Covering a Few Submodular Constraints and Applications	Tanvi Bajpai et.al.	2507.09879	null
2025-07-14	ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models	Yongheng Zhang et.al.	2507.09876	null
2025-07-14	Function Induction and Task Generalization: An Interpretability Study with Off-by-One Addition	Qinyuan Ye et.al.	2507.09875	null
2025-07-14	Turning the Tide: Repository-based Code Reflection	Wei Zhang et.al.	2507.09866	null
2025-07-14	A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends	Yihao Ding et.al.	2507.09861	null
2025-07-14	Model-Grounded Symbolic Artificial Intelligence Systems Learning and Reasoning with Model-Grounded Symbolic Artificial Intelligence Systems	Aniruddha Chattopadhyay et.al.	2507.09854	null
2025-07-14	Rethinking Prompt Optimization: Reinforcement, Diversification, and Migration in Blackbox LLMs	MohammadReza Davari et.al.	2507.09839	null
2025-07-14	Generative Audio Language Modeling with Continuous-valued Tokens and Masked Next-Token Prediction	Shu-wen Yang et.al.	2507.09834	null
2025-07-13	Generative Cognitive Diagnosis	Jiatong Li et.al.	2507.09831	null
2025-07-13	Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications	Jia Yi Goh et.al.	2507.09820	null
2025-07-13	VRU-Accident: A Vision-Language Benchmark for Video Question Answering and Dense Captioning for Accident Scene Understanding	Younggun Kim et.al.	2507.09815	null
2025-07-13	A Scalable and Efficient Signal Integration System for Job Matching	Ping Liu et.al.	2507.09797	null
2025-07-13	CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD Design	Prashant Govindarajan et.al.	2507.09792	null
2025-07-13	Prompting for Performance: Exploring LLMs for Configuring Software	Helge Spieker et.al.	2507.09790	null
2025-07-13	TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit	Paulo Salem et.al.	2507.09788	null
2025-07-13	Efficient Molecular Conformer Generation with SO(3)-Averaged Flow Matching and Reflow	Zhonglin Cao et.al.	2507.09785	null
2025-07-13	Do we need equivariant models for molecule generation?	Ewa M. Nowara et.al.	2507.09753	null
2025-07-13	Sound and Complete Neuro-symbolic Reasoning with LLM-Grounded Interpretations	Bradley P. Allen et.al.	2507.09751	null
2025-07-13	BrainFLORA: Uncovering Brain Concept Representation via Multimodal Neural Embeddings	Dongyang Li et.al.	2507.09747	null
2025-07-13	Enhancing Trading Performance Through Sentiment Analysis with Large Language Models: Evidence from the S&P 500	Haojie Liu et.al.	2507.09739	null
2025-07-13	Continental scale habitat modelling with artificial intelligence and multimodal earth observation	Sara Si-Moussi et.al.	2507.09732	null
2025-07-13	Large Language Models Encode Semantics in Low-Dimensional Linear Subspaces	Baturay Saglam et.al.	2507.09709	null
2025-07-13	MCEval: A Dynamic Framework for Fair Multilingual Cultural Evaluation of LLMs	Shulin Huang et.al.	2507.09701	null
2025-07-13	ExpStar: Towards Automatic Commentary Generation for Multi-discipline Scientific Experiments	Jiali Chen et.al.	2507.09693	null
2025-07-13	Prompt2DEM: High-Resolution DEMs for Urban and Open Environments from Global Prompts Using a Monocular Foundation Model	Osher Rafaeli et.al.	2507.09681	null
2025-07-13	Can AI Rely on the Systematicity of Truth? The Challenge of Modelling Normative Domains	Matthieu Queloz et.al.	2507.09676	null
2025-07-13	Is Quantization a Deal-breaker? Empirical Insights from Large Code Models	Saima Afrin et.al.	2507.09665	null
2025-07-13	Towards Concise and Adaptive Thinking in Large Reasoning Models: A Survey	Jason Zhu et.al.	2507.09662	null
2025-07-13	Negotiating Comfort: Simulating Personality-Driven LLM Agents in Shared Residential Social Networks	Ann Nedime Nese Rende et.al.	2507.09657	null
2025-07-13	Cultivating Pluralism In Algorithmic Monoculture: The Community Alignment Dataset	Lily Hong Zhang et.al.	2507.09650	null
2025-07-13	Can Group Relative Policy Optimization Improve Thai Legal Reasoning and Question Answering?	Pawitsapak Akarajaradwong et.al.	2507.09638	null
2025-07-13	Demystifying Flux Architecture	Or Greenberg et.al.	2507.09595	null
2025-07-11	Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective	Hangjie Yuan et.al.	2507.08801	null
2025-07-11	NeuralOS: Towards Simulating Operating Systems via Neural Generative Models	Luke Rivard et.al.	2507.08800	null
2025-07-11	One Token to Fool LLM-as-a-Judge	Yulai Zhao et.al.	2507.08794	null
2025-07-11	From One to More: Contextual Part Latents for 3D Generation	Shaocong Dong et.al.	2507.08772	null
2025-07-11	BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity	Chenyang Song et.al.	2507.08771	null
2025-07-11	Multilingual Multimodal Software Developer for Code Generation	Linzheng Chai et.al.	2507.08719	null
2025-07-11	Unreal is all you need: Multimodal ISAC Data Simulation with Only One Engine	Kongwu Huang et.al.	2507.08716	null
2025-07-11	KG-Attention: Knowledge Graph-Guided Attention at Test-Time via Bidirectional Information Aggregation	Songlin Zhai et.al.	2507.08704	null
2025-07-11	ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way	Rajarshi Roy et.al.	2507.08679	null
2025-07-11	LLMCup: Ranking-Enhanced Comment Updating with LLMs	Hua Ge et.al.	2507.08671	null
2025-07-11	KELPS: A Framework for Verified Multi-Language Autoformalization via Semantic-Syntactic Alignment	Jiyao Zhang et.al.	2507.08665	null
2025-07-11	Introspection of Thought Helps AI Agents	Haoran Sun et.al.	2507.08664	null
2025-07-11	Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement Learning	Xingguang Ji et.al.	2507.08649	null
2025-07-11	DatasetAgent: A Novel Multi-Agent System for Auto-Constructing Datasets from Real-World Images	Haoran Sun et.al.	2507.08648	null
2025-07-11	NL in the Middle: Code Translation with LLMs and Intermediate Representations	Chi-en Amy Tai et.al.	2507.08627	null
2025-07-11	A comprehensive study of LLM-based argument classification: from LLAMA through GPT-4o to Deepseek-R1	Marcin Pietroń et.al.	2507.08621	null
2025-07-11	Agentic Large Language Models for Conceptual Systems Engineering and Design	Soheyl Massoudi et.al.	2507.08619	null
2025-07-11	AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs	Florian Grötschla et.al.	2507.08616	null
2025-07-11	Emergent Natural Language with Communication Games for Improving Image Captioning Capabilities without Additional Data	Parag Dutta et.al.	2507.08610	null
2025-07-11	Unlocking Speech Instruction Data Potential with Query Rewriting	Yonghua Hei et.al.	2507.08603	null
2025-07-11	Visual Semantic Description Generation with MLLMs for Image-Text Matching	Junyu Chen et.al.	2507.08590	null
2025-07-11	To Trade or Not to Trade: An Agentic Approach to Estimating Market Risk Improves Trading Decisions	Dimitrios Emmanoulopoulos et.al.	2507.08584	null
2025-07-11	Large Multi-modal Model Cartographic Map Comprehension for Textual Locality Georeferencing	Kalana Wijegunarathna et.al.	2507.08575	null
2025-07-11	AbbIE: Autoregressive Block-Based Iterative Encoder for Efficient Sequence Modeling	Preslav Aleksandrov et.al.	2507.08567	null
2025-07-11	FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation	Yuxuan Jiang et.al.	2507.08557	null
2025-07-11	White-Basilisk: A Hybrid Model for Code Vulnerability Detection	Ioannis Lamprou et.al.	2507.08540	null
2025-07-11	The AI Language Proficiency Monitor – Tracking the Progress of LLMs on Multilingual Benchmarks	David Pomerenke et.al.	2507.08538	null
2025-07-11	A Multi-granularity Concept Sparse Activation and Hierarchical Knowledge Graph Fusion Framework for Rare Disease Diagnosis	Mingda Zhang et.al.	2507.08529	null
2025-07-11	InferLog: Accelerating LLM Inference for Online Log Parsing via ICL-oriented Prefix Caching	Yilun Wang et.al.	2507.08523	null
2025-07-11	Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation	Liu He et.al.	2507.08513	null
2025-07-11	From Language to Logic: A Bi-Level Framework for Structured Reasoning	Keying Yang et.al.	2507.08501	null
2025-07-11	Semantic-Augmented Latent Topic Modeling with LLM-in-the-Loop	Mengze Hong et.al.	2507.08498	null
2025-07-11	LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural Planning	Shibo Sun et.al.	2507.08496	null
2025-07-11	A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench	David Schlangen et.al.	2507.08491	null
2025-07-11	ILT-Iterative LoRA Training through Focus-Feedback-Fix for Multilingual Speech Recognition	Qingliang Meng et.al.	2507.08477	null
2025-07-11	SynBridge: Bridging Reaction States via Discrete Flow for Bidirectional Reaction Prediction	Haitao Lin et.al.	2507.08475	null
2025-07-11	Using Large Language Models for Legal Decision-Making in Austrian Value-Added Tax Law: An Experimental Study	Marina Luketina et.al.	2507.08468	null
2025-07-11	F3-Net: Foundation Model for Full Abnormality Segmentation of Medical Images with Flexible Input Modality Requirement	Seyedeh Sahar Taheri Otaghsara et.al.	2507.08460	null
2025-07-11	Diagnosing Failures in Large Language Models’ Answers: Integrating Error Attribution into Evaluation Framework	Zishan Xu et.al.	2507.08459	null
2025-07-11	A document is worth a structured record: Principled inductive bias design for document recognition	Benjamin Meyer et.al.	2507.08458	null
2025-07-11	CUE-RAG: Towards Accurate and Cost-Efficient Graph-Based RAG via Multi-Partite Graph and Query-Driven Iterative Retrieval	Yaodong Su et.al.	2507.08445	null
2025-07-11	Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation	Anlin Zheng et.al.	2507.08441	null
2025-07-11	Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences	Selina Heller et.al.	2507.08440	null
2025-07-11	xpSHACL: Explainable SHACL Validation using Retrieval-Augmented Generation and Large Language Models	Gustavo Correa Publio et.al.	2507.08432	null
2025-07-11	ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains	Zilu Dong et.al.	2507.08427	null
2025-07-11	Generative artificial intelligence and hybrid models to accelerate LES in reactive flows: Application to hydrogen/methane combustion	Xiangrui Zou et.al.	2507.08426	null
2025-07-11	A Survey of Large Language Models in Discipline-specific Research: Challenges, Methods and Opportunities	Lu Xiang et.al.	2507.08425	null
2025-07-11	InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes	Zesong Yang et.al.	2507.08416	null
2025-07-11	Multi-modal Mutual-Guidance Conditional Prompt Learning for Vision-Language Models	Shijun Yang et.al.	2507.08410	null
2025-07-11	PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models	Yongjian Zhang et.al.	2507.08400	null
2025-07-11	Understanding Driving Risks using Large Language Models: Toward Elderly Driver Assessment	Yuki Yoshihara et.al.	2507.08367	null
2025-07-11	Leveraging Machine Learning and Enhanced Parallelism Detection for BPMN Model Generation from Text	Phuong Nam Lê et.al.	2507.08362	null
2025-07-11	Cycle Context Verification for In-Context Medical Image Segmentation	Shishuai Hu et.al.	2507.08357	null
2025-07-11	Exploring Design of Multi-Agent LLM Dialogues for Research Ideation	Keisuke Ueda et.al.	2507.08350	null
2025-07-11	What Factors Affect LLMs and RLLMs in Financial Question Answering?	Peng Wang et.al.	2507.08339	null
2025-07-11	CoCo-Bot: Energy-based Composable Concept Bottlenecks for Interpretable Generative Models	Sangwon Kim et.al.	2507.08334	null
2025-07-11	CRMAgent: A Multi-Agent LLM System for E-Commerce CRM Message Template Generation	Yinzhu Quan et.al.	2507.08325	null
2025-07-11	Generative AI in Science: Applications, Challenges, and Emerging Questions	Ryan Harries et.al.	2507.08310	null
2025-07-11	Improving MLLM’s Document Image Machine Translation via Synchronously Self-reviewing Its OCR Proficiency	Yupu Liang et.al.	2507.08309	null
2025-07-11	M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning	Inclusion AI et.al.	2507.08306	null
2025-07-11	KAT-V1: Kwai-AutoThink Technical Report	Zizheng Zhan et.al.	2507.08297	null
2025-07-11	Invariant-based Robust Weights Watermark for Large Language Models	Qingxiao Guo et.al.	2507.08288	null
2025-07-11	Lightweight Safety Guardrails via Synthetic Data and RL-guided Adversarial Training	Aleksei Ilin et.al.	2507.08284	null
2025-07-11	Agent Safety Alignment via Reinforcement Learning	Zeyang Sha et.al.	2507.08270	null
2025-07-11	A Practical Two-Stage Recipe for Mathematical LLMs: Maximizing Accuracy with SFT and Efficiency with Reinforcement Learning	Hiroshi Yoshihara et.al.	2507.08267	null
2025-07-11	CL3R: 3D Reconstruction and Contrastive Learning for Enhanced Robotic Manipulation Representations	Wenbo Cui et.al.	2507.08262	null
2025-07-11	Quantum-Accelerated Neural Imputation with Large Language Models (LLMs)	Hossein Jamali et.al.	2507.08255	null
2025-07-11	Raptor: Scalable Train-Free Embeddings for 3D Medical Volumes Leveraging Pretrained 2D Foundation Models	Ulzee An et.al.	2507.08254	null
2025-07-11	Leveraging Large Language Models for Classifying App Users’ Feedback	Yasaman Abedini et.al.	2507.08250	null
2025-07-11	Time Variation in the TeV Cosmic Ray Anisotropy with IceCube and Energy Dependence of the Solar Dipole	Perri Zilberman et.al.	2507.08242	null
2025-07-11	Data Generation without Function Estimation	Hadi Daneshmand et.al.	2507.08239	null
2025-07-11	InsightBuild: LLM-Powered Causal Reasoning in Smart Building Systems	Pinaki Prasad Guha Neogi et.al.	2507.08235	null
2025-07-11	Can LLMs Reliably Simulate Real Students’ Abilities in Mathematics and Reading Comprehension?	KV Aditya Srivatsa et.al.	2507.08232	null
2025-07-11	Making VLMs More Robot-Friendly: Self-Critical Distillation of Low-Level Procedural Reasoning	Chan Young Park et.al.	2507.08224	null
2025-07-10	Effect of Static vs. Conversational AI-Generated Messages on Colorectal Cancer Screening Intent: a Randomized Controlled Trial	Neil K. R. Sehgal et.al.	2507.08211	null
2025-07-10	Reasoning and Behavioral Equilibria in LLM-Nash Games: From Mindsets to Actions	Quanyan Zhu et.al.	2507.08208	null
2025-07-10	A Dynamic Stackelberg Game Framework for Agentic AI Defense Against LLM Jailbreaking	Zhengye Han et.al.	2507.08207	null
2025-07-10	TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs	Duygu Nur Yaldiz et.al.	2507.08203	null
2025-07-10	Consciousness as a Jamming Phase	Kaichen Ouyang et.al.	2507.08197	null
2025-07-10	CTRLS: Chain-of-Thought Reasoning via Latent State-Transition	Junda Wu et.al.	2507.08182	null
2025-07-10	Analysis of Propaganda in Tweets From Politically Biased Sources	Vivek Sharma et.al.	2507.08169	null
2025-07-10	KP-A: A Unified Network Knowledge Plane for Catalyzing Agentic Network Intelligence	Yun Tang et.al.	2507.08164	null
2025-07-10	ALCo-FM: Adaptive Long-Context Foundation Model for Accident Prediction	Pinaki Prasad Guha Neogi et.al.	2507.08153	null
2025-07-10	Distilling Empathy from Large Language Models	Henry J. Xie et.al.	2507.08151	null
2025-07-10	Compactor: Calibrated Query-Agnostic KV Cache Compression with Approximate Leverage Scores	Vivek Chari et.al.	2507.08143	null
2025-07-10	GRASP: Generic Reasoning And SPARQL Generation across Knowledge Graphs	Sebastian Walter et.al.	2507.08107	null
2025-07-10	Low-rank Momentum Factorization for Memory Efficient Training	Pouria Mahdavinia et.al.	2507.08091	null
2025-07-10	Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions	Simon Matrenok et.al.	2507.08068	null
2025-07-10	Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs	Ziyue Li et.al.	2507.07996	null
2025-07-10	Multigranular Evaluation for Brain Visual Decoding	Weihao Xia et.al.	2507.07993	null
2025-07-10	Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs	Jeongseok Hyun et.al.	2507.07990	null
2025-07-10	Automating Expert-Level Medical Reasoning Evaluation of Large Language Models	Shuang Zhou et.al.	2507.07988	null
2025-07-10	OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding	JingLi Lin et.al.	2507.07984	null
2025-07-10	Performance and Practical Considerations of Large and Small Language Models in Clinical Decision Support in Rheumatology	Sabine Felde et.al.	2507.07983	null
2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null
2025-07-10	Defending Against Prompt Injection With a Few DefensiveTokens	Sizhe Chen et.al.	2507.07974	null
2025-07-10	Scaling RL to Long Videos	Yukang Chen et.al.	2507.07966	null
2025-07-10	Dynamic Chunking for End-to-End Hierarchical Sequence Modeling	Sukjun Hwang et.al.	2507.07955	null
2025-07-10	Input Conditioned Layer Dropping in Speech Foundation Models	Abdul Hannan et.al.	2507.07954	null
2025-07-10	Low Resource Reconstruction Attacks Through Benign Prompts	Sol Yarkoni et.al.	2507.07947	null
2025-07-10	Can Large Language Models Improve Phishing Defense? A Large-Scale Controlled Experiment on Warning Dialogue Explanations	Federico Maria Cau et.al.	2507.07916	null
2025-07-10	MIRA: A Novel Framework for Fusing Modalities in Medical RAG	Jinhong Wang et.al.	2507.07902	null
2025-07-10	An Integrated Framework of Prompt Engineering and Multidimensional Knowledge Graphs for Legal Dispute Analysis	Mingda Zhang et.al.	2507.07893	null
2025-07-10	Automating MD simulations for Proteins using Large language Models: NAMD-Agent	Achuth Chandrasekhar et.al.	2507.07887	null
2025-07-10	Opting Out of Generative AI: a Behavioral Experiment on the Role of Education in Perplexity AI Avoidance	Roberto Ulloa et.al.	2507.07881	null
2025-07-10	LISTEN: Lightweight Industrial Sound-representable Transformer for Edge Notification	Changheon Han et.al.	2507.07879	null
2025-07-10	Mitigating Watermark Stealing Attacks in Generative Models via Multi-Key Watermarking	Toluwani Aremu et.al.	2507.07871	null
2025-07-10	DocCHA: Towards LLM-Augmented Interactive Online diagnosis System	Xinyi Liu et.al.	2507.07870	null
2025-07-10	THUNDER: Tile-level Histopathology image UNDERstanding benchmark	Pierre Marza et.al.	2507.07860	null
2025-07-10	From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems	Youngjoon Jang et.al.	2507.07847	null
2025-07-10	Towards Benchmarking Foundation Models for Tabular Data With Text	Martin Mráz et.al.	2507.07829	null
2025-07-10	MoSE: Skill-by-Skill Mixture-of-Expert Learning for Autonomous Driving	Lu Xu et.al.	2507.07818	null
2025-07-10	Understanding and Controlling Repetition Neurons and Induction Heads in In-Context Learning	Nhi Hoai Doan et.al.	2507.07810	null
2025-07-10	SecureSpeech: Prompt-based Speaker and Content Protection	Belinda Soh Hui Hui et.al.	2507.07799	null
2025-07-10	Measuring AI Alignment with Human Flourishing	Elizabeth Hilliard et.al.	2507.07787	null
2025-07-10	Where are we with calibration under dataset shift in image classification?	Mélanie Roschewitz et.al.	2507.07780	null
2025-07-10	A Unified Empirical Risk Minimization Framework for Flexible N-Tuples Weak Supervision	Shuying Huang et.al.	2507.07771	null
2025-07-10	Structured Prompts, Better Outcomes? Exploring the Effects of a Structured Interface with ChatGPT in a Graduate Robotics Course	Jerome Brender et.al.	2507.07767	null
2025-07-10	When Large Language Models Meet Law: Dual-Lens Taxonomy, Technical Advances, and Ethical Governance	Peizhang Shao et.al.	2507.07748	null
2025-07-10	On the capabilities of LLMs for classifying and segmenting time series of fruit picking motions into primitive actions	Eleni Konstantinidou et.al.	2507.07745	null
2025-07-10	GuardVal: Dynamic Large Language Model Jailbreak Evaluation for Comprehensive Safety Testing	Peiyan Zhang et.al.	2507.07735	null
2025-07-10	Not All Preferences are What You Need for Post-Training: Selective Alignment Strategy for Preference Optimization	Zhijin Dong et.al.	2507.07725	null
2025-07-10	KeyKnowledgeRAG (K^2RAG): An Enhanced RAG method for improved LLM question-answering capabilities	Hruday Markondapatnaikuni et.al.	2507.07695	null
2025-07-10	From Domain Documents to Requirements: Retrieval-Augmented Generation in the Space Industry	Chetan Arora et.al.	2507.07689	null
2025-07-10	Rationale-Enhanced Decoding for Multi-modal Chain-of-Thought	Shin’ya Yamaguchi et.al.	2507.07685	null
2025-07-10	Accelerating Transposed Convolutions on FPGA-based Edge Devices	Jude Haris et.al.	2507.07683	null
2025-07-10	Prompt Engineering for Requirements Engineering: A Literature Review and Roadmap	Kaicheng Huang et.al.	2507.07682	null
2025-07-10	PlanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations	Fedor Rodionov et.al.	2507.07644	null
2025-07-10	FrugalRAG: Learning to retrieve and reason for multi-hop QA	Abhinav Java et.al.	2507.07634	null
2025-07-10	T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates	Zhitao Wang et.al.	2507.07633	null
2025-07-10	Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks	Joyeeta Datta et.al.	2507.07630	null
2025-07-10	SpatialViz-Bench: Automatically Generated Spatial Visualization Reasoning Tasks for MLLMs	Siting Wang et.al.	2507.07610	null
2025-07-10	Enhancing Vaccine Safety Surveillance: Extracting Vaccine Mentions from Emergency Department Triage Notes Using Fine-Tuned Large Language Models	Sedigh Khademi et.al.	2507.07599	null
2025-07-10	NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning	Tianwei Mu et.al.	2507.07579	null
2025-07-10	Single-to-mix Modality Alignment with Multimodal Large Language Model for Document Image Machine Translation	Yupu Liang et.al.	2507.07572	null
2025-07-10	CEA-LIST at CheckThat! 2025: Evaluating LLMs as Detectors of Bias and Opinion in Text	Akram Elbouanani et.al.	2507.07539	null
2025-07-10	MAPEX: Modality-Aware Pruning of Experts for Remote Sensing Foundation Models	Joelle Hanna et.al.	2507.07527	null
2025-07-10	Toward Real-World Chinese Psychological Support Dialogues: CPsDD Dataset and a Co-Evolving Multi-Agent System	Yuanchen Shi et.al.	2507.07509	null
2025-07-10	PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving	Mihir Parmar et.al.	2507.07495	null
2025-07-10	Sparse Autoencoders Reveal Interpretable Structure in Small Gene Language Models	Haoxiang Guan et.al.	2507.07486	null
2025-07-10	Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models	Kaiqu Liang et.al.	2507.07484	null
2025-07-10	General purpose models for the chemical sciences	Nawaf Alampara et.al.	2507.07456	null
2025-07-10	RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning	Hongzhi Zhang et.al.	2507.07451	null
2025-07-10	StarDojo: Benchmarking Open-Ended Behaviors of Agentic Multimodal LLMs in Production-Living Simulations with Stardew Valley	Weihao Tan et.al.	2507.07445	null
2025-07-10	SAND: Boosting LLM Agents with Self-Taught Action Deliberation	Yu Xia et.al.	2507.07441	null
2025-07-10	Towards Interpretable Time Series Foundation Models	Matthieu Boileau et.al.	2507.07439	null
2025-07-10	Neural networks leverage nominally quantum and post-quantum representations	Paul M. Riechers et.al.	2507.07432	null
2025-07-10	DrugMCTS: a drug repurposing framework combining multi-agent, RAG and Monte Carlo Tree Search	Zerui Yang et.al.	2507.07426	null
2025-07-10	Corvid: Improving Multimodal Large Language Models Towards Chain-of-Thought Reasoning	Jingjing Jiang et.al.	2507.07424	null
2025-07-10	May I have your Attention? Breaking Fine-Tuning based Prompt Injection Defenses using Architecture-Aware Attacks	Nishit V. Pandya et.al.	2507.07417	null
2025-07-10	EPIC: Efficient Prompt Interaction for Text-Image Classification	Xinyao Yu et.al.	2507.07415	null
2025-07-10	GNN-CNN: An Efficient Hybrid Model of Convolutional and Graph Neural Networks for Text Representation	Fardin Rastakhiz et.al.	2507.07414	null
2025-07-10	Hybrid LLM-Enhanced Intrusion Detection for Zero-Day Threats in IoT Networks	Mohammad F. Al-Hammouri et.al.	2507.07413	null
2025-07-10	Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models	Jikesh Thapa et.al.	2507.07406	null
2025-07-10	KVFlow: Efficient Prefix Caching for Accelerating LLM-Based Multi-Agent Workflows	Zaifeng Pan et.al.	2507.07400	null
2025-07-10	Behave Your Motion: Habit-preserved Cross-category Animal Motion Transfer	Zhimin Zhang et.al.	2507.07394	null
2025-07-10	Learning Collective Variables from Time-lagged Generation	Seonghyun Park et.al.	2507.07390	null
2025-07-10	Bradley-Terry and Multi-Objective Reward Modeling Are Complementary	Zhiwei Zhang et.al.	2507.07375	null
2025-07-10	PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency	Haotian Wang et.al.	2507.07374	null
2025-07-09	On the Impossibility of Separating Intelligence from Judgment: The Computational Intractability of Filtering for AI Alignment	Sarah Ball et.al.	2507.07341	null
2025-07-09	Bridging the Plausibility-Validity Gap by Fine-Tuning a Reasoning-Enhanced LLM for Chemical Synthesis and Discovery	Malikussaid et.al.	2507.07328	null
2025-07-09	Frontier LLMs Still Struggle with Simple Reasoning Tasks	Alan Malek et.al.	2507.07313	null
2025-07-09	Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation	Anirban Saha Anik et.al.	2507.07307	null
2025-07-09	Application of LLMs to Multi-Robot Path Planning and Task Allocation	Ashish Kumar et.al.	2507.07302	null
2025-07-09	Time Series Foundation Models for Multivariate Financial Time Series Forecasting	Ben A. Marconi et.al.	2507.07296	null
2025-07-09	Thermodynamic Prediction Enabled by Automatic Dataset Building and Machine Learning	Juejing Liu et.al.	2507.07293	null
2025-07-09	Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery	Licong Xu et.al.	2507.07257	null
2025-07-09	A Language-Driven Framework for Improving Personalized Recommendations: Merging LLMs with Traditional Algorithms	Aaron Goldstein et.al.	2507.07251	null
2025-07-09	Medical Red Teaming Protocol of Language Models: On the Importance of User Perspectives in Healthcare Settings	Minseon Kim et.al.	2507.07248	null
2025-07-09	Attentions Under the Microscope: A Comparative Study of Resource Utilization for Variants of Self-Attention	Zhengyu Tian et.al.	2507.07247	null
2025-07-09	An Information-Theoretic Perspective on Multi-LLM Uncertainty Estimation	Maya Kruse et.al.	2507.07236	null
2025-07-09	SynthTextEval: Synthetic Text Data Generation and Evaluation for High-Stakes Domains	Krithika Ramesh et.al.	2507.07229	null
2025-07-09	Compute Can’t Handle the Truth: Why Communication Tax Prioritizes Memory and Interconnects in Modern AI Infrastructure	Myoungsoo Jung et.al.	2507.07223	null
2025-07-09	Neurosymbolic Feature Extraction for Identifying Forced Labor in Supply Chains	Zili Wang et.al.	2507.07217	null
2025-07-09	Scale leads to compositional generalization	Florian Redhardt et.al.	2507.07207	null
2025-07-09	State-Inference-Based Prompting for Natural Language Trading with Game NPCs	Minkyung Kim et.al.	2507.07203	null
2025-07-09	A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality	Mohamed Elmoghany et.al.	2507.07202	null
2025-07-09	Combining Pre-Trained Models for Enhanced Feature Representation in Reinforcement Learning	Elia Piccoli et.al.	2507.07197	null
2025-07-09	Bridging the Last Mile of Prediction: Enhancing Time Series Forecasting with Conditional Guided Flow Matching	Huibo Xu et.al.	2507.07192	null
2025-07-09	Prompt Perturbations Reveal Human-Like Biases in LLM Survey Responses	Jens Rupprecht et.al.	2507.07188	null
2025-07-09	Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs	Itay Itzhak et.al.	2507.07186	null
2025-07-09	Interpretable EEG-to-Image Generation with Semantic Prompts	Arshak Rezvani et.al.	2507.07157	null
2025-07-09	Evaluating Retrieval-Augmented Generation Agents for Autonomous Scientific Discovery in Astrophysics	Xueqing Xu et.al.	2507.07155	null
2025-07-09	Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor	Vatsal Agarwal et.al.	2507.07106	null
2025-07-09	Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models	Tiezheng Zhang et.al.	2507.07104	null
2025-07-09	Evaluating Attribute Confusion in Fashion Text-to-Image Generation	Ziyue Liu et.al.	2507.07079	null
2025-07-09	5C Prompt Contracts: A Minimalist, Creative-Friendly, Token-Efficient Design Framework for Individual and SME LLM Usage	Ugur Ari et.al.	2507.07045	null
2025-07-09	UniConv: Unifying Retrieval and Response Generation for Large Language Models in Conversations	Fengran Mo et.al.	2507.07030	null
2025-07-09	First Return, Entropy-Eliciting Explore	Tianyu Zheng et.al.	2507.07017	null
2025-07-09	Integrating Pathology Foundation Models and Spatial Transcriptomics for Cellular Decomposition from Histology Images	Yutong Sun et.al.	2507.07013	null
2025-07-09	GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning	S M Taslim Uddin Raju et.al.	2507.07006	null
2025-07-09	Learning Deliberately, Acting Intuitively: Unlocking Test-Time Reasoning in Multimodal LLMs	Yahan Yu et.al.	2507.06999	null
2025-07-09	MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation	Qilong Xing et.al.	2507.06992	null
2025-07-09	Are They All Good? Evaluating the Quality of CoTs in LLM-based Code Generation	Binquan Zhang et.al.	2507.06980	null
2025-07-09	Hallucinating 360°: Panoramic Street-View Generation via Local Scenes Diffusion and Probabilistic Prompting	Fei Teng et.al.	2507.06971	null
2025-07-09	Scaling Towards the Information Boundary of Instruction Set: InfinityInstruct-Subject Technical Report	Li Du et.al.	2507.06968	null
2025-07-09	Investigating the Robustness of Retrieval-Augmented Generation at the Query Level	Sezen Perçin et.al.	2507.06956	null
2025-07-09	What Has a Foundation Model Found? Using Inductive Bias to Probe for World Models	Keyon Vafa et.al.	2507.06952	null
2025-07-09	Rethinking Verification for LLM Code Generation: From Generation to Testing	Zihan Ma et.al.	2507.06920	null
2025-07-09	Exploring LLMs for Predicting Tutor Strategy and Student Outcomes in Dialogues	Fareya Ikram et.al.	2507.06910	null
2025-07-09	MultiJustice: A Chinese Dataset for Multi-Party, Multi-Charge Legal Prediction	Xiao Wang et.al.	2507.06909	null
2025-07-09	SCoRE: Streamlined Corpus-based Relation Extraction using Multi-Label Contrastive Learning and Bayesian kNN	Luca Mariotti et.al.	2507.06895	null
2025-07-09	Developing and Maintaining an Open-Source Repository of AI Evaluations: Challenges and Insights	Alexandra Abbas et.al.	2507.06893	null
2025-07-09	Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model	Jing Liang et.al.	2507.06892	null
2025-07-09	DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models	Liang Wang et.al.	2507.06853	null
2025-07-09	The Dark Side of LLMs Agent-based Attacks for Complete Computer Takeover	Matteo Lupinacci et.al.	2507.06850	null
2025-07-09	Physics-Grounded Motion Forecasting via Equation Discovery for Trajectory-Guided Image-to-Video Generation	Tao Feng et.al.	2507.06830	null
2025-07-09	Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework	Zenan Xu et.al.	2507.06829	null
2025-07-09	Democratizing High-Fidelity Co-Speech Gesture Video Generation	Xu Yang et.al.	2507.06812	null
2025-07-09	Text to model via SysML: Automated generation of dynamical system computational models from unstructured natural language text via enhanced System Modeling Language diagrams	Matthew Anderson Hendricks et.al.	2507.06803	null
2025-07-09	Efficient Industrial sLLMs through Domain Adaptive Continual Pretraining: Method, Evaluation and Applications	Seonwu Kim et.al.	2507.06795	null
2025-07-09	Checklist Engineering Empowers Multilingual LLM Judges	Mohammad Ghiasvand Mohammadkhani et.al.	2507.06774	null
2025-07-09	Leveraging LLMs for Semantic Conflict Detection via Unit Test Generation	Nathalia Barbosa et.al.	2507.06762	null
2025-07-09	LOVON: Legged Open-Vocabulary Object Navigator	Daojie Peng et.al.	2507.06747	null
2025-07-09	PenTest2.0: Towards Autonomous Privilege Escalation Using GenAI	Haitham S. Al-Sinani et.al.	2507.06742	null
2025-07-09	Hierarchical Feature Alignment for Gloss-Free Sign Language Translation	Sobhan Asasi et.al.	2507.06732	null
2025-07-09	On the Effect of Uncertainty on Layer-wise Inference Dynamics	Sunwoo Kim et.al.	2507.06722	null
2025-07-09	A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding	Zhenyang Liu et.al.	2507.06719	null
2025-07-09	CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs	Garapati Keerthana et.al.	2507.06715	null
2025-07-09	Elite Polarization in European Parliamentary Speeches: a Novel Measurement Approach Using Large Language Models	Gennadii Iakovlev et.al.	2507.06658	null
2025-07-09	Deep Disentangled Representation Network for Treatment Effect Estimation	Hui Meng et.al.	2507.06650	null
2025-07-09	EXAONE Path 2.0: Pathology Foundation Model with End-to-End Supervision	Myungjang Pyeon et.al.	2507.06639	null
2025-07-09	UniOD: A Universal Model for Outlier Detection across Diverse Domains	Dazhi Fu et.al.	2507.06624	null
2025-07-09	Expediting data extraction using a large language model (LLM) and scoping review protocol: a methodological study within a complex scoping review	James Stewart-Evans et.al.	2507.06623	null
2025-07-09	FuDoBa: Fusing Document and Knowledge Graph-based Representations with Bayesian Optimisation	Boshko Koloski et.al.	2507.06622	null
2025-07-09	Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation	Anshuk Uppal et.al.	2507.06613	null
2025-07-09	From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization	Xinjie Chen et.al.	2507.06573	null
2025-07-09	SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference	Qian Chen et.al.	2507.06567	null
2025-07-09	The Flaws of Others: An LLM-driven Framework for Scientific Knowledge Production	Juan B. Gutiérrez et.al.	2507.06565	null
2025-07-09	SkyVLN: Vision-and-Language Navigation and NMPC Control for UAVs in Urban Environments	Tianshun Li et.al.	2507.06564	null
2025-07-09	SPEAR: Subset-sampled Performance Evaluation via Automated Ground Truth Generation for RAG	Zou Yuheng et.al.	2507.06554	null
2025-07-09	Large Language Model for Extracting Complex Contract Information in Industrial Scenes	Yunyang Cao et.al.	2507.06539	null
2025-07-09	InvestAlign: Overcoming Data Scarcity in Aligning Large Language Models with Investor Decision-Making Processes under Herd Behavior	Huisheng Wang et.al.	2507.06528	null
2025-07-09	FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation	Liqiang Jing et.al.	2507.06523	null
2025-07-09	SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers	Zicong Tang et.al.	2507.06517	null
2025-07-09	QUEST: Query Optimization in Unstructured Document Analysis	Zhaoze Sun et.al.	2507.06515	null
2025-07-09	Towards LLM-based Root Cause Analysis of Hardware Design Failures	Siyu Qiu et.al.	2507.06512	null
2025-07-09	Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection	Yupeng Hu et.al.	2507.06510	null
2025-07-09	GR-LLMs: Recent Advances in Generative Recommendation Based on Large Language Models	Zhen Yang et.al.	2507.06507	null
2025-07-09	Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings	Russell Taylor et.al.	2507.06506	null
2025-07-09	MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting Models	Yiwen Liu et.al.	2507.06502	null
2025-07-09	On the Robustness of Verbal Confidence of LLMs in Adversarial Attacks	Stephen Obadinma et.al.	2507.06489	null
2025-07-09	Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning	Ziyang Wang et.al.	2507.06485	null
2025-07-09	3D-Generalist: Self-Improving Vision-Language-Action Models for Crafting 3D Worlds	Fan-Yun Sun et.al.	2507.06484	null
2025-07-09	Learning Japanese with Jouzu: Interaction Outcomes with Stylized Dialogue Fictional Agents	Zackary Rackauckas et.al.	2507.06483	null
2025-07-09	IMPACT: Industrial Machine Perception via Acoustic Cognitive Transformer	Changheon Han et.al.	2507.06481	null
2025-07-09	Generative Lagrangian data assimilation for ocean dynamics under extreme sparsity	Niloofar Asefi et.al.	2507.06479	null
2025-07-09	Foundation Model Self-Play: Open-Ended Strategy Innovation via Foundation Models	Aaron Dharna et.al.	2507.06466	null
2025-07-09	Evaluating Efficiency and Novelty of LLM-Generated Code for Graph Analysis	Atieh Barati Nia et.al.	2507.06463	null
2025-07-08	A Semantic Parsing Framework for End-to-End Time Normalization	Xin Su et.al.	2507.06450	null
2025-07-08	Perception-Aware Policy Optimization for Multimodal Reasoning	Zhenhailong Wang et.al.	2507.06448	null
2025-07-08	Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders	Shun Wang et.al.	2507.06427	null
2025-07-08	Reward Models Can Improve Themselves: Reward-Guided Adversarial Failure Mode Discovery for Robust Reward Modeling	Pankayaraj Pathmanathan et.al.	2507.06419	null
2025-07-08	PAST: A multimodal single-cell foundation model for histopathology and spatial transcriptomics in cancer	Changchun Yang et.al.	2507.06418	null
2025-07-08	Voltage Regulation in Distribution Systems with Data Center Loads	Yize Chen et.al.	2507.06416	null
2025-07-08	An AI-Driven Thermal-Fluid Testbed for Advanced Small Modular Reactors: Integration of Digital Twin and Large Language Models	Doyeong Lim et.al.	2507.06399	null
2025-07-08	SLDB: An End-To-End Heterogeneous System-on-Chip Benchmark Suite for LLM-Aided Design	Elisavet Lydia Alvanaki et.al.	2507.06376	null
2025-07-08	Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms	Tarek Gasmi et.al.	2507.06323	null
2025-07-08	Too Human to Model:The Uncanny Valley of LLMs in Social Simulation – When Generative Language Agents Misalign with Modelling Principles	Yongchao Zeng et.al.	2507.06310	null
2025-07-08	Humans overrely on overconfident language models, across languages	Neil Rathi et.al.	2507.06306	null
2025-07-08	RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models	Keyan Chen et.al.	2507.06231	null
2025-07-08	Efficiency-Effectiveness Reranking FLOPs for LLM-based Rerankers	Zhiyuan Peng et.al.	2507.06223	null
2025-07-08	Is Diversity All You Need for Scalable Robotic Manipulation?	Modi Shi et.al.	2507.06219	null
2025-07-08	A Survey on Latent Reasoning	Rui-Jie Zhu et.al.	2507.06203	null
2025-07-08	UQLM: A Python Package for Uncertainty Quantification in Large Language Models	Dylan Bouchard et.al.	2507.06196	null
2025-07-08	SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads	Jiale Lao et.al.	2507.06192	null
2025-07-08	Hidden Prompts in Manuscripts Exploit AI-Assisted Peer Review	Zhicheng Lin et.al.	2507.06185	null
2025-07-08	Data-Semantics-Aware Recommendation of Diverse Pivot Tables	Whanhee Cho et.al.	2507.06171	null
2025-07-09	Skywork-R1V3 Technical Report	Wei Shen et.al.	2507.06167	null
2025-07-08	Evaluation of Habitat Robotics using Large Language Models	William Li et.al.	2507.06157	null
2025-07-08	Large Language Models Predict Human Well-being – But Not Equally Everywhere	Pat Pataranutaporn et.al.	2507.06141	null
2025-07-08	Coding Triangle: How Does Large Language Model Understand Code?	Taolin Zhang et.al.	2507.06138	null
2025-07-08	PrefixAgent: An LLM-Powered Design Framework for Efficient Prefix Adder Optimization	Dongsheng Zuo et.al.	2507.06127	null
2025-07-09	Omni-Video: Democratizing Unified Video Understanding and Generation	Zhiyu Tan et.al.	2507.06119	null
2025-07-08	Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis	Xintong Hu et.al.	2507.06116	null
2025-07-08	Reflections Unlock: Geometry-Aware Reflection Disentanglement in 3D Gaussian Splatting for Photorealistic Scenes Rendering	Jiayi Song et.al.	2507.06103	null
2025-07-09	FEVO: Financial Knowledge Expansion and Reasoning Evolution for Large Language Models	Bo Pang et.al.	2507.06057	null
2025-07-08	Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs	Yizhan Huang et.al.	2507.06056	null
2025-07-08	Kernel Trace Distance: Quantum Statistical Metric between Measures through RKHS Density Operators	Arturo Castellanos et.al.	2507.06055	null
2025-07-08	Hierarchical Interaction Summarization and Contrastive Prompting for Explainable Recommendations	Yibin Liu et.al.	2507.06044	null
2025-07-08	CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations	Xiaohu Li et.al.	2507.06043	null
2025-07-08	CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation	Kushal Gajjar et.al.	2507.06013	null
2025-07-08	DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations	Nicholas Popovič et.al.	2507.05997	null
2025-07-08	Development and Evaluation of HopeBot: an LLM-based chatbot for structured and interactive PHQ-9 depression screening	Zhijun Guo et.al.	2507.05984	null
2025-07-08	Multi-Agent Debate Strategies to Enhance Requirements Engineering with Large Language Models	Marc Oriol et.al.	2507.05981	null
2025-07-08	RabakBench: Scaling Human Annotations to Construct Localized Multilingual Safety Benchmarks for Low-Resource Languages	Gabriel Chua et.al.	2507.05980	null
2025-07-08	Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval	Haiwen Li et.al.	2507.05970	null
2025-07-08	OpenFActScore: Open-Source Atomic Evaluation of Factuality in Text Generation	Lucas Fonseca Lage et.al.	2507.05965	null
2025-07-08	Evaluation of Large Language Model-Driven AutoML in Data and Model Management from Human-Centered Perspective	Jiapeng Yao et.al.	2507.05962	null
2025-07-08	A Wireless Foundation Model for Multi-Task Prediction	Yucheng Sheng et.al.	2507.05938	null
2025-07-08	BlueLM-2.5-3B Technical Report	Baojiao Xiong et.al.	2507.05934	null
2025-07-08	Few-shot text-based emotion detection	Teodor-George Marchitan et.al.	2507.05918	null
2025-07-08	Best-of-N through the Smoothing Lens: KL Divergence and Regret Analysis	Gholamali Aminian et.al.	2507.05913	null
2025-07-08	AI-Reporter: A Path to a New Genre of Scientific Communication	Gerd Graßhoff et.al.	2507.05903	null
2025-07-08	Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators	Sungjib Lim et.al.	2507.05890	null
2025-07-08	Current Practices for Building LLM-Powered Reasoning Tools Are Ad Hoc – and We Can Do Better	Aaron Bembenek et.al.	2507.05886	null
2025-07-08	RecRankerEval: A Flexible and Extensible Framework for Top-k LLM-based Recommendation	Zeyuan Meng et.al.	2507.05880	null
2025-07-08	KERAG_R: Knowledge-Enhanced Retrieval-Augmented Generation for Recommendation	Zeyuan Meng et.al.	2507.05863	null
2025-07-08	USIGAN: Unbalanced Self-Information Feature Transport for Weakly Paired Image IHC Virtual Staining	Yue Peng et.al.	2507.05843	null
2025-07-08	Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models	L’ea Dubois et.al.	2507.05822	null
2025-07-08	2D Instance Editing in 3D Space	Yuhuan Xie et.al.	2507.05819	null
2025-07-08	Affective-ROPTester: Capability and Bias Analysis of LLMs in Predicting Retinopathy of Prematurity	Shuai Zhao et.al.	2507.05816	null
2025-07-08	Just Say Better or Worse: A Human-AI Collaborative Framework for Medical Image Segmentation Without Manual Annotations	Yizhe Zhang et.al.	2507.05815	null
2025-07-08	Improving Robustness of Foundation Models in Domain Adaptation with Soup-Adapters	Marco Roschkowski et.al.	2507.05807	null
2025-07-08	DREAM: Document Reconstruction via End-to-end Autoregressive Model	Xin Li et.al.	2507.05805	null
2025-07-08	Creating a customisable freely-accessible Socratic AI physics tutor	Eugenio Tufino et.al.	2507.05795	null
2025-07-08	TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model	Yujie Hu et.al.	2507.05790	null
2025-07-08	Flippi: End To End GenAI Assistant for E-Commerce	Anand A. Rajasekar et.al.	2507.05788	null
2025-07-08	Text-Guided Token Communication for Wireless Image Transmission	Bole Liu et.al.	2507.05781	null
2025-07-08	LeAD: The LLM Enhanced Planning System Converged with End-to-end Autonomous Driving	Yuhang Zhang et.al.	2507.05754	null
2025-07-08	Jigsaw: Training Multi-Billion-Parameter AI Weather Models with Optimized Model Parallelism	Deifilia Kieckhefen et.al.	2507.05753	null
2025-07-08	DocTalk: Scalable Graph-based Dialogue Synthesis for Enhancing LLM Conversational Capabilities	Jing Yang Lee et.al.	2507.05750	null
2025-07-08	Tissue Concepts v2: a Supervised Foundation Model for whole slide images	Till Nicke et.al.	2507.05742	null
2025-07-08	When Transformers Meet Recommenders: Integrating Self-Attentive Sequential Recommendation with Fine-Tuned LLMs	Kechen Liu et.al.	2507.05733	null
2025-07-08	ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark	He Wang et.al.	2507.05727	null
2025-07-08	Large Language Models for Agent-Based Modelling: Current and possible uses across the modelling cycle	Loïs Vanhée et.al.	2507.05723	null
2025-07-08	HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation	YiHan Jiao et.al.	2507.05714	null
2025-07-08	DRAGON: Dynamic RAG Benchmark On News	Fedor Chernogorskii et.al.	2507.05713	null
2025-07-08	Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs	SeungWon Ji et.al.	2507.05686	null
2025-07-08	MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos	Rongsheng Wang et.al.	2507.05675	null
2025-07-08	Integrating Diffusion-based Multi-task Learning with Online Reinforcement Learning for Robust Quadruped Robot Control	Xinyao Qin et.al.	2507.05674	null
2025-07-08	TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data	Aravind Cheruvu et.al.	2507.05660	null
2025-07-08	LLMs are Introvert	Litian Zhang et.al.	2507.05638	null
2025-07-08	SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression	Yiqiao Jin et.al.	2507.05633	null
2025-07-08	Enhancing Student Learning with LLM-Generated Retrieval Practice Questions: An Empirical Study in Data Science Courses	Yuan An et.al.	2507.05629	null
2025-07-08	DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation	Young Hun Kim et.al.	2507.05627	null
2025-07-08	Flipping Knowledge Distillation: Leveraging Small Models’ Expertise to Enhance LLMs in Text Matching	Mingzhe Li et.al.	2507.05617	null
2025-07-08	Domain adaptation of large language models for geotechnical applications	Lei Fan et.al.	2507.05613	null
2025-07-08	MMW: Side Talk Rejection Multi-Microphone Whisper on Smart Glasses	Yang Liu et.al.	2507.05609	null
2025-07-08	Structured Task Solving via Modular Embodied Intelligence: A Case Study on Rubik’s Cube	Chongshan Fan et.al.	2507.05607	null
2025-07-08	Self-Review Framework for Enhancing Instruction Following Capability of LLM	Sihyun Park et.al.	2507.05598	null
2025-07-08	PaddleOCR 3.0 Technical Report	Cheng Cui et.al.	2507.05595	null
2025-07-08	MLlm-DR: Towards Explainable Depression Recognition with MultiModal Large Language Models	Wei Zhang et.al.	2507.05591	null
2025-07-08	The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation	Alexander Xiong et.al.	2507.05578	null
2025-07-08	Beyond Retrieval: Ensembling Cross-Encoders and GPT Rerankers with LLMs for Biomedical QA	Shashank Verma et.al.	2507.05577	null
2025-07-08	Prompt Migration: Stabilizing GenAI Applications with Evolving Large Language Models	Shivani Tripathi et.al.	2507.05573	null
2025-07-08	ReLayout: Integrating Relation Reasoning for Content-aware Layout Generation with Multi-modal Large Language Models	Jiaxu Tian et.al.	2507.05568	null
2025-07-08	Search-based Selection of Metamorphic Relations for Optimized Robustness Testing of Large Language Models	Sangwon Hyun et.al.	2507.05565	null
2025-07-08	Enhancing Test-Time Scaling of Large Language Models with Hierarchical Retrieval-Augmented MCTS	Alex ZH Dou et.al.	2507.05557	null
2025-07-08	A Malliavin calculus approach to score functions in diffusion generative models	Ehsan Mirafzali et.al.	2507.05550	null
2025-07-07	SenseCF: LLM-Prompted Counterfactuals for Intervention and Sensor Data Augmentation	Shovito Barua Soumma et.al.	2507.05541	null
2025-07-07	Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment	Jiahuan Pei et.al.	2507.05528	null
2025-07-07	Empowering Healthcare Practitioners with Language Models: Structuring Speech Transcripts in Two Real-World Clinical Applications	Jean-Philippe Corbeil et.al.	2507.05517	null
2025-07-07	Tool for Supporting Debugging and Understanding of Normative Requirements Using LLMs	Alex Kleijwegt et.al.	2507.05504	null
2025-07-07	MolFORM: Multi-modal Flow Matching for Structure-Based Drug Design	Jie Huang et.al.	2507.05503	null
2025-07-07	Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents	Prahaladh Chandrahasan et.al.	2507.05495	null
2025-07-07	MBFormer: A General Transformer-based Learning Paradigm for Many-body Interactions in Real Materials	Bowen Hou et.al.	2507.05480	null
2025-07-07	Dense and comeager conjugacy classes in zero-dimensional dynamics	Michal Doucha et.al.	2507.05474	null
2025-07-07	Inaugural MOASEI Competition at AAMAS’2025: A Technical Report	Ceferino Patino et.al.	2507.05469	null
2025-07-07	Risk-Aware Aerocapture Guidance Through a Probabilistic Indicator Function	Grace E. Calkins et.al.	2507.05454	null
2025-07-07	On the Semantics of Large Language Models	Martin Schuele et.al.	2507.05448	null
2025-07-07	PhoniTale: Phonologically Grounded Mnemonic Generation for Typologically Distant Language Pairs	Sana Kang et.al.	2507.05444	null
2025-07-07	Mastering Regional 3DGS: Locating, Initializing, and Editing with Diverse 2D Priors	Lanqing Guo et.al.	2507.05426	null
2025-07-07	“Lost-in-the-Later”: Framework for Quantifying Contextual Grounding in Large Language Models	Yufei Tao et.al.	2507.05424	null
2025-07-07	Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning	Jaedong Hwang et.al.	2507.05418	null
2025-07-07	PBE Meets LLM: When Few Examples Aren’t Few-Shot Enough	Shuning Zhang et.al.	2507.05403	null
2025-07-07	Neural-Driven Image Editing	Pengfei Zhou et.al.	2507.05397	null
2025-07-07	Controlling What You Share: Assessing Language Model Adherence to Privacy Preferences	Guillem Ramírez et.al.	2507.05391	null
2025-07-07	From General to Specialized: The Need for Foundational Models in Agriculture	Vishal Nedungadi et.al.	2507.05390	null
2025-07-07	Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training	Song Lai et.al.	2507.05386	null
2025-07-07	Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing	Chun-Hsiao Yeh et.al.	2507.05259	null
2025-07-07	Spatio-Temporal LLM: Reasoning about Environments and Actions	Haozhen Zheng et.al.	2507.05258	null
2025-07-07	Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions	Yuanzhe Hu et.al.	2507.05257	null
2025-07-07	Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning	Yana Wei et.al.	2507.05255	null
2025-07-07	Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models	Ziqi Miao et.al.	2507.05248	null
2025-07-07	Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration	Benjamin Li et.al.	2507.05244	null
2025-07-07	StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling	Meng Wei et.al.	2507.05240	null
2025-07-07	All in One: Visual-Description-Guided Unified Point Cloud Segmentation	Zongyan Han et.al.	2507.05211	null
2025-07-07	MedGemma Technical Report	Andrew Sellergren et.al.	2507.05201	null
2025-07-07	CREW-WILDFIRE: Benchmarking Agentic Multi-Agent Collaborations at Scale	Jonathan Hyun et.al.	2507.05178	null
2025-07-07	OpenS2S: Advancing Open-Source End-to-End Empathetic Large Speech Language Model	Chen Wang et.al.	2507.05177	null
2025-07-07	A Dynamical Systems Perspective on the Analysis of Neural Networks	Dennis Chemnitz et.al.	2507.05164	null
2025-07-07	4DSloMo: 4D Reconstruction for High Speed Scene with Asynchronous Capture	Yutian Chen et.al.	2507.05163	null
2025-07-07	AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models	Chinnappa Guggilla et.al.	2507.05157	null
2025-07-07	Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization	Jaewook Lee et.al.	2507.05137	null
2025-07-07	LERa: Replanning with Visual Feedback in Instruction Following	Svyatoslav Pchelintsev et.al.	2507.05135	null
2025-07-07	An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques	Walid Mohamed Aly et.al.	2507.05123	null
2025-07-07	LVM4CSI: Enabling Direct Application of Pre-Trained Large Vision Models for Wireless Channel Tasks	Jiajia Guo et.al.	2507.05121	null
2025-07-07	VerifyLLM: LLM-Based Pre-Execution Task Plan Verification for Robots	Danil S. Grigorev et.al.	2507.05118	null
2025-07-07	DICE: Discrete inverse continuity equation for learning population dynamics	Tobias Blickhan et.al.	2507.05107	null
2025-07-07	The Hidden Threat in Plain Text: Attacking RAG Data Loaders	Alberto Castagnaro et.al.	2507.05093	null
2025-07-07	Gaussian approximation for non-linearity parameter estimation in perturbed random fields on the sphere	Claudio Durastanti et.al.	2507.05074	null
2025-07-07	ICAS: Detecting Training Data from Autoregressive Image Generative Models	Hongyao Yu et.al.	2507.05068	null
2025-07-07	Replacing thinking with tool usage enables reasoning in small language models	Corrado Rainone et.al.	2507.05065	null
2025-07-07	What Shapes User Trust in ChatGPT? A Mixed-Methods Study of User Attributes, Trust Dimensions, Task Context, and Societal Perceptions among University Students	Kadija Bouyzourn et.al.	2507.05046	null
2025-07-07	MoLink: Distributed and Efficient Serving Framework for Large Models	Lewei Jin et.al.	2507.05043	null
2025-07-07	Beyond Scaling Curves: Internal Dynamics of Neural Networks Through the NTK Lens	Konstantin Nikolaou et.al.	2507.05035	null
2025-07-07	Estimating Object Physical Properties from RGB-D Vision and Depth Robot Sensors Using Deep Learning	Ricardo Cardoso et.al.	2507.05029	null
2025-07-07	A Generative Diffusion Model for Amorphous Materials	Kai Yang et.al.	2507.05024	null
2025-07-07	Co-DETECT: Collaborative Discovery of Edge Cases in Text Classification	Chenfei Xiong et.al.	2507.05010	null
2025-07-07	Multi-modal Representations for Fine-grained Multi-label Critical View of Safety Recognition	Britty Baby et.al.	2507.05007	null
2025-07-07	From Autonomy to Agency: Agentic Vehicles for Human-Centered Mobility Systems	Jiangbo Yu et.al.	2507.04996	null
2025-07-07	Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy Grading	Qinkai Yu et.al.	2507.04978	null
2025-07-07	Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models	Eunseop Yoon et.al.	2507.04976	null
2025-07-07	The Case for Instance-Optimized LLMs in OLAP Databases	Bardia Mohammadi et.al.	2507.04967	null
2025-07-07	EXPOTION: Facial Expression and Motion Control for Multimodal Music Generation	Fathinah Izzati et.al.	2507.04955	null
2025-07-07	ArtifactsBench: Bridging the Visual-Interactive Gap in LLM Code Generation Evaluation	Chenchen Zhang et.al.	2507.04952	null
2025-07-07	ReLoop: “Seeing Twice and Thinking Backwards” via Closed-loop Training to Mitigate Hallucinations in Multimodal understanding	Jianjiang Yang et.al.	2507.04943	null
2025-07-07	Contextual Light-Particle Interference	Brian Stout et.al.	2507.04935	null
2025-07-07	LIFT: Automating Symbolic Execution Optimization with Large Language Models for AI Networks	Ruoxi Wang et.al.	2507.04931	null
2025-07-07	HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding	Yuxuan Cai et.al.	2507.04909	null
2025-07-07	Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations	A. Bochkov et.al.	2507.04886	null
2025-07-07	DoPI: Doctor-like Proactive Interrogation LLM for Traditional Chinese Medicine	Zewen Sun et.al.	2507.04877	null
2025-07-07	Music Boomerang: Reusing Diffusion Models for Data Augmentation and Audio Manipulation	Alexander Fichtinger et.al.	2507.04864	null
2025-07-07	Supporting Software Formal Verification with Large Language Models: An Experimental Study	Weiqi Wang et.al.	2507.04857	null
2025-07-07	Semantically Consistent Discrete Diffusion for 3D Biological Graph Modeling	Chinmay Prabhakar et.al.	2507.04856	null
2025-07-07	$\textit{Grahak-Nyay:}$ Consumer Grievance Redressal through Large Language Models	Shrey Ganatra et.al.	2507.04854	null
2025-07-07	Dialogue-Based Multi-Dimensional Relationship Extraction from Novels	Yuchen Yan et.al.	2507.04852	null
2025-07-07	Spec-TOD: A Specialized Instruction-Tuned LLM Framework for Efficient Task-Oriented Dialogue Systems	Quang-Vinh Nguyen et.al.	2507.04841	null
2025-07-07	RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction	Johannes Künzel et.al.	2507.04839	null
2025-07-07	The Geopolitical Determinants of Economic Growth, 1960-2019	Tianyu Fan et.al.	2507.04833	null
2025-07-07	Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation	Junru Wu et.al.	2507.04820	null
2025-07-07	Application and Evaluation of Large Language Models for Forecasting the Impact of Traffic Incidents	George Jagadeesh et.al.	2507.04803	null
2025-07-07	Generalization bounds for score-based generative models: a synthetic proof	Arthur Stéphanovitch et.al.	2507.04794	null
2025-07-07	Reason to Rote: Rethinking Memorization in Reasoning	Yupei Du et.al.	2507.04782	null
2025-07-07	From Imitation to Innovation: The Emergence of AI Unique Artistic Styles and the Challenge of Copyright Protection	Zexi Jia et.al.	2507.04769	null
2025-07-07	ABench-Physics: Benchmarking Physical Reasoning in LLMs via High-Difficulty and Dynamic Physics Problems	Yiming Zhang et.al.	2507.04766	null
2025-07-07	GraphBrep: Learning B-Rep in Graph Structure for Efficient CAD Generation	Weilin Lai et.al.	2507.04765	null
2025-07-07	Intervening to learn and compose disentangled representations	Alex Markham et.al.	2507.04754	null
2025-07-07	Large Language Models for Network Intrusion Detection Systems: Foundations, Implementations, and Future Directions	Shuo Yang et.al.	2507.04752	null
2025-07-07	LLMs as Architects and Critics for Multi-Source Opinion Summarization	Anuj Attri et.al.	2507.04751	null
2025-07-07	LLM-based Question-Answer Framework for Sensor-driven HVAC System Interaction	Sungmin Lee et.al.	2507.04748	null
2025-07-07	Activation Steering for Chain-of-Thought Compression	Seyedarmin Azizi et.al.	2507.04742	null
2025-07-07	ChipSeek-R1: Generating Human-Surpassing RTL with LLM via Hierarchical Reward-Driven Reinforcement Learning	Zhirong Chen et.al.	2507.04736	null
2025-07-07	An analysis of vision-language models for fabric retrieval	Francesco Giuliari et.al.	2507.04735	null
2025-07-07	“This Suits You the Best”: Query Focused Comparative Explainable Summarization	Arnav Attri et.al.	2507.04733	null
2025-07-07	Who’s the Mole? Modeling and Detecting Intention-Hiding Malicious Agents in LLM-Based Multi-Agent Systems	Yizhe Xie et.al.	2507.04724	null
2025-07-07	LOOM-Scope: a comprehensive and efficient LOng-cOntext Model evaluation framework	Zecheng Tang et.al.	2507.04723	null
2025-07-07	Geometric-Guided Few-Shot Dental Landmark Detection with Human-Centric Foundation Model	Anbang Wang et.al.	2507.04710	null
2025-07-07	Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce	Arnav Attri et.al.	2507.04708	null
2025-07-07	Tempo-R0: A Video-MLLM for Temporal Video Grounding through Efficient Temporal Sensing Reinforcement Learning	Feng Yue et.al.	2507.04702	null
2025-07-07	XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL	Yifu Liu et.al.	2507.04701	null
2025-07-07	A Visual Leap in CLIP Compositionality Reasoning through Generation of Counterfactual Sets	Zexi Jia et.al.	2507.04699	null
2025-07-07	Performance Evaluation of General Purpose Large Language Models for Basic Linear Algebra Subprograms Code Generation	Daichi Mukunoki et.al.	2507.04697	null
2025-07-07	AKEGEN: A LLM-based Tabular Corpus Generator for Evaluating Dataset Discovery in Data Lakes	Zhenwei Dai et.al.	2507.04687	null
2025-07-07	ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Sensing	Zhenghui Zhao et.al.	2507.04678	null
2025-07-07	VectorLLM: Human-like Extraction of Structured Building Contours vis Multimodal LLMs	Tao Zhang et.al.	2507.04664	null
2025-07-07	MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding	Zhicheng Zhang et.al.	2507.04635	null
2025-07-07	Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models?	Yun Qu et.al.	2507.04632	null
2025-07-07	Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts	Yun Wang et.al.	2507.04631	null
2025-07-07	Heterogeneous User Modeling for LLM-based Recommendation	Honghui Bao et.al.	2507.04626	null
2025-07-07	Knowledge-Aware Self-Correction in Language Models via Structured Memory Graphs	Swayamjit Saha et.al.	2507.04625	null
2025-07-07	Hierarchical Intent-guided Optimization with Pluggable LLM-Driven Semantics for Session-based Recommendation	Jinpeng Chen et.al.	2507.04623	null
2025-07-07	Multimodal LLM Integrated Semantic Communications for 6G Immersive Experiences	Yusong Zhang et.al.	2507.04621	null
2025-07-07	any4: Learned 4-bit Numeric Representation for LLMs	Mostafa Elhoushi et.al.	2507.04610	null
2025-07-07	PRIME: Large Language Model Personalization with Cognitive Memory and Thought Processes	Xinliang Frederick Zhang et.al.	2507.04607	null
2025-07-07	QR-LoRA: Efficient and Disentangled Fine-tuning via QR Decomposition for Customized Generation	Jiahui Yang et.al.	2507.04599	null
2025-07-06	Evaluating LLMs on Real-World Forecasting Against Human Superforecasters	Janna Lu et.al.	2507.04562	null
2025-07-06	MambaVideo for Discrete Video Tokenization with Channel-Split Quantization	Dawit Mureja Argaw et.al.	2507.04559	null
2025-07-06	Self-supervised learning of speech representations with Dutch archival data	Nik Vaessen et.al.	2507.04554	null
2025-07-06	Greedy Dynamic Matching	Nick Arnosti et.al.	2507.04551	null
2025-07-06	DP-Fusion: Token-Level Differentially Private Inference for Large Language Models	Rushil Thareja et.al.	2507.04531	null
2025-07-06	DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging	Neha Verma et.al.	2507.04517	null
2025-07-06	Unveiling the Potential of Diffusion Large Language Model in Controllable Generation	Zhen Xiong et.al.	2507.04504	null
2025-07-06	A validity-guided workflow for robust large language model research in psychology	Zhicheng Lin et.al.	2507.04491	null
2025-07-06	Source Attribution in Retrieval-Augmented Generation	Ikhtiyor Nematov et.al.	2507.04480	null
2025-07-06	Model Inversion Attacks on Llama 3: Extracting PII from Large Language Models	Sathesh P. Sivashanmugam et.al.	2507.04478	null
2025-07-06	The role of large language models in UI/UX design: A systematic literature review	Ammar Ahmed et.al.	2507.04469	null
2025-07-06	GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models	Kai Yao et.al.	2507.04455	null
2025-07-06	ESSA: Evolutionary Strategies for Scalable Alignment	Daria Korotyshova et.al.	2507.04453	null
2025-07-03	MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real	Renhao Wang et.al.	2507.02864	null
2025-07-03	RefTok: Reference-Based Tokenization for Video Generation	Xiang Fan et.al.	2507.02862	null
2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null
2025-07-03	Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation	Jiaer Xia et.al.	2507.02859	null
2025-07-03	Requirements Elicitation Follow-Up Question Generation	Yuchen Shen et.al.	2507.02858	null
2025-07-03	MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs	Purbesh Mitra et.al.	2507.02851	null
2025-07-03	Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection	Ziqi Miao et.al.	2507.02844	null
2025-07-03	LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding	Yuchen Ma et.al.	2507.02843	null
2025-07-03	StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason	Kaiyi Zhang et.al.	2507.02841	null
2025-07-03	ExPO: Unlocking Hard Reasoning with Self-Explanation-Guided Reinforcement Learning	Ruiyang Zhou et.al.	2507.02834	null
2025-07-03	SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model	Wencheng Zhang et.al.	2507.02822	null
2025-07-03	Multimodal Mathematical Reasoning with Diverse Solving Perspective	Wenhao Shi et.al.	2507.02804	null
2025-07-03	Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models	Riccardo Cantini et.al.	2507.02799	null
2025-07-03	No time to train! Training-Free Reference-Based Instance Segmentation	Miguel Espinosa et.al.	2507.02798	null
2025-07-03	From Long Videos to Engaging Clips: A Human-Inspired Video Editing Framework with Multimodal Narrative Understanding	Xiangfeng Wang et.al.	2507.02790	null
2025-07-03	Moral Responsibility or Obedience: What Do We Want from AI?	Joseph Boland et.al.	2507.02788	null
2025-07-03	Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs	Ken Tsui et.al.	2507.02778	null
2025-07-03	KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs	Yuzhang Xie et.al.	2507.02773	null
2025-07-03	Grounding Intelligence in Movement	Melanie Segado et.al.	2507.02771	null
2025-07-03	DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment	Ke-Han Lu et.al.	2507.02768	null
2025-07-03	Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work	Guangwei Zhang et.al.	2507.02760	null
2025-07-03	Fast and Simplex: 2-Simplicial Attention in Triton	Aurko Roy et.al.	2507.02754	null
2025-07-03	Who’s Sorry Now: User Preferences Among Rote, Empathic, and Explanatory Apologies from LLM Chatbots	Zahra Ashktorab et.al.	2507.02745	null
2025-07-03	Prompt learning with bounding box constraints for medical image segmentation	Mélanie Gaillochet et.al.	2507.02743	null
2025-07-03	Early Signs of Steganographic Capabilities in Frontier LLMs	Artur Zolkowski et.al.	2507.02737	null
2025-07-03	Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving	Matthieu Zimmer et.al.	2507.02726	null
2025-07-03	On the Convergence of Large Language Model Optimizer for Black-Box Network Management	Hoon Lee et.al.	2507.02689	null
2025-07-03	Embedding-Based Federated Data Sharing via Differentially Private Conditional VAEs	Francesco Di Salvo et.al.	2507.02671	null
2025-07-03	AIGI-Holmes: Towards Explainable and Generalizable AI-Generated Image Detection via Multimodal Large Language Models	Ziyin Zhou et.al.	2507.02664	null
2025-07-03	Hey AI, Generate Me a Hardware Code! Agentic AI-based Hardware Design & Verification	Deepak Narayan Gadde et.al.	2507.02660	null
2025-07-03	Medical Data Pecking: A Context-Aware Approach for Automated Quality Evaluation of Structured Medical Data	Irena Girshovitz et.al.	2507.02628	null
2025-07-03	VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning	Siran Chen et.al.	2507.02626	null
2025-07-03	FlowSpec: Continuous Pipelined Speculative Decoding for Efficient Distributed LLM Inference	Xing Liu et.al.	2507.02620	null
2025-07-03	Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory	Kenneth Payne et.al.	2507.02618	null
2025-07-03	DynamiCare: A Dynamic Multi-Agent Framework for Interactive and Open-Ended Medical Decision-Making	Tianqi Shang et.al.	2507.02616	null
2025-07-03	De-AntiFake: Rethinking the Protective Perturbations Against Voice Cloning Attacks	Wei Fan et.al.	2507.02606	null
2025-07-03	MPF: Aligning and Debiasing Language Models post Deployment via Multi Perspective Fusion	Xin Guan et.al.	2507.02595	null
2025-07-03	Revisiting Active Learning under (Human) Label Variation	Cornelia Gruber et.al.	2507.02593	null
2025-07-03	Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning	Buzhen Huang et.al.	2507.02565	null
2025-07-03	LLMREI: Automating Requirements Elicitation Interviews with LLMs	Alexander Korn et.al.	2507.02564	null
2025-07-03	Transformers Don’t Need LayerNorm at Inference Time: Scaling LayerNorm Removal to GPT-2 XL and the Implications for Mechanistic Interpretability	Luca Baroni et.al.	2507.02559	null
2025-07-03	Clarifying Before Reasoning: A Coq Prover with Structural Context	Yanzhen Lu et.al.	2507.02541	null
2025-07-03	Are You Listening to Me? Fine-Tuning Chatbots for Empathetic Dialogue	Paulo Ricardo Knob et.al.	2507.02537	null
2025-07-03	Meta-Fair: AI-Assisted Fairness Testing of Large Language Models	Miguel Romero-Arjona et.al.	2507.02533	null
2025-07-03	Open-Source System for Multilingual Translation and Cloned Speech Synthesis	Mateo Cámara et.al.	2507.02530	null
2025-07-03	RetrySQL: text-to-SQL training with retry data for self-correcting query generation	Alicja Rączkowska et.al.	2507.02529	null
2025-07-03	Continual Gradient Low-Rank Projection Fine-Tuning for LLMs	Chenxu Wang et.al.	2507.02503	null
2025-07-03	CrowdTrack: A Benchmark for Difficult Multiple Pedestrian Tracking in Real Scenarios	Teng Fu et.al.	2507.02479	null
2025-07-03	System-performance and cost modeling of Large Language Model training and inference	Wenzhe Guo et.al.	2507.02456	null
2025-07-03	Introducing a New Brexit-Related Uncertainty Index: Its Evolution and Economic Consequences	Ismet Gocer et.al.	2507.02439	null
2025-07-03	Toward a Robust and Generalizable Metamaterial Foundation Model	Namjung Kim et.al.	2507.02436	null
2025-07-03	Improving Consistency in Vehicle Trajectory Prediction Through Preference Optimization	Caio Azevedo et.al.	2507.02406	null
2025-07-03	Evaluating Language Models For Threat Detection in IoT Security Logs	Jorge J. Tejero-Fernández et.al.	2507.02390	null
2025-07-03	JoyTTS: LLM-based Spoken Chatbot With Voice Cloning	Fangru Zhou et.al.	2507.02380	null
2025-07-03	Efficient Code LLM Training via Distribution-Consistent and Diversity-Aware Data Selection	Weijie Lyu et.al.	2507.02378	null
2025-07-03	UVLM: Benchmarking Video Language Model for Underwater World Understanding	Xizhe Xue et.al.	2507.02373	null
2025-07-03	Holistic Tokenizer for Autoregressive Image Generation	Anlin Zheng et.al.	2507.02358	null
2025-07-03	Coling-UniA at SciVQA 2025: Few-Shot Example Retrieval and Confidence-Informed Ensembling for Multimodal Large Language Models	Christian Jaumann et.al.	2507.02357	null
2025-07-03	DoMIX: An Efficient Framework for Exploiting Domain Knowledge in Fine-Tuning	Dohoon Kim et.al.	2507.02302	null
2025-07-03	Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization	De Cheng et.al.	2507.02288	null
2025-07-03	Misaligned from Within: Large Language Models Reproduce Our Double-Loop Learning Blindness	Tim Rogers et.al.	2507.02283	null
2025-07-03	Content filtering methods for music recommendation: A review	Terence Zeng et.al.	2507.02282	null
2025-07-03	LaCo: Efficient Layer-wise Compression of Visual Tokens for Multimodal Large Language Models	Juntao Liu et.al.	2507.02279	null
2025-07-03	NLP4Neuro: Sequence-to-sequence learning for neural population decoding	Jacob J. Morra et.al.	2507.02264	null
2025-07-03	Uncertainty-aware Reward Design Process	Yang Yang et.al.	2507.02256	null
2025-07-03	Listwise Preference Alignment Optimization for Tail Item Recommendation	Zihao Li et.al.	2507.02255	null
2025-07-03	Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation	Jungkoo Kang et.al.	2507.02253	null
2025-07-03	SurgVisAgent: Multimodal Agentic Model for Versatile Surgical Visual Enhancement	Zeyu Lei et.al.	2507.02252	null
2025-07-03	VERBA: Verbalizing Model Differences Using Large Language Models	Shravan Doda et.al.	2507.02241	null
2025-07-03	DecoRTL: A Run-time Decoding Framework for RTL Code Generation with LLMs	Mohammad Akyash et.al.	2507.02226	null
2025-07-03	GDC Cohort Copilot: An AI Copilot for Curating Cohorts from the Genomic Data Commons	Steven Song et.al.	2507.02221	null
2025-07-02	ESTR-CoT: Towards Explainable and Accurate Event Stream based Scene Text Recognition with Chain-of-Thought Reasoning	Xiao Wang et.al.	2507.02200	null
2025-07-02	EvalAssist: A Human-Centered Tool for LLM-as-a-Judge	Zahra Ashktorab et.al.	2507.02186	null
2025-07-02	Computer Science Education in the Age of Generative AI	Russell Beale et.al.	2507.02183	null
2025-07-02	Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models	Fangjian Lei et.al.	2507.02182	null
2025-07-02	The Revolution Has Arrived: What the Current State of Large Language Models in Education Implies for the Future	Russell Beale et.al.	2507.02180	null
2025-07-02	Data Diversification Methods In Alignment Enhance Math Performance In LLMs	Berkan Dokmeci et.al.	2507.02173	null
2025-07-02	Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization	Keyan Jin et.al.	2507.02145	null
2025-07-02	When LLMs Disagree: Diagnosing Relevance Filtering Bias and Retrieval Divergence in SDG Search	William A. Ingram et.al.	2507.02139	null
2025-07-02	Dissecting the Impact of Mobile DVFS Governors on LLM Inference Performance and Energy Efficiency	Zongpu Zhang et.al.	2507.02135	null
2025-07-02	BACTA-GPT: An AI-Based Bayesian Adaptive Clinical Trial Architect	Krishna Padmanabhan et.al.	2507.02130	null
2025-07-02	Generative Latent Diffusion for Efficient Spatiotemporal Data Reduction	Xiao Li et.al.	2507.02129	null
2025-07-02	CROP: Circuit Retrieval and Optimization with Parameter Guidance using LLMs	Jingyu Pan et.al.	2507.02128	null
2025-07-02	SAKURAONE: Empowering Transparent and Open AI Platforms through Private-Sector HPC Investment in Japan	Fumikazu Konishi et.al.	2507.02124	null
2025-07-02	PAL: Designing Conversational Agents as Scalable, Cooperative Patient Simulators for Palliative-Care Training	Neil K. R. Sehgal et.al.	2507.02122	null
2025-07-02	What Neuroscience Can Teach AI About Learning in Continuously Changing Environments	Daniel Durstewitz et.al.	2507.02103	null
2025-07-02	The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems	Reza Yousefi Maragheh et.al.	2507.02097	null
2025-07-02	Sample Complexity Bounds for Linear Constrained MDPs with a Generative Model	Xingtu Liu et.al.	2507.02089	null
2025-07-02	McBE: A Multi-task Chinese Bias Evaluation Benchmark for Large Language Models	Tian Lan et.al.	2507.02088	null
2025-07-02	Evaluating the Promise and Pitfalls of LLMs in Hiring Decisions	Eitan Anzenberg et.al.	2507.02087	null
2025-07-02	Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab	Haonan Duan et.al.	2507.02083	null
2025-07-02	Reasoning on a Budget: A Survey of Adaptive and Controllable Test-Time Compute in LLMs	Mohammad Ali Alomrani et.al.	2507.02076	null
2025-07-02	Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges	Sanjeda Akter et.al.	2507.02074	null
2025-07-02	MGC: A Compiler Framework Exploiting Compositional Blindness in Aligned LLMs for Malware Generation	Lu Yan et.al.	2507.02057	null
2025-07-02	How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks	Rahul Ramachandran et.al.	2507.01955	null
2025-07-02	Test-Time Scaling with Reflective Generative Model	Zixiao Wang et.al.	2507.01951	null
2025-07-02	Kwai Keye-VL Technical Report	Kwai Keye Team et.al.	2507.01949	null
2025-07-02	LongAnimation: Long Animation Generation with Dynamic Global-Local Memory	Nan Chen et.al.	2507.01945	null
2025-07-02	SpecCLIP: Aligning and Translating Spectroscopic Measurements for Stars	Xiaosheng Zhao et.al.	2507.01939	null
2025-07-02	The Thin Line Between Comprehension and Persuasion in LLMs	Adrian de Wynter et.al.	2507.01936	null
2025-07-02	Large Language Model-Driven Closed-Loop UAV Operation with Semantic Observations	Wenhao Wang et.al.	2507.01930	null
2025-07-02	A Survey on Vision-Language-Action Models: An Action Tokenization Perspective	Yifan Zhong et.al.	2507.01925	null
2025-07-02	Decision-oriented Text Evaluation	Yu-Shiang Huang et.al.	2507.01923	null
2025-07-02	Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models	Chengao Li et.al.	2507.01915	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908	null
2025-07-02	AI4Research: A Survey of Artificial Intelligence for Scientific Research	Qiguang Chen et.al.	2507.01903	null
2025-07-02	High-Layer Attention Pruning with Rescaling	Songtao Liu et.al.	2507.01900	null
2025-07-02	MiCoTA: Bridging the Learnability Gap with Intermediate CoT and Teacher Assistants	Dongyi Ding et.al.	2507.01887	null
2025-07-02	Improving GANs by leveraging the quantum noise from real hardware	Hongni Jin et.al.	2507.01886	null
2025-07-02	A computationally frugal open-source foundation model for thoracic disease detection in lung cancer screening programs	Niccolò McConnell et.al.	2507.01881	null
2025-07-02	Towards Foundation Auto-Encoders for Time-Series Anomaly Detection	Gastón García González et.al.	2507.01875	null
2025-07-02	DIY-MKG: An LLM-Based Polyglot Language Learning System	Kenan Tang et.al.	2507.01872	null
2025-07-02	Bridging UI Design and chatbot Interactions: Applying Form-Based Principles to Conversational Agents	Sanjay Krishna Anbalagan et.al.	2507.01862	null
2025-07-02	TypeTele: Releasing Dexterity in Teleoperation by Dexterous Manipulation Types	Yuhao Lin et.al.	2507.01857	null
2025-07-02	Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages	Samridhi Raj Sinha et.al.	2507.01853	null
2025-07-02	Low-Perplexity LLM-Generated Sequences and Where To Find Them	Arthur Wuhrmann et.al.	2507.01844	null
2025-07-02	Out-of-Distribution Detection Methods Answer the Wrong Questions	Yucen Lily Li et.al.	2507.01831	null
2025-07-02	APRMCTS: Improving LLM-based Automated Program Repair with Iterative Tree Search	Haichuan Hu et.al.	2507.01827	null
2025-07-02	LoRA Fine-Tuning Without GPUs: A CPU-Efficient Meta-Generation Framework for LLMs	Reza Arabpour et.al.	2507.01806	null
2025-07-02	Towards Decentralized and Sustainable Foundation Model Training with the Edge	Leyang Xue et.al.	2507.01803	null
2025-07-02	HCNQA: Enhancing 3D VQA with Hierarchical Concentration Narrowing Supervision	Shengli Zhou et.al.	2507.01800	null
2025-07-02	Robust brain age estimation from structural MRI with contrastive learning	Carlo Alberto Barbano et.al.	2507.01794	null
2025-07-02	Machine learning prediction of a chemical reaction over 8 decades of energy	Daniel Julian et.al.	2507.01793	null
2025-07-02	FreeLoRA: Enabling Training-Free LoRA Fusion for Autoregressive Multi-Subject Personalization	Peng Zheng et.al.	2507.01792	null
2025-07-02	MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining	Zhixun Chen et.al.	2507.01785	null
2025-07-02	Frontiers of Generative AI for Network Optimization: Theories, Limits, and Visions	Bo Yang et.al.	2507.01773	null
2025-07-02	Enhanced Generative Model Evaluation with Clipped Density and Coverage	Nicolas Salvy et.al.	2507.01761	null
2025-07-02	Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis	Peng Zheng et.al.	2507.01756	null
2025-07-02	Tuning without Peeking: Provable Privacy and Generalization Bounds for LLM Post-Training	Ismail Labiad et.al.	2507.01752	null
2025-07-02	LLMs for Legal Subsumption in German Employment Contracts	Oliver Wardas et.al.	2507.01734	null
2025-07-02	Token Communication in the Era of Large Models: An Information Bottleneck-Based Approach	Hao Wei et.al.	2507.01728	null
2025-07-02	Generative flow-based warm start of the variational quantum eigensolver	Hang Zou et.al.	2507.01726	null
2025-07-02	Agent Ideate: A Framework for Product Idea Generation from Patents Using Agentic AI	Gopichand Kanumolu et.al.	2507.01717	null
2025-07-02	Generative modeling of convergence maps based on predicted one-point statistics	Vilasini Tinnaneri Sreekanth et.al.	2507.01707	null
2025-07-02	AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness	Zixin Chen et.al.	2507.01702	null
2025-07-02	Graph Representation-based Model Poisoning on Federated LLMs in CyberEdge Networks	Hanlin Cai et.al.	2507.01694	null
2025-07-02	GPT, But Backwards: Exactly Inverting Language Model Outputs	Adrians Skapars et.al.	2507.01693	null
2025-07-02	A generative modeling / Physics-Informed Neural Network approach to random differential equations	Georgios Arampatzis et.al.	2507.01687	null
2025-07-02	Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling	Zeyu Huang et.al.	2507.01679	null
2025-07-02	AsyncFlow: An Asynchronous Streaming RL Framework for Efficient LLM Post-Training	Zhenyu Han et.al.	2507.01663	null
2025-07-02	SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement	Weijie Yin et.al.	2507.01643	null
2025-07-02	DaiFu: In-Situ Crash Recovery for Deep Learning Systems	Zilong He et.al.	2507.01628	null
2025-07-02	Chart Question Answering from Real-World Analytical Narratives	Maeve Hutchinson et.al.	2507.01627	null
2025-07-02	Data Agent: A Holistic Architecture for Orchestrating Data+AI Ecosystems	Zhaoyan Sun et.al.	2507.01599	null
2025-07-02	Emotionally Intelligent Task-oriented Dialogue Systems: Architecture, Representation, and Optimisation	Shutong Feng et.al.	2507.01594	null
2025-07-02	A Gift from the Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation	Hao Wang et.al.	2507.01573	null
2025-07-02	Self-Guided Process Reward Optimization with Masked Step Advantage for Process Reinforcement Learning	Wu Fei et.al.	2507.01551	null
2025-07-02	Crafting Hanzi as Narrative Bridges: An AI Co-Creation Workshop for Elderly Migrants	Wen Zhan et.al.	2507.01548	null
2025-07-02	MARVIS: Modality Adaptive Reasoning over VISualizations	Benjamin Feuer et.al.	2507.01544	null
2025-07-02	Is External Information Useful for Stance Detection with LLMs?	Quang Minh Nguyen et.al.	2507.01543	null
2025-07-02	Efficient Out-of-Scope Detection in Dialogue Systems via Uncertainty-Driven LLM Routing	Álvaro Zaera et.al.	2507.01541	null
2025-07-02	Loss Functions in Diffusion Models: A Comparative Study	Dibyanshu Kumar et.al.	2507.01516	null
2025-07-02	SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism	Beitao Chen et.al.	2507.01513	null
2025-07-02	AVC-DPO: Aligned Video Captioning via Direct Preference Optimization	Jiyang Tang et.al.	2507.01492	null
2025-07-02	Agent-as-Tool: A Study on the Hierarchical Decision Making with Reinforcement Learning	Yanfei Zhang et.al.	2507.01489	null
2025-07-02	BioMARS: A Multi-Agent Robotic System for Autonomous Biological Experiments	Yibo Qiu et.al.	2507.01485	null
2025-07-02	Evaluating the Effectiveness of Direct Preference Optimization for Personalizing German Automatic Text Simplifications for Persons with Intellectual Disabilities	Yingqiang Gao et.al.	2507.01479	null
2025-07-02	Representation Entanglement for Generation:Training Diffusion Transformers Is Much Easier Than You Think	Ge Wu et.al.	2507.01467	null
2025-07-02	NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation	Max Gandyra et.al.	2507.01463	null
2025-07-02	Using multi-agent architecture to mitigate the risk of LLM hallucinations	Abd Elrahman Amer et.al.	2507.01446	null
2025-07-02	A Large Language Model for Chemistry and Retrosynthesis Predictions	Yueqing Zhang et.al.	2507.01444	null
2025-07-02	EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices	Zheyu Shen et.al.	2507.01438	null
2025-07-02	Challenges & Opportunities with LLM-Assisted Visualization Retargeting	Luke S. Snyder et.al.	2507.01436	null
2025-07-02	Pensieve Grader: An AI-Powered, Ready-to-Use Platform for Effortless Handwritten STEM Grading	Yoonseok Yang et.al.	2507.01431	null
2025-07-02	TriVLA: A Unified Triple-System-Based Unified Vision-Language-Action Model for General Robot Control	Zhenyang Liu et.al.	2507.01424	null
2025-07-02	Evaluating LLM Agent Collusion in Double Auctions	Kushal Agrawal et.al.	2507.01413	null
2025-07-02	BronchoGAN: Anatomically consistent and domain-agnostic image-to-image translation for video bronchoscopy	Ahmad Soliman et.al.	2507.01387	null
2025-07-02	RALLY: Role-Adaptive LLM-Driven Yoked Navigation for Agentic UAV Swarms	Ziyao Wang et.al.	2507.01378	null
2025-07-02	AI Agents and Agentic AI-Navigating a Plethora of Concepts for Future Manufacturing	Yinwang Ren et.al.	2507.01376	null
2025-07-02	Activation Reward Models for Few-Shot Model Alignment	Tianning Chai et.al.	2507.01368	null
2025-07-02	Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy	Chris Yuhao Liu et.al.	2507.01352	null
2025-07-02	SpeechAccentLLM: A Unified Framework for Foreign Accent Conversion and Text to Speech	Cheng Zhuangfei et.al.	2507.01348	null
2025-07-02	LEDOM: An Open and Fundamental Reverse Language Model	Xunjian Yin et.al.	2507.01335	null
2025-07-02	Symbolic or Numerical? Understanding Physics Problem Solving in Reasoning LLMs	Nifu Dan et.al.	2507.01334	null
2025-07-02	Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy	Xiaoyun Zhang et.al.	2507.01327	null
2025-07-02	ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks	Zhiyao Ren et.al.	2507.01321	null
2025-07-02	La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation	Kai Liu et.al.	2507.01299	null
2025-07-02	Beyond Black-Box AI: Interpretable Hybrid Systems for Dementia Care	Matthew JY Kang et.al.	2507.01282	null
2025-07-02	Rethinking All Evidence: Enhancing Trustworthy Retrieval-Augmented Generation via Conflict-Driven Summarization	Juan Chen et.al.	2507.01281	null
2025-07-02	Evaluating Large Language Models for Multimodal Simulated Ophthalmic Decision-Making in Diabetic Retinopathy and Glaucoma Screening	Cindy Lie Tabuse et.al.	2507.01278	null
2025-07-02	AI Meets Maritime Training: Precision Analytics for Enhanced Safety and Performance	Vishakha Lall et.al.	2507.01274	null
2025-07-02	PULSE: Practical Evaluation Scenarios for Large Multimodal Model Unlearning	Tatsuki Kawakami et.al.	2507.01271	null
2025-07-02	LLM-based Realistic Safety-Critical Driving Video Generation	Yongjie Fu et.al.	2507.01264	null
2025-07-02	GAIus: Combining Genai with Legal Clauses Retrieval for Knowledge-based Assistant	Michał Matak et.al.	2507.01259	null
2025-07-01	Beyond First-Order: Training LLMs with Stochastic Conjugate Subgradients and AdamW	Di Zhang et.al.	2507.01241	null
2025-07-01	PAE MobiLLM: Privacy-Aware and Efficient LLM Fine-Tuning on the Mobile Device via Additive Side-Tuning	Xingke Yang et.al.	2507.01216	null
2025-07-01	2024 NASA SUITS Report: LLM-Driven Immersive Augmented Reality User Interface for Robotics and Space Exploration	Kathy Zhuang et.al.	2507.01206	null
2025-07-01	Escaping Platos Cave: JAM for Aligning Independently Trained Vision and Language Models	Hyoseo et.al.	2507.01201	null
2025-07-01	Are Large Brainwave Foundation Models Capable Yet? Insights from Fine-tuning	Na Lee et.al.	2507.01196	null
2025-07-01	FlashDP: Private Training Large Language Models with Efficient DP-SGD	Liangyu Wang et.al.	2507.01154	null
2025-07-01	SonoGym: High Performance Simulation for Challenging Surgical Tasks with Robotic Ultrasound	Yunke Ao et.al.	2507.01152	null
2025-07-01	Geometry-aware 4D Video Generation for Robot Manipulation	Zeyi Liu et.al.	2507.01099	null
2025-07-01	A theoretical prediction for the dipole in nearby distances using cosmography	Hayley J. Macpherson et.al.	2507.01095	null
2025-07-02	GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning	GLM-V Team et.al.	2507.01006	null
2025-07-01	Teaching Time Series to See and Speak: Forecasting with Aligned Visual and Textual Perspectives	Sixun Dong et.al.	2506.24124	null
2025-06-30	Calligrapher: Freestyle Text Image Customization	Yue Ma et.al.	2506.24123	null
2025-06-30	TextMesh4D: High-Quality Text-to-4D Mesh Generation	Sisi Dai et.al.	2506.24121	null
2025-06-30	Data Uniformity Improves Training Efficiency and More, with a Convergence Framework Beyond the NTK Regime	Yuqing Wang et.al.	2506.24120	null
2025-06-30	DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World	Xiangtai Li et.al.	2506.24102	null
2025-06-30	Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention	Wonwoong Cho et.al.	2506.24085	null
2025-06-30	Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models	Tung-Ling Li et.al.	2506.24056	null
2025-06-30	Agent.xpu: Efficient Scheduling of Agentic LLM Workloads on Heterogeneous SoC	Xinming Wei et.al.	2506.24045	null
2025-06-30	A Survey on Vision-Language-Action Models for Autonomous Driving	Sicong Jiang et.al.	2506.24044	null
2025-06-30	Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data	Shubhabrata Mukherjee et.al.	2506.24039	null
2025-06-30	Minimally dissipative multi-bit logical operations	Jérémie Klinger et.al.	2506.24021	null
2025-06-30	Ella: Embodied Social Agents with Lifelong Memory	Hongxin Zhang et.al.	2506.24019	null
2025-06-30	EXPERT: An Explainable Image Captioning Evaluation Metric with Structured Explanations	Hyunjong Kim et.al.	2506.24016	null
2025-06-30	Large Language Models Don’t Make Sense of Word Problems. A Scoping Review from a Mathematics Education Perspective	Anselm R. Strohmaier et.al.	2506.24006	null
2025-06-30	Auto-TA: Towards Scalable Automated Thematic Analysis (TA) via Multi-Agent Large Language Models with Reinforcement Learning	Seungjun Yi et.al.	2506.23998	null
2025-06-30	TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data Generation	Renren Jin et.al.	2506.23979	null
2025-06-30	Visual and Memory Dual Adapter for Multi-Modal Object Tracking	Boyue Xu et.al.	2506.23972	null
2025-06-30	UMA: A Family of Universal Models for Atoms	Brandon M. Wood et.al.	2506.23971	null
2025-06-30	Unveiling Decision-Making in LLMs for Text Classification : Extraction of influential and interpretable concepts with Sparse Autoencoders	Mathis Le Bail et.al.	2506.23951	null
2025-06-30	AI Risk-Management Standards Profile for General-Purpose AI (GPAI) and Foundation Models	Anthony M. Barrett et.al.	2506.23949	null
2025-07-01	Graft: Integrating the Domain Knowledge via Efficient Parameter Synergy for MLLMs	Yang Dai et.al.	2506.23940	null
2025-06-30	Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages	Ruhina Tabasshum Prome et.al.	2506.23930	null
2025-06-30	IMPACT: Inflectional Morphology Probes Across Complex Typologies	Mohammed J. Saeed et.al.	2506.23929	null
2025-06-30	Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice	Akshit Kumar et.al.	2506.23924	null
2025-06-30	The Trilemma of Truth in Large Language Models	Germans Savcisens et.al.	2506.23921	null
2025-06-30	World4Omni: A Zero-Shot Framework from Image Generation World Model to Robotic Manipulation	Haonan Chen et.al.	2506.23919	null
2025-06-30	Advancing Multi-Step Mathematical Reasoning in Large Language Models through Multi-Layered Self-Reflection with Auto-Prompting	André de Souza Loureiro et.al.	2506.23888	null
2025-06-30	Scaling Self-Supervised Representation Learning for Symbolic Piano Performance	Louis Bradshaw et.al.	2506.23869	null
2025-06-30	Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression	Marc Ratkovic et.al.	2506.23862	null
2025-06-30	Email as the Interface to Generative AI Models: Seamless Administrative Automation	Andres Navarro et.al.	2506.23850	null
2025-06-30	A Survey on Autonomy-Induced Security Risks in Large Model-Based Agents	Hang Su et.al.	2506.23844	null
2025-06-30	Refine Any Object in Any Scene	Ziwei Chen et.al.	2506.23835	null
2025-06-30	Towards the “Digital Me”: A vision of authentic Conversational Agents powered by personal Human Digital Twins	Lluís C. Coll et.al.	2506.23826	null
2025-06-30	Flash-VStream: Efficient Real-Time Understanding for Long Video Streams	Haoji Zhang et.al.	2506.23825	null
2025-07-01	The Impact of AI on Educational Assessment: A Framework for Constructive Alignment	Patrick Stokkink et.al.	2506.23815	null
2025-06-30	Leveraging a Multi-Agent LLM-Based System to Educate Teachers in Hate Incidents Management	Ewelina Gajewska et.al.	2506.23774	null
2025-06-30	Software Engineering for Large Language Models: Research Status, Challenges and the Road Ahead	Hongzhou Rao et.al.	2506.23762	null
2025-06-30	A Survey of LLM-based Automated Program Repair: Taxonomies, Design Paradigms, and Applications	Boyang Yang et.al.	2506.23749	null
2025-07-01	Positional Bias in Binary Question Answering: How Uncertainty Shapes Model Preferences	Tiziano Labruna et.al.	2506.23743	null
2025-06-30	AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data	JiaRu Wu et.al.	2506.23735	null
2025-06-30	Radioactive Watermarks in Diffusion and Autoregressive Image Generative Models	Michel Meintz et.al.	2506.23731	null
2025-06-30	System-Embedded Diffusion Bridge Models	Bartlomiej Sobieski et.al.	2506.23726	null
2025-06-30	PAC Bench: Do Foundation Models Understand Prerequisites for Executing Manipulation Policies?	Atharva Gundawar et.al.	2506.23725	null
2025-06-30	MDPG: Multi-domain Diffusion Prior Guidance for MRI Reconstruction	Lingtong Zhang et.al.	2506.23701	null
2025-06-30	MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation	Peiting Tian et.al.	2506.23700	null
2025-06-30	If You Had to Pitch Your Ideal Software – Evaluating Large Language Models to Support User Scenario Writing for User Experience Experts and Laypersons	Patrick Stadler et.al.	2506.23694	null
2025-06-30	Agent4S: The Transformation of Research Paradigms from the Perspective of Large Language Models	Boyuan Zheng et.al.	2506.23692	null
2025-06-30	SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation	Shuai Tan et.al.	2506.23690	null
2025-06-30	PokéAI: A Goal-Generating, Battle-Optimizing Multi-agent System for Pokemon Red	Zihao Liu et.al.	2506.23689	null
2025-06-30	Interactive Reasoning: Visualizing and Controlling Chain-of-Thought Reasoning in Large Language Models	Rock Yuren Pang et.al.	2506.23678	null
2025-06-30	Efficient Interleaved Speech Modeling through Knowledge Distillation	Mohammadmahdi Nouriborji et.al.	2506.23670	null
2025-06-30	L0: Reinforcement Learning to Become General Agents	Junjie Zhang et.al.	2506.23667	null
2025-06-30	On the Domain Robustness of Contrastive Vision-Language Models	Mario Koddenbrock et.al.	2506.23663	null
2025-06-30	Multiscale Turbulence Synthesis: Validation in 2D Hydrodynamics	Pierre Lesaffre et.al.	2506.23659	null
2025-06-30	Act-With-Think: Chunk Auto-Regressive Modeling for Generative Recommendation	Yifan Wang et.al.	2506.23643	null
2025-06-30	VAP-Diffusion: Enriching Descriptions with MLLMs for Enhanced Medical Image Generation	Peng Huang et.al.	2506.23641	null
2025-06-30	Unified Multimodal Understanding via Byte-Pair Visual Encoding	Wanpeng Zhang et.al.	2506.23639	null
2025-06-30	Towards Building Private LLMs: Exploring Multi-Node Expert Parallelism on Apple Silicon for Mixture-of-Experts Large Language Model	Mu-Chi Chen et.al.	2506.23635	null
2025-06-30	TurboVSR: Fantastic Video Upscalers and Where to Find Them	Zhongdao Wang et.al.	2506.23618	null
2025-06-30	Evaluating the Simulation of Human Personality-Driven Susceptibility to Misinformation with LLMs	Manuel Pratelli et.al.	2506.23610	null
2025-06-30	PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum	Shiqi Zhang et.al.	2506.23607	null
2025-06-30	SG-LDM: Semantic-Guided LiDAR Generation via Latent-Aligned Diffusion	Zhengkang Xiang et.al.	2506.23606	null
2025-06-30	AI-Generated Lecture Slides for Improving Slide Element Detection and Retrieval	Suyash Maniyar et.al.	2506.23605	null
2025-06-30	SoK: Semantic Privacy in Large Language Models	Baihe Ma et.al.	2506.23603	null
2025-06-30	Semantic-guided Diverse Decoding for Large Language Model	Weijie Shi et.al.	2506.23601	null
2025-06-30	Transition Matching: Scalable and Flexible Generative Modeling	Neta Shaul et.al.	2506.23589	null
2025-06-30	Dataset Distillation via Vision-Language Category Prototype	Yawen Zou et.al.	2506.23580	null
2025-06-30	Evaluating Multi-Agent Defences Against Jailbreaking Attacks on Large Language Models	Maria Carolina Cornelia Wit et.al.	2506.23576	null
2025-06-30	MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI	Huanjin Yao et.al.	2506.23563	null
2025-06-30	JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching	Mingi Kwon et.al.	2506.23552	null
2025-06-30	Neural Langevin Machine: a local asymmetric learning rule can be creative	Zhendong Yu et.al.	2506.23546	null
2025-06-30	Comparative Analysis of the Code Generated by Popular Large Language Models (LLMs) for MISRA C++ Compliance	Malik Muhammad Umer et.al.	2506.23535	null
2025-06-30	On Recipe Memorization and Creativity in Large Language Models: Is Your Model a Creative Cook, a Bad Cook, or Merely a Plagiator?	Jan Kvapil et.al.	2506.23527	null
2025-06-30	NEU-ESC: A Comprehensive Vietnamese dataset for Educational Sentiment analysis and topic Classification toward multitask learning	Phan Quoc Hung Mai et.al.	2506.23524	null
2025-07-01	ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data	Yu Zhang et.al.	2506.23520	null
2025-06-30	Reinforcement Fine-Tuning Enables MLLMs Learning Novel Tasks Stably	Zhihao Zhang et.al.	2506.23508	null
2025-06-30	LLM-enhanced Action-aware Multi-modal Prompt Tuning for Image-Text Matching	Mengxiao Tian et.al.	2506.23502	null
2025-06-30	Thought-Augmented Planning for LLM-Powered Interactive Recommender Agent	Haocheng Yu et.al.	2506.23485	null
2025-06-30	MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting	Jun Huang et.al.	2506.23482	null
2025-06-30	Evaluation of Geolocation Capabilities of Multimodal Large Language Models and Analysis of Associated Privacy Risks	Xian Zhang et.al.	2506.23481	null
2025-06-30	What to Keep and What to Drop: Adaptive Table Filtering Framework	Jang Won June et.al.	2506.23463	null
2025-06-30	Can We Predict the Unpredictable? Leveraging DisasterNet-LLM for Multimodal Disaster Classification	Manaswi Kulahara et.al.	2506.23462	null
2025-06-30	General Signal Model and Capacity Limit for Rydberg Quantum Information System	Jieao Zhu et.al.	2506.23455	null
2025-06-30	PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions	Mahesh Bhosale et.al.	2506.23440	null
2025-06-29	TuCo: Measuring the Contribution of Fine-Tuning to Individual Responses of LLMs	Felipe Nuti et.al.	2506.23423	null
2025-06-29	Datasets for Fairness in Language Models: An In-Depth Survey	Jiale Zhang et.al.	2506.23411	null
2025-06-29	Do LLMs Dream of Discrete Algorithms?	Claudionor Coelho Jr et.al.	2506.23408	null
2025-06-29	Perspective Dial: Measuring Perspective of Text and Guiding LLM Outputs	Taejin Kim et.al.	2506.23377	null
2025-06-29	Federated Timeline Synthesis: Scalable and Private Methodology For Model Training and Deployment	Pawel Renc et.al.	2506.23358	null
2025-06-29	GeoProg3D: Compositional Visual Reasoning for City-Scale 3D Language Fields	Shunsuke Yasuki et.al.	2506.23352	null
2025-06-29	ATGen: A Framework for Active Text Generation	Akim Tsvigun et.al.	2506.23342	null
2025-06-29	Information Loss in LLMs’ Multilingual Translation: The Role of Training Data, Language Proximity, and Language Family	Yumeng Lin et.al.	2506.23340	null
2025-06-29	VALID-Mol: a Systematic Framework for Validated LLM-Assisted Molecular Design	Malikussaid et.al.	2506.23339	null
2025-06-29	XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs	Yitian Gong et.al.	2506.23325	null
2025-06-29	GATSim: Urban Mobility Simulation with Generative Agents	Qi Liu et.al.	2506.23306	null
2025-07-01	Exposing and Mitigating Calibration Biases and Demographic Unfairness in MLLM Few-Shot In-Context Learning for Medical Image Classification	Xing Shen et.al.	2506.23298	null
2025-06-29	Two Spelling Normalization Approaches Based on Large Language Models	Miguel Domingo et.al.	2506.23288	null
2025-06-29	MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition	Yuhuan Yang et.al.	2506.23283	null
2025-06-29	Autoregressive Denoising Score Matching is a Good Video Anomaly Detector	Hanwen Zhang et.al.	2506.23282	null
2025-06-29	Corrupted by Reasoning: Reasoning Language Models Become Free-Riders in Public Goods Games	David Guzman Piedrahita et.al.	2506.23276	null
2025-06-27	Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy	Yuhao Liu et.al.	2506.22432	null
2025-06-27	The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT Improvements	Bingchen Zhao et.al.	2506.22419	null
2025-06-27	HyperCLOVA X THINK Technical Report	NAVER Cloud HyperCLOVA X Team et.al.	2506.22403	null
2025-06-27	Refining Czech GEC: Insights from a Multi-Experiment Approach	Petr Pechman et.al.	2506.22402	null
2025-06-27	QuickSilver – Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization	Danush Khanna et.al.	2506.22396	null
2025-06-27	What Makes ChatGPT Effective for Software Issue Resolution? An Empirical Study of Developer-ChatGPT Conversations in GitHub	Ramtin Ehsani et.al.	2506.22390	null
2025-06-27	Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment	Yue Zhang et.al.	2506.22385	null
2025-06-27	Probabilistic Optimality for Inference-time Scaling	Youkang Wang et.al.	2506.22376	null
2025-06-27	Towards Fair Rankings: Leveraging LLMs for Gender Bias Detection and Measurement	Maryam Mousavian et.al.	2506.22372	null
2025-06-27	Can Large Language Models Help Students Prove Software Correctness? An Experimental Study with Dafny	Carolina Carreira et.al.	2506.22370	null
2025-06-27	Concept-Level AI for Telecom: Moving Beyond Large Language Models	Viswanath Kumarskandpriya et.al.	2506.22359	null
2025-06-27	Optimal Estimation of Watermark Proportions in Hybrid AI-Human Texts	Xiang Li et.al.	2506.22343	null
2025-06-27	Evaluating Scoring Bias in LLM-as-a-Judge	Qingquan Li et.al.	2506.22316	null
2025-06-27	Detection of Personal Data in Structured Datasets Using a Large Language Model	Albert Agisha Ntwali et.al.	2506.22305	null
2025-06-27	Unfolding Generative Flows with Koopman Operators: Fast and Interpretable Sampling	Erkan Turan et.al.	2506.22304	null
2025-06-27	Rethinking Visual Token Reduction in LVLMs under Cross-modal Misalignment	Rui Xu et.al.	2506.22283	null
2025-06-27	Public Service Algorithm: towards a transparent, explainable, and scalable content curation for news content based on editorial values	Ahmad Mel et.al.	2506.22270	null
2025-06-27	Towards Operational Data Analytics Chatbots – Virtual Knowledge Graph is All You Need	Junaid Ahmed Khan et.al.	2506.22267	null
2025-06-27	Projected Compression: Trainable Projection for Efficient Transformer Compression	Maciej Stefaniak et.al.	2506.22255	null
2025-06-27	Adapting University Policies for Generative AI: Opportunities, Challenges, and Policy Solutions in Higher Education	Russell Beale et.al.	2506.22231	null
2025-06-27	Cardiovascular disease classification using radiomics and geometric features from cardiac CT	Ajay Mittal et.al.	2506.22226	null
2025-06-27	Hybrid Generative Modeling for Incomplete Physics: Deep Grey-Box Meets Optimal Transport	Gurjeet Sangra Singh et.al.	2506.22204	null
2025-06-27	EFRame: Deeper Reasoning via Exploration-Filtering-Replay Reinforcement Learning Framework	Chen Wang et.al.	2506.22200	null
2025-06-27	Exploring Modularity of Agentic Systems for Drug Discovery	Laura van Weesep et.al.	2506.22189	null
2025-06-27	A Different Approach to AI Safety: Proceedings from the Columbia Convening on Openness in Artificial Intelligence and AI Safety	Camille François et.al.	2506.22183	null
2025-06-27	Training Language Model to Critique for Better Refinement	Tianshu Yu et.al.	2506.22157	null
2025-06-27	RetFiner: A Vision-Language Refinement Scheme for Retinal Foundation Models	Ronald Fecso et.al.	2506.22149	null
2025-06-27	SAGE: Spliced-Audio Generated Data for Enhancing Foundational Models in Low-Resource Arabic-English Code-Switched Speech Recognition	Muhammad Umar Farooq et.al.	2506.22143	null
2025-06-27	Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs	Shaojie Zhang et.al.	2506.22139	null
2025-06-27	Reasoning in machine vision: learning to think fast and slow	Shaheer U. Saeed et.al.	2506.22075	null
2025-06-27	Query as Test: An Intelligent Driving Test and Data Storage Method for Integrated Cockpit-Vehicle-Road Scenarios	Shengyue Yao et.al.	2506.22068	null
2025-06-27	Lost at the Beginning of Reasoning	Baohao Liao et.al.	2506.22058	null
2025-06-27	Decoding Machine Translationese in English-Chinese News: LLMs vs. NMTs	Delu Kong et.al.	2506.22050	null
2025-06-27	GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling	Tianhao Chen et.al.	2506.22049	null
2025-06-27	Few-Shot Identity Adaptation for 3D Talking Heads via Global Gaussian Field	Hong Nie et.al.	2506.22044	null
2025-06-27	UniCA: Adapting Time Series Foundation Model to General Covariate-Aware Forecasting	Lu Han et.al.	2506.22039	null
2025-06-27	Can Peter Pan Survive MT? A Stylometric Study of LLMs, NMTs, and HTs in Children’s Literature Translation	Delu Kong et.al.	2506.22038	null
2025-06-27	SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference	Yongchao He et.al.	2506.22033	null
2025-06-27	LMPVC and Policy Bank: Adaptive voice control for industrial robots with code generating LLMs and reusable Pythonic policies	Ossi Parikka et.al.	2506.22028	null
2025-06-27	RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation	Liudi Yang et.al.	2506.22007	null
2025-06-27	LeanConjecturer: Automatic Generation of Mathematical Conjectures for Theorem Proving	Naoto Onda et.al.	2506.22005	null
2025-06-27	R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning	Biao Wang et.al.	2506.21980	null
2025-06-27	TASeg: Text-aware RGB-T Semantic Segmentation based on Fine-tuning Vision Foundation Models	Meng Yu et.al.	2506.21975	null
2025-06-27	Don’t Trust Generative Agents to Mimic Communication on Social Networks Unless You Benchmarked their Empirical Realism	Simon Münker et.al.	2506.21974	null
2025-06-27	Advancing Jailbreak Strategies: A Hybrid Approach to Exploiting LLM Vulnerabilities and Bypassing Modern Defenses	Mohamed Ahmed et.al.	2506.21972	null
2025-06-27	Using Large Language Models to Suggest Informative Prior Distributions in Bayesian Statistics	Michael A. Riegler et.al.	2506.21964	null
2025-06-27	PapersPlease: A Benchmark for Evaluating Motivational Values of Large Language Models Based on ERG Theory	Junho Myung et.al.	2506.21961	null
2025-06-27	Optimal Return-to-Go Guided Decision Transformer for Auto-Bidding in Advertisement	Hao Jiang et.al.	2506.21956	null
2025-06-27	Universal Modelling of Autocovariance Functions via Spline Kernels	Lachlan Astfalck et.al.	2506.21953	null
2025-06-27	CAL-RAG: Retrieval-Augmented Multi-Agent Generation for Content-Aware Layout Design	Najmeh Forouzandehmehr et.al.	2506.21934	null
2025-06-27	ARAG: Agentic Retrieval Augmented Generation for Personalized Recommendation	Reza Yousefi Maragheh et.al.	2506.21931	null
2025-06-27	A Survey of LLM Inference Systems	James Pan et.al.	2506.21901	null
2025-06-27	Bias, Accuracy, and Trust: Gender-Diverse Perspectives on Large Language Models	Aimen Gaba et.al.	2506.21898	null
2025-06-27	Exploring Task-Solving Paradigm for Generalized Cross-Domain Face Anti-Spoofing via Reinforcement Fine-Tuning	Fangling Jiang et.al.	2506.21895	null
2025-06-27	Integrating Multi-Modal Sensors: A Review of Fusion Techniques for Intelligent Vehicles	Chuheng Wei et.al.	2506.21885	null
2025-06-27	A Dual-Layered Evaluation of Geopolitical and Cultural Bias in LLMs	Sean Kim et.al.	2506.21881	null
2025-06-27	WildSpeech-Bench: Benchmarking Audio LLMs in Natural Speech Conversation	Jian Zhang et.al.	2506.21875	null
2025-06-27	On the Feasibility of Poisoning Text-to-Image AI Models via Adversarial Mislabeling	Stanley Wu et.al.	2506.21874	null
2025-06-27	Grounding-Aware Token Pruning: Recovering from Drastic Performance Drops in Visual Grounding Caused by Pruning	Tzu-Chun Chien et.al.	2506.21873	null
2025-06-27	RiverEcho: Real-Time Interactive Digital System for Ancient Yellow River Culture	Haofeng Wang et.al.	2506.21865	null
2025-06-27	DeepTalk: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoE	Hang Shao et.al.	2506.21864	null
2025-06-27	LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs	Boyuan Sun et.al.	2506.21862	null
2025-06-27	SPADE: Spatial Transcriptomics and Pathology Alignment Using a Mixture of Data Experts for an Expressive Latent Space	Ekaterina Redekop et.al.	2506.21857	null
2025-06-27	Skill-Nav: Enhanced Navigation with Versatile Quadrupedal Locomotion via Waypoint Interface	Dewei Wang et.al.	2506.21853	null
2025-06-27	The Consistency Hypothesis in Uncertainty Quantification for Large Language Models	Quan Xiao et.al.	2506.21849	null
2025-06-27	Adversarial Threats in Quantum Machine Learning: A Survey of Attacks and Defenses	Archisman Ghosh et.al.	2506.21842	null
2025-06-27	PARSI: Persian Authorship Recognition via Stylometric Integration	Kourosh Shahnazari et.al.	2506.21840	null
2025-06-27	ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts	Xiaoqi Wang et.al.	2506.21835	null
2025-06-27	TaleForge: Interactive Multimodal System for Personalized Story Creation	Minh-Loi Nguyen et.al.	2506.21832	null
2025-06-27	Few-Shot Segmentation of Historical Maps via Linear Probing of Vision Foundation Models	Rafael Sterzinger et.al.	2506.21826	null
2025-06-26	Exploring the change in scientific readability following the release of ChatGPT	Abdulkareem Alsudais et.al.	2506.21825	null
2025-06-26	Exploring the Structure of AI-Induced Language Change in Scientific English	Riley Galpin et.al.	2506.21817	null
2025-06-26	CAT-SG: A Large Dynamic Scene Graph Dataset for Fine-Grained Understanding of Cataract Surgery	Felix Holm et.al.	2506.21813	null
2025-06-26	Towards Transparent AI: A Survey on Explainable Large Language Models	Avash Palikhe et.al.	2506.21812	null
2025-06-26	CitySim: Modeling Urban Behaviors and City Dynamics with Large-Scale LLM-Driven Agent Simulation	Nicolas Bougie et.al.	2506.21805	null
2025-06-26	Multi-task parallelism for robust pre-training of graph foundation models on multi-source, multi-fidelity atomistic modeling data	Massimiliano Lupo Pasini et.al.	2506.21788	null
2025-06-26	MobiVerse: Scaling Urban Mobility Simulation with Hybrid Lightweight Domain-Specific Generator and Large Language Models	Yifan Liu et.al.	2506.21784	null
2025-06-26	Evaluating List Construction and Temporal Understanding capabilities of Large Language Models	Alexandru Dumitru et.al.	2506.21783	null
2025-06-26	M3PO: Massively Multi-Task Model-Based Policy Optimization	Aditya Narendra et.al.	2506.21782	null
2025-06-26	THE-Tree: Can Tracing Historical Evolution Enhance Scientific Verification and Reasoning?	Xin Wang et.al.	2506.21763	null
2025-06-26	(Fact) Check Your Bias	Eivind Morris Bakke et.al.	2506.21745	null
2025-06-26	Hierarchical Reasoning Model	Guan Wang et.al.	2506.21734	null
2025-06-26	Exploring Image Generation via Mutually Exclusive Probability Spaces and Local Correlation Hypothesis	Chenqiu Zhao et.al.	2506.21731	null
2025-06-26	FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering	Liangyu Zhong et.al.	2506.21710	null
2025-06-26	TanDiT: Tangent-Plane Diffusion Transformer for High-Quality 360° Panorama Generation	Hakan Çapuk et.al.	2506.21681	null
2025-06-26	Infrared foundations for quantum geometry I: Catalogue of totally symmetric rank-three field theories	Will Barker et.al.	2506.21662	null
2025-06-26	APO: Enhancing Reasoning Ability of MLLMs via Asymmetric Policy Optimization	Minjie Hong et.al.	2506.21655	null
2025-06-26	Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test	Ziyue Li et.al.	2506.21551	null
2025-06-26	mTSBench: Benchmarking Multivariate Time Series Anomaly Detection and Model Selection at Scale	Xiaona Zhou et.al.	2506.21550	null
2025-06-26	SAM4D: Segment Anything in Camera and LiDAR Streams	Jianyun Xu et.al.	2506.21547	null
2025-06-26	PsyLite Technical Report	Fangjun Ding et.al.	2506.21536	null
2025-06-26	Exploring the Design Space of 3D MLLMs for CT Report Generation	Mohammed Baharoon et.al.	2506.21535	null
2025-06-26	“What’s Up, Doc?”: Analyzing How Users Seek Health Information in Large-Scale Conversational AI Datasets	Akshay Paruchuri et.al.	2506.21532	null
2025-06-26	Potemkin Understanding in Large Language Models	Marina Mancoridis et.al.	2506.21521	null
2025-06-26	Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge	Boyu Gou et.al.	2506.21506	null
2025-06-26	Bridging Offline and Online Reinforcement Learning for LLMs	Jack Lanchantin et.al.	2506.21495	null
2025-06-26	Global and Local Entailment Learning for Natural World Imagery	Srikumar Sastry et.al.	2506.21476	null
2025-06-26	Efficient and Reuseable Cloud Configuration Search Using Discovery Spaces	Michael Johnston et.al.	2506.21467	null
2025-06-26	ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing	Huadai Liu et.al.	2506.21448	null
2025-06-26	Controllable 3D Placement of Objects with Scene-Aware Diffusion Models	Mohamed Omran et.al.	2506.21446	null
2025-06-26	Text2Cypher Across Languages: Evaluating Foundational Models Beyond English	Makbule Gulcin Ozsoy et.al.	2506.21445	null
2025-06-26	Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation	Sweta Banerjee et.al.	2506.21444	null
2025-06-26	Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection	Ali Şenol et.al.	2506.21443	null
2025-06-26	Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning	Prajwal Koirala et.al.	2506.21427	null
2025-06-26	XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation	Bowen Chen et.al.	2506.21416	null
2025-06-26	Distributed Cross-Channel Hierarchical Aggregation for Foundation Models	Aristeidis Tsaris et.al.	2506.21411	null
2025-06-26	Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference	Colin Samplawski et.al.	2506.21408	null
2025-06-26	TableMoE: Neuro-Symbolic Routing for Structured Expert Reasoning in Multimodal Table Understanding	Junwen Zhang et.al.	2506.21393	null
2025-06-26	Early Stopping Tabular In-Context Learning	Jaris Küken et.al.	2506.21387	null
2025-06-26	Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation	Guanting Dong et.al.	2506.21384	null
2025-06-26	Canonical Quantization of a Memristive Leaky Integrate-and-Fire Neuron Circuit	Dean Brand et.al.	2506.21363	null
2025-06-26	Structuralist Approach to AI Literary Criticism: Leveraging Greimas Semiotic Square for Large Language Models	Fangzhou Dong et.al.	2506.21360	null
2025-06-26	CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations	Julian Lorenz et.al.	2506.21357	null
2025-06-26	SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning	Melanie Rieff et.al.	2506.21355	null
2025-06-26	DynamicBench: Evaluating Real-Time Report Generation in Large Language Models	Jingyao Li et.al.	2506.21343	null
2025-06-26	Active Inference AI Systems for Scientific Discovery	Karthik Duraisamy et.al.	2506.21329	null
2025-06-26	Latent Prototype Routing: Achieving Near-Perfect Load Balancing in Mixture-of-Experts	Jiajie Yang et.al.	2506.21328	null
2025-06-26	DrishtiKon: Multi-Granular Visual Grounding for Text-Rich Document Images	Badri Vishal Kasuba et.al.	2506.21316	null
2025-06-26	Exploring Adapter Design Tradeoffs for Low Resource Music Generation	Atharva Mehta et.al.	2506.21298	null
2025-06-26	Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models	Bram Willemsen et.al.	2506.21294	null
2025-06-26	Small Encoders Can Rival Large Decoders in Detecting Groundedness	Istabrak Abbes et.al.	2506.21288	null
2025-06-26	Double-Checker: Enhancing Reasoning of Slow-Thinking LLMs via Self-Critical Fine-Tuning	Xin Xu et.al.	2506.21285	null
2025-06-26	Hyperspherical Variational Autoencoders Using Efficient Spherical Cauchy Distribution	Lukas Sablica et.al.	2506.21278	null
2025-06-26	HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context	Qize Yang et.al.	2506.21277	null
2025-06-26	Cat and Mouse – Can Fake Text Generation Outpace Detector Systems?	Andrea McGlinchey et.al.	2506.21274	null
2025-06-26	DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster	Ji Qi et.al.	2506.21263	null
2025-06-26	Agent-RewardBench: Towards a Unified Benchmark for Reward Modeling across Perception, Planning, and Safety in Real-World Multimodal Agents	Tianyi Men et.al.	2506.21252	null
2025-06-26	ACTLLM: Action Consistency Tuned Large Language Model	Jing Bi et.al.	2506.21250	null
2025-06-26	GANet-Seg: Adversarial Learning for Brain Tumor Segmentation with Hybrid Generative Models	Qifei Cui et.al.	2506.21245	null
2025-06-26	Zero-Shot Learning for Obsolescence Risk Forecasting	Elie Saad et.al.	2506.21240	null
2025-06-26	Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval	Yongchan Chun et.al.	2506.21222	null
2025-06-26	Complexity-aware fine-tuning	Andrey Goncharov et.al.	2506.21220	null
2025-06-26	Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?	Haoang Chi et.al.	2506.21215	null
2025-06-26	$T^3$ : Multi-level Tree-based Automatic Program Repair with Large Language Models	Quanming Liu et.al.	2506.21211	null
2025-06-26	BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models	Louis Kerner et.al.	2506.21209	null
2025-06-26	MedPrompt: LLM-CNN Fusion with Weight Routing for Medical Image Segmentation and Classification	Shadman Sobhan et.al.	2506.21199	null
2025-06-26	Prompt-Guided Turn-Taking Prediction	Koji Inoue et.al.	2506.21191	null
2025-06-26	GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding	Zijun Lin et.al.	2506.21188	null
2025-06-26	Task-Aware KV Compression For Cost-Effective Long Video Understanding	Minghao Qin et.al.	2506.21184	null
2025-06-26	Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks	Deepak Kumar Panda et.al.	2506.21142	null
2025-06-26	How Good Are Synthetic Requirements ? Evaluating LLM-Generated Datasets for AI4RE	Abdelkarim El-Hajjami et.al.	2506.21138	null
2025-06-26	IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes	Yujia Liang et.al.	2506.21116	null
2025-06-26	OracleFusion: Assisting the Decipherment of Oracle Bone Script with Structurally Constrained Semantic Typography	Caoshuo Li et.al.	2506.21101	null
2025-06-26	Enhancing LLM Tool Use with High-quality Instruction Data from Knowledge Graph	Jingwei Wang et.al.	2506.21071	null
2025-06-26	MT2-CSD: A New Dataset and Multi-Semantic Knowledge Fusion Method for Conversational Stance Detection	Fuqiang Niu et.al.	2506.21053	null
2025-06-26	V2X-REALM: Vision-Language Model-Based Robust End-to-End Cooperative Autonomous Driving with Adaptive Long-Tail Modeling	Junwei You et.al.	2506.21041	null
2025-06-26	Little By Little: Continual Learning via Self-Activated Sparse Mixture-of-Rank Adaptive Learning	Haodong Lu et.al.	2506.21035	null
2025-06-26	BLOCKS: Blockchain-supported Cross-Silo Knowledge Sharing for Efficient LLM Services	Zhaojiacheng Zhou et.al.	2506.21033	null
2025-06-26	Large Language Models Acing Chartered Accountancy	Jatin Gupta et.al.	2506.21031	null
2025-06-26	STEP Planner: Constructing cross-hierarchical subgoal tree as an embodied long-horizon task planner	Zhou Tianxing et.al.	2506.21030	null
2025-06-26	Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation	Ze Wang et.al.	2506.21022	null
2025-06-26	Multimodal Prompt Alignment for Facial Expression Recognition	Fuyan Ma et.al.	2506.21017	null
2025-06-26	HybridQ: Hybrid Classical-Quantum Generative Adversarial Network for Skin Disease Image Generation	Qingyue Jiao et.al.	2506.21015	null
2025-06-26	Distilling Normalizing Flows	Steven Walton et.al.	2506.21003	null
2025-06-26	SAC: A Framework for Measuring and Inducing Personality Traits in LLMs with Dynamic Intensity Control	Adithya Chittem et.al.	2506.20993	null
2025-06-26	Segment Anything in Pathology Images with Natural Language	Zhixuan Chen et.al.	2506.20988	null
2025-06-26	Our Coding Adventure: Using LLMs to Personalise the Narrative of a Tangible Programming Robot for Preschoolers	Martin Ruskov et.al.	2506.20982	null
2025-06-26	Response Quality Assessment for Retrieval-Augmented Generation via Conditional Conformal Factuality	Naihe Feng et.al.	2506.20978	null
2025-06-26	Where is AIED Headed? Key Topics and Emerging Frontiers (2020-2024)	Shihui Feng et.al.	2506.20971	null
2025-06-26	Parallels Between VLA Model Post-Training and Human Motor Learning: Progress, Challenges, and Trends	Tian-Yu Xiang et.al.	2506.20966	null
2025-06-26	Evidence-based diagnostic reasoning with multi-agent copilot for human pathology	Chengkuan Chen et.al.	2506.20964	null
2025-06-26	EraRAG: Efficient and Incremental Retrieval Augmented Generation for Growing Corpora	Fangyuan Zhang et.al.	2506.20963	null
2025-06-26	Hierarchical Sub-action Tree for Continuous Sign Language Recognition	Dejie Yang et.al.	2506.20947	null
2025-06-26	Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models	Donggoo Kang et.al.	2506.20946	null
2025-06-26	E-FreeM2: Efficient Training-Free Multi-Scale and Cross-Modal News Verification via MLLMs	Van-Hoang Phan et.al.	2506.20944	null
2025-06-26	Model State Arithmetic for Machine Unlearning	Keivan Rezaei et.al.	2506.20941	null
2025-06-26	ParEval-Repo: A Benchmark Suite for Evaluating LLMs with Repository-level HPC Translation Tasks	Joshua H. Davis et.al.	2506.20938	null
2025-06-26	LLM-guided Chemical Process Optimization with a Multi-Agent Approach	Tong Zeng et.al.	2506.20921	null
2025-06-26	FineWeb2: One Pipeline to Scale Them All – Adapting Pre-Training Data Processing to Every Language	Guilherme Penedo et.al.	2506.20920	null
2025-06-26	Metadata Enrichment of Long Text Documents using Large Language Models	Manika Lamba et.al.	2506.20918	null
2025-06-26	ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models	Mina Namazi et.al.	2506.20915	null
2025-06-26	*FaSTA $^$ : Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing**	Advait Gupta et.al.	2506.20911	null
2025-06-25	Omniwise: Predicting GPU Kernels Performance with LLMs	Zixian Wang et.al.	2506.20886	null
2025-06-25	MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans	Shubhankar Borse et.al.	2506.20879	null
2025-06-25	3DGH: 3D Head Generation with Composable Hair and Face	Chengan He et.al.	2506.20875	null
2025-06-25	Engineering RAG Systems for Real-World Applications: Design, Development, and Evaluation	Md Toufique Hasan et.al.	2506.20869	null
2025-06-25	Leaner Training, Lower Leakage: Revisiting Memorization in LLM Fine-Tuning with LoRA	Fei Wang et.al.	2506.20856	null
2025-06-25	Vector Contrastive Learning For Pixel-Wise Pretraining In Medical Vision	Yuting He et.al.	2506.20850	null
2025-06-25	Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes	Quintin Myers et.al.	2506.20822	null
2025-06-25	MultiFinRAG: An Optimized Multimodal Retrieval-Augmented Generation (RAG) Framework for Financial Question Answering	Chinmay Gondhalekar et.al.	2506.20821	null
2025-06-25	GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization	Martin Andrews et.al.	2506.20807	null
2025-06-25	Poster: Enhancing GNN Robustness for Network Intrusion Detection via Agent-based Analysis	Zhonghao Zhan et.al.	2506.20806	null
2025-06-25	Predicting Readiness to Engage in Psychotherapy of People with Chronic Pain Based on their Pain-Related Narratives Saar	Saar Draznin Shiran et.al.	2506.20805	null
2025-06-25	The Ideation-Execution Gap: Execution Outcomes of LLM-Generated versus Human Research Ideas	Chenglei Si et.al.	2506.20803	null
2025-06-25	Structural System Identification via Validation and Adaptation	Cristian López et.al.	2506.20799	null
2025-06-25	How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction?	Stephanie Käs et.al.	2506.20795	null
2025-06-25	Multi-lingual Functional Evaluation for Large Language Models	Victor Ojewale et.al.	2506.20793	null
2025-06-25	The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind	Andrei Lupu et.al.	2506.20664	null
2025-06-25	Memento: Note-Taking for Your Future Self	Chao Wan et.al.	2506.20642	null
2025-06-25	Telegrapher’s Generative Model via Kac Flows	Richard Duong et.al.	2506.20641	null
2025-06-25	Towards Community-Driven Agents for Machine Learning Engineering	Sijie Li et.al.	2506.20640	null
2025-06-25	DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation	Shansan Gong et.al.	2506.20639	null
2025-06-25	AI Assistants to Enhance and Exploit the PETSc Knowledge Base	Barry Smith et.al.	2506.20608	null
2025-06-25	Model Editing as a Double-Edged Sword: Steering Agent Ethical Behavior Toward Beneficence or Harm	Baixiang Huang et.al.	2506.20606	null
2025-06-25	Video Perception Models for 3D Scene Synthesis	Rui Huang et.al.	2506.20601	null
2025-06-25	SFNet: Fusion of Spatial and Frequency-Domain Features for Remote Sensing Image Forgery Detection	Ji Qi et.al.	2506.20599	null
2025-06-25	Large Language Model-Driven Code Compliance Checking in Building Information Modeling	Soumya Madireddy et.al.	2506.20551	null
2025-06-25	When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs	Ammar Khairi et.al.	2506.20544	null
2025-06-25	WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads	Hongzhen Huang et.al.	2506.20535	null
2025-06-25	Case-based Reasoning Augmented Large Language Model Framework for Decision Making in Realistic Safety-Critical Driving Scenarios	Wenbin Gan et.al.	2506.20531	null
2025-06-25	Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards	Charles Arnal et.al.	2506.20520	null
2025-06-25	OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling	Zengzhi Wang et.al.	2506.20512	null
2025-06-25	BotHash: Efficient and Training-Free Bot Detection Through Approximate Nearest Neighbor	Edoardo Di Paolo et.al.	2506.20503	null
2025-06-25	ReCode: Updating Code API Knowledge with Reinforcement Learning	Haoze Wu et.al.	2506.20495	null
2025-06-25	Behavior Foundation Model: Towards Next-Generation Whole-Body Control System of Humanoid Robots	Mingqi Yuan et.al.	2506.20487	null
2025-06-25	Counterfactual Influence as a Distributional Quantity	Matthieu Meeus et.al.	2506.20481	null
2025-06-25	GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching	Guinan Su et.al.	2506.20480	null
2025-06-25	Probing AI Safety with Source Code	Ujwal Narayan et.al.	2506.20471	null
2025-06-25	Automatic Demonstration Selection for LLM-based Tabular Data Classification	Shuchu Han et.al.	2506.20451	null
2025-06-25	Med-Art: Diffusion Transformer for 2D Medical Text-to-Image Generation	Changlu Guo et.al.	2506.20449	null
2025-06-25	An Agentic System for Rare Disease Diagnosis with Traceable Reasoning	Weike Zhao et.al.	2506.20430	null
2025-06-25	Semantic Caching for Improving Web Affordability	Hafsa Akbar et.al.	2506.20420	null
2025-06-25	SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models	Dipayan Saha et.al.	2506.20415	null
2025-06-25	TAPS: Tool-Augmented Personalisation via Structured Tagging	Ekaterina Taktasheva et.al.	2506.20409	null
2025-06-25	A Novel Large Vision Foundation Model (LVFM)-based Approach for Generating High-Resolution Canopy Height Maps in Plantations for Precision Forestry Management	Shen Tan et.al.	2506.20388	null
2025-06-25	TESSERA: Temporal Embeddings of Surface Spectra for Earth Representation and Analysis	Zhengpeng Feng et.al.	2506.20380	null
2025-06-25	Tabular Feature Discovery With Reasoning Type Exploration	Sungwon Han et.al.	2506.20357	null
2025-06-25	A foundation model with multi-variate parallel attention to generate neuronal activity	Francesco Carzaniga et.al.	2506.20354	null
2025-06-25	DipSVD: Dual-importance Protected SVD for Efficient LLM Compression	Xuan Ding et.al.	2506.20353	null
2025-06-25	Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content	Rian Touchent et.al.	2506.20331	null
2025-06-25	From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios	Changliang Xia et.al.	2506.20279	null
2025-06-25	Enterprise Large Language Model Evaluation Benchmark	Liya Wang et.al.	2506.20274	null
2025-06-25	Narrative Shift Detection: A Hybrid Approach of Dynamic Topic Models and Large Language Models	Kai-Robin Lange et.al.	2506.20269	null
2025-06-25	Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement	Kun Yuan et.al.	2506.20254	null
2025-06-25	Q-resafe: Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models	Kejia Chen et.al.	2506.20251	null
2025-06-25	Enhancing Large Language Models through Structured Reasoning	Yubo Dong et.al.	2506.20241	null
2025-06-25	Tomography for Plasma Imaging: a Unifying Framework for Bayesian Inference	D. Hamm et.al.	2506.20232	null
2025-06-25	UniCode $^2$ : Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation	Yanzhe Chen et.al.	2506.20214	null
2025-06-25	How to Retrieve Examples in In-context Learning to Improve Conversational Emotion Recognition using Large Language Models?	Mengqi Wang et.al.	2506.20199	null
2025-06-25	Zero-Shot Attribution for Large Language Models: A Distribution Testing Approach	Clément L. Canonne et.al.	2506.20197	null
2025-06-25	DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs	Ruokai Yin et.al.	2506.20194	null
2025-06-25	Breaking the Boundaries of Long-Context LLM Inference: Adaptive KV Management on a Single Commodity GPU	He Sun et.al.	2506.20187	null
2025-06-25	COIN: Uncertainty-Guarding Selective Question Answering for Foundation Models with Provable Risk Guarantees	Zhiyuan Wang et.al.	2506.20178	null
2025-06-25	Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition	Man Duc Chuc et.al.	2506.20174	null
2025-06-25	JsDeObsBench: Measuring and Benchmarking LLMs for JavaScript Deobfuscation	Guoqiang Chen et.al.	2506.20170	null
2025-06-25	Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models	Zhentao He et.al.	2506.20168	null
2025-06-25	SEED: A Structural Encoder for Embedding-Driven Decoding in Time Series Prediction with LLMs	Fengze Li et.al.	2506.20167	null
2025-06-25	AALC: Large Language Model Efficient Reasoning via Adaptive Accuracy-Length Control	Ruosen Li et.al.	2506.20160	null
2025-06-25	Irec: A Metacognitive Scaffolding for Self-Regulated Learning through Just-in-Time Insight Recall: A Conceptual Framework and System Prototype	Xuefei Hou et.al.	2506.20156	null
2025-06-25	EAR: Erasing Concepts from Unified Autoregressive Models	Haipeng Fan et.al.	2506.20151	null
2025-06-25	A Multi-Pass Large Language Model Framework for Precise and Efficient Radiology Report Error Detection	Songsoo Kim et.al.	2506.20112	null
2025-06-25	BrokenVideos: A Benchmark Dataset for Fine-Grained Artifact Localization in AI-Generated Videos	Jiahao Lin et.al.	2506.20103	null
2025-06-25	PSALM-V: Automating Symbolic Planning in Interactive Visual Environments with Large Language Models	Wang Bill Zhu et.al.	2506.20097	null
2025-06-25	ITFormer: Bridging Time Series and Natural Language for Multi-Modal QA with Large-Scale Multitask Dataset	Yilin Wang et.al.	2506.20093	null
2025-06-25	A Modular Multitask Reasoning Framework Integrating Spatio-temporal Models and LLMs	Kethmi Hirushini Hettige et.al.	2506.20073	null
2025-06-25	Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision	KMA Solaiman et.al.	2506.20070	null
2025-06-24	Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models	Zhicheng Zhang et.al.	2506.20061	null
2025-06-24	DiaLLMs: EHR Enhanced Clinical Conversational System for Clinical Test Recommendation and Diagnosis Prediction	Weijieying Ren et.al.	2506.20059	null
2025-06-24	Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization	Yuheng Chen et.al.	2506.20056	null
2025-06-24	Controlled Retrieval-augmented Context Evaluation for Long-form RAG	Jia-Huei Ju et.al.	2506.20051	null
2025-06-24	Cross-Layer Discrete Concept Discovery for Interpreting Language Models	Ankur Garg et.al.	2506.20040	null
2025-06-24	Persona-Assigned Large Language Models Exhibit Human-Like Motivated Reasoning	Saloni Dash et.al.	2506.20020	null
2025-06-24	Achieving Trustworthy Real-Time Decision Support Systems with Low-Latency Interpretable AI Models	Zechun Deng et.al.	2506.20018	null
2025-06-24	Accurate and Energy Efficient: Local Retrieval-Augmented Generation Models Outperform Commercial Large Language Models in Medical Tasks	Konstantinos Vrettos et.al.	2506.20009	null
2025-06-24	QHackBench: Benchmarking Large Language Models for Quantum Code Generation Using PennyLane Hackathon Challenges	Abdul Basit et.al.	2506.20008	null
2025-06-24	CoVE: Compressed Vocabulary Expansion Makes Better LLM-based Recommender Systems	Haochen Zhang et.al.	2506.19993	null
2025-06-24	HERCULES: Hierarchical Embedding-based Recursive Clustering Using LLMs for Efficient Summarization	Gabor Petnehazi et.al.	2506.19992	null
2025-06-24	Context Attribution with Multi-Armed Bandit Optimization	Deng Pan et.al.	2506.19977	null
2025-06-24	Inference Scaled GraphRAG: Improving Multi Hop Question Answering on Knowledge Graphs	Travis Thompson et.al.	2506.19967	null
2025-06-24	An ab initio foundation model of wavefunctions that accurately describes chemical bond breaking	Adam Foster et.al.	2506.19960	null
2025-06-24	CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation	Deepon Halder et.al.	2506.19952	null
2025-06-24	Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture	Shuchen Xue et.al.	2506.19935	null
2025-06-24	Prover Agent: An Agent-based Framework for Formal Mathematical Proofs	Kaito Baba et.al.	2506.19923	null
2025-06-24	Orthogonal Finetuning Made Scalable	Zeju Qiu et.al.	2506.19847	null
2025-06-24	JoyAgents-R1: Joint Evolution Dynamics for Versatile Multi-LLM Agents with Reinforcement Learning	Ai Han et.al.	2506.19846	null
2025-06-24	MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration	Yucheng Zhou et.al.	2506.19835	null
2025-06-24	A standard transformer and attention with linear biases for molecular conformer generation	Viatcheslav Gurev et.al.	2506.19834	null
2025-06-24	ProxelGen: Generating Proteins as 3D Densities	Felix Faltings et.al.	2506.19820	null
2025-06-24	Curating art exhibitions using machine learning	Eurico Covas et.al.	2506.19813	null
2025-06-24	KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality	Baochang Ren et.al.	2506.19807	null
2025-06-24	LLM-Based Social Simulations Require a Boundary	Zengqing Wu et.al.	2506.19806	null
2025-06-24	KnowML: Improving Generalization of ML-NIDS with Attack Knowledge Graphs	Xin Fan Guo et.al.	2506.19802	null
2025-06-24	Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study	Yuqi Zhu et.al.	2506.19794	null
2025-06-24	Line ratio identification of external photoevaporation	Tyger Peake et.al.	2506.19788	null
2025-06-24	SAGE: Strategy-Adaptive Generation Engine for Query Rewriting	Teng Wang et.al.	2506.19783	null
2025-06-24	Alleviating User-Sensitive bias with Fair Generative Sequential Recommendation Model	Yang Liu et.al.	2506.19777	null
2025-06-24	Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation	Jun Wang et.al.	2506.19774	null
2025-06-24	SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning	Yuqian Fu et.al.	2506.19767	null
2025-06-24	Arabic Dialect Classification using RNNs, Transformers, and Large Language Models: A Comparative Analysis	Omar A. Essameldin et.al.	2506.19753	null
2025-06-24	Breaking Barriers: Do Reinforcement Post Training Gains Transfer To Unseen Domains?	Chuxuan Hu et.al.	2506.19733	null
2025-06-24	Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders	Matyas Bohacek et.al.	2506.19708	null
2025-06-24	LLM-Driven Medical Document Analysis: Enhancing Trustworthy Pathology and Differential Diagnosis	Lei Kang et.al.	2506.19702	null
2025-06-24	Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models	Jungwoo Park et.al.	2506.19697	null
2025-06-24	Semantic Scene Graph for Ultrasound Image Explanation and Scanning Guidance	Xuesong Li et.al.	2506.19683	null
2025-06-24	Genome-Anchored Foundation Model Embeddings Improve Molecular Prediction from Histology Images	Cheng Jin et.al.	2506.19681	null
2025-06-25	Adaptive Request Scheduling for CodeLLM Serving with SLA Guarantees	Shi Chang et.al.	2506.19677	null
2025-06-24	A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures	Dezhang Kong et.al.	2506.19676	null
2025-06-24	Massive Atomic Diversity: a compact universal dataset for atomistic machine learning	Arslan Mazitov et.al.	2506.19674	null
2025-06-24	Recurrent Visual Feature Extraction and Stereo Attentions for CT Report Generation	Yuanhe Tian et.al.	2506.19665	null
2025-06-24	Multimodal large language models and physics visual tasks: comparative analysis of performance and costs	Giulia Polverini et.al.	2506.19662	null
2025-06-24	SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting	Yang Xing et.al.	2506.19658	null
2025-06-24	Tailored Conversations beyond LLMs: A RL-Based Dialogue Manager	Lucie Galland et.al.	2506.19652	null
2025-06-24	Tensor-Parallelism with Partially Synchronized Activations	Itay Lamprecht et.al.	2506.19645	null
2025-06-24	Varif.ai to Vary and Verify User-Driven Diversity in Scalable Image Generation	M. Michelessa et.al.	2506.19644	null
2025-06-24	HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions	Mrunmai Vivek Phatak et.al.	2506.19639	null
2025-06-24	Decompiling Smart Contracts with a Large Language Model	Isaac David et.al.	2506.19624	null
2025-06-24	Position: Intelligent Science Laboratory Requires the Integration of Cognitive and Embodied AI	Sha Zhang et.al.	2506.19613	null
2025-06-24	Correcting Hallucinations in News Summaries: Exploration of Self-Correcting LLM Methods with External Knowledge	Juraj Vladika et.al.	2506.19607	null
2025-06-24	ECCoT: A Framework for Enhancing Effective Cognition via Chain of Thought in Large Language Model	Zhenke Duan et.al.	2506.19599	null
2025-06-24	Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning	Harisankar Babu et.al.	2506.19592	null
2025-06-24	Generative model for optimal density estimation on unknown manifold	Arthur Stéphanovitch et.al.	2506.19587	null
2025-06-24	SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing Images	Gencer Sumbul et.al.	2506.19585	null
2025-06-24	Fake or Real, Can Robots Tell? Evaluating Embodied Vision-Language Models on Real and 3D-Printed Objects	Federico Tavella et.al.	2506.19579	null
2025-06-24	Bayesian community detection in assortative stochastic block model with unknown number of communities	Martina Amongero et.al.	2506.19576	null
2025-06-24	PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty	Jinwen He et.al.	2506.19563	null
2025-06-24	Regularity of the score function in generative models	Arthur Stéphanovitch et.al.	2506.19559	null
2025-06-24	General Methods Make Great Domain-specific Foundation Models: A Case-study on Fetal Ultrasound	Jakob Ambsdorf et.al.	2506.19552	null
2025-06-24	Programming Geotechnical Reliability Algorithms using Generative AI	Atma Sharma et.al.	2506.19536	null
2025-06-24	KnowMap: Efficient Knowledge-Driven Task Adaptation for LLMs	Kelin Fu et.al.	2506.19527	null
2025-06-24	Automatic Posology Structuration : What role for LLMs?	Natalia Bobkova et.al.	2506.19525	null
2025-06-24	AnTKV: Anchor Token-Aware Sub-Bit Vector Quantization for KV Cache in Large Language Models	Zeyu Li et.al.	2506.19505	null
2025-06-24	NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling	Yan Jiang et.al.	2506.19500	null
2025-06-24	The time course of visuo-semantic representations in the human brain is captured by combining vision and language models	Boyan Rong et.al.	2506.19497	null
2025-06-24	SceneCrafter: Controllable Multi-View Driving Scene Editing	Zehao Zhu et.al.	2506.19488	null
2025-06-24	Dialogic Pedagogy for Large Language Models: Aligning Conversational AI with Proven Theories of Learning	Russell Beale et.al.	2506.19484	null
2025-06-24	Commonsense Generation and Evaluation for Dialogue Systems using Large Language Models	Marcos Estecha-Garitagoitia et.al.	2506.19483	null
2025-06-24	LLM-based Multi-Agent System for Intelligent Refactoring of Haskell Code	Shahbaz Siddeeq et.al.	2506.19481	null
2025-06-24	Surgery-R1: Advancing Surgical-VQLA with Reasoning Multimodal Large Language Model via Reinforcement Learning	Pengfei Hao et.al.	2506.19469	null
2025-06-24	MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages	Wenhan Han et.al.	2506.19468	null
2025-06-24	Can Large Language Models Capture Human Annotator Disagreements?	Jingwei Ni et.al.	2506.19467	null
2025-06-24	KunLunBaizeRAG: Reinforcement Learning Driven Inference Performance Leap for Large Language Models	Cheng Li et.al.	2506.19466	null
2025-06-24	FuncVul: An Effective Function Level Vulnerability Detection Model using LLM and Code Chunk	Sajal Halder et.al.	2506.19453	null
2025-06-24	Low-Complexity Semantic Packet Aggregation for Token Communication via Lookahead Search	Seunghun Lee et.al.	2506.19451	null
2025-06-24	Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection	Yazhou Zhang et.al.	2506.19420	null
2025-06-24	Automated Detection of Pre-training Text in Black-box LLMs	Ruihan Hu et.al.	2506.19399	null
2025-06-24	Measuring and Guiding Monosemanticity	Ruben Härle et.al.	2506.19382	null
2025-06-24	SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation	Yunsung Chung et.al.	2506.19360	null
2025-06-24	Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended Generation	Jisu Shin et.al.	2506.19352	null
2025-06-24	Training-Free Motion Customization for Distilled Video Generators with Adaptive Test-Time Distillation	Jintao Rong et.al.	2506.19348	null
2025-06-24	FEAT: A Preference Feedback Dataset through a Cost-Effective Auto-Generation and Labeling Framework for English AI Tutoring	Hyein Seo et.al.	2506.19325	null
2025-06-24	Continual Retinal Vision-Language Pre-training upon Incremental Imaging Modalities	Yuang Yao et.al.	2506.19320	null
2025-06-24	Da Yu: Towards USV-Based Image Captioning for Waterway Surveillance and Scene Understanding	Runwei Guan et.al.	2506.19288	null
2025-06-24	Generating and Understanding Tests via Path-Aware Symbolic Execution with LLMs	Yaoxuan Wu et.al.	2506.19287	null
2025-06-24	EmoStage: A Framework for Accurate Empathetic Response Generation via Perspective-Taking and Phase Recognition	Zhiyang Qi et.al.	2506.19279	null
2025-06-24	Style Transfer: A Decade Survey	Tianshan Zhang et.al.	2506.19278	null
2025-06-24	Continuous-variable Quantum Diffusion Model for State Generation and Restoration	Haitao Huang et.al.	2506.19270	null
2025-06-24	What Matters in LLM-generated Data: Diversity and Its Effect on Model Fine-Tuning	Yuchang Zhu et.al.	2506.19262	null
2025-06-24	Automated Image Recognition Framework	Quang-Binh Nguyen et.al.	2506.19261	null
2025-06-24	Inference-Time Reward Hacking in Large Language Models	Hadi Khalaf et.al.	2506.19248	null
2025-06-24	RecLLM-R1: A Two-Stage Training Paradigm with Reinforcement Learning and Chain-of-Thought v1	Yu Xie et.al.	2506.19235	null
2025-06-24	Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification	Minghao Qin et.al.	2506.19225	null
2025-06-24	MedErr-CT: A Visual Question Answering Benchmark for Identifying and Correcting Errors in CT Reports	Sunggu Kyung et.al.	2506.19217	null
2025-06-24	Augmenting Multi-Agent Communication with State Delta Trajectory	Yichen Tang et.al.	2506.19209	null
2025-06-23	Spiritual-LLM : Gita Inspired Mental Health Therapy In the Era of LLMs	Janak Kapuriya et.al.	2506.19185	null
2025-06-23	MOSCARD – Causal Reasoning and De-confounding for Multimodal Opportunistic Screening of Cardiovascular Adverse Events	Jialu Pi et.al.	2506.19174	null
2025-06-23	Distilling Tool Knowledge into Language Models via Back-Translated Traces	Xingyue Huang et.al.	2506.19171	null
2025-06-23	GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model	Amir Faiyaz et.al.	2506.19164	null
2025-06-23	AgenticControl: An Automated Control Design Framework Using Large Language Models	Mohammad Narimani et.al.	2506.19160	null
2025-06-23	Thought Anchors: Which LLM Reasoning Steps Matter?	Paul C. Bogdan et.al.	2506.19143	null
2025-06-23	Command-V: Pasting LLM Behaviors via Activation Profiles	Barry Wang et.al.	2506.19140	null
2025-06-23	Local Learning Rules for Out-of-Equilibrium Physical Generative Models	Cyrill Bösch et.al.	2506.19136	null
2025-06-23	A Fourier Explanation of AI-music Artifacts	Darius Afchar et.al.	2506.19108	null
2025-06-23	Improving Student-AI Interaction Through Pedagogical Prompting: An Example in Computer Science Education	Ruiwei Xiao et.al.	2506.19107	null
2025-06-23	Baba is LLM: Reasoning in a Game with Dynamic Rules	Fien van Wetten et.al.	2506.19095	null
2025-06-23	Language Models Might Not Understand You: Evaluating Theory of Mind via Story Prompting	Nathaniel Getachew et.al.	2506.19089	null
2025-06-23	Finetuning a Weather Foundation Model with Lightweight Decoders for Unseen Physical Processes	Fanny Lehmann et.al.	2506.19088	null
2025-06-23	Enhancing Biosecurity in Tamper-Resistant Large Language Models With Quantum Gradient Descent	Fahmida Hai et.al.	2506.19086	null
2025-06-23	Benchmarking Music Generation Models and Metrics via Human Preference Studies	Florian Grötschla et.al.	2506.19085	null
2025-06-23	FairCauseSyn: Towards Causally Fair LLM-Augmented Synthetic Data Generation	Nitish Nagesh et.al.	2506.19082	null
2025-06-23	Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition	Iosif Tsangko et.al.	2506.19079	null
2025-06-23	MFTCXplain: A Multilingual Benchmark Dataset for Evaluating the Moral Reasoning of LLMs through Hate Speech Multi-hop Explanation	Jackson Trager et.al.	2506.19073	null
2025-06-23	NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis	Georgii Bychkov et.al.	2506.19051	null
2025-06-23	From Rows to Yields: How Foundation Models for Tabular Data Simplify Crop Yield Prediction	Filip Sabo et.al.	2506.19046	null
2025-06-23	FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation	Kaiyi Huang et.al.	2506.18899	null
2025-06-23	Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations	Jiaming Han et.al.	2506.18898	null
2025-06-23	MinD: Unified Visual Imagination and Control via Hierarchical World Models	Xiaowei Chi et.al.	2506.18897	null
2025-06-23	ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs	Jiaru Zou et.al.	2506.18896	null
2025-06-23	Universal Video Temporal Grounding with Generative Multi-modal Large Language Models	Zeqian Li et.al.	2506.18883	null
2025-06-23	Let Your Video Listen to Your Music!	Xinyu Zhang et.al.	2506.18881	null
2025-06-23	CommVQ: Commutative Vector Quantization for KV Cache Compression	Junyan Li et.al.	2506.18879	null
2025-06-23	OmniGen2: Exploration to Advanced Multimodal Generation	Chenyuan Wu et.al.	2506.18871	null
2025-06-23	OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation	Qijun Gan et.al.	2506.18866	null
2025-06-23	TAMMs: Temporal-Aware Multimodal Model for Satellite Image Change Understanding and Forecasting	Zhongbin Guo et.al.	2506.18862	null
2025-06-23	LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning	Yuhao Wu et.al.	2506.18841	null
2025-06-23	STU-PID: Steering Token Usage via PID Controller for Efficient Large Language Model Reasoning	Aryasomayajula Ram Bharadwaj et.al.	2506.18831	null
2025-06-23	Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories	Islem Bouzenia et.al.	2506.18824	null
2025-06-23	RWESummary: A Framework and Test for Choosing Large Language Models to Summarize Real-World Evidence (RWE) Studies	Arjun Mukerji et.al.	2506.18819	null
2025-06-23	Context-Aware CodeLLM Eviction for AI-assisted Coding	Kishanthan Thangarajah et.al.	2506.18796	null
2025-06-23	3D Arena: An Open Platform for Generative 3D Evaluation	Dylan Ebert et.al.	2506.18787	null
2025-06-23	TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation	Kamil Szczepanik et.al.	2506.18783	null
2025-06-23	Existing LLMs Are Not Self-Consistent For Simple Tasks	Zhenru Lin et.al.	2506.18781	null
2025-06-23	DefFusionNet: Learning Multimodal Goal Shapes for Deformable Object Manipulation via a Diffusion-based Probabilistic Model	Bao Thach et.al.	2506.18779	null
2025-06-23	Programming by Backprop: LLMs Acquire Reusable Algorithmic Abstractions During Code Training	Jonathan Cook et.al.	2506.18777	null
2025-06-23	ContinualFlow: Learning and Unlearning with Neural Flow Matching	Lorenzo Simone et.al.	2506.18747	null
2025-06-23	Towards Group Fairness with Multiple Sensitive Attributes in Federated Foundation Models	Yuning Yang et.al.	2506.18732	null
2025-06-23	MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners	Fang-Duo Tsai et.al.	2506.18729	null
2025-06-23	PARALLELPROMPT: Extracting Parallelism from Large Language Model Queries	Steven Kolawole et.al.	2506.18728	null
2025-06-23	LLM-enhanced Interactions in Human-Robot Collaborative Drawing with Older Adults	Marianne Bossema et.al.	2506.18711	null
2025-06-23	Benchmarking the Pedagogical Knowledge of Large Language Models	Maxime Lelièvre et.al.	2506.18710	null
2025-06-23	Matrix-Game: Interactive World Foundation Model	Yifan Zhang et.al.	2506.18701	null
2025-06-23	MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Tianchen Deng et.al.	2506.18678	null
2025-06-23	Is There a Case for Conversation Optimized Tokenizers in Large Language Models?	Raquel Ferrando et.al.	2506.18674	null
2025-06-23	Harnessing the Power of Reinforcement Learning for Language-Model-Based Information Retriever via Query-Document Co-Augmentation	Jingming Liu et.al.	2506.18670	null
2025-06-23	Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtyping	Pablo Meseguer et.al.	2506.18668	null
2025-06-23	A Random Matrix Analysis of In-context Memorization for Nonlinear Attention	Zhenyu Liao et.al.	2506.18656	null
2025-06-23	ReDit: Reward Dithering for Improved LLM Policy Optimization	Chenxing Wei et.al.	2506.18631	null
2025-06-23	AggTruth: Contextual Hallucination Detection using Aggregated Attention Scores in LLMs	Piotr Matys et.al.	2506.18628	null
2025-06-23	The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches	Alisa Barkar et.al.	2506.18621	null
2025-06-23	Simulation-Free Differential Dynamics through Neural Conservation Laws	Mengjian Hua et.al.	2506.18604	null
2025-06-23	BulletGen: Improving 4D Reconstruction with Bullet-Time Generation	Denys Rozumnyi et.al.	2506.18601	null
2025-06-23	Reply to “Emergent LLM behaviors are observationally equivalent to data leakage”	Ariel Flint Ashery et.al.	2506.18600	null
2025-06-23	No Training Wheels: Steering Vectors for Bias Correction at Inference Time	Aviral Gupta et.al.	2506.18598	null
2025-06-23	Search for dark-matter axions beyond the quantum limit: the Cosmological Axion Sarov Haloscope (CASH) proposal	Andrey L. Pankratov et.al.	2506.18595	null
2025-06-23	Parallel Continuous Chain-of-Thought with Jacobi Iteration	Haoyi Wu et.al.	2506.18582	null
2025-06-23	VisualChef: Generating Visual Aids in Cooking via Mask Inpainting	Oleh Kuzyk et.al.	2506.18569	null
2025-06-23	VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning	Xuanyu Zhang et.al.	2506.18564	null
2025-06-23	T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent	Hong Qing Yu et.al.	2506.18559	null
2025-06-23	Object-aware Sound Source Localization via Audio-Visual Scene Understanding	Sung Jin Um et.al.	2506.18557	null
2025-06-23	Security Assessment of DeepSeek and GPT Series Models against Jailbreak Attacks	Xiaodong Wu et.al.	2506.18543	null
2025-06-23	End-to-End Spoken Grammatical Error Correction	Mengjie Qian et.al.	2506.18532	null
2025-06-23	Auto-Regressively Generating Multi-View Consistent Images	JiaKui Hu et.al.	2506.18527	null
2025-06-23	MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis	Yuting Zhang et.al.	2506.18512	null
2025-06-23	Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance	Yu Han et.al.	2506.18511	null
2025-06-23	Smooth Operators: LLMs Translating Imperfect Hints into Disfluency-Rich Transcripts	Duygu Altinok et.al.	2506.18510	null
2025-06-23	Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey	Xinyao Li et.al.	2506.18504	null
2025-06-23	Comparative Evaluation of ChatGPT and DeepSeek Across Key NLP Tasks: Strengths, Weaknesses, and Domain-Specific Performance	Wael Etaiwi et.al.	2506.18501	null
2025-06-23	Leveraging neural network interatomic potentials for a foundation model of chemistry	So Yeon Kim et.al.	2506.18497	null
2025-06-23	MeRF: Motivation-enhanced Reinforcement Finetuning for Large Reasoning Models	Junjie Zhang et.al.	2506.18485	null
2025-06-23	GANs vs. Diffusion Models for virtual staining with the HER2match dataset	Pascal Klöckner et.al.	2506.18484	null
2025-06-23	AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction	Gengyuan Zhang et.al.	2506.18472	null
2025-06-23	CODS : A Theoretical Model for Computational Design Based on Design Space	Nan Cao et.al.	2506.18455	null
2025-06-23	Benchmarking Foundation Models and Parameter-Efficient Fine-Tuning for Prognosis Prediction in Medical Imaging	Filippo Ruffini et.al.	2506.18434	null
2025-06-23	A Large Language Model-based Multi-Agent Framework for Analog Circuits’ Sizing Relationships Extraction	Chengjie Liu et.al.	2506.18424	null
2025-06-23	TReB: A Comprehensive Benchmark for Evaluating Table Reasoning Capabilities of Large Language Models	Ce Li et.al.	2506.18421	null
2025-06-23	Generative Diffusion Receivers: Achieving Pilot-Efficient MIMO-OFDM Communications	Yuzhi Yang et.al.	2506.18419	null
2025-06-23	What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language Models	Yiyao Wang et.al.	2506.18407	null
2025-06-23	SafeClick: Error-Tolerant Interactive Segmentation of Any Medical Volumes via Hierarchical Expert Consensus	Yifan Gao et.al.	2506.18404	null
2025-06-23	Tracing Errors, Constructing Fixes: Repository-Level Memory Error Repair via Typestate-Guided Context Retrieval	Xiao Cheng et.al.	2506.18394	null
2025-06-23	RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models	Yeongtak Oh et.al.	2506.18369	null
2025-06-23	Recipe for Discovery: A Framework for Systematic Open Source Project Identification	Juanita Gomez et.al.	2506.18359	null
2025-06-23	SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation	Zichong Li et.al.	2506.18349	null
2025-06-23	Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team	Weilun Yu et.al.	2506.18348	null
2025-06-23	Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs	Kang Chen et.al.	2506.18341	null
2025-06-23	Controlled Generation with Equivariant Variational Flow Matching	Floor Eijkelboom et.al.	2506.18340	null
2025-06-23	Geometry-Aware Preference Learning for 3D Texture Generation	AmirHossein Zamani et.al.	2506.18331	null
2025-06-23	Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning	Lixin Wu et.al.	2506.18330	null
2025-06-23	NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation	Yu Xie et.al.	2506.18325	null
2025-06-23	Team LA at SCIDOCA shared task 2025: Citation Discovery via relation-based zero-shot retrieval	Trieu An et.al.	2506.18316	null
2025-06-23	Use Property-Based Testing to Bridge LLM Code Generation and Validation	Lehan He et.al.	2506.18315	null
2025-06-23	BrainSymphony: A Transformer-Driven Fusion of fMRI Time Series and Structural Connectivity	Moein Khajehnejad et.al.	2506.18314	null
2025-06-23	Large-Scale Training Data Attribution for Music Generative Models via Unlearning	Woosung Choi et.al.	2506.18312	null
2025-06-23	Enhancing Document Retrieval in COVID-19 Research: Leveraging Large Language Models for Hidden Relation Extraction	Hoang-An Trieu et.al.	2506.18311	null
2025-06-23	LettinGo: Explore User Profile Generation for Recommendation System	Lu Wang et.al.	2506.18309	null
2025-06-23	LLM-Integrated Digital Twins for Hierarchical Resource Allocation in 6G Networks	Majumder Haider et.al.	2506.18293	null
2025-06-23	Learning Causal Graphs at Scale: A Foundation Model Approach	Naiyu Yin et.al.	2506.18285	null
2025-06-23	Leveraging Large Language Models for Information Verification – an Engineering Approach	Nguyen Nang Hung et.al.	2506.18274	null
2025-06-23	Memory-Augmented Architecture for Long-Term Context Handling in Large Language Models	Haseeb Ullah Khan Shinwari et.al.	2506.18271	null
2025-06-23	Co-persona: Leveraging LLMs and Expert Collaboration to Understand User Personas through Social Media Data Analysis	Min Yin et.al.	2506.18269	null
2025-06-23	ARD-LoRA: Dynamic Rank Allocation for Parameter-Efficient Fine-Tuning of Foundation Models with Heterogeneous Adaptation Needs	Haseeb Ullah Khan Shinwari et.al.	2506.18267	null
2025-06-23	YouTube-Occ: Learning Indoor 3D Semantic Occupancy Prediction from YouTube Videos	Haoming Chen et.al.	2506.18266	null
2025-06-23	Advanced For-Loop for QML algorithm search	FuTe Wong et.al.	2506.18260	null
2025-06-23	TableVault: Managing Dynamic Data Collections for LLM-Augmented Workflows	Jinjin Zhao et.al.	2506.18257	null
2025-06-23	Semantic Structure-Aware Generative Attacks for Enhanced Adversarial Transferability	Jongoh Jeong et.al.	2506.18248	null
2025-06-23	Smart-LLaMA-DPO: Reinforced Large Language Model for Explainable Smart Contract Vulnerability Detection	Lei Yu et.al.	2506.18245	null
2025-06-23	The 4th Dimension for Scaling Model Size	Ruike Zhu et.al.	2506.18233	null
2025-06-23	Exact Conditional Score-Guided Generative Modeling for Amortized Inference in Uncertainty Quantification	Zezhong Zhang et.al.	2506.18227	null
2025-06-23	Make It Efficient: Dynamic Sparse Attention for Autoregressive Image Generation	Xunzhi Xiang et.al.	2506.18226	null
2025-06-22	Deciphering Emotions in Children Storybooks: A Comparative Analysis of Multimodal LLMs in Educational Applications	Bushra Asseri et.al.	2506.18201	null
2025-06-22	Prompt Engineering Techniques for Mitigating Cultural Bias Against Arabs and Muslims in Large Language Models: A Systematic Review	Bushra Asseri et.al.	2506.18199	null
2025-06-22	Integrating LLMs and Digital Twins for Adaptive Multi-Robot Task Allocation in Construction	Min Deng et.al.	2506.18178	null
2025-06-22	Understanding Reasoning in Thinking Language Models via Steering Vectors	Constantin Venhoff et.al.	2506.18167	null
2025-06-22	Chain-of-Memory: Enhancing GUI Agents for Cross-Application Navigation	Xinzge Gao et.al.	2506.18158	null
2025-06-22	AI Through the Human Lens: Investigating Cognitive Theories in Machine Psychology	Akash Kundu et.al.	2506.18156	null
2025-06-20	VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning	Zhangyang Qi et.al.	2506.17221	null
2025-06-20	No Free Lunch: Rethinking Internal Feedback for LLM Reasoning	Yanzhi Zhang et.al.	2506.17219	null
2025-06-20	Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency	Kathleen C. Fraser et.al.	2506.17209	null
2025-06-20	Dissecting the SWE-Bench Leaderboards: Profiling Submitters and Architectures of LLM- and Agent-Based Repair Systems	Matias Martinez et.al.	2506.17208	null
2025-06-20	DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Yukun Huang et.al.	2506.17206	null
2025-06-20	Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction	Jiekai Ma et.al.	2506.17203	null
2025-06-20	Dex1B: Learning with 1B Demonstrations for Dexterous Manipulation	Jianglong Ye et.al.	2506.17198	null
2025-06-20	Schrödinger Bridge Matching for Tree-Structured Costs and Entropic Wasserstein Barycentres	Samuel Howard et.al.	2506.17197	null
2025-06-20	Detecting LLM-Generated Short Answers and Effects on Learner Performance	Shambhavi Bhushan et.al.	2506.17196	link
2025-06-20	Deep generative models as the probability transformation functions	Vitalii Bondar et.al.	2506.17171	null
2025-06-20	The MedPerturb Dataset: What Non-Content Perturbations Reveal About Human and Clinical LLM Decision Making	Abinitha Gourabathina et.al.	2506.17163	null
2025-06-20	Do We Need Large VLMs for Spotting Soccer Actions?	Ritabrata Chakraborty et.al.	2506.17144	null
2025-06-20	MeDi: Metadata-Guided Diffusion Models for Mitigating Biases in Tumor Classification	David Jacob Drexlin et.al.	2506.17140	null
2025-06-20	Large Language Model Unlearning for Source Code	Xue Jiang et.al.	2506.17125	null
2025-06-20	When Can Model-Free Reinforcement Learning be Enough for Thinking?	Josiah P. Hanna et.al.	2506.17124	null
2025-06-20	Are Bias Evaluation Methods Biased ?	Lina Berrayana et.al.	2506.17111	null
2025-06-20	Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving	Chuxue Cao et.al.	2506.17104	null
2025-06-20	Chain-of-Thought Prompting Obscures Hallucination Cues in Large Language Models: An Empirical Evaluation	Jiahao Cheng et.al.	2506.17088	null
2025-06-20	Empowering Near-Field Communications in Low-Altitude Economy with LLM: Fundamentals, Potentials, Solutions, and Future Directions	Zhuo Xu et.al.	2506.17067	null
2025-06-20	Generative Modeling of Full-Atom Protein Conformations using Latent Diffusion on Graph Embeddings	Aditya Sengar et.al.	2506.17064	link
2025-06-20	Universal Music Representations? Evaluating Foundation Models on World Music Corpora	Charilaos Papaioannou et.al.	2506.17055	link
2025-06-20	MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models	Xiaolong Wang et.al.	2506.17046	null
2025-06-20	The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation	Giulia Bertazzini et.al.	2506.17016	null
2025-06-20	LLM-Generated Feedback Supports Learning If Learners Choose to Use It	Danielle R. Thomas et.al.	2506.17006	link
2025-06-20	PersonalAI: Towards digital twins in the graph form	Mikhail Menschikov et.al.	2506.17001	null
2025-06-20	TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs	Sahil Kale et.al.	2506.16990	link
2025-06-20	SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization	Hao Zhang et.al.	2506.16981	null
2025-06-20	Latent Concept Disentanglement in Transformer-based Language Models	Guan Zhe Hong et.al.	2506.16975	null
2025-06-20	MM-AttacKG: A Multimodal Approach to Attack Graph Construction with Large Language Models	Yongheng Zhang et.al.	2506.16968	null
2025-06-20	Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs	Haoran Sun et.al.	2506.16962	link
2025-06-20	Reversing Flow for Image Restoration	Haina Qin et.al.	2506.16961	null
2025-06-20	From Data to Knowledge: Evaluating How Efficiently Language Models Learn Facts	Daniel Christoph et.al.	2506.16912	link
2025-06-20	Towards Effective Complementary Security Analysis using Large Language Models	Jonas Wagner et.al.	2506.16899	null
2025-06-20	AI’s Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario	Ciro Beneduce et.al.	2506.16898	null
2025-06-20	With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You	Fabian Gröger et.al.	2506.16895	null
2025-06-20	Multi-Objective Recommendation in the Era of Generative AI: A Survey of Recent Progress and Future Prospects	Zihan Hong et.al.	2506.16893	null
2025-06-20	Revolutionizing Validation and Verification: Explainable Testing Methodologies for Intelligent Automotive Decision-Making Systems	Halit Eris et.al.	2506.16876	null
2025-06-20	Reward-Agnostic Prompt Optimization for Text-to-Image Diffusion Models	Semin Kim et.al.	2506.16853	link
2025-06-20	CosmoGLINT: Cosmological Generative Model for Line Intensity Mapping with Transformer	Kana Moriwaki et.al.	2506.16843	link
2025-06-20	elicito: A Python Package for Expert Prior Elicitation	Florence Bockting et.al.	2506.16830	null
2025-06-20	Beyond Blur: A Fluid Perspective on Generative Diffusion Models	Grzegorz Gruszczynski et.al.	2506.16827	null
2025-06-20	Predicting New Research Directions in Materials Science using Large Language Models and Concept Graphs	Thomas Marwitz et.al.	2506.16824	null
2025-06-20	Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection	Yuchu Jiang et.al.	2506.16819	link
2025-06-20	FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation	Fan Yang et.al.	2506.16806	null
2025-06-20	Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation	Riccardo Corvi et.al.	2506.16802	null
2025-06-20	RealSR-R1: Reinforcement Learning for Real-World Image Super-Resolution with Vision-Language Chain-of-Thought	Junbo Qiao et.al.	2506.16796	link
2025-06-20	MIST: Jailbreaking Black-box Large Language Models via Iterative Semantic Tuning	Muyang Zheng et.al.	2506.16792	null
2025-06-20	TabArena: A Living Benchmark for Machine Learning on Tabular Data	Nick Erickson et.al.	2506.16791	null
2025-06-20	Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps	Jiashun Cheng et.al.	2506.16787	null
2025-06-20	DistillNote: LLM-based clinical note summaries improve heart failure diagnosis	Heloisa Oss Boll et.al.	2506.16777	null
2025-06-20	eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing	Isaac Shi et.al.	2506.16768	null
2025-06-20	SocialSim: Towards Socialized Simulation of Emotional Support Conversation	Zhuang Chen et.al.	2506.16756	null
2025-06-20	An improved example for an autoconvolution inequality	Christopher Boyer et.al.	2506.16750	link
2025-06-20	The Role of Model Confidence on Bias Effects in Measured Uncertainties	Xinyi Liu et.al.	2506.16724	null
2025-06-20	Large Language Models as Psychological Simulators: A Methodological Guide	Zhicheng Lin et.al.	2506.16702	null
2025-06-20	Exploring Traffic Simulation and Cybersecurity Strategies Using Large Language Models	Lu Gao et.al.	2506.16699	null
2025-06-20	From Prompts to Constructs: A Dual-Validity Framework for LLM Research in Psychology	Zhicheng Lin et.al.	2506.16697	null
2025-06-20	LegiGPT: Party Politics and Transport Policy with Large Language Model	Hyunsoo Yun et.al.	2506.16692	null
2025-06-20	LaVi: Efficient Large Vision-Language Models via Internal Feature Modulation	Tongtian Yue et.al.	2506.16691	null
2025-06-20	Fast and Stable Diffusion Planning through Variational Adaptive Weighting	Zhiying Qiu et.al.	2506.16688	null
2025-06-20	Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations	Ananth Agarwal et.al.	2506.16678	null
2025-06-20	A Minimalist Optimizer Design for LLM Pretraining	Athanasios Glentis et.al.	2506.16659	link
2025-06-20	Mesh-Informed Neural Operator : A Transformer Generative Approach	Yaozhong Shi et.al.	2506.16656	link
2025-06-19	Arch-Router: Aligning LLM Routing with Human Preferences	Co Tran et.al.	2506.16655	null
2025-06-19	Relational Deep Learning: Challenges, Foundations and Next-Generation Architectures	Vijay Prakash Dwivedi et.al.	2506.16654	null
2025-06-19	LLMs in Coding and their Impact on the Commercial Software Engineering Landscape	Vladislav Belozerov et.al.	2506.16653	null
2025-06-19	SemAgent: A Semantics Aware Program Repair Agent	Anvith Pabba et.al.	2506.16650	null
2025-06-19	Semantic Outlier Removal with Embedding Models and LLMs	Eren Akbiyik et.al.	2506.16644	null
2025-06-19	LLM-based Satisfiability Checking of String Requirements by Consistent Data and Checker Generation	Boqi Chen et.al.	2506.16639	null
2025-06-19	Latent Noise Injection for Private and Statistically Aligned Synthetic Data Generation	Rex Shen et.al.	2506.16636	null
2025-06-19	Initial Investigation of LLM-Assisted Development of Rule-Based Clinical NLP System	Jianlin Shi et.al.	2506.16628	null
2025-06-19	LDI: Localized Data Imputation	Soroush Omidvartehrani et.al.	2506.16616	link
2025-06-19	Aethorix v1.0: AI-Driven Inverse Design of Inorganic Materials for Scalable Industrial Innovation	Yingjie Shi et.al.	2506.16609	null
2025-06-19	FLAME: Towards Federated Fine-Tuning Large Language Models Through Adaptive SMoE	Khiem Le et.al.	2506.16600	null
2025-06-19	A Community-driven vision for a new Knowledge Resource for AI	Vinay K Chaudhri et.al.	2506.16596	null
2025-06-19	A Scoping Review of Synthetic Data Generation for Biomedical Research and Applications	Hanshu Rao et.al.	2506.16594	null
2025-06-19	AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions	Ihor Pysmennyi et.al.	2506.16586	link
2025-06-19	Measuring (a Sufficient) World Model in LLMs: A Variance Decomposition Framework	Nadav Kunievsky et.al.	2506.16584	null
2025-06-19	SafeTriage: Facial Video De-identification for Privacy-Preserving Stroke Triage	Tongan Cai et.al.	2506.16578	null
2025-06-19	Advancing Harmful Content Detection in Organizational Research: Integrating Large Language Models with Elo Rating System	Mustafa Akben et.al.	2506.16575	null
2025-06-19	Weight Factorization and Centralization for Continual Learning in Speech Recognition	Enes Yavuz Ugan et.al.	2506.16574	null
2025-06-19	Capturing Visualization Design Rationale	Maeve Hutchinson et.al.	2506.16571	null
2025-06-19	A Free Probabilistic Framework for Analyzing the Transformer-based Language Models	Swagatam Das et.al.	2506.16550	null
2025-06-19	Mr. Snuffleupagus at SemEval-2025 Task 4: Unlearning Factual Knowledge from LLMs Using Adaptive RMU	Arjun Dosajh et.al.	2506.16548	null
2025-06-19	Robust Reward Modeling via Causal Rubrics	Pragya Srivastava et.al.	2506.16507	null
2025-06-19	Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details	Zeqiang Lai et.al.	2506.16504	link
2025-06-19	Relic: Enhancing Reward Model Generalization for Low-Resource Indic Languages with Few-Shot Examples	Soumya Suvra Ghosal et.al.	2506.16502	null
2025-06-19	Grounding Language Models with Semantic Digital Twins for Robotic Planning	Mehreen Naeem et.al.	2506.16493	null
2025-06-19	Do We Talk to Robots Like Therapists, and Do They Respond Accordingly? Language Alignment in AI Emotional Support	Sophie Chiang et.al.	2506.16473	null
2025-06-19	How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?	Giuseppe Lando et.al.	2506.16450	null
2025-06-19	Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models	Biao Yi et.al.	2506.16447	null
2025-06-19	StoryWriter: A Multi-Agent Framework for Long Story Generation	Haotian Xia et.al.	2506.16445	null
2025-06-19	REIS: A High-Performance and Energy-Efficient Retrieval System with In-Storage Processing	Kangqi Chen et.al.	2506.16444	null
2025-06-19	Evaluating the Use of LLMs for Documentation to Code Traceability	Ebube Alor et.al.	2506.16440	null
2025-06-19	Optimizing MoE Routers: Design, Implementation, and Evaluation in Transformer Models	Daniel Fidel Harvey et.al.	2506.16419	null
2025-06-19	Unpacking Generative AI in Education: Computational Modeling of Teacher and Student Perspectives in Social Media Discourse	Paulina DeVito et.al.	2506.16412	null
2025-06-19	When Does Divide and Conquer Work for Long Context LLM? A Noise Decomposition Framework	Zhen Xu et.al.	2506.16411	null
2025-06-19	Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights	Zhiyuan Liang et.al.	2506.16406	null
2025-06-19	Generating Directed Graphs with Dual Attention and Asymmetric Encoding	Alba Carballo-Castro et.al.	2506.16404	link
2025-06-19	NepaliGPT: A Generative Language Model for the Nepali Language	Shushanta Pudasaini et.al.	2506.16399	null
2025-06-18	Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards	Qingming Liu et.al.	2506.15684	null
2025-06-18	PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning	Yuhui Shi et.al.	2506.15683	null
2025-06-18	Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Anirud Aggarwal et.al.	2506.15682	link
2025-06-18	GenRecal: Generation after Recalibration from Large to Small Vision-Language Models	Byung-Kwan Lee et.al.	2506.15681	null
2025-06-18	SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence	Yao Zhang et.al.	2506.15672	null
2025-06-18	CC-LEARN: Cohort-based Consistency Learning	Xiao Ye et.al.	2506.15662	null
2025-06-18	PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection	Wenhao Li et.al.	2506.15656	null
2025-06-18	deepSURF: Detecting Memory Safety Vulnerabilities in Rust Through Fuzzing LLM-Augmented Harnesses	Georgios Androutsopoulos et.al.	2506.15648	null
2025-06-18	Demystifying the Visual Quality Paradox in Multimodal Large Language Models	Shuo Xing et.al.	2506.15645	null
2025-06-18	Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability	Yusuke Sakai et.al.	2506.15629	null
2025-06-18	The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games	Lyle Goodyear et.al.	2506.15624	null
2025-06-18	The Compositional Architecture of Regret in Large Language Models	Xiangxiang Cui et.al.	2506.15617	null
2025-06-18	BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion	Yuqing Lan et.al.	2506.15610	null
2025-06-18	LoX: Low-Rank Extrapolation Robustifies LLM Safety Against Fine-tuning	Gabrel J. Perin et.al.	2506.15606	link
2025-06-18	From Model to Classroom: Evaluating Generated MCQs for Portuguese with Narrative and Difficulty Concerns	Bernardo Leite et.al.	2506.15598	null
2025-06-18	LiteGD: Lightweight and dynamic GPU Dispatching for Large-scale Heterogeneous Clusters	Kunming Zhang et.al.	2506.15595	null
2025-06-18	One-Step Diffusion for Detail-Rich and Temporally Consistent Video Super-Resolution	Yujing Sun et.al.	2506.15591	link
2025-06-18	SciVer: Evaluating Foundation Models for Multimodal Scientific Claim Verification	Chengye Wang et.al.	2506.15569	link
2025-06-18	Gender Inclusivity Fairness Index (GIFI): A Multilevel Framework for Evaluating Gender Diversity in Large Language Models	Zhengyang Shan et.al.	2506.15568	link
2025-06-18	Managing Complex Failure Analysis Workflows with LLM-based Reasoning and Acting Agents	Aline Dobrovsky et.al.	2506.15567	null
2025-06-18	PredGen: Accelerated Inference of Large Language Models through Input-Time Speculation for Real-Time Speech Interaction	Shufan Li et.al.	2506.15556	null
2025-06-18	Diff-TONE: Timestep Optimization for iNstrument Editing in Text-to-Music Diffusion Models	Teysir Baoueb et.al.	2506.15530	null
2025-06-18	Lessons from Training Grounded LLMs with Verifiable Rewards	Shang Hong Sim et.al.	2506.15522	null
2025-06-18	RePCS: Diagnosing Data Memorization in LLM-Powered Retrieval-Augmented Generation	Le Vu Anh et.al.	2506.15513	null
2025-06-18	Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach	Wenqi Guan et.al.	2506.15512	null
2025-06-18	SPARE: Single-Pass Annotation with Reference-Guided Evaluation for Automatic Process Supervision and Reward Modelling	Md Imbesat Hassan Rizvi et.al.	2506.15498	link
2025-06-18	Context-Informed Grounding Supervision	Hyunji Lee et.al.	2506.15480	link
2025-06-18	Creating User-steerable Projections with Interactive Semantic Mapping	Artur André Oliveira et.al.	2506.15479	null
2025-06-18	Multimodal Large Language Models for Medical Report Generation via Customized Prompt Tuning	Chunlei Li et.al.	2506.15477	null
2025-06-18	RE-IMAGINE: Symbolic Benchmark Synthesis for Reasoning Evaluation	Xinnuo Xu et.al.	2506.15455	null
2025-06-18	Uncovering Intention through LLM-Driven Code Snippet Description Generation	Yusuf Sulistyo Nugroho et.al.	2506.15453	null
2025-06-18	AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need	Zhouhong Gu et.al.	2506.15451	link
2025-06-18	Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material	Team Hunyuan3D et.al.	2506.15442	link
2025-06-18	Understanding GUI Agent Localization Biases through Logit Sharpness	Xingjian Tao et.al.	2506.15425	null
2025-06-18	Targeted Lexical Injection: Unlocking Latent Cross-Lingual Alignment in Lugha-Llama via Early-Layer LoRA Fine-Tuning	Stanley Ngugi et.al.	2506.15415	null
2025-06-18	Provable Maximum Entropy Manifold Exploration via Diffusion Models	Riccardo De Santi et.al.	2506.15385	null
2025-06-18	Sampling 3D Molecular Conformers with Diffusion Transformers	J. Thorben Frank et.al.	2506.15378	link
2025-06-18	SANSKRITI: A Comprehensive Benchmark for Evaluating Language Models’ Knowledge of Indian Culture	Arijit Maji et.al.	2506.15355	null
2025-06-18	DeVisE: Behavioral Testing of Medical Large Language Models	Camila Zurdo Tagliabue et.al.	2506.15339	null
2025-06-18	When and How Unlabeled Data Provably Improve In-Context Learning	Yingcong Li et.al.	2506.15329	null
2025-06-18	MapFM: Foundation Model-Driven HD Mapping with Multi-Task Contextual Learning	Leonid Ivanov et.al.	2506.15313	link
2025-06-18	SecFwT: Efficient Privacy-Preserving Fine-Tuning of Large Language Models Using Forward-Only Passes	Jinglong Luo et.al.	2506.15307	null
2025-06-18	Conditional Generative Modeling for Enhanced Credit Risk Management in Supply Chain Finance	Qingkai Zhang et.al.	2506.15305	null
2025-06-18	MEGC2025: Micro-Expression Grand Challenge on Spot Then Recognize and Visual Question Answering	Xinqi Fan et.al.	2506.15298	null
2025-06-18	Unlocking Post-hoc Dataset Inference with Synthetic Data	Bihe Zhao et.al.	2506.15271	null
2025-06-18	RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments	Yuchuan Fu et.al.	2506.15253	link
2025-06-18	Singular Value Decomposition on Kronecker Adaptation for Large Language Model	Yee Hin Chong et.al.	2506.15251	null
2025-06-18	TopClustRAG at SIGIR 2025 LiveRAG Challenge	Juli Bakagianni et.al.	2506.15246	null
2025-06-18	Research on Graph-Retrieval Augmented Generation Based on Historical Text Knowledge Graphs	Yang Fan et.al.	2506.15241	null
2025-06-18	Lost in Variation? Evaluating NLI Performance in Basque and Spanish Geographical Variants	Jaione Bengoetxea et.al.	2506.15239	null
2025-06-18	Large Language Models for Unit Testing: A Systematic Literature Review	Quanjun Zhang et.al.	2506.15227	null
2025-06-18	An efficient forgetting-aware fine-tuning framework for pretrained universal machine-learning interatomic potentials	Jisu Kim et.al.	2506.15223	link
2025-06-18	video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models	Changli Tang et.al.	2506.15220	null
2025-06-18	MinosEval: Distinguishing Factoid and Non-Factoid for Tailored Open-Ended QA Evaluation with LLMs	Yongqi Fan et.al.	2506.15215	link
2025-06-18	LLM vs. SAST: A Technical Analysis on Detecting Coding Bugs of GPT4-Advanced Data Analysis	Madjid G. Tehrani et.al.	2506.15212	null
2025-06-18	ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs	Feng He et.al.	2506.15211	null
2025-06-18	A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals	Andrea Cadeddu et.al.	2506.15208	null
2025-06-18	HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges	Xianliang Yang et.al.	2506.15196	link
2025-06-18	From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem	Yanxu Mao et.al.	2506.15170	null
2025-06-18	LLM Agent for Hyper-Parameter Optimization	Wanzhe Wang et.al.	2506.15167	null
2025-06-18	Robust Instant Policy: Leveraging Student’s t-Regression Model for Robust In-context Imitation Learning of Robot Manipulation	Hanbit Oh et.al.	2506.15157	null
2025-06-18	eLLM: Elastic Memory Management Framework for Efficient LLM Serving	Jiale Xu et.al.	2506.15155	null
2025-06-18	SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning	Anuradha Chopra et.al.	2506.15154	link
2025-06-18	Generative thermodynamic computing	Stephen Whitelam et.al.	2506.15121	null
2025-06-18	CKD-EHR:Clinical Knowledge Distillation for Electronic Health Records	Junke Wang et.al.	2506.15118	null
2025-06-18	CipherMind: The Longest Codebook in the World	Ming Nie et.al.	2506.15117	link
2025-06-18	Enhancement Report Approval Prediction: A Comparative Study of Large Language Models	Haosheng Zuo et.al.	2506.15098	link
2025-06-18	EMUSE: Evolutionary Map of the Universe Search Engine	Nikhel Gupta et.al.	2506.15090	link
2025-06-18	EmojiVoice: Towards long-term controllable expressivity in robot speech	Paige Tuttösí et.al.	2506.15085	null
2025-06-18	Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks	Yimian Ding et.al.	2506.15082	null
2025-06-18	Learning-Time Encoding Shapes Unlearning in LLMs	Ruihan Wu et.al.	2506.15076	link
2025-06-18	ChatModel: Automating Reference Model Design and Verification with LLMs	Jianmin Ye et.al.	2506.15066	null
2025-06-18	HEAL: An Empirical Study on Hallucinations in Embodied Agents Driven by Large Language Models	Trishna Chakraborty et.al.	2506.15065	null
2025-06-18	Truncated Proximal Policy Optimization	Tiantian Fan et.al.	2506.15050	null
2025-06-18	Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer’s and Dementia Caregivers	Jiayue Melissa Shi et.al.	2506.15047	null
2025-06-18	Identifying economic narratives in large text corpora – An integrated approach using Large Language Models	Tobias Schmidt et.al.	2506.15041	null
2025-06-18	Break Stylistic Sophon: Are We Really Meant to Confine the Imagination in Style Transfer?	Gary Song Yan et.al.	2506.15033	null
2025-06-17	Optimal Embedding Learning Rate in LLMs: The Effect of Vocabulary Size	Soufiane Hayou et.al.	2506.15025	null
2025-06-17	SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models	Gyuhak Kim et.al.	2506.15021	null
2025-06-17	Scaling Intelligence: Designing Data Centers for Next-Gen Language Models	Jesmin Jahan Tithi et.al.	2506.15006	null
2025-06-17	Memory Tokens: Large Language Models Can Generate Reversible Sentence Embeddings	Ignacio Sastre et.al.	2506.15001	link
2025-06-17	Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings	Harbin Hong et.al.	2506.14997	null
2025-06-17	Early Prediction of Multiple Sclerosis Disability Progression via Multimodal Foundation Model Benchmarks	Maxime Usdin et.al.	2506.14986	null
2025-06-17	Thinking in Directivity: Speech Large Language Model for Multi-Talker Directional Speech Recognition	Jiamin Xie et.al.	2506.14973	null
2025-06-17	FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization	Rajat Kumar Jenamani et.al.	2506.14968	null
2025-06-17	Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective	Zhoujun Cheng et.al.	2506.14965	link
2025-06-17	POCO: Scalable Neural Forecasting through Population Conditioning	Yu Duan et.al.	2506.14957	link
2025-06-17	From Chat to Checkup: Can Large Language Models Assist in Diabetes Prediction?	Shadman Sakib et.al.	2506.14949	link
2025-06-17	Structured Moral Reasoning in Language Models: A Value-Grounded Evaluation Framework	Mohna Chakraborty et.al.	2506.14948	null
2025-06-17	FedOne: Query-Efficient Federated Learning for Black-box Discrete Prompt Learning	Ganyu Wang et.al.	2506.14929	link
2025-06-17	FORTRESS: Frontier Risk Evaluation for National Security and Public Safety	Christina Q. Knight et.al.	2506.14922	null
2025-06-17	Recursive Variational Autoencoders for 3D Blood Vessel Generative Modeling	Paula Feldman et.al.	2506.14914	null
2025-06-17	Winter Soldier: Backdooring Language Models at Pre-Training with Indirect Data Poisoning	Wassim Bouaziz et.al.	2506.14913	null
2025-06-17	CrEst: Credibility Estimation for Contexts in LLMs via Weak Supervision	Dyah Adila et.al.	2506.14912	null
2025-06-17	Foundation Artificial Intelligence Models for Health Recognition Using Face Photographs (FAHR-Face)	Fridolin Haugg et.al.	2506.14909	null
2025-06-17	DETONATE: A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization	Renjith Prasad et.al.	2506.14903	null
2025-06-17	A Variational Framework for Improving Naturalness in Generative Spoken Language Models	Li-Wei Chen et.al.	2506.14767	link
2025-06-17	ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM	Yujun Wang et.al.	2506.14766	null
2025-06-17	Scaling-Up the Pretraining of the Earth Observation Foundation Model PhilEO to the MajorTOM Dataset	Nikolaos Dionelis et.al.	2506.14765	link
2025-06-17	Large Language Models – the Future of Fundamental Physics?	Caroline Heneka et.al.	2506.14757	null
2025-06-17	Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs	Ring Team et.al.	2506.14731	null
2025-06-17	AgentDistill: Training-Free Agent Distillation with Generalizable MCP Boxes	Jiahao Qiu et.al.	2506.14728	null
2025-06-17	Adaptive Accompaniment with ReaLchords	Yusong Wu et.al.	2506.14723	null
2025-06-17	Capacity Matters: a Proof-of-Concept for Transformer Memorization on Real-World Data	Anton Changalidis et.al.	2506.14704	link
2025-06-17	Unified Software Engineering agent as AI Software Engineer	Leonhard Applis et.al.	2506.14683	null
2025-06-17	AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models	Ads Dawson et.al.	2506.14682	link
2025-06-17	Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality	Yuto Harada et.al.	2506.14681	null
2025-06-17	GuiLoMo: Allocating Expert Number and Rank for LoRA-MoE via Bilevel Optimization with GuidedSelection Vectors	Hengyuan Zhang et.al.	2506.14646	link
2025-06-17	Passing the Turing Test in Political Discourse: Fine-Tuning LLMs to Mimic Polarized Social Media Comments	. Pazzaglia et.al.	2506.14645	null
2025-06-17	Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot	Xiang Cheng et.al.	2506.14641	null
2025-06-17	AIn’t Nothing But a Survey? Using Large Language Models for Coding German Open-Ended Survey Responses on Survey Motivation	Leah von der Heyde et.al.	2506.14634	null
2025-06-17	ACM Survey Draft on Formalising Software Requirements with Large Language Models	Arshad Beg et.al.	2506.14627	null
2025-06-17	Probabilistic Aggregation and Targeted Embedding Optimization for Collective Moral Reasoning in Large Language Models	Chenchen Yuan et.al.	2506.14625	link
2025-06-17	Exploring MLLMs Perception of Network Visualization Principles	Jacob Miller et.al.	2506.14611	null
2025-06-17	Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees	Ahmed Heakl et.al.	2506.14606	null
2025-06-17	Align Your Flow: Scaling Continuous-Time Flow Map Distillation	Amirmojtaba Sabour et.al.	2506.14603	null
2025-06-17	NetRoller: Interfacing General and Specialized Models for End-to-End Autonomous Driving	Ren Xin et.al.	2506.14589	link
2025-06-17	GenerationPrograms: Fine-grained Attribution with Executable Programs	David Wan et.al.	2506.14580	link
2025-06-17	TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization	Mingkang Zhu et.al.	2506.14574	null
2025-06-17	From Points to Places: Towards Human Mobility-Driven Spatiotemporal Foundation Models via Understanding Places	Mohammad Hashemi et.al.	2506.14570	null
2025-06-17	AlphaDecay:Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs	Di He et.al.	2506.14562	link
2025-06-17	Doppelgänger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack	Daewon Kang et.al.	2506.14539	null
2025-06-17	Automatic Qiskit Code Refactoring Using Large Language Models	José Manuel Suárez et.al.	2506.14535	null
2025-06-17	M2BeamLLM: Multimodal Sensing-empowered mmWave Beam Prediction with Large Language Models	Can Zheng et.al.	2506.14532	null
2025-06-17	Sharp Generalization Bounds for Foundation Models with Asymmetric Randomized Low-Rank Adapters	Anastasis Kratsios et.al.	2506.14530	null
2025-06-17	Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution	Xiaohan Zheng et.al.	2506.14529	null
2025-06-17	RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge	Kun Ran et.al.	2506.14516	link
2025-06-17	SIRI-Bench: Challenging VLMs’ Spatial Intelligence through Complex Reasoning Tasks	Zijian Song et.al.	2506.14512	null
2025-06-17	Can Pretrained Vision-Language Embeddings Alone Guide Robot Navigation?	Nitesh Subedi et.al.	2506.14507	link
2025-06-17	LLM-Powered Swarms: A New Frontier or a Conceptual Stretch?	Muhammad Atta Ur Rahman et.al.	2506.14496	null
2025-06-17	LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops	Jiyuan Fu et.al.	2506.14493	null
2025-06-17	Reimagining Target-Aware Molecular Generation through Retrieval-Enhanced Aligned Diffusion	Dong Xu et.al.	2506.14488	null
2025-06-17	SimSpark: Interactive Simulation of Social Media Behaviors	Ziyue Lin et.al.	2506.14476	null
2025-06-17	LexiMark: Robust Watermarking via Lexical Substitutions to Enhance Membership Verification of an LLM’s Textual Training Data	Eyal German et.al.	2506.14474	link
2025-06-17	Foundation Model Insights and a Multi-Model Approach for Superior Fine-Grained One-shot Subset Selection	Zhijing Wan et.al.	2506.14473	link
2025-06-17	Dense360: Dense Understanding from Omnidirectional Panoramas	Yikang Zhou et.al.	2506.14471	null
2025-06-17	MalGuard: Towards Real-Time, Accurate, and Actionable Detection of Malicious Packages in PyPI Ecosystem	Xingan Gao et.al.	2506.14466	null
2025-06-17	Active Digital Twins via Active Inference	Matteo Torzoni et.al.	2506.14453	null
2025-06-17	How Far Can LLMs Improve from Experience? Measuring Test-Time Learning Ability in LLMs with Human Comparison	Jiayin Wang et.al.	2506.14448	link
2025-06-17	Vela: Scalable Embeddings with Voice Large Language Models for Multimodal Retrieval	Ruofan Hu et.al.	2506.14445	null
2025-06-17	MoORE: SVD-based Model MoE-ization for Conflict- and Oblivion-Resistant Multi-Task Adaptation	Shen Yuan et.al.	2506.14436	link
2025-06-17	M3SD: Multi-modal, Multi-scenario and Multi-language Speaker Diarization Dataset	Shilong Wu et.al.	2506.14427	null
2025-06-17	RAGtifier: Evaluating RAG Generation Approaches of State-of-the-Art RAG Systems for the SIGIR LiveRAG Competition	Tim Cofala et.al.	2506.14412	null
2025-06-17	Thunder-NUBench: A Benchmark for LLMs’ Sentence-Level Negation Understanding	Yeonkyoung So et.al.	2506.14397	null
2025-06-17	Don’t Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning	William F. Shen et.al.	2506.14387	null
2025-06-17	Excessive Reasoning Attack on Reasoning LLMs	Wai Man Si et.al.	2506.14374	null
2025-06-17	ELLIS Alicante at CQs-Gen 2025: Winning the critical thinking questions shared task: LLM-based question generation and selection	Lucile Favero et.al.	2506.14371	null
2025-06-17	EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization	Xiaoqi Wang et.al.	2506.14356	link
2025-06-17	A Vision for Geo-Temporal Deep Research Systems: Towards Comprehensive, Transparent, and Reproducible Geo-Temporal Information Synthesis	Bruno Martins et.al.	2506.14345	null
2025-06-17	LLM-Powered Intent-Based Categorization of Phishing Emails	Even Eilertsen et.al.	2506.14337	null
2025-06-17	AviationLLM: An LLM-based Knowledge System for Aviation Training	Jia’ang Wan et.al.	2506.14336	null
2025-06-17	ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies	Jinyan Yuan et.al.	2506.14315	null
2025-06-17	Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent	Xueyang Feng et.al.	2506.14302	null
2025-06-17	ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems	Fanzhi Zeng et.al.	2506.14299	null
2025-06-17	Quality Assessment of Python Tests Generated by Large Language Models	Victor Alves et.al.	2506.14297	link
2025-06-17	Equivariance Everywhere All At Once: A Recipe for Graph Foundation Models	Ben Finkelshtein et.al.	2506.14291	link
2025-06-17	Large Language Model Empowered Design of Fluid Antenna Systems: Challenges, Frameworks, and Case Studies for 6G	Chao Wang et.al.	2506.14288	null
2025-06-17	From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents	Seongbo Jang et.al.	2506.14285	link
2025-06-17	Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment	Weiming Zhang et.al.	2506.14271	null
2025-06-17	RL-Obfuscation: Can Language Models Learn to Evade Latent-Space Monitors?	Rohan Gupta et.al.	2506.14261	link
2025-06-17	Re-Initialization Token Learning for Tool-Augmented Large Language Models	Chenghao Li et.al.	2506.14248	link
2025-06-17	Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs	Xumeng Wen et.al.	2506.14245	null
2025-06-17	Causes in neuron diagrams, and testing causal reasoning in Large Language Models. A glimpse of the future of philosophy?	Louis Vervoort et.al.	2506.14239	null
2025-06-17	Xolver: Multi-Agent Reasoning with Holistic Experience Learning Just Like an Olympiad Team	Md Tanzib Hosain et.al.	2506.14234	null
2025-06-17	ImpReSS: Implicit Recommender System for Support Conversations	Omri Haller et.al.	2506.14231	null
2025-06-17	From Black Boxes to Transparent Minds: Evaluating and Enhancing the Theory of Mind in Multimodal Large Language Models	Xinyang Li et.al.	2506.14224	null
2025-06-17	Can Large Language Models Improve Spectral Graph Neural Networks?	Kangkang Lu et.al.	2506.14220	null
2025-06-17	CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation	Jia-Chen Zhang et.al.	2506.14206	null
2025-06-17	DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion	Makoto Shing et.al.	2506.14202	null
2025-06-17	MAS-LitEval : Multi-Agent System for Literary Translation Quality Assessment	Junghwan Kim et.al.	2506.14199	null
2025-06-17	Mobile Application Review Summarization using Chain of Density Prompting	Shristi Shrestha et.al.	2506.14192	null
2025-06-17	Meta-SurDiff: Classification Diffusion Model Optimized by Meta Learning is Reliable for Online Surgical Phase Recognition	Yufei Li et.al.	2506.14181	null
2025-06-17	GRAM: A Generative Foundation Reward Model for Reward Generalization	Chenglong Wang et.al.	2506.14175	link
2025-06-17	Structured and Informed Probabilistic Modeling with the Thermodynamic Kolmogorov-Arnold Model	Prithvi Raj et.al.	2506.14167	link
2025-06-17	MIST: Towards Multi-dimensional Implicit Bias and Stereotype Evaluation of LLMs via Theory of Mind	Yanlin Li et.al.	2506.14161	null
2025-06-17	S $^4$ C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models	Tao He et.al.	2506.14158	null
2025-06-17	Acoustic scattering AI for non-invasive object classifications: A case study on hair assessment	Long-Vu Hoang et.al.	2506.14148	null
2025-06-17	Collaborative Editable Model	Kaiwen Tang et.al.	2506.14146	null
2025-06-17	SceneAware: Scene-Constrained Pedestrian Trajectory Prediction with LLM-Guided Walkability	Juho Bai et.al.	2506.14144	null
2025-06-17	Less is More: Undertraining Experts Improves Model Upcycling	Stefan Horoi et.al.	2506.14126	null
2025-06-17	Abstract Meaning Representation for Hospital Discharge Summarization	Paul Landes et.al.	2506.14101	link
2025-06-17	Toward a Graph Foundation Model: Pre-Training Transformers With Random Walks	Ziyuan Tang et.al.	2506.14098	null
2025-06-17	Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems	Sanjeda Akter et.al.	2506.14096	null
2025-06-17	Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models	Haonan Yin et.al.	2506.14092	null
2025-06-17	Multi-Scale Finetuning for Encoder-based Time Series Foundation Models	Zhongzheng Qiao et.al.	2506.14087	null
2025-06-17	InsertRank: LLMs can reason over BM25 scores to Improve Listwise Reranking	Rahul Seetharaman et.al.	2506.14086	null
2025-06-17	Lightweight Relevance Grader in RAG	Taehee Jeong et.al.	2506.14084	link
2025-06-17	Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification	Nathaniel Pinckney et.al.	2506.14074	link
2025-06-16	A Regret Perspective on Online Selective Generation	Minjae Lee et.al.	2506.14067	null
2025-06-16	Ace-CEFR – A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications	David Kogan et.al.	2506.14046	null
2025-06-16	Discovering Temporal Structure: An Overview of Hierarchical Reinforcement Learning	Martin Klissarov et.al.	2506.14045	null
2025-06-16	MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation	Xueqing Peng et.al.	2506.14028	null
2025-06-16	Lost in the Mix: Evaluating LLM Understanding of Code-Switched Text	Amr Mohamed et.al.	2506.14012	link
2025-06-16	Unlearning Isn’t Invisible: Detecting Unlearning Traces in LLMs from Model Outputs	Yiwei Chen et.al.	2506.14003	link
2025-06-16	Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders	Siyu Chen et.al.	2506.14002	null
2025-06-16	ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users	Shahaf David et.al.	2506.13980	null
2025-06-16	Diagnosing and Improving Diffusion Models by Estimating the Optimal Loss Value	Yixian Xu et.al.	2506.13763	null
2025-06-16	Touch begins where vision ends: Generalizable policies for contact-rich manipulation	Zifan Zhao et.al.	2506.13762	null
2025-06-16	AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning	Zewei Zhou et.al.	2506.13757	link
2025-06-16	UltraZoom: Generating Gigapixel Images from Regular Photos	Jingwei Ma et.al.	2506.13756	null
2025-06-16	Steering LLM Thinking with Budget Guidance	Junyan Li et.al.	2506.13752	link
2025-06-16	Evaluating Large Language Models for Phishing Detection, Self-Consistency, Faithfulness, and Explainability	Shova Kuikel et.al.	2506.13746	link
2025-06-16	Instruction Following by Boosting Attention of Large Language Models	Vitoria Guardieiro et.al.	2506.13734	null
2025-06-16	Attribution-guided Pruning for Compression, Circuit Discovery, and Targeted Correction in LLMs	Sayed Mohammad Vakilzadeh Hatefi et.al.	2506.13727	link
2025-06-16	Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models	Arjun Krishna et.al.	2506.13726	null
2025-06-16	OTFusion: Bridging Vision-only and Vision-Language Models via Optimal Transport for Transductive Zero-Shot Learning	Qiyu Xu et.al.	2506.13723	null
2025-06-16	TimeMaster: Training Time-Series Multimodal LLMs to Reason via Reinforcement Learning	Junru Zhang et.al.	2506.13705	link
2025-06-16	Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry	Junyoung Seo et.al.	2506.13697	null
2025-06-16	Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systems	Shang-Chi Tsai et.al.	2506.13692	null
2025-06-16	UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions	Zhucun Xue et.al.	2506.13691	null
2025-06-16	What Happens During the Loss Plateau? Understanding Abrupt Learning in Transformers	Pulkit Gopalani et.al.	2506.13688	link
2025-06-16	Enforcing tail calibration when training probabilistic forecast models	Jakob Benjamin Wessel et.al.	2506.13687	link
2025-06-16	An LLM’s Apology: Outsourcing Awkwardness in the Age of AI	Twm Stone et.al.	2506.13685	link
2025-06-16	Prefix-Tuning+: Modernizing Prefix-Tuning through Attention Independent Prefix Data	Haonan Wang et.al.	2506.13674	null
2025-06-16	MultiViT2: A Data-augmented Multimodal Neuroimaging Prediction Framework via Latent Diffusion Model	Bi Yuda et.al.	2506.13667	null
2025-06-16	We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems	Junfeng Fang et.al.	2506.13666	link
2025-06-16	DesignCoder: Hierarchy-Aware and Self-Correcting UI Code Generation with Large Language Models	Yunnong Chen et.al.	2506.13663	null
2025-06-16	Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model	Shaolei Zhang et.al.	2506.13642	link
2025-06-16	EvolvTrip: Enhancing Literary Character Understanding with Temporal Theory-of-Mind Graphs	Bohao Yang et.al.	2506.13641	link
2025-06-16	An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability	Yusuke Yamauchi et.al.	2506.13639	null
2025-06-16	Assessing the Limits of In-Context Learning beyond Functions using Partially Ordered Relation	Debanjan Dutta et.al.	2506.13608	null
2025-06-16	CAMS: A CityGPT-Powered Agentic Framework for Urban Human Mobility Simulation	Yuwei Du et.al.	2506.13599	null
2025-06-16	Qwen vs. Gemma Integration with Whisper: A Comparative Study in Multilingual SpeechLLM Systems	Tuan Nguyen et.al.	2506.13596	null
2025-06-16	Calibrated Predictive Lower Bounds on Time-to-Unsafe-Sampling in LLMs	Hen Davidov et.al.	2506.13593	link
2025-06-16	Omni-AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented for Efficient Long Video Understanding	Zhucun Xue et.al.	2506.13589	null
2025-06-17	Model Context Protocol (MCP) at First Glance: Studying the Security and Maintainability of MCP Servers	Mohammed Mehedi Hasan et.al.	2506.13538	link
2025-06-16	Implicit and Explicit Research Quality Score Probabilities from ChatGPT	Mike Thelwall et.al.	2506.13525	null
2025-06-16	TensorSLM: Energy-efficient Embedding Compression of Sub-billion Parameter Language Models on Low-end Devices	Mingxue Xu et.al.	2506.13514	null
2025-06-17	Safe-Child-LLM: A Developmental Benchmark for Evaluating LLM Safety in Child-LLM Interactions	Junfeng Jiao et.al.	2506.13510	link
2025-06-16	BOW: Bottlenecked Next Word Exploration	Ming Shen et.al.	2506.13502	null
2025-06-16	A Survey on Imitation Learning for Contact-Rich Tasks in Robotics	Toshiaki Tsuji et.al.	2506.13498	null
2025-06-16	Watermarking LLM-Generated Datasets in Downstream Tasks	Yugeng Liu et.al.	2506.13494	null
2025-06-16	Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Martina Pastorino et.al.	2506.13484	null
2025-06-16	Position: Pause Recycling LoRAs and Prioritize Mechanisms to Uncover Limits and Effectiveness	Mei-Yen Chen et.al.	2506.13479	null
2025-06-16	Language Agents for Hypothesis-driven Clinical Decision Making with Reinforcement Learning	David Bani-Harouni et.al.	2506.13474	null
2025-06-17	ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models	Junho Yoon et.al.	2506.13472	null
2025-06-16	Unveiling the Learning Mind of Language Models: A Cognitive Framework and Empirical Study	Zhengyu Hu et.al.	2506.13464	null
2025-06-16	PRO: Projection Domain Synthesis for CT Imaging	Kang Chen et.al.	2506.13443	null
2025-06-16	From Promise to Peril: Rethinking Cybersecurity Red and Blue Teaming in the Age of LLMs	Alsharif Abuadbba et.al.	2506.13434	null
2025-06-16	Uncertainty-Aware Remaining Lifespan Prediction from Images	Tristan Kenneweg et.al.	2506.13430	null
2025-06-16	RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis	Pengzuo Wu et.al.	2506.13405	null
2025-06-16	Deflating Deflationism: A Critical Perspective on Debunking Arguments Against LLM Mentality	Alex Grzankowski et.al.	2506.13403	null
2025-06-16	Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR	Yizhou Peng et.al.	2506.13396	null
2025-06-16	Delving Into the Psychology of Machines: Exploring the Structure of Self-Regulated Learning via LLM-Generated Survey Responses	Leonie V. D. E. Vogelsmeier et.al.	2506.13384	null
2025-06-16	Decompositional Reasoning for Graph Retrieval with Large Language Models	Valentin Six et.al.	2506.13380	null
2025-06-17	Dual guidance: ROM-informed field reconstruction with generative models	Sajad Salavatidezfouli et.al.	2506.13369	null
2025-06-16	Socratic RL: A Novel Framework for Efficient Knowledge Acquisition through Iterative Reflection and Viewpoint Distillation	Xiangfan Wu et.al.	2506.13358	null
2025-06-16	StoryBench: A Dynamic Benchmark for Evaluating Long-Term Memory with Multi Turns	Luanbo Wan et.al.	2506.13356	null
2025-06-16	Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks	Yifei Xu et.al.	2506.13351	null
2025-06-16	LapDDPM: A Conditional Graph Diffusion Model for scRNA-seq Generation with Spectral Adversarial Perturbations	Lorenzo Bini et.al.	2506.13344	null
2025-06-16	NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025	Yizhou Peng et.al.	2506.13339	null
2025-06-16	Document-Level Tabular Numerical Cross-Checking: A Coarse-to-Fine Approach	Chaoxu Pang et.al.	2506.13328	null
2025-06-16	VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation	Bo Pan et.al.	2506.13326	link
2025-06-16	Towards Pervasive Distributed Agentic Generative AI – A State of The Art	Gianni Molinari et.al.	2506.13324	null
2025-06-16	Large Language Models as ‘Hidden Persuaders’: Fake Product Reviews are Indistinguishable to Humans and Machines	Weiyao Meng et.al.	2506.13313	null
2025-06-16	Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts	Solène Debuysère et.al.	2506.13307	null
2025-06-16	Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research	Salah Ghamizi et.al.	2506.13306	null
2025-06-16	Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs	Houcheng Jiang et.al.	2506.13285	null
2025-06-16	Navigating the Black Box: Leveraging LLMs for Effective Text-Level Graph Injection Attacks	Yuefei Lyu et.al.	2506.13276	null
2025-06-16	AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining	Hongyuan Dong et.al.	2506.13274	null
2025-06-16	Vector Ontologies as an LLM world view extraction method	Kaspar Rothenfusser et.al.	2506.13252	link
2025-06-16	A Game-Theoretic Negotiation Framework for Cross-Cultural Consensus in LLMs	Guoxi Zhang et.al.	2506.13245	null
2025-06-16	IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation	Zijie Lin et.al.	2506.13229	link
2025-06-16	Breaking Thought Patterns: A Multi-Dimensional Reasoning Framework for LLMs	Xintong Tang et.al.	2506.13192	null
2025-06-16	Multimodal “Puppeteer”: An Exploration of Robot Teleoperation Via Virtual Counterpart with LLM-Driven Voice and Gesture Interaction in Augmented Reality	Yuchong Zhang et.al.	2506.13189	null
2025-06-16	SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists	Lynn Khellaf et.al.	2506.13188	null
2025-06-16	Dynamic Context-oriented Decomposition for Task-aware Low-rank Adaptation with Less Forgetting and Faster Convergence	Yibo Yang et.al.	2506.13187	null
2025-06-16	Empirical Evaluation of Large Language Models in Automated Program Repair	Jiajun Sun et.al.	2506.13186	null
2025-06-16	From Empirical Evaluation to Context-Aware Enhancement: Repairing Regression Errors with LLMs	Anh Ho et.al.	2506.13182	null
2025-06-16	Align-then-Unlearn: Embedding Alignment for LLM Unlearning	Philipp Spohn et.al.	2506.13181	link
2025-06-16	Enhancing Large Language Models with Reliable Knowledge Graphs	Qinggang Zhang et.al.	2506.13178	null
2025-06-17	AI-Facilitated Analysis of Abstracts and Conclusions: Flagging Unsubstantiated Claims and Ambiguous Pronouns	Evgeny Markhasin et.al.	2506.13172	null
2025-06-16	Querying Large Automotive Software Models: Agentic vs. Direct LLM Approaches	Lukasz Mazur et.al.	2506.13171	null
2025-06-16	Using LLMs for Security Advisory Investigations: How Far Are We?	Bayu Fedra Abdullah et.al.	2506.13161	link
2025-06-16	Adapting LLMs for Minimal-edit Grammatical Error Correction	Ryszard Staruch et.al.	2506.13148	null
2025-06-16	ZINA: Multimodal Fine-grained Hallucination Detection and Editing	Yuiga Wada et.al.	2506.13130	null
2025-06-16	SAGDA: Open-Source Synthetic Agriculture Data for Africa	Abdelghani Belgaid et.al.	2506.13123	link
2025-06-16	Designing Deep Learning Frameworks for LLMs:Challenges, Expectations, and Opportunities	Yanzhou Mu et.al.	2506.13114	null
2025-06-16	GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction	Jinguang Tong et.al.	2506.13110	null
2025-06-16	Leveraging In-Context Learning for Language Model Agents	Shivanshu Gupta et.al.	2506.13109	null
2025-06-16	Rethinking Test-Time Scaling for Medical AI: Model and Task-Aware Strategies for LLMs and VLMs	Gyutaek Oh et.al.	2506.13102	null
2025-06-16	Detecting Hard-Coded Credentials in Software Repositories via LLMs	Chidera Biringa et.al.	2506.13090	null
2025-06-16	Discerning What Matters: A Multi-Dimensional Assessment of Moral Competence in LLMs	Daniel Kilov et.al.	2506.13082	null
2025-06-16	SuperPlace: The Renaissance of Classical Feature Aggregation for Visual Place Recognition in the Era of Foundation Models	Bingxi Liu et.al.	2506.13073	null
2025-06-16	CHILL at SemEval-2025 Task 2: You Can’t Just Throw Entities and Hope – Make Your LLM to Get Them Right	Jaebok Lee et.al.	2506.13070	null
2025-06-16	MotiveBench: How Far Are We From Human-Like Motivational Reasoning in Large Language Models?	Xixian Yong et.al.	2506.13065	null
2025-06-16	PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue	George Shaikovski et.al.	2506.13063	null
2025-06-16	Metis-RISE: RL Incentivizes and SFT Enhances Multimodal Reasoning Model Learning	Haibo Qiu et.al.	2506.13056	null
2025-06-16	CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model	Jiangtong Li et.al.	2506.13055	null
2025-06-16	Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning	Can Polat et.al.	2506.13051	link
2025-06-16	A Comprehensive Survey on Continual Learning in Generative Models	Haiyang Guo et.al.	2506.13045	link
2025-06-16	Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models	Muhammad Reza Qorib et.al.	2506.13044	null
2025-06-16	Evolution of ReID: From Early Methods to LLM Integration	Amran Bhuiyan et.al.	2506.13039	null
2025-06-16	Forecast-Then-Optimize Deep Learning Methods	Jinhang Jiang et.al.	2506.13036	null
2025-06-16	Knowledge Graph Fusion with Large Language Models for Accurate, Explainable Manufacturing Process Planning	Danny Hoang et.al.	2506.13026	null
2025-06-16	A Practical Guide for Evaluating LLMs and LLM-Reliant Systems	Ethan M. Rudd et.al.	2506.13023	null
2025-06-13	EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction	Hsi-Che Lin et.al.	2506.12015	null
2025-06-13	code_transformed: The Influence of Large Language Models on Code	Yuliang Xu et.al.	2506.12014	null
2025-06-13	Tracing LLM Reasoning Processes with Strategic Games: A Framework for Planning, Revision, and Resource-Constrained Decision Making	Xiaopeng Yuan et.al.	2506.12012	null
2025-06-13	Generative Representational Learning of Foundation Models for Recommendation	Zheli Zhou et.al.	2506.11999	null
2025-06-13	VGR: Visual Grounded Reasoning	Jiacong Wang et.al.	2506.11991	null
2025-06-13	How Visual Representations Map to Language Feature Space in Multimodal LLMs	Constantin Venhoff et.al.	2506.11976	null
2025-06-13	A Robust Local Fréchet Regression Using Unbalanced Neural Optimal Transport with Applications to Dynamic Single-cell Genomics Data	Binghao Yan et.al.	2506.11969	null
2025-06-13	Improving Large Language Model Safety with Contrastive Representation Learning	Samuel Simko et.al.	2506.11938	link
2025-06-13	LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?	Zihan Zheng et.al.	2506.11928	null
2025-06-13	Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation	Min-Seop Kwak et.al.	2506.11924	null
2025-06-13	Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache	Xiaoran Liu et.al.	2506.11886	null
2025-06-13	Addressing Bias in LLMs: Strategies and Application to Fair AI-based Recruitment	Alejandro Peña et.al.	2506.11880	null
2025-06-13	A Short Survey on Formalising Software Requirements using Large Language Models	Arshad Beg et.al.	2506.11874	null
2025-06-13	Post Persona Alignment for Multi-Session Dialogue Generation	Yi-Pei Chen et.al.	2506.11857	null
2025-06-13	TrustGLM: Evaluating the Robustness of GraphLLMs Against Prompt, Text, and Structure Attacks	Qihai Zhang et.al.	2506.11844	null
2025-06-13	Your Ride, Your Rules: Psychology and Cognition Enabled Automated Driving Systems	Zhipeng Bao et.al.	2506.11842	null
2025-06-13	CLEAN-MI: A Scalable and Efficient Pipeline for Constructing High-Quality Neurodata in Motor Imagery Paradigm	Dingkun Liu et.al.	2506.11830	null
2025-06-13	Revealing Political Bias in LLMs through Structured Multi-Agent Debate	Aishwarya Bandaru et.al.	2506.11825	link
2025-06-13	On the Performance of LLMs for Real Estate Appraisal	Margot Geerts et.al.	2506.11812	null
2025-06-13	Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?	Simeon Junker et.al.	2506.11807	null
2025-06-13	Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models	Maximilian Kreutner et.al.	2506.11798	null
2025-06-13	SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks	Hwiwon Lee et.al.	2506.11791	link
2025-06-13	Conversational AI as a Catalyst for Informal Learning: An Empirical Large-Scale Study on LLM Use in Everyday Learning	Nađa Terzimehić et.al.	2506.11789	null
2025-06-13	AgentSense: Virtual Sensor Data Generation Using LLM Agent in Simulated Home Environments	Zikang Leng et.al.	2506.11773	null
2025-06-13	CLIP Meets Diffusion: A Synergistic Approach to Anomaly Detection	Byeongchan Lee et.al.	2506.11772	null
2025-06-13	Long-Short Alignment for Effective Long-Context Modeling in LLMs	Tianqi Du et.al.	2506.11769	null
2025-06-13	Designing Effective LLM-Assisted Interfaces for Curriculum Development	Abdolali Faraji et.al.	2506.11767	null
2025-06-13	Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis	Zuzanna Skorniewska et.al.	2506.11753	null
2025-06-13	DART: Distilling Autoregressive Reasoning to Silent Thought	Nan Jiang et.al.	2506.11752	null
2025-06-13	Quizzard@INOVA Challenge 2025 – Track A: Plug-and-Play Technique in Interleaved Multi-Image Model	Dinh Viet Cuong et.al.	2506.11737	link
2025-06-13	Classification of Quality Characteristics in Online User Feedback using Linguistic Analysis, Crowdsourcing and LLMs	Eduard C. Groen et.al.	2506.11722	null
2025-06-13	Mitigating Hallucination Through Theory-Consistent Symmetric Multimodal Preference Optimization	Wenqi Liu et.al.	2506.11712	null
2025-06-13	Fusion of multi-source precipitation records via coordinate-based generative model	Sencan Sun et.al.	2506.11698	null
2025-06-13	LLMs for Sentence Simplification: A Hybrid Multi-Agent prompting Approach	Pratibha Zunjare et.al.	2506.11681	null
2025-06-13	LLMs on support of privacy and security of mobile apps: state of the art and research directions	Tran Thanh Lam Nguyen et.al.	2506.11679	null
2025-06-13	Dynamic Mixture of Curriculum LoRA Experts for Continual Multimodal Instruction Tuning	Chendi Ge et.al.	2506.11672	null
2025-06-13	Brain Network Analysis Based on Fine-tuned Self-supervised Model for Brain Disease Diagnosis	Yifei Tang et.al.	2506.11671	null
2025-06-13	Converting Annotated Clinical Cases into Structured Case Report Forms	Pietro Ferrazzi et.al.	2506.11666	null
2025-06-13	An Empirical study on LLM-based Log Retrieval for Software Engineering Metadata Management	Simin Sun et.al.	2506.11659	null
2025-06-13	LoRA-Gen: Specializing Large Language Model via Online LoRA Generation	Yicheng Xiao et.al.	2506.11638	null
2025-06-13	FAA Framework: A Large Language Model-Based Approach for Credit Card Fraud Investigations	Shaun Shuster et.al.	2506.11635	null
2025-06-13	Convergent Linear Representations of Emergent Misalignment	Anna Soligo et.al.	2506.11618	null
2025-06-13	Model Organisms for Emergent Misalignment	Edward Turner et.al.	2506.11613	null
2025-06-13	KEENHash: Hashing Programs into Function-Aware Embeddings for Large-Scale Binary Code Similarity Analysis	Zhijie Liu et.al.	2506.11612	null
2025-06-13	TongSearch-QR: Reinforced Query Reasoning for Retrieval	Xubo Qin et.al.	2506.11603	null
2025-06-13	Are LLMs Good Text Diacritizers? An Arabic and Yorùbá Case Study	Hawau Olamide Toyin et.al.	2506.11602	null
2025-06-13	GraphRAG-Causal: A novel graph-augmented framework for causal reasoning and annotation in news	Abdul Haque et.al.	2506.11600	null
2025-06-13	A $^2$ LC: Active and Automated Label Correction for Semantic Segmentation	Youjin Jeon et.al.	2506.11599	null
2025-06-13	Collaborative LLM Inference via Planning for Efficient Reasoning	Byeongchan Lee et.al.	2506.11578	null
2025-06-13	Learn to Preserve Personality: Federated Foundation Models in Recommendations	Zhiwei Li et.al.	2506.11563	null
2025-06-13	Identifying Helpful Context for LLM-based Vulnerability Repair: A Preliminary Study	Gábor Antal et.al.	2506.11561	null
2025-06-13	Leveraging GPT-4 for Vulnerability-Witnessing Unit Test Generation	Gábor Antal et.al.	2506.11559	null
2025-06-13	DaMO: A Data-Efficient Multimodal Orchestrator for Temporal Reasoning with Video LLMs	Bo-Cheng Chiu et.al.	2506.11558	null
2025-06-13	From Persona to Person: Enhancing the Naturalness with Multiple Discourse Relations Graph Learning in Personalized Dialogue Generation	Chih-Hao Hsu et.al.	2506.11557	null
2025-06-13	RAG+: Enhancing Retrieval-Augmented Generation with Application-Aware Reasoning	Yu Wang et.al.	2506.11555	null
2025-06-13	Augmenting the Generality and Performance of Large Language Models for Software Engineering	Fabian C. Peña et.al.	2506.11548	null
2025-06-13	Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing	Thanapat Trachu et.al.	2506.11542	null
2025-06-13	Foundation Models in Autonomous Driving: A Survey on Scenario Generation and Scenario Analysis	Yuan Gao et.al.	2506.11526	link
2025-06-13	Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models	Jinming Wen et.al.	2506.11521	null
2025-06-13	Brewing Knowledge in Context: Distillation Perspectives on In-Context Learning	Chengye Li et.al.	2506.11516	null
2025-06-13	Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs	Xiao Xu et.al.	2506.11515	null
2025-06-13	Prioritizing Alignment Paradigms over Task-Specific Model Customization in Time-Series LLMs	Wei Li et.al.	2506.11512	link
2025-06-13	Lag-Relative Sparse Attention In Long Context Training	Manlai Liang et.al.	2506.11498	null
2025-06-13	Relational Schemata in BERT Are Inducible, Not Emergent: A Study of Performance vs. Competence in Language Models	Cole Gawin et.al.	2506.11485	null
2025-06-13	VulStamp: Vulnerability Assessment using Large Language Model	Haoshen et.al.	2506.11484	null
2025-06-13	LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment	Shikun Li et.al.	2506.11480	null
2025-06-13	ImmunoFOMO: Are Language Models missing what oncologists see?	Aman Sinha et.al.	2506.11478	null
2025-06-13	FAME: A Lightweight Spatio-Temporal Network for Model Attribution of Face-Swap Deepfakes	Wasim Ahmad et.al.	2506.11477	link
2025-06-13	LiLAC: A Lightweight Latent ControlNet for Musical Audio Generation	Tom Baker et.al.	2506.11476	null
2025-06-13	Med-PRM: Medical Reasoning Models with Stepwise, Guideline-verified Process Rewards	Jaehoon Yun et.al.	2506.11474	null
2025-06-13	A Gamified Evaluation and Recruitment Platform for Low Resource Language Machine Translation Systems	Carlos Rafael Catalan et.al.	2506.11467	null
2025-06-13	Leveraging Reference Documents for Zero-Shot Ranking via Large Language Models	Jieran Li et.al.	2506.11452	null
2025-06-13	ReVeal: Self-Evolving Code Agents via Iterative Generation-Verification	Yiyang Jin et.al.	2506.11442	null
2025-06-13	AbsenceBench: Language Models Can’t Tell What’s Missing	Harvey Yiyun Fu et.al.	2506.11440	link
2025-06-13	TAViS: Text-bridged Audio-Visual Segmentation with Foundation Models	Ziyang Luo et.al.	2506.11436	null
2025-06-13	KoGEC : Korean Grammatical Error Correction with Pre-trained Translation Models	Taeeun Kim et.al.	2506.11432	null
2025-06-13	Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards	Jeff Da et.al.	2506.11425	null
2025-06-13	Efficient Long-Context LLM Inference via KV Cache Clustering	Jie Hu et.al.	2506.11418	null
2025-06-13	Stop learning it all to mitigate visual hallucination, Focus on the hallucination target	Dokyoon Yoon et.al.	2506.11417	null
2025-06-13	Bias Amplification in RAG: Poisoning Knowledge Retrieval to Steer LLMs	Linlin Wang et.al.	2506.11415	null
2025-06-13	Predicting Early-Onset Colorectal Cancer with Large Language Models	Wilson Lau et.al.	2506.11410	null
2025-06-13	LoRA Users Beware: A Few Spurious Tokens Can Manipulate Your Finetuned Model	Pradyut Sekhsaria et.al.	2506.11402	link
2025-06-13	Curriculum-Guided Layer Scaling for Language Model Pretraining	Karanpartap Singh et.al.	2506.11389	null
2025-06-13	Large Language Model-Powered Conversational Agent Delivering Problem-Solving Therapy (PST) for Family Caregivers: Enhancing Empathy and Therapeutic Alliance Using In-Context Learning	Liying Wang et.al.	2506.11376	null
2025-06-13	Benchmarking Multimodal LLMs on Recognition and Understanding over Chemical Tables	Yitong Zhou et.al.	2506.11375	null
2025-06-13	A Watermark for Auto-Regressive Image Generation Models	Yihan Wu et.al.	2506.11371	null
2025-06-12	The Biased Samaritan: LLM biases in Perceived Kindness	Jack H Fagan et.al.	2506.11361	null
2025-06-12	From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review	Yaohui Zhang et.al.	2506.11343	null
2025-06-12	Uncovering Reliable Indicators: Improving IoC Extraction from Threat Reports	Evangelos Froudakis et.al.	2506.11325	null
2025-06-12	HyBiomass: Global Hyperspectral Imagery Benchmark Dataset for Evaluating Geospatial Foundation Models in Forest Aboveground Biomass Estimation	Aaron Banze et.al.	2506.11314	null
2025-06-12	SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding	Ziyi Zhang et.al.	2506.11309	null
2025-06-12	Don’t Pay Attention	Mohammad Hammoud et.al.	2506.11305	null
2025-06-12	Domain-Constrained Diffusion Models to Synthesize Tabular Data: A Case Study in Power Systems	Milad Hoseinpour et.al.	2506.11281	null
2025-06-12	Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation	Benjamin Elder et.al.	2506.11266	null
2025-06-12	Gondola: Grounded Vision Language Planning for Generalizable Robotic Manipulation	Shizhe Chen et.al.	2506.11261	null
2025-06-12	Lifting Data-Tracing Machine Unlearning to Knowledge-Tracing for Foundation Models	Yuwen Tan et.al.	2506.11253	null
2025-06-12	Can Time-Series Foundation Models Perform Building Energy Management Tasks?	Ozan Baris Mulayim et.al.	2506.11250	null
2025-06-12	No Universal Prompt: Unifying Reasoning through Adaptive Prompting for Temporal Table Reasoning	Kushagra Dixit et.al.	2506.11246	null
2025-06-12	LLM-as-a-Fuzzy-Judge: Fine-Tuning Large Language Models as a Clinical Evaluation Judge with Fuzzy Logic	Weibing Zheng et.al.	2506.11221	link
2025-06-12	SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis	Weiliang Chen et.al.	2506.10981	null
2025-06-12	GenWorld: Towards Detecting AI-generated Real-world Simulation Videos	Weiliang Chen et.al.	2506.10975	null
2025-06-12	AutoMind: Adaptive Knowledgeable Agent for Automated Data Science	Yixin Ou et.al.	2506.10974	link
2025-06-12	Farseer: A Refined Scaling Law in Large Language Models	Houyi Li et.al.	2506.10972	link
2025-06-12	Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs	Qizhe Zhang et.al.	2506.10967	link
2025-06-12	GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation	Ning Gao et.al.	2506.10966	null
2025-06-13	MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Yuxuan Luo et.al.	2506.10963	null
2025-06-12	ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark	Kangwei Liu et.al.	2506.10960	link
2025-06-12	Distillation of atomistic foundation models across architectures and chemical domains	John L. A. Gardner et.al.	2506.10956	link
2025-06-12	SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks	Lianghong Guo et.al.	2506.10954	link
2025-06-12	Build the web for agents, not agents for the web	Xing Han Lù et.al.	2506.10953	null
2025-06-12	Execution Guided Line-by-Line Code Generation	Boaz Lavon et.al.	2506.10948	link
2025-06-12	GUARD: Guided Unlearning and Retention via Data Attribution for Large Language Models	Evelyn Ma et.al.	2506.10946	null
2025-06-12	Self-Adapting Language Models	Adam Zweiger et.al.	2506.10943	null
2025-06-12	Dynamic Epistemic Friction in Dialogue	Timothy Obiso et.al.	2506.10934	null
2025-06-12	The Role of Generative AI in Facilitating Social Interactions: A Scoping Review	T. T. J. E. Arets et.al.	2506.10927	null
2025-06-12	Robustly Improving LLM Fairness in Realistic Settings via Interpretability	Adam Karvonen et.al.	2506.10922	link
2025-06-12	Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization	Or Shafran et.al.	2506.10920	link
2025-06-12	Foundation Models for Causal Inference via Prior-Data Fitted Networks	Yuchen Ma et.al.	2506.10914	null
2025-06-12	Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular Detoxification?	Fei Lin et.al.	2506.10912	null
2025-06-12	NoLoCo: No-all-reduce Low Communication Training Method for Large Models	Jari Kolehmainen et.al.	2506.10911	link
2025-06-12	Probably Approximately Correct Labels	Emmanuel J. Candès et.al.	2506.10908	link
2025-06-12	Beyond Gold Standards: Epistemic Ensemble of LLM Judges for Formal Mathematical Reasoning	Lan Zhang et.al.	2506.10903	null
2025-06-12	GenPlanX. Generation of Plans and Execution	Daniel Borrajo et.al.	2506.10897	null
2025-06-12	AIR: Zero-shot Generative Model Adaptation with Iterative Refinement	Guimeng Liu et.al.	2506.10895	link
2025-06-12	Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers	Yixiao Huang et.al.	2506.10887	null
2025-06-12	Slimming Down LLMs Without Losing Their Minds	Qingda et.al.	2506.10885	null
2025-06-12	Precise Zero-Shot Pointwise Ranking with LLMs through Post-Aggregated Global Context Information	Kehan Long et.al.	2506.10859	link
2025-06-12	A Study on Individual Spatiotemporal Activity Generation Method Using MCP-Enhanced Chain-of-Thought Large Language Models	Yu Zhang et.al.	2506.10853	link
2025-06-13	Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Principles	Qingyan Wei et.al.	2506.10848	link
2025-06-12	Evaluating Large Language Models on Non-Code Software Engineering Tasks	Fabian C. Peña et.al.	2506.10833	link
2025-06-12	LLM-Driven Personalized Answer Generation and Evaluation	Mohammadreza Molavi et.al.	2506.10829	null
2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825	null
2025-06-12	ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization	Zhensheng Jin et.al.	2506.10822	link
2025-06-12	VideoDeepResearch: Long Video Understanding With Agentic Tool Using	Huaying Yuan et.al.	2506.10821	link
2025-06-12	Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Mario Barbara et.al.	2506.10807	null
2025-06-13	Detecting High-Stakes Interactions with Activation Probes	Alex McKenzie et.al.	2506.10805	null
2025-06-12	Mitigating Negative Interference in Multilingual Sequential Knowledge Editing through Null-Space Constraints	Wei Sun et.al.	2506.10800	null
2025-06-12	FASCIST-O-METER: Classifier for Neo-fascist Discourse Online	Rudy Alexandro Garrido Veliz et.al.	2506.10789	null
2025-06-12	What Users Value and Critique: Large-Scale Analysis of User Feedback on AI-Powered Mobile Apps	Vinaik Chhetri et.al.	2506.10785	null
2025-06-12	Improving Named Entity Transcription with Contextual LLM-based Revision	Viet Anh Trinh et.al.	2506.10779	null
2025-06-12	Different Questions, Different Models: Fine-Grained Evaluation of Uncertainty and Calibration in Clinical QA with LLMs	Alberto Testoni et.al.	2506.10769	null
2025-06-12	One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers	Diana Abagyan et.al.	2506.10766	null
2025-06-12	OPT-BENCH: Evaluating LLM Agent on Large-Scale Search Spaces Optimization Problems	Xiaozhe Li et.al.	2506.10764	link
2025-06-12	Integrating Large Language Models into Text Animation: An Intelligent Editing System with Inline and Chat Interaction	Bao Zhang et.al.	2506.10762	null
2025-06-12	Grounded Vision-Language Navigation for UAVs with Open-Vocabulary Goal Understanding	Yuhang Zhang et.al.	2506.10756	null
2025-06-12	TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora	Priyanka Kargupta et.al.	2506.10737	link
2025-06-12	Inferring Adjective Hypernyms with Language Models to Increase the Connectivity of Open English Wordnet	Lorenzo Augello et.al.	2506.10715	null
2025-06-12	ConTextTab: A Semantics-Aware Tabular In-Context Learner	Marco Spinaci et.al.	2506.10707	link
2025-06-12	Formalising Software Requirements using Large Language Models	Arshad Beg et.al.	2506.10704	null
2025-06-12	Large Language Models for Detection of Life-Threatening Texts	Thanh Thi Nguyen et.al.	2506.10687	null
2025-06-12	Unsourced Adversarial CAPTCHA: A Bi-Phase Adversarial CAPTCHA Framework	Xia Du et.al.	2506.10685	null
2025-06-12	Automated Validation of Textual Constraints Against AutomationML via LLMs and SHACL	Tom Westermann et.al.	2506.10678	link
2025-06-12	TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving	Vincenzo Colle et.al.	2506.10674	null
2025-06-12	Large Language Models-Empowered Wireless Networks: Fundamentals, Architecture, and Challenges	Latif U. Khan et.al.	2506.10651	null
2025-06-12	Data Shifts Hurt CoT: A Theoretical Study	Lang Yin et.al.	2506.10647	null
2025-06-12	Spelling-out is not Straightforward: LLMs’ Capability of Tokenization from Token to Characters	Tatsuya Hiraoka et.al.	2506.10641	null
2025-06-12	Conversational Search: From Fundamentals to Frontiers in the LLM Era	Fengran Mo et.al.	2506.10635	null
2025-06-12	Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models	Francisco Caetano et.al.	2506.10634	null
2025-06-12	Hessian Geometry of Latent Space in Generative Models	Alexander Lobashev et.al.	2506.10632	link
2025-06-12	NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors	Numaan Naeem et.al.	2506.10627	link
2025-06-12	SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis	Sergio Burdisso et.al.	2506.10622	link
2025-06-12	SoK: Evaluating Jailbreak Guardrails for Large Language Models	Xunguang Wang et.al.	2506.10597	link
2025-06-12	IDEA: Augmenting Design Intelligence through Design Space Exploration	Chuer Chen et.al.	2506.10587	null
2025-06-12	Primender Sequence: A Novel Mathematical Construct for Testing Symbolic Inference and AI Reasoning	Mohd Anwar Jamal Faiz et.al.	2506.10585	null
2025-06-12	Equations of state and stability condition of mixed p-spin glass model	Ali Talebi et.al.	2506.10579	null
2025-06-12	Harmonizing Geometry and Uncertainty: Diffusion with Hyperspheres	Muskan Dosi et.al.	2506.10576	null
2025-06-12	Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning	Chun-Mei Feng et.al.	2506.10575	null
2025-06-12	DanceChat: Large Language Model-Guided Music-to-Dance Generation	Qing Wang et.al.	2506.10574	null
2025-06-12	Unitary Scrambling and Collapse: A Quantum Diffusion Framework for Generative Modeling	Yihua Li et.al.	2506.10571	link
2025-06-12	From Images to Insights: Explainable Biodiversity Monitoring with Plain Language Habitat Explanations	Yutong Zhou et.al.	2506.10559	null
2025-06-12	StepProof: Step-by-step verification of natural language mathematical proofs	Xiaolin Hu et.al.	2506.10558	link
2025-06-12	Nowcasting the euro area with social media data	Konstantin Boss et.al.	2506.10546	null
2025-06-12	AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation	Haoyuan Shi et.al.	2506.10540	null
2025-06-12	Equivariant Neural Diffusion for Molecule Generation	François Cornet et.al.	2506.10532	link
2025-06-12	LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs	Yanan Cai et.al.	2506.10527	null
2025-06-12	AdaptiveLLM: A Framework for Selecting Optimal Cost-Efficient LLM for Code-Generation Based on CoT Length	Junhang Cheng et.al.	2506.10525	link
2025-06-13	Scientists’ First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning	Yuhao Zhou et.al.	2506.10521	null
2025-06-12	CogStream: Context-guided Streaming Video Question Answering	Zicheng Zhao et.al.	2506.10516	null
2025-06-12	Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs	Yilin Xiao et.al.	2506.10508	null
2025-06-12	Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models	Sangmin Song et.al.	2506.10504	null
2025-06-12	BugGen: A Self-Correcting Multi-Agent LLM Pipeline for Realistic RTL Bug Synthesis	Surya Jasper et.al.	2506.10501	null
2025-06-12	Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models	Aleksandra Sorokovikova et.al.	2506.10491	null
2025-06-12	EXPEREPAIR: Dual-Memory Enhanced LLM-based Repository-Level Program Repair	Fangwen Mu et.al.	2506.10484	link
2025-06-12	LLMs Are Not Yet Ready for Deepfake Image Detection	Shahroz Tariq et.al.	2506.10474	null
2025-06-12	MedSeg-R: Reasoning Segmentation in Medical Images with Multimodal Large Language Models	Yu Huang et.al.	2506.10465	null
2025-06-12	Are We Generalizing from the Exception? An In-the-Wild Study on Group-Sensitive Conversation Design in Human-Agent Interactions	Ana Müller et.al.	2506.10462	null
2025-06-12	Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty	Zehui Ling et.al.	2506.10446	null
2025-06-12	MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices	Zhaode Wang et.al.	2506.10443	null
2025-06-12	Towards Understanding Bugs in Distributed Training and Inference Frameworks for Large Language Models	Xiao Yu et.al.	2506.10426	null
2025-06-12	SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks	Kaiyuan Zhang et.al.	2506.10424	link
2025-06-12	PAL: Probing Audio Encoders via LLMs – A Study of Information Transfer from Audio Encoders to LLMs	Tony Alex et.al.	2506.10423	null
2025-06-12	Beyond the Battlefield: Framing Analysis of Media Coverage in Conflict Reporting	Avneet Kaur et.al.	2506.10421	null
2025-06-12	Can Sound Replace Vision in LLaVA With Token Substitution?	Ali Vosoughi et.al.	2506.10416	null
2025-06-12	Burn After Reading: Do Multimodal Large Language Models Truly Capture Order of Events in Image Sequences?	Yingjin Song et.al.	2506.10415	null
2025-06-12	Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges	Jintao Liang et.al.	2506.10408	null
2025-06-12	PAG: Multi-Turn Reinforced LLM Self-Correction with Policy as Generative Verifier	Yuhua Jiang et.al.	2506.10406	null
2025-06-12	Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation	Tzu-Heng Huang et.al.	2506.10403	null
2025-06-12	Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation	Zhiyang Xu et.al.	2506.10395	null
2025-06-12	Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills	Yuquan Xie et.al.	2506.10387	null
2025-06-11	Flipping Against All Odds: Reducing LLM Coin Flip Bias via Verbalized Rejection Sampling	Tim Z. Xiao et.al.	2506.09998	null
2025-06-11	From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring	Yang Li et.al.	2506.09996	null
2025-06-11	Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages	Amel Muminovic et.al.	2506.09992	link
2025-06-11	Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation	Xinyu Yang et.al.	2506.09991	null
2025-06-11	V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning	Mido Assran et.al.	2506.09985	link
2025-06-11	Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs	Hiroshi Matsuda et.al.	2506.09983	link
2025-06-11	AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation	Zijie Wu et.al.	2506.09982	null
2025-06-11	When Detection Fails: The Power of Fine-Tuned Models to Generate Human-Like Social Media Text	Hillary Dawkins et.al.	2506.09975	null
2025-06-11	SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance	Wentao Ge et.al.	2506.09968	null
2025-06-11	Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing	Junfei Wu et.al.	2506.09965	link
2025-06-11	Kvasir-VQA-x1: A Multimodal Dataset for Medical Reasoning and Robust MedVQA in Gastrointestinal Endoscopy	Sushant Gautam et.al.	2506.09958	null
2025-06-11	LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge	Sahar Abdelnabi et.al.	2506.09956	link
2025-06-11	VerIF: Verification Engineering for Reinforcement Learning in Instruction Following	Hao Peng et.al.	2506.09942	link
2025-06-11	PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants	Zheng Zhao et.al.	2506.09902	link
2025-06-11	The Emergence of Abstract Thought in Large Language Models Beyond Any Language	Yuxin Chen et.al.	2506.09890	null
2025-06-11	Attention Head Embeddings with Trainable Deep Kernels for Hallucination Detection in LLMs	Rodion Oblovatny et.al.	2506.09886	null
2025-06-11	3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation	Seonho Lee et.al.	2506.09883	link
2025-06-11	Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation	Siyu Chen et.al.	2506.09881	link
2025-06-11	Foundation Model-Aided Deep Reinforcement Learning for RIS-Assisted Wireless Communication	Mohammad Ghassemi et.al.	2506.09855	null
2025-06-11	Causal Sufficiency and Necessity Improves Chain-of-Thought Reasoning	Xiangning Yu et.al.	2506.09853	null
2025-06-11	Dataset of News Articles with Provenance Metadata for Media Relevance Assessment	Tomas Peterka et.al.	2506.09847	null
2025-06-11	A Deep Generative Model for the Simulation of Discrete Karst Networks	Dany Lauzon et.al.	2506.09832	null
2025-06-11	EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection	Christoph Schuhmann et.al.	2506.09827	null
2025-06-11	Superstudent intelligence in thermodynamics	Rebecca Loubet et.al.	2506.09822	null
2025-06-11	DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision	Xiandong Zou et.al.	2506.09814	null
2025-06-11	Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?	Andreas Säuberli et.al.	2506.09796	null
2025-06-11	Accurate and efficient zero-shot 6D pose estimation with frozen foundation models	Andrea Caraffa et.al.	2506.09784	null
2025-06-11	Intelligent Design 4.0: Paradigm Evolution Toward the Agentic AI Era	Shuo Jiang et.al.	2506.09755	null
2025-06-11	Large Language Models for Design Structure Matrix Optimization	Shuo Jiang et.al.	2506.09749	null
2025-06-11	Hierarchical Image Matching for UAV Absolute Visual Localization via Semantic and Structural Constraints	Xiangkai Zhang et.al.	2506.09748	null
2025-06-11	Feature Engineering for Agents: An Adaptive Cognitive Architecture for Interpretable ML Monitoring	Gusseppe Bravo-Rocca et.al.	2506.09742	null
2025-06-11	Towards Multi-modal Graph Large Language Model	Xin Wang et.al.	2506.09738	null
2025-06-11	Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning	Yuting Li et.al.	2506.09736	link
2025-06-11	Generative Models for Parameter Space Reduction applied to Reduced Order Modelling	Guglielmo Padula et.al.	2506.09721	null
2025-06-11	A First Look at Bugs in LLM Inference Engines	Mugeng Liu et.al.	2506.09713	link
2025-06-11	TRIDENT: Temporally Restricted Inference via DFA-Enhanced Neural Traversal	Vincenzo Collura et.al.	2506.09701	null
2025-06-11	Inv-Entropy: A Fully Probabilistic Framework for Uncertainty Quantification in Language Models	Haoyi Song et.al.	2506.09684	link
2025-06-11	Assessing the Quality of Denoising Diffusion Models in Wasserstein Distance: Noisy Score and Optimal Bounds	Vahan Arsenyan et.al.	2506.09681	null
2025-06-11	Is Fine-Tuning an Effective Solution? Reassessing Knowledge Editing for Unstructured Data	Hao Xiong et.al.	2506.09672	null
2025-06-11	DHoTT: A Temporal Extension of Homotopy Type Theory for Semantic Drift	Iman Poernomo et.al.	2506.09671	null
2025-06-11	Query-Level Uncertainty in Large Language Models	Lihu Chen et.al.	2506.09669	null
2025-06-11	Intent Factored Generation: Unleashing the Diversity in Your Language Model	Eltayeb Ahmed et.al.	2506.09659	null
2025-06-11	Bridging the Gap Between Open-Source and Proprietary LLMs in Table QA	Nikolas Evkarpidi et.al.	2506.09657	link
2025-06-11	Application-Driven Value Alignment in Agentic AI Systems: Survey and Perspectives	Wei Zeng et.al.	2506.09656	null
2025-06-11	DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy	Kaixuan Xu et.al.	2506.09655	null
2025-06-11	Real-Time Network Traffic Forecasting with Missing Data: A Generative Model Approach	Lei Deng et.al.	2506.09647	null
2025-06-11	Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering	Tianjun Yao et.al.	2506.09645	link
2025-06-11	DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning	Dongxu Liu et.al.	2506.09644	null
2025-06-11	Using Sign Language Production as Data Augmentation to enhance Sign Language Translation	Harry Walsh et.al.	2506.09643	null
2025-06-11	FedVLMBench: Benchmarking Federated Fine-Tuning of Vision-Language Models	Weiying Zheng et.al.	2506.09638	null
2025-06-11	HSENet: Hybrid Spatial Encoding Network for 3D Medical Vision-Language Understanding	Yanzhao Shi et.al.	2506.09634	null
2025-06-11	Ties of Trust: a bowtie model to uncover trustor-trustee relationships in LLMs	Eva Paraschou et.al.	2506.09632	null
2025-06-11	In-Context Bias Propagation in LLM-Based Tabular Data Generation	Pol G. Recasens et.al.	2506.09630	null
2025-06-11	Benchmarking Debiasing Methods for LLM-based Parameter Estimates	Nicolas Audinet de Pieuchon et.al.	2506.09627	null
2025-06-11	Analytic Task Scheduler: Recursive Least Squares Based Method for Continual Learning in Embodied Foundation Models	Lipei Xie et.al.	2506.09623	null
2025-06-11	Latent geometry emerging from network-driven processes	Andrea Filippo Beretta et.al.	2506.09616	null
2025-06-12	Consistent Story Generation with Asymmetry Zigzag Sampling	Mingxiao Li et.al.	2506.09612	link
2025-06-11	ASTAGEN: Empirical Evaluation of Automated SATD Taxonomy Generation with LLMs	Sota Nakashima et.al.	2506.09601	null
2025-06-11	Generic models of licci ideals parametrized by Schur functors	Lorenzo Guerrieri et.al.	2506.09598	null
2025-06-11	Beyond Overconfidence: Foundation Models Redefine Calibration in Deep Neural Networks	Achim Hekler et.al.	2506.09593	null
2025-06-11	Integrating Quantized LLMs into Robotics Systems as Edge AI to Leverage their Natural Language Processing Capabilities	Miguel Á. González-Santamarta et.al.	2506.09581	null
2025-06-11	From Symbolic to Neural and Back: Exploring Knowledge Graph-Large Language Model Synergies	Blaž Škrlj et.al.	2506.09566	null
2025-06-11	Towards Open Foundation Language Model and Corpus for Macedonian: A Low-Resource Language	Stefan Krsteski et.al.	2506.09560	null
2025-06-12	Understanding the Performance and Power of LLM Inferencing on Edge Accelerators	Mayank Arya et.al.	2506.09554	null
2025-06-11	Automated Synthesis of Formally Verified Multi-Abstraction Function Summaries	Fanpeng Yang et.al.	2506.09550	null
2025-06-11	LLM-Powered CPI Prediction Inference with Online Text Time Series	Yingying Fan et.al.	2506.09516	link
2025-06-11	ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning	Yu Sun et.al.	2506.09513	link
2025-06-11	Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning	Jiayi Yuan et.al.	2506.09501	null
2025-06-11	Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression	Dingcheng Zhen et.al.	2506.09482	link
2025-06-11	Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning	Cheng Chen et.al.	2506.09473	null
2025-06-11	ArcNeural: A Multi-Modal Database for the Gen-AI Era	Wu Min et.al.	2506.09467	null
2025-06-11	Towards Bridging the Reward-Generation Gap in Direct Alignment Algorithms	Zeguan Xiao et.al.	2506.09457	null
2025-06-11	From Partial to Monadic: Combinatory Algebra with Effects	Liron Cohen et.al.	2506.09453	link
2025-06-11	Learning Obfuscations Of LLM Embedding Sequences: Stained Glass Transform	Jay Roberts et.al.	2506.09452	null
2025-06-11	UniToMBench: Integrating Perspective-Taking to Improve Theory of Mind in LLMs	Prameshwar Thiyagarajan et.al.	2506.09450	null
2025-06-11	OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary	Yui Sudo et.al.	2506.09448	null
2025-06-11	LLMs Cannot Reliably Judge (Yet?): A Comprehensive Assessment on the Robustness of LLM-as-a-Judge	Songze Li et.al.	2506.09443	link
2025-06-11	GigaChat Family: Efficient Russian Language Modeling Through Mixture of Experts Architecture	GigaChat team et.al.	2506.09440	null
2025-06-11	Mitigating Spurious Correlations in LLMs via Causality-Aware Post-Training	Shurui Gui et.al.	2506.09433	null
2025-06-11	Improved Supervised Fine-Tuning for Large Language Models to Mitigate Catastrophic Forgetting	Fei Ding et.al.	2506.09428	null
2025-06-11	Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings	Md Messal Monem Miah et.al.	2506.09424	null
2025-06-11	Time-Unified Diffusion Policy with Action Discrimination for Robotic Manipulation	Ye Niu et.al.	2506.09422	null
2025-06-11	A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy	Henry Peng Zou et.al.	2506.09420	link
2025-06-11	PGDA-KGQA: A Prompt-Guided Generative Framework with Multiple Data Augmentation Strategies for Knowledge Graph Question Answering	Xiujun Zhou et.al.	2506.09414	null
2025-06-11	Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models	Jui-Ming Yao et.al.	2506.09408	null
2025-06-11	A theoretical basis for model collapse in recursive training	Vivek Shripad Borkar et.al.	2506.09401	null
2025-06-11	SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving	Xiangchen Li et.al.	2506.09397	null
2025-06-11	Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models	Zongjie Li et.al.	2506.09396	null
2025-06-11	Comparing human and LLM politeness strategies in free production	Haoran Zhao et.al.	2506.09391	null
2025-06-11	Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making	Kehan Zheng et.al.	2506.09390	null
2025-06-11	Revisiting Diffusion Models: From Generative Pre-training to One-Step Generation	Bowen Zheng et.al.	2506.09376	null
2025-06-11	Anomaly Detection and Generation with Diffusion Models: A Survey	Yang Liu et.al.	2506.09368	null
2025-06-11	Taming SQL Complexity: LLM-Based Equivalence Evaluation for Text-to-SQL	Qingyun Zeng et.al.	2506.09359	null
2025-06-11	“Is This Really a Human Peer Supporter?”: Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions	Kellie Yu Hui Sim et.al.	2506.09354	null
2025-06-11	DIVE into MoE: Diversity-Enhanced Reconstruction of Large Language Models from Dense into Mixture-of-Experts	Yuchen Feng et.al.	2506.09351	null
2025-06-11	Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation	Shanchuan Lin et.al.	2506.09350	null
2025-06-11	OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment	Chao-Hong Tan et.al.	2506.09349	link
2025-06-11	RePO: Replay-Enhanced Policy Optimization	Siheng Li et.al.	2506.09340	link
2025-06-11	Know What You Don’t Know: Uncertainty Calibration of Process Reward Models	Young-Jin Park et.al.	2506.09338	null
2025-06-11	Natural Language Guided Ligand-Binding Protein Design	Zhenqiao Song et.al.	2506.09332	null
2025-06-10	VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning	Li Kang et.al.	2506.09049	null
2025-06-10	Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation	Xiaowen Ma et.al.	2506.09046	null
2025-06-10	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	link
2025-06-10	AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions	Polina Kirichenko et.al.	2506.09038	link
2025-06-10	FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed	Sizhe Dang et.al.	2506.09034	null
2025-06-10	Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning	Haozhen Zhang et.al.	2506.09033	link
2025-06-10	Diffuse and Disperse: Image Generation with Representation Regularization	Runqian Wang et.al.	2506.09027	null
2025-06-11	Do Multiple Instance Learning Models Transfer?	Daniel Shao et.al.	2506.09022	link
2025-06-10	Edit Flows: Flow Matching with Edit Operations	Marton Havasi et.al.	2506.09018	null
2025-06-10	SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning	Ruiqi Zhang et.al.	2506.09016	link
2025-06-10	Learning to Reason Across Parallel Samples for LLM Reasoning	Jianing Qi et.al.	2506.09014	null
2025-06-10	Branched Schrödinger Bridge Matching	Sophia Tang et.al.	2506.09007	null
2025-06-11	Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models	Bei Chu et.al.	2506.09002	null
2025-06-10	Do Concept Replacement Techniques Really Erase Unacceptable Concepts?	Anudeep Das et.al.	2506.08991	null
2025-06-10	SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Xiao Liang et.al.	2506.08989	link
2025-06-11	On Finetuning Tabular Foundation Models	Ivan Rubachev et.al.	2506.08982	link
2025-06-11	Towards Better Code Generation: Adaptive Decoding with Uncertainty Guidance	Kaifeng He et.al.	2506.08980	null
2025-06-10	Propositional Logic for Probing Generalization in Neural Networks	Anna Langedijk et.al.	2506.08978	null
2025-06-10	Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System	Yuan Guo et.al.	2506.08972	null
2025-06-10	ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations	Amirreza Rouhi et.al.	2506.08968	null
2025-06-10	ORIDa: Object-centric Real-world Image Composition Dataset	Jinwoo Kim et.al.	2506.08964	null
2025-06-10	Evaluating Generative Vehicle Trajectory Models for Traffic Intersection Dynamics	Yash Ranjan et.al.	2506.08963	null
2025-06-10	WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis	Liangliang Chen et.al.	2506.08962	null
2025-06-10	IntTrajSim: Trajectory Prediction for Simulating Multi-Vehicle driving at Signalized Intersections	Yash Ranjan et.al.	2506.08957	null
2025-06-10	Segment Concealed Objects with Incomplete Supervision	Chunming He et.al.	2506.08955	null
2025-06-11	Can LLMs Ground when they (Don’t) Know: A Study on Direct and Loaded Political Questions	Clara Lachenmaier et.al.	2506.08952	null
2025-06-10	SSS: Semi-Supervised SAM-2 with Efficient Prompting for Medical Imaging Segmentation	Hongjie Zhu et.al.	2506.08949	link
2025-06-10	FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation	Qinggang Zhang et.al.	2506.08938	link
2025-06-10	BioLangFusion: Multimodal Fusion of DNA, mRNA, and Protein Language Models	Amina Mollaysa et.al.	2506.08936	null
2025-06-10	Can A Gamer Train A Mathematical Reasoning Model?	Andrew Shin et.al.	2506.08935	link
2025-06-10	What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities	Wendong Bu et.al.	2506.08933	null
2025-06-10	PropMEND: Hypernetworks for Knowledge Propagation in LLMs	Zeyu Leo Liu et.al.	2506.08920	link
2025-06-10	Quantifying Mix Network Privacy Erosion with Generative Models	Vasilios Mavroudis et.al.	2506.08918	null
2025-06-10	Dialect Normalization using Large Language Models and Morphological Rules	Antonios Dimakis et.al.	2506.08907	link
2025-06-10	Intention-Conditioned Flow Occupancy Models	Chongyi Zheng et.al.	2506.08902	link
2025-06-11	MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis	José Morano et.al.	2506.08900	link
2025-06-10	From Legal Texts to Defeasible Deontic Logic via LLMs: A Study in Automated Semantic Analysis	Elias Horner et.al.	2506.08899	null
2025-06-10	Product of Experts for Visual Generation	Yunzhi Zhang et.al.	2506.08894	null
2025-06-10	InfoDPCCA: Information-Theoretic Dynamic Probabilistic Canonical Correlation Analysis	Shiqin Tang et.al.	2506.08884	link
2025-06-11	Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis	Jingguo Qu et.al.	2506.08849	link
2025-06-11	Design Patterns for Securing LLM Agents against Prompt Injections	Luca Beurer-Kellner et.al.	2506.08837	null
2025-06-10	The impact of fine tuning in LLaMA on hallucinations for named entity extraction in legal documentation	Francisco Vargas et.al.	2506.08827	null
2025-06-10	FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency	Yifei Su et.al.	2506.08822	null
2025-06-10	Measuring Data Science Automation: A Survey of Evaluation Tools for AI Assistants and Agents	Irene Testini et.al.	2506.08800	null
2025-06-10	Landsat-Bench: Datasets and Benchmarks for Landsat Foundation Models	Isaac Corley et.al.	2506.08780	link
2025-06-10	RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation	Jiayi Song et.al.	2506.08772	link
2025-06-10	Paths to Causality: Finding Informative Subgraphs Within Knowledge Graphs for Knowledge-Based Causal Discovery	Yuni Susanti et.al.	2506.08771	link
2025-06-11	AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP	Ahmed Hasanaath et.al.	2506.08768	null
2025-06-10	EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements	Issa Sugiura et.al.	2506.08762	link
2025-06-10	Enhancing Accuracy and Maintainability in Nuclear Plant Data Retrieval: A Function-Calling LLM Approach Over NL-to-SQL	Mishca de Costa et.al.	2506.08757	null
2025-06-10	Bayesian Inverse Physics for Neuro-Symbolic Robot Learning	Octavio Arriaga et.al.	2506.08756	null
2025-06-10	Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data	Muhammad Anwar et.al.	2506.08750	null
2025-06-10	Towards Secure and Private Language Models for Nuclear Power Plants	Muhammad Anwar et.al.	2506.08746	null
2025-06-10	Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning	Kongcheng Zhang et.al.	2506.08745	link
2025-06-10	Breaking the ICE: Exploring promises and challenges of benchmarks for Inference Carbon & Energy estimation for LLMs	Samarth Sikand et.al.	2506.08727	null
2025-06-10	Improved LLM Agents for Financial Document Question Answering	Nelvin Tan et.al.	2506.08726	null
2025-06-10	Explainable Compliance Detection with Multi-Hop Natural Language Inference on Assurance Case Structure	Fariz Ikhwantri et.al.	2506.08713	null
2025-06-10	ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Large Language Model Preference Optimization	Hee Suk Yoon et.al.	2506.08712	null
2025-06-10	Educators’ Perceptions of Large Language Models as Tutors: Comparing Human and AI Tutors in a Blind Text-only Setting	Sankalan Pal Chowdhury et.al.	2506.08702	null
2025-06-10	On the Ethics of Using LLMs for Offensive Security	Andreas Happe et.al.	2506.08693	null
2025-06-10	Brevity is the soul of sustainability: Characterizing LLM response lengths	Soham Poddar et.al.	2506.08686	link
2025-06-11	Mitigating Reward Over-optimization in Direct Alignment Algorithms with Importance Sampling	Phuc Minh Nguyen et.al.	2506.08681	link
2025-06-10	Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search	Dongge Han et.al.	2506.08669	null
2025-06-10	Summarization for Generative Relation Extraction in the Microbiome Domain	Oumaima El Khettari et.al.	2506.08647	null
2025-06-10	MEMETRON: Metaheuristic Mechanisms for Test-time Response Optimization of Large Language Models	Son The Nguyen et.al.	2506.08643	null
2025-06-10	Time Series Representations for Classification Lie Hidden in Pretrained Vision Transformers	Simon Roschmann et.al.	2506.08641	null
2025-06-10	Orientation Matters: Making 3D Generative Models Orientation-Aligned	Yichong Lu et.al.	2506.08640	null
2025-06-10	RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping	Yang Bai et.al.	2506.08632	null
2025-06-11	Leveraging LLMs to Evaluate Usefulness of Document	Xingzhu Wang et.al.	2506.08626	null
2025-06-10	Generalizing while preserving monotonicity in comparison-based preference learning models	Julien Fageot et.al.	2506.08616	null
2025-06-10	RE-oriented Model Development with LLM Support and Deduction-based Verification	Radoslaw Klimek et.al.	2506.08606	null
2025-06-10	Hateful Person or Hateful Model? Investigating the Role of Personas in Hate Speech Detection by Large Language Models	Shuzhou Yuan et.al.	2506.08593	null
2025-06-10	CounselBench: A Large-Scale Expert Evaluation and Adversarial Benchmark of Large Language Models in Mental Health Counseling	Yahan Li et.al.	2506.08584	link
2025-06-10	The Geometries of Truth Are Orthogonal Across Tasks	Waiss Azizian et.al.	2506.08572	null
2025-06-11	Detecting State Manipulation Vulnerabilities in Smart Contracts Using LLM and Static Analysis	Hao Wu et.al.	2506.08561	null
2025-06-10	From Pixels to Graphs: using Scene and Knowledge Graphs for HD-EPIC VQA Challenge	Agnese Taluzzi et.al.	2506.08553	null
2025-06-10	Efficient Post-Training Refinement of Latent Reasoning in Large Language Models	Xinyuan Wang et.al.	2506.08552	null
2025-06-11	DeepForm: Reasoning Large Language Model for Communication System Formulation	Panlong Wu et.al.	2506.08551	null
2025-06-10	Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness	Yanwei Gong et.al.	2506.08532	null
2025-06-11	Teaching Physical Awareness to LLMs through Sounds	Weiguo Wang et.al.	2506.08524	null
2025-06-10	MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding	Zhiyi Zhu et.al.	2506.08512	null
2025-06-10	The Predictive Brain: Neural Correlates of Word Expectancy Align with Large Language Model Prediction Probabilities	Nikola Kölbl et.al.	2506.08511	null
2025-06-10	MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning	Kuo Yang et.al.	2506.08507	null
2025-06-10	DRAGged into Conflicts: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs	Arie Cattan et.al.	2506.08500	link
2025-06-10	RHealthTwin: Towards Responsible and Multimodal Digital Twins for Personalized Well-being	Rahatara Ferdousi et.al.	2506.08486	null
2025-06-10	Efficient Fireworks Algorithm Equipped with an Explosion Mechanism based on Student’s T-distribution	Cen Shipeng et.al.	2506.08484	null
2025-06-10	Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning	Fengjun Pan et.al.	2506.08477	null
2025-06-11	AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin	Shuo Yang et.al.	2506.08473	link
2025-06-10	Hybrid Reasoning for Perception, Explanation, and Autonomous Action in Manufacturing	Christos Margadji et.al.	2506.08462	null
2025-06-10	A Review on Score-based Generative Models for Audio Applications	Ge Zhu et.al.	2506.08457	null
2025-06-10	A Survey on Large Language Models for Mathematical Reasoning	Peng-Yuan Wang et.al.	2506.08446	null
2025-06-10	SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills	Kazuki Kawamura et.al.	2506.08443	null
2025-06-10	Olica: Efficient Structured Pruning of Large Language Models without Retraining	Jiujun He et.al.	2506.08436	link
2025-06-11	Low-resource domain adaptation while minimizing energy and hardware resource consumption	Hernán Maina et.al.	2506.08433	null
2025-06-10	CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models	Ziqi. Liu et.al.	2506.08430	null
2025-06-10	Better Reasoning with Less Data: Enhancing VLMs Through Unified Modality Scoring	Mingjie Xu et.al.	2506.08429	null
2025-06-10	Know-MRI: A Knowledge Mechanisms Revealer&Interpreter for Large Language Models	Jiaxiang Liu et.al.	2506.08427	link
2025-06-11	SHIELD: Multi-task Multi-distribution Vehicle Routing Solver with Sparsity and Hierarchy	Yong Liang Goh et.al.	2506.08424	null
2025-06-11	Transforming Expert Knowledge into Scalable Ontology via Large Language Models	Ikkei Itoku et.al.	2506.08422	null
2025-06-10	Improved Scaling Laws in Linear Regression via Data Reuse	Licong Lin et.al.	2506.08415	null
2025-06-09	GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior	Penghao Wu et.al.	2506.08012	null
2025-06-09	Play to Generalize: Learning to Reason Through Game Play	Yunfei Xie et.al.	2506.08011	link
2025-06-09	Reinforcement Pre-Training	Qingxiu Dong et.al.	2506.08007	null
2025-06-09	Dreamland: Controllable World Creation with Simulator and Generative Models	Sicheng Mo et.al.	2506.08006	null
2025-06-09	Reparameterized LLM Training via Orthogonal Equivalence Transformation	Zeju Qiu et.al.	2506.08001	null
2025-06-09	MADFormer: Mixed Autoregressive and Diffusion Transformers for Continuous Image Generation	Junhao Chen et.al.	2506.07999	null
2025-06-09	Generative Modeling of Weights: Generalization or Memorization?	Boya Zeng et.al.	2506.07998	link
2025-06-09	Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System	Fan Yang et.al.	2506.07997	null
2025-06-09	HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Hongzheng Chen et.al.	2506.07972	link
2025-06-09	CyberV: Cybernetics for Test-time Scaling in Video Understanding	Jiahao Meng et.al.	2506.07971	link
2025-06-09	SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence	Ziyang Gong et.al.	2506.07966	link
2025-06-09	Reinforcing Multimodal Understanding and Generation with Dual Self-rewards	Jixiang Hong et.al.	2506.07963	null
2025-06-09	Correlated Errors in Large Language Models	Elliot Kim et.al.	2506.07962	null
2025-06-09	TokenBreak: Bypassing Text Classification Models Through Token Manipulation	Kasimir Schulz et.al.	2506.07948	null
2025-06-09	Statistical Hypothesis Testing for Auditing Robustness in Language Models	Paulius Rauba et.al.	2506.07947	null
2025-06-09	ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols	Arnav Sheth et.al.	2506.07945	link
2025-06-09	Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations	Yizhen Li et.al.	2506.07943	null
2025-06-09	Adversarial Attack Classification and Robustness Testing for Large Language Models for Code	Yang Liu et.al.	2506.07942	null
2025-06-09	Gradients: When Markets Meet Fine-tuning – A Distributed Approach to Model Optimisation	Christopher Subia-Waud et.al.	2506.07940	null
2025-06-09	Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor	Rishit Dagli et.al.	2506.07932	null
2025-06-09	Solving Inequality Proofs with Large Language Models	Jiayi Sheng et.al.	2506.07927	link
2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915	null
2025-06-09	WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning	Jie Yang et.al.	2506.07905	link
2025-06-09	FunDiff: Diffusion Models over Function Spaces for Physics-Informed Generative Modeling	Sifan Wang et.al.	2506.07902	link
2025-06-09	MiniCPM4: Ultra-Efficient LLMs on End Devices	MiniCPM Team et.al.	2506.07900	link
2025-06-09	GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution	Shuja Khalid et.al.	2506.07897	null
2025-06-09	Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark	Shoko Oka et.al.	2506.07896	link
2025-06-09	Video Unlearning via Low-Rank Refusal Vector	Simone Facchiano et.al.	2506.07891	null
2025-06-09	SoK: Data Reconstruction Attacks Against Machine Learning Models: Definition, Metrics, and Benchmark	Rui Wen et.al.	2506.07888	null
2025-06-09	EgoM2P: Egocentric Multimodal Multitask Pretraining	Gen Li et.al.	2506.07886	null
2025-06-09	A Temporal FRBR/FRBRoo-Based Model for Component-Level Versioning of Legal Norms	Hudson de Martim et.al.	2506.07853	null
2025-06-09	Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning	Yiju Guo et.al.	2506.07851	null
2025-06-09	SAM2Auto: Auto Annotation Using FLASH	Arash Rocky et.al.	2506.07850	null
2025-06-09	Jarzynski Reweighting and Sampling Dynamics for Training Energy-Based Models: Theoretical Analysis of Different Transition Kernels	Davide Carbone et.al.	2506.07843	null
2025-06-09	Diffusion models under low-noise regime	Elizabeth Pavlova et.al.	2506.07841	link
2025-06-09	HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains	Shijie Wang et.al.	2506.07837	link
2025-06-09	Improving large language models with concept-aware fine-tuning	Michael K. Chen et.al.	2506.07833	link
2025-06-09	R3D2: Realistic 3D Asset Insertion via Diffusion for Autonomous Driving Simulation	William Ljungbergh et.al.	2506.07826	null
2025-06-09	Addition in Four Movements: Mapping Layer-wise Information Trajectories in LLMs	Yao Yan et.al.	2506.07824	null
2025-06-09	Guideline Forest: Experience-Induced Multi-Guideline Reasoning with Stepwise Aggregation	Jiaxiang CHen et.al.	2506.07820	null
2025-06-09	WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code	Zhiyu Lin et.al.	2506.07818	link
2025-06-09	Self-Cascaded Diffusion Models for Arbitrary-Scale Image Super-Resolution	Junseo Bang et.al.	2506.07813	null
2025-06-09	LLM Unlearning Should Be Form-Independent	Xiaotian Ye et.al.	2506.07795	null
2025-06-09	Language-Vision Planner and Executor for Text-to-Visual Reasoning	Yichang Xu et.al.	2506.07778	null
2025-06-09	REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models	Diego Forniés-Tabuenca et.al.	2506.07759	null
2025-06-09	Augmenting LLMs’ Reasoning by Reinforcing Abstract Thinking	Silin Gao et.al.	2506.07751	null
2025-06-09	Research quality evaluation by AI in the era of Large Language Models: Advantages, disadvantages, and systemic effects	Mike Thelwall et.al.	2506.07748	null
2025-06-09	Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images	Yingping Liang et.al.	2506.07740	null
2025-06-09	ArchiLense: A Framework for Quantitative Analysis of Architectural Styles Based on Vision Large Language Models	Jing Zhong et.al.	2506.07739	null
2025-06-09	RSafe: Incentivizing proactive reasoning to build robust and adaptive LLM safeguards	Jingnan Zheng et.al.	2506.07736	null
2025-06-09	Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning	Haizhao Jing et.al.	2506.07735	null
2025-06-09	NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models	Mouadh Yagoubi et.al.	2506.07731	null
2025-06-09	Interaction Analysis by Humans and AI: A Comparative Perspective	Maryam Teimouri et.al.	2506.07707	null
2025-06-09	Training Superior Sparse Autoencoders for Instruct Models	Jiaming Li et.al.	2506.07691	null
2025-06-09	QUITE: A Query Rewrite System Beyond Rules with LLM Agents	Yuyang Song et.al.	2506.07675	null
2025-06-09	How Benchmark Prediction from Fewer Data Misses the Mark	Guanhua Zhang et.al.	2506.07673	link
2025-06-09	Foundation Model Empowered Synesthesia of Machines (SoM): AI-native Intelligent Multi-Modal Sensing-Communication Integration	Xiang Cheng et.al.	2506.07647	null
2025-06-09	Evaluating LLMs Robustness in Less Resourced Languages with Proxy Models	Maciej Chrabąszcz et.al.	2506.07645	null
2025-06-09	TreeReview: A Dynamic Tree of Questions Framework for Deep and Efficient LLM-based Scientific Peer Review	Yuan Chang et.al.	2506.07642	null
2025-06-09	SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling	Haoran Wang et.al.	2506.07636	null
2025-06-09	Event-Priori-Based Vision-Language Model for Efficient Visual Understanding	Haotong Qin et.al.	2506.07627	null
2025-06-09	Intent Matters: Enhancing AI Tutoring with Fine-Grained Pedagogical Intent Annotation	Kseniia Petukhova et.al.	2506.07626	null
2025-06-09	LoRMA: Low-Rank Multiplicative Adaptation for LLMs	Harsh Bihany et.al.	2506.07621	null
2025-06-09	Vuyko Mistral: Adapting LLMs for Low-Resource Dialectal Translation	Roman Kyslyi et.al.	2506.07617	null
2025-06-09	PolitiSky24: U.S. Political Bluesky Dataset with User Stance Labels	Peyman Rostami et.al.	2506.07606	null
2025-06-09	SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis	Jianhui Wei et.al.	2506.07603	null
2025-06-09	SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding	Nianbo Zeng et.al.	2506.07600	null
2025-06-09	Instructing Large Language Models for Low-Resource Languages: A Systematic Study for Basque	Oscar Sainz et.al.	2506.07597	null
2025-06-09	TwinBreak: Jailbreaking LLM Security Alignments based on Twin Prompts	Torsten Krauß et.al.	2506.07596	null
2025-06-09	Evaluating LLMs Effectiveness in Detecting and Correcting Test Smells: An Empirical Study	E. G. Santana Jr et.al.	2506.07594	link
2025-06-09	MalGEN: A Generative Agent Framework for Modeling Malicious Software in Cybersecurity	Bikash Saha et.al.	2506.07586	null
2025-06-09	MIRA: Medical Time Series Foundation Model for Real-World Health Data	Hao Li et.al.	2506.07584	null
2025-06-09	Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models	Ramakrishna Appicharla et.al.	2506.07583	null
2025-06-09	Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding	Boyu Chen et.al.	2506.07576	null
2025-06-09	SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems	Peiran Li et.al.	2506.07564	null
2025-06-09	Cross-channel Perception Learning for H&E-to-IHC Virtual Staining	Hao Yang et.al.	2506.07559	null
2025-06-09	SELT: Self-Evaluation Tree Search for LLMs with Task Decomposition	Mengsong Wu et.al.	2506.07557	null
2025-06-09	ChemAgent: Enhancing LLMs for Chemistry and Materials Science through Tree-Search Based Tool Learning	Mengsong Wu et.al.	2506.07551	link
2025-06-09	APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs	Bowen Liu et.al.	2506.07542	null
2025-06-09	Bit-level BPE: Below the byte boundary	Sangwhan Moon et.al.	2506.07541	null
2025-06-09	MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts	Wei Tao et.al.	2506.07533	null
2025-06-09	BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation	Hongyu Wang et.al.	2506.07530	link
2025-06-09	Learning What Reinforcement Learning Can’t: Interleaved Online Fine-Tuning for Hardest Questions	Lu Ma et.al.	2506.07527	link
2025-06-09	Towards Large Language Models with Self-Consistent Natural Language Explanations	Sahar Admoni et.al.	2506.07523	null
2025-06-09	LeVo: High-Quality Song Generation with Multi-Preference Alignment	Shun Lei et.al.	2506.07520	link
2025-06-09	Taking Flight with Dialogue: Enabling Natural Language Control for PX4-based Drone Agent	Shoon Kit Lim et.al.	2506.07509	link
2025-06-09	Large Language Models for Multilingual Vulnerability Detection: How Far Are We?	Honglin Shu et.al.	2506.07503	link
2025-06-09	Explicit Preference Optimization: No Need for an Implicit Reward Model	Xiangkun Hu et.al.	2506.07492	link
2025-06-09	SpatialLM: Training Large Language Models for Structured Indoor Modeling	Yongsen Mao et.al.	2506.07491	null
2025-06-09	A Framework for Creating Non-Regressive Test Cases via Branch Consistency Analysis Driven by Descriptions	Yuxiang Zhang et.al.	2506.07486	null
2025-06-09	A Hybrid GA LLM Framework for Structured Task Optimization	Berry Feng et.al.	2506.07483	link
2025-06-09	Improving Fairness of Large Language Models in Multi-document Summarization	Haoyuan Li Yusen Zhang et.al.	2506.07479	link
2025-06-09	DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO	Jinyoung Park et.al.	2506.07464	null
2025-06-09	CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models	Guang Liu et.al.	2506.07463	null
2025-06-09	From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered	Siddartha Devic et.al.	2506.07461	null
2025-06-09	ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning	Ziwen Wang et.al.	2506.07459	null
2025-06-09	KScope: A Framework for Characterizing the Knowledge Status of Language Models	Yuxin Xiao et.al.	2506.07458	null
2025-06-09	PhysiInter: Integrating Physical Mapping for High-Fidelity Human Interaction Generation	Wei Yao et.al.	2506.07456	null
2025-06-09	Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs	Jared Strader et.al.	2506.07454	null
2025-06-09	When Style Breaks Safety: Defending Language Models Against Superficial Style Alignment	Yuxin Xiao et.al.	2506.07452	link
2025-06-06	TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation	Muhammad Sohail Danish et.al.	2506.06281	null
2025-06-06	Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias	Yuanzhe Hu et.al.	2506.06280	null
2025-06-06	CoMemo: LVLMs Need Image Context with Image Memory	Shi Liu et.al.	2506.06279	null
2025-06-06	STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis	Jiatao Gu et.al.	2506.06276	null
2025-06-06	AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization	Mukur Gupta et.al.	2506.06273	null
2025-06-06	RecGPT: A Foundation Model for Sequential Recommendation	Yangqin Jiang et.al.	2506.06270	link
2025-06-06	Cartridges: Lightweight and general-purpose long context representations via self-study	Sabri Eyuboglu et.al.	2506.06266	null
2025-06-06	PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time	Weizhi Zhang et.al.	2506.06254	null
2025-06-06	DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation	Jingyu Xiao et.al.	2506.06251	link
2025-06-06	Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models	Zahra Babaiee et.al.	2506.06242	null
2025-06-06	Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge	Yi Sui et.al.	2506.06240	null
2025-06-06	Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study	Leon Mayer et.al.	2506.06232	null
2025-06-06	CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports	Peter Pirkelbauer et.al.	2506.06227	null
2025-06-06	PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems	Yi Huang et.al.	2506.06226	null
2025-06-06	STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving	Christian Fruhwirth-Reisinger et.al.	2506.06218	link
2025-06-06	Can Theoretical Physics Research Benefit from Language Agents?	Sirui Lu et.al.	2506.06214	null
2025-06-06	Model-Driven Graph Contrastive Learning	Ali Azizpour et.al.	2506.06212	null
2025-06-06	PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts	Hengzhi Li et.al.	2506.06211	null
2025-06-06	Does It Run and Is That Enough? Revisiting Text-to-Chart Generation with a Multi-Agent Approach	James Ford et.al.	2506.06175	null
2025-06-06	Technical Report for Egocentric Mistake Detection for the HoloAssist Challenge	Constantin Patsch et.al.	2506.06174	null
2025-06-06	The Lock-in Hypothesis: Stagnation by Algorithm	Tianyi Alex Qiu et.al.	2506.06166	null
2025-06-06	ENMA: Tokenwise Autoregression for Generative Neural PDE Operators	Armand Kassaï Koupaï et.al.	2506.06158	null
2025-06-06	Masked Language Models are Good Heterogeneous Graph Generalizers	Jinyu Yang et.al.	2506.06157	link
2025-06-06	Personalized Large Language Models Can Increase the Belief Accuracy of Social Networks	Adiba Mahbub Proma et.al.	2506.06153	null
2025-06-06	Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems	Haowei Wang et.al.	2506.06151	link
2025-06-06	Let’s CONFER: A Dataset for Evaluating Natural Language Inference Models on CONditional InFERence and Presupposition	Tara Azin et.al.	2506.06133	null
2025-06-06	Bridging the Gap: In-Context Learning for Modeling Human Disagreement	Benedetta Muscato et.al.	2506.06113	null
2025-06-06	Text-to-LoRA: Instant Transformer Adaption	Rujikorn Charakorn et.al.	2506.06105	null
2025-06-06	VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning	Zikang Wang et.al.	2506.06097	null
2025-06-06	Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU	Wenhao Dai et.al.	2506.06095	null
2025-06-06	Reinforcing Code Generation: Improving Text-to-SQL with Execution-Based Learning	Atharv Kulkarni et.al.	2506.06093	null
2025-06-06	Full Conformal Adaptation of Medical Vision-Language Models	Julio Silva-Rodríguez et.al.	2506.06076	null
2025-06-06	Zero-Shot Detection of LLM-Generated Code via Approximated Task Conditioning	Maor Ashkenazi et.al.	2506.06069	null
2025-06-06	Conversational Interfaces for Parametric Conceptual Architectural Design: Integrating Mixed Reality with LLM-driven Interaction	Ruochen Ji et.al.	2506.06066	null
2025-06-06	Simple Yet Effective: Extracting Private Data Across Clients in Federated Fine-Tuning of Large Language Models	Yingqi Hu et.al.	2506.06060	null
2025-06-06	Hey, That’s My Data! Label-Only Dataset Inference in Large Language Models	Chen Xiong et.al.	2506.06057	null
2025-06-06	CP-Bench: Evaluating Large Language Models for Constraint Modelling	Kostis Michailidis et.al.	2506.06052	null
2025-06-06	HAVIR: HierArchical Vision to Image Reconstruction using CLIP-Guided Versatile Diffusion	Shiyi Zhang et.al.	2506.06035	null
2025-06-06	MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?	Zhitao He et.al.	2506.06034	null
2025-06-06	Large Language Models are Demonstration Pre-Selectors for Themselves	Jiarui Jin et.al.	2506.06033	null
2025-06-06	On Inverse Problems, Parameter Estimation, and Domain Generalization	Deborah Pereg et.al.	2506.06024	null
2025-06-06	When to Trust Context: Self-Reflective Debates for Context Reliability	Zeqi Zhou et.al.	2506.06020	null
2025-06-06	Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models	Chaoyi Zhu et.al.	2506.06018	link
2025-06-06	AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search	Yu Li et.al.	2506.06017	null
2025-06-06	On the Merits of LLM-Based Corpus Enrichment	Gal Zur et.al.	2506.06015	null
2025-06-06	Unlocking Recursive Thinking of LLMs: Alignment via Refinement	Haoke Zhang et.al.	2506.06009	null
2025-06-06	Token Signature: Predicting Chain-of-Thought Gains with Token Decoding Feature in Large Language Models	Peijie Liu et.al.	2506.06008	null
2025-06-06	Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models	Yifu Qiu et.al.	2506.06006	link
2025-06-06	LightGTS: A Lightweight General Time Series Forecasting Model	Yihang Wang et.al.	2506.06005	null
2025-06-06	A Culturally-Rich Romanian NLP Dataset from “Who Wants to Be a Millionaire?” Videos	Alexandru-Gabriel Ganea et.al.	2506.05991	null
2025-06-06	Leveraging Generative AI for Enhancing Automated Assessment in Programming Education Contests	Stefan Dascalescu et.al.	2506.05990	null
2025-06-06	Audio-Aware Large Language Models as Judges for Speaking Styles	Cheng-Han Chiang et.al.	2506.05984	null
2025-06-06	CrimeMind: Simulating Urban Crime with Multi-Modal LLM Agents	Qingbin Zeng et.al.	2506.05981	null
2025-06-06	Mitigating Catastrophic Forgetting with Adaptive Transformer Block Expansion in Federated Fine-Tuning	Yujia Huo et.al.	2506.05977	null
2025-06-06	Let’s Put Ourselves in Sally’s Shoes: Shoes-of-Others Prefixing Improves Theory of Mind in Large Language Models	Kazutoshi Shinoda et.al.	2506.05970	null
2025-06-06	Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning	Motoki Omura et.al.	2506.05968	link
2025-06-06	Preference Learning for AI Alignment: a Causal Perspective	Katarzyna Kobalczyk et.al.	2506.05967	null
2025-06-06	AQUATIC-Diff: Additive Quantization for Truly Tiny Compressed Diffusion Models	Adil Hasan et.al.	2506.05960	null
2025-06-06	Elementary Math Word Problem Generation using Large Language Models	Nimesh Ariyarathne et.al.	2506.05950	null
2025-06-06	IntentionESC: An Intention-Centered Framework for Enhancing Emotional Support in Dialogue Systems	Xinjie Zhang et.al.	2506.05947	null
2025-06-06	Exponential Family Variational Flow Matching for Tabular Data Generation	Andrés Guzmán-Cordero et.al.	2506.05940	null
2025-06-06	Respecting Temporal-Causal Consistency: Entity-Event Knowledge Graphs for Retrieval-Augmented Generation	Ze Yu Zhang et.al.	2506.05939	null
2025-06-06	DynamicMind: A Tri-Mode Thinking System for Large Language Models	Wei Li et.al.	2506.05936	null
2025-06-06	MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models	Jie Cao et.al.	2506.05928	null
2025-06-06	Small Models, Big Support: A Local LLM Framework for Teacher-Centric Content Creation and Assessment using RAG and CAG	Zarreen Reza et.al.	2506.05925	null
2025-06-06	Generating Grounded Responses to Counter Misinformation via Learning Efficient Fine-Grained Critiques	Xiaofei Xu et.al.	2506.05924	null
2025-06-06	Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router	Chenyang Shao et.al.	2506.05901	null
2025-06-06	Explainability in Context: A Multilevel Framework Aligning AI Explanations with Stakeholder with LLMs	Marilyn Bello et.al.	2506.05887	null
2025-06-06	Human-AI Alignment of Multimodal Large Language Models with Speech-Language Pathologists in Parent-Child Interactions	Weiyan Shi et.al.	2506.05879	null
2025-06-06	Research on Personalized Financial Product Recommendation by Integrating Large Language Models and Graph Neural Networks	Yushang Zhao et.al.	2506.05873	null
2025-06-06	Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection	Yu Li et.al.	2506.05872	null
2025-06-06	BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures	Xiannan Hu et.al.	2506.05871	null
2025-06-06	Stealix: Model Stealing via Prompt Evolution	Zhixiong Zhuang et.al.	2506.05867	null
2025-06-06	CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy	Jiakai Zhang et.al.	2506.05864	null
2025-06-06	Microstructural Studies Using Generative Adversarial Network (GAN): a Case Study	Owais Ahmad et.al.	2506.05860	link
2025-06-06	Towards Next-Generation Intelligent Maintenance: Collaborative Fusion of Large and Small Models	Xiaoyi Yuan et.al.	2506.05854	null
2025-06-06	Training-Free Query Optimization via LLM-Based Plan Similarity	Nikita Vasilenko et.al.	2506.05853	null
2025-06-06	Cross-lingual Collapse: How Language-Centric Foundation Models Shape Reasoning in Large Language Models	Cheonbok Park et.al.	2506.05850	null
2025-06-06	Heartcare Suite: Multi-dimensional Understanding of ECG with Raw Multi-lead Signal Modeling	Yihan Xie et.al.	2506.05831	null
2025-06-06	CodeContests+: High-Quality Test Case Generation for Competitive Programming	Zihan Wang et.al.	2506.05817	null
2025-06-06	LLIA – Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models	Haojie Yu et.al.	2506.05806	null
2025-06-06	Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models	Yuke Lin et.al.	2506.05796	null
2025-06-06	Discrete Minds in a Continuous World: Do Language Models Know Time Passes?	Minghan Wang et.al.	2506.05790	null
2025-06-06	EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs	Ivan Rodin et.al.	2506.05787	null
2025-06-06	dots.llm1 Technical Report	Bi Huo et.al.	2506.05767	null
2025-06-06	BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions	Saptarshi Sengupta et.al.	2506.05766	null
2025-06-06	BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning	Yunpeng Qing et.al.	2506.05762	null
2025-06-06	Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning	Xuanyu Lei et.al.	2506.05760	null
2025-06-06	LLM-Symbolic Integration for Robust Temporal Tabular Reasoning	Atharv Kulkarni et.al.	2506.05746	null
2025-06-06	Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence Awareness	Rongzhe Wei et.al.	2506.05735	null
2025-06-06	Large Language Models are Good Relational Learners	Fang Wu et.al.	2506.05725	null
2025-06-06	Advancement and Field Evaluation of a Dual-arm Apple Harvesting Robot	Keyi Zhu et.al.	2506.05714	null
2025-06-06	Come Together, But Not Right Now: A Progressive Strategy to Boost Low-Rank Adaptation	Zhan Zhuang et.al.	2506.05713	null
2025-06-06	Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition	Mu Yang et.al.	2506.05706	null
2025-06-06	RKEFino1: A Regulation Knowledge-Enhanced Large Language Model	Yan Wang et.al.	2506.05700	null
2025-06-06	Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework	Lingyuan Liu et.al.	2506.05695	link
2025-06-06	SafeGenBench: A Benchmark Framework for Security Vulnerability Detection in LLM-Generated Code	Xinghang Li et.al.	2506.05692	null
2025-06-06	When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation	Zhishang Xiang et.al.	2506.05690	link
2025-06-06	Pts3D-LLM: Studying the Impact of Token Structure for 3D Scene Understanding With Large Language Models	Hugues Thomas et.al.	2506.05689	null
2025-06-06	Voice Impression Control in Zero-Shot TTS	Keinichi Fujita et.al.	2506.05688	null
2025-06-05	Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets	Lei Hsiung et.al.	2506.05346	null
2025-06-05	SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs	Jiahui Wang et.al.	2506.05344	link
2025-06-05	ContentV: Efficient Training of Video Generation Models with Limited Compute	Wenfeng Lin et.al.	2506.05343	null
2025-06-05	Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning	Xingjian Ran et.al.	2506.05341	null
2025-06-05	VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Ghazi Shazan Ahmad et.al.	2506.05336	link
2025-06-05	Search Arena: Analyzing Search-Augmented LLMs	Mihran Miroyan et.al.	2506.05334	link
2025-06-05	MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning	Xinyan Chen et.al.	2506.05331	link
2025-06-05	LSM-2: Learning from Incomplete Wearable Sensor Data	Maxwell A. Xu et.al.	2506.05321	null
2025-06-05	Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay	Yifan Sun et.al.	2506.05316	null
2025-06-05	Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models	Taha Entesari et.al.	2506.05314	null
2025-06-05	Learning normalized image densities via dual score matching	Florentin Guth et.al.	2506.05310	null
2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305	null
2025-06-05	Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos	Weifeng Lin et.al.	2506.05302	null
2025-06-05	Power Law Guided Dynamic Sifting for Efficient Attention	Nirav Koley et.al.	2506.05300	null
2025-06-05	Sample Complexity and Representation Ability of Test-time Scaling Paradigms	Baihe Huang et.al.	2506.05295	null
2025-06-05	EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?	Yuqian Yuan et.al.	2506.05287	null
2025-06-05	Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning	Nan Huo et.al.	2506.05278	null
2025-06-05	How to Unlock Time Series Editing? Diffusion-Driven Approach with Multi-Grained Control	Hao Yu et.al.	2506.05276	null
2025-06-05	Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams	Mohammed Almutairi et.al.	2506.05265	null
2025-06-05	Can Foundation Models Generalise the Presentation Attack Detection Capabilities on ID Cards?	Juan E. Tapia et.al.	2506.05263	null
2025-06-05	LeanPO: Lean Preference Optimization for Likelihood Alignment in Video-LLMs	Xiaodong Wang et.al.	2506.05260	link
2025-06-05	SECNEURON: Reliable and Flexible Abuse Control in Local LLMs via Hybrid Neuron Encryption	Zhiqiang Wang et.al.	2506.05242	null
2025-06-05	Aligning Latent Spaces with Flow Priors	Yizhuo Li et.al.	2506.05240	null
2025-06-05	DSG-World: Learning a 3D Gaussian World Model from Dual State Videos	Wenhao Hu et.al.	2506.05217	null
2025-06-05	LLM-First Search: Self-Guided Exploration of the Solution Space	Nathan Herr et.al.	2506.05213	link
2025-06-05	Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation	Jan Ackermann et.al.	2506.05210	null
2025-06-05	The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text	Nikhil Kandpal et.al.	2506.05209	null
2025-06-05	RELIC: Evaluating Compositional Instruction Following via Language Recognition	Jackson Petty et.al.	2506.05205	null
2025-06-05	Transformers Meet In-Context Learning: A Universal Approximation Theory	Gen Li et.al.	2506.05200	null
2025-06-05	Quantifying Cross-Modality Memorization in Vision-Language Models	Yuxin Wen et.al.	2506.05198	null
2025-06-05	Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis	Neeraj Kumar et.al.	2506.05184	null
2025-06-05	TreeRPO: Tree Relative Policy Optimization	Zhicheng Yang et.al.	2506.05183	null
2025-06-05	On the Comprehensibility of Multi-structured Financial Documents using LLMs and Pre-processing Tools	Shivani Upadhyay et.al.	2506.05182	link
2025-06-05	Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models	Yanzhao Zhang et.al.	2506.05176	null
2025-06-06	ECoRAG: Evidentiality-guided Compression for Long Context RAG	Yeonseok Jeong et.al.	2506.05167	link
2025-06-06	Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective	Bhavik Chandna et.al.	2506.05166	null
2025-06-05	Do Large Language Models Judge Error Severity Like Humans?	Diege Sun et.al.	2506.05142	null
2025-06-05	DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Tanmay Parekh et.al.	2506.05128	null
2025-06-05	PixCell: A generative foundation model for digital histopathology images	Srikar Yellapragada et.al.	2506.05127	null
2025-06-05	Membership Inference Attacks on Sequence Models	Lorenzo Rossi et.al.	2506.05126	null
2025-06-05	The NTNU System at the S&I Challenge 2025 SLA Open Track	Hong-Yun Lin et.al.	2506.05121	null
2025-06-05	DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models	Revant Teotia et.al.	2506.05108	null
2025-06-05	Survey on the Evaluation of Generative Models in Music	Alexander Lerch et.al.	2506.05104	null
2025-06-05	Privacy Amplification Through Synthetic Data: Insights from Linear Regression	Clément Pierquin et.al.	2506.05101	null
2025-06-05	Interpretable Multimodal Framework for Human-Centered Street Assessment: Integrating Visual-Language Models for Perceptual Urban Diagnostics	HaoTian Lan et.al.	2506.05087	null
2025-06-05	Parking, Perception, and Retail: Street-Level Determinants of Community Vitality in Harbin	HaoTian Lan et.al.	2506.05080	null
2025-06-05	Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation	Soumitra Ghosh et.al.	2506.05073	null
2025-06-05	RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation	Tianjiao Li et.al.	2506.05070	null
2025-06-05	Reason-to-Recommend: Using Interaction-of-Thought Reasoning to Enhance LLM Recommendation	Keyu Zhao et.al.	2506.05069	null
2025-06-06	Does It Make Sense to Speak of Introspection in Large Language Models?	Iulia M. Comsa et.al.	2506.05068	null
2025-06-05	A Survey on Vietnamese Document Analysis and Recognition: Challenges and Future Directions	Anh Le et.al.	2506.05061	null
2025-06-05	TALL – A Trainable Architecture for Enhancing LLM Performance in Low-Resource Languages	Moshe Ofer et.al.	2506.05057	null
2025-06-05	Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers	Yutao Hou et.al.	2506.05038	null
2025-06-05	Tuning the Right Foundation Models is What you Need for Partial Label Learning	Kuang He et.al.	2506.05027	link
2025-06-05	Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System	Haokun Liu et.al.	2506.05020	null
2025-06-05	UAV4D: Dynamic Neural Rendering of Human-Centric UAV Imagery using Gaussian Splatting	Jaehoon Choi et.al.	2506.05011	null
2025-06-05	ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development	Zhenran Xu et.al.	2506.05010	link
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009	null
2025-06-05	QiMeng: Fully Automated Hardware and Software Design for Processor Chip	Rui Zhang et.al.	2506.05007	null
2025-06-05	SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View	Yongjie Xiao et.al.	2506.05000	null
2025-06-05	Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic Reasoning	Mehdi Azarafza et.al.	2506.04998	null
2025-06-05	BacPrep: An Experimental Platform for Evaluating LLM-Based Bacalaureat Assessment	Dumitran Adrian Marius et.al.	2506.04989	null
2025-06-05	FPTQuant: Function-Preserving Transforms for LLM Quantization	Boris van Breugel et.al.	2506.04985	null
2025-06-05	TextVidBench: A Benchmark for Long Video Scene Text Understanding	Yangyang Zhong et.al.	2506.04983	null
2025-06-05	Agentic AI for Intent-Based Industrial Automation	Marcos Lima Romero et.al.	2506.04980	link
2025-06-05	Evaluating Prompt-Driven Chinese Large Language Models: The Influence of Persona Assignment on Stereotypes and Safeguards	Geng Liu et.al.	2506.04975	null
2025-06-05	From Struggle (06-2024) to Mastery (02-2025) LLMs Conquer Advanced Algorithm Exams and Pave the Way for Editorial Generation	Adrian Marius Dumitran et.al.	2506.04965	null
2025-06-06	PoCGen: Generating Proof-of-Concept Exploits for Vulnerabilities in Npm Packages	Deniz Simsek et.al.	2506.04962	null
2025-06-05	APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval	Hong Gao et.al.	2506.04953	null
2025-06-05	Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback	Junior Cedric Tonga et.al.	2506.04920	null
2025-06-05	When Thinking LLMs Lie: Unveiling the Strategic Deception in Representations of Reasoning Models	Kai Wang et.al.	2506.04909	null
2025-06-05	Verbose ListOps (VLO): Beyond Long Context – Unmasking LLM’s Reasoning Blind Spots	Alex Pan et.al.	2506.04907	null
2025-06-05	From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes	Tianxu Wang et.al.	2506.04897	null
2025-06-05	ICPC-Eval: Probing the Frontiers of LLM Reasoning with Competitive Programming Contests	Shiyi Xu et.al.	2506.04894	link
2025-06-05	Evaluating the Effectiveness of Linguistic Knowledge in Pretrained Language Models: A Case Study of Universal Dependencies	Wenxi Li et.al.	2506.04887	null
2025-06-05	Consciousness via MIPT?	Alexander Gorsky et.al.	2506.04875	null
2025-06-05	LLMs for sensory-motor control: Combining in-context and iterative learning	Jônata Tyska Carvalho et.al.	2506.04867	link
2025-06-05	Adapting Online Customer Reviews for Blind Users: A Case Study of Restaurant Reviews	Mohan Sunkara et.al.	2506.04865	null
2025-06-06	Sparse Autoencoders, Again?	Yin Lu et.al.	2506.04859	null
2025-06-05	Prompting LLMs: Length Control for Isometric Machine Translation	Dávid Javorský et.al.	2506.04855	null
2025-06-05	Multiple-Choice Question Generation Using Large Language Models: Methodology and Educator Insights	Giorgio Biancini et.al.	2506.04851	null
2025-06-05	On Automating Security Policies with Contemporary LLMs	Pablo Fernández Saura et.al.	2506.04838	null
2025-06-05	OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model	Kunshen Zhang et.al.	2506.04837	link
2025-06-05	Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models	Changyue Wang et.al.	2506.04832	link
2025-06-05	DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation	Shuo Cao et.al.	2506.04830	null
2025-06-05	Evaluating Vision-Language and Large Language Models for Automated Student Assessment in Indonesian Classrooms	Nurul Aisyah et.al.	2506.04822	null
2025-06-05	LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning	Zhen Hao Wong et.al.	2506.04821	link
2025-06-05	Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study	Yujun Zhou et.al.	2506.04810	link
2025-06-05	Towards LLM-Centric Multimodal Fusion: A Survey on Integration Strategies and Techniques	Jisu An et.al.	2506.04788	null
2025-06-05	MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Dingdong Wang et.al.	2506.04779	link
2025-06-05	Fine-Grained Interpretation of Political Opinions in Large Language Models	Jingyu Hu et.al.	2506.04774	null
2025-06-05	GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval	Lingyuan Liu et.al.	2506.04762	link
2025-06-05	Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion	Lingyuan Liu et.al.	2506.04760	link
2025-06-05	Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning	Shenshen Li et.al.	2506.04755	link
2025-06-05	Multi-Layer GRPO: Enhancing Reasoning and Self-Correction in Large Language Models	Fei Ding et.al.	2506.04746	null
2025-06-05	SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs	Shuhan Xu et.al.	2506.04743	null
2025-06-05	Lifelong Evolution: Collaborative Learning between Large and Small Language Models for Continuous Emergent Fake News Detection	Ziyi Zhou et.al.	2506.04739	null
2025-06-05	Towards Holistic Visual Quality Assessment of AI-Generated Videos: A LLM-Based Multi-Dimensional Evaluation Model	Zelu Qi et.al.	2506.04715	link
2025-06-05	UNO: Unlearning via Orthogonalization in Generative models	Pinak Mandal et.al.	2506.04712	null
2025-06-05	LLM-based phoneme-to-grapheme for phoneme-based speech recognition	Te Ma et.al.	2506.04711	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217	link
2025-06-04	Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector	Boyong He et.al.	2506.04211	link
2025-06-04	Language-Image Alignment with Fixed Text Encoders	Jingfeng Yang et.al.	2506.04209	null
2025-06-04	Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning	Shuang Chen et.al.	2506.04207	null
2025-06-04	EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation	Jinghan Jia et.al.	2506.04205	link
2025-06-04	Cascadia: A Cascade Serving System for Large Language Models	Youhe Jiang et.al.	2506.04203	null
2025-06-04	TracLLM: A Generic Framework for Attributing Long Context LLMs	Yanting Wang et.al.	2506.04202	link
2025-06-04	R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning	Qingfei Zhao et.al.	2506.04185	link
2025-06-04	SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Yuhao Wu et.al.	2506.04180	null
2025-06-04	SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling	Anhao Zhao et.al.	2506.04179	null
2025-06-04	Does Prompt Design Impact Quality of Data Imputation by LLMs?	Shreenidhi Srinivasan et.al.	2506.04172	null
2025-06-04	Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints	Utkarsh Utkarsh et.al.	2506.04171	null
2025-06-04	VISCA: Inferring Component Abstractions for Automated End-to-End Testing	Parsa Alian et.al.	2506.04161	null
2025-06-04	A Dataset for Addressing Patient’s Information Needs related to Clinical Course of Hospitalization	Sarvesh Soni et.al.	2506.04156	null
2025-06-04	Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis	Kejian Zhu et.al.	2506.04142	null
2025-06-04	MMR-V: What’s Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos	Kejian Zhu et.al.	2506.04141	null
2025-06-04	TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems	Shaina Raza et.al.	2506.04133	null
2025-06-04	Guided Speculative Inference for Efficient Test-Time Alignment of LLMs	Jonathan Geuter et.al.	2506.04118	link
2025-06-05	Rectified Sparse Attention	Yutao Sun et.al.	2506.04108	null
2025-06-04	TextAtari: 100K Frames Game Playing with Language Agents	Wenhao Li et.al.	2506.04098	link
2025-06-04	AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment	Anastasiia Ivanova et.al.	2506.04089	link
2025-06-04	Multimodal Tabular Reasoning with Privileged Structured Information	Jun-Peng Jiang et.al.	2506.04088	null
2025-06-04	EuroLLM-9B: Technical Report	Pedro Henrique Martins et.al.	2506.04079	null
2025-06-04	LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation	Ming Zhang et.al.	2506.04078	link
2025-06-04	A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions	Chung-Chun Wang et.al.	2506.04077	null
2025-06-04	A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis	Esteban Gutiérrez et.al.	2506.04073	null
2025-06-04	Controlling Difficulty of Generated Text for AI-Assisted Language Learning	Meiqing Jin et.al.	2506.04072	null
2025-06-04	Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning	Muling Wu et.al.	2506.04065	null
2025-06-04	Crowd-SFT: Crowdsourcing for LLM Alignment	Alex Sotiropoulos et.al.	2506.04063	null
2025-06-04	Towards generating more interpretable counterfactuals via concept vectors: a preliminary study on chest X-rays	Bulat Maksudov et.al.	2506.04058	link
2025-06-04	High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning	Tim Franzmeyer et.al.	2506.04051	null
2025-06-04	Explainability-Based Token Replacement on LLM-Generated Text	Hadi Mohammadi et.al.	2506.04050	null
2025-06-04	Lacuna Inc. at SemEval-2025 Task 4: LoRA-Enhanced Influence-Based Unlearning for LLMs	Aleksey Kudelya et.al.	2506.04044	null
2025-06-04	Think Like a Person Before Responding: A Multi-Faceted Evaluation of Persona-Guided LLMs for Countering Hate	Mikel K. Ngueajio et.al.	2506.04043	link
2025-06-04	Unveiling and Eliminating the Shortcut Learning for Locate-Then-Edit Knowledge Editing via Both Subject and Relation Awareness	Xiyu Liu et.al.	2506.04042	null
2025-06-04	Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization	Jiulong Wu et.al.	2506.04039	null
2025-06-04	Generating Automotive Code: Large Language Models for Software Development and Verification in Safety-Critical Systems	Sven Kirchner et.al.	2506.04038	null
2025-06-04	Privacy and Security Threat for OpenAI GPTs	Wei Wenying et.al.	2506.04036	null
2025-06-04	AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in LLM-Based Agents	Akshat Naik et.al.	2506.04018	null
2025-06-04	Dreaming up scale invariance via inverse renormalization group	Adam Rançon et.al.	2506.04016	link
2025-06-04	GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems	Tiehua Mei et.al.	2506.04015	null
2025-06-04	Large deviations for scaled families of Schrödinger bridges with reflection	Viktor Nilsson et.al.	2506.03999	null
2025-06-04	Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter Era	Dan Oneata et.al.	2506.03994	null
2025-06-04	From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding	Chiwei Zhu et.al.	2506.03968	null
2025-06-04	Lower Ricci Curvature for Hypergraphs	Shiyi Yang et.al.	2506.03943	null
2025-06-04	Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning	Junqi Gao et.al.	2506.03939	link
2025-06-04	VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation	Yuansheng Ni et.al.	2506.03930	null
2025-06-04	Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample	Ze Feng et.al.	2506.03928	null
2025-06-04	More or Less Wrong: A Benchmark for Directional Bias in LLM Comparative Reasoning	Mohammadamin Shafiei et.al.	2506.03923	null
2025-06-04	HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models	Zhaolu Kang et.al.	2506.03922	link
2025-06-05	Magic Mushroom: A Customizable Benchmark for Fine-grained Analysis of Retrieval Noise Erosion in RAG Systems	Yuxin Zhang et.al.	2506.03901	null
2025-06-04	RadialRouter: Structured Representation for Efficient and Robust Large Language Models Routing	Ruihan Jin et.al.	2506.03880	null
2025-06-04	Evaluating Apple Intelligence’s Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets	Mohd. Farhan Israk Soumik et.al.	2506.03870	null
2025-06-04	EuroGEST: Investigating gender stereotypes in multilingual language models	Jacqueline Rowe et.al.	2506.03867	null
2025-06-04	PulseReddit: A Novel Reddit Dataset for Benchmarking MAS in High-Frequency Cryptocurrency Trading	Qiuhan Han et.al.	2506.03861	null
2025-06-04	Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation	Mingxuan Xia et.al.	2506.03857	null
2025-06-04	Algorithm- and Data-Dependent Generalization Bounds for Score-Based Generative Models	Benjamin Dupuis et.al.	2506.03849	null
2025-06-04	Enhancing Safety of Foundation Models for Visual Navigation through Collision Avoidance via Repulsive Estimation	Joonkyung Kim et.al.	2506.03834	null
2025-06-04	AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance	Dhaval Patel et.al.	2506.03828	link
2025-06-04	Multi-objective Aligned Bidword Generation Model for E-commerce Search Advertising	Zhenhui Liu et.al.	2506.03827	null
2025-06-04	From Theory to Practice: Real-World Use Cases on Trustworthy LLM-Driven Process Modeling, Prediction and Automation	Peter Pfeiffer et.al.	2506.03801	null
2025-06-04	STELLA: Towards Protein Function Prediction with Multimodal LLMs Integrating Sequence-Structure Representations	Hongwang Xiao et.al.	2506.03800	null
2025-06-04	Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts	Sidharth Pulipaka et.al.	2506.03793	null
2025-06-05	Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons	Isik Baran Sandan et.al.	2506.03785	null
2025-06-04	Unifying Uniform and Binary-coding Quantization for Accurate Compression of Large Language Models	Seungcheol Park et.al.	2506.03781	null
2025-06-04	ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations	Quang Hieu Pham et.al.	2506.03763	null
2025-06-04	AhaKV: Adaptive Holistic Attention-Driven KV Cache Eviction for Efficient Inference of Large Language Models	Yifeng Gu et.al.	2506.03762	null
2025-06-04	Act-as-Pet: Benchmarking the Abilities of Large Language Models as E-Pets in Social Network Services	Hongcheng Guo et.al.	2506.03761	null
2025-06-04	Understanding Physical Properties of Unseen Deformable Objects by Leveraging Large Language Models and Robot Actions	Changmin Park et.al.	2506.03760	null
2025-06-04	Frame-Level Real-Time Assessment of Stroke Rehabilitation Exercises from Video-Level Labeled Data: Task-Specific vs. Foundation Models	Gonçalo Mesquita et.al.	2506.03752	null
2025-06-04	Spatiotemporal Prediction of Electric Vehicle Charging Load Based on Large Language Models	Hang Fan et.al.	2506.03728	null
2025-06-04	Sign-SGD is the Golden Gate between Multi-Node to Single-Node Learning: Significant Boost via Parameter-Free Optimization	Daniil Medyakov et.al.	2506.03725	null
2025-06-04	Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision	Chaeyun Jang et.al.	2506.03723	null
2025-06-04	On the Closed-Form of Flow Matching: Generalization Does Not Arise from Target Stochasticity	Quentin Bertrand et.al.	2506.03719	null
2025-06-04	AetherVision-Bench: An Open-Vocabulary RGB-Infrared Benchmark for Multi-Angle Segmentation across Aerial and Ground Perspectives	Aniruddh Sikdar et.al.	2506.03709	null
2025-06-04	ScoreRAG: A Retrieval-Augmented Generation Framework with Consistency-Relevance Scoring and Structured Summarization for News Generation	Pei-Yun Lin et.al.	2506.03704	null
2025-06-04	Learning-at-Criticality in Large Language Models for Quantum Field Theory and Beyond	Xiansheng Cai et.al.	2506.03703	null
2025-06-04	AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism	Zhepei Wei et.al.	2506.03700	link
2025-06-04	Scaling Transformers for Discriminative Recommendation via Generative Pretraining	Chunqi Wang et.al.	2506.03699	null
2025-06-04	Robust Preference Optimization via Dynamic Target Margins	Jie Sun et.al.	2506.03690	null
2025-06-04	Out-of-Distribution Graph Models Merging	Yidi Wang et.al.	2506.03674	null
2025-06-04	Trustworthy Medical Question Answering: An Evaluation-Centric Survey	Yinuo Wang et.al.	2506.03659	null
2025-06-04	Client-Side Zero-Shot LLM Inference for Comprehensive In-Browser URL Analysis	Avihay Cohen et.al.	2506.03656	null
2025-06-04	Facts are Harder Than Opinions – A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability	Lorraine Saju et.al.	2506.03655	null
2025-06-04	RewardAnything: Generalizable Principle-Following Reward Models	Zhuohao Yu et.al.	2506.03637	null
2025-06-04	Robustness of Prompting: Enhancing Robustness of Large Language Models Against Prompting Attacks	Lin Mu et.al.	2506.03627	null
2025-06-04	Do Large Language Models Know Folktales? A Case Study of Yokai in Japanese Folktales	Ayuto Tsutsumi et.al.	2506.03619	null
2025-06-04	Learning to Insert [PAUSE] Tokens for Better Reasoning	Eunki Kim et.al.	2506.03616	null
2025-06-04	Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games	Dongmin Park et.al.	2506.03610	null
2025-06-04	Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision	Tomoya Yoshida et.al.	2506.03605	null
2025-06-04	Auto prompt sql: a resource-efficient architecture for text-to-sql translation in constrained environments	Zetong Tang et.al.	2506.03598	null
2025-06-04	Resolving Task Objective Conflicts in Unified Multimodal Understanding and Generation via Task-Aware Mixture-of-Experts	Jiaxing Zhang et.al.	2506.03591	null
2025-06-04	Preface to the Special Issue of the TAL Journal on Scholarly Document Processing	Florian Boudin et.al.	2506.03587	null
2025-06-04	Improving LLM-Based Fault Localization with External Memory and Project Context	Inseok Yeo et.al.	2506.03585	null
2025-06-04	Exchange of Perspective Prompting Enhances Reasoning in Large Language Models	Lin Sun et.al.	2506.03573	null
2025-06-04	FreePRM: Training Process Reward Models Without Ground Truth Process Labels	Lin Sun et.al.	2506.03570	null
2025-06-04	POSS: Position Specialist Generates Better Draft for Speculative Decoding	Langlin Huang et.al.	2506.03566	link
2025-06-04	ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch	Jiawei Chen et.al.	2506.03558	null
2025-06-04	BPO: Revisiting Preference Modeling in Direct Preference Optimization	Lin Sun et.al.	2506.03557	null
2025-06-04	From Virtual Agents to Robot Teams: A Multi-Robot Framework Evaluation in High-Stakes Healthcare Context	Yuanchen Bai et.al.	2506.03546	null
2025-06-03	Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM	Pralaypati Ta et.al.	2506.03145	null
2025-06-03	Not All Tokens Are Meant to Be Forgotten	Xiangyu Zhou et.al.	2506.03142	null
2025-06-03	SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation	Siqi Chen et.al.	2506.03139	null
2025-06-03	Native-Resolution Image Synthesis	Zidong Wang et.al.	2506.03131	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation	Prashanth Vijayaraghavan et.al.	2506.03122	null
2025-06-03	Targeted Forgetting of Image Subgroups in CLIP Models	Zeliang Zhang et.al.	2506.03117	null
2025-06-03	Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback	Xiaoying Zhang et.al.	2506.03106	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	SG2VID: Scene Graphs Enable Fine-Grained Control for Video Synthesis	Ssharvien Kumar Sivakumar et.al.	2506.03082	null
2025-06-03	ORV: 4D Occupancy-centric Robot Video Generation	Xiuyu Yang et.al.	2506.03079	link
2025-06-03	EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models	Mingzhe Li et.al.	2506.03067	null
2025-06-03	Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models	Ram Potham et.al.	2506.03056	null
2025-06-03	Facts Do Care About Your Language: Assessing Answer Quality of Multilingual LLMs	Yuval Kansal et.al.	2506.03051	null
2025-06-03	Sample complexity of Schrödinger potential estimation	Nikita Puchkin et.al.	2506.03043	null
2025-06-03	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2506.03038	null
2025-06-03	Leveraging Information Retrieval to Enhance Spoken Language Understanding Prompts in Few-Shot Learning	Pierre Lepagnol et.al.	2506.03035	null
2025-06-03	TestAgent: An Adaptive and Intelligent Expert for Human Assessment	Junhao Yu et.al.	2506.03032	null
2025-06-03	GenFair: Systematic Test Generation for Fairness Fault Detection in Large Language Models	Madhusudan Srinivasan et.al.	2506.03024	null
2025-06-03	Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech	Florian Ludwig et.al.	2506.03009	null
2025-06-03	DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models	Jiarui Wang et.al.	2506.03007	null
2025-06-03	A Preference-Driven Methodology for High-Quality Solidity Code Generation	Zhiyuan Peng et.al.	2506.03006	null
2025-06-03	Linear Spatial World Models Emerge in Large Language Models	Matthieu Tehenan et.al.	2506.02996	null
2025-06-03	It’s Not a Walk in the Park! Challenges of Idiom Translation in Speech-to-text Systems	Iuliia Zaitova et.al.	2506.02995	null
2025-06-03	Mitigating Manipulation and Enhancing Persuasion: A Reflective Multi-Agent Approach for Legal Argument Generation	Li Zhang et.al.	2506.02992	null
2025-06-03	Performance of leading large language models in May 2025 in Membership of the Royal College of General Practitioners-style examination questions: a cross-sectional analysis	Richard Armitage et.al.	2506.02987	null
2025-06-03	Astrophotography turbulence mitigation via generative models	Joonyeoup Kim et.al.	2506.02981	null
2025-06-03	On the Robustness of Tabular Foundation Models: Test-Time Attacks and In-Context Defenses	Mohamed Djilani et.al.	2506.02978	null
2025-06-03	Expanding before Inferring: Enhancing Factuality in Large Language Models through Premature Layers Interpolation	Dingwei Chen et.al.	2506.02973	null
2025-06-04	PC-MoE: Memory-Efficient and Privacy-Preserving Collaborative Training for Mixture-of-Experts LLMs	Ze Yu Zhang et.al.	2506.02965	null
2025-06-03	FORLA:Federated Object-centric Representation Learning with Slot Attention	Guiqiu Liao et.al.	2506.02964	null
2025-06-03	FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models	Yan Gao et.al.	2506.02961	null
2025-06-03	HACo-Det: A Study Towards Fine-Grained Machine-Generated Text Detection under Human-AI Coauthoring	Zhixiong Su et.al.	2506.02959	null
2025-06-03	UniConFlow: A Unified Constrained Generalization Framework for Certified Motion Planning with Flow Matching Models	Zewen Yang et.al.	2506.02955	null
2025-06-03	Towards More Effective Fault Detection in LLM-Based Unit Test Generation	Guancheng Wang et.al.	2506.02954	null
2025-06-03	Adaptive Graph Pruning for Multi-Agent Communication	Boyi Li et.al.	2506.02951	null
2025-06-03	Quantitative LLM Judges	Aishwarya Sahoo et.al.	2506.02945	null
2025-06-03	A Multi-agent LLM-based JUit Test Generation with Strong Oracles	Qinghua Xu et.al.	2506.02943	null
2025-06-03	Memory-Efficient Split Federated Learning for LLM Fine-Tuning on Heterogeneous Mobile Devices	Xiaopei Chen et.al.	2506.02940	null
2025-06-03	Elasticity of substitution and general model of economic growth	Constantin Chilarescu et.al.	2506.02936	null
2025-06-03	Large Processor Chip Model	Kaiyan Chang et.al.	2506.02929	null
2025-06-03	INESC-ID @ eRisk 2025: Exploring Fine-Tuned, Similarity-Based, and Prompt-Based Approaches to Depression Symptom Identification	Diogo A. P. Nunes et.al.	2506.02924	null
2025-06-03	Sample, Predict, then Proceed: Self-Verification Sampling for Tool Use of LLMs	Shangmin Guo et.al.	2506.02918	null
2025-06-03	Towards Auto-Annotation from Annotation Guidelines: A Benchmark through 3D LiDAR Detection	Yechi Ma et.al.	2506.02914	null
2025-06-03	Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning	Yin Fang et.al.	2506.02911	link
2025-06-03	Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency	Bunlong Lay et.al.	2506.02908	null
2025-06-03	Scaling Fine-Grained MoE Beyond 50B Parameters: Empirical Evaluation and Practical Insights	Jakub Krajewski et.al.	2506.02890	null
2025-06-03	CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective	Jintian Shao et.al.	2506.02878	null
2025-06-03	It’s the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics	Matthew Kowal et.al.	2506.02873	null
2025-06-03	Token and Span Classification for Entity Recognition in French Historical Encyclopedias	Ludovic Moncla et.al.	2506.02872	null
2025-06-03	Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings	Amal S. Perera et.al.	2506.02868	null
2025-06-03	BNPO: Beta Normalization Policy Optimization	Changyi Xiao et.al.	2506.02864	null
2025-06-03	Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs	Wenjing Tang et.al.	2506.02860	null
2025-06-03	ATAG: AI-Agent Application Threat Assessment with Attack Graphs	Parth Atulbhai Gandhi et.al.	2506.02859	null
2025-06-03	Enhancing Abnormality Identification: Robust Out-of-Distribution Strategies for Deepfake Detection	Luca Maiano et.al.	2506.02857	null
2025-06-03	METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding	Mengyue Wang et.al.	2506.02850	link
2025-06-03	CLONE: Customizing LLMs for Efficient Latency-Aware Inference at the Edge	Chunlin Tian et.al.	2506.02847	null
2025-06-03	TaxAgent: How Large Language Model Designs Fiscal Policy	Jizhou Wang et.al.	2506.02838	null
2025-06-03	High-speed control and navigation for quadrupedal robots on complex and discrete terrain	Hyeongjun Kim et.al.	2506.02835	null
2025-06-03	TO-GATE: Clarifying Questions and Summarizing Responses with Trajectory Optimization for Eliciting Human Preference	Yulin Dou et.al.	2506.02827	null
2025-06-03	ProcrustesGPT: Compressing LLMs with Structured Matrices and Orthogonal Transformations	Ekaterina Grishina et.al.	2506.02818	null
2025-06-03	CART-based Synthetic Tabular Data Generation for Imbalanced Regression	António Pedro Pinheiro et.al.	2506.02811	null
2025-06-03	Rethinking the effects of data contamination in Code Intelligence	Zhen Yang et.al.	2506.02791	null
2025-06-03	Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization	Ruilong Wu et.al.	2506.02787	null
2025-06-03	Reuse or Generate? Accelerating Code Editing via Edit-Oriented Speculative Decoding	Peiding Wang et.al.	2506.02780	null
2025-06-03	Rethinking Machine Unlearning in Image Generation Models	Renyang Liu et.al.	2506.02761	link
2025-06-03	Exploiting the English Vocabulary Profile for L2 word-level vocabulary assessment with LLMs	Stefano Bannò et.al.	2506.02758	null
2025-06-03	Enriching Location Representation with Detailed Semantic Information	Junyuan Liu et.al.	2506.02744	null
2025-06-03	Why do AI agents communicate in human language?	Pengcheng Zhou et.al.	2506.02739	null
2025-06-03	RACE-Align: Retrieval-Augmented and Chain-of-Thought Enhanced Preference Alignment for Large Language Models	Qihang Yan et.al.	2506.02726	null
2025-06-03	Benchmarking and Advancing Large Language Models for Local Life Services	Xiaochong Lan et.al.	2506.02720	null
2025-06-03	Expansion-contraction duality breaking in a Planck-scale sensitive cosmological quantum simulator	S. Mahesh Chandran et.al.	2506.02719	null
2025-06-03	Heterogeneous Group-Based Reinforcement Learning for LLM-based Multi-Agent Systems	Guanzhong Chen et.al.	2506.02718	null
2025-06-03	Open-Set Living Need Prediction with Large Language Models	Xiaochong Lan et.al.	2506.02713	null
2025-06-03	Smoothed Preference Optimization via ReNoise Inversion for Aligning Diffusion Models with Varied Human Preferences	Yunhong Lu et.al.	2506.02698	null
2025-06-03	Shaking to Reveal: Perturbation-Based Detection of LLM Hallucinations	Jinyuan Luo et.al.	2506.02696	null
2025-06-03	Large-scale Self-supervised Video Foundation Model for Intelligent Surgery	Shu Yang et.al.	2506.02692	null
2025-06-04	MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching	Liang Yue et.al.	2506.02689	null
2025-06-03	Decompose, Plan in Parallel, and Merge: A Novel Paradigm for Large Language Models based Planning with Multiple Constraints	Zhengdong Lu et.al.	2506.02683	null
2025-06-03	Solving Inverse Problems with FLAIR	Julius Erbach et.al.	2506.02680	null
2025-06-03	TL;DR: Too Long, Do Re-weighting for Effcient LLM Reasoning Compression	Zhong-Zhi Li et.al.	2506.02678	link
2025-06-03	EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving	Shihan Dou et.al.	2506.02672	null
2025-06-03	Are Economists Always More Introverted? Analyzing Consistency in Persona-Assigned LLMs	Manon Reusens et.al.	2506.02659	null
2025-06-04	Computational Thinking Reasoning in Large Language Models	Kechi Zhang et.al.	2506.02658	null
2025-06-03	From Prompts to Protection: Large Language Model-Enabled In-Context Learning for Smart Public Safety UAV	Yousef Emami et.al.	2506.02649	null
2025-06-03	Truly Assessing Fluid Intelligence of Large Language Models through Dynamic Reasoning Evaluation	Yue Yang et.al.	2506.02648	null
2025-06-03	KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider	Jiahao Wang et.al.	2506.02634	link
2025-06-03	HAM: A Hyperbolic Step to Regulate Implicit Bias	Tom Jacobs et.al.	2506.02630	null
2025-06-03	Hyperspectral Image Generation with Unmixing Guided Diffusion Model	Shiyu Shen et.al.	2506.02601	null
2025-06-03	EssayBench: Evaluating Large Language Models in Multi-Genre Chinese Essay Writing	Fan Gao et.al.	2506.02596	null
2025-06-03	EALG: Evolutionary Adversarial Generation of Language Model-Guided Generators for Combinatorial Optimization	Ruibo Duan et.al.	2506.02594	null
2025-06-03	Beyond the Surface: Measuring Self-Preference in LLM Judgments	Zhi-Yuan Chen et.al.	2506.02592	null
2025-06-03	On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures	Minh Duc Bui et.al.	2506.02591	null
2025-06-03	Evaluating Named Entity Recognition Models for Russian Cultural News Texts: From BERT to LLM	Maria Levchenko et.al.	2506.02589	null
2025-06-03	IndoSafety: Culturally Grounded Safety for LLMs in Indonesian Languages	Muhammad Falensi Azmi et.al.	2506.02573	null
2025-06-03	HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference	Ping Gong et.al.	2506.02572	link
2025-06-03	MLaGA: Multimodal Large Language and Graph Assistant	Dongzhe Fan et.al.	2506.02568	null
2025-06-03	Pruning General Large Language Models into Customized Expert Models	Yirao Zhao et.al.	2506.02561	null
2025-06-03	Kernel-based Unsupervised Embedding Alignment for Enhanced Visual Representation in Vision-language Models	Shizhan Gong et.al.	2506.02557	null
2025-06-03	SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence	Zhitao Zeng et.al.	2506.02555	null
2025-05-30	ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL	Yu Zhang et.al.	2505.24875	null
2025-05-30	The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models	Adam Stein et.al.	2505.24874	link
2025-05-30	MiniMax-Remover: Taming Bad Noise Helps Video Object Removal	Bojia Zi et.al.	2505.24873	null
2025-05-30	MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning	Yiqing Liang et.al.	2505.24871	null
2025-05-30	GenSpace: Benchmarking Spatially-Aware Image Generation	Zehan Wang et.al.	2505.24870	null
2025-05-30	SiLVR: A Simple Language-based Video Reasoning Framework	Ce Zhang et.al.	2505.24869	link
2025-05-30	TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection	Xinqi Xiong et.al.	2505.24866	null
2025-05-30	ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models	Mingjie Liu et.al.	2505.24864	link
2025-05-30	ViStoryBench: Comprehensive Benchmark Suite for Story Visualization	Cailin Zhuang et.al.	2505.24862	link
2025-05-30	MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning	Jingyan Shen et.al.	2505.24846	null
2025-05-30	Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning	Wanyun Xie et.al.	2505.24844	link
2025-05-30	Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck	Yuwen Tan et.al.	2505.24840	null
2025-05-30	VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software	Brandon Man et.al.	2505.24838	link
2025-05-30	Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs	Juraj Vladika et.al.	2505.24830	null
2025-05-30	LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text	Li yunhan et.al.	2505.24826	link
2025-05-30	PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models	Yinggan Xu et.al.	2505.24823	null
2025-05-30	Bi-Manual Joint Camera Calibration and Scene Representation	Haozhan Tang et.al.	2505.24819	null
2025-06-02	Guiding Generative Storytelling with Knowledge Graphs	Zhijun Pan et.al.	2505.24803	null
2025-05-30	Inference Acceleration of Autoregressive Normalizing Flows by Selective Jacobi Decoding	Jiaru Zhang et.al.	2505.24791	null
2025-05-30	Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation	Yucheng Zhou et.al.	2505.24787	link
2025-05-30	AXIOM: Learning to Play Games in Minutes with Expanding Object-Centric Models	Conor Heins et.al.	2505.24784	null
2025-06-03	EVA-MILP: Towards Standardized Evaluation of MILP Instance Generation	Yidong Luo et.al.	2505.24779	link
2025-05-30	Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models’ Uncertainty?	Jiayu Liu et.al.	2505.24778	link
2025-05-30	Diffusion-Based Symbolic Regression	Zachary Bastiani et.al.	2505.24776	null
2025-05-30	AFLoRA: Adaptive Federated Fine-Tuning of Large Language Models with Resource-Aware Low-Rank Adaption	Yajie Zhou et.al.	2505.24773	null
2025-05-30	Generalization Dynamics of Linear Diffusion Models	Claudia Merger et.al.	2505.24769	null
2025-05-30	From Macro to Micro: Probing Dataset Diversity in Language Model Fine-Tuning	Haoyu Li et.al.	2505.24768	null
2025-05-30	A survey of using EHR as real-world evidence for discovering and validating new drug indications	Nabasmita Talukdar et.al.	2505.24767	null
2025-05-30	LGAR: Zero-Shot LLM-Guided Neural Ranking for Abstract Screening in Systematic Literature Reviews	Christian Jaumann et.al.	2505.24757	link
2025-05-30	SUMO: Subspace-Aware Moment-Orthogonalization for Accelerating Memory-Efficient LLM Training	Yehonathan Refael et.al.	2505.24749	null
2025-05-30	DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds	Jiaxu Zhang et.al.	2505.24733	null
2025-05-30	Circuit Stability Characterizes Language Model Generalization	Alan Sun et.al.	2505.24731	link
2025-05-30	Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning	Shelly Bensal et.al.	2505.24726	null
2025-05-30	HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts	Neil He et.al.	2505.24722	link
2025-06-03	Reinforcing Video Reasoning with Focused Thinking	Jisheng Dang et.al.	2505.24718	link
2025-05-30	PDE-Transformer: Efficient and Versatile Transformers for Physics Simulations	Benjamin Holzschuh et.al.	2505.24717	link
2025-05-30	Towards Scalable Schema Mapping using Large Language Models	Christopher Buss et.al.	2505.24716	null
2025-05-30	FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation	Junyu Luo et.al.	2505.24714	link
2025-05-30	HESEIA: A community-based dataset for evaluating social biases in large language models, co-designed in real school settings in Latin America	Guido Ivetta et.al.	2505.24712	null
2025-05-30	Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting	Wei Chen et.al.	2505.24710	link
2025-05-30	Multi-Domain ABSA Conversation Dataset Generation via LLMs for Real-World Evaluation and Model Comparison	Tejul Pandit et.al.	2505.24701	null
2025-05-30	Conformal Prediction for Zero-Shot Models	Julio Silva-Rodríguez et.al.	2505.24693	link
2025-05-30	BPE Stays on SCRIPT: Structured Encoding for Robust Multilingual Pretokenization	Sander Land et.al.	2505.24689	link
2025-05-30	Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration	Qinglin Zhu et.al.	2505.24688	null
2025-05-30	A Simple Linear Patch Revives Layer-Pruned Large Language Models	Xinrui Chen et.al.	2505.24680	null
2025-05-30	TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis	Xiaorui Wu et.al.	2505.24672	link
2025-05-30	Multiple LLM Agents Debate for Equitable Cultural Alignment	Dayeon Ki et.al.	2505.24671	link
2025-05-30	Can LLMs and humans be friends? Uncovering factors affecting human-AI intimacy formation	Yeseon Hong et.al.	2505.24658	null
2025-05-30	Adaptable Cardiovascular Disease Risk Prediction from Heterogeneous Data using Large Language Models	Frederike Lübeck et.al.	2505.24655	null
2025-05-30	Efficient Text Encoders for Labor Market Analysis	Jens-Joris Decorte et.al.	2505.24640	null
2025-05-30	Disentangling Language and Culture for Evaluating Multilingual Large Language Models	Jiahao Ying et.al.	2505.24635	null
2025-05-30	The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models	Junyi Li et.al.	2505.24630	link
2025-05-30	Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors	Duo Zheng et.al.	2505.24625	null
2025-05-30	Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success	Ben Griffin et.al.	2505.24622	null
2025-05-30	Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization	Utsav Maskey et.al.	2505.24621	null
2025-05-30	Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX	Nikita Martynov et.al.	2505.24616	null
2025-05-30	Harnessing Large Language Models for Scientific Novelty Detection	Yan Liu et.al.	2505.24615	null
2025-05-30	Mixture-of-Experts for Personalized and Semantic-Aware Next Location Prediction	Shuai Liu et.al.	2505.24597	null
2025-05-30	A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement	Jie Zhang et.al.	2505.24576	null
2025-05-30	Bench4KE: Benchmarking Automated Competency Question Generation	Anna Sofia Lippolis et.al.	2505.24554	link
2025-05-30	CREFT: Sequential Multi-Agent LLM for Character Relation Extraction	Ye Eun Chun et.al.	2505.24553	null
2025-05-30	Cross-Attention Speculative Decoding	Wei Zhong et.al.	2505.24544	null
2025-05-30	Mixpert: Mitigating Multimodal Learning Conflicts with Efficient Mixture-of-Vision-Experts	Xin He et.al.	2505.24541	null
2025-06-03	Localizing Persona Representations in LLMs	Celia Cintas et.al.	2505.24539	null
2025-05-30	Don’t Erase, Inform! Detecting and Contextualizing Harmful Language in Cultural Heritage Collections	Orfeas Menis Mastromichalakis et.al.	2505.24538	link
2025-05-30	Beyond Linear Steering: Unified Multi-Attribute Control for Language Models	Narmeen Oozeer et.al.	2505.24535	null
2025-05-30	Transformers Are Universally Consistent	Sagar Ghosh et.al.	2505.24531	null
2025-05-30	Geospatial Foundation Models to Enable Progress on Sustainable Development Goals	Pedram Ghamisi et.al.	2505.24528	null
2025-05-30	Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors	Andrea Pedrotti et.al.	2505.24523	link
2025-05-30	UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation	Yang-Tian Sun et.al.	2505.24521	null
2025-05-30	un $^2$ CLIP: Improving CLIP’s Visual Detail Capturing Ability via Inverting unCLIP	Yinqi Li et.al.	2505.24517	link
2025-05-30	TimeHC-RL: Temporal-aware Hierarchical Cognitive Reinforcement Learning for Enhancing LLMs’ Social Intelligence	Guiyang Hou et.al.	2505.24500	link
2025-05-30	Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation	Ximing Xing et.al.	2505.24499	null
2025-05-30	MELT: Towards Automated Multimodal Emotion Data Annotation by Leveraging LLM Embedded Knowledge	Xin Jing et.al.	2505.24493	link
2025-05-30	Object Centric Concept Bottlenecks	David Steinmann et.al.	2505.24492	null
2025-05-30	Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation	Sania Nayab et.al.	2505.24479	null
2025-05-30	Optimizing the Interface Between Knowledge Graphs and LLMs for Complex Reasoning	Vasilije Markovic et.al.	2505.24478	null
2025-05-30	Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model	Yuting Zhang et.al.	2505.24476	link
2025-05-30	SA-Person: Text-Based Person Retrieval with Scene-aware Re-ranking	Yingjia Xu et.al.	2505.24466	null
2025-05-30	SEAR: A Multimodal Dataset for Analyzing AR-LLM-Driven Social Engineering Behaviors	Tianlong Yu et.al.	2505.24458	null
2025-05-30	LPASS: Linear Probes as Stepping Stones for vulnerability detection using compressed LLMs	Luis Ibanez-Lissen et.al.	2505.24451	null
2025-05-30	Exploring the Impact of Occupational Personas on Domain-Specific QA	Eojin Kang et.al.	2505.24448	null
2025-05-30	Learning Safety Constraints for Large Language Models	Xin Chen et.al.	2505.24445	link
2025-05-30	RMoA: Optimizing Mixture-of-Agents through Diversity Maximization and Residual Compensation	Zhentao Xie et.al.	2505.24442	link
2025-05-30	SORCE: Small Object Retrieval in Complex Environments	Chunxu Liu et.al.	2505.24441	link
2025-05-30	Model Unlearning via Sparse Autoencoder Subspace Guided Projections	Xu Wang et.al.	2505.24428	null
2025-05-30	MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs	Zhiwei Liu et.al.	2505.24423	link
2025-05-30	LLMs Are Globally Multilingual Yet Locally Monolingual: Exploring Knowledge Transfer via Language and Thought Theory	Eojin Kang et.al.	2505.24409	null
2025-05-30	IRBridge: Solving Image Restoration Bridge with Pre-trained Generative Diffusion Models	Hanting Wang et.al.	2505.24406	link
2025-05-30	ClueAnchor: Clue-Anchored Knowledge Reasoning Exploration and Optimization for Retrieval-Augmented Generation	Hao Chen et.al.	2505.24388	link
2025-05-30	Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models	Xiaoyu Wu et.al.	2505.24379	null
2025-05-30	LLM Inference Enhanced by External Knowledge: A Survey	Yu-Hsuan Lin et.al.	2505.24377	link
2025-05-30	Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering	Md Intisar Chowdhury et.al.	2505.24371	null
2025-05-30	ReCalKV: Low-Rank KV Cache Compression via Head Reordering and Offline Calibration	Xianglong Yan et.al.	2505.24357	null
2025-05-30	Multilingual Gloss-free Sign Language Translation: Towards Building a Sign Language Foundation Model	Sihan Tan et.al.	2505.24355	link
2025-05-30	Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research	Qianqian Zhang et.al.	2505.24354	null
2025-05-30	Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings	Shujian Yang et.al.	2505.24341	link
2025-05-30	GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models	Gilles Quentin Hacheme et.al.	2505.24340	link
2025-05-30	Pangu DeepDiver: Adaptive Search Intensity Scaling via Open-Web Reinforcement Learning	Wenxuan Shi et.al.	2505.24332	null
2025-05-30	DisTime: Distribution-based Time Representation for Video Large Language Models	Yingsen Zeng et.al.	2505.24329	link
2025-05-29	Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought	Yunze Man et.al.	2505.23766	null
2025-05-29	From Chat Logs to Collective Insights: Aggregative Question Answering	Wentao Zhang et.al.	2505.23765	null
2025-05-29	MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence	Sihan Yang et.al.	2505.23764	null
2025-05-29	DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning	Ziyin Zhang et.al.	2505.23754	link
2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link
2025-05-29	Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?	Paul Gölz et.al.	2505.23749	null
2025-05-29	Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence	Diankun Wu et.al.	2505.23747	null
2025-05-29	MAGREF: Masked Guidance for Any-Reference Video Generation	Yufan Deng et.al.	2505.23742	link
2025-05-29	Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time	Mohamad Chehade et.al.	2505.23729	null
2025-05-29	PixelThink: Towards Efficient Chain-of-Pixel Reasoning	Song Wang et.al.	2505.23727	null
2025-05-29	FMG-Det: Foundation Model Guided Robust Object Detection	Darryl Hannan et.al.	2505.23726	null
2025-05-29	MuLoCo: Muon is a practical inner optimizer for DiLoCo	Benjamin Thérien et.al.	2505.23725	null
2025-05-29	SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA	Minrui Luo et.al.	2505.23724	null
2025-05-29	ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering	Zexi Liu et.al.	2505.23723	link
2025-05-29	Label-Guided In-Context Learning for Named Entity Recognition	Fan Bai et.al.	2505.23722	link
2025-05-29	TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning	Andreas Auer et.al.	2505.23719	link
2025-05-29	Don’t Take the Premise for Granted: Evaluating the Premise Critique Ability of Large Language Models	Jinzhe Li et.al.	2505.23715	link
2025-05-29	SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models	Zixiang Xu et.al.	2505.23713	link
2025-05-29	Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation	Ziling Cheng et.al.	2505.23701	null
2025-05-29	Fortune: Formula-Driven Reinforcement Learning for Symbolic Table Reasoning in Language Models	Lang Cao et.al.	2505.23667	null
2025-05-29	LoLA: Low-Rank Linear Attention With Sparse Caching	Luke McDermott et.al.	2505.23666	null
2025-05-29	ToolHaystack: Stress-Testing Tool-Augmented Language Models in Realistic Long-Term Interactions	Beong-woo Kwak et.al.	2505.23662	link
2025-05-29	OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation	Size Wu et.al.	2505.23661	link
2025-05-29	D-AR: Diffusion via Autoregressive Models	Ziteng Gao et.al.	2505.23660	link
2025-05-29	Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation	Hongxiang Zhang et.al.	2505.23657	null
2025-05-29	VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	Xiangdong Zhang et.al.	2505.23656	link
2025-05-29	ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs	Mohamed Elaraby et.al.	2505.23654	null
2025-05-29	How does Transformer Learn Implicit Reasoning?	Jiaran Ye et.al.	2505.23653	link
2025-05-29	Optimization-Free Diffusion Model – A Perturbation Theory Approach	Yuehaw Khoo et.al.	2505.23652	null
2025-05-29	MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment	John Halloran et.al.	2505.23634	null
2025-05-29	AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora	Jiaxin Bai et.al.	2505.23628	link
2025-05-29	ZeroSep: Separate Anything in Audio with Zero Training	Chao Huang et.al.	2505.23625	null
2025-05-29	Few-Shot Speech Deepfake Detection Adaptation with Gaussian Processes	Neta Glazer et.al.	2505.23619	link
2025-05-29	Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model	Qingyu Shi et.al.	2505.23606	link
2025-05-29	A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis	Shengyuan Liu et.al.	2505.23601	null
2025-05-29	LLM Performance for Code Generation on Noisy Tasks	Radzim Sendyka et.al.	2505.23598	link
2025-05-29	MAPLE: A Mobile Assistant with Persistent Finite State Machines for Recovery Reasoning	Linqiang Guo et.al.	2505.23596	null
2025-05-29	Jigsaw-R1: A Study of Rule-based Visual Reinforcement Learning with Jigsaw Puzzles	Zifu Wang et.al.	2505.23590	link
2025-05-29	On-Policy RL with Optimal Reward Baseline	Yaru Hao et.al.	2505.23585	link
2025-05-29	BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model	Adibvafa Fallahpour et.al.	2505.23579	link
2025-05-29	Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms	Jane Cleland-Huang et.al.	2505.23576	null
2025-05-29	Evaluating AI capabilities in detecting conspiracy theories on YouTube	Leonardo La Rocca et.al.	2505.23570	link
2025-05-29	Maximum Likelihood Learning of Latent Dynamics Without Reconstruction	Samo Hromadka et.al.	2505.23569	null
2025-05-29	Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models	Yiran Guo et.al.	2505.23564	link
2025-05-29	Merge Hijacking: Backdoor Attacks to Model Merging of Large Language Models	Zenghui Yuan et.al.	2505.23561	null
2025-05-29	SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents	Kunlun Zhu et.al.	2505.23559	link
2025-05-29	Adaptive Federated LoRA in Heterogeneous Wireless Networks with Independent Sampling	Yanzhao Hou et.al.	2505.23555	null
2025-05-29	Sustainable Carbon-Aware and Water-Efficient LLM Scheduling in Geo-Distributed Cloud Datacenters	Hayden Moore et.al.	2505.23554	null
2025-05-29	LLM-based Property-based Test Generation for Guardrailing Cyber-Physical Systems	Khashayar Etemadi et.al.	2505.23549	null
2025-05-29	Translation in the Wild	Yuri Balashov et.al.	2505.23548	null
2025-05-29	Position Paper: Metadata Enrichment Model: Integrating Neural Networks and Semantic Knowledge Graphs for Cultural Heritage Applications	Jan Ignatowicz et.al.	2505.23543	null
2025-05-29	Probability-Consistent Preference Optimization for Enhanced LLM Reasoning	Yunqiao Yang et.al.	2505.23540	link
2025-05-29	Domain-Aware Tensor Network Structure Search	Giorgos Iacovides et.al.	2505.23537	null
2025-05-29	AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity	Yu Zhang et.al.	2505.23520	link
2025-05-29	DeepFilterGAN: A Full-band Real-time Speech Enhancement System with GAN-based Stochastic Regeneration	Sanberk Serbest et.al.	2505.23515	null
2025-05-29	VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning	Liyun Zhu et.al.	2505.23504	link
2025-05-29	Can Large Language Models Challenge CNNS in Medical Image Analysis?	Shibbir Ahmed et.al.	2505.23503	null
2025-05-29	Identity resolution of software metadata using Large Language Models	Eva Martín del Pico et.al.	2505.23500	null
2025-05-29	R2I-Bench: Benchmarking Reasoning-Driven Text-to-Image Generation	Kaijie Chen et.al.	2505.23493	null
2025-05-29	Autoformalization in the Era of Large Language Models: A Survey	Ke Weng et.al.	2505.23486	link
2025-05-29	VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation	Shi-Xue Zhang et.al.	2505.23484	link
2025-05-29	Revisiting Overthinking in Long Chain-of-Thought from the Perspective of Self-Doubt	Keqin Peng et.al.	2505.23480	null
2025-05-29	Evaluating the performance and fragility of large language models on the self-assessment for neurological surgeons	Krithik Vishwanath et.al.	2505.23477	null
2025-05-29	EVOREFUSE: Evolutionary Prompt Optimization for Evaluation and Mitigation of LLM Over-Refusal to Pseudo-Malicious Instructions	Xiaorui Wu et.al.	2505.23473	null
2025-05-29	Synthesizing Performance Constraints for Evaluating and Improving Code Efficiency	Jun Yang et.al.	2505.23471	null
2025-05-29	Diffusion Guidance Is a Controllable Policy Improvement Operator	Kevin Frans et.al.	2505.23458	link
2025-05-29	What About Emotions? Guiding Fine-Grained Emotion Extraction from Mobile App Reviews	Quim Motger et.al.	2505.23452	link
2025-05-30	CMIE: Combining MLLM Insights with External Evidence for Explainable Out-of-Context Misinformation Detection	Fanxiao Li et.al.	2505.23449	null
2025-05-29	Diversity-Aware Policy Optimization for Large Language Model Reasoning	Jian Yao et.al.	2505.23433	null
2025-05-29	SWE-bench Goes Live!	Linghao Zhang et.al.	2505.23419	link
2025-05-29	KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction	Jang-Hyun Kim et.al.	2505.23416	link
2025-05-29	Bidirectional predictive coding	Gaspard Oliviers et.al.	2505.23415	null
2025-05-29	From Parameters to Prompts: Understanding and Mitigating the Factuality Gap between Fine-Tuned LLMs	Xuan Gong et.al.	2505.23410	null
2025-05-29	A Practical Guide for Supporting Formative Assessment and Feedback Using Generative AI	Sapolnach Prompiengchai et.al.	2505.23405	null
2025-05-29	Adaptive Jailbreaking Strategies Based on the Semantic Understanding Capabilities of Large Language Models	Mingyu Yu et.al.	2505.23404	null
2025-05-29	Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation	Sanggyun Ma et.al.	2505.23400	null
2025-05-29	Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization	Mingzhe Du et.al.	2505.23387	null
2025-05-29	UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning	Weijia Mao et.al.	2505.23380	link
2025-05-29	Threading the Needle: Reweaving Chain-of-Thought Reasoning to Explain Human Label Variation	Beiduo Chen et.al.	2505.23368	link
2025-05-29	Discriminative Policy Optimization for Token-Level Reward Models	Hongzhan Chen et.al.	2505.23363	link
2025-05-29	VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?	Yuanxin Liu et.al.	2505.23359	link
2025-05-29	Representing local protein environments with atomistic foundation models	Meital Bojan et.al.	2505.23354	null
2025-05-29	Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems	Xu Shen et.al.	2505.23352	link
2025-05-29	Towards Reward Fairness in RLHF: From a Resource Allocation Perspective	Sheng Ouyang et.al.	2505.23349	link
2025-05-29	Fine-Tuning Next-Scale Visual Autoregressive Models with Group Relative Policy Optimization	Matteo Gallici et.al.	2505.23331	null
2025-05-29	Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis	Hengyuan Cao et.al.	2505.23325	null
2025-05-29	Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO	Kaiyang Guo et.al.	2505.23316	null
2025-05-29	TRACE: Trajectory-Constrained Concept Erasure in Diffusion Models	Finn Carter et.al.	2505.23312	null
2025-05-29	Towards LLM-based Generation of Human-Readable Proofs in Polynomial Formal Verification	Rolf Drechsler et.al.	2505.23311	null
2025-05-29	Score-based Generative Modeling for Conditional Independence Testing	Yixin Ren et.al.	2505.23309	link
2025-05-29	Data-efficient Meta-models for Evaluation of Context-based Questions and Answers in LLMs	Julia Belikova et.al.	2505.23299	null
2025-05-29	EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian	Daryna Dementieva et.al.	2505.23297	null
2025-05-29	How Does Response Length Affect Long-Form Factuality	James Xu Zhao et.al.	2505.23295	link
2025-05-29	Federated Unsupervised Semantic Segmentation	Evangelos Charalampakis et.al.	2505.23292	null
2025-05-29	GenCAD-Self-Repairing: Feasibility Enhancement for 3D CAD Generation	Chikaha Tsuji et.al.	2505.23287	null
2025-05-29	MathArena: Evaluating LLMs on Uncontaminated Math Competitions	Mislav Balunović et.al.	2505.23281	link
2025-05-29	Sentinel: Attention Probing of Proxy Models for LLM Context Compression with an Understanding Perspective	Yong Zhang et.al.	2505.23277	link
2025-05-29	The Arabic AI Fingerprint: Stylometric Analysis and Detection of Large Language Models Text	Maged S. Al-Shaibani et.al.	2505.23276	link
2025-05-29	Wireless Agentic AI with Retrieval-Augmented Multimodal Semantic Perception	Guangyuan Liu et.al.	2505.23275	null
2025-05-29	Does Machine Unlearning Truly Remove Model Knowledge? A Framework for Auditing Unlearning in LLMs	Haokun Chen et.al.	2505.23270	null
2025-05-28	Zero-Shot Vision Encoder Grafting via LLM Surrogates	Kaiyu Yue et.al.	2505.22664	link
2025-05-28	AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models	Feng Luo et.al.	2505.22662	null
2025-05-28	GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning	Qingchen Yu et.al.	2505.22661	null
2025-05-28	3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model	Wenbo Hu et.al.	2505.22657	null
2025-05-28	Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents	Michael Kirchhof et.al.	2505.22655	null
2025-05-28	The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason	Ang Lv et.al.	2505.22653	null
2025-05-28	Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese	Hanjia Lyu et.al.	2505.22645	link
2025-05-28	Understanding (Un)Reliability of Steering Vectors in Language Models	Joschka Braun et.al.	2505.22637	null
2025-05-28	Learning Composable Chains-of-Thought	Fangcong Yin et.al.	2505.22635	null
2025-05-28	Spatial Knowledge Graph-Guided Multimodal Synthesis	Yida Xue et.al.	2505.22633	null
2025-05-28	Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs	Ziling Cheng et.al.	2505.22630	null
2025-05-28	Principled Out-of-Distribution Generalization via Simplicity	Jiawei Ge et.al.	2505.22622	null
2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-28	RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction	Yuchi Wang et.al.	2505.22613	null
2025-05-28	Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates	Haoning Xu et.al.	2505.22608	null
2025-05-28	Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning	Erxin Yu et.al.	2505.22591	null
2025-05-28	Precise In-Parameter Concept Erasure in Large Language Models	Yoav Gur-Arieh et.al.	2505.22586	null
2025-05-28	Less, but Better: Efficient Multilingual Expansion for LLMs via Layer-wise Mixture-of-Experts	Xue Zhang et.al.	2505.22582	null
2025-05-28	Fusion Steering: Prompt-Specific Activation Control	Waldemar Chang et.al.	2505.22572	null
2025-05-29	Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems	Hoang Pham et.al.	2505.22571	null
2025-05-28	Universal Visuo-Tactile Video Understanding for Embodied Interaction	Yifan Xie et.al.	2505.22566	null
2025-05-28	Do Large Language Models Think Like the Brain? Sentence-Level Evidence from fMRI and Hierarchical Embeddings	Yu Lei et.al.	2505.22563	null
2025-05-28	ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM	Hoang Pham et.al.	2505.22552	null
2025-05-28	DES-LOC: Desynced Low Communication Adaptive Optimizers for Training Foundation Models	Alex Iacob et.al.	2505.22549	null
2025-05-28	TabularQGAN: A Quantum Generative Model for Tabular Data	Pallavi Bhardwaj et.al.	2505.22533	null
2025-05-28	Symplectic Generative Networks (SGNs): A Hamiltonian Framework for Invertible Deep Generative Modeling	Agnideep Aich et.al.	2505.22527	null
2025-05-28	PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models	Junwen Chen et.al.	2505.22523	null
2025-05-28	Multi-MLLM Knowledge Distillation for Out-of-Context News Detection	Yimeng Gu et.al.	2505.22517	null
2025-05-28	EvolveSearch: An Iterative Self-Evolving Search Agent	Dingchu Zhang et.al.	2505.22501	null
2025-05-28	ProSpero: Active Learning for Robust Protein Design Beyond Wild-Type Neighborhoods	Michal Kmicikiewicz et.al.	2505.22494	null
2025-05-28	Understanding Adversarial Training with Energy-based Models	Mujtaba Hussain Mirza et.al.	2505.22486	null
2025-05-29	Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems	Jiaxi Yang et.al.	2505.22467	null
2025-05-28	Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO	Lai Wei et.al.	2505.22453	link
2025-05-28	Position: All Current Generative Fidelity and Diversity Metrics are Flawed	Ossi Räisä et.al.	2505.22450	null
2025-05-28	Privacy-preserving Prompt Personalization in Federated Learning for Multimodal Large Language Models	Sizai Hou et.al.	2505.22447	null
2025-05-28	Does Johnny Get the Message? Evaluating Cybersecurity Notifications for Everyday Users	Victor Jüttner et.al.	2505.22435	null
2025-05-28	Scaling Reasoning without Attention	Xueliang Zhao et.al.	2505.22425	null
2025-05-28	Frugal Incremental Generative Modeling using Variational Autoencoders	Victor Enescu et.al.	2505.22408	null
2025-05-28	Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs	Xudong Li et.al.	2505.22396	null
2025-05-28	PacTure: Efficient PBR Texture Generation on Packed Views with Visual Autoregressive Models	Fan Fei et.al.	2505.22394	null
2025-05-28	Physics-Informed Distillation of Diffusion Models for PDE-Constrained Generation	Yi Zhang et.al.	2505.22391	null
2025-05-29	Pangu Embedded: An Efficient Dual-system LLM Reasoner with Metacognition	Hanting Chen et.al.	2505.22375	null
2025-05-28	AgentDNS: A Root Domain Naming System for LLM Agents	Enfang Cui et.al.	2505.22368	null
2025-05-28	Identity-Preserving Text-to-Image Generation via Dual-Level Feature Decoupling and Expert-Guided Fusion	Kewen Chen et.al.	2505.22360	null
2025-05-28	Budget-Adaptive Adapter Tuning in Orthogonal Subspaces for Continual Learning in LLMs	Zhiyi Wan et.al.	2505.22358	null
2025-05-28	ChatPD: An LLM-driven Paper-Dataset Networking System	Anjie Xu et.al.	2505.22349	null
2025-05-28	Task-Driven Implicit Representations for Automated Design of LiDAR Systems	Nikhil Behari et.al.	2505.22344	null
2025-05-28	Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start	Lai Wei et.al.	2505.22334	link
2025-05-28	NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment	Antonia Karamolegkou et.al.	2505.22327	null
2025-05-28	Advancing Expert Specialization for Better MoE	Hongcan Guo et.al.	2505.22323	null
2025-05-28	Chain-of-Thought for Large Language Model-empowered Wireless Communications	Xudong Wang et.al.	2505.22320	null
2025-05-28	If Pigs Could Fly… Can LLMs Logically Reason Through Counterfactuals?	Ishwar B Balappanawar et.al.	2505.22318	null
2025-05-29	Skywork Open Reasoner 1 Technical Report	Jujie He et.al.	2505.22312	link
2025-05-28	From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications	Feibo Jiang et.al.	2505.22311	null
2025-05-28	CADReview: Automatically Reviewing CAD Programs with Error Detection and Correction	Jiali Chen et.al.	2505.22304	null
2025-05-28	Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing	Yifan Lu et.al.	2505.22298	null
2025-05-28	Compensating for Data with Reasoning: Low-Resource Machine Translation with LLMs	Samuel Frontull et.al.	2505.22293	null
2025-05-28	Rethinking the Unsolvable: When In-Context Search Meets Test-Time Scaling	Fanzeng Xia et.al.	2505.22290	null
2025-05-28	New Tools are Needed for Tracking Adherence to AI Model Behavioral Use Clauses	Daniel McDuff et.al.	2505.22287	null
2025-05-28	Test-Time Immunization: A Universal Defense Framework Against Jailbreaks for (Multimodal) Large Language Models	Yongcan Yu et.al.	2505.22271	null
2025-05-28	Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition	Yuan Tseng et.al.	2505.22251	null
2025-05-28	BioHopR: A Benchmark for Multi-Hop, Multi-Answer Reasoning in Biomedical Domain	Yunsoo Kim et.al.	2505.22240	null
2025-05-28	Judging Quality Across Languages: A Multilingual Approach to Pretraining Data Filtering with Language Models	Mehdi Ali et.al.	2505.22232	link
2025-05-28	Look & Mark: Leveraging Radiologist Eye Fixations and Bounding boxes in Multimodal Large Language Models for Chest X-ray Report Generation	Yunsoo Kim et.al.	2505.22222	null
2025-05-28	A Survey on Training-free Open-Vocabulary Semantic Segmentation	Naomi Kombol et.al.	2505.22209	null
2025-05-28	Efficient Leave-one-out Approximation in LLM Multi-agent Debate Based on Introspection	Yue Cui et.al.	2505.22192	null
2025-05-29	Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design	Yudi Zhang et.al.	2505.22179	link
2025-05-28	Reverse Preference Optimization for Complex Instruction Following	Xiang Huang et.al.	2505.22172	null
2025-05-28	Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers	Weilun Feng et.al.	2505.22167	null
2025-05-28	InComeS: Integrating Compression and Selection Mechanisms into LLMs for Efficient Model Editing	Shuaiyi Li et.al.	2505.22156	null
2025-05-28	What Makes a Good Reasoning Chain? Uncovering Structural Patterns in Long Chain-of-Thought Reasoning	Gangwei Jiang et.al.	2505.22148	null
2025-05-28	Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices	Tiantian Feng et.al.	2505.22133	link
2025-05-28	EULER: Enhancing the Reasoning Ability of Large Language Models through Error-Induced Learning	Zhuoyang Wu et.al.	2505.22131	null
2025-05-28	SridBench: Benchmark of Scientific Research Illustration Drawing of Image Generation Model	Yifan Chang et.al.	2505.22126	null
2025-05-28	LoKI: Low-damage Knowledge Implanting of Large Language Models	Runyu Wang et.al.	2505.22120	link
2025-05-28	THINK-Bench: Evaluating Thinking Efficiency and Chain-of-Thought Quality of Large Reasoning Models	Zhiyuan Li et.al.	2505.22113	null
2025-05-28	Visual Large Language Models Exhibit Human-Level Cognitive Flexibility in the Wisconsin Card Sorting Test	Guangfu Hao et.al.	2505.22112	null
2025-05-28	Curse of High Dimensionality Issue in Transformer for Long-context Modeling	Shuhai Zhang et.al.	2505.22107	link
2025-05-28	MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models	Zhiyu Li et.al.	2505.22101	null
2025-05-28	Knowledge Base Construction for Knowledge-Augmented Text-to-SQL	Jinheon Baek et.al.	2505.22096	null
2025-05-28	Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning	Chunyi Peng et.al.	2505.22095	null
2025-05-28	VIRAL: Vision-grounded Integration for Reward design And Learning	Valentin Cuzin-Rambaud et.al.	2505.22092	link
2025-05-28	ArgInstruct: Specialized Instruction Fine-Tuning for Computational Argumentation	Maja Stahl et.al.	2505.22076	null
2025-05-28	On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition	Shujie HU et.al.	2505.22072	null
2025-05-28	Beyond path selection: Better LLMs for Scientific Information Extraction with MimicSFT and Relevance and Rule-induced(R $^2$ )GRPO	Ran Li et.al.	2505.22068	null
2025-05-28	Weakly Supervised Data Refinement and Flexible Sequence Compression for Efficient Thai LLM-based ASR	Mingchen Shao et.al.	2505.22063	null
2025-05-28	Safeguarding Privacy of Retrieval Data against Membership Inference Attacks: Is This Query Too Close to Home?	Yujin Choi et.al.	2505.22061	null
2025-05-28	Estimating the Effects of Sample Training Orders for Large Language Models without Retraining	Hao Yang et.al.	2505.22042	null
2025-05-28	Detecting Undesired Process Behavior by Means of Retrieval Augmented Generation	Michael Grohs et.al.	2505.22041	null
2025-05-28	Jailbreak Distillation: Renewable Safety Benchmarking	Jingyu Zhang et.al.	2505.22037	null
2025-05-28	CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models	Siqi Fan et.al.	2505.22017	null
2025-05-28	PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms	Yifei Xia et.al.	2505.22016	null
2025-05-28	VulBinLLM: LLM-powered Vulnerability Detection for Stripped Binaries	Nasir Hussain et.al.	2505.22010	null
2025-05-28	Efficiently Enhancing General Agents With Hierarchical-categorical Memory	Changze Qiao et.al.	2505.22006	null
2025-05-28	Legal Assist AI: Leveraging Transformer-Based Model for Effective Legal Assistance	Jatin Gupta et.al.	2505.22003	null
2025-05-28	Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate	Ashim Gupta et.al.	2505.21999	null
2025-05-28	Leveraging Interview-Informed LLMs to Model Survey Responses: Comparative Insights from AI-Generated and Human Data	Jihong Zhang et.al.	2505.21997	null
2025-05-28	Learning World Models for Interactive Video Generation	Taiye Chen et.al.	2505.21996	null
2025-05-28	ACE: Exploring Activation Cosine Similarity and Variance for Accurate and Calibration-Efficient LLM Pruning	Zhendong Mi et.al.	2505.21987	null
2025-05-28	Learning Compositional Behaviors from Demonstration and Language	Weiyu Liu et.al.	2505.21981	null
2025-05-27	Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making	Yihan Wang et.al.	2505.21503	null
2025-05-27	Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers	Wei Pang et.al.	2505.21497	link
2025-05-27	Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment	Xiaojun Jia et.al.	2505.21494	link
2025-05-27	Reinforcing General Reasoning without Verifiers	Xiangxin Zhou et.al.	2505.21493	link
2025-05-27	Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming	Yang Yang et.al.	2505.21486	null
2025-05-27	Are Language Models Consequentialist or Deontological Moral Reasoners?	Keenan Samway et.al.	2505.21479	null
2025-05-27	Policy Optimized Text-to-Image Pipeline Design	Uri Gadot et.al.	2505.21478	null
2025-05-27	Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration	Zijun Liu et.al.	2505.21471	link
2025-05-27	PropMolFlow: Property-guided Molecule Generation with Geometry-Complete Flow Matching	Cheng Zeng et.al.	2505.21469	null
2025-05-27	Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance	Shintaro Ozaki et.al.	2505.21458	null
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457	null
2025-05-27	Designing Cyclic Peptides via Harmonic SDE with Atom-Bond Modeling	Xiangxin Zhou et.al.	2505.21452	null
2025-05-27	Can Large Reasoning Models Self-Train?	Sheikh Shafayat et.al.	2505.21444	null
2025-05-27	Hume: Introducing System-2 Thinking in Visual-Language-Action Model	Haoming Song et.al.	2505.21432	null
2025-05-27	Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning	Xianling Mu et.al.	2505.21427	null
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-27	Autonomous Multi-Modal LLM Agents for Treatment Planning in Focused Ultrasound Ablation Surgery	Lina Zhao et.al.	2505.21418	null
2025-05-27	RefTool: Enhancing Model Reasoning with Reference-Guided Tool Creation	Xiao Liu et.al.	2505.21413	null
2025-05-28	Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity	Yehui Tang et.al.	2505.21411	null
2025-05-27	RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models	Dario Satriani et.al.	2505.21409	null
2025-05-27	A Convergence Theory for Diffusion Language Models: An Information-Theoretic Perspective	Gen Li et.al.	2505.21400	null
2025-05-27	Factual Self-Awareness in Language Models: Representation, Robustness, and Scaling	Hovhannes Tamoyan et.al.	2505.21399	null
2025-05-27	DecisionFlow: Advancing Large Language Model as Principled Decision Maker	Xiusi Chen et.al.	2505.21397	null
2025-05-27	Improving Research Idea Generation Through Data: An Empirical Investigation in Social Science	Xiao Liu et.al.	2505.21396	null
2025-05-27	AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs	Xuanwen Ding et.al.	2505.21389	link
2025-05-27	DeCAF: Decentralized Consensus-And-Factorization for Low-Rank Adaptation of Foundation Models	Nastaran Saadati et.al.	2505.21382	null
2025-05-27	GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution	Fengxiang Wang et.al.	2505.21375	link
2025-05-27	Improving LLM-based Global Optimization with Search Space Partitioning	Andrej Schwanke et.al.	2505.21372	link
2025-05-27	When Experimental Economics Meets Large Language Models: Tactics with Evidence	Shu Wang et.al.	2505.21371	null
2025-05-27	Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders	James Oldfield et.al.	2505.21364	null
2025-05-27	Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History	Qishuai Zhong et.al.	2505.21362	link
2025-05-28	AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Crop Mapping	Wenyuan Li et.al.	2505.21357	link
2025-05-28	Towards Robust Automated Perceptual Voice Quality Assessment with Speech Foundation Models	Whenty Ariyanti et.al.	2505.21356	null
2025-05-27	Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning	Bidyarthi Paul et.al.	2505.21354	null
2025-05-27	Out of the Past: An AI-Enabled Pipeline for Traffic Simulation from Noisy, Multimodal Detector Data and Stakeholder Feedback	Rex Chen et.al.	2505.21349	null
2025-05-27	The Multilingual Divide and Its Impact on Global AI Safety	Aidan Peppin et.al.	2505.21344	null
2025-05-28	PEDANTIC: A Dataset for the Automatic Examination of Definiteness in Patent Claims	Valentin Knappich et.al.	2505.21342	null
2025-05-28	HoliTom: Holistic Token Merging for Fast Video Large Language Models	Kele Shao et.al.	2505.21334	link
2025-05-27	MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios	Yang Shi et.al.	2505.21333	null
2025-05-27	MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs	Jiakang Yuan et.al.	2505.21327	null
2025-05-27	Leveraging large language models and traditional machine learning ensembles for ADHD detection from narrative transcripts	Yuxin Zhu et.al.	2505.21324	null
2025-05-27	Assured Autonomy with Neuro-Symbolic Perception	R. Spencer Hallyburton et.al.	2505.21322	null
2025-05-27	Beyond Chemical QA: Evaluating LLM’s Chemical Reasoning with Modular Chemical Operations	Hao Li et.al.	2505.21318	null
2025-05-27	A Cross Modal Knowledge Distillation & Data Augmentation Recipe for Improving Transcriptomics Representations through Morphological Features	Ihab Bendidi et.al.	2505.21317	null
2025-05-27	Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead	Jesujoba O. Alabi et.al.	2505.21315	null
2025-05-27	Large Language Models Miss the Multi-Agent Mark	Emanuele La Malfa et.al.	2505.21298	null
2025-05-27	rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset	Yifei Liu et.al.	2505.21297	link
2025-05-27	Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework	Saman Marandi et.al.	2505.21291	null
2025-05-27	PACT: A Contract-Theoretic Framework for Pricing Agentic AI Services Powered by Large Language Models	Ya-Ting Yang et.al.	2505.21286	null
2025-05-27	RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models	Yue Zhang et.al.	2505.21281	null
2025-05-27	Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space	Yao Huang et.al.	2505.21277	link
2025-05-27	JavaSith: A Client-Side Framework for Analyzing Potentially Malicious Extensions in Browsers, VS Code, and NPM Packages	Avihay Cohen et.al.	2505.21263	null
2025-05-27	ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision	Dosung Lee et.al.	2505.21250	null
2025-05-27	Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV Settings	Gunjan Balde et.al.	2505.21242	null
2025-05-27	LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners	Yu He et.al.	2505.21239	null
2025-05-27	Unfolding A Few Structures for The Many: Memory-Efficient Compression of Conformer and Speech Foundation Models	Zhaoqing Li et.al.	2505.21237	null
2025-05-27	Pretrained LLMs Learn Multiple Types of Uncertainty	Roi Cohen et.al.	2505.21218	null
2025-05-27	Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM’s Instruction-Following Capabilities	Junyan Zhang et.al.	2505.21191	null
2025-05-27	Exploring the Latent Capacity of LLMs for One-Step Text Generation	Gleb Mezentsev et.al.	2505.21189	null
2025-05-27	PoisonSwarm: Universal Harmful Information Synthesis via Model Crowdsourcing	Yu Yan et.al.	2505.21184	null
2025-05-27	Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning	Mingyang Song et.al.	2505.21178	null
2025-05-27	SOLIDGEO: Measuring Multimodal Spatial Math Reasoning in Solid Geometry	Peijie Wang et.al.	2505.21177	null
2025-05-27	TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment	Zheng Li et.al.	2505.21172	null
2025-05-27	STEB: In Search of the Best Evaluation Approach for Synthetic Time Series	Michael Stenger et.al.	2505.21160	null
2025-05-27	Assessment of L2 Oral Proficiency using Speech Large Language Models	Rao Ma et.al.	2505.21148	null
2025-05-27	IKMo: Image-Keyframed Motion Generation with Trajectory-Pose Conditioned Motion Diffusion Model	Yang Zhao et.al.	2505.21146	null
2025-05-27	Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis	Tianyi Xu et.al.	2505.21138	null
2025-05-27	Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction	Mengjie Qian et.al.	2505.21137	null
2025-05-27	Named Entity Swapping for Metadata Anonymization in a Text Corpus	Jan Greve et.al.	2505.21128	null
2025-05-27	Creativity in LLM-based Multi-Agent Systems: A Survey	Yi-Cheng Lin et.al.	2505.21116	null
2025-05-27	Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA	Sergey Pletenev et.al.	2505.21115	null
2025-05-27	Simulating Ethics: Using LLM Debate Panels to Model Deliberation on Medical Dilemmas	Hazem Zohny et.al.	2505.21112	null
2025-05-27	A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction	Bogdan Bogachov et.al.	2505.21109	null
2025-05-27	Thinker: Learning to Think Fast and Slow	Stephen Chung et.al.	2505.21097	null
2025-05-27	BLUCK: A Benchmark Dataset for Bengali Linguistic Understanding and Cultural Knowledge	Daeen Kabir et.al.	2505.21092	null
2025-05-27	Position is Power: System Prompts as a Mechanism of Bias in Large Language Models (LLMs)	Anna Neumann et.al.	2505.21091	null
2025-05-27	LLMs Think, But Not In Your Flow: Reasoning-Level Personalization for Black-Box Large Language Models	Jieyong Kim et.al.	2505.21082	null
2025-05-27	Uni3D-MoE: Scalable Multimodal 3D Scene Understanding via Mixture of Experts	Yue Zhang et.al.	2505.21079	null
2025-05-27	Efficient Large Language Model Inference with Neural Block Linearization	Mete Erdogan et.al.	2505.21077	null
2025-05-27	DynamicVL: Benchmarking Multimodal Large Language Models for Dynamic City Understanding	Weihao Xuan et.al.	2505.21076	null
2025-05-27	Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation	Ekaterina Fadeeva et.al.	2505.21072	null
2025-05-27	CXXCrafter: An LLM-Based Agent for Automated C/C++ Open Source Software Building	Zhengmin Yu et.al.	2505.21069	null
2025-05-27	Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning	Xiao Hu et.al.	2505.21067	null
2025-05-27	Agent-Environment Alignment via Automated Interface Generation	Kaiming Liu et.al.	2505.21055	null
2025-05-27	SHE-LoRA: Selective Homomorphic Encryption for Federated Tuning with Heterogeneous LoRA	Jianmin Liu et.al.	2505.21051	null
2025-05-28	Advancing high-fidelity 3D and Texture Generation with 2.5D latents	Xin Yang et.al.	2505.21050	null
2025-05-27	Large Language Model-enhanced Reinforcement Learning for Low-Altitude Economy Networking	Lingyi Cai et.al.	2505.21045	null
2025-05-28	FCKT: Fine-Grained Cross-Task Knowledge Transfer with Semantic Contrastive Learning for Targeted Sentiment Analysis	Wei Chen et.al.	2505.21040	null
2025-05-27	RainFusion: Adaptive Video Generation Acceleration via Multi-Dimensional Visual Redundancy	Aiyue Chen et.al.	2505.21036	null
2025-05-27	LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms	Wenhu Li et.al.	2505.21034	link
2025-05-27	Def-DTS: Deductive Reasoning for Open-domain Dialogue Topic Segmentation	Seungmin Lee et.al.	2505.21033	null
2025-05-27	Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?	Yifei Wang et.al.	2505.21003	null
2025-05-27	Who Reasons in the Large Language Models?	Jie Shao et.al.	2505.20993	null
2025-05-27	LifeIR at the NTCIR-18 Lifelog-6 Task	Jiahan Chen et.al.	2505.20987	null
2025-05-27	Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution	Minghao Han et.al.	2505.20984	null
2025-05-27	Evaluating and Steering Modality Preferences in Multimodal Large Language Model	Yu Zhang et.al.	2505.20977	null
2025-05-27	Contrastive Learning on LLM Back Generation Treebank for Cross-domain Constituency Parsing	Peiming Guo et.al.	2505.20976	null
2025-05-28	Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement	Keheliya Gallaba et.al.	2505.20973	null
2025-05-27	Research Community Perspectives on “Intelligence” and Large Language Models	Bertram Højer et.al.	2505.20959	null
2025-05-27	IRCopilot: Automated Incident Response with Large Language Models	Xihuan Lin et.al.	2505.20945	null
2025-05-26	Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs	Hanting Chen et.al.	2505.20155	null
2025-05-26	UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models	Xueyan Zhang et.al.	2505.20154	null
2025-05-26	MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents	Ziming Wei et.al.	2505.20148	link
2025-05-26	FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities	Jin Wang et.al.	2505.20147	null
2025-05-26	StructEval: Benchmarking LLMs’ Capabilities to Generate Structural Outputs	Jialin Yang et.al.	2505.20139	null
2025-05-26	Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers	Zhengliang Shi et.al.	2505.20128	link
2025-05-26	Agentic AI Process Observability: Discovering Behavioral Variability	Fabiana Fournier et.al.	2505.20127	null
2025-05-26	Understanding Generalization in Diffusion Models via Probability Flow Distance	Huijie Zhang et.al.	2505.20123	null
2025-05-27	TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent	Dominik Meier et.al.	2505.20118	link
2025-05-26	Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi’s Zibaldone	Cristian Santini et.al.	2505.20113	null
2025-05-26	ResSVD: Residual Compensated SVD for Large Language Model Compression	Haolei Bai et.al.	2505.20112	null
2025-05-26	Proxy-Free GFlowNet	Ruishuo Chen et.al.	2505.20110	null
2025-05-26	Language-Agnostic Suicidal Risk Detection Using Large Language Models	June-Woo Kim et.al.	2505.20109	null
2025-05-26	Adaptive Deep Reasoning: Triggering Deep Thinking When Needed	Yunhao Wang et.al.	2505.20101	null
2025-05-26	AdaTP: Attention-Debiased Token Pruning for Video Large Language Models	Fengyuan Sun et.al.	2505.20100	null
2025-05-26	Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities	Chuangtao Ma et.al.	2505.20099	link
2025-05-26	S2LPP: Small-to-Large Prompt Prediction across LLMs	Liang Cheng et.al.	2505.20097	null
2025-05-26	Multi-Domain Explainability of Preferences	Nitay Calderon et.al.	2505.20088	null
2025-05-26	Inference-time Alignment in Continuous Space	Yige Yuan et.al.	2505.20081	link
2025-05-26	Incentivizing Reasoning from Weak Supervision	Yige Yuan et.al.	2505.20072	link
2025-05-26	On the Same Page: Dimensions of Perceived Shared Understanding in Human-AI Interaction	Qingyu Liang et.al.	2505.20068	null
2025-05-26	SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety	Geon-Hyeong Kim et.al.	2505.20065	null
2025-05-26	Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	Zheqi Lv et.al.	2505.20053	link
2025-05-26	Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks	Debargha Ganguly et.al.	2505.20047	null
2025-05-26	REARANK: Reasoning Re-ranking Agent via Reinforcement Learning	Le Zhang et.al.	2505.20046	link
2025-05-26	Uncertainty-Aware Attention Heads: Efficient Unsupervised Uncertainty Quantification for LLMs	Artem Vazhentsev et.al.	2505.20045	null
2025-05-26	ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving	Xueyi Liu et.al.	2505.20024	link
2025-05-26	Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking	Yihan Chen et.al.	2505.20023	null
2025-05-26	Ontology- and LLM-based Data Harmonization for Federated Learning in Healthcare	Natallia Kokash et.al.	2505.20020	null
2025-05-26	Does Rationale Quality Matter? Enhancing Mental Disorder Detection via Selective Reasoning Distillation	Hoyun Song et.al.	2505.20014	link
2025-05-26	WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback	Minda Hu et.al.	2505.20013	null
2025-05-26	TabPFN: One Model to Rule Them All?	Qiong Zhang et.al.	2505.20003	link
2025-05-26	NEXT: Multi-Grained Mixture of Experts via Text-Modulation for Multi-Modal Object Re-ID	Shihao Li et.al.	2505.20001	null
2025-05-26	Embracing Imperfection: Simulating Students with Diverse Cognitive Levels Using LLM-based Agents	Tao Wu et.al.	2505.19997	null
2025-05-26	Automatic Metadata Extraction for Text-to-SQL	Vladislav Shkapenyuk et.al.	2505.19988	null
2025-05-26	How Well Do Large Reasoning Models Translate? A Comprehensive Evaluation for Multi-Domain Machine Translation	Yongshi Ye et.al.	2505.19987	link
2025-05-26	Rethinking Probabilistic Circuit Parameter Learning	Anji Liu et.al.	2505.19982	null
2025-05-26	DFIR-Metric: A Benchmark Dataset for Evaluating Large Language Models in Digital Forensics and Incident Response	Bilel Cherif et.al.	2505.19973	null
2025-05-26	CP-Router: An Uncertainty-Aware Router Between LLM and LRM	Jiayuan Su et.al.	2505.19970	null
2025-05-26	Learning to Select In-Context Demonstration Preferred by Large Language Model	Zheng Zhang et.al.	2505.19966	null
2025-05-26	Adaptive Location Hierarchy Learning for Long-Tailed Mobility Prediction	Yu Wang et.al.	2505.19965	null
2025-05-26	The Limits of Preference Data for Post-Training	Eric Zhao et.al.	2505.19964	null
2025-05-26	MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models	Zhongzhan Huang et.al.	2505.19959	link
2025-05-26	DCG-SQL: Enhancing In-Context Learning for Text-to-SQL with Deep Contextual Schema Link Graph	Jihyung Lee et.al.	2505.19956	null
2025-05-26	An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning	Andrew Zamai et.al.	2505.19954	null
2025-05-26	Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19952	null
2025-05-26	Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions	Siqi Kou et.al.	2505.19949	null
2025-05-26	ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs	Pooneh Mousavi et.al.	2505.19937	null
2025-05-26	Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making	Yejin Son et.al.	2505.19933	null
2025-05-26	TCP: a Benchmark for Temporal Constraint-Based Planning	Zifeng Ding et.al.	2505.19927	null
2025-05-26	Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles	Jiangjie Chen et.al.	2505.19914	null
2025-05-26	APE: A Data-Centric Benchmark for Efficient LLM Adaptation in Text Summarization	Javier Marín et.al.	2505.19912	link
2025-05-27	Dynamic-I2V: Exploring Image-to-Video Generation Models via Multimodal LLM	Peng Liu et.al.	2505.19901	null
2025-05-26	ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows	Qiushi Sun et.al.	2505.19897	null
2025-05-26	Large Language Models as Autonomous Spacecraft Operators in Kerbal Space Program	Alejandro Carrasco et.al.	2505.19896	link
2025-05-26	ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining	Melis Ilayda Bal et.al.	2505.19893	null
2025-05-26	Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging	Yongxian Wei et.al.	2505.19892	link
2025-05-27	Generalized and Personalized Federated Learning with Foundation Models via Orthogonal Transformations	Eun Gyung Kong et.al.	2505.19888	null
2025-05-26	Deconstructing Obfuscation: A four-dimensional framework for evaluating Large Language Models assembly code deobfuscation capabilities	Anton Tkachenko et.al.	2505.19887	null
2025-05-26	Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought	Chao Huang et.al.	2505.19877	link
2025-05-26	A fully automated urban PV parameterization framework for improved estimation of energy production profiles	Bowen Tian et.al.	2505.19876	null
2025-05-26	StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation	Yi Wu et.al.	2505.19874	null
2025-05-26	Deep Active Inference Agents for Delayed and Long-Horizon Environments	Yavar Taheri Yeganeh et.al.	2505.19867	link
2025-05-26	HS-STAR: Hierarchical Sampling for Self-Taught Reasoners via Difficulty Estimation and Budget Reallocation	Feng Xiong et.al.	2505.19866	null
2025-05-26	CPA-RAG:Covert Poisoning Attacks on Retrieval-Augmented Generation in Large Language Models	Chunyang Li et.al.	2505.19864	null
2025-05-26	FruitNeRF++: A Generalized Multi-Fruit Counting Method Utilizing Contrastive Learning and Neural Radiance Fields	Lukas Meyer et.al.	2505.19863	link
2025-05-26	Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?	Zexi Li et.al.	2505.19855	null
2025-05-26	Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages	Gulfarogh Azam et.al.	2505.19851	null
2025-05-26	Improving Multilingual Math Reasoning for African Languages	Odunayo Ogundepo et.al.	2505.19848	null
2025-05-26	FoodTaxo: Generating Food Taxonomies with Large Language Models	Pascal Wullschleger et.al.	2505.19838	link
2025-05-26	SecVulEval: Benchmarking LLMs for Real-World C/C++ Vulnerability Detection	Md Basim Uddin Ahmed et.al.	2505.19828	link
2025-05-26	Foundation Models for Tabular Data within Systemic Contexts Need Grounding	Tassilo Klein et.al.	2505.19825	null
2025-05-26	FinLoRA: Benchmarking LoRA Methods for Fine-Tuning LLMs on Financial Datasets	Dannong Wang et.al.	2505.19819	link
2025-05-26	Deciphering Trajectory-Aided LLM Reasoning: An Optimization Perspective	Junnan Liu et.al.	2505.19815	link
2025-05-26	Efficient Multi-modal Long Context Learning for Training-free Adaptation	Zehong Ma et.al.	2505.19812	link
2025-05-26	Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks	Sirui Chen et.al.	2505.19806	link
2025-05-26	Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation	Siyuan Li et.al.	2505.19804	link
2025-05-26	Integrating emotional intelligence, memory architecture, and gestures to achieve empathetic humanoid robot interaction in an educational setting	Fuze Sun et.al.	2505.19803	null
2025-05-26	MOLE: Metadata Extraction and Validation in Scientific Papers Using LLMs	Zaid Alyafeai et.al.	2505.19800	link
2025-05-26	Advancements in Medical Image Classification through Fine-Tuning Natural Domain Foundation Models	Mobina Mansoori et.al.	2505.19779	link
2025-05-26	DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation	Prabash Reddy Male et.al.	2505.19774	null
2025-05-26	What Really Matters in Many-Shot Attacks? An Empirical Study of Long-Context Vulnerabilities in LLMs	Sangyeop Kim et.al.	2505.19773	null
2025-05-26	SGM: A Framework for Building Specification-Guided Moderation Filters	Masoomali Fatehkia et.al.	2505.19766	null
2025-05-26	Agentic Predictor: Performance Prediction for Agentic Workflows via Multi-View Encoding	Patara Trirat et.al.	2505.19764	link
2025-05-26	Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement Learning	Zican Hu et.al.	2505.19761	link
2025-05-26	NeuSym-RAG: Hybrid Neural Symbolic Retrieval with Multiview Structuring for PDF Question Answering	Ruisheng Cao et.al.	2505.19754	null
2025-05-27	Token-level Accept or Reject: A Micro Alignment Approach for Large Language Models	Yang Zhang et.al.	2505.19743	link
2025-05-26	ReChisel: Effective Automatic Chisel Code Generation by LLM with Reflection	Juxin Niu et.al.	2505.19734	link
2025-05-26	Accelerating Nash Learning from Human Feedback via Mirror Prox	Daniil Tiapkin et.al.	2505.19731	null
2025-05-26	Distilling Closed-Source LLM’s Knowledge for Locally Stable and Economic Biomedical Entity Linking	Yihao Ai et.al.	2505.19722	null
2025-05-26	Extremum Flow Matching for Offline Goal Conditioned Reinforcement Learning	Quentin Rouxel et.al.	2505.19717	null
2025-05-26	MT $^{3}$ : Scaling MLLM-based Text Image Machine Translation via Multi-Task Reinforcement Learning	Zhaopeng Feng et.al.	2505.19714	null
2025-05-26	On the Relation between Rectified Flows and Optimal Transport	Johannes Hertrich et.al.	2505.19712	null
2025-05-26	MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19707	null
2025-05-26	Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision	Tej Deep Pala et.al.	2505.19706	link
2025-05-26	Point-RFT: Improving Multimodal Reasoning with Visually Grounded Reinforcement Finetuning	Minheng Ni et.al.	2505.19702	null
2025-05-26	Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models	Yi Liu et.al.	2505.19700	null
2025-05-26	Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments	Junming Liu et.al.	2505.19699	null
2025-05-26	DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving	Wenchao Sun et.al.	2505.19692	link
2025-05-26	Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation	Victor M. Tenorio et.al.	2505.19685	null
2025-05-26	VisCRA: A Visual Chain Reasoning Attack for Jailbreaking Multimodal Large Language Models	Bingrui Sima et.al.	2505.19684	null
2025-05-26	Large Language Models for Planning: A Comprehensive and Systematic Survey	Pengfei Cao et.al.	2505.19683	link
2025-05-23	Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs	Wafa Alghallabi et.al.	2505.18152	link
2025-05-23	Generative Distribution Embeddings	Nic Fishman et.al.	2505.18150	link
2025-05-23	First Finish Search: Efficient Test-Time Scaling in Large Language Models	Aradhye Agarwal et.al.	2505.18149	null
2025-05-23	Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find	Owen Bianchi et.al.	2505.18148	null
2025-05-23	Gaming Tool Preferences in Agentic LLMs	Kazem Faghih et.al.	2505.18135	link
2025-05-23	Reward Model Overoptimisation in Iterated RLHF	Lorenz Wolf et.al.	2505.18126	null
2025-05-23	TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations	Alan Arazi et.al.	2505.18125	null
2025-05-23	UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification	Poojah Ganesan et.al.	2505.18122	null
2025-05-23	ProgRM: Build Better GUI Agents with Progress Rewards	Danyang Zhang et.al.	2505.18121	null
2025-05-23	Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models	Jiongran Wu et.al.	2505.18120	null
2025-05-23	Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM	Zinuo Li et.al.	2505.18110	null
2025-05-23	ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework	Lisheng Huang et.al.	2505.18105	link
2025-05-23	How Can I Publish My LLM Benchmark Without Giving the True Answers Away?	Takashi Ishida et.al.	2505.18102	null
2025-05-23	Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL	Joey Hong et.al.	2505.18098	null
2025-05-23	DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations	Ziqiao Peng et.al.	2505.18096	null
2025-05-23	QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization	Weizhou Shen et.al.	2505.18092	null
2025-05-23	Data Mixing Can Induce Phase Transitions in Knowledge Acquisition	Xinran Gu et.al.	2505.18091	null
2025-05-23	Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding	Xiaoyi Zhang et.al.	2505.18079	null
2025-05-23	Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals	Jia-Nan Li et.al.	2505.18071	null
2025-05-23	Emergence of Hebbian Dynamics in Regularized Non-Local Learners	David Koplow et.al.	2505.18069	null
2025-05-23	Reward Model Generalization for Compute-Aware Test-Time Reasoning	Zeen Song et.al.	2505.18065	null
2025-05-23	A Foundation Model Framework for Multi-View MRI Classification of Extramural Vascular Invasion and Mesorectal Fascia Invasion in Rectal Cancer	Yumeng Zhang et.al.	2505.18058	null
2025-05-23	MathEDU: Towards Adaptive Feedback for Student Mathematical Problem-Solving	Wei-Ling Hsu et.al.	2505.18056	null
2025-05-23	SpikeGen: Generative Framework for Visual Spike Stream Processing	Gaole Dai et.al.	2505.18049	null
2025-05-23	Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition	Minxue Niu et.al.	2505.18040	link
2025-05-23	Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation	Li Zhong et.al.	2505.18039	null
2025-05-23	RemoteSAM: Towards Segment Anything for Earth Observation	Liang Yao et.al.	2505.18022	link
2025-05-23	LLM assisted web application functional requirements generation: A case study of four popular LLMs over a Mess Management System	Rashmi Gupta et.al.	2505.18019	null
2025-05-23	Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling	Matthieu Blanke et.al.	2505.18017	link
2025-05-23	Training with Pseudo-Code for Instruction Following	Prince Kumar et.al.	2505.18011	null
2025-05-23	Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective	Jintian Shao et.al.	2505.17997	null
2025-05-23	Outcome-based Reinforcement Learning to Predict the Future	Benjamin Turtel et.al.	2505.17989	null
2025-05-23	Towards Revealing the Effectiveness of Small-Scale Fine-tuning in R1-style Reinforcement Learning	Yutong Chen et.al.	2505.17988	link
2025-05-23	ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling	Weihang You et.al.	2505.17987	null
2025-05-23	SmartNote: An LLM-Powered, Personalised Release Note Generator That Just Works	Farbod Daneshyan et.al.	2505.17977	link
2025-05-23	Generalized Fisher-Weighted SVD: Scalable Kronecker-Factored Fisher Approximation for Compressing Large Language Models	Viktoriia Chekalina et.al.	2505.17974	null
2025-05-23	Explainable Anatomy-Guided AI for Prostate MRI: Foundation Models and In Silico Clinical Trials for Virtual Biopsy-based Risk Assessment	Danial Khan et.al.	2505.17971	null
2025-05-23	Are Large Language Models Reliable AI Scientists? Assessing Reverse-Engineering of Black-Box Systems	Jiayi Geng et.al.	2505.17968	null
2025-05-23	SVD-Free Low-Rank Adaptive Gradient Optimization for Large Language Models	Ionut-Vlad Modoranu et.al.	2505.17967	null
2025-05-23	Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL	Che Liu et.al.	2505.17952	null
2025-05-23	Survival Games: Human-LLM Strategic Showdowns under Severe Resource Scarcity	Zhihong Chen et.al.	2505.17937	link
2025-05-23	AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models	Xingjian Li et.al.	2505.17931	null
2025-05-23	LLM Meeting Decision Trees on Tabular Data	Hangting Ye et.al.	2505.17918	null
2025-05-23	Flexible MOF Generation with Torsion-Aware Flow Matching	Nayoung Kim et.al.	2505.17914	null
2025-05-23	ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback	Litao Guo et.al.	2505.17908	link
2025-05-23	T2I-Eval-R1: Reinforcement Learning-Driven Reasoning for Interpretable Text-to-Image Evaluation	Zi-Ao Ma et.al.	2505.17897	null
2025-05-23	DataRater: Meta-Learned Dataset Curation	Dan A. Calian et.al.	2505.17895	null
2025-05-23	Pixels to Prognosis: Harmonized Multi-Region CT-Radiomics and Foundation-Model Signatures Across Multicentre NSCLC Data	Shruti Atul Mali et.al.	2505.17893	null
2025-05-23	LLM4SP: Large Language Models for Scatterer Prediction via Synesthesia of Machines	Zengrui Han et.al.	2505.17879	null
2025-05-23	MOOSE-Chem3: Toward Experiment-Guided Hypothesis Ranking via Simulated Experimental Feedback	Wanhao Liu et.al.	2505.17873	link
2025-05-23	Mixture of Low Rank Adaptation with Partial Parameter Sharing for Time Series Forecasting	Licheng Pan et.al.	2505.17872	null
2025-05-23	The emergence of sparse attention: impact of data distribution and benefits of repetition	Nicolas Zucchet et.al.	2505.17863	null
2025-05-23	Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities	Ziwei Zhou et.al.	2505.17862	link
2025-05-23	Superplatforms Have to Attack AI Agents	Jianghao Lin et.al.	2505.17861	null
2025-05-23	Automated Testing of the GUI of a Real-Life Engineering Software using Large Language Models	Tim Rosenbach et.al.	2505.17839	null
2025-05-23	Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning	Zezhong Wang et.al.	2505.17829	null
2025-05-23	Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models	Xuchen Pan et.al.	2505.17826	link
2025-05-23	Evaluation Faking: Unveiling Observer Effects in Safety Evaluation of Frontier AI Systems	Yihe Fan et.al.	2505.17815	null
2025-05-23	Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning	Michael Hassid et.al.	2505.17813	null
2025-05-23	A Coreset Selection of Coreset Selection Literature: Introduction and Recent Advances	Brian B. Moser et.al.	2505.17799	null
2025-05-23	DialogXpert: Driving Intelligent and Emotion-Aware Conversations through Online Value-Based Reinforcement Learning with LLM Priors	Tazeek Bin Abdur Rakib et.al.	2505.17795	null
2025-05-23	Titanus: Enabling KV Cache Pruning and Quantization On-the-Fly for LLM Acceleration	Peilin Chen et.al.	2505.17787	link
2025-05-23	Generative Data Augmentation for Object Point Cloud Segmentation	Dekai Zhu et.al.	2505.17783	null
2025-05-23	C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models	Amir Hossein Rahmati et.al.	2505.17773	null
2025-05-23	Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models	Patrick Leask et.al.	2505.17769	link
2025-05-23	R-Genie: Reasoning-Guided Generative Image Editing	Dong Zhang et.al.	2505.17768	null
2025-05-23	The Real Barrier to LLM Agent Usability is Agentic ROI	Weiwen Liu et.al.	2505.17767	null
2025-05-23	Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs	Ziyu Ge et.al.	2505.17762	link
2025-05-23	But what is your honest answer? Aiding LLM-judges with honest alternatives using steering vectors	Leon Eshuijs et.al.	2505.17760	null
2025-05-23	Fast Quiet-STaR: Thinking Without Thought Tokens	Wei Huang et.al.	2505.17746	null
2025-05-23	Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios	Xueyang Zhou et.al.	2505.17735	null
2025-05-23	Slot-MLLM: Object-Centric Visual Tokenization for Multimodal LLM	Donghwan Chi et.al.	2505.17726	null
2025-05-23	SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation	Dekai Zhu et.al.	2505.17721	null
2025-05-23	Get Experience from Practice: LLM Agents with Record & Replay	Erhu Feng et.al.	2505.17716	null
2025-05-23	Understanding How Value Neurons Shape the Generation of Specified Values in LLMs	Yi Su et.al.	2505.17712	null
2025-05-23	LLM Contribution Summarization in Software Projects	Rafael Corsi Ferrao et.al.	2505.17710	null
2025-05-23	CIKT: A Collaborative and Iterative Knowledge Tracing Framework with Large Language Models	Runze Li et.al.	2505.17705	null
2025-05-23	Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek	Xueyang Li et.al.	2505.17702	null
2025-05-23	COUNTDOWN: Contextually Sparse Activation Filtering Out Unnecessary Weights in Down Projection	Jaewon Cheon et.al.	2505.17701	null
2025-05-23	Activation Control for Efficiently Eliciting Long Chain-of-thought Ability of Language Models	Zekai Zhao et.al.	2505.17697	null
2025-05-23	ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction	Yan Yu et.al.	2505.17691	null
2025-05-23	Tuning Language Models for Robust Prediction of Diverse User Behaviors	Fanjin Meng et.al.	2505.17682	null
2025-05-23	Patterns with long and short-range order in monoloyers of binary mixtures with competing interactions	M. Litniewski et.al.	2505.17675	null
2025-05-23	Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States	Yang Xiao et.al.	2505.17663	link
2025-05-23	Automated scientific minimization of regret	Marcel Binz et.al.	2505.17661	null
2025-05-23	Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling	Xiaolong Tang et.al.	2505.17659	null
2025-05-23	Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs	Hexiang Tan et.al.	2505.17656	null
2025-05-23	EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications	Ancheng Xu et.al.	2505.17654	null
2025-05-23	GeoGramBench: Benchmarking the Geometric Program Reasoning in Modern LLMs	Shixian Luo et.al.	2505.17653	null
2025-05-23	Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective	Deyang Kong et.al.	2505.17652	null
2025-05-23	Simulating Macroeconomic Expectations using LLM Agents	Jianhao Lin et.al.	2505.17648	null
2025-05-23	Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives	Huanran Chen et.al.	2505.17646	null
2025-05-23	HoloLLM: Multisensory Foundation Model for Language-Grounded Human Sensing and Reasoning	Chuhao Zhou et.al.	2505.17645	null
2025-05-23	PreMoe: Lightening MoEs on Constrained Memory by Expert Pruning and Retrieval	Zehua Pei et.al.	2505.17639	null
2025-05-23	ReqBrain: Task-Specific Instruction Tuning of LLMs for AI-Assisted Requirements Generation	Mohammad Kasra Habib et.al.	2505.17632	null
2025-05-23	BehaveGPT: A Foundation Model for Large-scale User Behavior Modeling	Jiahui Gong et.al.	2505.17631	null
2025-05-23	GIM: Improved Interpretability for Large Language Models	Joakim Edin et.al.	2505.17630	null
2025-05-23	Enhancing Large Vision-Language Models with Layout Modality for Table Question Answering on Japanese Annual Securities Reports	Hayato Aida et.al.	2505.17625	null
2025-05-23	Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration	Jingtong Gao et.al.	2505.17621	null
2025-05-23	CAS-IQA: Teaching Vision-Language Models for Synthetic Angiography Quality Assessment	Bo Wang et.al.	2505.17619	null
2025-05-22	GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning	Chengqi Duan et.al.	2505.17022	link
2025-05-22	CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms	Shilin Yan et.al.	2505.17020	link
2025-05-22	Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework	Chenhao Zhang et.al.	2505.17019	link
2025-05-22	SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward	Kaixuan Fan et.al.	2505.17018	link
2025-05-22	Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO	Chengzhuo Tong et.al.	2505.17017	link
2025-05-22	Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models	Runsen Xu et.al.	2505.17015	null
2025-05-22	SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding	Haoning Wu et.al.	2505.17012	link
2025-05-22	R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning	Huatong Song et.al.	2505.17005	link
2025-05-22	Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?	Jin Jiang et.al.	2505.16998	link
2025-05-22	DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization	Chao Zhang et.al.	2505.16995	null
2025-05-22	Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding	Runpeng Yu et.al.	2505.16990	link
2025-05-22	T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning	Amartya Chakraborty et.al.	2505.16986	null
2025-05-22	UFT: Unifying Supervised and Reinforcement Fine-Tuning	Mingyang Liu et.al.	2505.16984	link
2025-05-22	LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding	Junlong Tong et.al.	2505.16983	link
2025-05-22	Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine	Adib Bazgir et.al.	2505.16982	null
2025-05-22	HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation	Weizhi Tang et.al.	2505.16978	link
2025-05-22	SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	Yaxin Du et.al.	2505.16975	link
2025-05-22	Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models	Junjie Xiong et.al.	2505.16957	null
2025-05-22	A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization	Shengyu Feng et.al.	2505.16952	null
2025-05-22	From Reality to Virtual Worlds: The Role of Photogrammetry in Game Development	Santiago Berrezueta-Guzman et.al.	2505.16951	null
2025-05-22	Bottlenecked Transformers: Periodic KV Cache Abstraction for Generalised Reasoning	Adnan Oomerjee et.al.	2505.16950	null
2025-05-22	MixAT: Combining Continuous and Discrete Adversarial Training for LLMs	Csaba Dékány et.al.	2505.16947	link
2025-05-22	AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios	Yunjia Qi et.al.	2505.16944	link
2025-05-23	FoMoH: A clinically meaningful foundation model evaluation for structured electronic health records	Chao Pang et.al.	2505.16941	link
2025-05-22	In-Context Watermarks for Large Language Models	Yepeng Liu et.al.	2505.16934	null
2025-05-22	LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning	Zebin You et.al.	2505.16933	null
2025-05-22	UNCLE: Uncertainty Expressions in Long-Form Generation	Ruihan Yang et.al.	2505.16922	null
2025-05-22	Scalable and Interpretable Contextual Bandits: A Literature Review and Retail Offer Prototype	Nikola Tankovic et.al.	2505.16918	null
2025-05-22	Backdoor Cleaning without External Guidance in MLLM Fine-tuning	Xuankun Rong et.al.	2505.16916	link
2025-05-22	Unsupervised Prompting for Graph Neural Networks	Peyman Baghershahi et.al.	2505.16903	null
2025-05-22	Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks	Hongyuan Tao et.al.	2505.16901	null
2025-05-23	Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality	Jintian Shao et.al.	2505.16900	link
2025-05-22	Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs	Zeyu Wei et.al.	2505.16894	null
2025-05-22	CAIN: Hijacking LLM-Humans Conversations via a Two-Stage Malicious System Prompt Generation and Refining Framework	Viet Pham et.al.	2505.16888	null
2025-05-22	Don’t “Overthink” Passage Reranking: Is Reasoning Truly Necessary?	Nour Jedidi et.al.	2505.16886	null
2025-05-22	CASTILLO: Characterizing Response Length Distributions of Large Language Models	Daniel F. Perez-Ramirez et.al.	2505.16881	link
2025-05-22	MPO: Multilingual Safety Alignment via Reward Gap Optimization	Weixiang Zhao et.al.	2505.16869	link
2025-05-22	Conditional Panoramic Image Generation via Masked Autoregressive Modeling	Chaoyang Wang et.al.	2505.16862	null
2025-05-22	Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph Walks	Martin Böckling et.al.	2505.16849	link
2025-05-22	R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search	Yibo Wang et.al.	2505.16838	link
2025-05-22	SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis	Shuang Sun et.al.	2505.16834	link
2025-05-22	From EduVisBench to EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization	Haonian Ji et.al.	2505.16832	link
2025-05-22	Unlearning Isn’t Deletion: Investigating Reversibility of Machine Unlearning in LLMs	Xiaoyu Xu et.al.	2505.16831	link
2025-05-22	KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning	Wei Sun et.al.	2505.16826	link
2025-05-22	Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts	Taewon Kang et.al.	2505.16819	null
2025-05-22	DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation	Bowen Zheng et.al.	2505.16810	null
2025-05-22	Two-way Evidence self-Alignment based Dual-Gated Reasoning Enhancement	Kexin Zhang et.al.	2505.16806	null
2025-05-22	Learning Beyond Limits: Multitask Learning and Synthetic Data for Low-Resource Canonical Morpheme Segmentation	Changbing Yang et.al.	2505.16800	null
2025-05-22	REOBench: Benchmarking Robustness of Earth Observation Foundation Models	Xiang Li et.al.	2505.16793	link
2025-05-22	Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability	Punya Syon Pandey et.al.	2505.16789	link
2025-05-22	CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models	Zhenzhen Ren et.al.	2505.16785	null
2025-05-22	Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning	Xinghao Chen et.al.	2505.16782	link
2025-05-22	IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models	Yiming Gao et.al.	2505.16774	link
2025-05-22	When Safety Detectors Aren’t Enough: A Stealthy and Effective Jailbreak Attack on LLMs via Steganographic Techniques	Jianing Geng et.al.	2505.16765	null
2025-05-22	TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning	Florentin Beck et.al.	2505.16743	link
2025-05-22	Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization	Chengcan Wu et.al.	2505.16737	link
2025-05-22	Forward-only Diffusion Probabilistic Models	Ziwei Luo et.al.	2505.16733	link
2025-05-22	Masked Conditioning for Deep Generative Models	Phillip Mueller et.al.	2505.16725	null
2025-05-22	Advancing Brainwave Modeling with a Codebook-Based Foundation Model	Konstantinos Barmpas et.al.	2505.16724	null
2025-05-22	Breaking mBad! Supervised Fine-tuning for Cross-Lingual Detoxification	Himanshu Beniwal et.al.	2505.16722	link
2025-05-22	Training Long-Context LLMs Efficiently via Chunk-wise Optimization	Wenhao Li et.al.	2505.16710	link
2025-05-22	A Novel Generative Model with Causality Constraint for Mitigating Biases in Recommender Systems	Jianfeng Deng et.al.	2505.16708	null
2025-05-22	KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models	Yongliang Wu et.al.	2505.16707	null
2025-05-22	Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs	Zeping Yu et.al.	2505.16703	null
2025-05-22	MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models	Xuanqi Gao et.al.	2505.16700	null
2025-05-22	Software Architecture Meets LLMs: A Systematic Literature Review	Larissa Schmid et.al.	2505.16697	null
2025-05-22	Sensitivity of ECG QRS Complexes to His-Purkinje Structure in Computational Heart Models	Preetam V. Tanikella et.al.	2505.16696	null
2025-05-22	Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence	Gouki Minegishi et.al.	2505.16694	null
2025-05-22	Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator	Beier Luo et.al.	2505.16690	null
2025-05-22	Semantic Compression of 3D Objects for Open and Collaborative Virtual Worlds	Jordan Dotzel et.al.	2505.16679	null
2025-05-22	Hybrid Parameterized Quantum States for Variational Quantum Learning	Chen-Yu Liu et.al.	2505.16676	null
2025-05-22	R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO	Huanjin Yao et.al.	2505.16673	link
2025-05-22	BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models	Xiaobei Yan et.al.	2505.16670	null
2025-05-22	SD-MAD: Sign-Driven Few-shot Multi-Anomaly Detection in Medical Images	Kaiyu Guo et.al.	2505.16659	null
2025-05-22	Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding	Feilong Tang et.al.	2505.16652	null
2025-05-22	Collaboration among Multiple Large Language Models for Medical Question Answering	Kexin Shang et.al.	2505.16648	null
2025-05-23	SMART: Self-Generating and Self-Validating Multi-Dimensional Assessment for LLMs’ Mathematical Problem Solving	Yujie Hou et.al.	2505.16646	null
2025-05-22	From Evaluation to Defense: Advancing Safety in Video Large Language Models	Yiwei Sun et.al.	2505.16643	null
2025-05-23	SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation	Wenjie Yang et.al.	2505.16637	link
2025-05-22	WikiDBGraph: Large-Scale Database Graph of Wikidata for Collaborative Learning	Zhaomin Wu et.al.	2505.16635	null
2025-05-22	Steering Large Language Models for Machine Translation Personalization	Daniel Scalena et.al.	2505.16612	link
2025-05-22	From Generic Empathy to Personalized Emotional Support: A Self-Evolution Framework for User Preference Alignment	Jing Ye et.al.	2505.16610	null
2025-05-22	Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering	Bowen Jiang et.al.	2505.16591	null
2025-05-22	Beyond LLMs: An Exploration of Small Open-source Language Models in Logging Statement Generation	Renyi Zhong et.al.	2505.16590	null
2025-05-22	A Survey on the Application of Large Language Models in Scenario-Based Testing of Automated Driving Systems	Yongqi Zhao et.al.	2505.16587	link
2025-05-22	O $^2$ -Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering	Jianbiao Mei et.al.	2505.16582	link
2025-05-22	Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning	Siqu Ou et.al.	2505.16579	link
2025-05-22	Large Language Model-Empowered Interactive Load Forecasting	Yu Zuo et.al.	2505.16577	null
2025-05-22	EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions	Spencer Hong et.al.	2505.16576	link
2025-05-22	URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training	Dongyang Fan et.al.	2505.16570	null
2025-05-23	Finetuning-Activated Backdoors in LLMs	Thibaud Gloaguen et.al.	2505.16567	link
2025-05-22	ScholarBench: A Bilingual Benchmark for Abstraction, Comprehension, and Reasoning Evaluation in Academic Contexts	Dongwon Noh et.al.	2505.16566	null
2025-05-22	CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning	Biao Yi et.al.	2505.16559	null
2025-05-22	Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning	Junchi Yao et.al.	2505.16557	null
2025-05-23	Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains	Wenhui Tan et.al.	2505.16552	null
2025-05-22	Incremental Sequence Classification with Temporal Consistency	Lucas Maystre et.al.	2505.16548	null
2025-05-22	TextureSAM: Towards a Texture Aware Foundation Model for Segmentation	Inbal Cohen et.al.	2505.16540	null
2025-05-22	Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models	Ercong Nie et.al.	2505.16538	null
2025-05-22	HOFT: Householder Orthogonal Fine-tuning	Alejandro Moreno Arcas et.al.	2505.16531	null
2025-05-22	DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection	Yuliang Yan et.al.	2505.16530	link
2025-05-21	On the creation of narrow AI: hierarchy and nonlocality of neural network skills	Eric J. Michaud et.al.	2505.15811	link
2025-05-21	MMaDA: Multimodal Large Diffusion Language Models	Ling Yang et.al.	2505.15809	link
2025-05-21	Neural Conditional Transport Maps	Carlos Rodriguez-Pardo et.al.	2505.15808	null
2025-05-21	The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation	Patrick Kahardipraja et.al.	2505.15807	link
2025-05-21	Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering	Hwan Chang et.al.	2505.15805	link
2025-05-21	STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs	Zongzhao Li et.al.	2505.15804	link
2025-05-21	VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models	Yuchen Yan et.al.	2505.15801	null
2025-05-21	Interspatial Attention for Efficient 4D Human Video Generation	Ruizhi Shao et.al.	2505.15800	null
2025-05-21	Reverse Engineering Human Preferences with Reinforcement Learning	Lisa Alazraki et.al.	2505.15795	null
2025-05-21	HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving	Zhiwen Chen et.al.	2505.15793	null
2025-05-21	Large Language Models as Computable Approximations to Solomonoff Induction	Jun Wan et.al.	2505.15784	null
2025-05-21	IA-T2I: Internet-Augmented Text-to-Image Generation	Chuanhao Li et.al.	2505.15779	null
2025-05-21	ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning	Changtai Zhu et.al.	2505.15776	link
2025-05-21	Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention	Huanxuan Liao et.al.	2505.15774	link
2025-05-21	MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling	Cheng Yifan et.al.	2505.15772	null
2025-05-21	Constructing a 3D Town from a Single Image	Kaizhi Zheng et.al.	2505.15765	null
2025-05-21	An Empirical Analysis of Vulnerability Detection Tools for Solidity Smart Contracts Using Line Level Manually Annotated Vulnerabilities	Francesco Salzano et.al.	2505.15756	null
2025-05-21	Exploring The Visual Feature Space for Multimodal Neural Decoding	Weihao Xia et.al.	2505.15755	null
2025-05-21	Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval	Taiye Chen et.al.	2505.15753	null
2025-05-21	Multi-modal Integration Analysis of Alzheimer’s Disease Using Large Language Models and Knowledge Graphs	Kanan Kiguchi et.al.	2505.15747	null
2025-05-21	Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications	Dikshit Chauhan et.al.	2505.15741	null
2025-05-21	HybridProver: Augmenting Theorem Proving with LLM-Driven Proof Synthesis and Refinement	Jilin Hu et.al.	2505.15740	null
2025-05-21	Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses	Xiaoxue Yang et.al.	2505.15738	link
2025-05-21	DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning	Gaurav Srivastava et.al.	2505.15734	null
2025-05-21	VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models	Heyang Liu et.al.	2505.15727	link
2025-05-21	Shared Path: Unraveling Memorization in Multilingual LLMs through Language Similarities	Xiaoyu Luo et.al.	2505.15722	null
2025-05-21	Privacy-Preserving Conformal Prediction Under Local Differential Privacy	Coby Penso et.al.	2505.15721	link
2025-05-21	Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling	He Hu et.al.	2505.15715	null
2025-05-21	TurnaboutLLM: A Deductive Reasoning Benchmark from Detective Games	Yuan Yuan et.al.	2505.15712	null
2025-05-21	Advancing LLM Safe Alignment with Safety Representation Ranking	Tianqi Du et.al.	2505.15710	null
2025-05-21	LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing	Peng Wang et.al.	2505.15702	link
2025-05-21	HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph Databases	Pingqing Zheng et.al.	2505.15701	link
2025-05-21	Can Large Language Models be Effective Online Opinion Miners?	Ryang Heo et.al.	2505.15695	link
2025-05-21	Toward Open Earth Science as Fast and Accessible as Natural Language	Marquita Ellis et.al.	2505.15690	null
2025-05-21	From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems	Xiuchao Sui et.al.	2505.15685	link
2025-05-21	ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy	Gengyang Li et.al.	2505.15684	null
2025-05-21	UniErase: Unlearning Token as a Universal Erasure Primitive for Language Models	Miao Yu et.al.	2505.15674	link
2025-05-21	Graph Conditional Flow Matching for Relational Data Generation	Davide Scassola et.al.	2505.15668	link
2025-05-21	Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization	Jiaming Zhou et.al.	2505.15660	link
2025-05-21	Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!	Zhexin Zhang et.al.	2505.15656	link
2025-05-21	Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models	Zihao Li et.al.	2505.15634	null
2025-05-21	Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions	David Thulke et.al.	2505.15633	null
2025-05-21	Can LLMs $\textit{understand}$ Math? – Exploring the Pitfalls in Mathematical Reasoning	Tiasa Singha Roy et.al.	2505.15623	null
2025-05-21	DS-Bench: A Realistic Benchmark for Data Science Code Generation	Shuyin Ouyang et.al.	2505.15621	link
2025-05-21	LENS: Multi-level Evaluation of Multimodal Reasoning with Large Language Models	Ruilin Yao et.al.	2505.15616	null
2025-05-21	From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning	David Dinucu-Jianu et.al.	2505.15607	link
2025-05-21	Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off	Yury Belousov et.al.	2505.15594	null
2025-05-21	Federated Learning with Unlabeled Clients: Personalization Can Happen in Low Dimensions	Hossein Zakerinia et.al.	2505.15579	null
2025-05-21	Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback	Wangyang Ying et.al.	2505.15572	null
2025-05-21	Moonbeam: A MIDI Foundation Model Using Both Absolute and Relative Music Attributes	Zixun Guo et.al.	2505.15559	null
2025-05-21	DayDreamer at CQs-Gen 2025: Generating Critical Questions through Argument Scheme Completion	Wendi Zhou et.al.	2505.15554	null
2025-05-21	Social Bias in Popular Question-Answering Benchmarks	Angelie Kraft et.al.	2505.15553	null
2025-05-21	Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs	Lang Gao et.al.	2505.15524	null
2025-05-21	Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution Shifts	Debarshi Brahma et.al.	2505.15506	link
2025-05-21	Protoknowledge Shapes Behaviour of LLMs in Downstream Tasks: Memorization and Generalization with Knowledge Graphs	Federico Ranaldi et.al.	2505.15501	null
2025-05-21	KaFT: Knowledge-aware Fine-tuning for Boosting LLMs’ Domain-specific Question-Answering Performance	Qihuang Zhong et.al.	2505.15480	null
2025-05-21	LFTF: Locating First and Then Fine-Tuning for Mitigating Gender Bias in Large Language Models	Zhanyue Qin et.al.	2505.15475	null
2025-05-21	PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions	Song Dai et.al.	2505.15472	null
2025-05-21	CoLA: Collaborative Low-Rank Adaptation	Yiyun Zhou et.al.	2505.15471	link
2025-05-21	A Qualitative Investigation into LLM-Generated Multilingual Code Comments and Automatic Evaluation Metrics	Jonathan Katzy et.al.	2505.15469	null
2025-05-21	Joint Flashback Adaptation for Forgetting-Resistant Instruction Tuning	Yukun Zhao et.al.	2505.15467	null
2025-05-21	Teaching Language Models to Evolve with Users: Dynamic Profile Modeling for Personalized Alignment	Weixiang Zhao et.al.	2505.15456	null
2025-05-21	ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning	Ziqiang Xu et.al.	2505.15447	null
2025-05-21	On the Generalization vs Fidelity Paradox in Knowledge Distillation	Suhas Kamasetty Ramesh et.al.	2505.15442	link
2025-05-21	Bridging Sign and Spoken Languages: Pseudo Gloss Generation for Sign Language Translation	Jianyuan Guo et.al.	2505.15438	null
2025-05-21	Set-LLM: A Permutation-Invariant LLM	Beni Egressy et.al.	2505.15433	null
2025-05-21	Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought	Ao Liu et.al.	2505.15431	null
2025-05-21	Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign Queries	Yuhao Wang et.al.	2505.15420	null
2025-05-21	ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs	Bahar Radmehr et.al.	2505.15410	link
2025-05-21	Reranking with Compressed Document Representation	Hervé Déjean et.al.	2505.15394	null
2025-05-21	An Empirical Study of the Anchoring Effect in LLMs: Existence, Mechanism, and Potential Mitigations	Yiming Huang et.al.	2505.15392	null
2025-05-21	RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection	Yiming Huang et.al.	2505.15386	null
2025-05-21	X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System	Peng Wang et.al.	2505.15372	link
2025-05-21	AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals	Stefan Pasch et.al.	2505.15365	null
2025-05-21	NL-Debugging: Exploiting Natural Language as an Intermediate Representation for Code Debugging	Weiming Zhang et.al.	2505.15356	null
2025-05-21	FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management	Xiang Liu et.al.	2505.15347	null
2025-05-21	SSR: Speculative Parallel Scaling Reasoning in Test-time	Yuanlin Chu et.al.	2505.15340	null
2025-05-21	Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors	Hao Fang et.al.	2505.15337	null
2025-05-21	Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification	Bernardin Ligan et.al.	2505.15334	null
2025-05-21	Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models	Ria Shekhawat et.al.	2505.15332	null
2025-05-21	Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack	Silvia Cappelletti et.al.	2505.15323	null
2025-05-21	Emotional Supporters often Use Multiple Strategies in a Single Turn	Xin Bai et.al.	2505.15316	null
2025-05-21	Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning	Yurun Yuan et.al.	2505.15311	null
2025-05-21	Towards Pre-training an Effective Respiratory Audio Foundation Model	Daisuke Niizumi et.al.	2505.15307	link
2025-05-21	Multiple Weaks Win Single Strong: Large Language Models Ensemble Weak Reinforcement Learning Agents into a Supreme One	Yiwen Song et.al.	2505.15306	null
2025-05-21	Chinese Toxic Language Mitigation via Sentiment Polarity Consistent Rewrites	Xintong Wang et.al.	2505.15297	null
2025-05-21	LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models	Qianyue Hao et.al.	2505.15293	null
2025-05-21	Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	Joonho Yang et.al.	2505.15291	null
2025-05-21	Web-Shepherd: Advancing PRMs for Reinforcing Web Agents	Hyungjoo Chae et.al.	2505.15277	link
2025-05-21	Scaling Diffusion Transformers Efficiently via $μ$ P	Chenyu Zheng et.al.	2505.15270	link
2025-05-21	LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval	Zhenyu Ning et.al.	2505.15269	null
2025-05-21	Blind Spot Navigation: Evolutionary Discovery of Sensitive Semantic Concepts for LVLMs	Zihao Pan et.al.	2505.15265	null
2025-05-21	gen2seg: Generative Models Enable Generalizable Instance Segmentation	Om Khangaonkar et.al.	2505.15263	null
2025-05-21	ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search	Hyunseok Lee et.al.	2505.15259	null
2025-05-21	When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners	Weixiang Zhao et.al.	2505.15257	null
2025-05-21	MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation	Yuansheng Gao et.al.	2505.15255	null
2025-05-21	Towards Explainable Temporal Reasoning in Large Language Models: A Structure-Aware Generative Framework	Zihao Jiang et.al.	2505.15245	link
2025-05-21	Adaptive Plan-Execute Framework for Smart Contract Security Auditing	Zhiyuan Wei et.al.	2505.15242	null
2025-05-21	Multilingual Prompting for Improving LLM Generation Diversity	Qihan Wang et.al.	2505.15229	null
2025-05-21	Multimodal Conditional Information Bottleneck for Generalizable AI-Generated Image Detection	Haotian Qin et.al.	2505.15217	link
2025-05-20	Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning	Haolei Xu et.al.	2505.14684	null
2025-05-20	Emerging Properties in Unified Multimodal Pretraining	Chaorui Deng et.al.	2505.14683	null
2025-05-20	UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation	Rui Tian et.al.	2505.14682	null
2025-05-20	NExT-Search: Rebuilding User Feedback Ecosystem for Generative AI Search	Sunhao Dai et.al.	2505.14680	null
2025-05-20	UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models	Xiaojie Gu et.al.	2505.14679	link
2025-05-20	Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning	Jiaer Xia et.al.	2505.14677	null
2025-05-20	Reward Reasoning Model	Jiaxin Guo et.al.	2505.14674	null
2025-05-20	Training-Free Watermarking for Autoregressive Image Generation	Yu Tong et.al.	2505.14673	link
2025-05-20	Quartet: Native FP4 Training Can Be Optimal for Large Language Models	Roberto L. Castro et.al.	2505.14669	link
2025-05-20	ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions	Bufang Yang et.al.	2505.14668	null
2025-05-20	Beyond Words: Multimodal LLM Knows When to Speak	Zikai Liao et.al.	2505.14654	null
2025-05-20	General-Reasoner: Advancing LLM Reasoning Across All Domains	Xueguang Ma et.al.	2505.14652	null
2025-05-20	Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits	Tiantian Feng et.al.	2505.14648	link
2025-05-21	Think Only When You Need with Large Hybrid-Reasoning Models	Lingjie Jiang et.al.	2505.14631	null
2025-05-20	KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models	Fnu Mohbat et.al.	2505.14629	link
2025-05-20	Debating for Better Reasoning: An Unsupervised Multimodal Approach	Ashutosh Adhikari et.al.	2505.14627	null
2025-05-20	TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning	Zhangchen Xu et.al.	2505.14625	link
2025-05-20	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	Morgan Lindsay Heisler et.al.	2505.14620	null
2025-05-20	Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models	Sahar Abdelnabi et.al.	2505.14617	link
2025-05-20	SATBench: Benchmarking LLMs’ Logical Reasoning via Automated Puzzle Generation from SAT Formulas	Anjiang Wei et.al.	2505.14615	null
2025-05-20	sudoLLM : On Multi-role Alignment of Language Models	Soumadeep Saha et.al.	2505.14607	null
2025-05-20	Towards a Foundation Model for Communication Systems	Davide Buffelli et.al.	2505.14603	null
2025-05-20	Toward Reliable Biomedical Hypothesis Generation: Evaluating Truthfulness and Hallucination in Large Language Models	Guangzhi Xiong et.al.	2505.14599	link
2025-05-20	Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning	Wenbin Hu et.al.	2505.14585	null
2025-05-20	TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring	Sohaila Eltanbouly et.al.	2505.14577	null
2025-05-20	Neural Inverse Scattering with Score-based Regularization	Yuan Gao et.al.	2505.14560	null
2025-05-21	KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation	Jiajun Shi et.al.	2505.14552	link
2025-05-20	Can Large Language Models Really Recognize Your Name?	Dzung Pham et.al.	2505.14549	link
2025-05-20	Time to Embed: Unlocking Foundation Models for Time Series with Channel Descriptions	Utsav Dutta et.al.	2505.14543	null
2025-05-20	Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders	Agam Goyal et.al.	2505.14536	null
2025-05-20	Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs	Zhipeng Yang et.al.	2505.14530	link
2025-05-20	BugRepro: Enhancing Android Bug Reproduction with Domain-Specific Knowledge Integration	Hongrong Yin et.al.	2505.14528	null
2025-05-20	Guarded Query Routing for Large Language Models	Richard Šléher et.al.	2505.14524	link
2025-05-21	Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling	Zhihao Li et.al.	2505.14521	null
2025-05-20	Teaching Audio-Aware Large Language Models What Does Not Hear: Mitigating Hallucinations through Synthesized Negative Samples	Chun-Yi Kuan et.al.	2505.14518	null
2025-05-20	Latent Flow Transformer	Yen-Chen Wu et.al.	2505.14513	link
2025-05-20	ModRWKV: Transformer Multimodality in Linear Time	Jiale Kang et.al.	2505.14505	link
2025-05-20	Enhanced Multimodal Aspect-Based Sentiment Analysis by LLM-Generated Rationales	Jun Cao et.al.	2505.14499	null
2025-05-20	Reasoning Models Better Express Their Confidence	Dongkeun Yoon et.al.	2505.14489	link
2025-05-20	MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance	Agam Goyal et.al.	2505.14483	null
2025-05-20	Towards Reliable Proof Generation with LLMs: A Neuro-Symbolic Approach	Oren Sultan et.al.	2505.14479	null
2025-05-20	Enhancing Interpretability of Sparse Latent Representations with Class Information	Farshad Sangari Abiz et.al.	2505.14476	null
2025-05-20	Attributional Safety Failures in Large Language Models under Code-Mixed Perturbations	Somnath Banerjee et.al.	2505.14469	null
2025-05-20	ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs	Yifan Sui et.al.	2505.14468	null
2025-05-20	VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank	Tianhe Wu et.al.	2505.14460	link
2025-05-20	Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models	Xuyang Liu et.al.	2505.14454	link
2025-05-20	Creative Preference Optimization	Mete Ismayilzada et.al.	2505.14442	null
2025-05-20	S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models	Yuanbo Fang et.al.	2505.14438	link
2025-05-20	Neural Incompatibility: The Unbridgeable Gap of Cross-Scale Parametric Knowledge Transfer in Large Language Models	Yuqiao Tan et.al.	2505.14436	link
2025-05-20	Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI	Annika Bush et.al.	2505.14435	null
2025-05-20	Rank-K: Test-Time Reasoning for Listwise Reranking	Eugene Yang et.al.	2505.14432	link
2025-05-20	From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning	Chalamalasetti Kranti et.al.	2505.14425	null
2025-05-20	MindVote: How LLMs Predict Human Decision-Making in Social Media Polls	Xutao Mao et.al.	2505.14422	null
2025-05-20	Hidden Ghost Hand: Unveiling Backdoor Vulnerabilities in MLLM-Powered Mobile GUI Agents	Pengzhou Cheng et.al.	2505.14418	null
2025-05-20	Towards Non-Euclidean Foundation Models: Advancing AI Beyond Euclidean Frameworks	Menglin Yang et.al.	2505.14417	null
2025-05-20	Table Foundation Models: on knowledge pre-training for tabular learning	Myung Jun Kim et.al.	2505.14415	null
2025-05-20	Diving into the Fusion of Monocular Priors for Generalized Stereo Matching	Chengtang Yao et.al.	2505.14414	link
2025-05-20	Byte Pair Encoding for Efficient Time Series Forecasting	Leon Götz et.al.	2505.14411	null
2025-05-21	Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis	Haoming Huang et.al.	2505.14406	null
2025-05-20	OmniGenBench: A Modular Platform for Reproducible Genomic Foundation Models Benchmarking	Heng Yang et.al.	2505.14402	link
2025-05-20	Log-Augmented Generation: Scaling Test-Time Reasoning with Reusable Computation	Peter Baile Chen et.al.	2505.14398	null
2025-05-20	Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds	Gaël Gendron et.al.	2505.14396	link
2025-05-20	MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language	Seyoung Song et.al.	2505.14395	link
2025-05-20	Knowledge Graph Based Repository-Level Code Generation	Mihir Athale et.al.	2505.14394	null
2025-05-20	SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation	Yuyang Dong et.al.	2505.14381	null
2025-05-20	AutoRev: Automatic Peer Review System for Academic Research Papers	Maitreya Prafulla Chitale et.al.	2505.14376	null
2025-05-20	Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs	Jiawen Wang et.al.	2505.14368	null
2025-05-21	Dual Decomposition of Weights and Singular Value Low Rank Adaptation	Jialong Han et.al.	2505.14367	null
2025-05-20	Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives	Xingxing Weng et.al.	2505.14361	null
2025-05-20	Dual Data Alignment Makes AI-Generated Image Detector Easier Generalizable	Ruoxin Chen et.al.	2505.14359	null
2025-05-20	PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs	Sho Inoue et.al.	2505.14356	link
2025-05-20	WirelessMathBench: A Mathematical Modeling Benchmark for LLMs in Wireless Communications	Xin Li et.al.	2505.14354	null
2025-05-21	OSoRA: Output-Dimension and Singular-Value Initialized Low-Rank Adaptation	Jialong Han et.al.	2505.14350	null
2025-05-20	QA-prompting: Improving Summarization with Large Language Models using Question-Answering	Neelabh Sinha et.al.	2505.14347	link
2025-05-20	Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach	Umberto Cappellazzo et.al.	2505.14336	null
2025-05-20	Handloom Design Generation Using Generative Networks	Rajat Kanti Bhattacharjee et.al.	2505.14330	null
2025-05-20	RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection	Wenjun Hou et.al.	2505.14318	link
2025-05-20	Exploring Jailbreak Attacks on LLMs through Intent Concealment and Diversion	Tiehan Cui et.al.	2505.14316	null
2025-05-20	Low-Cost FlashAttention with Fused Exponential and Multiplication Hardware Operators	Kosmas Alexandridis et.al.	2505.14314	null
2025-05-20	A MIND for Reasoning: Meta-learning for In-context Deduction	Leonardo Bertolazzi et.al.	2505.14313	link
2025-05-20	HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing	Shamsuddeen Hassan Muhammad et.al.	2505.14311	null
2025-05-20	JOLT-SQL: Joint Loss Tuning of Text-to-SQL with Confusion-aware Noisy Schema Sampling	Jinwang Song et.al.	2505.14305	link
2025-05-20	Scaling Law for Quantization-Aware Training	Mengzhao Chen et.al.	2505.14302	null
2025-05-20	SafetyNet: Detecting Harmful Outputs in LLMs by Modeling and Monitoring Deceptive Behaviors	Maheep Chaudhary et.al.	2505.14300	null
2025-05-20	Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy	Zihao Feng et.al.	2505.14299	null
2025-05-20	Cross-Lingual Optimization for Language Transfer in Large Language Models	Jungseob Lee et.al.	2505.14297	null
2025-05-20	Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs	Rao Ma et.al.	2505.14286	null
2025-05-20	YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering	Jennifer D’Souza et.al.	2505.14279	null
2025-05-20	Think-J: Learning to Think for Generative LLM-as-a-Judge	Hui Huang et.al.	2505.14268	link
2025-05-20	AAPO: Enhance the Reasoning Capabilities of LLMs with Advantage Momentum	Jian Xiong et.al.	2505.14264	null
2025-05-20	Speculative Decoding Reimagined for Multimodal Large Language Models	Luxi Lin et.al.	2505.14260	link
2025-05-20	FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation	Shaolin Zhu et.al.	2505.14256	null
2025-05-20	TransBench: Benchmarking Machine Translation for Industrial-Scale Applications	Haijun Li et.al.	2505.14244	null
2025-05-20	ABBA: Highly Expressive Hadamard Product Adaptation for Large Language Models	Raghav Singhal et.al.	2505.14238	link
2025-05-20	UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning	Sule Bai et.al.	2505.14231	null
2025-05-20	“Haet Bhasha aur Diskrimineshun”: Phonetic Perturbations in Code-Mixed Hinglish to Red-Team LLMs	Darpan Aswal et.al.	2505.14226	null
2025-05-20	Automatic Dataset Generation for Knowledge Intensive Question Answering Tasks	Sizhe Yuen et.al.	2505.14212	null
2025-05-20	Challenges and Limitations in the Synthetic Generation of mHealth Sensor Data	Flavio Di Martino et.al.	2505.14206	null
2025-05-20	Capturing the Effects of Quantization on Trojans in Code LLMs	Aftab Hussain et.al.	2505.14200	null
2025-05-20	Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method	Xinshen Zhang et.al.	2505.14197	null
2025-05-19	Mean Flows for One-step Generative Modeling	Zhengyang Geng et.al.	2505.13447	null
2025-05-19	Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards	Xiaoyuan Liu et.al.	2505.13445	link
2025-05-19	Optimizing Anytime Reasoning via Budget Relative Policy Optimization	Penghui Qi et.al.	2505.13438	link
2025-05-19	SMOTExT: SMOTE meets Large Language Models	Mateusz Bystroński et.al.	2505.13434	null
2025-05-19	Synthetic-Powered Predictive Inference	Meshi Bashari et.al.	2505.13432	link
2025-05-19	Fine-tuning Quantized Neural Networks with Zeroth-order Optimization	Sifeng Shang et.al.	2505.13430	link
2025-05-19	MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision	Lingxiao Du et.al.	2505.13427	link
2025-05-19	Learnware of Language Models: Specialized Small Language Models Can Do Big	Zhi-Hao Tan et.al.	2505.13425	link
2025-05-19	Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard	Si-Yang Liu et.al.	2505.13421	null
2025-05-19	FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning	Zhuozhao Hu et.al.	2505.13419	link
2025-05-19	CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process	Jinhe Bi et.al.	2505.13408	null
2025-05-19	AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database	Rong Bian et.al.	2505.13406	null
2025-05-19	MR. Judge: Multimodal Reasoner as a Judge	Renjie Pi et.al.	2505.13403	null
2025-05-19	CompeteSMoE – Statistically Guaranteed Mixture of Experts Training via Competition	Nam V. Nguyen et.al.	2505.13380	link
2025-05-19	Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation	Yasi Zhang et.al.	2505.13377	null
2025-05-19	Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots	Dan BW Choe et.al.	2505.13376	null
2025-05-19	Minimum-Excess-Work Guidance	Christopher Kolloff et.al.	2505.13375	null
2025-05-19	One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling	Nimrod Berman et.al.	2505.13358	link
2025-05-19	Multi-Armed Bandits Meet Large Language Models	Djallel Bouneffouf et.al.	2505.13355	null
2025-05-20	Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning	Adam Štorek et.al.	2505.13353	null
2025-05-19	Investigating the Vulnerability of LLM-as-a-Judge Architectures to Prompt-Injection Attacks	Narek Maloyan et.al.	2505.13348	null
2025-05-19	J4R: Learning to Judge with Equivalent Initial State Group Relative Preference Optimization	Austin Xu et.al.	2505.13346	null
2025-05-19	Thinking Short and Right Over Thinking Long: Serving LLM Reasoning Efficiently and Accurately	Yuhang Wang et.al.	2505.13326	null
2025-05-19	VesselGPT: Autoregressive Modeling of Vascular Geometry	Paula Feldman et.al.	2505.13318	null
2025-05-19	GUARD: Generation-time LLM Unlearning via Adaptive Restriction and Detection	Zhijie Deng et.al.	2505.13312	null
2025-05-19	Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space	Hengli Li et.al.	2505.13308	link
2025-05-19	RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning	Qiguang Chen et.al.	2505.13307	link
2025-05-19	I’ll believe it when I see it: Images increase misinformation sharing in Vision-Language Models	Alice Plebe et.al.	2505.13302	link
2025-05-19	TimeSeriesGym: A Scalable Benchmark for (Time Series) Machine Learning Engineering Agents	Yifu Cai et.al.	2505.13291	link
2025-05-19	Hybrid Voting-Based Task Assignment in Modular Construction Scenarios	Daniel Weiner et.al.	2505.13278	null
2025-05-19	CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement Learning	Lei Sheng et.al.	2505.13271	link
2025-05-19	Distilling a speech and music encoder with task arithmetic	Fabian Ritter-Gutierrez et.al.	2505.13270	null
2025-05-19	Are requirements really all you need? A case study of LLM-driven configuration code generation for automotive simulations	Krzysztof Lebioda et.al.	2505.13263	null
2025-05-19	From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery	Tianshi Zheng et.al.	2505.13259	link
2025-05-19	Effective and Transparent RAG: Adaptive-Reward Reinforcement Learning for Decision Traceability	Jingyi Ren et.al.	2505.13258	link
2025-05-19	Policy Contrastive Decoding for Robotic Foundation Models	Shihan Wu et.al.	2505.13255	link
2025-05-19	HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding	Siran Liu et.al.	2505.13254	null
2025-05-19	RN-F: A Novel Approach for Mitigating Contaminated Data in Large Language Models	Le Vu Anh et.al.	2505.13249	link
2025-05-19	JNLP at SemEval-2025 Task 11: Cross-Lingual Multi-Label Emotion Detection Using Generative Models	Jieying Xue et.al.	2505.13244	link
2025-05-19	Conformalized Decision Risk Assessment	Wenbin Zhou et.al.	2505.13243	null
2025-05-19	SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information	Chih-Kai Yang et.al.	2505.13237	link
2025-05-19	From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection	Lincan Cai et.al.	2505.13233	link
2025-05-19	Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis	Tianbao Xie et.al.	2505.13227	null
2025-05-19	SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science	Jie Ying et.al.	2505.13220	link
2025-05-19	Diffusion Models with Double Guidance: Generate with aggregated datasets	Yanfeng Yang et.al.	2505.13213	null
2025-05-19	Quantum Knowledge Distillation for Large Language Models	Lingxiao Li et.al.	2505.13205	null
2025-05-19	Alignment-Augmented Speculative Decoding with Alignment Sampling and Conditional Verification	Jikai Wang et.al.	2505.13204	null
2025-05-19	A Physics-Inspired Optimizer: Velocity Regularized Adam	Pranav Vaidhyanathan et.al.	2505.13196	null
2025-05-19	Adversarial Testing in LLMs: Insights into Decision-Making Vulnerabilities	Lili Zhang et.al.	2505.13195	null
2025-05-19	True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics	Christoph Jürgen Hemmer et.al.	2505.13192	null
2025-05-19	A Malliavin-Gamma calculus approach to Score Based Diffusion Generative models for random fields	Giacomo Greco et.al.	2505.13189	null
2025-05-19	ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models	Matteo Merler et.al.	2505.13180	link
2025-05-19	ToolSpectrum : Towards Personalized Tool Utilization for Large Language Models	Zihao Cheng et.al.	2505.13176	null
2025-05-19	Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment	Siming Sun et.al.	2505.13175	null
2025-05-19	A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs	V. S. D. S. Mahesh Akavarapu et.al.	2505.13173	link
2025-05-19	Positional Fragility in LLMs: How Offset Effects Reshape Our Understanding of Memorization Risks	Yixuan Xu et.al.	2505.13171	null
2025-05-19	Role-Playing Evaluation for Large Language Models	Yassine El Boudouri et.al.	2505.13157	link
2025-05-19	Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice	Zhi Liu et.al.	2505.13156	null
2025-05-19	Zero-Shot Adaptation of Behavioral Foundation Models to Unseen Dynamics	Maksim Bobrin et.al.	2505.13150	link
2025-05-20	What if Deception Cannot be Detected? A Cross-Linguistic Study on the Limits of Deception Detection from Text	Aswathy Velutharambath et.al.	2505.13147	null
2025-05-19	Auditing Meta-Cognitive Hallucinations in Reasoning Large Language Models	Haolang Lu et.al.	2505.13143	null
2025-05-19	Understanding Cross-Lingual Inconsistency in Large Language Models	Zheng Wei Lim et.al.	2505.13141	null
2025-05-19	CacheFlow: Fast Human Motion Prediction by Cached Normalizing Flow	Takahiro Maeda et.al.	2505.13140	null
2025-05-19	Optimizing Retrieval Augmented Generation for Object Constraint Language	Kevin Chenhao Li et.al.	2505.13129	null
2025-05-19	Benchmarking and Confidence Evaluation of LALMs For Temporal Reasoning	Debarpan Bhattacharya et.al.	2505.13115	link
2025-05-19	Why Knowledge Distillation Works in Generative Models: A Minimal Working Explanation	Sungmin Cha et.al.	2505.13111	null
2025-05-19	FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference	Guangda Liu et.al.	2505.13109	null
2025-05-19	Fixing 7,400 Bugs for 1$: Cheap Crash-Site Program Repair	Han Zheng et.al.	2505.13103	null
2025-05-20	Industrial Synthetic Segment Pre-training	Shinichi Mae et.al.	2505.13099	null
2025-05-19	LLM-KG-Bench 3.0: A Compass for SemanticTechnology Capabilities in the Ocean of LLMs	Lars-Peter Meyer et.al.	2505.13098	link
2025-05-19	The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation	David Stap et.al.	2505.13090	null
2025-05-19	Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning	Xiaoyu Yang et.al.	2505.13081	null
2025-05-19	The Hidden Dangers of Browsing AI Agents	Mykyta Mudryi et.al.	2505.13076	null
2025-05-19	Structure-Aware Corpus Construction and User-Perception-Aligned Metrics for Large-Language-Model Code Completion	Dengfeng Liu et.al.	2505.13073	null
2025-05-19	Hearing from Silence: Reasoning Audio Descriptions from Silent Videos via Vision-Language Model	Yong Ren et.al.	2505.13062	null
2025-05-19	Automatic mixed precision for optimizing gained time with constrained loss mean-squared-error based on model partition to sequential sub-graphs	Shmulik Markovich-Golan et.al.	2505.13060	null
2025-05-19	CAIM: Development and Evaluation of a Cognitive AI Memory Framework for Long-Term Interaction with Intelligent Agents	Rebecca Westhäußer et.al.	2505.13044	null
2025-05-19	KIT’s Offline Speech Translation and Instruction Following Submission for IWSLT 2025	Sai Koneru et.al.	2505.13036	null
2025-05-19	MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their Mix	Ziyang Ma et.al.	2505.13032	link
2025-05-19	MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO	Yicheng Xiao et.al.	2505.13031	link
2025-05-19	MDDM: A Multi-view Discriminative Enhanced Diffusion-based Model for Speech Enhancement	Nan Xu et.al.	2505.13029	null
2025-05-20	Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset	Sayon Palit et.al.	2505.13028	null
2025-05-19	Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs	Jack Chen et.al.	2505.13026	null
2025-05-19	Unveiling and Steering Connectome Organization with Interpretable Latent Variables	Yubin Li et.al.	2505.13011	null
2025-05-19	Generative Modeling of Random Fields from Limited Data via Constrained Latent Flow Matching	James E. Warner et.al.	2505.13007	link
2025-05-19	Fractured Chain-of-Thought Reasoning	Baohao Liao et.al.	2505.12992	null
2025-05-19	An Empirical Study of Many-to-Many Summarization with Large Language Models	Jiaan Wang et.al.	2505.12983	null
2025-05-20	From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents	Liangxuan Wu et.al.	2505.12981	null
2025-05-19	A Structured Literature Review on Traditional Approaches in Current Natural Language Processing	Robin Jegan et.al.	2505.12970	null
2025-05-19	MA-COIR: Leveraging Semantic Search Index and Generative Models for Ontology-Driven Biomedical Concept Recognition	Shanshan Liu et.al.	2505.12964	link
2025-05-19	DGRO: Enhancing LLM Reasoning via Exploration-Exploitation Control and Reward Variance Management	Xuerui Su et.al.	2505.12951	null
2025-05-19	GuRE:Generative Query REwriter for Legal Passage Retrieval	Daehee Kim et.al.	2505.12950	link
2025-05-19	A3 : an Analytical Low-Rank Approximation Framework for Attention	Jeffrey T. H. Wong et.al.	2505.12942	null
2025-05-19	Leveraging LLM Inconsistency to Boost Pass@k Performance	Uri Dalal et.al.	2505.12938	null
2025-05-19	Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs	Zhihe Yang et.al.	2505.12929	link
2025-05-19	CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming	Han Deng et.al.	2505.12925	link
2025-05-19	The Traitors: Deception and Trust in Multi-Agent Language Model Simulations	Pedro M. P. Curvo et.al.	2505.12923	link
2025-05-19	Sinusoidal Initialization, Time for a New Start	Alberto Fernández-Hernández et.al.	2505.12909	null
2025-05-19	AutoGEEval: A Multimodal and Automated Framework for Geospatial Code Generation on GEE with Large Language Models	Shuyang Hou et.al.	2505.12900	null
2025-05-19	On the Thinking-Language Modeling Gap in Large Language Models	Chenxi Liu et.al.	2505.12896	null
2025-05-16	QVGen: Pushing the Limit of Quantized Video Generative Models	Yushi Huang et.al.	2505.11497	null
2025-05-16	msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML	Zhaolan Huang et.al.	2505.11483	link
2025-05-16	Improving Assembly Code Performance with Large Language Models via Reinforcement Learning	Anjiang Wei et.al.	2505.11480	null
2025-05-16	HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages	Zhilin Wang et.al.	2505.11475	null
2025-05-16	Disentangling Reasoning and Knowledge in Medical Large Language Models	Rahul Thapa et.al.	2505.11462	null
2025-05-16	ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks	Zhixiong Zhuang et.al.	2505.11459	null
2025-05-16	LLMs unlock new paths to monetizing exploits	Nicholas Carlini et.al.	2505.11449	null
2025-05-16	Is Compression Really Linear with Code Intelligence?	Xianzhen Luo et.al.	2505.11441	null
2025-05-16	GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art	Chenkai Zhang et.al.	2505.11436	link
2025-05-16	MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production	Chao Jin et.al.	2505.11432	null
2025-05-16	When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs	Xiaomin Li et.al.	2505.11423	null
2025-05-16	EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions	Patryk Bartkowiak et.al.	2505.11417	link
2025-05-16	MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems	Yinsicheng Jiang et.al.	2505.11415	null
2025-05-16	CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs	Sijia Chen et.al.	2505.11413	null
2025-05-16	Visual Planning: Let’s Think Only with Images	Yi Xu et.al.	2505.11409	link
2025-05-16	Large Language Model Use Impact Locus of Control	Jenny Xiyu Fu et.al.	2505.11406	null
2025-05-16	EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models	Bohao Xing et.al.	2505.11405	link
2025-05-16	GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents	Lingxiao Diao et.al.	2505.11368	null
2025-05-16	Phare: A Safety Probe for Large Language Models	Pierre Le Jeune et.al.	2505.11365	link
2025-05-16	LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors	Rao Ma et.al.	2505.11352	null
2025-05-16	Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning	Yuanzhao Zhang et.al.	2505.11349	null
2025-05-16	Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models	Banca Calvo Figueras et.al.	2505.11341	null
2025-05-16	XtraGPT: LLMs for Human-AI Collaboration on Controllable Academic Paper Revision	Nuo Chen et.al.	2505.11336	null
2025-05-16	TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference	Raja Gond et.al.	2505.11329	link
2025-05-16	Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors	Thomas Nagler et.al.	2505.11325	null
2025-05-16	A Fourier Space Perspective on Diffusion Models	Fabian Falck et.al.	2505.11278	null
2025-05-16	Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs	Yaorui Shi et.al.	2505.11277	link
2025-05-16	TCC-Bench: Benchmarking the Traditional Chinese Culture Understanding Capabilities of MLLMs	Pengju Xu et.al.	2505.11275	link
2025-05-16	Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models	Camille Couturier et.al.	2505.11271	null
2025-05-16	TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes	Chao Zhang et.al.	2505.11270	null
2025-05-16	DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models	Giulia Bertazzini et.al.	2505.11257	null
2025-05-16	LD-Scene: LLM-Guided Diffusion for Controllable Generation of Adversarial Safety-Critical Driving Scenarios	Mingxing Peng et.al.	2505.11247	null
2025-05-16	Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification	Wenhao Qian et.al.	2505.11237	link
2025-05-16	Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs	Zhangying Feng et.al.	2505.11227	null
2025-05-16	HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization	Chengyu Huang et.al.	2505.11225	link
2025-05-16	Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation	Donghoon Lee et.al.	2505.11221	link
2025-05-16	Unveiling the Potential of Vision-Language-Action Models with Open-Ended Multimodal Instructions	Wei Zhao et.al.	2505.11214	null
2025-05-16	Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese	Xihuai Wang et.al.	2505.11200	null
2025-05-16	Multi-Modal Multi-Task (M3T) Federated Foundation Models for Embodied AI: Potentials and Challenges for Edge Integration	Kasra Borazjani et.al.	2505.11191	null
2025-05-16	Can Global XAI Methods Reveal Injected Bias in LLMs? SHAP vs Rule Extraction vs RuleSHAP	Francesco Sovrano et.al.	2505.11189	link
2025-05-16	On Next-Token Prediction in LLMs: How End Goals Determine the Consistency of Decoding Algorithms	Jacob Trauger et.al.	2505.11183	null
2025-05-16	Feasibility with Language Models for Open-World Compositional Zero-Shot Learning	Jae Myung Kim et.al.	2505.11181	null
2025-05-16	mmRAG: A Modular Benchmark for Retrieval-Augmented Generation over Text, Tables, and Knowledge Graphs	Chuan Xu et.al.	2505.11180	link
2025-05-16	Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline	Hrishit Madhavi et.al.	2505.11177	link
2025-05-16	Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training	Myeonghwan Ahn et.al.	2505.11170	null
2025-05-16	SoLoPO: Unlocking Long-Context Capabilities in LLMs via Short-to-Long Preference Optimization	Huashan Sun et.al.	2505.11166	null
2025-05-16	Foundation Time-Series AI Model for Realized Volatility Forecasting	Anubha Goel et.al.	2505.11163	null
2025-05-16	Diffusion Model in Hyperspectral Image Processing and Analysis: A Review	Xing Hu et.al.	2505.11158	null
2025-05-16	MPMA: Preference Manipulation Attack Against Model Context Protocol	Zihan Wang et.al.	2505.11154	null
2025-05-16	Human-Aligned Bench: Fine-Grained Assessment of Reasoning Ability in MLLMs vs. Humans	Yansheng Qiu et.al.	2505.11141	null
2025-05-16	Scaling Reasoning can Improve Factuality in Large Language Models	Mike Zhang et.al.	2505.11140	link
2025-05-16	PhiNet v2: A Mask-Free Brain-Inspired Vision Foundation Model from Video	Makoto Yamada et.al.	2505.11129	link
2025-05-16	Risk theory in a finite customer-pool setting	Michel Mandjes et.al.	2505.11127	link
2025-05-16	GraphOracle: A Foundation Model for Knowledge Graph Reasoning	Enjun Du et.al.	2505.11125	null
2025-05-16	Navigating the Alpha Jungle: An LLM-Powered MCTS Framework for Formulaic Factor Mining	Yu Shi et.al.	2505.11122	null
2025-05-16	Redundancy-Aware Pretraining of Vision-Language Foundation Models in Remote Sensing	Mathis Jürgen Adler et.al.	2505.11121	null
2025-05-16	Deepfake Forensic Analysis: Source Dataset Attribution and Legal Implications of Synthetic Media Manipulation	Massimiliano Cassia et.al.	2505.11110	null
2025-05-16	MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark	Florinel-Alin Croitoru et.al.	2505.11109	null
2025-05-16	Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity	Chan-Jan Hsu et.al.	2505.11107	null
2025-05-16	Towards Better Evaluation for Generated Patent Claims	Lekang Jiang et.al.	2505.11095	link
2025-05-16	ShiQ: Bringing back Bellman to LLMs	Pierre Clavier et.al.	2505.11081	null
2025-05-16	$\mathcal{A}LLM4ADD$ : Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection	Hao Gu et.al.	2505.11079	null
2025-05-16	LLM-Enhanced Symbolic Control for Safety-Critical Applications	Amir Bayat et.al.	2505.11077	null
2025-05-16	Addition is almost all you need: Compressing neural networks with double binary factorization	Vladimír Boža et.al.	2505.11076	link
2025-05-16	Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment Benchmarking	Changlun Li et.al.	2505.11065	link
2025-05-16	Conceptual framework for the application of deep neural networks to surface composition reconstruction from Mercury’s exospheric data	Adrian Kazakov et.al.	2505.11053	null
2025-05-16	OntoURL: A Benchmark for Evaluating Large Language Models on Symbolic Ontological Understanding, Reasoning and Learning	Xiao Zhang et.al.	2505.11031	link
2025-05-16	Exploiting the Asymmetric Uncertainty Structure of Pre-trained VLMs on the Unit Hypersphere	Li Ju et.al.	2505.11029	null
2025-05-16	Logo-LLM: Local and Global Modeling with Large Language Models for Time Series Forecasting	Wenjie Ou et.al.	2505.11017	link
2025-05-16	WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?	An-Lan Wang et.al.	2505.11015	null
2025-05-16	Humans expect rationality and cooperation from LLM opponents in strategic games	Darija Barak et.al.	2505.11011	null
2025-05-16	Review-Instruct: A Review-Driven Multi-Turn Conversations Generation Method for Large Language Models	Jiangxu Wu et.al.	2505.11010	null
2025-05-16	Space Group Equivariant Crystal Diffusion	Rees Chang et.al.	2505.10994	null
2025-05-16	Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges	Yuan Zhang et.al.	2505.10993	null
2025-05-16	ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks	Feiran You et.al.	2505.10992	null
2025-05-16	GenoArmory: A Unified Evaluation Framework for Adversarial Attacks on Genomic Foundation Models	Haozheng Luo et.al.	2505.10983	link
2025-05-16	Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory	Yexiang Liu et.al.	2505.10981	link
2025-05-16	Group-in-Group Policy Optimization for LLM Agent Training	Lang Feng et.al.	2505.10978	link
2025-05-16	Can Large Language Models Correctly Interpret Equations with Errors?	Lachlan McGinness et.al.	2505.10966	null
2025-05-16	MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation	Zhenwen Liang et.al.	2505.10962	null
2025-05-16	SubGCache: Accelerating Graph-based RAG with Subgraph-level KV Cache	Qiuyu Zhu et.al.	2505.10951	null
2025-05-16	Shackled Dancing: A Bit-Locked Diffusion Algorithm for Lossless and Controllable Image Steganography	Tianshuo Zhang et.al.	2505.10950	null
2025-05-16	The Way We Prompt: Conceptual Blending, Neural Dynamics, and Prompt-Induced Transitions in LLMs	Makoto Sato et.al.	2505.10948	null
2025-05-16	ToDMA: Large Model-Driven Token-Domain Multiple Access for Semantic Communications	Li Qiao et.al.	2505.10946	null
2025-05-16	Semantic Aware Linear Transfer by Recycling Pre-trained Language Models for Cross-lingual Transfer	Seungyoon Lee et.al.	2505.10945	null
2025-05-16	Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation	Qing Yu et.al.	2505.10940	null
2025-05-16	GenKnowSub: Improving Modularity and Reusability of LLMs through General Knowledge Subtraction	Mohammadtaha Bagherifard et.al.	2505.10939	link
2025-05-16	Accurate KV Cache Quantization with Outlier Tokens Tracing	Yi Su et.al.	2505.10938	link
2025-05-16	Connecting the Dots: A Chain-of-Collaboration Prompting Framework for LLM Agents	Jiaxing Zhao et.al.	2505.10936	null
2025-05-16	Physics-informed Temporal Alignment for Auto-regressive PDE Foundation Models	Congcong Zhu et.al.	2505.10930	link
2025-05-16	Vaiage: A Multi-Agent Solution to Personalized Travel Planning	Binwen Liu et.al.	2505.10922	null
2025-05-16	A Physics-Informed Convolutional Long Short Term Memory Statistical Model for Fluid Thermodynamics Simulations	Luca Menicali et.al.	2505.10919	link
2025-05-16	VISTA: Enhancing Vision-Text Alignment in MLLMs via Cross-Modal Mutual Information Maximization	Mingxiao Li et.al.	2505.10917	null
2025-05-16	Explain What You Mean: Intent Augmented Knowledge Graph Recommender Built With LLM	Wenqing Zheng et.al.	2505.10900	null
2025-05-16	Multi-Objective Preference Optimization: Improving Human Alignment of Generative Models	Akhil Agnihotri et.al.	2505.10892	null
2025-05-16	Approximation and Generalization Abilities of Score-based Neural Network Generative Models for Sub-Gaussian Distributions	Guoji Fu et.al.	2505.10880	null
2025-05-16	A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision	Alexey Magay et.al.	2505.10875	null
2025-05-16	REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning?	Chenxi Jiang et.al.	2505.10872	null
2025-05-16	Improve Rule Retrieval and Reasoning with Self-Induction and Relevance ReEstimate	Ziyang Huang et.al.	2505.10870	null
2025-05-16	Have Multimodal Large Language Models (MLLMs) Really Learned to Tell the Time on Analog Clocks?	Tairan Fu et.al.	2505.10862	null
2025-05-15	End-to-End Vision Tokenizer Tuning	Wenxuan Wang et.al.	2505.10562	null
2025-05-15	T2A-Feedback: Improving Basic Capabilities of Text-to-Audio Generation via Fine-grained AI Feedback	Zehan Wang et.al.	2505.10561	null
2025-05-15	Neural Thermodynamic Laws for Large Language Model Training	Ziming Liu et.al.	2505.10559	null
2025-05-15	Flowing Through Hilbert Space: Quantum-Enhanced Generative Models for Lattice Field Theory	Jehu Martinez et.al.	2505.10553	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning	Milan Ganai et.al.	2505.10547	null
2025-05-15	Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models	Annie Wong et.al.	2505.10543	link
2025-05-15	Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis	Pengfei Wang et.al.	2505.10541	link
2025-05-15	S3C2 Summit 2024-09: Industry Secure Software Supply Chain Summit	Imranur Rahman et.al.	2505.10538	null
2025-05-15	CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs	Raman Dutt et.al.	2505.10496	link
2025-05-15	RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs	Vibha Belavadi et.al.	2505.10495	null
2025-05-15	Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective	Yutao Mou et.al.	2505.10494	link
2025-05-15	CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning	Shaohan Wang et.al.	2505.10493	null
2025-05-15	Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns	Leon Hannig et.al.	2505.10490	null
2025-05-15	UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation	Yi Li et.al.	2505.10483	null
2025-05-15	Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI	Agnik Saha et.al.	2505.10472	null
2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null
2025-05-15	Superposition Yields Robust Neural Scaling	Yizhou liu et.al.	2505.10465	link
2025-05-15	Are Large Language Models Robust in Understanding Code Against Semantics-Preserving Mutations?	Pedro Orvalho et.al.	2505.10443	null
2025-05-15	Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs	Jingyao Wang et.al.	2505.10425	null
2025-05-15	Hierarchical Document Refinement for Long-context Retrieval-augmented Generation	Jiajie Jin et.al.	2505.10413	link
2025-05-15	Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation	Yue Guo et.al.	2505.10409	null
2025-05-15	Two-Stage Generative Model for Intracranial Aneurysm Meshes with Morphological Marker Conditioning	Wenhao Ding et.al.	2505.10407	link
2025-05-15	Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding	Jianhao Huang et.al.	2505.10405	null
2025-05-15	Multi-domain Multilingual Sentiment Analysis in Industry: Predicting Aspect-based Opinion Quadruples	Benjamin White et.al.	2505.10389	null
2025-05-15	Are Sparse Autoencoders Useful for Java Function Bug Detection?	Rui Melo et.al.	2505.10375	link
2025-05-15	FactsR: A Safer Method for Producing High Quality Healthcare Documentation	Victor Petrén Bach Hansen et.al.	2505.10360	null
2025-05-15	NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning	Le Shi et.al.	2505.10359	null
2025-05-16	LDIR: Low-Dimensional Dense and Interpretable Text Embeddings with Relative Representations	Yile Wang et.al.	2505.10354	link
2025-05-15	Non-Markovian dynamics with a driven three-level giant atom in a semi-infinite photonic waveguide	S. J. Sun et.al.	2505.10340	null
2025-05-15	AutoPentest: Enhancing Vulnerability Management With Autonomous LLM Agents	Julius Henke et.al.	2505.10321	link
2025-05-15	One For All: Formally Verifying Protocols which use Aggregate Signatures (extended version)	Xenia Hofmeier et.al.	2505.10316	null
2025-05-15	Empirically evaluating commonsense intelligence in large language models with large-scale human judgments	Tuan Dung Nguyen et.al.	2505.10309	null
2025-05-15	MIPHEI-ViT: Multiplex Immunofluorescence Prediction from H&E Images using ViT Foundation Models	Guillaume Balezo et.al.	2505.10294	link
2025-05-15	From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making	Dubai Li et.al.	2505.10282	link
2025-05-15	The Evolving Landscape of Generative Large Language Models and Traditional Natural Language Processing in Medicine	Rui Yang et.al.	2505.10261	null
2025-05-15	Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data	Poli Apollinaire Nemkova et.al.	2505.10260	link
2025-05-15	Towards Safe Robot Foundation Models Using Inductive Biases	Maximilian Tölle et.al.	2505.10219	null
2025-05-15	Informed Forecasting: Leveraging Auxiliary Knowledge to Boost LLM Performance on Time Series Forecasting	Mohammadmahdi Ghasemloo et.al.	2505.10213	null
2025-05-15	Do LLMs Memorize Recommendation Datasets? A Preliminary Study on MovieLens-1M	Dario Di Palma et.al.	2505.10212	link
2025-05-15	VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits	Jintian Shao et.al.	2505.10202	null
2025-05-15	The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think	Seongyun Lee et.al.	2505.10185	null
2025-05-15	Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning	Yoichi Ishibashi et.al.	2505.10182	null
2025-05-15	GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs	Longchao Da et.al.	2505.10143	null
2025-05-15	Large Wireless Localization Model (LWLM): A Foundation Model for Positioning in 6G Networks	Guangjin Pan et.al.	2505.10134	link
2025-05-15	Learning Virtual Machine Scheduling in Cloud Computing through Language Agents	JieHao Wu et.al.	2505.10117	null
2025-05-15	What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs	Xinlan Yan et.al.	2505.10113	null
2025-05-15	EmbodiedMAE: A Unified 3D Multi-Modal Representation for Robot Manipulation	Zibin Dong et.al.	2505.10105	null
2025-05-15	From Text to Network: Constructing a Knowledge Graph of Taiwan-Based China Studies Using Generative AI	Hsuan-Lei Shao et.al.	2505.10093	null
2025-05-15	ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data	Chengsen Wang et.al.	2505.10083	null
2025-05-16	Leveraging Graph Retrieval-Augmented Generation to Support Learners’ Understanding of Knowledge Concepts in MOOCs	Mohamed Abdelmagied et.al.	2505.10074	null
2025-05-15	Dark LLMs: The Growing Threat of Unaligned AI Models	Michael Fire et.al.	2505.10066	null
2025-05-15	CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability	Han Peng et.al.	2505.10063	null
2025-05-15	Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis	Bingda Tang et.al.	2505.10046	link
2025-05-15	DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs	Lake Yin et.al.	2505.10013	null
2025-05-15	ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts	Jing-Cheng Pang et.al.	2505.10010	link
2025-05-15	SVA-ICL: Improving LLM-based Software Vulnerability Assessment via In-Context Learning and Information Fusion	Chaoyang Gao et.al.	2505.10008	link
2025-05-15	AI2MMUM: AI-AI Oriented Multi-Modal Universal Model Leveraging Telecom Domain Large Model	Tianyu Jiao et.al.	2505.10003	null
2025-05-15	ServeGen: Workload Characterization and Generation of Large Language Model Serving in Production	Yuxing Xiang et.al.	2505.09999	link
2025-05-15	Physical regularized Hierarchical Generative Model for Metallic Glass Structural Generation and Energy Prediction	Qiyuan Chen et.al.	2505.09977	null
2025-05-15	Analysing Safety Risks in LLMs Fine-Tuned with Pseudo-Malicious Cyber Security Data	Adel ElZemity et.al.	2505.09974	null
2025-05-15	Pre-Act: Multi-Step Planning and Reasoning Improves Acting in LLM Agents	Mrinal Rawat et.al.	2505.09970	null
2025-05-15	Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors	Ahmed S. Abdelrahman et.al.	2505.09949	null
2025-05-15	Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph	Deeksha Prahlad et.al.	2505.09945	link
2025-05-15	Design and Evaluation of Generative Agent-based Platform for Human-Assistant Interaction Research: A Tale of 10 User Studies	Ziyi Xuan et.al.	2505.09938	null
2025-05-15	CartoAgent: a multimodal large language model-powered multi-agent cartographic framework for map style transfer and evaluation	Chenglong Wang et.al.	2505.09936	null
2025-05-15	Rethinking Prompt Optimizers: From Prompt Merits to Optimization	Zixiao Zhu et.al.	2505.09930	link
2025-05-15	Reinforced Interactive Continual Learning via Real-time Noisy Human Feedback	Yutao Yang et.al.	2505.09925	null
2025-05-16	From Trade-off to Synergy: A Versatile Symbiotic Watermarking Framework for Large Language Models	Yidan Wang et.al.	2505.09924	link
2025-05-15	Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity	Zichen Liu et.al.	2505.09922	null
2025-05-16	PIG: Privacy Jailbreak Attack on LLMs via Gradient-based Iterative In-Context Optimization	Yidan Wang et.al.	2505.09921	link
2025-05-15	UICopilot: Automating UI Synthesis via Hierarchical Code Generation from Webpage Designs	Yi Gui et.al.	2505.09904	link
2025-05-15	Crossing Borders Without Crossing Boundaries: How Sociolinguistic Awareness Can Optimize User Engagement with Localized Spanish AI Models Across Hispanophone Countries	Martin Capdevila et.al.	2505.09902	null
2025-05-15	Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit Tasks	Ziyuan Zhang et.al.	2505.09901	link
2025-05-16	Characterizing Unintended Consequences in Human-GUI Agent Collaboration for Web Browsing	Shuning Zhang et.al.	2505.09875	null
2025-05-14	Do Large Language Models Know Conflict? Investigating Parametric vs. Non-Parametric Knowledge of LLMs for Conflict Forecasting	Apollinaire Poli Nemkova et.al.	2505.09852	null
2025-05-14	Evaluating Large Language Models for the Generation of Unit Tests with Equivalence Partitions and Boundary Values	Martín Rodríguez et.al.	2505.09830	null
2025-05-14	KRISTEVA: Close Reading as a Novel Task for Benchmarking Interpretive Reasoning	Peiqi Sui et.al.	2505.09825	null
2025-05-14	Adversarial Attack on Large Language Models using Exponentiated Gradient Descent	Sajib Biswas et.al.	2505.09820	link
2025-05-14	Lossless Compression for LLM Tensor Incremental Snapshots	Daniel Waddington et.al.	2505.09810	null
2025-05-14	Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models	Aditya Nagori et.al.	2505.09805	null
2025-05-14	A Multimodal Multi-Agent Framework for Radiology Report Generation	Ziruo Yi et.al.	2505.09787	null
2025-05-14	Regularized Operator Extrapolation Method For Stochastic Bilevel Variational Inequality Problems	Mohammad Khalafi et.al.	2505.09778	null
2025-05-14	A Survey on Large Language Models in Multimodal Recommender Systems	Alejo Lopez-Avila et.al.	2505.09777	null
2025-05-14	Self-Consuming Generative Models with Adversarially Curated Data	Xiukun Wei et.al.	2505.09768	null
2025-05-14	Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents	Botao Amber Hu et.al.	2505.09757	null
2025-05-14	FAS-LLM: Large Language Model-Based Channel Prediction for OTFS-Enabled Satellite-FAS Links	Halvin Yang et.al.	2505.09751	null
2025-05-14	VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts	Xin Liu et.al.	2505.09701	null
2025-05-14	EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models	Hu Yue et.al.	2505.09694	link
2025-05-14	System Prompt Optimization with Meta-Learning	Yumin Choi et.al.	2505.09666	null
2025-05-16	Tales of the 2025 Los Angeles Fire: Hotwash for Public Health Concerns in Reddit via LLM-Enhanced Topic Modeling	Sulong Zhou et.al.	2505.09665	null
2025-05-14	Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors	Nicolas Dupuis et.al.	2505.09610	null
2025-05-14	Adversarial Suffix Filtering: a Defense Pipeline for LLMs	David Khachaturov et.al.	2505.09602	null
2025-05-15	How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference	Nidhal Jegham et.al.	2505.09598	null
2025-05-14	WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models	Abdullah Mushtaq et.al.	2505.09595	null
2025-05-15	Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media	Yuchen Wu et.al.	2505.09583	null
2025-05-14	Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach	Shannon Lodoen et.al.	2505.09576	null
2025-05-14	MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8	Linbo Liu et.al.	2505.09569	link
2025-05-14	Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation	Anne-Marie Rickmann et.al.	2505.09564	null
2025-05-14	PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	Zongqian Li et.al.	2505.09519	link
2025-05-15	Towards Fair In-Context Learning with Tabular Foundation Models	Patrik Kenfack et.al.	2505.09503	null
2025-05-14	Layered Unlearning for Adversarial Relearning	Timothy Qian et.al.	2505.09500	link
2025-05-14	Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models	Eduard Kuric et.al.	2505.09478	null
2025-05-14	Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities	Zachary Ravichandran et.al.	2505.09477	null
2025-05-14	Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment	Paul Tschisgale et.al.	2505.09438	null
2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link
2025-05-14	Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records	Yili He et.al.	2505.09435	null
2025-05-15	SafePath: Conformal Prediction for Safe LLM-Based Autonomous Navigation	Achref Doula et.al.	2505.09427	null
2025-05-14	FaceShield: Explainable Face Anti-Spoofing with Multimodal Large Language Models	Hongyang Wang et.al.	2505.09415	null
2025-05-14	The Influence of Human-inspired Agentic Sophistication in LLM-driven Strategic Reasoners	Vince Trencsenyi et.al.	2505.09396	null
2025-05-14	Quantum-Enhanced Parameter-Efficient Learning for Typhoon Trajectory Forecasting	Chen-Yu Liu et.al.	2505.09395	null
2025-05-14	Qwen3 Technical Report	An Yang et.al.	2505.09388	link
2025-05-14	MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment	Siyuan Yan et.al.	2505.09372	link
2025-05-14	Efficient Modelling of Lyman-α opacity fluctuations during late EoR	Barun Maity et.al.	2505.09369	null
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358	link
2025-05-14	Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures	Chenggang Zhao et.al.	2505.09343	null
2025-05-14	Access Controls Will Solve the Dual-Use Dilemma	Evžen Wybitul et.al.	2505.09341	null
2025-05-14	RAG-Enabled Intent Reasoning for Application-Network Interaction	Salwa Mostafa et.al.	2505.09339	link
2025-05-14	BioVFM-21M: Benchmarking and Scaling Self-Supervised Vision Foundation Models for Biomedical Image Analysis	Jiarun Liu et.al.	2505.09329	null
2025-05-14	Statistical Modeling and Uncertainty Estimation of LLM Inference Systems	Kaustabha Ray et.al.	2505.09319	null
2025-05-14	Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging	Hongjin Qian et.al.	2505.09316	null
2025-05-14	Reproducibility Study of “Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM Agents”	Pedro M. P. Curvo et.al.	2505.09289	link
2025-05-14	A Scalable Unsupervised Framework for multi-aspect labeling of Multilingual and Multi-Domain Review Data	Jiin Park et.al.	2505.09286	null
2025-05-14	Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations	Panqi Chen et.al.	2505.09284	null
2025-05-14	A Note on Semantic Diffusion	Alexander P. Ryjov et.al.	2505.09283	null
2025-05-14	Recent Advances in Medical Imaging Segmentation: A Survey	Fares Bougourzi et.al.	2505.09274	link
2025-05-14	MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	Bin-Bin Gao et.al.	2505.09265	null
2025-05-14	Few-Shot Anomaly-Driven Generation for Anomaly Classification and Segmentation	Guan Gui et.al.	2505.09263	link
2025-05-14	Instantiating Standards: Enabling Standard-Driven Text TTP Extraction with Evolvable Memory	Cheng Meng et.al.	2505.09261	null
2025-05-14	Zero-Shot Multi-modal Large Language Model v.s. Supervised Deep Learning: A Comparative Study on CT-Based Intracranial Hemorrhage Subtyping	Yinuo Wang et.al.	2505.09252	link
2025-05-14	Focus, Merge, Rank: Improved Question Answering Based on Semi-structured Knowledge Bases	Derian Boer et.al.	2505.09246	link
2025-05-14	InvDesFlow-AL: Active Learning-based Workflow for Inverse Design of Functional Materials	Xiao-Qi Han et.al.	2505.09203	link
2025-05-15	UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System	Yitao Zhu et.al.	2505.09178	null
2025-05-14	A Multi-Task Foundation Model for Wireless Channel Representation Using Contrastive and Masked Autoencoder Learning	Berkay Guler et.al.	2505.09160	null
2025-05-14	AMSnet 2.0: A Large AMS Database with AI Segmentation for Net Detection	Yichen Shi et.al.	2505.09155	null
2025-05-14	ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor	Seungbeom Choi et.al.	2505.09142	null
2025-05-14	Sensing-Assisted Channel Prediction in Complex Wireless Environments: An LLM-Based Approach	Junjie He et.al.	2505.09141	null
2025-05-14	Beyond General Prompts: Automated Prompt Refinement using Contrastive Class Alignment Scores for Disambiguating Objects in Vision-Language Models	Lucas Choi et.al.	2505.09139	null
2025-05-14	FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis	Yuxing Chen et.al.	2505.09109	null
2025-05-14	Air-Ground Collaboration for Language-Specified Missions in Unknown Environments	Fernando Cladera et.al.	2505.09108	null
2025-05-14	Ornithologist: Towards Trustworthy “Reasoning” about Central Bank Communications	Dominic Zaun Eu Jones et.al.	2505.09083	null
2025-05-14	CEC-Zero: Chinese Error Correction Solution Based on LLM	Sophie Zhang et.al.	2505.09082	null
2025-05-14	S-DAT: A Multilingual, GenAI-Driven Framework for Automated Divergent Thinking Assessment	Jennifer Haase et.al.	2505.09068	null
2025-05-14	Variational Prefix Tuning for Diverse and Accurate Code Summarization Using Pre-trained Language Models	Junda Zhao et.al.	2505.09062	link
2025-05-14	A Comprehensive Analysis of Large Language Model Outputs: Similarity, Diversity, and Bias	Brandon Smith et.al.	2505.09056	null
2025-05-14	Atomic Consistency Preference Optimization for Long-Form Question Answering	Jingfeng Chen et.al.	2505.09039	link
2025-05-13	Improving the Reliability of LLMs: Combining CoT, RAG, Self-Consistency, and Self-Verification	Adarsh Kumar et.al.	2505.09031	null
2025-05-13	Tests as Prompt: A Test-Driven-Development Benchmark for LLM Code Generation	Yi Cui et.al.	2505.09027	null
2025-05-13	Automated Meta Prompt Engineering for Alignment with the Theory of Mind	Aaron Baughman et.al.	2505.09024	null
2025-05-13	Block-Biased Mamba for Long-Range Sequence Processing	Annan Yu et.al.	2505.09022	null
2025-05-13	AI-Mediated Code Comment Improvement	Maria Dhakal et.al.	2505.09021	null
2025-05-13	A suite of LMs comprehend puzzle statements as well as humans	Adele E Goldberg et.al.	2505.08996	null
2025-05-13	ITERA-LLM: Boosting Sub-8-Bit Large Language Model Inference via Iterative Tensor Decomposition	Keran Zheng et.al.	2505.08981	null
2025-05-13	Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training	Yangyi Chen et.al.	2505.08971	link
2025-05-13	Parameter-Efficient Fine-Tuning of Vision Foundation Model for Forest Floor Segmentation from UAV Imagery	Mohammad Wasil et.al.	2505.08932	link
2025-05-13	Assessing and Advancing Benchmarks for Evaluating Large Language Models in Software Engineering Tasks	Xing Hu et.al.	2505.08903	null
2025-05-13	Optimized Couplings for Watermarking Large Language Models	Dor Tsur et.al.	2505.08878	link
2025-05-13	Generative AI for Autonomous Driving: Frontiers and Opportunities	Yuping Wang et.al.	2505.08854	link
2025-05-13	Improved Algorithms for Differentially Private Language Model Alignment	Keyu Chen et.al.	2505.08849	null
2025-05-13	CellTypeAgent: Trustworthy cell type annotation with Large Language Models	Jiawen Chen et.al.	2505.08844	link
2025-05-13	PCS-UQ: Uncertainty Quantification via the Predictability-Computability-Stability Framework	Abhineet Agarwal et.al.	2505.08784	null
2025-05-13	CodePDE: An Inference Framework for LLM-driven PDE Solver Generation	Shanda Li et.al.	2505.08783	link
2025-05-13	HealthBench: Evaluating Large Language Models Towards Improved Human Health	Rahul K. Arora et.al.	2505.08775	link
2025-05-13	Generative Molecular Design with Steerable and Granular Synthesizability Control	Jeff Guo et.al.	2505.08774	link
2025-05-14	Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology	Yatai Ji et.al.	2505.08765	null
2025-05-13	AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models	Yanxi Zhang et.al.	2505.08750	link
2025-05-13	DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models	Xiaoyang Chen et.al.	2505.08744	link
2025-05-13	Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies	Xiaoliang Luo et.al.	2505.08739	link
2025-05-13	Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data	James Giroux et.al.	2505.08736	link
2025-05-13	NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context	Ben Yao et.al.	2505.08734	null
2025-05-13	Securing RAG: A Risk Assessment and Mitigation Framework	Lukas Ammann et.al.	2505.08728	null
2025-05-13	TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series	Xiaolei Qin et.al.	2505.08723	link
2025-05-13	PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts	Yang Su et.al.	2505.08719	null
2025-05-13	LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs	K M Sajjadul Islam et.al.	2505.08704	null
2025-05-13	A Survey of Deep Learning for Complex Speech Spectrograms	Yuying Xie et.al.	2505.08694	null
2025-05-13	VizCV: AI-assisted visualization of researchers’ publications tracks	Vladimír Lazárik et.al.	2505.08691	null
2025-05-13	Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation	Sheng Liang et.al.	2505.08690	null
2025-05-13	A Social Robot with Inner Speech for Dietary Guidance	Valerio Belcamino et.al.	2505.08664	link
2025-05-13	Revealing economic facts: LLMs know more than they say	Marcus Buckmann et.al.	2505.08662	null
2025-05-13	Enhancing Software Development with Context-Aware Conversational Agents: A User Study on Developer Interactions with Chatbots	Glaucia Melo et.al.	2505.08648	null
2025-05-13	Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models	Donghoon Kim et.al.	2505.08622	null
2025-05-13	Resource-Efficient Language Models: Quantization for Fast and Accessible Inference	Tollef Emil Jørgensen et.al.	2505.08620	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607	null
2025-05-13	Automatic Task Detection and Heterogeneous LLM Speculative Decoding	Danying Ge et.al.	2505.08600	null
2025-05-13	Enhancing Thyroid Cytology Diagnosis with RAG-Optimized LLMs and Pa-thology Foundation Models	Hussien Al-Asi et.al.	2505.08590	null
2025-05-13	Small but Significant: On the Promise of Small Language Models for Accessible AIED	Yumou Wei et.al.	2505.08588	null
2025-05-13	Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection	Ayush K. Rai et.al.	2505.08561	null
2025-05-13	DFA-CON: A Contrastive Learning Approach for Detecting Copyright Infringement in DeepFake Art	Haroon Wahab et.al.	2505.08552	null
2025-05-13	Guiding LLM-based Smart Contract Generation with Finite State Machine	Hao Luo et.al.	2505.08542	null
2025-05-13	The Truth Becomes Clearer Through Debate! Multi-Agent Systems with Large Language Models Unmask Fake News	Yuhan Liu et.al.	2505.08532	null
2025-05-13	Building-Block Aware Generative Modeling for 3D Crystals of Metal Organic Frameworks	Chenru Duan et.al.	2505.08531	link
2025-05-13	ExEBench: Benchmarking Foundation Models on Extreme Earth Events	Shan Zhao et.al.	2505.08529	link
2025-05-13	A Deep Learning-Driven Framework for Inhalation Injury Grading Using Bronchoscopy Images	Yifan Li et.al.	2505.08517	null
2025-05-13	TrialMatchAI: An End-to-End AI-powered Clinical Trial Recommendation System to Streamline Patient-to-Trial Matching	Majd Abdallah et.al.	2505.08508	null
2025-05-13	InfoPO: On Mutual Information Maximization for Large Language Model Alignment	Teng Xiao et.al.	2505.08507	null
2025-05-13	LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models	Takumi Shibata et.al.	2505.08498	null
2025-05-13	BizChat: Scaffolding AI-Powered Business Planning for Small Business Owners Across Digital Skill Levels	Quentin Romero Lauro et.al.	2505.08493	null
2025-05-13	Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions	Lata Pangtey et.al.	2505.08464	null
2025-05-13	Strategy-Augmented Planning for Large Language Models via Opponent Exploitation	Shuai Xu et.al.	2505.08459	link
2025-05-13	IterKey: Iterative Keyword Generation with LLMs for Enhanced Retrieval Augmented Generation	Kazuki Hayashi et.al.	2505.08450	null
2025-05-13	Scalable UAV Multi-Hop Networking via Multi-Agent Reinforcement Learning with Large Language Models	Yanggang Xu et.al.	2505.08448	null
2025-05-13	Optimizing Retrieval-Augmented Generation: Analysis of Hyperparameter Impact on Performance and Efficiency	Adel Ammar et.al.	2505.08445	null
2025-05-13	Symbolically-Guided Visual Plan Inference from Uncurated Video Data	Wenyan Yang et.al.	2505.08444	null
2025-05-13	A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court	Matteo Marulli et.al.	2505.08439	null
2025-05-13	Visual Image Reconstruction from Brain Activity via Latent Representation	Yukiyasu Kamitani et.al.	2505.08429	null
2025-05-13	An integrated language-vision foundation model for conversational diagnostics and triaging in primary eye care	Zhi Da Soh et.al.	2505.08414	null
2025-05-13	TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers	Aiyao He et.al.	2505.08402	null
2025-05-13	Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping	Ren Zhuang et.al.	2505.08392	null
2025-05-13	Towards Contamination Resistant Benchmarks	Rahmatullah Musawi et.al.	2505.08389	null
2025-05-13	Learning Like Humans: Advancing LLM Reasoning Capabilities via Adaptive Difficulty Curriculum Learning and Expert-Guided Self-Reformulation	Enci Zhang et.al.	2505.08364	null
2025-05-13	Hamiltonian replica exchange augmented with diffusion-based generative models and importance sampling to assess biomolecular conformational basins and barriers	Zakarya Benayad et.al.	2505.08357	null
2025-05-13	Alignment Drift in CEFR-prompted LLMs for Interactive Spanish Tutoring	Mina Almasi et.al.	2505.08351	null
2025-05-13	Benchmarking AI scientists in omics data-driven biological research	Erpai Luo et.al.	2505.08341	link
2025-05-13	Evaluating the Effectiveness of Black-Box Prompt Optimization as the Scale of LLMs Continues to Grow	Ziyu Zhou et.al.	2505.08303	null
2025-05-13	A Practical Introduction to Deep Reinforcement Learning	Yinghan Sun et.al.	2505.08295	null
2025-05-13	Ultra Lowrate Image Compression with Semantic Residual Coding and Compression-aware Diffusion	Anle Ke et.al.	2505.08281	link
2025-05-13	LLM Enhancers for GNNs: An Analysis from the Perspective of Causal Mechanism Identification	Hang Gao et.al.	2505.08265	null
2025-05-13	LLM-Based Detection of Tangled Code Changes for Higher-Quality Method-Level Bug Datasets	Md Nahidul Islam Opu et.al.	2505.08263	null
2025-05-13	Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration	Rishabh Agrawal et.al.	2505.08261	null
2025-05-13	Evaluating LLM Metrics Through Real-World Capabilities	Justin K Miller et.al.	2505.08253	null
2025-05-13	Identifying Memorization of Diffusion Models through p-Laplace Analysis	Jonathan Brokman et.al.	2505.08246	link
2025-05-13	Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement	Haoran Ye et.al.	2505.08245	link
2025-05-13	Unveiling the Best Practices for Applying Speech Foundation Models to Speech Intelligibility Prediction for Hearing-Impaired People	Haoshuai Zhou et.al.	2505.08215	null
2025-05-13	A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs	Artem Shelmanov et.al.	2505.08200	null
2025-05-13	Aitomia: Your Intelligent Assistant for AI-Driven Atomistic and Quantum Chemical Simulations	Jinming Hu et.al.	2505.08195	null
2025-05-13	CLTP: Contrastive Language-Tactile Pre-training for 3D Contact Geometry Understanding	Wenxuan Ma et.al.	2505.08194	null
2025-05-13	DSADF: Thinking Fast and Slow for Decision Making	Alex Zhihao Dou et.al.	2505.08189	null
2025-05-14	Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage	Ruilin Liu et.al.	2505.08167	null
2025-05-13	Decoding Neighborhood Environments with Large Language Models	Andrew Cart et.al.	2505.08163	null
2025-05-13	Foundation Models Knowledge Distillation For Battery Capacity Degradation Forecast	Joey Chan et.al.	2505.08151	link
2025-05-13	A Large-Scale Empirical Analysis of Custom GPTs’ Vulnerabilities in the OpenAI Ecosystem	Sunday Oyinlola Ogundoyin et.al.	2505.08148	link
2025-05-13	Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information	Jiawei Zhou et.al.	2505.08143	null
2025-05-13	Lost in Transmission: When and Why LLMs Fail to Reason Globally	Tobias Schnabel et.al.	2505.08140	null
2025-05-13	Large Language Models for Computer-Aided Design: A Survey	Licheng Zhang et.al.	2505.08137	link
2025-05-13	Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions	Keita Teranishi et.al.	2505.08135	null
2025-05-13	ALOHA: Empowering Multilingual Agent for University Orientation with Hierarchical Retrieval	Mingxu Tao et.al.	2505.08130	null
2025-05-12	Will Your Next Pair Programming Partner Be Human? An Empirical Evaluation of Generative AI as a Collaborative Teammate in a Semester-Long Classroom Setting	Wenhan Lyu et.al.	2505.08119	null
2025-05-12	Are LLMs complicated ethical dilemma analyzers?	Jiashen et.al.	2505.08106	link
2025-05-12	Visually Interpretable Subtask Reasoning for Visual Question Answering	Yu Cheng et.al.	2505.08084	null
2025-05-12	LLMs to Support K-12 Teachers in Culturally Relevant Pedagogy: An AI Literacy Example	Jiayi Wang et.al.	2505.08083	null
2025-05-12	Fréchet Power-Scenario Distance: A Metric for Evaluating Generative AI Models across Multiple Time-Scales in Smart Grids	Yuting Cai et.al.	2505.08082	null
2025-05-12	Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders	Dong Shu et.al.	2505.08080	null
2025-05-12	FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning	Zhehao Zhang et.al.	2505.08054	null
2025-05-12	Protein FID: Improved Evaluation of Protein Structure Generative Models	Felix Faltings et.al.	2505.08041	null
2025-05-12	Opportunities and Applications of GenAI in Smart Cities: A User-Centric Survey	Ankit Shetgaonkar et.al.	2505.08034	null
2025-05-12	Large Language Models and Arabic Content: A Review	Haneh Rhel et.al.	2505.08004	null
2025-05-12	Vision Foundation Model Embedding-Based Semantic Anomaly Detection	Max Peter Ronecker et.al.	2505.07998	null
2025-05-12	Spec2Assertion: Automatic Pre-RTL Assertion Generation using Large Language Models with Progressive Regularization	Fenghua Wu et.al.	2505.07995	null
2025-05-12	MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing	Aybora Koksal et.al.	2505.07984	null
2025-05-12	Assessing and Mitigating Medical Knowledge Drift and Conflicts in Large Language Models	Weiyi Wu et.al.	2505.07968	null
2025-05-12	Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks	Thomas R. Harvey et.al.	2505.07956	null
2025-05-12	H $^{\mathbf{3}}$ DP: Triply-Hierarchical Diffusion Policy for Visuomotor Learning	Yiyang Lu et.al.	2505.07819	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Continuous Visual Autoregressive Generation via Score Maximization	Chenze Shao et.al.	2505.07812	link
2025-05-12	Improving Trajectory Stitching with Flow Models	Reece O’Mahoney et.al.	2505.07802	null
2025-05-12	Learning Dynamics in Continual Pre-Training for Large Language Models	Xingjin Wang et.al.	2505.07796	null
2025-05-12	Domain Regeneration: How well do LLMs match syntactic properties of text domains?	Da Ju et.al.	2505.07784	null
2025-05-12	Relative Overfitting and Accept-Reject Framework	Yanxin Liu et.al.	2505.07783	null
2025-05-12	MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering	Rushi Qiang et.al.	2505.07782	link
2025-05-12	Synthesizing Diverse Network Flow Datasets with Scalable Dynamic Multigraph Generation	Arya Grayeli et.al.	2505.07777	null
2025-05-12	Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving	Xinji Mai et.al.	2505.07773	link
2025-05-12	Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding	Yifeng Di et.al.	2505.07768	link
2025-05-12	BodyGPS: Anatomical Positioning System	Halid Ziya Yerebakan et.al.	2505.07744	null
2025-05-12	Assessing the Chemical Intelligence of Large Language Models	Nicholas T. Runcie et.al.	2505.07735	link
2025-05-12	LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention	Jiangling Zhang et.al.	2505.07734	null
2025-05-12	Spoken Language Understanding on Unseen Tasks With In-Context Learning	Neeraj Agrawal et.al.	2505.07731	null
2025-05-12	Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations	Pranav Sinha et.al.	2505.07711	null
2025-05-12	PatchTrack: A Comprehensive Analysis of ChatGPT’s Influence on Pull Request Outcomes	Daniel Ogenrwot et.al.	2505.07700	null
2025-05-12	S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models	Muzhi Dai et.al.	2505.07686	null
2025-05-12	Multimodal Survival Modeling in the Age of Foundation Models	Steven Song et.al.	2505.07683	link
2025-05-12	SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models	Hang Wu et.al.	2505.07680	null
2025-05-13	OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit	Arun S. Maiya et.al.	2505.07672	link
2025-05-12	Benchmarking Retrieval-Augmented Generation for Chemistry	Xianrui Zhong et.al.	2505.07671	null
2025-05-12	A Case Study Investigating the Role of Generative AI in Quality Evaluations of Epics in Agile Software Development	Werner Geyer et.al.	2505.07664	null
2025-05-12	JobHop: A Large-Scale Dataset of Career Trajectories	Iman Johary et.al.	2505.07653	null
2025-05-12	Neural Brain: A Neuroscience-inspired Framework for Embodied Agents	Jian Liu et.al.	2505.07634	link
2025-05-12	Diffused Responsibility: Analyzing the Energy Consumption of Generative Text-to-Audio Diffusion Models	Riccardo Passoni et.al.	2505.07615	null
2025-05-12	Concept-Level Explainability for Auditing & Steering LLM Responses	Kenza Amara et.al.	2505.07610	link
2025-05-12	TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining	Paul Primus et.al.	2505.07609	null
2025-05-12	MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining	Xiaomi LLM-Core Team et.al.	2505.07608	link
2025-05-12	Characterizing the Investigative Methods of Fictional Detectives with Large Language Models	Edirlei Soares de Lima et.al.	2505.07601	null
2025-05-12	Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent	Ziyang Huang et.al.	2505.07596	null
2025-05-12	A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models	Junjie Ye et.al.	2505.07591	link
2025-05-12	SecReEvalBench: A Multi-turned Security Resilience Evaluation Benchmark for Large Language Models	Huining Cui et.al.	2505.07584	null
2025-05-12	YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language Models	Lei Wang et.al.	2505.07581	link
2025-05-12	Direct Density Ratio Optimization: A Statistically Consistent Approach to Aligning Large Language Models	Rei Higuchi et.al.	2505.07558	null
2025-05-12	Injecting Knowledge Graphs into Large Language Models	Erica Coppolillo et.al.	2505.07554	null
2025-05-12	Towards Requirements Engineering for RAG Systems	Tor Sporsem et.al.	2505.07553	null
2025-05-12	GRADA: Graph-based Reranker against Adversarial Documents Attack	Jingjie Zheng et.al.	2505.07546	link
2025-05-12	RAI: Flexible Agent Framework for Embodied AI	Kajetan Rachwał et.al.	2505.07532	link
2025-05-12	Byam: Fixing Breaking Dependency Updates with Large Language Models	Frank Reyes et.al.	2505.07522	link
2025-05-12	ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution	Xu Huang et.al.	2505.07512	null
2025-05-12	Learning to Reason and Navigate: Parameter Efficient Action Planning with Large Language Models	Bahram Mohammadi et.al.	2505.07500	null
2025-05-12	Web-Bench: A LLM Code Benchmark Based on Web Standards and Frameworks	Kai Xu et.al.	2505.07473	link
2025-05-12	A Survey on Collaborative Mechanisms Between Large and Small Language Models	Yi Chen et.al.	2505.07460	null
2025-05-12	Why Uncertainty Estimation Methods Fall Short in RAG: An Axiomatic Analysis	Heydar Soudani et.al.	2505.07459	null
2025-05-12	Can Generative AI agents behave like humans? Evidence from laboratory market experiments	R. Maria del Rio-Chanona et.al.	2505.07457	null
2025-05-12	How well do LLMs reason over tabular data, really?	Cornelius Wolff et.al.	2505.07453	null
2025-05-13	Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model	Wei Li et.al.	2505.07449	link
2025-05-12	Unified Continuous Generative Models	Peng Sun et.al.	2505.07447	link
2025-05-12	DiffCrysGen: A Score-Based Diffusion Model for Design of Diverse Inorganic Crystalline Materials	Sourav Mal et.al.	2505.07442	null
2025-05-12	LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning	Xiaotian Lin et.al.	2505.07437	link
2025-05-12	A Systematic Literature Review on Neural Code Translation	Xiang Chen et.al.	2505.07425	null
2025-05-12	AI in Money Matters	Nadine Sandjo Tchatchoua et.al.	2505.07393	null
2025-05-12	Examining the Role of LLM-Driven Interactions on Attention and Cognitive Engagement in Virtual Classrooms	Suleyman Ozdel et.al.	2505.07377	null
2025-05-12	A Preliminary Study of Large Language Models for Multilingual Vulnerability Detection	Junji Yu et.al.	2505.07376	null
2025-05-12	Synthetic Code Surgery: Repairing Bugs and Vulnerabilities with LLMs and Synthetic Data	David de-Fitero-Dominguez et.al.	2505.07372	null
2025-05-12	GAN-based synthetic FDG PET images from T1 brain MRI can serve to improve performance of deep unsupervised anomaly detection models	Daria Zotova et.al.	2505.07364	null
2025-05-12	BinMetric: A Comprehensive Binary Analysis Benchmark for Large Language Models	Xiuwei Shang et.al.	2505.07360	null
2025-05-12	From Search To Sampling: Generative Models For Robust Algorithmic Recourse	Prateek Garg et.al.	2505.07351	link
2025-05-12	QUPID: Quantified Understanding for Enhanced Performance, Insights, and Decisions in Korean Search Engines	Ohjoon Kwon et.al.	2505.07345	null
2025-05-12	Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption	Jordan Frery et.al.	2505.07329	null
2025-05-12	Uncertainty Profiles for LLMs: Uncertainty Source Decomposition and Adaptive Model-Metric Selection	Pei-Fu Guo et.al.	2505.07309	null
2025-05-12	L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers	Sofia Casarin et.al.	2505.07300	null
2025-05-12	Semantic Retention and Extreme Compression in LLMs: Can We Have Both?	Stanislas Laborde et.al.	2505.07289	null
2025-05-12	Piloting Structure-Based Drug Design via Modality-Specific Optimal Schedule	Keyue Qiu et.al.	2505.07286	link
2025-05-12	Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains	Ibne Farabi Shihab et.al.	2505.07274	link
2025-05-12	Automated Repair of Ambiguous Natural Language Requirements	Haoxiang Jia et.al.	2505.07270	link
2025-05-12	No Query, No Access	Wenqiang Wang et.al.	2505.07258	null
2025-05-12	Synthetic Similarity Search in Automotive Production	Christoph Huber et.al.	2505.07256	null
2025-05-12	SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models	Peichao Lai et.al.	2505.07247	link
2025-05-12	Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity	Guang Yan et.al.	2505.07239	null
2025-05-12	DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation	Jiashuo Sun et.al.	2505.07233	link
2025-05-12	Spatial Confounding in Multivariate Areal Data Analysis	Kyle Lin Wu et.al.	2505.07232	link
2025-05-12	Measuring General Intelligence with Generated Games	Vivek Verma et.al.	2505.07215	link
2025-05-12	Towards user-centered interactive medical image segmentation in VR with an assistive AI agent	Pascal Spiegler et.al.	2505.07214	null
2025-05-12	Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030	Mouxiao Bian et.al.	2505.07205	null
2025-05-12	PrefillOnly: An Inference Engine for Prefill-only Workloads in Large Language Model Applications	Kuntai Du et.al.	2505.07203	null
2025-05-12	Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMs	Yifan Wei et.al.	2505.07184	link
2025-05-12	Internet of Agents: Fundamentals, Applications, and Challenges	Yuntao Wang et.al.	2505.07176	null
2025-05-12	Metrics that matter: Evaluating image quality metrics for medical image generation	Yash Deo et.al.	2505.07175	link
2025-05-12	One Trigger Token Is Enough: A Defense Strategy for Balancing Safety and Usability in Large Language Models	Haoran Gu et.al.	2505.07167	null
2025-05-12	KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification	Hajar Sakai et.al.	2505.07162	null
2025-05-12	HAMLET: Healthcare-focused Adaptive Multilingual Learning Embedding-based Topic Modeling	Hajar Sakai et.al.	2505.07157	null
2025-05-12	Reassessing Large Language Model Boolean Query Generation for Systematic Reviews	Shuai Wang et.al.	2505.07155	null
2025-05-13	Exploring Anthropomorphism in Conversational Agents for Environmental Sustainability	Mathyas Giudici et.al.	2505.07142	null
2025-05-11	KOKKAI DOC: An LLM-driven framework for scaling parliamentary representatives	Ken Kato et.al.	2505.07118	null
2025-05-11	Knowledge Distillation for Enhancing Walmart E-commerce Search Relevance Using Large Language Models	Hongwei Shang et.al.	2505.07105	null
2025-05-11	RefPentester: A Knowledge-Informed Self-Reflective Penetration Testing Framework Based on Large Language Models	Hanzheng Dai et.al.	2505.07089	null
2025-05-11	Architectural Precedents for General Agents using Large Language Models	Robert E. Wray et.al.	2505.07087	null
2025-05-11	Multi-Objective-Guided Discrete Flow Matching for Controllable Biological Sequence Design	Tong Chen et.al.	2505.07086	null
2025-05-11	DriveSOTIF: Advancing Perception SOTIF Through Multimodal Large Language Models	Shucheng Huang et.al.	2505.07084	link
2025-05-11	Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?	Weixian Waylon Li et.al.	2505.07078	link
2025-05-11	ParaView-MCP: An Autonomous Visualization Agent with Direct Tool Use	Shusen Liu et.al.	2505.07064	null
2025-05-11	Seed1.5-VL Technical Report	Dong Guo et.al.	2505.07062	null
2025-05-11	LLM-Augmented Chemical Synthesis and Design Decision Programs	Haorui Wang et.al.	2505.07027	null
2025-05-11	A Vision-Language Foundation Model for Leaf Disease Identification	Khang Nguyen Quoc et.al.	2505.07019	link
2025-05-11	MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception	Zhengye Zhang et.al.	2505.07007	link
2025-05-11	GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance	Jinuk Kim et.al.	2505.07004	link
2025-05-11	Convert Language Model into a Value-based Strategic Planner	Xiaoyu Wang et.al.	2505.06987	null
2025-05-11	Web Page Classification using LLMs for Crawling Support	Yuichi Sasazawa et.al.	2505.06972	link
2025-05-09	Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks	Christos Plachouras et.al.	2505.06224	link
2025-05-09	Adapting a Segmentation Foundation Model for Medical Image Classification	Pengfei Gu et.al.	2505.06217	null
2025-05-09	From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling	Vahid Rahimzadeh et.al.	2505.06184	null
2025-05-09	A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows	Linjiang Cao et.al.	2505.06178	null
2025-05-09	MonetGPT: Solving Puzzles Enhances MLLMs’ Image Retouching Skills	Niladri Shekhar Dutt et.al.	2505.06176	null
2025-05-09	Turbo-ICL: In-Context Learning-Based Turbo Equalization	Zihang Song et.al.	2505.06175	null
2025-05-09	A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets	Ryan Lagasse et.al.	2505.06150	null
2025-05-09	Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study	Faeze Ghorbanpour et.al.	2505.06149	null
2025-05-09	Constraints to Lorentz violation and ultrahigh-energy electrons in D-foamy space-times	Chengyi Li et.al.	2505.06121	null
2025-05-09	LLMs Get Lost In Multi-Turn Conversation	Philippe Laban et.al.	2505.06120	link
2025-05-09	Photovoltaic Defect Image Generator with Boundary Alignment Smoothing Constraint for Domain Shift Mitigation	Dongying Li et.al.	2505.06117	null
2025-05-09	LLMs Outperform Experts on Challenging Biology Benchmarks	Lennart Justen et.al.	2505.06108	null
2025-05-09	Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs	Sam Bush et.al.	2505.06096	null
2025-05-09	Assessing Tenstorrent’s RISC-V MatMul Acceleration Capabilities	Hiari Pizzini Cavagna et.al.	2505.06085	null
2025-05-09	Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information	Joshua Harris et.al.	2505.06046	null
2025-05-09	Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation	Stefan Vasilev et.al.	2505.06027	null
2025-05-09	ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding	Shuai Wang et.al.	2505.06020	null
2025-05-09	Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition	Congqi Cao et.al.	2505.06002	link
2025-05-09	Offline Multi-agent Reinforcement Learning via Score Decomposition	Dan Qiao et.al.	2505.05968	null
2025-05-09	GEORCE: A Fast New Control Algorithm for Computing Geodesics	Frederik Möbius Rygaard et.al.	2505.05961	link
2025-05-09	NeoQA: Evidence-based Question Answering with Generated News Events	Max Glockner et.al.	2505.05949	link
2025-05-09	Summarisation of German Judgments in conjunction with a Class-based Evaluation	Bianca Steffes et.al.	2505.05947	link
2025-05-09	Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma2	Vytenis Šliogeris et.al.	2505.05946	null
2025-05-09	Autoencoder-Based Hybrid Replay for Class-Incremental Learning	Milad Khademi Nori et.al.	2505.05926	null
2025-05-09	CAPE: Context-Aware Prompt Perturbation Mechanism with Differential Privacy	Haoqi Wu et.al.	2505.05922	null
2025-05-09	Generative Discovery of Partial Differential Equations by Learning from Math Handbooks	Hao Xu et.al.	2505.05869	null
2025-05-09	Evolutionary ecology of words	Reiji Suzuki et.al.	2505.05863	null
2025-05-09	AgentXploit: End-to-End Redteaming of Black-Box AI Agents	Zhun Wang et.al.	2505.05849	null
2025-05-09	Augmented Body Communicator: Enhancing daily body expression for people with upper limb limitations through LLM and a robotic arm	Songchen Zhou et.al.	2505.05832	null
2025-05-09	Tell Me Who Your Students Are: GPT Can Generate Valid Multiple-Choice Questions When Students’ (Mis)Understanding Is Hinted	Machi Shimmei et.al.	2505.05815	null
2025-05-09	What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips	Renjie Li et.al.	2505.05794	null
2025-05-09	A Day in Their Shoes: Using LLM-Based Perspective-Taking Interactive Fiction to Reduce Stigma Toward Dirty Work	Xiangzhe Yuan et.al.	2505.05786	null
2025-05-09	Sparse Attention Remapping with Clustering for Efficient LLM Decoding on PIM	Zehao Fan et.al.	2505.05772	null
2025-05-09	Multi-Agent Systems for Robotic Autonomy with LLMs	Junhong Chen et.al.	2505.05762	null
2025-05-09	APOLLO: Automated LLM and Lean Collaboration for Advanced Formal Reasoning	Azim Ospanov et.al.	2505.05758	null
2025-05-09	Evolutionary thoughts: integration of large language models and evolutionary algorithms	Antonio Jimeno Yepes et.al.	2505.05756	link
2025-05-09	Harnessing LLMs Explanations to Boost Surrogate Models in Tabular Data Classification	Ruxue Shi et.al.	2505.05744	null
2025-05-09	Multimodal Integrated Knowledge Transfer to Large Language Models through Preference Optimization with Biomedical Applications	Da Wu et.al.	2505.05736	link
2025-05-09	Automated Learning of Semantic Embedding Representations for Diffusion Models	Limai Jiang et.al.	2505.05732	null
2025-05-09	Understanding Stragglers in Large Model Training Using What-if Analysis	Jinkun Lin et.al.	2505.05713	link
2025-05-09	LLM-Text Watermarking based on Lagrange Interpolation	Jarosław Janas et.al.	2505.05712	null
2025-05-09	HyperspectralMAE: The Hyperspectral Imagery Classification Model using Fourier-Encoded Dual-Branch Masked Autoencoder	Wooyoung Jeong et.al.	2505.05710	null
2025-05-09	Assessing Robustness to Spurious Correlations in Post-Training Language Models	Julia Shuieh et.al.	2505.05704	null
2025-05-08	Fine-Tuning Video-Text Contrastive Model for Primate Behavior Retrieval from Unlabeled Raw Videos	Giulio Cesare Mastrocinque Santo et.al.	2505.05681	null
2025-05-08	From Bias To Improved Prompts: A Case Study of Bias Mitigation of Clone Detection Models	QiHong Chen et.al.	2505.05679	null
2025-05-08	InstanceGen: Image Generation with Instance-level Instructions	Etai Sella et.al.	2505.05678	link
2025-05-08	Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval	Alexander Most et.al.	2505.05666	null
2025-05-08	Adaptive Stress Testing Black-Box LLM Planners	Neeloy Chakraborty et.al.	2505.05665	null
2025-05-08	Not Like Us, Hunty: Measuring Perceptions and Behavioral Effects of Minoritized Anthropomorphic Cues in LLMs	Jeffrey Basoah et.al.	2505.05660	null
2025-05-08	The Moon’s Many Faces: A Single Unified Transformer for Multimodal Lunar Reconstruction	Tom Sander et.al.	2505.05644	null
2025-05-08	Looking Beyond Language Priors: Enhancing Visual Comprehension and Attention in Multimodal Models	Aarti Ghatkesar et.al.	2505.05626	null
2025-05-08	CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory	Weichen Zhang et.al.	2505.05622	link
2025-05-08	LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities	Kalyan Nakka et.al.	2505.05619	link
2025-05-08	Leveraging Large Language Models for enzymatic reaction prediction and characterization	Lorenzo Di Fruscia et.al.	2505.05616	link
2025-05-08	scDrugMap: Benchmarking Large Foundation Models for Drug Response Prediction	Qing Wang et.al.	2505.05612	link
2025-05-08	HiBayES: A Hierarchical Bayesian Modeling Framework for AI Evaluation Statistics	Lennart Luettgau et.al.	2505.05602	link
2025-05-08	Enhancing Large Language Models with Faster Code Preprocessing for Vulnerability Detection	José Gonçalves et.al.	2505.05600	link
2025-05-08	PRIMG : Efficient LLM-driven Test Generation Using Mutant Prioritization	Mohamed Salah Bouafif et.al.	2505.05584	link
2025-05-08	KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification	Qianbo Zang et.al.	2505.05583	link
2025-05-08	PyTDC: A multimodal machine learning training, evaluation, and inference platform for biomedical foundation models	Alejandro Velez-Arce et.al.	2505.05577	link
2025-05-08	Griffin: Towards a Graph-Centric Relational Database Foundation Model	Yanbo Wang et.al.	2505.05568	link
2025-05-08	3D Scene Generation: A Survey	Beichen Wen et.al.	2505.05474	link
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Generating Physically Stable and Buildable LEGO Designs from Text	Ava Pun et.al.	2505.05469	link
2025-05-08	StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant	Haibo Wang et.al.	2505.05467	null
2025-05-08	ComPO: Preference Alignment via Comparison Oracles	Peter Chen et.al.	2505.05465	null
2025-05-08	Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging	Shiqi Chen et.al.	2505.05464	link
2025-05-08	UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections	Fatima Haouari et.al.	2505.05459	null
2025-05-08	Conversational Process Model Redesign	Nataliia Klievtsova et.al.	2505.05453	null
2025-05-08	clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	Chalamalasetti Kranti et.al.	2505.05445	null
2025-05-08	GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality	Xiyun Hu et.al.	2505.05441	null
2025-05-09	EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation	Biao Yi et.al.	2505.05440	null
2025-05-08	Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data	Yudong Wang et.al.	2505.05427	null
2025-05-09	LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering	Ran Zhang et.al.	2505.05423	link
2025-05-08	Crosslingual Reasoning through Test-Time Scaling	Zheng-Xin Yong et.al.	2505.05408	link
2025-05-08	Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?	Valeria Pastorino et.al.	2505.05406	null
2025-05-08	A Pain Assessment Framework based on multimodal data and Deep Machine Learning methods	Stefanos Gkikas et.al.	2505.05396	null
2025-05-08	Modelling and Verifying Neuronal Archetypes in Coq	Abdorrahim Bahrami et.al.	2505.05362	link
2025-05-08	DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning	Wenru Liu et.al.	2505.05360	null
2025-05-08	Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization	Sooyoung Park et.al.	2505.05343	link
2025-05-08	ICon: In-Context Contribution for Automatic Data Selection	Yixin Yang et.al.	2505.05327	null
2025-05-08	Toward Reasonable Parrots: Why Large Language Models Should Argue with Us by Design	Elena Musi et.al.	2505.05298	null
2025-05-08	Benchmarking Ophthalmology Foundation Models for Clinically Significant Age Macular Degeneration Detection	Benjamin A. Cohen et.al.	2505.05291	null
2025-05-08	HEXGEN-TEXT2SQL: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL Workflow	You Peng et.al.	2505.05286	link
2025-05-09	Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents	Kaixin Wang et.al.	2505.05283	null
2025-05-08	MTL-UE: Learning to Learn Nothing for Multi-Task Learning	Yi Yu et.al.	2505.05279	null
2025-05-08	PADriver: Towards Personalized Autonomous Driving	Genghua Kou et.al.	2505.05240	null
2025-05-08	Latte: Transfering LLMs` Latent-level Knowledge for Few-shot Tabular Learning	Ruxue Shi et.al.	2505.05237	null
2025-05-08	ChemRxivQuest: A Curated Chemistry Question-Answer Database Extracted from ChemRxiv Preprints	Mahmoud Amiri et.al.	2505.05232	null
2025-05-08	QualBench: Benchmarking Chinese LLMs with Localized Professional Qualifications for Vertical Domain Evaluation	Mengze Hong et.al.	2505.05225	null
2025-05-08	Diffusion Model Quantization: A Review	Qian Zeng et.al.	2505.05215	link
2025-05-08	Stealthy LLM-Driven Data Poisoning Attacks Against Embedding-Based Retrieval-Augmented Recommender Systems	Fatemeh Nazary et.al.	2505.05196	null
2025-05-08	Revealing Weaknesses in Text Watermarking Through Self-Information Rewrite Attacks	Yixin Cheng et.al.	2505.05190	link
2025-05-08	Biomed-DPT: Dual Modality Prompt Tuning for Biomedical Vision-Language Models	Wei Peng et.al.	2505.05189	link
2025-05-08	MARK: Memory Augmented Refinement of Knowledge	Anish Ganguli et.al.	2505.05177	null
2025-05-08	FedTDP: A Privacy-Preserving and Unified Framework for Trajectory Data Preparation via Federated Learning	Zhihao Zeng et.al.	2505.05155	null
2025-05-08	Overcoming Dimensional Factorization Limits in Discrete Diffusion Models through Quantum Joint Distribution Learning	Chuangtao Chen et.al.	2505.05151	link
2025-05-08	Text2Cypher: Data Pruning using Hard Example Selection	Makbule Gulcin Ozsoy et.al.	2505.05122	null
2025-05-08	Enhancing Text2Cypher with Schema Filtering	Makbule Gulcin Ozsoy et.al.	2505.05118	null
2025-05-08	Unveiling Language-Specific Features in Large Language Models via Sparse Autoencoders	Boyi Deng et.al.	2505.05111	null
2025-05-08	Multi-agent Embodied AI: Advances and Future Directions	Zhaohan Feng et.al.	2505.05108	null
2025-05-08	A Weighted Byzantine Fault Tolerance Consensus Driven Trusted Multiple Large Language Models Network	Haoxiang Luo et.al.	2505.05103	null
2025-05-08	X-Driver: Explainable Autonomous Driving with Vision-Language Models	Wei Liu et.al.	2505.05098	null
2025-05-08	Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction	Xiaowei Zhu et.al.	2505.05084	null
2025-05-08	ItDPDM: Information-Theoretic Discrete Poisson Diffusion Model	Sagnik Bhattacharya et.al.	2505.05082	null
2025-05-08	Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization	Ajwad Abrar et.al.	2505.05070	null
2025-05-08	WaterDrum: Watermarking for Data-centric Unlearning Metric	Xinyang Lu et.al.	2505.05064	link
2025-05-08	CodeMixBench: Evaluating Large Language Models on Code Generation with Code-Mixed Prompts	Manik Sheokand et.al.	2505.05063	null
2025-05-08	ULFine: Unbiased Lightweight Fine-tuning for Foundation-Model-Assisted Long-Tailed Semi-Supervised Learning	Enhao Zhang et.al.	2505.05062	null
2025-05-08	Towards Mitigating API Hallucination in Code Generated by LLMs with Hierarchical Dependency Aware	Yujia Chen et.al.	2505.05057	link
2025-05-08	Statistical method for A-RNA and B-DNA	Marco Zoli et.al.	2505.05053	null
2025-05-09	UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model	Timo Kaiser et.al.	2505.05049	link
2025-05-08	LSRP: A Leader-Subordinate Retrieval Framework for Privacy-Preserving Cloud-Device Collaboration	Yingyi Zhang et.al.	2505.05031	link
2025-05-08	A Reputation System for Large Language Model-based Multi-agent Systems to Avoid the Tragedy of the Commons	Siyue Ren et.al.	2505.05029	null
2025-05-08	Generative Models for Long Time Series: Approximately Equivariant Recurrent Network Structures for an Adjusted Training Scheme	Ruwen Fulek et.al.	2505.05020	link
2025-05-08	Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints	Waldemar Hahn et.al.	2505.05019	null
2025-05-08	Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization	Yuntai Bao et.al.	2505.05017	link
2025-05-08	The Pitfalls of Growing Group Complexity: LLMs and Social Choice-Based Aggregation for Group Recommendations	Cedric Waterschoot et.al.	2505.05016	null
2025-05-08	Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication	Jinhe Huang et.al.	2505.04996	null
2025-05-08	Rethinking Invariance in In-context Learning	Lizhe Fang et.al.	2505.04994	null
2025-05-08	Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes	Zhuocheng Gong et.al.	2505.04993	null
2025-05-08	Boosting Statistic Learning with Synthetic Data from Pretrained Large Models	Jialong Jiang et.al.	2505.04992	null
2025-05-08	LVLM-MPC Collaboration for Autonomous Driving: A Safety-Aware and Task-Scalable Control Architecture	Kazuki Atsuta et.al.	2505.04980	null
2025-05-08	ReAlign: Bilingual Text-to-Motion Generation via Step-Aware Reward-Guided Alignment	Wanjiang Weng et.al.	2505.04974	null
2025-05-08	DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding	Henry Zheng et.al.	2505.04965	null
2025-05-08	Learning Item Representations Directly from Multimodal Features for Effective Recommendation	Xin Zhou et.al.	2505.04960	link
2025-05-08	Graffe: Graph Representation Learning via Diffusion Probabilistic Models	Dingshuo Chen et.al.	2505.04956	null
2025-05-08	Chain-of-Thought Tokens are Computer Program Variables	Fangwei Zhu et.al.	2505.04955	link
2025-05-08	Position: Epistemic Artificial Intelligence is Essential for Machine Learning Models to Know When They Do Not Know	Shireen Kudukkil Manchingal et.al.	2505.04950	null
2025-05-08	Prompt-Based LLMs for Position Bias-Aware Reranking in Personalized Recommendations	Md Aminul Islam et.al.	2505.04948	link
2025-05-08	T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models	Xuyang Guo et.al.	2505.04946	null
2025-05-08	An Open-Source Dual-Loss Embedding Model for Semantic Retrieval in Higher Education	Ramteja Sajja et.al.	2505.04916	null
2025-05-08	SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models	Shun Taguchi et.al.	2505.04911	null
2025-05-08	Clustering with Communication: A Variational Framework for Single Cell Representation Learning	Cong Qi et.al.	2505.04891	null
2025-05-08	A Multi-Agent AI Framework for Immersive Audiobook Production through Spatial Audio and Neural Narration	Shaja Arul Selvamani et.al.	2505.04885	null
2025-05-08	GroverGPT-2: Simulating Grover’s Algorithm via Chain-of-Thought Reasoning and Quantum-Native Tokenization	Min Chen et.al.	2505.04880	null
2025-05-08	From First Draft to Final Insight: A Multi-Agent Approach for Feedback Generation	Jie Cao et.al.	2505.04869	null
2025-05-08	Mix-QSAM: Mixed-Precision Quantization of the Segment Anything Model	Navin Ranjan et.al.	2505.04861	null
2025-05-07	CRAFT: Cultural Russian-Oriented Dataset Adaptation for Focused Text-to-Image Generation	Viacheslav Vasilev et.al.	2505.04851	null
2025-05-07	HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights	Ozan Gokdemir et.al.	2505.04846	null
2025-05-07	Comparative Study of Generative Models for Early Detection of Failures in Medical Devices	Binesh Sadanandan et.al.	2505.04845	link
2025-05-07	Osiris: A Lightweight Open-Source Hallucination Detection System	Alex Shan et.al.	2505.04844	null
2025-05-07	Large Language Models are Autonomous Cyber Defenders	Sebastián R. Castro et.al.	2505.04843	link
2025-05-07	Steerable Scene Generation with Post Training and Inference-Time Search	Nicholas Pfaff et.al.	2505.04831	link
2025-05-07	Guide your favorite protein sequence generative model	Junhao Xiong et.al.	2505.04823	null
2025-05-07	WIR3D: Visually-Informed and Geometry-Aware 3D Shape Abstraction	Richard Liu et.al.	2505.04813	null
2025-05-07	Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs	Chetan Pathade et.al.	2505.04806	null
2025-05-07	ORBIT-2: Scaling Exascale Vision Foundation Models for Weather and Climate Downscaling	Xiao Wang et.al.	2505.04802	null
2025-05-07	Safeguard-by-Development: A Privacy-Enhanced Development Paradigm for Multi-Agent Collaboration Systems	Jian Cui et.al.	2505.04799	null
2025-05-07	A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models	Pedro Pinacho-Davidson et.al.	2505.04784	null
2025-05-07	When Bad Data Leads to Good Models	Kenneth Li et.al.	2505.04741	null
2025-05-07	The Promise and Limits of LLMs in Constructing Proofs and Hints for Logic Problems in Intelligent Tutoring Systems	Sutapa Dey Tithi et.al.	2505.04736	null
2025-05-07	QBD-RankedDataGen: Generating Custom Ranked Datasets for Improving Query-By-Document Search Using LLM-Reranking with Reduced Human Effort	Sriram Gopalakrishnan et.al.	2505.04732	null
2025-05-07	SOAEsV2-7B/72B: Full-Pipeline Optimization for State-Owned Enterprise LLMs via Continual Pre-Training, Domain-Progressive SFT and Distillation-Enhanced Speculative Decoding	Jingyang Deng et.al.	2505.04723	null
2025-05-07	Lay-Your-Scene: Natural Scene Layout Generation with Diffusion Transformers	Divyansh Srivastava et.al.	2505.04718	null
2025-05-07	EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning	Zhenghao Xing et.al.	2505.04623	link
2025-05-07	On Path to Multimodal Generalist: General-Level and General-Bench	Hao Fei et.al.	2505.04620	null
2025-05-07	OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution	Lianghong Guo et.al.	2505.04606	link
2025-05-07	OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning	Xianhang Li et.al.	2505.04601	null
2025-05-08	MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection	Zhihao Zhang et.al.	2505.04594	null
2025-05-07	ZeroSearch: Incentivize the Search Capability of LLMs without Searching	Hao Sun et.al.	2505.04588	link
2025-05-07	SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions	Chloe Qianhui Zhao et.al.	2505.04584	link
2025-05-07	Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization	Wenjun Cao et.al.	2505.04578	null
2025-05-07	Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development	Kuen Sum Cheung et.al.	2505.04521	null
2025-05-07	Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs	Yehui Tang et.al.	2505.04519	null
2025-05-07	Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts	Ilya Koziev et.al.	2505.04507	null
2025-05-08	Defining and Quantifying Creative Behavior in Popular Image Generators	Aditi Ramaswamy et.al.	2505.04497	null
2025-05-07	Efficient Flow Matching using Latent Variables	Anirban Samaddar et.al.	2505.04486	null
2025-05-07	CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation	Jiahao Li et.al.	2505.04481	null
2025-05-07	TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution	Zhikai Zhao et.al.	2505.04480	link
2025-05-07	Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration	Shigeki Karita et.al.	2505.04457	link
2025-05-07	M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation	Qianru Zhang et.al.	2505.04445	null
2025-05-07	Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs	Mirazul Haque et.al.	2505.04441	null
2025-05-07	OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models	Xiaoyu Xu et.al.	2505.04416	null
2025-05-07	DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	Junjie Wang et.al.	2505.04410	link
2025-05-07	YABLoCo: Yet Another Benchmark for Long Context Code Generation	Aidar Valeev et.al.	2505.04406	null
2025-05-07	Large Means Left: Political Bias in Large Language Models Increases with Their Number of Parameters	David Exler et.al.	2505.04393	null
2025-05-07	The Aloe Family Recipe for Open and Specialized Healthcare LLMs	Dario Garcia-Gasulla et.al.	2505.04388	null
2025-05-07	CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies	Komal Gilani et.al.	2505.04365	null
2025-05-07	Benchmarking LLMs’ Swarm intelligence	Kai Ruan et.al.	2505.04364	link
2025-05-07	Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows	Wenhao Li et.al.	2505.04354	null
2025-05-07	CountDiffusion: Text-to-Image Synthesis with Training-Free Counting-Guidance Diffusion	Yanyu Li et.al.	2505.04347	null
2025-05-07	Riemannian Denoising Diffusion Probabilistic Models	Zichen Liu et.al.	2505.04338	null
2025-05-07	GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance	Sofia Jamil et.al.	2505.04284	link
2025-05-07	Non-stationary Diffusion For Probabilistic Time Series Forecasting	Weiwei Ye et.al.	2505.04278	link
2025-05-07	Weaponizing Language Models for Cybersecurity Offensive Operations: Automating Vulnerability Assessment Report Validation; A Review Paper	Abdulrahman S Almuhaidib et.al.	2505.04265	null
2025-05-07	Steerable Chatbots: Personalizing LLMs with Preference-Based Activation Steering	Jessica Y. Bo et.al.	2505.04260	null
2025-05-07	LLM-Independent Adaptive RAG: Let the Question Speak for Itself	Maria Marina et.al.	2505.04253	null
2025-05-07	A Large Language Model for Feasible and Diverse Population Synthesis	Sung Yoo Lim et.al.	2505.04196	null
2025-05-07	AutoPatch: Multi-Agent Framework for Patching Real-World CVE Vulnerabilities	Minjae Seo et.al.	2505.04195	link
2025-05-07	On-Device LLM for Context-Aware Wi-Fi Roaming	Ju-Hyung Lee et.al.	2505.04174	link
2025-05-07	DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion	Zixiao Wang et.al.	2505.04173	null
2025-05-07	Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts	Nouar Aldahoul et.al.	2505.04171	null
2025-05-07	Can Language Models Understand Social Behavior in Clinical Conversations?	Manas Satish Bedmutha et.al.	2505.04152	null
2025-05-07	Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety	Variath Madhupal Gautham Nair et.al.	2505.04146	null
2025-05-07	NAMO-LLM: Efficient Navigation Among Movable Obstacles with Large Language Model Guidance	Yuqing Zhang et.al.	2505.04141	link
2025-05-07	Enhancing Granular Sentiment Classification with Chain-of-Thought Prompting in Large Language Models	Vihaan Miriyala et.al.	2505.04135	null
2025-05-07	RFNNS: Robust Fixed Neural Network Steganography with Popular Deep Generative Models	Yu Cheng et.al.	2505.04116	null
2025-05-07	Alpha Excel Benchmark	David Noever et.al.	2505.04110	null
2025-05-08	MAISY: Motion-Aware Image SYnthesis for Medical Image Motion Correction	Andrew Zhang et.al.	2505.04105	null
2025-05-07	LLMs’ Suitability for Network Security: A Case Study of STRIDE Threat Modeling	AbdulAziz AbdulGhaffar et.al.	2505.04101	null
2025-05-07	An Empirical Study of OpenAI API Discussions on Stack Overflow	Xiang Chen et.al.	2505.04084	null
2025-05-07	QStore: Quantization-Aware Compressed Model Storage	Raunak Shah et.al.	2505.04081	link
2025-05-07	LLM-e Guess: Can LLMs Capabilities Advance Without Hardware Progress?	Teddy Foley et.al.	2505.04075	link
2025-05-07	Natural Language Generation in Healthcare: A Review of Methods and Applications	Mengxian Lyu et.al.	2505.04073	null
2025-05-07	Advancing and Benchmarking Personalized Tool Invocation for LLMs	Xu Huang et.al.	2505.04072	link
2025-05-07	Shadow Wireless Intelligence: Large Language Model-Driven Reasoning in Covert Communications	Yuanai Xie et.al.	2505.04068	null
2025-05-07	BuildingBlock: A Hybrid Approach for Structured Building Generation	Junming Huang et.al.	2505.04051	null
2025-05-07	Identification and Optimization of Redundant Code Using Large Language Models	Shamse Tasnim Cynthia et.al.	2505.04040	null
2025-05-06	Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving	Shan Yu et.al.	2505.04021	null
2025-05-06	SLOT: Structuring the Output of Large Language Models	Darren Yow-Bang Wang et.al.	2505.04016	null
2025-05-06	Can Large Language Models Predict Parallel Code Performance?	Gregory Bolet et.al.	2505.03988	null
2025-05-06	LogiDebrief: A Signal-Temporal Logic based Automated Debriefing Approach with Large Language Models Integration	Zirong Chen et.al.	2505.03985	null
2025-05-06	Diffusion Models are Secretly Exchangeable: Parallelizing DDPMs via Autospeculation	Hengyuan Hu et.al.	2505.03983	null
2025-05-06	A Reasoning-Focused Legal Retrieval Benchmark	Lucia Zheng et.al.	2505.03970	null
2025-05-06	nuGAN: Generative Adversarial Emulator for Cosmic Web with Neutrinos	Neerav Kaushal et.al.	2505.03936	null
2025-05-06	MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models	Asif Rahman et.al.	2505.03906	null
2025-05-06	Unveiling the Role of ChatGPT in Software Development: Insights from Developer-ChatGPT Interactions on GitHub	Ruiyin Li et.al.	2505.03901	null
2025-05-06	Machine Learning: a Lecture Note	Kyunghyun Cho et.al.	2505.03861	null
2025-05-06	VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model	Zuwei Long et.al.	2505.03739	link
2025-05-06	Meta-Optimization and Program Search using Language Models for Task and Motion Planning	Denis Shcherba et.al.	2505.03725	null
2025-05-06	Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech	Susmita Bhattacharjee et.al.	2505.03697	null
2025-05-06	Graph Drawing for LLMs: An Empirical Evaluation	Walter Didimo et.al.	2505.03678	null
2025-05-06	Binding threshold units with artificial oscillatory neurons	Vladimir Fanaskov et.al.	2505.03648	link
2025-05-06	PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing	Yiping Xie et.al.	2505.03621	null
2025-05-06	From Pixels to Polygons: A Survey of Deep Learning Approaches for Medical Image-to-Mesh Reconstruction	Fengming Lin et.al.	2505.03599	null
2025-05-06	DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes	Sergey Linok et.al.	2505.03581	link
2025-05-06	LlamaFirewall: An open source guardrail system for building secure AI agents	Sahana Chennabasappa et.al.	2505.03574	null
2025-05-06	Say It Another Way: A Framework for User-Grounded Paraphrasing	Cléa Chataigner et.al.	2505.03563	null
2025-05-06	Real-Time Person Image Synthesis Using a Flow Matching Model	Jiwoo Jeong et.al.	2505.03562	link
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning	Kolawole E. Ogunsina et.al.	2505.03553	null
2025-05-06	STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game	Eric Zhou et.al.	2505.03547	null
2025-05-06	Faster MoE LLM Inference for Extremely Large Models	Haoqi Yang et.al.	2505.03531	null
2025-05-06	Causal Intervention Framework for Variational Auto Encoder Mechanistic Interpretability	Dip Roy et.al.	2505.03530	null
2025-05-06	Ruled by the Representation Space: On the University’s Embrace of Large Language Models	Katia Schwerzmann et.al.	2505.03513	null
2025-05-06	Modality-Guided Dynamic Graph Fusion and Temporal Diffusion for Self-Supervised RGB-T Tracking	Shenglan Li et.al.	2505.03507	link
2025-05-06	BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models	Zihan Wang et.al.	2505.03501	null
2025-05-06	A new membership inference attack that spots memorization in generative and predictive models: Loss-Based with Reference Model algorithm (LBRM)	Faiz Taleb et.al.	2505.03490	null
2025-05-06	am-ELO: A Stable Framework for Arena-based LLM Evaluation	Zirui Liu et.al.	2505.03475	null
2025-05-06	Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models	Bin Yu et.al.	2505.03469	link
2025-05-06	Uncertainty-Aware Large Language Models for Explainable Disease Diagnosis	Shuang Zhou et.al.	2505.03467	null
2025-05-06	LogisticsVLN: Vision-Language Navigation For Low-Altitude Terminal Delivery Based on Agentic UAVs	Xinyuan Zhang et.al.	2505.03460	null
2025-05-06	The Steganographic Potentials of Language Models	Artem Karpov et.al.	2505.03439	null
2025-05-06	Procedural Memory Is Not All You Need: Bridging Cognitive Gaps in LLM-Based Agents	Schaun Wheeler et.al.	2505.03434	null
2025-05-06	Wasserstein Convergence of Score-based Generative Models under Semiconvexity and Discontinuous Gradients	Stefano Bruno et.al.	2505.03432	null
2025-05-06	MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks	Mouath Abu Daoud et.al.	2505.03427	link
2025-05-06	Phenotype-Guided Generative Model for High-Fidelity Cardiac MRI Synthesis: Advancing Pretraining and Clinical Applications	Ziyu Li et.al.	2505.03426	null
2025-05-06	Directed Greybox Fuzzing via Large Language Model	Hanxiang Xu et.al.	2505.03425	null
2025-05-06	Knowledge Augmented Complex Problem Solving with Large Language Models: A Survey	Da Zheng et.al.	2505.03418	null
2025-05-06	Lightweight Clinical Decision Support System using QLoRA-Fine-Tuned LLMs and Retrieval-Augmented Generation	Mohammad Shoaib Ansari et.al.	2505.03406	null
2025-05-06	Automatic Calibration for Membership Inference Attack on Large Language Models	Saleh Zare Zade et.al.	2505.03392	link
2025-05-06	SPAP: Structured Pruning via Alternating Optimization and Penalty Methods	Hanyu Hu et.al.	2505.03373	null
2025-05-06	Validating the Effectiveness of a Large Language Model-based Approach for Identifying Children’s Development across Various Free Play Settings in Kindergarten	Yuanyuan Yang et.al.	2505.03369	null
2025-05-06	Geospatial Mechanistic Interpretability of Large Language Models	Stef De Sabbata et.al.	2505.03368	link
2025-05-06	Domain Adversarial Training for Mitigating Gender Bias in Speech-based Mental Health Detection	June-Woo Kim et.al.	2505.03359	null
2025-05-06	Elevating Cyber Threat Intelligence against Disinformation Campaigns with LLM-based Concept Extraction and the FakeCTI Dataset	Domenico Cotroneo et.al.	2505.03345	link
2025-05-06	Avoid Recommending Out-of-Domain Items: Constrained Generative Recommendation with LLMs	Hao Liao et.al.	2505.03336	link
2025-05-07	Absolute Zero: Reinforced Self-play Reasoning with Zero Data	Andrew Zhao et.al.	2505.03335	link
2025-05-06	AI-Driven Scholarly Peer Review via Persistent Workflow Prompting, Meta-Prompting, and Meta-Reasoning	Evgeny Markhasin et.al.	2505.03332	null
2025-05-06	Artificial Behavior Intelligence: Technology, Challenges, and Future Directions	Kanghyun Jo et.al.	2505.03315	null
2025-05-06	Towards Efficient Benchmarking of Foundation Models in Remote Sensing: A Capabilities Encoding Approach	Pierre Adorni et.al.	2505.03299	link
2025-05-06	Capability-Driven Skill Generation with LLMs: A RAG-Based Approach for Reusing Existing Libraries and Interfaces	Luis Miguel Vieira da Silva et.al.	2505.03295	null
2025-05-06	Ψ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback	Shijing Zhu et.al.	2505.03293	null
2025-05-06	RAG-MCP: Mitigating Prompt Bloat in LLM Tool Selection via Retrieval-Augmented Generation	Tiantian Gan et.al.	2505.03275	null
2025-05-06	SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation	Zhaoxi Mu et.al.	2505.03273	null
2025-05-06	Synthline: A Product Line Approach for Synthetic Requirements Engineering Data Generation using Large Language Models	Abdelkarim El-Hajjami et.al.	2505.03265	link
2025-05-06	SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation	Yu-Ren Guo et.al.	2505.03244	null
2025-05-06	RobotxR1: Enabling Embodied Robotic Intelligence on Large Language Models through Closed-Loop Reinforcement Learning	Liam Boyle et.al.	2505.03238	null
2025-05-06	GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data	Shengliang Deng et.al.	2505.03233	null
2025-05-06	DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral	Qiang Sun et.al.	2505.03214	null
2025-05-06	DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning	Borui Wang et.al.	2505.03209	null
2025-05-06	Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights	Zhaiming Shen et.al.	2505.03205	null
2025-05-06	A Trustworthy Multi-LLM Network: Challenges,Solutions, and A Use Case	Haoxiang Luo et.al.	2505.03196	null
2025-05-06	Patterns and Mechanisms of Contrastive Activation Engineering	Yixiong Hao et.al.	2505.03189	null
2025-05-06	VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making	Jake Grigsby et.al.	2505.03181	null
2025-05-06	Bridging Expertise Gaps: The Role of LLMs in Human-AI Collaboration for Cybersecurity	Shahroz Tariq et.al.	2505.03179	null
2025-05-06	CombiBench: Benchmarking LLM Capability for Combinatorial Mathematics	Junqi Liu et.al.	2505.03171	link
2025-05-06	The Impact of Large Language Models on K-12 Education in Rural India: A Thematic Analysis of Student Volunteer’s Perspectives	Harshita Goyal et.al.	2505.03163	null
2025-05-06	An LLM-based Self-Evolving Security Framework for 6G Space-Air-Ground Integrated Networks	Qi Qin et.al.	2505.03161	null
2025-05-06	StableMotion: Training Motion Cleanup Models with Unpaired Corrupted Data	Yuxuan Mu et.al.	2505.03154	null
2025-05-06	Towards Effective Identification of Attack Techniques in Cyber Threat Intelligence Reports using Large Language Models	Hoang Cuong Nguyen et.al.	2505.03147	link
2025-05-06	Holmes: Automated Fact Check with Large Language Models	Haoran Ou et.al.	2505.03135	null
2025-05-06	VISLIX: An XAI Framework for Validating Vision Models with Slice Discovery and Analysis	Xinyuan Yan et.al.	2505.03132	null
2025-05-06	Plug-and-Play AMC: Context Is King in Training-Free, Open-Set Modulation with LLMs	Mohammad Rostami et.al.	2505.03112	link
2025-05-06	Towards a standardized methodology and dataset for evaluating LLM-based digital forensic timeline analysis	Hudan Studiawan et.al.	2505.03100	null
2025-05-06	Assessing and Enhancing the Robustness of LLM-based Multi-Agent Systems Through Chaos Engineering	Joshua Owotogbe et.al.	2505.03096	null
2025-05-05	Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models	Zhengliang Shi et.al.	2505.03075	link
2025-05-05	Variational diffusion transformers for conditional sampling of supernovae spectra	Yunyi Shen et.al.	2505.03063	null
2025-05-05	Improving Model Alignment Through Collective Intelligence of Open-Source LLMS	Junlin Wang et.al.	2505.03059	null
2025-05-05	34 Examples of LLM Applications in Materials Science and Chemistry: Towards Automation, Assistants, Agents, and Accelerated Scientific Discovery	Yoel Zimmermann et.al.	2505.03049	null
2025-05-05	MORE: Mobile Manipulation Rearrangement Through Grounded Language Reasoning	Mohammad Mohammadi et.al.	2505.03035	null
2025-05-05	Evaluating the Impact of AI-Powered Audiovisual Personalization on Learner Emotion, Focus, and Learning Outcomes	George Xi Wang et.al.	2505.03033	null
2025-05-05	Radio: Rate-Distortion Optimization for Large Language Model Compression	Sean I. Young et.al.	2505.03031	null
2025-05-05	UCSC at SemEval-2025 Task 3: Context, Models and Prompt Optimization for Automated Hallucination Detection in LLM Output	Sicong Huang et.al.	2505.03030	null
2025-05-05	Memorization or Interpolation ? Detecting LLM Memorization through Input Perturbation Analysis	Albérick Euraste Djiré et.al.	2505.03019	null
2025-05-05	Lesion-Aware Generative Artificial Intelligence for Virtual Contrast-Enhanced Mammography in Breast Cancer	Aurora Rofena et.al.	2505.03018	null
2025-05-05	GIF: Generative Inspiration for Face Recognition at Scale	Saeed Ebrahimi et.al.	2505.03012	null
2025-05-05	Modeling the Impact of Group Interactions on Climate-related Opinion Change in Reddit	Alessia Antelmi et.al.	2505.02989	link
2025-05-05	Generative modelling of multivariate geometric extremes using normalising flows	Lambert De Monte et.al.	2505.02957	null
2025-05-05	RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference	Yaoqi Chen et.al.	2505.02922	null
2025-05-05	When Your Own Output Becomes Your Training Data: Noise-to-Meaning Loops and a Formal RSI Trigger	Rintaro Ando et.al.	2505.02888	link
2025-05-05	Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation	Lu Ling et.al.	2505.02836	null
2025-05-05	R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning	Yi-Fan Zhang et.al.	2505.02835	link
2025-05-07	No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves	Dengyang Jiang et.al.	2505.02831	link
2025-05-05	LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery	Jerome Quenum et.al.	2505.02829	null
2025-05-05	ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations	Dmitriy Shopkhoev et.al.	2505.02819	link
2025-05-05	Towards Quantifying the Hessian Structure of Neural Networks	Zhaorui Dong et.al.	2505.02809	link
2025-05-05	Generating HomeAssistant Automations Using an LLM-based Chatbot	Mathyas Giudici et.al.	2505.02802	null
2025-05-05	HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models	Zheng Lin et.al.	2505.02795	null
2025-05-05	Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control	Nam H. Le et.al.	2505.02766	null
2025-05-05	Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models	Matthew Dahl et.al.	2505.02763	null
2025-05-05	Using Knowledge Graphs to harvest datasets for efficient CLIP model training	Simon Ging et.al.	2505.02746	link
2025-05-06	Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation	Gerard Pons et.al.	2505.02737	null
2025-05-05	FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models	Zhouliang Yu et.al.	2505.02735	link
2025-05-05	Enhancing LLMs’ Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry	Junu Kim et.al.	2505.02722	link
2025-05-05	Less is More: Efficient Weight Farcasting with 1-Layer Neural Network	Xiao Shou et.al.	2505.02714	null
2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link
2025-05-05	AI Standardized Patient Improves Human Conversations in Advanced Cancer Care	Kurtis Haut et.al.	2505.02694	link
2025-05-05	Predicting Movie Hits Before They Happen with LLMs	Shaghayegh Agah et.al.	2505.02693	null
2025-05-05	Sailing AI by the Stars: A Survey of Learning from Rewards in Post-Training and Test-Time Scaling of Large Language Models	Xiaobao Wu et.al.	2505.02686	link
2025-05-05	Multimodal Deep Learning for Stroke Prediction and Detection using Retinal Imaging and Clinical Data	Saeed Shurrab et.al.	2505.02677	null
2025-05-05	A Survey on Progress in LLM Alignment from the Perspective of Reward Design	Miaomiao Ji et.al.	2505.02666	null
2025-05-05	A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law	Qianjun Pan et.al.	2505.02665	null
2025-05-06	A Note on Statistically Accurate Tabular Data Generation Using Large Language Models	Andrey Sidorenko et.al.	2505.02659	link
2025-05-05	Hierarchical random measures without tables	Marta Catalano et.al.	2505.02653	null
2025-05-05	Enhancing Chemical Reaction and Retrosynthesis Prediction with Large Language Model and Dual-task Learning	Xuan Lin et.al.	2505.02639	null
2025-05-05	Parameter-Efficient Fine-Tuning with Attributed Patch Semantic Graph for Automated Patch Correctness Assessment	Zhenyu Yang et.al.	2505.02629	link
2025-05-05	DeepSparse: A Foundation Model for Sparse-View CBCT Reconstruction	Yiqun Lin et.al.	2505.02628	null
2025-05-05	Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models	Sassan Mokhtar et.al.	2505.02626	link
2025-05-05	LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis	Qingkai Fang et.al.	2505.02625	link
2025-05-05	Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM Era	Chenxi Liu et.al.	2505.02583	link
2025-05-06	EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning	Lingxiao Kong et.al.	2505.02579	link
2025-05-05	Recursive Decomposition with Dependencies for Generic Divide-and-Conquer Reasoning	Sergio Hernández-Gutiérrez et.al.	2505.02576	null
2025-05-05	Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities	Xinjie Zhang et.al.	2505.02567	link
2025-05-06	Evaluating Contrastive Feedback for Effective User Simulations	Andreas Konstantin Kruff et.al.	2505.02560	link
2025-05-05	The Turing Test Is More Relevant Than Ever	Avraham Rahimov et.al.	2505.02558	null
2025-05-05	Large Language Model Partitioning for Low-Latency Inference at the Edge	Dimitrios Kafetzis et.al.	2505.02533	null
2025-05-05	Text to Image Generation and Editing: A Survey	Pengfei Yang et.al.	2505.02527	null
2025-05-05	Unveiling the Landscape of LLM Deployment in the Wild: An Empirical Study	Xinyi Hou et.al.	2505.02502	null
2025-05-05	Automating Automotive Software Development: A Synergy of Generative AI and Formal Methods	Fengjunjie Pan et.al.	2505.02500	null
2025-05-05	Beyond the model: Key differentiators in large language models and multi-agent services	Muskaan Goyal et.al.	2505.02489	null
2025-05-05	Hypothesis testing and Stein’s lemma in general probability theories with Euclidean Jordan algebra and its quantum realization	Kanta Sonoda et.al.	2505.02487	null
2025-05-05	SEFE: Superficial and Essential Forgetting Eliminator for Multimodal Continual Instruction Tuning	Jinpeng Chen et.al.	2505.02486	link
2025-05-05	Automated Hybrid Reward Scheduling via Large Language Models for Robotic Skill Learning	Changxin Huang et.al.	2505.02483	null
2025-05-05	Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality	Xueguang Ma et.al.	2505.02466	link
2025-05-05	Incentivizing Inclusive Contributions in Model Sharing Markets	Enpei Zhang et.al.	2505.02462	null
2025-05-05	Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs	Elisa Forcada Rodríguez et.al.	2505.02456	null
2025-05-05	Can LLM-Simulated Practice and Feedback Upskill Human Counselors? A Randomized Study with 90+ Novice Counselors	Ryan Louie et.al.	2505.02428	null
2025-05-05	Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks	Baoxia Du et.al.	2505.02413	null
2025-05-05	Estimating Commonsense Scene Composition on Belief Scene Graphs	Mario A. V. Saucedo et.al.	2505.02405	null
2025-05-05	Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL	Jiarui Yao et.al.	2505.02391	link
2025-05-05	RM-R1: Reward Modeling as Reasoning	Xiusi Chen et.al.	2505.02387	link
2025-05-06	EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices	Arnab Sanyal et.al.	2505.02380	null
2025-05-05	LAMeD: LLM-generated Annotations for Memory Leak Detection	Ekaterina Shemetova et.al.	2505.02376	null
2025-05-05	Advancing Email Spam Detection: Leveraging Zero-Shot Learning and Large Language Models	Ghazaleh SHirvani et.al.	2505.02362	link
2025-05-05	An End-to-End Model For Logits Based Large Language Models Watermarking	Kahim Wong et.al.	2505.02344	link
2025-05-05	VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection	Hao Cheng et.al.	2505.02331	link
2025-05-05	From Course to Skill: Evaluating LLM Performance in Curricular Analytics	Zhen Xu et.al.	2505.02324	link
2025-05-05	HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking	Runquan Gui et.al.	2505.02322	null
2025-05-05	Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering	Jihao Zhao et.al.	2505.02311	link
2025-05-05	Bayesian inference for cluster-randomized trials with multivariate outcomes subject to both truncation by death and missingness	Guangyu Tong et.al.	2505.02310	null
2025-05-05	Optimizing LLMs for Resource-Constrained Environments: A Survey of Model Compression Techniques	Sanjay Surendranath Girija et.al.	2505.02309	null
2025-05-05	Generative Sign-description Prompts with Multi-positive Contrastive Learning for Sign Language Recognition	Siyu Liang et.al.	2505.02304	null
2025-05-05	Entropy-Guided Sampling of Flat Modes in Discrete Spaces	Pinaki Mohanty et.al.	2505.02296	link
2025-05-04	A survey of agent interoperability protocols: Model Context Protocol (MCP), Agent Communication Protocol (ACP), Agent-to-Agent Protocol (A2A), and Agent Network Protocol (ANP)	Abul Ehtesham et.al.	2505.02279	null
2025-05-04	Real-time Spatial Retrieval Augmented Generation for Urban Environments	David Nazareno Campo et.al.	2505.02271	null
2025-05-04	Enhancing AI Face Realism: Cost-Efficient Quality Improvement in Distilled Diffusion Models with a Fully Synthetic Dataset	Jakub Wąsala et.al.	2505.02255	null
2025-05-04	Personalisation or Prejudice? Addressing Geographic Bias in Hate Speech Detection using Debias Tuning in Large Language Models	Paloma Piot et.al.	2505.02252	link
2025-05-04	Improving Physical Object State Representation in Text-to-Image Generative Systems	Tianle Chen et.al.	2505.02236	link
2025-05-04	Prompt-responsive Object Retrieval with Memory-augmented Student-Teacher Learning	Malte Mosbach et.al.	2505.02232	null
2025-05-04	An Empirical Study of Qwen3 Quantization	Xingyu Zheng et.al.	2505.02214	link
2025-05-04	Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes	Matthew T. Dearing et.al.	2505.02184	null
2025-05-04	Robust AI-Generated Face Detection with Imbalanced Data	Yamini Sri Krubha et.al.	2505.02182	link
2025-05-04	Sparfels: Fast Reconstruction from Sparse Unposed Imagery	Shubhendu Jena et.al.	2505.02178	null
2025-05-04	Measuring Hong Kong Massive Multi-Task Language Understanding	Chuxue Cao et.al.	2505.02177	null
2025-05-04	Identifying Legal Holdings with LLMs: A Systematic Study of Performance, Scale, and Memorization	Chuck Arvin et.al.	2505.02172	link
2025-05-04	A New HOPE: Domain-agnostic Automatic Evaluation of Text Chunking	Henrik Brådland et.al.	2505.02171	null
2025-05-04	Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions	Cunxin Fan et.al.	2505.02152	null
2025-05-04	Large Language Models are overconfident and amplify human bias	Fengfei Sun et.al.	2505.02151	null
2025-05-04	QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach	Shouyang Dong et.al.	2505.02146	null
2025-05-04	Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study	Xiaoyu Tian et.al.	2505.02142	null
2025-05-06	Efficient Multivariate Time Series Forecasting via Calibrated Language Models with Privileged Knowledge Distillation	Chenxi Liu et.al.	2505.02138	link
2025-05-04	Enhancing LLM Code Generation: A Systematic Evaluation of Multi-Agent Collaboration and Runtime Debugging for Improved Accuracy, Reliability, and Latency	Nazmus Ashrafi et.al.	2505.02133	link
2025-05-04	Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data	Zhong Guan et.al.	2505.02130	link
2025-05-04	GRAIL: Graph Edit Distance and Node Alignment Using LLM-Generated Code	Samidha Verma et.al.	2505.02124	link
2025-05-04	DriveAgent: Multi-Agent Structured Reasoning with LLM and Multimodal Sensor Fusion for Autonomous Driving	Xinmeng Hou et.al.	2505.02123	link
2025-05-04	MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based Agents	Zeyu Zhang et.al.	2505.02099	link
2025-05-04	LLM-OptiRA: LLM-Driven Optimization of Resource Allocation for Non-Convex Problems in Wireless Communications	Xinyue Peng et.al.	2505.02091	link
2025-05-04	Retrieval-augmented in-context learning for multimodal large language models in disease classification	Zaifu Zhan et.al.	2505.02087	null
2025-05-04	LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning	Joy Lim Jia Yin et.al.	2505.02078	link
2025-05-04	Leveraging LLM Agents and Digital Twins for Fault Handling in Process Plants	Milapji Singh Gill et.al.	2505.02076	link
2025-05-04	Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation	Volodymyr Havrylov et.al.	2505.02075	link
2025-05-04	Lightweight Defense Against Adversarial Attacks in Time Series Classification	Yi Han et.al.	2505.02073	link
2025-05-06	RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video	Shuhang Xun et.al.	2505.02064	link
2025-05-04	TxP: Reciprocal Generation of Ground Pressure Dynamics and Activity Descriptions for Improving Human Activity Recognition	Lala Shakti Swarup Ray et.al.	2505.02052	link
2025-05-04	Secrets of GFlowNets’ Learning Behavior: A Theoretical Study	Tianshu Yu et.al.	2505.02035	null
2025-05-04	From Mind to Machine: The Rise of Manus AI as a Fully Autonomous Digital Agent	Minjie Shen et.al.	2505.02024	null
2025-05-04	Wide & Deep Learning for Node Classification	Yancheng Chen et.al.	2505.02020	link
2025-05-04	Learning the Simplest Neural ODE	Yuji Okamoto et.al.	2505.02019	null
2025-05-04	MLLM-Enhanced Face Forgery Detection: A Vision-Language Fusion Solution	Siran Peng et.al.	2505.02013	null
2025-05-04	Testing Database Systems with Large Language Model Synthesized Fragments	Suyang Zhong et.al.	2505.02012	null
2025-05-02	GENMO: A GENeralist Model for Human MOtion	Jiefeng Li et.al.	2505.01425	null
2025-05-02	How Effective are Large Time Series Models in Hydrology? A Study on Water Level Forecasting in Everglades	Rahuul Rangaraj et.al.	2505.01415	null
2025-05-02	Provable Efficiency of Guidance in Diffusion Models for General Data Distribution	Gen Li et.al.	2505.01382	null
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System	Sheikh Samit Muhaimin et.al.	2505.01315	null
2025-05-02	Enhancing SPARQL Query Rewriting for Complex Ontology Alignments	Anicet Lepetit Ondo et.al.	2505.01309	null
2025-05-02	Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments	Regan Bolton et.al.	2505.01307	null
2025-05-02	ViSA-Flow: Accelerating Robot Skill Learning via Large-Scale Video Semantic Action Flow	Changhe Chen et.al.	2505.01288	null
2025-05-02	Scoring-Assisted Generative Exploration for Proteins (SAGE-Prot): A Framework for Multi-Objective Protein Optimization via Iterative Sequence Generation and Evaluation	Hocheol Lim et.al.	2505.01277	link
2025-05-02	FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing	Gaoxiang Cong et.al.	2505.01263	null
2025-05-02	Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications	Elie Saad et.al.	2505.01261	null
2025-05-02	Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research	Flavio Lichtenstein et.al.	2505.01259	null
2025-05-02	Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging	Elena Mulero Ayllón et.al.	2505.01239	null
2025-05-02	CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning	Tsai-Ning Wang et.al.	2505.01199	null
2025-05-02	A Combinatorial Proof of Universal Optimality for Computing a Planar Convex Hull	Ivor van der Hoog et.al.	2505.01194	null
2025-05-02	TSTMotion: Training-free Scene-awarenText-to-motion Generation	Ziyan Guo et.al.	2505.01182	null
2025-05-02	LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures	Francisco Aguilera-Martínez et.al.	2505.01177	null
2025-05-02	Methodological Foundations for AI-Driven Survey Question Generation	Ted K. Mburu et.al.	2505.01150	null
2025-05-02	Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications	Jiawei He et.al.	2505.01146	null
2025-05-02	Evaluating the Impact of Data Cleaning on the Quality of Generated Pull Request Descriptions	Kutay Tire et.al.	2505.01120	null
2025-05-02	Incorporating Inductive Biases to Energy-based Generative Models	Yukun Li et.al.	2505.01111	null
2025-05-02	MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning	Murtadha Ahmed et.al.	2505.01110	null
2025-05-02	Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study	Ali Mammadov et.al.	2505.01109	link
2025-05-02	Any-to-Any Vision-Language Model for Multimodal X-ray Imaging and Radiological Report Generation	Daniele Molino et.al.	2505.01091	null
2025-05-02	MADIL: An MDL-based Framework for Efficient Program Synthesis in the ARC Benchmark	Sébastien Ferré et.al.	2505.01081	null
2025-05-02	Zero-Shot Document-Level Biomedical Relation Extraction via Scenario-based Prompt Design in Two-Stage with LLM	Lei Zhao et.al.	2505.01077	null
2025-05-02	Federated Adapter on Foundation Models: An Out-Of-Distribution Approach	Yiyuan Yang et.al.	2505.01075	null
2025-05-02	WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks	Jingwen Tong et.al.	2505.01074	link
2025-05-02	Retrieval Augmented Learning: A Retrial-based Large Language Model Self-Supervised Learning and Autonomous Knowledge Generation	Zongyuan Li et.al.	2505.01073	null
2025-05-02	A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories	Ziqi Ding et.al.	2505.01067	null
2025-05-02	Good News for Script Kiddies? Evaluating Large Language Models for Automated Exploit Generation	David Jin et.al.	2505.01065	null
2025-05-02	Efficient Vocabulary-Free Fine-Grained Visual Recognition in the Age of Multimodal LLMs	Hari Chandana Kuchibhotla et.al.	2505.01064	null
2025-05-02	Transferable Adversarial Attacks on Black-Box Vision-Language Models	Kai Hu et.al.	2505.01050	null
2025-05-02	Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities	Zhiwei Hao et.al.	2505.01043	null
2025-05-02	Do We Need a Detailed Rubric for Automated Essay Scoring using Large Language Models?	Lui Yoshida et.al.	2505.01035	null
2025-05-02	Improving Large Language Model Planning with Action Sequence Similarity	Xinran Zhao et.al.	2505.01009	null
2025-05-02	Where’s the liability in the Generative Era? Recovery-based Black-Box Detection of AI-Generated Content	Haoyue Bai et.al.	2505.01008	null
2025-05-02	Togedule: Scheduling Meetings with Large Language Models and Adaptive Representations of Group Availability	Jaeyoon Song et.al.	2505.01000	link
2025-05-02	Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis	Yu Hua et.al.	2505.00998	null
2025-05-02	Position: Enough of Scaling LLMs! Lets Focus on Downscaling	Ayan Sengupta et.al.	2505.00985	link
2025-05-02	Multi-agents based User Values Mining for Recommendation	Lijian Chen et.al.	2505.00981	null
2025-05-02	Synthesize-on-Graph: Knowledgeable Synthetic Data Generation for Continue Pre-training of Large Language Models	Xuhui Jiang et.al.	2505.00979	null
2025-05-02	Attack and defense techniques in large language models: A survey and new perspectives	Zhiyu Liao et.al.	2505.00976	null
2025-05-02	Seeking to Collide: Online Safety-Critical Scenario Generation for Autonomous Driving with Retrieval Augmented Large Language Models	Yuewen Mei et.al.	2505.00972	null
2025-05-02	Tree-Sliced Wasserstein Distance with Nonlinear Projection	Thanh Tran et.al.	2505.00968	null
2025-05-02	Preserving Privacy and Utility in LLM-Based Product Recommendations	Tina Khezresmaeilzadeh et.al.	2505.00951	null
2025-05-02	SSRLBot: Designing and Developing an LLM-based Agent using Socially Shared Regulated Learning	Xiaoshan Huang et.al.	2505.00945	null
2025-05-02	Large Language Model-Driven Dynamic Assessment of Grammatical Accuracy in English Language Learner Writing	Timur Jaganov et.al.	2505.00931	null
2025-05-02	How Transformers Learn Regular Language Recognition: A Theoretical Study on Training Dynamics and Implicit Bias	Ruiquan Huang et.al.	2505.00926	null
2025-05-01	Multivariate Conformal Selection	Tian Bai et.al.	2505.00917	null
2025-05-01	NeMo-Inspector: A Visualization Tool for LLM Generation Analysis	Daria Gitman et.al.	2505.00903	link
2025-05-01	Towards Explainable Temporal User Profiling with LLMs	Milad Sabouri et.al.	2505.00886	link
2025-05-01	Protocol-agnostic and Data-free Backdoor Attacks on Pre-trained Models in RF Fingerprinting	Tianya Zhao et.al.	2505.00881	link
2025-05-01	LLM Ethics Benchmark: A Three-Dimensional Assessment System for Evaluating Moral Reasoning in Large Language Models	Junfeng Jiao et.al.	2505.00853	link
2025-05-01	ICQuant: Index Coding enables Low-bit LLM Quantization	Xinlin Li et.al.	2505.00850	null
2025-05-01	OET: Optimization-based prompt injection Evaluation Toolkit	Jinsheng Pan et.al.	2505.00843	link
2025-05-01	From Texts to Shields: Convergence of Large Language Models and Cybersecurity	Tao Li et.al.	2505.00841	null
2025-05-01	Multi-site modelling and reconstruction of past extreme skew surges along the French Atlantic coast	Nathan Huet et.al.	2505.00835	link
2025-05-01	SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation	Quang P. M. Pham et.al.	2505.00831	link
2025-05-01	Data-Driven Optical To Thermal Inference in Pool Boiling Using Generative Adversarial Networks	Qianxi Fu et.al.	2505.00823	null
2025-05-01	Should AI Mimic People? Understanding AI-Supported Writing Technology Among Black Users	Jeffrey Basoah et.al.	2505.00821	null
2025-05-01	HMCF: A Human-in-the-loop Multi-Robot Collaboration Framework Based on Large Language Models	Zhaoxing Li et.al.	2505.00820	null
2025-05-01	Spill The Beans: Exploiting CPU Cache Side-Channels to Leak Tokens from Large Language Models	Andrew Adiletta et.al.	2505.00817	null
2025-05-01	Reasoning Capabilities and Invariability of Large Language Models	Alessandro Raganato et.al.	2505.00776	link
2025-05-01	Multi-Modal Language Models as Text-to-Image Model Evaluators	Jiahui Chen et.al.	2505.00759	null
2025-05-01	A Survey on Large Language Model based Human-Agent Systems	Henry Peng Zou et.al.	2505.00753	link
2025-05-01	T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT	Dongzhi Jiang et.al.	2505.00703	link
2025-05-01	GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution	Aditya Arora et.al.	2505.00687	null
2025-05-01	Steering Large Language Models with Register Analysis for Arbitrary Style Transfer	Xinchen Yang et.al.	2505.00679	null
2025-05-01	Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions	Yiming Du et.al.	2505.00675	link
2025-05-01	DeepCritic: Deliberate Critique with Large Language Models	Wenkai Yang et.al.	2505.00662	link
2025-05-01	On the generalization of language models from in-context learning and finetuning: a controlled study	Andrew K. Lampinen et.al.	2505.00661	null
2025-05-01	Large Language Models Understanding: an Inherent Ambiguity Barrier	Daniel N. Nissani et.al.	2505.00654	null
2025-05-01	Open-Source LLM-Driven Federated Transformer for Predictive IoV Management	Yazan Otoum et.al.	2505.00651	null
2025-05-01	Investigating Task Arithmetic for Zero-Shot Information Retrieval	Marco Braga et.al.	2505.00649	link
2025-05-01	Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis	Zhongying Deng et.al.	2505.00627	null
2025-05-01	The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)	Zihao Wang et.al.	2505.00626	null
2025-05-02	SA-GAT-SR: Self-Adaptable Graph Attention Networks with Symbolic Regression for high-fidelity material property prediction	Liu Junchi et.al.	2505.00625	link
2025-05-01	FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation	Chaitali Bhattacharyya et.al.	2505.00624	null
2025-05-01	Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction	Simon Giebenhain et.al.	2505.00615	null
2025-05-01	Combining LLMs with Logic-Based Framework to Explain MCTS	Ziyan An et.al.	2505.00610	null
2025-05-01	Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4	Phanish Puranam et.al.	2505.00603	null
2025-05-02	Fast and Low-Cost Genomic Foundation Models via Outlier Removal	Haozheng Luo et.al.	2505.00598	link
2025-05-01	Block Circulant Adapter for Large Language Models	Xinyu Ding et.al.	2505.00582	null
2025-05-01	Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors	Xinyu Ding et.al.	2505.00580	null
2025-05-01	FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension	Jushi Kai et.al.	2505.00570	null
2025-05-01	Triggering Hallucinations in LLMs: A Quantitative Study of Prompt-Induced Hallucination in Large Language Models	Makoto Sato et.al.	2505.00557	null
2025-05-02	100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models	Chong Zhang et.al.	2505.00551	null
2025-05-01	Leveraging Partial SMILES Validation Scheme for Enhanced Drug Design in Reinforcement Learning Frameworks	Xinyu Wang et.al.	2505.00530	null
2025-05-01	HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection	Deanna Emery et.al.	2505.00506	null
2025-05-01	UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces	Alaa Saleh et.al.	2505.00472	null
2025-05-01	A General Model for Linearly Polarized Optical Vector Beams	Jonathan Nichols et.al.	2505.00471	null
2025-05-01	Red Teaming Large Language Models for Healthcare	Vahid Balazadeh et.al.	2505.00467	null
2025-05-01	Data Therapist: Eliciting Domain Knowledge from Subject Matter Experts Using Large Language Models	Sungbok Shin et.al.	2505.00455	null
2025-05-01	Distributed Retrieval-Augmented Generation	Chenhao Xu et.al.	2505.00443	link
2025-05-01	CSE-SFP: Enabling Unsupervised Sentence Representation Learning via a Single Forward Pass	Bowen Zhang et.al.	2505.00389	link
2025-05-01	Urban Air Mobility as a System of Systems: An LLM-Enhanced Holonic Approach	Ahmed R. Sadik et.al.	2505.00368	null
2025-05-01	KoACD: The First Korean Adolescent Dataset for Cognitive Distortion Analysis	JunSeo Kim et.al.	2505.00367	null
2025-05-01	R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training	Albert Ge et.al.	2505.00358	null
2025-05-01	LLMPrism: Black-box Performance Diagnosis for Production LLM Training Platforms	Zhihan Jiang et.al.	2505.00342	null
2025-05-01	T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation	Xuyang Guo et.al.	2505.00337	null
2025-05-01	Quaternion Wavelet-Conditioned Diffusion Models for Image Super-Resolution	Luigi Sigillo et.al.	2505.00334	null
2025-05-01	Communication-Efficient Wireless Federated Fine-Tuning for Large-Scale AI Models	Bumjun Kim et.al.	2505.00333	null
2025-05-01	Mixture of Sparse Attention: Content-Based Learnable Sparse Attention via Expert-Choice Routing	Piotr Piękos et.al.	2505.00315	link
2025-05-01	Large Language Models as AI Agents for Digital Atoms and Molecules: Catalyzing a New Era in Computational Biophysics	Yijie Xia et.al.	2505.00270	null
2025-05-01	EnronQA: Towards Personalized RAG over Private Documents	Michael J. Ryan et.al.	2505.00263	null
2025-05-01	LLM-Based Threat Detection and Prevention Framework for IoT Ecosystems	Yazan Otoum et.al.	2505.00240	null
2025-05-02	Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks	Vishnu Sarukkai et.al.	2505.00234	null
2025-05-01	Scaling On-Device GPU Inference for Large Generative Models	Jiuqiang Tang et.al.	2505.00232	null
2025-05-01	ReXGradient-160K: A Large-Scale Publicly Available Dataset of Chest Radiographs with Free-text Reports	Xiaoman Zhang et.al.	2505.00228	null
2025-04-30	RAIL in the Wild: Operationalizing Responsible AI Evaluation Using Anthropic’s Value Dataset	Sumit Verma et.al.	2505.00204	null
2025-04-30	Generative Multimodal Multiscale Data Fusion for Digital Twins in Aerosol Jet Electronics Printing	Fatemeh Elhambakhsh et.al.	2505.00176	null
2025-04-30	GEOM-Drugs Revisited: Toward More Chemically Accurate Benchmarks for 3D Molecule Generation	Filipp Nikitin et.al.	2505.00169	link
2025-04-30	V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving	Jannik Lübberstedt et.al.	2505.00156	null
2025-04-30	Audo-Sight: Enabling Ambient Interaction For Blind And Visually Impaired Individuals	Bhanuja Ainary et.al.	2505.00153	null
2025-04-30	AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models	Yinghui He et.al.	2505.00147	null
2025-04-30	When Deep Learning Meets Information Retrieval-based Bug Localization: A Survey	Feifei Niu et.al.	2505.00144	null
2025-04-30	Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs	Jinyan Su et.al.	2505.00127	null
2025-04-30	Fine-Tuning LLMs for Low-Resource Dialect Translation: The Case of Lebanese	Silvana Yakhni et.al.	2505.00114	link
2025-04-30	CoordField: Coordination Field for Agentic UAV Task Allocation In Low-altitude Urban Scenarios	Tengchao Zhang et.al.	2505.00091	null
2025-04-30	Materials discovery acceleration by using condition generative methodology	Caiyuan Ye et.al.	2505.00076	link
2025-04-30	ConSens: Assessing context grounding in open-book question answering	Ivan Vankov et.al.	2505.00065	null
2025-04-30	GDI-Bench: A Benchmark for General Document Intelligence with Vision and Reasoning Decoupling	Siqi Li et.al.	2505.00063	null
2025-04-30	Enhancing Security and Strengthening Defenses in Automated Short-Answer Grading Systems	Sahar Yarmohammadtoosky et.al.	2505.00061	null
2025-04-30	Fact-Consistency Evaluation of Text-to-SQL Generation for Business Intelligence Using Exaone 3.5	Jeho Choi et.al.	2505.00060	null
2025-04-30	A Report on the llms evaluating the high school questions	Zhu Jiawei et.al.	2505.00057	null
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-04-30	TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments	Sichang Tu et.al.	2504.21851	null
2025-04-30	COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning	Xindi Wu et.al.	2504.21850	null
2025-04-30	3D Stylization via Large Reconstruction Model	Ipek Oztas et.al.	2504.21836	null
2025-04-30	From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems	Huan Zhang et.al.	2504.21815	null
2025-04-30	Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields	Yixin Gao et.al.	2504.21814	null
2025-04-30	A simple and effective approach for body part recognition on CT scans based on projection estimation	Franko Hrzic et.al.	2504.21810	null
2025-04-30	An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding	Xiuwei Shang et.al.	2504.21803	null
2025-04-30	DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition	Z. Z. Ren et.al.	2504.21801	link
2025-04-30	MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness	Junsheng Huang et.al.	2504.21773	null
2025-04-30	Anatomical Similarity as a New Metric to Evaluate Brain Generative Models	Bahram Jafrasteh et.al.	2504.21771	null
2025-04-30	LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs	Baleegh Ahmad et.al.	2504.21770	null
2025-04-30	LLM-based Interactive Imitation Learning for Robotic Manipulation	Jonas Werner et.al.	2504.21769	link
2025-04-30	Investigating Literary Motifs in Ancient and Medieval Novels with Large Language Models	Emelie Hallenberg et.al.	2504.21742	null
2025-04-30	TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training	Shengqian Wang et.al.	2504.21735	null
2025-04-30	XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs	Marco Arazzi et.al.	2504.21700	null
2025-04-30	Visual Text Processing: A Comprehensive Review and Unified Evaluation	Yan Shu et.al.	2504.21682	link
2025-04-30	Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs	Pan Suo et.al.	2504.21680	null
2025-04-30	Traceback of Poisoning Attacks to Retrieval-Augmented Generation	Baolei Zhang et.al.	2504.21668	null
2025-04-30	From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising	Jingwen Cai et.al.	2504.21667	null
2025-04-30	AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization	Haotian Luo et.al.	2504.21659	link
2025-04-30	Sadeed: Advancing Arabic Diacritization Through Small Language Model	Zeina Aldallal et.al.	2504.21635	null
2025-04-30	Meeseeks: An Iterative Benchmark Evaluating LLMs Multi-Turn Instruction-Following Ability	Jiaming Wang et.al.	2504.21625	null
2025-04-30	RDF-Based Structured Quality Assessment Representation of Multilingual LLM Evaluations	Jonas Gwozdz et.al.	2504.21605	null
2025-04-30	Leveraging Pre-trained Large Language Models with Refined Prompting for Online Task and Motion Planning	Huihui Guo et.al.	2504.21596	null
2025-04-30	MF-LLM: Simulating Collective Decision Dynamics via a Mean-Field Large Language Model Framework	Qirui Mi et.al.	2504.21582	null
2025-04-30	Generative AI in Financial Institution: A Global Survey of Opportunities, Threats, and Regulation	Bikash Saha et.al.	2504.21574	null
2025-04-29	A Systematic Literature Review of Parameter-Efficient Fine-Tuning for Large Code Models	Md Zahidul Haque et.al.	2504.21569	link
2025-04-30	eNCApsulate: NCA for Precision Diagnosis on Capsule Endoscopes	Henry John Krumb et.al.	2504.21562	null
2025-04-30	Iterative Trajectory Exploration for Multimodal Agents	Pengxiang Li et.al.	2504.21561	null
2025-04-30	Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models	Lucas Maisonnave et.al.	2504.21553	null
2025-04-30	Consistency-aware Fake Videos Detection on Short Video Platforms	Junxi Wang et.al.	2504.21495	link
2025-04-30	GarmentDiffusion: 3D Garment Sewing Pattern Generation with Multimodal Diffusion Transformers	Xinyu Li et.al.	2504.21476	null
2025-04-30	Rethinking Visual Layer Selection in Multimodal LLMs	Haoran Chen et.al.	2504.21447	null
2025-04-30	SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding	Chenkai Zhang et.al.	2504.21435	link
2025-04-30	UAV-VLN: End-to-End Vision Language guided Navigation for UAVs	Pranav Saxena et.al.	2504.21432	null
2025-04-30	Diff-Prompt: Diffusion-Driven Prompt Generator with Mask Supervision	Weicai Yan et.al.	2504.21423	null
2025-04-30	Galvatron: An Automatic Distributed System for Efficient Foundation Model Training	Xinyi Liu et.al.	2504.21411	link
2025-04-30	Who Gets the Callback? Generative AI and Gender Bias	Sugat Chaturvedi et.al.	2504.21400	null
2025-04-30	Sparse-to-Sparse Training of Diffusion Models	Inês Cardoso Oliveira et.al.	2504.21380	null
2025-04-30	Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction	Máté Gedeon et.al.	2504.21372	null
2025-04-30	Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing	Hong Zhang et.al.	2504.21356	link
2025-04-30	Generative QoE Modeling: A Lightweight Approach for Telecom Networks	Vinti Nayar et.al.	2504.21353	null
2025-04-30	UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation	Linshan Wu et.al.	2504.21336	link
2025-04-30	Simple Visual Artifact Detection in Sora-Generated Videos	Misora Sugiyama et.al.	2504.21334	null
2025-04-30	Does the Prompt-based Large Language Model Recognize Students’ Demographics and Introduce Bias in Essay Scoring?	Kaixun Yang et.al.	2504.21330	null
2025-04-30	Covert Prompt Transmission for Secure Large Language Model Services	Ruichen Zhang et.al.	2504.21311	null
2025-04-30	An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images	Modesto Castrillón-Santana et.al.	2504.21309	null
2025-04-30	Confidence in Large Language Model Evaluation: A Bayesian Approach to Limited-Sample Challenges	Xiao Xiao et.al.	2504.21303	null
2025-04-30	BiasGuard: A Reasoning-enhanced Bias Detection Tool For Large Language Models	Zhiting Fan et.al.	2504.21299	null
2025-04-30	NEP89: Universal neuroevolution potential for inorganic and organic materials across 89 elements	Ting Liang et.al.	2504.21286	link
2025-04-30	Birdie: Natural Language-Driven Table Discovery Using Differentiable Search Index	Yuxiang Guo et.al.	2504.21282	null
2025-04-30	Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models	Guanghao Zhou et.al.	2504.21277	null
2025-04-30	CoCoDiff: Diversifying Skeleton Action Features via Coarse-Fine Text-Co-Guided Latent Diffusion	Zhifu Zhao et.al.	2504.21266	null
2025-04-30	Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA	Xuanzhao Dong et.al.	2504.21252	link
2025-04-30	Memorization and Knowledge Injection in Gated LLMs	Xu Pan et.al.	2504.21239	null
2025-04-30	Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math	Haoran Xu et.al.	2504.21233	null
2025-04-29	CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks	Rui Wang et.al.	2504.21228	null
2025-04-29	Theoretical Foundations for Semantic Cognition in Artificial Intelligence	Sebastian Dumbrava et.al.	2504.21218	null
2025-04-29	A Cost-Effective LLM-based Approach to Identify Wildlife Trafficking in Online Marketplaces	Juliana Barbosa et.al.	2504.21211	null
2025-04-29	Automatic Legal Writing Evaluation of LLMs	Ramon Pires et.al.	2504.21202	link
2025-04-29	Graph Synthetic Out-of-Distribution Exposure with Large Language Models	Haoyan Xu et.al.	2504.21198	null
2025-04-29	Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare	Lovedeep Gondara et.al.	2504.21191	null
2025-04-29	LIFT: LLM-Based Pragma Insertion for HLS via GNN Supervised Fine-Tuning	Neha Prakriya et.al.	2504.21187	null
2025-04-29	GLIP-OOD: Zero-Shot Graph OOD Detection with Foundation Model	Haoyan Xu et.al.	2504.21186	null
2025-05-01	AI-in-the-Loop Planning for Transportation Electrification: Case Studies from Austin, Texas	Seung Jun Choi et.al.	2504.21185	null
2025-04-29	Efficient LLMs with AMP: Attention Heads and MLP Pruning	Leandro Giusti Mugnaini et.al.	2504.21174	null
2025-04-29	Detecting Manipulated Contents Using Knowledge-Grounded Inference	Mark Huasong Meng et.al.	2504.21165	link
2025-04-29	LLM Enhancer: Merged Approach using Vector Embedding for Reducing Large Language Model Hallucinations with External Knowledge	Naheed Rayhan et.al.	2504.21132	null
2025-04-29	Optimized Quantum Embedding: A Universal Minor-Embedding Framework for Large Complete Bipartite Graph	Salvatore Sinno et.al.	2504.21112	null
2025-04-29	A Survey on Parameter-Efficient Fine-Tuning for Foundation Models in Federated Learning	Jieming Bian et.al.	2504.21099	null
2025-04-29	ProT-GFDM: A Generative Fractional Diffusion Model for Protein Generation	Xiao Liang et.al.	2504.21092	null
2025-04-29	On the Potential of Large Language Models to Solve Semantics-Aware Process Mining Tasks	Adrian Rebmann et.al.	2504.21074	null
2025-04-29	YoChameleon: Personalized Vision and Language Generation	Thao Nguyen et.al.	2504.20998	null
2025-04-29	Toward Efficient Exploration by Large Language Model Agents	Dilip Arumugam et.al.	2504.20997	null
2025-04-29	X-Fusion: Introducing New Modality to Frozen Large Language Models	Sicheng Mo et.al.	2504.20996	null
2025-04-29	TesserAct: Learning 4D Embodied World Models	Haoyu Zhen et.al.	2504.20995	null
2025-04-29	ACE: A Security Architecture for LLM-Integrated App Systems	Evan Li et.al.	2504.20984	null
2025-04-29	Real-Time Wayfinding Assistant for Blind and Low-Vision Users	Dabbrata Das et.al.	2504.20976	null
2025-04-29	SetKE: Knowledge Editing for Knowledge Elements Overlap	Yifan Wei et.al.	2504.20972	null
2025-04-29	OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Shangyu Li et.al.	2504.20964	link
2025-04-29	Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models	Maryna Vyshnyvetska et.al.	2504.20951	null
2025-04-30	Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition	Tyler McDonald et.al.	2504.20946	null
2025-04-29	ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification	Ziqing Fan et.al.	2504.20930	link
2025-04-30	End-to-end Audio Deepfake Detection from RAW Waveforms: a RawNet-Based Approach with Cross-Dataset Evaluation	Andrea Di Pierno et.al.	2504.20923	link
2025-04-29	An Empirical Study on the Capability of LLMs in Decomposing Bug Reports	Zhiyuan Chen et.al.	2504.20911	null
2025-04-29	Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers	Quentin Guimard et.al.	2504.20902	null
2025-04-29	Evaluating Generative Models for Tabular Data: Novel Metrics and Benchmarking	Dayananda Herurkar et.al.	2504.20900	null
2025-04-29	LELANTE: LEveraging LLM for Automated ANdroid TEsting	Shamit Fatin et.al.	2504.20896	null
2025-04-29	The Leaderboard Illusion	Shivalika Singh et.al.	2504.20879	null
2025-04-29	AI-GenBench: A New Ongoing Benchmark for AI-Generated Image Detection	Lorenzo Pellegrini et.al.	2504.20865	null
2025-04-29	Universal language model with the intervention of quantum theory	D. -F. Qin et.al.	2504.20839	null
2025-04-29	Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning	Hongfei Xue et.al.	2504.20835	null
2025-04-29	Reinforcement Learning for LLM Reasoning Under Memory Constraints	Alan Lee et.al.	2504.20834	null
2025-04-30	Ascendra: Dynamic Request Prioritization for Efficient LLM Serving	Azam Ikram et.al.	2504.20828	null
2025-04-29	Secure Coding with AI, From Creation to Inspection	Vladislav Belozerov et.al.	2504.20814	null
2025-04-30	Unlocking User-oriented Pages: Intention-driven Black-box Scanner for Real-world Web Applications	Weizhe Wang et.al.	2504.20801	null
2025-04-29	Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges	Yunseo Lee et.al.	2504.20799	null
2025-04-29	Q-Fusion: Diffusing Quantum Circuits	Collin Beaudoin et.al.	2504.20794	null
2025-04-29	Using LLMs in Generating Design Rationale for Software Architecture Decisions	Xiyu Zhou et.al.	2504.20781	link
2025-04-29	Turing Machine Evaluation for Large Language Model	Haitao Wu et.al.	2504.20771	link
2025-04-29	Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption	Wenxiao Wang et.al.	2504.20769	null
2025-04-29	Understanding Large Language Model Supply Chain: Structure, Domain, and Vulnerabilities	Yanzhe Hu et.al.	2504.20763	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	Learning a General Model: Folding Clothing with Topological Dynamics	Yiming Liu et.al.	2504.20720	null
2025-04-29	Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think	Hasan Abed Al Kader Hammoud et.al.	2504.20708	null
2025-04-29	What’s Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models	Jan Kapar et.al.	2504.20687	link
2025-04-29	Identifying Uncertainty in Self-Adaptive Robotics with Large Language Models	Hassan Sartaj et.al.	2504.20684	null
2025-04-29	CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation	Wenjing Yin et.al.	2504.20673	null
2025-04-29	A Generative-AI-Driven Claim Retrieval System Capable of Detecting and Retrieving Claims from Social Media Platforms in Multiple Languages	Ivan Vykopal et.al.	2504.20668	link
2025-04-29	ComplexVCoder: An LLM-Driven Framework for Systematic Generation of Complex Verilog Code	Jian Zuo et.al.	2504.20653	null
2025-04-29	Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection	Ziqing Fan et.al.	2504.20644	null
2025-04-29	Cooking Up Creativity: A Cognitively-Inspired Approach for Enhancing LLM Creativity through Structured Representations	Moran Mizrahi et.al.	2504.20643	link
2025-04-29	Bridging the Generalisation Gap: Synthetic Data Generation for Multi-Site Clinical Model Validation	Bradley Segal et.al.	2504.20635	link
2025-04-29	ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting	Yu Zhang et.al.	2504.20630	null
2025-04-29	Cognitive maps are generative programs	Marta Kryven et.al.	2504.20628	null
2025-04-29	DiffusionRIR: Room Impulse Response Interpolation using Diffusion Models	Sagi Della Torre et.al.	2504.20625	null
2025-04-29	PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval	Zihan Niu et.al.	2504.20624	null
2025-04-29	The Hidden Risks of LLM-Generated Web Application Code: A Security-Centric Evaluation of Code Generation Capabilities in Large Language Models	Swaroop Dora et.al.	2504.20612	null
2025-04-29	Information Retrieval in the Age of Generative AI: The RGB Model	Michele Garetto et.al.	2504.20610	link
2025-04-29	WenyanGPT: A Large Language Model for Classical Chinese Tasks	Xinyu Yao et.al.	2504.20609	null
2025-04-29	Reinforcement Learning for Reasoning in Large Language Models with One Training Example	Yiping Wang et.al.	2504.20571	link
2025-04-29	ReCIT: Reconstructing Full Private Data from Gradient in Parameter-Efficient Fine-Tuning of Large Language Models	Jin Xie et.al.	2504.20570	null
2025-04-29	BrAIcht, a theatrical agent that speaks like Bertolt Brecht’s characters	Baz Roland et.al.	2504.20552	null
2025-04-29	TriniMark: A Robust Generative Speech Watermarking Method for Trinity-Level Attribution	Yue Li et.al.	2504.20532	null
2025-04-30	Conversations with AI Chatbots Increase Short-Term Vaccine Intentions But Do Not Outperform Standard Public Health Messaging	Neil K. R. Sehgal et.al.	2504.20519	null
2025-04-29	MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living	Xi Chen et.al.	2504.20505	null
2025-04-29	SAM-Guided Robust Representation Learning for One-Shot 3D Medical Image Segmentation	Jia Wang et.al.	2504.20501	null
2025-04-29	UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation	Huimin Lu et.al.	2504.20500	link
2025-04-29	Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression	Yu Cui et.al.	2504.20493	null
2025-04-29	Enhancing LLM Language Adaption through Cross-lingual In-Context Pre-training	Linjuan Wu et.al.	2504.20484	null
2025-04-29	Robustness via Referencing: Defending against Prompt Injection Attacks by Referencing the Executed Instruction	Yulin Chen et.al.	2504.20472	null
2025-04-29	Fane at SemEval-2025 Task 10: Zero-Shot Entity Framing with Large Language Models	Enfa Fane et.al.	2504.20469	link
2025-04-29	A Summary on GUI Agents with Foundation Models Enhanced by Reinforcement Learning	Jiahao Li et.al.	2504.20464	null
2025-04-30	TAMO:Fine-Grained Root Cause Analysis via Tool-Assisted LLM Agent with Multi-Modality Observation Data	Qi Wang et.al.	2504.20462	null
2025-04-29	SAS-Prompt: Large Language Models as Numerical Optimizers for Robot Self-Improvement	Heni Ben Amor et.al.	2504.20459	null
2025-04-29	Enhancing News Recommendation with Hierarchical LLM Prompting	Hai-Dang Kieu et.al.	2504.20452	null
2025-04-29	GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection	DiJia Su et.al.	2504.20437	null
2025-04-29	RV-Syn: Rational and Verifiable Mathematical Reasoning Data Synthesis based on Structured Function Library	Jiapeng Wang et.al.	2504.20426	null
2025-04-29	Plant Disease Detection through Multimodal Large Language Models and Convolutional Neural Networks	Konstantinos I. Roumeliotis et.al.	2504.20419	null
2025-04-29	Enhancing Leakage Attacks on Searchable Symmetric Encryption Using LLM-Based Synthetic Data Generation	Joshua Chiu et.al.	2504.20414	link
2025-04-29	CrashFixer: A crash resolution agent for the Linux kernel	Alex Mathai et.al.	2504.20412	null
2025-04-29	Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs	Paiheng Xu et.al.	2504.20406	null
2025-04-29	FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding	Yanan Guo et.al.	2504.20384	null
2025-04-29	Generative Learning for Slow Manifolds and Bifurcation Diagrams	Ellis R. Crabtree et.al.	2504.20375	null
2025-04-29	DMDTEval: An Evaluation and Analysis of LLMs on Disambiguation in Multi-domain Translation	Zhibo Man et.al.	2504.20371	null
2025-04-29	Thoughtful, Confused, or Untrustworthy: How Text Presentation Influences Perceptions of AI Writing Tools	David Zhou et.al.	2504.20365	null
2025-04-29	PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking	Xiatao Sun et.al.	2504.20359	null
2025-04-29	Local Prompt Optimization	Yash Jain et.al.	2504.20355	null
2025-04-29	CarbonCall: Sustainability-Aware Function Calling for Large Language Models on Edge Devices	Varatheepan Paramanayakam et.al.	2504.20348	null
2025-04-29	“I’ve talked to ChatGPT about my issues last night.”: Examining Mental Health Conversations with Large Language Models through Reddit Analysis	Kyuha Jung et.al.	2504.20320	null
2025-04-28	DeepAndes: A Self-Supervised Vision Foundation Model for Multi-Spectral Remote Sensing Imagery of the Andes	Junlin Guo et.al.	2504.20303	null
2025-04-28	FALCO: a Foundation model of Astronomical Light Curves for time dOmain astronomy	Xiaoxiong Zuo et.al.	2504.20290	null
2025-04-28	Image Interpolation with Score-based Riemannian Metrics of Diffusion Models	Shinnosuke Saito et.al.	2504.20288	null
2025-04-28	Enhancing Systematic Reviews with Large Language Models: Using GPT-4 and Kimi	Dandan Chen Kaptur et.al.	2504.20276	null
2025-04-28	Can Large Language Models Learn Formal Logic? A Data-Driven Training and Evaluation Framework	Yuan Xia et.al.	2504.20213	null
2025-04-28	Prompting LLMs for Code Editing: Struggles and Remedies	Daye Nam et.al.	2504.20196	null
2025-04-28	BLADE: Benchmark suite for LLM-driven Automated Design and Evolution of iterative optimisation heuristics	Niki van Stein et.al.	2504.20183	null
2025-04-28	Integration Flow Models	Jingjing Wang et.al.	2504.20179	null
2025-04-28	Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models	Zae Myung Kim et.al.	2504.20157	link
2025-04-28	AutoJudge: Judge Decoding Without Manual Annotation	Roman Garipov et.al.	2504.20039	null
2025-04-28	SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning	Wufei Ma et.al.	2504.20024	null
2025-04-28	Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages	Pritika Rohera et.al.	2504.20022	null
2025-04-28	Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models	Xin Wang et.al.	2504.20020	null
2025-04-29	LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation	Beizhe Hu et.al.	2504.20013	null
2025-04-28	Towards Automated Scoping of AI for Social Good Projects	Jacob Emmerson et.al.	2504.20010	null
2025-04-28	Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom	Rishika Sen et.al.	2504.20000	null
2025-04-28	HJRNO: Hamilton-Jacobi Reachability with Neural Operators	Yankai Li et.al.	2504.19989	null
2025-04-28	TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons	Emre Can Acikgoz et.al.	2504.19982	null
2025-04-28	Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets	Adam Younsi et.al.	2504.19981	null
2025-04-29	From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification	Junhao Ye et.al.	2504.19959	null
2025-04-28	Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI	Hugo Georgenthum et.al.	2504.19918	null
2025-04-28	Can AI Agents Design and Implement Drug Discovery Pipelines?	Khachik Smbatyan et.al.	2504.19912	null
2025-04-28	GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets	Mingqian He et.al.	2504.19898	null
2025-04-28	CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition	Quynh Phung et.al.	2504.19894	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876	link
2025-04-28	semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage	Ke Hong et.al.	2504.19867	null
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-29	The Automation Advantage in AI Red Teaming	Rob Mulla et.al.	2504.19855	null
2025-04-28	Do You Know the Way? Human-in-the-Loop Understanding for Fast Traversability Estimation in Mobile Robotics	Andre Schreiber et.al.	2504.19851	link
2025-04-28	Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration	Juhan Park et.al.	2504.19847	null
2025-04-28	LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects	Guangyi Liu et.al.	2504.19838	link
2025-04-28	PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping	Feng Chen et.al.	2504.19818	link
2025-04-28	Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance	Takuya Tamura et.al.	2504.19811	null
2025-04-28	Contextures: The Mechanism of Representation Learning	Runtian Zhai et.al.	2504.19792	null
2025-04-28	Heterophily-informed Message Passing	Haishan Wang et.al.	2504.19785	null
2025-04-29	If Concept Bottlenecks are the Question, are Foundation Models the Answer?	Nicola Debole et.al.	2504.19774	link
2025-04-28	Moral Reasoning Across Languages: The Critical Role of Low-Resource Languages in LLMs	Huichi Zhou et.al.	2504.19759	null
2025-04-28	Reconstructing Context: Evaluating Advanced Chunking Strategies for Retrieval-Augmented Generation	Carlo Merola et.al.	2504.19754	link
2025-04-28	FineQ: Software-Hardware Co-Design for Low-Bit Fine-Grained Mixed-Precision Quantization of LLMs	Xilong Xie et.al.	2504.19746	null
2025-04-28	LLM-Assisted Automated Deductive Coding of Dialogue Data: Leveraging Dialogue-Specific Characteristics to Enhance Contextual Understanding	Ying Na et.al.	2504.19734	null
2025-04-28	RepText: Rendering Visual Text via Replicating	Haofan Wang et.al.	2504.19724	null
2025-04-28	Taming the Titans: A Survey of Efficient LLM Inference Serving	Ranran Zhen et.al.	2504.19720	link
2025-04-28	Pixels2Points: Fusing 2D and 3D Features for Facial Skin Segmentation	Victoria Yue Chen et.al.	2504.19718	null
2025-04-28	Guided Tensor Lifting	Yixuan Li et.al.	2504.19705	null
2025-04-28	From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review	Mohamed Amine Ferrag et.al.	2504.19678	null
2025-04-28	Annif at SemEval-2025 Task 5: Traditional XMTC augmented by LLMs	Osma Suominen et.al.	2504.19675	link
2025-04-28	$\texttt{SAGE}$ : A Generic Framework for LLM Safety Evaluation	Madhur Jindal et.al.	2504.19674	link
2025-04-28	A Tripartite Perspective on GraphRAG	Michael Banf et.al.	2504.19667	null
2025-04-28	Decentralization of Generative AI via Mixture of Experts for Wireless Networks: A Comprehensive Survey	Yunting Xu et.al.	2504.19660	null
2025-04-28	Intelligent4DSE: Optimizing High-Level Synthesis Design Space Exploration with Graph Neural Networks and Large Language Models	Lei Xu et.al.	2504.19649	null
2025-04-28	Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search	Fei Liu et.al.	2504.19636	null
2025-04-28	DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer	Junpeng Jiang et.al.	2504.19614	null
2025-04-28	Scene2Hap: Combining LLMs and Physical Modeling for Automatically Generating Vibrotactile Signals for Full VR Scenes	Arata Jingu et.al.	2504.19611	null
2025-04-28	Coreference Resolution for Vietnamese Narrative Texts	Hieu-Dai Tran et.al.	2504.19606	null
2025-04-28	GVPO: Group Variance Policy Optimization for Large Language Model Post-Training	Kaichen Zhang et.al.	2504.19599	null
2025-04-28	Towards Robust Multimodal Physiological Foundation Models: Handling Arbitrary Missing Modalities	Xi Fu et.al.	2504.19596	null
2025-04-28	Mapping the Italian Telegram Ecosystem	Lorenzo Alvisi et.al.	2504.19594	null
2025-04-28	Graph-Based Spectral Decomposition for Parameter Coordination in Language Model Fine-Tuning	Hanlu Zhang et.al.	2504.19583	null
2025-04-28	m-KAILIN: Knowledge-Driven Agentic Scientific Corpus Distillation Framework for Biomedical Large Language Models Training	Meng Xiao et.al.	2504.19565	null
2025-04-28	Quantifying Memory Utilization with Effective State-Size	Rom N. Parnichkun et.al.	2504.19561	null
2025-04-28	Detecting Effects of AI-Mediated Communication on Language Complexity and Sentiment	Kristen Sussman et.al.	2504.19556	null
2025-04-28	DEEMO: De-identity Multimodal Emotion Recognition and Reasoning	Deng Li et.al.	2504.19549	null
2025-04-28	Towards Faster and More Compact Foundation Models for Molecular Property Prediction	Yasir Ghunaim et.al.	2504.19538	link
2025-04-28	LR-IAD:Mask-Free Industrial Anomaly Detection with Logical Reasoning	Peijian Zeng et.al.	2504.19524	null
2025-04-28	FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation	Ke Hong et.al.	2504.19519	null
2025-04-28	Evolution of Cooperation in LLM-Agent Societies: A Preliminary Study Using Different Punishment Strategies	Kavindu Warnakulasuriya et.al.	2504.19487	null
2025-04-28	Improving Reasoning Performance in Large Language Models via Representation Engineering	Bertram Højer et.al.	2504.19483	null
2025-04-28	An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination	Dixiao Wei et.al.	2504.19480	null
2025-04-28	BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text	Jiageng Wu et.al.	2504.19467	link
2025-04-28	Do Automatic Comment Generation Techniques Fall Short? Exploring the Influence of Method Dependencies on Code Understanding	Md Mustakim Billah et.al.	2504.19459	null
2025-04-28	Towards Long Context Hallucination Detection	Siyi Liu et.al.	2504.19457	null
2025-04-28	Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition	Yuki Hirakawa et.al.	2504.19455	null
2025-04-28	R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference	Zhenyu Zhang et.al.	2504.19449	null
2025-04-28	Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks	Yi-Long Lu et.al.	2504.19445	null
2025-04-28	Large Language Models are Qualified Benchmark Builders: Rebuilding Pre-Training Datasets for Advancing Code Intelligence Tasks	Kang Yang et.al.	2504.19444	null
2025-04-28	Context-Guided Dynamic Retrieval for Improving Generation Quality in RAG Models	Jacky He et.al.	2504.19436	null
2025-04-29	MER 2025: When Affective Computing Meets Large Language Models	Zheng Lian et.al.	2504.19423	null
2025-04-28	Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory	Prateek Chhikara et.al.	2504.19413	null
2025-04-29	Context Selection and Rewriting for Video-based Educational Question Generation	Mengxia Yu et.al.	2504.19406	link
2025-04-27	LLMs for Engineering: Teaching Models to Design High Powered Rockets	Toby Simonds et.al.	2504.19394	null
2025-04-27	From Inductive to Deductive: LLMs-Based Qualitative Data Analysis in Requirements Engineering	Syed Tauhid Ullah Shah et.al.	2504.19384	link
2025-04-27	Flow Along the K-Amplitude for Generative Modeling	Weitao Du et.al.	2504.19353	null
2025-04-27	Contextual Online Uncertainty-Aware Preference Learning for Human Feedback	Nan Lu et.al.	2504.19342	null
2025-04-27	OpenFOAMGPT 2.0: end-to-end, trustworthy automation for computational fluid dynamics	Jingsen Feng et.al.	2504.19338	null
2025-04-29	Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing	James O’ Neill et.al.	2504.19333	null
2025-04-27	BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese	Peilin Zhou et.al.	2504.19314	link
2025-04-27	AndroidGen: Building an Android Language Agent under Data Scarcity	Hanyu Lai et.al.	2504.19298	link
2025-04-27	Multiscale Roughness of Upper Mantle Discontinuities Inferred from the USArray: Dependence on Tomography Models	Yinzhi Wang et.al.	2504.19290	null
2025-04-27	Generalized Score Matching: Bridging $f$ -Divergence and Statistical Estimation Under Correlated Noise	Yirong Shen et.al.	2504.19288	null
2025-04-27	Small Models, Big Tasks: An Exploratory Empirical Study on Small Language Models for Function Calling	Ishan Kavathekar et.al.	2504.19277	link
2025-04-27	Anyprefer: An Agentic Framework for Preference Data Synthesis	Yiyang Zhou et.al.	2504.19276	null
2025-04-27	OpenFusion++: An Open-vocabulary Real-time Scene Understanding System	Xiaofeng Jin et.al.	2504.19266	null
2025-04-27	The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach	Chad Coleman et.al.	2504.19255	null
2025-04-27	Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers	Dylan Bouchard et.al.	2504.19254	link
2025-04-27	CARL: Camera-Agnostic Representation Learning for Spectral Image Analysis	Alexander Baumann et.al.	2504.19223	null
2025-04-27	AlphaFuse: Learn ID Embeddings for Sequential Recommendation in Null Space of Language Embeddings	Guoqing Hu et.al.	2504.19218	link
2025-04-27	WuNeng: Hybrid State with Attention	Liu Xiao et.al.	2504.19191	null
2025-04-27	Different behaviors of diffusing diffusivity dynamics based on three different definitions of fractional Brownian motion	Wei Wang et.al.	2504.19190	null
2025-04-27	Hierarchical Attention Generates Better Proofs	Jianlong Chen et.al.	2504.19188	link
2025-04-27	Segmenting Objectiveness and Task-awareness Unknown Region for Autonomous Driving	Mi Zheng et.al.	2504.19183	null
2025-04-27	SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning	Jiaqi Chen et.al.	2504.19162	null
2025-04-27	Muyan-TTS: A Trainable Text-to-Speech Model Optimized for Podcast Scenarios with a $50K Budget	Xin Li et.al.	2504.19146	link
2025-04-27	ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development	Bowei Wang et.al.	2504.19144	link
2025-04-27	APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries	Huajian Xin et.al.	2504.19110	null
2025-04-27	A Multi-Language Perspective on the Robustness of LLM Code Generation	Fazle Rabbi et.al.	2504.19108	link
2025-04-27	Harmonizing Generalization and Personalization in Ring-topology Decentralized Federated Learning	Shunxin Guo et.al.	2504.19103	null
2025-04-27	Privacy-Preserving Federated Embedding Learning for Localized Retrieval-Augmented Generation	Qianren Mao et.al.	2504.19101	null
2025-04-27	VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction	Ning Wang et.al.	2504.19099	null
2025-04-27	CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges	Yu Li et.al.	2504.19093	null
2025-04-25	Generalization Capability for Imitation Learning	Yixiao Wang et.al.	2504.18538	null
2025-04-25	TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation	Gwen Yidou Weng et.al.	2504.18535	null
2025-04-25	Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation	Shivam Duggal et.al.	2504.18509	null
2025-04-25	Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional	Sanjeev Raja et.al.	2504.18506	null
2025-04-25	Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues	Leandra Fichtel et.al.	2504.18483	null
2025-04-25	Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation	Peiyuan Jing et.al.	2504.18453	null
2025-04-25	Kimi-Audio Technical Report	KimiTeam et.al.	2504.18425	link
2025-04-25	LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning	Rui Li et.al.	2504.18424	null
2025-04-25	LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection	Rajesh Yarra et.al.	2504.18423	null
2025-04-25	BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs	Hongyu Wang et.al.	2504.18415	null
2025-04-25	An Empirical Study of Evaluating Long-form Question Answering	Ning Xian et.al.	2504.18413	link
2025-04-25	Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers	Jared Moore et.al.	2504.18412	link
2025-04-25	HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?	Yusen Zhang et.al.	2504.18406	null
2025-04-25	HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models	Jens Hooge et.al.	2504.18405	null
2025-04-25	Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization	Kesen Zhao et.al.	2504.18397	link
2025-04-25	Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation	Qidong Liu et.al.	2504.18383	null
2025-04-25	Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Lei Shen et.al.	2504.18373	link
2025-04-25	ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications	Felix Viktor Jedrzejewski et.al.	2504.18369	null
2025-04-25	Enhanced Sampling, Public Dataset and Generative Model for Drug-Protein Dissociation Dynamics	Maodong Li et.al.	2504.18367	null
2025-04-25	Testing Individual Fairness in Graph Neural Networks	Roya Nasiri et.al.	2504.18353	null
2025-04-25	Revisiting Data Auditing in Large Vision-Language Models	Hongyu Zhu et.al.	2504.18349	null
2025-04-25	Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review	Toghrul Abbasli et.al.	2504.18346	null
2025-04-25	Large Language Models to Accelerate Organic Chemistry Synthesis	Yu Zhang et.al.	2504.18340	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332	null
2025-04-25	Towards Adaptive Software Agents for Debugging	Yacine Majdoub et.al.	2504.18316	null
2025-04-25	Artificial Intelligence health advice accuracy varies across languages and contexts	Prashant Garg et.al.	2504.18310	null
2025-04-25	Seeing Soundscapes: Audio-Visual Generation and Separation from Soundscapes Using Audio-Visual Separator	Minjae Kang et.al.	2504.18283	null
2025-04-25	LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method	Tao Wu et.al.	2504.18271	link
2025-04-25	TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation	Shintaro Ozaki et.al.	2504.18269	null
2025-04-25	MAGI: Multi-Agent Guided Interview for Psychiatric Assessment	Guanqun Bi et.al.	2504.18260	null
2025-04-25	SSL4Eco: A Global Seasonal Dataset for Geospatial Foundation Models in Ecology	Elena Plekhanova et.al.	2504.18256	null
2025-04-25	Efficient Single-Pass Training for Multi-Turn Reasoning	Ritesh Goru et.al.	2504.18246	null
2025-04-25	What is the Added Value of UDA in the VFM Era?	Brunó B. Englert et.al.	2504.18190	null
2025-04-25	Offline Learning of Controllable Diverse Behaviors	Mathieu Petitbois et.al.	2504.18160	null
2025-04-25	Leveraging Decoder Architectures for Learned Sparse Retrieval	Jingfen Qiao et.al.	2504.18151	null
2025-04-25	NoEsis: Differentially Private Knowledge Transfer in Modular LLM Adaptation	Rob Romijnders et.al.	2504.18147	null
2025-04-25	Score-Based Deterministic Density Sampling	Vasily Ilin et.al.	2504.18130	null
2025-04-25	Think, Prune, Train, Improve: Scaling Reasoning without Scaling Models	Caia Costello et.al.	2504.18116	null
2025-04-25	Comparative Study on the Discourse Meaning of Chinese and English Media in the Paris Olympics Based on LDA Topic Modeling Technology and LLM Prompt Engineering	Yinglong Yu et.al.	2504.18106	null
2025-04-25	Application and Optimization of Large Models Based on Prompt Tuning for Fact-Check-Worthiness Estimation	Yinglong Yu et.al.	2504.18104	null
2025-04-25	Random-Set Large Language Models	Muhammad Mubashar et.al.	2504.18085	null
2025-04-25	Automating Function-Level TARA for Automotive Full-Lifecycle Security	Yuqiao Yang et.al.	2504.18083	null
2025-04-25	Stabilizing Reasoning in Medical LLMs with Continued Pretraining and Reasoning Preference Optimization	Wataru Kawakami et.al.	2504.18080	null
2025-04-25	PropRAG: Guiding Retrieval with Beam Search over Proposition Paths	Jingjin Wang et.al.	2504.18070	null
2025-04-25	LLM-Guided Open RAN: Empowering Hierarchical RAN Intelligent Control	Lingyan Bao et.al.	2504.18062	null
2025-04-25	DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models	Jianyu Liu et.al.	2504.18053	link
2025-04-25	Validating Network Protocol Parsers with Traceable RFC Document Interpretation	Mingwei Zheng et.al.	2504.18050	null
2025-04-25	RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models	Bang An et.al.	2504.18041	null
2025-04-25	MultiMind: Enhancing Werewolf Agents with Multimodal Reasoning and Theory of Mind	Zheng Zhang et.al.	2504.18039	null
2025-04-25	Federated Client-tailored Adapter for Medical Image Segmentation	Guyue Hu et.al.	2504.18020	null
2025-04-25	Diffusion-Driven Universal Model Inversion Attack for Face Recognition	Hanrui Wang et.al.	2504.18015	null
2025-04-25	Sky-Drive: A Distributed Multi-Agent Simulation Platform for Socially-Aware and Human-AI Collaborative Future Transportation	Zilin Huang et.al.	2504.18010	null
2025-04-25	Assessing the Utility of Audio Foundation Models for Heart and Respiratory Sound Analysis	Daisuke Niizumi et.al.	2504.18004	null
2025-04-25	Self-Balancing, Memory Efficient, Dynamic Metric Space Data Maintenance, for Rapid Multi-Kernel Estimation	Aditya S Ellendula et.al.	2504.18003	null
2025-04-25	Streaming, Fast and Slow: Cognitive Load-Aware Streaming for Efficient LLM Serving	Chang Xiao et.al.	2504.17999	null
2025-04-24	Optimism, Expectation, or Sarcasm? Multi-Class Hope Speech Detection in Spanish and English	Sabur Butt et.al.	2504.17974	null
2025-04-24	LLM Agent Swarm for Hypothesis-Driven Drug Discovery	Kevin Song et.al.	2504.17967	null
2025-04-24	Evaluating Machine Expertise: How Graduate Students Develop Frameworks for Assessing GenAI Content	Celia Chen et.al.	2504.17964	null
2025-04-24	Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents	Chaoran Chen et.al.	2504.17934	null
2025-04-24	The Role of Open-Source LLMs in Shaping the Future of GeoAI	Xiao Huang et.al.	2504.17833	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	Replay to Remember: Retaining Domain Knowledge in Streaming Language Models	Sneh Pillai et.al.	2504.17780	null
2025-04-24	Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT	Anuja Tayal et.al.	2504.17753	null
2025-04-24	Towards Robust LLMs: an Adversarial Robustness Measurement Framework	Natan Levy et.al.	2504.17723	null
2025-04-24	Multilingual Performance Biases of Large Language Models in Education	Vansh Gupta et.al.	2504.17720	null
2025-04-24	PICO: Reconstructing 3D People In Contact with Objects	Alpár Cseke et.al.	2504.17695	null
2025-04-24	Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks	Haru-Tada Sato et.al.	2504.17685	null
2025-04-24	INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models	Jarne Thys et.al.	2504.17677	null
2025-04-24	Energy Considerations of Large Language Model Inference and Efficiency Optimizations	Jared Fernandez et.al.	2504.17674	null
2025-04-24	Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation	Ying Zhu et.al.	2504.17672	null
2025-04-24	DiMeR: Disentangled Mesh Reconstruction Model	Lutao Jiang et.al.	2504.17670	link
2025-04-24	Towards a HIPAA Compliant Agentic AI System in Healthcare	Subash Neupane et.al.	2504.17669	null
2025-04-24	Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics	Zena Al-Khalili et.al.	2504.17665	null
2025-04-24	Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models	Julius Vetter et.al.	2504.17660	null
2025-04-24	Likelihood-Free Variational Autoencoders	Chen Xu et.al.	2504.17622	null
2025-04-24	L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference	Qingyuan Liu et.al.	2504.17584	null
2025-04-25	DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training	Xiaoyu Tian et.al.	2504.17565	null
2025-04-24	HalluLens: LLM Hallucination Benchmark	Yejin Bang et.al.	2504.17550	null
2025-04-24	A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task	Jiaqi Deng et.al.	2504.17547	null
2025-04-24	Auditing the Ethical Logic of Generative AI Models	W. Russell Neuman et.al.	2504.17544	null
2025-04-24	Large Language Model-Driven Concolic Execution for Highly Structured Test Input Generation	Haoxin Tu et.al.	2504.17542	null
2025-04-24	Towards Machine-Generated Code for the Resolution of User Intentions	Justus Flerlage et.al.	2504.17531	link
2025-04-26	Combining GCN Structural Learning with LLM Chemical Knowledge for Enhanced Virtual Screening	Radia Berreziga et.al.	2504.17497	null
2025-04-24	Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation	Xin Yi et.al.	2504.17480	null
2025-04-24	Unveiling Hidden Vulnerabilities in Digital Human Generation via Adversarial Attacks	Zhiying Li et.al.	2504.17457	null
2025-04-24	Adaptive Orchestration of Modular Generative Information Access Systems	Mohanna Hoveyda et.al.	2504.17454	link
2025-04-24	Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs	Tiancheng Gu et.al.	2504.17432	null
2025-04-24	Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation	Guojia An et.al.	2504.17427	null
2025-04-24	Towards Leveraging Large Language Model Summaries for Topic Modeling in Source Code	Michele Carissimi et.al.	2504.17426	null
2025-04-24	Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks	Yang Liu et.al.	2504.17421	null
2025-04-24	Assessing the Capability of Large Language Models for Domain-Specific Ontology Generation	Anna Sofia Lippolis et.al.	2504.17402	null
2025-04-24	Fine-tune Smarter, Not Harder: Parameter-Efficient Fine-Tuning for Geospatial Foundation Models	Francesc Marti-Escofet et.al.	2504.17397	null
2025-04-25	On the workflow, opportunities and challenges of developing foundation model in geophysics	Hanlin Sheng et.al.	2504.17384	null
2025-04-24	On-Device Qwen2.5: Efficient LLM Inference with Model Compression and Hardware Acceleration	Maoyang Xiang et.al.	2504.17376	null
2025-04-24	LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Yongxuan Wu et.al.	2504.17366	link
2025-04-25	TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation	Ling You et.al.	2504.17365	null
2025-04-24	PatientDx: Merging Large Language Models for Protecting Data-Privacy in Healthcare	Jose G. Moreno et.al.	2504.17360	null
2025-04-24	Comprehend, Divide, and Conquer: Feature Subspace Exploration via Multi-Agent Hierarchical Reinforcement Learning	Weiliang Zhang et.al.	2504.17356	null
2025-04-24	DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition	Yiyan Xu et.al.	2504.17349	null
2025-04-24	TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos	Linli Yao et.al.	2504.17343	link
2025-04-24	Bridging Cognition and Emotion: Empathy-Driven Multimodal Misinformation Detection	Zihan Wang et.al.	2504.17332	null
2025-04-24	Exploring Context-aware and LLM-driven Locomotion for Immersive Virtual Reality	Süleyman Özdel et.al.	2504.17331	null
2025-04-24	Dargana: fine-tuning EarthPT for dynamic tree canopy mapping from space	Michael J. Smith et.al.	2504.17321	null
2025-04-25	Class-Conditional Distribution Balancing for Group Robust Classification	Miaoyun Zhao et.al.	2504.17314	null
2025-04-24	FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation	Yulia Otmakhova et.al.	2504.17311	null
2025-04-24	CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality	Junyan Zhang et.al.	2504.17309	null
2025-04-24	AI-Enhanced Business Process Automation: A Case Study in the Insurance Domain Using Object-Centric Process Mining	Shahrzad Khayatbashi et.al.	2504.17295	null
2025-04-24	Combining Static and Dynamic Approaches for Mining and Testing Constraints for RESTful API Testing	Hieu Huynh et.al.	2504.17287	null
2025-04-24	MV-Crafter: An Intelligent System for Music-guided Video Generation	Chuer Chen et.al.	2504.17267	null
2025-04-24	JurisCTC: Enhancing Legal Judgment Prediction via Cross-Domain Transfer and Contrastive Learning	Zhaolu Kang et.al.	2504.17264	null
2025-04-24	Symbolic Representation for Any-to-Any Generative Tasks	Jiaqi Chen et.al.	2504.17261	null
2025-04-24	Targeted AMP generation through controlled diffusion with efficient embeddings	Diogo Soares et.al.	2504.17247	null
2025-04-24	FLAG: Formal and LLM-assisted SVA Generation for Formal Specifications of On-Chip Communication Protocols	Yu-An Shih et.al.	2504.17226	null
2025-04-24	Visual and textual prompts for enhancing emotion recognition in video	Zhifeng Wang et.al.	2504.17224	null
2025-04-24	Towards Generalizable Deepfake Detection with Spatial-Frequency Collaborative Learning and Hierarchical Cross-Modal Fusion	Mengyu Qiao et.al.	2504.17223	null
2025-04-24	Does Knowledge Distillation Matter for Large Language Model based Bundle Generation?	Kaidong Feng et.al.	2504.17220	null
2025-04-24	Enhancing Variational Autoencoders with Smooth Robust Latent Encoding	Hyomin Lee et.al.	2504.17219	null
2025-04-24	Synthetic Power Flow Data Generation Using Physics-Informed Denoising Diffusion Probabilistic Models	Junfei Wang et.al.	2504.17210	null
2025-04-24	Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation	Phillip Y. Lee et.al.	2504.17207	null
2025-04-24	High-Fidelity And Complex Test Data Generation For Real-World SQL Code Generation Services	Shivasankari Kannan et.al.	2504.17203	null
2025-04-24	A RAG-Based Multi-Agent LLM System for Natural Hazard Resilience and Adaptation	Yangxinyu Xie et.al.	2504.17200	null
2025-04-24	Automatically Generating Rules of Malicious Software Packages via Large Language Model	XiangRui Zhang et.al.	2504.17198	null
2025-04-24	Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning	Minju Seo et.al.	2504.17192	link
2025-04-25	We’ll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback	Minkyu Choi et.al.	2504.17180	null
2025-04-24	A Genealogy of Multi-Sensor Foundation Models in Remote Sensing	Kevin Lane et.al.	2504.17177	null
2025-04-23	MIRAGE: A Metric-Intensive Benchmark for Retrieval-Augmented Generation Evaluation	Chanhee Park et.al.	2504.17137	null
2025-04-23	Steering the CensorShip: Uncovering Representation Vectors for LLM “Thought” Control	Hannah Cyberey et.al.	2504.17130	link
2025-04-23	Physiological neural representation for personalised tracer kinetic parameter estimation from dynamic PET	Kartikay Tehlan et.al.	2504.17122	link
2025-04-25	The Rise of Small Language Models in Healthcare: A Comprehensive Survey	Muskan Garg et.al.	2504.17119	null
2025-04-23	Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments	Yuran Li et.al.	2504.17087	null
2025-04-23	Scene-Aware Location Modeling for Data Augmentation in Automotive Object Detection	Jens Petersen et.al.	2504.17076	null
2025-04-23	Robo-Troj: Attacking LLM-based Task Planners	Mohaiminul Al Nahian et.al.	2504.17070	null
2025-04-23	Distilling semantically aware orders for autoregressive image generation	Rishav Pramanik et.al.	2504.17069	null
2025-04-23	Statistical Guarantees in Synthetic Data through Conformal Adversarial Generation	Rahul Vishwakarma et.al.	2504.17058	null
2025-04-23	Do Words Reflect Beliefs? Evaluating Belief Depth in Large Language Models	Shariar Kabir et.al.	2504.17052	null
2025-04-23	DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs	Zhenhailong Wang et.al.	2504.17040	null
2025-04-23	Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation	Luca Moroni et.al.	2504.17025	null
2025-04-23	LLM impact on BLV programming	Prashant Chandrasekar et.al.	2504.17018	null
2025-04-23	(Im)possibility of Automated Hallucination Detection in Large Language Models	Amin Karbasi et.al.	2504.17004	null
2025-04-23	Safety Pretraining: Toward the Next Generation of Safe AI	Pratyush Maini et.al.	2504.16980	null
2025-04-23	Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light	Ali Hassani et.al.	2504.16922	link
2025-04-23	IberBench: LLM Evaluation on Iberian Languages	José Ángel González et.al.	2504.16921	null
2025-04-23	DreamO: A Unified Framework for Image Customization	Chong Mou et.al.	2504.16915	null
2025-04-23	BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation	Ruotong Wang et.al.	2504.16907	null
2025-04-23	Practical approaches for crystal structure predictions with inpainting generation and universal interatomic potentials	Peichen Zhong et.al.	2504.16893	null
2025-04-23	Do Large Language Models know who did what to whom?	Joseph M. Denning et.al.	2504.16884	null
2025-04-23	Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models	Xuyang Zhu et.al.	2504.16883	null
2025-04-23	Context-Enhanced Vulnerability Detection Based on Large Language Model	Yixin Yang et.al.	2504.16877	null
2025-04-24	Exploring How LLMs Capture and Represent Domain-Specific Knowledge	Mirian Hipolito Garcia et.al.	2504.16871	null
2025-04-23	Common Functional Decompositions Can Mis-attribute Differences in Outcomes Between Populations	Manuel Quintero et.al.	2504.16864	null
2025-04-23	Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Alexander Shvets et.al.	2504.16856	null
2025-04-23	Monte Carlo Planning with Large Language Model for Text-Based Game Agents	Zijing Shi et.al.	2504.16855	null
2025-04-25	Improving Significant Wave Height Prediction Using Chronos Models	Yilin Zhai et.al.	2504.16834	null
2025-04-23	LRASGen: LLM-based RESTful API Specification Generation	Sida Deng et.al.	2504.16833	null
2025-04-23	GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning	Luu Quy Tung et.al.	2504.16832	null
2025-04-23	Decoupled Global-Local Alignment for Improving Compositional Understanding	Xiaoxing Hu et.al.	2504.16801	null
2025-04-23	MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores	Fengwei Zhou et.al.	2504.16786	null
2025-04-23	Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation	Tixiao Shan et.al.	2504.16782	null
2025-04-23	Advanced Chest X-Ray Analysis via Transformer-Based Image Descriptors and Cross-Model Attention Mechanism	Lakshita Agarwal et.al.	2504.16774	null
2025-04-23	How Effective are Generative Large Language Models in Performing Requirements Classification?	Waad Alhoshan et.al.	2504.16768	null
2025-04-23	Tri-FusionNet: Enhancing Image Description Generation with Transformer-based Fusion Network and Dual Attention Mechanism	Lakshita Agarwal et.al.	2504.16761	null
2025-04-23	Lightweight Latent Verifiers for Efficient Meta-Generation Strategies	Bartosz Piotrowski et.al.	2504.16760	null
2025-04-23	HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations	Kwangseob Ahn et.al.	2504.16754	null
2025-04-23	Feature Mixing Approach for Detecting Intraoperative Adverse Events in Laparoscopic Roux-en-Y Gastric Bypass Surgery	Rupak Bose et.al.	2504.16749	null
2025-04-23	A Survey of AI Agent Protocols	Yingxuan Yang et.al.	2504.16736	null
2025-04-23	IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery	Aniketh Garikaparthi et.al.	2504.16728	link
2025-04-23	Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator	Chenhao Li et.al.	2504.16680	null
2025-04-23	A Post-trainer’s Guide to Multilingual Training Data: Uncovering Cross-lingual Transfer Dynamics	Luisa Shimabucoro et.al.	2504.16677	null
2025-04-23	LLMCode: Evaluating and Enhancing Researcher-AI Alignment in Qualitative Analysis	Joel Oksanen et.al.	2504.16671	null
2025-04-23	MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified Benchmark	William Corrias et.al.	2504.16651	link
2025-04-23	ParetoHqD: Fast Offline Multiobjective Alignment of Large Language Models using Pareto High-quality Data	Haoran Gu et.al.	2504.16628	null
2025-04-23	Federated EndoViT: Pretraining Vision Transformers via Federated Learning on Endoscopic Image Collections	Max Kirchner et.al.	2504.16612	null
2025-04-23	Debunking with Dialogue? Exploring AI-Generated Counterspeech to Challenge Conspiracy Theories	Mareike Lisker et.al.	2504.16604	null
2025-04-23	Comparing Large Language Models and Traditional Machine Translation Tools for Translating Medical Consultation Summaries: A Pilot Study	Andy Li et.al.	2504.16601	null
2025-04-23	Case Study: Fine-tuning Small Language Models for Accurate and Private CWE Detection in Python Code	Md. Azizul Hakim Bappy et.al.	2504.16584	null
2025-04-24	Hyper-Transforming Latent Diffusion Models	Ignacio Peis et.al.	2504.16580	null
2025-04-23	PIS: Linking Importance Sampling and Attention Mechanisms for Efficient Prompt Compression	Lizhe Chen et.al.	2504.16574	null
2025-04-23	PsyCounAssist: A Full-Cycle AI-Powered Psychological Counseling Assistant System	Xianghe Liu et.al.	2504.16573	null
2025-04-23	Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution	Junjie Chen et.al.	2504.16563	link
2025-04-23	Exploring human-SAV interaction using large language models: The impact of psychological ownership and anthropomorphism on user experience	Lirui Guo et.al.	2504.16548	null
2025-04-23	Tinkering Against Scaling	Bolun Zhang et.al.	2504.16546	null
2025-04-23	6G EdgeAI: Performance Evaluation and Analysis	Chien-Sheng Yang et.al.	2504.16529	null
2025-04-23	QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining	Fengze Liu et.al.	2504.16511	null
2025-04-23	A Comprehensive Survey of Synthetic Tabular Data Generation	Ruxue Shi et.al.	2504.16506	link
2025-04-23	TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance	Meng Chu et.al.	2504.16505	null
2025-04-23	Intelligent Depression Prevention via LLM-Based Dialogue Analysis: Overcoming the Limitations of Scale-Dependent Diagnosis through Precise Emotional Pattern Recognition	Zhenguang Zhong et.al.	2504.16504	null
2025-04-23	Amplified Vulnerabilities: Structured Jailbreak Attacks on LLM-based Multi-Agent Debate	Senmao Qi et.al.	2504.16489	null
2025-04-23	Harden and Catch for Just-in-Time Assured LLM-Based Software Testing: Open Research Challenges	Mark Harman et.al.	2504.16472	null
2025-04-23	Killing Two Birds with One Stone: Unifying Retrieval and Ranking with a Single Generative Recommendation Model	Luankang Zhang et.al.	2504.16454	null
2025-04-23	EMRModel: A Large Language Model for Extracting Medical Consultation Dialogues into Structured Medical Records	Shuguang Zhao et.al.	2504.16448	null
2025-04-23	Give LLMs a Security Course: Securing Retrieval-Augmented Code Generation via Knowledge Injection	Bo Lin et.al.	2504.16429	null
2025-04-24	Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark	Hanlei Zhang et.al.	2504.16427	link
2025-04-23	Advancing Radar Hand Gesture Recognition: A Hybrid Spectrum Synthetic Framework Merging Simulation with Neural Networks	Jiaqi Tang et.al.	2504.16423	null
2025-04-23	A Survey of Foundation Model-Powered Recommender Systems: From Feature-Based, Generative to Agentic Paradigms	Chengkai Huang et.al.	2504.16420	null
2025-04-23	Evaluating Multi-Hop Reasoning in Large Language Models: A Chemistry-Centric Case Study	Mohammad Khodadad et.al.	2504.16414	null
2025-04-23	Out-of-the-Box Conditional Text Embeddings from Large Language Models	Kosuke Yamada et.al.	2504.16411	null
2025-04-23	EEmo-Bench: A Benchmark for Multi-modal Large Language Models on Image Evoked Emotion Assessment	Lancheng Gao et.al.	2504.16405	null
2025-04-23	Study of Auto-igniting Spray Flame in Vitiated Swirling Hot Coflow using flamelet generated model	Zafar Alam et.al.	2504.16384	null
2025-04-23	SplitReason: Learning To Offload Reasoning	Yash Akhauri et.al.	2504.16379	null
2025-04-23	Text-to-TrajVis: Enabling Trajectory Data Visualizations from Natural Language Questions	Tian Bai et.al.	2504.16358	null
2025-04-23	DP2FL: Dual Prompt Personalized Federated Learning in Foundation Models	Ying Chang et.al.	2504.16357	null
2025-04-23	Transitive Array: An Efficient GEMM Accelerator with Result Reuse	Cong Guo et.al.	2504.16339	null
2025-04-23	ClarifyCoder: Clarification-Aware Fine-Tuning for Programmatic Problem Solving	Jie JW Wu et.al.	2504.16331	null
2025-04-22	Media Content Atlas: A Pipeline to Explore and Investigate Multidimensional Media Space using Multimodal LLMs	Merve Cerit et.al.	2504.16323	link
2025-04-22	SignX: The Foundation Model for Sign Recognition	Sen Fang et.al.	2504.16315	null
2025-04-22	Capturing Symmetry and Antisymmetry in Language Models through Symmetry-Aware Training Objectives	Zhangdie Yuan et.al.	2504.16312	null
2025-04-22	Improving Automated Secure Code Reviews: A Synthetic Dataset for Code Vulnerability Flaws	Leonardo Centellas-Claros et.al.	2504.16310	null
2025-04-22	The Paradox of Poetic Intent in Back-Translation: Evaluating the Quality of Large Language Models in Chinese Translation	Li Weigang et.al.	2504.16286	null
2025-04-22	Investigating LLMs in Clinical Triage: Promising Capabilities, Persistent Intersectional Biases	Joseph Lee et.al.	2504.16273	null
2025-04-22	Learning Explainable Dense Reward Shapes via Bayesian Optimization	Ryan Koo et.al.	2504.16272	null
2025-04-22	TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs	Ye Qiao et.al.	2504.16266	null
2025-04-22	Learning Energy-Based Generative Models via Potential Flow: A Variational Principle Approach to Probability Density Homotopy Matching	Junn Yong Loo et.al.	2504.16262	null
2025-04-22	FinNLI: Novel Dataset for Multi-Genre Financial Natural Language Inference Benchmarking	Jabez Magomere et.al.	2504.16188	null
2025-04-22	DATETIME: A new benchmark to measure LLM translation and reasoning capabilities	Edward Gaere et.al.	2504.16155	link
2025-04-22	Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room	Danial Hooshyar et.al.	2504.16148	null
2025-04-22	TTRL: Test-Time Reinforcement Learning	Yuxin Zuo et.al.	2504.16084	link
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities	Thomas Schmied et.al.	2504.16078	null
2025-04-22	PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models	Shi Qiu et.al.	2504.16074	null
2025-04-22	Boosting Generative Image Modeling via Joint Image-Feature Synthesis	Theodoros Kouzelis et.al.	2504.16064	null
2025-04-23	Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach	Penghui Li et.al.	2504.16057	null
2025-04-22	Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability	Daniel Hendriks et.al.	2504.16056	null
2025-04-22	Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis	Frank Li et.al.	2504.16047	null
2025-04-23	Certified Mitigation of Worst-Case LLM Copyright Infringement	Jingyu Zhang et.al.	2504.16046	null
2025-04-22	LLMs meet Federated Learning for Scalable and Secure IoT Management	Yazan Otoum et.al.	2504.16032	null
2025-04-22	LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale	Joya Chen et.al.	2504.16030	null
2025-04-22	Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3	Ahmed R. Sadik et.al.	2504.16027	null
2025-04-23	CAPO: Cost-Aware Prompt Optimization	Tom Zehle et.al.	2504.16005	link
2025-04-23	From Human Memory to AI Memory: A Survey on Memory Mechanisms in the Era of LLMs	Yaxiong Wu et.al.	2504.15965	null
2025-04-22	Deep learning of point processes for modeling high-frequency data	Yoshihiro Gyotoku et.al.	2504.15944	null
2025-04-22	FairTranslate: An English-French Dataset for Gender Bias Evaluation in Machine Translation by Overcoming Gender Binarity	Fanny Jourdan et.al.	2504.15941	link
2025-04-22	Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning	Wang Lin et.al.	2504.15932	null
2025-04-22	StreamRL: Scalable, Heterogeneous, and Elastic RL for LLMs with Disaggregated Stream Generation	Yinmin Zhong et.al.	2504.15930	null
2025-04-22	Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning	Hieu Huynh et.al.	2504.15917	link
2025-04-22	Automated Bug Report Prioritization in Large Open-Source Projects	Riley Pierson et.al.	2504.15912	link
2025-04-24	Synergizing RAG and Reasoning: A Systematic Review	Yunfan Gao et.al.	2504.15909	null
2025-04-23	Impact of Noise on LLM-Models Performance in Abstraction and Reasoning Corpus (ARC) Tasks with Model Temperature Considerations	Nikhil Khandalkar et.al.	2504.15903	null
2025-04-22	SARI: Structured Audio Reasoning via Curriculum-Guided Reinforcement Learning	Cheng Wen et.al.	2504.15900	null
2025-04-22	Exploring Cognitive and Aesthetic Causality for Multimodal Aspect-Based Sentiment Analysis	Luwei Xiao et.al.	2504.15848	null
2025-04-22	Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model	Junshu Pan et.al.	2504.15843	null
2025-04-22	DualOptim: Enhancing Efficacy and Stability in Machine Unlearning with Dual Optimizers	Xuyang Zhong et.al.	2504.15827	null
2025-04-22	What’s the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns	Michael A. Hedderich et.al.	2504.15815	link
2025-04-22	Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback	Ning Wang et.al.	2504.15804	null
2025-04-22	A closer look at how large language models trust humans: patterns and biases	Valeria Lerman et.al.	2504.15801	null
2025-04-23	FinDER: Financial Dataset for Question Answering and Evaluating Retrieval-Augmented Generation	Chanyeol Choi et.al.	2504.15800	null
2025-04-22	WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents	Siyu Zhou et.al.	2504.15785	link
2025-04-22	Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach	Ruizhe Li et.al.	2504.15784	null
2025-04-22	TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving	Daocheng Fu et.al.	2504.15780	null
2025-04-22	Clifford Group Equivariant Diffusion Models for 3D Molecular Generation	Cong Liu et.al.	2504.15773	null
2025-04-22	Grounded in Context: Retrieval-Based Method for Hallucination Detection	Assaf Gerner et.al.	2504.15771	null
2025-04-22	Riemannian Neural Geodesic Interpolant	Jiawen Wu et.al.	2504.15736	null
2025-04-22	BBAL: A Bidirectional Block Floating Point-Based Quantisation Accelerator for Large Language Models	Xiaomeng Han et.al.	2504.15721	null
2025-04-22	SeaLLM: Service-Aware and Latency-Optimized Resource Sharing for Large Language Model Inference	Yihao Zhao et.al.	2504.15720	null
2025-04-22	Implementing Rational Choice Functions with LLMs and Measuring their Alignment with User Preferences	Anna Karnysheva et.al.	2504.15719	null
2025-04-22	DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models	Jie Zhu et.al.	2504.15716	link
2025-04-22	Advancing Embodied Agent Security: From Safety Benchmarks to Input Moderation	Ning Wang et.al.	2504.15699	null
2025-04-22	DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining	Wei Zhuo et.al.	2504.15669	null
2025-04-22	FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning	Ju Yeon Kang et.al.	2504.15663	null
2025-04-22	VeriCoder: Enhancing LLM-Based RTL Code Generation through Functional Correctness Validation	Anjiang Wei et.al.	2504.15659	null
2025-04-22	Cost-Effective Text Clustering with Large Language Models	Hongtao Wang et.al.	2504.15640	null
2025-04-22	DR.FIX: Automatically Fixing Data Races at Industry Scale	Farnaz Behrang et.al.	2504.15637	link
2025-04-22	Exploiting Contextual Knowledge in LLMs through V-usable Information based Layer Enhancement	Xiaowei Yuan et.al.	2504.15630	null
2025-04-22	CiteFix: Enhancing RAG Accuracy Through Post-Processing Citation Correction	Harsh Maheshwari et.al.	2504.15629	null
2025-04-22	ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?	Doanh C. Bui et.al.	2504.15627	null
2025-04-22	FaceInsight: A Multimodal Large Language Model for Face Perception	Jingzhi Li et.al.	2504.15624	null
2025-04-22	Exploring the Role of Large Language Models in Cybersecurity: A Systematic Survey	Shuang Tian et.al.	2504.15622	null
2025-04-22	AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization	Jinda Lu et.al.	2504.15619	null
2025-04-23	A LoRA-Based Approach to Fine-Tuning LLMs for Educational Guidance in Resource-Constrained Settings	Md Millat Hosen et.al.	2504.15610	link
2025-04-22	Research on Navigation Methods Based on LLMs	Anlong Zhang et.al.	2504.15600	null
2025-04-22	MetaMolGen: A Neural Graph Motif Generation Model for De Novo Molecular Design	Zimo Yan et.al.	2504.15587	null
2025-04-22	A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment	Kun Wang et.al.	2504.15585	null
2025-04-22	A Large-scale Class-level Benchmark Dataset for Code Generation with LLMs	Musfiqur Rahman et.al.	2504.15564	null
2025-04-22	A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models	Gengxian Cao et.al.	2504.15552	null
2025-04-22	Do It For Me vs. Do It With Me: Investigating User Perceptions of Different Paradigms of Automation in Copilots for Feature-Rich Software	Anjali Khurana et.al.	2504.15549	null
2025-04-22	LLM-based Semantic Augmentation for Harmful Content Detection	Elyas Meguellati et.al.	2504.15548	null
2025-04-22	A Framework for Testing and Adapting REST APIs as LLM Tools	Jayachandu Bandlamudi et.al.	2504.15546	null
2025-04-22	IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property	Qiyao Wang et.al.	2504.15524	null
2025-04-22	The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks	Minghao Wu et.al.	2504.15521	null
2025-04-23	Transport f divergences	Wuchen Li et.al.	2504.15515	null
2025-04-22	SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation	Keqi Deng et.al.	2504.15509	null
2025-04-21	Application of Deep Generative Models for Anomaly Detection in Complex Financial Transactions	Tengda Tang et.al.	2504.15491	null
2025-04-21	Unifying Image Counterfactuals and Feature Attributions with Latent-Space Adversarial Attacks	Jeremy Goldwasser et.al.	2504.15479	null
2025-04-21	In-context Ranking Preference Optimization	Junda Wu et.al.	2504.15477	null
2025-04-21	From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System	Rohan Surana et.al.	2504.15476	null
2025-04-21	Speculative Sampling via Exponential Races	Szymon Kobus et.al.	2504.15475	null
2025-04-21	Agent for User: Testing Multi-User Interactive Features in TikTok	Sidong Feng et.al.	2504.15474	null
2025-04-21	Emergence and Evolution of Interpretable Concepts in Diffusion Models	Berk Tinaz et.al.	2504.15473	null
2025-04-21	LAPP: Large Language Model Feedback for Preference-Driven Reinforcement Learning	Pingcheng Jian et.al.	2504.15472	null
2025-04-21	Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images	Jonathan Brokman et.al.	2504.15470	link
2025-04-21	Improving Human-AI Coordination through Adversarial Training and Generative Models	Paresh Chaudhary et.al.	2504.15457	null
2025-04-21	Prize-Collecting Forest with Submodular Penalties: Improved Approximation	Ali Ahmadi et.al.	2504.15445	null
2025-04-21	Demand for LLMs: Descriptive Evidence on Substitution, Market Expansion, and Multihoming	Andrey Fradkin et.al.	2504.15440	null
2025-04-21	Combating Toxic Language: A Review of LLM-Based Strategies for Software Engineering	Hao Zhuo et.al.	2504.15439	null
2025-04-21	TVR: Automotive System Requirement Traceability Validation and Recovery Through Retrieval-Augmented Generation	Feifei Niu et.al.	2504.15427	null
2025-04-21	LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study	Nishath Rajiv Ranasinghe et.al.	2504.15424	null
2025-04-21	IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs	David Ma et.al.	2504.15415	link
2025-04-21	MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World	Ankit Dhiman et.al.	2504.15397	null
2025-04-21	Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection	Myrthe Reuver et.al.	2504.15392	link
2025-04-21	Solving New Tasks by Adapting Internet Video Knowledge	Calvin Luo et.al.	2504.15369	null
2025-04-21	Measuring Interest Group Positions on Legislation: An AI-Driven Analysis of Lobbying Reports	Jiseon Kim et.al.	2504.15333	link
2025-04-21	Med-CoDE: Medical Critique based Disagreement Evaluation Framework	Mohit Gupta et.al.	2504.15330	null
2025-04-21	Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs	Chun-Hsiao Yeh et.al.	2504.15280	link
2025-04-21	VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models	Weiye Xu et.al.	2504.15279	null
2025-04-21	Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning	Jie Cheng et.al.	2504.15275	link
2025-04-21	Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning	Ehsan Ahmadi et.al.	2504.15263	null
2025-04-21	CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation	Anirudh Khatry et.al.	2504.15254	link
2025-04-21	Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators	Yilun Zhou et.al.	2504.15253	link
2025-04-21	MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning	Yahan Yang et.al.	2504.15241	null
2025-04-21	A Self-Improving Coding Agent	Maxime Robeyns et.al.	2504.15228	null
2025-04-21	EvalAgent: Discovering Implicit Evaluation Criteria from the Web	Manya Wadhwa et.al.	2504.15219	null
2025-04-21	DRAGON: Distributional Rewards Optimize Diffusion Generative Models	Yatong Bai et.al.	2504.15217	null
2025-04-21	Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs	Marina Sakharova et.al.	2504.15210	null
2025-04-21	Compute-Optimal LLMs Provably Generalize Better With Scale	Marc Finzi et.al.	2504.15208	null
2025-04-21	Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges	Nandan Thakur et.al.	2504.15205	null
2025-04-22	Synergistic Weak-Strong Collaboration by Aligning Preferences	Yizhu Jiao et.al.	2504.15188	null
2025-04-21	Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform	Xianpan Zhou et.al.	2504.15182	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks	Joan C. Timoneda et.al.	2504.15160	null
2025-04-21	KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking	Juyeon Kim et.al.	2504.15135	link
2025-04-21	EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models	Ziwen Xu et.al.	2504.15133	link
2025-04-21	Kuwain 1.5B: An Arabic SLM via Language Injection	Khalil Hennara et.al.	2504.15120	null
2025-04-21	Rethinking the Potential of Multimodality in Collaborative Problem Solving Diagnosis with Large Language Models	K. Wong et.al.	2504.15093	null
2025-04-21	Safety Co-Option and Compromised National Security: The Self-Fulfilling Prophecy of Weakened AI Risk Thresholds	Heidy Khlaaf et.al.	2504.15088	null
2025-04-21	Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs	Chen Xie et.al.	2504.15080	null
2025-04-21	Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL	Simone Papicchio et.al.	2504.15077	null
2025-04-21	The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models	Ronak Pradeep et.al.	2504.15068	null
2025-04-21	Testing LLMs’ Capabilities in Annotating Translations Based on an Error Typology Designed for LSP Translation: First Experiments with ChatGPT	Joachim Minder et.al.	2504.15052	null
2025-04-21	ScanEdit: Hierarchically-Guided Functional 3D Scan Editing	Mohamed el amine Boudjoghra et.al.	2504.15049	null
2025-04-21	RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search	Quy-Anh Dang et.al.	2504.15047	link
2025-04-21	A Call for New Recipes to Enhance Spatial Reasoning in MLLMs	Huanyu Zhang et.al.	2504.15037	null
2025-04-21	SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation	Yue Li et.al.	2504.15035	null
2025-04-21	DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation	Weijie He et.al.	2504.15032	null
2025-04-21	DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models	Chengyu Wang et.al.	2504.15027	null
2025-04-21	Stay Hungry, Stay Foolish: On the Extended Reading Articles Generation with LLMs	Yow-Fu Liou et.al.	2504.15013	null
2025-04-21	Efficient Pretraining Length Scaling	Bohong Wu et.al.	2504.14992	null
2025-04-21	aiXamine: LLM Safety and Security Simplified	Fatih Deniz et.al.	2504.14985	null
2025-04-21	RealisDance-DiT: Simple yet Strong Baseline towards Controllable Character Animation in the Wild	Jingkai Zhou et.al.	2504.14977	null
2025-04-21	Evaluating LLMs on Chinese Topic Constructions: A Research Proposal Inspired by Tian et al. (2024)	Xiaodong Yang et.al.	2504.14969	null
2025-04-21	SLO-Aware Scheduling for Large Language Model Inferences	Jinqi Huang et.al.	2504.14966	null
2025-04-21	Evaluating Code Generation of LLMs in Advanced Computer Science Problems	Emir Catir et.al.	2504.14964	null
2025-04-21	Efficient Document Retrieval with G-Retriever	Manthankumar Solanki et.al.	2504.14955	link
2025-04-21	Generative Semantic Communications: Principles and Practices	Xiaojun Yuan et.al.	2504.14947	null
2025-04-22	WindVE: Collaborative CPU-NPU Vector Embedding	Jinqi Huang et.al.	2504.14941	null
2025-04-21	TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models	Mazharul Islam Rakib et.al.	2504.14933	null
2025-04-21	EducationQ: Evaluating LLMs’ Teaching Capabilities Through Multi-Agent Dialogue Framework	Yao Shi et.al.	2504.14928	null
2025-04-21	POLYRAG: Integrating Polyviews into Retrieval-Augmented Generation for Medical Applications	Chunjing Gan et.al.	2504.14917	null
2025-04-21	StableQuant: Layer Adaptive Post-Training Quantization for Speech Foundation Models	Yeona Hong et.al.	2504.14915	null
2025-04-21	CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs	Yingming Zheng et.al.	2504.14905	link
2025-04-21	Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation	Chenjie Cao et.al.	2504.14899	link
2025-04-21	Expected Free Energy-based Planning as Variational Inference	Bert de Vries et.al.	2504.14898	null
2025-04-21	Hardware-based Heterogeneous Memory Management for Large Language Model Inference	Soojin Hwang et.al.	2504.14893	null
2025-04-21	Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey	Aoran Gan et.al.	2504.14891	null
2025-04-21	Latent Bayesian Optimization via Autoregressive Normalizing Flows	Seunghun Lee et.al.	2504.14889	null
2025-04-21	Efficient Function Orchestration for Large Language Models	Xiaoxia Liu et.al.	2504.14872	null
2025-04-21	Natural Fingerprints of Large Language Models	Teppei Suzuki et.al.	2504.14871	null
2025-04-21	OTC: Optimal Tool Calls via Reinforcement Learning	Hongru Wang et.al.	2504.14870	null
2025-04-21	Transparentize the Internal and External Knowledge Utilization in LLMs with Trustworthy Citation	Jiajun Shen et.al.	2504.14856	null
2025-04-21	Uncertainty quantification of neural network models of evolving processes via Langevin sampling	Cosmin Safta et.al.	2504.14854	null
2025-04-21	APIRAT: Integrating Multi-source API Knowledge for Enhanced Code Translation with LLMs	Chaofan Wang et.al.	2504.14852	link
2025-04-21	Language Models for Materials Discovery and Sustainability: Progress, Challenges, and Opportunities	Zongrui Pei et.al.	2504.14849	null
2025-04-21	Enhancing the Patent Matching Capability of Large Language Models via the Memory Graph	Qiushi Xiong et.al.	2504.14845	link
2025-04-21	Establishing Reliability Metrics for Reward Models in Large Language Models	Yizhou Chen et.al.	2504.14838	null
2025-04-21	SQL-Factory: A Multi-Agent Framework for High-Quality and Large-Scale SQL Generation	Jiahui Li et.al.	2504.14837	link
2025-04-21	Protecting Your Voice: Temporal-aware Robust Watermarking	Yue Li et.al.	2504.14832	null
2025-04-21	Completing A Systematic Review in Hours instead of Months with Interactive AI Agents	Rui Qiu et.al.	2504.14822	link
2025-04-21	DONOD: Robust and Generalizable Instruction Fine-Tuning for LLMs via Model-Intrinsic Dataset Pruning	Jucheng Hu et.al.	2504.14810	null
2025-04-21	On Self-improving Token Embeddings	Mario M. Kubek et.al.	2504.14808	null
2025-04-21	Automatic Evaluation Metrics for Document-level Translation: Overview, Challenges and Trends	Jiaxin GUO et.al.	2504.14804	null
2025-04-21	Automated Duplicate Bug Report Detection in Large Open Bug Repositories	Clare E. Laney et.al.	2504.14797	null
2025-04-21	Enhanced Data-driven Topology Design Methodology with Multi-level Mesh and Correlation-based Mutation for Stress-related Multi-objective Optimization	Jun Yang et.al.	2504.14790	null
2025-04-21	The 1st EReL@MIR Workshop on Efficient Representation Learning for Multimodal Information Retrieval	Junchen Fu et.al.	2504.14788	null
2025-04-21	gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling	Tianyu Guo et.al.	2504.14775	link
2025-04-20	Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions	Luyang Fang et.al.	2504.14772	null
2025-04-20	The Memorization Problem: Can We Trust LLMs’ Economic Forecasts?	Alejandro Lopez-Lira et.al.	2504.14765	null
2025-04-20	Steering Semantic Data Processing With DocWrangler	Shreya Shankar et.al.	2504.14764	null
2025-04-20	SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs	Minh V. T. Pham et.al.	2504.14757	null
2025-04-20	PROMPTEVALS: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines	Reya Vir et.al.	2504.14738	null
2025-04-20	Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation	Tuhina Tripathi et.al.	2504.14716	null
2025-04-22	AI with Emotions: Exploring Emotional Expressions in Large Language Models	Shin-nosuke Ishikawa et.al.	2504.14706	null
2025-04-20	Video-MMLU: A Massive Multi-Discipline Lecture Understanding Benchmark	Enxin Song et.al.	2504.14693	link
2025-04-20	FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models	Mehrnoush Shamsfard et.al.	2504.14690	null
2025-04-20	Evaluating Temporal Plasticity in Foundation Time Series Models for Incremental Fine-tuning	Jia Liu et.al.	2504.14677	null
2025-04-20	Trans-Zero: Self-Play Incentivizes Large Language Models for Multilingual Translation Without Parallel Data	Wei Zou et.al.	2504.14669	link
2025-04-20	Efficient Federated Split Learning for Large Language Models over Communication Networks	Kai Zhao et.al.	2504.14667	null
2025-04-20	Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens	Kaihang Pan et.al.	2504.14666	null
2025-04-20	A Case Study Exploring the Current Landscape of Synthetic Medical Record Generation with Commercial LLMs	Yihan Lin et.al.	2504.14657	null
2025-04-20	LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs	Yunhui Xia et.al.	2504.14655	null
2025-04-20	A Framework for Benchmarking and Aligning Task-Planning Safety in LLM-Based Embodied Agents	Yuting Huang et.al.	2504.14650	null
2025-04-20	Relation-R1: Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relational Comprehension	Lin Li et.al.	2504.14642	null
2025-04-20	HLSTester: Efficient Testing of Behavioral Discrepancies with LLMs for High-Level Synthesis	Kangwei Xu et.al.	2504.14641	null
2025-04-20	Risk Assessment Framework for Code LLMs via Leveraging Internal States	Yuheng Huang et.al.	2504.14640	null
2025-04-20	Harnessing Generative LLMs for Enhanced Financial Event Entity Extraction Performance	Soo-joon Choi et.al.	2504.14633	null
2025-04-20	Towards Optimal Circuit Generation: Multi-Agent Collaboration Meets Collective Intelligence	Haiyan Qin et.al.	2504.14625	link
2025-04-20	A Hierarchical Framework for Measuring Scientific Paper Innovation via Large Language Models	Hongming Tan et.al.	2504.14620	null
2025-04-20	Translation Analytics for Freelancers: I. Introduction, Data Preparation, Baseline Evaluations	Yuri Balashov et.al.	2504.14619	null
2025-04-20	UFO2: The Desktop AgentOS	Chaoyun Zhang et.al.	2504.14603	link
2025-04-20	a1: Steep Test-time Scaling Law via Environment Augmented Generation	Lingrui Mei et.al.	2504.14597	null
2025-04-20	HealthGenie: Empowering Users with Healthy Dietary Guidance through Knowledge Graph and Large Language Models	Fan Gao et.al.	2504.14594	null
2025-04-20	Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction	Wenke Xia et.al.	2504.14588	link
2025-04-20	Using street view imagery and deep generative modeling for estimating the health of urban forests	Akshit Gupta et.al.	2504.14583	null
2025-04-20	Prompt-Hacking: The New p-Hacking?	Thomas Kosch et.al.	2504.14571	null
2025-04-18	Generative AI Act II: Test Time Scaling Drives Cognition Engineering	Shijie Xia et.al.	2504.13828	link
2025-04-18	Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models	Junjie Yang et.al.	2504.13825	null
2025-04-18	CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning	Yang Yue et.al.	2504.13820	link
2025-04-18	Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning	Yixuan Even Xu et.al.	2504.13818	null
2025-04-18	BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models	Zhengxian Wu et.al.	2504.13775	null
2025-04-18	DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs	Tamim Al Mahmud et.al.	2504.13774	link
2025-04-18	Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy?	Motunrayo Ibiyo et.al.	2504.13769	null
2025-04-18	Scaling sparse feature circuit finding for in-context learning	Dmitrii Kharlapenko et.al.	2504.13756	null
2025-04-18	ESPLoRA: Enhanced Spatial Precision with Low-Rank Adaption in Text-to-Image Diffusion Models for High-Definition Synthesis	Andrea Rigo et.al.	2504.13745	null
2025-04-18	Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence	Paul K. Mandal et.al.	2504.13730	link
2025-04-18	MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection	Lin Yuan et.al.	2504.13726	null
2025-04-18	OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation	Yichen Wu et.al.	2504.13707	null
2025-04-18	Exploring Multimodal Prompt for Visualization Authoring with Large Language Models	Zhen Wen et.al.	2504.13700	null
2025-04-17	Deep literature reviews: an application of fine-tuned language models to migration research	Stefano M. Iacus et.al.	2504.13685	null
2025-04-18	Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation	Xiangrong et.al.	2504.13684	null
2025-04-18	Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm	Russell Beale et.al.	2504.13667	null
2025-04-18	Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code	Antonio Della Porta et.al.	2504.13656	null
2025-04-18	Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs	Gabriel Freedman et.al.	2504.13644	link
2025-04-18	Divergent LLM Adoption and Heterogeneous Convergence Paths in Research Writing	Cong William Lin et.al.	2504.13629	null
2025-04-18	PV-VLM: A Multimodal Vision-Language Approach Incorporating Sky Images for Intra-Hour Photovoltaic Power Forecasting	Huapeng Lin et.al.	2504.13624	null
2025-04-18	Compile Scene Graphs with Reinforcement Learning	Zuyao Chen et.al.	2504.13617	link
2025-04-18	Long-context Non-factoid Question Answering in Indic Languages	Ritwik Mishra et.al.	2504.13615	link
2025-04-18	Continual Pre-Training is (not) What You Need in Domain Adaption	Pin-Er Chen et.al.	2504.13603	null
2025-04-18	HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering	Alexander Rusnak et.al.	2504.13590	null
2025-04-18	Towards End-to-End Network Intent Management with Large Language Models	Lam Dinh et.al.	2504.13589	null
2025-04-18	RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines	Quentin Romero Lauro et.al.	2504.13587	null
2025-04-18	Contextualizing Spotify’s Audiobook List Recommendations with Descriptive Shelves	Gustavo Penha et.al.	2504.13572	null
2025-04-18	DETAM: Defending LLMs Against Jailbreak Attacks via Targeted Attention Modification	Yu Li et.al.	2504.13562	null
2025-04-18	Zero-Shot Industrial Anomaly Segmentation with Image-Aware Prompt Generation	SoYoung Park et.al.	2504.13560	link
2025-04-18	Integrating LLMs for Grading and Appeal Resolution in Computer Science Education	I. Aytutuldu et.al.	2504.13557	null
2025-04-18	MusFlow: Multimodal Music Generation via Conditional Flow Matching	Jiahao Song et.al.	2504.13535	null
2025-04-18	CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models	Feiyang Li et.al.	2504.13534	null
2025-04-18	Designing a reliable lateral movement detector using a graph foundation model	Corentin Larroche et.al.	2504.13527	null
2025-04-18	Large Language Models for Validating Network Protocol Parsers	Mingwei Zheng et.al.	2504.13515	link
2025-04-18	Prejudge-Before-Think: Enhancing Large Language Models at Test-Time by Process Prejudge Reasoning	Jianing Wang et.al.	2504.13500	link
2025-04-18	U-Shape Mamba: State Space Model for faster diffusion	Alex Ergasti et.al.	2504.13499	link
2025-04-18	Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing	Joowon Kim et.al.	2504.13490	null
2025-04-18	LLM Sensitivity Evaluation Framework for Clinical Diagnosis	Chenwei Yan et.al.	2504.13475	null
2025-04-18	Everything You Wanted to Know About LLM-based Vulnerability Detection But Were Afraid to Ask	Yue Li et.al.	2504.13474	null
2025-04-18	CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation	Xinchen Wang et.al.	2504.13472	null
2025-04-18	From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs	Jiliang Ni et.al.	2504.13471	null
2025-04-18	Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization	Hongwei Ji et.al.	2504.13460	null
2025-04-18	SatelliteCalculator: A Multi-Task Vision Foundation Model for Quantitative Remote Sensing Inversion	Zhenyu Yu et.al.	2504.13442	null
2025-04-18	D-GEN: Automatic Distractor Generation and Evaluation for Reliable Assessment of Generative Model	Grace Byun et.al.	2504.13439	null
2025-04-18	Secure Multifaceted-RAG for Enterprise: Hybrid Knowledge Retrieval with Security Filtering	Grace Byun et.al.	2504.13425	null
2025-04-18	Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction	Wenyu Li et.al.	2504.13419	null
2025-04-18	STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings	Saksham Rastogi et.al.	2504.13416	link
2025-04-18	CytoFM: The first cytology foundation model	Vedrana Ivezić et.al.	2504.13402	null
2025-04-18	Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving Safety	Shashank Shriram et.al.	2504.13399	link
2025-04-18	POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation	Evans Xu Han et.al.	2504.13392	null
2025-04-17	SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models	Mingxiao Tu et.al.	2504.13378	null
2025-04-17	On the minimax optimality of Flow Matching through the connection to kernel density estimation	Lea Kunkel et.al.	2504.13336	null
2025-04-17	Predicting Forced Responses of Probability Distributions via the Fluctuation-Dissipation Theorem and Generative Modeling	Ludovico T. Giorgini et.al.	2504.13333	null
2025-04-17	Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes	Andreas Lau Hansen et.al.	2504.13297	link
2025-04-17	LIFT+: Lightweight Fine-Tuning for Long-Tail Learning	Jiang-Xin Shi et.al.	2504.13282	link
2025-04-17	Using LLMs for Library Migration	Md Mohayeminul Islam et.al.	2504.13272	null
2025-04-17	Causal-Copilot: An Autonomous Causal Analysis Agent	Xinyue Wang et.al.	2504.13263	null
2025-04-17	CPG-EVAL: A Multi-Tiered Benchmark for Evaluating the Chinese Pedagogical Grammar Competence of Large Language Models	Dong Wang et.al.	2504.13261	null
2025-04-17	ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs	Yan Yang et.al.	2504.13237	null
2025-04-17	NNTile: a machine learning framework capable of training extremely large GPT language models on a single node	Aleksandr Mikhalev et.al.	2504.13236	null
2025-04-17	Auto-FEDUS: Autoregressive Generative Modeling of Doppler Ultrasound Signals from Fetal Electrocardiograms	Alireza Rafiei et.al.	2504.13233	null
2025-04-17	Aligning Constraint Generation with Design Intent in Parametric CAD	Evan Casey et.al.	2504.13178	null
2025-04-17	It’s All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization	Ali Behrouz et.al.	2504.13173	null
2025-04-17	SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs	Haoxuan Li et.al.	2504.13172	null
2025-04-17	Sleep-time Compute: Beyond Inference Scaling at Test-time	Kevin Lin et.al.	2504.13171	link
2025-04-17	Digital Twin Generation from Visual Data: A Survey	Andrew Melnik et.al.	2504.13159	link
2025-04-18	Exploring Expert Failures Improves LLM Agent Tuning	Li-Cheng Lan et.al.	2504.13145	null
2025-04-18	Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo	João Loula et.al.	2504.13139	null
2025-04-17	Energy-Based Reward Models for Robust Language Model Alignment	Anamika Lochab et.al.	2504.13134	link
2025-04-17	Science-T2I: Addressing Scientific Illusions in Image Synthesis	Jialuo Li et.al.	2504.13129	null
2025-04-17	LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard	Varun Rao et.al.	2504.13125	null
2025-04-17	Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training	Xinsong Zhang et.al.	2504.13123	null
2025-04-17	VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models	Haojian Huang et.al.	2504.13122	link
2025-04-17	Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification	Kumar Manas et.al.	2504.13111	link
2025-04-17	UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models	Guanlong Jiao et.al.	2504.13109	null
2025-04-17	EventVAD: Training-Free Event-Aware Video Anomaly Detection	Yihua Shao et.al.	2504.13092	null
2025-04-17	Retrieval-Augmented Generation with Conflicting Evidence	Han Wang et.al.	2504.13079	link
2025-04-17	An All-Atom Generative Model for Designing Protein Complexes	Ruizhe Chen et.al.	2504.13075	link
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	Early Accessibility: Automating Alt-Text Generation for UI Icons During App Development	Sabrina Haque et.al.	2504.13069	null
2025-04-17	Accuracy is Not Agreement: Expert-Aligned Evaluation of Crash Narrative Classification Models	Sudesh Ramesh Bhagat et.al.	2504.13068	null
2025-04-17	ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models	Linkang Du et.al.	2504.13061	link
2025-04-17	RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins	Yao Mu et.al.	2504.13059	null
2025-04-17	Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation	Yichao Feng et.al.	2504.13054	null
2025-04-17	GraphAttack: Exploiting Representational Blindspots in LLM Safety Mechanisms	Sinan He et.al.	2504.13052	null
2025-04-17	Multi-modal single-cell foundation models via dynamic token adaptation	Wenmin Zhao et.al.	2504.13049	null
2025-04-17	Design Topological Materials by Reinforcement Fine-Tuned Generative Model	Haosheng Xu et.al.	2504.13048	null
2025-04-17	How Large Language Models Are Changing MOOC Essay Answers: A Comparison of Pre- and Post-LLM Responses	Leo Leppänen et.al.	2504.13038	null
2025-04-18	Towards Cardiac MRI Foundation Models: Comprehensive Visual-Tabular Representations for Whole-Heart Assessment and Beyond	Yundi Zhang et.al.	2504.13037	link
2025-04-17	InstructRAG: Leveraging Retrieval-Augmented Generation on Instruction Graphs for LLM-Based Task Planning	Zheng Wang et.al.	2504.13032	null
2025-04-17	ChatEXAONEPath: An Expert-level Multimodal Large Language Model for Histopathology Using Whole Slide Images	Sangwook Kim et.al.	2504.13023	null
2025-04-17	SHA256 at SemEval-2025 Task 4: Selective Amnesia – Constrained Unlearning for Large Language Models via Knowledge Isolation	Saransh Agrawal et.al.	2504.12996	link
2025-04-17	Chain-of-Thought Prompting for Out-of-Distribution Samples: A Latent-Variable Study	Yu Wang et.al.	2504.12991	link
2025-04-17	A Virtual Machine for Arbitrary Low-Precision GPGPU Computation in LLM Serving	Yaoyao Ding et.al.	2504.12984	null
2025-04-17	Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild	Jiatai Wang et.al.	2504.12982	null
2025-04-17	Sparks of Science: Hypothesis Generation Using Structured Paper Data	Charles O’Neill et.al.	2504.12976	null
2025-04-17	QLLM: Do We Really Need a Mixing Network for Credit Assignment in Multi-Agent Reinforcement Learning?	Zhouyang Jiang et.al.	2504.12961	null
2025-04-17	Are Retrials All You Need? Enhancing Large Language Model Reasoning Without Verbalized Feedback	Nearchos Potamitis et.al.	2504.12951	null
2025-04-18	Customizing Emotional Support: How Do Individuals Construct and Interact With LLM-Powered Chatbots	Xi Zheng et.al.	2504.12943	null
2025-04-17	Explainable AI in Usable Privacy and Security: Challenges and Opportunities	Vincent Freiberger et.al.	2504.12931	null
2025-04-17	ConExion: Concept Extraction with Large Language Models	Ebrahim Norouzi et.al.	2504.12915	link
2025-04-17	MAIN: Mutual Alignment Is Necessary for instruction tuning	Fanyi Yang et.al.	2504.12913	null
2025-04-17	Benchmarking Multi-National Value Alignment for Large Language Models	Chengyi Ju et.al.	2504.12911	null
2025-04-17	FashionDPO:Fine-tune Fashion Outfit Generation Model using Direct Preference Optimization	Mingzhe Yu et.al.	2504.12900	link
2025-04-17	Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models	Zhouhao Sun et.al.	2504.12898	null
2025-04-18	EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting	Guanrou Yang et.al.	2504.12867	null
2025-04-17	DashChat: Interactive Authoring of Industrial Dashboard Design Prototypes through Conversation with LLM-Powered Agents	S. Shen et.al.	2504.12865	null
2025-04-17	Enhancing Decentralization in Blockchain Decision-Making Through Quadratic Voting and Its Generalization	Lyudmila Kovalchuk et.al.	2504.12859	null
2025-04-17	3D-PNAS: 3D Industrial Surface Anomaly Synthesis with Perlin Noise	Yifeng Cheng et.al.	2504.12856	null
2025-04-17	Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval and haystacks	Amey Hengle et.al.	2504.12845	link
2025-04-17	TwoSquared: 4D Generation from 2D Image Pairs	Lu Sang et.al.	2504.12825	null
2025-04-17	Assesing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation	Takaya Arita et.al.	2504.12805	null
2025-04-17	EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery	Wei Zhang et.al.	2504.12795	null
2025-04-17	Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration	Yicheng Pan et.al.	2504.12773	link
2025-04-17	GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks	Hao Xu et.al.	2504.12764	link
2025-04-17	Trajectory Adaptation using Large Language Models	Anurag Maurya et.al.	2504.12755	null
2025-04-17	Stronger, Steadier & Superior: Geometric Consistency in Depth VFM Forges Domain Generalized Semantic Segmentation	Siyu Chen et.al.	2504.12753	link
2025-04-17	Pandora: A Code-Driven Large Language Model Agent for Unified Reasoning Across Diverse Structured Knowledge	Yongrui Chen et.al.	2504.12734	null
2025-04-17	Validating LLM-Generated Relevance Labels for Educational Resource Search	Ratan J. Sebastian et.al.	2504.12732	null
2025-04-17	SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation	Nicolas Bougie et.al.	2504.12722	null
2025-04-17	Post-pre-training for Modality Alignment in Vision-Language Foundation Models	Shin’ya Yamaguchi et.al.	2504.12717	link
2025-04-17	SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding	Qianqian Sun et.al.	2504.12704	null
2025-04-17	Collaborative Perception Datasets for Autonomous Driving: A Review	Naibang Wang et.al.	2504.12696	link
2025-04-17	Why and How LLMs Hallucinate: Connecting the Dots with Subsequence Associations	Yiyou Sun et.al.	2504.12691	link
2025-04-17	Data-efficient LLM Fine-tuning for Code Generation	Weijie Lv et.al.	2504.12687	link
2025-04-17	SOPHY: Generating Simulation-Ready Objects with Physical Materials	Junyi Cao et.al.	2504.12684	null
2025-04-17	GRAIL: Gradient-Based Adaptive Unlearning for Privacy and Copyright in LLMs	Kun-Woo Kim et.al.	2504.12681	null
2025-04-17	Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning	Baining Zhao et.al.	2504.12680	link
2025-04-17	Persona-judge: Personalized Alignment of Large Language Models via Token-level Self-judgment	Xiaotian Zhang et.al.	2504.12663	null
2025-04-17	Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation	Linda He et.al.	2504.12637	null
2025-04-17	Towards Characterizing Subjectivity of Individuals through Modeling Value Conflicts and Trade-offs	Younghun Lee et.al.	2504.12633	null
2025-04-17	SAM-Based Building Change Detection with Distribution-Aware Fourier Adaptation and Edge-Constrained Warping	Yun-Cheng Li et.al.	2504.12619	null
2025-04-17	Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation	Mingwei Liu et.al.	2504.12608	null
2025-04-17	GeoSense: Evaluating Identification and Application of Geometric Principles in Multimodal Reasoning	Liangyu Xu et.al.	2504.12597	null
2025-04-17	Identifying and Mitigating the Influence of the Prior Distribution in Large Language Models	Liyi Zhang et.al.	2504.12585	link
2025-04-17	Provable Secure Steganography Based on Adaptive Dynamic Sampling	Kaiyi Pang et.al.	2504.12579	null
2025-04-17	Prompt-Driven and Training-Free Forgetting Approach and Dataset for Large Language Models	Zhenyu Yu et.al.	2504.12574	null
2025-04-17	ZeroSumEval: Scaling LLM Evaluation with Inter-Model Competition	Haidar Khan et.al.	2504.12562	link
2025-04-17	CDF-RAG: Causal Dynamic Feedback for Adaptive Retrieval-Augmented Generation	Elahe Khatibi et.al.	2504.12560	link
2025-04-17	Benchmarking LLM-based Relevance Judgment Methods	Negar Arabzadeh et.al.	2504.12558	link
2025-04-17	ELAB: Extensive LLM Alignment Benchmark in Persian Language	Zahra Pourbahman et.al.	2504.12553	null
2025-04-17	Privacy-Preserving Operating Room Workflow Analysis using Digital Twins	Alejandra Perez et.al.	2504.12552	null
2025-04-17	Knowledge Acquisition on Mass-shooting Events via LLMs for AI-Driven Justice	Benign John Ihugba et.al.	2504.12545	null
2025-04-16	Memorization vs. Reasoning: Updating LLMs with New Knowledge	Aochong Oliver Li et.al.	2504.12523	null
2025-04-16	Evaluating the Diversity and Quality of LLM Generated Content	Alexander Shypula et.al.	2504.12522	null
2025-04-16	Multimodal LLM Augmented Reasoning for Interpretable Visual Perception Analysis	Shravan Chaudhari et.al.	2504.12511	null
2025-04-16	Towards Conversational AI for Human-Machine Collaborative MLOps	George Fatouros et.al.	2504.12477	null
2025-04-16	Integrating Structural and Semantic Signals in Text-Attributed Graphs with BiGTex	Azadeh Beiranvand et.al.	2504.12474	link
2025-04-16	You Don’t Need All Attentions: Distributed Dynamic Fine-Tuning for Foundation Models	Shiwei Ding et.al.	2504.12471	null
2025-04-16	SLURG: Investigating the Feasibility of Generating Synthetic Online Fallacious Discourse	Cal Blanco et.al.	2504.12466	null
2025-04-16	PlanGlow: Personalized Study Planning with an Explainable and Controllable LLM-Driven System	Jiwon Chun et.al.	2504.12452	link
2025-04-16	Position: The Most Expensive Part of an LLM should be its Training Data	Nikhil Kandpal et.al.	2504.12427	null
2025-04-16	Don’t Just Translate, Agitate: Using Large Language Models as Devil’s Advocates for AI Explanations	Ashley Suh et.al.	2504.12424	null
2025-04-16	Mitigating LLM Hallucinations with Knowledge Graphs: A Case Study	Harry Li et.al.	2504.12422	null
2025-04-16	A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment	Negar Arabzadeh et.al.	2504.12408	link
2025-04-16	Activated LoRA: Fine-tuned LLMs for Intrinsics	Kristjan Greenewald et.al.	2504.12397	link
2025-04-16	BitNet b1.58 2B4T Technical Report	Shuming Ma et.al.	2504.12285	null
2025-04-16	HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks	Stefan Abi-Karam et.al.	2504.12268	link
2025-04-16	VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate	Zhihang Yuan et.al.	2504.12259	link
2025-04-16	FLIP Reasoning Challenge	Andreas Plesner et.al.	2504.12256	link
2025-04-16	AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection	Xinyu Li et.al.	2504.12250	null
2025-04-16	MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models	Hang Yuan et.al.	2504.12234	null
2025-04-16	Watermarking Needs Input Repetition Masking	David Khachaturov et.al.	2504.12229	null
2025-04-16	Coding-Prior Guided Diffusion Network for Video Deblurring	Yike Liu et.al.	2504.12222	null
2025-04-16	d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning	Siyan Zhao et.al.	2504.12216	null
2025-04-16	What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure	Céline Budding et.al.	2504.12187	null
2025-04-16	SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data	Suyoung Bae et.al.	2504.12185	null
2025-04-16	Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification	Jaime E. Cuellar et.al.	2504.12180	null
2025-04-16	Deep Generative Models for Bayesian Inference on High-Rate Sensor Data: Applications in Automotive Radar and Medical Imaging	Tristan S. W. Stevens et.al.	2504.12154	null
2025-04-16	Multilingual Contextualization of Large Language Models for Document-Level Machine Translation	Miguel Moura Ramos et.al.	2504.12140	null
2025-04-16	Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation	Anfu Tang et.al.	2504.12113	null
2025-04-16	Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Shizhan Cai et.al.	2504.12108	null
2025-04-16	Gauging Overprecision in LLMs: An Empirical Study	Adil Bahaj et.al.	2504.12098	null
2025-04-16	Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework	Jack Preuveneers et.al.	2504.12090	null
2025-04-16	Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection	Yumin Kim et.al.	2504.12082	null
2025-04-16	Subitizing-Inspired_Large_Language_Models_for_Floorplanning	Shao-Chien Lu et.al.	2504.12076	null
2025-04-16	Generative Deep Learning Framework for Inverse Design of Fuels	Kiran K. Yalamanchi et.al.	2504.12075	null
2025-04-16	Optimizing Compound Retrieval Systems	Harrie Oosterhuis et.al.	2504.12063	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-04-16	Instruction-augmented Multimodal Alignment for Image-Text and Element Matching	Xinli Yue et.al.	2504.12018	null
2025-04-16	Purposefully Induced Psychosis (PIP): Embracing Hallucination as Imagination in Large Language Models	Kris Pilcher et.al.	2504.12012	null
2025-04-16	Generative Recommendation with Continuous-Token Diffusion	Haohao Qu et.al.	2504.12007	null
2025-04-16	A Complex-valued SAR Foundation Model Based on Physically Inspired Representation Learning	Mengyu Wang et.al.	2504.11999	null
2025-04-16	Language Models as Quasi-Crystalline Thought: Structure, Constraint, and Emergence in Generative Systems	Jose Manuel Guevara-Vela et.al.	2504.11986	null
2025-04-16	SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration Mistakes	Raúl Vázquez et.al.	2504.11975	null
2025-04-16	LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA	Xanh Ho et.al.	2504.11972	link
2025-04-16	Mind2Matter: Creating 3D Models from EEG Signals	Xia Deng et.al.	2504.11936	link
2025-04-16	An LLM-as-a-judge Approach for Scalable Gender-Neutral Translation Evaluation	Andrea Piergentili et.al.	2504.11934	null
2025-04-16	Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading	Qianjin Yu et.al.	2504.11919	null
2025-04-16	AnomalyR1: A GRPO-based End-to-end MLLM for Industrial Anomaly Detection	Yuhao Chao et.al.	2504.11914	null
2025-04-16	Finding Flawed Fictions: Evaluating Complex Reasoning in Language Models via Plot Hole Detection	Kabir Ahuja et.al.	2504.11900	null
2025-04-16	Search is All You Need for Few-shot Anomaly Detection	Qishan Wang et.al.	2504.11895	link
2025-04-16	Rethinking LLM-Based Recommendations: A Query Generation-Based, Training-Free Approach	Donghee Han et.al.	2504.11889	null
2025-04-16	Boosting Multi-View Stereo with Depth Foundation Model in the Absence of Real-World Labels	Jie Zhu et.al.	2504.11845	null
2025-04-16	Evaluating the Goal-Directedness of Large Language Models	Tom Everitt et.al.	2504.11844	link
2025-04-16	FiSMiness: A Finite State Machine Based Paradigm for Emotional Support Conversations	Yue Zhao et.al.	2504.11837	null
2025-04-16	Could Thinking Multilingually Empower LLM Reasoning?	Changjiang Gao et.al.	2504.11833	link
2025-04-16	Déjà Vu: Multilingual LLM Evaluation through the Lens of Machine Translation Evaluation	Julia Kreutzer et.al.	2504.11829	null
2025-04-16	Towards Forceful Robotic Foundation Models: a Literature Survey	William Xie et.al.	2504.11827	null
2025-04-16	Real-World Depth Recovery via Structure Uncertainty Modeling and Inaccurate GT Depth Fitting	Delong Suzhang et.al.	2504.11820	null
2025-04-16	Efficient and Adaptive Simultaneous Speech Translation with Fully Unidirectional Architecture	Biao Fu et.al.	2504.11809	null
2025-04-16	Résumé abstractif à partir d’une transcription audio	Ilia Derkach et.al.	2504.11803	null
2025-04-17	Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification	Yue Li et.al.	2504.11793	null
2025-04-16	Large Language Models for Drug Overdose Prediction from Longitudinal Medical Records	Md Sultan Al Nahian et.al.	2504.11792	null
2025-04-16	Enhancing Web Agents with Explicit Rollback Mechanisms	Zhisong Zhang et.al.	2504.11788	null
2025-04-16	The Digital Cybersecurity Expert: How Far Have We Come?	Dawei Wang et.al.	2504.11783	link
2025-04-16	Bridging the Semantic Gaps: Improving Medical VQA Consistency with LLM-Augmented Question Sets	Yongpei Ma et.al.	2504.11777	null
2025-04-16	Shared Disk KV Cache Management for Efficient Multi-Instance Inference in RAG-Powered LLMs	Hyungwoo Lee et.al.	2504.11765	null
2025-04-16	Characterizing and Optimizing LLM Inference Workloads on CPU-GPU Coupled Architectures	Prabhu Vellaisamy et.al.	2504.11750	null
2025-04-16	The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation	Bingjie Gao et.al.	2504.11739	null
2025-04-16	Recent Advance in 3D Object and Scene Generation: A Survey	Xiang Tang et.al.	2504.11734	null
2025-04-16	EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos	Jilan Xu et.al.	2504.11732	null
2025-04-16	EdgePrompt: A Distributed Key-Value Inference Framework for LLMs in 6G Networks	Jiahong Ning et.al.	2504.11729	null
2025-04-16	Probing the Unknown: Exploring Student Interactions with Probeable Problems at Scale in Introductory Programming	Paul Denny et.al.	2504.11723	null
2025-04-17	The Hitchhiker’s Guide to Program Analysis, Part II: Deep Thoughts by LLMs	Haonan Li et.al.	2504.11711	link
2025-04-16	Learning What NOT to Count	Adriano D’Alessandro et.al.	2504.11705	null
2025-04-16	A Library of LLM Intrinsics for Retrieval-Augmented Generation	Marina Danilevsky et.al.	2504.11704	null
2025-04-16	Progent: Programmable Privilege Control for LLM Agents	Tianneng Shi et.al.	2504.11703	link
2025-04-16	A New Paradigm of User-Centric Wireless Communication Driven by Large Language Models	Kuiyuan Ding et.al.	2504.11696	null
2025-04-16	Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics	Yiran He et.al.	2504.11686	null
2025-04-16	Higher-Order Binding of Language Model Virtual Personas: a Study on Approximating Political Partisan Misperceptions	Minwoo Kang et.al.	2504.11673	null
2025-04-16	Steering Prosocial AI Agents: Computational Basis of LLM’s Decision Making in Social Simulation	Ji Ma et.al.	2504.11671	null
2025-04-15	Improving LLM Interpretability and Performance via Guided Embedding Refinement for Sequential Recommendation	Nanshan Jia et.al.	2504.11658	null
2025-04-15	70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float	Tianyi Zhang et.al.	2504.11651	link
2025-04-15	Making Acoustic Side-Channel Attacks on Noisy Keyboards Viable with LLM-Assisted Spectrograms’ “Typo” Correction	Seyyed Ali Ayati et.al.	2504.11622	link
2025-04-15	Towards Interpretable Deep Generative Models via Causal Representation Learning	Gemma E. Moran et.al.	2504.11609	null
2025-04-15	GraphicBench: A Planning Benchmark for Graphic Design with Language Agents	Dayeon Ki et.al.	2504.11571	null
2025-04-15	Probabilistic causal graphs as categorical data synthesizers: Do they do better than Gaussian Copulas and Conditional Tabular GANs?	Olha Shaposhnyk et.al.	2504.11547	null
2025-04-15	NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes	Tianyang Xu et.al.	2504.11544	null
2025-04-15	HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis Generation	Haokun Liu et.al.	2504.11524	null
2025-04-15	FACT: Foundation Model for Assessing Cancer Tissue Margins with Mass Spectrometry	Mohammad Farahmand et.al.	2504.11519	link
2025-04-15	Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Ziqi Pang et.al.	2504.11457	link
2025-04-16	Elucidating the Design Space of Multimodal Protein Language Models	Cheng-Yen Hsieh et.al.	2504.11454	null
2025-04-15	TextArena	Leon Guertler et.al.	2504.11442	link
2025-04-15	Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models	Maria Teleki et.al.	2504.11431	link
2025-04-15	A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Xue Zhang et.al.	2504.11426	null
2025-04-15	Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts	Quanyu Long et.al.	2504.11420	null
2025-04-15	DataDecide: How to Predict Best Pretraining Data with Small Experiments	Ian Magnusson et.al.	2504.11393	null
2025-04-15	RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models	Juan Diego Rodriguez et.al.	2504.11381	link
2025-04-15	Ring Artifacts Correction Based on Global-Local Features Interaction Guidance in the Projection Domain	Yunze Liu et.al.	2504.11375	null
2025-04-15	Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions	Wang Bill Zhu et.al.	2504.11373	link
2025-04-15	OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution	Lucio La Cava et.al.	2504.11369	null
2025-04-15	Teaching Large Language Models to Reason through Learning and Forgetting	Tianwei Ni et.al.	2504.11364	link
2025-04-15	Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning	Haiming Wang et.al.	2504.11354	link
2025-04-16	Seedream 3.0 Technical Report	Yu Gao et.al.	2504.11346	null
2025-04-15	A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce	Wei Xiong et.al.	2504.11343	link
2025-04-15	Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Ruicheng Ao et.al.	2504.11320	link
2025-04-15	Learning to Be A Doctor: Searching for Effective Medical Agent Architectures	Yangyang Zhuang et.al.	2504.11301	null
2025-04-16	Automated Python Translation	Joshua Otten et.al.	2504.11290	null
2025-04-15	The Obvious Invisible Threat: LLM-Powered GUI Agents’ Vulnerability to Fine-Print Injections	Chaoran Chen et.al.	2504.11281	null
2025-04-15	From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs	Guocong Li et.al.	2504.11277	null
2025-04-15	Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution	Xinning Chai et.al.	2504.11271	link
2025-04-15	Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning	Juan Garcia Giraldo et.al.	2504.11268	null
2025-04-15	Nondeterministic Polynomial-time Problem Challenge: An Ever-Scaling Reasoning Benchmark for LLMs	Chang Yang et.al.	2504.11239	link
2025-04-15	Video Summarization with Large Language Models	Min Jung Lee et.al.	2504.11199	null
2025-04-15	R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning	Lijun Sheng et.al.	2504.11195	link
2025-04-15	Enhancing multimodal analogical reasoning with Logic Augmented Generation	Anna Sofia Lippolis et.al.	2504.11190	link
2025-04-15	Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items	Minjie Zou et.al.	2504.11186	null
2025-04-15	Exploring Backdoor Attack and Defense for LLM-empowered Recommendations	Liangbo Ning et.al.	2504.11182	null
2025-04-15	TerraMesh: A Planetary Mosaic of Multimodal Earth Observation Data	Benedikt Blumenstiel et.al.	2504.11172	null
2025-04-15	TerraMind: Large-Scale Generative Multimodality for Earth Observation	Johannes Jakubik et.al.	2504.11171	null
2025-04-15	MuSeD: A Multimodal Spanish Dataset for Sexism Detection in Social Media Videos	Laura De Grazia et.al.	2504.11169	link
2025-04-15	Bypassing Prompt Injection and Jailbreak Detection in LLM Guardrails	William Hackett et.al.	2504.11168	null
2025-04-15	Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation	Linus Jern et.al.	2504.11109	link
2025-04-15	Using LLMs as prompt modifier to avoid biases in AI image generators	René Peinl et.al.	2504.11104	null
2025-04-15	AI-guided Antibiotic Discovery Pipeline from Target Selection to Compound Identification	Maximilian G. Schuh et.al.	2504.11091	null
2025-04-15	TD-Suite: All Batteries Included Framework for Technical Debt Classification	Karthik Shivashankar et.al.	2504.11085	link
2025-04-15	QAMA: Quantum annealing multi-head attention operator with classical deep learning framework	Peng Du et.al.	2504.11083	null
2025-04-15	DPS: Design Pattern Summarisation Using Code Features	Najam Nazar et.al.	2504.11081	link
2025-04-15	Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation Models	Andrea Tirinzoni et.al.	2504.11054	link
2025-04-15	Leveraging LLMs and attention-mechanism for automatic annotation of historical maps	Yunshuang Yuan et.al.	2504.11050	null
2025-04-15	LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews	Sukannya Purkayastha et.al.	2504.11042	link
2025-04-15	Defending Against Frequency-Based Attacks with Diffusion Models	Fatemeh Amerehi et.al.	2504.11034	null
2025-04-16	GATE3D: Generalized Attention-based Task-synergized Estimation in 3D*	Eunsoo Im et.al.	2504.11014	null
2025-04-15	MMC: Iterative Refinement of VLM Reasoning via MCTS-based Multimodal Critique	Shuhang Liu et.al.	2504.11009	null
2025-04-15	Dynamic Compressing Prompts for Efficient Inference of Large Language Models	Jinwu Hu et.al.	2504.11004	null
2025-04-15	Dopamine Audiobook: A Training-free MLLM Agent for Emotional and Human-like Audiobook Generation	Yan Rong et.al.	2504.11002	null
2025-04-15	ReZero: Enhancing LLM search ability by trying one-more-time	Alan Dao et.al.	2504.11001	null
2025-04-16	Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs	Yingjian Chen et.al.	2504.10982	null
2025-04-15	When is Task Vector Provably Effective for Model Editing? A Generalization Analysis of Nonlinear Transformers	Hongkang Li et.al.	2504.10957	null
2025-04-15	Unveiling Challenges for LLMs in Enterprise Data Engineering	Jan-Micha Bodensohn et.al.	2504.10950	link
2025-04-15	Can LLMs Leverage Observational Data? Towards Data-Driven Causal Discovery with LLMs	Yuni Susanti et.al.	2504.10936	null
2025-04-15	Transfer Learning for Temporal Link Prediction	Ayan Chatterjee et.al.	2504.10925	link
2025-04-15	MSCRS: Multi-modal Semantic Graph Prompt Learning Framework for Conversational Recommender Systems	Yibiao Wei et.al.	2504.10921	link
2025-04-15	Adaptive Human-Agent Teaming: A Review of Empirical Studies from the Process Dynamics Perspective	Mengyao Wang et.al.	2504.10918	null
2025-04-15	Towards A Universal Graph Structural Encoder	Jialin Chen et.al.	2504.10917	null
2025-04-15	Understanding LLMs’ Cross-Lingual Context Retrieval: How Good It Is And Where It Comes From	Changjiang Gao et.al.	2504.10906	null
2025-04-15	Bridging Distribution Gaps in Time Series Foundation Model Pretraining with Prototype-Guided Normalization	Peiliang Gong et.al.	2504.10900	null
2025-04-15	ARise: Towards Knowledge-Augmented Reasoning via Risk-Adaptive Search	Yize Zhang et.al.	2504.10893	null
2025-04-15	Exploring Persona-dependent LLM Alignment for the Moral Machine Experiment	Jiseon Kim et.al.	2504.10886	null
2025-04-15	Large Language Model-Informed Feature Discovery Improves Prediction and Interpretation of Credibility Perceptions of Visual Content	Yilang Peng et.al.	2504.10878	null
2025-04-15	LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation	Hanning Chen et.al.	2504.10854	null
2025-04-15	Enhancing Features in Long-tailed Data Using Large Vision Mode	Pengxiao Han et.al.	2504.10852	null
2025-04-15	How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model?	Meiqi Liu et.al.	2504.10850	null
2025-04-15	Moving Beyond Next-Token Prediction: Transformers are Context-Sensitive Language Generators	Phill Kyu Rhee et.al.	2504.10845	null
2025-04-15	LayoutCoT: Unleashing the Deep Reasoning Potential of Large Language Models for Layout Generation	Hengyu Shi et.al.	2504.10829	null
2025-04-15	CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives	Ayoung Lee et.al.	2504.10823	null
2025-04-15	IlluSign: Illustrating Sign Language Videos by Leveraging the Attention Mechanism	Janna Bruner et.al.	2504.10822	null
2025-04-15	CSPLADE: Learned Sparse Retrieval with Causal Language Models	Zhichao Xu et.al.	2504.10816	null
2025-04-15	Tabular foundation model to detect empathy from visual cues	Md Rakibul Hasan et.al.	2504.10808	null
2025-04-15	Can Large Language Models Trade? Testing Financial Theories with LLM Agents in Market Simulations	Alejandro Lopez-Lira et.al.	2504.10789	null
2025-04-15	The Art of Audience Engagement: LLM-Based Thin-Slicing of Scientific Talks	Ralf Schmälzle et.al.	2504.10768	null
2025-04-14	How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients	Ming Li et.al.	2504.10766	link
2025-04-14	CleanMAP: Distilling Multimodal LLMs for Confidence-Driven Crowdsourced HD Map Updates	Ankit Kumar Shaw et.al.	2504.10738	null
2025-04-14	Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization	Darryl Hannan et.al.	2504.10727	null
2025-04-14	HELIOS: Adaptive Model And Early-Exit Selection for Efficient LLM Inference Serving	Avinash Kumar et.al.	2504.10724	null
2025-04-14	Can LLMs Classify CVEs? Investigating LLMs Capabilities in Computing CVSS Vectors	Francesco Marchiori et.al.	2504.10713	link
2025-04-14	Distinct hydrologic response patterns and trends worldwide revealed by physics-embedded learning	Haoyu Ji et.al.	2504.10707	null
2025-04-14	Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE	Jesun Firoz et.al.	2504.10700	null
2025-04-14	The Jailbreak Tax: How Useful are Your Jailbreak Outputs?	Kristina Nikolić et.al.	2504.10694	link
2025-04-14	Load Balancing with Network Latencies via Distributed Gradient Descent	Santiago R. Balseiro et.al.	2504.10693	null
2025-04-14	Introducing Large Language Models as the Next Challenging Internet Traffic Source	Nataliia Koneva et.al.	2504.10688	link
2025-04-14	EMAFusion: A Self-Optimizing System for Seamless LLM Selection and Integration	Soham Shah et.al.	2504.10681	null
2025-04-14	Relation-Rich Visual Document Generator for Visual Information Extraction	Zi-Han Jiang et.al.	2504.10659	link
2025-04-14	MatterTune: An Integrated, User-Friendly Platform for Fine-Tuning Atomistic Foundation Models to Accelerate Materials Simulation and Discovery	Lingyu Kong et.al.	2504.10655	null
2025-04-14	Un marco conceptual para la generación de requerimientos de software de calidad	Mauro José Pacchiotti et.al.	2504.10654	null
2025-04-14	Weight-of-Thought Reasoning: Exploring Neural Network Weights for Enhanced LLM Reasoning	Saif Punjwani et.al.	2504.10646	link
2025-04-14	Who is More Bayesian: Humans or ChatGPT?	Tianshi Mu et.al.	2504.10636	null
2025-04-14	Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models	Thilo Hagendorff et.al.	2504.10615	null
2025-04-14	Energy Matching: Unifying Flow Matching and Energy-Based Models for Generative Modeling	Michal Balcerak et.al.	2504.10612	link
2025-04-15	InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models	Jinguo Zhu et.al.	2504.10479	link
2025-04-14	MIEB: Massive Image Embedding Benchmark	Chenghao Xiao et.al.	2504.10471	link
2025-04-14	Art3D: Training-Free 3D Generation from Flat-Colored Illustration	Xiaoyan Cong et.al.	2504.10466	null
2025-04-14	Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding	Tao Zhang et.al.	2504.10465	link
2025-04-14	The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer	Weixian Lei et.al.	2504.10462	link
2025-04-15	GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents	Xiaobo Xia et.al.	2504.10458	null
2025-04-14	M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	Junxiong Wang et.al.	2504.10449	link
2025-04-14	Multimodal Long Video Modeling Based on Temporal Dynamic Context	Haoran Hao et.al.	2504.10443	link
2025-04-14	Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Taihang Hu et.al.	2504.10434	link
2025-04-14	LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models	Minqian Liu et.al.	2504.10430	null
2025-04-14	Foundation models for electronic health records: representation dynamics and transferability	Michael C. Burkhart et.al.	2504.10422	link
2025-04-14	Can We Edit LLMs for Long-Tail Biomedical Knowledge?	Xinhao Yi et.al.	2504.10421	link
2025-04-14	Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA	Michał Turski et.al.	2504.10419	link
2025-04-14	CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation	Jing Chen et.al.	2504.10418	null
2025-04-14	LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models	Parshin Shojaee et.al.	2504.10415	link
2025-04-14	Performance of Large Language Models in Supporting Medical Diagnosis and Treatment	Diogo Sousa et.al.	2504.10405	null
2025-04-14	Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks	Yan zhu et.al.	2504.10403	null
2025-04-14	Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling?	Olha Shaposhnyk et.al.	2504.10397	null
2025-04-14	SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning	Yiting Wang et.al.	2504.10369	null
2025-04-14	Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis	Kaiwen Zheng et.al.	2504.10351	null
2025-04-14	VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge	Yueqi Song et.al.	2504.10342	null
2025-04-14	Forecasting from Clinical Textual Time Series: Adaptations of the Encoder and Decoder Language Model Families	Shahriar Noroozizadeh et.al.	2504.10340	null
2025-04-14	MorphTok: Morphologically Grounded Tokenization for Indian Languages	Maharaj Brahma et.al.	2504.10335	null
2025-04-14	AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference	Yangshen Deng et.al.	2504.10326	null
2025-04-14	CROSSAN: Towards Efficient and Effective Adaptation of Multiple Multimodal Foundation Models for Sequential Recommendation	Junchen Fu et.al.	2504.10307	link
2025-04-14	Characterizing LLM-driven Social Network: The Chirper.ai Case	Yiming Zhu et.al.	2504.10286	null
2025-04-14	$α$ -Flow: A Unified Framework for Continuous-State Discrete Flow Matching Models	Chaoran Cheng et.al.	2504.10283	null
2025-04-14	Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials	Jingyun Yang et.al.	2504.10281	null
2025-04-14	XY-Cut++: Advanced Layout Ordering via Hierarchical Mask Mechanism on a Novel Benchmark	Shuai Liu et.al.	2504.10258	link
2025-04-14	GNN-ACLP: Graph Neural Networks based Analog Circuit Link Prediction	Guanyuan Pan et.al.	2504.10240	null
2025-04-14	A Model Zoo of Vision Transformers	Damian Falk et.al.	2504.10231	link
2025-04-14	Probing then Editing Response Personality of Large Language Models	Tianjie Ju et.al.	2504.10227	link
2025-04-14	PRM-BAS: Enhancing Multimodal Reasoning through PRM-guided Beam Annealing Search	Pengfei Hu et.al.	2504.10222	null
2025-04-14	Can Competition Enhance the Proficiency of Agents Powered by Large Language Models in the Realm of News-driven Time Series Forecasting?	Yuxuan Zhang et.al.	2504.10210	null
2025-04-14	DioR: Adaptive Cognitive Detection and Contextual Retrieval Optimization for Dynamic Retrieval-Augmented Generation	Hanghui Guo et.al.	2504.10198	null
2025-04-14	Localized Cultural Knowledge is Conserved and Controllable in Large Language Models	Veniamin Veselovsky et.al.	2504.10191	null
2025-04-14	Efficient Generative Model Training via Embedded Representation Warmup	Deyuan Liu et.al.	2504.10188	link
2025-04-14	LLM Unlearning Reveals a Stronger-Than-Expected Coreset Effect in Current Benchmarks	Soumyadeep Pal et.al.	2504.10185	link
2025-04-14	A New Paradigm in IBR Modeling for Power Flow and Short Circuit Analysis	Zahid Javid et.al.	2504.10181	null
2025-04-14	The Future of MLLM Prompting is Adaptive: A Comprehensive Experimental Evaluation of Prompt Engineering Methods for Robust Multimodal Performance	Anwesha Mohanty et.al.	2504.10179	null
2025-04-14	MSCoT: Structured Chain-of-Thought Generation for Multiple Programming Languages	Naizhu Jin et.al.	2504.10178	link
2025-04-14	HalluSearch at SemEval-2025 Task 3: A Search-Enhanced RAG Pipeline for Hallucination Detection	Mohamed A. Abdallah et.al.	2504.10168	null
2025-04-14	C-FAITH: A Chinese Fine-Grained Benchmark for Automated Hallucination Evaluation	Xu Zhang et.al.	2504.10167	null
2025-04-14	Fact-Checking with Contextual Narratives: Leveraging Retrieval-Augmented LLMs for Social Media Analysis	Arka Ujjal Dey et.al.	2504.10166	null
2025-04-14	MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning	Zhaopeng Feng et.al.	2504.10160	link
2025-04-14	COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts	Jiansheng Li et.al.	2504.10158	null
2025-04-14	SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users	Xinnong Zhang et.al.	2504.10157	link
2025-04-14	HistLLM: A Unified Framework for LLM-Based Multimodal Recommendation with User History Encoding and Compression	Chen Zhang et.al.	2504.10150	null
2025-04-14	Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers	Chunyang Zhang et.al.	2504.10148	null
2025-04-14	Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design	Andreas Happe et.al.	2504.10112	null
2025-04-14	Enhancing LLM-based Recommendation through Semantic-Aligned Collaborative Knowledge	Zihan Wang et.al.	2504.10107	null
2025-04-14	CameraBench: Benchmarking Visual Reasoning in MLLMs via Photography	I-Sheng Fang et.al.	2504.10090	null
2025-04-14	RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability	Yichi Zhang et.al.	2504.10081	null
2025-04-15	MMKB-RAG: A Multi-Modal Knowledge-Based Retrieval-Augmented Generation Framework	Zihan Ling et.al.	2504.10074	null
2025-04-14	Mavors: Multi-granularity Video Representation for Multimodal Large Language Model	Yang Shi et.al.	2504.10068	null
2025-04-14	Hallucination Detection in LLMs via Topological Divergence on Attention Graphs	Alexandra Bazarova et.al.	2504.10063	null
2025-04-15	Emotional Strain and Frustration in LLM Interactions in Software Engineering	Cristina Martinez Montes et.al.	2504.10050	null
2025-04-14	CodeRAG: Supportive Code Retrieval on Bigraph for Real-World Code Generation	Jia Li et.al.	2504.10046	null
2025-04-14	CHARM: Calibrating Reward Models With Chatbot Arena Scores	Xiao Zhu et.al.	2504.10045	link
2025-04-14	DataMosaic: Explainable and Verifiable Multi-Modal Data Analytics through Extract-Reason-Verify	Zhengxuan Zhang et.al.	2504.10036	null
2025-04-14	The Mirage of Performance Gains: Why Contrastive Decoding Fails to Address Multimodal Hallucination	Hao Yin et.al.	2504.10020	null
2025-04-14	Training LLMs on HPC Systems: Best Practices from the OpenGPT-X Project	Carolin Penke et.al.	2504.10013	null
2025-04-15	GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting	Junlin Hao et.al.	2504.10001	null
2025-04-14	Do We Really Need Curated Malicious Data for Safety Alignment in Multi-modal Large Language Models?	Yanbo Wang et.al.	2504.10000	null
2025-04-14	Enhancing Multi-task Learning Capability of Medical Generalist Foundation Model via Image-centric Multi-annotation Data	Xun Zhu et.al.	2504.09967	null
2025-04-14	Privacy Meets Explainability: Managing Confidential Data and Transparency Policies in LLM-Empowered Science	Yashothara Shanmugarasa et.al.	2504.09961	null
2025-04-14	C-MTCSD: A Chinese Multi-Turn Conversational Stance Detection Dataset	Fuqiang Niu et.al.	2504.09958	null
2025-04-14	Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes	Huijie Liu et.al.	2504.09948	null
2025-04-14	KeepKV: Eliminating Output Perturbation in KV Cache Compression for Efficient LLMs Inference	Yuxuan Tian et.al.	2504.09936	null
2025-04-14	Constrained Auto-Regressive Decoding Constrains Generative Retrieval	Shiguang Wu et.al.	2504.09935	null
2025-04-14	FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding	Zheng Liu et.al.	2504.09925	link
2025-04-14	Guiding Reasoning in Small Language Models with LLM Assistance	Yujin Kim et.al.	2504.09923	null
2025-04-14	Learning to Erase Private Knowledge from Multi-Documents for Retrieval-Augmented Large Language Models	Yujing Wang et.al.	2504.09910	null
2025-04-14	Refining Financial Consumer Complaints through Multi-Scale Model Interaction	Bo-Wei Chen et.al.	2504.09903	null
2025-04-14	TAMP: Token-Adaptive Layerwise Pruning in Multimodal Large Language Models	Jaewoo Lee et.al.	2504.09897	link
2025-04-14	Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data	Shuai Zhao et.al.	2504.09895	null
2025-04-14	LangPert: Detecting and Handling Task-level Perturbations for Robust Object Rearrangement	Xu Yin et.al.	2504.09893	null
2025-04-14	Ember: A Compiler for Efficient Embedding Operations on Decoupled Access-Execute Architectures	Marco Siracusa et.al.	2504.09870	null
2025-04-14	RadarLLM: Empowering Large Language Models to Understand Human Motion from Millimeter-wave Point Cloud Sequence	Zengyuan Lai et.al.	2504.09862	null
2025-04-14	EthosGPT: Mapping Human Value Diversity to Advance Sustainable Development Goals (SDGs)	Luyao Zhang et.al.	2504.09861	link
2025-04-14	SUMART: SUMmARizing Translation from Wordy to Concise Expression	Naoto Nishida et.al.	2504.09860	null
2025-04-14	Working with Large Language Models to Enhance Messaging Effectiveness for Vaccine Confidence	Lucinda Gullison et.al.	2504.09857	null
2025-04-14	PestMA: LLM-based Multi-Agent System for Informed Pest Management	Hongrui Shi et.al.	2504.09855	null
2025-04-14	A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science	Jie Feng et.al.	2504.09848	null
2025-04-14	$\mathbb{Z}_N$ generalizations of three-dimensional stabilizer codes	Chanbeen Lee et.al.	2504.09847	null
2025-04-14	OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training	Juntao Zhao et.al.	2504.09844	null
2025-04-14	StruPhantom: Evolutionary Injection Attacks on Black-Box Tabular Agents Powered by Large Language Models	Yang Feng et.al.	2504.09841	null
2025-04-14	Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems	Karthik Elamvazhuthi et.al.	2504.09836	null
2025-04-14	RAKG:Document-level Retrieval Augmented Knowledge Graph Construction	Hairong Zhang et.al.	2504.09823	link
2025-04-14	Transferable text data distillation by trajectory matching	Rong Yao et.al.	2504.09818	null
2025-04-14	Augmented Relevance Datasets with Fine-Tuned Small LLMs	Quentin Fitte-Rey et.al.	2504.09816	null
2025-04-14	See or Recall: A Sanity Check for the Role of Vision in Solving Visualization Question Answer Tasks with Multimodal LLMs	Zhimin Li et.al.	2504.09809	null
2025-04-14	Training Small Reasoning LLMs with Cognitive Preference Alignment	Wenrui Cai et.al.	2504.09802	null
2025-04-14	ReadMe.LLM: A Framework to Help LLMs Understand Your Library	Sandya Wijaya et.al.	2504.09798	null
2025-04-14	Reasoning Court: Combining Reasoning, Action, and Judgment for Multi-Hop Reasoning	Jingtian Wu et.al.	2504.09781	null
2025-04-14	Reasoning without Regret	Tarun Chitra et.al.	2504.09777	null
2025-04-14	An Investigation of Large Language Models and Their Vulnerabilities in Spam Detection	Qiyao Tang et.al.	2504.09776	null
2025-04-14	Understanding and Optimizing Multi-Stage AI Inference Pipelines	Abhimanyu Rajeshkumar Bambhaniya et.al.	2504.09775	null
2025-04-14	Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning	Can Jin et.al.	2504.09772	link
2025-04-14	Socratic Chart: Cooperating Multiple Agents for Robust SVG Chart Understanding	Yuyang Ji et.al.	2504.09764	null
2025-04-11	Quantum Large Language Model Fine-Tuning	Sang Hyub Kim et.al.	2504.08732	null
2025-04-11	DocAgent: A Multi-Agent System for Automated Code Documentation Generation	Dayu Yang et.al.	2504.08725	link
2025-04-11	SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents	Muhammad Shihab Rashid et.al.	2504.08703	link
2025-04-11	Large Language Models as Span Annotators	Zdeněk Kasner et.al.	2504.08697	null
2025-04-11	TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning	Hang Ni et.al.	2504.08694	null
2025-04-11	Fast-Slow-Thinking: Complex Task Solving with Large Language Models	Yiliu Sun et.al.	2504.08690	null
2025-04-11	Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing	Jiho Kim et.al.	2504.08687	null
2025-04-11	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis	Alexandre Bazin et.al.	2504.08666	null
2025-04-11	Safe Flow Matching: Robot Motion Planning with Control Barrier Functions	Xiaobing Dai et.al.	2504.08661	null
2025-04-11	Quality evaluation of Tabby coding assistant using real source code snippets	Marta Borek et.al.	2504.08650	link
2025-04-11	Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents	Alessio Buscemi et.al.	2504.08640	null
2025-04-11	Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Gabriele Lozupone et.al.	2504.08635	link
2025-04-11	Analyzing 16,193 LLM Papers for Fun and Profits	Zhiqiu Xia et.al.	2504.08619	null
2025-04-11	ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration	Yongsheng Yu et.al.	2504.08591	null
2025-04-11	Playpen: An Environment for Exploring Learning Through Conversational Interaction	Nicola Horst et.al.	2504.08590	link
2025-04-11	COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails	Miguel Espinosa et.al.	2504.08548	null
2025-04-11	Slicing the Gaussian Mixture Wasserstein Distance	Moritz Piening et.al.	2504.08544	link
2025-04-11	UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection	Frances Laureano De Leon et.al.	2504.08543	null
2025-04-11	Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions	Tommaso Galliena et.al.	2504.08531	null
2025-04-11	Task Memory Engine (TME): Enhancing State Awareness for Multi-Step LLM Agent Tasks	Ye Ye et.al.	2504.08525	link
2025-04-11	Adopting Large Language Models to Automated System Integration	Robin D. Pesl et.al.	2504.08490	null
2025-04-11	TickIt: Leveraging Large Language Models for Automated Ticket Escalation	Fengrui Liu et.al.	2504.08475	null
2025-04-11	On the Design of Diffusion-based Neural Speech Codecs	Pietro Foti et.al.	2504.08470	null
2025-04-11	Diffusion Models for Robotic Manipulation: A Survey	Rosa Wolf et.al.	2504.08438	null
2025-04-11	Customizing Spider Silk: Generative Models with Mechanical Property Conditioning for Protein Engineering	Neeru Dubey et.al.	2504.08437	null
2025-04-11	A Reproducibility Study of Graph-Based Legal Case Retrieval	Gregor Donabauer et.al.	2504.08400	null
2025-04-11	Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models	Yin Jou Huang et.al.	2504.08399	null
2025-04-11	PCA-RAG: Principal Component Analysis for Efficient Retrieval-Augmented Generation	Arman Khaledian et.al.	2504.08386	null
2025-04-11	Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash	Fucheng Jia et.al.	2504.08378	null
2025-04-11	MedRep: Medical Concept Representation for General Electronic Health Record Foundation Models	Junmo Kim et.al.	2504.08329	link
2025-04-11	SortBench: Benchmarking LLMs based on their ability to sort lists	Steffen Herbold et.al.	2504.08312	null
2025-04-11	DSM: Building A Diverse Semantic Map for 3D Visual Grounding	Qinghongbing Xie et.al.	2504.08307	null
2025-04-11	Large language models could be rote learners	Yuyang Xu et.al.	2504.08300	null
2025-04-11	ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation	Vishal Gandhi et.al.	2504.08281	null
2025-04-11	To See or Not to See – Fingerprinting Devices in Adversarial Environments Amid Advanced Machine Learning	Justin Feng et.al.	2504.08264	null
2025-04-11	Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare	Yonchanok Khaokaew et.al.	2504.08260	null
2025-04-11	CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model	Ruohao Zhan et.al.	2504.08259	null
2025-04-11	RAG-VR: Leveraging Retrieval-Augmented Generation for 3D Question Answering in VR Environments	Shiyi Ding et.al.	2504.08256	link
2025-04-11	Understanding the Impact of Data Domain Extraction on Synthetic Data Privacy	Georgi Ganev et.al.	2504.08254	null
2025-04-11	Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices	Shengyuan Ye et.al.	2504.08242	null
2025-04-11	Optimal Transport-Based Generative Models for Bayesian Posterior Sampling	Ke Li et.al.	2504.08214	null
2025-04-11	How Good Are Large Language Models for Course Recommendation in MOOCs?	Boxuan Ma et.al.	2504.08208	null
2025-04-11	DRAFT-ing Architectural Design Decisions using LLMs	Rudra Dhar et.al.	2504.08207	link
2025-04-11	Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models	Yu Fu et.al.	2504.08202	null
2025-04-11	Neural Encoding and Decoding at Scale	Yizi Zhang et.al.	2504.08201	null
2025-04-11	A Vulnerability Code Intent Summary Dataset	Yifan Huang et.al.	2504.08180	null
2025-04-11	SynthFM: Training Modality-agnostic Foundation Models for Medical Image Segmentation without Real Medical Data	Sourya Sengupta et.al.	2504.08177	null
2025-04-11	GenXSS: an AI-Driven Framework for Automated Detection of XSS Attacks in WAFs	Vahid Babaey et.al.	2504.08176	null
2025-04-10	Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora	Alex Warstadt et.al.	2504.08165	link
2025-04-10	Information bounds on the accuracy of cell polarization	Tau-Mu Yi et.al.	2504.08164	null
2025-04-10	Investigating Vision-Language Model for Point Cloud-based Vehicle Classification	Yiqiao Li et.al.	2504.08154	null
2025-04-10	LoRAX: LoRA eXpandable Networks for Continual Synthetic Image Attribution	Danielle Sullivan-Pao et.al.	2504.08149	link
2025-04-10	Orchestrating Agents and Data for Enterprise: A Blueprint Architecture for Compound AI	Eser Kandogan et.al.	2504.08148	null
2025-04-10	Empowering Vector Architectures for ML: The CAMP Architecture for Matrix Multiplication	Mohammadreza Esmali Nojehdeh et.al.	2504.08137	null
2025-04-10	Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects	Shalini Maiti et.al.	2504.08125	null
2025-04-10	DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Daniil Larionov et.al.	2504.08120	null
2025-04-10	Test Amplification for REST APIs via Single and Multi-Agent LLM Systems	Robbe Nooyens et.al.	2504.08113	null
2025-04-10	Scaling Laws of Graph Neural Networks for Atomistic Materials Modeling	Chaojian Li et.al.	2504.08112	null
2025-04-10	POEM: Precise Object-level Editing via MLLM control	Marco Schouten et.al.	2504.08111	null
2025-04-10	Optimal Investment in Equity and Credit Default Swaps in the Presence of Default	Zhe Fei et.al.	2504.08085	null
2025-04-10	Teaching Humans Subtle Differences with DIFFusion	Mia Chiquier et.al.	2504.08046	null
2025-04-10	Can Reasoning LLMs Enhance Clinical Document Classification?	Akram Mustafa et.al.	2504.08040	null
2025-04-10	Emergence of psychopathological computations in large language models	Soo Yong Lee et.al.	2504.08016	link
2025-04-10	C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing	Zhongyang Li et.al.	2504.07964	link
2025-04-10	PixelFlow: Pixel-Space Generative Models with Flow	Shoufa Chen et.al.	2504.07963	link
2025-04-10	GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation	Lang Lin et.al.	2504.07962	null
2025-04-10	VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning	Zhong-Yu Li et.al.	2504.07960	null
2025-04-10	Detect Anything 3D in the Wild	Hanxue Zhang et.al.	2504.07958	link
2025-04-10	MM-IFEngine: Towards Multimodal Instruction Following	Shengyuan Ding et.al.	2504.07957	link
2025-04-10	VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning	Yukun Qi et.al.	2504.07956	null
2025-04-11	Pushing the Accuracy Limit of Foundation Neural Network Models with Quantum Monte Carlo Forces and Path Integrals	Anouar Benali et.al.	2504.07948	null
2025-04-10	We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy	Jordi Linares-Pellicer et.al.	2504.07936	null
2025-04-10	Porting an LLM based Application from ChatGPT to an On-Premise Environment	Teemu Paloniemi et.al.	2504.07907	null
2025-04-10	Redefining Machine Translation on Social Network Services with Large Language Models	Hongcheng Guo et.al.	2504.07901	link
2025-04-10	How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective	Qi Liu et.al.	2504.07898	link
2025-04-10	Fast Adaptation with Behavioral Foundation Models	Harshit Sikchi et.al.	2504.07896	null
2025-04-10	DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows	Mashrur M. Morshed et.al.	2504.07894	null
2025-04-10	Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge	Riccardo Cantini et.al.	2504.07887	link
2025-04-10	Token Level Routing Inference System for Edge Devices	Jianshu She et.al.	2504.07878	null
2025-04-10	SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos	Joshua Li et.al.	2504.07867	null
2025-04-11	Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs	Yichun Yin et.al.	2504.07866	null
2025-04-10	Robust Hallucination Detection in LLMs via Adaptive Token Selection	Mengjia Niu et.al.	2504.07863	null
2025-04-10	Horizons, throats and bounces in hybrid metric-Palatini gravity with a non-zero potential	Gabriel I. Róis et.al.	2504.07861	null
2025-04-10	2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization	Mengyang Li et.al.	2504.07856	null
2025-04-10	The KL3M Data Project: Copyright-Clean Training Resources for Large Language Models	Michael J Bommarito II et.al.	2504.07854	link
2025-04-10	Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines	Cansu Koyuturk et.al.	2504.07840	null
2025-04-10	Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models	Hongcheng Guo et.al.	2504.07807	link
2025-04-10	A System for Comprehensive Assessment of RAG Frameworks	Mattia Rengo et.al.	2504.07803	link
2025-04-10	FairEval: Evaluating Fairness in LLM-Based Recommendations with Personality Awareness	Chandan Kumar Sah et.al.	2504.07801	null
2025-04-10	Plan-and-Refine: Diverse and Comprehensive Retrieval-Augmented Generation	Alireza Salemi et.al.	2504.07794	link
2025-04-10	Revisiting Likelihood-Based Out-of-Distribution Detection by Modeling Representations	Yifan Ding et.al.	2504.07793	link
2025-04-10	Fairness Mediator: Neutralize Stereotype Associations to Mitigate Bias in Large Language Models	Yisong Xiao et.al.	2504.07787	null
2025-04-10	Exploring a Patch-Wise Approach for Privacy-Preserving Fake ID Detection	Javier Muñoz-Haro et.al.	2504.07761	null
2025-04-10	Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation	Bo Zhang et.al.	2504.07754	link
2025-04-10	SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding	Yangliu Hu et.al.	2504.07745	null
2025-04-10	Zero-Shot Cross-Domain Code Search without Fine-Tuning	Keyu Liang et.al.	2504.07740	link
2025-04-10	Automated Construction of a Knowledge Graph of Nuclear Fusion Energy for Effective Elicitation and Retrieval of Information	A. Loreti et.al.	2504.07738	null
2025-04-10	DeepGreen: Effective LLM-Driven Green-washing Monitoring System Designed for Empirical Testing – Evidence from China	Congluo Xu et.al.	2504.07733	null
2025-04-10	MRD-RAG: Enhancing Medical Diagnosis with Multi-Round Retrieval-Augmented Generation	Yixiang Chen et.al.	2504.07724	link
2025-04-10	PR-Attack: Coordinated Prompt-RAG Attacks on Retrieval-Augmented Generation in Large Language Models via Bilevel Optimization	Yang Jiao et.al.	2504.07717	null
2025-04-10	Proactive User Information Acquisition via Chats on User-Favored Topics	Shiki Sato et.al.	2504.07698	null
2025-04-10	Conformalized Generative Bayesian Imaging: An Uncertainty Quantification Framework for Computational Imaging	Canberk Ekmekci et.al.	2504.07696	null
2025-04-10	FMNV: A Dataset of Media-Published News Videos for Fake News Detection	Yihao Wang et.al.	2504.07687	null
2025-04-10	Synthetic Fluency: Hallucinations, Confabulations, and the Creation of Irish Words in LLM-Generated Translations	Sheila Castilho et.al.	2504.07680	null
2025-04-10	Data Requirement Goal Modeling for Machine Learning Systems	Asma Yamani et.al.	2504.07664	null
2025-04-10	Unveiling the Impact of Multimodal Features on Chinese Spelling Correction: From Analysis to Design	Xiaowu Zhang et.al.	2504.07661	link
2025-04-10	Synthesizing High-Quality Programming Tasks with LLM-based Expert and Student Agents	Manh Hung Nguyen et.al.	2504.07655	null
2025-04-10	On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data	Alfredo Garrachón Ruiz et.al.	2504.07646	null
2025-04-10	Enhancing Large Language Models through Neuro-Symbolic Integration and Ontological Reasoning	Ruslan Idelfonso Magana Vsevolodovna et.al.	2504.07640	link
2025-04-10	Agent That Debugs: Dynamic State-Guided Vulnerability Repair	Zhengyao Liu et.al.	2504.07634	null
2025-04-10	ConceptFormer: Towards Efficient Use of Knowledge-Graph Embeddings in Large Language Models	Joel Barmettler et.al.	2504.07624	null
2025-04-10	Beating Transformers using Synthetic Cognition	Alfredo Ibias et.al.	2504.07619	null
2025-04-10	VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model	Haozhan Shen et.al.	2504.07615	link
2025-04-11	Boosting Universal LLM Reward Design through Heuristic Reward Observation Space Evolution	Zen Kit Heng et.al.	2504.07596	null
2025-04-10	REANIMATOR: Reanimate Retrieval Test Collections with Extracted and Synthetic Resources	Björn Engelmann et.al.	2504.07584	link
2025-04-10	Exploring Human-Like Thinking in Search Simulations with Large Language Models	Erhan Zhang et.al.	2504.07570	link
2025-04-10	Benchmarking Image Embeddings for E-Commerce: Evaluating Off-the Shelf Foundation Models, Fine-Tuning Strategies and Practical Trade-offs	Urszula Czerwinska et.al.	2504.07567	null
2025-04-11	Using LLMs for Analyzing AIS Data	Gaspard Merten et.al.	2504.07557	null
2025-04-10	TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs	Zijian Zhang et.al.	2504.07556	null
2025-04-10	A taxonomy of epistemic injustice in the context of AI and the case for generative hermeneutical erasure	Warmhold Jan Thomas Mollema et.al.	2504.07531	null
2025-04-10	Automating the Path: An R&D Agenda for Human-Centered AI and Visualization	Niklas Elmqvist et.al.	2504.07529	null
2025-04-10	Supervised Optimism Correction: Be Confident When LLMs Are Sure	Junjie Zhang et.al.	2504.07527	null
2025-04-10	Why We Feel: Breaking Boundaries in Emotional Reasoning with Multimodal Large Language Models	Yuxiang Lin et.al.	2504.07521	link
2025-04-10	VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding	Henghao Zhao et.al.	2504.07519	null
2025-04-10	Enhancements for Developing a Comprehensive AI Fairness Assessment Standard	Avinash Agarwal et.al.	2504.07516	null
2025-04-10	GPT Carry-On: Training Foundation Model for Customization Could Be Simple, Scalable and Affordable	Jianqiao Wangni et.al.	2504.07513	null
2025-04-10	Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving	Shihong Gao et.al.	2504.07494	link
2025-04-10	UniCAIM: A Unified CAM/CIM Architecture with Static-Dynamic KV Cache Pruning for Efficient Long-Context LLM Inference	Weikai Xu et.al.	2504.07479	null
2025-04-10	Defense against Prompt Injection Attacks via Mixture of Encodings	Ruiyi Zhang et.al.	2504.07467	link
2025-04-10	Learning Universal Features for Generalizable Image Forgery Localization	Hengrun Zhao et.al.	2504.07462	link
2025-04-10	Achilles Heel of Distributed Multi-Agent Systems	Yiting Zhang et.al.	2504.07461	null
2025-04-10	Beyond LLMs: A Linguistic Approach to Causal Graph Generation from Narrative Texts	Zehan Li et.al.	2504.07459	null
2025-04-10	How Can Objects Help Video-Language Understanding?	Zitian Tang et.al.	2504.07454	null
2025-04-10	LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation	Juzheng Zhang et.al.	2504.07448	link
2025-04-10	Revisiting LLM Evaluation through Mechanism Interpretability: a New Metric and Model Utility Law	Yixin Cao et.al.	2504.07440	link
2025-04-10	LLM4Ranking: An Easy-to-use Framework of Utilizing Large Language Models for Document Reranking	Qi Liu et.al.	2504.07439	link
2025-04-10	From Token to Line: Enhancing Code Generation with a Long-Term Perspective	Tingwei Lu et.al.	2504.07433	null
2025-04-10	LLM-Enabled Data Transmission in End-to-End Semantic Communication	Shavbo Salehi et.al.	2504.07431	null
2025-04-10	Task-oriented Age of Information for Remote Inference with Hybrid Language Models	Shuying Gan et.al.	2504.07428	null
2025-04-10	Conditional Data Synthesis Augmentation	Xinyu Tian et.al.	2504.07426	null
2025-04-10	Enhancing Player Enjoyment with a Two-Tier DRL and LLM-Based Agent System for Fighting Games	Shouren Wang et.al.	2504.07425	null
2025-04-10	Routing to the Right Expertise: A Trustworthy Judge for Instruction-based Image Editing	Chenxi Sun et.al.	2504.07424	null
2025-04-10	RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability	Jonggwon Park et.al.	2504.07416	null
2025-04-10	Leveraging LLMs for Multimodal Retrieval-Augmented Radiology Report Generation via Key Phrase Extraction	Kyoyun Choi et.al.	2504.07415	null
2025-04-10	AI Coding with Few-Shot Prompting for Thematic Analysis	Samuel Flanders et.al.	2504.07408	null
2025-04-10	FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation	Linyan Huang et.al.	2504.07405	null
2025-04-10	Automating quantum feature map design via large language models	Kenya Sakka et.al.	2504.07396	link
2025-04-10	ID-Booth: Identity-consistent Face Generation with Diffusion Models	Darian Tomašević et.al.	2504.07392	link
2025-04-10	TALE: A Tool-Augmented Framework for Reference-Free Evaluation of Large Language Models	Sher Badshah et.al.	2504.07385	null
2025-04-10	Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction	Qingchao Jiang et.al.	2504.07382	link
2025-04-10	Structure-Property Relationship in Disordered Hyperuniform Materials: Microstructure Representation, Field Fluctuations and Effective Properties	Liyu Zhong et.al.	2504.07380	null
2025-04-10	Towards Distribution Matching between Collaborative and Language Spaces for Generative Recommendation	Yi Zhang et.al.	2504.07363	link
2025-04-10	Enhancing Time Series Forecasting via Multi-Level Text Alignment with LLMs	Taibiao Zhao et.al.	2504.07360	link
2025-04-10	Revisiting Prompt Optimization with Large Reasoning Models-A Case Study on Event Extraction	Saurabh Srivastava et.al.	2504.07357	null
2025-04-10	Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents	Yueying Li et.al.	2504.07347	null
2025-04-09	Code Generation with Small Language Models: A Deep Evaluation on Codeforces	Débora Souza et.al.	2504.07343	null
2025-04-09	Leveraging deep learning for plant disease identification: a bibliometric analysis in SCOPUS from 2018 to 2024	Enow Takang Achuo Albert et.al.	2504.07342	null
2025-04-09	Zeus: Zero-shot LLM Instruction for Union Segmentation in Multimodal Medical Imaging	Siyuan Dai et.al.	2504.07336	null
2025-04-09	Objaverse++: Curated 3D Object Dataset with Quality Annotations	Chendi Lin et.al.	2504.07334	link
2025-04-09	Alice: Proactive Learning with Teacher’s Demonstrations for Weak-to-Strong Generalization	Shujin Wu et.al.	2504.07316	link
2025-04-09	PAYADOR: A Minimalist Approach to Grounding Language Models on Structured Data for Interactive Storytelling and Role-playing Games	Santiago Góngora et.al.	2504.07304	link
2025-04-09	Modeling Response Consistency in Multi-Agent LLM Systems: A Comparative Analysis of Shared and Separate Context Approaches	Tooraj Helmi et.al.	2504.07303	null
2025-04-09	MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning	Yangning Li et.al.	2504.07288	null
2025-04-09	RAISE: Reinforenced Adaptive Instruction Selection For Large Language Models	Lv Qingsong et.al.	2504.07282	null
2025-04-09	Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning	Nikhil Shivakumar Nayak et.al.	2504.07097	link
2025-04-09	Are We Done with Object-Centric Learning?	Alexander Rubinstein et.al.	2504.07092	link
2025-04-09	KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs	Elan Markowitz et.al.	2504.07087	null
2025-04-09	Identifying Unknown Stochastic Dynamics via Finite expression methods	Senwei Liang et.al.	2504.07085	null
2025-04-09	DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning	Atharva Pandey et.al.	2504.07080	null
2025-04-09	A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models	Zhouhang Xie et.al.	2504.07070	null
2025-04-09	HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification	Bibek Paudel et.al.	2504.07069	null
2025-04-09	Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer	Shi Pan et.al.	2504.07061	null
2025-04-09	TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling	Liang-Hsuan Tseng et.al.	2504.07053	link
2025-04-09	To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning	Tian Qin et.al.	2504.07052	null
2025-04-09	Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety	Chad Melton et.al.	2504.07022	null
2025-04-09	LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware	Nowfel Mashnoor et.al.	2504.07015	null
2025-04-09	Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies	Jonas Loos et.al.	2504.07008	link
2025-04-09	Towards LLMs Robustness to Changes in Prompt Format Styles	Lilian Ngweta et.al.	2504.06969	null
2025-04-09	Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation	Thomas Kerdreux et.al.	2504.06962	null
2025-04-09	VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning	Xinhao Li et.al.	2504.06958	null
2025-04-09	RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts	Natalia Loukachevitch et.al.	2504.06947	link
2025-04-09	Review of Case-Based Reasoning for LLM Agents: Theoretical Foundations, Architectural Components, and Cognitive Integration	Kostas Hatalis et.al.	2504.06943	null
2025-04-09	FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks	Dekun Dai et.al.	2504.06939	link
2025-04-09	The Importance of Being Discrete: Measuring the Impact of Discretization in End-to-End Differentially Private Synthetic Data	Georgi Ganev et.al.	2504.06923	null
2025-04-09	Data Augmentation for Fake Reviews Detection in Multiple Languages and Multiple Domains	Ming Liu et.al.	2504.06917	null
2025-04-09	UKBOB: One Billion MRI Labeled Masks for Generalizable 3D Medical Image Segmentation	Emmanuelle Bourigault et.al.	2504.06908	null
2025-04-09	MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking	Chang Nie et.al.	2504.06863	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	Integrating Cognitive Processing Signals into Language Models: A Review of Advances, Applications and Future Directions	Angela Lopez-Cardona et.al.	2504.06843	null
2025-04-09	LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding	Ziyi Wang et.al.	2504.06835	null
2025-04-09	IAAO: Interactive Affordance Learning for Articulated Objects in 3D Environments	Can Zhang et.al.	2504.06827	null
2025-04-09	Open Problems and a Hypothetical Path Forward in LLM Knowledge Paradigms	Xiaotian Ye et.al.	2504.06823	null
2025-04-09	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	A Meaningful Perturbation Metric for Evaluating Explainability Methods	Danielle Cohen et.al.	2504.06800	null
2025-04-09	Zero-Shot Image-Based Large Language Model Approach to Road Pavement Monitoring	Shuoshuo Xu et.al.	2504.06785	null
2025-04-09	CHIME: A Compressive Framework for Holistic Interest Modeling	Yong Bai et.al.	2504.06780	null
2025-04-09	FamilyTool: A Multi-hop Personalized Tool Use Benchmark	Yuxin Wang et.al.	2504.06766	link
2025-04-09	Robust Capacity Expansion Modelling for Renewable Energy Systems under Weather and Demand Uncertainty	Sebastian Kebrich et.al.	2504.06750	link
2025-04-09	Plastic tensor networks for interpretable generative modeling	Katsuya O. Akamatsu et.al.	2504.06722	null
2025-04-09	Toward Holistic Evaluation of Recommender Systems Powered by Generative Models	Yashar Deldjoo et.al.	2504.06667	null
2025-04-09	Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception	Ruotian Peng et.al.	2504.06666	null
2025-04-09	SEE: Continual Fine-tuning with Sequential Ensemble of Experts	Zhilin Wang et.al.	2504.06664	link
2025-04-09	Bridging the Gap Between Preference Alignment and Machine Unlearning	Xiaohua Feng et.al.	2504.06659	null
2025-04-09	A Neuro-inspired Interpretation of Unlearning in Large Language Models through Sample-level Unlearning Difficulty	Xiaohua Feng et.al.	2504.06658	null
2025-04-09	ThoughtProbe: Classifier-Guided Thought Space Exploration Leveraging LLM Intrinsic Reasoning	Zijian Wang et.al.	2504.06650	null
2025-04-09	SCI-Reason: A Dataset with Chain-of-Thought Rationales for Complex Multimodal Reasoning in Academic Areas	Chenghao Ma et.al.	2504.06637	null
2025-04-09	BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation	Kaiyuan Li et.al.	2504.06636	null
2025-04-09	The Method for Storing Patterns in Neural Networks-Memorization and Recall of QR code Patterns-	Hiroshi Inazawa et.al.	2504.06631	null
2025-04-09	Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program	Minghe Gao et.al.	2504.06606	link
2025-04-09	Automated Business Process Analysis: An LLM-Based Approach to Value Assessment	William De Michele et.al.	2504.06600	link
2025-04-09	A Multi-Modal Interaction Framework for Efficient Human-Robot Collaborative Shelf Picking	Abhinav Pathak et.al.	2504.06593	null
2025-04-09	Right Prediction, Wrong Reasoning: Uncovering LLM Misalignment in RA Disease Diagnosis	Umakanta Maharana et.al.	2504.06581	link
2025-04-09	Bypassing Safety Guardrails in LLMs Using Humor	Pedro Cisneros-Velarde et.al.	2504.06577	null
2025-04-09	NeedleInATable: Exploring Long-Context Capability of Large Language Models towards Long-Structured Tables	Lanrui Wang et.al.	2504.06560	null
2025-04-09	Societal Impacts Research Requires Benchmarks for Creative Composition Tasks	Judy Hanwen Shen et.al.	2504.06549	null
2025-04-09	DiffusionCom: Structure-Aware Multimodal Diffusion Model for Multimodal Knowledge Graph Completion	Wei Huang et.al.	2504.06543	null
2025-04-09	Lugha-Llama: Adapting Large Language Models for African Languages	Happy Buzaaba et.al.	2504.06536	null
2025-04-08	Towards Holistic Prompt Craft	Joseph Lindley et.al.	2504.06496	null
2025-04-08	Mind the Gap: Evaluating Vision Systems in Small Data Applications	Samuel Stevens et.al.	2504.06486	link
2025-04-08	Can LLMs Simulate Personas with Reversed Performance? A Benchmark for Counterfactual Instruction Following	Sai Adith Senthil Kumar et.al.	2504.06460	null
2025-04-08	Can you Finetune your Binoculars? Embedding Text Watermarks into the Weights of Large Language Models	Fay Elhassan et.al.	2504.06446	null
2025-04-08	Don’t Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning	Yuehan Qin et.al.	2504.06438	null
2025-04-08	Language-Dependent Political Bias in AI: A Study of ChatGPT and Gemini	Dogus Yuksel et.al.	2504.06436	null
2025-04-08	Human Trust in AI Search: A Large-Scale Experiment	Haiwen Li et.al.	2504.06435	null
2025-04-08	S’MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning	Hanqing Zeng et.al.	2504.06426	null
2025-04-08	Releasing Differentially Private Event Logs Using Generative Models	Frederik Wangelik et.al.	2504.06418	link
2025-04-08	Unifying Autoregressive and Diffusion-Based Sequence Generation	Nima Fathi et.al.	2504.06416	null
2025-04-08	Comparing Self-Disclosure Themes and Semantics to a Human, a Robot, and a Disembodied Agent	Sophie Chiang et.al.	2504.06374	null
2025-04-08	Query Understanding in LLM-based Conversational Information Seeking	Yifei Yuan et.al.	2504.06356	null
2025-04-08	A Geometric-Aware Perspective and Beyond: Hybrid Quantum-Classical Machine Learning Methods	Azadeh Alavia et.al.	2504.06328	null
2025-04-08	From Stability to Inconsistency: A Study of Moral Preferences in LLMs	Monika Jotautaite et.al.	2504.06324	null
2025-04-08	Mosaic: Composite Projection Pruning for Resource-efficient LLMs	Bailey J. Eccles et.al.	2504.06323	null
2025-04-09	GOLLuM: Gaussian Process Optimized LLMs – Reframing LLM Finetuning through Bayesian Optimization	Bojana Ranković et.al.	2504.06265	link
2025-04-08	OmniSVG: A Unified Scalable Vector Graphics Generation Model	Yiying Yang et.al.	2504.06263	null
2025-04-09	Hogwild! Inference: Parallel LLM Generation via Concurrent Attention	Gleb Rodionov et.al.	2504.06261	link
2025-04-08	FEABench: Evaluating Language Models on Multiphysics Reasoning Ability	Nayantara Mudur et.al.	2504.06260	link
2025-04-08	Electronic Structure Guided Inverse Design Using Generative Models	Shuyi Jia et.al.	2504.06249	link
2025-04-08	Orb-v3: atomistic simulation at scale	Benjamin Rhodes et.al.	2504.06231	link
2025-04-08	LExT: Towards Evaluating Trustworthiness of Natural Language Explanations	Krithi Shailya et.al.	2504.06227	null
2025-04-08	Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation	Biao Zhang et.al.	2504.06225	null
2025-04-09	Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Xiaoxing Hu et.al.	2504.06220	link
2025-04-08	Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs	Dongyang Fan et.al.	2504.06219	null
2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null
2025-04-08	TxGemma: Efficient and Agentic LLMs for Therapeutics	Eric Wang et.al.	2504.06196	null
2025-04-08	A Self-Supervised Framework for Space Object Behaviour Characterisation	Ian Groves et.al.	2504.06176	null
2025-04-08	Assessing how hyperparameters impact Large Language Models’ sarcasm detection performance	Montgomery Gole et.al.	2504.06166	null
2025-04-09	Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups	Rijul Magu et.al.	2504.06160	null
2025-04-08	A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning	Akash Kumar et.al.	2504.06153	null
2025-04-08	V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models	Xiangxi Zheng et.al.	2504.06148	link
2025-04-08	ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs	Tooraj Helmi et.al.	2504.06143	null
2025-04-10	A Multimedia Analytics Model for the Foundation Model Era	Marcel Worring et.al.	2504.06138	null
2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null
2025-04-08	FaceCloak: Learning to Protect Face Templates	Sudipta Banerjee et.al.	2504.06131	link
2025-04-08	Knowledge Graph Completion with Relation-Aware Anchor Enhancement	Duanyang Yuan et.al.	2504.06129	link
2025-04-08	Multi-Sense Embeddings for Language Models and Knowledge Distillation	Qitong Wang et.al.	2504.06036	null
2025-04-08	Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi	Monojit Choudhury et.al.	2504.06011	null
2025-04-08	Optuna vs Code Llama: Are LLMs a New Paradigm for Hyperparameter Tuning?	Roman Kochnev et.al.	2504.06006	null
2025-04-08	Note on the Universality of Parameterized IQP Circuits with Hidden Units for Generating Probability Distributions	Andrii Kurkin et.al.	2504.05997	null
2025-04-08	NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge	Firoj Alam et.al.	2504.05995	null
2025-04-08	An Empirical Study of GPT-4o Image Generation Capabilities	Sixiang Chen et.al.	2504.05979	link
2025-04-08	AVP-AP: Self-supervised Automatic View Positioning in 3D cardiac CT via Atlas Prompting	Xiaolin Fan et.al.	2504.05966	null
2025-04-08	Unsupervised Location Mapping for Narrative Corpora	Eitan Wagner et.al.	2504.05954	null
2025-04-08	InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Control	Ruixiang Wu et.al.	2504.05946	null
2025-04-08	Assessing Thai Dialect Performance in LLMs with Automatic Benchmarks and Human Evaluation	Peerat Limkonchotiwat et.al.	2504.05898	null
2025-04-08	KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection	Xingyuan Li et.al.	2504.05878	null
2025-04-08	Agent Guide: A Simple Agent Behavioral Watermarking Framework	Kaibo Huang et.al.	2504.05871	null
2025-04-08	CTI-HAL: A Human-Annotated Dataset for Cyber Threat Intelligence Analysis	Sofia Della Penna et.al.	2504.05866	null
2025-04-08	Are Generative AI Agents Effective Personalized Financial Advisors?	Takehiro Takayanagi et.al.	2504.05862	link
2025-04-08	Enhancing Coreference Resolution with Pretrained Language Models: Bridging the Gap Between Syntax and Semantics	Xingzu Liu et.al.	2504.05855	null
2025-04-08	Physics-aware generative models for turbulent fluid flows through energy-consistent stochastic interpolants	Nikolaj T. Mücke et.al.	2504.05852	link
2025-04-08	PathGPT: Leveraging Large Language Models for Personalized Route Generation	Steeve Cuthbert Marcelyn et.al.	2504.05846	null
2025-04-08	Leveraging Robust Optimization for LLM Alignment under Distribution Shifts	Mingye Zhu et.al.	2504.05831	null
2025-04-08	Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models	Jiahao Chen et.al.	2504.05815	null
2025-04-08	Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization	Qingyang Zhang et.al.	2504.05812	link
2025-04-08	PaMi-VDPO: Mitigating Video Hallucinations by Prompt-Aware Multi-Instance Video Preference Learning	Xinpeng Ding et.al.	2504.05810	null
2025-04-08	StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization	Yiming Tang et.al.	2504.05804	link
2025-04-08	From Superficial to Deep: Integrating External Knowledge for Follow-up Question Generation Using Knowledge Graph and LLM	Jianyu Liu et.al.	2504.05801	null
2025-04-08	DefMamba: Deformable Visual State Space Model	Leiye Liu et.al.	2504.05794	null
2025-04-08	ViralQC: A Tool for Assessing Completeness and Contamination of Predicted Viral Contigs	Cheng Peng et.al.	2504.05790	link
2025-04-08	How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM	Jirong Zha et.al.	2504.05786	null
2025-04-08	MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models	Pengfei Zhou et.al.	2504.05782	link
2025-04-08	SEA-LION: Southeast Asian Languages in One Network	Raymond Ng et.al.	2504.05747	null
2025-04-08	LLM-assisted Mutation for Whitebox API Testing	Jia Li et.al.	2504.05738	null
2025-04-08	Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring	Yida Cai et.al.	2504.05736	null
2025-04-08	LLM $\times$ MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources	Haoyu Wang et.al.	2504.05732	link
2025-04-08	Retrieval Augmented Generation with Collaborative Filtering for Personalized Text Generation	Teng Shi et.al.	2504.05731	link
2025-04-08	Unified Generative Search and Recommendation	Teng Shi et.al.	2504.05730	null
2025-04-08	Single-Agent vs. Multi-Agent LLM Strategies for Automated Student Reflection Assessment	Gen Li et.al.	2504.05716	null
2025-04-08	Automated Archival Descriptions with Federated Intelligence of LLMs	Jinghua Groppe et.al.	2504.05711	null
2025-04-08	Large Language Models Enhanced Hyperbolic Space Recommender Systems	Wentao Cheng et.al.	2504.05694	null
2025-04-08	STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation	Aniket Deroy et.al.	2504.05693	null
2025-04-08	StayLTC: A Cost-Effective Multimodal Framework for Hospital Length of Stay Forecasting	Sudeshna Jana et.al.	2504.05691	null
2025-04-09	STAGE: Stemmed Accompaniment Generation through Prefix-Based Conditioning	Giorgio Strano et.al.	2504.05690	null
2025-04-08	Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators	Xitao Li et.al.	2504.05689	null
2025-04-08	Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis?	Subhankar Maity et.al.	2504.05683	null
2025-04-08	VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs	Dongjun Qian et.al.	2504.05673	null
2025-04-08	Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing	Tianchi Liu et.al.	2504.05657	link
2025-04-08	Sugar-Coated Poison: Benign Generation Unlocks LLM Jailbreaking	Yu-Hang Wu et.al.	2504.05652	link
2025-04-08	iEBAKER: Improved Remote Sensing Image-Text Retrieval Framework via Eliminate Before Align and Keyword Explicit Reasoning	Yan Zhang et.al.	2504.05644	null
2025-04-08	Leveraging Prompt-Tuning for Bengali Grammatical Error Explanation Using Large Language Models	Subhankar Maity et.al.	2504.05642	null
2025-04-08	TAGC: Optimizing Gradient Communication in Distributed Transformer Training	Igor Polyakov et.al.	2504.05638	link
2025-04-08	Model-Agnostic Policy Explanations with Large Language Models	Zhang Xi-Jia et.al.	2504.05625	null
2025-04-08	Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement	Yichen Dong et.al.	2504.05614	null
2025-04-08	Falcon: Fractional Alternating Cut with Overcoming Minima in Unsupervised Segmentation	Xiao Zhang et.al.	2504.05613	null
2025-04-08	FactGuard: Leveraging Multi-Agent Systems to Generate Answerable and Unanswerable Questions for Enhanced Long-Context LLM Extraction	Qian-Wen Zhang et.al.	2504.05607	link
2025-04-08	On the Impact of Language Nuances on Sentiment Analysis with Large Language Models: Paraphrasing, Sarcasm, and Emojis	Naman Bhargava et.al.	2504.05603	null
2025-04-08	Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought	Yi Peng et.al.	2504.05599	null
2025-04-08	DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding	Hossein Entezari Zarch et.al.	2504.05598	null
2025-04-08	Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions	Oded Ovadia et.al.	2504.05571	null
2025-04-07	Can Large Language Models Match Tutoring System Adaptivity? A Benchmarking Study	Conrad Borchers et.al.	2504.05570	null
2025-04-07	From Fairness to Truthfulness: Rethinking Data Valuation Design	Dongyang Fan et.al.	2504.05563	null
2025-04-07	SciSciGPT: Advancing Human-AI Collaboration in the Science of Science	Erzhuo Shao et.al.	2504.05559	null
2025-04-07	COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values	M-A-P Team et.al.	2504.05535	null
2025-04-07	Bridging Industrial Expertise and XR with LLM-Powered Conversational Agents	Despina Tomkou et.al.	2504.05527	null
2025-04-07	Pretraining Language Models for Diachronic Linguistic Change Discovery	Elisabeth Fittschen et.al.	2504.05523	null
2025-04-07	User Feedback Alignment for LLM-powered Exploration in Large-scale Recommendation Systems	Jianling Wang et.al.	2504.05522	null
2025-04-07	Efficient Reinforcement Finetuning via Adaptive Curriculum Learning	Taiwei Shi et.al.	2504.05520	link
2025-04-07	Evaluating the Generalization Capabilities of Large Language Models on Code Reasoning	Rem Yang et.al.	2504.05518	null
2025-04-07	SelfMAD: Enhancing Generalization and Robustness in Morphing Attack Detection via Self-Supervised Learning	Marija Ivanovska et.al.	2504.05504	null
2025-04-07	Prism: Dynamic and Flexible Benchmarking of LLMs Code Generation with Monte Carlo Tree Search	Vahid Majdinasab et.al.	2504.05500	null
2025-04-07	A Survey on Hypothesis Generation for Scientific Discovery in the Era of Large Language Models	Atilla Kaan Alkan et.al.	2504.05496	null
2025-04-07	REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding	Sakib Reza et.al.	2504.05491	null
2025-04-07	GraphRAFT: Retrieval Augmented Fine-Tuning for Knowledge Graphs on Graph Databases	Alfred Clemedtson et.al.	2504.05478	link
2025-04-07	Generative Adversarial Networks with Limited Data: A Survey and Benchmarking	Omar De Mitri et.al.	2504.05456	null
2025-04-07	Connecting Feedback to Choice: Understanding Educator Preferences in GenAI vs. Human-Created Lesson Plans in K-12 Education – A Comparative Analysis	Shawon Sarkar et.al.	2504.05449	null
2025-04-07	EP-Diffuser: An Efficient Diffusion Model for Traffic Scene Generation and Prediction via Polynomial Representations	Yue Yao et.al.	2504.05422	null
2025-04-07	Less but Better: Parameter-Efficient Fine-Tuning of Large Language Models for Personality Detection	Lingzhi Shen et.al.	2504.05411	null
2025-04-07	EduPlanner: LLM-Based Multi-Agent Systems for Customized and Intelligent Instructional Design	Xueqiao Zhang et.al.	2504.05370	null
2025-04-07	URECA: Unique Region Caption Anything	Sangbeom Lim et.al.	2504.05305	null
2025-04-07	InteractVLM: 3D Interaction Reasoning from 2D Foundational Models	Sai Kumar Dwivedi et.al.	2504.05303	link
2025-04-07	Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations	Pedro Ferreira et.al.	2504.05294	null
2025-04-07	The challenge of uncertainty quantification of large language models in medicine	Zahra Atf et.al.	2504.05278	null
2025-04-07	Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation	Yucheng Chu et.al.	2504.05276	null
2025-04-07	Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models	Yang Yan et.al.	2504.05262	null
2025-04-07	Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models	Adrián Bazaga et.al.	2504.05258	null
2025-04-07	Explaining Low Perception Model Competency with High-Competency Counterfactuals	Sara Pohland et.al.	2504.05254	null
2025-04-07	LLM-based Automated Grading with Human-in-the-Loop	Hang Li et.al.	2504.05239	null
2025-04-08	Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG	Hengran Zhang et.al.	2504.05220	null
2025-04-07	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Hengran Zhang et.al.	2504.05216	null
2025-04-07	Post-Training Language Models for Continual Relation Extraction	Sefika Efeoglu et.al.	2504.05214	null
2025-04-07	Quantum Program Linting with LLMs: Emerging Results from a Comparative Study	Seung Yeob Shin et.al.	2504.05204	null
2025-04-07	P2Mark: Plug-and-play Parameter-intrinsic Watermarking for Neural Speech Generation	Yong Ren et.al.	2504.05197	null
2025-04-07	Training state-of-the-art pathology foundation models with orders of magnitude less data	Mikhail Karasikov et.al.	2504.05186	null
2025-04-07	Concise Reasoning via Reinforcement Learning	Mehdi Fatemi et.al.	2504.05185	link
2025-04-07	BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks	Wei Li et.al.	2504.05180	null
2025-04-07	Learning symmetries in datasets	Veronica Sanz et.al.	2504.05174	null
2025-04-07	Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness	Dongzhuoran Zhou et.al.	2504.05163	null
2025-04-07	DDPM Score Matching and Distribution Learning	Sinho Chewi et.al.	2504.05161	null
2025-04-07	PanoDreamer: Consistent Text to 360-Degree Scene Generation	Zhexiao Xiong et.al.	2504.05152	null
2025-04-07	Pr $εε$ mpt: Sanitizing Sensitive Prompts for LLMs	Amrita Roy Chowdhury et.al.	2504.05147	link
2025-04-07	Query Smarter, Trust Better? Exploring Search Behaviours for Verifying News Accuracy	David Elsweiler et.al.	2504.05146	null
2025-04-07	DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation	Xinglin Lyu et.al.	2504.05122	link
2025-04-07	Algorithm Discovery With LLMs: Evolutionary Search Meets Reinforcement Learning	Anja Surina et.al.	2504.05108	null
2025-04-07	Speech-to-Trajectory: Learning Human-Like Verbal Guidance for Robot Motion	Eran Beeri Bamani et.al.	2504.05084	null
2025-04-07	The Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning	Tianshi Zheng et.al.	2504.05081	null
2025-04-07	On the Performance of an Explainable Language Model on PubMedQA	Venkat Srinivasan et.al.	2504.05074	null
2025-04-08	Not All Data Are Unlearned Equally	Aravind Krishnan et.al.	2504.05058	link
2025-04-07	Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models	Jiawei Lian et.al.	2504.05050	null
2025-04-07	Debate Only When Necessary: Adaptive Multiagent Collaboration for Efficient LLM Reasoning	Sugyeong Eo et.al.	2504.05047	null
2025-04-07	InstructionBench: An Instructional Video Understanding Benchmark	Haiwan Wei et.al.	2504.05040	null
2025-04-07	Mixture-of-Personas Language Models for Population Simulation	Ngoc Bui et.al.	2504.05019	null
2025-04-07	Surveying Professional Writers on AI: Limitations, Expectations, and Fears	Anastasiia Ivanova et.al.	2504.05008	link
2025-04-07	Enhancing Smart Contract Vulnerability Detection in DApps Leveraging Fine-Tuned LLM	Jiuyang Bu et.al.	2504.05006	null
2025-04-07	Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs	Ling Hu et.al.	2504.04994	null
2025-04-07	RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model	Congcong Wen et.al.	2504.04988	null
2025-04-07	Low-Rate Semantic Communication with Codebook-based Conditional Generative Models	Kailang Ye et.al.	2504.04977	null
2025-04-07	A Domain-Based Taxonomy of Jailbreak Vulnerabilities in Large Language Models	Carlos Peláez-González et.al.	2504.04976	null
2025-04-07	Towards Visual Text Grounding of Multimodal Large Language Model	Ming Li et.al.	2504.04974	null
2025-04-07	The Dream Within Huang Long Cave: AI-Driven Interactive Narrative for Family Storytelling and Emotional Reflection	Jiayang Huang et.al.	2504.04968	null
2025-04-07	A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization	Wenyuan Xu et.al.	2504.04950	null
2025-04-07	One Quantizer is Enough: Toward a Lightweight Audio Codec	Linwei Zhai et.al.	2504.04949	link
2025-04-07	A Llama walks into the ‘Bar’: Efficient Supervised Fine-Tuning for Legal Reasoning in the Multi-state Bar Exam	Rean Fernandes et.al.	2504.04945	null
2025-04-07	Lemmanaid: Neuro-Symbolic Lemma Conjecturing	Yousef Alhessi et.al.	2504.04942	null
2025-04-07	Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration	Ran Xu et.al.	2504.04915	link
2025-04-07	Video-Bench: Human-Aligned Video Generation Benchmark	Hui Han et.al.	2504.04907	null
2025-04-07	SCAM: A Real-World Typographic Robustness Evaluation for Multimodal Foundation Models	Justus Westerhoff et.al.	2504.04893	link
2025-04-07	Leveraging Large Language Models for Cost-Effective, Multilingual Depression Detection and Severity Assessment	Longdi Xian et.al.	2504.04891	null
2025-04-07	SoK: LLM-based Log Parsing	Viktor Beck et.al.	2504.04877	link
2025-04-07	Simulating Persuasive Dialogues on Meat Reduction with Generative Agents	Georg Ahnert et.al.	2504.04872	link
2025-04-07	BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents	Haoxuan Li et.al.	2504.04855	null
2025-04-07	Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models	Ruikang Liu et.al.	2504.04823	link
2025-04-07	Beyond Answers: How LLMs Can Pursue Strategic Thinking in Education	Eleonora Grassucci et.al.	2504.04815	null
2025-04-07	Select Me! When You Need a Tool: A Black-box Text Attack on Tool Selection	Liuji Chen et.al.	2504.04809	null
2025-04-07	ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines	Tengjun Jin et.al.	2504.04808	link
2025-04-07	OrderChain: A General Prompting Paradigm to Improve Ordinal Understanding Ability of MLLM	Jinhong Wang et.al.	2504.04801	null
2025-04-07	Topological Schrödinger Bridge Matching	Maosheng Yang et.al.	2504.04799	link
2025-04-07	TabRep: Training Tabular Diffusion Models with a Simple and Effective Continuous Representation	Jacob Si et.al.	2504.04798	link
2025-04-07	Addressing the Curse of Scenario and Task Generalization in AI-6G: A Multi-Modal Paradigm	Tianyu Jiao et.al.	2504.04797	null
2025-04-07	Weak-for-Strong: Training Weak Meta-Agent to Harness Strong Executors	Fan Nie et.al.	2504.04785	link
2025-04-07	OCC-MLLM-CoT-Alpha: Towards Multi-stage Occlusion Recognition Based on Large Language Models via 3D-Aware Supervision and Chain-of-Thoughts Guidance	Chaoyi Wang et.al.	2504.04781	null
2025-04-07	Improving Multilingual Retrieval-Augmented Language Models through Dialectic Reasoning Argumentations	Leonardo Ranaldi et.al.	2504.04771	null
2025-04-07	Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches	Eloi Moliner et.al.	2504.04751	null
2025-04-07	Can LLMs Interpret and Leverage Structured Linguistic Representations? A Case Study with AMRs	Ankush Raut et.al.	2504.04745	null
2025-04-07	AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation	Xiongbo Lu et.al.	2504.04743	null
2025-04-07	Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data	Samarth Mishra et.al.	2504.04740	link
2025-04-07	TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context	Shubham Kumar Nigam et.al.	2504.04737	null
2025-04-07	Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Anna Goldie et.al.	2504.04736	null
2025-04-07	Can LLM-Driven Hard Negative Sampling Empower Collaborative Filtering? Findings and Potentials	Chu Zhao et.al.	2504.04726	link
2025-04-08	Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models	Yubo Li et.al.	2504.04717	link
2025-04-07	Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs	Will Cai et.al.	2504.04715	link
2025-04-07	Sequential-NIAH: A Needle-In-A-Haystack Benchmark for Extracting Sequential Needles from Long Contexts	Yifei Yu et.al.	2504.04713	null
2025-04-07	Generalising from Self-Produced Data: Model Training Beyond Human Constraints	Alfath Daryl Alhajir et.al.	2504.04711	null
2025-04-07	LagKV: Lag-Relative Information of the KV Cache Tells Which Tokens Are Important	Manlai Liang et.al.	2504.04704	link
2025-04-07	Causal Retrieval with Semantic Consideration	Hyunseo Shin et.al.	2504.04700	null
2025-04-07	R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation	Martin Weyssow et.al.	2504.04699	link
2025-04-07	scAgent: Universal Single-Cell Annotation via a LLM Agent	Yuren Mao et.al.	2504.04698	null
2025-04-07	Generative Large Language Model usage in Smart Contract Vulnerability Detection	Peter Ince et.al.	2504.04685	null
2025-04-07	ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback	Tasnia Rahman et.al.	2504.04657	null
2025-04-07	LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts	Yimu Wang et.al.	2504.04653	null
2025-04-06	Splits! A Flexible Dataset for Evaluating a Model’s Demographic Social Inference	Eylon Caplan et.al.	2504.04640	link
2025-04-06	Foundation Models for Software Engineering of Cyber-Physical Systems: the Road Ahead	Chengjie Lu et.al.	2504.04630	null
2025-04-06	SECQUE: A Benchmark for Evaluating Real-World Financial Analysis Capabilities	Noga Ben Yoash et.al.	2504.04596	null
2025-04-08	Your Image Generator Is Your New Private Dataset	Nicolo Resmini et.al.	2504.04582	null
2025-04-06	Hierarchical Planning for Complex Tasks with Knowledge Graph-RAG and Symbolic Verification	Cristina Cornelio et.al.	2504.04578	null
2025-04-06	DexTOG: Learning Task-Oriented Dexterous Grasp with Language	Jieyi Zhang et.al.	2504.04573	null
2025-04-06	Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language Models	Rui Gan et.al.	2504.04562	link
2025-04-06	Chain of Understanding: Supporting Code Understanding with Large Language Models	Jie Gao et.al.	2504.04553	null
2025-04-06	Advancing Egocentric Video Question Answering with Multimodal Large Language Models	Alkesh Patel et.al.	2504.04550	null
2025-04-06	Opening the black box of deep learning: Validating the statistical association between explainable artificial intelligence (XAI) and clinical domain knowledge in fundus image-based glaucoma diagnosis	Han Yuan et.al.	2504.04549	null
2025-04-06	The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?	Weichen Zhang et.al.	2504.04540	null
2025-04-06	An Empirical Comparison of Text Summarization: A Multi-Dimensional Evaluation of Large Language Models	Anantharaman Janakiraman et.al.	2504.04534	null
2025-04-06	Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning	Xuerui Su et.al.	2504.04524	link
2025-04-06	Hessian of Perplexity for Large Language Models by PyTorch autograd (Open Source)	Ivan Ilin et.al.	2504.04520	link
2025-04-06	Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection	Jiancheng Pan et.al.	2504.04517	link
2025-04-06	Saliency-driven Dynamic Token Pruning for Large Language Models	Yao Tao et.al.	2504.04514	null
2025-04-06	Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification	Shijian Wang et.al.	2504.04510	null
2025-04-06	VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT	Zhuo Zhi et.al.	2504.04471	null
2025-04-06	Domain Generalization for Face Anti-spoofing via Content-aware Composite Prompt Engineering	Jiabao Guo et.al.	2504.04470	null
2025-04-04	MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models	Wulin Xie et.al.	2504.03641	null
2025-04-04	Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning	Xinyi Wang et.al.	2504.03635	null
2025-04-04	Enhancing Causal Effect Estimation with Diffusion-Generated Data	Li Chen et.al.	2504.03630	null
2025-04-04	Align to Structure: Aligning Large Language Models with Structural Information	Zae Myung Kim et.al.	2504.03622	null
2025-04-04	VISTA-OCR: Towards generative and interactive end to end OCR models	Laziz Hamdi et.al.	2504.03621	null
2025-04-04	Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task	Leonardo Ranaldi et.al.	2504.03616	null
2025-04-04	Autonomous and Self-Adapting System for Synthetic Media Detection and Attribution	Aref Azizpour et.al.	2504.03615	null
2025-04-04	AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset	Bingxiang He et.al.	2504.03612	null
2025-04-04	MedSAM2: Segment Anything in 3D Medical Images and Videos	Jun Ma et.al.	2504.03600	link
2025-04-04	EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline	Peter Baile Chen et.al.	2504.03598	null
2025-04-04	PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector	Kaidong Li et.al.	2504.03563	null
2025-04-04	Agentic Knowledgeable Self-awareness	Shuofei Qiao et.al.	2504.03553	link
2025-04-04	HumanDreamer-X: Photorealistic Single-image Human Avatars Reconstruction via Gaussian Restoration	Boyuan Wang et.al.	2504.03536	null
2025-04-04	RANa: Retrieval-Augmented Navigation	Gianluca Monaci et.al.	2504.03524	null
2025-04-04	Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles	Chen Wei Kuo et.al.	2504.03520	null
2025-04-04	Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej	Shubham Kumar Nigam et.al.	2504.03486	null
2025-04-04	D-Garment: Physics-Conditioned Latent Diffusion for Dynamic Garment Deformations	Antoine Dumoulin et.al.	2504.03468	null
2025-04-04	Generating ensembles of spatially-coherent in-situ forecasts using flow matching	David Landry et.al.	2504.03463	null
2025-04-04	Conditioning Diffusions Using Malliavin Calculus	Jakiw Pidstrigach et.al.	2504.03461	null
2025-04-04	Optimizing Specific and Shared Parameters for Efficient Parameter Tuning	Van-Anh Nguyen et.al.	2504.03450	null
2025-04-04	LLMSched: Uncertainty-Aware Workload Scheduling for Compound LLM Applications	Botao Zhu et.al.	2504.03444	null
2025-04-04	Know What You do Not Know: Verbalized Uncertainty Estimation Robustness on Corrupted Images in Vision-Language Models	Mirko Borszukovszki et.al.	2504.03440	null
2025-04-04	Locations of Characters in Narratives: Andersen and Persuasion Datasets	Batuhan Ozyurt et.al.	2504.03434	link
2025-04-04	BitHEP – The Limits of Low-Precision ML in HEP	Claudius Krause et.al.	2504.03387	link
2025-04-04	Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning	Sanghwan Bae et.al.	2504.03380	null
2025-04-04	MultiClear: Multimodal Soft Exoskeleton Glove for Transparent Object Grasping Assistance	Chen Hu et.al.	2504.03379	null
2025-04-04	Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Erik Johannes Husom et.al.	2504.03360	null
2025-04-04	BabyLM’s First Words: Word Segmentation as a Phonological Probing Task	Zébulon Goriely et.al.	2504.03338	null
2025-04-04	Steerable Anatomical Shape Synthesis with Implicit Neural Representations	Bram de Wilde et.al.	2504.03313	link
2025-04-04	Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices	Luís Couto Seller et.al.	2504.03312	null
2025-04-04	Noise Augmented Fine Tuning for Mitigating Hallucinations in Large Language Models	Afshin Khadangi et.al.	2504.03302	link
2025-04-04	Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task	Bingqian Wang et.al.	2504.03295	null
2025-04-04	Towards Effective EU E-Participation: The Development of AskThePublic	Kilian Sprenkamp et.al.	2504.03287	null
2025-04-04	Do Large Language Models Solve the Problems of Agent-Based Modeling? A Critical Review of Generative Social Simulations	Maik Larooij et.al.	2504.03274	null
2025-04-04	Inherent and emergent liability issues in LLM-based agentic systems: a principal-agent perspective	Garry A. Gabison et.al.	2504.03255	null
2025-04-04	Seeing is Believing: Belief-Space Planning with Foundation Models as Uncertainty Estimators	Linfeng Zhao et.al.	2504.03245	null
2025-04-04	Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video	Jiaxin Guo et.al.	2504.03198	null
2025-04-04	Explain with Visual Keypoints Like a Real Mentor! A Benchmark for Multimodal Solution Explanation	Jaewoo Park et.al.	2504.03197	null
2025-04-04	Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation	Xin Zhang et.al.	2504.03193	link
2025-04-04	Learning Natural Language Constraints for Safe Reinforcement Learning of Language Agents	Jaymari Chua et.al.	2504.03185	null
2025-04-04	RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation	Hanbo Bi et.al.	2504.03166	null
2025-04-04	Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation	Weitao Li et.al.	2504.03165	link
2025-04-04	DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments	Yuxiang Zheng et.al.	2504.03160	link
2025-04-04	Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction	Junlang Qian et.al.	2504.03159	link
2025-04-04	TokenFLEX: Unified VLM Training for Flexible Visual Tokens Inference	Junshan Hu et.al.	2504.03154	null
2025-04-04	Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1)	Jing Bi et.al.	2504.03151	null
2025-04-04	A Human Digital Twin Architecture for Knowledge-based Interactions and Context-Aware Conversations	Abdul Mannan Mohammed et.al.	2504.03147	null
2025-04-04	LightPROF: A Lightweight Reasoning Framework for Large Language Model on Knowledge Graph	Tu Ao et.al.	2504.03137	null
2025-04-04	Les Dissonances: Cross-Tool Harvesting and Polluting in Multi-Tool Empowered LLM Agents	Zichuan Li et.al.	2504.03111	null
2025-04-04	Single-Pass Document Scanning for Question Answering	Weili Cao et.al.	2504.03101	link
2025-04-03	Unlocking the AMD Neural Processing Unit for ML Training on the Client Using Bare-Metal-Programming Tools	André Rösti et.al.	2504.03083	null
2025-04-03	AD-GPT: Large Language Models in Alzheimer’s Disease	Ziyu Liu et.al.	2504.03071	null
2025-04-03	Design of AI-Powered Tool for Self-Regulation Support in Programming Education	Huiyong Li et.al.	2504.03068	null
2025-04-03	Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models	Chengyang He et.al.	2504.03051	null
2025-04-03	Extending CREAMT: Leveraging Large Language Models for Literary Translation Post-Editing	Antonio Castaldo et.al.	2504.03045	null
2025-04-03	Ontologies in Design: How Imagining a Tree Reveals Possibilites and Assumptions in Large Language Models	Nava Haghighi et.al.	2504.03029	null
2025-04-03	AuDeRe: Automated Strategy Decision and Realization in Robot Planning and Control via LLMs	Yue Meng et.al.	2504.03015	link
2025-04-03	What People Share With a Robot When Feeling Lonely and Stressed and How It Helps Over Time	Guy Laban et.al.	2504.02991	null
2025-04-03	Language Models Guidance with Multi-Aspect-Cueing: A Case Study for Competitor Analysis	Amir Hadifar et.al.	2504.02984	null
2025-04-03	Hummus: A Dataset of Humorous Multimodal Metaphor Use	Xiaoyu Tong et.al.	2504.02983	link
2025-04-03	Digital Forensics in the Age of Large Language Models	Zhipeng Yin et.al.	2504.02963	null
2025-04-03	Cultural Learning-Based Culture Adaptation of Language Models	Chen Cecilia Liu et.al.	2504.02953	link
2025-04-03	VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning	Xianwei Zhuang et.al.	2504.02949	link
2025-04-03	HyperRAG: Enhancing Quality-Efficiency Tradeoffs in Retrieval-Augmented Generation with Reranker KV-Cache Reuse	Yuwei An et.al.	2504.02921	null
2025-04-03	Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments	Chenyu Zhang et.al.	2504.02918	null
2025-04-03	Bias in Large Language Models Across Clinical Applications: A Systematic Review	Thanathip Suenghataiphorn et.al.	2504.02917	null
2025-04-03	Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models	Mateusz Pach et.al.	2504.02821	link
2025-04-03	Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization	Kangle Deng et.al.	2504.02817	null
2025-04-03	Generative Evaluation of Complex Reasoning in Large Language Models	Haowei Lin et.al.	2504.02810	link
2025-04-03	MegaMath: Pushing the Limits of Open Math Corpora	Fan Zhou et.al.	2504.02807	link
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-04	A Survey of Large Language Models in Mental Health Disorder Detection on Social Media	Zhuohan Ge et.al.	2504.02800	null
2025-04-03	A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models	Gaurav Verma et.al.	2504.02793	null
2025-04-03	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null
2025-04-03	A Framework for Robust Cognitive Evaluation of LLMs	Karin de Langis et.al.	2504.02789	null
2025-04-03	From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks	Joshua Holstein et.al.	2504.02780	null
2025-04-03	BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs	Alexander Leszczynski et.al.	2504.02779	link
2025-04-03	How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?	Andres Algaba et.al.	2504.02767	link
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-03	Echoes of the hidden: Uncovering coordination beyond network structure	Shahar Somin et.al.	2504.02757	null
2025-04-04	RBT4DNN: Requirements-based Testing of Neural Networks	Nusrat Jahan Mozumder et.al.	2504.02737	link
2025-04-03	Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study	Aryan Agrawal et.al.	2504.02733	link
2025-04-04	Why do LLMs attend to the first token?	Federico Barbero et.al.	2504.02732	null
2025-04-03	ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization	Kehua Feng et.al.	2504.02725	null
2025-04-03	TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models	Xinquan Wang et.al.	2504.02712	null
2025-04-03	The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context	Nikhil Verma et.al.	2504.02708	null
2025-04-03	LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems	Zishuo Liu et.al.	2504.02671	null
2025-04-03	Affordable AI Assistants with Knowledge Graph of Thoughts	Maciej Besta et.al.	2504.02670	null
2025-04-03	Prompt Optimization with Logged Bandit Data	Haruka Kiyohara et.al.	2504.02646	null
2025-04-03	Towards Computation- and Communication-efficient Computational Pathology	Chu Han et.al.	2504.02628	null
2025-04-03	Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions	PeiJie Yu et.al.	2504.02623	link
2025-04-03	Exploring undercurrents of learning tensions in an LLM-enhanced landscape: A student-centered qualitative perspective on LLM vs Search	Rahul R. Divekar et.al.	2504.02622	null
2025-04-03	Efficient Model Editing with Task-Localized Sparse Fine-tuning	Leonardo Iurada et.al.	2504.02620	link
2025-04-03	Variational Online Mirror Descent for Robust Learning in Schrödinger Bridge	Dong-Sig Han et.al.	2504.02618	null
2025-04-03	Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation	Jiwoo Chung et.al.	2504.02612	link
2025-04-03	Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving	Daoguang Zan et.al.	2504.02605	link
2025-04-04	Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme	Yan Ma et.al.	2504.02587	link
2025-04-03	Language Models reach higher Agreement than Humans in Historical Interpretation	Fabio Celli et.al.	2504.02572	null
2025-04-04	Leveraging LLM For Synchronizing Information Across Multilingual Tables	Siddharth Khincha et.al.	2504.02559	null
2025-04-03	Exploring Individual Factors in the Adoption of LLMs for Specific Software Engineering Tasks	Stefano Lambiase et.al.	2504.02553	null
2025-04-03	GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning	Xiangxiang Chu et.al.	2504.02546	link
2025-04-03	UNDO: Understanding Distillation as Optimization	Kushal Jain et.al.	2504.02521	null
2025-04-03	A Memory-Augmented LLM-Driven Method for Autonomous Merging of 3D Printing Work Orders	Yuhao Liu et.al.	2504.02509	null
2025-04-03	ZClip: Adaptive Spike Mitigation for LLM Pre-Training	Abhay Kumar et.al.	2504.02507	link
2025-04-03	Inference-Time Scaling for Generalist Reward Modeling	Zijun Liu et.al.	2504.02495	null
2025-04-03	MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities	Bizhu Wu et.al.	2504.02478	link
2025-04-03	Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision	Xiaofeng Han et.al.	2504.02477	null
2025-04-03	Retrieval-Augmented Purifier for Robust LLM-Empowered Recommendation	Liangbo Ning et.al.	2504.02458	null
2025-04-03	Cognitive Memory in Large Language Models	Lianlei Shan et.al.	2504.02441	null
2025-04-03	A Multi-Level Sentiment Analysis Framework for Financial Texts	Yiwei Liu et.al.	2504.02429	link
2025-04-03	Adapting Large Language Models for Multi-Domain Retrieval-Augmented-Generation	Alexandre Misrahi et.al.	2504.02411	null
2025-04-03	AnesBench: Multi-Dimensional Evaluation of LLM Reasoning in Anesthesiology	Xiang Feng et.al.	2504.02404	link
2025-04-03	DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers	Max Müller-Eberstein et.al.	2504.02403	null
2025-04-03	CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design	Zhendong Cao et.al.	2504.02367	link
2025-04-03	ReuseDroid: A VLM-empowered Android UI Test Migrator Boosted by Active Feedback	Xiaolei Li et.al.	2504.02357	null
2025-04-03	All-day Depth Completion via Thermal-LiDAR Fusion	Janghyun Kim et.al.	2504.02356	null
2025-04-03	Agglomerating Large Vision Encoders via Distillation for VFSS Segmentation	Chengxi Zeng et.al.	2504.02351	null
2025-04-03	Toward General and Robust LLM-enhanced Text-attributed Graph Learning	Zihao Zhang et.al.	2504.02343	null
2025-04-03	LearNAT: Learning NL2SQL with AST-guided Task Decomposition for Large Language Models	Weibin Liao et.al.	2504.02327	null
2025-04-03	CoTAL: Human-in-the-Loop Prompt Engineering, Chain-of-Thought Reasoning, and Active Learning for Generalizable Formative Assessment Scoring	Clayton Cohn et.al.	2504.02323	null
2025-04-03	OmniCam: Unified Multimodal Video Generation via Camera Control	Xiaoda Yang et.al.	2504.02312	null
2025-04-03	Relativistic compact object in Generalised Tolman-Kuchowicz spacetime with quadratic equation of state	Hemani R. Acharya et.al.	2504.02311	null
2025-04-03	Improving Harmful Text Detection with Joint Retrieval and External Knowledge	Zidong Yu et.al.	2504.02310	null
2025-04-03	Measurement of LLM’s Philosophies of Human Nature	Minheng Ni et.al.	2504.02304	link
2025-04-03	Parallel Market Environments for FinRL Contests	Keyi Wang et.al.	2504.02281	null
2025-04-03	LLM-Guided Evolution: An Autonomous Model Optimization for Object Detection	YiMing Yu et.al.	2504.02280	null
2025-04-03	Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models	Hung Le et.al.	2504.02273	null
2025-04-03	MinkOcc: Towards real-time label-efficient semantic occupancy prediction	Samuel Sze et.al.	2504.02270	null
2025-04-03	MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism	Ruidong Zhu et.al.	2504.02263	null
2025-04-03	LLMs as Deceptive Agents: How Role-Based Prompting Induces Semantic Ambiguity in Puzzle Tasks	Seunghyun Yoo et.al.	2504.02254	null
2025-04-03	LLM Social Simulations Are a Promising Research Method	Jacy Reese Anthis et.al.	2504.02234	null
2025-04-03	The Plot Thickens: Quantitative Part-by-Part Exploration of MLLM Visualization Literacy	Matheus Valentim et.al.	2504.02217	null
2025-04-03	LLM-Augmented Graph Neural Recommenders: Integrating User Reviews	Hiroki Kanezashi et.al.	2504.02195	null
2025-04-03	More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment	Yifan Wang et.al.	2504.02193	null
2025-04-02	A Survey of Scaling in Large Language Model Reasoning	Zihan Chen et.al.	2504.02181	null
2025-04-02	Subasa – Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala	Shanilka Haturusinghe et.al.	2504.02178	null
2025-04-02	Responsible Innovation: A Strategic Framework for Financial LLM Integration	Ahmadreza Tavasoli et.al.	2504.02165	null
2025-04-02	OmniCellTOSG: The First Cell Text-Omic Signaling Graphs Dataset for Joint LLM and GNN Modeling	Heming Zhang et.al.	2504.02148	link
2025-04-02	LL4G: Self-Supervised Dynamic Optimization for Graph-Based Personality Detection	Lingzhi Shen et.al.	2504.02146	null
2025-04-02	On Simulation-Guided LLM-based Code Generation for Safe Autonomous Driving Software	Ali Nouri et.al.	2504.02141	null
2025-04-02	One Pic is All it Takes: Poisoning Visual Document Retrieval Augmented Generation with a Single Image	Ezzeldin Shereen et.al.	2504.02132	null
2025-04-02	Achieving Unanimous Consensus in Decision Making Using Multi-Agents	Apurba Pokharel et.al.	2504.02128	null
2025-04-02	Efficient Model Selection for Time Series Forecasting via LLMs	Wang Wei et.al.	2504.02119	null
2025-04-02	LLMPi: Optimizing LLMs for High-Throughput on Raspberry Pi	Mahsa Ardakani et.al.	2504.02118	null
2025-04-02	PolyG: Effective and Efficient GraphRAG with Adaptive Graph Traversal	Renjie Liu et.al.	2504.02112	null
2025-04-02	Exploring LLM Reasoning Through Controlled Prompt Variations	Giannis Chatziveroglou et.al.	2504.02111	link
2025-04-02	ScreenAudit: Detecting Screen Reader Accessibility Errors in Mobile Apps Using Large Language Models	Mingyuan Zhong et.al.	2504.02110	null
2025-04-02	TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining	Jeffrey Li et.al.	2504.02107	link
2025-04-02	ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation	Xiao Wang et.al.	2504.02106	null
2025-04-02	FlowDistill: Scalable Traffic Flow Prediction via Distillation from LLMs	Chenyang Yu et.al.	2504.02094	link
2025-04-02	Increasing happiness through conversations with artificial intelligence	Joseph Heffner et.al.	2504.02091	null
2025-04-02	Evolving Security in LLMs: A Study of Jailbreak Attacks and Defenses	Zhengchun Shang et.al.	2504.02080	null
2025-04-02	Trapped by Expectations: Functional Fixedness in LLM-Enabled Chat Search	Jiqun Liu et.al.	2504.02074	null
2025-04-02	From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP	Fabio Yáñez-Romero et.al.	2504.02064	null
2025-04-02	Aligned Better, Listen Better for Audio-Visual Large Language Models	Yuxin Guo et.al.	2504.02061	null
2025-04-02	Towards Operationalizing Heterogeneous Data Discovery	Jin Wang et.al.	2504.02059	null
2025-04-02	MageSQL: Enhancing In-context Learning for Text-to-SQL Applications with Large Language Models	Chen Shen et.al.	2504.02055	null
2025-04-02	From Prompts to Templates: A Systematic Prompt Template Analysis for Real-world LLMapps	Yuetian Mao et.al.	2504.02052	null
2025-04-02	WorldPrompter: Traversable Text-to-Scene Generation	Zhaoyang Zhang et.al.	2504.02045	null
2025-04-02	Slot-Level Robotic Placement via Visual Imitation from Single Human Video	Dandan Shan et.al.	2504.01959	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-02	Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities	Jing Liu et.al.	2504.01954	null
2025-04-02	The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data	Massimiliano Luca et.al.	2504.01951	null
2025-04-02	OpenCodeReasoning: Advancing Data Distillation for Competitive Coding	Wasi Uddin Ahmad et.al.	2504.01943	null
2025-04-02	A Unified Approach to Analysis and Design of Denoising Markov Models	Yinuo Ren et.al.	2504.01938	null
2025-04-02	Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?	Celine Lee et.al.	2504.01935	link
2025-04-02	A thorough benchmark of automatic text classification: From traditional approaches to large language models	Washington Cunha et.al.	2504.01930	link
2025-04-02	Gen-C: Populating Virtual Worlds with Generative Crowds	Andreas Panayiotou et.al.	2504.01924	null
2025-04-02	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null
2025-04-02	Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning	Yinggan Xu et.al.	2504.01911	null
2025-04-02	Multi-fidelity Parameter Estimation Using Conditional Diffusion Models	Caroline Tatsuoka et.al.	2504.01894	null
2025-04-02	TransientTables: Evaluating LLMs’ Reasoning on Temporally Evolving Semi-structured Tables	Abhilash Shankarampeta et.al.	2504.01879	null
2025-04-02	Interpreting Emergent Planning in Model-Free Reinforcement Learning	Thomas Bush et.al.	2504.01871	null
2025-04-02	From Code Generation to Software Testing: AI Copilot with Context-Based RAG	Yuchen Wang et.al.	2504.01866	null
2025-04-02	Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models	Zhiwei Yu et.al.	2504.01857	null
2025-04-02	Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks	Ali Al-Kaswan et.al.	2504.01850	null
2025-04-02	BOGausS: Better Optimized Gaussian Splatting	Stéphane Pateux et.al.	2504.01844	null
2025-04-02	LARGE: Legal Retrieval Augmented Generation Evaluation Tool	Minhu Park et.al.	2504.01840	link
2025-04-02	YourBench: Easy Custom Evaluation Sets for Everyone	Sumuk Shashidhar et.al.	2504.01833	link
2025-04-02	Spatial-R1: Enhancing MLLMs in Video Spatial Reasoning	Kun Ouyang et.al.	2504.01805	link
2025-04-02	Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training	Zhijun Wang et.al.	2504.01801	link
2025-04-02	UniViTAR: Unified Vision Transformer with Native Resolution	Limeng Qiao et.al.	2504.01792	null
2025-04-02	OpenThaiGPT 1.6 and R1: Thai-Centric Open Source and Reasoning Large Language Models	Sumeth Yuenyong et.al.	2504.01789	null
2025-04-02	BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing	Yunqi Gu et.al.	2504.01786	link
2025-04-02	Leveraging Embedding Techniques in Multimodal Machine Learning for Mental Illness Assessment	Abdelrahaman A. Hassan et.al.	2504.01767	null
2025-04-02	AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization	Chaohu Liu et.al.	2504.01735	null
2025-04-03	InfiniteICL: Breaking the Limit of Context Window Size via Long Short-term Memory Transformation	Bowen Cao et.al.	2504.01707	null
2025-04-02	ToM-RL: Reinforcement Learning Unlocks Theory of Mind in Small LLMs	Yi-Long Lu et.al.	2504.01698	link
2025-04-02	System Level Synthesis for Affine Control Policies: Model Based and Data-Driven Settings	Lukas Schüepp et.al.	2504.01677	link
2025-04-03	Testing Low-Resource Language Support in LLMs Using Language Proficiency Exams: the Case of Luxembourgish	Cedric Lothritz et.al.	2504.01667	null
2025-04-02	Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning	Yiting Lu et.al.	2504.01655	link
2025-04-02	FlowR: Flowing from Sparse to Dense 3D Reconstructions	Tobias Fischer et.al.	2504.01647	null
2025-04-02	Proposition of Affordance-Driven Environment Recognition Framework Using Symbol Networks in Large Language Models	Kazuma Arii et.al.	2504.01644	null
2025-04-02	LLM-mediated Dynamic Plan Generation with a Multi-Agent Approach	Reo Abe et.al.	2504.01637	null
2025-04-02	Horizon Scans can be accelerated using novel information retrieval and artificial intelligence tools	Lena Schmidt et.al.	2504.01627	null
2025-04-02	Comment Staytime Prediction with LLM-enhanced Comment Understanding	Changshuo Zhang et.al.	2504.01602	link
2025-04-02	Integrating experimental feedback improves generative models for biological sequences	Francesco Calvanese et.al.	2504.01593	null
2025-04-03	Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval	Adriano Fragomeni et.al.	2504.01591	null
2025-04-02	Building Knowledge from Interactions: An LLM-Based Architecture for Adaptive Tutoring and Social Reasoning	Luca Garello et.al.	2504.01588	null
2025-04-02	Pro-DG: Procedural Diffusion Guidance for Architectural Facade Generation	Aleksander Plocharski et.al.	2504.01571	null
2025-04-02	GPT Adoption and the Impact of Disclosure Policies	Cathy Yang et.al.	2504.01566	null
2025-04-02	Bhakti: A Lightweight Vector Database Management System for Endowing Large Language Models with Semantic Search Capabilities and Memory	Zihao Wu et.al.	2504.01553	link
2025-04-02	Representation Bending for Large Language Model Safety	Ashkan Yousefpour et.al.	2504.01550	link
2025-04-02	Semi-Supervised Biomedical Image Segmentation via Diffusion Models and Teacher-Student Co-Training	Luca Ciampi et.al.	2504.01547	link
2025-04-02	Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation	Amanda Myntti et.al.	2504.01542	null
2025-04-02	Hyperbolic Diffusion Recommender Model	Meng Yuan et.al.	2504.01541	null
2025-04-02	LightDefense: A Lightweight Uncertainty-Driven Defense against Jailbreaks via Shifted Token Distribution	Zhuoran Yang et.al.	2504.01533	null
2025-04-02	Adapting Knowledge Prompt Tuning for Enhanced Automated Program Repair	Xuemeng Cai et.al.	2504.01523	link
2025-04-02	Redefining technology for indigenous languages	Silvia Fernandez-Sabido et.al.	2504.01522	null
2025-04-02	Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model	Jincheng Zhong et.al.	2504.01521	link
2025-04-02	Chain of Correction for Full-text Speech Recognition with Large Language Models	Zhiyuan Tang et.al.	2504.01519	null
2025-04-02	PROPHET: An Inferable Future Forecasting Benchmark with Causal Intervened Likelihood Estimation	Zhengwei Tao et.al.	2504.01509	link
2025-04-02	Are Autonomous Web Agents Good Testers?	Antoine Chevrot et.al.	2504.01495	null
2025-04-02	ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction	Yuejiao Su et.al.	2504.01472	null
2025-04-02	A Prefixed Patch Time Series Transformer for Two-Point Boundary Value Problems in Three-Body Problems	Akira Hatakeyama et.al.	2504.01464	null
2025-04-03	GeoRAG: A Question-Answering Approach from a Geographical Perspective	Jian Wang et.al.	2504.01458	null
2025-04-02	LLM-VPRF: Large Language Model Based Vector Pseudo Relevance Feedback	Hang Li et.al.	2504.01448	null
2025-04-02	Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality	Philipp Mondorf et.al.	2504.01445	link
2025-04-02	PiCo: Jailbreaking Multimodal Large Language Models via $\textbf{Pi}$ctorial $\textbf{Co}$ de Contextualization	Aofan Liu et.al.	2504.01444	null
2025-04-02	Refining Interactions: Enhancing Anisotropy in Graph Neural Networks with Language Semantics	Zhaoxing Li et.al.	2504.01429	null
2025-04-02	Dynamic Incentive Strategies for Smart EV Charging Stations: An LLM-Driven User Digital Twin Approach	Yichen Sun et.al.	2504.01423	null
2025-04-02	FAIRE: Assessing Racial and Gender Bias in AI-Driven Resume Evaluations	Athena Wen et.al.	2504.01420	link
2025-04-02	LLM4SZZ: Enhancing SZZ Algorithm with Context-Enhanced Assessment on Large Language Models	Lingxiao Tang et.al.	2504.01404	null
2025-04-02	Generative Retrieval and Alignment Model: A New Paradigm for E-commerce Retrieval	Ming Pang et.al.	2504.01403	null
2025-04-02	ToolACE-R: Tool Learning with Adaptive Self-Refinement	Xingshan Zeng et.al.	2504.01400	null
2025-04-02	An Illusion of Progress? Assessing the Current State of Web Agents	Tianci Xue et.al.	2504.01382	link
2025-04-02	UniFault: A Fault Diagnosis Foundation Model from Bearing Data	Emadeldeen Eldele et.al.	2504.01373	null
2025-04-02	Slow-Fast Architecture for Video Multi-Modal Large Language Models	Min Shi et.al.	2504.01328	link
2025-04-02	Adaptive Rectification Sampling for Test-Time Compute Scaling	Zhendong Tan et.al.	2504.01317	link
2025-04-02	Safeguarding Vision-Language Models: Mitigating Vulnerabilities to Gaussian Noise in Perturbation-based Attacks	Jiawei Wang et.al.	2504.01308	link
2025-04-02	Real-time Ad retrieval via LLM-generative Commercial Intention for Sponsored Search Advertising	Tongtong Liu et.al.	2504.01304	null
2025-04-02	Extracting Formal Specifications from Documents Using LLMs for Automated Testing	Hui Li et.al.	2504.01294	link
2025-04-02	Prompt-Reverse Inconsistency: LLM Self-Inconsistency Beyond Generative Randomness and Prompt Paraphrasing	Jihyun Janice Ahn et.al.	2504.01282	null
2025-04-03	Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding	Sakhinana Sagar Srinivas et.al.	2504.01281	null
2025-04-02	Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning	Si Chen et.al.	2504.01278	null
2025-04-02	Facilitating Instructors-LLM Collaboration for Problem Design in Introductory Programming Classrooms	Muntasir Hoq et.al.	2504.01259	null
2025-04-01	Grade Guard: A Smart System for Short Answer Automated Grading	Niharika Dadu et.al.	2504.01253	null
2025-04-01	Plan-and-Act using Large Language Models for Interactive Agreement	Kazuhiro Sasabuchi et.al.	2504.01252	null
2025-04-01	Automated Factual Benchmarking for In-Car Conversational Systems using Large Language Models	Rafael Giebisch et.al.	2504.01248	null
2025-04-01	Catastrophic Forgetting in LLMs: A Comparative Analysis Across Language Tasks	Naimul Haque et.al.	2504.01241	null
2025-04-01	Towards Resilient Federated Learning in CyberEdge Networks: Recent Advances and Future Trends	Kai Li et.al.	2504.01240	null
2025-04-01	Prompting Forgetting: Unlearning in GANs via Textual Guidance	Piyush Nagasubramaniam et.al.	2504.01218	null
2025-04-01	Detecting PTSD in Clinical Interviews: A Comparative Analysis of NLP Methods and Large Language Models	Feng Chen et.al.	2504.01216	null
2025-04-01	Articulated Kinematics Distillation from Video Diffusion Models	Xuan Li et.al.	2504.01204	null
2025-04-01	Medical large language models are easily distracted	Krithik Vishwanath et.al.	2504.01201	link
2025-04-01	$μ$ KE: Matryoshka Unstructured Knowledge Editing of Large Language Models	Zian Su et.al.	2504.01196	null
2025-04-01	Predicting Field Experiments with Large Language Models	Yaoyu Chen et.al.	2504.01167	null
2025-04-01	Beyond Quacking: Deep Integration of Language Models and RAG into DuckDB	Anas Dorbani et.al.	2504.01157	null
2025-04-01	Catch Me if You Search: When Contextual Web Search Results Affect the Detection of Hallucinations	Mahjabin Nahar et.al.	2504.01153	link
2025-04-01	MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)	Bikash Saha et.al.	2504.01145	link
2025-04-01	Can LLMs Grasp Implicit Cultural Values? Benchmarking LLMs’ Metacognitive Cultural Intelligence with CQ-Bench	Ziyi Liu et.al.	2504.01127	link
2025-04-01	ShieldGemma 2: Robust and Tractable Image Content Moderation	Wenjun Zeng et.al.	2504.01081	null
2025-04-01	MixerMDM: Learnable Composition of Human Motion Diffusion Models	Pablo Ruiz-Ponce et.al.	2504.01019	null
2025-04-01	Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization	Di Wu et.al.	2504.01018	null
2025-04-01	AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction	Junhao Cheng et.al.	2504.01014	link
2025-04-01	When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning	Nishad Singhi et.al.	2504.01005	null
2025-04-01	Token embeddings violate the manifold hypothesis	Michael Robinson et.al.	2504.01002	null
2025-04-01	MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization	Siyuan Li et.al.	2504.00999	link
2025-04-01	MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs	Juncheng Wu et.al.	2504.00993	link
2025-03-31	Consistent Subject Generation via Contrastive Instantiated Concepts	Lee Hsin-Ying et.al.	2503.24387	null
2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null
2025-03-31	Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models	Rui Wang et.al.	2503.24377	link
2025-03-31	Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1	Yi Chen et.al.	2503.24376	link
2025-03-31	Effectively Controlling Reasoning Models through Thinking Intervention	Tong Wu et.al.	2503.24370	null
2025-03-31	Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation	Xiaoran Zhang et.al.	2503.24368	null
2025-03-31	ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion	Rana Muhammad Shahroz Khan et.al.	2503.24354	null
2025-03-31	PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks	Fang Yan et.al.	2503.24345	null
2025-03-31	Can Test-Time Scaling Improve World Foundation Model?	Wenyan Cong et.al.	2503.24320	link
2025-03-31	BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models	Alok Abhishek et.al.	2503.24310	null
2025-03-31	A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG	Arshia Kermani et.al.	2503.24307	null
2025-03-31	Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning	Jiacheng Lin et.al.	2503.24289	link
2025-03-31	Style Quantization for Data-Efficient GAN Training	Jian Wang et.al.	2503.24282	null
2025-03-31	Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality	Sewoong Lee et.al.	2503.24277	link
2025-03-31	Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation	Dun Yuan et.al.	2503.24245	null
2025-03-31	What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models	Qiyuan Zhang et.al.	2503.24235	link
2025-03-31	Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes	Daichi Otsuka et.al.	2503.24229	null
2025-03-31	Synthetic News Generation for Fake News Classification	Abdul Sittar et.al.	2503.24206	null
2025-03-31	TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers’ Guidance	Jingxian Xu et.al.	2503.24198	null
2025-03-31	Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval	Enrico Palumbo et.al.	2503.24193	null
2025-03-31	Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms	Shuoming Zhang et.al.	2503.24191	null
2025-03-31	Foundation Models For Seismic Data Processing: An Extensive Review	Fabian Fuchs et.al.	2503.24166	link
2025-03-31	LLM4FS: Leveraging Large Language Models for Feature Selection and How to Improve It	Jianhao Li et.al.	2503.24157	null
2025-03-31	AI-Assisted Colonoscopy: Polyp Detection and Segmentation using Foundation Models	Uxue Delaquintana-Aramendi et.al.	2503.24138	link
2025-03-31	It’s a (Blind) Match! Towards Vision-Language Correspondence without Parallel Data	Dominik Schnaus et.al.	2503.24129	link
2025-04-01	TeleAntiFraud-28k: An Audio-Text Slow-Thinking Dataset for Telecom Fraud Detection	Zhiming Ma et.al.	2503.24115	link
2025-03-31	PolypSegTrack: Unified Foundation Model for Colonoscopy Video Analysis	Anwesa Choudhuri et.al.	2503.24108	null
2025-03-31	Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?	Yewei Song et.al.	2503.24102	null
2025-03-31	TransMamba: Flexibly Switching between Transformer and Mamba	Yixing Li et.al.	2503.24067	null
2025-03-31	Artificial Conversations, Real Results: Fostering Language Detection with Synthetic Data	Fatemeh Mohammadi et.al.	2503.24062	null
2025-03-31	ReaLM: Reliable and Efficient Large Language Model Inference with Statistical Algorithm-Based Fault Tolerance	Tong Xie et.al.	2503.24053	link
2025-04-01	A Deep Learning Framework for the Electronic Structure of Water: Towards a Universal Model	Xinyuan Liang et.al.	2503.24050	null
2025-03-31	Towards Scientific Intelligence: A Survey of LLM-based Scientific Agents	Shuo Ren et.al.	2503.24047	null
2025-03-31	IntelliCircos: A Data-driven and AI-powered Authoring Tool for Circos Plots	Mingyang Gu et.al.	2503.24021	null
2025-03-31	H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding	Qi Wu et.al.	2503.24008	null
2025-03-31	Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving	Wei Gao et.al.	2503.24000	link
2025-03-31	SALT: A Flexible Semi-Automatic Labeling Tool for General LiDAR Point Clouds with Cross-Scene Adaptability and 4D Consistency	Yanbo Wang et.al.	2503.23980	link
2025-04-01	Local Information Matters: Inference Acceleration For Grounded Conversation Generation Models Through Adaptive Local-Aware Token Pruning	Bizhe Bai et.al.	2503.23959	null
2025-03-31	Green MLOps to Green GenOps: An Empirical Study of Energy Consumption in Discriminative and Generative AI Operations	Adrián Sánchez-Mompó et.al.	2503.23934	null
2025-03-31	Model Hemorrhage and the Robustness Limits of Large Language Models	Ziyang Ma et.al.	2503.23924	null
2025-03-31	Entropy-Based Adaptive Weighting for Self-Training	Xiaoxuan Wang et.al.	2503.23913	link
2025-03-31	HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment	Zhichao Liao et.al.	2503.23907	null
2025-03-31	Rubrik’s Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset	Diana Galvan-Sosa et.al.	2503.23899	null
2025-03-31	Better wit than wealth: Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge Enhancement	Yuqiao Tan et.al.	2503.23895	link
2025-03-31	SchemaAgent: A Multi-Agents Framework for Generating Relational Database Schema	Qin Wang et.al.	2503.23886	link
2025-03-31	GenSwarm: Scalable Multi-Robot Code-Policy Generation and Deployment via Language Models	Wenkang Ji et.al.	2503.23875	link
2025-03-31	Exploring In-Context Learning Capabilities of ChatGPT for Pathological Speech Detection	Mahdi Amiri et.al.	2503.23873	null
2025-03-31	Communication-Efficient and Personalized Federated Foundation Model Fine-Tuning via Tri-Matrix Adaptation	Yongle Li et.al.	2503.23869	null
2025-04-01	Evaluating small vision-language models as AI assistants for radio astronomical source analysis tasks	S. Riggi et.al.	2503.23859	link
2025-03-31	FlexiMo: A Flexible Remote Sensing Foundation Model	Xuyang Li et.al.	2503.23844	null
2025-03-31	OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training	Yijie Zheng et.al.	2503.23830	null
2025-04-01	Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains	Yi Su et.al.	2503.23829	null
2025-03-31	Aud-Sur: An Audio Analyzer Assistant for Audio Surveillance Applications	Phat Lam et.al.	2503.23827	null
2025-03-31	Conformal uncertainty quantification to evaluate predictive fairness of foundation AI model for skin lesion classes across patient demographics	Swarnava Bhattacharyya et.al.	2503.23819	null
2025-03-31	MVDRAM: Enabling GeMV Execution in Unmodified DRAM for Low-Bit LLM Acceleration	Tatsuya Kubo et.al.	2503.23817	null
2025-04-01	Did ChatGPT or Copilot use alter the style of internet news headlines? A time series regression analysis	Chris Brogly et.al.	2503.23811	null
2025-03-31	Adaptive Attention-Based Model for 5G Radio-based Outdoor Localization	Ilayda Yaman et.al.	2503.23810	null
2025-03-31	Get the Agents Drunk: Memory Perturbations in Autonomous Agent-based Recommender Systems	Shiyi Yang et.al.	2503.23804	null
2025-03-31	Adaptive Layer-skipping in Pre-trained LLMs	Xuan Luo et.al.	2503.23798	null
2025-04-01	On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices	Bosung Kim et.al.	2503.23796	link
2025-03-31	LLMigrate: Transforming “Lazy” Large Language Models into Efficient Source Code Migrators	Yuchen Liu et.al.	2503.23791	null
2025-03-31	MGD-SAM2: Multi-view Guided Detail-enhanced Segment Anything Model 2 for High-Resolution Class-agnostic Segmentation	Haoran Shen et.al.	2503.23786	link
2025-03-31	ObfusQate: Unveiling the First Quantum Program Obfuscation Framework	Nilhil Bartake et.al.	2503.23785	null
2025-03-31	DebFlow: Automating Agent Creation via Agent Debate	Jinwei Su et.al.	2503.23781	null
2025-03-31	WinoWhat: A Parallel Corpus of Paraphrased WinoGrande Sentences with Common Sense Categorization	Ine Gevers et.al.	2503.23779	null
2025-03-31	CONGRAD:Conflicting Gradient Filtering for Multilingual Preference Alignment	Jiangnan Li et.al.	2503.23777	null
2025-03-31	XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?	Fengxiang Wang et.al.	2503.23771	null
2025-03-31	Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism	Linghao Feng et.al.	2503.23767	null
2025-03-31	Accelerating High-Efficiency Organic Photovoltaic Discovery via Pretrained Graph Neural Networks and Generative Reinforcement Learning	Jiangjie Qiu et.al.	2503.23766	null
2025-03-31	STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?	Yun Li et.al.	2503.23765	null
2025-03-31	UniSep: Universal Target Audio Separation with Language Models at Scale	Yuanyuan Wang et.al.	2503.23762	null
2025-03-31	Short-video Propagation Influence Rating: A New Real-world Dataset and A New Large Graph Model	Dizhan Xue et.al.	2503.23746	link
2025-03-31	LANID: LLM-assisted New Intent Discovery	Lu Fan et.al.	2503.23740	link
2025-03-31	AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization	Yiyang Du et.al.	2503.23733	link
2025-03-31	Detecting Functional Bugs in Smart Contracts through LLM-Powered and Bug-Oriented Composite Analysis	Binbin Zhao et.al.	2503.23718	null
2025-03-31	HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation	Kun Liu et.al.	2503.23715	null
2025-03-31	Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models	Youmi Ma et.al.	2503.23714	null
2025-03-31	A Conceptual Framework for Human-AI Collaborative Genome Annotation	Xiaomei Li et.al.	2503.23691	null
2025-03-31	Mapping Geopolitical Bias in 11 Large Language Models: A Bilingual, Dual-Framing Analysis of U.S.-China Tensions	William Guey et.al.	2503.23688	null
2025-03-31	Large Language Models Pass the Turing Test	Cameron R. Jones et.al.	2503.23674	null
2025-03-31	WHERE and WHICH: Iterative Debate for Biomedical Synthetic Data Augmentation	Zhengyi Zhao et.al.	2503.23673	null
2025-03-31	Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity	Kotaro Inoue et.al.	2503.23667	null
2025-03-31	DeepDubber-V1: Towards High Quality and Dialogue, Narration, Monologue Adaptive Movie Dubbing Via Multi-Modal Chain-of-Thoughts Reasoning Guidance	Junjie Zheng et.al.	2503.23660	null
2025-04-01	GIScience in the Era of Artificial Intelligence: A Research Agenda Towards Autonomous GIS	Zhenlong Li et.al.	2503.23633	null
2025-03-30	Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation	Zahra TehraniNasab et.al.	2503.23623	null
2025-03-30	Leveraging Vision-Language Foundation Models to Reveal Hidden Image-Attribute Relationships in Medical Imaging	Amar Kumar et.al.	2503.23618	null
2025-03-30	Graph-Eq: Discovering Mathematical Equations using Graph Generative Models	Nisal Ranasinghe et.al.	2503.23617	null
2025-03-30	Make Autoregressive Great Again: Diffusion-Free Graph Generation with Next-Scale Prediction	Samuel Belkadi et.al.	2503.23612	null
2025-03-30	Exploring GPT-4 for Robotic Agent Strategy with Real-Time State Feedback and a Reactive Behaviour Framework	Thomas O’Brien et.al.	2503.23601	null
2025-03-30	When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing	Haein Kong et.al.	2503.23566	null
2025-03-30	Modelling the impact of phenotypic heterogeneity on cell migration: a continuum framework derived from individual-based principles	Rebecca M. Crossley et.al.	2503.23545	link
2025-03-30	Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages	Xabier de Zuazo et.al.	2503.23542	link
2025-03-30	Enhancing Creative Generation on Stable Diffusion-based Models	Jiyeon Han et.al.	2503.23538	link
2025-03-30	Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models	Haochen Liu et.al.	2503.23523	link
2025-03-30	If an LLM Were a Character, Would It Know Its Own Story? Evaluating Lifelong Learning in LLMs	Siqi Fan et.al.	2503.23514	null
2025-03-30	RARE: Retrieval-Augmented Reasoning Modeling	Zhengren Wang et.al.	2503.23513	link
2025-03-30	SCORE: Story Coherence and Retrieval Enhancement for AI Narratives	Qiang Yi et.al.	2503.23512	null
2025-03-30	Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model	Jannik Endres et.al.	2503.23502	link
2025-03-30	DNA and Human Language: Epigenetic Memory and Redundancy in Linear Sequence	Li Yang et.al.	2503.23494	null
2025-03-30	POINT $^{2}$ : A Polymer Informatics Training and Testing Database	Jiaxin Xu et.al.	2503.23491	link
2025-03-28	Q-Insight: Understanding Image Quality via Visual Reinforcement Learning	Weiqi Li et.al.	2503.22679	link
2025-03-28	DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness	Ruining Li et.al.	2503.22677	null
2025-03-28	QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?	Belinda Z. Li et.al.	2503.22674	link
2025-03-28	Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers	Francesca Pezzuti et.al.	2503.22672	link
2025-03-28	Unicorn: Text-Only Data Synthesis for Vision Language Model Training	Xiaomin Yu et.al.	2503.22655	link
2025-03-28	Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users	Antonia Karamolegkou et.al.	2503.22610	null
2025-03-28	On the Alignment of Post-Publication Reviews & Bibliometric and Altmetric Impact – A Case Study on Expert Statements from the Science Media Center Germany	Dirk Tunger et.al.	2503.22594	null
2025-03-28	LLM-enabled Instance Model Generation	Fengjunjie Pan et.al.	2503.22587	null
2025-03-28	Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish	Kevin Cohen et.al.	2503.22585	link
2025-03-28	Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation	Sarubi Thillainathan et.al.	2503.22582	null
2025-03-28	RELD: Regularization by Latent Diffusion Models for Image Restoration	Pasquale Cascarano et.al.	2503.22563	null
2025-03-28	Niyama : Breaking the Silos of LLM Inference Serving	Kanishk Goel et.al.	2503.22562	null
2025-03-28	Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation	Zhuo-Yang Song et.al.	2503.22547	null
2025-03-28	Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities	Raman Dutt et.al.	2503.22517	null
2025-03-28	Assessing Foundation Models for Sea Ice Type Segmentation in Sentinel-1 SAR Imagery	Samira Alkaee Taleghan et.al.	2503.22516	null
2025-03-28	Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model	Wangtao Sun et.al.	2503.22480	null
2025-03-28	WorkTeam: Constructing Workflows from Natural Language with Multi-Agents	Hanchao Liu et.al.	2503.22473	null
2025-03-28	Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey	Shengyue Guan et.al.	2503.22458	null
2025-03-28	Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning	Abdullah Vanlioglu et.al.	2503.22456	null
2025-03-28	STADE: Standard Deviation as a Pruning Metric	Diego Coello de Portugal Mecke et.al.	2503.22451	link
2025-03-28	CoSIL: Software Issue Localization via LLM-Driven Code Repository Graph Searching	Zhonghao Jiang et.al.	2503.22424	link
2025-03-28	Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis	Jiangyong Huang et.al.	2503.22420	link
2025-03-28	Training Large Language Models for Advanced Typosquatting Detection	Jackson Welch et.al.	2503.22406	null
2025-03-28	Generative Reliability-Based Design Optimization Using In-Context Learning Capabilities of Large Language Models	Zhonglin Jiang et.al.	2503.22401	null
2025-03-28	GAITGen: Disentangled Motion-Pathology Impaired Gait Generative Model – Bringing Motion Generation to the Clinical Domain	Vida Adeli et.al.	2503.22397	null
2025-03-28	Negation: A Pink Elephant in the Large Language Models’ Room?	Tereza Vrabcová et.al.	2503.22395	null
2025-03-28	Supposedly Equivalent Facts That Aren’t? Entity Frequency in Pre-training Induces Asymmetry in LLMs	Yuan He et.al.	2503.22362	link
2025-03-28	EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation	Hadrien Reynaud et.al.	2503.22357	null
2025-03-28	Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions	Yubo Li et.al.	2503.22353	null
2025-03-28	Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization	Barış Batuhan Topal et.al.	2503.22352	null
2025-03-28	Using a Large Language Model as Design Material for an Interactive Museum Installation	Maria Padilla Engstrøm et.al.	2503.22345	null
2025-03-28	SKDU at De-Factify 4.0: Natural Language Features for AI-Generated Text-Detection	Shrikant Malviya et.al.	2503.22338	link
2025-03-28	A Refined Analysis of Massive Activations in LLMs	Louis Owen et.al.	2503.22329	link
2025-03-28	Large Language Models Are Democracy Coders with Attitudes	Nils B. Weidmann et.al.	2503.22315	null
2025-03-28	BanglAssist: A Bengali-English Generative AI Chatbot for Code-Switching and Dialect-Handling in Customer Service	Francesco Kruk et.al.	2503.22283	null
2025-03-28	MultiClaimNet: A Massively Multilingual Dataset of Fact-Checked Claim Clusters	Rrubaa Panchendrarajan et.al.	2503.22280	null
2025-03-28	Make Some Noise: Towards LLM audio reasoning and generation using sound tokens	Shivam Mehta et.al.	2503.22275	null
2025-03-28	Beyond the Script: Testing LLMs for Authentic Patient Communication Styles in Healthcare	Anna Bodonhelyi et.al.	2503.22250	null
2025-03-28	FLAM: Foundation Model-Based Body Stabilization for Humanoid Locomotion and Manipulation	Xianqi Zhang et.al.	2503.22249	null
2025-03-28	Agent-Centric Personalized Multiple Clustering with Multi-Modal LLMs	Ziye Chen et.al.	2503.22241	null
2025-03-28	Integrating LLMs in Software Engineering Education: Motivators, Demotivators, and a Roadmap Towards a Framework for Finnish Higher Education Institutes	Maryam Khan et.al.	2503.22238	null
2025-03-28	SCHNet: SAM Marries CLIP for Human Parsing	Kunliang Liu et.al.	2503.22237	null
2025-03-28	CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving	Yishen Ji et.al.	2503.22231	null
2025-03-28	Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback	Wei Shen et.al.	2503.22230	null
2025-03-28	DeepSound-V1: Start to Think Step-by-Step in the Audio Generation from Videos	Yunming Liang et.al.	2503.22208	null
2025-03-28	Enhance Generation Quality of Flow Matching V2A Model via Multi-Step CoT-Like Guidance and Combined Preference Optimization	Haomin Zhang et.al.	2503.22200	null
2025-03-28	EdgeInfinite: A Memory-Efficient Infinite-Context Transformer for Edge Devices	Jiyu Chen et.al.	2503.22196	null
2025-03-28	Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation	Minho Park et.al.	2503.22172	null
2025-03-28	Reasoning of Large Language Models over Knowledge Graphs with Super-Relations	Song Wang et.al.	2503.22166	null
2025-03-28	Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models	Zhanke Zhou et.al.	2503.22165	link
2025-03-28	PharmAgents: Building a Virtual Pharma with Large Language Model Agents	Bowen Gao et.al.	2503.22164	null
2025-03-28	EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos	Yuxuan Li et.al.	2503.22152	link
2025-03-28	Tokenization of Gaze Data	Tim Rolff et.al.	2503.22145	null
2025-03-28	FRASE: Structured Representations for Generalizable SPARQL Query Generation	Papa Abdou Karim Karou Diallo et.al.	2503.22144	null
2025-03-28	A Self-Supervised Learning of a Foundation Model for Analog Layout Design Automation	Sungyu Jeong et.al.	2503.22143	null
2025-03-28	Score-Based Turbo Message Passing for Plug-and-Play Compressive Image Recovery	Chang Cai et.al.	2503.22140	null
2025-03-28	Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model	Changchang Sun et.al.	2503.22138	null
2025-03-28	Sharpe Ratio-Guided Active Learning for Preference Optimization in RLHF	Syrine Belakaria et.al.	2503.22137	null
2025-03-28	Detecting Localized Deepfake Manipulations Using Action Unit-Guided Video Representations	Tharun Anand et.al.	2503.22121	null
2025-03-28	Beyond Single-Sentence Prompts: Upgrading Value Alignment Benchmarks with Dialogues and Stories	Yazhou Zhang et.al.	2503.22115	null
2025-03-28	Few-Shot Graph Out-of-Distribution Detection with LLMs	Haoyan Xu et.al.	2503.22097	null
2025-03-28	Leveraging LLMs for Predicting Unknown Diagnoses from Clinical Notes	Dina Albassam et.al.	2503.22092	null
2025-03-28	A Survey on Remote Sensing Foundation Models: From Vision to Multimodality	Ziyue Huang et.al.	2503.22081	link
2025-03-28	Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation	Chuan-Wei Kuo et.al.	2503.22074	null
2025-03-28	Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning	Deshani Geethika Poddenige et.al.	2503.22063	null
2025-03-27	ThinkEdit: Interpretable Weight Editing to Mitigate Overly Short Thinking in Reasoning Models	Chung-En Sun et.al.	2503.22048	link
2025-03-27	The Risks of Using Large Language Models for Text Annotation in Social Science Research	Hao Lin et.al.	2503.22040	null
2025-03-27	Debate-Driven Multi-Agent LLMs for Phishing Email Detection	Ngoc Tuong Vy Nguyen et.al.	2503.22038	null
2025-03-27	Cognitive Prompts Using Guilford’s Structure of Intellect Model	Oliver Kramer et.al.	2503.22036	null
2025-03-27	AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait Identification	Earl Ranario et.al.	2503.22019	link
2025-03-27	Tune It Up: Music Genre Transfer and Prediction	Fidan Samet et.al.	2503.22008	link
2025-03-27	BOOTPLACE: Bootstrapped Object Placement with Detection Transformers	Hang Zhou et.al.	2503.21991	link
2025-03-27	Socially Constructed Treatment Plans: Analyzing Online Peer Interactions to Understand How Patients Navigate Complex Medical Conditions	Madhusudan Basak et.al.	2503.21986	null
2025-03-27	Improving Equivariant Networks with Probabilistic Symmetry Breaking	Hannah Lawrence et.al.	2503.21985	null
2025-03-27	RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction	Armin Abdollahi et.al.	2503.21971	null
2025-03-27	Data-Agnostic Robotic Long-Horizon Manipulation with Vision-Language-Guided Closed-Loop Feedback	Yuan Meng et.al.	2503.21969	link
2025-03-27	Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images	Lorenzo F. C. Varaschin et.al.	2503.21966	null
2025-03-27	Entropy-Aware Branching for Improved Mathematical Reasoning	Xianzhi Li et.al.	2503.21961	null
2025-03-27	Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad	Ivo Petrov et.al.	2503.21934	null
2025-03-27	Multimodal Data Integration for Sustainable Indoor Gardening: Tracking Anyplant with Time Series Foundation Model	Seyed Hamidreza Nabaei et.al.	2503.21932	null
2025-03-27	Local Normalization Distortion and the Thermodynamic Formalism of Decoding Strategies for Large Language Models	Tom Kempton et.al.	2503.21929	null
2025-03-27	Hybrid Emotion Recognition: Enhancing Customer Interactions Through Acoustic and Textual Analysis	Sahan Hewage Wewelwala et.al.	2503.21927	null
2025-03-27	AutoPsyC: Automatic Recognition of Psychodynamic Conflicts from Semi-structured Interviews with Large Language Models	Sayed Muddashir Hossain et.al.	2503.21911	null
2025-03-27	AssistPDA: An Online Video Surveillance Assistant for Video Anomaly Prediction, Detection, and Analysis	Zhiwei Yang et.al.	2503.21904	null
2025-03-27	OntoAligner: A Comprehensive Modular and Robust Python Toolkit for Ontology Alignment	Hamed Babaei Giglou et.al.	2503.21902	link
2025-03-27	StarFlow: Generating Structured Workflow Outputs From Sketch Images	Patrice Bechard et.al.	2503.21889	null
2025-03-27	RedditESS: A Mental Health Social Support Interaction Dataset – Understanding Effective Social Support to Refine AI-Driven Support Tools	Zeyad Alghamdi et.al.	2503.21888	null
2025-03-27	Video-R1: Reinforcing Video Reasoning in MLLMs	Kaituo Feng et.al.	2503.21776	link
2025-03-27	A Unified Image-Dense Annotation Generation Model for Underwater Scenes	Hongkai Lin et.al.	2503.21771	link
2025-03-27	Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence	Haolin Liu et.al.	2503.21766	null
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video	David Yifan Yao et.al.	2503.21761	link
2025-03-27	MemInsight: Autonomous Memory Augmentation for LLM Agents	Rana Salama et.al.	2503.21760	null
2025-03-27	A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One	Minyoung Kim et.al.	2503.21756	null
2025-03-27	VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness	Dian Zheng et.al.	2503.21755	link
2025-03-27	3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models	Yuhan Zhang et.al.	2503.21745	null
2025-03-27	GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics	Arsham Gholamzadeh Khoee et.al.	2503.21735	null
2025-03-29	Effective Skill Unlearning through Intervention and Abstention	Yongce Li et.al.	2503.21730	link
2025-03-27	Collab: Controlled Decoding using Mixture of Agents for LLM Alignment	Souradip Chakraborty et.al.	2503.21720	null
2025-03-27	Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs	Boyang Yang et.al.	2503.21710	null
2025-03-27	Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data	Zhiyuan Ma et.al.	2503.21694	link
2025-03-27	LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning	Hui Wang et.al.	2503.21683	null
2025-03-27	JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human Self-Destructive Behavior Content in Jirai Community	Yunze Xiao et.al.	2503.21679	null
2025-03-27	How do language models learn facts? Dynamics, curricula and hallucinations	Nicolas Zucchet et.al.	2503.21676	null
2025-03-27	Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base	Satvik Verma et.al.	2503.21674	link
2025-03-27	A friendly introduction to triangular transport	Maximilian Ramgraber et.al.	2503.21673	null
2025-03-27	UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning	Zhengxi Lu et.al.	2503.21620	link
2025-03-27	Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach	Javier Coronado-Blázquez et.al.	2503.21613	null
2025-03-27	GenEdit: Compounding Operators and Continuous Improvement to Tackle Text-to-SQL in the Enterprise	Karime Maamari et.al.	2503.21602	null
2025-03-27	Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing	Johan Wahréus et.al.	2503.21598	null
2025-03-27	Critical Iterative Denoising: A Discrete Generative Model Applied to Graphs	Yoann Boget et.al.	2503.21592	null
2025-03-27	Cooking Task Planning using LLM and Verified by Graph Network	Ryunosuke Takebayashi et.al.	2503.21564	null
2025-03-27	debug-gym: A Text-Based Environment for Interactive Debugging	Xingdi Yuan et.al.	2503.21557	null
2025-03-27	SWI: Speaking with Intent in Large Language Models	Yuwei Yin et.al.	2503.21544	link
2025-03-27	Combining Artificial Users and Psychotherapist Assessment to Evaluate Large Language Model-based Mental Health Chatbots	Florian Onur Kuhlmeier et.al.	2503.21540	null
2025-03-27	Exploring the Energy Landscape of RBMs: Reciprocal Space Insights into Bosons, Hierarchical Learning and Symmetry Breaking	J. Quetzalcóatl Toledo-Marin et.al.	2503.21536	null
2025-03-27	Uncertainty-aware Bayesian machine learning modelling of land cover classification	Samuel Bilson et.al.	2503.21510	null
2025-03-27	Keyword-Oriented Multimodal Modeling for Euphemism Identification	Yuxue Hu et.al.	2503.21504	link
2025-03-27	Double Blind Imaging with Generative Modeling	Brett Levac et.al.	2503.21501	null
2025-03-27	OpenHuEval: Evaluating Large Language Model on Hungarian Specifics	Haote Yang et.al.	2503.21500	link
2025-03-28	OmniVox: Zero-Shot Emotion Recognition with Omni-LLMs	John Murzaku et.al.	2503.21480	null
2025-03-27	DeepRV: pre-trained spatial priors for accelerated disease mapping	Jhonathan Navott et.al.	2503.21473	null
2025-03-27	Harnessing Chain-of-Thought Metadata for Task Routing and Adversarial Prompt Detection	Ryan Marinelli et.al.	2503.21464	link
2025-03-27	Large Language Model Agent: A Survey on Methodology, Applications and Challenges	Junyu Luo et.al.	2503.21460	link
2025-03-27	FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs	Xiaoqin Wang et.al.	2503.21457	link
2025-03-27	CMADiff: Cross-Modal Aligned Diffusion for Controllable Protein Generation	Changjian Zhou et.al.	2503.21450	null
2025-03-27	Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving	Lucas Nunes et.al.	2503.21449	link
2025-03-27	From Deep Learning to LLMs: A survey of AI in Quantitative Investment	Bokai Cao et.al.	2503.21422	null
2025-03-28	Neuroplasticity in Artificial Intelligence – An Overview and Inspirations on Drop In & Out Learning	Yupei Li et.al.	2503.21419	null
2025-03-27	Exploring the Roles of Large Language Models in Reshaping Transportation Systems: A Survey, Framework, and Roadmap	Tong Nie et.al.	2503.21411	link
2025-03-27	VALLR: Visual ASR Language Model for Lip Reading	Marshall Thomas et.al.	2503.21408	null
2025-03-27	An evaluation of LLMs and Google Translate for translation of selected Indian languages via sentiment and semantic analyses	Rohitash Chandra et.al.	2503.21393	null
2025-03-27	Controlling Large Language Model with Latent Actions	Chengxing Jia et.al.	2503.21383	link
2025-03-27	Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models	Haoxiang Sun et.al.	2503.21380	link
2025-03-27	Generative Decoding for Quantum Error-correcting Codes	Hanyan Cao et.al.	2503.21374	null
2025-03-27	From User Preferences to Optimization Constraints Using Large Language Models	Manuela Sanguinetti et.al.	2503.21360	null
2025-03-27	Using large language models to produce literature reviews: Usages and systematic biases of microphysics parametrizations in 2699 publications	Tianhang Zhang et.al.	2503.21352	null
2025-03-27	Fine-Tuning LLMs on Small Medical Datasets: Text Classification and Normalization Effectiveness on Cardiology reports and Discharge records	Noah Losch et.al.	2503.21349	null
2025-03-27	Scalable Expectation Estimation with Subtractive Mixture Models	Lena Zellinger et.al.	2503.21346	null
2025-03-27	Large Language Models for Traffic and Transportation Research: Methodologies, State of the Art, and Future Opportunities	Yimo Yan et.al.	2503.21330	null
2025-03-27	Structural bias in three-dimensional autoregressive generative machine learning of organic molecules	Zsuzsanna Koczor-Benda et.al.	2503.21328	null
2025-03-27	Tricking Retrievers with Influential Tokens: An Efficient Black-Box Corpus Poisoning Attack	Cheng Wang et.al.	2503.21315	null
2025-03-27	InternVL-X: Advancing and Accelerating InternVL Series with Efficient Visual Token Compression	Dongchen Lu et.al.	2503.21307	link
2025-03-27	R-PRM: Reasoning-Driven Process Reward Modeling	Shuaijie She et.al.	2503.21295	link
2025-03-27	Reinforced Model Merging	Jiaqi Han et.al.	2503.21272	link
2025-03-27	ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition	Yujie Liu et.al.	2503.21248	null
2025-03-27	DynamiCtrl: Rethinking the Basic Structure and the Role of Text for High-quality Human Image Animation	Haoyu Zhao et.al.	2503.21246	link
2025-03-27	Exploring the Rastall Gravity Cosmological Model using Gong-Zhang parameterization with Latest Observational Data and Deep Learning Techniques	Vinod Kumar Bhardwaj et.al.	2503.21243	null
2025-03-27	Bias-Aware Agent: Enhancing Fairness in AI-Driven Knowledge Retrieval	Karanbir Singh et.al.	2503.21237	link
2025-03-27	LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models	Hengyuan Zhao et.al.	2503.21227	null
2025-03-27	Efficient Learning for Entropy-regularized Markov Decision Processes via Multilevel Monte Carlo	Matthieu Meunier et.al.	2503.21224	null
2025-03-27	Rethinking Graph Structure Learning in the Era of LLMs	Zhihan Zhang et.al.	2503.21223	null
2025-03-27	GenFusion: Closing the Loop between Reconstruction and Generation via Videos	Sibo Wu et.al.	2503.21219	null
2025-03-27	Resource-Efficient Federated Fine-Tuning Large Language Models for Heterogeneous Data	Jun Liu et.al.	2503.21213	null
2025-03-27	FakeReasoning: Towards Generalizable Forgery Detection and Reasoning	Yueying Gao et.al.	2503.21210	null
2025-03-27	PilotANN: Memory-Bounded GPU Acceleration for Vector Search	Yuntao Gui et.al.	2503.21206	link
2025-03-27	Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering	Erika Mori et.al.	2503.21190	null
2025-03-27	DGSUnet: An Improved Unet Model with DINO-Guided SAM2 for Multi-Scale Feature Collaboration	Yimin Xu et.al.	2503.21187	link
2025-03-27	Integrating Large Language Models For Monte Carlo Simulation of Chemical Reaction Networks	Sadikshya Gyawali et.al.	2503.21178	null
2025-03-27	Model as a Game: On Numerical and Spatial Consistency for Generative Games	Jingye Chen et.al.	2503.21172	null
2025-03-27	Integrating Travel Behavior Forecasting and Generative Modeling for Predicting Future Urban Mobility and Spatial Transformations	Eugene Denteh et.al.	2503.21158	null
2025-03-27	Embedding Domain-Specific Knowledge from LLMs into the Feature Engineering Pipeline	João Eduardo Batista et.al.	2503.21155	null
2025-03-27	Expressive Timing in Hindustani Vocal Music	Yash Bhake et.al.	2503.21142	null
2025-03-27	MoQa: Rethinking MoE Quantization with Multi-stage Data-model Distribution Awareness	Zihao Zheng et.al.	2503.21135	null
2025-03-27	Collaborative Evolution: Multi-Round Learning Between Large and Small Language Models for Emergent Fake News Detection	Ziyi Zhou et.al.	2503.21127	null
2025-03-27	De Novo Functional Protein Sequence Generation: Overcoming Data Scarcity through Regeneration and Large Models	Chenyu Ren et.al.	2503.21123	null
2025-03-27	Leveraging Large Language Models for Risk Assessment in Hyperconnected Logistic Hub Network Deployment	Yinzhu Quan et.al.	2503.21115	null
2025-03-27	Alleviating LLM-based Generative Retrieval Hallucination in Alipay Search	Yedan Shen et.al.	2503.21098	null
2025-03-27	ZJUKLAB at SemEval-2025 Task 4: Unlearning via Model Merging	Haoming Xu et.al.	2503.21088	link
2025-03-28	EQ-Negotiator: An Emotion-Reasoning LLM Agent in Credit Dialogues	Yuhan Liu et.al.	2503.21080	null
2025-03-27	Online Reasoning Video Segmentation with Just-in-Time Digital Twins	Yiqing Shen et.al.	2503.21056	null
2025-03-27	What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning	Chi-Hsi Kung et.al.	2503.21055	null
2025-03-26	Operating Room Workflow Analysis via Reasoning Segmentation over Digital Twins	Yiqing Shen et.al.	2503.21054	null
2025-03-26	Scalability Evaluation of HPC Multi-GPU Training for ECG-based LLMs	Dimitar Mileski et.al.	2503.21033	null
2025-03-26	Two for the Price of One: Integrating Large Language Models to Learn Biophysical Interactions	Joseph D. Clark et.al.	2503.21017	null
2025-03-26	Can Large Language Models Predict Associations Among Human Attitudes?	Ana Ma et.al.	2503.21011	null
2025-03-26	Evaluating Large Language Models for Automated Clinical Abstraction in Pulmonary Embolism Registries: Performance Across Model Sizes, Versions, and Parameters	Mahmoud Alwakeel et.al.	2503.21004	null
2025-03-26	Multi-head Reward Aggregation Guided by Entropy	Xiaomin Li et.al.	2503.20995	null
2025-03-26	FinAudio: A Benchmark for Audio Large Language Models in Financial Applications	Yupeng Cao et.al.	2503.20990	null
2025-03-26	Patients Speak, AI Listens: LLM-based Analysis of Online Reviews Uncovers Key Drivers for Urgent Care Satisfaction	Xiaoran Xu et.al.	2503.20981	null
2025-03-26	ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction	Yiqiao Jin et.al.	2503.20978	null
2025-03-26	Sociotechnical Effects of Machine Translation	Joss Moorkens et.al.	2503.20959	null
2025-03-26	DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care	Yutong Song et.al.	2503.20950	null
2025-03-26	Hacia la interpretabilidad de la detección anticipada de riesgos de depresión utilizando grandes modelos de lenguaje	Horacio Thompson et.al.	2503.20939	null
2025-03-26	Leveraging LLMs, IDEs, and Semantic Embeddings for Automated Move Method Refactoring	Fraol Batole et.al.	2503.20934	null
2025-03-26	D4R – Exploring and Querying Relational Graphs Using Natural Language and Large Language Models – the Case of Historical Documents	Michel Boeglin et.al.	2503.20914	null
2025-03-26	Assessing Generative Models for Structured Data	Reilly Cannon et.al.	2503.20903	null
2025-03-26	Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark	Sondos Mahmoud Bsharat et.al.	2503.20786	link
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields	Shijie Zhou et.al.	2503.20776	null
2025-03-26	Reliable algorithm selection for machine learning-guided design	Clara Fannjiang et.al.	2503.20767	null
2025-03-26	UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines	Chen Tang et.al.	2503.20748	null
2025-03-26	MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams	Yanpeng Sun et.al.	2503.20745	null
2025-03-26	Continual learning via probabilistic exchangeable sequence modelling	Hanwen Xing et.al.	2503.20725	null
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models	Nikita Neveditsin et.al.	2503.20715	null
2025-03-26	Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control	Eloy Anguiano Batanero et.al.	2503.20688	null
2025-03-27	Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound	Yuhao Huang et.al.	2503.20685	null
2025-03-27	Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy	Yinan Sun et.al.	2503.20673	null
2025-03-26	BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation	Yuyang Peng et.al.	2503.20672	null
2025-03-26	TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews	Huimin Xu et.al.	2503.20666	null
2025-03-26	ARMO: Autoregressive Rigging for Multi-Category Objects	Mingze Sun et.al.	2503.20663	null
2025-03-26	Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging	Han Wu et.al.	2503.20641	link
2025-03-26	Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions	Alessandro Maisto et.al.	2503.20623	null
2025-03-26	Diffusion Counterfactuals for Image Regressors	Trung Duc Ha et.al.	2503.20595	link
2025-03-26	Supply chain network rewiring dynamics at the firm-level	Tobias Reisch et.al.	2503.20594	link
2025-03-26	What to Retrieve for Effective Retrieval-Augmented Code Generation? An Empirical Study and Beyond	Wenchao Gu et.al.	2503.20589	null
2025-03-26	LLPut: Investigating Large Language Models for Bug Report-Based Input Generation	Alif Al Hasan et.al.	2503.20578	null
2025-03-26	Optimizing Case-Based Reasoning System for Functional Test Script Generation with Large Language Models	Siyuan Guo et.al.	2503.20576	null
2025-03-26	Stochastic Transport Maps in Diffusion Models and Sampling	Xicheng Zhang et.al.	2503.20573	null
2025-03-26	Low-resource Information Extraction with the European Clinical Case Corpus	Soumitra Ghosh et.al.	2503.20568	null
2025-03-26	TerraTorch: The Geospatial Foundation Models Toolkit	Carlos Gomes et.al.	2503.20563	link
2025-03-26	A Theoretical Framework for Prompt Engineering: Approximating Smooth Functions with Transformer Prompts	Ryumei Nakada et.al.	2503.20561	null
2025-03-26	Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation	Yunkai Liang et.al.	2503.20552	link
2025-03-26	Knowledge-Based Multi-Agent Framework for Automated Software Architecture Design	Yiran Zhang et.al.	2503.20536	null
2025-03-26	StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs	Zhicheng Guo et.al.	2503.20527	link
2025-03-26	GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving	Lloyd Russell et.al.	2503.20523	null
2025-03-26	MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation	Jinnan Chen et.al.	2503.20519	null
2025-03-26	Exploring the Effect of Robotic Embodiment and Empathetic Tone of LLMs on Empathy Elicitation	Liza Darwesh et.al.	2503.20518	null
2025-03-26	Explainable ICD Coding via Entity Linking	Leonor Barreiros et.al.	2503.20508	null
2025-03-26	Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering	Zehui Liao et.al.	2503.20504	null
2025-03-26	MLLM-Selector: Necessity and Diversity-driven High-Value Data Selection for Enhanced Visual Instruction Tuning	Yiwei Ma et.al.	2503.20502	null
2025-03-26	FireRedTTS-1S: An Upgraded Streamable Foundation Text-to-Speech System	Hao-Han Guo et.al.	2503.20499	null
2025-03-26	VPO: Aligning Text-to-Video Generation Models with Prompt Optimization	Jiale Cheng et.al.	2503.20491	link
2025-03-26	Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Yingdong Shi et.al.	2503.20483	null
2025-03-26	From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment	Yucheng Suo et.al.	2503.20472	null
2025-03-26	Data-driven Seasonal Climate Predictions via Variational Inference and Transformers	Lluís Palma et.al.	2503.20466	null
2025-03-26	Attention Xception UNet (AXUNet): A Novel Combination of CNN and Self-Attention for Brain Tumor Segmentation	Farzan Moodi et.al.	2503.20446	null
2025-03-26	RALLRec+: Retrieval Augmented Large Language Model Recommendation with Reasoning	Sichun Luo et.al.	2503.20430	link
2025-03-26	CFunModel: A “Funny” Language Model Capable of Chinese Humor Generation and Processing	Zhenghan Yu et.al.	2503.20417	null
2025-03-26	MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation	Rongyu Zhang et.al.	2503.20384	null
2025-03-26	Dewey Long Context Embedding Model: A Technical Report	Dun Zhang et.al.	2503.20376	null
2025-03-26	VideoGEM: Training-free Action Grounding in Videos	Felix Vogel et.al.	2503.20348	null
2025-03-26	Dynamic Pyramid Network for Efficient Multimodal Large Language Model	Hao Ai et.al.	2503.20322	null
2025-03-26	Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models	Shih-Wen Ke et.al.	2503.20320	null
2025-03-26	Wan: Open and Advanced Large-Scale Video Generative Models	WanTeam et.al.	2503.20314	link
2025-03-26	Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs	Zitian Wang et.al.	2503.20309	null
2025-03-27	Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics	Lee Chae-Yeon et.al.	2503.20308	null
2025-03-26	A Multilingual, Culture-First Approach to Addressing Misgendering in LLM Applications	Sunayana Sitaram et.al.	2503.20302	link
2025-03-26	Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model	Yuhan Wang et.al.	2503.20297	null
2025-03-26	QualiSpeech: A Speech Quality Assessment Dataset with Natural Language Reasoning and Descriptions	Siyin Wang et.al.	2503.20290	null
2025-03-26	Faster Parameter-Efficient Tuning with Token Redundancy Reduction	Kwonyoung Kim et.al.	2503.20282	link
2025-03-26	sudo rm -rf agentic_security	Sejin Lee et.al.	2503.20279	link
2025-03-26	The cell as a token: high-dimensional geometry in language models and cell embeddings	William Gilpin et.al.	2503.20278	null
2025-03-26	ViLBench: A Suite for Vision-Language Process Reward Modeling	Haoqin Tu et.al.	2503.20271	null
2025-03-26	L4: Diagnosing Large-scale LLM Training Failures via Automated Log Analysis	Zhihan Jiang et.al.	2503.20263	null
2025-03-26	LGR: LLM-Guided Ranking of Frontiers for Object Goal Navigation	Mitsuaki Uno et.al.	2503.20241	null
2025-03-26	Automated UI Interface Generation via Diffusion Models: Enhancing Personalization and Efficiency	Yifei Duan et.al.	2503.20229	null
2025-03-26	TeleLoRA: Teleporting Model-Specific Alignment Across LLMs	Xiao Lin et.al.	2503.20228	null
2025-03-26	DINeMo: Learning Neural Mesh Models with no 3D Annotations	Weijie Guo et.al.	2503.20220	null
2025-03-26	Qwen2.5-Omni Technical Report	Jin Xu et.al.	2503.20215	null
2025-03-26	SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain	Nan Gao et.al.	2503.20202	null
2025-03-26	Beyond Words: Advancing Long-Text Image Generation via Multimodal Autoregressive Models	Alex Jinpeng Wang et.al.	2503.20198	null
2025-03-26	Enhancing the Robustness of LLM-Generated Code: Empirical Study and Framework	ZiKe Li et.al.	2503.20197	link
2025-03-26	GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization	Zhouhong Gu et.al.	2503.20194	link
2025-03-26	Maya: Optimizing Deep Learning Training Workloads using Emulated Virtual Accelerators	Srihas Yarlagadda et.al.	2503.20191	null
2025-03-26	Cross-Modal Prototype Allocation: Unsupervised Slide Representation Learning via Patch-Text Contrast in Computational Pathology	Yuxuan Chen et.al.	2503.20190	null
2025-03-26	Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector	Xiao Guo et.al.	2503.20188	link
2025-03-26	Leveraging Implicit Sentiments: Enhancing Reliability and Validity in Psychological Trait Evaluation of LLMs	Huanhuan Ma et.al.	2503.20182	link
2025-03-26	Can We Make Code Green? Understanding Trade-Offs in LLMs vs. Human Code Optimizations	Pooja Rani et.al.	2503.20126	null
2025-03-26	Synthesizing world models for bilevel planning	Zergham Ahmed et.al.	2503.20124	null
2025-03-25	Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors	Yuke Lou et.al.	2503.20118	null
2025-03-25	VibE: A Visual Analytics Workflow for Semantic Error Analysis of CVML Models at Subgroup Level	Jun Yuan et.al.	2503.20112	null
2025-03-25	Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data	Dominika Ciupek et.al.	2503.20107	null
2025-03-25	Direct Post-Training Preference Alignment for Multi-Agent Motion Generation Models Using Implicit Feedback from Pre-training Demonstrations	Ran Tian et.al.	2503.20105	null
2025-03-25	Bigger But Not Better: Small Neural Language Models Outperform Large Language Models in Detection of Thought Disorder	Changye Li et.al.	2503.20103	link
2025-03-25	Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success	Sophie Hao et.al.	2503.20088	null
2025-03-25	Can Multi-modal (reasoning) LLMs work as deepfake detectors?	Simiao Ren et.al.	2503.20084	null
2025-03-27	Cross-Tokenizer Distillation via Approximate Likelihood Matching	Benjamin Minixhofer et.al.	2503.20083	link
2025-03-25	Poor Alignment and Steerability of Large Language Models: Evidence from College Admission Essays	Jinsook Lee et.al.	2503.20062	null
2025-03-25	Deep Learning Approaches for Blood Disease Diagnosis Across Hematopoietic Lineages	Gabriel Bo et.al.	2503.20049	link
2025-03-25	Warm Start Adaptive-Bias Quantum Approximate Optimization Algorithm	Yunlong Yu et.al.	2503.20048	null
2025-03-25	Unlocking Multi-Task Electric Energy System Intelligence: Data Scaling Laws and Performance with Limited Fine-Tuning	Shaohuai Liu et.al.	2503.20040	null
2025-03-25	OmniNova:A General Multimodal Agent Framework	Pengfei Du et.al.	2503.20028	null
2025-03-25	Gemini Robotics: Bringing AI into the Physical World	Gemini Robotics Team et.al.	2503.20020	null
2025-03-25	LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?	Kexian Tang et.al.	2503.19990	null
2025-03-25	ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback	Bohan Zhai et.al.	2503.19988	link
2025-03-25	Conditional Deep Generative Models for Simultaneous Simulation and Reconstruction of Entire Events	Etienne Dreyer et.al.	2503.19981	link
2025-03-25	SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining	Xiang Xu et.al.	2503.19912	link
2025-03-25	CoLLM: A Large Language Model for Composed Image Retrieval	Chuong Huynh et.al.	2503.19910	link
2025-03-25	FullDiT: Multi-Task Video Generative Foundation Model with Full Attention	Xuan Ju et.al.	2503.19907	null
2025-03-25	ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models	Fernando Julio Cendra et.al.	2503.19902	null
2025-03-25	A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design	Jie Tian et.al.	2503.19889	null
2025-03-25	CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation	Nengbo Wang et.al.	2503.19878	null
2025-03-25	SLA-Awareness for AI-assisted coding	Kishanthan Thangarajah et.al.	2503.19876	null
2025-03-25	Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking	Xiaoyu Tian et.al.	2503.19855	null
2025-03-25	Towards Online Multi-Modal Social Interaction Understanding	Xinpeng Li et.al.	2503.19851	link
2025-03-25	FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs	Carlos Plou et.al.	2503.19850	null
2025-03-25	A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950	Zhao Fang et.al.	2503.19844	null
2025-03-25	TopoGEN: topology-driven microstructure generation for in silico modeling of fiber network mechanics	Sara Cardona et.al.	2503.19832	null
2025-03-25	IgCraft: A versatile sequence generation framework for antibody discovery and engineering	Matthew Greenig et.al.	2503.19821	link
2025-03-25	Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning	Pratibha Kumari et.al.	2503.19819	null
2025-03-25	SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI	Zhiyang Liu et.al.	2503.19801	null
2025-03-25	SemEval-2025 Task 9: The Food Hazard Detection Challenge	Korbinian Randl et.al.	2503.19800	null
2025-03-25	PAVE: Patching and Adapting Video Large Language Models	Zhuoming Liu et.al.	2503.19794	link
2025-03-25	Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models	Kartik Thakral et.al.	2503.19783	null
2025-03-25	ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation	Haoyu Fu et.al.	2503.19755	null
2025-03-25	Inducing Personality in LLM-Based Honeypot Agents: Measuring the Effect on Human-Like Agenda Generation	Lewis Newsham et.al.	2503.19752	null
2025-03-25	Optimizing Photonic Structures with Large Language Model Driven Algorithm Discovery	Haoran Yin et.al.	2503.19742	null
2025-03-25	Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings	Chengan Che et.al.	2503.19740	link
2025-03-26	FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion	Pihai Sun et.al.	2503.19739	link
2025-03-25	AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation	Itay Nakash et.al.	2503.19693	link
2025-03-25	CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation	Rupak Bose et.al.	2503.19661	null
2025-03-25	BiblioPage: A Dataset of Scanned Title Pages for Bibliographic Metadata Extraction	Jan Kohút et.al.	2503.19658	null
2025-03-25	OpenSDI: Spotting Diffusion-Generated Images in the Open World	Yabin Wang et.al.	2503.19653	link
2025-03-25	HausaNLP at SemEval-2025 Task 3: Towards a Fine-Grained Model-Aware Hallucination Detection	Maryam Bala et.al.	2503.19650	null
2025-03-25	Show or Tell? Effectively prompting Vision-Language Models for semantic segmentation	Niccolo Avogaro et.al.	2503.19647	null
2025-03-25	1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training	Han Zhao et.al.	2503.19633	null
2025-03-25	Optimization through In-Context Learning and Iterative LLM Prompting for Nuclear Engineering Design Problems	M. Rizki Oktavian et.al.	2503.19620	null
2025-03-25	Exploring Next Token Prediction For Optimizing Databases	Yeasir Rayhan et.al.	2503.19619	null
2025-03-25	RL-finetuning LLMs from on- and off-policy data with a single algorithm	Yunhao Tang et.al.	2503.19612	null
2025-03-25	Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation	Max W. Y. Lam et.al.	2503.19611	null
2025-03-25	Innate Reasoning is Not Enough: In-Context Learning Enhances Reasoning Large Language Models with Less Overthinking	Yuyao Ge et.al.	2503.19602	null
2025-03-25	HoarePrompt: Structural Reasoning About Program Correctness in Natural Language	Dimitrios Stamatios Bouras et.al.	2503.19599	link
2025-03-25	Context-Efficient Retrieval with Factual Decomposition	Yanhong Li et.al.	2503.19574	null
2025-03-25	Motif Counting in Complex Networks: A Comprehensive Survey	Haozhe Yin et.al.	2503.19573	null
2025-03-25	Dance Like a Chicken: Low-Rank Stylization for Human Motion Diffusion	Haim Sawdayee et.al.	2503.19557	null
2025-03-26	Scaling Laws of Synthetic Data for Language Models	Zeyu Qin et.al.	2503.19551	null
2025-03-25	FLEX: A Benchmark for Evaluating Robustness of Fairness in Large Language Models	Dahyun Jung et.al.	2503.19540	link
2025-03-25	VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models	Suhas G Hegde et.al.	2503.19530	null
2025-03-25	Conditional Autoencoder for Generating BNS Waveforms with Tidal and Precession Effects	Mengfei Sun et.al.	2503.19512	null
2025-03-25	SparSamp: Efficient Provably Secure Steganography Based on Sparse Sampling	Yaofei Wang et.al.	2503.19499	null
2025-03-25	DomainCQA: Crafting Expert-Level QA from Domain-Specific Charts	Ling Zhong et.al.	2503.19498	null
2025-03-25	Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage	Zhengwentai Sun et.al.	2503.19486	null
2025-03-25	KSHSeek: Data-Driven Approaches to Mitigating and Detecting Knowledge-Shortcut Hallucinations in Generative Models	Zhiwei Wang et.al.	2503.19482	null
2025-03-25	GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers	Shijie Ma et.al.	2503.19480	null
2025-03-25	A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition	Yaomin Shen et.al.	2503.19474	null
2025-03-25	ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning	Mingyang Chen et.al.	2503.19470	null
2025-03-25	G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation	Juntao Jian et.al.	2503.19457	null
2025-03-25	Data-centric Federated Graph Learning with Large Language Models	Bo Yan et.al.	2503.19455	null
2025-03-25	VecTrans: LLM Transformation Framework for Better Auto-vectorization on High-performance CPU	Zhongchun Zheng et.al.	2503.19449	null
2025-03-25	Enhanced Bloom’s Educational Taxonomy for Fostering Information Literacy in the Era of Large Language Models	Yiming Luo et.al.	2503.19434	null
2025-03-25	DeCAP: Context-Adaptive Prompt Generation for Debiasing Zero-shot Question Answering in Large Language Models	Suyoung Bae et.al.	2503.19426	null
2025-03-25	Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing	Jaihoon Kim et.al.	2503.19385	null
2025-03-25	MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation	Yukang Lin et.al.	2503.19383	null
2025-03-25	Interpretable Generative Models through Post-hoc Concept Bottlenecks	Akshay Kulkarni et.al.	2503.19377	link
2025-03-26	EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models	Yufei Cai et.al.	2503.19369	link
2025-03-25	ImageSet2Text: Describing Sets of Images through Text	Piera Riccio et.al.	2503.19361	null
2025-03-25	QUAD: Quantization and Parameter-Efficient Tuning of LLM with Activation Decomposition	Yuxuan Hu et.al.	2503.19353	link
2025-03-25	Membership Inference Attacks on Large-Scale Models: A Survey	Hengyu Wu et.al.	2503.19338	null
2025-03-25	Process or Result? Manipulated Ending Tokens Can Mislead Reasoning LLMs to Ignore the Correct Reasoning Steps	Yu Cui et.al.	2503.19326	null
2025-03-25	LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer Text	Weizhi Chen et.al.	2503.19311	link
2025-03-25	Iterative Hypothesis Generation for Scientific Discovery with Monte Carlo Nash Equilibrium Self-Refining Trees	Gollam Rabby et.al.	2503.19309	null
2025-03-25	UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design	Xiangzhe Kong et.al.	2503.19300	link
2025-03-25	Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications	Ben Rahman et.al.	2503.19276	null
2025-03-25	MARS: Memory-Enhanced Agents with Reflective Self-improvement	Xuechen Liang et.al.	2503.19271	null
2025-03-25	PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping	Sarah Pungitore et.al.	2503.19265	null
2025-03-25	DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning	Fucai Ke et.al.	2503.19263	null
2025-03-25	Linguistic Blind Spots of Large Language Models	Jiali Cheng et.al.	2503.19260	null
2025-03-25	SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings	Farhana Keya et.al.	2503.19257	null
2025-03-24	LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages	Patrick Diehl et.al.	2503.19217	null
2025-03-24	A Survey of Large Language Model Agents for Question Answering	Murong Yue et.al.	2503.19213	null
2025-03-24	Overtrained Language Models Are Harder to Fine-Tune	Jacob Mitchell Springer et.al.	2503.19206	null
2025-03-24	A Shared Low-Rank Adaptation Approach to Personalized RLHF	Renpu Liu et.al.	2503.19201	null
2025-03-24	Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces	Chenyangguang Zhang et.al.	2503.19199	null
2025-03-24	Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education	Hayate Iso et.al.	2503.19182	null
2025-03-24	Language Model Uncertainty Quantification with Attention Chain	Yinghao Li et.al.	2503.19168	link
2025-03-24	Reconstructing hadronically decaying tau leptons with a jet foundation model	Laurits Tani et.al.	2503.19165	null
2025-03-24	HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models	Mingzhen Huang et.al.	2503.19157	null
2025-03-24	Risk-Based Thresholding for Reliable Anomaly Detection in Concentrated Solar Power Plants	Yorick Estievenart et.al.	2503.19146	null
2025-03-24	Compositional Caching for Training-free Open-vocabulary Attribute Detection	Marco Garosi et.al.	2503.19145	null
2025-03-24	MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks	Wenhao You et.al.	2503.19134	null
2025-03-24	Understanding and Improving Information Preservation in Prompt Compression for LLMs	Weronika Łajewska et.al.	2503.19114	null
2025-03-24	Masks and Mimicry: Strategic Obfuscation and Impersonation Attacks on Authorship Verification	Kenneth Alperin et.al.	2503.19099	null
2025-03-24	Rankers, Judges, and Assistants: Towards Understanding the Interplay of LLMs in Information Retrieval Evaluation	Krisztian Balog et.al.	2503.19092	null
2025-03-24	LLM-Based Insight Extraction for Contact Center Analytics and Cost-Efficient Deployment	Varsha Embar et.al.	2503.19090	null
2025-03-24	Paving the way for scientific foundation models: enhancing generalization and robustness in PDEs with constraint-aware pre-training	Amin Totounferoush et.al.	2503.19081	null
2025-03-24	Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization	Zhanda Zhu et.al.	2503.19050	link
2025-03-24	LookAhead Tuning: Safer Language Models via Partial Answer Previews	Kangwei Liu et.al.	2503.19041	link
2025-03-24	Equivariant Image Modeling	Ruixiao Dong et.al.	2503.18948	link
2025-03-25	Aether: Geometric-Aware Unified World Modeling	Aether Team et.al.	2503.18945	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944	link
2025-03-24	SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding	Mingze Xu et.al.	2503.18943	null
2025-03-24	Video-T1: Test-Time Scaling for Video Generation	Fangfu Liu et.al.	2503.18942	null
2025-03-24	Exploring Training and Inference Scaling Laws in Generative Retrieval	Hongru Cai et.al.	2503.18941	link
2025-03-24	CoMP: Continual Multimodal Pre-training for Vision Foundation Models	Yitong Chen et.al.	2503.18931	link
2025-03-24	Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training	Brian R. Bartoldson et.al.	2503.18929	null
2025-03-24	FFN Fusion: Rethinking Sequential Computation in Large Language Models	Akhiad Bercovich et.al.	2503.18908	null
2025-03-24	xKV: Cross-Layer SVD for KV-Cache Compression	Chi-Chih Chang et.al.	2503.18893	link
2025-03-24	AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration	Zhexuan Wang et.al.	2503.18891	link
2025-03-24	Toward building next-generation Geocoding systems: a systematic review	Zhengcong Yin et.al.	2503.18888	null
2025-03-24	I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders	Andrey Galichin et.al.	2503.18878	link
2025-03-24	Efficient Self-Supervised Adaptation for Medical Image Analysis	Moein Sorkhei et.al.	2503.18873	link
2025-03-24	Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design	Rui Xie et.al.	2503.18869	null
2025-03-24	Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations	Junlan Chen et.al.	2503.18865	null
2025-03-24	3DSwapping: Texture Swapping For 3D Object From Single Reference Image	Xiao Cao et.al.	2503.18853	null
2025-03-24	Defeating Prompt Injections by Design	Edoardo Debenedetti et.al.	2503.18813	null
2025-03-24	Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code	Augusto B. Corrêa et.al.	2503.18809	null
2025-03-24	REALM: A Dataset of Real-World LLM Use Cases	Jingwen Cheng et.al.	2503.18792	null
2025-03-24	BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache	Dayou Du et.al.	2503.18773	link
2025-03-24	AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning	Alan Dao et.al.	2503.18769	null
2025-03-24	RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation	Chengbo Yuan et.al.	2503.18738	null
2025-03-24	Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving	Hongkuan Zhou et.al.	2503.18730	null
2025-03-24	LLaVAction: evaluating and training multi-modal large language models for action recognition	Shaokai Ye et.al.	2503.18712	link
2025-03-24	Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology	Boqi Chen et.al.	2503.18709	null
2025-03-24	OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad	Luyao Tang et.al.	2503.18695	link
2025-03-25	Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models	Yazhou Zhang et.al.	2503.18681	null
2025-03-24	NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping	Tianyi Wang et.al.	2503.18678	null
2025-03-24	Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark	Bingchen Miao et.al.	2503.18665	link
2025-03-24	From Fragment to One Piece: A Survey on AI-Driven Graphic Design	Xingxing Zou et.al.	2503.18641	null
2025-03-24	Adaptive Machine Learning for Resource-Constrained Environments	Sebastián A. Cajas Ordóñez et.al.	2503.18634	link
2025-03-24	Generative Dataset Distillation using Min-Max Diffusion Model	Junqiao Fan et.al.	2503.18626	null
2025-03-24	Scaling Laws for Emulation of Stellar Spectra	Tomasz Różański et.al.	2503.18617	null
2025-03-24	LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment	Jong Myoung Kim et.al.	2503.18603	null
2025-03-24	Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization	Minsu Kim et.al.	2503.18599	null
2025-03-24	A Universal Model Combining Differential Equations and Neural Networks for Ball Trajectory Prediction	Zhiwei Shi et.al.	2503.18584	null
2025-03-24	Anchor-based oversampling for imbalanced tabular data via contrastive and adversarial learning	Hadi Mohammadi et.al.	2503.18569	null
2025-03-24	Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures	Abdoul Majid O. Thiombiano et.al.	2503.18565	null
2025-03-24	Power-fractional distributions and branching processes	Gerold Alsmeyer et.al.	2503.18563	null
2025-03-24	Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models	Nariman Naderi et.al.	2503.18562	null
2025-03-25	AMD-Hummingbird: Towards an Efficient Text-to-Video Model	Takashi Isobe et.al.	2503.18559	link
2025-03-24	HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications	Guneet Mutreja et.al.	2503.18540	null
2025-03-24	SciClaims: An End-to-End Generative System for Biomedical Claim Analysis	Raúl Ortega et.al.	2503.18526	null
2025-03-24	P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction	Yufeng Zhong et.al.	2503.18525	null
2025-03-24	Can Text-to-Video Generation help Video-Language Alignment?	Luca Zanella et.al.	2503.18507	null
2025-03-24	Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain	Andrés García-Silva et.al.	2503.18502	null
2025-03-24	Verbal Process Supervision Elicits Better Coding Agents	Hao-Yuan Chen et.al.	2503.18494	null
2025-03-24	Safeguarding Mobile GUI Agent via Logic-based Action Verification	Jungjae Lee et.al.	2503.18492	null
2025-03-24	Large Language Models powered Network Attack Detection: Architecture, Opportunities and Case Study	Xinggong Zhang et.al.	2503.18487	null
2025-03-24	Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification	Zequn Zeng et.al.	2503.18483	link
2025-03-24	Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding	Xiangrui Liu et.al.	2503.18478	null
2025-03-24	PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models	Tadeusz Dziarmaga et.al.	2503.18462	link
2025-03-24	MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing	Lingting Zhu et.al.	2503.18461	null
2025-03-24	ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation	Jiahui Xiang et.al.	2503.18460	null
2025-03-24	SEAlign: Alignment Training for Software Engineering Agent	Kechi Zhang et.al.	2503.18455	null
2025-03-24	InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment	Yunhong Lu et.al.	2503.18454	link
2025-03-24	ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation	Guosheng Zhao et.al.	2503.18438	null
2025-03-24	A Simple yet Effective Layout Token in Large Language Models for Document Understanding	Zhaoqing Zhu et.al.	2503.18434	null
2025-03-24	Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning	Junsong Li et.al.	2503.18432	null
2025-03-24	Breaking the Encoder Barrier for Seamless Video-Language Understanding	Handong Li et.al.	2503.18422	null
2025-03-25	Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning	Sherry X. Chen et.al.	2503.18406	link
2025-03-24	Solving Situation Puzzles with Large Language Model and External Reformulation	Kun Li et.al.	2503.18394	null
2025-03-24	Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors	Joshua Krook et.al.	2503.18387	null
2025-03-24	Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance	Sicong Feng et.al.	2503.18386	null
2025-03-24	Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs	Chang Gao et.al.	2503.18377	null
2025-03-24	J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain	Yiran Hu et.al.	2503.18360	link
2025-03-24	Mitigating Cache Noise in Test-Time Adaptation for Large Vision-Language Models	Haotian Zhai et.al.	2503.18334	null
2025-03-24	Optimizing Influence Campaigns: Nudging under Bounded Confidence	Yen-Shao Chen et.al.	2503.18331	null
2025-03-24	Towards Training-free Anomaly Detection with Vision and Language Foundation Models	Jinjin Zhang et.al.	2503.18325	link
2025-03-24	Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions	Dong Jing et.al.	2503.18320	null
2025-03-24	Knowledge Transfer from LLMs to Provenance Analysis: A Semantic-Augmented Method for APT Detection	Fei Zuo et.al.	2503.18316	null
2025-03-24	DeepFund: Will LLM be Professional at Fund Investment? A Live Arena Perspective	Changlun Li et.al.	2503.18313	null
2025-03-24	Enhancing LLM-based Code Translation in Repository Context via Triple Knowledge-Augmented	Guangsheng Ou et.al.	2503.18305	null
2025-03-24	How to Capture and Study Conversations Between Research Participants and ChatGPT: GPT for Researchers (g4r.org)	Jin Kim et.al.	2503.18303	null
2025-03-24	Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module	Yishen Liu et.al.	2503.18297	null
2025-03-24	Surgical Action Planning with Large Language Models	Mengya Xu et.al.	2503.18296	null
2025-03-24	Fact-checking AI-generated news reports: Can LLMs catch their own lies?	Jiayi Yao et.al.	2503.18293	null
2025-03-24	Jenga: Effective Memory Management for Serving LLM with Heterogeneity	Chen Zhang et.al.	2503.18292	null
2025-03-24	Sun-Shine: A Large Language Model for Tibetan Culture	Cheng Huang et.al.	2503.18288	link
2025-03-24	CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI	Siyuan Cheng et.al.	2503.18286	link
2025-03-24	Analyzing Islamophobic Discourse Using Semi-Coded Terms and LLMs	Raza Ul Mustafa et.al.	2503.18273	null
2025-03-24	Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift	Xiudi Li et.al.	2503.18256	null
2025-03-24	Surface-Aware Distilled 3D Semantic Features	Lukas Uzolas et.al.	2503.18254	null
2025-03-24	Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages	Tadesse Destaw Belay et.al.	2503.18253	null
2025-03-23	CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation	Jungsoo Lee et.al.	2503.18244	null
2025-03-23	ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices	Aneesh Vathul et.al.	2503.18242	null
2025-03-23	Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters	Roberto Garcia et.al.	2503.18216	link
2025-03-23	LakotaBERT: A Transformer-based Model for Low Resource Lakota Language	Kanishka Parankusham et.al.	2503.18212	null
2025-03-23	The Power of Small LLMs in Geometry Generation for Physical Simulations	Ossama Shafiq et.al.	2503.18178	null
2025-03-23	Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering	Zixin Chen et.al.	2503.18172	null
2025-03-23	Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes	Kelly O. Marshall et.al.	2503.18155	null
2025-03-23	LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space	Zhangyu Wang et.al.	2503.18142	null
2025-03-23	AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs	Diwei Wang et.al.	2503.18141	null
2025-03-23	MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation	Jiaxin Huang et.al.	2503.18135	null
2025-03-23	MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection	Yibo Yan et.al.	2503.18132	null
2025-03-23	Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization	Juntao Dai et.al.	2503.18130	null
2025-03-23	GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks	Varvara Krechetova et.al.	2503.18129	link
2025-03-23	$D^2LoRA$ : Data-Driven LoRA Initialization for Low Resource Tasks	Javad SeraJ et.al.	2503.18089	null
2025-03-23	Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms	Nachuan Ma et.al.	2503.18082	null
2025-03-21	Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique	Yansi Li et.al.	2503.17363	null
2025-03-21	Position: Interactive Generative Video as Next-Generation Game Engine	Jiwen Yu et.al.	2503.17359	null
2025-03-21	HCAST: Human-Calibrated Autonomy Software Tasks	David Rein et.al.	2503.17354	link
2025-03-21	NdLinear Is All You Need for Representation Learning	Alex Reneau et.al.	2503.17353	link
2025-03-21	OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement	Yihe Deng et.al.	2503.17352	link
2025-03-21	Capturing Individual Human Preferences with Reward Features	André Barreto et.al.	2503.17338	null
2025-03-21	Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Reem Gody et.al.	2503.17336	null
2025-03-21	CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities	Yuxuan Zhu et.al.	2503.17332	link
2025-03-21	LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language	Kun Chu et.al.	2503.17309	link
2025-03-21	Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests	John Naulty et.al.	2503.17302	null
2025-03-21	Offline Model-Based Optimization: Comprehensive Review	Minsu Kim et.al.	2503.17286	link
2025-03-21	CASE – Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement	Gaifan Zhang et.al.	2503.17279	null
2025-03-21	Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras	Shuang Guo et.al.	2503.17262	link
2025-03-21	SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging	Aladin Djuhera et.al.	2503.17239	link
2025-03-21	FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs	Albert Sawczyn et.al.	2503.17229	null
2025-03-21	Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation	Giacomo Savazzi et.al.	2503.17224	null
2025-03-21	Automating Adjudication of Cardiovascular Events Using Large Language Models	Sonish Sivarajkumar et.al.	2503.17222	null
2025-03-21	TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning	Sheng Wang et.al.	2503.17195	null
2025-03-21	LLMs Love Python: A Study of LLMs’ Bias for Programming Languages and Libraries	Lukas Twist et.al.	2503.17181	link
2025-03-21	D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens	Panpan Wang et.al.	2503.17155	null
2025-03-21	Modifying Large Language Model Post-Training for Diverse Creative Writing	John Joon Young Chung et.al.	2503.17126	link
2025-03-21	Large Language Model Compression via the Nested Activation-Aware Decomposition	Jun Lu et.al.	2503.17101	null
2025-03-21	Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics	J. M. Diederik Kruijssen et.al.	2503.17085	null
2025-03-21	A Study into Investigating Temporal Robustness of LLMs	Jonas Wallat et.al.	2503.17073	null
2025-03-21	PVChat: Personalized Video Chat with One-Shot Learning	Yufei Shi et.al.	2503.17069	null
2025-03-21	Problem Framing in the AI era: a new model	Matteo Tuveri et.al.	2503.17040	null
2025-03-21	AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process	Junjie Hu et.al.	2503.17029	null
2025-03-21	RiboFlow: Conditional De Novo RNA Sequence-Structure Co-Design via Synergistic Flow Matching	Runze Ma et.al.	2503.17007	null
2025-03-21	Text2Model: Generating dynamic chemical reactor models using large language models (LLMs)	Sophia Rupprecht et.al.	2503.17004	null
2025-03-21	A Survey on Personalized Alignment – The Missing Piece for Large Language Models in Real-World Applications	Jian Guan et.al.	2503.17003	null
2025-03-21	Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation	Qinghe Ma et.al.	2503.16997	link
2025-03-21	TRACE: Time SeRies PArameter EffiCient FinE-tuning	Yuze Li et.al.	2503.16991	null
2025-03-21	Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models	Haichao Zhang et.al.	2503.16980	null
2025-03-21	Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks	Julian Junyan Wang et.al.	2503.16974	null
2025-03-21	Distilling Monocular Foundation Model for Fine-grained Depth Completion	Yingping Liang et.al.	2503.16970	null
2025-03-21	HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis	Mengtian Li et.al.	2503.16944	null
2025-03-21	TEMPO: Temporal Preference Optimization of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment	Shicheng Li et.al.	2503.16929	link
2025-03-21	RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation	Linxi Liang et.al.	2503.16922	link
2025-03-21	Malliavin-Bismut Score-based Diffusion Models	Ehsan Mirafzali et.al.	2503.16917	null
2025-03-21	FAIT: Fault-Aware Fine-Tuning for Better Code Generation	Lishui Fan et.al.	2503.16913	null
2025-03-21	Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation	Jingzhi Fang et.al.	2503.16893	null
2025-03-21	Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation	Jiangcheng Qin et.al.	2503.16875	null
2025-03-21	MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization	Jian Zhang et.al.	2503.16874	null
2025-03-21	Lie Detector: Unified Backdoor Detection via Cross-Examination Framework	Xuan Wang et.al.	2503.16872	null
2025-03-21	Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs	Anshumann et.al.	2503.16870	null
2025-03-21	Nonparametric Factor Analysis and Beyond	Yujia Zheng et.al.	2503.16865	null
2025-03-21	MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering	Jialin Chen et.al.	2503.16858	link
2025-03-21	Generative Compositor for Few-Shot Visual Information Extraction	Zhibo Yang et.al.	2503.16854	null
2025-03-21	Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models	Suho Yoo et.al.	2503.16853	null
2025-03-21	Towards LLM Guardrails via Sparse Representation Steering	Zeqing He et.al.	2503.16851	null
2025-03-21	LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models	Jian Liang et.al.	2503.16843	null
2025-03-21	Downstream Analysis of Foundational Medical Vision Models for Disease Progression	Basar Demir et.al.	2503.16842	null
2025-03-21	When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts	Jun Seong Kim et.al.	2503.16826	null
2025-03-21	When Debate Fails: Bias Reinforcement in Large Language Models	Jihwan Oh et.al.	2503.16814	null
2025-03-21	Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models	Mengsong Wu et.al.	2503.16779	link
2025-03-21	Current and Future Use of Large Language Models for Knowledge Work	Michelle Brachman et.al.	2503.16774	null
2025-03-21	On Explaining (Large) Language Models For Code Using Global Code-Based Explanations	David N. Palacio et.al.	2503.16771	null
2025-03-20	Automated Harmfulness Testing for Code Large Language Models	Honghao Tan et.al.	2503.16740	null
2025-03-20	Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models	Chengkai Huang et.al.	2503.16734	null
2025-03-20	Natural Language Generation	Emiel van Miltenburg et.al.	2503.16728	null
2025-03-20	Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding	Jinlong Li et.al.	2503.16707	link
2025-03-20	APPA : Agentic Preformulation Pathway Assistant	Julius Lange et.al.	2503.16698	null
2025-03-20	GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations	Zeping Liu et.al.	2503.16683	null
2025-03-20	Echoes of Power: Investigating Geopolitical Bias in US and China Large Language Models	Andre G. C. Pacheco et.al.	2503.16679	null
2025-03-20	Accelerating Transformer Inference and Training with 2:4 Activation Sparsity	Daniel Haziza et.al.	2503.16672	null
2025-03-20	Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms	Niki van Stein et.al.	2503.16668	null
2025-03-20	A preliminary data fusion study to assess the feasibility of Foundation Process-Property Models in Laser Powder Bed Fusion	Oriol Vendrell-Gallart et.al.	2503.16667	null
2025-03-20	Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs	Maxime Delmas et.al.	2503.16655	null
2025-03-20	Leveraging Large Language Models for Explainable Activity Recognition in Smart Homes: A Critical Evaluation	Michele Fiori et.al.	2503.16622	null
2025-03-20	A Recipe for Generating 3D Worlds From a Single Image	Katja Schwarz et.al.	2503.16611	null
2025-03-20	Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions	Hadi Amini et.al.	2503.16585	link
2025-03-22	Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation	Yuqing Wang et.al.	2503.16430	null
2025-03-20	DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding	Keyan Chen et.al.	2503.16426	link
2025-03-20	SynCity: Training-Free Generation of 3D Worlds	Paul Engstler et.al.	2503.16420	null
2025-03-20	Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models	Yang Sui et.al.	2503.16419	link
2025-03-20	M3: 3D-Spatial MultiModal Memory	Xueyan Zou et.al.	2503.16413	link
2025-03-20	DreamTexture: Shape from Virtual Texture with Analysis by Augmentation	Ananta R. Bhattarai et.al.	2503.16412	null
2025-03-20	VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness	SeungJu Cha et.al.	2503.16406	link
2025-03-20	The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination	Yifan Sun et.al.	2503.16402	link
2025-03-20	Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them	Guanyu Chen et.al.	2503.16401	null
2025-03-20	Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation	Yijia Luo et.al.	2503.16385	link
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse	Muyao Li et.al.	2503.16365	null
2025-03-20	CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners	Yunzhi Yao et.al.	2503.16356	link
2025-03-20	Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences	Krithik Ramesh et.al.	2503.16351	null
2025-03-20	LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates	Ying Shen et.al.	2503.16334	null
2025-03-20	OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence	Long Yuan et.al.	2503.16326	null
2025-03-20	Issue2Test: Generating Reproducing Test Cases from Issue Reports	Noor Nashid et.al.	2503.16320	null
2025-03-21	Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1	Peiran Gu et.al.	2503.16304	null
2025-03-20	SceneMI: Motion In-betweening for Modeling Human-Scene Interactions	Inwoo Hwang et.al.	2503.16289	null
2025-03-21	Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens	Shuqi Lu et.al.	2503.16278	link
2025-03-20	Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data	Zijian Li et.al.	2503.16260	null
2025-03-20	Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models	Keda Tao et.al.	2503.16257	null
2025-03-21	Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning	Zhaowei Liu et.al.	2503.16252	link
2025-03-20	Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t	Quy-Anh Dang et.al.	2503.16219	link
2025-03-20	MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion	Qizhi Pei et.al.	2503.16212	link
2025-03-20	VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis	Chia-Yi Hsu et.al.	2503.16195	null
2025-03-21	Affective Polarization Amongst Swedish Politicians	François t’Serstevens et.al.	2503.16193	link
2025-03-20	Large Language Models for Water Distribution Systems Modeling and Decision-Making	Yinon Goldshtein et.al.	2503.16191	null
2025-03-20	CLS-RL: Image Classification with Rule-Based Reinforcement Learning	Ming Li et.al.	2503.16188	link
2025-03-20	Narrowing Class-Wise Robustness Gaps in Adversarial Training	Fatemeh Amerehi et.al.	2503.16179	null
2025-03-20	CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models	Hong Yi Lin et.al.	2503.16167	null
2025-03-20	SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs	Shibo Jie et.al.	2503.16163	null
2025-03-20	Towards Lighter and Robust Evaluation for Retrieval Augmented Generation	Alex-Razvan Ispas et.al.	2503.16161	link
2025-03-20	Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems	Shenbin Qian et.al.	2503.16158	link
2025-03-20	Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models	Mats Faulborn et.al.	2503.16148	null
2025-03-20	Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs	Djamel Eddine Khelladi et.al.	2503.16144	null
2025-03-21	MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering	Feiyang Li et.al.	2503.16131	null
2025-03-20	The Impact of Revealing Large Language Model Stochasticity on Trust, Reliability, and Anthropomorphization	Chelse Swoopes et.al.	2503.16114	null
2025-03-20	OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP	Mohamad Hassan N C et.al.	2503.16106	link
2025-03-20	Cultural Alignment in Large Language Models Using Soft Prompt Tuning	Reem I. Masoud et.al.	2503.16094	null
2025-03-20	Quantum Chebyshev Probabilistic Models for Fragmentation Functions	Jorge J. Martínez de Lejarza et.al.	2503.16073	null
2025-03-20	Tuning LLMs by RAG Principles: Towards LLM-native Memory	Jiale Wei et.al.	2503.16071	link
2025-03-20	SALT: Singular Value Adaptation with Low-Rank Transformation	Abdelrahman Elsayed et.al.	2503.16055	link
2025-03-20	Meta-Learning Neural Mechanisms rather than Bayesian Priors	Michael Goodale et.al.	2503.16048	null
2025-03-20	Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation	Zhiyu Cao et.al.	2503.16043	null
2025-03-20	GreenIQ: A Deep Search Platform for Comprehensive Carbon Market Analysis and Automated Report Generation	Bisola Faith Kayode et.al.	2503.16041	null
2025-03-20	Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond	Yaoyao Yu et.al.	2503.16040	null
2025-03-20	Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models	Zhihang Liu et.al.	2503.16036	link
2025-03-20	The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement	Ruihan Yang et.al.	2503.16024	null
2025-03-20	BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models	Zenghui Yuan et.al.	2503.16023	null
2025-03-20	Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models	Mario Sanz-Guerrero et.al.	2503.16022	link
2025-03-21	Autonomous AI imitators increase diversity in homogeneous information ecosystems	Emil Bakkensen Johansen et.al.	2503.16021	null
2025-03-20	GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions	Xiaomeng Chu et.al.	2503.16013	null
2025-03-20	“This could save us months of work” – Use Cases of AI and Automation Support in Investigative Journalism	Besjon Cifliku et.al.	2503.16011	null
2025-03-20	ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph	Langming Liu et.al.	2503.15990	null
2025-03-20	A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli	Pengyu Liu et.al.	2503.15978	null
2025-03-20	Stability of Schrödinger bridges and Sinkhorn semigroups for log-concave models	Pierre Del Moral et.al.	2503.15963	null
2025-03-20	GAN-enhanced Simulation-driven DNN Testing in Absence of Ground Truth	Mohammed Attaoui et.al.	2503.15953	null
2025-03-20	From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models	Jinyi Liu et.al.	2503.15944	null
2025-03-21	Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment	Gaole Dai et.al.	2503.15937	null
2025-03-20	Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning	Peiyi Lin et.al.	2503.15924	null
2025-03-20	SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models	Fahao Chen et.al.	2503.15921	null
2025-03-20	Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras	Beilei Cui et.al.	2503.15917	null
2025-03-20	From Structured Prompts to Open Narratives: Measuring Gender Bias in LLMs Through Open-Ended Storytelling	Evan Chen et.al.	2503.15904	null
2025-03-20	Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models	Baolong Bi et.al.	2503.15888	link
2025-03-21	Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance	Hui Liu et.al.	2503.15886	null
2025-03-20	DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System	Kai Chen et.al.	2503.15876	null
2025-03-20	MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations	Kyungho Bae et.al.	2503.15871	null
2025-03-20	TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data	Rohit Kundu et.al.	2503.15867	null
2025-03-20	DroidTTP: Mapping Android Applications with TTP for Cyber Threat Intelligence	Dincy R Arikkat et.al.	2503.15866	link
2025-03-20	VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling	Hyojun Go et.al.	2503.15855	null
2025-03-20	Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey	Xiaoou Liu et.al.	2503.15850	null
2025-03-20	Entropy-based Exploration Conduction for Multi-step Reasoning	Jinghan Zhang et.al.	2503.15848	null
2025-03-20	Automatic Generation of Safety-compliant Linear Temporal Logic via Large Language Model: A Self-supervised Framework	Junle Li et.al.	2503.15840	null
2025-03-20	Enhancing LLM Code Generation with Ensembles: A Similarity-Based Selection Approach	Tarek Mahmud et.al.	2503.15838	null
2025-03-20	Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation	Shangqing Zhao et.al.	2503.15837	link
2025-03-20	Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection	Haotian Ma et.al.	2503.15818	null
2025-03-20	A Vision Centric Remote Sensing Benchmark	Abduljaleel Adejumo et.al.	2503.15816	null
2025-03-20	Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing	Vishnu Asutosh Dasu et.al.	2503.15815	null
2025-03-20	ChatGPT and U(X): A Rapid Review on Measuring the User Experience	Katie Seaborn et.al.	2503.15808	null
2025-03-20	Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture	Cheng Li et.al.	2503.15807	null
2025-03-20	DNA Bench: When Silence is Smarter – Benchmarking Over-Reasoning in Reasoning LLMs	Masoud Hashemi et.al.	2503.15793	null
2025-03-20	RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models	Parham Saremi et.al.	2503.15784	null
2025-03-20	Grammar and Gameplay-aligned RL for Game Description Generation with LLMs	Tsunehiko Tanaka et.al.	2503.15783	null
2025-03-20	AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models	Boshra Khalili et.al.	2503.15778	null
2025-03-20	Detecting LLM-Written Peer Reviews	Vishisht Rao et.al.	2503.15772	link
2025-03-20	Towards Agentic AI Networking in 6G: A Generative Foundation Model-as-Agent Approach	Yong Xiao et.al.	2503.15764	null
2025-03-20	Dialogic Learning in Child-Robot Interaction: A Hybrid Approach to Personalized Educational Content Generation	Elena Malnatsky et.al.	2503.15762	null
2025-03-20	GraPLUS: Graph-based Placement Using Semantics for Image Composition	Mir Mohammad Khaleghi et.al.	2503.15761	null
2025-03-20	AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration	Andy Zhou et.al.	2503.15754	null
2025-03-20	Using Language Models to Decipher the Motivation Behind Human Behaviors	Yutong Xie et.al.	2503.15752	null
2025-03-19	Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat	Joseph Emmanuel DL Dayo et.al.	2503.15726	null
2025-03-21	Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication	Sin-Yu Huang et.al.	2503.15722	null
2025-03-19	Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View	Mathilde Aguiar et.al.	2503.15718	link
2025-03-19	Safety Aware Task Planning via Large Language Models in Robotics	Azal Ahmad Khan et.al.	2503.15707	null
2025-03-19	GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving	William Ljungbergh et.al.	2503.15672	null
2025-03-19	Enhancing Pancreatic Cancer Staging with Large Language Models: The Role of Retrieval-Augmented Generation	Hisashi Johno et.al.	2503.15664	null
2025-03-19	R $^2$ : A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs	Zefeng Lin et.al.	2503.15655	null
2025-03-19	LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning	Federico Cocchi et.al.	2503.15621	link
2025-03-19	Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings	Austin Xu et.al.	2503.15620	link
2025-03-19	SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks	Yifei Zhou et.al.	2503.15478	link
2025-03-19	Cube: A Roblox View of 3D Intelligence	Foundation AI Team et.al.	2503.15475	link
2025-03-19	EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining	Boshen Xu et.al.	2503.15470	link
2025-03-19	From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment	Jia-Nan Li et.al.	2503.15463	link
2025-03-19	Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator	Yuanzhi Zhu et.al.	2503.15457	null
2025-03-19	SkyLadder: Better and Faster Pretraining via Context Window Scheduling	Tongyao Zhu et.al.	2503.15450	link
2025-03-19	Visual Position Prompt for MLLM based Visual Grounding	Wei Tang et.al.	2503.15426	link
2025-03-19	Probing the topology of the space of tokens with structured prompts	Michael Robinson et.al.	2503.15421	null
2025-03-19	LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding	Amirhossein Kazerouni et.al.	2503.15420	null
2025-03-19	Temporal Regularization Makes Your Video Generator Stronger	Harold Haodong Chen et.al.	2503.15417	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation	Yumin Zhang et.al.	2503.15390	null
2025-03-19	Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers	Corentin Vazia et.al.	2503.15383	null
2025-03-19	EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models	Yinan Liang et.al.	2503.15369	null
2025-03-19	SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation	Thomas Pickard et.al.	2503.15358	null
2025-03-19	SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models	I-Fan Lin et.al.	2503.15351	null
2025-03-19	TruthLens:A Training-Free Paradigm for DeepFake Detection	Ritabrata Chakraborty et.al.	2503.15342	null
2025-03-19	Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs	Yuqi Zhu et.al.	2503.15341	null
2025-03-19	Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context	Junyi Ao et.al.	2503.15338	link
2025-03-19	Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model	Euclid Collaboration et.al.	2503.15312	link
2025-03-19	Euclid Quick Data Release (Q1): First visual morphology catalogue	Euclid Collaboration et.al.	2503.15310	link
2025-03-19	aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion	Jia Li et.al.	2503.15301	null
2025-03-19	Inside-Out: Hidden Factual Knowledge in LLMs	Zorik Gekhman et.al.	2503.15299	null
2025-03-19	SENAI: Towards Software Engineering Native Generative Artificial Intelligence	Mootez Saad et.al.	2503.15282	null
2025-03-19	MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration	David Wan et.al.	2503.15272	null
2025-03-19	Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning?	Roberto Araya et.al.	2503.15268	null
2025-03-19	LEGION: Learning to Ground and Explain for Synthetic Image Detection	Hengrui Kang et.al.	2503.15264	null
2025-03-19	Efficient allocation of image recognition and LLM tasks on multi-GPU system	Marcin Lawenda et.al.	2503.15252	null
2025-03-19	Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study	Jomar Thomas Almonte et.al.	2503.15248	null
2025-03-19	Exploring Large Language Models for Word Games:Who is the Spy?	Chentian Wei et.al.	2503.15235	link
2025-03-19	When LLMs Meet API Documentation: Can Retrieval Augmentation Aid Code Generation Just as It Helps Developers?	Jingyi Chen et.al.	2503.15231	null
2025-03-19	A Personalized Data-Driven Generative Model of Human Motion	Angelo Di Porzio et.al.	2503.15225	null
2025-03-19	A Foundation Model for Patient Behavior Monitoring and Suicide Detection	Rodrigo Oliver et.al.	2503.15221	null
2025-03-19	Context-Aware Vision Language Foundation Models for Ocular Disease Screening in Retinal Images	Lucie Berger et.al.	2503.15212	null
2025-03-19	DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation	Jiazhe Guo et.al.	2503.15208	null
2025-03-19	Benchmarking Large Language Models for Handwritten Text Recognition	Giorgia Crosilla et.al.	2503.15195	null
2025-03-19	Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems	Sejong Kim et.al.	2503.15191	link
2025-03-19	Foundation models may exhibit staged progression in novel CBRN threat disclosure	Kevin M Esvelt et.al.	2503.15182	null
2025-03-19	A Review on Large Language Models for Visual Analytics	Navya Sonal Agarwal et.al.	2503.15176	null
2025-03-19	Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks	Yuting Guo et.al.	2503.15169	null
2025-03-19	Object-Centric Pretraining via Target Encoder Bootstrapping	Nikola Đukić et.al.	2503.15141	null
2025-03-19	VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention	Mingzhe Zheng et.al.	2503.15138	null
2025-03-19	Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models	Man Fai Wong et.al.	2503.15129	null
2025-03-19	Text-Derived Relational Graph-Enhanced Network for Skeleton-Based Action Segmentation	Haoyu Ji et.al.	2503.15126	null
2025-03-19	Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification	Shichen Li et.al.	2503.15117	null
2025-03-19	DeCaFlow: A Deconfounding Causal Generative Model	Alejandro Almodóvar et.al.	2503.15114	link
2025-03-19	Reasoning Effort and Problem Complexity: A Scaling Analysis in LLMs	Benjamin Estermann et.al.	2503.15113	null
2025-03-19	OpenLLM-RTL: Open Dataset and Benchmark for LLM-Aided Design RTL Generation	Shang Liu et.al.	2503.15112	link
2025-03-19	VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making	Mohamed Salim Aissi et.al.	2503.15108	null
2025-03-19	Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings	Zonghao Ying et.al.	2503.15092	link
2025-03-19	Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs	Yao Cheng et.al.	2503.15091	null
2025-03-19	LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents	Ke Zhang et.al.	2503.15079	null
2025-03-19	Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis	Imanol G. Estepa et.al.	2503.15060	null
2025-03-19	ELTEX: A Framework for Domain-Driven Synthetic Data Generation	Arina Razmyslovich et.al.	2503.15055	link
2025-03-19	Studying and Understanding the Effectiveness and Failures of Conversational LLM-Based Repair	Aolin Chen et.al.	2503.15050	null
2025-03-19	SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection	Haoyi Li et.al.	2503.15044	null
2025-03-19	DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling	Jianbo Zhao et.al.	2503.15029	null
2025-03-19	Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene	Shengqiong Wu et.al.	2503.15019	null
2025-03-19	LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones?	Amr Keleg et.al.	2503.15003	null
2025-03-19	Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering	Francesco Maria Molfese et.al.	2503.14996	null
2025-03-19	ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents	Hao Liang et.al.	2503.14948	null
2025-03-19	Generating Multimodal Driving Scenes via Next-Scene Prediction	Yanhao Wu et.al.	2503.14945	null
2025-03-19	UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation	Qihui Zhang et.al.	2503.14941	null
2025-03-19	VisNumBench: Evaluating Number Sense of Multimodal Large Language Models	Tengjin Weng et.al.	2503.14939	null
2025-03-19	Proceedings of the 3rd Italian Conference on Big Data and Data Science (ITADATA2024)	Nicola Bena et.al.	2503.14937	null
2025-03-19	FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding	Chongjun Tu et.al.	2503.14935	null
2025-03-19	Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices	Ziyao Wang et.al.	2503.14932	null
2025-03-19	GenM $^3$ : Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation	Junyu Shi et.al.	2503.14919	null
2025-03-19	MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models	Jiazheng Li et.al.	2503.14917	null
2025-03-19	Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology	Siyuan Yan et.al.	2503.14911	link
2025-03-19	POSTA: A Go-to Framework for Customized Artistic Poster Generation	Haoyu Chen et.al.	2503.14908	null
2025-03-19	Deep Contrastive Unlearning for Language Models	Estrid He et.al.	2503.14900	null
2025-03-19	When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach	Vaibhav Rathore et.al.	2503.14897	null
2025-03-19	Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations	Shuo Li et.al.	2503.14895	null
2025-03-19	MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer	Honglin Lin et.al.	2503.14891	link
2025-03-19	Pseudo-Relevance Feedback Can Improve Zero-Shot LLM-Based Dense Retrieval	Hang Li et.al.	2503.14887	null
2025-03-19	Envisioning an AI-Enhanced Mental Health Ecosystem	Kellie Yu Hui Sim et.al.	2503.14883	null
2025-03-19	Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks	Kai Zhang et.al.	2503.14882	null
2025-03-19	Chemical Foundation Model Guided Design of High Ionic Conductivity Electrolyte Formulations	Murtaza Zohair et.al.	2503.14878	null
2025-03-19	Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection	Peipeng Yu et.al.	2503.14853	null
2025-03-19	LogLLaMA: Transformer-based log anomaly detection with LLaMA	Zhuoyi Yang et.al.	2503.14849	null
2025-03-19	Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning	Chengran Yang et.al.	2503.14838	null
2025-03-19	Robust Transmission of Punctured Text with Large Language Model-based Recovery	Sojeong Park et.al.	2503.14831	null
2025-03-19	MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models	Chejian Xu et.al.	2503.14827	null
2025-03-18	Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection	Matt Franchi et.al.	2503.14754	link
2025-03-18	Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence	Sophia Hager et.al.	2503.14749	null
2025-03-18	GR00T N1: An Open Foundation Model for Generalist Humanoid Robots	NVIDIA et.al.	2503.14734	null
2025-03-18	CodingGenie: A Proactive LLM-Powered Programming Assistant	Sebastian Zhao et.al.	2503.14724	link
2025-03-18	Generating Medically-Informed Explanations for Depression Detection using LLMs	Xiangyong Chen et.al.	2503.14671	null
2025-03-18	RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving	Wenqi Jiang et.al.	2503.14649	null
2025-03-18	Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache	Hanchen Li et.al.	2503.14647	null
2025-03-18	Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor control	Merkourios Simos et.al.	2503.14637	link
2025-03-18	Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving	Priscylla Silva et.al.	2503.14630	link
2025-03-18	Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives	Sara Sarto et.al.	2503.14604	link
2025-03-18	Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM	Yazeed Alnumay et.al.	2503.14603	null
2025-03-18	Aligning Multimodal LLM with Human Preference: A Survey	Tao Yu et.al.	2503.14504	link
2025-03-18	Deeply Supervised Flow-Based Generative Models	Inkyu Shin et.al.	2503.14494	null
2025-03-18	Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	NVIDIA et.al.	2503.14492	link
2025-03-18	Engineering Scientific Assistants using Interactive Structured Induction of Programs	Shraddha Surana et.al.	2503.14488	null
2025-03-18	Gricean Norms as a Basis for Effective Collaboration	Fardin Saad et.al.	2503.14484	link
2025-03-18	ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing	Yulin Pan et.al.	2503.14482	null
2025-03-18	Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM	Xinyu Fang et.al.	2503.14478	link
2025-03-18	The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models	Erminia Calabrese et.al.	2503.14454	null
2025-03-18	Bolt3D: Generating 3D Scenes in Seconds	Stanislaw Szymanowicz et.al.	2503.14445	null
2025-03-18	EnvBench: A Benchmark for Automated Environment Setup	Aleksandra Eliseeva et.al.	2503.14443	link
2025-03-18	LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers	Nikhil Abhyankar et.al.	2503.14434	link
2025-03-18	PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play	Wei Fang et.al.	2503.14432	null
2025-03-18	Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models	Siwei Zhang et.al.	2503.14411	null
2025-03-18	Large Language Models for Virtual Human Gesture Selection	Parisa Ghanad Torshizi et.al.	2503.14408	null
2025-03-18	DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers	Mert Bulent Sariyildiz et.al.	2503.14405	null
2025-03-18	Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance	Lisha Li et.al.	2503.14402	null
2025-03-18	From “Hallucination” to “Suture”: Insights from Language Philosophy to Enhance Large Language Models	Qiantong Wang et.al.	2503.14392	null
2025-03-18	How much do LLMs learn from negative examples?	Shadi Hamdan et.al.	2503.14391	link
2025-03-18	Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation	Rikuto Tsuchida et.al.	2503.14382	null
2025-03-18	On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller?	Pouria Sarhadi et.al.	2503.14379	link
2025-03-18	Impossible Videos	Zechen Bai et.al.	2503.14378	null
2025-03-18	RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment	Chao Wang et.al.	2503.14358	null
2025-03-18	MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts	Runqi Meng et.al.	2503.14355	null
2025-03-18	MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration	Yisen Xu et.al.	2503.14340	null
2025-03-18	DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies	Wei Song et.al.	2503.14324	link
2025-03-18	COPA: Comparing the Incomparable to Explore the Pareto Front	Adrián Javaloy et.al.	2503.14321	null
2025-03-18	RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT	Yuheng Li et.al.	2503.14304	null
2025-03-18	Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs	Nicolas Le Roux et.al.	2503.14286	null
2025-03-18	DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal	Vaibhav Aggarwal et.al.	2503.14269	link
2025-03-18	Quantization-Free Autoregressive Action Transformer	Ziyad Sheebaelhamd et.al.	2503.14259	link
2025-03-18	InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being	Guang Dai et.al.	2503.14257	null
2025-03-18	Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search	Yu Feng et.al.	2503.14251	null
2025-03-19	KG-IRAG: A Knowledge Graph-Based Iterative Retrieval-Augmented Generation Framework for Temporal Reasoning	Ruiyi Yang et.al.	2503.14234	null
2025-03-18	CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models	Yuyang Xue et.al.	2503.14232	null
2025-03-18	Decision Tree Induction Through LLMs via Semantically-Aware Evolution	Tennison Liu et.al.	2503.14217	null
2025-03-18	Inferring Event Descriptions from Time Series with Language Models	Mingtian Tan et.al.	2503.14190	link
2025-03-18	Towards Harmless Multimodal Assistants with Blind Preference Optimization	Yongqi Li et.al.	2503.14189	null
2025-03-18	Can LLMs Enable Verification in Mainstream Programming?	Aleksandr Shefer et.al.	2503.14183	null
2025-03-18	EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models	Zongyun Zhang et.al.	2503.14162	null
2025-03-18	Speculative Decoding for Verilog: Speed and Quality, All in One	Changran Xu et.al.	2503.14153	null
2025-03-18	Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding	Zining Wang et.al.	2503.14140	null
2025-03-18	CARE: A QLoRA-Fine Tuned Multi-Domain Chatbot With Fast Learning On Minimal Hardware	Ankit Dutta et.al.	2503.14136	null
2025-03-18	Inference-Time Intervention in Large Language Models for Reliable Requirement Verification	Paul Darm et.al.	2503.14130	null
2025-03-18	SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models	Subhadeep Koley et.al.	2503.14129	null
2025-03-18	PET-MAD, a universal interatomic potential for advanced materials modeling	Arslan Mazitov et.al.	2503.14118	link
2025-03-18	DangerMaps: Personalized Safety Advice for Travel in Urban Environments using a Retrieval-Augmented Language Model	Jonas Oppenlaender et.al.	2503.14103	null
2025-03-18	Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency	Jiangxuan Long et.al.	2503.14076	null
2025-03-18	Fast Autoregressive Video Generation with Diagonal Decoding	Yang Ye et.al.	2503.14070	null
2025-03-18	AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark	Xinhao Xiang et.al.	2503.14064	link
2025-03-18	Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach	Tianshu Wu et.al.	2503.14051	null
2025-03-18	Learning on LLM Output Signatures for gray-box LLM Behavior Analysis	Guy Bar-Shalom et.al.	2503.14043	link
2025-03-18	Intra and Inter Parser-Prompted Transformers for Effective Image Restoration	Cong Wang et.al.	2503.14037	link
2025-03-18	Synthetic Data Generation Using Large Language Models: Advances in Text and Code	Mihai Nadas et.al.	2503.14023	null
2025-03-18	MP-GUI: Modality Perception with MLLMs for GUI Understanding	Ziwei Wang et.al.	2503.14021	link
2025-03-18	Predicting Human Choice Between Textually Described Lotteries	Eyal Marantz et.al.	2503.14004	null
2025-03-18	MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling	Damian Boborzi et.al.	2503.14002	link
2025-03-18	The KoLMogorov Test: Compression by Code Generation	Ori Yoran et.al.	2503.13992	null
2025-03-18	Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks	Mykyta Syromiatnikov et.al.	2503.13988	link
2025-03-18	DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection	Jaewoo Song et.al.	2503.13985	null
2025-03-18	SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability	Jiankang Wang et.al.	2503.13983	null
2025-03-18	Empowering LLMs in Decision Games through Algorithmic Data Synthesis	Haolin Wang et.al.	2503.13980	null
2025-03-18	FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks	Siqi Zhang et.al.	2503.13966	null
2025-03-18	MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding	Siwei Han et.al.	2503.13964	link
2025-03-18	Survey of Adversarial Robustness in Multimodal Large Language Models	Chengze Jiang et.al.	2503.13962	null
2025-03-18	Improving LLM Video Understanding with 16 Frames Per Second	Yixuan Li et.al.	2503.13956	null
2025-03-18	ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models	Alexey Karev et.al.	2503.13923	null
2025-03-18	MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments	Zhengsheng Guo et.al.	2503.13882	null
2025-03-18	MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation	Donggon Jang et.al.	2503.13881	link
2025-03-18	Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment	Wei Chen et.al.	2503.13879	null
2025-03-18	Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations	Rui Yang et.al.	2503.13857	null
2025-03-18	MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation	Kai Chen et.al.	2503.13856	null
2025-03-18	Causal Discovery from Data Assisted by Large Language Models	Kamyar Barakati et.al.	2503.13833	null
2025-03-18	Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection	Chunlei Li et.al.	2503.13828	link
2025-03-18	LLM-Empowered IoT for 6G Networks: Architecture, Challenges, and Solutions	Xiaopei Chen et.al.	2503.13819	null
2025-03-18	Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models	Mingming Peng et.al.	2503.13813	null
2025-03-18	The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations	Suyash Fulay et.al.	2503.13812	null
2025-03-18	Empowering GraphRAG with Knowledge Filtering and Integration	Kai Guo et.al.	2503.13804	null
2025-03-18	LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation	Yang Zhou et.al.	2503.13794	null
2025-03-18	Mapping the Trust Terrain: LLMs in Software Engineering – Insights and Perspectives	Dipin Khati et.al.	2503.13793	null
2025-03-17	Mitigating KV Cache Competition to Enhance User Experience in LLM Inference	Haiying Shen et.al.	2503.13773	null
2025-03-17	Do Large Language Models Understand Performance Optimization?	Bowen Cui et.al.	2503.13772	null
2025-03-17	Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion	Kartik Thakral et.al.	2503.13769	null
2025-03-17	AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications	Haiying Shen et.al.	2503.13737	null
2025-03-17	CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings	Daniil Orel et.al.	2503.13733	null
2025-03-17	FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models	Minghan Li et.al.	2503.13684	null
2025-03-17	Pensez: Less Data, Better Reasoning – Rethinking French LLM	Huy Hoang Ha et.al.	2503.13661	null
2025-03-17	INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations	Qian Meng et.al.	2503.13660	null
2025-03-17	SOSecure: Safer Code Generation with RAG and StackOverflow Discussions	Manisha Mukherjee et.al.	2503.13654	null
2025-03-17	Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos	Chiara Plizzari et.al.	2503.13646	link
2025-03-17	Plasmon-Plasmon Interaction in Nanoparticle Assemblies: Role of the Dipole-Quadrupole Coupling	Olivier Masset et.al.	2503.13645	null
2025-03-17	Evaluating Programming Language Confusion	Micheline Bénédicte Moumoula et.al.	2503.13620	null
2025-03-17	MetaScale: Test-Time Scaling with Evolving Meta-Thoughts	Qin Liu et.al.	2503.13447	null
2025-03-17	MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation	Zhenyu Wu et.al.	2503.13446	null
2025-03-17	Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance	Noah Y. Siegel et.al.	2503.13445	null
2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link
2025-03-17	Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images	Tianhao Wu et.al.	2503.13439	null
2025-03-17	xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference	Maximilian Beck et.al.	2503.13427	link
2025-03-17	Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation	Xinyu Lian et.al.	2503.13424	null
2025-03-17	A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives	Weiqiang Jin et.al.	2503.13415	null
2025-03-18	DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective	Dengyun Peng et.al.	2503.13413	link
2025-03-17	Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis	Alexander Ku et.al.	2503.13401	null
2025-03-17	MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research	James Burgess et.al.	2503.13399	link
2025-03-17	Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning	Mengyao Lyu et.al.	2503.13383	null
2025-03-17	Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning	Hai-Long Sun et.al.	2503.13360	null
2025-03-17	Agents Play Thousands of 3D Video Games	Zhongwen Xu et.al.	2503.13356	null
2025-03-17	Valid Text-to-SQL Generation with Unification-based DeepStochLog	Ying Jiao et.al.	2503.13342	link
2025-03-17	LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support	Xinyu Jessica Wang et.al.	2503.13340	null
2025-03-17	LEAVS: An LLM-based Labeler for Abdominal CT Supervision	Ricardo Bigolin Lanfredi et.al.	2503.13330	link
2025-03-17	Edit Transfer: Learning Image Editing via Vision In-Context Relations	Lan Chen et.al.	2503.13327	null
2025-03-17	Computation Mechanism Behind LLM Position Generalization	Chi Han et.al.	2503.13305	null
2025-03-17	LIMCA: LLM for Automating Analog In-Memory Computing Architecture Design Exploration	Deepak Vungarala et.al.	2503.13301	null
2025-03-17	A Survey on Transformer Context Extension: Approaches and Evaluation	Yijun Liu et.al.	2503.13299	null
2025-03-17	LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation	Xiaodi Li et.al.	2503.13281	null
2025-03-17	Knowledge-Aware Iterative Retrieval for Multi-Agent Systems	Seyoung Song et.al.	2503.13275	null
2025-03-17	Graph Generative Models Evaluation with Masked Autoencoder	Chengen Wang et.al.	2503.13271	null
2025-03-17	TablePilot; Recommending Human-Preferred Tabular Data Analysis with Large Language Models	Deyin Yi et.al.	2503.13262	null
2025-03-17	MindEye-OmniAssist: A Gaze-Driven LLM-Enhanced Assistive Robot System for Implicit Intention Recognition and Task Execution	Zejia Zhang et.al.	2503.13250	null
2025-03-17	Can Language Models Follow Multiple Turns of Entangled Instructions?	Chi Han et.al.	2503.13222	link
2025-03-17	Dense Policy: Bidirectional Autoregressive Learning of Actions	Yue Su et.al.	2503.13217	null
2025-03-17	MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis	Marvin Seyfarth et.al.	2503.13211	null
2025-03-17	Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach	Sinan Fan et.al.	2503.13208	null
2025-03-17	MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways	Zhen Chen et.al.	2503.13205	null
2025-03-17	3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o	Dingning Liu et.al.	2503.13185	null
2025-03-17	Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs	Jasmin Wachter et.al.	2503.13149	null
2025-03-17	Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images	Yaxi Chen et.al.	2503.13131	null
2025-03-17	3D Human Interaction Generation: A Survey	Siyuan Fan et.al.	2503.13120	null
2025-03-17	VeriLeaky: Navigating IP Protection vs Utility in Fine-Tuning for LLM-Driven Verilog Coding	Zeng Wang et.al.	2503.13116	null
2025-03-17	MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs	Erik Daxberger et.al.	2503.13111	null
2025-03-17	DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry	Jing Li et.al.	2503.13110	link
2025-03-17	Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences	Kedi Chen et.al.	2503.13109	null
2025-03-17	Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference	Hao Yin et.al.	2503.13108	link
2025-03-17	ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models	Hao Yin et.al.	2503.13107	link
2025-03-17	Managing Hybrid Solid-State Drives Using Large Language Models	Qian Wei et.al.	2503.13105	null
2025-03-17	REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities	Alexander Pugachev et.al.	2503.13102	null
2025-03-17	Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa	Babangida Sani et.al.	2503.13101	link
2025-03-17	ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning	Baohao Liao et.al.	2503.13089	null
2025-03-17	A Framework to Assess Multilingual Vulnerabilities of LLMs	Likai Tang et.al.	2503.13081	null
2025-03-17	Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation	Yihong Luo et.al.	2503.13070	null
2025-03-17	Do Vision Models Develop Human-Like Progressive Difficulty Understanding?	Zeyi Huang et.al.	2503.13058	null
2025-03-17	MaskSDM with Shapley values to improve flexibility, robustness, and explainability in species distribution modeling	Robin Zbinden et.al.	2503.13057	null
2025-03-17	Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided, Self-Consistent MLLMs for Food Preparation Task Planning	Yu-Hong Shen et.al.	2503.13055	null
2025-03-17	InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving	Ruiqi Song et.al.	2503.13047	null
2025-03-17	Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task	Junjie Chen et.al.	2503.13038	null
2025-03-17	How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark	Roba Al Majzoub et.al.	2503.12990	link
2025-03-17	A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models	Palakorn Achananuparp et.al.	2503.12989	null
2025-03-17	ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM	Wenqiang Wang et.al.	2503.12988	null
2025-03-17	Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning	Junming Liu et.al.	2503.12972	null
2025-03-17	Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity	Eliot Beyler et.al.	2503.12966	null
2025-03-17	Training Video Foundation Models with NVIDIA NeMo	Zeeshan Patel et.al.	2503.12964	null
2025-03-17	HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding	Jiahe Zhao et.al.	2503.12955	null
2025-03-17	HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model	Haiyang Guo et.al.	2503.12941	null
2025-03-17	R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization	Jingyi Zhang et.al.	2503.12937	link
2025-03-17	Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs	Wei Hung et.al.	2503.12932	null
2025-03-17	MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting	Rui Pu et.al.	2503.12931	null
2025-03-17	Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models	Zhiyi Huang et.al.	2503.12923	null
2025-03-17	ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs	Pengcheng Wen et.al.	2503.12918	null
2025-03-17	HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models	Xinyan Jiang et.al.	2503.12908	link
2025-03-17	Optimizing Ansatz Design in Quantum Generative Adversarial Networks Using Large Language Models	Kento Ueda et.al.	2503.12884	null
2025-03-17	nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity	Tianqi Luo et.al.	2503.12880	null
2025-03-17	An interpretable approach to automating the assessment of biofouling in video footage	Evelyn J. Mannix et.al.	2503.12875	link
2025-03-17	UniReg: Foundation Model for Controllable Medical Image Registration	Zi Li et.al.	2503.12868	null
2025-03-17	Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English	Duke Nguyen et.al.	2503.12858	null
2025-03-17	Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation	Songjun Tu et.al.	2503.12854	link
2025-03-17	ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing	Aditi Tiwari et.al.	2503.12852	null
2025-03-17	GuideDog: A Real-World Egocentric Multimodal Dataset for Blind and Low-Vision Accessibility-Aware Guidance	Junhyeok Kim et.al.	2503.12844	null
2025-03-18	Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data	Haozhe Si et.al.	2503.12843	null
2025-03-17	A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules	Kairong Luo et.al.	2503.12811	link
2025-03-17	Grounded Chain-of-Thought for Multimodal Large Language Models	Qiong Wu et.al.	2503.12799	link
2025-03-18	DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding	Xinyu Ma et.al.	2503.12797	link
2025-03-17	Quantum-Enhanced LLM Efficient Fine Tuning	Xiaofei Kong et.al.	2503.12790	null
2025-03-17	SAM2 for Image and Video Segmentation: A Comprehensive Survey	Zhang Jiaxing et.al.	2503.12781	null
2025-03-17	NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models	Sung-Yeon Park et.al.	2503.12772	null
2025-03-17	A Survey on Human Interaction Motion Generation	Kewei Sui et.al.	2503.12763	link
2025-03-17	RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning	Jerry Huang et.al.	2503.12759	null
2025-03-17	VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis	Zhifeng Wang et.al.	2503.12758	null
2025-03-17	MAP: Multi-user Personalization with Collaborative LLM-powered Agents	Christine Lee et.al.	2503.12757	link
2025-03-17	Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering	Kenneth J. K. Ong et.al.	2503.12722	null
2025-03-17	Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective	Luca Collini et.al.	2503.12721	null
2025-03-16	AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration	Javier Tirado-Garín et.al.	2503.12701	null
2025-03-16	A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT	Dazhou Guo et.al.	2503.12698	null
2025-03-16	AI Agents: Evolution, Architecture, and Real-World Applications	Naveen Krishnan et.al.	2503.12687	null
2025-03-16	ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory	Liangyu Wang et.al.	2503.12668	link
2025-03-16	Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility	Jacob Chmura et.al.	2503.12667	null
2025-03-16	Quantum Chemistry Driven Molecular Inverse Design with Data-free Reinforcement Learning	Francesco Calcagno et.al.	2503.12653	null
2025-03-16	UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing	Tsu-Jui Fu et.al.	2503.12652	null
2025-03-16	VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures	Yoo Yeon Sung et.al.	2503.12651	null
2025-03-16	FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization	Hao Mark Chen et.al.	2503.12649	link
2025-03-16	LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization	Alessio Spagnoletti et.al.	2503.12615	null
2025-03-16	VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and Invisibility	Yitian Shi et.al.	2503.12609	null
2025-03-16	Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey	Yaoting Wang et.al.	2503.12605	link
2025-03-16	SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models	Kunyang Sun et.al.	2503.12602	link
2025-03-14	From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets	P. Campeti et.al.	2503.11643	link
2025-03-14	Gradient-bridged Posterior: Bayesian Inference for Models with Implicit Functions	Cheng Zeng et.al.	2503.11637	null
2025-03-14	ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning	Xinyi Wang et.al.	2503.11617	link
2025-03-14	Pathology Image Compression with Pre-trained Autoencoders	Srikar Yellapragada et.al.	2503.11591	null
2025-03-14	Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space	Zhiliang Chen et.al.	2503.11586	link
2025-03-14	SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion	Ahmed Nassar et.al.	2503.11576	null
2025-03-14	Synthesizing Access Control Policies using Large Language Models	Adarsh Vatsa et.al.	2503.11573	null
2025-03-14	Implicit Bias-Like Patterns in Reasoning Models	Messi H. J. Lee et.al.	2503.11572	null
2025-03-14	VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity	Jing Bi et.al.	2503.11557	null
2025-03-14	AugGen: Synthetic Augmentation Can Improve Discriminative Models	Parsa Rahimi et.al.	2503.11544	null
2025-03-14	Potential of large language model-powered nudges for promoting daily water and energy conservation	Zonghan Li et.al.	2503.11531	null
2025-03-14	Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models	Hao Cheng et.al.	2503.11519	null
2025-03-14	HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models	Ziqin Zhou et.al.	2503.11513	null
2025-03-14	Perfect Stabilization of Biomolecular Adhesions under Load	Anton F. Burnet et.al.	2503.11510	null
2025-03-14	V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Zixu Cheng et.al.	2503.11495	null
2025-03-14	A Review of DeepSeek Models’ Key Innovative Techniques	Chengen Wang et.al.	2503.11486	null
2025-03-14	Exponential Quantum Advantage for Simulating Open Classical Systems	Agi Villanyi et.al.	2503.11483	null
2025-03-14	T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation	Seyed Mohammad Hadi Hosseini et.al.	2503.11481	null
2025-03-14	Integrating LLMs in Gamified Systems	Carlos J. Costa et.al.	2503.11458	null
2025-03-14	D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning	Jia Zhang et.al.	2503.11441	null
2025-03-14	Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models	Xu Liu et.al.	2503.11411	null
2025-03-14	A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving	Tin Stribor Sohn et.al.	2503.11400	null
2025-03-14	Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages	Jiyeong Kim et.al.	2503.11384	null
2025-03-14	Modeling Subjectivity in Cognitive Appraisal with Language Models	Yuxiang Zhou et.al.	2503.11381	null
2025-03-14	Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches	Panggih Kusuma Ningrum et.al.	2503.11376	null
2025-03-14	Cornstarch: Distributed Multimodal Training Must Be Multimodality-Aware	Insu Jang et.al.	2503.11367	link
2025-03-14	PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models	Mayank Nautiyal et.al.	2503.11360	null
2025-03-14	Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data Analysis	Zhenyi Zhang et.al.	2503.11347	null
2025-03-14	AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation	Fengyu Li et.al.	2503.11346	link
2025-03-14	Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models	Aissatou Diallo et.al.	2503.11336	null
2025-03-14	Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking	Ziyi Wang et.al.	2503.11324	null
2025-03-14	MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens	Jeong Hun Yeo et.al.	2503.11315	link
2025-03-14	Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering	Xinyu Tang et.al.	2503.11314	link
2025-03-14	Are formal and functional linguistic mechanisms dissociated?	Michael Hanna et.al.	2503.11302	link
2025-03-14	GNNs as Predictors of Agentic Workflow Performances	Yuanshuo Zhang et.al.	2503.11301	link
2025-03-14	BriLLM: Brain-inspired Large Language Model	Hai Zhao et.al.	2503.11299	null
2025-03-14	High-Dimensional Interlingual Representations of Large Language Models	Bryan Wilie et.al.	2503.11280	null
2025-03-14	When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective	Alireza Mousavi-Hosseini et.al.	2503.11272	link
2025-03-14	CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy	Jonas Utz et.al.	2503.11266	null
2025-03-14	Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model	Haoyang Huang et.al.	2503.11251	link
2025-03-14	Reasoning-Grounded Natural Language Explanations for Language Models	Vojtech Cahlik et.al.	2503.11248	link
2025-03-14	LLMPerf: GPU Performance Modeling meets Large Language Models	Khoi N. M. Nguyen et.al.	2503.11244	link
2025-03-14	PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders	Ahmed Frikha et.al.	2503.11232	null
2025-03-14	Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment	Ke Wang et.al.	2503.11229	null
2025-03-14	GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction	Jian Zhang et.al.	2503.11227	null
2025-03-14	Heterogeneously structured compartmental models of epidemiological systems: from individual-level processes to population-scale dynamics	Emanuele Bernardi et.al.	2503.11225	null
2025-03-14	Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty?	Giacomo Camposampiero et.al.	2503.11207	link
2025-03-14	LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs	Leqi Shen et.al.	2503.11205	null
2025-03-14	Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering	Gang Li et.al.	2503.11197	link
2025-03-14	FastVID: Dynamic Density Pruning for Fast Video Large Language Models	Leqi Shen et.al.	2503.11187	link
2025-03-14	Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification	Yingjie Zhang et.al.	2503.11185	null
2025-03-14	Palette of Language Models: A Solver for Controlled Text Generation	Zhe Yang et.al.	2503.11182	null
2025-03-14	Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity	Chi Xu et.al.	2503.11164	null
2025-03-14	Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models	Shaotian Yan et.al.	2503.11154	null
2025-03-14	SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets	Hao Liu et.al.	2503.11133	null
2025-03-14	Don’t Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning	Matthew Khoriaty et.al.	2503.11127	null
2025-03-14	Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers	Yifang Chen et.al.	2503.11108	null
2025-03-14	Quantifying Interpretability in CLIP Models with Concept Consistency	Avinash Madasu et.al.	2503.11103	null
2025-03-14	Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space	Weichen Zhan et.al.	2503.11094	link
2025-03-14	OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning	Yuan Liu et.al.	2503.11093	null
2025-03-14	EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks	Yi Zhang et.al.	2503.11089	null
2025-03-14	A Survey of Cross-domain Graph Learning: Progress and Future Directions	Haihong Zhao et.al.	2503.11086	link
2025-03-14	Prompt Alchemy: Automatic Prompt Refinement for Enhancing Code Generation	Sixiang Ye et.al.	2503.11085	link
2025-03-14	LLMs are Bug Replicators: An Empirical Study on LLMs’ Capability in Completing Bug-prone Code	Liwei Guo et.al.	2503.11082	link
2025-03-14	Understanding Flatness in Generative Models: Its Role and Benefits	Taehwan Lee et.al.	2503.11078	null
2025-03-14	Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities	Xueyang Zhou et.al.	2503.11074	null
2025-03-14	Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models	Hongyang Wei et.al.	2503.11073	link
2025-03-14	Falcon: A Remote Sensing Vision-Language Foundation Model	Kelu Yao et.al.	2503.11070	link
2025-03-14	API Agents vs. GUI Agents: Divergence and Convergence	Chaoyun Zhang et.al.	2503.11069	null
2025-03-14	DeepSeek Powered Solid Dosage Formulation Design and Development	Leqi Lin et.al.	2503.11068	null
2025-03-14	Generative Modelling for Mathematical Discovery	Jordan S. Ellenberg et.al.	2503.11061	link
2025-03-14	BannerAgency: Advertising Banner Design with Multimodal LLM Agents	Heng Wang et.al.	2503.11060	null
2025-03-14	Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization	Kyle Sargent et.al.	2503.11056	null
2025-03-14	Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning	Jieyi Tan et.al.	2503.11051	null
2025-03-14	PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing	Hasan Iqbal et.al.	2503.11044	null
2025-03-14	Beyond A Single AI Cluster: A Survey of Decentralized LLM Training	Haotian Dong et.al.	2503.11023	null
2025-03-14	An LLM’s Attempts to Adapt to Diverse Software Engineers’ Problem-Solving Styles: More Inclusive & Equitable?	Andrew Anderson et.al.	2503.11018	null
2025-03-14	RONA: Pragmatically Diverse Image Captioning with Coherence Relations	Aashish Anantha Ramakrishnan et.al.	2503.10997	link
2025-03-14	TigerLLM – A Family of Bangla Large Language Models	Nishat Raihan et.al.	2503.10995	link
2025-03-14	Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium	Kaizhao Liu et.al.	2503.10990	link
2025-03-14	From Dionysius Emerges Apollo – Learning Patterns and Abstractions from Perceptual Sequences	Shuchen Wu et.al.	2503.10973	null
2025-03-14	Combinatorial Optimization for All: Using LLMs to Aid Non-Experts in Improving Optimization Algorithms	Camilo Chacón Sartori et.al.	2503.10968	null
2025-03-13	Empirical Computation	Eric Tang et.al.	2503.10954	null
2025-03-13	Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations	Piyush Gupta et.al.	2503.10941	null
2025-03-13	ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models	Haoyu Zhang et.al.	2503.10937	null
2025-03-13	OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses	Angela Lopez-Cardona et.al.	2503.10927	link
2025-03-13	Learning to Inference Adaptively for Multimodal Large Language Models	Zhuoyan Xu et.al.	2503.10905	null
2025-03-13	Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification	Nathaniel Lesperance et.al.	2503.10886	null
2025-03-13	Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data	Paul Quinlan et.al.	2503.10883	null
2025-03-13	SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable	Jiaxin Zhang et.al.	2503.10881	null
2025-03-13	Teamwork makes the dream work: LLMs-Based Agents for GitHub README.MD Summarization	Duc S. H. Nguyen et.al.	2503.10876	null
2025-03-13	Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation	Leonard Waldmann et.al.	2503.10845	link
2025-03-13	Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs?	So Young Lee et.al.	2503.10838	link
2025-03-13	Exploiting Concavity Information in Gaussian Process Contextual Bandit Optimization	Kevin Li et.al.	2503.10836	null
2025-03-13	Thinking Machines: A Survey of LLM based Reasoning Strategies	Dibyanayan Bandyopadhyay et.al.	2503.10814	null
2025-03-13	HALURust: Exploiting Hallucinations of Large Language Models to Detect Vulnerabilities in Rust	Yu Luo et.al.	2503.10793	null
2025-03-13	Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview	Norbert Tihanyi et.al.	2503.10784	null
2025-03-13	Large-scale Pre-training for Grounded Video Caption Generation	Evangelos Kazakos et.al.	2503.10781	link
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	UniGoal: Towards Universal Zero-shot Goal-oriented Navigation	Hang Yin et.al.	2503.10630	null
2025-03-13	From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM	Kshitij Ambilduke et.al.	2503.10620	link
2025-03-13	Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search	Andy Zhou et.al.	2503.10619	null
2025-03-13	Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models	Andy Zhou et.al.	2503.10617	null
2025-03-13	R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization	Yi Yang et.al.	2503.10615	link
2025-03-13	CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing	Advait Gupta et.al.	2503.10613	link
2025-03-13	TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention	Jinhao Duan et.al.	2503.10602	link
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Unlock the Power of Unlabeled Data in Language Driving Model	Chaoqun Wang et.al.	2503.10586	null
2025-03-13	Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures	Nina Vesseron et.al.	2503.10576	null
2025-03-13	Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models	Afrar Jahin et.al.	2503.10573	null
2025-03-13	ASIDE: Architectural Separation of Instructions and Data in Language Models	Egor Zverev et.al.	2503.10566	null
2025-03-13	Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations	Brett Puppart et.al.	2503.10556	null
2025-03-13	KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation	Zixian Liu et.al.	2503.10546	null
2025-03-13	DP-GPL: Differentially Private Graph Prompt Learning	Jing Xu et.al.	2503.10544	null
2025-03-13	Foundation Models for Atomistic Simulation of Chemistry and Materials	Eric C. -Y. Yuan et.al.	2503.10538	null
2025-03-13	PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models	Zilu Guo et.al.	2503.10529	null
2025-03-13	Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set	Florian Eichin et.al.	2503.10515	link
2025-03-13	Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression	Hooman Shahrokhi et.al.	2503.10512	null
2025-03-13	SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models	Sahar Admoni et.al.	2503.10509	null
2025-03-13	TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models	Xudong Tan et.al.	2503.10501	link
2025-03-13	MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation	Weihao Xuan et.al.	2503.10497	null
2025-03-13	Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents	Hanxu Hu et.al.	2503.10494	link
2025-03-13	Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion	Evgeniia Vu et.al.	2503.10488	null
2025-03-13	LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions	Gaurav Kumar Gupta et.al.	2503.10486	null
2025-03-13	Siamese Foundation Models for Crystal Structure Prediction	Liming Wu et.al.	2503.10471	null
2025-03-13	DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation	Wenhao Hu et.al.	2503.10452	null
2025-03-13	4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models	Wanhua Li et.al.	2503.10437	link
2025-03-13	Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback	Derun Li et.al.	2503.10434	null
2025-03-13	BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models	Can Zheng et.al.	2503.10432	null
2025-03-13	Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning	Jonathan Shaki et.al.	2503.10408	null
2025-03-13	RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models	Yijing Lin et.al.	2503.10406	null
2025-03-13	RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing	Fengxiang Wang et.al.	2503.10392	link
2025-03-13	CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance	Yufan Deng et.al.	2503.10391	null
2025-03-13	SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading	Qiaoling Chen et.al.	2503.10377	null
2025-03-13	Probabilistic Forecasting via Autoregressive Flow Matching	Ahmed El-Gazzar et.al.	2503.10375	null
2025-03-13	G-Boost: Boosting Private SLMs with General LLMs	Yijiang Fan et.al.	2503.10367	null
2025-03-13	Piece it Together: Part-Based Concepting with IP-Priors	Elad Richardson et.al.	2503.10365	null
2025-03-13	BioSerenity-E1: a self-supervised EEG model for medical applications	Ruggero G. Bettinardi et.al.	2503.10362	null
2025-03-13	Collaborative Speculative Inference for Efficient LLM Inference Serving	Luyao Gao et.al.	2503.10325	null
2025-03-13	IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification	Yuhao Wang et.al.	2503.10324	null
2025-03-13	Towards Fast, Memory-based and Data-Efficient Vision-Language Policy	Haoxuan Li et.al.	2503.10322	null
2025-03-13	Capturing Semantic Flow of ML-based Systems	Shin Yoo et.al.	2503.10310	null
2025-03-13	Test Amplification for REST APIs Using “Out-of-the-box” Large Language Models	Tolgahan Bardakci et.al.	2503.10306	null
2025-03-13	CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems	Peyman Neshaastegaran et.al.	2503.10297	null
2025-03-13	VisualPRM: An Effective Process Reward Model for Multimodal Reasoning	Weiyun Wang et.al.	2503.10291	null
2025-03-13	MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment	Hao Zhou et.al.	2503.10287	null
2025-03-13	An Expanded Massive Multilingual Dataset for High-Performance Language Technologies	Laurie Burchell et.al.	2503.10267	link
2025-03-13	Numerical Error Analysis of Large Language Models	Stanislav Budzinskiy et.al.	2503.10251	null
2025-03-13	LLM Agents Display Human Biases but Exhibit Distinct Learning Patterns	Idan Horowitz et.al.	2503.10248	null
2025-03-13	MinorBench: A hand-built benchmark for content-based risks for children	Shaun Khoo et.al.	2503.10242	null
2025-03-13	SCOOP: A Framework for Proactive Collaboration and Social Continual Learning through Natural Language Interaction andCausal Reasoning	Dimitri Ognibene et.al.	2503.10241	null
2025-03-13	Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA	Zhixuan Li et.al.	2503.10225	null
2025-03-13	Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout	Shilong Wang et.al.	2503.10217	null
2025-03-13	Adaptive Preference Aggregation	Benjamin Heymann et.al.	2503.10215	null
2025-03-13	Singular Value Fine-tuning for Few-Shot Class-Incremental Learning	Zhiwu Wang et.al.	2503.10214	null
2025-03-13	Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation	Henglyu Liu et.al.	2503.10211	null
2025-03-13	LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents	Boyu Chen et.al.	2503.10200	null
2025-03-13	Robustness Tokens: Towards Adversarial Robustness of Transformers	Brian Pulfer et.al.	2503.10191	link
2025-03-13	“Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding	Hyunbin Jin et.al.	2503.10167	null
2025-03-13	Retrieval-Augmented Generation with Hierarchical Knowledge	Haoyu Huang et.al.	2503.10150	link
2025-03-13	Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding	Jinze Li et.al.	2503.10135	null
2025-03-13	PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models	Runze He et.al.	2503.10127	null
2025-03-13	Hybrid Agents for Image Restoration	Bingchen Li et.al.	2503.10120	null
2025-03-13	StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error	Shu-Xun Yang et.al.	2503.10105	link
2025-03-13	AgentDAO: Synthesis of Proposal Transactions Via Abstract DAO Semantics	Lin Ao et.al.	2503.10099	null
2025-03-13	Semantic Latent Motion for Portrait Video Generation	Qiyuan Zhang et.al.	2503.10096	null
2025-03-13	Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text	Avinash Patil et.al.	2503.10095	link
2025-03-13	Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model	Qiyuan Deng et.al.	2503.10093	null
2025-03-13	Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset	Xintong Dong et.al.	2503.10092	null
2025-03-13	Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective	Xiang Zhang et.al.	2503.10084	null
2025-03-13	AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption	Joonsung Jeon et.al.	2503.10081	link
2025-03-13	Information Density Principle for MLLM Benchmarks	Chunyi Li et.al.	2503.10079	link
2025-03-13	VMBench: A Benchmark for Perception-Aligned Video Motion Generation	Xinrang Ling et.al.	2503.10076	link
2025-03-13	SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation	Xiangyu Shi et.al.	2503.10069	null
2025-03-13	Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations	Ho Hin Lee et.al.	2503.10057	link
2025-03-13	Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy	Ziqi Jia et.al.	2503.10049	null
2025-03-13	How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game	Ziyue Wang et.al.	2503.10042	link
2025-03-13	NumScout: Unveiling Numerical Defects in Smart Contracts using LLM-Pruning Symbolic Execution	Jiachi Chen et.al.	2503.10041	link
2025-03-13	OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model	Bowen Zhang et.al.	2503.10009	link
2025-03-13	TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs	Yunxiao Wang et.al.	2503.09994	null
2025-03-13	Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes	JunYong Choi et.al.	2503.09993	null
2025-03-13	From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs	Rohan Bhatnagar et.al.	2503.09986	null
2025-03-13	ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content	Bhavik Chandna et.al.	2503.09964	null
2025-03-13	Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification	Jiayu Jiang et.al.	2503.09962	link
2025-03-13	RMG: Real-Time Expressive Motion Generation with Self-collision Avoidance for 6-DOF Companion Robotic Arms	Jiansheng Li et.al.	2503.09959	null
2025-03-13	Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey	Yu Qiao et.al.	2503.09956	null
2025-03-13	UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?	Yuanxin Liu et.al.	2503.09949	link
2025-03-13	PluralLLM: Pluralistic Alignment in LLMs via Federated Learning	Mahmoud Srewa et.al.	2503.09925	null
2025-03-13	Inter-environmental world modeling for continuous and compositional dynamics	Kohei Hayashi et.al.	2503.09911	null
2025-03-12	Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets	Zahra Abbasiantaeb et.al.	2503.09902	link
2025-03-12	Improving the Reusability of Conversational Search Test Collections	Zahra Abbasiantaeb et.al.	2503.09899	link
2025-03-12	What’s In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models	Abhipsha Das et.al.	2503.09894	link
2025-03-12	On the contraction properties of Sinkhorn semigroups	O. Deniz Akyildiz et.al.	2503.09887	null
2025-03-12	CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation	Hariprasath Govindarajan et.al.	2503.09878	null
2025-03-12	LuciBot: Automated Robot Policy Learning from Generated Videos	Xiaowen Qiu et.al.	2503.09871	null
2025-03-12	Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis	Nahid Ul Islam et.al.	2503.09860	link
2025-03-12	Media and responsible AI governance: a game-theoretic and LLM analysis	Nataliya Balabanova et.al.	2503.09858	null
2025-03-12	MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System	Jihao Zhao et.al.	2503.09600	link
2025-03-12	How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation	Ruohao Guo et.al.	2503.09598	link
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-12	SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment	Katrin Renz et.al.	2503.09594	null
2025-03-12	BIMBA: Selective-Scan Compression for Long-Range Video Question Answering	Md Mohaiminul Islam et.al.	2503.09590	link
2025-03-12	Minimax Optimality of the Probability Flow ODE for Diffusion Models	Changxiao Cai et.al.	2503.09583	null
2025-03-12	Cost-Optimal Grouped-Query Attention for Long-Context LLMs	Yingfa Chen et.al.	2503.09579	link
2025-03-12	Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks	Lutfi Eren Erdogan et.al.	2503.09572	null
2025-03-13	Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models	Qiguang Chen et.al.	2503.09567	null
2025-03-12	GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals	Shuokang Huang et.al.	2503.09537	null
2025-03-13	Large Language Models for Multi-Facility Location Mechanism Design	Nguyen Thach et.al.	2503.09533	null
2025-03-12	Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning	Bowen Jin et.al.	2503.09516	link
2025-03-12	ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning	Ziyu Wan et.al.	2503.09501	link
2025-03-12	Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection	Romain Thoreau et.al.	2503.09493	null
2025-03-12	DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction	Junjie Zhou et.al.	2503.09491	link
2025-03-12	Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness	Beier Zhu et.al.	2503.09487	null
2025-03-12	BAMBI: Developing Baby Language Models for Italian	Alice Suozzi et.al.	2503.09481	null
2025-03-12	Explicit Learning and the LLM in Machine Translation	Malik Marmonier et.al.	2503.09454	link
2025-03-12	How Well Does Your Tabular Generator Learn the Structure of Tabular Data?	Xiangjian Jiang et.al.	2503.09453	link
2025-03-12	Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models	Julian Spravil et.al.	2503.09443	null
2025-03-12	CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection	Richard A. Dubniczky et.al.	2503.09433	link
2025-03-12	Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter	Kechun Xu et.al.	2503.09423	null
2025-03-12	VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary	Kevin Qinghong Lin et.al.	2503.09402	link
2025-03-12	ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation	Tobias Christian Nauen et.al.	2503.09399	link
2025-03-12	Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training	Jiatong Xia et.al.	2503.09396	null
2025-03-12	Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs	Jiani Huang et.al.	2503.09382	link
2025-03-12	Towards Graph Foundation Models: A Transferability Perspective	Yuxiang Wang et.al.	2503.09363	null
2025-03-12	Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X	Katharina Prasse et.al.	2503.09361	null
2025-03-12	RetSTA: An LLM-Based Approach for Standardizing Clinical Fundus Image Reports	Jiushen Cai et.al.	2503.09358	null
2025-03-12	Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts	Hongyu Chen et.al.	2503.09347	null
2025-03-12	NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model	Yuzhi Lai et.al.	2503.09335	link
2025-03-12	CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data	Adel ElZemity et.al.	2503.09334	link
2025-03-12	A Survey on Enhancing Causal Reasoning Ability of Large Language Models	Xin Li et.al.	2503.09326	null
2025-03-12	Revealing the Implicit Noise-based Imprint of Generative Models	Xinghan Li et.al.	2503.09314	null
2025-03-12	xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation	Elio Musacchio et.al.	2503.09313	null
2025-03-12	Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactions	Fynn Bachmann et.al.	2503.09311	link
2025-03-12	Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference	Mohammad Siavashi et.al.	2503.09304	null
2025-03-12	Prompt Inference Attack on Distributed Large Language Model Inference Frameworks	Xinjian Luo et.al.	2503.09291	null
2025-03-12	Crowdsourced Homophily Ties Based Graph Annotation Via Large Language Model	Yu Bu et.al.	2503.09281	null
2025-03-12	Fine-Tuning Large Language Models for Educational Support: Leveraging Gagne’s Nine Events of Instruction for Lesson Planning	Linzhao Jia et.al.	2503.09276	null
2025-03-12	COLA: A Scalable Multi-Agent Framework For Windows UI Task Automation	Di Zhao et.al.	2503.09263	link
2025-03-13	DeepInnovation AI: A Global Dataset Mapping the AI innovation from Academic Research to Industrial Patents	Haixing Gong et.al.	2503.09257	null
2025-03-12	City Models: Past, Present and Future Prospects	Helge Ritter et.al.	2503.09237	null
2025-03-12	LREF: A Novel LLM-based Relevance Framework for E-commerce	Tian Tang et.al.	2503.09223	null
2025-03-12	Rethinking Prompt-based Debiasing in Large Language Models	Xinyi Yang et.al.	2503.09219	null
2025-03-12	Why LLMs Cannot Think and How to Fix It	Marius Jahrens et.al.	2503.09211	null
2025-03-12	Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model	Ali Vosoughi et.al.	2503.09205	null
2025-03-12	Token Weighting for Long-Range Language Modeling	Falko Helm et.al.	2503.09202	link
2025-03-12	WonderVerse: Extendable 3D Scene Generation with Video Generative Models	Hao Feng et.al.	2503.09160	null
2025-03-12	FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models	Fufangchen Zhao et.al.	2503.09158	null
2025-03-12	AdaptAI: A Personalized Solution to Sense Your Stress, Fix Your Mess, and Boost Productivity	Rushiraj Gadhvi et.al.	2503.09150	link
2025-03-12	Generative Frame Sampler for Long Video Understanding	Linli Yao et.al.	2503.09146	null
2025-03-12	Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding	Haoyu Zhang et.al.	2503.09143	null
2025-03-12	AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks	Jin Li et.al.	2503.09124	null
2025-03-12	Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?	Yuechen Xie et.al.	2503.09122	link
2025-03-12	GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models	Yue Wang et.al.	2503.09117	null
2025-03-12	VaxGuard: A Multi-Generator, Multi-Type, and Multi-Role Dataset for Detecting LLM-Generated Vaccine Misinformation	Syed Talal Ahmad et.al.	2503.09103	null
2025-03-12	Multi-Modal Foundation Models for Computational Pathology: A Survey	Dong Li et.al.	2503.09091	null
2025-03-12	Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows	Chengyue Gong et.al.	2503.09069	null
2025-03-12	Probing Latent Subspaces in LLM for AI Security: Identifying and Manipulating Adversarial States	Xin Wei Chia et.al.	2503.09066	null
2025-03-12	Discovering Influential Neuron Path in Vision Transformers	Yifan Wang et.al.	2503.09046	null
2025-03-12	ManeuverGPT Agentic Control for Safe Autonomous Stunt Maneuvers	Shawn Azdam et.al.	2503.09035	link
2025-03-12	Teaching LLMs How to Learn with Contextual Fine-Tuning	Younwoo Choi et.al.	2503.09032	null
2025-03-12	DAST: Difficulty-Aware Self-Training on Large Language Models	Boyang Xue et.al.	2503.09029	link
2025-03-12	Aligning to What? Limits to RLHF Based Alignment	Logan Barnhart et.al.	2503.09025	null
2025-03-13	Prompt Inversion Attack against Collaborative Inference of Large Language Models	Wenjie Qu et.al.	2503.09022	null
2025-03-12	Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning	Yuan Jiang et.al.	2503.09020	link
2025-03-12	Natural Humanoid Robot Locomotion with Generative Motion Prior	Haodong Zhang et.al.	2503.09015	null
2025-03-12	Leveraging Retrieval Augmented Generative LLMs For Automated Metadata Description Generation to Enhance Data Catalogs	Mayank Singh et.al.	2503.09003	null
2025-03-12	KNighter: Transforming Static Analysis with LLM-Synthesized Checkers	Chenyuan Yang et.al.	2503.09002	link
2025-03-12	JBFuzz: Jailbreaking LLMs Efficiently and Effectively Using Fuzzing	Vasudev Gohil et.al.	2503.08990	null
2025-03-12	I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data?	Yuhang Liu et.al.	2503.08980	null
2025-03-12	Large Language Models-Aided Program Debloating	Bo Lin et.al.	2503.08969	null
2025-03-11	Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation	Yu Wang et.al.	2503.08963	null
2025-03-11	FP3: A 3D Foundation Policy for Robotic Manipulation	Rujia Yang et.al.	2503.08950	null
2025-03-11	Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model	Zilong Deng et.al.	2503.08934	null
2025-03-11	ARCHED: A Human-Centered Framework for Transparent, Responsible, and Collaborative AI-Assisted Instructional Design	Hongming Li et.al.	2503.08931	null
2025-03-11	Enhancing Large Language Models for Hardware Verification: A Novel SystemVerilog Assertion Dataset	Anand Menon et.al.	2503.08923	link
2025-03-11	Backtracking for Safety	Bilgehan Sel et.al.	2503.08919	null
2025-03-11	Multilevel Generative Samplers for Investigating Critical Phenomena	Ankur Singha et.al.	2503.08918	link
2025-03-11	Reconstruct Anything Model: a lightweight foundation model for computational imaging	Matthieu Terris et.al.	2503.08915	null
2025-03-11	Interpreting the Repeated Token Phenomenon in Large Language Models	Itay Yona et.al.	2503.08908	link
2025-03-11	A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation	Forough Fazeliasl et.al.	2503.08902	null
2025-03-11	Seeing What’s Not There: Spurious Correlation in Multimodal LLMs	Parsa Hosseini et.al.	2503.08884	null
2025-03-11	LLMs Know What to Drop: Self-Attention Guided KV Cache Eviction for Efficient Long-Context Inference	Guangtao Wang et.al.	2503.08879	null
2025-03-11	Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs	Rafael Carranza et.al.	2503.08857	null
2025-03-11	Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs	Tianyu Sun et.al.	2503.08842	null
2025-03-11	ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness	Ce Guo et.al.	2503.08823	null
2025-03-11	Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations	Danielle Villa et.al.	2503.08815	null
2025-03-11	Robust Multi-Objective Controlled Decoding of Large Language Models	Seongho Son et.al.	2503.08796	link
2025-03-11	Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs	Ariba Khan et.al.	2503.08688	link
2025-03-11	OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models	Jialv Zou et.al.	2503.08686	link
2025-03-11	Self-Taught Self-Correction for Small Language Models	Viktor Moskvoretskii et.al.	2503.08681	null
2025-03-11	GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing	Yuanhao Wang et.al.	2503.08678	null
2025-03-12	OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting	Yongsheng Yu et.al.	2503.08677	null
2025-03-11	Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields	Tobias Kreiman et.al.	2503.08674	null
2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null
2025-03-11	Generating Robot Constitutions & Benchmarks for Semantic Safety	Pierre Sermanet et.al.	2503.08663	null
2025-03-11	Exploring the Word Sense Disambiguation Capabilities of Large Language Models	Pierpaolo Basile et.al.	2503.08662	null
2025-03-11	YuE: Scaling Open Foundation Models for Long-Form Music Generation	Ruibin Yuan et.al.	2503.08638	link
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments	Dongping Li et.al.	2503.08604	link
2025-03-11	NSF-SciFy: Mining the NSF Awards Database for Scientific Claims	Delip Rao et.al.	2503.08600	null
2025-03-11	3D Point Cloud Generation via Autoregressive Up-sampling	Ziqiao Meng et.al.	2503.08594	null
2025-03-11	Proc4Gem: Foundation models for physical agency through procedural generation	Yixin Lin et.al.	2503.08593	null
2025-03-11	HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding	Shehreen Azad et.al.	2503.08585	null
2025-03-11	RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding	Xichen Tan et.al.	2503.08576	null
2025-03-11	DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process	Minjun Zhu et.al.	2503.08569	null
2025-03-11	Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies	Chen Xu et.al.	2503.08558	null
2025-03-11	Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs	Wanyong Feng et.al.	2503.08551	null
2025-03-11	Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling	Craig Messner et.al.	2503.08550	null
2025-03-11	Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation	Xian Gao et.al.	2503.08549	null
2025-03-11	DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering	Sher Badshah et.al.	2503.08542	null
2025-03-11	Mellow: a small audio language model for reasoning	Soham Deshmukh et.al.	2503.08540	link
2025-03-11	Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation	Andres M Bran et.al.	2503.08537	link
2025-03-11	ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems	Siddhant Arora et.al.	2503.08533	null
2025-03-11	GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training	Tong Wei et.al.	2503.08525	null
2025-03-11	Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency	Siqi Fan et.al.	2503.08524	null
2025-03-11	High-Quality 3D Head Reconstruction from Any Single Portrait Image	Jianfu Zhang et.al.	2503.08516	null
2025-03-11	LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning	Weijie Zhou et.al.	2503.08508	link
2025-03-11	Referring to Any Person	Qing Jiang et.al.	2503.08507	link
2025-03-11	ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews	Xian Gao et.al.	2503.08506	null
2025-03-11	Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models	Han Cao et.al.	2503.08495	null
2025-03-11	TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting	Fengyi Zhang et.al.	2503.08485	null
2025-03-11	Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum	Shengpeng Xiao et.al.	2503.08484	null
2025-03-11	PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability	Weijie Zhou et.al.	2503.08481	link
2025-03-11	FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework	Jianian Zhu et.al.	2503.08461	null
2025-03-11	KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents	Hsin-Ling Hsu et.al.	2503.08452	link
2025-03-11	LLM-Pack: Intuitive Grocery Handling for Logistics Applications	Yannik Blei et.al.	2503.08445	null
2025-03-11	TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems	Feiyang Wu et.al.	2503.08415	link
2025-03-11	Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information	Elizaveta Kuznetsova et.al.	2503.08404	null
2025-03-11	OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning	Jiawei Zhou et.al.	2503.08398	null
2025-03-11	Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens	Qingsong Xie et.al.	2503.08377	null
2025-03-11	nnInteractive: Redefining 3D Promptable Segmentation	Fabian Isensee et.al.	2503.08373	link
2025-03-11	MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model	Haonan Chen et.al.	2503.08372	null
2025-03-11	Robust Latent Matters: Boosting Image Generation with Sampling Error	Kai Qiu et.al.	2503.08354	link
2025-03-12	Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs	Chongjun Tu et.al.	2503.08342	null
2025-03-11	Trinity: A Modular Humanoid Robot AI System	Jingkai Sun et.al.	2503.08338	null
2025-03-11	Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos	Soumya Shamarao Jahagirdar et.al.	2503.08335	null
2025-03-11	KiteRunner: Language-Driven Cooperative Local-Global Navigation Policy with UAV Mapping in Outdoor Environments	Shibo Huang et.al.	2503.08330	null
2025-03-11	Towards Scalable and Cross-Lingual Specialist Language Models for Oncology	Morteza Rohanian et.al.	2503.08323	null
2025-03-11	Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference	Pol G. Recasens et.al.	2503.08311	null
2025-03-11	Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework	Zhuo Zhi et.al.	2503.08308	null
2025-03-11	General-Purpose Aerial Intelligent Agents Empowered by Large Language Models	Ji Zhao et.al.	2503.08302	null
2025-03-12	Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study	Xian-Rong Zhang et.al.	2503.08301	null
2025-03-11	Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges	Xiaoxiao Liu et.al.	2503.08292	link
2025-03-11	PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net	Jun Yin et.al.	2503.08276	null
2025-03-11	LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization	Wenzhe Niu et.al.	2503.08271	null
2025-03-11	DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness	Yiming Zhong et.al.	2503.08257	link
2025-03-11	Aligning Text to Image in Diffusion Models is Easier Than You Think	Jaa-Yeon Lee et.al.	2503.08250	link
2025-03-11	Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices	Tao Shen et.al.	2503.08223	null
2025-03-11	EgoBlind: Towards Egocentric Visual Assistance for the Blind People	Junbin Xiao et.al.	2503.08221	link
2025-03-11	S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction	Guangting Zheng et.al.	2503.08217	null
2025-03-11	To Use or Not to Use a Universal Force Field	Denan Li et.al.	2503.08207	null
2025-03-11	Route Sparse Autoencoder to Interpret Large Language Models	Wei Shi et.al.	2503.08200	link
2025-03-11	A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models	Miao Zhang et.al.	2503.08199	null
2025-03-11	Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation	Wenlong Meng et.al.	2503.08195	link
2025-03-11	Automating Violence Detection and Categorization from Ancient Texts	Alhassan Abdelhalim et.al.	2503.08192	null
2025-03-11	RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware	Gonzalo Santamaría Gómez et.al.	2503.08188	null
2025-03-11	Mutation Testing via Iterative Large Language Model-Driven Scientific Debugging	Philipp Straubinger et.al.	2503.08182	null
2025-03-12	ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models	Zicheng Ma et.al.	2503.08179	null
2025-03-11	Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study	Veronica Bot et.al.	2503.08174	null
2025-03-11	Towards All-in-One Medical Image Re-Identification	Yuan Tian et.al.	2503.08173	link
2025-03-11	TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement	Miao Zhang et.al.	2503.08168	null
2025-03-11	FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback	Kangan Qian et.al.	2503.08162	null
2025-03-12	OASIS: Order-Augmented Strategy for Improved Code Search	Zuchen Gao et.al.	2503.08161	null
2025-03-11	Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model	Yufan Chen et.al.	2503.08156	null
2025-03-11	WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation	Jing Wang et.al.	2503.08153	null
2025-03-11	Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features	Hanbyul Lee et.al.	2503.08148	null
2025-03-11	FilmComposer: LLM-Driven Music Production for Silent Film Clips	Zhifeng Xie et.al.	2503.08147	null
2025-03-11	Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method	Fei Wang et.al.	2503.08144	null
2025-03-11	FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems	Jeongsol Kim et.al.	2503.08136	null
2025-03-11	Large Scale Multi-Task Bayesian Optimization with Large Language Models	Yimeng Zeng et.al.	2503.08131	null
2025-03-11	LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence	Renxuan Tan et.al.	2503.08123	null
2025-03-11	Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments	Soonwoo Kwon et.al.	2503.08122	null
2025-03-11	Uni $\textbf{F}^2$ ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models	Junzhe Li et.al.	2503.08120	null
2025-03-11	Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models	Weiguo Gao et.al.	2503.08117	null
2025-03-11	AI-native Memory 2.0: Second Me	Jiale Wei et.al.	2503.08102	null
2025-03-12	PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models	Kyeongkook Seo et.al.	2503.08085	link
2025-03-11	Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation	Fangyuan Wang et.al.	2503.08084	null
2025-03-11	Seeing Beyond Haze: Generative Nighttime Image Dehazing	Beibei Lin et.al.	2503.08073	null
2025-03-11	Flow Matching for Discrete Systems: Efficient Free Energy Sampling Across Lattice Sizes and Temperatures	Ping Tuo et.al.	2503.08063	null
2025-03-11	Odysseus Navigates the Sirens’ Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation	Wen Luo et.al.	2503.08057	null
2025-03-11	Counterfactual Language Reasoning for Explainable Recommendation Systems	Guanrong Li et.al.	2503.08051	null
2025-03-11	SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models	Nadarasar Bahavan et.al.	2503.08049	link
2025-03-11	LongProLIP: A Probabilistic Vision-Language Model with Long Context Text	Sanghyuk Chun et.al.	2503.08048	link
2025-03-11	Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection	Ying Fu Lim et.al.	2503.08045	null
2025-03-11	ObjectMover: Generative Object Movement with Video Prior	Xin Yu et.al.	2503.08037	null
2025-03-11	Learning to Search Effective Example Sequences for In-Context Learning	Xiang Gao et.al.	2503.08030	null
2025-03-11	In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents	Zhen Tan et.al.	2503.08026	null
2025-03-10	V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation	Guiwei Zhang et.al.	2503.07493	link
2025-03-10	LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?	Bangyan Li et.al.	2503.07487	null
2025-03-10	Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Zongzheng Zhang et.al.	2503.07485	link
2025-03-10	GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models	Ryugo Morita et.al.	2503.07463	null
2025-03-10	MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning	Xiangru Tang et.al.	2503.07459	link
2025-03-10	LLMs syntactically adapt their language use to their conversational partner	Florian Kandra et.al.	2503.07457	null
2025-03-10	Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration	Dylan J. Foster et.al.	2503.07453	null
2025-03-10	From Idea to Implementation: Evaluating the Influence of Large Language Models in Software Development – An Opinion Paper	Sargam Yadav et.al.	2503.07450	null
2025-03-10	From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics	Jaewook Lee et.al.	2503.07429	null
2025-03-10	RePO: ReLU-based Preference Optimization	Junkang Wu et.al.	2503.07426	link
2025-03-10	REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding	Yan Tai et.al.	2503.07413	link
2025-03-10	Towards Safe Robot Foundation Models	Maximilian Tölle et.al.	2503.07404	null
2025-03-10	Keeping Representation Similarity in Finetuning for Medical Image Analysis	Wenqiang Zu et.al.	2503.07399	null
2025-03-10	Revisiting Noise in Natural Language Processing for Computational Social Science	Nadav Borenstein et.al.	2503.07395	null
2025-03-10	Process-Supervised LLM Recommenders via Flow-guided Tuning	Chongming Gao et.al.	2503.07377	link
2025-03-10	Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future	Yannick Oswald et.al.	2503.07364	null
2025-03-10	RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing	Yiqing Xie et.al.	2503.07358	link
2025-03-10	Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment	Xing Xie et.al.	2503.07334	link
2025-03-10	Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models	Hao Zhou et.al.	2503.07329	null
2025-03-10	Dynamic Path Navigation for Motion Agents with LLM Reasoning	Yubo Zhao et.al.	2503.07323	null
2025-03-10	Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents	Guanxuan Jiang et.al.	2503.07320	null
2025-03-10	Self-Corrective Task Planning by Inverse Prompting with Large Language Models	Jiho Lee et.al.	2503.07317	null
2025-03-10	Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies	Luyi Jiang et.al.	2503.07306	null
2025-03-10	A Graph-based Verification Framework for Fact-Checking	Yani Huang et.al.	2503.07282	null
2025-03-10	COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition	Baiyu Chen et.al.	2503.07259	link
2025-03-10	CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Haicheng Liao et.al.	2503.07234	null
2025-03-10	Control Flow-Augmented Decompiler based on Large Language Model	Peipei Liu et.al.	2503.07215	null
2025-03-10	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204	null
2025-03-10	A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding	Bingchen Liu et.al.	2503.07202	null
2025-03-10	Effective and Efficient Masked Image Generation Models	Zebin You et.al.	2503.07197	link
2025-03-10	Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems	Lia Shahnazaryan et.al.	2503.07195	null
2025-03-10	Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms	Jiaming Song et.al.	2503.07154	null
2025-03-10	MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark	Shengkun Ma et.al.	2503.07144	link
2025-03-10	Application of Multiple Chain-of-Thought in Contrastive Reasoning for Implicit Sentiment Analysis	Liwei Yang et.al.	2503.07140	null
2025-03-10	VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation	Hanzhi Chen et.al.	2503.07135	null
2025-03-10	Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation	Sihao Lin et.al.	2503.07125	null
2025-03-10	Quantizing Large Language Models for Code Generation: A Differentiated Replication	Alessandro Giagnorio et.al.	2503.07103	null
2025-03-10	A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images	Xiaoyi Liang et.al.	2503.07094	null
2025-03-10	Linguistic Knowledge Transfer Learning for Speech Enhancement	Kuo-Hsuan Hung et.al.	2503.07078	null
2025-03-10	DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs	Jongwoo Ko et.al.	2503.07067	link
2025-03-10	Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning	Huilin Deng et.al.	2503.07065	link
2025-03-10	TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation	Victor Shea-Jay Huang et.al.	2503.07050	null
2025-03-10	Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion	Yongle Zhang et.al.	2503.07047	null
2025-03-10	Conditional Generative Modeling for Amorphous Multi-Element Materials	Honglin Li et.al.	2503.07043	link
2025-03-10	TCM-3CEval: A Triaxial Benchmark for Assessing Responses from Large Language Models in Traditional Chinese Medicine	Tianai Huang et.al.	2503.07041	null
2025-03-10	Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams	Nardine Basta et.al.	2503.07036	null
2025-03-10	Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation	Zhi Qin et.al.	2503.07032	null
2025-03-10	Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense	Yuting Hu et.al.	2503.07020	null
2025-03-10	Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning	Xintong Li et.al.	2503.07018	link
2025-03-10	HELM: Human-Preferred Exploration with Language Models	Shuhao Liao et.al.	2503.07006	null
2025-03-10	Large Language Models Often Say One Thing and Do Another	Ruoxi Xu et.al.	2503.07003	link
2025-03-10	Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning	Jiazheng Liu et.al.	2503.07002	null
2025-03-10	Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs	Wenzhuo Xu et.al.	2503.06989	null
2025-03-10	Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations	Jiho Jin et.al.	2503.06987	null
2025-03-10	Learning Decision Trees as Amortized Structure Inference	Mohammed Mahfoud et.al.	2503.06985	link
2025-03-10	Exploring Multimodal Perception in Large Language Models Through Perceptual Strength Ratings	Jonghyun Lee et.al.	2503.06980	null
2025-03-10	Lightweight Multimodal Artificial Intelligence Framework for Maritime Multi-Scene Recognition	Xinyu Xi et.al.	2503.06978	null
2025-03-10	Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation	Pengchen Liang et.al.	2503.06976	null
2025-03-10	ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA	Zhao Xinjie et.al.	2503.06951	null
2025-03-10	CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation	Runqi Sui et.al.	2503.06950	null
2025-03-11	LexPro-1.0 Technical Report	Haotian Chen et.al.	2503.06949	link
2025-03-10	Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection	Wentao Wu et.al.	2503.06948	null
2025-03-10	Handle Object Navigation as Weighted Traveling Repairman Problem	Ruimeng Liu et.al.	2503.06937	link
2025-03-10	Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping	Ning Ding et.al.	2503.06930	null
2025-03-10	Effect of Selection Format on LLM Performance	Yuchen Han et.al.	2503.06926	null
2025-03-10	Combinatorial Optimization via LLM-driven Iterated Fine-tuning	Pranjal Awasthi et.al.	2503.06917	null
2025-03-10	Beyond Code Generation: LLM-supported Exploration of the Program Design Space	J. D. Zamfirescu-Pereira et.al.	2503.06911	null
2025-03-10	A Query Optimization Method Utilizing Large Language Models	Zhiming Yao et.al.	2503.06902	null
2025-03-10	DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation	Xiaoliang Ju et.al.	2503.06900	null
2025-03-10	SafePlan: Leveraging Formal Logic and Chain-of-Thought Reasoning for Enhanced Safety in LLM-based Robotic Task Planning	Ike Obi et.al.	2503.06892	null
2025-03-10	ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks	Yan Yang et.al.	2503.06885	null
2025-03-10	Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help	Yuefan Cao et.al.	2503.06884	null
2025-03-10	ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration	Mengting Ai et.al.	2503.06881	link
2025-03-10	Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception	Wanjing Huang et.al.	2503.06866	link
2025-03-10	FIGLUT: An Energy-Efficient Accelerator Design for FP-INT GEMM Using Look-Up Tables	Gunho Park et.al.	2503.06862	null
2025-03-10	Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention	Mengzhe Hei et.al.	2503.06861	null
2025-03-10	MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification	Xiangyan Qu et.al.	2503.06847	null
2025-03-10	GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought	Sungsik Kim et.al.	2503.06832	link
2025-03-10	Towards a Multimodal MRI-Based Foundation Model for Multi-Level Feature Exploration in Segmentation, Molecular Subtyping, and Grading of Glioma	Somayeh Farahani et.al.	2503.06828	null
2025-03-10	eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference	Suraiya Tairin et.al.	2503.06823	null
2025-03-10	HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors	Siyu Li et.al.	2503.06821	link
2025-03-10	Towards Fine-Grained Video Question Answering	Wei Dai et.al.	2503.06820	null
2025-03-09	Privacy Auditing of Large Language Models	Ashwinee Panda et.al.	2503.06808	null
2025-03-09	VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation	Hritik Bansal et.al.	2503.06800	null
2025-03-09	Multimodal AI-driven Biomarker for Early Detection of Cancer Cachexia	Sabeen Ahmed et.al.	2503.06797	null
2025-03-09	RoboDesign1M: A Large-scale Dataset for Robot Design Understanding	Tri Le et.al.	2503.06796	null
2025-03-09	AutoMisty: A Multi-Agent LLM Framework for Automated Code Generation in the Misty Social Robot	Xiao Wang et.al.	2503.06791	null
2025-03-09	Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models	Tianyi Zhang et.al.	2503.06784	null
2025-03-09	Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting	Yufei Li et.al.	2503.06781	null
2025-03-09	Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators	Feng Gu et.al.	2503.06778	null
2025-03-09	Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints	Max Buckley et.al.	2503.06751	null
2025-03-09	Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models	Wenxuan Huang et.al.	2503.06749	link
2025-03-09	CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving	Rui Song et.al.	2503.06744	null
2025-03-09	Delusions of Large Language Models	Hongshen Xu et.al.	2503.06709	null
2025-03-09	Alignment for Efficient Tool Calling of Large Language Models	Hongshen Xu et.al.	2503.06708	null
2025-03-09	PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts	Ming Zhang et.al.	2503.06706	link
2025-03-09	InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models	Yuchen Yan et.al.	2503.06692	null
2025-03-09	DependEval: Benchmarking LLMs for Repository Dependency Understanding	Junjia Du et.al.	2503.06689	link
2025-03-09	UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion	Gongbo Zhang et.al.	2503.06687	null
2025-03-09	FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation	Wei Li et.al.	2503.06680	null
2025-03-09	Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets	Tommaso Bendinelli et.al.	2503.06664	null
2025-03-07	Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints	Parameswaran Kamalaruban et.al.	2503.05684	null
2025-03-07	Understanding the Limits of Lifelong Knowledge Editing in LLMs	Lukas Thede et.al.	2503.05683	null
2025-03-07	AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data	Zengqun Zhao et.al.	2503.05665	link
2025-03-07	A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval	Yu Zhang et.al.	2503.05659	link
2025-03-07	A functional approach for curve alignment and shape analysis	Issam-Ali Moindjié et.al.	2503.05632	null
2025-03-07	Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings	Xuanqing Liu et.al.	2503.05620	null
2025-03-07	A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models	Dong Shu et.al.	2503.05613	null
2025-03-07	From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing	Prashant K. Jha et.al.	2503.05598	link
2025-03-07	R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning	Huatong Song et.al.	2503.05592	null
2025-03-07	Evaluating open-source Large Language Models for automated fact-checking	Nicolo’ Fontana et.al.	2503.05565	null
2025-03-07	Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance	Bryan Etzine et.al.	2503.05551	null
2025-03-07	Leveraging Approximate Caching for Faster Retrieval-Augmented Generation	Shai Bergman et.al.	2503.05530	null
2025-03-07	PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs	Roberto Cerina et.al.	2503.05529	null
2025-03-07	Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations	Eren Erogullari et.al.	2503.05522	link
2025-03-07	Cognitive Bias Detection Using Advanced Prompt Engineering	Frederic Lemieux et.al.	2503.05516	null
2025-03-07	Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering	Yusong Ke et.al.	2503.05505	null
2025-03-07	Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders	Qijiong Liu et.al.	2503.05493	null
2025-03-07	Statistical Deficiency for Task Inclusion Estimation	Loïc Fosse et.al.	2503.05491	null
2025-03-07	Maximum Hallucination Standards for Domain-Specific Large Language Models	Tingmingke Lu et.al.	2503.05481	null
2025-03-07	The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence	Noah Mamie et.al.	2503.05473	null
2025-03-07	De Novo Design of Protein-Binding Peptides by Quantum Computing	Lars Meuser et.al.	2503.05458	null
2025-03-07	LLM-based Iterative Approach to Metamodeling in Automotive	Nenad Petrovic et.al.	2503.05449	null
2025-03-07	Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts	Weigao Sun et.al.	2503.05447	link
2025-03-07	Are Your LLM-based Text-to-SQL Models Secure? Exploring SQL Injection via Backdoor Attacks	Meiyu Lin et.al.	2503.05445	null
2025-03-07	Static Program Analysis Guided LLM Based Unit Test Generation	Sujoy Roychowdhury et.al.	2503.05394	null
2025-03-07	Ontology Generation using Large Language Models	Anna Sofia Lippolis et.al.	2503.05388	link
2025-03-07	VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method	Weiyu Ma et.al.	2503.05383	link
2025-03-07	R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning	Jiaxing Zhao et.al.	2503.05379	null
2025-03-07	Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs	Zara Siddique et.al.	2503.05371	null
2025-03-07	Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter	Weixiang Zhao et.al.	2503.05362	null
2025-03-07	GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation	Zhenxuan Zhang et.al.	2503.05347	link
2025-03-07	AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications	Leming Shen et.al.	2503.05346	link
2025-03-07	PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?	Martin Spitznagel et.al.	2503.05333	null
2025-03-07	Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models	Anar Yeginbergen et.al.	2503.05328	null
2025-03-07	Routing for Large ML Models	Ofir Cohen et.al.	2503.05324	link
2025-03-07	Riemannian Metric Learning: Closer to You than You Imagine	Samuel Gruffaz et.al.	2503.05321	null
2025-03-07	Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms	Zitao Fang et.al.	2503.05320	null
2025-03-07	Escaping Plato’s Cave: Towards the Alignment of 3D and Text Latent Spaces	Souhail Hadgi et.al.	2503.05283	null
2025-03-07	Similarity-Based Domain Adaptation with LLMs	Jie He et.al.	2503.05281	null
2025-03-07	Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching	Bowen Pang et.al.	2503.05248	link
2025-03-07	L-FUSION: Laplacian Fetal Ultrasound Segmentation & Uncertainty Estimation	Johanna P. Müller et.al.	2503.05245	null
2025-03-07	WritingBench: A Comprehensive Benchmark for Generative Writing	Yuning Wu et.al.	2503.05244	link
2025-03-07	MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio	Xuenan Xu et.al.	2503.05242	link
2025-03-07	Unveiling Biases in AI: ChatGPT’s Political Economy Perspectives and Human Comparisons	Leonardo Becchetti et.al.	2503.05234	null
2025-03-07	Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction	Shuo Jiang et.al.	2503.05231	null
2025-03-07	ARbiter: Generating Dialogue Options and Communication Support in Augmented Reality	Julián Méndez et.al.	2503.05220	null
2025-03-07	Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning	Guoxiu He et.al.	2503.05212	null
2025-03-07	Path Pooling: Train-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation	Hairu Wang et.al.	2503.05203	null
2025-03-07	ORANSight-2.0: Foundational LLMs for O-RAN	Pranshav Gajjar et.al.	2503.05200	null
2025-03-07	Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning	Mufan Xu et.al.	2503.05193	null
2025-03-07	Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions	Chan hur et.al.	2503.05186	null
2025-03-07	Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching	Simon A. Aytes et.al.	2503.05179	link
2025-03-07	Development and Enhancement of Text-to-Image Diffusion Models	Rajdeep Roshan Sahu et.al.	2503.05149	null
2025-03-07	RocketEval: Efficient Automated LLM Evaluation via Grading Checklist	Tianjun Wei et.al.	2503.05142	link
2025-03-07	Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs	Ling Team et.al.	2503.05139	null
2025-03-07	R1-Zero’s “Aha Moment” in Visual Reasoning on a 2B Non-SFT Model	Hengguang Zhou et.al.	2503.05132	link
2025-03-07	Dilu: Enabling GPU Resourcing-on-Demand for Serverless DL Serving via Introspective Elasticity	Cunchi Lv et.al.	2503.05130	link
2025-03-07	Can Large Language Models Grasp Concepts in Visual Content? A Case Study on YouTube Shorts about Depression	Jiaying “Lizzy” Liu et.al.	2503.05109	null
2025-03-07	AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models	Hengrui Xing et.al.	2503.05102	null
2025-03-07	SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding	Kaiyu Huang et.al.	2503.05096	null
2025-03-07	S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information	Feng Jiang et.al.	2503.05085	null
2025-03-07	On a Connection Between Imitation Learning and RLHF	Teng Xiao et.al.	2503.05079	link
2025-03-07	PromptPex: Automatic Test Generation for Language Model Prompts	Reshabh K Sharma et.al.	2503.05070	link
2025-03-07	Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts	Shwai He et.al.	2503.05066	null
2025-03-07	No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding	Michael Krumdick et.al.	2503.05061	null
2025-03-06	Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets	Preetam Prabhu Srikar Dammu et.al.	2503.05049	null
2025-03-06	Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference	Grace Proebsting et.al.	2503.05047	null
2025-03-06	Continual Pre-training of MoEs: How robust is your router?	Benjamin Thérien et.al.	2503.05029	null
2025-03-06	ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids	Hannes Stark et.al.	2503.05025	link
2025-03-06	Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety	Yuyou Zhang et.al.	2503.05021	null
2025-03-06	LLMs’ Reshaping of People, Processes, Products, and Society in Software Development: A Comprehensive Exploration with Early Adopters	Benyamin Tabarsi et.al.	2503.05012	null
2025-03-06	Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation	Bryan Li et.al.	2503.05010	null
2025-03-06	Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models	Benyamin Jamialahmadi et.al.	2503.05005	link
2025-03-06	Wanda++: Pruning Large Language Models via Regional Gradients	Yifan Yang et.al.	2503.04992	null
2025-03-06	DP-GTR: Differentially Private Prompt Protection via Group Text Rewriting	Mingchen Li et.al.	2503.04990	null
2025-03-06	Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review	Boris Malashenko et.al.	2503.04983	null
2025-03-06	LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression	Souvik Kundu et.al.	2503.04982	null
2025-03-06	Quantifying the Relevance of Youth Research Cited in the US Policy Documents	Miftahul Jannat Mokarrama et.al.	2503.04977	link
2025-03-06	Energy-Weighted Flow Matching for Offline Reinforcement Learning	Shiyuan Zhang et.al.	2503.04975	null
2025-03-06	Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning	Giulio Corallo et.al.	2503.04973	null
2025-03-06	Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge	Songyuan Li et.al.	2503.04971	null
2025-03-06	DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL	Haoyuan Ma et.al.	2503.04959	null
2025-03-06	Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems	Jooyoung Lee et.al.	2503.04945	null
2025-03-06	HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models	Yao Ge et.al.	2503.04930	null
2025-03-06	Metadata-free Georegistration of Ground and Airborne Imagery	Adam Bredvik et.al.	2503.04927	null
2025-03-06	FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement	Ian Huang et.al.	2503.04919	null
2025-03-06	L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling	Zhuo Chen et.al.	2503.04725	link
2025-03-07	Shifting Long-Context LLMs Research from Input to Output	Yuhao Wu et.al.	2503.04723	null
2025-03-06	Enough Coin Flips Can Make LLMs Act Bayesian	Ritwik Gupta et.al.	2503.04722	null
2025-03-06	Predictable Scale: Part I – Optimal Hyperparameter Scaling Law in Large Language Model Pretraining	Houyi Li et.al.	2503.04715	null
2025-03-07	Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size	Alireza Behtash et.al.	2503.04704	null
2025-03-06	UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets	Wenyu Wang et.al.	2503.04693	null
2025-03-06	Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases	Pengcheng Qiu et.al.	2503.04691	null
2025-03-06	LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue	Sangyeop Kim et.al.	2503.04675	null
2025-03-06	What Are You Doing? A Closer Look at Controllable Human Video Generation	Emanuele Bugliarello et.al.	2503.04666	null
2025-03-06	CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models	Shengzhuang Chen et.al.	2503.04655	link
2025-03-06	Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators	Blaine Quackenbush et.al.	2503.04649	link
2025-03-06	Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment	Wen Yang et.al.	2503.04647	link
2025-03-06	Simulating the Real World: A Unified Survey of Multimodal Generative Models	Yuqi Hu et.al.	2503.04641	link
2025-03-06	Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation	Aishik Konwer et.al.	2503.04639	null
2025-03-06	Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking	Yijie Xu et.al.	2503.04636	null
2025-03-06	3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs	Artin Saberpour Abadian et.al.	2503.04635	null
2025-03-06	Better Process Supervision with Bi-directional Rewarding Signals	Wenxiang Chen et.al.	2503.04618	null
2025-03-06	Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning	Mohammad Amin Ghanizadeh et.al.	2503.04611	null
2025-03-06	HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization	Zhijian Zhuo et.al.	2503.04598	link
2025-03-06	The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy	Xinyi Hou et.al.	2503.04596	null
2025-03-06	Learning Generalizable Language-Conditioned Cloth Manipulation from Long Demonstrations	Hanyi Zhao et.al.	2503.04557	null
2025-03-06	Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation	Armel Zebaze et.al.	2503.04554	null
2025-03-06	Benchmarking Reasoning Robustness in Large Language Models	Tong Yu et.al.	2503.04550	null
2025-03-06	Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model	Wenke Huang et.al.	2503.04543	link
2025-03-06	SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning	Chen Li et.al.	2503.04530	null
2025-03-06	Multi-modal Summarization in Model-Based Engineering: Automotive Software Development Case Study	Nenad Petrovic et.al.	2503.04506	null
2025-03-06	Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training	Adrian Chang et.al.	2503.04496	null
2025-03-06	Large Language Models in Bioinformatics: A Survey	Zhenyu Wang et.al.	2503.04490	null
2025-03-06	InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference	Tianyu Cui et.al.	2503.04483	null
2025-03-06	ToolFuzz – Automated Agent Tool Testing	Ivan Milev et.al.	2503.04479	null
2025-03-06	Semantic Alignment of Unimodal Medical Text and Vision Representations	Maxime Di Folco et.al.	2503.04478	null
2025-03-06	Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges	Francisco Eiras et.al.	2503.04474	null
2025-03-06	Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification	Van Bach Nguyen et.al.	2503.04463	null
2025-03-06	TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction	Chao Wang et.al.	2503.04457	null
2025-03-06	Activation Space Interventions Can Be Transferred Between Large Language Models	Narmeen Oozeer et.al.	2503.04429	link
2025-03-06	AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services	Xiaoqi Wang et.al.	2503.04418	null
2025-03-06	Can Large Language Models Predict Antimicrobial Resistance Gene?	Hyunwoo Yoo et.al.	2503.04413	null
2025-03-06	Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search	Kou Misaki et.al.	2503.04412	null
2025-03-06	Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling	Yan Li et.al.	2503.04398	null
2025-03-06	TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models	Xinyi He et.al.	2503.04396	null
2025-03-06	Shaping Shared Languages: Human and Large Language Models’ Inductive Biases in Emergent Communication	Tom Kouwenhoven et.al.	2503.04395	null
2025-03-06	AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management	Junyuan Mao et.al.	2503.04392	null
2025-03-06	TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge	Cheng-Han Chiang et.al.	2503.04381	link
2025-03-06	Lost in Literalism: How Supervised Training Shapes Translationese in LLMs	Yafu Li et.al.	2503.04369	null
2025-03-06	A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery	Yiheng Zhu et.al.	2503.04362	null
2025-03-06	LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding	Jia Li et.al.	2503.04359	null
2025-03-06	scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge	Zhen Yu et.al.	2503.04357	null
2025-03-06	Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling	Zhenghua Wang et.al.	2503.04355	null
2025-03-06	Large Language Models for Zero-shot Inference of Causal Structures in Biology	Izzy Newsham et.al.	2503.04347	null
2025-03-06	TRANSIT your events into a new mass: Fast background interpolation for weakly-supervised anomaly searches	Ivan Oleksiyuk et.al.	2503.04342	link
2025-03-06	In-depth Analysis of Graph-based RAG in a Unified Framework	Yingli Zhou et.al.	2503.04338	null
2025-03-06	The Challenge of Identifying the Origin of Black-Box Large Language Models	Ziqing Yang et.al.	2503.04332	null
2025-03-06	Solving Word-Sense Disambiguation and Word-Sense Induction with Dictionary Examples	Tadej Škvorc et.al.	2503.04328	null
2025-03-06	Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation	Christian Rondanini et.al.	2503.04302	null
2025-03-06	Mapping AI Benchmark Data to Quantitative Risk Estimates Through Expert Elicitation	Malcolm Murray et.al.	2503.04299	null
2025-03-06	MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs	Tianyang Zhang et.al.	2503.04291	null
2025-03-06	How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale	Jeanette Falk et.al.	2503.04290	null
2025-03-06	Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models	Niccolò Turcato et.al.	2503.04280	null
2025-03-06	VirtualXAI: A User-Centric Framework for Explainability Assessment Leveraging GPT-Generated Personas	Georgios Makridis et.al.	2503.04261	null
2025-03-06	Knowledge Retention for Continual Model-Based Reinforcement Learning	Yixiang Sun et.al.	2503.04256	null
2025-03-06	ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput	Junsoo Kim et.al.	2503.04253	null
2025-03-06	An Egocentric Vision-Language Model based Portable Real-time Smart Assistant	Yifei Huang et.al.	2503.04250	link
2025-03-06	How to Mitigate Overfitting in Weak-to-strong Generalization?	Junhao Shi et.al.	2503.04249	null
2025-03-06	ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions	Julian Aron Prenner et.al.	2503.04241	link
2025-03-06	DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models	Ruizhe Chen et.al.	2503.04240	null
2025-03-06	SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models	Zesong Zhang et.al.	2503.04234	null
2025-03-06	FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion	Ziyi Yang et.al.	2503.04222	link
2025-03-06	Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition	Bin Chen et.al.	2503.04201	null
2025-03-06	MASTER: Multimodal Segmentation with Text Prompts	Fuyang Liu et.al.	2503.04199	null
2025-03-06	Measuring temporal effects of agent knowledge by date-controlled tool use	R. Patrick Xian et.al.	2503.04188	null
2025-03-06	TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records	Hejie Cui et.al.	2503.04176	null
2025-03-06	DuCos: Duality Constrained Depth Super-Resolution via Foundation Model	Zhiqiang Yan et.al.	2503.04171	null
2025-03-06	CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation	Yuki Tanaka et.al.	2503.04164	null
2025-03-06	VLA Model-Expert Collaboration for Bi-directional Manipulation Learning	Tian-Yu Xiang et.al.	2503.04163	null
2025-03-06	Semantic Retrieval Augmented Contrastive Learning for Sequential Recommendation	Ziqiang Cui et.al.	2503.04162	null
2025-03-06	KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease	Yongchao Long et.al.	2503.04153	link
2025-03-06	Ticktack : Long Span Temporal Alignment of Large Language Models Leveraging Sexagenary Cycle Time Expression	Xue Han et.al.	2503.04150	null
2025-03-06	Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination	Simin Chen et.al.	2503.04149	null
2025-03-06	Biological Sequence with Language Model Prompting: A Survey	Jiyue Jiang et.al.	2503.04135	null
2025-03-06	Token-Efficient Long Video Understanding for Multimodal LLMs	Jindong Jiang et.al.	2503.04130	null
2025-03-06	TimeFound: A Foundation Model for Time Series Forecasting	Congxi Xiao et.al.	2503.04118	null
2025-03-06	InterChat: Enhancing Generative Visual Analytics using Multimodal Interactions	Juntong Chen et.al.	2503.04110	null
2025-03-06	WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining	Haoran Wang et.al.	2503.04106	link
2025-03-06	LLMs Can Generate a Better Answer by Aggregating Their Own Responses	Zichong Li et.al.	2503.04104	null
2025-03-06	Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English	Runtao Zhou et.al.	2503.04099	null
2025-03-07	Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts	Xiangnan Chen et.al.	2503.04095	null
2025-03-06	PokéChamp: an Expert-level Minimax Language Agent	Seth Karten et.al.	2503.04094	null
2025-03-06	Beyond Memorization: Evaluating the True Type Inference Capabilities of LLMs for Java Code Snippets	Yiwen Dong et.al.	2503.04076	null
2025-03-06	PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks	Feng Ni et.al.	2503.04065	link
2025-03-06	Uncovering inequalities in new knowledge learning by large language models across different languages	Chenglong Wang et.al.	2503.04064	link
2025-03-06	EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models	Haiyang Yu et.al.	2503.04058	null
2025-03-06	Insights from Rights and Wrongs: A Large Language Model for Solving Assertion Failures in RTL Design	Jie Zhou et.al.	2503.04057	link
2025-03-06	GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding	Xihan Wang et.al.	2503.04034	null
2025-03-06	Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting	Jiyue Jiang et.al.	2503.04013	null
2025-03-06	DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation	Amin Karimi et.al.	2503.04006	null
2025-03-06	Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows	Xiangxin Zhou et.al.	2503.03989	null
2025-03-06	RetinalGPT: A Retinal Clinical Preference Conversational Assistant Powered by Large Vision-Language Models	Wenhui Zhu et.al.	2503.03987	null
2025-03-06	ReasonGraph: Visualisation of Reasoning Paths	Zongqian Li et.al.	2503.03979	link
2025-03-05	Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge	Fanwen Wang et.al.	2503.03971	link
2025-03-05	Model Behavior Specification by Leveraging LLM Self-Playing and Self-Improving	Soya Park et.al.	2503.03967	null
2025-03-05	The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems	Richard Ren et.al.	2503.03750	null
2025-03-05	Process-based Self-Rewarding Language Models	Shimao Zhang et.al.	2503.03746	link
2025-03-05	Towards Understanding Distilled Reasoning Models: A Representational Approach	David D. Baek et.al.	2503.03730	null
2025-03-05	Improving LLM Safety Alignment with Dual-Objective Optimization	Xuandong Zhao et.al.	2503.03710	link
2025-03-05	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	Effective LLM Knowledge Learning via Model Generalization	Mingkang Zhu et.al.	2503.03705	null
2025-03-05	A Practical Memory Injection Attack against LLM Agents	Shen Dong et.al.	2503.03704	null
2025-03-05	Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models	Jiyue Jiang et.al.	2503.03702	null
2025-03-05	Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks	Zihao Zhao et.al.	2503.03687	link
2025-03-05	Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Bar Karov et.al.	2503.03669	link
2025-03-05	Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction	Gustaw Opiełka et.al.	2503.03666	link
2025-03-05	A Generative Approach to High Fidelity 3D Reconstruction from Text Data	Venkat Kumar R et.al.	2503.03664	null
2025-03-05	Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset	Jessica Hoffmann et.al.	2503.03654	null
2025-03-05	Token-Level Privacy in Large Language Models	Re’em Harel et.al.	2503.03652	null
2025-03-05	DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles	Rui Zhao et.al.	2503.03651	link
2025-03-05	Psy-Copilot: Visual Chain of Thought for Counseling	Keqi Chen et.al.	2503.03645	null
2025-03-05	Large language models in finance: estimating financial sentiment for stock prediction	Kemal Kirtac et.al.	2503.03612	null
2025-03-05	Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering	Lingli Cao et.al.	2503.03609	link
2025-03-05	Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling	Keqi Chen et.al.	2503.03607	null
2025-03-05	Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders	Kristian Kuznetsov et.al.	2503.03601	null
2025-03-05	PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention	Lida Chen et.al.	2503.03588	null
2025-03-05	“You don’t need a university degree to comprehend data protection this way”: LLM-Powered Interactive Privacy Policy Assessment	Vincent Freiberger et.al.	2503.03587	null
2025-03-05	Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories	Alperen Yildiz et.al.	2503.03586	null
2025-03-05	Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection	Wenqiao Li et.al.	2503.03562	null
2025-03-05	Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation	Xiaomeng Zhu et.al.	2503.03556	null
2025-03-05	Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems	Yaoru Li et.al.	2503.03505	link
2025-03-05	Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization	Jiajun Yu et.al.	2503.03503	link
2025-03-05	CURVALID: Geometrically-guided Adversarial Prompt Detection	Canaan Yung et.al.	2503.03502	link
2025-03-05	TEDDY: A Family Of Foundation Models For Understanding Single Cell Biology	Alexis Chevalier et.al.	2503.03485	null
2025-03-05	Generative Artificial Intelligence in Robotic Manipulation: A Survey	Kun Zhang et.al.	2503.03464	null
2025-03-05	Open-Source Large Language Models as Multilingual Crowdworkers: Synthesizing Open-Domain Dialogues in Several Languages With No Examples in Targets and No Machine Translation	Ahmed Njifenjou et.al.	2503.03462	null
2025-03-05	Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models	Alessio Galatolo et.al.	2503.03460	link
2025-03-05	Unified Mind Model: Reimagining Autonomous Agents in the LLM Era	Pengbo Hu et.al.	2503.03459	null
2025-03-05	Taxation Perspectives from Large Language Models: A Case Study on Additional Tax Penalties	Eunkyung Choi et.al.	2503.03444	null
2025-03-05	RASD: Retrieval-Augmented Speculative Decoding	Guofeng Quan et.al.	2503.03434	null
2025-03-05	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-05	Leveraging Large Language Models to Develop Heuristics for Emerging Optimization Problems	Thomas Bömer et.al.	2503.03350	null
2025-03-05	EnigmaToM: Improve LLMs’ Theory-of-Mind Reasoning Capabilities with Neural Knowledge Base of Entity States	Hainiu Xu et.al.	2503.03340	link
2025-03-05	LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models	Xi Zhu et.al.	2503.03313	link
2025-03-05	SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection	Yi-Fan Lu et.al.	2503.03303	null
2025-03-05	Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters	Julia Hindel et.al.	2503.03299	null
2025-03-05	A 262 TOPS Hyperdimensional Photonic AI Accelerator powered by a Si3N4 microcomb laser	Christos Pappas et.al.	2503.03263	null
2025-03-05	Can Frontier LLMs Replace Annotators in Biomedical Text Mining? Analyzing Challenges and Exploring Solutions	Yichong Zhao et.al.	2503.03261	link
2025-03-05	Exploring the Potential of Large Language Models as Predictors in Dynamic Text-Attributed Graphs	Runlin Lei et.al.	2503.03258	null
2025-03-05	PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary Algorithms	Shady Ali et.al.	2503.03239	link
2025-03-05	FANS – Formal Answer Selection for Natural Language Math Reasoning Using Lean4	Jiarui Yao et.al.	2503.03238	null
2025-03-05	Targeted Distillation for Sentiment Analysis	Yice Zhang et.al.	2503.03225	null
2025-03-05	Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture	Zhumei Wang et.al.	2503.03222	null
2025-03-05	COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence	Wentao Li et.al.	2503.03215	null
2025-03-05	PolyVer: A Compositional Approach for Polyglot System Modeling and Verification	Pei-Wei Chen et.al.	2503.03207	null
2025-03-05	An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models	Binxu Wang et.al.	2503.03206	null
2025-03-05	MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving	Ruida Wang et.al.	2503.03205	link
2025-03-05	Find Matching Faces Based On Face Parameters	Setu A. Bhatt et.al.	2503.03204	null
2025-03-05	Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution	Jizhao Zhu et.al.	2503.03201	null
2025-03-05	Structured Outputs Enable General-Purpose LLMs to be Medical Experts	Guangfu Guo et.al.	2503.03194	null
2025-03-05	Enhancing Memory Efficiency in Large Language Model Training Through Chronos-aware Pipeline Parallelism	Xinyuan Lin et.al.	2503.03182	null
2025-03-05	Enhancing Cybersecurity in Critical Infrastructure with LLM-Assisted Explainable IoT Systems	Ashutosh Ghimire et.al.	2503.03180	null
2025-03-05	AttackSeqBench: Benchmarking Large Language Models’ Understanding of Sequential Patterns in Cyber Attacks	Javier Yong et.al.	2503.03170	link
2025-03-05	Dango: A Mixed-Initiative Data Wrangling System using Large Language Model	Wei-Hao Chen et.al.	2503.03154	null
2025-03-05	Position: Model Collapse Does Not Mean What You Think	Rylan Schaeffer et.al.	2503.03150	null
2025-03-05	DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models	YiQiu Guo et.al.	2503.03149	null
2025-03-05	PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Function Secret Sharing	Zhichao You et.al.	2503.03146	null
2025-03-05	A Survey of Foundation Models for Environmental Science	Runlong Yu et.al.	2503.03142	null
2025-03-05	StarFlow: Leveraging Normalizing Flows for Stellar Age Estimation in SDSS-V DR19	Alexander Stone-Martinez et.al.	2503.03138	null
2025-03-05	Bridging Molecular Graphs and Large Language Models	Runze Wang et.al.	2503.03135	link
2025-03-05	Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability	Chenhui Xu et.al.	2503.03128	null
2025-03-05	The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models	Zichao Li et.al.	2503.03122	link
2025-03-05	PromAssistant: Leveraging Large Language Models for Text-to-PromQL	Chenxi Zhang et.al.	2503.03114	null
2025-03-05	SoK: Knowledge is All You Need: Last Mile Delivery for Automated Provenance-based Intrusion Detection with LLMs	Wenrui Cheng et.al.	2503.03108	null
2025-03-05	Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation	Yurui Chang et.al.	2503.03106	null
2025-03-05	BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving	Katharina Winter et.al.	2503.03074	link
2025-03-04	Unification of Stochastic and Quantum Thermodynamics in Scalar Field Theory via a Model with Brownian Thermostat	T. Koide et.al.	2503.03059	null
2025-03-04	SAGE: Steering and Refining Dialog Generation with State-Action Augmentation	Yizhe Zhang et.al.	2503.03040	link
2025-03-04	SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs	Samir Abdaljalil et.al.	2503.03032	null
2025-03-04	Generative Active Adaptation for Drifting and Imbalanced Network Intrusion Detection	Ragini Gupta et.al.	2503.03022	null
2025-03-04	Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems?	Evan Scope Crafts et.al.	2503.03007	link
2025-03-04	Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment	Matthew DosSantos DiSorbo et.al.	2503.02976	null
2025-03-04	LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation	Jude Khouja et.al.	2503.02972	null
2025-03-04	Multilingual Relative Clause Attachment Ambiguity Resolution in Large Language Models	So Young Lee et.al.	2503.02971	link
2025-03-04	InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model	Siqi Ouyang et.al.	2503.02969	link
2025-03-04	Privacy-Preserving Fair Synthetic Tabular Data	Fatima J. Sarmin et.al.	2503.02968	null
2025-03-04	KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding	Zhangchen Xu et.al.	2503.02951	link
2025-03-04	Train on classical, deploy on quantum: scaling generative quantum machine learning to a thousand qubits	Erik Recio-Armengol et.al.	2503.02934	link
2025-03-04	Optimizing open-domain question answering with graph-based retrieval augmented generation	Joyce Cahoon et.al.	2503.02922	null
2025-03-04	ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models	Qinyu Zhao et.al.	2503.02883	link
2025-03-04	Wikipedia in the Era of LLMs: Evolution and Risks	Siming Huang et.al.	2503.02879	link
2025-03-04	SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models	Dmitry Nechaev et.al.	2503.02876	link
2025-03-04	The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models	Ke Ji et.al.	2503.02875	null
2025-03-04	Prompting Generative AI with Interaction-Augmented Instructions	Leixian Shen et.al.	2503.02874	null
2025-03-05	FairSense-AI: Responsible AI Meets Sustainability	Shaina Raza et.al.	2503.02865	null
2025-03-04	Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework	Ziang Zhou et.al.	2503.02863	null
2025-03-04	Privacy and Accuracy-Aware AI/ML Model Deduplication	Hong Guan et.al.	2503.02862	null
2025-03-04	Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers	Zicong He et.al.	2503.02851	link
2025-03-04	Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs	Yuzhe Gu et.al.	2503.02846	link
2025-03-04	SeqFusion: Sequential Fusion of Pre-Trained Models for Zero-Shot Time-Series Forecasting	Ting-Ji Huang et.al.	2503.02836	link
2025-03-04	AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation	Songming Zhang et.al.	2503.02832	null
2025-03-04	Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging	Yujin Oh et.al.	2503.02824	null
2025-03-04	A Multimodal Symphony: Integrating Taste and Sound through Generative AI	Matteo Spanio et.al.	2503.02823	link
2025-03-04	Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts	Marta Skreta et.al.	2503.02819	link
2025-03-04	RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration	Alicia Russell-Gilbert et.al.	2503.02800	null
2025-03-04	Multimodal AI predicts clinical outcomes of drug combinations from preclinical data	Yepeng Huang et.al.	2503.02781	link
2025-03-04	Implicit Bias in LLMs: A Survey	Xinru Lin et.al.	2503.02776	null
2025-03-04	InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training	Dingdong Wang et.al.	2503.02769	null
2025-03-04	BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression	Daniil Larionov et.al.	2503.02756	null
2025-03-04	Large Language Models for Multilingual Previously Fact-Checked Claim Detection	Ivan Vykopal et.al.	2503.02737	link
2025-03-04	RedChronos: A Large Language Model-Based Log Analysis System for Insider Threat Detection in Enterprises	Chenyu Li et.al.	2503.02702	null
2025-03-04	MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality	Shuaike Li et.al.	2503.02701	link
2025-03-04	Zero-Shot Complex Question-Answering on Long Scientific Documents	Wanting Wang et.al.	2503.02695	link
2025-03-04	FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting	Congluo Xu et.al.	2503.02692	null
2025-03-04	Generative Modeling of Microweather Wind Velocities for Urban Air Mobility	Tristan A. Shah et.al.	2503.02690	link
2025-03-04	MPO: Boosting LLM Agents with Meta Plan Optimization	Weimin Xiong et.al.	2503.02682	link
2025-03-04	Multidimensional Consistency Improves Reasoning in Language Models	Huiyuan Lai et.al.	2503.02670	null
2025-03-04	LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models	Pengwei Tang et.al.	2503.02659	null
2025-03-04	The Effectiveness of Large Language Models in Transforming Unstructured Text to Standardized Formats	William Brach et.al.	2503.02650	link
2025-03-04	YARE-GAN: Yet Another Resting State EEG-GAN	Yeganeh Farahzadi et.al.	2503.02636	link
2025-03-04	Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective	Haotian Li et.al.	2503.02631	null
2025-03-04	Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning Extraction	Wenxuan Liu et.al.	2503.02628	null
2025-03-04	Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models	Paul Stangel et.al.	2503.02623	null
2025-03-04	OkraLong: A Flexible Retrieval-Augmented Framework for Long-Text Query Processing	Yulong Hui et.al.	2503.02603	null
2025-03-04	Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs	Wei-Yao Wang et.al.	2503.02597	link
2025-03-04	StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts	Zhaoxing Gan et.al.	2503.02595	null
2025-03-04	MciteBench: A Benchmark for Multimodal Citation Text Generation in MLLMs	Caiyu Hu et.al.	2503.02589	link
2025-03-04	Playing games with Large language models: Randomness and strategy	Alicia Vidler et.al.	2503.02582	null
2025-03-04	LLM-Safety Evaluations Lack Robustness	Tim Beyer et.al.	2503.02574	null
2025-03-04	SpecInF: Exploiting Idle GPU Resources in Distributed DL Training via Speculative Inference Filling	Cunchi Lv et.al.	2503.02550	null
2025-03-04	PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks	Sheng Shang et.al.	2503.02547	null
2025-03-04	SAGE-Amine: Generative Amine Design with Multi-Property Optimization for Efficient CO2 Capture	Hocheol Lim et.al.	2503.02534	link
2025-03-04	Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development	Dimitri Ognibene et.al.	2503.02532	null
2025-03-04	Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent	Xingzuo Li et.al.	2503.02519	link
2025-03-04	Deepfake Detection via Knowledge Injection	Tonghui Li et.al.	2503.02503	null
2025-03-04	LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs	Jianghao Chen et.al.	2503.02502	null
2025-03-04	PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Dataset	Haider Asif et.al.	2503.02497	null
2025-03-04	BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA	Zhengyang Ji et.al.	2503.02476	link
2025-03-04	It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation	Sohan Patnaik et.al.	2503.02463	null
2025-03-04	Don’t Get Too Excited – Eliciting Emotions in LLMs	Gino Franco Fazzi et.al.	2503.02457	null
2025-03-04	Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations	Yuhao Yang et.al.	2503.02453	null
2025-03-04	Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization	Yilun Qiu et.al.	2503.02450	link
2025-03-04	AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking	Iraklis Premptis et.al.	2503.02443	null
2025-03-04	AILS-NTUA at SemEval-2025 Task 3: Leveraging Large Language Models and Translation Strategies for Multilingual Hallucination Detection	Dimitra Karkani et.al.	2503.02442	null
2025-03-04	Artificial Intelligence in Reactor Physics: Current Status and Future Prospects	Ruizhi Zhang et.al.	2503.02440	null
2025-03-04	Beyond the Leland strategies	Emmanuel Lepinette et.al.	2503.02419	null
2025-03-04	Building 3D In-Context Learning Universal Model in Neuroimaging	Jiesi Hu et.al.	2503.02410	link
2025-03-04	Wyckoff Transformer: Generation of Symmetric Crystals	Nikita Kazeev et.al.	2503.02407	link
2025-03-04	Hierarchical Re-ranker Retriever (HRR)	Ashish Singh et.al.	2503.02401	null
2025-03-04	Promptware Engineering: Software Engineering for LLM Prompt Development	Zhenpeng Chen et.al.	2503.02400	null
2025-03-04	PersonaX: A Recommendation Agent Oriented User Modeling Framework for Long Behavior Sequence	Yunxiao Shi et.al.	2503.02398	link
2025-03-04	ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks	Heng Zhou et.al.	2503.02390	link
2025-03-04	RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking	Yifeng Xu et.al.	2503.02387	null
2025-03-04	An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning	Wei Sun et.al.	2503.02382	link
2025-03-04	Teaching Metric Distance to Autoregressive Multimodal Foundational Models	Jiwan Chung et.al.	2503.02379	null
2025-03-04	MedEthicEval: Evaluating Large Language Models Based on Chinese Medical Ethics	Haoan Jin et.al.	2503.02374	null
2025-03-04	EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports	Lama Moukheiber et.al.	2503.02365	null
2025-03-04	Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm	Zhuo Li et.al.	2503.02359	null
2025-03-04	Efficient Long Context Fine-tuning with Chunk Flow	Xiulong Yuan et.al.	2503.02356	null
2025-03-04	CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory	Jiashun Suo et.al.	2503.02354	null
2025-03-04	DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability	Yunzhen He et.al.	2503.02343	link
2025-03-04	GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning	Zhun Mou et.al.	2503.02341	null
2025-03-04	Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection	Eun Cheol Choi et.al.	2503.02328	null
2025-03-04	PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models	Xueliang Zhao et.al.	2503.02324	link
2025-03-04	Generative Model-Assisted Demosaicing for Cross-multispectral Cameras	Jiahui Luo et.al.	2503.02322	null
2025-03-04	Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration	Pengchen Liang et.al.	2503.02321	null
2025-03-04	A Token-level Text Image Foundation Model for Document Understanding	Tongkun Guan et.al.	2503.02304	null
2025-03-04	Towards Large Language Model Guided Kernel Direct Fuzzing	Xie Li et.al.	2503.02301	null
2025-03-04	Towards Explainable Doctor Recommendation with Large Language Models	Ziyang Zeng et.al.	2503.02298	null
2025-03-04	Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions	Wentao Chen et.al.	2503.02296	null
2025-03-04	spike: A tool to drizzle HST, JWST, and Roman PSFs for improved analyses	Ava Polzin et.al.	2503.02288	link
2025-03-04	AppAgentX: Evolving GUI Agents as Proficient Smartphone Users	Wenjia Jiang et.al.	2503.02268	null
2025-03-04	Large Language Models as Natural Selector for Embodied Soft Robot Design	Changhe Chen et.al.	2503.02249	null
2025-03-04	Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts	Tong Liang et.al.	2503.02248	null
2025-03-04	From Code to Courtroom: LLMs as the New Software Judges	Junda He et.al.	2503.02246	null
2025-03-04	OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale	Haoyang Li et.al.	2503.02240	link
2025-03-04	V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors	Keshu Wu et.al.	2503.02239	null
2025-03-04	Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions	Zirui Wu et.al.	2503.02238	link
2025-03-04	Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling	Hang Zheng et.al.	2503.02233	null
2025-03-04	ATLaS: Agent Tuning via Learning Critical Steps	Zhixun Chen et.al.	2503.02197	null
2025-03-04	DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models	Saeed Ranjbar Alvar et.al.	2503.02175	link
2025-03-04	Leveraging Large Language Models for Enhanced Digital Twin Modeling: Trends, Methods, and Challenges	Linyao Yang et.al.	2503.02167	null
2025-03-04	X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning	Jianzhong You et.al.	2503.02162	null
2025-03-04	LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation	Yunbo Long et.al.	2503.02161	null
2025-03-04	Tabby: Tabular Data Synthesis with Language Models	Sonia Cromp et.al.	2503.02152	null
2025-03-04	Malware Classification from Memory Dumps Using Machine Learning, Transformers, and Large Language Models	Areej Dweib et.al.	2503.02144	null
2025-03-04	Measuring Intrinsic Dimension of Token Embeddings	Takuya Kataiwa et.al.	2503.02142	null
2025-03-04	Network Traffic Classification Using Machine Learning, Transformer, and Large Language Models	Ahmad Antari et.al.	2503.02141	null
2025-03-03	TMIQ: Quantifying Test and Measurement Domain Intelligence in Large Language Models	Emmanuel A. Olowe et.al.	2503.02123	null
2025-02-28	LLM Post-Training: A Deep Dive into Reasoning Large Language Models	Komal Kumar et.al.	2502.21321	link
2025-02-28	How far can we go with ImageNet for Text-to-Image generation?	L. Degeorge et.al.	2502.21318	null
2025-02-28	FANformer: Improving Large Language Models Through Effective Periodicity Modeling	Yihong Dong et.al.	2502.21309	link
2025-02-28	Contextualizing biological perturbation experiments through language	Menghua Wu et.al.	2502.21290	link
2025-02-28	Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion	Kulin Shah et.al.	2502.21278	null
2025-02-28	Adaptive Keyframe Sampling for Long Video Understanding	Xi Tang et.al.	2502.21271	null
2025-03-03	Foundation Models – A Panacea for Artificial Intelligence in Pathology?	Nita Mulliqi et.al.	2502.21264	null
2025-02-28	Modeling Human Beliefs about AI Behavior for Scalable Oversight	Leon Lang et.al.	2502.21262	null
2025-02-28	RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete	Yuheng Ji et.al.	2502.21257	null
2025-02-28	TimesBERT: A BERT-Style Foundation Model for Time Series Understanding	Haoran Zhang et.al.	2502.21245	null
2025-03-04	Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs	Xiaomin Li et.al.	2502.21239	null
2025-02-28	Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication	Daniil Filienko et.al.	2502.21236	null
2025-02-28	ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs	Hao Ge et.al.	2502.21231	null
2025-03-03	ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer	Omer Goldman et.al.	2502.21228	null
2025-02-28	Dynamic Markov Blanket Detection for Macroscopic Physics Discovery	Jeff Beck et.al.	2502.21217	link
2025-02-28	Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought	Jianhao Huang et.al.	2502.21212	null
2025-02-28	Chronologically Consistent Large Language Models	Songrun He et.al.	2502.21206	null
2025-03-04	SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training	Fakrul Islam Tushar et.al.	2502.21187	null
2025-02-28	$Δ$ -model correction of Foundation Model based on the models own understanding	Mads-Peter Verner Christiansen et.al.	2502.21179	null
2025-03-03	Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models	Ruta Binkyte et.al.	2502.21123	null
2025-02-28	Optimizing Large Language Models for ESG Activity Detection in Financial Texts	Mattia Birti et.al.	2502.21112	link
2025-02-28	Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure?	Charles Dawson et.al.	2502.21110	null
2025-02-28	Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization	Lie Meng Pang et.al.	2502.21108	null
2025-02-28	Generating patient cohorts from electronic health records using two-step retrieval-augmented text-to-SQL generation	Angelo Ziletti et.al.	2502.21107	null
2025-02-28	A Non-contrast Head CT Foundation Model for Comprehensive Neuro-Trauma Triage	Youngjin Yoo et.al.	2502.21106	null
2025-02-28	Re-evaluating Theory of Mind evaluation in large language models	Jennifer Hu et.al.	2502.21098	null
2025-02-28	An LLM-based Delphi Study to Predict GenAI Evolution	Francesco Bertolotti et.al.	2502.21092	null
2025-02-28	PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information	Hansi Yang et.al.	2502.21087	null
2025-02-28	Are foundation models useful feature extractors for electroencephalography analysis?	Özgün Turgut et.al.	2502.21086	null
2025-02-28	Spatial Reasoning with Denoising Models	Christopher Wewer et.al.	2502.21075	null
2025-02-28	CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation	Zhenyi Shen et.al.	2502.21074	null
2025-02-28	GUIDE: LLM-Driven GUI Generation Decomposition for Automated Prototyping	Kristian Kolthoff et.al.	2502.21068	null
2025-02-28	Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport	Jingru Fu et.al.	2502.21049	link
2025-02-28	Incorporating Long-Range Interactions via the Multipole Expansion into Ground and Excited-State Molecular Simulations	Rhyan Barrett et.al.	2502.21045	null
2025-02-28	The amplifier effect of artificial agents in social contagion	Eric Hitz et.al.	2502.21037	null
2025-02-28	Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs	José I. Orlicki et.al.	2502.21030	null
2025-02-28	Measuring and identifying factors of individuals’ trust in Large Language Models	Edoardo Sebastiano De Duro et.al.	2502.21028	null
2025-02-28	PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues	Fangxu Yu et.al.	2502.21017	null
2025-02-28	Explainable Biomedical Claim Verification with Large Language Models	Siting Liang et.al.	2502.21014	null
2025-02-28	Merging Clinical Knowledge into Large Language Models for Medical Research and Applications: A Survey	Qiyuan Li et.al.	2502.20988	null
2025-02-28	UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation	Thanet Markchom et.al.	2502.20984	null
2025-02-28	Set-Theoretic Compositionality of Sentence Embeddings	Naman Bansal et.al.	2502.20975	null
2025-02-28	TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval	Chien-Yu Lin et.al.	2502.20969	null
2025-02-28	Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs	Weixiang Zhao et.al.	2502.20968	null
2025-02-28	Fine-Grained Retrieval-Augmented Generation for Visual Question Answering	Zhengxuan Zhang et.al.	2502.20964	null
2025-02-28	Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content	Hongyuan Shen et.al.	2502.20952	null
2025-02-28	Generative Uncertainty in Diffusion Models	Metod Jazbec et.al.	2502.20946	null
2025-02-28	A Deep User Interface for Exploring LLaMa	Divya Perumal et.al.	2502.20938	null
2025-02-28	Large Language Models Are Innate Crystal Structure Generators	Jingru Gan et.al.	2502.20933	null
2025-02-28	Automated Evaluation of Meter and Rhyme in Russian Generative and Human-Authored Poetry	Ilya Koziev et.al.	2502.20931	null
2025-02-28	A database to support the evaluation of gender biases in GPT-4o output	Luise Mehner et.al.	2502.20898	null
2025-02-28	Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals’ Subjective Text Perceptions	Matthias Orlikowski et.al.	2502.20897	null
2025-02-28	PathVG: A New Benchmark and Dataset for Pathology Visual Grounding	Chunlin Zhong et.al.	2502.20869	null
2025-02-28	ProBench: Benchmarking Large Language Models in Competitive Programming	Lei Yang et.al.	2502.20868	null
2025-02-28	The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents	Yifan Duan et.al.	2502.20859	null
2025-02-28	Learning to Substitute Components for Compositional Generalization	Zhaoyi Li et.al.	2502.20834	null
2025-02-28	CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval	Zelong Sun et.al.	2502.20826	null
2025-02-28	Can We Simplify Slide-level Fine-tuning of Pathology Foundation Models?	Jiawen Li et.al.	2502.20823	null
2025-02-28	Towards Reliable Vector Database Management Systems: A Software Testing Roadmap for 2030	Shenao Wang et.al.	2502.20812	null
2025-02-28	HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models	Xiao Wang et.al.	2502.20811	null
2025-02-28	PFD: Automatically Generating Machine Learning Force Fields from Universal Models	Ruoyu Wang et.al.	2502.20809	link
2025-03-03	MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts	Peijie Wang et.al.	2502.20808	null
2025-02-28	Digital Player: Evaluating Large Language Models based Human-like Agent in Games	Jiawei Wang et.al.	2502.20807	link
2025-02-28	Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation	Kuang-Da Wang et.al.	2502.20795	null
2025-02-28	Cyber Defense Reinvented: Large Language Models as Threat Intelligence Copilots	Xiaoqun Liu et.al.	2502.20791	null
2025-02-28	Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision	Dawei Zhu et.al.	2502.20790	null
2025-02-28	Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective	Yuko Nakagi et.al.	2502.20779	null
2025-02-28	FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference	Xunhao Lai et.al.	2502.20766	link
2025-02-28	Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth	Seyed Pouyan Mousavi Davoudi et.al.	2502.20758	null
2025-02-28	The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue Agents	Yihong Tang et.al.	2502.20757	null
2025-02-28	SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models	Yichi Zhang et.al.	2502.20749	link
2025-02-28	Teach-to-Reason with Scoring: Self-Explainable Rationale-Driven Multi-Trait Essay Scoring	Heejin Do et.al.	2502.20748	null
2025-02-28	Measuring Determinism in Large Language Models for Software Code Review	Eugene Klishevich et.al.	2502.20747	null
2025-02-28	CADDreamer: CAD object Generation from Single-view Images	Yuan Li et.al.	2502.20732	null
2025-02-28	SPD: Sync-Point Drop for efficient tensor parallelism of Large Language Models	Han-Byul Kim et.al.	2502.20727	null
2025-02-28	Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition	Yifei Duan et.al.	2502.20726	link
2025-02-28	Generating Clinically Realistic EHR Data via a Hierarchy- and Semantics-Guided Transformer	Guanglin Zhou et.al.	2502.20719	null
2025-02-28	Why Trust in AI May Be Inevitable	Nghi Truong et.al.	2502.20701	null
2025-02-28	Towards General Visual-Linguistic Face Forgery Detection(V2)	Ke Sun et.al.	2502.20698	link
2025-02-28	WorldModelBench: Judging Video Generation Models As World Models	Dacheng Li et.al.	2502.20694	null
2025-02-28	Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale Matching	Yihan Wang et.al.	2502.20687	null
2025-02-28	JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation	Yingbing Huang et.al.	2502.20684	null
2025-02-28	STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding	Aaryan Garg et.al.	2502.20678	null
2025-02-28	SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition	Shanshan Wan et.al.	2502.20676	null
2025-02-28	Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA	Ojonugwa Oluwafemi Ejiga Peter et.al.	2502.20667	null
2025-02-28	Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models	Colleen Gilhuly et.al.	2502.20647	null
2025-02-28	LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation	Haitao Li et.al.	2502.20640	link
2025-02-28	Can LLM Assist in the Evaluation of the Quality of Machine Learning Explanations?	Bo Wang et.al.	2502.20635	null
2025-02-28	Are LLMs Ready for Practical Adoption for Assertion Generation?	Vaishnavi Pulavarthi et.al.	2502.20633	null
2025-02-28	Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning	Ayana Niwa et.al.	2502.20620	null
2025-02-28	Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems	Jędrzej Warczyński et.al.	2502.20609	null
2025-02-28	NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence	Saman Khamesian et.al.	2502.20601	link
2025-02-27	Few-Shot, No Problem: Descriptive Continual Relation Extraction	Nguyen Xuan Thanh et.al.	2502.20596	null
2025-02-27	Multi $^2$ : Multi-Agent Test-Time Scalable Framework for Multi-Document Processing	Juntai Cao et.al.	2502.20592	null
2025-02-27	LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis	Saeif Alhazbi et.al.	2502.20589	null
2025-03-04	InstaFace: Identity-Preserving Facial Editing with Single Image Inference	MD Wahiduzzaman Khan et.al.	2502.20577	null
2025-02-27	ECCOS: Efficient Capability and Cost Coordinated Scheduling for Multi-LLM Serving	Kai Mei et.al.	2502.20576	link
2025-02-27	Visual Reasoning at Urban Intersections: FineTuning GPT-4o for Traffic Conflict Detection	Sari Masri et.al.	2502.20573	null
2025-02-27	Stochastic Rounding for LLM Training: Theory and Practice	Kaan Ozkara et.al.	2502.20566	null
2025-02-27	LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks	Joana C. Costa et.al.	2502.20562	link
2025-02-27	R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts	Zhongyang Li et.al.	2502.20395	link
2025-02-27	InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions	Sirui Xu et.al.	2502.20390	link
2025-02-27	Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation	Sucheng Ren et.al.	2502.20388	link
2025-02-27	Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis	Jeffrey Yang Fan Chiang et.al.	2502.20383	null
2025-02-27	Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers	Shalev Lifshitz et.al.	2502.20379	null
2025-02-27	PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation	Albert Gong et.al.	2502.20377	link
2025-02-27	Constrained Generative Modeling with Manually Bridged Diffusion Models	Saeid Naderiparizi et.al.	2502.20371	null
2025-02-27	Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Ryan C. Barron et.al.	2502.20364	link
2025-02-27	Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs	Kuan Lok Zhou et.al.	2502.20356	null
2025-02-27	KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model	Kai Zhang et.al.	2502.20350	null
2025-02-27	Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models	Yi Jing et.al.	2502.20344	null
2025-02-27	Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners	Daniele Paliotta et.al.	2502.20339	null
2025-02-27	Expertise Is What We Want	Alan Ashworth et.al.	2502.20335	null
2025-02-27	Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models	Yukang Yang et.al.	2502.20332	null
2025-02-27	Long-Context Inference with Retrieval-Augmented Speculative Decoding	Guanzheng Chen et.al.	2502.20330	link
2025-02-27	EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants	Franck Cappello et.al.	2502.20309	link
2025-02-27	M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging	Jinghao Feng et.al.	2502.20301	null
2025-02-27	An exploration of features to improve the generalisability of fake news detection models	Nathaniel Hoy et.al.	2502.20299	null
2025-02-27	Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription	Benjamin Gutteridge et.al.	2502.20295	link
2025-02-27	Conformal Tail Risk Control for Large Language Model Alignment	Catherine Yu-Chi Chen et.al.	2502.20285	null
2025-02-27	Evaluating Human Trust in LLM-Based Planners: A Preliminary Study	Shenghui Chen et.al.	2502.20284	null
2025-02-27	Large Language Models as Attribution Regularizers for Efficient Model Training	Davor Vukadin et.al.	2502.20268	link
2025-02-27	Vector-Quantized Vision Foundation Models for Object-Centric Learning	Rongzhen Zhao et.al.	2502.20263	link
2025-02-27	LLM as a Broken Telephone: Iterative Generation Distorts Information	Amr Mohamed et.al.	2502.20258	link
2025-02-27	Do computer vision foundation models learn the low-level characteristics of the human visual system?	Yancheng Cai et.al.	2502.20256	null
2025-02-27	Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets	Chichien Tsai et.al.	2502.20246	null
2025-02-27	From Retrieval to Generation: Comparing Different Approaches	Abdelrahman Abdallah et.al.	2502.20245	null
2025-02-27	FINEREASON: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving	Guizhen Chen et.al.	2502.20238	link
2025-02-27	AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions	Clare Grogan et.al.	2502.20231	link
2025-02-27	Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars	Tobias Kirschstein et.al.	2502.20220	null
2025-02-27	ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models	Haibin Chen et.al.	2502.20196	link
2025-02-27	Model Checking Linear Temporal Logic with Standpoint Modalities	Rajab Aghamov et.al.	2502.20193	null
2025-02-27	Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge	Yan-Lun Chen et.al.	2502.20186	null
2025-02-27	DGFM: Full Body Dance Generation Driven by Music Foundation Models	Xinran Liu et.al.	2502.20176	null
2025-02-27	An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs	Kaustubh Vyas et.al.	2502.20175	null
2025-02-27	Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think	Liang Chen et.al.	2502.20172	link
2025-02-27	Re-evaluating Open-ended Evaluation of Large Language Models	Siqi Liu et.al.	2502.20170	null
2025-02-27	Adaptive H&E-IHC information fusion staining framework based on feature extra	Yifan Jia et.al.	2502.20156	link
2025-02-27	Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale	Max M. Lang et.al.	2502.20140	null
2025-02-27	Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking	Yifan Zhang et.al.	2502.20129	null
2025-02-27	Self-Training Elicits Concise Reasoning in Large Language Models	Tergel Munkhbat et.al.	2502.20122	link
2025-02-27	LongRoPE2: Near-Lossless LLM Context Window Scaling	Ning Shang et.al.	2502.20082	link
2025-02-27	Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents	Haochen Sun et.al.	2502.20073	link
2025-02-27	A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation	Tianyang Qi et.al.	2502.20068	null
2025-02-27	Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish	Marta Lango et.al.	2502.20046	null
2025-02-27	3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds	Hengshuo Chu et.al.	2502.20041	null
2025-02-27	AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs	Xuyang Wei et.al.	2502.20035	link
2025-02-27	Erasing Without Remembering: Safeguarding Knowledge Forgetting in Large Language Models	Huazheng Wang et.al.	2502.19982	link
2025-02-27	The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs	Tanja Baeumel et.al.	2502.19981	null
2025-02-27	Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios	Chao Wang et.al.	2502.19973	null
2025-02-27	Deterministic or probabilistic? The psychology of LLMs as random number generators	Javier Coronado-Blázquez et.al.	2502.19965	null
2025-02-27	SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model	Xinghao Wang et.al.	2502.19960	link
2025-02-27	Collaborative Stance Detection via Small-Large Language Model Consistency Verification	Yu Yan et.al.	2502.19954	link
2025-02-27	GeoEdit: Geometric Knowledge Editing for Large Language Models	Yujie Feng et.al.	2502.19953	null
2025-02-27	Algebraic Machine Learning: Learning as computing an algebraic decomposition of a task	Fernando Martin-Maroto et.al.	2502.19944	link
2025-02-27	Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation	Xiang Geng et.al.	2502.19941	null
2025-02-27	Playing Pokémon Red via Deep Reinforcement Learning	Marco Pleines et.al.	2502.19920	link
2025-02-27	Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models	Yuan Sui et.al.	2502.19918	null
2025-02-27	Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents	Zhenyu Liu et.al.	2502.19917	link
2025-02-27	LLM-driven Effective Knowledge Tracing by Integrating Dual-channel Difficulty	Jiahui Cen et.al.	2502.19915	null
2025-02-27	SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks	Nikolay Blagoev et.al.	2502.19913	link
2025-02-27	Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation	Qianxi He et.al.	2502.19907	null
2025-02-27	Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy	Zaijing Li et.al.	2502.19902	null
2025-02-27	GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors	An Li et.al.	2502.19896	null
2025-02-27	Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models	Sibo Yi et.al.	2502.19883	null
2025-02-27	Towards Multimodal Large-Language Models for Parent-Child Interaction: A Focus on Joint Attention	Weiyan Shi et.al.	2502.19877	null
2025-02-27	MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge	Yuntao Du et.al.	2502.19870	link
2025-02-27	MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue	Yujia Chen et.al.	2502.19860	null
2025-02-27	ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments	Hojae Han et.al.	2502.19852	null
2025-02-27	One-for-More: Continual Diffusion Model for Anomaly Detection	Xiaofan Li et.al.	2502.19848	link
2025-02-27	ProAPO: Progressively Automatic Prompt Optimization for Visual Classification	Xiangyan Qu et.al.	2502.19844	link
2025-02-27	Shared Stochastic Gaussian Process Latent Variable Models: A Multi-modal Generative Model for Quasar Spectra	Vidhi Lalchand et.al.	2502.19824	link
2025-02-27	Foot-In-The-Door: A Multi-turn Jailbreak for LLMs	Zixuan Weng et.al.	2502.19820	link
2025-02-27	Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts	Shulai Zhang et.al.	2502.19811	link
2025-02-27	Implicit Search via Discrete Diffusion: A Study on Chess	Jiacheng Ye et.al.	2502.19805	link
2025-02-27	Developmental Support Approach to AI’s Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning	Taichiro Endo et.al.	2502.19798	null
2025-02-27	ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model	Chuanliu Fan et.al.	2502.19794	null
2025-02-27	Mixtera: A Data Plane for Foundation Model Training	Maximilian Böther et.al.	2502.19790	link
2025-02-27	Advancements in Natural Language Processing for Automatic Text Summarization	Nevidu Jayatilleke et.al.	2502.19773	null
2025-02-27	Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models	Heeseung Kim et.al.	2502.19759	null
2025-02-27	PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation	Nathan Roll et.al.	2502.19756	null
2025-02-27	Beneath the Surface: How Large Language Models Reflect Hidden Bias	Jinhao Pan et.al.	2502.19749	link
2025-02-27	HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture	Taiqiang Wu et.al.	2502.19747	null
2025-02-27	R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning	Minggui He et.al.	2502.19735	null
2025-02-27	Preference Learning Unlocks LLMs’ Psycho-Counseling Skills	Mian Zhang et.al.	2502.19731	null
2025-02-27	Do Expressions Change Decisions? Exploring the Impact of AI’s Explanation Tone on Decision-Making	Ayano Okoso et.al.	2502.19730	null
2025-02-27	Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training	Toan Tran et.al.	2502.19726	null
2025-02-27	Few-Shot Multilingual Open-Domain QA from 5 Examples	Fan Jiang et.al.	2502.19722	link
2025-02-27	Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs	Hannah Cyberey et.al.	2502.19721	link
2025-02-27	Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation	Manveer Singh Tamber et.al.	2502.19712	link
2025-02-27	AoECR: AI-ization of Elderly Care Robot	Linkun Zhou et.al.	2502.19706	null
2025-02-27	You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving	Guangfeng Jiang et.al.	2502.19698	null
2025-02-27	M-LLM Based Video Frame Selection for Efficient Video Understanding	Kai Hu et.al.	2502.19680	null
2025-02-27	Old Experience Helps: Leveraging Survey Methodology to Improve AI Text Annotation Reliability in Social Sciences	Linzhuo li et.al.	2502.19679	null
2025-02-27	Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack	Chenhe Gu et.al.	2502.19672	null
2025-02-27	SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning	Mingsheng Cai et.al.	2502.19668	null
2025-02-27	Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models	Jan Wehner et.al.	2502.19649	null
2025-02-27	cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning	Micha Livne et.al.	2502.19642	null
2025-02-26	Agentic Mixture-of-Workflows for Multi-Modal Chemical Search	Tiffany J. Callahan et.al.	2502.19629	null
2025-02-26	Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication	Zhongyuan Liang et.al.	2502.19625	null
2025-02-26	Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing	Akshat Gupta et.al.	2502.19416	null
2025-02-26	Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs	Dayu Yang et.al.	2502.19411	link
2025-02-26	Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices	Xinru Wang et.al.	2502.19410	null
2025-02-26	ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models	Danae Sánchez Villegas et.al.	2502.19409	null
2025-02-26	Learning Code-Edit Embedding to Model Student Debugging Behavior	Hasnain Heickal et.al.	2502.19407	null
2025-02-26	General Reasoning Requires Learning to Reason from the Get-go	Seungwook Han et.al.	2502.19402	null
2025-02-26	TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding	Max Ku et.al.	2502.19400	null
2025-02-26	Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis	Minjoo Lim et.al.	2502.19390	null
2025-02-26	LiDAR Registration with Visual Foundation Models	Niclas Vödisch et.al.	2502.19374	null
2025-02-26	Deep Learning For Time Series Analysis With Application On Human Motion	Ali Ismail-Fawaz et.al.	2502.19364	null
2025-02-26	DataMan: Data Manager for Pre-training Large Language Models	Ru Peng et.al.	2502.19363	null
2025-02-26	Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?	Yancheng He et.al.	2502.19361	link
2025-02-26	Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets	Tohida Rehman et.al.	2502.19339	null
2025-02-26	Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems	Hao Peng et.al.	2502.19328	link
2025-02-26	Shh, don’t say that! Domain Certification in LLMs	Cornelius Emde et.al.	2502.19320	null
2025-02-26	Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond	Qizhou Wang et.al.	2502.19301	null
2025-02-26	Agent-centric Information Access	Evangelos Kanoulas et.al.	2502.19298	null
2025-02-26	Complex LLM Planning via Automated Heuristics Discovery	Hongyi Ling et.al.	2502.19295	null
2025-02-26	Efficient Federated Search for Retrieval-Augmented Generation	Rachid Guerraoui et.al.	2502.19280	null
2025-02-26	ArtInsight: Enabling AI-Powered Artwork Engagement for Mixed Visual-Ability Families	Arnavi Chheda-Kothary et.al.	2502.19263	null
2025-02-26	AI-Powered Bayesian Inference	Veronika Ročková et.al.	2502.19231	null
2025-02-26	Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time	Jiazheng Li et.al.	2502.19230	null
2025-02-26	A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images	Nikita Shvetsov et.al.	2502.19217	null
2025-02-26	A Hybrid Transformer Architecture with a Quantized Self-Attention Mechanism Applied to Molecular Generation	Anthony M. Smaldone et.al.	2502.19214	link
2025-02-26	Negation-Induced Forgetting in LLMs	Francesca Capuano et.al.	2502.19211	null
2025-02-26	Bi’an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation	Zhouyu Jiang et.al.	2502.19209	null
2025-02-26	Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms	Jinyu Cai et.al.	2502.19193	null
2025-02-26	BIG-Bench Extra Hard	Mehran Kazemi et.al.	2502.19187	link
2025-02-26	INFO-SEDD: Continuous Time Markov Chains as Scalable Information Metrics Estimators	Alberto Foresti et.al.	2502.19183	null
2025-02-26	UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering	Langming Liu et.al.	2502.19178	link
2025-02-26	MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis	Daniel Rose et.al.	2502.19175	null
2025-02-26	A Model-Centric Review of Deep Learning for Protein Design	Gregory W. Kyro et.al.	2502.19173	null
2025-02-26	CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation	Kaiwen Yan et.al.	2502.19166	link
2025-02-26	TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency	Henry Peng Zou et.al.	2502.19163	link
2025-02-26	Detecting Linguistic Indicators for Stereotype Assessment with Large Language Models	Rebekka Görge et.al.	2502.19160	null
2025-02-26	A Sliding Layer Merging Method for Efficient Depth-Wise Pruning in LLMs	Xuan Ding et.al.	2502.19159	link
2025-02-26	When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning	Yijiang River Dong et.al.	2502.19158	null
2025-02-26	Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval	Jiarong Wu et.al.	2502.19149	null
2025-02-26	Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs	Zhaowei Zhang et.al.	2502.19148	null
2025-02-26	Identification Under the Semantic Effective Secrecy Constraint	Abdalla Ibrahim et.al.	2502.19142	null
2025-02-26	A Temporal Planning Framework for Multi-Agent Systems via LLM-Aided Knowledge Base Management	Enrico Saccon et.al.	2502.19135	null
2025-02-26	Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement	Siyuan Zhang et.al.	2502.19127	null
2025-02-26	A Survey on Foundation-Model-Based Industrial Defect Detection	Tianle Yang et.al.	2502.19106	null
2025-02-26	Evaluating Gender Bias in German Machine Translation	Michelle Kappl et.al.	2502.19104	link
2025-02-26	LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm	Siwei Wu et.al.	2502.19103	null
2025-02-26	Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation	Humza Sami et.al.	2502.19091	link
2025-02-26	EndoMamba: An Efficient Foundation Model for Endoscopic Videos	Qingyao Tian et.al.	2502.19090	link
2025-02-26	Sparse Brains are Also Adaptive Brains: Cognitive-Load-Aware Dynamic Activation for LLMs	Yiheng Yang et.al.	2502.19078	null
2025-02-26	IndicEval-XL: Bridging Linguistic Diversity in Code Generation Across Indic Languages	Ujjwal Singh et.al.	2502.19067	link
2025-02-26	Can Large Language Models Outperform Non-Experts in Poetry Evaluation? A Comparative Study Using the Consensual Assessment Technique	Piotr Sawicki et.al.	2502.19064	null
2025-02-26	MathClean: A Benchmark for Synthetic Mathematical Data Cleaning	Hao Liang et.al.	2502.19058	null
2025-02-26	Beyond Surface-Level Patterns: An Essence-Driven Defense Framework Against Jailbreak Attacks in LLMs	Shiyu Xiang et.al.	2502.19041	null
2025-02-26	FungalZSL: Zero-Shot Fungal Classification with Image Captioning Using a Synthetic Data Approach	Anju Rani et.al.	2502.19038	null
2025-02-26	InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model	Fengbin Guan et.al.	2502.19026	null
2025-02-26	Binary Neural Networks for Large Language Model: A Survey	Liangdong Liu et.al.	2502.19008	null
2025-02-26	The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training	Jinbo Wang et.al.	2502.19002	null
2025-02-26	MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering	Teng Lin et.al.	2502.18993	null
2025-02-26	OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models	Hui Feng et.al.	2502.18992	link
2025-02-26	GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation	Jie He et.al.	2502.18990	null
2025-02-26	PEToolLLM: Towards Personalized Tool Learning in Large Language Models	Qiancheng Xu et.al.	2502.18980	null
2025-02-26	Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning	Hongyi Cal et.al.	2502.18978	null
2025-02-26	(Mis)Fitting: A Survey of Scaling Laws	Margaret Li et.al.	2502.18969	link
2025-02-26	Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles	Kuang Wang et.al.	2502.18968	link
2025-02-26	OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment	Jiaxin Deng et.al.	2502.18965	null
2025-02-26	DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model	Lei Zhao et.al.	2502.18952	null
2025-02-26	Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models	Yu He et.al.	2502.18943	null
2025-02-26	JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models	Shuyi Liu et.al.	2502.18935	null
2025-02-26	Talking like Piping and Instrumentation Diagrams (P&IDs)	Achmad Anggawirya Alimin et.al.	2502.18928	null
2025-02-26	ClassInvGen: Class Invariant Synthesis using Large Language Models	Chuyue Sun et.al.	2502.18917	null
2025-02-26	END: Early Noise Dropping for Efficient and Effective Context Denoising	Hongye Jin et.al.	2502.18915	null
2025-02-26	CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning	Ping Zhang et.al.	2502.18910	null
2025-02-26	An Empirical Study on Commit Message Generation using LLMs via In-Context Learning	Yifan Wu et.al.	2502.18904	link
2025-02-26	From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens	Tong Wu et.al.	2502.18890	link
2025-02-26	Letters from Future Self: Augmenting the Letter-Exchange Exercise with LLM-based Agents to Enhance Young Adults’ Career Exploration	Hayeon Jeon et.al.	2502.18881	null
2025-02-26	Learning to Generate Structured Output with Schema Reinforcement Learning	Yaxi Lu et.al.	2502.18878	null
2025-02-26	Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework	Kaishuai Xu et.al.	2502.18874	null
2025-02-26	Multi-LLM Collaborative Search for Complex Problem Solving	Sen Yang et.al.	2502.18873	null
2025-02-26	A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops	Shi Fu et.al.	2502.18865	null
2025-02-26	Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM	Junxiao Ma et.al.	2502.18863	null
2025-02-26	A Causal Lens for Evaluating Faithfulness Metrics	Kerem Zaman et.al.	2502.18848	null
2025-02-26	Sliding Window Attention Training for Efficient Large Language Models	Zichuan Fu et.al.	2502.18845	null
2025-02-26	Evidence-Driven Marker Extraction for Social Media Suicide Risk Detection	Carter Adams et.al.	2502.18823	null
2025-02-26	Data-Efficient Multi-Agent Spatial Planning with LLMs	Huangyuan Su et.al.	2502.18822	null
2025-02-26	CAMEx: Curvature-aware Merging of Experts	Dung V. Nguyen et.al.	2502.18821	link
2025-02-26	Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models	Shuliang Liu et.al.	2502.18817	null
2025-02-26	Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal	Weipeng Jiang et.al.	2502.18810	null
2025-02-26	Optimal Stochastic Trace Estimation in Generative Modeling	Xinyang Liu et.al.	2502.18808	null
2025-02-26	SolEval: Benchmarking Large Language Models for Repository-level Solidity Code Generation	Zhiyuan Peng et.al.	2502.18793	null
2025-02-26	Active Few-Shot Learning for Text Classification	Saeed Ahmadnia et.al.	2502.18782	null
2025-02-26	Towards Optimal Multi-draft Speculative Decoding	Zhengmian Hu et.al.	2502.18779	null
2025-02-26	M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance	Qingpei Guo et.al.	2502.18778	null
2025-02-26	Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance	Xueqing Peng et.al.	2502.18772	null
2025-02-26	Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation	Yuxiang Wang et.al.	2502.18771	link
2025-02-26	Reward Shaping to Mitigate Reward Hacking in RLHF	Jiayi Fu et.al.	2502.18770	link
2025-02-26	CommGPT: A Graph and Retrieval-Augmented Multimodal Communication Foundation Model	Feibo Jiang et.al.	2502.18763	null
2025-02-26	Training Large Recommendation Models via Graph-Language Token Alignment	Mingdai Yang et.al.	2502.18757	null
2025-02-26	M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type	Weiming Hu et.al.	2502.18755	null
2025-02-26	AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms	Yuwei Yan et.al.	2502.18754	link
2025-02-26	Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking	Shaheer Mohamed et.al.	2502.18748	null
2025-02-26	Automatic Prompt Optimization via Heuristic Search: A Survey	Wendi Cui et.al.	2502.18746	null
2025-02-25	DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers	Xueguang Ma et.al.	2502.18460	link
2025-02-25	LLM-Based Design Pattern Detection	Christian Schindler et.al.	2502.18458	null
2025-02-25	FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response	Mollie Shichman et.al.	2502.18452	null
2025-02-25	SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution	Yuxiang Wei et.al.	2502.18449	null
2025-02-25	MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning	Chanwoo Park et.al.	2502.18439	null
2025-02-25	TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning	Frederikus Hudi et.al.	2502.18431	link
2025-02-25	OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference	Xiangyu Zhao et.al.	2502.18411	link
2025-02-25	Enhancing DNA Foundation Models to Address Masking Inefficiencies	Monireh Safari et.al.	2502.18405	null
2025-02-25	Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods	Nicola Cecere et.al.	2502.18389	null
2025-02-25	How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities	Minhua Lin et.al.	2502.18387	null
2025-02-25	MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning	Sepehr Asgarian et.al.	2502.18371	null
2025-02-25	Sparse Bayesian Generative Modeling for Joint Parameter and Channel Estimation	Benedikt Böck et.al.	2502.18369	null
2025-02-25	ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation	Yifan Pu et.al.	2502.18364	null
2025-02-25	Responsible AI Agents	Deven R. Desai et.al.	2502.18359	null
2025-02-25	Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation	Jessica He et.al.	2502.18357	null
2025-02-25	BRIDO: Bringing Democratic Order to Abstractive Summarization	Junhyun Lee et.al.	2502.18342	null
2025-02-25	Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology	Romy Beauté et.al.	2502.18318	null
2025-02-25	GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music	Xinran Liu et.al.	2502.18309	null
2025-02-25	RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction	Jianhao Yan et.al.	2502.18308	null
2025-02-25	LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation	Pengzhi Li et.al.	2502.18302	null
2025-02-25	Bayesian Computation in Deep Learning	Wenlong Chen et.al.	2502.18300	null
2025-02-25	DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis	Zeju Li et.al.	2502.18297	null
2025-02-25	AMPO: Active Multi-Preference Optimization	Taneesh Gupta et.al.	2502.18293	null
2025-02-25	Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases	Shanshan Xu et.al.	2502.18282	null
2025-02-25	Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support	Guoxin Wang et.al.	2502.18274	link
2025-02-25	Imperfect Knowledge Management (IKM) in GEFRED (GENeralized model for Fuzzy RElational Databases)	Leoncio Jimenez et.al.	2502.18255	null
2025-02-25	Iterative Counterfactual Data Augmentation	Mitchell Plyler et.al.	2502.18249	link
2025-02-25	Unveiling and Causalizing CoT: A Causal Pespective	Jiarun Fu et.al.	2502.18239	null
2025-02-25	Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints	Mihaela Cătălina Stoian et.al.	2502.18237	link
2025-02-25	Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent	Xiaofeng Wang et.al.	2502.18228	null
2025-02-25	From ChatGPT to DeepSeek: Can LLMs Simulate Humanity?	Qian Wang et.al.	2502.18210	null
2025-02-25	LAG: LLM agents for Leaderboard Auto Generation on Demanding	Jian Wu et.al.	2502.18209	null
2025-02-25	Grandes modelos de lenguaje: de la predicción de palabras a la comprensión?	Carlos Gómez-Rodríguez et.al.	2502.18205	null
2025-02-25	Intersubjective Model of AI-mediated Communication: Augmenting Human-Human Text Chat through LLM-based Adaptive Agent Pair	Shutaro Aoyama et.al.	2502.18201	null
2025-02-25	Task-Agnostic Semantic Communication with Multimodal Foundation Models	Jiangjing Hu et.al.	2502.18200	null
2025-02-25	Agnostic calculation of atomic free energies with the descriptor density of states	Thomas D Swinburne et.al.	2502.18191	link
2025-02-25	ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis	Li Lei et.al.	2502.18180	null
2025-02-25	Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs	Gaye Colakoglu et.al.	2502.18179	link
2025-02-25	CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification	Mingkun Zhang et.al.	2502.18176	link
2025-02-25	SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models	Zhang Yuxuan et.al.	2502.18168	null
2025-02-25	Can LLMs Explain Themselves Counterfactually?	Zahra Dehghanighobadi et.al.	2502.18156	null
2025-02-25	Carbon and Silicon, Coexist or Compete? A Survey on Human-AI Interactions in Agent-based Modeling and Simulation	Ziyue Lin et.al.	2502.18145	null
2025-02-25	LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers	Zhuocheng Zhang et.al.	2502.18139	link
2025-02-25	Large Language Model Driven Agents for Simulating Echo Chamber Formation	Chenhao Gu et.al.	2502.18138	null
2025-02-25	Inverse Materials Design by Large Language Model-Assisted Generative Framework	Yun Hao et.al.	2502.18127	link
2025-02-25	HyperG: Hypergraph-Enhanced LLMs for Structured Knowledge	Sirui Huang et.al.	2502.18125	null
2025-02-25	Bayesian Optimization for Controlled Image Editing via LLMs	Chengkun Cai et.al.	2502.18116	null
2025-02-25	PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching	Han Nie et.al.	2502.18104	link
2025-02-25	Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models	Cao Yuxuan et.al.	2502.18101	link
2025-02-25	Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning	Wenkai Yang et.al.	2502.18080	null
2025-02-25	Examining the Threat Landscape: Foundation Models and Model Stealing	Ankita Raj et.al.	2502.18077	null
2025-02-25	MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration	Yishuai Cai et.al.	2502.18072	link
2025-02-25	Golden Ratio Mixing of Real and Synthetic Data for Stabilizing Generative Model Training	Hengzhi He et.al.	2502.18049	null
2025-02-25	AutoCas: Autoregressive Cascade Predictor in Social Networks via Large Language Models	Yuhao Zheng et.al.	2502.18040	null
2025-02-25	Harnessing Multiple Large Language Models: A Survey on LLM Ensemble	Zhijun Chen et.al.	2502.18036	link
2025-02-25	Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference	Zhuo Chen et.al.	2502.18023	null
2025-02-25	AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages	Joshua Sakthivel Raju et.al.	2502.18020	null
2025-02-25	NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms	Yashan Wang et.al.	2502.18008	null
2025-02-25	Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning	Xinghao Chen et.al.	2502.18001	link
2025-02-25	Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation	Guang Lin et.al.	2502.17972	null
2025-02-25	LLM Knows Geometry Better than Algebra: Numerical Understanding of LLM-Based Agents in A Trading Arena	Tianmi Ma et.al.	2502.17967	link
2025-02-25	Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments	Patomporn Payoungkhamdee et.al.	2502.17956	null
2025-02-25	DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning	Pusheng Xu et.al.	2502.17947	null
2025-02-25	Assessing Large Language Models in Agentic Multilingual National Bias	Qianying Liu et.al.	2502.17945	null
2025-02-25	CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation	Haitao Li et.al.	2502.17943	link
2025-02-25	Advantage-Guided Distillation for Preference Alignment in Small Language Models	Shiping Gao et.al.	2502.17927	link
2025-02-25	LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction	Suozhi Huang et.al.	2502.17925	null
2025-02-25	FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models	Hongzhan Lin et.al.	2502.17924	link
2025-02-25	Towards Sustainable Web Agents: A Plea for Transparency and Dedicated Metrics for Energy Consumption	Lars Krupp et.al.	2502.17903	null
2025-02-25	Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs	Che Liu et.al.	2502.17900	null
2025-02-25	Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation	Tong Li et.al.	2502.17899	null
2025-02-25	FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real	Weiheng Liu et.al.	2502.17894	null
2025-02-25	RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts	Mingyan Wu et.al.	2502.17888	link
2025-02-25	Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers	Hannah Calzi Kleidermacher et.al.	2502.17882	null
2025-02-25	EEGM2: An Efficient Mamba-2-Based Self-Supervised Framework for Long-Sequence EEG Modeling	Jiazhen Hong et.al.	2502.17873	link
2025-02-25	ASurvey: Spatiotemporal Consistency in Video Generation	Zhiyu Yin et.al.	2502.17863	null
2025-02-25	HRR: Hierarchical Retrospection Refinement for Generated Image Detection	Peipei Yuan et.al.	2502.17862	null
2025-02-25	LR ${}^{2}$ Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems	Jianghao Chen et.al.	2502.17848	null
2025-02-25	Quantifying interdisciplinary synergy in higher STEM education	Gahyoun Gim et.al.	2502.17841	null
2025-02-25	A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation	Beibei Xiong et.al.	2502.17840	null
2025-02-25	TagGAN: A Generative Model for Data Tagging	Muhammad Nawaz et.al.	2502.17836	null
2025-02-25	MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks	Hyeonjeong Ha et.al.	2502.17832	link
2025-02-25	A General Framework to Enhance Fine-tuning-based LLM Unlearning	Jie Ren et.al.	2502.17823	link
2025-02-25	An Overview of Large Language Models for Statisticians	Wenlong Ji et.al.	2502.17814	null
2025-02-25	Can Multimodal LLMs Perform Time Series Anomaly Detection?	Xiongxiao Xu et.al.	2502.17812	link
2025-02-25	URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models	Ruiqi Yan et.al.	2502.17810	null
2025-02-25	DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities	Tianyi Zhuang et.al.	2502.17807	null
2025-02-25	Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training	Yihang Yao et.al.	2502.17800	null
2025-02-25	AIR: Complex Instruction Generation via Automatic Iterative Refinement	Wei Liu et.al.	2502.17787	link
2025-02-25	Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty	Yoshee Jain et.al.	2502.17785	null
2025-02-25	Tip of the Tongue Query Elicitation for Simulated Evaluation	Yifan He et.al.	2502.17776	link
2025-02-25	FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks	Tanawan Premsri et.al.	2502.17775	link
2025-02-25	Uncertainty Quantification for LLM-Based Survey Simulations	Chengpiao Huang et.al.	2502.17773	null
2025-02-25	DeepSeek vs. ChatGPT: A Comparative Study for Scientific Computing and Scientific Machine Learning Tasks	Qile Jiang et.al.	2502.17764	null
2025-02-25	Design and implementation of a distributed security threat detection system integrating federated learning and multimodal LLM	Yuqing Wang et.al.	2502.17763	null
2025-02-25	Detection of LLM-Paraphrased Code and Identification of the Responsible LLM Using Coding Style Features	Shinwoo Park et.al.	2502.17749	null
2025-02-24	LLM Inference Acceleration via Efficient Operation Fusion	Mahsa Salmani et.al.	2502.17728	null
2025-02-24	Can Score-Based Generative Modeling Effectively Handle Medical Image Classification?	Sushmita Sarker et.al.	2502.17727	link
2025-02-24	Spontaneous Giving and Calculated Greed in Language Models	Yuxuan Li et.al.	2502.17720	null
2025-02-24	Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures	Akhila Yerukola et.al.	2502.17710	link
2025-02-24	Fractal Generative Models	Tianhong Li et.al.	2502.17437	link
2025-02-24	Introducing Visual Perception Token into Multimodal Large Language Model	Runpeng Yu et.al.	2502.17425	link
2025-02-24	MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs	Jiarui Zhang et.al.	2502.17422	link
2025-02-24	LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification	Penghui Yang et.al.	2502.17421	link
2025-02-24	The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence	Tom Wollschläger et.al.	2502.17420	null
2025-02-24	From System 1 to System 2: A Survey of Reasoning Large Language Models	Zhong-Zhi Li et.al.	2502.17419	link
2025-02-24	Reasoning with Latent Thoughts: On the Power of Looped Transformers	Nikunj Saunshi et.al.	2502.17416	null
2025-02-24	COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs	Liming Liu et.al.	2502.17410	link
2025-02-24	Large Language Models are Powerful EHR Encoders	Stefan Hegselmann et.al.	2502.17403	link
2025-02-24	What is a Good Question? Utility Estimation with LLM-based Simulations	Dong-Ho Lee et.al.	2502.17383	null
2025-02-24	KV-Edit: Training-Free Image Editing for Precise Background Preservation	Tianrui Zhu et.al.	2502.17363	link
2025-02-24	A Closer Look at TabPFN v2: Strength, Limitation, and Extension	Han-Jia Ye et.al.	2502.17361	null
2025-02-24	RELICT: A Replica Detection Framework for Medical Image Generation	Orhun Utku Aydin et.al.	2502.17360	link
2025-02-24	On Relation-Specific Neurons in Large Language Models	Yihong Liu et.al.	2502.17355	link
2025-02-24	How Scientists Use Large Language Models to Program	Gabrielle O’Brien et.al.	2502.17348	null
2025-02-24	Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators	João Pedro Matos-Carvalho et.al.	2502.17341	null
2025-02-24	HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization	Zhenghao Liu et.al.	2502.17315	link
2025-02-24	Delta Decompression for MoE-based LLMs Compression	Hao Gu et.al.	2502.17298	link
2025-02-24	Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts	Zhenghao Liu et.al.	2502.17297	link
2025-02-24	Integrating protein sequence embeddings with structure via graph-based deep learning for the prediction of single-residue properties	Kevin Michalewicz et.al.	2502.17294	link
2025-02-24	Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing	Yi-Kai Zhang et.al.	2502.17282	link
2025-02-24	MonoTODia: Translating Monologue Requests to Task-Oriented Dialogues	Sebastian Steindl et.al.	2502.17268	link
2025-02-24	Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective	Chengyin Xu et.al.	2502.17262	null
2025-02-24	Detecting Benchmark Contamination Through Watermarking	Tom Sander et.al.	2502.17259	null
2025-02-24	REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective	Simon Geisler et.al.	2502.17254	link
2025-02-24	Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search	Boyan Li et.al.	2502.17248	null
2025-02-24	Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction	Tianpeng Li et.al.	2502.17239	link
2025-02-24	Making LLMs Reason? The Intermediate Language Problem in Neurosymbolic Approaches	Alexander Beiser et.al.	2502.17216	null
2025-02-24	CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought	Boxuan Zhang et.al.	2502.17214	link
2025-02-24	Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following	Jie Zeng et.al.	2502.17204	link
2025-02-24	IGDA: Interactive Graph Discovery through Large Language Model Agents	Alex Havrilla et.al.	2502.17189	null
2025-02-24	Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks	Andrei Chernov et.al.	2502.17187	null
2025-02-24	Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric	Yuming Yang et.al.	2502.17184	link
2025-02-24	Unsupervised Accelerated MRI Reconstruction via Ground-Truth-Free Flow Matching	Xinzhe Luo et.al.	2502.17174	null
2025-02-24	Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch	Xueru Wen et.al.	2502.17173	null
2025-02-24	Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding)	Damien Sileo et.al.	2502.17169	null
2025-02-24	JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning	Huanghai Liu et.al.	2502.17166	link
2025-02-24	MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation	María Andrea Cruz Blandón et.al.	2502.17163	link
2025-02-24	Real-time Monitoring of Economic Shocks using Company Websites	Michael Koenig et.al.	2502.17161	null
2025-02-24	A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis	Yuli Wu et.al.	2502.17160	null
2025-02-24	Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation	Fanhu Zeng et.al.	2502.17159	null
2025-02-24	CodeSwift: Accelerating LLM Inference for Efficient Code Generation	Qianhui Zhao et.al.	2502.17139	null
2025-02-24	Evaluating the Effectiveness of Large Language Models in Automated News Article Summarization	Lionel Richy Panlap Houamegni et.al.	2502.17136	null
2025-02-24	Applications of Large Models in Medicine	YunHe Su et.al.	2502.17132	null
2025-02-24	Thus Spake Long-Context Large Language Model	Xiaoran Liu et.al.	2502.17129	null
2025-02-24	Adversarial Training for Defense Against Label Poisoning Attacks	Melis Ilayda Bal et.al.	2502.17121	link
2025-02-24	Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Zhong Li et.al.	2502.17119	link
2025-02-24	SFLD: Reducing the content bias for AI-generated Image Detection	Seoyeon Gye et.al.	2502.17105	null
2025-02-24	Generative Models in Decision Making: A Survey	Yinchuan Li et.al.	2502.17100	null
2025-02-24	Improved Diffusion-based Generative Model with Better Adversarial Robustness	Zekun Wang et.al.	2502.17099	link
2025-02-24	Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies	Julieth Katherine Riveros et.al.	2502.17087	link
2025-02-24	Automatically Evaluating the Paper Reviewing Capability of Large Language Models	Hyungyu Shin et.al.	2502.17086	null
2025-02-24	Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence	Bolin Chen et.al.	2502.17085	null
2025-02-24	Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability	Ashhadul Islam et.al.	2502.17071	null
2025-02-24	LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking Preferences	Sijia Yao et.al.	2502.17057	link
2025-02-24	PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance	Haoran Li et.al.	2502.17041	link
2025-02-24	Evolution 6.0: Evolving Robotic Capabilities Through Generative Design	Muhammad Haris Khan et.al.	2502.17034	null
2025-02-24	Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology	Longchao Da et.al.	2502.17026	null
2025-02-24	Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization	Zixuan Gong et.al.	2502.17024	null
2025-02-24	Quantifying Logical Consistency in Transformers via Query-Key Alignment	Eduard Tulchinskii et.al.	2502.17017	null
2025-02-24	Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation	Jaskaran Singh Walia et.al.	2502.17011	null
2025-02-24	Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators	Shixin Zhao et.al.	2502.17006	null
2025-02-24	An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT	Shreya Singh et.al.	2502.17000	null
2025-02-24	Active Learning for Conditional Inverse Design with Crystal Generation and Foundation Atomic Models	Zhuoyuan Li et.al.	2502.16984	null
2025-02-24	LongSafety: Evaluating Long-Context Safety of Large Language Models	Yida Lu et.al.	2502.16971	link
2025-02-24	Autoregressive Image Generation Guided by Chains of Thought	Miaomiao Cai et.al.	2502.16965	null
2025-02-24	Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM	Lian Liu et.al.	2502.16963	null
2025-02-24	UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings	Layba Fiaz et.al.	2502.16961	null
2025-02-24	Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance	Chenghua Huang et.al.	2502.16944	null
2025-02-24	Reasoning Does Not Necessarily Improve Role-Playing Ability	Xiachong Feng et.al.	2502.16940	null
2025-02-24	BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference	Zewen Jin et.al.	2502.16927	null
2025-02-24	FilterLLM: Text-To-Distribution LLM for Billion-Scale Cold-Start Recommendation	Ruochen Liu et.al.	2502.16924	null
2025-02-24	A Systematic Survey of Automatic Prompt Optimization Techniques	Kiran Ramnath et.al.	2502.16923	null
2025-02-24	Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties	Zhenglin Wang et.al.	2502.16922	link
2025-02-24	SS-MPC: A Sequence-Structured Multi-Party Conversation System	Yoonjin Jang et.al.	2502.16920	null
2025-02-24	Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model	Kang Fu et.al.	2502.16915	link
2025-02-24	SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models	Kevin Miller et.al.	2502.16911	null
2025-02-24	AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models	Qin Zhu et.al.	2502.16906	link
2025-02-24	GuidedBench: Equipping Jailbreak Evaluation with Guidelines	Ruixuan Huang et.al.	2502.16903	null
2025-02-24	Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinment	Suchae Jeong et.al.	2502.16902	null
2025-02-24	Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs	Himanshu Beniwal et.al.	2502.16901	link
2025-02-24	Zero-shot Load Forecasting for Integrated Energy Systems: A Large Language Model-based Framework with Multi-task Learning	Jiaheng Li et.al.	2502.16896	null
2025-02-24	Unlocking Scientific Concepts: How Effective Are LLM-Generated Analogies for Student Understanding and Classroom Practice?	Zekai Shao et.al.	2502.16895	null
2025-02-24	Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment	Chenghao Fan et.al.	2502.16894	link
2025-02-24	Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data	Yejian Zhang et.al.	2502.16892	null
2025-02-24	Unveiling Institution-Specific Bias in Pathology Foundation Models: Detriments, Causes, and Potential Solutions	Weiping Lin et.al.	2502.16889	null
2025-02-24	DBudgetKV: Dynamic Budget in KV Cache Compression for Ensuring Optimal Performance	Xuanfan Ni et.al.	2502.16886	null
2025-02-24	CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter	Yepeng Weng et.al.	2502.16880	null
2025-02-24	A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis	Yuzhi Hao et.al.	2502.16879	null
2025-02-24	Graphy’our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data	Longbin Lai et.al.	2502.16868	null
2025-02-24	Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment	Kartik Nagpal et.al.	2502.16863	null
2025-02-24	LongAttn: Selecting Long-context Training Data via Token-level Attention	Longyun Wu et.al.	2502.16860	link
2025-02-24	Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models	Avinash Trivedi et.al.	2502.16857	null
2025-02-24	Improving LLM General Preference Alignment via Optimistic Online Mirror Descent	Yuheng Zhang et.al.	2502.16852	null
2025-02-24	Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models	Yaqi Sun et.al.	2502.16842	null
2025-02-24	Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives	Dilermando Queiroz et.al.	2502.16841	null
2025-02-24	In-context learning of evolving data streams with tabular foundational models	Afonso Lourenço et.al.	2502.16840	null
2025-02-24	“Actionable Help” in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts	Rabindra Lamsal et.al.	2502.16839	null
2025-02-24	REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction	Omar Sharif et.al.	2502.16838	null
2025-02-24	Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization	Yao Xiao et.al.	2502.16825	null
2025-02-21	ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval	Guanqi Zhan et.al.	2502.15682	null
2025-02-21	Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training	Jaydeep Borkar et.al.	2502.15680	link
2025-02-21	FLEKE: Federated Locate-then-Edit Knowledge Editing	Zongkai Zhao et.al.	2502.15677	link
2025-02-21	AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Zhining Zhang et.al.	2502.15676	link
2025-02-21	VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Florent Bartoccioni et.al.	2502.15672	link
2025-02-21	Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing	Shoumik Saha et.al.	2502.15666	link
2025-02-21	Machine-generated text detection prevents language model collapse	George Drayson et.al.	2502.15654	link
2025-02-21	Empowering LLMs with Logical Reasoning: A Comprehensive Survey	Fengxiang Cheng et.al.	2502.15652	null
2025-02-21	Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models	Anirudh Sundar et.al.	2502.15639	null
2025-02-21	Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification	Vasilii Feofanov et.al.	2502.15637	link
2025-02-21	The Relationship Between Reasoning and Performance in Large Language Models – o3 (mini) Thinks Harder, Not Longer	Marthe Ballon et.al.	2502.15631	link
2025-02-21	Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing	Qi Le et.al.	2502.15618	link
2025-02-21	On the Robustness of Transformers against Context Hijacking for Linear Classification	Tianle Li et.al.	2502.15609	null
2025-02-21	Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance	Akos Nagy et.al.	2502.15604	null
2025-02-21	Do Multilingual LLMs Think In English?	Lisa Schut et.al.	2502.15603	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention	Jiaqi Wu et.al.	2502.15594	null
2025-02-21	Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning	Wenhao Zhu et.al.	2502.15592	link
2025-02-21	LightThinker: Thinking Step-by-Step Compression	Jintian Zhang et.al.	2502.15589	null
2025-02-21	Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid	Yunfeng Li et.al.	2502.15583	null
2025-02-21	Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning	Mariia Radova et.al.	2502.15582	null
2025-02-21	Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders	Xuansheng Wu et.al.	2502.15576	null
2025-02-21	DReSD: Dense Retrieval for Speculative Decoding	Milan Gritta et.al.	2502.15572	link
2025-02-21	A Cautionary Tale About “Neutrally” Informative AI Tools Ahead of the 2025 Federal Elections in Germany	Ina Dormuth et.al.	2502.15568	null
2025-02-21	PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning	Pengcheng Huang et.al.	2502.15543	link
2025-02-21	Accurate and efficient machine learning interatomic potentials for finite temperature modeling of molecular crystals	Flaviano Della Pia et.al.	2502.15530	null
2025-02-21	Scaling Sparse and Dense Retrieval in Decoder-Only LLMs	Hansi Zeng et.al.	2502.15526	link
2025-02-21	Towards Swift Serverless LLM Cold Starts with ParaServe	Chiheng Lou et.al.	2502.15524	null
2025-02-21	Activation Steering in Neural Theorem Provers	Shashank Kirtania et.al.	2502.15507	null
2025-02-21	Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing	Masaya Kobayashi et.al.	2502.15506	null
2025-02-21	Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models	Ya Wang et.al.	2502.15499	link
2025-02-21	Programmers Aren’t Obsolete Yet: A Syllabus for Teaching CS Students to Responsibly Use Large Language Models for Code Generation	Bruno Pereira Cipriano et.al.	2502.15493	null
2025-02-21	ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models	Martina Miliani et.al.	2502.15487	null
2025-02-21	Enhancing RWKV-based Language Models for Long-Sequence Text Generation	Xinghan Pan et.al.	2502.15485	link
2025-02-21	FaultGPT: Industrial Fault Diagnosis Question Answering System by Vision Language Models	Jiao Chen et.al.	2502.15481	null
2025-02-21	PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System	Yintao He et.al.	2502.15470	null
2025-02-21	Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation	Wenxuan Wang et.al.	2502.15466	null
2025-02-21	Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs	Gengyuan Zhang et.al.	2502.15457	null
2025-02-21	R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning	Jinda Liu et.al.	2502.15455	link
2025-02-21	A fast convergence algorithm based on binary integer programming for expert load balancing in MoE LLMs	Yuan Sun et.al.	2502.15451	link
2025-02-21	When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models	Weilan Wang et.al.	2502.15443	null
2025-02-21	On the Effectiveness of Large Language Models in Writing Alloy Formulas	Yang Hong et.al.	2502.15441	null
2025-02-21	Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning	Raghav Singhal et.al.	2502.15436	link
2025-02-21	Single-pass Detection of Jailbreaking Input in Large Language Models	Leyla Naz Candogan et.al.	2502.15435	null
2025-02-21	Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation	Yue Zhou et.al.	2502.15434	link
2025-02-21	Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations	Lihu Chen et.al.	2502.15429	link
2025-02-21	Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs	Giulio Zizzo et.al.	2502.15427	link
2025-02-21	Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking	Yi-Ling Chung et.al.	2502.15419	link
2025-02-21	MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models	Suraj Racha et.al.	2502.15418	link
2025-02-21	HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings	Rasmus Aavang et.al.	2502.15411	link
2025-02-21	Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning	Xuetao Ma et.al.	2502.15401	null
2025-02-21	Beyond Tools: Understanding How Heavy Users Integrate LLMs into Everyday Tasks and Decision-Making	Eunhye Kim et.al.	2502.15395	null
2025-02-21	Chitrarth: Bridging Vision and Language for a Billion People	Shaharukh Khan et.al.	2502.15392	null
2025-02-21	MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing	Matvey Skripkin et.al.	2502.15381	null
2025-02-21	Weakly Supervised Video Scene Graph Generation via Natural Language Supervision	Kibum Kim et.al.	2502.15370	link
2025-02-21	Identifying Features that Shape Perceived Consciousness in Large Language Model-based AI: A Quantitative Study of Human Responses	Kang Bongsu et.al.	2502.15365	null
2025-02-21	Evaluating Social Biases in LLM Reasoning	Xuyang Wu et.al.	2502.15361	null
2025-02-21	ARS: Automatic Routing Solver with Large Language Models	Kai Li et.al.	2502.15359	link
2025-02-21	AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms	Feiyang Chen et.al.	2502.15349	link
2025-02-21	Constructing a Norm for Children’s Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models	Yi Zhang et.al.	2502.15348	null
2025-02-21	Efficiently Solving Discounted MDPs with Predictions on Transition Matrices	Lixing Lyu et.al.	2502.15345	null
2025-02-21	Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions	Shoubin Chen et.al.	2502.15336	null
2025-02-21	Stepwise Informativeness Search for Improving LLM Reasoning	Siyuan Wang et.al.	2502.15335	null
2025-02-21	Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment	Pedram Zaree et.al.	2502.15334	null
2025-02-21	Detecting Future-related Contexts of Entity Mentions	Puneet Prashar et.al.	2502.15332	null
2025-02-21	DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation	Luzhou Ge et.al.	2502.15309	link
2025-02-21	SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention	Hong Yankun et.al.	2502.15304	null
2025-02-21	Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference	Yaohua Tang et.al.	2502.15294	null
2025-02-21	Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models	Jianming Chang et.al.	2502.15292	null
2025-02-21	BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization	Tonghan Wang et.al.	2502.15283	null
2025-02-21	A Training-free LLM-based Approach to General Chinese Character Error Correction	Houquan Zhou et.al.	2502.15266	link
2025-02-21	Retrieval-Augmented Speech Recognition Approach for Domain Challenges	Peng Shen et.al.	2502.15264	null
2025-02-21	LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design	Renjie Wei et.al.	2502.15260	null
2025-02-21	An approach for API synthesis using large language models	Hua Zhong et.al.	2502.15246	null
2025-02-21	Comparative Analysis of Large Language Models for Context-Aware Code Completion using SAFIM Framework	Hang Zhang et.al.	2502.15243	null
2025-02-21	From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants	Manisha Mukherjee et.al.	2502.15237	null
2025-02-21	A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation	Shilong Hou et.al.	2502.15233	link
2025-02-21	User Experience with LLM-powered Conversational Recommendation Systems: A Case of Music Recommendation	Sojeong Yun et.al.	2502.15229	null
2025-02-21	Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews	Mengqiao Liu et.al.	2502.15226	link
2025-02-21	Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs	Tingting Chen et.al.	2502.15224	null
2025-02-21	FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs	Madhurima Chakraborty et.al.	2502.15217	link
2025-02-21	The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning	Sheila Schoepp et.al.	2502.15214	null
2025-02-21	Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing	Zhilin Wang et.al.	2502.15208	null
2025-02-21	Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis	Yifan Jiang et.al.	2502.15204	link
2025-02-21	TETRIS: Optimal Draft Token Selection for Batch Speculative Decoding	Zhaoxuan Wu et.al.	2502.15197	null
2025-02-21	LEDD: Large Language Model-Empowered Data Discovery in Data Lakes	Qi An et.al.	2502.15182	null
2025-02-21	Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders	Weiqiao Shan et.al.	2502.15178	null
2025-02-21	Methods and Trends in Detecting Generated Images: A Comprehensive Review	Arpan Mahara et.al.	2502.15176	null
2025-02-21	M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment	Chuan Cui et.al.	2502.15167	link
2025-02-21	Extreme Speech Classification in the Era of LLMs: Exploring Open-Source and Proprietary Models	Sarthak Mahajan et.al.	2502.15155	null
2025-02-21	Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems	Tianjie Ju et.al.	2502.15153	link
2025-02-21	Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns	Naiming Liu et.al.	2502.15140	null
2025-02-21	Chain-of-Rank: Enhancing Large Language Models for Domain-Specific RAG in Edge Device	Juntae Lee et.al.	2502.15134	null
2025-02-21	TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba	Xiuwei Chen et.al.	2502.15130	null
2025-02-20	LUME: LLM Unlearning with Multitask Evaluations	Anil Ramakrishna et.al.	2502.15097	null
2025-02-20	Detecting Student Intent for Chat-Based Intelligent Tutoring Systems	Ella Cutler et.al.	2502.15096	null
2025-02-20	Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models	Marianne Chuang et.al.	2502.15094	null
2025-02-20	Optimizing Singular Spectrum for Large Language Model Compression	Dengjie Li et.al.	2502.15092	null
2025-02-20	Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans	Masha Fedzechkina et.al.	2502.15090	null
2025-02-20	Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models	Yeonjun In et.al.	2502.15086	link
2025-02-20	LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention	Shang Yang et.al.	2502.14866	link
2025-02-20	Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning	Shuyue Stella Li et.al.	2502.14860	link
2025-02-20	FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling	Weilin Zhao et.al.	2502.14856	null
2025-02-20	Prompt-to-Leaderboard	Evan Frick et.al.	2502.14855	link
2025-02-20	GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks	Jianwen Luo et.al.	2502.14848	link
2025-02-20	Red-Teaming LLM Multi-Agent Systems via Communication Attacks	Pengfei He et.al.	2502.14847	null
2025-02-20	Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation	Yue Yang et.al.	2502.14846	null
2025-02-20	Revealing and Mitigating Over-Attention in Knowledge Editing	Pinzheng Wang et.al.	2502.14838	link
2025-02-20	Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs	Danni Liu et.al.	2502.14830	link
2025-02-20	A Survey of Model Architectures in Information Retrieval	Zhichao Xu et.al.	2502.14822	null
2025-02-20	eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables	Luis Antonio Gutiérrez Guanilo et.al.	2502.14820	null
2025-02-20	Dynamic Low-Rank Sparse Adaptation for Large Language Models	Weizhong Huang et.al.	2502.14816	link
2025-02-20	FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis	Fadillah Maani et.al.	2502.14807	link
2025-02-20	From RAG to Memory: Non-Parametric Continual Learning for Large Language Models	Bernal Jiménez Gutiérrez et.al.	2502.14802	link
2025-02-20	A Multi-Agent Perspective on Modern Information Retrieval	Haya Nachimovsky et.al.	2502.14796	null
2025-02-20	Rapid Word Learning Through Meta In-Context Learning	Wentao Wang et.al.	2502.14791	null
2025-02-20	DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models	Hongji Yang et.al.	2502.14779	null
2025-02-20	SurveyX: Academic Survey Automation via Large Language Models	Xun Liang et.al.	2502.14776	null
2025-02-20	Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective	Weizhong Huang et.al.	2502.14770	null
2025-02-20	Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis	Priyanka Kargupta et.al.	2502.14767	link
2025-02-20	EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations	Haotian Zhai et.al.	2502.14760	link
2025-02-20	On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems	Juraj Vladika et.al.	2502.14759	link
2025-02-20	TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators	Jianling Li et.al.	2502.14752	link
2025-02-20	Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs	Zongxia Li et.al.	2502.14748	null
2025-02-20	Multi-Agent Coordination across Diverse Applications: A Survey	Lijun Sun et.al.	2502.14743	null
2025-02-20	SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines	M-A-P Team et.al.	2502.14739	null
2025-02-20	EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration	Minjie Hong et.al.	2502.14735	null
2025-02-20	WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models	Yifu Chen et.al.	2502.14727	null
2025-02-20	I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search	Zujie Liang et.al.	2502.14693	link
2025-02-20	Bridging the Gap: Transforming Natural Language Questions into SQL Queries via Abstract Query Pattern and Contextual Schema Markup	Yonghui Kong et.al.	2502.14682	null
2025-02-20	How to Get Your LLM to Generate Challenging Problems for Evaluation	Arkil Patel et.al.	2502.14678	link
2025-02-20	Data-Constrained Synthesis of Training Data for De-Identification	Thomas Vakili et.al.	2502.14677	null
2025-02-20	Explanations of Deep Language Models Explain Language Representations in the Brain	Maryam Rahimi et.al.	2502.14671	null
2025-02-20	AlphaMaze: Enhancing Large Language Models’ Spatial Intelligence via GRPO	Alan Dao et.al.	2502.14669	link
2025-02-20	Beyond the Surface: Uncovering Implicit Locations with LLMs for Personalized Local News	Gali Katz et.al.	2502.14660	null
2025-02-20	Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs	Yuchen Wu et.al.	2502.14645	null
2025-02-20	LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning	Yansheng Mao et.al.	2502.14644	null
2025-02-20	Length-Controlled Margin-Based Preference Optimization without Reference Model	Gengxu Li et.al.	2502.14643	link
2025-02-20	ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Angxiao Yue et.al.	2502.14637	link
2025-02-20	CER: Confidence Enhanced Reasoning in LLMs	Ali Razghandi et.al.	2502.14634	link
2025-02-20	Augmenting Coaching with GenAI: Insights into Use, Effectiveness, and Future Potential	Jennifer Haase et.al.	2502.14632	null
2025-02-20	Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery	Minh-Quyet Ha et.al.	2502.14631	null
2025-02-20	PEARL: Towards Permutation-Resilient LLMs	Liang Chen et.al.	2502.14628	link
2025-02-20	Reward Models Identify Consistency, Not Causality	Yuhui Xu et.al.	2502.14619	null
2025-02-20	Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale	Shashwat Jaiswal et.al.	2502.14617	null
2025-02-20	FIND: Fine-grained Information Density Guided Adaptive Retrieval-Augmented Generation for Disease Diagnosis	Mingyi Jia et.al.	2502.14614	null
2025-02-20	Behavioral Analysis of Information Salience in Large Language Models	Jan Trienes et.al.	2502.14613	link
2025-02-20	“Don’t Forget the Teachers”: Towards an Educator-Centered Understanding of Harms from Large Language Models in Education	Emma Harvey et.al.	2502.14592	null
2025-02-20	Vision Foundation Models in Medical Image Analysis: Advances and Challenges	Pengchen Liang et.al.	2502.14584	null
2025-02-20	A Theory for Conditional Generative Modeling on Multiple Data Sources	Rongzhen Wang et.al.	2502.14583	link
2025-02-20	ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification	Hyunseok Lee et.al.	2502.14565	null
2025-02-20	Plan-over-Graph: Towards Parallelable LLM Agent Schedule	Shiqi Zhang et.al.	2502.14563	link
2025-02-20	Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs	Paris Koloveas et.al.	2502.14561	link
2025-02-20	Less is More: Improving LLM Alignment via Preference Data Selection	Xun Deng et.al.	2502.14560	null
2025-02-20	Multiscale Byte Language Models – A Hierarchical Architecture for Causal Million-Length Sequence Modeling	Eric Egli et.al.	2502.14553	link
2025-02-20	Position: Graph Learning Will Lose Relevance Due To Poor Benchmarks	Maya Bechler-Speicher et.al.	2502.14546	null
2025-02-20	LLM-based User Profile Management for Recommender System	Seunghwan Bang et.al.	2502.14541	null
2025-02-20	LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization	Yupeng Chang et.al.	2502.14538	link
2025-02-20	CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models	Zhenhong Zhou et.al.	2502.14529	link
2025-02-20	Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation	Austin A. Barr et.al.	2502.14523	link
2025-02-20	Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases	Rena Gao et.al.	2502.14507	link
2025-02-20	How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?	Sergey Pletenev et.al.	2502.14502	link
2025-02-20	MLGym: A New Framework and Benchmark for Advancing AI Research Agents	Deepak Nathani et.al.	2502.14499	null
2025-02-20	StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following	Jinnan Li et.al.	2502.14494	link
2025-02-20	How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation	Zhuohang Long et.al.	2502.14486	null
2025-02-20	NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models	Chenlu Guo et.al.	2502.14482	link
2025-02-20	Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression	Haoyu Wang et.al.	2502.14477	null
2025-02-20	Argument-Based Comparative Question Answering Evaluation Benchmark	Irina Nikishina et.al.	2502.14476	null
2025-02-20	Enhancing Smart Environments with Context-Aware Chatbots using Large Language Models	Aurora Polo-Rodríguez et.al.	2502.14469	null
2025-02-20	Narrative-Driven Travel Planning: Geoculturally-Grounded Script Generation with Evolutionary Itinerary Optimization	Ran Ding et.al.	2502.14456	link
2025-02-20	Optimal word order for non-causal text generation with Large Language Models: the Spanish case	Andrea Busto-Castiñeira et.al.	2502.14451	null
2025-02-20	LLM4FaaS: No-Code Application Development using LLMs and FaaS	Minghe Wang et.al.	2502.14450	null
2025-02-20	PredictaBoard: Benchmarking LLM Score Predictability	Lorenzo Pacchiardi et.al.	2502.14445	link
2025-02-20	Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models	Artem Vazhentsev et.al.	2502.14427	link
2025-02-20	A Survey on Data Contamination for Large Language Models	Yuxing Cheng et.al.	2502.14425	link
2025-02-20	ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model	Zhongyi Zhou et.al.	2502.14420	link
2025-02-20	Towards Efficient Automatic Self-Pruning of Large Language Models	Weizhong Huang et.al.	2502.14413	null
2025-02-20	Evaluating Precise Geolocation Inference Capabilities of Vision Language Models	Neel Jay et.al.	2502.14412	link
2025-02-20	Unstructured Evidence Attribution for Long Context Query Focused Summarization	Dustin Wright et.al.	2502.14409	link
2025-02-20	HPS: Hard Preference Sampling for Human Preference Alignment	Xiandong Zou et.al.	2502.14400	null
2025-02-20	Enhancing Portuguese Variety Identification with Cross-Domain Approaches	Hugo Sousa et.al.	2502.14394	null
2025-02-20	Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment	Lucile Favero et.al.	2502.14389	null
2025-02-20	*S: Test Time Scaling for Code Generation**	Dacheng Li et.al.	2502.14382	link
2025-02-20	PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization	Xinpeng Shou et.al.	2502.14370	null
2025-02-20	Entropy-UID: A Method for Optimizing Information Density	Xinpeng Shou et.al.	2502.14366	null
2025-02-20	Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning	Jiachen Zhu et.al.	2502.14361	null
2025-02-20	SR-LLM: Rethinking the Structured Representation in Large Language Model	Jiahuan Zhang et.al.	2502.14352	null
2025-02-20	SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images	Yichi Zhang et.al.	2502.14351	link
2025-02-20	FlowAgent: Achieving Compliance and Flexibility for Workflow Agents	Yuchen Shi et.al.	2502.14345	link
2025-02-20	Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective	Ruichen Shao et.al.	2502.14340	link
2025-02-20	A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics	Ting-Ruen Wei et.al.	2502.14333	null
2025-02-20	SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation	Junjie Sheng et.al.	2502.14328	null
2025-02-20	ChemHTS: Hierarchical Tool Stacking for Enhancing Chemical Agents	Zhucong Li et.al.	2502.14327	link
2025-02-20	Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems	Bingyu Yan et.al.	2502.14321	null
2025-02-20	Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models	James Fodor et.al.	2502.14318	null
2025-02-20	ParallelComp: Parallel Long-Context Compressor for Length Extrapolation	Jing Xiong et.al.	2502.14317	null
2025-02-20	Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension	Amir Hossein Yari et.al.	2502.14315	null
2025-02-20	Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications	Kayhan Behdin et.al.	2502.14305	null
2025-02-20	MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models	Shrey Pandit et.al.	2502.14302	null
2025-02-20	SEA-HELM: Southeast Asian Holistic Evaluation of Language Models	Yosephine Susanto et.al.	2502.14301	null
2025-02-19	Where’s the Bug? Attention Probing for Scalable Fault Localization	Adam Stein et.al.	2502.13966	null
2025-02-19	Autellix: An Efficient Serving Engine for LLM Agents as General Programs	Michael Luo et.al.	2502.13965	null
2025-02-19	MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads	Weihao Liu et.al.	2502.13963	link
2025-02-19	Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering	William Jurayj et.al.	2502.13962	null
2025-02-19	LIDDIA: Language-based Intelligent Drug Discovery Agent	Reza Averly et.al.	2502.13959	null
2025-02-19	Neurosymbolic artificial intelligence via large language models and coherence-driven inference	Steve Huntsman et.al.	2502.13953	null
2025-02-19	Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region	Chak Tou Leong et.al.	2502.13946	null
2025-02-19	Image compositing is all you need for data augmentation	Ang Jia Ning Shermaine et.al.	2502.13936	null
2025-02-19	LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization	Guanzheng Chen et.al.	2502.13922	link
2025-02-19	Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis	Jiahao Gai et.al.	2502.13921	null
2025-02-19	Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health	Xingbo Wang et.al.	2502.13920	link
2025-02-19	How Do LLMs Perform Two-Hop Reasoning in Context?	Tianyu Guo et.al.	2502.13913	null
2025-02-19	Lost in Sequence: Do Large Language Models Understand Sequential Recommendation?	Sein Kim et.al.	2502.13909	link
2025-02-19	Judging the Judges: A Collection of LLM-Generated Relevance Judgements	Hossein A. Rahmani et.al.	2502.13908	link
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants	Yiran Qin et.al.	2502.13894	null
2025-02-19	Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models	Matthew P. Wilson et.al.	2502.13886	link
2025-02-19	SPEX: Scaling Feature Interaction Explanations for LLMs	Justin Singh Kang et.al.	2502.13870	link
2025-02-19	MagicGeo: Training-Free Text-Guided Geometric Diagram Generation	Junxiao Wang et.al.	2502.13855	null
2025-02-19	Enhancing LLM-Based Recommendations Through Personalized Reasoning	Jiahao Liu et.al.	2502.13845	link
2025-02-19	Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents	Jiahao Liu et.al.	2502.13843	link
2025-02-19	Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking	Yilong Chen et.al.	2502.13842	null
2025-02-19	Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models	Peter Carragher et.al.	2502.13836	null
2025-02-19	Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning	Zenan Li et.al.	2502.13834	link
2025-02-19	ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities	Chanjin Zheng et.al.	2502.13832	link
2025-02-19	LESA: Learnable LLM Layer Scaling-Up	Yifei Yang et.al.	2502.13794	link
2025-02-19	From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions	Nathanaël Carraz Rakotonirina et.al.	2502.13791	link
2025-02-19	From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education	Yi-Fan Zhang et.al.	2502.13789	null
2025-02-19	Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics	Matthew Wood et.al.	2502.13785	link
2025-02-19	Generative Large Recommendation Models: Emerging Trends in LLMs for Recommendation	Hao Wang et.al.	2502.13783	null
2025-02-19	Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions	Beatrice Savoldi et.al.	2502.13780	null
2025-02-19	VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare	Anudeex Shetty et.al.	2502.13775	null
2025-02-19	AI Software Engineer: Programming with Trust	Abhik Roychoudhury et.al.	2502.13767	null
2025-02-19	SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning	Renxi Wang et.al.	2502.13753	link
2025-02-19	Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions	Xinwei Shen et.al.	2502.13747	null
2025-02-19	Enhancing Input-Label Mapping in In-Context Learning with Contrastive Decoding	Keqin Peng et.al.	2502.13738	null
2025-02-19	CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models	Nikolaos Dionelis et.al.	2502.13734	null
2025-02-19	Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method	Juyuan Zhang et.al.	2502.13725	null
2025-02-19	Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values	Hongbo Zhang et.al.	2502.13723	null
2025-02-19	TALKPLAY: Multimodal Music Recommendation with Large Language Models	Seungheon Doh et.al.	2502.13713	null
2025-02-19	Is This Collection Worth My LLM’s Time? Automatically Measuring Information Potential in Text Corpora	Tristan Karch et.al.	2502.13691	null
2025-02-19	An LLM-based Agent for Reliable Docker Environment Configuration	Ruida Hu et.al.	2502.13681	link
2025-02-19	SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation	Song Duong et.al.	2502.13674	null
2025-02-19	Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models	Liyang He et.al.	2502.13656	link
2025-02-19	C2T: A Classifier-Based Tree Construction Method in Speculative Decoding	Feiye Huo et.al.	2502.13652	null
2025-02-19	Reliability Across Parametric and External Knowledge: Understanding Knowledge Handling in LLMs	Youna Kim et.al.	2502.13648	null
2025-02-19	D.Va: Validate Your Demonstration First Before You Use It	Qi Zhang et.al.	2502.13646	null
2025-02-19	Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts	Maiya Goloburda et.al.	2502.13640	null
2025-02-19	Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization	Or Raphael Bidusa et.al.	2502.13632	null
2025-02-19	AI-Empowered Catalyst Discovery: A Survey from Classical Machine Learning Approaches to Large Language Models	Yuanyuan Xu et.al.	2502.13626	null
2025-02-19	REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models	DongGeon Lee et.al.	2502.13622	null
2025-02-19	Complex Ontology Matching with Large Language Model Embeddings	Guilherme Sousa et.al.	2502.13619	null
2025-02-19	LaVCa: LLM-assisted Visual Cortex Captioning	Takuya Matsuyama et.al.	2502.13606	null
2025-02-19	BeamLoRA: Beam-Constraint Low-Rank Adaptation	Naibin Gu et.al.	2502.13604	null
2025-02-19	MMTEB: Massive Multilingual Text Embedding Benchmark	Kenneth Enevoldsen et.al.	2502.13595	link
2025-02-19	Don’t Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints	Nicolò Penzo et.al.	2502.13592	link
2025-02-19	Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts	Xin Li et.al.	2502.13577	null
2025-02-19	LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation	Xin Li et.al.	2502.13568	null
2025-02-19	Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs	Joonatan Laato et.al.	2502.13566	null
2025-02-19	PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models	Guangwei Li et.al.	2502.13564	link
2025-02-19	Are Large Language Models In-Context Graph Learners?	Jintang Li et.al.	2502.13562	null
2025-02-19	Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs	Yushi Feng et.al.	2502.13555	link
2025-02-19	STaR-SQL: Self-Taught Reasoner for Text-to-SQL	Mingqian He et.al.	2502.13550	null
2025-02-19	Detecting Linguistic Bias in Government Documents Using Large language Models	Milena de Swart et.al.	2502.13548	null
2025-02-19	From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN	Peiwen Yuan et.al.	2502.13544	null
2025-02-19	Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference	Qingfa Xiao et.al.	2502.13542	null
2025-02-19	Bursting Filter Bubble: Enhancing Serendipity Recommendations with Aligned Large Language Models	Yunjia Xi et.al.	2502.13539	null
2025-02-19	Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Jun Zhang et.al.	2502.13533	link
2025-02-19	Exploiting Prefix-Tree in Structured Output Interfaces for Enhancing Jailbreak Attacking	Yanzeng Li et.al.	2502.13527	link
2025-02-19	SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin	Hao Yi et.al.	2502.13516	null
2025-02-19	Unlocking Multimodal Integration in EHRs: A Prompt Learning Framework for Language and Time Series Fusion	Shuai Niu et.al.	2502.13509	null
2025-02-19	Reproducing NevIR: Negation in Neural Information Retrieval	Coen van Elsen et.al.	2502.13506	link
2025-02-19	PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference	Burc Gokden et.al.	2502.13502	link
2025-02-19	Towards Geo-Culturally Grounded LLM Generations	Piyawat Lertvittayakumjorn et.al.	2502.13497	null
2025-02-19	What are Models Thinking about? Understanding Large Language Model Hallucinations “Psychology” through Model Inner State Analysis	Peiran Wang et.al.	2502.13490	null
2025-02-19	LLM4Tag: Automatic Tagging System for Information Retrieval via Large Language Models	Ruiming Tang et.al.	2502.13481	null
2025-02-19	Integration of Agentic AI with 6G Networks for Mission-Critical Applications: Use-case and Challenges	Sunder Ali Khowaja et.al.	2502.13476	null
2025-02-19	LLM should think and action as a human	Haun Leung et.al.	2502.13475	null
2025-02-19	Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models	Chenyu Zhu et.al.	2502.13474	null
2025-02-19	ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails	Xiaofei Wen et.al.	2502.13458	link
2025-02-19	Interleaved Gibbs Diffusion for Constrained Generation	Gautham Govind Anil et.al.	2502.13450	null
2025-02-19	Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning	Yang Yan et.al.	2502.13447	null
2025-02-19	TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation	Jialin Ouyang et.al.	2502.13442	link
2025-02-19	The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding?	Yutao Sun et.al.	2502.13441	null
2025-02-19	MATS: An Audio Language Model under Text-only Supervision	Wen Wang et.al.	2502.13433	null
2025-02-19	Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning	Hao Ma et.al.	2502.13430	null
2025-02-19	MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering	Guanming Xiong et.al.	2502.13428	null
2025-02-19	TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition	Yuxiang Wang et.al.	2502.13422	null
2025-02-19	RLTHF: Targeted Human Feedback for LLM Alignment	Yifei Xu et.al.	2502.13417	null
2025-02-19	Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning	Ningke Li et.al.	2502.13416	null
2025-02-19	Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction	Yanbang Sun et.al.	2502.13412	null
2025-02-19	Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks	Vince Kurtz et.al.	2502.13406	null
2025-02-19	$\mathtt{GeLLM^3O}$ : Generalizing Large Language Models for Multi-property Molecule Optimization	Vishal Dey et.al.	2502.13398	link
2025-02-19	Prompting a Weighting Mechanism into LLM-as-a-Judge in Two-Step: A Case Study	Wenwen Xie et.al.	2502.13396	null
2025-02-19	Flow-based generative models as iterative algorithms in probability space	Yao Xie et.al.	2502.13394	null
2025-02-19	Reasoning with Reinforced Functional Token Tuning	Kongcheng Zhang et.al.	2502.13389	link
2025-02-19	Reflection of Episodes: Learning to Play Game from Expert and Self Experiences	Xiaojie Xu et.al.	2502.13388	null
2025-02-19	MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification	Linzhuang Sun et.al.	2502.13383	link
2025-02-19	AutoTEE: Automated Migration and Protection of Programs in Trusted Execution Environments	Ruidong Han et.al.	2502.13379	link
2025-02-19	Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor	Barys Liskavets et.al.	2502.13374	null
2025-02-18	Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Shuo Xing et.al.	2502.13146	link
2025-02-18	Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Bencheng Liao et.al.	2502.13145	link
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-18	UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models	Huawei Lin et.al.	2502.13141	link
2025-02-18	AIDE: AI-Driven Exploration in the Space of Code	Zhengyao Jiang et.al.	2502.13138	link
2025-02-18	Theorem Prover as a Judge for Synthetic Data Generation	Joshua Ong Jun Leang et.al.	2502.13137	null
2025-02-18	AV-Flow: Transforming Text to Audio-Visual Human-like Interactions	Aggelina Chatziagapi et.al.	2502.13133	null
2025-02-18	Learning to Defer for Causal Discovery with Imperfect Experts	Oscar Clivio et.al.	2502.13132	null
2025-02-18	Rethinking Diverse Human Preference Learning through Principal Component Analysis	Feng Luo et.al.	2502.13131	null
2025-02-18	Magma: A Foundation Model for Multimodal AI Agents	Jianwei Yang et.al.	2502.13130	link
2025-02-18	Is Noise Conditioning Necessary for Denoising Generative Models?	Qiao Sun et.al.	2502.13129	null
2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null
2025-02-18	RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises	Zenan Zhai et.al.	2502.13125	link
2025-02-18	Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context	Marion Bartl et.al.	2502.13120	null
2025-02-18	STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models	Narun Raman et.al.	2502.13119	null
2025-02-18	Performance Evaluation of Large Language Models in Statistical Programming	Xinyi Song et.al.	2502.13117	link
2025-02-18	MatterChat: A Multi-Modal LLM for Material Science	Yingheng Tang et.al.	2502.13107	null
2025-02-18	Text2World: Benchmarking Large Language Models for Symbolic World Model Generation	Mengkang Hu et.al.	2502.13092	null
2025-02-18	A Neural Difference-of-Entropies Estimator for Mutual Information	Haoran Ni et.al.	2502.13085	null
2025-02-18	Personalized Image Generation with Deep Generative Models: A Decade Survey	Yuxiang Wei et.al.	2502.13081	link
2025-02-18	SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models	Xianfu Cheng et.al.	2502.13059	null
2025-02-18	LAMD: Context-driven Android Malware Detection and Classification with LLMs	Xingzhi Qian et.al.	2502.13055	null
2025-02-18	Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction	Nils Constantin Hellwig et.al.	2502.13044	null
2025-02-18	HPSS: Heuristic Prompting Strategy Search for LLM Evaluators	Bosi Wen et.al.	2502.13031	null
2025-02-18	A deep learning framework for efficient pathology image analysis	Peter Neidlinger et.al.	2502.13027	null
2025-02-18	Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks	Markus J. Buehler et.al.	2502.13025	link
2025-02-18	Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation	Sha Li et.al.	2502.13019	null
2025-02-18	Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents	Chaoran Chen et.al.	2502.13012	null
2025-02-18	Adaptive Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge	Mohammad Reza Rezaei et.al.	2502.13010	null
2025-02-18	You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations	Frederic Kirstein et.al.	2502.13001	null
2025-02-18	Personalized Top-k Set Queries Over Predicted Scores	Sohrab Namazi Nia et.al.	2502.12998	null
2025-02-18	Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs	Zixiao Wang et.al.	2502.12988	null
2025-02-18	Towards Variational Flow Matching on General Geometries	Olga Zaghen et.al.	2502.12981	null
2025-02-18	Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search	Yifan Ji et.al.	2502.12974	link
2025-02-18	Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking	Junda Zhu et.al.	2502.12970	link
2025-02-18	Trust Me, I’m Wrong: High-Certainty Hallucinations in LLMs	Adi Simhi et.al.	2502.12964	null
2025-02-18	Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing	Xiaoju Ye et.al.	2502.12962	null
2025-02-18	Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger	Wenjun Li et.al.	2502.12961	null
2025-02-18	Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression	Jaemoon Lee et.al.	2502.12951	null
2025-02-18	Fake It Till You Make It: Using Synthetic Data and Domain Knowledge for Improved Text-Based Learning for LGE Detection	Athira J Jacob et.al.	2502.12948	null
2025-02-18	Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Gyeongman Kim et.al.	2502.12947	null
2025-02-18	LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation	Junchen Fu et.al.	2502.12945	null
2025-02-18	Performance of Zero-Shot Time Series Foundation Models on Cloud Data	William Toner et.al.	2502.12944	link
2025-02-18	Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options	Lakshmi Nair et.al.	2502.12929	link
2025-02-18	Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts	Leiyu Pan et.al.	2502.12928	null
2025-02-18	SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems	Mike Zhang et.al.	2502.12927	link
2025-02-18	Towards more Contextual Agents: An extractor-Generator Optimization Framework	Mourad Aouini et.al.	2502.12926	null
2025-02-18	Keep what you need : extracting efficient subnetworks from large audio representation models	David Genova et.al.	2502.12925	link
2025-02-18	Conditioning LLMs to Generate Code-Switched Text: A Methodology Grounded in Naturally Occurring Data	Maite Heredia et.al.	2502.12924	link
2025-02-18	On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation	Rune Birkmose et.al.	2502.12923	link
2025-02-18	Q-STRUM Debate: Query-Driven Contrastive Summarization for Recommendation Comparison	George-Kirollos Saad et.al.	2502.12921	link
2025-02-18	Lightweight Online Adaption for Time Series Foundation Model Forecasts	Thomas L. Lee et.al.	2502.12920	null
2025-02-18	GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning	Sifan Zhou et.al.	2502.12913	null
2025-02-18	Probabilistic neural operators for functional uncertainty quantification	Christopher Bülte et.al.	2502.12902	link
2025-02-18	Soundwave: Less is More for Speech-Text Alignment in LLMs	Yuhao Zhang et.al.	2502.12900	link
2025-02-18	Multilingual European Language Models: Benchmarking Approaches and Challenges	Fabio Barth et.al.	2502.12895	null
2025-02-18	CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image	Kaixin Yao et.al.	2502.12894	null
2025-02-18	Are Multilingual Language Models an Off-ramp for Under-resourced Languages? Will we arrive at Digital Language Equality in Europe in 2030?	Georg Rehm et.al.	2502.12886	null
2025-02-18	How desirable is alignment between LLMs and linguistically diverse human users?	Pia Knoeferle et.al.	2502.12884	null
2025-02-18	Continuous Learning Conversational AI: A Personalized Agent Framework via A2C Reinforcement Learning	Nandakishor M et.al.	2502.12876	null
2025-02-18	RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution	Emmanuel K. Raptis et.al.	2502.12862	link
2025-02-18	PAFT: Prompt-Agnostic Fine-Tuning	Chenxing Wei et.al.	2502.12859	null
2025-02-18	Rejected Dialects: Biases Against African American Language in Reward Models	Joel Mire et.al.	2502.12858	link
2025-02-18	MeMo: Towards Language Models with Associative Memory Mechanisms	Fabio Massimo Zanzotto et.al.	2502.12851	null
2025-02-18	MOLLM: Multi-Objective Large Language Model for Molecular Design – Optimizing with Experts	Nian Ran et.al.	2502.12845	null
2025-02-18	Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols	Kathrin Seßler et.al.	2502.12842	null
2025-02-18	Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing	Berk Yilmaz et.al.	2502.12838	null
2025-02-18	An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation	Mohammad Feli et.al.	2502.12836	null
2025-02-18	KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan	Mukhammed Togmanov et.al.	2502.12829	null
2025-02-18	Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models	Rubing Lu et.al.	2502.12825	null
2025-02-18	Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models	Elena Stringli et.al.	2502.12821	null
2025-02-18	Simulating User Diversity in Task-Oriented Dialogue Systems using Large Language Models	Adnan Ahmad et.al.	2502.12813	null
2025-02-18	Towards Text-Image Interleaved Retrieval	Xin Zhang et.al.	2502.12799	link
2025-02-18	RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models	Tanqiu Jiang et.al.	2502.12794	link
2025-02-18	Commonsense Reasoning in Arab Culture	Abdelrahman Sadallah et.al.	2502.12788	null
2025-02-18	Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models	Daiki Chijiwa et.al.	2502.12776	null
2025-02-18	How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild	Saad Obaid ul Islam et.al.	2502.12769	link
2025-02-18	R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs	Sumin Jo et.al.	2502.12767	link
2025-02-18	One-bit Compressed Sensing using Generative Models	Swatantra Kafle et.al.	2502.12762	null
2025-02-18	Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models	Kamer Ali Yuksel et.al.	2502.12755	link
2025-02-18	Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table	Haoyuan Wu et.al.	2502.12751	null
2025-02-18	Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation	Yong Zhang et.al.	2502.12744	null
2025-02-18	“I know myself better, but not really greatly”: Using LLMs to Detect and Explain LLM-Generated Texts	Jiazhou Ji et.al.	2502.12743	null
2025-02-18	Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment	Haoyuan Wu et.al.	2502.12732	null
2025-02-18	TREND: A Whitespace Replacement Information Hiding Method	Malte Hellmeier et.al.	2502.12710	null
2025-02-18	Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming	Arash Lagzian et.al.	2502.12700	null
2025-02-18	Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees	Yongtao Wu et.al.	2502.12678	null
2025-02-18	Baichuan-M1: Pushing the Medical Capability of Large Language Models	Bingning Wang et.al.	2502.12671	null
2025-02-18	Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research	Xiang Liu et.al.	2502.12669	null
2025-02-18	Evaluation of Best-of-N Sampling Strategies for Language Model Alignment	Yuki Ichihara et.al.	2502.12668	null
2025-02-18	A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization	Junhui He et.al.	2502.12665	null
2025-02-18	Demystifying Multilingual Chain-of-Thought in Process Reward Modeling	Weixuan Wang et.al.	2502.12663	null
2025-02-18	The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1	Kaiwen Zhou et.al.	2502.12659	null
2025-02-18	R.R.: Unveiling LLM Training Privacy through Recollection and Ranking	Wenlong Meng et.al.	2502.12658	link
2025-02-18	NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Zhiyuan Liu et.al.	2502.12638	link
2025-02-18	Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning	Yunhao Gou et.al.	2502.12635	null
2025-02-18	\textit{One Size doesn’t Fit All}: A Personalized Conversational Tutoring Agent for Mathematics Instruction	Ben Liu et.al.	2502.12633	null
2025-02-18	Automating Prompt Leakage Attacks on Large Language Models Using Agentic Approach	Tvrtko Sternak et.al.	2502.12630	link
2025-02-18	DeepResonance: Enhancing Multimodal Music Understanding via Music-centric Multi-way Instruction Tuning	Zhuoyuan Mao et.al.	2502.12623	null
2025-02-18	Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions	Leonardo Ranaldi et.al.	2502.12616	null
2025-02-17	Idiosyncrasies in Large Language Models	Mingjie Sun et.al.	2502.12150	link
2025-02-17	HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation	Ling Yang et.al.	2502.12148	link
2025-02-17	Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control	Jinyan Su et.al.	2502.12145	link
2025-02-17	Small Models Struggle to Learn from Strong Reasoners	Yuetai Li et.al.	2502.12143	null
2025-02-17	SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs	Yige Xu et.al.	2502.12134	link
2025-02-17	Transformer Dynamics: A neuroscientific approach to interpretability of large language models	Jesseba Fernando et.al.	2502.12131	null
2025-02-17	Scaling Autonomous Agents via Automatic Reward Modeling And Planning	Zhenfang Chen et.al.	2502.12130	null
2025-02-17	LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities	Florian Sestak et.al.	2502.12128	link
2025-02-17	Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA	Patryk Marszałek et.al.	2502.12122	link
2025-02-17	LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws	Prasanna Mayilvahanan et.al.	2502.12120	null
2025-02-17	PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection	Jinhe Bi et.al.	2502.12119	null
2025-02-17	A-MEM: Agentic Memory for LLM Agents	Wujiang Xu et.al.	2502.12110	link
2025-02-17	Personality Structured Interview for Large Language Model Simulation in Personality Research	Pengda Wang et.al.	2502.12109	null
2025-02-17	Relational Norms for Human-AI Cooperation	Brian D. Earp et.al.	2502.12102	null
2025-02-17	Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications	Li Qiao et.al.	2502.12096	null
2025-02-17	How compositional generalization and creativity improve as diffusion models are trained	Alessandro Favero et.al.	2502.12089	null
2025-02-17	Meta-Statistical Learning: Supervised Learning of Statistical Inference	Maxime Peyrard et.al.	2502.12088	null
2025-02-17	APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs	Yuxiang Huang et.al.	2502.12085	link
2025-02-17	Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation	Zhongyi Qiu et.al.	2502.12073	null
2025-02-17	TokenSkip: Controllable Chain-of-Thought Compression in LLMs	Heming Xia et.al.	2502.12067	link
2025-02-17	CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models	Yifan Zhang et.al.	2502.12066	null
2025-02-17	AI-generated Text Detection with a GLTR-based Approach	Lucía Yan Wu et.al.	2502.12064	null
2025-02-17	Designing Role Vectors to Improve LLM Inference Behaviour	Daniele Potertì et.al.	2502.12055	null
2025-02-17	PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning	Xinyu Zhang et.al.	2502.12054	null
2025-02-17	A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond	Shreya Shukla et.al.	2502.12048	null
2025-02-17	KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs	Qi Zhao et.al.	2502.12029	null
2025-02-17	SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities	Fengqing Jiang et.al.	2502.12025	null
2025-02-17	Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving	Xin Xu et.al.	2502.12022	null
2025-02-17	Atom of Thoughts for Markov LLM Test-Time Scaling	Fengwei Teng et.al.	2502.12018	link
2025-02-17	Unsupervised Structural-Counterfactual Generation under Domain Shift	Krishn Vishwas Kher et.al.	2502.12013	null
2025-02-17	Design Considerations Based on Stability for a Class of TCP Algorithms	Sreekanth Prabhakar et.al.	2502.11983	null
2025-02-17	Image Inversion: A Survey from GANs to Diffusion and Beyond	Yinan Chen et.al.	2502.11974	link
2025-02-17	Generating Text from Uniform Meaning Representation	Emma Markle et.al.	2502.11973	link
2025-02-17	A MIMO Wireless Channel Foundation Model via CIR-CSI Consistency	Jun Jiang et.al.	2502.11965	null
2025-02-17	Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning	Tianyi Wu et.al.	2502.11962	null
2025-02-17	On Representational Dissociation of Language and Arithmetic in Large Language Models	Riku Kisako et.al.	2502.11932	null
2025-02-17	GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs	Yi Fang et.al.	2502.11925	null
2025-02-17	From Text to Trust: Empowering AI-assisted Decision Making with Adaptive LLM-powered Analysis	Zhuoyan Li et.al.	2502.11919	null
2025-02-17	EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models	Jiamin Su et.al.	2502.11916	link
2025-02-17	Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives	Leo Schwinn et.al.	2502.11910	null
2025-02-17	MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation	Haochen Xue et.al.	2502.11903	null
2025-02-17	DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation	Zhihang Yuan et.al.	2502.11897	link
2025-02-17	CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning	Yanxiao Zhao et.al.	2502.11896	null
2025-02-17	Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models?	Jacob Nielsen et.al.	2502.11895	null
2025-02-17	Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration	Shao Zhang et.al.	2502.11882	link
2025-02-17	Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models	Hyunwoo Kim et.al.	2502.11881	null
2025-02-17	Bitnet.cpp: Efficient Edge Inference for Ternary LLMs	Jinheng Wang et.al.	2502.11880	link
2025-02-17	JoLT: Joint Probabilistic Predictions on Tabular Data Using LLMs	Aliaksandra Shysheya et.al.	2502.11877	link
2025-02-17	FedEAT: A Robustness Optimization Framework for Federated LLMs	Yahao Pang et.al.	2502.11863	null
2025-02-17	Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu	Renhao Pei et.al.	2502.11862	link
2025-02-17	Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics	Shuqi Yang et.al.	2502.11861	null
2025-02-17	StructTransform: A Scalable Attack Surface for Safety-Aligned Large Language Models	Shehel Yoosuf et.al.	2502.11853	link
2025-02-17	BaxBench: Can LLMs Generate Correct and Secure Backends?	Mark Vero et.al.	2502.11844	null
2025-02-17	Can LLM Agents Maintain a Persona in Discourse?	Pranav Bhandari et.al.	2502.11843	null
2025-02-17	Model Generalization on Text Attribute Graphs: Principles with Large Language Models	Haoyu Wang et.al.	2502.11836	link
2025-02-17	HAAN: A Holistic Approach for Accelerating Normalization Operations in Large Language Models	Tianfan Peng et.al.	2502.11832	null
2025-02-17	Intuitive physics understanding emerges from self-supervised pretraining on natural videos	Quentin Garrido et.al.	2502.11831	link
2025-02-17	Text Classification in the LLM Era - Where do we stand?	Sowmya Vajjala et.al.	2502.11830	null
2025-02-17	Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities	Hanbin Wang et.al.	2502.11829	link
2025-02-17	M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis	Chengyan Wu et.al.	2502.11824	link
2025-02-17	Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis	Xu Wang et.al.	2502.11812	null
2025-02-17	FineFilter: A Fine-grained Noise Filtering Mechanism for Retrieval-Augmented Large Language Models	Qianchi Zhang et.al.	2502.11811	null
2025-02-17	Exploring Translation Mechanism of Large Language Models	Hongbin Zhang et.al.	2502.11806	null
2025-02-17	Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning	Peiying Yu et.al.	2502.11799	link
2025-02-17	Personality Editing for Language Models through Relevant Knowledge Editing	Seojin Hwang et.al.	2502.11789	null
2025-02-17	Efficient Response Generation Method Selection for Fine-Tuning Large Language Models	Xuan Ren et.al.	2502.11779	null
2025-02-17	video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model	Guangzhi Sun et.al.	2502.11775	link
2025-02-17	The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It	Leonardo Bertolazzi et.al.	2502.11771	link
2025-02-17	Cognitive-Aligned Document Selection for Retrieval-augmented Generation	Bingyu Wan et.al.	2502.11770	null
2025-02-17	From Selection to Generation: A Survey of LLM-based Active Learning	Yu Xia et.al.	2502.11767	null
2025-02-17	Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation	Zengkui Sun et.al.	2502.11766	link
2025-02-17	HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims	Michiel van der Meer et.al.	2502.11753	null
2025-02-17	Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning	Yuqi Pang et.al.	2502.11751	link
2025-02-17	ILIAS: Instance-Level Image retrieval At Scale	Giorgos Kordopatis-Zilos et.al.	2502.11748	null
2025-02-17	SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL	Shuai Lyu et.al.	2502.11741	link
2025-02-17	ReviewEval: An Evaluation Framework for AI-Generated Reviews	Chavvi Kirtani et.al.	2502.11736	null
2025-02-17	Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment	Jonathan Jordan et.al.	2502.11733	null
2025-02-17	Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption	Alireza Nik et.al.	2502.11723	null
2025-02-17	Enhancing Recommendation Explanations through User-Centric Refinement	Jingsen Zhang et.al.	2502.11721	null
2025-02-17	Can you pass that tool?: Implications of Indirect Speech in Physical Human-Robot Collaboration	Yan Zhang et.al.	2502.11720	null
2025-02-17	Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection	Xuan Tong et.al.	2502.11712	null
2025-02-17	Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models	Sherzod Hakimov et.al.	2502.11707	null
2025-02-17	LLM Agents Making Agent Tools	Georg Wölflein et.al.	2502.11705	link
2025-02-17	CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation	Guangya Yu et.al.	2502.11703	null
2025-02-17	MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow	Hanzhuo Huang et.al.	2502.11697	null
2025-02-17	Improve LLM-as-a-Judge Ability as a General Ability	Jiachen Yu et.al.	2502.11689	null
2025-02-17	MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task	Yuchen Yan et.al.	2502.11684	null
2025-02-17	RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars	Yuncheng Hua et.al.	2502.11681	link
2025-02-17	Exploring LLM-based Student Simulation for Metacognitive Cultivation	Haoxuan Li et.al.	2502.11678	null
2025-02-17	Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception	Shiyu Ni et.al.	2502.11677	null
2025-02-17	Diversity-Oriented Data Augmentation with Large Language Models	Zaitian Wang et.al.	2502.11671	null
2025-02-17	VRoPE: Rotary Position Embedding for Video Large Language Models	Zikang Liu et.al.	2502.11664	link
2025-02-17	An Innovative Brain-Computer Interface Interaction System Based on the Large Language Model	Jing Jina et.al.	2502.11659	null
2025-02-17	Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation	Amin Qasmi et.al.	2502.11649	null
2025-02-17	DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing	Yi Wang et.al.	2502.11647	null
2025-02-17	Hyperspherical Energy Transformer with Recurrent Depth	Yunzhe Hu et.al.	2502.11646	null
2025-02-17	Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI	Yuxia Wang et.al.	2502.11614	null
2025-02-17	Maximum Entropy Reinforcement Learning with Diffusion Policy	Xiaoyi Dong et.al.	2502.11612	link
2025-02-17	Accuracy Assessment of OpenAlex and Clarivate Scholar ID with an LLM-Assisted Benchmark	Renyu Zhao et.al.	2502.11610	null
2025-02-17	GraphThought: Graph Combinatorial Optimization with Thought Generation	Zixiao Huang et.al.	2502.11607	null
2025-02-14	MM-RLHF: The Next Step Forward in Multimodal LLM Alignment	Yi-Fan Zhang et.al.	2502.10391	null
2025-02-14	Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction	WonJin Yoon et.al.	2502.10388	null
2025-02-14	Robustness tests for biomedical foundation models should tailor to specification	R. Patrick Xian et.al.	2502.10374	link
2025-02-14	AffinityFlow: Guided Flows for Antibody Affinity Maturation	Can Chen et.al.	2502.10365	null
2025-02-14	Enhancing Multilingual LLM Pretraining with Model-Based Data Selection	Bettina Messmer et.al.	2502.10361	null
2025-02-14	Dimension-free Score Matching and Time Bootstrapping for Diffusion Models	Syamantak Kumar et.al.	2502.10354	null
2025-02-14	Organize the Web: Constructing Domains Enhances Pre-Training Data Curation	Alexander Wettig et.al.	2502.10341	null
2025-02-14	Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering	Nick Ferguson et.al.	2502.10338	null
2025-02-14	Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions	Leo Zhang et.al.	2502.10328	null
2025-02-14	LLM-Powered Preference Elicitation in Combinatorial Assignment	Ermis Soumalias et.al.	2502.10308	null
2025-02-14	SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models	Aditya Mishra et.al.	2502.10307	null
2025-02-14	Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2	Saem Hasan et.al.	2502.10299	null
2025-02-14	Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification	Pengyu Zhang et.al.	2502.10280	null
2025-02-14	Are Large Language Models the future crowd workers of Linguistics?	Iris Ferrazzo et.al.	2502.10266	null
2025-02-14	Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers	Aivin V. Solatorio et.al.	2502.10263	link
2025-02-14	VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models	Gokul Karthik Kumar et.al.	2502.10250	null
2025-02-14	Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model	Guoqing Ma et.al.	2502.10248	link
2025-02-14	Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices	Mohamed Aboelenien Ahmed et.al.	2502.10239	null
2025-02-14	Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control	Thomas Jiralerspong et.al.	2502.10236	null
2025-02-14	AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting	Abdelhakim Benechehab et.al.	2502.10235	link
2025-02-14	Do Large Language Models Reason Causally Like Us? Even Better?	Hanna M. Dettki et.al.	2502.10215	null
2025-02-14	Can Post-Training Quantization Benefit from an Additional QLoRA Integration?	Xiliang Zhu et.al.	2502.10202	null
2025-02-14	Prediction hubs are context-informed frequent tokens in LLMs	Beatrix M. G. Nielsen et.al.	2502.10201	null
2025-02-14	MathConstruct: Challenging LLM Reasoning with Constructive Proofs	Mislav Balunović et.al.	2502.10197	null
2025-02-14	Translating Common Security Assertions Across Processor Designs: A RISC-V Case Study	Sharjeel Imtiaz et.al.	2502.10194	null
2025-02-14	VideoDiff: Human-AI Video Co-Creation with Alternatives	Mina Huh et.al.	2502.10190	null
2025-02-14	Modeling biases in binary decision-making within the generalized nonlinear q-voter model	Maciej Doniec et.al.	2502.10172	link
2025-02-14	Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries	Serkan Sulun et.al.	2502.10154	null
2025-02-14	Semantica: Decentralized Search using a LLM-Guided Semantic Tree Overlay	Petru Neague et.al.	2502.10151	link
2025-02-14	Cooperative Multi-Agent Planning with Adaptive Skill Synthesis	Zhiyuan Li et.al.	2502.10148	null
2025-02-14	Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages	Daniil Gurgurov et.al.	2502.10140	null
2025-02-14	Physics-Informed Generative Modeling of Wireless Channels	Benedikt Böck et.al.	2502.10137	null
2025-02-14	ScamFerret: Detecting Scam Websites Autonomously with Large Language Models	Hiroki Nakano et.al.	2502.10110	link
2025-02-14	NeuroXVocal: Detection and Explanation of Alzheimer’s Disease through Non-invasive Analysis of Picture-prompted Speech	Nikolaos Ntampakis et.al.	2502.10108	null
2025-02-14	A novel approach to data generation in generative model	JaeHong Kim et.al.	2502.10092	null
2025-02-14	Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations	Tianyu Song et.al.	2502.10088	link
2025-02-14	DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery	Utkarsh Mall et.al.	2502.10060	null
2025-02-14	A Generalized Modeling Approach to Liquid-driven Ballooning Membranes	Mirroyal Ismayilov et.al.	2502.10057	null
2025-02-14	ORI: O Routing Intelligence	Ahmad Shadid et.al.	2502.10051	null
2025-02-14	A Survey on LLM-powered Agents for Recommender Systems	Qiyao Peng et.al.	2502.10050	null
2025-02-14	ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments	Juyeong Hwang et.al.	2502.10046	null
2025-02-14	POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning	Jiawei Cheng et.al.	2502.10038	null
2025-02-14	Probabilistic Lexical Manifold Construction in Large Language Models via Hierarchical Vector Field Interpolation	Clive Pendleton et.al.	2502.10013	null
2025-02-14	ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy?	Jian Chen et.al.	2502.10008	null
2025-02-14	EmbBERT-Q: Breaking Memory Barriers in Embedded NLP	Riccardo Bravin et.al.	2502.10001	null
2025-02-14	Decision Information Meets Large Language Models: The Future of Explainable Operations Research	Yansen Zhang et.al.	2502.09994	link
2025-02-14	Large Language Diffusion Models	Shen Nie et.al.	2502.09992	null
2025-02-14	V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models	Hsu-kuang Chiu et.al.	2502.09980	null
2025-02-14	LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing	Kuan Li et.al.	2502.09977	null
2025-02-14	Has My System Prompt Been Used? Large Language Model Prompt Membership Inference	Roman Levin et.al.	2502.09974	null
2025-02-14	KGGen: Extracting Knowledge Graphs from Plain Text with Language Models	Belinda Mo et.al.	2502.09956	null
2025-02-14	A Preliminary Exploration with GPT-4o Voice Mode	Yu-Xiang Lin et.al.	2502.09940	null
2025-02-14	Precise Parameter Localization for Textual Generation in Diffusion Models	Łukasz Staniszewski et.al.	2502.09935	null
2025-02-14	MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning	Kai Yan et.al.	2502.09933	null
2025-02-14	Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence	Granite Vision Team et.al.	2502.09927	null
2025-02-14	λScale: Enabling Fast Scaling for Serverless Large Language Model Inference	Minchen Yu et.al.	2502.09922	null
2025-02-14	INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing	Hongsun Jang et.al.	2502.09921	null
2025-02-14	AutoS $^2$ earch: Unlocking the Reasoning Potential of Large Models for Web-based Source Search	Zhengqiu Zhu et.al.	2502.09913	null
2025-02-14	Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding	Thanh-Dat Truong et.al.	2502.09906	null
2025-02-14	The Ann Arbor Architecture for Agent-Oriented Programming	Wei Dong et.al.	2502.09903	link
2025-02-14	Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond	Kehan Guo et.al.	2502.09897	null
2025-02-14	ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation	Ye Dong et.al.	2502.09896	null
2025-02-14	ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation	Shu Wang et.al.	2502.09891	null
2025-02-14	Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos	Weirui Ye et.al.	2502.09886	null
2025-02-14	Solvable Dynamics of Self-Supervised Word Embeddings and the Emergence of Analogical Reasoning	Dhruva Karkada et.al.	2502.09863	null
2025-02-14	Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge	Naoyuki Kamo et.al.	2502.09859	null
2025-02-14	Automated Hypothesis Validation with Agentic Sequential Falsifications	Kexin Huang et.al.	2502.09858	link
2025-02-14	Port-LLM: A Port Prediction Method for Fluid Antenna based on Large Language Models	Yali Zhang et.al.	2502.09857	null
2025-02-14	Efficient Multitask Learning in Small Language Models Through Upside-Down Reinforcement Learning	Yu-Chen Lin et.al.	2502.09854	null
2025-02-14	HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation	Tianwei Lin et.al.	2502.09838	link
2025-02-13	A Solver-Aided Hierarchical Language for LLM-Driven CAD Design	Benjamin T. Jones et.al.	2502.09819	null
2025-02-13	Statistical Coherence Alignment for Large Language Model Representation Learning Through Tensor Field Convergence	Jonathan Gale et.al.	2502.09815	null
2025-02-13	INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages	Hao Yu et.al.	2502.09814	null
2025-02-13	AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration	Jizhou Chen et.al.	2502.09809	null
2025-02-13	Unit Testing Past vs. Present: Examining LLMs’ Impact on Defect Detection and Efficiency	Rudolf Ramler et.al.	2502.09801	null
2025-02-13	Co-designing Large Language Model Tools for Project-Based Learning with K12 Educators	Prerna Ravi et.al.	2502.09799	null
2025-02-13	A Survey on LLM-based News Recommender Systems	Rongyao Wang et.al.	2502.09797	null
2025-02-13	TableTalk: Scaffolding Spreadsheet Development with a Language Agent	Jenny T. Liang et.al.	2502.09787	null
2025-02-13	Improving Acoustic Side-Channel Attacks on Keyboards Using Transformers and Large Language Models	Jin Hyun Park et.al.	2502.09782	null
2025-02-13	CellFlow: Simulating Cellular Morphology Changes via Flow Matching	Yuhui Zhang et.al.	2502.09775	null
2025-02-13	Non-Markovian Discrete Diffusion with Causal Language Models	Yangtian Zhang et.al.	2502.09767	null
2025-02-13	LLM-Generated Microservice Implementations from RESTful API Definitions	Saurabh Chauhan et.al.	2502.09766	link
2025-02-13	Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization	Amit Levi et.al.	2502.09755	null
2025-02-13	Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting	Chaoyuan Zhang et.al.	2502.09749	null
2025-02-13	The Widespread Adoption of Large Language Model-Assisted Writing Across Society	Weixin Liang et.al.	2502.09747	null
2025-02-13	Fine-Tuning Foundation Models with Federated Learning for Privacy Preserving Medical Time Series Forecasting	Mahad Ali et.al.	2502.09744	null
2025-02-13	FoNE: Precise Single-Token Number Embeddings via Fourier Features	Tianyi Zhou et.al.	2502.09741	null
2025-02-13	Making Them a Malicious Database: Exploiting Query Code to Jailbreak Aligned Large Language Models	Qingsong Zou et.al.	2502.09723	link
2025-02-13	NestQuant: Nested Lattice Quantization for Matrix Products and LLMs	Semyon Savkin et.al.	2502.09720	null
2025-02-13	Genetic Data Governance in Crisis: Policy Recommendations for Safeguarding Privacy and Preventing Discrimination	Vivek Ramanan et.al.	2502.09716	null
2025-02-13	MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Dongzhi Jiang et.al.	2502.09621	null
2025-02-13	Exploring the Potential of Encoder-free Architectures in 3D LMMs	Yiwen Tang et.al.	2502.09620	link
2025-02-13	Designing a Conditional Prior Distribution for Flow-Based Generative Models	Noam Issachar et.al.	2502.09611	null
2025-02-14	Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions	Tejas Jayashankar et.al.	2502.09609	null
2025-02-13	Human-LLM Coevolution: Evidence from Academic Writing	Mingmeng Geng et.al.	2502.09606	null
2025-02-13	SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models	Yung-Sung Chuang et.al.	2502.09604	link
2025-02-13	Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs	Siyan Zhao et.al.	2502.09597	link
2025-02-13	KIMAs: A Configurable Knowledge Integrated Multi-Agent System	Zitao Li et.al.	2502.09596	null
2025-02-13	Logical forms complement probability in understanding language model (and human) performance	Yixuan Wang et.al.	2502.09589	null
2025-02-13	Rolling Ahead Diffusion for Traffic Scene Simulation	Yunpeng Liu et.al.	2502.09587	null
2025-02-13	Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks	Qian Wan et.al.	2502.09577	null
2025-02-13	Zero-shot generation of synthetic neurosurgical data with large language models	Austin A. Barr et.al.	2502.09566	link
2025-02-13	MDCrow: Automating Molecular Dynamics Workflows with Large Language Models	Quintina Campbell et.al.	2502.09565	link
2025-02-13	EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Rui Yang et.al.	2502.09560	null
2025-02-13	Explainable AI-assisted Optimization for Feynman Integral Reduction	Zhuo-Yang Song et.al.	2502.09544	null
2025-02-13	Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages	Shreyan Biswas et.al.	2502.09532	null
2025-02-13	SQ-GAN: Semantic Image Communications Using Masked Vector Quantization	Francesco Pezone et.al.	2502.09520	link
2025-02-13	Diffusion Models for Molecules: A Survey of Methods and Tasks	Liang Wang et.al.	2502.09511	link
2025-02-13	EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling	Theodoros Kouzelis et.al.	2502.09509	null
2025-02-13	Improve LLM-based Automatic Essay Scoring with Linguistic Features	Zhaoyi Joey Hou et.al.	2502.09497	null
2025-02-13	Foundation Neural-Network Quantum States	Riccardo Rende et.al.	2502.09488	null
2025-02-13	Objective quantification of mood states using large language models	Jakub Onysk et.al.	2502.09487	null
2025-02-13	DiffRenderGAN: Addressing Training Data Scarcity in Deep Segmentation Networks for Quantitative Nanomaterial Analysis through Differentiable Rendering and Generative Modelling	Dennis Possart et.al.	2502.09477	null
2025-02-13	Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction	Ziyi Chen et.al.	2502.09423	null
2025-02-13	ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation	Rotem Shalev-Arkushin et.al.	2502.09411	null
2025-02-13	SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models	Daniel Fleischer et.al.	2502.09390	link
2025-02-13	Truth Knows No Language: Evaluating Truthfulness Beyond English	Blanca Calvo Figueras et.al.	2502.09387	link
2025-02-13	APT-LLM: Embedding-Based Anomaly Detection of Cyber Advanced Persistent Threats Using Large Language Models	Sidahmed Benabderrahmane et.al.	2502.09385	null
2025-02-13	LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won’t Fail)	Junsu Kim et.al.	2502.09376	null
2025-02-13	Inverse problems with experiment-guided AlphaFold	Advaith Maddipatla et.al.	2502.09372	null
2025-02-13	Language Agents as Digital Representatives in Collective Decision-Making	Daniel Jarrett et.al.	2502.09369	null
2025-02-13	Machine learning for modelling unstructured grid data in computational physics: a review	Sibo Cheng et.al.	2502.09346	null
2025-02-13	ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments	Youhe Jiang et.al.	2502.09334	null
2025-02-13	Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs	Itai Mondshine et.al.	2502.09331	null
2025-02-13	Copilot Arena: A Platform for Code LLM Evaluation in the Wild	Wayne Chi et.al.	2502.09328	null
2025-02-13	A Benchmark for Crime Surveillance Video Analysis with Large Models	Haoran Chen et.al.	2502.09325	null
2025-02-13	A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis	Kentaro Imajo et.al.	2502.09316	link
2025-02-13	When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models	Samuel Joseph Amouyal et.al.	2502.09307	null
2025-02-13	Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling	Paula Cordero-Encinar et.al.	2502.09306	null
2025-02-13	KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG	Yiqian Huang et.al.	2502.09304	link
2025-02-13	When do neural networks learn world models?	Tianren Zhang et.al.	2502.09297	null
2025-02-13	SparQLe: Speech Queries to Text Translation Through LLMs	Amirbek Djanibekov et.al.	2502.09284	link
2025-02-13	GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation	Hongyin Zhang et.al.	2502.09268	null
2025-02-13	AnomalyGFM: Graph Foundation Model for Zero/Few-shot Anomaly Detection	Hezhe Qiao et.al.	2502.09254	link
2025-02-13	From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine	Lukas Buess et.al.	2502.09242	null
2025-02-13	OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart Logistics	Junhui Wang et.al.	2502.09238	null
2025-02-13	Reliable Conversational Agents under ASP Control that Understand Natural Language	Yankai Zeng et.al.	2502.09237	null
2025-02-13	Data2Concept2Text: An Explainable Multilingual Framework for Data Analysis Narration	Flavio Bertini et.al.	2502.09218	null
2025-02-13	LP-LM: No Hallucinations in Question Answering with Logic Programming	Katherine Wu et.al.	2502.09212	link
2025-02-13	Visual Graph Question Answering with ASP and LLMs for Language Parsing	Jakob Johannes Bauer et.al.	2502.09211	null
2025-02-13	On LLM-generated Logic Programs and their Inference Execution Methods	Paul Tarau et.al.	2502.09209	null
2025-02-13	Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York	Sanskar Sehgal et.al.	2502.09204	null
2025-02-13	XAInomaly: Explainable and Interpretable Deep Contractive Autoencoder for O-RAN Traffic Anomaly Detection	Osman Tugay Basaran et.al.	2502.09194	null
2025-02-13	Thinking beyond the anthropomorphic paradigm benefits LLM research	Lujain Ibrahim et.al.	2502.09192	null
2025-02-13	Matina: A Large-Scale 73B Token Persian Text Corpus	Sara Bourbour Hosseinbeigi et.al.	2502.09188	null
2025-02-13	RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation	Changzhi Zhou et.al.	2502.09183	null
2025-02-13	FLAME: Flexible LLM-Assisted Moderation Engine	Ivan Bakulin et.al.	2502.09175	null
2025-02-13	Two-Stage Representation Learning for Analyzing Movement Behavior Dynamics in People Living with Dementia	Jin Cui et.al.	2502.09173	null
2025-02-13	Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs	Chang Liu et.al.	2502.09156	null
2025-02-13	Finite-Time Analysis of Discrete-Time Stochastic Interpolants	Yuhao Liu et.al.	2502.09130	null
2025-02-13	One-shot Federated Learning Methods: A Practical Guide	Xiang Liu et.al.	2502.09104	null
2025-02-13	Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation	Zongyu Chang et.al.	2502.09101	null
2025-02-13	Logical Reasoning in Large Language Models: A Survey	Hanmeng Liu et.al.	2502.09100	null
2025-02-13	Show Me the Work: Fact-Checkers’ Requirements for Explainable Automated Fact-Checking	Greta Warren et.al.	2502.09083	null
2025-02-13	CoSER: Coordinating LLM-Based Persona Simulation of Established Roles	Xintao Wang et.al.	2502.09082	link
2025-02-13	Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables	Xuzhao Geng et.al.	2502.09073	null
2025-02-13	Unleashing the Power of Large Language Model for Denoising Recommendation	Shuyao Wang et.al.	2502.09058	null
2025-02-13	An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging	Kunat Pipatanakul et.al.	2502.09056	null
2025-02-13	Game Theory Meets Large Language Models: A Systematic Survey	Haoran Sun et.al.	2502.09053	null
2025-02-13	Typhoon T1: An Open Thai Reasoning Model	Pittawat Taveekitworachai et.al.	2502.09042	null
2025-02-13	Implementation of a Fuzzy Relational Database. Case Study: Chilean Cardboard Industry in the Maule Region	Leoncio Jimenez et.al.	2502.09035	null
2025-02-13	MTDP: Modulated Transformer Diffusion Policy Model	Qianhao Wang et.al.	2502.09029	null
2025-02-13	EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition	Xiao Wang et.al.	2502.09020	link
2025-02-13	Diversity Enhances an LLM’s Performance in RAG and Long-context Task	Zhchao Wang et.al.	2502.09017	null
2025-02-13	Hope vs. Hate: Understanding User Interactions with LGBTQ+ News Content in Mainstream US News Media through the Lens of Hope Speech	Jonathan Pofcher et.al.	2502.09004	null
2025-02-13	RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models	Quan Wei et.al.	2502.09003	null
2025-02-13	End-to-End triplet loss based fine-tuning for network embedding in effective PII detection	Rishika Kohli et.al.	2502.09002	null
2025-02-13	Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks?	Amirhesam Abedsoltan et.al.	2502.08991	null
2025-02-13	Prophet Inequalities for Bandits, Cabinets, and DAGs	Robin Bowers et.al.	2502.08976	null
2025-02-13	Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning	Leon Nissen et.al.	2502.08954	link
2025-02-13	Structured Convergence in Large Language Model Representations via Hierarchical Latent Space Folding	Fenella Harcourt et.al.	2502.08947	null
2025-02-13	Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis	Wenbo Zhang et.al.	2502.08943	null
2025-02-13	Escaping Collapse: The Strength of Weak Data for Large Language Model Training	Kareem Amin et.al.	2502.08924	null
2025-02-13	Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models	Xin Zhou et.al.	2502.08922	null
2025-02-13	Detecting Malicious Concepts Without Image Generation in AIGC	Kun Xu et.al.	2502.08921	null
2025-02-13	InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU	Heejun Lee et.al.	2502.08910	null
2025-02-13	Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs	Premtim Sahitaj et.al.	2502.08909	null
2025-02-13	Reinforced Large Language Model is a formal theorem prover	Zhiling Luo et.al.	2502.08908	link
2025-02-13	DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation	Tangyu Jiang et.al.	2502.08905	null
2025-02-13	MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training	Xinxin You et.al.	2502.08904	null
2025-02-13	3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning	Guoqin Tang et.al.	2502.08903	null
2025-02-13	Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication	Weicheng Ma et.al.	2502.08896	null
2025-02-13	ShapeLib: designing a library of procedural 3D shape abstractions with Large Language Models	R. Kenny Jones et.al.	2502.08884	null
2025-02-13	Utilizing Pre-trained and Large Language Models for 10-K Items Segmentation	Hsin-Min Lu et.al.	2502.08875	null
2025-02-13	Harnessing Vision Models for Time Series Analysis: A Survey	Jingchao Ni et.al.	2502.08869	link
2025-02-13	A Systematic Evaluation of Generative Models on Tabular Transportation Data	Chengen Wang et.al.	2502.08856	link
2025-02-12	Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation	Mohammad Mahdi Abootorabi et.al.	2502.08826	link
2025-02-12	DejAIvu: Identifying and Explaining AI Art on the Web in Real-Time with Saliency Maps	Jocelyn Dzuong et.al.	2502.08821	link
2025-02-12	Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model	Emre Can Acikgoz et.al.	2502.08820	null
2025-02-12	Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation	Koinis Vassilis et.al.	2502.08818	null
2025-02-12	Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples	Andrianos Michail et.al.	2502.08638	null
2025-02-12	Ensemble based approach to quantifying uncertainty of LLM based classifications	Srijith Rajamohan et.al.	2502.08631	null
2025-02-12	Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model	Saurabh Kataria et.al.	2502.08612	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-12	Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks	Ang Li et.al.	2502.08586	null
2025-02-12	Statistically validated projection of bipartite signed networks	Anna Gallo et.al.	2502.08567	null
2025-02-12	QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval	Wonduk Seo et.al.	2502.08557	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies	Sunnie S. Y. Kim et.al.	2502.08554	null
2025-02-12	LLMs can implicitly learn from mistakes in-context	Lisa Alazraki et.al.	2502.08550	null
2025-02-12	LLM Pretraining with Continuous Concepts	Jihoon Tack et.al.	2502.08524	null
2025-02-12	FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices	Dezhong Yao et.al.	2502.08518	link
2025-02-12	The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data	Evgenii Evstafev et.al.	2502.08515	null
2025-02-12	Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation	Mahnaz Koupaee et.al.	2502.08514	link
2025-02-12	Measuring Diversity in Synthetic Datasets	Yuchang Zhu et.al.	2502.08512	link
2025-02-12	Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction	Wei Li et.al.	2502.08507	link
2025-02-12	Salamandra Technical Report	Aitor Gonzalez-Agirre et.al.	2502.08489	link
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-12	Computed fingertip touch for the instrumental control of musical sound with an excursion on the computed retinal afterimage	Staas de Jong et.al.	2502.08471	null
2025-02-12	mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data	Haonan Chen et.al.	2502.08468	link
2025-02-12	From Haystack to Needle: Label Space Reduction for Zero-shot Classification	Nathan Vandemoortele et.al.	2502.08436	null
2025-02-12	IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance	Paul Röttger et.al.	2502.08395	null
2025-02-12	ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification	Jiangbo Shi et.al.	2502.08391	link
2025-02-12	Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding	Konstantin Berestizshevsky et.al.	2502.08363	link
2025-02-12	Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAG	Kushagra Bhushan et.al.	2502.08356	link
2025-02-12	Trustworthy GNNs with LLMs: A Systematic Review and Taxonomy	Ruizhan Xue et.al.	2502.08353	null
2025-02-12	Graph Foundation Models for Recommendation: A Comprehensive Survey	Bin Wu et.al.	2502.08346	null
2025-02-12	Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact	Mohsin Bilal et.al.	2502.08333	null
2025-02-12	Modification and Generated-Text Detection: Achieving Dual Detection Capabilities for the Outputs of LLM by Watermark	Yuhang Cai et.al.	2502.08332	null
2025-02-12	Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning	Barnaby Schmitt et.al.	2502.08323	null
2025-02-12	MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection	Lubna Al-Henaki et.al.	2502.08319	null
2025-02-12	Word Synchronization Challenge: A Benchmark for Word Association Responses for LLMs	Tanguy Cazalets et.al.	2502.08312	null
2025-02-12	Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model	Bencheng Yan et.al.	2502.08309	null
2025-02-12	HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting	Shibo Feng et.al.	2502.08302	link
2025-02-12	Compromising Honesty and Harmlessness in Language Models via Deception Attacks	Laurène Vaugrante et.al.	2502.08301	null
2025-02-12	Improving Existing Optimization Algorithms with LLMs	Camilo Chacón Sartori et.al.	2502.08298	null
2025-02-12	Redefining Simplicity: Benchmarking Large Language Models from Lexical to Document Simplification	Jipeng Qiang et.al.	2502.08281	null
2025-02-12	MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation	Min Hou et.al.	2502.08271	null
2025-02-12	Exploring the Potential of Large Language Models to Simulate Personality	Maria Molchanova et.al.	2502.08265	link
2025-02-12	GenIAS: Generator for Instantiating Anomalies in time Series	Zahra Zamanzadeh Darban et.al.	2502.08262	null
2025-02-12	FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation	Yang Sun et.al.	2502.08260	link
2025-02-12	Learning Human Skill Generators at Key-Step Levels	Yilu Wu et.al.	2502.08234	null
2025-02-12	Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause Analysis	Changhua Pei et.al.	2502.08224	null
2025-02-12	Memory Offloading for Large Language Model Inference with Latency SLO Guarantees	Chenxiang Ma et.al.	2502.08182	null
2025-02-12	Enhancing LLM Character-Level Manipulation via Divide and Conquer	Zhen Xiong et.al.	2502.08180	null
2025-02-12	ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation	Ruobing Yao et.al.	2502.08178	null
2025-02-12	SycEval: Evaluating LLM Sycophancy	Aaron Fanous et.al.	2502.08177	null
2025-02-12	Intention is All You Need: Refining Your Code from Your Intention	Qi Guo et.al.	2502.08172	null
2025-02-12	Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling	Yang Cao et.al.	2502.08150	null
2025-02-12	ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning	Vy Vo et.al.	2502.08148	null
2025-02-12	Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers	Siddharth Singh et.al.	2502.08145	null
2025-02-12	Bridging the Safety Gap: A Guardrail Pipeline for Trustworthy LLM Inferences	Shanshan Han et.al.	2502.08142	null
2025-02-12	LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits	Zikai Zhou et.al.	2502.08141	null
2025-02-12	Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models	Sonam Gupta et.al.	2502.08130	null
2025-02-12	Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance	Lingfei Qian et.al.	2502.08127	link
2025-02-12	HuDEx: Integrating Hallucination Detection and Explainability for Enhancing the Reliability of LLM responses	Sujeong Lee et.al.	2502.08109	null
2025-02-12	Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD)	Naomi Akhras et.al.	2502.08073	null
2025-02-12	On Mechanistic Circuits for Extractive Question-Answering	Samyadeep Basu et.al.	2502.08059	null
2025-02-12	Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs	Mohsinul Kabir et.al.	2502.08045	null
2025-02-12	Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery	Fan Jiang et.al.	2502.08037	null
2025-02-12	Stochastic Kinetics of Transcription: Analysis and Computation	Yuntao Lu et.al.	2502.08028	null
2025-02-12	Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations	Alistair Wren et.al.	2502.08026	null
2025-02-11	Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding	Ziyao Wang et.al.	2502.08020	null
2025-02-11	The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models	Artem Kirsanov et.al.	2502.08009	null
2025-02-11	An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models	Kasra Ahmadi et.al.	2502.08008	link
2025-02-11	Towards Training One-Step Diffusion Models Without Distillation	Mingtian Zhang et.al.	2502.08005	null
2025-02-11	Universal Adversarial Attack on Aligned Multimodal LLMs	Temurbek Rahmatullaev et.al.	2502.07987	null
2025-02-11	Deep Semantic Graph Learning via LLM based Node Enhancement	Chuanqi Shi et.al.	2502.07982	null
2025-02-11	CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs	Lejla Skelic et.al.	2502.07980	null
2025-02-11	From Hazard Identification to Controller Design: Proactive and LLM-Supported Safety Engineering for ML-Powered Systems	Yining Hong et.al.	2502.07974	null
2025-02-11	Caught in the Web of Words: Do LLMs Fall for Spin in Medical Literature?	Hye Sun Yun et.al.	2502.07963	link
2025-02-11	Accelerating Scientific Research Through a Multi-LLM Framework	Joaquin Ramirez-Medina et.al.	2502.07960	null
2025-02-11	Bridging HCI and AI Research for the Evaluation of Conversational SE Assistants	Jonan Richards et.al.	2502.07956	null
2025-02-11	Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs	Ruichen Zhang et.al.	2502.07942	null
2025-02-11	Discrete Markov Probabilistic Models	Le-Tuyet-Nhi Pham et.al.	2502.07939	null
2025-02-11	Distributed Approach to Haskell Based Applications Refactoring with LLMs Based Multi-Agent Systems	Shahbaz Siddeeq et.al.	2502.07928	null
2025-02-11	Sign Operator for Coping with Heavy-Tailed Noise: High Probability Convergence Bounds with Extensions to Distributed Optimization and Comparison Oracle	Nikita Kornilov et.al.	2502.07923	null
2025-02-11	Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning	Rujing Yao et.al.	2502.07912	link
2025-02-11	DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities	Chashi Mahiul Islam et.al.	2502.07905	null
2025-02-11	Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering	Rujing Yao et.al.	2502.07904	null
2025-02-11	HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment	Youhe Jiang et.al.	2502.07903	null
2025-02-11	TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation	Alex Jinpeng Wang et.al.	2502.07870	link
2025-02-11	TransMLA: Multi-head Latent Attention Is All You Need	Fanxu Meng et.al.	2502.07864	link
2025-02-11	BalanceKV: KV Cache Compression through Discrepancy Theory	Insu Han et.al.	2502.07861	null
2025-02-11	Pippo: High-Resolution Multi-View Humans from a Single Image	Yash Kant et.al.	2502.07785	null
2025-02-11	DarwinLM: Evolutionary Structured Pruning of Large Language Models	Shengkun Tang et.al.	2502.07780	link
2025-02-11	Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection	Anirudh Sundara Rajan et.al.	2502.07778	null
2025-02-11	Auditing Prompt Caching in Language Model APIs	Chenchen Gu et.al.	2502.07776	link
2025-02-11	Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming	Azizjon Kobilov et.al.	2502.07772	null
2025-02-11	Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers	Italo Santos et.al.	2502.07763	null
2025-02-11	Scalable Fingerprinting of Large Language Models	Anshul Nasery et.al.	2502.07760	null
2025-02-11	Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension	Wenbo Gong et.al.	2502.07752	null
2025-02-11	WHODUNIT: Evaluation benchmark for culprit detection in mystery stories	Kshitij Gupta et.al.	2502.07747	link
2025-02-11	The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing	Dirk Bergemann et.al.	2502.07736	null
2025-02-11	Revisiting Non-Acyclic GFlowNets in Discrete Environments	Nikita Morozov et.al.	2502.07735	link
2025-02-11	Economics of Sourcing Human Data	Sebastin Santy et.al.	2502.07732	null
2025-02-11	Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK	Marcos Cramer et.al.	2502.07728	null
2025-02-11	Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning	Aya Kayal et.al.	2502.07715	null
2025-02-11	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link
2025-02-11	A Framework for LLM-powered Design Assistants	Swaroop Panda et.al.	2502.07698	null
2025-02-11	Large Language Models as Proxies for Theories of Human Linguistic Cognition	Imry Ziv et.al.	2502.07687	null
2025-02-11	Steering Protein Family Design through Profile Bayesian Flow	Jingjing Gong et.al.	2502.07671	null
2025-02-11	Guiding Time-Varying Generative Models with Natural Gradients on Exponential Family Manifold	Song Liu et.al.	2502.07650	null
2025-02-11	SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models	Shihao Xia et.al.	2502.07644	null
2025-02-11	FoQA: A Faroese Question-Answering Dataset	Annika Simonsen et.al.	2502.07642	null
2025-02-11	Distributional Instrumental Variable Method	Anastasiia Holovchak et.al.	2502.07641	link
2025-02-11	Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving	Yong Lin et.al.	2502.07640	link
2025-02-11	Consistency Training with Physical Constraints	Che-Chia Chang et.al.	2502.07636	null
2025-02-11	Exploring Mobile Touch Interaction with Large Language Models	Tim Zindulka et.al.	2502.07629	null
2025-02-11	Tractable Transformers for Flexible Conditional Generation	Anji Liu et.al.	2502.07616	null
2025-02-11	Beyond Prompting: Time2Lang – Bridging Time-Series Foundation Models and Large Language Models for Health Sensing	Arvind Pillai et.al.	2502.07608	link
2025-02-11	Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models	Jiacong Xu et.al.	2502.07601	null
2025-02-11	Towards spatial computing: recent advances in multimodal natural interaction for XR headsets	Zhimin Wang et.al.	2502.07598	null
2025-02-11	SEMU: Singular Value Decomposition for Efficient Machine Unlearning	Marcin Sendera et.al.	2502.07587	null
2025-02-11	Generative Modeling with Bayesian Sample Inference	Marten Lienen et.al.	2502.07580	link
2025-02-11	PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference	Yufeng Gu et.al.	2502.07578	link
2025-02-11	Automated Capability Discovery via Model Self-Exploration	Cong Lu et.al.	2502.07577	link
2025-02-11	JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation	Shenyi Zhang et.al.	2502.07557	link
2025-02-11	O1 Embedder: Let Retrievers Think Before Action	Ruin Yan et.al.	2502.07555	null
2025-02-11	Grammar Control in Dialogue Response Generation for Language Learning Chatbots	Dominik Glandorf et.al.	2502.07544	link
2025-02-11	NatureLM: Deciphering the Language of Nature for Scientific Discovery	Yingce Xia et.al.	2502.07527	null
2025-02-11	The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation	Raman Dutt et.al.	2502.07516	link
2025-02-11	Enhance-A-Video: Better Generated Video for Free	Yang Luo et.al.	2502.07508	link
2025-02-11	Towards THz-based Obstacle Sensing: A Generative Radio Environment Awareness Framework	Tianyu Hu et.al.	2502.07504	null
2025-02-11	Unified Graph Networks (UGN): A Deep Neural Framework for Solving Graph Problems	Rudrajit Dawn et.al.	2502.07500	null
2025-02-11	LLM-Sketch: Enhancing Network Sketches with LLM	Yuanpeng Li et.al.	2502.07495	link
2025-02-11	Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More	Xialie Zhuang et.al.	2502.07490	link
2025-02-11	Improving Adaptive Moment Optimization via Preconditioner Diagonalization	Son Nguyen et.al.	2502.07488	null
2025-02-11	ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model	Xiaochen Liu et.al.	2502.07474	null
2025-02-11	JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata	Abhinaba Roy et.al.	2502.07461	link
2025-02-11	Logarithmic Regret for Online KL-Regularized Reinforcement Learning	Heyang Zhao et.al.	2502.07460	null
2025-02-11	PerCul: A Story-Driven Cultural Evaluation of LLMs in Persian	Erfan Moosavi Monazzah et.al.	2502.07459	null
2025-02-11	RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation	Viacheslav Vasilev et.al.	2502.07455	link
2025-02-11	Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon	Nurit Cohen-Inger et.al.	2502.07445	link
2025-02-11	Towards a Foundation Model for Physics-Informed Neural Networks: Multi-PDE Learning with Active Sampling	Keon Vin Park et.al.	2502.07425	null
2025-02-11	RomanLens: Latent Romanization and its role in Multilinguality in LLMs	Alan Saji et.al.	2502.07424	null
2025-02-11	Entity Linking using LLMs for Automated Product Carbon Footprint Estimation	Steffen Castle et.al.	2502.07418	null
2025-02-11	EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering	Sheng Zhou et.al.	2502.07411	link
2025-02-11	MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification	Anh-Tien Nguyen et.al.	2502.07409	link
2025-02-11	On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o	Rundong Liu et.al.	2502.07399	link
2025-02-11	FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents	Mostapha Benhenda et.al.	2502.07393	link
2025-02-11	LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!	Dacheng Li et.al.	2502.07374	link
2025-02-11	EvoFlow: Evolving Diverse Agentic Workflows On The Fly	Guibin Zhang et.al.	2502.07373	null
2025-02-11	LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation	Zican Dong et.al.	2502.07365	null
2025-02-11	Bridging the Evaluation Gap: Leveraging Large Language Models for Topic Model Evaluation	Zhiyin Tan et.al.	2502.07352	link
2025-02-11	KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems	Jusheng Zhang et.al.	2502.07350	null
2025-02-11	BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models	Xu Huang et.al.	2502.07346	link
2025-02-11	Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering	Shuzheng Si et.al.	2502.07340	link
2025-02-11	Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready)	Atharva Mehta et.al.	2502.07328	link
2025-02-11	Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Haowen Gao et.al.	2502.07327	null
2025-02-11	MEMIT-Merge: Addressing MEMIT’s Key-Value Conflicts in Same-Subject Batch Editing for LLMs	Zilu Dong et.al.	2502.07322	null
2025-02-11	CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction	Junlong Li et.al.	2502.07316	link
2025-02-11	Prompt-Based Document Modifications In Ranking Competitions	Niv Bardas et.al.	2502.07315	null
2025-02-11	CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry	Xiaopeng Ye et.al.	2502.07307	link
2025-02-11	TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation	Navid Rajabi et.al.	2502.07306	null
2025-02-11	Flow Matching for Collaborative Filtering	Chengkai Liu et.al.	2502.07303	link
2025-02-11	Generation of Drug-Induced Cardiac Reactions towards Virtual Clinical Trials	Qian Shao et.al.	2502.07297	null
2025-02-11	Small Language Model Makes an Effective Long Text Extractor	Yelin Chen et.al.	2502.07286	link
2025-02-11	Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization	Aditya Vora et.al.	2502.07278	null
2025-02-11	Cost-Efficient Continual Learning with Sufficient Exemplar Memory	Dongkyu Cho et.al.	2502.07274	null
2025-02-11	GENERator: A Long-Context Generative Genomic Foundation Model	Wei Wu et.al.	2502.07272	link
2025-02-11	When More is Less: Understanding Chain-of-Thought Length in LLMs	Yuyang Wu et.al.	2502.07266	null
2025-02-11	DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization	Xuefeng Liu et.al.	2502.07237	null
2025-02-11	A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models	Yiming Chen et.al.	2502.07222	null
2025-02-11	MLLM4PUE: Toward Universal Embeddings in Computational Pathology through Multimodal LLMs	Qifeng Zhou et.al.	2502.07221	null
2025-02-11	LUNAR: LLM Unlearning via Neural Activation Redirection	William F. Shen et.al.	2502.07218	null
2025-02-11	Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion	Xingpei Ma et.al.	2502.07203	null
2025-02-11	Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits	Long-Fei Li et.al.	2502.07193	link
2025-02-11	Bag of Tricks for Inference-time Computation of LLM Reasoning	Fan Liu et.al.	2502.07191	link
2025-02-11	A Large-Scale Benchmark for Vietnamese Sentence Paraphrases	Sang Quang Nguyen et.al.	2502.07188	link
2025-02-11	Refine Knowledge of Large Language Models via Adaptive Contrastive Learning	Yinghui Li et.al.	2502.07184	null
2025-02-11	Does Training on Synthetic Data Make Models Less Robust?	Lingze Zhang et.al.	2502.07164	null
2025-02-11	Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning	Feng Chen et.al.	2502.07154	link
2025-02-11	Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning	Jiayuan Zhu et.al.	2502.07143	null
2025-02-11	Language-TPP: Integrating Temporal Point Processes with Language Models for Event Analysis	Quyu Kong et.al.	2502.07139	null
2025-02-10	Cardiverse: Harnessing LLMs for Novel Card Game Prototyping	Danrui Li et.al.	2502.07128	null
2025-02-10	Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation	Denis Bakushev et.al.	2502.07124	null
2025-02-10	Online Scheduling for LLM Inference with KV Cache Constraints	Patrick Jaillet et.al.	2502.07115	null
2025-02-10	Generative Distribution Prediction: A Unified Approach to Multimodal Learning	Xinyu Tian et.al.	2502.07090	null
2025-02-10	Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring	Alex Heyman et.al.	2502.07087	link
2025-02-10	MPFBench: A Large Scale Dataset for SciML of Multi-Phase-Flows: Droplet and Bubble Dynamics	Mehdi Shadkhah et.al.	2502.07080	null
2025-02-10	Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models	Lujain Ibrahim et.al.	2502.07077	null
2025-02-10	IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models	Sayem Mohammad Imtiaz et.al.	2502.07072	null
2025-02-10	Specializing Large Language Models to Simulate Survey Response Distributions for Global Populations	Yong Cao et.al.	2502.07068	link
2025-02-10	Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT	Dongyang Liu et.al.	2502.06782	null
2025-02-10	Enhancing Performance of Explainable AI Models with Constrained Concept Refinement	Geyu Liang et.al.	2502.06775	null
2025-02-10	Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions	Jaeyeon Kim et.al.	2502.06768	null
2025-02-10	Rationalization Models for Text-to-SQL	Gaetano Rossiello et.al.	2502.06759	null
2025-02-10	Accelerating Data Processing and Benchmarking of AI Models for Pathology	Andrew Zhang et.al.	2502.06750	link
2025-02-10	Gradient Multi-Normalization for Stateless and Scalable LLM Training	Meyer Scetbon et.al.	2502.06742	null
2025-02-10	VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data	Thomas Zeng et.al.	2502.06737	null
2025-02-10	Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists	Bojia Zi et.al.	2502.06734	null
2025-02-10	Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining	Daouda Sow et.al.	2502.06733	null
2025-02-10	Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling	Runze Liu et.al.	2502.06703	link
2025-02-10	No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers	Jiajun He et.al.	2502.06685	null
2025-02-10	EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks	Michael Arbel et.al.	2502.06684	null
2025-02-10	Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations	Rui Chen et.al.	2502.06669	null
2025-02-10	Automatic Evaluation of Healthcare LLMs Beyond Question-Answering	Anna Arias-Duart et.al.	2502.06666	null
2025-02-10	Evaluation of Deep Audio Representations for Hearables	Fabian Gröger et.al.	2502.06664	null
2025-02-10	EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models	Xingrun Xing et.al.	2502.06663	null
2025-02-10	Unbiased Evaluation of Large Language Models from a Causal Perspective	Meilin Chen et.al.	2502.06655	null
2025-02-10	In-Context Learning (and Unlearning) of Length Biases	Stephanie Schoch et.al.	2502.06653	null
2025-02-10	Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A	Anna Leschanowsky et.al.	2502.06652	null
2025-02-10	Automatic Annotation Augmentation Boosts Translation between Molecules and Natural Language	Zhiqiang Zhong et.al.	2502.06634	null
2025-02-10	Combining Large Language Models with Static Analyzers for Code Review Generation	Imen Jaoua et.al.	2502.06633	link
2025-02-10	Multi-Scale Feature Fusion with Image-Driven Spatial Integration for Left Atrium Segmentation from Cardiac MRI Images	Bipasha Kundu et.al.	2502.06615	null
2025-02-10	A Large-scale AI-generated Image Inpainting Benchmark	Paschalis Giakoumoglou et.al.	2502.06593	null
2025-02-10	Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training	Yuchen Zhuang et.al.	2502.06589	null
2025-02-10	A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems	Linxiao Gong et.al.	2502.06581	null
2025-02-10	LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	Zhi Zhou et.al.	2502.06572	link
2025-02-10	Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	Chengwen Qi et.al.	2502.06563	link
2025-02-10	Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data?	Marika Swanberg et.al.	2502.06555	null
2025-02-10	Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments	Marc Felix Brinner et.al.	2502.06551	null
2025-02-10	Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning	Jean Vassoyan et.al.	2502.06533	null
2025-02-10	Properties of Wasserstein Gradient Flows for the Sliced-Wasserstein Distance	Christophe Vauthier et.al.	2502.06525	null
2025-02-10	GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing	Jinhao Duan et.al.	2502.06494	null
2025-02-10	Recent Advances in Discrete Speech Tokens: A Review	Yiwei Guo et.al.	2502.06490	null
2025-02-10	Adaptive Prompting: Ad-hoc Prompt Composition for Social Bias Detection	Maximilian Spliethöver et.al.	2502.06487	null
2025-02-10	WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry	Filip Ekström Kelvinius et.al.	2502.06485	link
2025-02-10	UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths	Weijia Mao et.al.	2502.06474	null
2025-02-10	KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment	Yuxing Lu et.al.	2502.06472	link
2025-02-10	A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks	Hieu Minh “Jord” Nguyen et.al.	2502.06470	null
2025-02-10	MATH-Perturb: Benchmarking LLMs’ Math Reasoning Abilities against Hard Perturbations	Kaixuan Huang et.al.	2502.06453	null
2025-02-10	FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model	Anna Tegon et.al.	2502.06438	null
2025-02-10	Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising	Huaqiu Li et.al.	2502.06432	link
2025-02-10	CoS: Chain-of-Shot Prompting for Long Video Understanding	Jian Hu et.al.	2502.06428	null
2025-02-10	Generating Privacy-Preserving Personalized Advice with Zero-Knowledge Proofs and LLMs	Hiroki Watanabe et.al.	2502.06425	null
2025-02-10	Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models	Tianshuo Xu et.al.	2502.06419	null
2025-02-10	Systematic Outliers in Large Language Models	Yongqi An et.al.	2502.06415	link
2025-02-10	AppVLM: A Lightweight Vision Language Model for Online App Control	Georgios Papoudakis et.al.	2502.06395	null
2025-02-10	How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators	Shang Liu et.al.	2502.06387	null
2025-02-10	Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment	Long Zhang et.al.	2502.06371	null
2025-02-10	Calibrating LLMs with Information-Theoretic Evidential Deep Learning	Yawei Li et.al.	2502.06351	link
2025-02-10	Can AI Examine Novelty of Patents?: Novelty Evaluation Based on the Correspondence between Patent Claim and Prior Art	Hayato Ikoma et.al.	2502.06316	null
2025-02-10	Latent Convergence Modulation in Large Language Models: A Novel Approach to Iterative Contextual Realignment	Patricia Porretta et.al.	2502.06302	null
2025-02-10	SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia	Chaoqun Liu et.al.	2502.06298	null
2025-02-10	Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases?	Qingshan Hou et.al.	2502.06289	null
2025-02-10	Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Haiduo Huang et.al.	2502.06282	link
2025-02-10	DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models	Utkarsh Tiwari et.al.	2502.06279	null
2025-02-10	Emergent Response Planning in LLM	Zhichen Dong et.al.	2502.06258	null
2025-02-10	K-ON: Stacking Knowledge On the Head Layer of Large Language Model	Lingbing Guo et.al.	2502.06257	null
2025-02-10	Find Central Dogma Again	Wang Liang et.al.	2502.06253	null
2025-02-10	Amplifying Minority Voices: AI-Mediated Devil’s Advocate System for Inclusive Group Decision-Making	Soohwan Lee et.al.	2502.06251	null
2025-02-10	PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts	Zeman Li et.al.	2502.06244	null
2025-02-10	Fully Exploiting Vision Foundation Model’s Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing	Sicen Guo et.al.	2502.06219	null
2025-02-10	LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks	Xin Zhou et.al.	2502.06215	null
2025-02-10	Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement	Junyu Lu et.al.	2502.06207	link
2025-02-10	C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation	Guoxin Chen et.al.	2502.06205	null
2025-02-10	Non-literal Understanding of Number Words by Language Models	Polina Tsvilodub et.al.	2502.06204	null
2025-02-10	Timing Matters: How Using LLMs at Different Timings Influences Writers’ Perceptions and Ideation Outcomes in AI-Assisted Ideation	Peinuan Qin et.al.	2502.06197	null
2025-02-10	Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering	Ruiqi Wang et.al.	2502.06193	null
2025-02-10	Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis	Sanket Jantre et.al.	2502.06173	null
2025-02-10	A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation	Wenhui Lei et.al.	2502.06171	null
2025-02-10	Universal Approximation of Visual Autoregressive Transformers	Yifang Chen et.al.	2502.06167	null
2025-02-10	Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy	Kamyar Kazari et.al.	2502.06150	null
2025-02-10	Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection	Yan Weng et.al.	2502.06148	null
2025-02-10	LegalViz: Legal Text Visualization by Text To Diagram Generation	Eri Onami et.al.	2502.06147	null
2025-02-10	LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMs	Sumin An et.al.	2502.06139	null
2025-02-10	Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models	Ce Zhang et.al.	2502.06130	link
2025-02-10	Foundation Model of Electronic Medical Records for Adaptive Risk Estimation	Pawel Renc et.al.	2502.06124	link
2025-02-10	Task-driven Layerwise Additive Activation Intervention	Hieu Trung Nguyen et.al.	2502.06115	null
2025-02-10	CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories	Yijia Xiao et.al.	2502.06111	null
2025-02-10	RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	Jian Xu et.al.	2502.06101	link
2025-02-10	ConMeC: A Dataset for Metonymy Resolution with Common Nouns	Saptarshi Ghosh et.al.	2502.06087	link
2025-02-10	Physics-Guided Foundation Model for Scientific Discovery: An Application to Aquatic Science	Runlong Yu et.al.	2502.06084	link
2025-02-10	Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo	Cheuk Kit Lee et.al.	2502.06079	null
2025-02-09	Deconstructing Depression Stigma: Integrating AI-driven Data Collection and Analysis with Causal Knowledge Graphs	Han Meng et.al.	2502.06075	null
2025-02-09	Allegro-FM: Towards Equivariant Foundation Model for Exascale Molecular Dynamics Simulations	Ken-ichi Nomura et.al.	2502.06073	null
2025-02-09	Benchmarking Prompt Sensitivity in Large Language Models	Amirhossein Razavi et.al.	2502.06065	null
2025-02-09	Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization	Jiajun Fan et.al.	2502.06061	null
2025-02-09	Benchmarking Prompt Engineering Techniques for Secure Code Generation with GPT Models	Marc Bruni et.al.	2502.06039	null
2025-02-09	Investigating Compositional Reasoning in Time Series Foundation Models	Willa Potosnak et.al.	2502.06037	link
2025-02-09	A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions	Elisa Negrini et.al.	2502.06026	link
2025-02-09	Dual Caption Preference Optimization for Diffusion Models	Amir Saeidi et.al.	2502.06023	link
2025-02-09	Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding	Xingjian Diao et.al.	2502.06020	link
2025-02-09	Media Bias Detector: Designing and Implementing a Tool for Real-Time Selection and Framing Bias Analysis in News Coverage	Jenny S Wang et.al.	2502.06009	null
2025-02-09	Analysis of LLM as a grammatical feature tagger for African American English	Rahul Porwal et.al.	2502.06004	null
2025-02-09	HamRaz: A Culture-Based Persian Conversation Dataset for Person-Centered Therapy Using LLM Agents	Mohammad Amin Abbasi et.al.	2502.05982	null
2025-02-09	$μ$ nit Scaling: Simple and Scalable FP8 LLM Training	Saaketh Narayan et.al.	2502.05967	null
2025-02-09	Redefining Robot Generalization Through Interactive Intelligence	Sharmita Dey et.al.	2502.05963	null
2025-02-09	MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents	Jiabin Tang et.al.	2502.05957	link
2025-02-09	Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks	Antonio La Torre et.al.	2502.05951	null
2025-02-09	Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention	Zhendong Zhang et.al.	2502.05947	null
2025-02-09	“Let the AI conspiracy begin…” Language Model coordination is just one inference-intervention away	Paul Darm et.al.	2502.05945	link
2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link
2025-02-07	Fillerbuster: Multi-View Scene Completion for Casual Captures	Ethan Weber et.al.	2502.05175	null
2025-02-07	NoLiMa: Long-Context Evaluation Beyond Literal Matching	Ali Modarressi et.al.	2502.05167	link
2025-02-07	Multitwine: Multi-Object Compositing with Text and Layout Control	Gemma Canet Tarrés et.al.	2502.05165	null
2025-02-07	DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails	Yihe Deng et.al.	2502.05163	link
2025-02-07	A Lightweight Method to Disrupt Memorized Sequences in LLM	Parjanya Prajakta Prashant et.al.	2502.05159	null
2025-02-07	Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation	Steffen Eger et.al.	2502.05151	link
2025-02-07	CodeSCM: Causal Analysis for Multi-Modal Code Generation	Mukur Gupta et.al.	2502.05150	link
2025-02-07	An Annotated Reading of ‘The Singer of Tales’ in the LLM Era	Kush R. Varshney et.al.	2502.05148	null
2025-02-07	Chest X-ray Foundation Model with Global and Local Representations Integration	Zefan Yang et.al.	2502.05142	link
2025-02-07	Latent Swap Joint Diffusion for Long-Form Audio Generation	Yusheng Dai et.al.	2502.05130	null
2025-02-07	Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning	Matt von Hippel et.al.	2502.05121	null
2025-02-07	Flexible and Efficient Grammar-Constrained Decoding	Kanghee Park et.al.	2502.05111	null
2025-02-07	Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs	Rohit Saxena et.al.	2502.05092	null
2025-02-07	Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs	Thierry Bossy et.al.	2502.05087	link
2025-02-07	Causality can systematically address the monsters under the bench(marks)	Felix Leeb et.al.	2502.05085	null
2025-02-07	ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework	Xiaoyu Deng et.al.	2502.05084	null
2025-02-07	Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures	Tushar Pandey et.al.	2502.05078	link
2025-02-07	Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images	Aditya Kumar et.al.	2502.05066	link
2025-02-07	nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow	Geliang Ouyang et.al.	2502.05036	link
2025-02-07	Prospects for detecting generic fast-time features in the neutrino lightcurve of nearby supernovae in neutrino telescopes	Jakob Beise et.al.	2502.05024	null
2025-02-07	QuEST: Stable Training of LLMs with 1-Bit Weights and Activations	Andrei Panferov et.al.	2502.05003	link
2025-02-07	Aligning Black-box Language Models with Human Judgments	Gerrit J. J. van den Burg et.al.	2502.04997	null
2025-02-07	C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features	Chenxing Sun et.al.	2502.04991	null
2025-02-07	MoGraphGPT: Creating Interactive Scenes Using Modular LLM and Graphical Control	Hui Ye et.al.	2502.04983	null
2025-02-07	Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits	Finn Rietz et.al.	2502.04979	null
2025-02-07	Towards Multimodal Empathetic Response Generation: A Rich Text-Speech-Vision Avatar-based Benchmark	Han Zhang et.al.	2502.04976	null
2025-02-07	CoCoA: A Generalized Approach to Uncertainty Quantification by Integrating Confidence and Consistency of LLM Outputs	Roman Vashurin et.al.	2502.04964	null
2025-02-07	The Rising Threat to Emerging AI-Powered Search Engines	Zeren Luo et.al.	2502.04951	null
2025-02-07	Mobile Network-specialized Large Language Models for 6G: Architectures, Innovations, Challenges, and Future Trends	Abdelaali Chaoub et.al.	2502.04933	null
2025-02-07	Generative-enhanced optimization for knapsack problems: an industry-relevant study	Yelyzaveta Vodovozova et.al.	2502.04928	null
2025-02-07	Classification or Prompting: A Case Study on Legal Requirements Traceability	Romina Etezadi et.al.	2502.04916	null
2025-02-07	Goku: Flow Based Video Generative Foundation Models	Shoufa Chen et.al.	2502.04896	null
2025-02-07	A Foundational Brain Dynamics Model via Stochastic Optimal Control	Joonhyeong Park et.al.	2502.04892	null
2025-02-07	Training-free Task-oriented Grasp Generation	Jiaming Wang et.al.	2502.04873	null
2025-02-07	Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration	Yifeng Yu et.al.	2502.04849	null
2025-02-07	Developmentally-plausible Working Memory Shapes a Critical Period for Language Acquisition	Masato Mita et.al.	2502.04795	null
2025-02-07	S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency	Yuting Zeng et.al.	2502.04790	null
2025-02-07	Probing Internal Representations of Multi-Word Verbs in Large Language Models	Hassane Kissane et.al.	2502.04789	null
2025-02-07	Enhancing SQL Injection Detection and Prevention Using Generative Models	Naga Sai Dasari et.al.	2502.04786	null
2025-02-07	SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning	Wanjia Zhao et.al.	2502.04780	link
2025-02-07	SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation	Jungwoo Kim et.al.	2502.04774	null
2025-02-07	Enhancing Phishing Email Identification with Large Language Models	Catherine Lee et.al.	2502.04759	null
2025-02-07	Concept Navigation and Classification via Open Source Large Language Model Processing	Maël Kubli et.al.	2502.04756	null
2025-02-07	Every Software as an Agent: Blueprint and Case Study	Mengwei Xu et.al.	2502.04747	null
2025-02-07	PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational Autoencoders	Tianyu Xie et.al.	2502.04730	link
2025-02-07	Generating Symbolic World Models via Test-time Scaling of Large Language Models	Zhouliang Yu et.al.	2502.04728	link
2025-02-07	Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?	Sourabrata Mukherjee et.al.	2502.04718	null
2025-02-07	Enhancing Impression Change Prediction in Speed Dating Simulations Based on Speakers’ Personalities	Kazuya Matsuo et.al.	2502.04706	null
2025-02-07	STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion	Zhenwei Wu et.al.	2502.04692	null
2025-02-07	ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning	Yuwei Yin et.al.	2502.04689	link
2025-02-07	M-IFEval: Multilingual Instruction-Following Evaluation	Antoine Dussolle et.al.	2502.04688	link
2025-02-07	Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization	Zelai Xu et.al.	2502.04686	null
2025-02-07	G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models	Mengdi Liu et.al.	2502.04684	null
2025-02-07	CALF-SBM: A Covariate-Assisted Latent Factor Stochastic Block Model	Sydney Louit et.al.	2502.04681	null
2025-02-07	LLM Query Scheduling with Prefix Reuse and Latency Constraints	Gregory Dexter et.al.	2502.04677	null
2025-02-07	AdParaphrase: Paraphrase Dataset for Analyzing Linguistic Features toward Generating Attractive Ad Texts	Soichiro Murakami et.al.	2502.04674	link
2025-02-07	Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization	Xinhao Yao et.al.	2502.04667	link
2025-02-07	Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy	Rishabh Uapadhyay et.al.	2502.04666	null
2025-02-07	Importance Sampling via Score-based Generative Models	Heasung Kim et.al.	2502.04646	null
2025-02-07	Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research	Junde Wu et.al.	2502.04644	link
2025-02-07	Confidence Elicitation: A New Attack Vector for Large Language Models	Brian Formento et.al.	2502.04643	link
2025-02-07	Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition	Khaled Sellami et.al.	2502.04604	null
2025-02-07	Extracting and Understanding the Superficial Knowledge in Alignment	Runjin Chen et.al.	2502.04602	link
2025-02-07	The $α$ -Alternator: Dynamic Adaptation To Varying Noise Levels In Sequences Using The Vendi Score For Improved Robustness and Performance	Mohammad Reza Rezaei et.al.	2502.04593	null
2025-02-07	Position-aware Automatic Circuit Discovery	Tal Haklay et.al.	2502.04577	link
2025-02-06	My LLM might Mimic AAE – But When Should it?	Sandra C. Sandoval et.al.	2502.04564	link
2025-02-06	Speeding up Speculative Decoding via Approximate Verification	Meiyu Zhong et.al.	2502.04557	null
2025-02-06	TruthFlow: Truthful LLM Generation via Representation Flow Correction	Hanyu Wang et.al.	2502.04556	null
2025-02-06	Contextual Gradient Flow Modeling for Large Language Model Generalization in Multi-Scale Feature Spaces	Daphne Quillington et.al.	2502.04548	null
2025-02-06	Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection	Minseok Jung et.al.	2502.04528	null
2025-02-06	Safety is Essential for Responsible Open-Ended Systems	Ivaxi Sheth et.al.	2502.04512	null
2025-02-06	ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization	Zijun Wu et.al.	2502.04501	null
2025-02-06	Verifiable Format Control for Large Language Model Generations	Zhaoyang Wang et.al.	2502.04498	null
2025-02-06	Multi-Agent Reinforcement Learning with Focal Diversity Optimization	Selim Furkan Tekin et.al.	2502.04492	link
2025-02-06	Building A Unified AI-centric Language System: analysis, framework and future work	Edward Hong Wang et.al.	2502.04488	null
2025-02-06	Active Task Disambiguation with LLMs	Katarzyna Kobalczyk et.al.	2502.04485	link
2025-02-06	The ML Supply Chain in the Era of Software 2.0: Lessons Learned from Hugging Face	Trevor Stalnaker et.al.	2502.04484	null
2025-02-06	Near-Optimal Sample Complexity for MDPs via Anchoring	Jongmin Lee et.al.	2502.04477	null
2025-02-06	ADIFF: Explaining audio difference using natural language	Soham Deshmukh et.al.	2502.04476	link
2025-02-06	Augmented Conditioning Is Enough For Effective Training Image Generation	Jiahui Chen et.al.	2502.04475	null
2025-02-06	Iterative Importance Fine-tuning of Diffusion Models	Alexander Denker et.al.	2502.04468	null
2025-02-06	FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks	Luca Della Libera et.al.	2502.04465	null
2025-02-06	Training Language Models to Reason Efficiently	Daman Arora et.al.	2502.04463	link
2025-02-06	Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization	Yu-Neng Chuang et.al.	2502.04428	null
2025-02-06	Decoding AI Judgment: How LLMs Assess News Credibility and Bias	Edoardo Loru et.al.	2502.04426	null
2025-02-06	EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models	He Hu et.al.	2502.04424	null
2025-02-06	Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment	Zuyan Liu et.al.	2502.04328	link
2025-02-06	Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness	Karolina Rudnicka et.al.	2502.04324	null
2025-02-06	Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions	Yik Siu Chan et.al.	2502.04322	link
2025-02-06	ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features	Alec Helbling et.al.	2502.04320	link
2025-02-06	sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views	Eyvaz Najafli et.al.	2502.04318	null
2025-02-06	ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters	Kamer Ali Yuksel et.al.	2502.04315	link
2025-02-06	ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization	Yinjie Wang et.al.	2502.04306	link
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression	Lirui Wang et.al.	2502.04296	null
2025-02-06	Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization	Yuanye Liu et.al.	2502.04295	link
2025-02-06	PILAF: Optimal Human Preference Sampling for Reward Modeling	Yunzhen Feng et.al.	2502.04270	null
2025-02-06	Efficient Randomized Experiments Using Foundation Models	Piersilvio De Bartolomeis et.al.	2502.04262	link
2025-02-06	Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention	Ayush K. Varshney et.al.	2502.04260	null
2025-02-06	MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion	Xintong Hao et.al.	2502.04235	null
2025-02-06	Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks	Andreas Happe et.al.	2502.04227	link
2025-02-06	Keep It Light! Simplifying Image Clustering Via Text-Free Adapters	Yicen Li et.al.	2502.04226	null
2025-02-06	Éclair – Extracting Content and Layout with Integrated Reading Order for Documents	Ilia Karmanov et.al.	2502.04223	null
2025-02-06	Sports and Women’s Sports: Gender Bias in Text Generation with Olympic Data	Laura Biester et.al.	2502.04218	null
2025-02-06	Algorithmic causal structure emerging through compression	Liang Wendong et.al.	2502.04210	null
2025-02-06	“Short-length” Adversarial Training Helps LLMs Defend “Long-length” Jailbreak Attacks: Theoretical and Empirical Evidence	Shaopeng Fu et.al.	2502.04204	link
2025-02-06	The Best Instruction-Tuning Data are Those That Fit	Dylan Zhang et.al.	2502.04194	null
2025-02-06	PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models?	Mennatullah Siam et.al.	2502.04192	link
2025-02-06	Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models	Carlos Eduardo Duarte et.al.	2502.04188	null
2025-02-06	Multi-agent Architecture Search via Agentic Supernet	Guibin Zhang et.al.	2502.04180	link
2025-02-06	MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation	Qinhan Yu et.al.	2502.04176	link
2025-02-06	Diffusion-based mass map reconstruction from weak lensing data	Supranta S. Boruah et.al.	2502.04158	null
2025-02-06	UltraIF: Advancing Instruction Following from the Wild	Kaikai An et.al.	2502.04153	link
2025-02-06	The Order Effect: Investigating Prompt Sensitivity in Closed-Source LLMs	Bryan Guan et.al.	2502.04134	null
2025-02-06	Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis	Zhen Ye et.al.	2502.04128	link
2025-02-06	Generative Adversarial Networks Bridging Art and Machine Intelligence	Junhao Song et.al.	2502.04116	null
2025-02-06	VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output	Eason Chen et.al.	2502.04103	null
2025-02-06	LLMs to Support a Domain Specific Knowledge Assistant	Maria-Flavia Lovin et.al.	2502.04095	null
2025-02-06	AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference	Qingyue Yang et.al.	2502.04077	link
2025-02-06	Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency	Shangkun Sun et.al.	2502.04076	link
2025-02-06	Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training	Changhao Jiang et.al.	2502.04066	link
2025-02-06	TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers	Younghye Hwang et.al.	2502.04056	null
2025-02-06	Exploring Imbalanced Annotations for Effective In-Context Learning	Hongfu Gao et.al.	2502.04037	null
2025-02-06	Fine, I’ll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging	Guinan Su et.al.	2502.04030	link
2025-02-06	Echo-Teddy: Preliminary Design and Development of Large Language Model-based Social Robot for Autistic Students	Unggi Lee et.al.	2502.04029	null
2025-02-06	Quantification of Biodiversity from Historical Survey Text with LLM-based Best-Worst Scaling	Thomas Haider et.al.	2502.04022	null
2025-02-06	Automating a Complete Software Test Process Using LLMs: An Automotive Case Study	Shuai Wang et.al.	2502.04008	null
2025-02-06	CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing	Yu Yuan et.al.	2502.03997	null
2025-02-06	Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering	Longquan Jiang et.al.	2502.03992	link
2025-02-06	Tight Bounds on Jensen’s Gap: Novel Approach with Applications in Generative Modeling	Marcin Mazur et.al.	2502.03988	null
2025-02-06	MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation	YoonJe Kang et.al.	2502.03966	null
2025-02-06	MAQInstruct: Instruction-based Unified Event Relation Extraction	Jun Xu et.al.	2502.03954	null
2025-02-06	LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models	Priyank Pathak et.al.	2502.03950	link
2025-02-06	Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond	Mardhiyah Sanni et.al.	2502.03945	null
2025-02-06	Unravelling Causal Genetic Biomarkers of Alzheimer’s Disease via Neuron to Gene-token Backtracking in Neural Architecture: A Groundbreaking Reverse-Gene-Finder Approach	Victor OK Li et.al.	2502.03938	null
2025-02-06	Quantifying Correlations of Machine Learning Models	Yuanyuan Li et.al.	2502.03937	link
2025-02-06	HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture	Jai Bardhan et.al.	2502.03933	null
2025-02-06	Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software	Andreas Baumann et.al.	2502.03916	null
2025-02-06	No Free Lunch in Annotation either: An objective evaluation of foundation models for streamlining annotation in animal tracking	Emil Mededovic et.al.	2502.03907	link
2025-02-06	LeAP: Consistent multi-domain 3D labeling using Foundation Models	Simon Gebraad et.al.	2502.03901	null
2025-02-06	InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers	Chenchen Shou et.al.	2502.03885	null
2025-02-06	Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning	Peizhuang Cong et.al.	2502.03884	null
2025-02-06	BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation	Bo Pang et.al.	2502.03860	null
2025-02-06	PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication	Zhuohui Zhang et.al.	2502.03845	null
2025-02-06	Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis	Lin Yuan et.al.	2502.03843	null
2025-02-06	FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing	Jinya Sakurai et.al.	2502.03826	null
2025-02-06	Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation	Tianhao Li et.al.	2502.03825	null
2025-02-06	PsyPlay: Personality-Infused Role-Playing Conversational Agents	Tao Yang et.al.	2502.03821	null
2025-02-06	Large Language Models for Multi-Robot Systems: A Survey	Peihan Li et.al.	2502.03814	link
2025-02-06	Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective	Yuan Feng et.al.	2502.03805	link
2025-02-06	Understanding and Supporting Formal Email Exchange by Answering AI-Generated Questions	Yusuke Miura et.al.	2502.03804	link
2025-02-06	Enhancing Hallucination Detection through Noise Injection	Litian Liu et.al.	2502.03799	null
2025-02-06	Distribution learning via neural differential equations: minimal energy regularization and approximation theory	Youssef Marzouk et.al.	2502.03795	null
2025-02-06	It’s All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers	Benjamin Clavié et.al.	2502.03793	null
2025-02-06	Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence	Jacob Fein-Ashley et.al.	2502.03787	null
2025-02-06	GistVis: Automatic Generation of Word-scale Visualizations from Data-rich Documents	Ruishi Zou et.al.	2502.03784	link
2025-02-06	Adaptive Semantic Prompt Caching with VectorQ	Luis Gaspar Schroeder et.al.	2502.03771	null
2025-02-06	Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models	Meiquan Dong et.al.	2502.03766	null
2025-02-06	Rethinking the Residual Distribution of Locate-then-Editing Methods in Model Editing	Xiaopeng Li et.al.	2502.03748	null
2025-02-06	Speaking the Language of Teamwork: LLM-Guided Credit Assignment in Multi-Agent Reinforcement Learning	Muhan Lin et.al.	2502.03723	null
2025-02-06	Boosting Knowledge Graph-based Recommendations through Confidence-Aware Augmentation with Large Language Models	Rui Cai et.al.	2502.03715	null
2025-02-06	MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers	Nicole Cho et.al.	2502.03711	null
2025-02-06	Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers	Daniel Beaglehole et.al.	2502.03708	null
2025-02-06	LLM Alignment as Retriever Optimization: An Information Retrieval Perspective	Bowen Jin et.al.	2502.03699	null
2025-02-06	A Comparison of DeepSeek and Other LLMs	Tianchen Gao et.al.	2502.03688	null
2025-02-06	Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free	Gian Mario Favero et.al.	2502.03687	null
2025-02-06	Controlled LLM Decoding via Discrete Auto-regressive Biasing	Patrick Pynadath et.al.	2502.03685	null
2025-02-05	Reflection-Window Decoding: Text Generation with Selective Refinement	Zeyu Tang et.al.	2502.03678	null
2025-02-05	Advancing Reasoning in Large Language Models: Promising Methods and Approaches	Avinash Patil et.al.	2502.03671	null
2025-02-05	Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set	Yikai Wu et.al.	2502.03669	null
2025-02-05	Privacy-Preserving Generative Models: A Comprehensive Survey	Debalina Padariya et.al.	2502.03668	null
2025-02-05	Context-Preserving Gradient Modulation for Large Language Models: A Novel Approach to Semantic Consistency in Long-Form Text Generation	Nirola Kobanov et.al.	2502.03643	null
2025-02-05	SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models	Daniel Levy et.al.	2502.03638	link
2025-02-05	AdaPhish: AI-Powered Adaptive Defense and Education Resource Against Deceptive Emails	Rei Meguro et.al.	2502.03622	null
2025-02-05	Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training	Reza Shirkavand et.al.	2502.03604	null
2025-02-05	HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference	Zeyu Zhang et.al.	2502.03589	null
2025-02-05	A Mixed-Methods Evaluation of LLM-Based Chatbots for Menopause	Roshini Deva et.al.	2502.03579	null
2025-02-05	Code Simulation as a Proxy for High-order Tasks in Large Language Models	Emanuele La Malfa et.al.	2502.03568	null
2025-02-05	Kronecker Mask and Interpretive Prompts are Language-Action Video Learners	Jingyi Yang et.al.	2502.03549	link
2025-02-05	YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment	Amitava Das et.al.	2502.03512	null
2025-02-05	Do Large Language Model Benchmarks Test Reliability?	Joshua Vendrow et.al.	2502.03461	link
2025-02-05	Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training	Boyao Wang et.al.	2502.03460	null
2025-02-05	A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)	Yiye Chen et.al.	2502.03450	null
2025-02-05	Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics	Xuan Li et.al.	2502.03449	null
2025-02-05	BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving	Ran Xin et.al.	2502.03438	null
2025-02-05	Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization	Yu-Han Wu et.al.	2502.03435	null
2025-02-05	On Fairness of Unified Multimodal Large Language Model for Image Generation	Ming Liu et.al.	2502.03429	null
2025-02-05	Harnessing Large Language Models for Curated Code Reviews	Oussama Ben Sghaier et.al.	2502.03425	link
2025-02-05	Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation	Alexey A. Novikov et.al.	2502.03420	null
2025-02-05	Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts	Nikta Gohari Sadr et.al.	2502.03418	null
2025-02-05	SPRI: Aligning Large Language Models with Context-Situated Principles	Hongli Zhan et.al.	2502.03397	null
2025-02-05	Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications	Issar Arab et.al.	2502.03395	null
2025-02-05	LIMO: Less is More for Reasoning	Yixin Ye et.al.	2502.03387	link
2025-02-05	Transformers and Their Roles as Time Series Foundation Models	Dennis Wu et.al.	2502.03383	null
2025-02-05	Demystifying Long Chain-of-Thought Reasoning in LLMs	Edward Yeo et.al.	2502.03373	link
2025-02-05	PalimpChat: Declarative and Interactive AI analytics	Chunwei Liu et.al.	2502.03368	null
2025-02-05	RadVLM: A Multitask Conversational Vision-Language Model for Radiology	Nicolas Deperrois et.al.	2502.03333	null
2025-02-05	ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model	Qiguang Chen et.al.	2502.03325	null
2025-02-05	Out-of-Distribution Detection using Synthetic Data Generation	Momin Abbas et.al.	2502.03323	null
2025-02-05	Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques	Sangjun Han et.al.	2502.03321	null
2025-02-05	Intent Representation Learning with Large Language Model for Recommendation	Yu Wang et.al.	2502.03307	link
2025-02-05	Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning	Qitao Tan et.al.	2502.03304	null
2025-02-05	MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters	Amin Dada et.al.	2502.03298	null
2025-02-05	SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs	Ben Liu et.al.	2502.03283	null
2025-02-05	Posterior SBC: Simulation-Based Calibration Checking Conditional on Data	Teemu Säilynoja et.al.	2502.03279	link
2025-02-05	Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning	DiJia Su et.al.	2502.03275	null
2025-02-05	ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models	Ying Zhang et.al.	2502.03266	link
2025-02-05	General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data	Cheng He et.al.	2502.03264	null
2025-02-05	CARROT: A Cost Aware Rate Optimal Router	Seamus Somerstep et.al.	2502.03261	null
2025-02-05	RiemannGFM: Learning a Graph Foundation Model from Riemannian Geometry	Li Sun et.al.	2502.03251	null
2025-02-05	Exploring the Security Threats of Knowledge Base Poisoning in Retrieval-Augmented Code Generation	Bo Lin et.al.	2502.03233	null
2025-02-05	Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models	Jialiang Wu et.al.	2502.03199	null
2025-02-05	MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding	Pengyi Li et.al.	2502.03183	null
2025-02-05	PICBench: Benchmarking LLMs for Photonic Integrated Circuits Design	Yuchao Wu et.al.	2502.03159	link
2025-02-05	Strategizing with AI: Insights from a Beauty Contest Experiment	Iuliia Alekseenko et.al.	2502.03158	null
2025-02-05	Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models	Xumeng Wen et.al.	2502.03147	null
2025-02-05	Symmetry-Aware Bayesian Flow Networks for Crystal Generation	Laura Ruple et.al.	2502.03146	null
2025-02-05	Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales	Zhen Qian et.al.	2502.03129	null
2025-02-05	Metis: A Foundation Speech Generation Model with Masked Generative Pre-training	Yuancheng Wang et.al.	2502.03128	link
2025-02-05	Structured Token Retention and Computational Memory Paths in Large Language Models	Jonathan Delena et.al.	2502.03102	null
2025-02-05	Reveal the Mystery of DPO: The Connection between DPO and RL Algorithms	Xuerui Su et.al.	2502.03095	null
2025-02-05	Implementing Large Quantum Boltzmann Machines as Generative AI Models for Dataset Balancing	Salvatore Sinno et.al.	2502.03086	null
2025-02-05	IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates	Aissatou Diallo et.al.	2502.03080	null
2025-02-05	Poisson Flow Joint Model for Multiphase contrast-enhanced CT	Rongjun Ge et.al.	2502.03079	null
2025-02-05	Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation	Nina Freise et.al.	2502.03078	null
2025-02-05	Optimizing Electric Vehicles Charging using Large Language Models and Graph Neural Networks	Stavros Orfanoudakis et.al.	2502.03067	null
2025-02-05	Understanding and Enhancing the Transferability of Jailbreaking Attacks	Runqi Lin et.al.	2502.03052	link
2025-02-05	RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts	Tuan Truong et.al.	2502.03044	null
2025-02-05	Large Language Models Are Universal Recommendation Learners	Junguang Jiang et.al.	2502.03041	null
2025-02-05	FuXi- $α$ : Scaling Recommendation Model with Feature Interaction Enhanced Transformer	Yufei Ye et.al.	2502.03036	null
2025-02-05	Knowledge Distillation from Large Language Models for Household Energy Modeling	Mohannad Takrouri et.al.	2502.03034	null
2025-02-05	Analyze Feature Flow to Enhance Interpretation and Steering in Language Models	Daniil Laptev et.al.	2502.03032	null
2025-02-05	Scaling Laws for Upcycling Mixture-of-Experts Language Models	Seng Pei Liew et.al.	2502.03009	null
2025-02-05	MedBioLM: Optimizing Medical and Biological QA with Fine-Tuned Large Language Models and Retrieval-Augmented Generation	Seonok Kim et.al.	2502.03004	null
2025-02-05	Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons	Renjun Hu et.al.	2502.02988	null
2025-02-05	Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models	Muxing Li et.al.	2502.02970	null
2025-02-05	The Labeled Coupon Collector Problem with Random Sample Sizes and Partial Recovery	Shoham Shimon Berrebi et.al.	2502.02968	null
2025-02-05	Large Language Model Adversarial Landscape Through the Lens of Attack Objectives	Nan Wang et.al.	2502.02960	null
2025-02-05	Position: Editing Large Language Models Poses Serious Safety Risks	Paul Youssef et.al.	2502.02958	null
2025-02-05	Control Search Rankings, Control the World: What is a Good Search Engine?	Simon Coghlan et.al.	2502.02957	null
2025-02-05	LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction	Ziwei Wang et.al.	2502.02945	null
2025-02-05	Large Language Model Guided Self-Debugging Code Generation	Muntasir Adnan et.al.	2502.02928	null
2025-02-05	SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs	Dinithi Jayasuriya et.al.	2502.02909	null
2025-02-05	AI-driven materials design: a mini-review	Mouyang Cheng et.al.	2502.02905	null
2025-02-05	A Benchmark for the Detection of Metalinguistic Disagreements between LLMs and Knowledge Graphs	Bradley P. Allen et.al.	2502.02896	null
2025-02-05	Lowering the Barrier of Machine Learning: Achieving Zero Manual Labeling in Review Classification Using LLMs	Yejian Zhang et.al.	2502.02893	null
2025-02-05	Expertized Caption Auto-Enhancement for Video-Text Retrieval	Junxiang Chen et.al.	2502.02885	link
2025-02-05	SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions	Xiaofan Yu et.al.	2502.02883	null
2025-02-05	Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning	Yibo Yan et.al.	2502.02871	null
2025-02-05	A Systematic Approach for Assessing Large Language Models’ Test Case Generation Capability	Hung-Fu Chang et.al.	2502.02866	null
2025-02-05	OceanChat: The Effect of Virtual Conversational AI Agents on Sustainable Attitude and Behavior Change	Pat Pataranutaporn et.al.	2502.02863	null
2025-02-05	A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges	Lei Ding et.al.	2502.02835	null
2025-02-05	COFFE: A Code Efficiency Benchmark for Code Generation	Yun Peng et.al.	2502.02827	link
2025-02-05	Accessible and Portable LLM Inference by Compiling Computational Graphs into SQL	Wenbo Sun et.al.	2502.02818	null
2025-02-05	Mol-LLM: Generalist Molecular LLM with Improved Graph Utilization	Chanhui Lee et.al.	2502.02810	null
2025-02-05	CAMI: A Counselor Agent Supporting Motivational Interviewing through State Inference and Topic Exploration	Yizhe Yang et.al.	2502.02807	null
2025-02-05	Leveraging the true depth of LLMs	Ramón Calvo González et.al.	2502.02790	null
2025-02-05	Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation	Jingyu Liu et.al.	2502.02789	link
2025-02-05	SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models	Amirhossein Dabiriaghdam et.al.	2502.02787	link
2025-02-04	Classroom Simulacra: Building Contextual Student Generative Agents in Online Education for Learning Behavioral Simulation	Songlin Xu et.al.	2502.02780	link
2025-02-04	3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography	Weicheng Zhu et.al.	2502.02779	null
2025-02-04	Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning	Chaofan Lin et.al.	2502.02770	null
2025-02-04	LLM-USO: Large Language Model-based Universal Sizing Optimizer	Karthik Somayaji N. S et.al.	2502.02764	null
2025-02-04	Rethinking Vision Transformer for Object Centric Foundation Models	Manuel Traub et.al.	2502.02763	null
2025-02-04	Too Noisy To Learn: Enhancing Data Quality for Code Review C	Chunhua Liu et.al.	2502.02757	null
2025-02-04	PatchPilot: A Stable and Cost-Efficient Agentic Patching Framework	Hongwei Li et.al.	2502.02747	link
2025-02-04	LLM Bandit: Cost-Efficient LLM Generation via Preference-Conditioned Dynamic Routing	Yang Li et.al.	2502.02743	null
2025-02-04	RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2	Bin Xie et.al.	2502.02741	null
2025-02-04	SmolLM2: When Smol Goes Big – Data-Centric Training of a Small Language Model	Loubna Ben Allal et.al.	2502.02737	null
2025-02-04	Peri-LN: Revisiting Layer Normalization in the Transformer Architecture	Jeonghoon Kim et.al.	2502.02732	null
2025-02-04	Cross-Lingual Transfer for Low-Resource Natural Language Processing	Iker García-Ferrero et.al.	2502.02722	null
2025-02-04	Astromer 2	Cristobal Donoso-Oliva et.al.	2502.02717	null
2025-02-04	A Unified Understanding and Evaluation of Steering Methods	Shawn Im et.al.	2502.02716	null
2025-02-04	An Analysis of LLM Fine-Tuning and Few-Shot Learning for Flaky Test Detection and Classification	Riddhi More et.al.	2502.02715	null
2025-02-04	Exploring LLMs Impact on Student-Created User Stories and Acceptance Testing in Software Development	Allan Brockenbrough et.al.	2502.02675	null
2025-02-04	MedRAX: Medical Reasoning Agent for Chest X-ray	Adibvafa Fallahpour et.al.	2502.02673	link
2025-02-04	Transformers Boost the Performance of Decision Trees on Tabular Data across Sample Sizes	Mayuka Jayawardhana et.al.	2502.02672	link
2025-02-04	Machine-learning approaches to accelerating lattice simulations	Scott Lawrence et.al.	2502.02670	null
2025-02-04	A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)	Yan Li et.al.	2502.02659	link
2025-02-04	Introducing the Rhea simulations of Milky-Way-like galaxies I: Effect of gravitational potential on morphology and star formation	Junia Göller et.al.	2502.02646	null
2025-02-04	COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation	Xueqing Deng et.al.	2502.02589	null
2025-02-04	Open Materials Generation with Stochastic Interpolants	Philipp Hoellmer et.al.	2502.02582	null
2025-02-04	A comparison of translation performance between DeepL and Supertext	Alex Flückiger et.al.	2502.02577	link
2025-02-04	Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement	Soheil Abbasloo et.al.	2502.02573	null
2025-02-04	Learning the RoPEs: Better 2D and 3D Position Encodings with STRING	Connor Schenck et.al.	2502.02562	null
2025-02-04	Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation	Junha Lee et.al.	2502.02548	null
2025-02-04	LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World	Shrikara Arun et.al.	2502.02539	null
2025-02-04	Adaptive Self-improvement LLM Agentic System for ML Library Development	Genghan Zhang et.al.	2502.02534	link
2025-02-04	Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies	Han Zhou et.al.	2502.02533	null
2025-02-04	Generative Modeling on Lie Groups via Euclidean Generalized Score Matching	Marco Bertolini et.al.	2502.02513	null
2025-02-04	Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search	Maohao Shen et.al.	2502.02508	null
2025-02-04	Learning to generate physical ocean states: Towards hybrid climate modeling	Etienne Meunier et.al.	2502.02499	null
2025-02-04	EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization	Yize Wu et.al.	2502.02493	null
2025-02-04	Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study	Menglong Cui et.al.	2502.02481	null
2025-02-04	Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography	Emir Ahmed et.al.	2502.02475	null
2025-02-04	Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification	Valentina Vadori et.al.	2502.02471	link
2025-02-04	SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency	Qianhao Yuan et.al.	2502.02458	link
2025-02-04	Personalization Toolkit: Training Free Personalization of Large Vision Language Models	Soroush Seifi et.al.	2502.02452	null
2025-02-04	Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study	Calvin Yixiang Cheng et.al.	2502.02451	link
2025-02-04	Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models	Haoran Ye et.al.	2502.02444	null
2025-02-04	LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models	Jiangong Chen et.al.	2502.02441	link
2025-02-04	Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment	Yaling Shen et.al.	2502.02438	null
2025-02-04	TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification	Jiaju Kang et.al.	2502.02428	null
2025-02-04	Activation-Informed Merging of Large Language Models	Amin Heyrani Nobari et.al.	2502.02421	link
2025-02-04	Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling	Markus Krimmel et.al.	2502.02415	link
2025-02-04	AI-Powered, But Power-Hungry? Energy Efficiency of LLM-Generated Code	Lola Solovyeva et.al.	2502.02412	null
2025-02-04	Avoiding spurious sharpness minimization broadens applicability of SAM	Sidak Pal Singh et.al.	2502.02407	null
2025-02-04	LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models	Tzu-Tao Chang et.al.	2502.02406	null
2025-02-04	CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning	Jianfeng Pan et.al.	2502.02390	null
2025-02-04	Hypergraph Link Prediction via Hyperedge Copying	Xie He et.al.	2502.02386	link
2025-02-04	STAIR: Improving Safety Alignment with Introspective Reasoning	Yichi Zhang et.al.	2502.02384	link
2025-02-04	Evaluating the Effectiveness of LLMs in Fixing Maintainability Issues in Real-World Projects	Henrique Nunes et.al.	2502.02368	null
2025-02-04	Field Matching: an Electrostatic Paradigm to Generate and Transfer Data	Alexander Kolesov et.al.	2502.02367	null
2025-02-04	Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs	Sagnik Mukherjee et.al.	2502.02362	null
2025-02-04	SHIELD: APT Detection and Intelligent Explanation Using LLM	Parth Atulbhai Gandhi et.al.	2502.02342	null
2025-02-04	Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking	Jinyang Wu et.al.	2502.02339	null
2025-02-04	ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs	Yuan Tian et.al.	2502.02329	null
2025-02-04	Information-Theoretic Proofs for Diffusion Sampling	Galen Reeves et.al.	2502.02305	null
2025-02-04	Density Ratio Estimation with Conditional Probability Paths	Hanlin Yu et.al.	2502.02300	null
2025-02-04	Evalita-LLM: Benchmarking Large Language Models on Italian	Bernardo Magnini et.al.	2502.02289	null
2025-02-04	Adaptive Resource Allocation Optimization Using Large Language Models in Dynamic Wireless Environments	Hyeonho Noh et.al.	2502.02287	null
2025-02-04	Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation	Atharva Mangeshkumar Agrawal et.al.	2502.02249	null
2025-02-04	Flatten Graphs as Sequences: Transformers are Scalable Graph Generators	Dexiong Chen et.al.	2502.02216	null
2025-02-04	When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks	Felix Drinkall et.al.	2502.02199	link
2025-02-04	Large language models in climate and sustainability policy: limits and opportunities	Francesca Larosa et.al.	2502.02191	null
2025-02-04	ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion	Nissim Maruani et.al.	2502.02187	null
2025-02-04	Generative Kernel Spectral Clustering	David Winant et.al.	2502.02185	null
2025-02-04	Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge	Daniel Tamayo et.al.	2502.02173	link
2025-02-04	EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues	Rohit Girmaji et.al.	2502.02172	null
2025-02-04	Risk-Aware Driving Scenario Analysis with Large Language Models	Yuan Gao et.al.	2502.02145	link
2025-02-04	IPO: Iterative Preference Optimization for Text-to-Video Generation	Xiaomeng Yang et.al.	2502.02088	null
2025-02-04	Position Paper: Building Trust in Synthetic Data for Clinical AI	Krishan Agyakari Raja Babu et.al.	2502.02076	null
2025-02-04	Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models	Prasanta Bhattacharya et.al.	2502.02074	null
2025-02-04	ASCenD-BDS: Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping	Rajiv Bahl et.al.	2502.02072	null
2025-02-04	Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign	Ruisi Zhang et.al.	2502.02068	null
2025-02-04	AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement	Shivam Singh et.al.	2502.02067	link
2025-02-04	Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments	Raghav Arora et.al.	2502.02066	null
2025-02-04	CASIM: Composite Aware Semantic Injection for Text to Motion Generation	Che-Jui Chang et.al.	2502.02063	null
2025-02-04	Large Language Models for Recommendation with Deliberative User Preference Alignment	Yi Fang et.al.	2502.02061	null
2025-02-04	Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning	Georgios Margaritis et.al.	2502.02048	null
2025-02-04	Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction	Frederick Dillon et.al.	2502.02046	null
2025-02-04	M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference	Nikhil Bhendawade et.al.	2502.02040	null
2025-02-04	ContinuouSP: Generative Model for Crystal Structure Prediction with Invariance and Continuity	Yuji Tone et.al.	2502.02026	link
2025-02-04	From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing	Siwei Luo et.al.	2502.02025	null
2025-02-04	ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling	Yi-Chiao Wu et.al.	2502.02019	null
2025-02-04	Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment	Shuo Wang et.al.	2502.02017	null
2025-02-04	A Periodic Bayesian Flow for Material Generation	Hanlin Wu et.al.	2502.02016	link
2025-02-04	Layer by Layer: Uncovering Hidden Representations in Language Models	Oscar Skean et.al.	2502.02013	null
2025-02-04	LLMSecConfig: An LLM-Based Approach for Fixing Software Container Misconfigurations	Ziyang Ye et.al.	2502.02009	null
2025-02-04	Reasoning Bias of Next Token Prediction Training	Pengxiao Lin et.al.	2502.02007	null
2025-02-04	FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024	Arnav Grover et.al.	2502.01992	null
2025-02-04	Can LLMs Assist Annotators in Identifying Morality Frames? – Case Study on Vaccination Debate on Social Media	Tunazzina Islam et.al.	2502.01991	null
2025-02-04	Generative Data Mining with Longtail-Guided Diffusion	David S. Hayden et.al.	2502.01980	null
2025-02-04	Gradient-Regularized Latent Space Modulation in Large Language Models for Structured Contextual Synthesis	Derek Yotheringhay et.al.	2502.01979	null
2025-02-04	AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs	Hongxin Li et.al.	2502.01977	null
2025-02-04	CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing	Wenhao Zheng et.al.	2502.01976	null
2025-02-04	Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning	Jinlong Pang et.al.	2502.01968	null
2025-02-04	MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving	Shiju Zhao et.al.	2502.01960	null
2025-02-04	Local minima of the empirical risk in high dimension: General theorems and convex examples	Kiana Asgari et.al.	2502.01953	null
2025-02-04	DAMO: Data- and Model-aware Alignment of Multi-modal LLMs	Jinda Lu et.al.	2502.01943	link
2025-02-04	Can LLMs Maintain Fundamental Abilities under KV Cache Compression?	Xiang Liu et.al.	2502.01941	null
2025-02-04	Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach	Mohammed Alsakabi et.al.	2502.01940	null
2025-02-04	Distributionally Robust Direct Preference Optimization	Zaiyan Xu et.al.	2502.01930	null
2025-02-04	PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling	Avery Ma et.al.	2502.01925	link
2025-02-04	LAST SToP For Modeling Asynchronous Time Series	Shubham Gupta et.al.	2502.01922	null
2025-02-04	Anomaly Detection via Autoencoder Composite Features and NCE	Yalin Liao et.al.	2502.01920	null
2025-02-04	Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales	Arian Eamaz et.al.	2502.01908	null
2025-02-04	Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models	Chia-Wen Kuo et.al.	2502.01906	null
2025-02-04	Conceptual Metaphor Theory as a Prompting Paradigm for Large Language Models	Oliver Kramer et.al.	2502.01901	null
2025-02-03	Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement	Ziad Shaker et.al.	2502.01882	null
2025-02-03	SE Arena: Benchmarking Software Engineering Chatbots with Iterative Interactions	Zhimin Zhao et.al.	2502.01860	null
2025-02-03	Security and Quality in LLM-Generated Code: A Multi-Language, Multi-Model Analysis	Mohammed Kharma et.al.	2502.01853	null
2025-02-03	Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting	Keyi Zhu et.al.	2502.01850	link
2025-02-03	Relatively-Secure LLM-Based Steganography via Constrained Markov Decision Processes	Yu-Shin Huang et.al.	2502.01827	link
2025-02-03	Agentic Bug Reproduction for Effective Automated Program Repair at Google	Runxiang Cheng et.al.	2502.01821	null
2025-02-03	Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning	Hanyang Zhao et.al.	2502.01819	null
2025-02-03	SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models	Diyana Muhammed et.al.	2502.01812	null
2025-02-03	Toward Neurosymbolic Program Comprehension	Alejandro Velasco et.al.	2502.01806	null
2025-02-03	Discovering Chunks in Neural Embeddings for Interpretability	Shuchen Wu et.al.	2502.01803	null
2025-02-03	Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale	Elisa Tsai et.al.	2502.01798	link
2025-01-31	Vintix: Action Model via In-Context Reinforcement Learning	Andrey Polubarov et.al.	2501.19400	link
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	link
2025-01-31	Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models	Alina Shutova et.al.	2501.19392	link
2025-01-31	Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models	Wenzhi Fang et.al.	2501.19389	link
2025-02-03	SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions	Dominik Wagner et.al.	2501.19377	null
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	We’re Different, We’re the Same: Creative Homogeneity Across LLMs	Emily Wenger et.al.	2501.19361	null
2025-01-31	Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies	Brandon P. Chelstrom et.al.	2501.19359	null
2025-01-31	The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking	Yuchun Miao et.al.	2501.19358	null
2025-01-31	Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters	Adrián Juan-Delgado et.al.	2501.19356	null
2025-01-31	Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Ting-Yao E. Hsu et.al.	2501.19353	null
2025-01-31	Towards Adaptive Self-Improvement for Smarter Energy Systems	Alexander Sommer et.al.	2501.19340	null
2025-01-31	PixelWorld: Towards Perceiving Everything as Pixels	Zhiheng Lyu et.al.	2501.19339	null
2025-01-31	Homogeneity Bias as Differential Sampling Uncertainty in Language Models	Messi H. J. Lee et.al.	2501.19337	null
2025-01-31	Reward-Guided Speculative Decoding for Efficient LLM Reasoning	Baohao Liao et.al.	2501.19324	null
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	LLM-based Affective Text Generation Quality Based on Different Quantization Values	Yarik Menchaca Resendiz et.al.	2501.19317	null
2025-01-31	Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment	Gregor Bachmann et.al.	2501.19309	null
2025-02-03	SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling	Jiefeng Chen et.al.	2501.19306	null
2025-01-31	Beyond checkmate: exploring the creative chokepoints in AI text	Nafis Irtiza Tripto et.al.	2501.19301	link
2025-01-31	Offline Learning for Combinatorial Multi-armed Bandits	Xutong Liu et.al.	2501.19300	null
2025-01-31	Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes	Zhiyao Xu et.al.	2501.19298	null
2025-01-31	Analysis of LLMs vs Human Experts in Requirements Engineering	Cory Hymel et.al.	2501.19297	null
2025-01-31	Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators	Kunpeng Zhang et.al.	2501.19282	null
2025-01-31	Pheromone-based Learning of Optimal Reasoning Paths	Anirudh Chari et.al.	2501.19278	null
2025-01-31	From Assistance to Autonomy – A Researcher Study on the Potential of AI Support for Qualitative Data Analysis	Elisabeth Kirsten et.al.	2501.19275	null
2025-01-31	Jackpot! Alignment as a Maximal Lottery	Roberto-Rafael Maura-Rivero et.al.	2501.19266	null
2025-01-31	Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge	Amogh Joshi et.al.	2501.19259	null
2025-01-31	A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation	Yunzhe Li et.al.	2501.19232	null
2025-01-31	Autonomous Legacy Web Application Upgrades Using a Multi-Agent System	Valtteri Ala-Salmi et.al.	2501.19204	link
2025-02-03	Improving the Robustness of Representation Misdirection for Large Language Model Unlearning	Dang Huu-Tien et.al.	2501.19202	link
2025-01-31	Efficient Reasoning with Hidden Thinking	Xuan Shen et.al.	2501.19201	link
2025-01-31	Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning	Xianglin Yang et.al.	2501.19180	null
2025-01-31	No Foundations without Foundations – Why semi-mechanistic models are essential for regulatory biology	Luka Kovačević et.al.	2501.19178	null
2025-01-31	Position: Contextual Integrity Washing for Language Models	Yan Shvartzshnaider et.al.	2501.19173	null
2025-01-31	Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs	Kejia Zhang et.al.	2501.19164	null
2025-01-31	A theoretical framework for overfitting in energy-based modeling	Giovanni Catania et.al.	2501.19158	null
2025-01-31	A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator	Sixiao Huang et.al.	2501.19135	null
2025-01-31	Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations	Sihwan Park et.al.	2501.19099	null
2025-01-31	Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data	Xichen Xu et.al.	2501.19094	null
2025-01-31	Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jialin Zhao et.al.	2501.19090	null
2025-01-31	Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification	Xiangyu Sun et.al.	2501.19086	null
2025-01-31	Enhancing Code Generation for Low-Resource Languages: No Silver Bullet	Alessandro Giagnorio et.al.	2501.19085	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	link
2025-01-31	TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs	Yan Sun et.al.	2501.19057	null
2025-01-31	Enabling Autonomic Microservice Management through Self-Learning Agents	Fenglin Yu et.al.	2501.19056	null
2025-01-31	Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models	Ruiyu Wang et.al.	2501.19054	null
2025-01-31	Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors	Simon Idoko et.al.	2501.19042	link
2025-01-31	Towards the Worst-case Robustness of Large Language Models	Huanran Chen et.al.	2501.19040	null
2025-01-31	Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs	Hongliang Li et.al.	2501.19036	null
2025-01-31	XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses	Bo Lan et.al.	2501.19034	link
2025-01-31	Multilayer Networks in Neuroimaging	Vesna Vuksanovic et.al.	2501.19024	null
2025-01-31	Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation	Bin Zhu et.al.	2501.19017	null
2025-01-31	Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities	Arjun Krishna et.al.	2501.19012	null
2025-01-31	Visual Autoregressive Modeling for Image Super-Resolution	Yunpeng Qu et.al.	2501.18993	link
2025-01-31	Symmetric Pruning of Large Language Models	Kai Yi et.al.	2501.18980	null
2025-01-31	BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics	Yuxuan Liu et.al.	2501.18972	null
2025-01-31	Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping	Pu Yang et.al.	2501.18962	link
2025-01-31	Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow	Alfred Bexley et.al.	2501.18957	null
2025-01-31	LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models	Shenghao Fu et.al.	2501.18954	link
2025-01-31	TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment	Zi-Jian Cheng et.al.	2501.18935	link
2025-01-31	Language Games as the Pathway to Artificial Superhuman Intelligence	Ying Wen et.al.	2501.18924	null
2025-01-31	KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search	Haoran Luo et.al.	2501.18922	link
2025-01-31	LLM Program Optimization via Retrieval Augmented Search	Sagnik Anupam et.al.	2501.18916	null
2025-01-31	Scaling Laws for Differentially Private Language Models	Ryan McKenna et.al.	2501.18914	null
2025-01-31	Streamlining Security Vulnerability Triage with Large Language Models	Mohammad Jalili Torkamani et.al.	2501.18908	null
2025-01-31	Trustworthy Evaluation of Generative AI Models	Zijun Gao et.al.	2501.18897	null
2025-01-31	Can We Predict the Effect of Prompts?	Jae Yong Lee et.al.	2501.18883	null
2025-01-31	Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models	Jiaqi Tang et.al.	2501.18863	null
2025-01-31	BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning	Han Zhong et.al.	2501.18858	null
2025-01-31	Equivariant Hypergraph Diffusion for Crystal Structure Prediction	Yang Liu et.al.	2501.18850	null
2025-01-31	Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities	Yaping Chai et.al.	2501.18845	null
2025-01-31	Trading Inference-Time Compute for Adversarial Robustness	Wojciech Zaremba et.al.	2501.18841	null
2025-01-31	Partially Rewriting a Transformer in Natural Language	Gonçalo Paulo et.al.	2501.18838	link
2025-01-31	Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming	Mrinank Sharma et.al.	2501.18837	null
2025-01-31	Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential	Chenyu Gao et.al.	2501.18834	null
2025-01-31	Structural Embedding Projection for Contextual Large Language Model Inference	Vincent Enoasmo et.al.	2501.18826	null
2025-01-31	Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies	Andrey Borro et.al.	2501.18817	link
2025-01-31	Large Language Models as Common-Sense Heuristics	Andrey Borro et.al.	2501.18816	null
2025-01-30	Compositional Generalization Requires More Than Disentangled Representations	Qiyao Liang et.al.	2501.18797	null
2025-01-30	Rope to Nope and Back Again: A New Hybrid Attention Strategy	Bowen Yang et.al.	2501.18795	null
2025-01-30	Survey and Improvement Strategies for Gene Prioritization with Large Language Models	Matthew Neeley et.al.	2501.18794	null
2025-01-30	LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore?	Alexander Tuisov et.al.	2501.18784	null
2025-01-30	Navigating the Fragrance space Via Graph Generative Models And Predicting Odors	Mrityunjay Sharma et.al.	2501.18777	link
2025-01-30	Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring	Zijun Deng et.al.	2501.18761	null
2025-01-30	Synthetic Data Generation for Augmenting Small Samples	Dan Liu et.al.	2501.18741	null
2025-01-30	Examining the Robustness of Large Language Models across Language Complexity	Jiayi Zhang et.al.	2501.18738	null
2025-01-30	Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks	Mohd. Farhan Israk Soumik et.al.	2501.18727	null
2025-01-30	Strong and Controllable 3D Motion Generation	Canxuan Gang et.al.	2501.18726	null
2025-01-30	Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning	Maya Kruse et.al.	2501.18724	null
2025-02-03	Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps	Devansh Bhardwaj et.al.	2501.18712	null
2025-01-30	Regularized second-order optimization of tensor-network Born machines	Matan Ben-Dov et.al.	2501.18691	null
2025-01-30	Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting	Yansong Qu et.al.	2501.18672	null
2025-01-30	Foundational Models for 3D Point Clouds: A Survey and Outlook	Vishal Thengane et.al.	2501.18594	null
2025-01-30	Diffusion Autoencoders are Scalable Image Tokenizers	Yinbo Chen et.al.	2501.18593	null
2025-02-03	Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models	Hao Dong et.al.	2501.18592	link
2025-01-30	Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs	Yue Wang et.al.	2501.18585	null
2025-01-30	Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Evgenii Evstafev et.al.	2501.18576	null
2025-01-30	BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos	Lehao Lin et.al.	2501.18565	null
2025-01-30	SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation	Haoquan Fang et.al.	2501.18564	link
2025-01-30	Semantic Web and Creative AI – A Technical Report from ISWS 2023	Raia Abu Ahmad et.al.	2501.18542	null
2025-01-30	Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges	Manveer Singh Tamber et.al.	2501.18536	link
2025-01-30	Differentially Private Steering for Large Language Model Alignment	Anmol Goel et.al.	2501.18532	link
2025-01-30	Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models	Guanqun Cao et.al.	2501.18516	null
2025-01-30	Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch	Arthur Douillard et.al.	2501.18512	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction	Peter J. Bentley et.al.	2501.18504	null
2025-01-30	Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline	Shivani Kapania et.al.	2501.18493	null
2025-01-30	A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models	Changshu Liu et.al.	2501.18482	null
2025-01-30	CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization	Yanxia Deng et.al.	2501.18475	null
2025-01-30	Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations	Chengxi Zeng et.al.	2501.18474	null
2025-01-30	ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation	Minghua He et.al.	2501.18460	null
2025-01-30	CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering	Yumeng Wang et.al.	2501.18457	null
2025-01-30	GENIE: Generative Note Information Extraction model for structuring EHR data	Huaiyuan Ying et.al.	2501.18435	null
2025-01-30	Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation	Youngjoon Lee et.al.	2501.18416	null
2025-01-30	RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects	Yiteng Tu et.al.	2501.18365	link
2025-01-30	A Video-grounded Dialogue Dataset and Metric for Event-driven Activities	Wiradee Imrattanatrai et.al.	2501.18324	link
2025-01-30	Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach	Tianpeng Pan et.al.	2501.18320	null
2025-01-30	Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models	Jennifer D’Souza et.al.	2501.18287	null
2025-01-30	Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models	Haoyu Liang et.al.	2501.18280	null
2025-01-30	Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence	Kevin Roitero et.al.	2501.18265	null
2025-01-30	How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Vilém Zouhar et.al.	2501.18251	link
2025-01-30	Statistical multi-metric evaluation and visualization of LLM system predictive performance	Samuel Ackerman et.al.	2501.18243	null
2025-01-30	Contextually Structured Token Dependency Encoding for Large Language Models	James Blades et.al.	2501.18205	null
2025-01-30	Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents	ShuiDe Wen et.al.	2501.18190	null
2025-01-30	Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation	Teddy Lazebnik et.al.	2501.18177	null
2025-01-30	Continually Evolved Multimodal Foundation Models for Cancer Prognosis	Jie Peng et.al.	2501.18170	null
2025-01-30	RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing	Jinyao Guo et.al.	2501.18160	link
2025-01-30	Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study	Yuchen Lei et.al.	2501.18158	null
2025-01-30	Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models	Wanlong Liu et.al.	2501.18154	null
2025-01-30	Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models	Qika Lin et.al.	2501.18119	null
2025-01-30	Scaling Inference-Efficient Language Models	Song Bian et.al.	2501.18107	null
2025-01-30	Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation	Yibo Wang et.al.	2501.18100	link
2025-01-30	AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates	Da Chang et.al.	2501.18094	null
2025-01-30	Normative Evaluation of Large Language Models with Everyday Moral Dilemmas	Pratik S. Sachdeva et.al.	2501.18081	null
2025-01-30	FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models	Spencer Mateega et.al.	2501.18062	null
2025-01-29	RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Duy A. Nguyen et.al.	2501.18056	null
2025-01-29	Current Pathology Foundation Models are unrobust to Medical Center Differences	Edwin D. de Jong et.al.	2501.18055	null
2025-01-29	A Proximal Operator for Inducing 2:4-Sparsity	Jonas M Kübler et.al.	2501.18015	null
2025-01-29	Large Language Models Think Too Fast To Explore Effectively	Lan Pan et.al.	2501.18009	null
2025-01-29	Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces	Neetha Jambigi et.al.	2501.18005	null
2025-01-29	InnerThoughts: Disentangling Representations and Predictions in Large Language Models	Didier Chételat et.al.	2501.17994	null
2025-01-29	Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study	Marwah Alaofi et.al.	2501.17981	link
2025-01-29	Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization	Zishun Yu et.al.	2501.17974	null
2025-01-29	“I Would Never Trust Anything Western”: Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools	Manas Mhasakar et.al.	2501.17942	null
2025-01-29	DReSS: Data-driven Regularized Structured Streamlining for Large Language Models	Mingkuan Feng et.al.	2501.17905	null
2025-01-29	Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning?	Pouya Pezeshkpour et.al.	2501.17840	link
2025-01-29	Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology	Sobhan Hemati et.al.	2501.17822	null
2025-01-30	Leveraging Multimodal LLM for Inspirational User Interface Search	Seokhyeon Park et.al.	2501.17799	link
2025-01-29	BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights	Chan-Jan Hsu et.al.	2501.17790	null
2025-01-29	AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing	Peter Pak et.al.	2501.17784	null
2025-01-29	2SSP: A Two-Stage Framework for Structured Pruning of LLMs	Fabrizio Sandri et.al.	2501.17771	link
2025-01-29	Generative Unordered Flow for Set-Structured Data Generation	Yangming Li et.al.	2501.17770	null
2025-01-29	Hybrid Graphs for Table-and-Text based Question Answering using LLMs	Ankush Agarwal et.al.	2501.17767	null
2025-01-29	On the Partitioning of GPU Power among Multi-Instances	Tirth Vamja et.al.	2501.17752	null
2025-01-29	Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation	Aitor Arrieta et.al.	2501.17749	null
2025-01-29	A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches	Ana R. Baião et.al.	2501.17729	null
2025-01-29	Using Code Generation to Solve Open Instances of Combinatorial Design Problems	Christopher D. Rosin et.al.	2501.17725	link
2025-01-29	RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts	Eujeong Choi et.al.	2501.17715	link
2025-01-29	Source-Channel Separation Theorems for Distortion Perception Coding	Chao Tian et.al.	2501.17706	null
2025-01-29	Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching	Xuzhe Dang et.al.	2501.17665	null
2025-01-30	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-29	Uncertainty Quantification and Decomposition for LLM-based Recommendation	Wonbin Kweon et.al.	2501.17630	link
2025-01-29	The Imitation Game According To Turing	Sharon Temtsin et.al.	2501.17629	null
2025-01-29	Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment	Jonathan Teel et.al.	2501.17617	null
2025-01-29	Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis	Kunrong Li et.al.	2501.17598	null
2025-01-30	Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models	Behraj Khan et.al.	2501.17595	null
2025-01-29	GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback	Mohamed Abdelaal et.al.	2501.17584	null
2025-01-29	CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs	Amey Hengle et.al.	2501.17581	null
2025-01-29	Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding	Marco Pasini et.al.	2501.17578	null
2025-01-29	Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models	Wooyoung Kim et.al.	2501.17549	null
2025-01-29	Towards Training-Free Open-World Classification with 3D Generative Models	Xinzhe Xia et.al.	2501.17547	null
2025-01-29	Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant	Gaole He et.al.	2501.17546	link
2025-01-29	Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison	Martin Nizon-Deladoeuille et.al.	2501.17539	null
2025-01-29	Neural Spelling: A Spell-Based BCI System for Language Neural Decoding	Xiaowei Jiang et.al.	2501.17489	null
2025-01-29	DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance	Seffi Cohen et.al.	2501.17479	link
2025-01-29	AugmenTest: Enhancing Tests with LLM-Driven Oracles	Shaker Mahmud Khandaker et.al.	2501.17461	link
2025-01-29	Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction	Kaiwei Luo et.al.	2501.17459	null
2025-01-29	Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation	Tiansheng Huang et.al.	2501.17433	link
2025-01-29	Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models	Yuxuan Li et.al.	2501.17420	null
2025-01-29	MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs	Ved Sirdeshmukh et.al.	2501.17399	link
2025-01-29	Learning Free Token Reduction for Multi-Modal LLM	Zihui Zhao et.al.	2501.17391	null
2025-01-29	Context-Aware Semantic Recomposition Mechanism for Large Language Models	Richard Katrix et.al.	2501.17386	null
2025-01-28	Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations	Md Tauhidul Islam et.al.	2501.17347	null
2025-01-28	Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction	Mingyu Derek Ma et.al.	2501.17326	null
2025-01-28	CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data	Lee Carlin et.al.	2501.17324	null
2025-01-30	Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding	Yun-Shiuan Chuang et.al.	2501.17310	null
2025-01-28	“Ownership, Not Just Happy Talk”: Co-Designing a Participatory Large Language Model for Journalism	Emily Tseng et.al.	2501.17299	null
2025-01-28	Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization	Zilu Tang et.al.	2501.17295	null
2025-01-28	Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology	Peilong Wang et.al.	2501.17286	null
2025-01-30	From Natural Language to Extensive-Form Game Representations	Shilong Deng et.al.	2501.17282	link
2025-01-28	Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models	Abdalaziz Al-Maeeni et.al.	2501.17279	null
2025-01-28	Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics	Jasper Timm et.al.	2501.17273	link
2025-01-28	Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care	Fengpei Yuan et.al.	2501.17206	null
2025-01-28	SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training	Tianzhe Chu et.al.	2501.17161	null
2025-01-28	FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data	Deren Lei et.al.	2501.17144	link
2025-01-28	ASTRAL: Automated Safety Testing of Large Language Models	Miriam Ugarte et.al.	2501.17132	null
2025-01-28	Optimizing Large Language Model Training Using FP4 Quantization	Ruizhe Wang et.al.	2501.17116	null
2025-01-28	Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction	Carl-Leander Henneking et.al.	2501.17112	null
2025-01-28	Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics	Guillaume Le Mailloux et.al.	2501.17107	link
2025-01-28	Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Evgenii Evstafev et.al.	2501.17084	null
2025-01-28	Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Akash Kumar et.al.	2501.17053	null
2025-01-28	Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models	Minghan Li et.al.	2501.17039	null
2025-01-28	Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies	Manojkumar Parmar et.al.	2501.17030	null
2025-01-28	Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs	Alessandro Midolo et.al.	2501.17024	link
2025-01-28	Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement	Kei Katsumata et.al.	2501.17022	link
2025-01-28	MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition	Philippe Pasquier et.al.	2501.17011	null
2025-01-28	Large Language Models for Code Generation: The Practitioners Perspective	Zeeshan Rasheed et.al.	2501.16998	link
2025-01-28	Artificial Intelligence Clones	Annie Liang et.al.	2501.16996	null
2025-01-28	FedEFM: Federated Endovascular Foundation Model with Unseen Data	Tuong Do et.al.	2501.16992	null
2025-01-28	Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver	Shunya Minami et.al.	2501.16986	null
2025-01-28	Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling	Hongzhi Huang et.al.	2501.16975	null
2025-01-28	Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers	Mohammad Raza et.al.	2501.16961	null
2025-01-28	Multiple Abstraction Level Retrieve Augment Generation	Zheng Zheng et.al.	2501.16952	null
2025-01-29	TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models	Makoto Shing et.al.	2501.16937	null
2025-01-28	Detecting harassment and defamation in cyberbullying with emotion-adaptive training	Peiling Yi et.al.	2501.16925	link
2025-01-28	RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains	Shady Nasrat et.al.	2501.16899	link
2025-01-28	Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model	Eoghan Cronin et.al.	2501.16893	link
2025-01-28	Irony Detection, Reasoning and Understanding in Zero-shot Learning	Peiling Yi et.al.	2501.16884	null
2025-01-28	Comparing Human and LLM Generated Code: The Jury is Still Out!	Sherlock A. Licorish et.al.	2501.16857	null
2025-01-28	Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis	Tiao Tan et.al.	2501.16842	null
2025-01-28	Misspellings in Natural Language Processing: A survey	Gianluca Sperduti et.al.	2501.16836	null
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	Algorithm for Automatic Legislative Text Consolidation	Matias Etcheverry et.al.	2501.16794	null
2025-01-28	Exponential Family Attention	Kevin Christian Wibisono et.al.	2501.16790	link
2025-01-28	Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding	Yun Li et.al.	2501.16786	null
2025-01-28	TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network	Yumingzhi Pan et.al.	2501.16784	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-29	Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models	Muhammad Atta ur Rahman et.al.	2501.16769	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	null
2025-01-28	HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns	Xinyue Shen et.al.	2501.16750	link
2025-01-28	Through the Prism of Culture: Evaluating LLMs’ Understanding of Indian Subcultures and Traditions	Garima Chhikara et.al.	2501.16748	null
2025-01-28	LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience	Nimesh Jha et.al.	2501.16744	null
2025-01-28	Distilling Large Language Models for Network Active Queue Management	Deol Satish et.al.	2501.16734	null
2025-01-28	xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking	Sunbowen Lee et.al.	2501.16727	link
2025-01-28	One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning	Chunpeng Zhou et.al.	2501.16720	null
2025-01-28	Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection	Hengzhuang Li et.al.	2501.16718	link
2025-01-28	3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow	Yueen Ma et.al.	2501.16698	null
2025-01-28	MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark	Dongyi Yi et.al.	2501.16688	null
2025-01-28	Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting	Li Yin et.al.	2501.16673	link
2025-01-28	VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records	Philip Chung et.al.	2501.16672	link
2025-01-28	Contextual Reinforcement in Multimodal Token Compression for Large Language Models	Naderdel Piero et.al.	2501.16658	null
2025-01-28	Large Language Model Critics for Execution-Free Evaluation of Code Changes	Aashish Yadavally et.al.	2501.16655	link
2025-01-28	Molecular-driven Foundation Model for Oncologic Pathology	Anurag Vaidya et.al.	2501.16652	link
2025-01-28	DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models	Zeping Min et.al.	2501.16650	null
2025-01-28	An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue	Koji Inoue et.al.	2501.16643	null
2025-01-28	CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs	Jinlan Fu et.al.	2501.16629	link
2025-01-28	Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems	Baraa Hikal et.al.	2501.16616	null
2025-01-28	Sparse Autoencoders Trained on the Same Data Learn Different Features	Gonçalo Paulo et.al.	2501.16615	null
2025-01-28	Fine-Tuned Language Models as Space Systems Controllers	Enrico M. Zucchelli et.al.	2501.16588	null
2025-01-27	AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models	Zheng Lian et.al.	2501.16566	link
2025-01-27	LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation	Farzad Farhadzadeh et.al.	2501.16559	null
2025-01-27	Distributional Information Embedding: A Framework for Multi-bit Watermarking	Haiyun He et.al.	2501.16558	null
2025-01-27	PackDiT: Joint Human Motion and Text Generation via Mutual Prompting	Zhongyu Jiang et.al.	2501.16551	null
2025-01-27	PhysAnimator: Physics-Guided Generative Cartoon Animation	Tianyi Xie et.al.	2501.16550	null
2025-01-27	Sample-Efficient Behavior Cloning Using General Domain Knowledge	Feiyu Zhu et.al.	2501.16546	null
2025-01-27	Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees	Piyush Gupta et.al.	2501.16539	null
2025-01-27	Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs	Jean-Charles Noirot Ferrand et.al.	2501.16534	null
2025-01-27	A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain	Jorge del Pozo Lérida et.al.	2501.16533	null
2025-01-27	Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction	Atharva Naik et.al.	2501.16524	null
2025-01-27	How well can LLMs Grade Essays in Arabic?	Rayed Ghazawi et.al.	2501.16516	null
2025-01-27	Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models	Sudarshan Kamath Barkur et.al.	2501.16513	null
2025-01-27	Smoothed Embeddings for Robust Language Models	Ryo Hase et.al.	2501.16497	null
2025-01-27	Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations	Pablo Valenzuela-Toledo et.al.	2501.16495	null
2025-01-27	Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM	Payal Kamboj et.al.	2501.16481	link
2025-01-27	Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation	Philip Hughes et.al.	2501.16467	null
2025-01-27	CoCoNUT: Structural Code Understanding does not fall out of a tree	Claas Beger et.al.	2501.16456	link
2025-01-27	Detecting Zero-Day Attacks in Digital Substations via In-Context Learning	Faizan Manzoor et.al.	2501.16453	null
2025-01-27	360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation	Hamed Firooz et.al.	2501.16450	null
2025-01-27	DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation	Han Sun et.al.	2501.16410	null
2025-01-27	Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology	Meiyun Cao et.al.	2501.16309	null
2025-01-27	RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval	Long Nguyen et.al.	2501.16303	null
2025-01-27	Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width	Zheng Liu et.al.	2501.16302	null
2025-01-27	Large Models in Dialogue for Active Perception and Anomaly Detection	Tzoulio Chamiti et.al.	2501.16300	link
2025-01-27	FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers	Renshan Zhang et.al.	2501.16297	null
2025-01-27	Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models	Jing Zhang et.al.	2501.16282	null
2025-01-27	Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation	Jiayi Hong et.al.	2501.16277	link
2025-01-27	URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT	Long Nguyen et.al.	2501.16276	null
2025-01-27	A foundation model for human-AI collaboration in medical literature mining	Zifeng Wang et.al.	2501.16255	null
2025-01-27	Multi-Agent Geospatial Copilots for Remote Sensing Workflows	Chaehong Lee et.al.	2501.16254	null
2025-01-27	Zero-Shot Decision Tree Construction via Large Language Models	Lucas Carrasco et.al.	2501.16247	null
2025-01-27	CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation	Xiaochuan Ma et.al.	2501.16246	null
2025-01-27	Phase Transitions in Large Language Models and the $O(N)$ Model	Youran Sun et.al.	2501.16241	null
2025-01-27	AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses	Runze Cai et.al.	2501.16240	link
2025-01-28	Distilling foundation models for robust and efficient models in digital pathology	Alexandre Filiot et.al.	2501.16239	null
2025-01-27	Language-Based Bayesian Optimization Research Assistant (BORA)	Abdoulatif Cissé et.al.	2501.16224	null
2025-01-27	Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models	Huayu Li et.al.	2501.16215	link
2025-01-27	Provence: efficient and robust context pruning for retrieval-augmented generation	Nadezhda Chirkova et.al.	2501.16214	null
2025-01-27	Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs	Antony Bartlett et.al.	2501.16191	null
2025-01-27	SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting	Wenxuan Xie et.al.	2501.16178	link
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma	Richard Willis et.al.	2501.16173	link
2025-01-27	MetaDecorator: Generating Immersive Virtual Tours through Multimodality	Shuang Xie et.al.	2501.16164	null
2025-01-27	CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge	Yuwei Zhang et.al.	2501.16155	null
2025-01-27	AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought	Xin Huang et.al.	2501.16154	null
2025-01-27	AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants	Pascal J. Sager et.al.	2501.16150	null
2025-01-27	PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing	Yuwei Zhang et.al.	2501.16149	null
2025-01-27	SampleLLM: Optimizing Tabular Data Synthesis in Recommendations	Jingtong Gao et.al.	2501.16125	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	Integration of LLM Quality Assurance into an NLG System	Ching-Yi Chen et.al.	2501.16078	null
2025-01-27	PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Maxime Louis et.al.	2501.16075	null
2025-01-27	A generative material transformer using Wyckoff representation	Pierre-Paul De Breuck et.al.	2501.16051	null
2025-01-27	Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation	Xing Zhang et.al.	2501.16050	null
2025-01-27	PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment	Vincent Freiberger et.al.	2501.16033	null
2025-01-27	FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments	Zhiyuan Fu et.al.	2501.16029	null
2025-01-27	Transformability reveals the interplay of dynamics across different network orders	Ming Xie et.al.	2501.16016	null
2025-01-27	TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference	Jack Min Ong et.al.	2501.16007	null
2025-01-27	EDSep: An Effective Diffusion-Based Method for Speech Source Separation	Jinwei Dong et.al.	2501.15965	null
2025-01-27	Rethinking the Bias of Foundation Model under Long-tailed Distribution	Jiahao Chen et.al.	2501.15955	null
2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null
2025-01-27	TimeHF: Billion-Scale Time Series Models Guided by Human Feedback	Yongzhi Qi et.al.	2501.15942	null
2025-01-27	SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub	Benjamin C. Carter et.al.	2501.15922	null
2025-01-27	Parametric Retrieval Augmented Generation	Weihang Su et.al.	2501.15915	link
2025-01-27	Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation	Muhammad Taha Tariq et.al.	2501.15901	null
2025-01-27	Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects	Victor Deng et.al.	2501.15900	null
2025-01-27	Adaptive Width Neural Networks	Federico Errica et.al.	2501.15889	null
2025-01-27	LCTG Bench: LLM Controlled Text Generation Benchmark	Kentaro Kurihara et.al.	2501.15875	link
2025-01-27	LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models	Yuewen Mei et.al.	2501.15850	null
2025-01-27	SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model	Delin Qu et.al.	2501.15830	null
2025-01-27	Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference	Tharindu B. Hewage et.al.	2501.15829	link
2025-01-27	MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer	Qi Chen et.al.	2501.15826	null
2025-01-27	LemmaHead: RAG Assisted Proof Generation Using Large Language Models	Tianbo Yang et.al.	2501.15797	null
2025-01-27	Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?	Zhiling Chen et.al.	2501.15795	null
2025-01-27	Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs	Yu Li et.al.	2501.15791	link
2025-01-27	Memorization and Regularization in Generative Diffusion Models	Ricardo Baptista et.al.	2501.15785	link
2025-01-27	Large Language Models to Diffusion Finetuning	Edoardo Cetin et.al.	2501.15781	null
2025-01-27	Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages	Ivory Yang et.al.	2501.15773	link
2025-01-27	GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design	Yuanfu Sun et.al.	2501.15755	null
2025-01-27	IndicMMLU-Pro: Benchmarking the Indic Large Language Models	Sankalp KJ et.al.	2501.15747	null
2025-01-27	Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning	Michael Xieyang Liu et.al.	2501.15727	null
2025-01-27	A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks	Dong Li et.al.	2501.15724	null
2025-01-27	On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models	Tadahiro Taniguchi et.al.	2501.15721	null
2025-01-26	Adapting Biomedical Abstracts into Plain language using Large Language Models	Haritha Gangavarapu et.al.	2501.15700	null
2025-01-26	TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs	Yuxuan Gu et.al.	2501.15674	link
2025-01-26	Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting	Yuxin Zhang et.al.	2501.15641	link
2025-01-26	BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation	Ali Khodabandeh Yalabadi et.al.	2501.15631	link
2025-01-26	Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets	Eduard Barbu et.al.	2501.15624	null
2025-01-26	Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning	Zeyu Gan et.al.	2501.15602	link
2025-01-26	Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals	Yinzhou Wang et.al.	2501.15599	null
2025-01-26	Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images	Sichen Zhu et.al.	2501.15598	link
2025-01-26	SedarEval: Automated Evaluation using Self-Adaptive Rubrics	Zhiyuan Fan et.al.	2501.15595	link
2025-01-26	SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain	Dakuan Lu et.al.	2501.15587	link
2025-01-26	Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Yuhong Sun et.al.	2501.15581	null
2025-01-26	Instruction Tuning for Story Understanding and Generation with Weak Supervision	Yangshu Yuan et.al.	2501.15574	null
2025-01-26	Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models	Spencer Ramsey et.al.	2501.15571	null
2025-01-26	ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer	Lin Yueyu et.al.	2501.15570	link
2025-01-26	Ocean-OCR: Towards General OCR Application via a Vision-Language Model	Song Chen et.al.	2501.15558	link
2025-01-26	Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles	Hanwen Zhang et.al.	2501.15544	null
2025-01-26	Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths	Yueyang Wang et.al.	2501.15522	null
2025-01-26	Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification	Dan Song et.al.	2501.15503	null
2025-01-26	Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning	Xiaohan Yu et.al.	2501.15470	null
2025-01-26	Data-adaptive Safety Rules for Training Reward Models	Xiaomin Li et.al.	2501.15453	null
2025-01-26	OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas	Xiaoyang Wang et.al.	2501.15427	null
2025-01-26	Visual Generation Without Guidance	Huayu Chen et.al.	2501.15420	link
2025-01-26	AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement	Junan Zhang et.al.	2501.15417	null
2025-01-26	The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation	Raha Aghaei et.al.	2501.15411	null
2025-01-26	Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency	Irin Kabakum et.al.	2501.15405	null
2025-01-26	How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning	Tohida Rehman et.al.	2501.15398	null
2025-01-26	Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations	Zijun Long et.al.	2501.15379	null
2025-01-26	How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback	Manzong Huang et.al.	2501.15378	null
2025-01-26	Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models	Melkamu Abay Mersha et.al.	2501.15374	null
2025-01-26	Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis	Robinson Umeike et.al.	2501.15370	null
2025-01-26	Decentralized Low-Rank Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2501.15361	null
2025-01-26	Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection	Bo Yang et.al.	2501.15355	null
2025-01-25	Fairness in LLM-Generated Surveys	Andrés Abeliuk et.al.	2501.15351	null
2025-01-25	Between Puppet and Actor: Reframing Authorship in this Age of AI Agents	Yuqian Sun et.al.	2501.15346	null
2025-01-25	Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data	Jiajie Li et.al.	2501.15326	null
2025-01-25	ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning	Shangqian Gao et.al.	2501.15316	null
2025-01-25	The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?	Ayo Adedeji et.al.	2501.15310	null
2025-01-25	You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning	Ayan Sengupta et.al.	2501.15296	null
2025-01-24	HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Xin Zhou et.al.	2501.14729	link
2025-01-24	Do LLMs Provide Consistent Answers to Health-Related Questions across Languages?	Ipek Baris Schlicht et.al.	2501.14719	null
2025-01-24	Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models	Naihao Deng et.al.	2501.14717	null
2025-01-24	FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing	James Seale Smith et.al.	2501.14713	null
2025-01-24	The Karp Dataset	Mason DiCicco et.al.	2501.14705	null
2025-01-24	Rethinking Table Instruction Tuning	Naihao Deng et.al.	2501.14693	null
2025-01-24	Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST	Fuping Wu et.al.	2501.14685	null
2025-01-24	An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations	Shabnam Hassani et.al.	2501.14683	null
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications	Yixing Jiang et.al.	2501.14654	link
2025-01-24	Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion	Ziyao Xu et.al.	2501.14649	link
2025-01-24	Towards Scalable Topological Regularizers	Hiu-Tung Wong et.al.	2501.14641	null
2025-01-24	Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics	Renato Ghisellini et.al.	2501.14634	null
2025-01-24	Extracting Problem Structure with LLMs for Optimized SAT Local Search	André Schilder et.al.	2501.14630	null
2025-01-24	Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data	Jordi Abante et.al.	2501.14615	null
2025-01-24	ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations	Tianming Liang et.al.	2501.14607	null
2025-01-24	Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research	Hamid Sarmadi et.al.	2501.14546	null
2025-01-24	VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning	Benjamin Callewaert et.al.	2501.14540	null
2025-01-24	Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models	Zhenguang Zhong et.al.	2501.14530	link
2025-01-24	Scene Understanding Enabled Semantic Communication with Open Channel Coding	Zhe Xiang et.al.	2501.14520	null
2025-01-24	Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel	Zhuoran Liu et.al.	2501.14512	null
2025-01-24	Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course	Pavlin G. Poličar et.al.	2501.14499	null
2025-01-24	Evaluating and Improving Graph to Text Generation with Large Language Models	Jie He et.al.	2501.14497	link
2025-01-24	RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques	Zhengyang Tang et.al.	2501.14492	link
2025-01-24	Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design	Taehan Kim et.al.	2501.14469	null
2025-01-24	Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis	Xiujing Guo et.al.	2501.14465	null
2025-01-24	Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing	Zeping Yu et.al.	2501.14457	null
2025-01-24	Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains	Xu Chu et.al.	2501.14431	null
2025-01-24	GraphBC: Improving LLMs for Better Graph Data Processing	Xu Chu et.al.	2501.14427	null
2025-01-24	CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios	Michael Fuest et.al.	2501.14426	null
2025-01-24	DeepFlow: Serverless Large Language Model Serving at Scale	Junhao Hu et.al.	2501.14417	null
2025-01-24	SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation	Shengjie Wang et.al.	2501.14400	null
2025-01-24	ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer	Yoni Schirris et.al.	2501.14379	link
2025-01-24	DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing	Xinyu Ma et.al.	2501.14371	link
2025-01-24	Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches	Ziad Sakr et.al.	2501.14366	null
2025-01-24	FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration	Kai-Tuo Xu et.al.	2501.14350	link
2025-01-24	Chain-of-Retrieval Augmented Generation	Liang Wang et.al.	2501.14342	null
2025-01-24	Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts	Clément Desroches et.al.	2501.14334	null
2025-01-24	Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models	Ridhi Jain et.al.	2501.14326	null
2025-01-24	PAID: A Framework of Product-Centric Advertising Image Design	Hongyu Chen et.al.	2501.14316	null
2025-01-24	Locality-aware Fair Scheduling in LLM Serving	Shiyi Cao et.al.	2501.14312	null
2025-01-24	A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education	Calvin Yeung et.al.	2501.14305	link
2025-01-24	MASTER: A Multi-Agent System with LLM Specialized MCTS	Bingzheng Gan et.al.	2501.14304	null
2025-01-24	Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph	Xujian Liang et.al.	2501.14300	link
2025-01-24	Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment	Julian A. Schnabel et.al.	2501.14296	null
2025-01-24	Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes	Sullam Jeoung et.al.	2501.14294	link
2025-01-24	Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches	Feng Zhou et.al.	2501.14291	null
2025-01-24	Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation	Sadegh Mahdavi et.al.	2501.14275	link
2025-01-24	Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors	Yi Zhao et.al.	2501.14250	link
2025-01-24	Humanity’s Last Exam	Long Phan et.al.	2501.14249	null
2025-01-24	Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game	Rong Ye et.al.	2501.14225	null
2025-01-24	Top Ten Challenges Towards Agentic Neural Graph Databases	Jiaxin Bai et.al.	2501.14224	null
2025-01-24	TFG-Flow: Training-free Guidance in Multimodal Generative Flow	Haowei Lin et.al.	2501.14216	link
2025-01-24	Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading	Minrui Xu et.al.	2501.14205	null
2025-01-24	VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking	Runyi Hu et.al.	2501.14195	link
2025-01-24	Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models	Saaduddin Mahmud et.al.	2501.14189	null
2025-01-24	GeoSim.AI: AI assistants for numerical simulations in geomechanics	Yared W. Bekele et.al.	2501.14186	null
2025-01-24	AI Chatbots as Professional Service Agents: Developing a Professional Identity	Wenwen Li et.al.	2501.14179	null
2025-01-24	Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models	Yile Gu et.al.	2501.14170	null
2025-01-24	Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction	Dongming Sheng et.al.	2501.14144	null
2025-01-23	Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation	Derek Yotheringhay et.al.	2501.14119	null
2025-01-23	Domain-Factored Untrained Deep Prior for Spectrum Cartography	Subash Timilsina et.al.	2501.14116	null
2025-01-23	MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning	Joshua Davis et.al.	2501.14105	link
2025-01-23	StreamingRAG: Real-time Contextual Retrieval and Generation Framework	Murugan Sankaradas et.al.	2501.14101	null
2025-01-23	Enhancing Biomedical Relation Extraction with Directionality	Po-Ting Lai et.al.	2501.14079	link
2025-01-23	LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language	Yubin Ge et.al.	2501.14073	null
2025-01-23	Efficient 2D CT Foundation Model for Contrast Phase Classification	Benjamin Hou et.al.	2501.14066	null
2025-01-23	Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jakob Krogh Petersen et.al.	2501.14051	link
2025-01-23	LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps	Andrey Palaev et.al.	2501.14046	link
2025-01-23	Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions	Jianfeng Zhu et.al.	2501.14037	null
2025-01-23	CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation	Guofeng Cui et.al.	2501.13927	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	null
2025-01-23	Analysis of Indic Language Capabilities in LLMs	Aatman Vaidya et.al.	2501.13912	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	Exploring Finetuned Audio-LLM on Heart Murmur Features	Adrian Florea et.al.	2501.13884	null
2025-01-23	The machine learning platform for developers of large systems	Alexey Naikov et.al.	2501.13881	null
2025-01-23	A RAG-Based Institutional Assistant	Gustavo Kuratomi et.al.	2501.13880	null
2025-01-23	On the Reasoning Capacity of AI Models and How to Quantify It	Santosh Kumar Radha et.al.	2501.13833	null
2025-01-23	Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing	Hao Zhang et.al.	2501.13831	null
2025-01-23	Hallucinations Can Improve Large Language Models in Drug Discovery	Shuzhou Yuan et.al.	2501.13824	null
2025-01-23	Large Language Model driven Policy Exploration for Recommender Systems	Jie Wang et.al.	2501.13816	null
2025-01-23	Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change	Mowafak Allaham et.al.	2501.13802	null
2025-01-23	Parameter-Efficient Fine-Tuning for Foundation Models	Dan Zhang et.al.	2501.13787	link
2025-01-23	Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling	Tanya Rodchenko et.al.	2501.13779	null
2025-01-23	Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework	Yoonsang Kim et.al.	2501.13778	link
2025-01-23	Do Large Language Models Truly Understand Geometric Structures?	Xiaofeng Wang et.al.	2501.13773	link
2025-01-23	Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak	Erjia Xiao et.al.	2501.13772	null
2025-01-23	UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models	Xin Xu et.al.	2501.13766	null
2025-01-23	EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents	Yuhui Yun et.al.	2501.13746	null
2025-01-23	GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification	Te Pei et.al.	2501.13743	null
2025-01-23	An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities	Zezhou Yang et.al.	2501.13742	link
2025-01-23	Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks	Chang Gong et.al.	2501.13731	null
2025-01-23	RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation	Shi-Qi Yan et.al.	2501.13726	null
2025-01-23	Musical ethnocentrism in Large Language Models	Anna Kruspe et.al.	2501.13720	null
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	null
2025-01-23	EventVL: Understand Event Streams via Multimodal Large Language Model	Pengteng Li et.al.	2501.13707	null
2025-01-23	DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale	Linghao Zhang et.al.	2501.13699	null
2025-01-23	Question Answering on Patient Medical Records with Private Fine-Tuned LLMs	Sara Kothari et.al.	2501.13687	null
2025-01-23	HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor	Zihui Wu et.al.	2501.13677	link
2025-01-23	How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization	Shezheng Song et.al.	2501.13669	null
2025-01-23	LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models	Yizheng Sun et.al.	2501.13652	null
2025-01-23	Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models	Zhenghao Lin et.al.	2501.13629	null
2025-01-23	Text-to-SQL based on Large Language Models and Database Keyword Search	Eduardo R. Nascimento et.al.	2501.13594	null
2025-01-23	Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization	Lei Huang et.al.	2501.13573	null
2025-01-23	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	LLMs Can Plan Only If We Tell Them	Bilgehan Sel et.al.	2501.13545	null
2025-01-23	ReasVQA: Advancing VideoQA with Imperfect Reasoning Process	Jianxin Liang et.al.	2501.13536	null
2025-01-23	RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles	Munachiso Nwadike et.al.	2501.13491	link
2025-01-23	Adaptive Testing for LLM-Based Applications: A Diversity-based Approach	Juyeon Yoon et.al.	2501.13480	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link
2025-01-23	Spurious Forgetting in Continual Learning of Language Models	Junhao Zheng et.al.	2501.13453	link
2025-01-23	Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models	Bo Gao et.al.	2501.13428	null
2025-01-23	Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling	Tomek Jaroslawski et.al.	2501.13415	null
2025-01-23	VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework	He Kong et.al.	2501.13411	link
2025-01-23	Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures	Minglong Dong et.al.	2501.13396	null
2025-01-23	Can Large Language Models Understand Preferences in Personalized Recommendation?	Zhaoxuan Tan et.al.	2501.13391	link
2025-01-23	Do as We Do, Not as You Think: the Conformity of Large Language Models	Zhiyuan Weng et.al.	2501.13381	link
2025-01-23	Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility	Gabrielle Hoyer et.al.	2501.13376	link
2025-01-23	Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement	Jae-Sung Bae et.al.	2501.13372	null
2025-01-23	Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification	Yuzhuo Li et.al.	2501.13368	null
2025-01-23	50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications	Zewei Shi et.al.	2501.13351	link
2025-01-23	MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize	Haohang Xu et.al.	2501.13349	null
2025-01-23	Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation	Rong Shan et.al.	2501.13344	link
2025-01-23	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	link
2025-01-23	Generative Multi-Form Bayesian Optimization	Zhendong Guo et.al.	2501.13337	null
2025-01-23	SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network	Songge Zhang et.al.	2501.13318	null
2025-01-23	Representing Visualization Insights as a Dense Insight Network	Jane Hoffswell et.al.	2501.13309	null
2025-01-23	OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia	Xuelong Geng et.al.	2501.13306	link
2025-01-23	Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers	Akshit Achara et.al.	2501.13302	link
2025-01-23	Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents	Shrinidhi Kumbhar et.al.	2501.13299	null
2025-01-23	RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering	Yang Bai et.al.	2501.13297	link
2025-01-23	Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols	John Joon Young Chung et.al.	2501.13284	null
2025-01-22	MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis	Daeun Jung et.al.	2501.13277	link
2025-01-22	RAG-Reward: Optimizing RAG with Reward Modeling and RLHF	Hanning Zhang et.al.	2501.13264	null
2025-01-22	Exploring GPT’s Ability as a Judge in Music Understanding	Kun Fang et.al.	2501.13261	link
2025-01-22	Bypassing Array Canaries via Autonomous Function Call Resolution	Nathaniel Oh et.al.	2501.13256	link
2025-01-22	S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning	Yichen Wu et.al.	2501.13198	link
2025-01-22	Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century	Axel Loewe et.al.	2501.13142	null
2025-01-23	VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Boqiang Zhang et.al.	2501.13106	link
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null
2025-01-22	Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning	Bohao Yang et.al.	2501.13042	link
2025-01-22	Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament	Yantao Liu et.al.	2501.13007	link
2025-01-22	Neural network enhanced cross entropy benchmark for monitored circuits	Yangrui Hu et.al.	2501.13005	null
2025-01-22	Large Language Model-Based Semantic Communication System for Image Transmission	Soheyb Ribouh et.al.	2501.12988	null
2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models	Chongren Sun et.al.	2501.12975	link
2025-01-22	Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs	Jan Corazza et.al.	2501.12972	link
2025-01-22	It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act	Kristof Meding et.al.	2501.12962	null
2025-01-22	Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference	Weizhi Fei et.al.	2501.12959	null
2025-01-22	GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models	Pengxiang Zhao et.al.	2501.12956	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	Correctness Assessment of Code Generated by Large Language Models Using Internal Representations	Tuan-Dung Bui et.al.	2501.12934	link
2025-01-22	DynamicEarth: How Far are We from Open-Vocabulary Change Detection?	Kaiyu Li et.al.	2501.12931	null
2025-01-22	A Functional Software Reference Architecture for LLM-Integrated Systems	Alessio Bucaioni et.al.	2501.12904	null
2025-01-22	Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration	Offa Kingsleigh et.al.	2501.12901	null
2025-01-22	Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback	Yafu Li et.al.	2501.12895	link
2025-01-23	Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program	Carlton Shepherd et.al.	2501.12883	null
2025-01-22	WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge	Jingyuan Chen et.al.	2501.12877	null
2025-01-22	ACEBench: Who Wins the Match Point in Tool Learning?	Chen Chen et.al.	2501.12851	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home	Viktor Moskvoretskii et.al.	2501.12835	null
2025-01-22	Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek	John Pavlopoulos et.al.	2501.12826	link
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	link
2025-01-22	Certified Guidance for Planning with Deep Generative Models	Francesco Giacomarra et.al.	2501.12815	null
2025-01-22	Revisit Self-Debugging with Self-Generated Tests for Code Generation	Xiancai Chen et.al.	2501.12793	null
2025-01-22	LLMs as Repositories of Factual Knowledge: Limitations and Solutions	Seyed Mahed Mousavi et.al.	2501.12774	null
2025-01-22	NExtLong: Toward Effective Long-Context Training without Long Documents	Chaochen Gao et.al.	2501.12766	link
2025-01-22	Online Preference Alignment for Language Models via Count-based Exploration	Chenjia Bai et.al.	2501.12735	link
2025-01-22	Paradigm-Based Automatic HDL Code Generation Using LLMs	Wenhao Sun et.al.	2501.12702	null
2025-01-22	Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression	Kai Yoshida et.al.	2501.12698	null
2025-01-22	Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering	Qian Tao et.al.	2501.12697	null
2025-01-22	SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling	Shengshi Yao et.al.	2501.12696	null
2025-01-22	EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Yifan Yu et.al.	2501.12689	null
2025-01-22	Distillation Quantification for Large Language Models	Sunbowen Lee et.al.	2501.12619	link
2025-01-22	Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?	Taiming Wang et.al.	2501.12617	null
2025-01-22	Kimi k1.5: Scaling Reinforcement Learning with LLMs	Kimi Team et.al.	2501.12599	null
2025-01-22	Leveraging LLMs to Create a Haptic Devices’ Recommendation System	Yang Liu et.al.	2501.12573	null
2025-01-22	Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review	Rock Yuren Pang et.al.	2501.12557	link
2025-01-21	Human-like conceptual representations emerge from language prediction	Ningyu Xu et.al.	2501.12547	null
2025-01-21	How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models?	Mirali Purohit et.al.	2501.12535	null
2025-01-21	An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts	Dhia Elhaq Rzig et.al.	2501.12521	null
2025-01-21	A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data	Minh Tran et.al.	2501.12501	null
2025-01-21	The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws	Tian Jin et.al.	2501.12486	null
2025-01-21	An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models	Xiaoyu Chu et.al.	2501.12469	link
2025-01-21	Adaptive PII Mitigation Framework for Large Language Models	Shubhi Asthana et.al.	2501.12465	null
2025-01-21	Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management	Arthur Vitui et.al.	2501.12461	link
2025-01-21	Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications	Shubhi Asthana et.al.	2501.12456	null
2025-01-21	Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation	Dongsheng Zhu et.al.	2501.12432	null
2025-01-21	FREYR: A Framework for Recognizing and Executing Your Requests	Roberto Gallotta et.al.	2501.12423	link
2025-01-21	CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning	Eunjee Choi et.al.	2501.12422	null
2025-01-22	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks	Greg Olmschenk et.al.	2501.12383	null
2025-01-21	MMVU: Measuring Expert-Level Multi-Discipline Video Understanding	Yilun Zhao et.al.	2501.12380	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	link
2025-01-21	Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL	Yeounoh Chung et.al.	2501.12372	link
2025-01-21	Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration	Thomas Walshe et.al.	2501.12332	null
2025-01-21	Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops	Mohamed Harmanani et.al.	2501.12331	link
2025-01-21	VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Xianwei Zhuang et.al.	2501.12327	link
2025-01-21	LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations	Hasan Abu-Rasheed et.al.	2501.12300	null
2025-01-21	MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks	Qishen Zhou et.al.	2501.12281	link
2025-01-21	Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement	Maosong Cao et.al.	2501.12273	link
2025-01-21	FOCUS: First Order Concentrated Updating Scheme	Yizhou Liu et.al.	2501.12243	null
2025-01-21	InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models	Pha Nguyen et.al.	2501.12231	null
2025-01-21	CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning	Yuanheng Fang et.al.	2501.12226	null
2025-01-21	Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces	Allard Oelen et.al.	2501.12221	null
2025-01-21	You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense	Wuyuao Mai et.al.	2501.12210	null
2025-01-21	Explainability for Vision Foundation Models: A Survey	Rémi Kazmierczak et.al.	2501.12203	null
2025-01-22	Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Zibo Zhao et.al.	2501.12202	link
2025-01-21	BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks	Zhuang Li et.al.	2501.12174	null
2025-01-21	Contextualizing Recommendation Explanations with LLMs: A User Study	Yuanjun Feng et.al.	2501.12152	null
2025-01-21	Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities	Qirun Dai et.al.	2501.12147	null
2025-01-21	Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot	Daniele Bifolco et.al.	2501.12134	null
2025-01-21	Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions	Tim Schreiter et.al.	2501.12128	null
2025-01-21	Can open source large language models be used for tumor documentation in Germany? – An evaluation on urological doctors’ notes	Stefan Lenz et.al.	2501.12106	link
2025-01-21	Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis	Weile Luo et.al.	2501.12084	null
2025-01-21	Phishing Awareness via Game-Based Learning	Argianto Rahartomo et.al.	2501.12077	link
2025-01-21	PINNsAgent: Automated PDE Surrogation with Large Language Models	Qingpo Wuwu et.al.	2501.12053	null
2025-01-21	Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation	Chen Griner et.al.	2501.12033	null
2025-01-21	Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing’s Syndrome Diagnosis in Facial Analysis	Hongjun Liu et.al.	2501.12023	null
2025-01-21	Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection?	Samantha Min Er Yew et.al.	2501.12016	null
2025-01-21	Rate-Aware Learned Speech Compression	Jun Xu et.al.	2501.11999	null
2025-01-21	Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models	Rupesh Raj Karn et.al.	2501.11979	null
2025-01-21	Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues	Maya Medjad et.al.	2501.11977	link
2025-01-21	Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization	Jie Zhao et.al.	2501.11968	null
2025-01-21	A Hybrid Attention Framework for Fake News Detection with Large Language Models	Xiaochuan Xu et.al.	2501.11967	null
2025-01-21	TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection	Yang Cao et.al.	2501.11960	null
2025-01-21	Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model	Minghan Wang et.al.	2501.11953	null
2025-01-21	ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation	Peter Devine et.al.	2501.11929	link
2025-01-21	Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model	He Chang et.al.	2501.11911	link
2025-01-21	Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation	Junhong Lian et.al.	2501.11900	link
2025-01-22	Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine	Keer Lu et.al.	2501.11885	link
2025-01-21	From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning	Yafu Li et.al.	2501.11877	link
2025-01-21	LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems	Venkata Sai Aswath Duvvuru et.al.	2501.11864	null
2025-01-21	EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents	Zhili Cheng et.al.	2501.11858	link
2025-01-21	Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance	Nikos Kanakaris et.al.	2501.11849	link
2025-01-21	A Survey on Memory-Efficient Large-Scale Model Training in AI for Science	Kaiyuan Tian et.al.	2501.11847	null
2025-01-21	Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction	Noah L. Schroeder et.al.	2501.11840	null
2025-01-21	PXGen: A Post-hoc Explainable Method for Generative Models	Yen-Lung Huang et.al.	2501.11827	null
2025-01-21	CogMorph: Cognitive Morphing Attacks for Text-to-Image Models	Zonglei Jing et.al.	2501.11815	null
2025-01-20	Benchmarking Large Language Models via Random Variables	Zijin Hong et.al.	2501.11790	null
2025-01-20	Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection	Ali Naseh et.al.	2501.11786	null
2025-01-20	Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference	Pouya Hamadanian et.al.	2501.11779	link
2025-01-20	The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers	Alina Starovolsky-Shitrit et.al.	2501.11770	null
2025-01-20	Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems	Fatemeh Nazary et.al.	2501.11759	link
2025-01-20	A generalizable 3D framework and model for self-supervised learning in medical imaging	Tony Xu et.al.	2501.11755	link
2025-01-20	Are generative models fair? A study of racial bias in dermatological image generation	Miguel López-Pérez et.al.	2501.11752	null
2025-01-20	Optimizing Pretraining Data Mixtures with LLM-Estimated Utility	William Held et.al.	2501.11747	null
2025-01-20	MedicoSAM: Towards foundation models for medical image segmentation	Anwai Archit et.al.	2501.11734	link
2025-01-20	Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks	Zhenhailong Wang et.al.	2501.11733	null
2025-01-20	Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy	Saeid Asgari Taghanaki et.al.	2501.11721	link
2025-01-20	YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners’ Perspectives	Nong Ming et.al.	2501.11712	link
2025-01-20	Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution	Ramtin Ehsani et.al.	2501.11709	link
2025-01-20	Trustformer: A Trusted Federated Transformer	Ali Abbasi Tadi et.al.	2501.11706	null
2025-01-20	Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s)	Brian E. Perron et.al.	2501.11705	null
2025-01-20	Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling	Zhenyu Hou et.al.	2501.11651	link
2025-01-20	Trojan Detection Through Pattern Recognition for Large Language Models	Vedant Bhasin et.al.	2501.11621	null
2025-01-20	Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems	Giorgio Robino et.al.	2501.11613	null
2025-01-20	SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks	Wentao Wan et.al.	2501.11599	link
2025-01-20	Recurrent Diffusion for Large-Scale Parameter Generation	Kai Wang et.al.	2501.11587	link
2025-01-20	Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models	Mahyar Habibi et.al.	2501.11581	null
2025-01-20	Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution	Zhiyuan You et.al.	2501.11561	null
2025-01-20	PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation	Jinyu Wang et.al.	2501.11551	link
2025-01-20	UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion	Zixuan Chen et.al.	2501.11515	null
2025-01-20	Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges	Vincent Koc et.al.	2501.11496	null
2025-01-20	Graph-defined Language Learning with LLMs	Huachi Zhou et.al.	2501.11478	null
2025-01-20	Curiosity-Driven Reinforcement Learning from Human Feedback	Haoran Sun et.al.	2501.11463	link
2025-01-20	Ontology Matching with Large Language Models and Prioritized Depth-First Search	Maria Taboada et.al.	2501.11441	null
2025-01-20	One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor	Zhikun Wu et.al.	2501.11433	null
2025-01-20	A Survey on Diffusion Models for Anomaly Detection	Jing Liu et.al.	2501.11430	link
2025-01-20	Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training	Siyu Yuan et.al.	2501.11425	link
2025-01-20	Neural Contextual Reinforcement Framework for Logical Structure Language Generation	Marcus Irvin et.al.	2501.11417	null
2025-01-20	Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing	Kevin Sim et.al.	2501.11411	null
2025-01-20	Revisiting Language Models in Neural News Recommender Systems	Yuyue Zhao et.al.	2501.11391	link
2025-01-20	Towards Advancing Code Generation with Large Language Models: A Research Roadmap	Haolin Jin et.al.	2501.11354	null
2025-01-20	EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery	Guankun Wang et.al.	2501.11347	link
2025-01-20	GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video	Zhenliang Ni et.al.	2501.11340	null
2025-01-20	Few-shot Policy (de)composition in Conversational Question Answering	Kyle Erwin et.al.	2501.11335	null
2025-01-20	Nested Annealed Training Scheme for Generative Adversarial Networks	Chang Wan et.al.	2501.11318	null
2025-01-20	Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning	Zhongtian Hu et.al.	2501.11292	null
2025-01-20	Large Language Model Agents for Radio Map Generation and Wireless Network Planning	Hongye Quan et.al.	2501.11283	null
2025-01-20	Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries	Yi-Hui Lee et.al.	2501.11273	null
2025-01-20	Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios	Zhongtian Hu et.al.	2501.11269	null
2025-01-20	Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian	Wannita Takerngsaksiri et.al.	2501.11264	link
2025-01-20	Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models	Zhuangzhuang Yan et.al.	2501.11247	null
2025-01-20	Irony in Emojis: A Comparative Study of Human and LLM Interpretation	Yawen Zheng et.al.	2501.11241	null
2025-01-20	KPL: Training-Free Medical Knowledge Mining of Vision-Language Models	Jiaxiang Liu et.al.	2501.11231	link
2025-01-20	Reasoning Language Models: A Blueprint	Maciej Besta et.al.	2501.11223	link
2025-01-20	Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation	Ivan Lopez et.al.	2501.11199	null
2025-01-19	Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests	Kristin Blesch et.al.	2501.11178	link
2025-01-17	FaceXBench: Evaluating Multimodal LLMs on Face Understanding	Kartik Narayan et.al.	2501.10360	link
2025-01-17	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	link
2025-01-17	Large language models for automated scholarly paper review: A survey	Zhenzhen Zhuang et.al.	2501.10326	null
2025-01-17	HiMix: Reducing Computational Complexity in Large Vision-Language Models	Xuange Zhang et.al.	2501.10318	null
2025-01-17	Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs	Claudio Di Sipio et.al.	2501.10313	null
2025-01-17	Computational Protein Science in the Era of Large Language Models (LLMs)	Wenqi Fan et.al.	2501.10282	null
2025-01-17	Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation	Azat Abdullin et.al.	2501.10200	null
2025-01-17	Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education	William Hersh et.al.	2501.10186	null
2025-01-17	Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval	Vera Pavlova et.al.	2501.10175	null
2025-01-17	Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis	Abhishek Kaushik et.al.	2501.10134	null
2025-01-17	ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario	Lucen Zhong et.al.	2501.10132	link
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-17	AI-Generated Music Detection and its Challenges	Darius Afchar et.al.	2501.10111	link
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-17	Universal Actions for Enhanced Embodied Foundation Models	Jinliang Zheng et.al.	2501.10105	link
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-17	FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization	Zhaopeng Gu et.al.	2501.10067	link
2025-01-17	Accelerating Large Language Models through Partially Linear Feed-Forward Network	Gansen Hu et.al.	2501.10054	null
2025-01-17	AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search	Wenfeng Feng et.al.	2501.10053	null
2025-01-17	Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists	Alyssia Chen et.al.	2501.10037	null
2025-01-17	Mapping scientific communities at scale	Victor Barbier et.al.	2501.10035	link
2025-01-17	Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions	Zhijie Tan et.al.	2501.10011	null
2025-01-17	Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models	Qiang Liu et.al.	2501.09997	null
2025-01-17	Agent-as-Judge for Factual Summarization of Long Narratives	Yeonseok Jeong et.al.	2501.09993	link
2025-01-17	RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation	Yuefan Cao et.al.	2501.09982	null
2025-01-17	GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions	Heda Zuo et.al.	2501.09972	null
2025-01-17	Explainable artificial intelligence (XAI): from inherent explainability to large language models	Fuseini Mumuni et.al.	2501.09967	null
2025-01-17	A Survey on Multi-Turn Interaction Capabilities of Large Language Models	Chen Zhang et.al.	2501.09959	null
2025-01-17	FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs	Zengyi Gao et.al.	2501.09957	null
2025-01-17	AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations	Jamin Seo et.al.	2501.09954	link
2025-01-17	Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt	Qingcheng Zeng et.al.	2501.09950	null
2025-01-17	MultiPruner: Balanced Structure Removal in Foundation Models	J. Pablo Muñoz et.al.	2501.09949	link
2025-01-17	Steering Large Language Models with Feature Guided Activation Additions	Samuel Soo et.al.	2501.09929	null
2025-01-17	Towards A Litmus Test for Common Sense	Hugo Latapie et.al.	2501.09913	null
2025-01-17	Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph	Jiawei Xu et.al.	2501.09909	null
2025-01-17	Position: Open and Closed Large Language Models in Healthcare	Jiawei Xu et.al.	2501.09906	null
2025-01-17	FoundationStereo: Zero-Shot Stereo Matching	Bowen Wen et.al.	2501.09898	link
2025-01-17	Evolving Deeper LLM Thinking	Kuang-Huei Lee et.al.	2501.09891	null
2025-01-17	Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment	Mohammad Sadegh Sheikhaei et.al.	2501.09888	link
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	null
2025-01-16	ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction	Izzeddin Teeti et.al.	2501.09878	null
2025-01-16	Geometry-Preserving Encoder/Decoder in Latent Generative Models	Wonjun Lee et.al.	2501.09876	null
2025-01-16	An LLM-Guided Tutoring System for Social Skills Training	Michael Guevarra et.al.	2501.09870	null
2025-01-16	Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing	Wenhan Wang et.al.	2501.09866	null
2025-01-16	Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message	Jiawei Li et.al.	2501.09861	null
2025-01-16	PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery	Shristi Das Biswas et.al.	2501.09826	link
2025-01-16	Bridging Language Barriers in Healthcare: A Study on Arabic LLMs	Nada Saadi et.al.	2501.09825	null
2025-01-16	BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling	Daniele Castellana et.al.	2501.09821	link
2025-01-16	Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems	Soham Roy et.al.	2501.09801	null
2025-01-16	Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API	Andrey Labunets et.al.	2501.09798	null
2025-01-16	GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation	Weiliang Tang et.al.	2501.09783	null
2025-01-16	SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation	Wanqi Yin et.al.	2501.09782	link
2025-01-16	VideoWorld: Exploring Knowledge Learning from Unlabeled Videos	Zhongwei Ren et.al.	2501.09781	null
2025-01-16	Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong	Tairan Fu et.al.	2501.09775	null
2025-01-16	Distilling Multi-modal Large Language Models for Autonomous Driving	Deepti Hegde et.al.	2501.09757	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-01-16	Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues	Youngjoon Jang et.al.	2501.09754	null
2025-01-16	OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking	Zekun Xi et.al.	2501.09751	link
2025-01-16	Enhancing Lexicon-Based Text Embeddings with Large Language Models	Yibin Lei et.al.	2501.09749	null
2025-01-16	Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models	Bihui Jin et.al.	2501.09745	null
2025-01-16	KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports	Hajung Kim et.al.	2501.09744	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	A Simple Aerial Detection Baseline of Multimodal Language Models	Qingyun Li et.al.	2501.09720	link
2025-01-16	Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text	Jihed Ncib et.al.	2501.09719	null
2025-01-16	CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education	Tianyu Wang et.al.	2501.09709	link
2025-01-16	Domain Adaptation of Foundation LLMs for e-Commerce	Christian Herold et.al.	2501.09706	null
2025-01-16	Cueless EEG imagined speech for subject identification: dataset and benchmarks	Ali Derakhshesh et.al.	2501.09700	link
2025-01-16	Simulated Interactive Debugging	Yannic Noller et.al.	2501.09694	null
2025-01-17	Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities	Fengli Xu et.al.	2501.09686	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark	Alexis Roger et.al.	2501.09672	null
2025-01-16	A Survey of Research in Large Language Models for Electronic Design Automation	Jingyu Pan et.al.	2501.09655	null
2025-01-16	The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models	Jonathan Katzy et.al.	2501.09653	null
2025-01-16	CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding	Johannes Kirmayr et.al.	2501.09645	link
2025-01-17	LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading	Kuan-Ming Liu et.al.	2501.09636	null
2025-01-16	Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework	Yushen Lin et.al.	2501.09631	null
2025-01-16	Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment	Chaoqi Wang et.al.	2501.09620	link
2025-01-16	From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs	Hrithik Majumdar Shibu et.al.	2501.09604	link
2025-01-16	Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures	Pratyush Dhingra et.al.	2501.09588	null
2025-01-16	Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis	Tingxuan Chen et.al.	2501.09555	link
2025-01-16	AI in Support of Diversity and Inclusion	Çiçek Güven et.al.	2501.09534	null
2025-01-16	Confidence Estimation for Error Detection in Text-to-SQL Systems	Oleg Somov et.al.	2501.09527	link
2025-01-16	Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data	Omar Mena et.al.	2501.09521	null
2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	link
2025-01-16	Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis	Qize Yang et.al.	2501.09502	null
2025-01-16	Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework	Nuo Chen et.al.	2501.09493	null
2025-01-16	Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators	Zhaocheng Liu et.al.	2501.09484	link
2025-01-16	Guided Debugging of Auto-Translated Code Using Differential Testing	Shengnan Wu et.al.	2501.09475	null
2025-01-16	DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Hualie Jiang et.al.	2501.09466	link
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	“A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design	Tianhao He et.al.	2501.09457	null
2025-01-16	Solving the unsolvable: Translating case law in Hong Kong	King-kui Sin et.al.	2501.09444	null
2025-01-16	Scaling up self-supervised learning for improved surgical foundation models	Tim J. M. Jaspers et.al.	2501.09436	link
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy	Huandong Wang et.al.	2501.09431	null
2025-01-16	AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring	Xinyi Wang et.al.	2501.09428	null
2025-01-16	AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling	Ancheng Xu et.al.	2501.09426	null
2025-01-16	FASP: Fast and Accurate Structured Pruning of Large Language Models	Hanyu Hu et.al.	2501.09412	null
2025-01-16	MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models	Lyudong Jin et.al.	2501.09410	null
2025-01-16	Adaptive Contextual Caching for Mobile Edge Large Language Model Service	Guangyuan Liu et.al.	2501.09383	null
2025-01-16	Aligning Instruction Tuning with Pre-training	Yiming Liang et.al.	2501.09368	null
2025-01-16	PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks	Huiyou Zhan et.al.	2501.09367	null
2025-01-16	YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks	Saptarashmi Bandyopadhyay et.al.	2501.09355	null
2025-01-16	UVRM: A Scalable 3D Reconstruction Model from Unposed Videos	Shiu-hong Kao et.al.	2501.09347	null
2025-01-16	Rational Tuning of LLM Cascades via Probabilistic Modeling	Michael J. Zellinger et.al.	2501.09345	null
2025-01-16	SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs	Anbang Ye et.al.	2501.09316	null
2025-01-16	A Study of In-Context-Learning-Based Text-to-SQL Errors	Jiawei Shen et.al.	2501.09310	link
2025-01-16	To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation	Kaustubh D. Dhole et.al.	2501.09292	null
2025-01-16	LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport	Kyeongha Rho et.al.	2501.09291	link
2025-01-16	Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding	Kohei Torimi et.al.	2501.09278	null
2025-01-16	Large Language Model is Secretly a Protein Sequence Optimizer	Yinkai Wang et.al.	2501.09274	null
2025-01-16	Perspective Transition of Large Language Models for Solving Subjective Tasks	Xiaolong Wang et.al.	2501.09265	null
2025-01-16	Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition	Takaaki Hori et.al.	2501.09258	null
2025-01-16	Clone-Robust AI Alignment	Ariel D. Procaccia et.al.	2501.09254	null
2025-01-16	Split Fine-Tuning for Large Language Models in Wireless Networks	Songge Zhang et.al.	2501.09237	null
2025-01-16	Foundations of Large Language Models	Tong Xiao et.al.	2501.09223	link
2025-01-16	Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs	Sanchit Sinha et.al.	2501.09221	null
2025-01-16	A Simple Graph Contrastive Learning Framework for Short Text Classification	Yonghao Liu et.al.	2501.09219	link
2025-01-16	Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics	Yuanyuan Wei et.al.	2501.09218	null
2025-01-16	Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning	Yonghao Liu et.al.	2501.09214	link
2025-01-16	FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training	Hongzhou Yu et.al.	2501.09213	link
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	Towards Semantics Lifting for Scientific Computing: A Case Study on FFT	Naifeng Zhang et.al.	2501.09201	null
2025-01-15	Guiding Retrieval using LLM-based Listwise Rankers	Mandeep Rathee et.al.	2501.09186	link
2025-01-15	The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching	Yevhen Kostiuk et.al.	2501.09164	null
2025-01-15	Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability	Stephanie L. Day et.al.	2501.09158	null
2025-01-15	Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History	Yevhen Kostiuk et.al.	2501.09154	null
2025-01-15	Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation	Xingxin He et.al.	2501.09138	null
2025-01-15	Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG	Aditi Singh et.al.	2501.09136	link
2025-01-15	HAFix: History-Augmented Large Language Models for Bug Fixing	Yu Shi et.al.	2501.09135	link
2025-01-15	Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing	Eshaan Tanwar et.al.	2501.09127	link
2025-01-15	Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment	Conrad Borchers et.al.	2501.09126	null
2025-01-15	Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach	Alireza Ghaffari et.al.	2501.09107	null
2025-01-15	Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites	Hans W. A. Hanley et.al.	2501.09102	link
2025-01-15	Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative	Yuqian Sun et.al.	2501.09099	null
2025-01-15	SteLLA: A Structured Grading System Using LLMs with RAG	Hefei Qiu et.al.	2501.09092	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	link
2025-01-15	Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition	Sneheel Sarangi et.al.	2501.09056	link
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	link
2025-01-15	Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians	Ishan Amin et.al.	2501.09009	link
2025-01-15	Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails	Shaona Ghosh et.al.	2501.09004	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001	link
2025-01-15	CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks	Krit Tangsongcharoen et.al.	2501.08998	link
2025-01-15	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities	Haozhe Xie et.al.	2501.08983	link
2025-01-15	Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models	Emma Croxford et.al.	2501.08977	null
2025-01-15	Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models	Karukriti Kaushik Ghosh et.al.	2501.08974	null
2025-01-15	Analyzing the Ethical Logic of Six Large Language Models	W. Russell Neuman et.al.	2501.08951	null
2025-01-15	Applying General Turn-taking Models to Conversational Human-Robot Interaction	Gabriel Skantze et.al.	2501.08946	null
2025-01-15	Disentangling Exploration of Large Language Models by Optimal Exploitation	Tim Grams et.al.	2501.08925	null
2025-01-15	GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge	Liam Dugan et.al.	2501.08913	link
2025-01-15	Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning	Qinyu Ma et.al.	2501.08897	link
2025-01-15	Connecting SPDE to SGMs	Junsu Seo et.al.	2501.08877	null
2025-01-15	Exploring Task-Level Optimal Prompts for Visual In-Context Learning	Yan Zhu et.al.	2501.08841	null
2025-01-15	How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering	Christoph Treude et.al.	2501.08774	null
2025-01-15	Admitting Ignorance Helps the Video Question Answering Models to Answer	Haopeng Li et.al.	2501.08771	null
2025-01-15	Enhanced Large Language Models for Effective Screening of Depression and Anxiety	June M. Liu et.al.	2501.08769	null
2025-01-15	Few-Shot Learner Generalizes Across AI-Generated Image Detection	Shiyu Wu et.al.	2501.08763	null
2025-01-15	Leveraging LLM Agents for Translating Network Configurations	Yunze Wei et.al.	2501.08760	null
2025-01-15	The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities	Irina Bigoulaeva et.al.	2501.08716	link
2025-01-15	Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching	Chuangtao Ma et.al.	2501.08686	link
2025-01-15	RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency	Siqi Li et.al.	2501.08682	null
2025-01-15	Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery	Zeqin Liao et.al.	2501.08670	null
2025-01-15	MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities	Savya Khosla et.al.	2501.08648	null
2025-01-15	Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations	Kaiyuan Zheng et.al.	2501.08641	null
2025-01-15	SWSC: Shared Weight for Similar Channel in LLM	Binrui Zeng et.al.	2501.08631	null
2025-01-15	Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models	Aruna Sankaranarayanan et.al.	2501.08618	link
2025-01-15	RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation	Kaiqu Liang et.al.	2501.08617	null
2025-01-15	Assessing the Alignment of FOL Closeness Metrics with Human Judgement	Ramya Keerthy Thatikonda et.al.	2501.08613	link
2025-01-15	Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design	Zhi Zheng et.al.	2501.08603	link
2025-01-15	AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL	Tyler Stennett et.al.	2501.08600	null
2025-01-15	LlamaRestTest: Effective REST API Testing with Small Language Models	Myeongsoo Kim et.al.	2501.08598	null
2025-01-15	Sound Scene Synthesis at the DCASE 2024 Challenge	Mathieu Lagrange et.al.	2501.08587	null
2025-01-15	LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model	Yuxuan Hu et.al.	2501.08582	null
2025-01-15	Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation	Jiaqi Huang et.al.	2501.08580	link
2025-01-15	Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms	Kewei Li et.al.	2501.08570	link
2025-01-15	Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications	Jin Chen et.al.	2501.08563	link
2025-01-15	LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation	Yiran Tao et.al.	2501.08558	null
2025-01-15	The Devil is in Temporal Token: High Quality Video Reasoning Segmentation	Sitong Gong et.al.	2501.08549	link
2025-01-15	Comprehensive Subjective and Objective Evaluation Method for Text-generated Video	Zelu Qi et.al.	2501.08545	null
2025-01-15	Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation	Jiaxin Guo et.al.	2501.08523	null
2025-01-14	Quantifying the Importance of Data Alignment in Downstream Model Performance	Krrish Chawla et.al.	2501.08496	null
2025-01-14	Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition	Md Meem Hossain et.al.	2501.08471	null
2025-01-14	Selective Attention Merging for low resource tasks: A case study of Child ASR	Natarajan Balaji Shankar et.al.	2501.08468	link
2025-01-14	Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin	Joao Carmo de Almeida Neto et.al.	2501.08464	null
2025-01-14	Large Language Models For Text Classification: Case Study And Comprehensive Review	Arina Kostina et.al.	2501.08457	null
2025-01-14	Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack	Sagiv Antebi et.al.	2501.08454	null
2025-01-14	Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies	Ajwad Abrar et.al.	2501.08441	link
2025-01-14	SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models	Anurag Kumar et.al.	2501.08421	null
2025-01-14	Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation	Seyed Mehdi Rakhtala et.al.	2501.08420	null
2025-01-14	Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data	Jiaxing Qiu et.al.	2501.08413	link
2025-01-14	OptiChat: Bridging Optimization Models and Practitioners with Large Language Models	Hao Chen et.al.	2501.08406	link
2025-01-14	Towards Best Practices for Open Datasets for LLM Training	Stefan Baack et.al.	2501.08365	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	PokerBench: Training Large Language Models to become Professional Poker Players	Richard Zhuang et.al.	2501.08328	link
2025-01-14	Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks	Miran Heo et.al.	2501.08326	null
2025-01-14	ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations	Ziyuan Huang et.al.	2501.08324	null
2025-01-14	Exploring Robustness of Multilingual LLMs on Real-World Noisy Data	Amirhossein Aliakbarzadeh et.al.	2501.08322	link
2025-01-14	Enhancing Automated Interpretability with Output-Centric Feature Descriptions	Yoav Gur-Arieh et.al.	2501.08319	link
2025-01-14	MiniMax-01: Scaling Foundation Models with Lightning Attention	MiniMax et.al.	2501.08313	null
2025-01-14	HALoGEN: Fantastic LLM Hallucinations and Where to Find Them	Abhilasha Ravichander et.al.	2501.08292	null
2025-01-14	LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding	Hongyu Li et.al.	2501.08282	link
2025-01-14	Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing	Pulkit Arora et.al.	2501.08276	null
2025-01-14	Addressing the sustainable AI trilemma: a case study on LLM agents and RAG	Hui Wu et.al.	2501.08262	link
2025-01-14	Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models	Yifu Qiu et.al.	2501.08248	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset	Jiawei Du et.al.	2501.08238	null
2025-01-14	Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings	Paul Joe Maliakel et.al.	2501.08219	null
2025-01-14	ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Mohita Chowdhury et.al.	2501.08208	null
2025-01-14	ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Zain Ul Abedin et.al.	2501.08203	null
2025-01-14	CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation	Jinjun Peng et.al.	2501.08200	link
2025-01-14	OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training	Yijiong Yu et.al.	2501.08197	link
2025-01-14	PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving	Ahmet Caner Yüzügüler et.al.	2501.08192	null
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-15	A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following	Yin Fang et.al.	2501.08187	link
2025-01-14	Potential and Perils of Large Language Models as Judges of Unstructured Textual Data	Rewina Bedemariam et.al.	2501.08167	null
2025-01-14	I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution	Soohyeon Choi et.al.	2501.08165	null
2025-01-14	Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data	Phai Vu Dinh et.al.	2501.08149	null
2025-01-14	Refusal Behavior in Large Language Models: A Nonlinear Perspective	Fabian Hildebrandt et.al.	2501.08145	link
2025-01-14	Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying	Jonathan Lyhs et.al.	2501.08142	null
2025-01-14	Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2	Seamie Hayes et.al.	2501.08118	null
2025-01-15	Consistency of Responses and Continuations Generated by Large Language Models on Social Media	Wenlu Fan et.al.	2501.08102	null
2025-01-14	Hierarchical Autoscaling for Large Language Model Serving with Chiron	Archit Patke et.al.	2501.08090	null
2025-01-14	Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving	Nert Keser et.al.	2501.08083	null
2025-01-14	CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning	Guoliang He et.al.	2501.08071	link
2025-01-14	A Roadmap to Guide the Integration of LLMs in Hierarchical Planning	Israel Puerta-Merino et.al.	2501.08068	null
2025-01-14	Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT	Awritrojit Banerjee et.al.	2501.08053	null
2025-01-14	TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yao Liang et.al.	2501.08008	null
2025-01-14	LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS	Muhammad Ashfaq et.al.	2501.07992	null
2025-01-14	Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness	Jiaxing Zhao et.al.	2501.07978	link
2025-01-14	Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models	Yifang Xu et.al.	2501.07972	null
2025-01-14	Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning	Jiaqi Hua et.al.	2501.07959	link
2025-01-14	AI Guide Dog: Egocentric Path Prediction on Smartphone	Aishwarya Jadhav et.al.	2501.07957	null
2025-01-14	Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations	Waqar Hussain et.al.	2501.07931	null
2025-01-14	Gandalf the Red: Adaptive Security for LLMs	Niklas Pfister et.al.	2501.07927	link
2025-01-14	VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models	Hui Kuurila-Zhang et.al.	2501.07922	link
2025-01-14	Large Language Model Interface for Home Energy Management Systems	François Michelon et.al.	2501.07919	null
2025-01-14	Bridge-SR: Schrödinger Bridge for Efficient SR	Chang Li et.al.	2501.07897	null
2025-01-14	Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs	Shuai Wang et.al.	2501.07892	null
2025-01-14	ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding	Zhongxiang Sun et.al.	2501.07861	null
2025-01-14	Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques	Shobhit Ratan et.al.	2501.07853	null
2025-01-14	Unveiling Provider Bias in Large Language Models for Code Generation	Xiaoyu Zhang et.al.	2501.07849	null
2025-01-14	Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning	Haoyu Han et.al.	2501.07845	null
2025-01-14	A Driver Advisory System Based on Large Language Model for High-speed Train	Y. C. Luo et.al.	2501.07837	null
2025-01-14	Flow: A Modular Approach to Automated Agentic Workflow Generation	Boye Niu et.al.	2501.07834	link
2025-01-14	Real-time Verification and Refinement of Language Model Text Generation	Joonho Ko et.al.	2501.07824	null
2025-01-14	3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding	Haomiao Xiong et.al.	2501.07819	link
2025-01-14	A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models	Kaustubh D. Dhole et.al.	2501.07818	null
2025-01-14	Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models	Dhruv Dhamani et.al.	2501.07815	null
2025-01-14	Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering	Feijie Wu et.al.	2501.07813	null
2025-01-14	CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation	Ruwei Pan et.al.	2501.07811	null
2025-01-14	Visual Language Models as Operator Agents in the Space Domain	Alejandro Carrasco et.al.	2501.07802	null
2025-01-14	Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding	Zhaokai Wang et.al.	2501.07783	link
2025-01-14	Symmetry-Aware Generative Modeling through Learned Canonicalization	Kusha Sareen et.al.	2501.07773	null
2025-01-14	Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey	Bingchen Liu et.al.	2501.07766	null
2025-01-14	On the Statistical Capacity of Deep Generative Models	Edric Tam et.al.	2501.07763	link
2025-01-13	Advancing Student Writing Through Automated Syntax Feedback	Kamyar Zeinalipour et.al.	2501.07740	null
2025-01-13	Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens	Dongwon Kim et.al.	2501.07730	null
2025-01-13	LLMic: Romanian Foundation Language Model	Vlad-Andrei Bădoiu et.al.	2501.07721	null
2025-01-13	CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory	Haokun Zhao et.al.	2501.07674	null
2025-01-13	Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning	Karishma Thakrar et.al.	2501.07663	null
2025-01-13	Large Language Models for Interpretable Mental Health Diagnosis	Brian Hyeongseok Kim et.al.	2501.07653	null
2025-01-13	BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations	Weixi Feng et.al.	2501.07647	null
2025-01-13	GPT as a Monte Carlo Language Tree: A Probabilistic Perspective	Kun-Peng Ning et.al.	2501.07641	null
2025-01-13	SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models	Fabien Bernier et.al.	2501.07639	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Imagine while Reasoning in Space: Multimodal Visualization-of-Thought	Chengzu Li et.al.	2501.07542	null
2025-01-13	ML Mule: Mobile-Driven Context-Aware Collaborative Learning	Haoxiang Yu et.al.	2501.07536	null
2025-01-13	Investigating Large Language Models in Inferring Personality Traits from User Conversations	Jianfeng Zhu et.al.	2501.07532	null
2025-01-13	RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment	Difei Gu et.al.	2501.07525	link
2025-01-13	Parallel Key-Value Cache Fusion for Position Invariant RAG	Philhoon Oh et.al.	2501.07523	null
2025-01-13	Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards	Yangsibo Huang et.al.	2501.07493	null
2025-01-13	TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models	Thales Sales Almeida et.al.	2501.07482	link
2025-01-13	A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities	Yihao Liu et.al.	2501.07468	null
2025-01-13	Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI	Rolf Pfister et.al.	2501.07458	null
2025-01-13	Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection	Xin Yin et.al.	2501.07425	null
2025-01-13	Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion	Lala Shakti Swarup Ray et.al.	2501.07408	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Simulating the Hubbard Model with Equivariant Normalizing Flows	Dominic Schuh et.al.	2501.07371	null
2025-01-13	Emergent effects of scaling on the functional hierarchies within large language models	Paul C. Bogdan et.al.	2501.07359	null
2025-01-13	Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring	Buse Sibel Korkmaz et.al.	2501.07324	link
2025-01-13	FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering	Erik Henriksson et.al.	2501.07314	link
2025-01-13	The Lessons of Developing Process Reward Models in Mathematical Reasoning	Zhenru Zhang et.al.	2501.07301	null
2025-01-13	GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction	Oleg Kobzarev et.al.	2501.07295	null
2025-01-13	LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks	Zan-Kai Chong et.al.	2501.07288	null
2025-01-13	Lifelong Learning of Large Language Model based Agents: A Roadmap	Junhao Zheng et.al.	2501.07278	link
2025-01-13	Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation	Amir Sartipi et.al.	2501.07276	null
2025-01-13	Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics	Wonduk Seo et.al.	2501.07267	null
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-13	EdgeTAM: On-Device Track Anything Model	Chong Zhou et.al.	2501.07256	link
2025-01-13	Large Language Models: New Opportunities for Access to Science	Jutta Schnabel et.al.	2501.07250	null
2025-01-13	Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training	Ziqing Wen et.al.	2501.07237	link
2025-01-13	Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction	Qiaoqiao Ren et.al.	2501.07224	link
2025-01-13	Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing	Laifa Tao et.al.	2501.07191	null
2025-01-13	Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study	Huashan Chen et.al.	2501.07165	null
2025-01-13	AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model	Bangchen Yin et.al.	2501.07155	link
2025-01-13	LLM360 K2: Scaling Up 360-Open-Source Large Language Models	Zhengzhong Liu et.al.	2501.07124	null
2025-01-13	How GPT learns layer by layer	Jason Du et.al.	2501.07108	link
2025-01-13	ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training	Jiayang Wu et.al.	2501.07078	link
2025-01-13	D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation	Zhejun Zhang et.al.	2501.07077	link
2025-01-13	Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values	Jing Yao et.al.	2501.07071	null
2025-01-13	Enhancing Image Generation Fidelity via Progressive Prompts	Zhen Xiong et.al.	2501.07070	link
2025-01-13	Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities	ZeKe Xiao et.al.	2501.07058	null
2025-01-13	SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation	Yee-Fan Tan et.al.	2501.07055	null
2025-01-13	PoAct: Policy and Action Dual-Control Agent for Generalized Applications	Guozhi Yuan et.al.	2501.07054	null
2025-01-13	ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction	Yan Zhang et.al.	2501.07051	link
2025-01-13	Unveiling the Potential of Text in High-Dimensional Time Series Forecasting	Xin Zhou et.al.	2501.07048	link
2025-01-13	Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis	Luwei Zeng et.al.	2501.07034	null
2025-01-13	A Proposed Large Language Model-Based Smart Search for Archive System	Ha Dung Nguyen et.al.	2501.07024	null
2025-01-13	Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps	Henry Li et.al.	2501.06999	link
2025-01-13	LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models	Mozhgan Nasr Azadani et.al.	2501.06986	link
2025-01-13	Combining LLM decision and RL action selection to improve RL policy for adaptive interventions	Karine Karine et.al.	2501.06980	null
2025-01-12	How is Google using AI for internal code migrations?	Stoyan Nikolov et.al.	2501.06972	null
2025-01-12	Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives	Xinyao Ma et.al.	2501.06964	null
2025-01-12	Comparison of Autoencoders for tokenization of ASL datasets	Vouk Praun-Petrovic et.al.	2501.06942	null
2025-01-12	Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy	Evgeny Ugolkov et.al.	2501.06939	link
2025-01-12	Harnessing Large Language Models for Disaster Management: A Survey	Zhenyu Lei et.al.	2501.06932	null
2025-01-12	Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories	Faaiq Waqar et.al.	2501.06921	null
2025-01-12	Risk-Averse Finetuning of Large Language Models	Sapana Chaudhary et.al.	2501.06911	link
2025-01-12	Deep Learning and Foundation Models for Weather Prediction: A Survey	Jimeng Shi et.al.	2501.06907	link
2025-01-12	A Foundational Generative Model for Breast Ultrasound Image Analysis	Haojun Yu et.al.	2501.06869	null
2025-01-12	Transfer Learning of Tabular Data by Finetuning Large Language Models	Shourav B. Rabbani et.al.	2501.06863	null
2025-01-12	A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context	Noureldin Zahran et.al.	2501.06859	null
2025-01-12	SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training	Tianjin Huang et.al.	2501.06842	link
2025-01-12	An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering	Zaber Al Hassan Ayon et.al.	2501.06837	null
2025-01-12	X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding	Wenqi Zhou et.al.	2501.06835	null
2025-01-12	LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents	Augusto Gonzalez-Bonorino et.al.	2501.06834	link
2025-01-12	GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing	Ruizhe Ou et.al.	2501.06828	null
2025-01-12	Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification	Shijing Chen et.al.	2501.06827	null
2025-01-12	Event Argument Extraction with Enriched Prompts	Chen Liang et.al.	2501.06825	link
2025-01-12	A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT	Yizhou Zhou et.al.	2501.06819	null
2025-01-12	RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models	Keyan Chen et.al.	2501.06809	link
2025-01-12	Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting	Yongshuo Zhu et.al.	2501.06808	null
2025-01-12	MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference	Wenxuan Zeng et.al.	2501.06807	null
2025-01-12	Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences	Liu Yu et.al.	2501.06795	null
2025-01-12	3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes	Mahmoud Ahmed et.al.	2501.06785	link
2025-01-12	Cost-Effective Robotic Handwriting System with AI Integration	Tianyi Huang et.al.	2501.06783	null
2025-01-12	Eliza: A Web3 friendly AI Agent Operating System	Shaw Walters et.al.	2501.06781	link
2025-01-12	VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning	Ji Soo Lee et.al.	2501.06761	link
2025-01-12	Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation	Shunfan Zheng et.al.	2501.06741	null
2025-01-12	ZOQO: Zero-Order Quantized Optimization	Noga Bar et.al.	2501.06736	null
2025-01-12	Better Prompt Compression Without Multi-Layer Perceptrons	Edouardo Honig et.al.	2501.06730	null
2025-01-12	Measuring the Robustness of Reference-Free Dialogue Evaluation Systems	Justin Vasselli et.al.	2501.06728	link
2025-01-12	Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G	Zhiyan Liu et.al.	2501.06726	null
2025-01-12	DRDT3: Diffusion-Refined Decision Test-Time Training Model	Xingshuai Huang et.al.	2501.06718	null
2025-01-12	ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian	Mykyta Syromiatnikov et.al.	2501.06715	link
2025-01-12	Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management	Liu Qianli et.al.	2501.06709	null
2025-01-12	Evaluating Sample Utility for Data Selection by Mimicking Model Weights	Tzu-Heng Huang et.al.	2501.06708	null
2025-01-12	AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds	Yinfang Chen et.al.	2501.06706	null
2025-01-12	Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese	Jie Yang et.al.	2501.06704	null
2025-01-12	Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users’ Questions	Aidan Hogan et.al.	2501.06699	null
2025-01-12	DVM: Towards Controllable LLM Agents in Social Deduction Games	Zheng Zhang et.al.	2501.06695	null
2025-01-12	TAPO: Task-Referenced Adaptation for Prompt Optimization	Wenxin Luo et.al.	2501.06689	link
2025-01-12	Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning	Xiangen Hu et.al.	2501.06682	null
2025-01-12	Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Haoxiang Gao et.al.	2501.06680	null
2025-01-11	Challenging reaction prediction models to generalize to novel chemistry	John Bradshaw et.al.	2501.06669	link
2025-01-11	Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training	Sanjit Kakarla et.al.	2501.06658	link
2025-01-11	FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings	Tong Liu et.al.	2501.06645	null
2025-01-11	Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models	Veronika Smilga et.al.	2501.06638	link
2025-01-11	Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach	Mohammed Maree et.al.	2501.06628	null
2025-01-11	Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks	Amr Almorsi et.al.	2501.06625	null
2025-01-11	Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks	Xuanhao Luo et.al.	2501.06604	null
2025-01-11	ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation	Xuanle Zhao et.al.	2501.06598	link
2025-01-11	ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning	Xiangru Tang et.al.	2501.06590	link
2025-01-11	Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping	Muru Zhang et.al.	2501.06589	link
2025-01-10	LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs	Omkar Thawakar et.al.	2501.06186	link
2025-01-10	PEACE: Empowering Geologic Map Holistic Understanding with MLLMs	Yangyu Huang et.al.	2501.06184	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-10	GenMol: A Drug Discovery Generalist with Discrete Diffusion	Seul Lee et.al.	2501.06158	null
2025-01-10	Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories	Gerd Kortemeyer et.al.	2501.06143	null
2025-01-10	Supervision policies can shape long-term risk management in general-purpose AI models	Manuel Cebrian et.al.	2501.06137	link
2025-01-10	Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI	Yuya Asano et.al.	2501.06129	null
2025-01-10	Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Fabian David Schmidt et.al.	2501.06117	link
2025-01-10	From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy	Elham Aghakhani et.al.	2501.06101	null
2025-01-10	Photokinetics of Photothermal Reactions	Mounir Maafi et.al.	2501.06057	null
2025-01-10	AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery	Johann Wenckstern et.al.	2501.06039	link
2025-01-10	Addressing speaker gender bias in large scale speech translation systems	Shubham Bansal et.al.	2501.05989	null
2025-01-10	Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing	Eklavya Sarkar et.al.	2501.05987	link
2025-01-10	Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys	Divya Mani Adhikari et.al.	2501.05985	null
2025-01-10	Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea	Eunjung Cho et.al.	2501.05981	null
2025-01-10	Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory	Yunmeng Shu et.al.	2501.05965	null
2025-01-10	Effective faking of verbal deception detection with target-aligned adversarial attacks	Bennett Kleinberg et.al.	2501.05962	null
2025-01-10	Reusable specimen-level inference in computational pathology	Jakub R. Kaczmarzyk et.al.	2501.05945	link
2025-01-10	DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information	Yongfan Lai et.al.	2501.05932	link
2025-01-10	LLMs Reproduce Stereotypes of Sexual and Gender Minorities	Ruby Ostrow et.al.	2501.05926	null
2025-01-10	Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction	Petraq Nako et.al.	2501.05925	null
2025-01-10	Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Ziheng Wu et.al.	2501.05901	link
2025-01-10	Prompt engineering and its implications on the energy consumption of Large Language Models	Riccardo Rubei et.al.	2501.05899	link
2025-01-10	Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs	Bianca Raimondi et.al.	2501.05891	link
2025-01-10	Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs	Dabing Cheng et.al.	2501.05884	null
2025-01-10	VideoRAG: Retrieval-Augmented Generation over Video Corpus	Soyeong Jeong et.al.	2501.05874	link
2025-01-10	ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability	Antonin Poché et.al.	2501.05855	link
2025-01-10	Understanding Impact of Human Feedback via Influence Functions	Taywon Min et.al.	2501.05790	link
2025-01-10	Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models	You Li et.al.	2501.05767	null
2025-01-10	Controlling Large Language Models Through Concept Activation Vectors	Hanyu Zhang et.al.	2501.05764	null
2025-01-10	StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation	Shangjin Zhai et.al.	2501.05763	null
2025-01-10	CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech	Madhurananda Pahar et.al.	2501.05755	null
2025-01-10	Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models	Sungjae Lee et.al.	2501.05752	null
2025-01-10	TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos	Korawat Charoenpitaks et.al.	2501.05733	link
2025-01-10	Enabling Scalable Oversight via Self-Evolving Critic	Zhengyang Tang et.al.	2501.05727	null
2025-01-10	I Can’t Share Code, but I need Translation – An Empirical Study on Code Translation through Federated LLM	Jahnavi Kumar et.al.	2501.05724	null
2025-01-10	How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond	Chen Huang et.al.	2501.05714	null
2025-01-10	Multi-Step Reasoning in Korean and the Emergent Mirage	Guijin Son et.al.	2501.05712	null
2025-01-10	EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model	Yi He et.al.	2501.05710	null
2025-01-10	Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains	Vighnesh Subramaniam et.al.	2501.05707	null
2025-01-10	Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness	Audrey Salmon et.al.	2501.05706	null
2025-01-10	Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection	Feiyi Chen et.al.	2501.05675	null
2025-01-10	Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration	Zuyuan Zhang et.al.	2501.05673	null
2025-01-10	Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Zheqi Lv et.al.	2501.05662	null
2025-01-10	Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation	Zheqi Lv et.al.	2501.05647	null
2025-01-10	Iconicity in Large Language Models	Anna Marklová et.al.	2501.05643	null
2025-01-10	HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection	Anant Mehta et.al.	2501.05631	link
2025-01-10	The Impact of Model Scaling on Seen and Unseen Language Performance	Rhitabrat Pokharel et.al.	2501.05629	null
2025-01-09	Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study	Zhenyu Qi et.al.	2501.05625	null
2025-01-09	Exploring Large Language Models for Translating Romanian Computational Problems into English	Adrian Marius Dumitran et.al.	2501.05601	null
2025-01-09	Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics	Gert Aarts et.al.	2501.05580	null
2025-01-09	Exploring Large Language Models (LLMs) through interactive Python activities	Eugenio Tufino et.al.	2501.05577	link
2025-01-09	LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts	Yuri Facanha Bezerra et.al.	2501.05554	link
2025-01-09	The dynamics of meaning through time: Assessment of Large Language Models	Mohamed Taher Alrefaie et.al.	2501.05552	null
2025-01-09	Infecting Generative AI With Viruses	David Noever et.al.	2501.05542	null
2025-01-09	NSChat: A Chatbot System To Rule Them All	Zenon Lamprou et.al.	2501.05541	null
2025-01-09	ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding	Xingyu Fu et.al.	2501.05452	null
2025-01-09	Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Yifan Yu et.al.	2501.05446	link
2025-01-09	Consistent Flow Distillation for Text-to-3D Generation	Runjie Yan et.al.	2501.05445	null
2025-01-09	Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark	Yunzhuo Hao et.al.	2501.05444	link
2025-01-09	A survey of textual cyber abuse detection using cutting-edge language models and large language models	Jose A. Diaz-Garcia et.al.	2501.05443	null
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers	Jerry Chongyi Hu et.al.	2501.05423	null
2025-01-09	Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation	Darius Petermann et.al.	2501.05413	null
2025-01-10	Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics	Maximilian Alber et.al.	2501.05409	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	link
2025-01-09	Mechanistic understanding and validation of large AI models with SemanticLens	Maximilian Dreyer et.al.	2501.05398	link
2025-01-09	FairCode: Evaluating Social Bias of LLMs in Code Generation	Yongkang Du et.al.	2501.05396	link
2025-01-09	Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models	Kristian G. Barman et.al.	2501.05382	null
2025-01-09	Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance	Dimitrios Gerogiannis et.al.	2501.05379	null
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction	Hantao Lou et.al.	2501.05336	link
2025-01-09	“What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles	Xuewen Luo et.al.	2501.05322	null
2025-01-09	Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning	Nora Gourmelon et.al.	2501.05281	link
2025-01-09	CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models	Fabian Hörst et.al.	2501.05269	link
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	null
2025-01-09	CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models	Yewei Song et.al.	2501.05255	null
2025-01-09	From Scientific Texts to Verifiable Code: Automating the Process with Transformers	Changjie Wang et.al.	2501.05252	null
2025-01-09	RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models	Peizhuo Lv et.al.	2501.05249	null
2025-01-09	Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning	Laura Puccioni et.al.	2501.05248	null
2025-01-09	Online Prompt and Solver Selection for Program Synthesis	Yixuan Li et.al.	2501.05247	null
2025-01-09	Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs	Artem Fedorchenko et.al.	2501.05234	null
2025-01-09	Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection	Pei-Kang Lee et.al.	2501.05228	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	link
2025-01-09	Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond	Tomas Goldsack et.al.	2501.05224	null
2025-01-09	A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education	Ziqing Li et.al.	2501.05220	null
2025-01-09	Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration	Xuyang Liu et.al.	2501.05179	link
2025-01-09	Emergence of human-like polarization among large language model agents	Jinghua Piao et.al.	2501.05171	null
2025-01-09	Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering	Matteo Esposito et.al.	2501.05165	null
2025-01-09	Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier	Yufei Shang et.al.	2501.05155	null
2025-01-09	DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving	Xuran Zheng et.al.	2501.05081	null
2025-01-09	Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization	Harshith Manjunath et.al.	2501.05079	null
2025-01-09	Analyzing Memorization in Large Language Models through the Lens of Model Attribution	Tarun Ram Menta et.al.	2501.05078	link
2025-01-09	A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model	Shuo Tong et.al.	2501.05075	null
2025-01-09	Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning	Huabin Liu et.al.	2501.05069	null
2025-01-09	LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding	Jiaxing Zhao et.al.	2501.05067	null
2025-01-09	Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators	Leonard Lupin-Jimenez et.al.	2501.05058	null
2025-01-09	LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models	Zengqi Peng et.al.	2501.05057	null
2025-01-09	On the Generalizability of Transformer Models to Code Completions of Different Lengths	Nathan Cooper et.al.	2501.05051	null
2025-01-09	SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution	Chengxing Xie et.al.	2501.05040	link
2025-01-09	Enhancing Human-Like Responses in Large Language Models	Ethem Yağız Çalık et.al.	2501.05032	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-09	A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications	Ofir Marom et.al.	2501.05030	null
2025-01-09	TreeKV: Smooth Key-Value Cache Compression with Tree Structures	Ziwei He et.al.	2501.04987	null
2025-01-09	SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs	Muhammad Salman et.al.	2501.04985	null
2025-01-09	V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer	Hangzhou He et.al.	2501.04975	link
2025-01-09	Demystifying Domain-adaptive Post-training for Financial LLMs	Zixuan Ke et.al.	2501.04961	link
2025-01-09	Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments	Yifan Xu et.al.	2501.04947	null
2025-01-09	Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models	Qingyu Ren et.al.	2501.04945	link
2025-01-09	Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency	Shiji Zhao et.al.	2501.04931	null
2025-01-09	Investigating Numerical Translation with Large Language Models	Wei Tang et.al.	2501.04927	null
2025-01-09	FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching	Jun-Hak Yun et.al.	2501.04926	link
2025-01-09	HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers	Yiyao Yang et.al.	2501.04908	link
2025-01-09	JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis	Jun-Hyeok Cha et.al.	2501.04904	null
2025-01-09	ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries	Keke Huang et.al.	2501.04901	null
2025-01-09	SUGAR: Leveraging Contextual Confidence for Smarter Retrieval	Hanna Zubkova et.al.	2501.04899	null
2025-01-08	Leveraging Log Probabilities in Language Models to Forecast Future Events	Tommaso Soru et.al.	2501.04880	null
2025-01-08	Real-Time Textless Dialogue Generation	Long Mai et.al.	2501.04877	link
2025-01-08	Modelling complex proton transport phenomena – Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields	Malte Grunert et.al.	2501.04876	null
2025-01-08	Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware	Brandon J Walton et.al.	2501.04848	null
2025-01-08	Do Code LLMs Understand Design Patterns?	Zhenyu Pan et.al.	2501.04835	null
2025-01-08	On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability	Andreas Vogelsang et.al.	2501.04810	null
2025-01-08	IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX	Erik Recio-Armengol et.al.	2501.04776	link
2025-01-08	Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations	Kirandeep Kaur et.al.	2501.04762	null
2025-01-08	Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch	Phillip Richter et.al.	2501.04755	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	Re-ranking the Context for Multimodal Retrieval Augmented Generation	Matin Mortaheb et.al.	2501.04695	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics	Ruilin Luo et.al.	2501.04686	link
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	Assessing Language Comprehension in Large Language Models Using Construction Grammar	Wesley Scivetti et.al.	2501.04661	null
2025-01-08	Multi-task retriever fine-tuning for domain-specific and efficient RAG	Patrice Béchard et.al.	2501.04652	null
2025-01-08	FlairGPT: Repurposing LLMs for Interior Designs	Gabrielle Littlefair et.al.	2501.04648	null
2025-01-08	Knowledge Retrieval Based on Generative AI	Te-Lun Yang et.al.	2501.04635	null
2025-01-08	“Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era	Giulio Antonio Abbo et.al.	2501.04633	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning	Ivan Kankeu et.al.	2501.04591	link
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	null
2025-01-08	InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection	Yuhang Liu et.al.	2501.04575	link
2025-01-09	OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis	Run Luo et.al.	2501.04561	link
2025-01-08	The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas?	Christopher Lazik et.al.	2501.04543	null
2025-01-08	Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time	Uri Berger et.al.	2501.04513	null
2025-01-08	CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection	Ruijun Feng et.al.	2501.04510	null
2025-01-08	Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction	Guofeng Yang et.al.	2501.04487	null
2025-01-08	When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages	Archchana Sindhujan et.al.	2501.04473	null
2025-01-08	Hidden Entity Detection from GitHub Leveraging Large Language Models	Lu Gan et.al.	2501.04455	link
2025-01-08	Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions	Doaa Mahmud et.al.	2501.04437	null
2025-01-08	Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions	Na Yan et.al.	2501.04436	null
2025-01-08	End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	H. M. Shadman Tabib et.al.	2501.04425	null
2025-01-08	SEO: Stochastic Experience Optimization for Large Language Models	Jitao Xu et.al.	2501.04393	null
2025-01-08	iFADIT: Invertible Face Anonymization via Disentangled Identity Transform	Lin Yuan et.al.	2501.04390	null
2025-01-08	DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications	Feng Liu et.al.	2501.04366	link
2025-01-08	Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting	Dong-Hai Zhu et.al.	2501.04341	link
2025-01-09	Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models	Haonan Shi et.al.	2501.04323	null
2025-01-08	Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts	Preethi Seshadri et.al.	2501.04316	link
2025-01-08	RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation	Jun Liu et.al.	2501.04315	null
2025-01-08	Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models	Kunpeng Zhang et.al.	2501.04312	null
2025-01-08	LLM4SR: A Survey on Large Language Models for Scientific Research	Ziming Luo et.al.	2501.04306	link
2025-01-08	Multimodal Graph Constrastive Learning and Prompt for ChartQA	Yue Dai et.al.	2501.04303	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	null
2025-01-08	An Analysis of Model Robustness across Concurrent Distribution Shifts	Myeongho Jeon et.al.	2501.04288	null
2025-01-08	Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models	Bahman Torkamandi et.al.	2501.04286	link
2025-01-08	Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking	Tianqi Ren et.al.	2501.04285	null
2025-01-08	OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments	Yujie Tang et.al.	2501.04279	null
2025-01-08	Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering	Christophe Bajan et.al.	2501.04277	link
2025-01-08	Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation	Senwei Xie et.al.	2501.04268	null
2025-01-08	Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning	Lang Xu et.al.	2501.04266	null
2025-01-08	IOLBENCH: Benchmarking LLMs on Linguistic Reasoning	Satyam Goyal et.al.	2501.04249	link
2025-01-08	TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events	Jian-Hua Fang et.al.	2501.04247	null
2025-01-08	Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks	Rachel Longjohn et.al.	2501.04234	null
2025-01-07	Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation	Alireza Salemi et.al.	2501.04167	null
2025-01-07	AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots	Shaoyue Wen et.al.	2501.04156	link
2025-01-07	Multilingual Open QA on the MIA Shared Task	Navya Yarrabelly et.al.	2501.04153	null
2025-01-07	The angular momentum spiral of the Milky Way disc in Gaia	Rashid Yaaqib et.al.	2501.04095	null
2025-01-07	More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives	Xiaoqing Zhang et.al.	2501.04070	link
2025-01-07	ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono	Jingquan Wang et.al.	2501.04062	null
2025-01-07	LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Lingdong Kong et.al.	2501.04005	null
2025-01-07	Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos	Haobo Yuan et.al.	2501.04001	link
2025-01-07	RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance	Matin Mortaheb et.al.	2501.03995	null
2025-01-07	Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance	Adil Rengim Cetingoz et.al.	2501.03993	null
2025-01-07	Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles	Yuxi Xia et.al.	2501.03991	null
2025-01-07	(De)-Indexing and the Right to be Forgotten	Salvatore Vilella et.al.	2501.03989	null
2025-01-07	VLM-driven Behavior Tree for Context-aware Task Planning	Naoki Wake et.al.	2501.03968	link
2025-01-07	Vision Language Models as Values Detectors	Giulio Antonio Abbo et.al.	2501.03957	null
2025-01-07	Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States	Jurgita Kapočiūtė-Dzikienė et.al.	2501.03952	null
2025-01-07	Synthetic Data Privacy Metrics	Amy Steier et.al.	2501.03941	null
2025-01-07	Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection	Pablo Miralles-González et.al.	2501.03940	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study	Ramya Jonnala et.al.	2501.03904	null
2025-01-07	LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token	Shaolei Zhang et.al.	2501.03895	link
2025-01-07	AlphaPO – Reward shape matters for LLM alignment	Aman Gupta et.al.	2501.03884	null
2025-01-07	CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds	Keonwoo Kim et.al.	2501.03879	null
2025-01-07	Progressive Document-level Text Simplification via Large Language Models	Dengzhao Fang et.al.	2501.03857	null
2025-01-07	MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention	Aadya Arora et.al.	2501.03839	null
2025-01-07	Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging	Simon W. Penninga et.al.	2501.03825	null
2025-01-08	MADation: Face Morphing Attack Detection with Foundation Models	Eduarda Caldeira et.al.	2501.03800	link
2025-01-07	KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration	Chengyuan Li et.al.	2501.03786	null
2025-01-07	Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series	Yuxiao Hu et.al.	2501.03747	null
2025-01-07	Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein	Xiaotong Guo et.al.	2501.03722	null
2025-01-07	Motion-Aware Generative Frame Interpolation	Guozhen Zhang et.al.	2501.03699	null
2025-01-07	SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment	Yuchun Fan et.al.	2501.03681	link
2025-01-07	Effective and Efficient Mixed Precision Quantization of Speech Foundation Models	Haoning Xu et.al.	2501.03643	null
2025-01-07	CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems	Zhaonan Wu et.al.	2501.03626	link
2025-01-07	LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment	Gaoussou Youssouf Kebe et.al.	2501.03624	null
2025-01-07	Cosmos World Foundation Model Platform for Physical AI	NVIDIA et.al.	2501.03575	link
2025-01-07	From Code to Compliance: Assessing ChatGPT’s Utility in Designing an Accessible Webpage – A Case Study	Ammar Ahmed et.al.	2501.03572	null
2025-01-07	What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs	Muneera Bano et.al.	2501.03569	null
2025-01-07	Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities	Benedikt Reitemeyer et.al.	2501.03566	null
2025-01-07	Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis	Haoran Lai et.al.	2501.03565	null
2025-01-07	PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models	Lingzhi Yuan et.al.	2501.03544	null
2025-01-07	Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions	Weijieying Ren et.al.	2501.03540	null
2025-01-07	Deep Learning for Pathological Speech: A Survey	Shakeel A. Sheikh et.al.	2501.03536	null
2025-01-08	SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving	Xuewen Luo et.al.	2501.03535	null
2025-01-07	A generative approach for lensless imaging in low-light conditions	Ziyang Liu et.al.	2501.03511	null
2025-01-07	A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models	Shuyang Wang et.al.	2501.03508	null
2025-01-07	Textualize Visual Prompt for Image Editing via Diffusion Bridge	Pengcheng Xu et.al.	2501.03495	null
2025-01-07	Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment	Prashant Trivedi et.al.	2501.03486	null
2025-01-07	Reading with Intent – Neutralizing Intent	Benjamin Reichman et.al.	2501.03475	null
2025-01-07	Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning	Chuang Niu et.al.	2501.03469	link
2025-01-07	MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems	Yannis Katsis et.al.	2501.03468	link
2025-01-07	ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation	Yu-Cheng Liu et.al.	2501.03462	null
2025-01-07	Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation	Xiao Wang et.al.	2501.03458	link
2025-01-07	CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering	Jialiang Chen et.al.	2501.03447	null
2025-01-07	LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models	Mohamad Fakih et.al.	2501.03446	null
2025-01-07	Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology	Sarah E. Finch et.al.	2501.03441	link
2025-01-06	SALT: Sales Autocompletion Linked Business Tables Dataset	Tassilo Klein et.al.	2501.03413	link
2025-01-06	BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations	Simone Giovannini et.al.	2501.03403	null
2025-01-06	DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes	Xuyang Wang et.al.	2501.03397	link
2025-01-06	Evolved Quantum Boltzmann Machines	Michele Minervini et.al.	2501.03367	null
2025-01-06	CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets	Tanay Agrawal et.al.	2501.03332	null
2025-01-06	LiLMaps: Learnable Implicit Language Maps	Evgenii Kruzhkov et.al.	2501.03304	null
2025-01-06	A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval	Shuo Tong et.al.	2501.03295	null
2025-01-06	Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model	Naibo Wang et.al.	2501.03292	null
2025-01-06	ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning	Pengwei Tang et.al.	2501.03291	link
2025-01-06	CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models	Zhenyu Xu et.al.	2501.03288	null
2025-01-06	BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning	Beichen Zhang et.al.	2501.03226	link
2025-01-06	Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text	Ayat Najjar et.al.	2501.03212	null
2025-01-06	Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity	Ayat A. Najjar et.al.	2501.03203	null
2025-01-06	CLIX: Cross-Lingual Explanations of Idiomatic Expressions	Aaron Gluck et.al.	2501.03191	null
2025-01-06	Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text	Ali Al-Lawati et.al.	2501.03166	link
2025-01-06	Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy	Risha Goel et.al.	2501.03153	link
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity	Yerong Li et.al.	2501.03139	null
2025-01-07	PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models	Mingyang Song et.al.	2501.03124	link
2025-01-06	CAT: Content-Adaptive Image Tokenization	Junhong Shen et.al.	2501.03120	null
2025-01-06	LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases	Dylan Bouchard et.al.	2501.03112	link
2025-01-06	Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling	Aseem Srivastava et.al.	2501.03088	null
2025-01-06	Retrieval-Augmented TLAPS Proof Generation with Large Language Models	Yuhao Zhou et.al.	2501.03073	null
2025-01-06	ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events	Duygu Sezen Islakoglu et.al.	2501.03040	null
2025-01-06	Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning	Zhen Li et.al.	2501.03035	null
2025-01-06	TransPixar: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	CALM: Curiosity-Driven Auditing for Large Language Models	Xiang Zheng et.al.	2501.02997	link
2025-01-06	Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation	Zhi Qu et.al.	2501.02979	link
2025-01-06	FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2501.02968	null
2025-01-07	Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild	Wanpeng Hu et.al.	2501.02964	link
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features	Shi Bin Hoo et.al.	2501.02945	link
2025-01-07	Inhibition of bacterial growth by antibiotics	Barnabe Ledoux et.al.	2501.02944	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model	Hao Yu et.al.	2501.02901	link
2025-01-06	FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection	Guray Ozgur et.al.	2501.02892	link
2025-01-06	MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs	Hui Sun et.al.	2501.02885	null
2025-01-06	IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment	Yiming Zhang et.al.	2501.02869	null
2025-01-06	Large Language Models for Video Surveillance Applications	Ulindu De Silva et.al.	2501.02850	null
2025-01-06	Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification	Yubo Wang et.al.	2501.02844	null
2025-01-06	Foundations of GenIR	Qingyao Ai et.al.	2501.02842	null
2025-01-06	An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models	Dezhi Ran et.al.	2501.02829	null
2025-01-06	InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion	Zhaoyi Yan et.al.	2501.02795	null
2025-01-06	CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation	Yuanhong Chen et.al.	2501.02786	null
2025-01-06	GeAR: Generation Augmented Retrieval	Haoyu Liu et.al.	2501.02772	null
2025-01-06	Visual Large Language Models for Generalized and Specialized Applications	Yifan Li et.al.	2501.02765	link
2025-01-06	Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging?	Hongyi Miao et.al.	2501.02751	null
2025-01-06	Artificial Intelligence in Creative Industries: Advances Prior to 2025	Nantheera Anantrasirichai et.al.	2501.02725	null
2025-01-06	KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models	Zaiyi Zheng et.al.	2501.02711	null
2025-01-06	QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance	Binita Saha et.al.	2501.02702	null
2025-01-06	EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models	Andrés Villa et.al.	2501.02699	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-05	Decoding specialised feature neurons in LLMs with the final projection layer	Harry J Davies et.al.	2501.02688	null
2025-01-05	From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering	Wen-ran Li et.al.	2501.02680	null
2025-01-05	A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model	Shivaram Kalyanakrishnan et.al.	2501.02652	null
2025-01-05	Representation Learning of Lab Values via Masked AutoEncoder	David Restrepo et.al.	2501.02648	link
2025-01-05	Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense	Yang Ouyang et.al.	2501.02629	link
2025-01-05	Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets	Mahmoud Jahanshahi et.al.	2501.02628	null
2025-01-05	HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning	Saleh Ashkboos et.al.	2501.02625	link
2025-01-05	LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment	Yifei Liu et.al.	2501.02621	null
2025-01-05	TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms	Jovan Stojkovic et.al.	2501.02600	null
2025-01-05	LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations	Jiaping Wang et.al.	2501.02573	link
2025-01-05	Multi-LLM Collaborative Caption Generation in Scientific Documents	Jaeyoung Kim et.al.	2501.02552	link
2025-01-05	Transformers Simulate MLE for Sequence Generation in Bayesian Networks	Yuan Cao et.al.	2501.02547	null
2025-01-05	Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm	Ljubisa Bojic et.al.	2501.02532	null
2025-01-05	Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI	Ljubisa Bojic et.al.	2501.02531	null
2025-01-05	Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks	Leo Franklin et.al.	2501.02527	null
2025-01-05	Unified Guidance for Geometry-Conditioned Molecular Generation	Sirine Ayadi et.al.	2501.02526	null
2025-01-05	Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors	Minglin Chen et.al.	2501.02519	null
2025-01-05	CHAIR-Classifier of Hallucination as Improver	Ao Sun et.al.	2501.02518	link
2025-01-05	ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use	Junjie Ye et.al.	2501.02506	null
2025-01-05	Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons	Sebastian Morel-Balbi et.al.	2501.02505	null
2025-01-05	ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling	Chaojie Mao et.al.	2501.02487	null
2025-01-05	LLMPC: Large Language Model Predictive Control	Gabriel Maher et.al.	2501.02486	link
2025-01-05	Decoding News Bias: Multi Bias Detection in News Articles	Bhushan Santosh Shah et.al.	2501.02482	null
2025-01-05	Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine	Yishen Liu et.al.	2501.02471	null
2025-01-05	Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Yuliang Guo et.al.	2501.02464	link
2025-01-05	Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications	Zhe Chen et.al.	2501.02460	null
2025-01-05	Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap	Hyunwoo Ko et.al.	2501.02448	null
2025-01-05	RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework	Kun Wang et.al.	2501.02446	null
2025-01-05	A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models	Yinpeng Cai et.al.	2501.02441	null
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-05	FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance	Haicheng Wang et.al.	2501.02430	link
2025-01-05	GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems	Mehmet Deniz Türkmen et.al.	2501.02408	null
2025-01-04	Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities	Tara Radvand et.al.	2501.02406	link
2025-01-04	Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers	Markus J. Buehler et.al.	2501.02393	link
2025-01-04	Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations	Kangyu Zhu et.al.	2501.02385	null
2025-01-04	Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Tsz Kin Lam et.al.	2501.02370	null
2025-01-04	Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving	Sanghyun Park et.al.	2501.02348	null
2025-01-04	Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support	Florian Putz et.al.	2501.02346	null
2025-01-04	UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility	Yonglin Tian et.al.	2501.02341	link
2025-01-04	AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference	Zhuomin He et.al.	2501.02336	link
2025-01-04	Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications	Jodi M. Casabianca et.al.	2501.02334	null
2025-01-04	Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance	Marta Gentiloni-Silveri et.al.	2501.02298	null
2025-01-04	Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection	Yachao Zhao et.al.	2501.02295	null
2025-01-04	Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control	Yongjeong Oh et.al.	2501.02273	null
2025-01-04	What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph	Yutao Jiang et.al.	2501.02268	link
2025-01-04	Unsupervised Class Generation to Expand Semantic Segmentation Datasets	Javier Montalvo et.al.	2501.02264	null
2025-01-04	Financial Named Entity Recognition: How Far Can LLM Go?	Yi-Te Lu et.al.	2501.02237	link
2025-01-04	Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends	Camille Barboule et.al.	2501.02235	null
2025-01-04	Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection	S M Mostaq Hossain et.al.	2501.02229	null
2025-01-04	Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation	Shijie Wang et.al.	2501.02226	null
2025-01-04	Can ChatGPT implement finite element models for geotechnical engineering applications?	Taegu Kim et.al.	2501.02199	null
2025-01-04	EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks	Shixuan Liu et.al.	2501.02192	null
2025-01-04	On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing	Jianwei Wang et.al.	2501.02191	link
2025-01-04	The Application of Large Language Models in Recommendation Systems	Peiyang Yu et.al.	2501.02178	null
2025-01-04	The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit	Huixue Zhou et.al.	2501.02173	null
2025-01-04	Personalized Graph-Based Retrieval for Large Language Models	Steven Au et.al.	2501.02157	link
2025-01-04	Table as Thought: Exploring Structured Thoughts in LLM Reasoning	Zhenjie Sun et.al.	2501.02152	null
2025-01-04	Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN	Yanxi Chen et.al.	2501.02146	null
2025-01-03	VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction	Chaoyou Fu et.al.	2501.01957	link
2025-01-03	Metadata Conditioning Accelerates Language Model Pre-training	Tianyu Gao et.al.	2501.01956	link
2025-01-03	MADGEN – Mass-Spec attends to De Novo Molecular generation	Yinkai Wang et.al.	2501.01950	link
2025-01-03	Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap	Weizhi Zhang et.al.	2501.01945	link
2025-01-03	Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models	Manh Duong Nguyen et.al.	2501.01932	link
2025-01-03	Virgo: A Preliminary Exploration on Reproducing o1-like MLLM	Yifan Du et.al.	2501.01904	link
2025-01-03	EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation	Siyuan Huang et.al.	2501.01895	null
2025-01-03	Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions	Rachneet Sachdeva et.al.	2501.01872	link
2025-01-03	Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification	Xiangxiang Dai et.al.	2501.01849	link
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	Time Series Language Model for Descriptive Caption Generation	Mohamed Trabelsi et.al.	2501.01832	null
2025-01-03	Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models	Yanjiang Liu et.al.	2501.01830	null
2025-01-03	SDPO: Segment-Level Direct Preference Optimization for Social Agents	Aobo Kong et.al.	2501.01821	link
2025-01-03	BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction	Ferhat Ozgur Catak et.al.	2501.01802	link
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	link
2025-01-03	Efficient LLM Inference with Activation Checkpointing and Hybrid Caching	Sanghyeon Lee et.al.	2501.01792	null
2025-01-03	Nonparametric estimation of a factorizable density using diffusion models	Hyeok Kyu Kwon et.al.	2501.01783	null
2025-01-03	SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation	Mingjie Li et.al.	2501.01765	null
2025-01-03	Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models	Andrea Matteazzi et.al.	2501.01761	null
2025-01-03	MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling	Simon Rouard et.al.	2501.01757	null
2025-01-03	Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation	Kangcheng Luo et.al.	2501.01743	null
2025-01-03	How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models	Simone Corbo et.al.	2501.01741	null
2025-01-03	AR4D: Autoregressive 4D Generation from Monocular Videos	Hanxin Zhu et.al.	2501.01722	null
2025-01-03	Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models	Guosheng Zhang et.al.	2501.01720	null
2025-01-03	LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries	Michal Kuk et.al.	2501.01711	null
2025-01-03	MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jiajun Cao et.al.	2501.01709	null
2025-01-03	AgentRefine: Enhancing Agent Generalization through Refinement Tuning	Dayuan Fu et.al.	2501.01702	null
2025-01-03	Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models	Lei Tang et.al.	2501.01679	null
2025-01-03	Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption	Zhang Ruoyan et.al.	2501.01672	null
2025-01-03	BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction	Alaeddine Diaf et.al.	2501.01664	null
2025-01-03	Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning	Danni Peng et.al.	2501.01653	null
2025-01-03	MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments	Cai Yin et.al.	2501.01652	link
2025-01-03	HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding	Heqing Zou et.al.	2501.01645	link
2025-01-03	iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings	Shuhei Tomoshige et.al.	2501.01642	null
2025-01-03	Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation	Rini Smita Thakur et.al.	2501.01640	null
2025-01-03	A non-ergodic framework for understanding emergent capabilities in Large Language Models	Javier Marin et.al.	2501.01638	null
2025-01-03	Revisiting Data Analysis with Pre-trained Foundation Models	Chen Liang et.al.	2501.01631	null
2025-01-03	ICPC: In-context Prompt Compression with Faster Inference	Ziyang Yu et.al.	2501.01625	null
2025-01-03	PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents	Jingoo Lee et.al.	2501.01594	null
2025-01-03	(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges	Mohamed Hisham Abdellatif et.al.	2501.01588	null
2025-01-02	Predicting the Performance of Black-box LLMs through Self-Queries	Dylan Sam et.al.	2501.01558	link
2025-01-02	Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs)	Jumana Almahmoud et.al.	2501.01545	null
2025-01-02	Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information	Rasul Tutnov et.al.	2501.01544	null
2025-01-02	Denoising Diffused Embeddings: a Generative Approach for Hypergraphs	Shihao Wu et.al.	2501.01541	null
2025-01-02	BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery	Kanishk Gandhi et.al.	2501.01540	link
2025-01-02	SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers	Bhavna Gopal et.al.	2501.01529	null
2025-01-02	Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search	Shuangtao Li et.al.	2501.01478	null
2025-01-02	Unifying Specialized Visual Encoders for Video Language Models	Jihoon Chung et.al.	2501.01426	link
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers	Seunghyun Lee et.al.	2501.01414	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios	Xize Cheng et.al.	2501.01384	null
2025-01-02	ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI	Neda Tavakoli et.al.	2501.01372	link
2025-01-02	Aligning Large Language Models for Faithful Integrity Against Opposing Argument	Yong Zhao et.al.	2501.01336	link
2025-01-02	CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models	Johan Wahréus et.al.	2501.01335	link
2025-01-02	Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension	Yanbo Fang et.al.	2501.01332	null
2025-01-02	The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation	Shuzheng Gao et.al.	2501.01329	null
2025-01-03	Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking	Xiaoxue Cheng et.al.	2501.01306	null
2025-01-02	Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case	Kaushik Roy et.al.	2501.01305	null
2025-01-02	Does a Large Language Model Really Speak in Human-Like Language?	Mose Park et.al.	2501.01273	null
2025-01-02	ProgCo: Program Helps Self-Correction of Large Language Models	Xiaoshuai Song et.al.	2501.01264	link
2025-01-03	CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings	Shanghaoran Quan et.al.	2501.01257	null
2025-01-02	Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?	Manuel Weber et.al.	2501.01256	null
2025-01-02	Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion	Qiyuan He et.al.	2501.01246	null
2025-01-02	SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization	Yongle Huang et.al.	2501.01245	link
2025-01-02	Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants	Lixiong Qin et.al.	2501.01243	null
2025-01-02	Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction	Alexander Brinkmann et.al.	2501.01237	link
2025-01-03	TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jiayu Li et.al.	2501.01216	null
2025-01-02	Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects	Abdullah Mushtaq et.al.	2501.01205	null
2025-01-02	HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation	Runsong Jia et.al.	2501.01203	null
2025-01-02	LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge	Kyoungkook Kang et.al.	2501.01197	null
2025-01-02	Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education	Annika Bush et.al.	2501.01192	null
2025-01-02	Towards Interactive Deepfake Analysis	Lixiong Qin et.al.	2501.01164	link
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	A3: Android Agent Arena for Mobile GUI Agents	Yuxiang Chai et.al.	2501.01149	null
2025-01-03	BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference	Wonsuk Jang et.al.	2501.01144	link
2025-01-02	Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method	Ruichen Zhang et.al.	2501.01141	null
2025-01-02	Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning	Shuo Yu et.al.	2501.01124	null
2025-01-02	MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification	Jimin Park et.al.	2501.01110	link
2025-01-03	MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization	Haina Zhu et.al.	2501.01108	link
2025-01-02	Graph Generative Pre-trained Transformer	Xiaohui Chen et.al.	2501.01073	null
2025-01-02	Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models	Yanwen Huang et.al.	2501.01059	null
2025-01-02	Risks of Cultural Erasure in Large Language Models	Rida Qadri et.al.	2501.01056	null
2025-01-02	Dynamic Scaling of Unit Tests for Code Reward Modeling	Zeyao Ma et.al.	2501.01054	null
2025-01-02	Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs	Linhao Huang et.al.	2501.01042	null
2025-01-02	Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models	Bin Wang et.al.	2501.01034	link
2025-01-02	ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning	Wonduk Seo et.al.	2501.01031	null
2025-01-03	KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model	Xinshuo Hu et.al.	2501.01028	link
2025-01-02	MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model	Chengze Zhang et.al.	2501.01014	null
2025-01-02	FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving	Zihao Ye et.al.	2501.01005	link
2025-01-02	Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory	Zhou Yang et.al.	2501.00999	null
2025-01-02	Optimizing Noise Schedules of Generative Models in High Dimensionss	Santiago Aranguri et.al.	2501.00988	null
2025-01-02	Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice	Federico Ravenda et.al.	2501.00982	link
2025-01-01	IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs	Junfeng Jiao et.al.	2501.00959	null
2025-01-01	Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors	Junfeng Jiao et.al.	2501.00957	null
2025-01-01	Incremental Dialogue Management: Survey, Discussion, and Implications for HRI	Casey Kennington et.al.	2501.00953	null
2025-01-01	SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering	Shihab Ahmed et.al.	2501.00940	null
2025-01-01	Diffusion Policies for Generative Modeling of Spacecraft Trajectories	Julia Briden et.al.	2501.00915	null
2025-01-01	Aligning LLMs with Domain Invariant Reward Models	David Wu et.al.	2501.00911	link
2025-01-01	Population Aware Diffusion for Time Series Generation	Yang Li et.al.	2501.00910	link
2025-01-01	Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things	Talha Zeeshan et.al.	2501.00906	null
2025-01-01	Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model	Chenyang Liu et.al.	2501.00895	null
2025-01-01	Evaluating Time Series Foundation Models on Noisy Periodic Time Series	Syamantak Datta Gupta et.al.	2501.00889	null
2025-01-01	Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization	Weiqi Wu et.al.	2501.00888	link
2025-01-01	Representation in large language models	Cameron C. Yetman et.al.	2501.00885	null
2025-01-01	Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents	Fouad Bousetouane et.al.	2501.00881	null
2025-01-01	Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction	Teng Hu et.al.	2501.00880	null
2025-01-01	TrustRAG: Enhancing Robustness and Trustworthiness in RAG	Huichi Zhou et.al.	2501.00879	link
2025-01-01	LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models	Hieu Man et.al.	2501.00874	link
2025-01-01	Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation	Mingjia Li et.al.	2501.00873	link
2025-01-01	Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation	Shoutao Guo et.al.	2501.00868	link
2025-01-01	Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era	Mihnea C. Moldoveanu et.al.	2501.00867	null
2025-01-01	Alzheimer’s disease detection based on large language model prompt engineering	Tian Zheng et.al.	2501.00861	null
2025-01-01	LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions	Adam Ishay et.al.	2501.00830	null
2025-01-01	An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems	Haoxiang Tian et.al.	2501.00829	null
2025-01-01	LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management	Yichen Luo et.al.	2501.00826	null
2025-01-01	Multimodal Large Models Are Effective Action Anticipators	Binglu Wang et.al.	2501.00795	link
2025-01-01	Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models	Minhao Bai et.al.	2501.00786	null
2025-01-01	NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model	Yuzhi Lai et.al.	2501.00785	null
2025-01-01	REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization	Huyen Nguyen et.al.	2501.00779	null
2025-01-01	FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation	Qianli Wang et.al.	2501.00777	link
2025-01-01	Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis	Jie Gao et.al.	2501.00775	null
2025-01-01	An AI-powered Bayesian generative modeling approach for causal inference in observational studies	Qiao Liu et.al.	2501.00755	null
2025-01-01	Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform	Cheonsu Jeong et.al.	2501.00750	null
2025-01-01	DIVE: Diversified Iterative Self-Improvement	Yiwei Qin et.al.	2501.00747	link
2025-01-01	Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines	Xiyang Hu et.al.	2501.00745	null
2025-01-01	A Distributional Evaluation of Generative Image Models	Edric Tam et.al.	2501.00744	null
2025-01-01	New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory	Madiha Ajmal et.al.	2501.00721	null
2025-01-01	Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection	Hao Wang et.al.	2501.00700	null
2025-01-01	Adjoint sharding for very long context training of state space models	Xingzi Xu et.al.	2501.00692	null
2025-01-01	Labels Generated by Large Language Model Helps Measuring People’s Empathy in Vitro	Md Rakibul Hasan et.al.	2501.00691	null
2025-01-01	IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently	Florian Dietz et.al.	2501.00684	null
2024-12-31	Grade Inflation in Generative Models	Phuc Nguyen et.al.	2501.00664	null
2024-12-31	Finding Missed Code Size Optimizations in Compilers using LLMs	Davide Italiano et.al.	2501.00655	null
2024-12-31	Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models	Suttisak Wizadwongsa et.al.	2501.00651	null
2024-12-31	Efficient Standardization of Clinical Notes using Large Language Models	Daniel B. Hier et.al.	2501.00644	null
2024-12-31	Enabling New HDLs with Agents	Mark Zakharov et.al.	2501.00642	null
2024-12-31	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2024-12-31	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2024-12-31	Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation	M. Ali Bayram et.al.	2501.00593	null
2024-12-31	Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method	Zhenpeng Huang et.al.	2501.00584	null
2024-12-31	Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders	Yipeng Kang et.al.	2501.00581	null
2024-12-31	AI and Quantum Computing in Binary Photocatalytic Hydrogen Production	Dennis Delali Kwesi Wayo et.al.	2501.00575	null
2024-12-31	VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling	Xinhao Li et.al.	2501.00574	link
2024-12-31	Probing Visual Language Priors in VLMs	Tiange Luo et.al.	2501.00569	null
2024-12-31	Robust and Adaptive Optimization under a Large Language Model Lens	Dimitris Bertsimas et.al.	2501.00568	null
2024-12-30	Distributed Mixture-of-Agents for Edge Inference with Large Language Models	Purbesh Mitra et.al.	2412.21200	link
2024-12-31	HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation	Zhaojian Yu et.al.	2412.21199	link
2024-12-30	The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick	Jonathan Berkheim et.al.	2412.21186	null
2024-12-30	Facilitating large language model Russian adaptation with Learned Embedding Propagation	Mikhail Tikhomirov et.al.	2412.21140	link
2024-12-30	ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation	Ruixuan Liu et.al.	2412.21123	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Varformer: Adapting VAR’s Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense	Yuyang Zhou et.al.	2412.21051	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Chia-Yu Hung et.al.	2412.21037	link
2024-12-30	GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models	Shangyu Xing et.al.	2412.21036	null
2024-12-30	MapQaTor: A System for Efficient Annotation of Map Query Datasets	Mahir Labib Dihan et.al.	2412.21015	link
2024-12-31	Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria	Joonwon Jang et.al.	2412.21006	null
2024-12-30	Plug-and-Play Training Framework for Preference Optimization	Jingyuan Ma et.al.	2412.20996	null
2024-12-30	KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation	Siyuan Fang et.al.	2412.20995	null
2024-12-30	Efficiently Serving LLM Reasoning Programs with Certaindex	Yichao Fu et.al.	2412.20993	null
2024-12-30	QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing	Shlomo Kashani et.al.	2412.20956	null
2024-12-30	AGON: Automated Design Framework for Customizing Processors from ISA Documents	Chongxiao Li et.al.	2412.20954	null
2024-12-30	Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema	Xiaohan Feng et.al.	2412.20942	null
2024-12-30	Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering	Junxiao Xue et.al.	2412.20927	null
2024-12-30	ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation	Ting Zhang et.al.	2412.20901	null
2024-12-30	Towards Compatible Fine-tuning for Vision-Language Model Updates	Zhengbo Wang et.al.	2412.20895	null
2024-12-30	DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models	Xiaolin Hu et.al.	2412.20891	null
2024-12-30	Enhancing Annotated Bibliography Generation with LLM Ensembles	Sergio Bermejo et.al.	2412.20864	null
2024-12-30	Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs’ Memory	Xingjian Tao et.al.	2412.20846	null
2024-12-30	Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment	Jianfei Zhang et.al.	2412.20834	link
2024-12-30	Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model	Runtao Ren et.al.	2412.20820	null
2024-12-30	TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting	Huanyu Zhang et.al.	2412.20810	null
2024-12-30	Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves	Chayan Chatterjee et.al.	2412.20789	null
2024-12-31	SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity	Pengfei Jing et.al.	2412.20787	null
2024-12-30	Large Language Model Enabled Multi-Task Physical Layer Network	Tianyue Zheng et.al.	2412.20772	null
2024-12-30	Attributing Culture-Conditioned Generations to Pretraining Corpora	Huihan Li et.al.	2412.20760	link
2024-12-30	M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs	Bei Yan et.al.	2412.20718	link
2024-12-30	HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images	Sungik Choi et.al.	2412.20704	null
2024-12-30	UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design	Zijie Chen et.al.	2412.20694	link
2024-12-30	Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks	Yuhe Ding et.al.	2412.20682	null
2024-12-30	Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA	Qingyun Jin et.al.	2412.20677	null
2024-12-30	Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner	Yitong Zhou et.al.	2412.20662	link
2024-12-30	Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis	Yousef Yeganeh et.al.	2412.20651	null
2024-12-30	SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy	Md Mahadi Hasan Nahid et.al.	2412.20641	null
2024-12-30	Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble	Yongchang Li et.al.	2412.20637	null
2024-12-30	EVOLVE: Emotion and Visual Output Learning via LLM Evaluation	Jordan Sinclair et.al.	2412.20632	null
2024-12-29	Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study	Yulin Fei et.al.	2412.20613	link
2024-12-29	NLP-based Regulatory Compliance – Using GPT 4.0 to Decode Regulatory Documents	Bimal Kumar et.al.	2412.20602	null
2024-12-29	MATEY: multiscale adaptive foundation models for spatiotemporal physical systems	Pei Zhang et.al.	2412.20601	null
2024-12-29	Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection	Dmitri Roussinov et.al.	2412.20595	link
2024-12-29	Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches	Madhavendra Thakur et.al.	2412.20584	null
2024-12-29	Counterfactual Samples Constructing and Training for Commonsense Statements Estimation	Chong Liu et.al.	2412.20563	null
2024-12-29	Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces	Linglingzhi Zhu et.al.	2412.20556	null
2024-12-29	The Impact of Prompt Programming on Function-Level Code Generation	Ranim Khojah et.al.	2412.20545	link
2024-12-29	Goal-Conditioned Data Augmentation for Offline Reinforcement Learning	Xingshuai Huang et.al.	2412.20519	null
2024-12-29	Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning	Hang Ni et.al.	2412.20505	null
2024-12-29	ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding	Xiao Wang et.al.	2412.20504	link
2024-12-29	TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication	Zongwu Wang et.al.	2412.20501	link
2024-12-29	Multimodal Variational Autoencoder: a Barycentric View	Peijie Qiu et.al.	2412.20487	null
2024-12-29	JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling	Haorui Ji et.al.	2412.20470	null
2024-12-29	Improving Vision-Language-Action Models via Chain-of-Affordance	Jinming Li et.al.	2412.20451	null
2024-12-29	Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs	Pratik Rakesh Singh et.al.	2412.20440	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-29	Unlocking adaptive digital pathology through dynamic feature learning	Jiawen Li et.al.	2412.20430	null
2024-12-29	AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models	Mansi et.al.	2412.20427	null
2024-12-29	Bringing Objects to Life: 4D generation from 3D objects	Ohad Rahamim et.al.	2412.20422	null
2024-12-29	Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection	Kalin Kopanov et.al.	2412.20414	null
2024-12-29	Multi-Objective Large Language Model Unlearning	Zibin Pan et.al.	2412.20412	link
2024-12-29	Open-Sora: Democratizing Efficient Video Production for All	Zangwei Zheng et.al.	2412.20404	link
2024-12-29	Natural Language Fine-Tuning	Jia Liu et.al.	2412.20382	link
2024-12-29	Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs)	Jia Wei Sii et.al.	2412.20381	null
2024-12-29	FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation	Yan Luo et.al.	2412.20374	link
2024-12-29	LLM2: Let Large Language Models Harness System 2 Reasoning	Cheng Yang et.al.	2412.20372	link
2025-01-02	Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey	Junqiao Wang et.al.	2412.20367	null
2024-12-29	HindiLLM: Large Language Model for Hindi	Sanjay Chouhan et.al.	2412.20357	null
2024-12-29	Distilling Desired Comments for Enhanced Code Review with Large Language Models	Yongda Yu et.al.	2412.20340	null
2024-12-29	Mind the Data Gap: Bridging LLMs to Enterprise Data Integration	Moe Kayali et.al.	2412.20331	null
2024-12-29	GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions	Tianyao Shi et.al.	2412.20322	null
2024-12-29	Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain	Shintaro Ozaki et.al.	2412.20309	link
2024-12-28	FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration	Jia Liu et.al.	2412.20297	null
2024-12-28	Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games	Guan-Horng Liu et.al.	2412.20279	null
2024-12-28	Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues	Henry J. Xie et.al.	2412.20264	link
2024-12-28	Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception	Athanasios Karagounis et.al.	2412.20230	null
2024-12-28	LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning	Shuguang Chen et.al.	2412.20227	null
2024-12-28	Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation	Yeonhong Park et.al.	2412.20185	null
2024-12-28	LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System	Hyucksung Kwon et.al.	2412.20166	null
2024-12-28	StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN	Andrzej Bedychaj et.al.	2412.20164	null
2024-12-28	Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems	Minhye Jeon et.al.	2412.20163	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	null
2024-12-28	Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses	Xinru Wen et.al.	2412.20154	null
2024-12-28	Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering	Wei Zhou et.al.	2412.20145	null
2024-12-28	TradingAgents: Multi-Agents LLM Financial Trading Framework	Yijia Xiao et.al.	2412.20138	link
2024-12-28	M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation	Zhaopeng Feng et.al.	2412.20127	link
2024-12-28	Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers	Tuomas Hakoniemi et.al.	2412.20114	null
2024-12-28	ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming	Jiedong Zhuang et.al.	2412.20105	null
2024-12-28	On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs	Atmane Ayoub Mansour Bahar et.al.	2412.20087	null
2024-12-31	Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset	Chongjian Yue et.al.	2412.20072	null
2024-12-28	On the Compositional Generalization of Multimodal LLMs for Medical Imaging	Zhenyang Cai et.al.	2412.20070	link
2024-12-28	VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition	Lan Chen et.al.	2412.20064	link
2024-12-28	MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion	Zechao Zhan et.al.	2412.20062	null
2024-12-28	Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts	Yanxin Shen et.al.	2412.20061	null
2024-12-28	“My life is miserable, have to sign 500 autographs everyday”: Exposing Humblebragging, the Brags in Disguise	Sharath Naganna et.al.	2412.20057	null
2024-12-27	Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization	Kumud Tripathi et.al.	2412.19785	null
2024-12-27	Can AI Help with Your Personal Finances?	Oudom Hean et.al.	2412.19784	null
2024-12-27	Tensor Network Estimation of Distribution Algorithms	John Gardiner et.al.	2412.19780	null
2024-12-27	Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration	Le Chen et.al.	2412.19770	link
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-27	On dual-projectively equivalent connections associated to second order superintegrable systems	Andreas Vollmer et.al.	2412.19739	null
2024-12-27	Can Large Language Models Adapt to Other Agents In-Context?	Matthew Riemer et.al.	2412.19726	null
2024-12-27	From Elements to Design: A Layered Approach for Automatic Graphic Design Composition	Jiawei Lin et.al.	2412.19712	null
2024-12-27	Toward Adaptive Reasoning in Large Language Models with Thought Rollback	Sijia Chen et.al.	2412.19707	link
2024-12-27	A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization	Jingchun Lian et.al.	2412.19685	null
2024-12-27	Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework	Jiang Liu et.al.	2412.19684	null
2024-12-27	CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs	Siyu Wang et.al.	2412.19663	null
2024-12-27	Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis	Jiaqi Wang et.al.	2412.19654	link
2024-12-27	FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios	Kaiyi Pang et.al.	2412.19652	null
2024-12-27	Xmodel-2 Technical Report	Wang Qun et.al.	2412.19638	link
2024-12-27	IMTP: Search-based Code Generation for In-memory Tensor Programs	Yongwon Shin et.al.	2412.19630	null
2024-12-27	Signatures of prediction during natural listening in MEG data?	Sahel Azizpour et.al.	2412.19622	null
2024-12-27	Gradient Weight-normalized Low-rank Projection for Efficient LLM Training	Jia-Hong Huang et.al.	2412.19616	link
2024-12-27	SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms	Shashank Rao Marpally et.al.	2412.19595	null
2024-12-27	Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following	Yuxiao Yang et.al.	2412.19562	null
2024-12-27	Diverse Rare Sample Generation with Pretrained GANs	Subeen Lee et.al.	2412.19543	link
2024-12-27	Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations	Yuanfei Huang et.al.	2412.19520	link
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	null
2024-12-27	Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs	Zhe Yang et.al.	2412.19513	link
2024-12-27	Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging	Hua Farn et.al.	2412.19512	null
2024-12-27	Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion	Koustav Ghosal et.al.	2412.19510	null
2024-12-27	MBQ: Modality-Balanced Quantization for Large Vision-Language Models	Shiyao Li et.al.	2412.19509	link
2024-12-27	DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-27	Casevo: A Cognitive Agents and Social Evolution Simulator	Zexun Jiang et.al.	2412.19498	link
2024-12-27	Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation	Chengyang Ye et.al.	2412.19492	link
2024-12-27	Focusing Image Generation to Mitigate Spurious Correlations	Xuewei Li et.al.	2412.19457	null
2024-12-27	Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models	Hyeonseok Moon et.al.	2412.19450	link
2024-12-27	Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models	Shuo Wang et.al.	2412.19449	null
2024-12-27	A Survey on Large Language Model Acceleration based on KV Cache Management	Haoyang Li et.al.	2412.19442	link
2024-12-27	Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback	Seong Jin Lee et.al.	2412.19436	null
2024-12-27	Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints	Alberto Maté et.al.	2412.19424	null
2024-12-27	Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning	Chen Li et.al.	2412.19422	link
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412	link
2024-12-27	MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios	Jiaqi Fan et.al.	2412.19406	link
2024-12-27	An Engorgio Prompt Makes Large Language Model Babble on	Jianshuo Dong et.al.	2412.19394	link
2024-12-26	Large Language Models for Market Research: A Data-augmentation Approach	Mengxin Wang et.al.	2412.19363	null
2024-12-26	Dynamic Skill Adaptation for Large Language Models	Jiaao Chen et.al.	2412.19361	null
2024-12-26	Identifying Split Vacancies with Foundation Models and Electrostatics	Seán R. Kavanagh et.al.	2412.19330	null
2024-12-26	Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment	Ziang Yan et.al.	2412.19326	link
2024-12-26	Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones	Mehrnaz Mofakhami et.al.	2412.19325	null
2024-12-26	From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries	Hugh Van Deventer et.al.	2412.19312	link
2024-12-26	Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries	Roberto Amoroso et.al.	2412.19304	null
2024-12-26	RecLM: Recommendation Instruction Tuning	Yangqin Jiang et.al.	2412.19302	link
2024-12-26	RAG with Differential Privacy	Nicolas Grislain et.al.	2412.19291	link
2024-12-26	Time Series Foundational Models: Their Role in Anomaly Detection and Prediction	Chathurangi Shyalika et.al.	2412.19286	link
2024-12-26	PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing	Michael Bezick et.al.	2412.19284	null
2024-12-26	MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes	Asma Ben Abacha et.al.	2412.19260	link
2024-12-26	VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis	Jaemin Jung et.al.	2412.19259	null
2024-12-26	Sentiment trading with large language models	Kemal Kirtac et.al.	2412.19245	null
2024-12-26	SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model	Xuyang Li et.al.	2412.19237	null
2024-12-26	Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining	Yuxin You et.al.	2412.19211	null
2024-12-26	Multi-Attribute Constraint Satisfaction via Language Model Rewriting	Ashutosh Baheti et.al.	2412.19198	null
2024-12-26	Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models	Haonan He et.al.	2412.19191	null
2024-12-26	Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue	Shuzhi Xu et.al.	2412.19154	null
2024-12-26	AskChart: Universal Chart Understanding through Textual Enhancement	Xudong Yang et.al.	2412.19146	link
2024-12-26	SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis	Senbin Zhu et.al.	2412.19140	link
2024-12-26	PlanLLM: Video Procedure Planning with Refinable Large Language Models	Dejie Yang et.al.	2412.19139	link
2024-12-26	Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing	Inpyo Hong et.al.	2412.19125	link
2024-12-26	Discrete vs. Continuous Trade-offs for Generative Models	Jathin Korrapati et.al.	2412.19114	null
2024-12-26	SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values	Yunfan Zhang et.al.	2412.19113	null
2024-12-26	Stochastic normalizing flows for Effective String Theory	Michele Caselle et.al.	2412.19109	null
2024-12-26	“I’ve Heard of You!”: Generate Spoken Named Entity Recognition Data for Unseen Entities	Jiawei Yu et.al.	2412.19102	null
2024-12-26	Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security	Vasileios Alevizos et.al.	2412.19088	null
2024-12-26	Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation	Haotian Qian et.al.	2412.19080	null
2024-12-26	CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers	Jingyi Zheng et.al.	2412.19037	link
2024-12-26	Repository Structure-Aware Training Makes SLMs Better Issue Resolver	Zexiong Ma et.al.	2412.19031	null
2024-12-26	Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation	Yixin Chen et.al.	2412.19026	link
2024-12-26	Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication	Xiqiang Qu et.al.	2412.19025	null
2024-12-26	Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation	Tao Liu et.al.	2412.19021	null
2024-12-26	Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability	Ruixi Lin et.al.	2412.19018	null
2024-12-25	How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study	Alejandro Velasco et.al.	2412.18989	null
2024-12-25	ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement	Zhefan Rao et.al.	2412.18966	null
2024-12-25	Musings About the Future of Search: A Return to the Past?	Jimmy Lin et.al.	2412.18956	null
2024-12-25	A Power-Efficient Hardware Implementation of L-Mul	Ruiqi Chen et.al.	2412.18948	null
2024-12-25	MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models	Kaiwen Zuo et.al.	2412.18947	null
2024-12-25	Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations	Yewon Kim et.al.	2412.18940	null
2024-12-25	Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference	Libo Zhang et.al.	2412.18934	null
2024-12-25	UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation	Lunhao Duan et.al.	2412.18928	null
2024-12-25	Exemplar-condensed Federated Class-incremental Learning	Rui Sun et.al.	2412.18926	null
2024-12-25	Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model	Yi-Chia Chen et.al.	2412.18917	link
2024-12-25	AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures	Situo Zhang et.al.	2412.18910	null
2024-12-25	CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models	Ping Guo et.al.	2412.18890	link
2024-12-25	MotionMap: Representing Multimodality in Human Pose Forecasting	Reyhaneh Hosseininejad et.al.	2412.18883	link
2024-12-25	Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models	Meltem Aksoy et.al.	2412.18863	null
2024-12-25	Improving the Readability of Automatically Generated Tests using Large Language Models	Matteo Biagiola et.al.	2412.18843	null
2024-12-25	LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements	Hao Zhang et.al.	2412.18835	null
2024-12-25	Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition	Shujie Hu et.al.	2412.18832	null
2024-12-25	RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting	Yilei Jiang et.al.	2412.18826	null
2024-12-25	CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection	Wenbin Li et.al.	2412.18820	link
2024-12-25	LLM-assisted vector similarity search	Md Riyadh et.al.	2412.18819	null
2024-12-25	DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search	Lei Yang et.al.	2412.18811	link
2024-12-25	Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation	Xinkai Du et.al.	2412.18800	null
2024-12-25	Torque-Aware Momentum	Pranshu Malviya et.al.	2412.18790	null
2024-12-25	Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models	Yu-An Liu et.al.	2412.18770	link
2024-12-25	The Impact of Input Order Bias on Large Language Models for Software Fault Localization	Md Nakhla Rafi et.al.	2412.18750	null
2024-12-24	Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models	Zehan Wang et.al.	2412.18605	link
2024-12-24	Long-Form Speech Generation with Spoken Language Models	Se Jin Park et.al.	2412.18603	link
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	link
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs	OpenMind et.al.	2412.18588	null
2024-12-24	Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control	Sergey Sedov et.al.	2412.18582	null
2024-12-24	Zero-resource Speech Translation and Recognition with LLMs	Karel Mundnich et.al.	2412.18566	null
2024-12-24	Distilling Fine-grained Sentiment Understanding from Large Language Models	Yice Zhang et.al.	2412.18552	link
2024-12-24	Token-Budget-Aware LLM Reasoning	Tingxu Han et.al.	2412.18547	link
2024-12-24	PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction	Xingjian Xu et.al.	2412.18541	null
2024-12-24	Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation	Derong Xu Xinhang Li et.al.	2412.18537	link
2024-12-24	Automated Code Review In Practice	Umut Cihan et.al.	2412.18531	null
2024-12-24	Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving	Hao Pang et.al.	2412.18511	null
2024-12-24	Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization	Yi-Fu Fu et.al.	2412.18497	null
2024-12-24	GeFL: Model-Agnostic Federated Learning with Generative Models	Honggu Kang et.al.	2412.18460	null
2024-12-24	3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding	Tatiana Zemskova et.al.	2412.18450	link
2024-12-24	Is Large Language Model Good at Triple Set Prediction? An Empirical Study	Yuan Yuan et.al.	2412.18443	null
2024-12-24	Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm	O. Deniz Akyildiz et.al.	2412.18432	null
2024-12-24	GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent	Kangjia Zhao et.al.	2412.18426	null
2024-12-24	Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models	Zihan Zhou et.al.	2412.18419	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-24	Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English	Avinash Anand et.al.	2412.18415	link
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-24	A Statistical Framework for Ranking LLM-Based Chatbots	Siavash Ameli et.al.	2412.18407	link
2024-12-24	Extract Free Dense Misalignment from CLIP	JeongYeon Nam et.al.	2412.18404	link
2024-12-24	RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction	Wu Xiaoping et.al.	2412.18390	null
2024-12-24	MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs	Qiuyi Gu et.al.	2412.18381	link
2024-12-24	Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents	Kaiwen Ning et.al.	2412.18371	link
2024-12-24	Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.18351	null
2024-12-24	M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models	Jiaxin Guo et.al.	2412.18299	null
2024-12-24	Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight	Xi Ding et.al.	2412.18298	link
2024-12-24	Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases	Christian Di Maio et.al.	2412.18295	null
2024-12-24	DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation	Junyi Lu et.al.	2412.18291	null
2024-12-24	Improved Feature Generating Framework for Transductive Zero-shot Learning	Zihan Ye et.al.	2412.18282	null
2024-12-24	GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications	Zhenzhou Jin et.al.	2412.18281	null
2024-12-24	Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization	Jiacai Liu et.al.	2412.18279	null
2024-12-24	GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge	Shammur Absar Chowdhury et.al.	2412.18274	null
2024-12-24	Annotating References to Mythological Entities in French Literature	Thierry Poibeau et.al.	2412.18270	null
2024-12-24	Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study	Xuefeng Jiang et.al.	2412.18260	link
2024-12-24	AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction	Pufan Zou et.al.	2412.18255	null
2024-12-24	An Automatic Graph Construction Framework based on Large Language Models for Recommendation	Rong Shan et.al.	2412.18241	link
2024-12-24	Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection	Jango Zhang et.al.	2412.18225	null
2024-12-24	Expand VSR Benchmark for VLLM to Expertize in Spatial Rules	Peijin Xie et.al.	2412.18224	link
2024-12-24	ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation	Mengyang Wu et.al.	2412.18216	link
2024-12-24	Adapting Large Language Models for Improving TCP Fairness over WiFi	Shyam Kumar Shrestha et.al.	2412.18200	null
2024-12-24	Robustness-aware Automatic Prompt Optimization	Zeru Shi et.al.	2412.18196	link
2024-12-24	VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks	Shiduo Zhang et.al.	2412.18194	null
2024-12-24	TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization	Yucong Luo et.al.	2412.18185	null
2024-12-24	Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation	Yucong Luo et.al.	2412.18176	null
2024-12-24	INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent	Haohang Li et.al.	2412.18174	null
2024-12-24	Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models	Xiaomeng Hu et.al.	2412.18171	null
2024-12-24	KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management	Rongxin Cheng et.al.	2412.18169	null
2024-12-24	Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence	Yinbin Han et.al.	2412.18164	null
2024-12-24	VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities	Shray Mathur et.al.	2412.18161	null
2024-12-24	Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task	Jinming Liu et.al.	2412.18158	null
2024-12-24	Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance	Yaoyun Zhang et.al.	2412.18157	null
2024-12-24	scReader: Prompting Large Language Models to Interpret scRNA-seq Data	Cong Li et.al.	2412.18156	null
2024-12-24	GeneSUM: Large Language Model-based Gene Summary Extraction	Zhijian Chen et.al.	2412.18154	null
2024-12-24	CoAM: Corpus of All-Type Multiword Expressions	Yusuke Ide et.al.	2412.18151	null
2024-12-24	EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation	Shuhao Han et.al.	2412.18150	link
2024-12-24	Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction	Xiao Guo et.al.	2412.18149	null
2024-12-24	Ensuring Consistency for In-Image Translation	Chengpeng Fu et.al.	2412.18139	null
2024-12-24	LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment	Binrui Zeng et.al.	2412.18135	null
2024-12-24	VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection	Zhaohui Jin et.al.	2412.18124	null
2024-12-24	AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation	Hao Wen et.al.	2412.18116	link
2024-12-24	AIGT: AI Generative Table Based on Prompt	Mingming Zhang et.al.	2412.18111	null
2024-12-24	SlimGPT: Layer-wise Structured Pruning for Large Language Models	Gui Ling et.al.	2412.18110	null
2024-12-24	Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach	Jing Bi et.al.	2412.18108	null
2024-12-24	Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels	Mingcong Song et.al.	2412.18106	null
2024-12-24	EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent	Suyuan Wang et.al.	2412.18100	null
2024-12-24	Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) – a Large Language Model Chatbot for Perioperative Medicine	Yu He Ke et.al.	2412.18096	null
2024-12-24	Molly: Making Large Language Model Agents Solve Python Problem More Logically	Rui Xiao et.al.	2412.18093	null
2024-12-24	Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner	Aizierjiang Aiersilan et.al.	2412.18086	link
2024-12-24	Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models	Xuan Lin et.al.	2412.18084	link
2024-12-24	Improving Factuality with Explicit Working Memory	Mingda Chen et.al.	2412.18069	null
2024-12-24	LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR	Osama Hosam Abdellaif et.al.	2412.18063	link
2024-12-24	Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction	Hyunbae Jeon et.al.	2412.18061	null
2024-12-24	An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM	Wen Wen et.al.	2412.18060	null
2024-12-23	Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations	Maya Patel et.al.	2412.18051	null
2024-12-23	AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data	Mirko Zaffaroni et.al.	2412.18038	link
2024-12-23	Generating refactored code accurately using reinforcement learning	Indranil Palit et.al.	2412.18035	null
2024-12-23	More than Chit-Chat: Developing Robots for Small-Talk Interactions	Rebecca Ramnauth et.al.	2412.18023	null
2024-12-23	Trustworthy and Efficient LLMs Meet Databases	Kyoungmin Kim et.al.	2412.18022	null
2024-12-23	StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs	Hailin Chen et.al.	2412.18011	null
2024-12-23	CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models	Ruibo Tu et.al.	2412.17970	link
2024-12-23	LMV-RPA: Large Model Voting-based Robotic Process Automation	Osama Abdellatif et.al.	2412.17965	link
2024-12-23	Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models	Antony Seabra et.al.	2412.17964	null
2024-12-23	Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models	Ge Zhang et.al.	2412.17963	null
2024-12-23	Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents	Antony Seabra et.al.	2412.17942	null
2024-12-23	BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism	Martin Fajcik et.al.	2412.17933	null
2024-12-23	Causal Composition Diffusion Model for Closed-loop Traffic Generation	Haohong Lin et.al.	2412.17920	null
2024-12-23	Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning	Orson Mengara et.al.	2412.17908	null
2024-12-23	LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses	Sara Riazi et.al.	2412.17892	null
2024-12-23	ChatGarment: Garment Estimation, Generation and Editing via Large Language Models	Siyuan Bian et.al.	2412.17811	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806	link
2024-12-23	Automating the Search for Artificial Life with Foundation Models	Akarsh Kumar et.al.	2412.17799	link
2024-12-23	ResearchTown: Simulator of Human Research Community	Haofei Yu et.al.	2412.17767	link
2024-12-23	ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback	Wei Zhang et.al.	2412.17754	null
2024-12-23	Deliberation in Latent Space via Differentiable Cache Augmentation	Luyang Liu et.al.	2412.17747	null
2024-12-23	YuLan-Mini: An Open Data-efficient Language Model	Yiwen Hu et.al.	2412.17743	link
2024-12-23	Reasoning to Attend: Try to Understand How Token Works	Rui Qian et.al.	2412.17741	link
2024-12-23	Knowledge Editing through Chain-of-Thought	Changyue Wang et.al.	2412.17727	link
2024-12-23	Understanding the Logic of Direct Preference Alignment through Logic	Kyle Richardson et.al.	2412.17696	null
2024-12-23	Large Language Model Safety: A Holistic Survey	Dan Shi et.al.	2412.17686	link
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models	Sijbren van Vaals et.al.	2412.17669	link
2024-12-23	Detecting anxiety and depression in dialogues: a multi-label and explainable approach	Francisco de Arriba-Pérez et.al.	2412.17651	null
2024-12-23	SCBench: A Sports Commentary Benchmark for Video LLMs	Kuangzhi Ge et.al.	2412.17637	null
2024-12-23	ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance	Renyang Liu et.al.	2412.17632	link
2024-12-23	Tracking the Feature Dynamics in LLM Training: A Mechanistic Study	Yang Xu et.al.	2412.17626	null
2024-12-23	Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models	Parham Rezaei et.al.	2412.17622	link
2024-12-23	Emerging Security Challenges of Large Language Models	Herve Debar et.al.	2412.17614	null
2024-12-23	Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs	Fabrizio Frasca et.al.	2412.17609	null
2024-12-23	EasyTime: Time Series Forecasting Made Easy	Xiangfei Qiu et.al.	2412.17603	null
2024-12-23	LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context	Kai Ruan et.al.	2412.17596	link
2024-12-23	Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation	Chengbing Wang et.al.	2412.17593	null
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	link
2024-12-23	S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field	Zixi Liang et.al.	2412.17561	link
2024-12-23	GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference	Chao Zeng et.al.	2412.17560	null
2024-12-23	A Survey of Query Optimization in Large Language Models	Mingyang Song et.al.	2412.17558	null
2024-12-23	Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Prakash Aryan et.al.	2412.17548	link
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	Constructing Fair Latent Space for Intersection of Fairness and Explainability	Hyungjun Joo et.al.	2412.17523	null
2024-12-23	DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak	Hao Wang et.al.	2412.17522	null
2024-12-23	Improving the Noise Estimation of Latent Neural Stochastic Differential Equations	Linus Heck et.al.	2412.17499	null
2024-12-23	Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings	Jérémie Sublime et.al.	2412.17486	null
2024-12-23	Power- and Fragmentation-aware Online Scheduling for GPU Datacenters	Francesco Lettich et.al.	2412.17484	link
2024-12-23	A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression	Chenlong Deng et.al.	2412.17483	null
2024-12-23	A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers	Shuaihang Chen et.al.	2412.17481	link
2024-12-23	CALLIC: Content Adaptive Learning for Lossless Image Compression	Daxin Li et.al.	2412.17464	null
2024-12-23	Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning	Xiaodan Chen et.al.	2412.17456	null
2024-12-23	Applying LLM and Topic Modelling in Psychotherapeutic Contexts	Alexander Vanin et.al.	2412.17449	null
2024-12-23	Measuring Contextual Informativeness in Child-Directed Text	Maria Valentini et.al.	2412.17427	link
2024-12-23	Multimodal Preference Data Synthetic Alignment with Reward Model	Robert Wijaya et.al.	2412.17417	link
2024-12-23	VidCtx: Context-aware Video Question Answering with Image Models	Andreas Goulas et.al.	2412.17415	link
2024-12-23	Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance	Muhammad Reza Qorib et.al.	2412.17408	link
2024-12-23	Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning	Huchen Jiang et.al.	2412.17397	null
2024-12-23	WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models	Huawen Feng et.al.	2412.17395	null
2024-12-23	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-23	Interweaving Memories of a Siamese Large Language Model	Xin Song et.al.	2412.17383	link
2024-12-23	MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models	Beibei Yu et.al.	2412.17339	null
2024-12-23	A Dual-Perspective Metaphor Detection Framework Using Large Language Models	Yujie Lin et.al.	2412.17332	link
2024-12-23	Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance	Nicolas Devatine et.al.	2412.17321	null
2024-12-23	CodeV: Issue Resolving with Visual Data	Linhao Zhang et.al.	2412.17315	link
2024-12-23	Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories	Mahan Tafreshipour et.al.	2412.17298	null
2024-12-23	Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples	Taewoong Kim et.al.	2412.17288	link
2024-12-23	LLM4AD: A Platform for Algorithm Design with Large Language Model	Fei Liu et.al.	2412.17287	link
2024-12-23	Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning	Rui Liang et.al.	2412.17285	null
2024-12-23	Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach	Rafid Ishrak Jahan et.al.	2412.17255	link
2024-12-23	SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval	Xiaopeng Li et.al.	2412.17250	null
2024-12-23	EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling	Zichen Song et.al.	2412.17249	null
2024-12-23	On the Generalization Ability of Machine-Generated Text Detectors	Yule Liu et.al.	2412.17242	link
2024-12-23	Brain-to-Text Benchmark ‘24: Lessons Learned	Francis R. Willett et.al.	2412.17227	link
2024-12-23	CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder	Lichen Ma et.al.	2412.17225	null
2024-12-22	Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension	Jio Oh et.al.	2412.17189	null
2024-12-22	Foundation Model for Lossy Compression of Spatiotemporal Scientific Data	Xiao Li et.al.	2412.17184	null
2024-12-22	Enhancing Item Tokenization for Generative Recommendation through Self-Improvement	Runjin Chen et.al.	2412.17171	null
2024-12-22	Generative Diffusion Modeling: A Practical Handbook	Zihan Ding et.al.	2412.17162	null
2024-12-22	LLM-based relevance assessment still can’t replace human relevance assessment	Charles L. A. Clarke et.al.	2412.17156	null
2024-12-22	LLM Agent for Fire Dynamics Simulations	Leidong Xu et.al.	2412.17146	null
2024-12-22	Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs	Rushendra Sidibomma et.al.	2412.17131	link
2024-12-22	Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models	Cameron R. Jones et.al.	2412.17128	null
2024-12-22	Learning to Adapt to Low-Resource Paraphrase Generation	Zhigen Li et.al.	2412.17111	null
2024-12-22	DreamOmni: Unified Image Generation and Editing	Bin Xia et.al.	2412.17098	null
2024-12-22	Analysis on LLMs Performance for Code Summarization	Md. Ahnaf Akib et.al.	2412.17094	null
2024-12-22	SAIL: Sample-Centric In-Context Learning for Document Information Extraction	Jinyu Zhang et.al.	2412.17092	link
2024-12-22	SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults	Jinzhi Wang et.al.	2412.17077	null
2024-12-22	The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM’s Internal States	Fabian Ridder et.al.	2412.17056	link
2024-12-22	DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately	Huiwen Wu et.al.	2412.17053	null
2024-12-22	ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues	Shaina Raza et.al.	2412.17052	link
2024-12-22	Modular Conversational Agents for Surveys and Interviews	Jiangbo Yu et.al.	2412.17049	null
2024-12-22	Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective	Hankun Wang et.al.	2412.17048	null
2024-12-22	Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation	Luoxu Jin et.al.	2412.17042	null
2024-12-22	HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories	Eric Hedlin et.al.	2412.17040	null
2024-12-22	Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model	Kevin Shen et.al.	2412.17039	null
2024-12-22	Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models	Lang Gao et.al.	2412.17034	null
2024-12-22	MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge	Jie He et.al.	2412.17032	link
2024-12-22	FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos	Zhengqian Wu et.al.	2412.17022	link
2024-12-22	GAS: Generative Auto-bidding with Post-training Search	Yewen Li et.al.	2412.17018	null
2024-12-22	Robustness of Large Language Models Against Adversarial Attacks	Yiyi Tao et.al.	2412.17011	null
2024-12-22	InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions	Ronghui Li et.al.	2412.16982	null
2024-12-22	On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora	Tzu-Chieh Chen et.al.	2412.16976	null
2024-12-22	Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs	Alexander von Recum et.al.	2412.16974	null
2024-12-22	Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach	Chunxu Zhang et.al.	2412.16969	link
2024-12-22	System-2 Mathematical Reasoning via Enriched Instruction Tuning	Huanqia Cai et.al.	2412.16964	null
2024-12-22	Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework	Jundong Xu et.al.	2412.16953	null
2024-12-22	A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation	Ekai Hashimoto et.al.	2412.16943	null
2024-12-22	Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.16936	null
2024-12-22	Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models	Kai Zheng et.al.	2412.16933	null
2024-12-22	Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs	Bohan Jin et.al.	2412.16922	null
2024-12-22	Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection	Yuhang Gan et.al.	2412.16918	null
2024-12-22	Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation	Quan Dao et.al.	2412.16906	null
2024-12-22	Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model	Songjun Tu et.al.	2412.16878	link
2024-12-20	HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding	Chenxin Tao et.al.	2412.16158	null
2024-12-20	Can Generative Video Models Help Pose Estimation?	Ruojin Cai et.al.	2412.16155	null
2024-12-20	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	link
2024-12-20	Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation	Seyedreza Mohseni et.al.	2412.16135	null
2024-12-20	Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information	Dirk Bergemann et.al.	2412.16132	null
2024-12-20	PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics	Daniil Larionov et.al.	2412.16120	null
2024-12-20	Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts	Muhammad Abdullah Sohail et.al.	2412.16119	link
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-20	The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse	Mahyar Habibi et.al.	2412.16114	null
2024-12-20	Logical Consistency of Large Language Models in Fact-checking	Bishwamittra Ghosh et.al.	2412.16100	null
2024-12-20	The Evolution of LLM Adoption in Industry Data Curation Practices	Crystal Qian et.al.	2412.16089	null
2024-12-20	Efficient MedSAMs: Segment Anything in Medical Images on Laptop	Jun Ma et.al.	2412.16085	link
2024-12-20	Formal Mathematical Reasoning: A New Frontier in AI	Kaiyu Yang et.al.	2412.16075	null
2024-12-20	The Only Way is Ethics: A Guide to Ethical Research with Large Language Models	Eddie L. Ungless et.al.	2412.16022	link
2024-12-20	Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support	Qijiong Liu et.al.	2412.15973	link
2024-12-20	From General to Specific: Tailoring Large Language Models for Personalized Healthcare	Ruize Shi et.al.	2412.15957	null
2024-12-20	Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring	Markus Borg et.al.	2412.15948	null
2024-12-20	Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation	Gautier Evennou et.al.	2412.15939	link
2024-12-20	Large Language Model assisted Hybrid Fuzzing	Ruijie Meng et.al.	2412.15931	null
2024-12-20	MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection	Andrea Moglia et.al.	2412.15925	link
2024-12-20	RiTTA: Modeling Event Relations in Text-to-Audio Generation	Yuhang He et.al.	2412.15922	link
2024-12-20	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	Guang Yang et.al.	2412.15921	null
2024-12-20	Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model	Yosuke Yamagishi et.al.	2412.15907	null
2024-12-20	Evaluation of Reliability Criteria for News Publishers with Large Language Models	Manuel Pratelli et.al.	2412.15896	null
2024-12-20	TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain	Camille Barboule et.al.	2412.15891	null
2024-12-20	AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI	Katja Bühler et.al.	2412.15876	null
2024-12-20	Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback	Jiaming Ji et.al.	2412.15838	link
2024-12-20	WebLLM: A High-Performance In-Browser LLM Inference Engine	Charlie F. Ruan et.al.	2412.15803	link
2024-12-20	Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning	Sungjin Park et.al.	2412.15797	null
2024-12-20	GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning	Heming Zhang et.al.	2412.15790	link
2024-12-20	Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech	Jonathan Heitz et.al.	2412.15772	link
2024-12-20	Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference	Jorge García-Carrasco et.al.	2412.15750	link
2024-12-20	Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models	Shamus Sim et.al.	2412.15748	null
2024-12-20	VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models	Dexter Neo et.al.	2412.15739	null
2024-12-20	AutoLife: Automatic Life Journaling with Smartphones and LLMs	Huatao Xu et.al.	2412.15714	null
2024-12-20	Contrastive Learning for Task-Independent SpeechLLM-Pretraining	Maike Züfle et.al.	2412.15712	link
2024-12-20	Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback	Niklas Ippisch et.al.	2412.15702	null
2024-12-20	Code Review Automation Via Multi-task Federated LLM – An Empirical Study	Jahnavi Kumar et.al.	2412.15676	null
2024-12-20	Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline	Guancheng Zeng et.al.	2412.15660	null
2024-12-20	Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class	Annie D’souza et.al.	2412.15657	link
2024-12-20	MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula	Sieun Hyeon et.al.	2412.15655	link
2024-12-20	Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution	Wentao Tan et.al.	2412.15650	link
2024-12-20	Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model	Xin Du et.al.	2412.15634	link
2024-12-20	Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning?	Mengyu Ye et.al.	2412.15628	null
2024-12-20	JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs	Hongyi Li et.al.	2412.15623	null
2024-12-20	Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	Zhi Gao et.al.	2412.15606	null
2024-12-20	Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks	Brian J Chan et.al.	2412.15605	link
2024-12-20	Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification	Gyutae Park et.al.	2412.15603	null
2024-12-20	Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation	Xiaoqiang Kang et.al.	2412.15594	link
2024-12-20	NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization	Danial Kamali et.al.	2412.15588	link
2024-12-20	To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models	Jessica Y. Bo et.al.	2412.15584	null
2024-12-20	A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation	Ryien Hosseini et.al.	2412.15582	link
2024-12-20	Score-based Generative Diffusion Models for Social Recommendations	Chengyi Liu et.al.	2412.15579	link
2024-12-20	QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning	Xinyang Tong et.al.	2412.15576	null
2024-12-20	J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM	Takero Yoshida et.al.	2412.15574	null
2024-12-20	Continual Learning Using a Kernel-Based Method Over Foundation Models	Saleh Momeni et.al.	2412.15571	link
2024-12-20	DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation	Yichun Tai et.al.	2412.15570	link
2024-12-20	In-context Continual Learning Assisted by an External Continual Learner	Saleh Momeni et.al.	2412.15563	null
2024-12-20	NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning	Zheyuan Zhang et.al.	2412.15547	null
2024-12-20	MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering	Zhang Siyue et.al.	2412.15540	null
2024-12-20	XRAG: eXamining the Core – Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation	Qianren Mao et.al.	2412.15529	link
2024-12-20	HREF: Human Response-Guided Evaluation of Instruction Following in Language Models	Xinxi Lyu et.al.	2412.15524	link
2024-12-20	PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time	Alireza Pourali et.al.	2412.15519	link
2024-12-20	Stylish and Functional: Guided Interpolation Subject to Physical Constraints	Yan-Ying Chen et.al.	2412.15507	null
2024-12-20	Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework	Zhenjie Xu et.al.	2412.15504	link
2024-12-20	Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models	Zhisheng Tang et.al.	2412.15501	null
2024-12-20	TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use	Junjie Ye et.al.	2412.15495	link
2024-12-20	PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search	Jiaxin Wu et.al.	2412.15494	null
2024-12-20	GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators	Hengjia Li et.al.	2412.15491	null
2024-12-20	Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage	Saehyung Lee et.al.	2412.15484	null
2024-12-20	Continual Learning Using Only Large Language Model Prompting	Jiabao Qiu et.al.	2412.15479	null
2024-12-19	TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models	Ammar N. Abbas et.al.	2412.15462	null
2024-12-19	Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization	Sahil Wadhwa et.al.	2412.15453	null
2024-12-19	AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals	Angela Mastrianni et.al.	2412.15444	null
2024-12-19	SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval	Aakash Mahalingam et.al.	2412.15443	null
2024-12-19	Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models	Tianchen Zhang et.al.	2412.15431	null
2024-12-19	MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale	Swapnil Gandhi et.al.	2412.15411	null
2024-12-19	Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification	Edoardo Allegrini et.al.	2412.15410	null
2024-12-19	Systematic Evaluation of Long-Context LLMs on Financial Concepts	Lavanya Gupta et.al.	2412.15386	null
2024-12-19	Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation	Joanne Boisson et.al.	2412.15375	link
2024-12-19	Automated Root Cause Analysis System for Complex Data Products	Mathieu Demarne et.al.	2412.15374	null
2024-12-19	Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs	Liam Seymour et.al.	2412.15352	link
2024-12-19	Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models	Reza Shirkavand et.al.	2412.15341	link
2024-12-19	Complete background cosmology of parity-even quadratic metric-affine gravity	Thomas Dyer et.al.	2412.15329	null
2024-12-19	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	link
2024-12-19	MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark	Qihao Zhao et.al.	2412.15194	link
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning	Simon Frieder et.al.	2412.15184	null
2024-12-19	STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning	Marius Memmel et.al.	2412.15182	null
2024-12-19	HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages	Aman Chaturvedi et.al.	2412.15178	null
2024-12-19	Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Federico Castagna et.al.	2412.15177	link
2024-12-19	Rethinking Uncertainty Estimation in Natural Language Generation	Lukas Aichberger et.al.	2412.15176	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Language Models as Continuous Self-Evolving Data Engineers	Peidong Wang et.al.	2412.15151	null
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Adaptive Pruning for Large Language Models with Structural Importance Awareness	Haotian Zheng et.al.	2412.15127	null
2024-12-19	Outcome-Refining Process Supervision for Code Generation	Zhuohao Yu et.al.	2412.15118	link
2024-12-19	Qwen2.5 Technical Report	Qwen et.al.	2412.15115	link
2024-12-19	Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture	Thomas F Burns et.al.	2412.15113	link
2024-12-19	Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation	Yang Tian et.al.	2412.15109	link
2024-12-19	Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability	Xiangsen Chen et.al.	2412.15101	null
2024-12-19	Nano-ESG: Extracting Corporate Sustainability Information from News Articles	Fabian Billert et.al.	2412.15093	link
2024-12-19	Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation	Haoran Liu et.al.	2412.15086	null
2024-12-19	ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots	Bhupendra Acharya et.al.	2412.15072	null
2024-12-19	ConfliBERT: A Language Model for Political Conflict	Patrick T. Brandt et.al.	2412.15060	link
2024-12-19	LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps	Felix Friedrich et.al.	2412.15035	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-19	Large Language Models and Code Security: A Systematic Literature Review	Enna Basic et.al.	2412.15004	null
2024-12-19	HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs	Pham Vu Tuan Dat et.al.	2412.14995	link
2024-12-19	RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning	Raphael Memmesheimer et.al.	2412.14989	null
2024-12-19	Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts	Ioana Buhnila et.al.	2412.14986	null
2024-12-19	AI and Cultural Context: An Empirical Investigation of Large Language Models’ Performance on Chinese Social Work Professional Standards	Zia Qi et.al.	2412.14971	null
2024-12-19	Movie2Story: A framework for understanding videos and telling stories in the form of novel text	Kangning Li et.al.	2412.14965	null
2024-12-19	Knowledge Injection via Prompt Distillation	Kalle Kujanpää et.al.	2412.14964	null
2024-12-19	Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities	Daniil Medyakov et.al.	2412.14935	null
2024-12-19	RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response	Junyu Luo et.al.	2412.14922	link
2024-12-19	Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation	Zexiong Ma et.al.	2412.14905	null
2024-12-19	Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering	Peize Li et.al.	2412.14880	null
2024-12-19	Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering	Imed Keraghel et.al.	2412.14867	null
2024-12-19	Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling	Junyi Li et.al.	2412.14860	null
2024-12-19	DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis	Hongling Xu et.al.	2412.14849	link
2024-12-19	Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas	Pietro Bernardelle et.al.	2412.14843	null
2024-12-19	Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis	Greta Dolcetti et.al.	2412.14841	null
2024-12-19	Progressive Multimodal Reasoning via Active Retrieval	Guanting Dong et.al.	2412.14835	null
2024-12-19	Answer Set Networks: Casting Answer Set Programming into Deep Learning	Arseny Skryagin et.al.	2412.14814	link
2024-12-19	ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis	Zeao Tu et.al.	2412.14809	link
2024-12-19	Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning	Ziang Ye et.al.	2412.14780	null
2024-12-19	ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine	Rabee Qasem et.al.	2412.14771	null
2024-12-19	PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children	Yiqun Zhang et.al.	2412.14769	link
2024-12-19	CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering	Ruida Hu et.al.	2412.14764	link
2024-12-19	Query pipeline optimization for cancer patient question answering systems	Maolin He et.al.	2412.14751	null
2024-12-19	Active Inference and Human–Computer Interaction	Roderick Murray-Smith et.al.	2412.14741	null
2024-12-19	On Verbalized Confidence Scores for LLMs	Daniel Yang et.al.	2412.14737	link
2024-12-19	Creation of AI-driven Smart Spaces for Enhanced Indoor Environments – A Survey	Aygün Varol et.al.	2412.14708	null
2024-12-19	LLMs as mediators: Can they diagnose conflicts accurately?	Özgecan Koçak et.al.	2412.14675	null
2024-12-19	Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT	Hassane Kissane et.al.	2412.14670	null
2024-12-19	IOHunter: Graph Foundation Model to Uncover Online Information Operations	Marco Minici et.al.	2412.14663	link
2024-12-19	Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models	Zijun Chen et.al.	2412.14660	link
2024-12-19	Length Controlled Generation for Black-box LLMs	Yuxuan Gu et.al.	2412.14656	null
2024-12-19	Learning to Generate Research Idea with Dynamic Control	Ruochen Li et.al.	2412.14626	null
2024-12-19	How good is GPT at writing political speeches for the White House?	Jacques Savoy et.al.	2412.14617	null
2024-12-19	Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning	Kepu Zhang et.al.	2412.14588	null
2024-12-19	HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning	Minkuk Kim et.al.	2412.14585	null
2024-12-19	Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues	Tao He et.al.	2412.14584	null
2024-12-19	CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation	Youngwon Lee et.al.	2412.14581	null
2024-12-19	DiffSim: Taming Diffusion Models for Evaluating Visual Similarity	Yiren Song et.al.	2412.14580	link
2024-12-19	Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models	Wenhan Liu et.al.	2412.14574	link
2024-12-19	ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model	Shunlin Lu et.al.	2412.14559	null
2024-12-19	The Current Challenges of Software Engineering in the Era of Large Language Models	Cuiyun Gao et.al.	2412.14554	null
2024-12-19	Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models	Xiao Cui et.al.	2412.14528	link
2024-12-19	Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment	Teng Xiao et.al.	2412.14516	link
2024-12-19	Relational Programming with Foundation Models	Ziyang Li et.al.	2412.14515	null
2024-12-19	PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization	Jiayi Wu et.al.	2412.14510	link
2024-12-19	Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs	Yuzuki Arai et.al.	2412.14501	null
2024-12-19	Guided Diffusion Model for Sensor Data Obfuscation	Xin Yang et.al.	2412.14499	null
2024-12-19	FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis	Abdullah Khan et.al.	2412.14492	link
2024-12-19	Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities	Amandeep Kaur et.al.	2412.14486	null
2024-12-19	DirectorLLM for Human-Centric Video Generation	Kunpeng Song et.al.	2412.14484	null
2024-12-19	Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs	Koshiro Saito et.al.	2412.14471	null
2024-12-19	Agent-SafetyBench: Evaluating the Safety of LLM Agents	Zhexin Zhang et.al.	2412.14470	link
2024-12-19	From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research	Xiang Cheng et.al.	2412.14461	null
2024-12-19	LEDiff: Latent Exposure Diffusion for HDR Generation	Chao Wang et.al.	2412.14456	null
2024-12-19	Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems	Genki Kusano et.al.	2412.14454	null
2024-12-19	Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation	Shengqi Liu et.al.	2412.14453	null
2024-12-19	ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study	Eric Modesitt et.al.	2412.14436	link
2024-12-19	All-in-One Tuning and Structural Pruning for Domain-Specific LLMs	Lei Lu et.al.	2412.14426	null
2024-12-19	FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning	Pramit Saha et.al.	2412.14424	null
2024-12-19	Enhancing Diffusion Models for High-Quality Image Generation	Jaineet Shah et.al.	2412.14422	null
2024-12-18	ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers	Haowei Liu et.al.	2412.14405	null
2024-12-18	Clinical Trials Ontology Engineering with Large Language Models	Berkan Çakır et.al.	2412.14387	null
2024-12-18	ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling	William Han et.al.	2412.14373	link
2024-12-18	Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models’ Character Understanding Evaluation	Yuxuan Jiang et.al.	2412.14368	null
2024-12-18	Surrealistic-like Image Generation with Vision-Language Models	Elif Ayten et.al.	2412.14366	link
2024-12-18	ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals	Utkarsh Saxena et.al.	2412.14363	link
2024-12-18	A Unifying Information-theoretic Perspective on Evaluating Generative Models	Alexis Fox et.al.	2412.14340	null
2024-12-18	Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation	Benjamin Steenhoek et.al.	2412.14308	null
2024-12-18	Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs	David Restrepo et.al.	2412.14304	null
2024-12-18	Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data	haina Raza et.al.	2412.14276	link
2024-12-18	Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces	Jihan Yang et.al.	2412.14171	link
2024-12-18	MetaMorph: Multimodal Understanding and Generation via Instruction Tuning	Shengbang Tong et.al.	2412.14164	null
2024-12-18	TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks	Frank F. Xu et.al.	2412.14161	link
2024-12-18	Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models	Atin Sakkeer Hussain et.al.	2412.14146	null
2024-12-18	LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research	Tianyang Gu et.al.	2412.14141	null

Video Understanding

Publish Date	Title	Authors	PDF	Code
2025-07-23	Yume: An Interactive World Generation Model	Xiaofeng Mao et.al.	2507.17744	null
2025-07-23	HLFormer: Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning	Li Jun et.al.	2507.17402	null
2025-07-23	EndoGen: Conditional Autoregressive Endoscopic Video Generation	Xinyu Liu et.al.	2507.17388	null
2025-07-22	Toward Scalable Video Narration: A Training-free Approach Using Multimodal Large Language Models	Tz-Ying Wu et.al.	2507.17050	null
2025-07-22	Controllable Hybrid Captioner for Improved Long-form Video Understanding	Kuleen Sasse et.al.	2507.17047	null
2025-07-22	Controllable Video Generation: A Survey	Yue Ma et.al.	2507.16869	null
2025-07-22	MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation	Yanchen Liu et.al.	2507.16310	null
2025-07-22	SPACT18: Spiking Human Action Recognition Benchmark Dataset with Complementary RGB and Thermal Modalities	Yasser Ashraf et.al.	2507.16151	null
2025-07-22	PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation	Yaofang Liu et.al.	2507.16116	null
2025-07-21	Can Your Model Separate Yolks with a Water Bottle? Benchmarking Physical Commonsense Understanding in Video Generation Models	Enes Sanli et.al.	2507.15824	null
2025-07-21	TokensGen: Harnessing Condensed Tokens for Long Video Generation	Wenqi Ouyang et.al.	2507.15728	null
2025-07-21	DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding	Xiaoyi Bao et.al.	2507.15569	null
2025-07-21	Quantifying and Narrowing the Unknown: Interactive Text-to-Video Retrieval via Uncertainty Minimization	Bingqing Zhang et.al.	2507.15504	null
2025-07-21	Prompt-aware of Frame Sampling for Efficient Text-Video Retrieval	Deyu Zhang et.al.	2507.15491	null
2025-07-21	Conditional Video Generation for High-Efficiency Video Compression	Fangqiu Yi et.al.	2507.15269	null
2025-07-20	Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction	Ce Zhang et.al.	2507.15130	null
2025-07-20	Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding	Yuanhan Zhang et.al.	2507.15028	null
2025-07-20	U-MARVEL: Unveiling Key Factors for Universal Multimodal Retrieval via Embedding Learning with MLLMs	Xiaojie Li et.al.	2507.14902	null
2025-07-20	LeAdQA: LLM-Driven Context-Aware Temporal Grounding for Video Question Answering	Xinxin Dong et.al.	2507.14784	null
2025-07-19	InterAct-Video: Reasoning-Rich Video QA for Urban Traffic	Joseph Raj Vishal et.al.	2507.14743	null
2025-07-19	BusterX++: Towards Unified Cross-Modal AI-Generated Content Detection and Explanation with MLLM	Haiquan Wen et.al.	2507.14632	null
2025-07-18	Generalist Forecasting with Frozen Video Models via Latent Diffusion	Jacob C Walker et.al.	2507.13942	null
2025-07-18	Team of One: Cracking Complex Video QA with Model Synergy	Jun Xie et.al.	2507.13820	null
2025-07-18	CoTasks: Chain-of-Thought based Video Instruction Tuning Tasks	Yanan Wang et.al.	2507.13609	null
2025-07-17	$\nabla$ NABLA: Neighborhood Adaptive Block-Level Attention	Dmitrii Mikhailov et.al.	2507.13546	null
2025-07-17	“PhyWorldBench”: A Comprehensive Evaluation of Physical Realism in Text-to-Video Models	Jing Gu et.al.	2507.13428	null
2025-07-12	Smart Routing for Multimodal Video Retrieval: When to Search What	Kevin Dela Rosa et.al.	2507.13374	null
2025-07-17	VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding	Shihao Wang et.al.	2507.13353	null
2025-07-17	Taming Diffusion Transformer for Real-Time Mobile Video Generation	Yushu Wu et.al.	2507.13343	null
2025-07-17	Leveraging Pre-Trained Visual Models for AI-Generated Video Detection	Keerthi Veeramachaneni et.al.	2507.13224	null
2025-07-17	LoViC: Efficient Long Video Generation with Context Compression	Jiaxiu Jiang et.al.	2507.12952	null
2025-07-17	FIQ: Fundamental Question Generation with the Integration of Question Embeddings for Video Question Answering	Ju-Young Oh et.al.	2507.12816	null
2025-07-17	World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving	Yanchen Guan et.al.	2507.12762	null
2025-07-18	DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition	Hayat Ullah et.al.	2507.12426	null
2025-07-16	Calisthenics Skills Temporal Video Segmentation	Antonio Finocchiaro et.al.	2507.12245	null
2025-07-15	UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks	Peiran Wu et.al.	2507.11336	null
2025-07-15	NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models	X. Feng et.al.	2507.11245	null
2025-07-14	Flows and Diffusions on the Neural Manifold	Daniel Saragih et.al.	2507.10623	null
2025-07-14	EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Mingxian Lin et.al.	2507.10548	null
2025-07-14	Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI	Jiangkai Wu et.al.	2507.10510	null
2025-07-14	DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs	Jiahe Zhao et.al.	2507.10302	null
2025-07-14	ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models	Yongheng Zhang et.al.	2507.09876	null
2025-07-22	VRU-Accident: A Vision-Language Benchmark for Video Question Answering and Dense Captioning for Accident Scene Understanding	Younggun Kim et.al.	2507.09815	null
2025-07-13	ExpStar: Towards Automatic Commentary Generation for Multi-discipline Scientific Experiments	Jiali Chen et.al.	2507.09693	null
2025-07-13	GLIMPSE: Do Large Vision-Language Models Truly Think With Videos or Just Glimpse at Them?	Yiyang Zhou et.al.	2507.09491	null
2025-07-12	$I^{2}$ -World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting	Zhimin Liao et.al.	2507.09144	null
2025-07-11	Taming generative video models for zero-shot optical flow extraction	Seungwoo Kim et.al.	2507.09082	null
2025-07-23	Infinite Video Understanding	Dell Zhang et.al.	2507.09068	null
2025-07-11	Detecting Deepfake Talking Heads from Facial Biometric Anomalies	Justin D. Norman et.al.	2507.08917	null
2025-07-11	Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective	Hangjie Yuan et.al.	2507.08801	null
2025-07-11	Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers	Wongi Jeong et.al.	2507.08422	null
2025-07-14	M2DAO-Talker: Harmonizing Multi-granular Motion Decoupling and Alternating Optimization for Talking-head Generation	Kui Jiang et.al.	2507.08307	null
2025-07-10	Multi-Granular Spatio-Temporal Token Merging for Training-Free Acceleration of Video LLMs	Jeongseok Hyun et.al.	2507.07990	null
2025-07-10	Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling	Haoyu Wu et.al.	2507.07982	null
2025-07-10	Martian World Models: Controllable Video Synthesis with Physically Accurate 3D Reconstructions	Longfei Li et.al.	2507.07978	null
2025-07-10	Scaling RL to Long Videos	Yukang Chen et.al.	2507.07966	null
2025-07-11	T-GVC: Trajectory-Guided Generative Video Coding at Ultra-Low Bitrates	Zhitao Wang et.al.	2507.07633	null
2025-07-09	A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality	Mohamed Elmoghany et.al.	2507.07202	null
2025-07-02	Discrete Diffusion Models for Language Generation	Ashen Weligalle et.al.	2507.07050	null
2025-07-09	Physics-Grounded Motion Forecasting via Equation Discovery for Trajectory-Guided Image-to-Video Generation	Tao Feng et.al.	2507.06830	null
2025-07-14	Democratizing High-Fidelity Co-Speech Gesture Video Generation	Xu Yang et.al.	2507.06812	null
2025-07-09	PromptTea: Let Prompts Tell TeaCache the Optimal Threshold	Zishen Huang et.al.	2507.06739	null
2025-07-09	FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation	Liqiang Jing et.al.	2507.06523	null
2025-07-18	Hierarchical Multi-Stage Transformer Architecture for Context-Aware Temporal Action Localization	Hayat Ullah et.al.	2507.06411	null
2025-07-08	Bridging Sequential Deep Operator Network and Video Diffusion: Residual Refinement of Spatio-Temporal PDE Solutions	Jaewan Park et.al.	2507.06133	null
2025-07-09	Omni-Video: Democratizing Unified Video Understanding and Generation	Zhiyu Tan et.al.	2507.06119	null
2025-07-08	MCAM: Multimodal Causal Analysis Model for Ego-Vehicle-Level Driving Video Understanding	Tongtong Cheng et.al.	2507.06072	null
2025-07-09	Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation	Zhenghao Zhang et.al.	2507.05963	null
2025-07-14	Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation	Quanzhu Niu et.al.	2507.05948	null
2025-07-08	Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models	L’ea Dubois et.al.	2507.05822	null
2025-07-08	MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos	Rongsheng Wang et.al.	2507.05675	null
2025-07-07	Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models	Eunseop Yoon et.al.	2507.04976	null
2025-07-07	HV-MMBench: Benchmarking MLLMs for Human-Centric Video Understanding	Yuxuan Cai et.al.	2507.04909	null
2025-07-07	From Vision To Language through Graph of Events in Space and Time: An Explainable Self-supervised Approach	Mihai Masala et.al.	2507.04815	null
2025-07-07	Music2Palette: Emotion-aligned Color Palette Generation via Cross-Modal Representation Learning	Jiayun Hu et.al.	2507.04758	null
2025-07-07	Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled Representations	Yuji Wang et.al.	2507.04705	null
2025-07-07	Tempo-R0: A Video-MLLM for Temporal Video Grounding through Efficient Temporal Sensing Reinforcement Learning	Feng Yue et.al.	2507.04702	null
2025-07-07	VLM2Vec-V2: Advancing Multimodal Embedding for Videos, Images, and Visual Documents	Rui Meng et.al.	2507.04590	null
2025-07-06	MambaVideo for Discrete Video Tokenization with Channel-Split Quantization	Dawit Mureja Argaw et.al.	2507.04559	null
2025-07-06	CLIP-RL: Surgical Scene Segmentation Using Contrastive Language-Vision Pretraining & Reinforcement Learning	Fatmaelzahraa Ali Ahmed et.al.	2507.04317	null
2025-07-06	M $^3$ -Med: A Benchmark for Multi-lingual, Multi-modal, and Multi-hop Reasoning in Medical Instructional Video Understanding	Shenxi Liu et.al.	2507.04289	null
2025-07-05	PresentAgent: Multimodal Agent for Presentation Video Generation	Jingwei Shi et.al.	2507.04036	null
2025-07-05	EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation	Rang Meng et.al.	2507.03905	null
2025-07-08	StreamDiT: Real-Time Streaming Text-to-Video Generation	Akio Kodaira et.al.	2507.03745	null
2025-07-04	Multimodal Alignment with Cross-Attentive GRUs for Fine-Grained Video Understanding	Namho Kim et.al.	2507.03531	null
2025-06-28	Iterative Zoom-In: Temporal Interval Exploration for Long Video Understanding	Chenglin Li et.al.	2507.02946	null
2025-07-03	RefTok: Reference-Based Tokenization for Video Generation	Xiang Fan et.al.	2507.02862	null
2025-07-03	Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching	Xin Zhou et.al.	2507.02860	null
2025-07-03	AnyI2V: Animating Any Conditional Image with Motion Control	Ziye Li et.al.	2507.02857	null
2025-07-03	Lost in Latent Space: An Empirical Study of Latent Diffusion Models for Physics Emulation	François Rozet et.al.	2507.02608	null
2025-07-23	AuroraLong: Bringing RNNs Back to Efficient Open-Ended Video Understanding	Weili Xu et.al.	2507.02591	null
2025-07-03	Are Synthetic Videos Useful? A Benchmark for Retrieval-Centric Evaluation of Synthetic Videos	Zecheng Zhao et.al.	2507.02316	null
2025-07-02	Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges	Sanjeda Akter et.al.	2507.02074	null
2025-07-01	Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames	Anurag Arnab et.al.	2507.02001	null
2025-07-02	Kwai Keye-VL Technical Report	Kwai Keye Team et.al.	2507.01949	null
2025-07-09	LongAnimation: Long Animation Generation with Dynamic Global-Local Memory	Nan Chen et.al.	2507.01945	null
2025-07-02	AVC-DPO: Aligned Video Captioning via Direct Preference Optimization	Jiyang Tang et.al.	2507.01492	null
2025-07-02	SD-Acc: Accelerating Stable Diffusion through Phase-aware Sampling and Hardware Co-Optimizations	Zhican Wang et.al.	2507.01309	null
2025-07-02	LLM-based Realistic Safety-Critical Driving Video Generation	Yongjie Fu et.al.	2507.01264	null
2025-07-02	AIGVE-MACS: Unified Multi-Aspect Commenting and Scoring Model for AI-Generated Video Evaluation	Xiao Liu et.al.	2507.01255	null
2025-07-01	Geometry-aware 4D Video Generation for Robot Manipulation	Zeyi Liu et.al.	2507.01099	null
2025-06-30	Embedding-based Retrieval in Multimodal Content Moderation	Hanzhong Liang et.al.	2507.01066	null
2025-07-02	GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning	GLM-V Team et.al.	2507.01006	null
2025-07-01	CAVALRY-V: A Large-Scale Generator Framework for Adversarial Attacks on Video MLLMs	Jiaming Zhang et.al.	2507.00817	null
2025-07-01	Bisecle: Binding and Separation in Continual Learning for Video Language Understanding	Yue Tan et.al.	2507.00469	null
2025-07-01	Populate-A-Scene: Affordance-Aware Human Video Generation	Mengyi Shan et.al.	2507.00334	null
2025-06-30	FreeLong++: Training-Free Long Video Generation via Multi-band SpectralFusion	Yu Lu et.al.	2507.00162	null
2025-06-28	MANTA: Cross-Modal Semantic Alignment and Information-Theoretic Optimization for Long-form Multimodal Understanding	Ziqi Zhong et.al.	2507.00068	null
2025-06-18	Moment Sampling in Video LLMs for Long-Form Video QA	Mustafa Chasmai et.al.	2507.00033	null
2025-06-30	Epona: Autoregressive Diffusion World Model for Autonomous Driving	Kaiwen Zhang et.al.	2506.24113	null
2025-06-30	VMoBA: Mixture-of-Block Attention for Video Diffusion Models	Jianzong Wu et.al.	2506.23858	null
2025-07-03	RGC-VQA: An Exploration Database for Robotic-Generated Video Quality Assessment	Jianing Jin et.al.	2506.23852	null
2025-06-30	Flash-VStream: Efficient Real-Time Understanding for Long Video Streams	Haoji Zhang et.al.	2506.23825	null
2025-06-30	SynMotion: Semantic-Visual Adaptation for Motion Customized Video Generation	Shuai Tan et.al.	2506.23690	null
2025-06-30	ViewPoint: Panoramic Video Generation with Pretrained Diffusion Models	Zixun Fang et.al.	2506.23513	null
2025-06-29	MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition	Yuhuan Yang et.al.	2506.23283	null
2025-06-29	Causal-Entity Reflected Egocentric Traffic Accident Video Synthesis	Lei-lei Li et.al.	2506.23263	null
2025-06-29	RoboScape: Physics-informed Embodied World Model	Yu Shang et.al.	2506.23135	null
2025-06-28	ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment	Amir Aghdam et.al.	2506.22967	null
2025-06-28	Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder	Dang Jisheng et.al.	2506.22880	null
2025-07-01	Listener-Rewarded Thinking in VLMs for Image Preferences	Alexander Gambashidze et.al.	2506.22832	null
2025-07-07	Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs	Shaojie Zhang et.al.	2506.22139	null
2025-06-27	RoboEnvision: A Long-Horizon Video Generation Model for Multi-Task Robot Manipulation	Liudi Yang et.al.	2506.22007	null
2025-06-27	DIVE: Deep-search Iterative Video Exploration A Technical Report for the CVRR Challenge at CVPR 2025	Umihiro Kamoto et.al.	2506.21891	null
2025-06-27	LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs	Boyuan Sun et.al.	2506.21862	null
2025-06-26	SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture	Kehan Sui et.al.	2506.21478	null
2025-06-27	ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models	Hongbo Liu et.al.	2506.21356	null
2025-06-26	HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation	Diego Biagini et.al.	2506.21287	null
2025-06-26	Video Virtual Try-on with Conditional Diffusion Transformer Inpainter	Cheng Zou et.al.	2506.21270	null
2025-06-26	Task-Aware KV Compression For Cost-Effective Long Video Understanding	Minghao Qin et.al.	2506.21184	null
2025-07-08	IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes	Yujia Liang et.al.	2506.21116	null
2025-06-27	DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing	Lingling Cai et.al.	2506.20967	null
2025-06-26	Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models	Donggoo Kang et.al.	2506.20946	null
2025-06-25	Video Perception Models for 3D Scene Synthesis	Rui Huang et.al.	2506.20601	null
2025-06-25	Dense Video Captioning using Graph-based Sentence Summarization	Zhiwang Zhang et.al.	2506.20583	null
2025-06-25	Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization	Zhiwang Zhang et.al.	2506.20567	null
2025-06-25	BrokenVideos: A Benchmark Dataset for Fine-Grained Artifact Localization in AI-Generated Videos	Jiahao Lin et.al.	2506.20103	null
2025-06-24	Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation	Xingyang Li et.al.	2506.19852	null
2025-06-24	GenHSI: Controllable Generation of Human-Scene Interaction Videos	Zekun Li et.al.	2506.19840	null
2025-06-24	SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution	Liangbin Xie et.al.	2506.19838	null
2025-06-24	Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Router	Yubo Huang et.al.	2506.19833	null
2025-07-07	PEVLM: Parallel Encoding for Vision-Language Models	Letian Kang et.al.	2506.19651	null
2025-06-24	Training-Free Motion Customization for Distilled Video Generators with Adaptive Test-Time Distillation	Jintao Rong et.al.	2506.19348	null
2025-06-24	Video-XL-2: Towards Very Long-Video Understanding Through Task-Aware KV Sparsification	Minghao Qin et.al.	2506.19225	null
2025-07-10	VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory	Runjia Li et.al.	2506.18903	null
2025-06-23	From Virtual Games to Real-World Play	Wenqiang Sun et.al.	2506.18901	null
2025-06-23	FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation	Kaiyi Huang et.al.	2506.18899	null
2025-06-23	MinD: Unified Visual Imagination and Control via Hierarchical World Models	Xiaowei Chi et.al.	2506.18897	null
2025-06-23	Universal Video Temporal Grounding with Generative Multi-modal Large Language Models	Zeqian Li et.al.	2506.18883	null
2025-06-23	OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation	Qijun Gan et.al.	2506.18866	null
2025-06-23	Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset	Zhuowei Chen et.al.	2506.18851	null
2025-06-23	Matrix-Game: Interactive World Foundation Model	Yifan Zhang et.al.	2506.18701	null
2025-06-23	RDPO: Real Data Preference Optimization for Physics Consistency Video Generation	Wenxu Qian et.al.	2506.18655	null
2025-06-23	BulletGen: Improving 4D Reconstruction with Bullet-Time Generation	Denys Rozumnyi et.al.	2506.18601	null
2025-06-23	VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning	Xuanyu Zhang et.al.	2506.18564	null
2025-06-27	MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering	Jisheng Dang et.al.	2506.18071	null
2025-06-22	SurgVidLM: Towards Multi-grained Surgical Video Understanding with Large Language Model	Guankun Wang et.al.	2506.17873	null
2025-06-21	CLiViS: Unleashing Cognitive Map through Linguistic-Visual Synergy for Embodied Visual Reasoning	Kailing Li et.al.	2506.17629	null
2025-06-23	Emergent Temporal Correspondences from Video Diffusion Transformers	Jisu Nam et.al.	2506.17220	link
2025-06-20	Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition	Jiaqi Li et.al.	2506.17201	null
2025-06-20	Seeing What Matters: Generalizable AI-generated Video Detection with Forensic-Oriented Augmentation	Riccardo Corvi et.al.	2506.16802	null
2025-06-19	How Far Can Off-the-Shelf Multimodal Large Language Models Go in Online Episodic Memory Question Answering?	Giuseppe Lando et.al.	2506.16450	null
2025-06-19	ViFusion: In-Network Tensor Fusion for Scalable Video Feature Indexing	Yisu Wang et.al.	2506.16258	null
2025-06-19	VideoGAN-based Trajectory Proposal for Automated Vehicles	Annajoyce Mariani et.al.	2506.16209	link
2025-06-19	GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning	Yi Chen et.al.	2506.16141	link
2025-06-19	FastInit: Fast Noise Initialization for Temporally Consistent Video Generation	Chengyu Bai et.al.	2506.16119	null
2025-06-19	PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning	Yizhe Li et.al.	2506.16082	null
2025-06-19	PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models	Tianchen Zhao et.al.	2506.16054	null
2025-06-19	Advanced Sign Language Video Generation with Compressed and Quantized Multi-Condition Tokenization	Cong Wang et.al.	2506.15980	link
2025-06-16	EchoShot: Multi-Shot Portrait Video Generation	Jiahao Wang et.al.	2506.15838	null
2025-06-18	Weakly-supervised VLM-guided Partial Contrastive Learning for Visual Language Navigation	Ruoyu Wang et.al.	2506.15757	null
2025-06-18	InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding	Minsoo Kim et.al.	2506.15745	null
2025-06-20	Sekai: A Video Dataset towards World Exploration	Zhen Li et.al.	2506.15675	null
2025-06-20	Show-o2: Improved Native Unified Multimodal Models	Jinheng Xie et.al.	2506.15564	link
2025-06-18	video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models	Changli Tang et.al.	2506.15220	null
2025-06-17	SIRI-Bench: Challenging VLMs’ Spatial Intelligence through Complex Reasoning Tasks	Zijian Song et.al.	2506.14512	null
2025-06-17	Causally Steered Diffusion for Automated Video Counterfactual Generation	Nikos Spyrou et.al.	2506.14404	link
2025-06-17	EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization	Xiaoqi Wang et.al.	2506.14356	link
2025-06-17	GHz spiking neuromorphic photonic chip with in-situ training	Jinlong Xiang et.al.	2506.14272	null
2025-06-17	CausalDiffTab: Mixed-Type Causal-Aware Diffusion for Tabular Data Generation	Jia-Chen Zhang et.al.	2506.14206	null
2025-06-18	VideoMAR: Autoregressive Video Generatio with Continuous Tokens	Hu Yu et.al.	2506.14168	null
2025-06-16	UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions	Zhucun Xue et.al.	2506.13691	null
2025-06-18	AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding	Zhucun Xue et.al.	2506.13589	null
2025-06-16	MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models	Geewook Kim et.al.	2506.13564	null
2025-06-21	STAGE: A Stream-Centric Generative World Model for Long-Horizon Driving-Scene Simulation	Jiamin Wang et.al.	2506.13138	null
2025-06-15	iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer	Zhelun Shen et.al.	2506.12847	null
2025-06-14	Understanding and Benchmarking the Trustworthiness in Multimodal LLMs for Video Understanding	Youze Wang et.al.	2506.12336	null
2025-06-13	Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation	Divyanshu Mishra et.al.	2506.11777	link
2025-06-13	SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation	Xu Wang et.al.	2506.11621	null
2025-06-11	Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search	Linhao Yu et.al.	2506.11155	null
2025-06-11	AlignHuman: Improving Motion and Fidelity via Timestep-Segment Preference Optimization for Audio-Driven Human Animation	Chao Liang et.al.	2506.11144	null
2025-06-12	GenWorld: Towards Detecting AI-generated Real-world Simulation Videos	Weiliang Chen et.al.	2506.10975	null
2025-06-12	M4V: Multi-Modal Mamba for Text-to-Video Generation	Jiancheng Huang et.al.	2506.10915	null
2025-06-15	VideoDeepResearch: Long Video Understanding With Agentic Tool Using	Huaying Yuan et.al.	2506.10821	link
2025-06-12	GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning	Xiaoyi Bao et.al.	2506.10639	null
2025-06-12	DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers	Lizhen Wang et.al.	2506.10568	null
2025-06-12	AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation	Haoyuan Shi et.al.	2506.10540	null
2025-06-12	CogStream: Context-guided Streaming Video Question Answering	Zicheng Zhao et.al.	2506.10516	null
2025-06-12	Can Sound Replace Vision in LLaVA With Token Substitution?	Ali Vosoughi et.al.	2506.10416	null
2025-06-11	Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval	Shubhashis Roy Dipta et.al.	2506.10202	null
2025-06-11	PlayerOne: Egocentric World Simulator	Yuanpeng Tu et.al.	2506.09995	null
2025-06-11	V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning	Mido Assran et.al.	2506.09985	link
2025-06-11	InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions	Zhenzhi Wang et.al.	2506.09984	null
2025-06-11	ReSim: Reliable World Simulation for Autonomous Driving	Jiazhi Yang et.al.	2506.09981	null
2025-06-11	CausalVQA: A Physically Grounded Causal Reasoning Benchmark for Video Models	Aaron Foss et.al.	2506.09943	link
2025-06-11	HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios	Kunyu Peng et.al.	2506.09650	link
2025-06-11	DGAE: Diffusion-Guided Autoencoder for Efficient Latent Representation Learning	Dongxu Liu et.al.	2506.09644	null
2025-06-11	TOGA: Temporally Grounded Open-Ended Video QA with Weak Supervision	Ayush Gupta et.al.	2506.09445	null
2025-06-11	Synthetic Human Action Video Data Generation with Pose Transfer	Vaclav Knapp et.al.	2506.09411	null
2025-06-11	MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed	Jiaqi Samantha Zhan et.al.	2506.09409	null
2025-06-11	Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation	Shanchuan Lin et.al.	2506.09350	null
2025-06-10	Seedance 1.0: Exploring the Boundaries of Video Generation Models	Yu Gao et.al.	2506.09113	null
2025-06-17	FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation	Zheqi He et.al.	2506.09081	link
2025-06-10	VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks	Xinlong Chen et.al.	2506.09079	null
2025-06-10	MagCache: Fast Video Generation with Magnitude-Aware Cache	Zehong Ma et.al.	2506.09045	link
2025-06-18	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	link
2025-06-10	HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation	Ziyao Huang et.al.	2506.08797	null
2025-06-10	MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding	Zhiyi Zhu et.al.	2506.08512	null
2025-06-10	How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models	Huixuan Zhang et.al.	2506.08351	null
2025-06-09	Seeing Voices: Generating A-Roll Video from Audio with Mirage	Aditi Sundararaman et.al.	2506.08279	null
2025-06-09	Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion	Xun Huang et.al.	2506.08009	null
2025-06-09	Dreamland: Controllable World Creation with Simulator and Generative Models	Sicheng Mo et.al.	2506.08006	null
2025-06-09	Audio-Sync Video Generation with Multi-Stream Temporal Control	Shuchen Weng et.al.	2506.08003	null
2025-06-09	Generative Modeling of Weights: Generalization or Memorization?	Boya Zeng et.al.	2506.07998	link
2025-06-09	CyberV: Cybernetics for Test-time Scaling in Video Understanding	Jiahao Meng et.al.	2506.07971	link
2025-06-09	Video Unlearning via Low-Rank Refusal Vector	Simone Facchiano et.al.	2506.07891	null
2025-06-09	PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement	Teng Hu et.al.	2506.07848	null
2025-06-09	Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning	Tieyuan Chen et.al.	2506.07811	link
2025-06-13	Consistent Video Editing as Flow-Driven Image-to-Video Generation	Ge Wang et.al.	2506.07713	null
2025-06-16	SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis	Jianhui Wei et.al.	2506.07603	null
2025-06-09	SceneRAG: Scene-level Retrieval-Augmented Generation for Video Understanding	Nianbo Zeng et.al.	2506.07600	null
2025-06-09	Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding	Boyu Chen et.al.	2506.07576	null
2025-06-09	Ambiguity-Restrained Text-Video Representation Learning for Partially Relevant Video Retrieval	CH Cho et.al.	2506.07471	null
2025-06-10	ARGUS: Hallucination and Omission Evaluation in Video-LLMs	Ruchit Rawal et.al.	2506.07371	null
2025-06-10	From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models	Pablo Acuaviva et.al.	2506.07280	null
2025-06-08	TV-LiVE: Training-Free, Text-Guided Video Editing via Layer Informed Vitality Exploitation	Min-Jung Kim et.al.	2506.07205	null
2025-06-08	Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models	Sangwon Jang et.al.	2506.07177	null
2025-06-08	Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion	Huaize Liu et.al.	2506.07136	null
2025-06-08	A Culturally-diverse Multilingual Multimodal Video Benchmark & Model	Bhuiyan Sanjid Shafique et.al.	2506.07032	null
2025-06-13	MAGNET: A Multi-agent Framework for Finding Audio-Visual Needles by Reasoning over Multi-Video Haystacks	Sanjoy Chowdhury et.al.	2506.07016	null
2025-06-07	How Important are Videos for Training Video LLMs?	George Lydakis et.al.	2506.06928	null
2025-06-07	Self-Adapting Improvement Loops for Robotic Learning	Calvin Luo et.al.	2506.06658	null
2025-06-06	Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models	Seung-jae Lee et.al.	2506.06537	null
2025-06-06	ExAct: A Video-Language Benchmark for Expert Action Analysis	Han Yi et.al.	2506.06277	null
2025-06-06	Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision	Yuping He et.al.	2506.06253	null
2025-06-06	VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning	Zikang Wang et.al.	2506.06097	null
2025-06-06	Restereo: Diffusion stereo video generation and restoration	Xingchang Huang et.al.	2506.06023	null
2025-06-06	LLIA – Enabling Low-Latency Interactive Avatars: Real-Time Audio-Driven Portrait Video Generation with Diffusion Models	Haojie Yu et.al.	2506.05806	null
2025-06-06	EASG-Bench: Video Q&A Benchmark with Egocentric Action Scene Graphs	Ivan Rodin et.al.	2506.05787	null
2025-06-05	EX-4D: EXtreme Viewpoint 4D Video Synthesis via Depth Watertight Mesh	Tao Hu et.al.	2506.05554	null
2025-06-05	FRAME: Pre-Training Video Feature Representations via Anticipation and Memory	Sethuraman TV et.al.	2506.05543	null
2025-06-11	ContentV: Efficient Training of Video Generation Models with Limited Compute	Wenfeng Lin et.al.	2506.05343	null
2025-06-05	VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Ghazi Shazan Ahmad et.al.	2506.05336	link
2025-06-05	AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs	Lidong Lu et.al.	2506.05328	null
2025-06-05	Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos	Weifeng Lin et.al.	2506.05302	null
2025-06-05	From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos	Animesh Gupta et.al.	2506.05274	link
2025-06-09	Astraea: A GPU-Oriented Token-wise Acceleration Framework for Video Diffusion Transformers	Haosong Liu et.al.	2506.05096	null
2025-06-05	TextVidBench: A Benchmark for Long Video Scene Text Understanding	Yangyang Zhong et.al.	2506.04983	null
2025-06-05	FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation	Huihan Wang et.al.	2506.04956	link
2025-06-05	APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval	Hong Gao et.al.	2506.04953	null
2025-06-13	DualX-VSR: Dual Axial Spatial $\times$ Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation	Shuo Cao et.al.	2506.04830	null
2025-06-06	FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion	Akide Liu et.al.	2506.04648	null
2025-06-05	Follow-Your-Creation: Empowering 4D Creation through Video Inpainting	Yue Ma et.al.	2506.04590	null
2025-06-04	LayerFlow: A Unified Model for Layer-aware Video Generation	Sihui Ji et.al.	2506.04228	null
2025-06-04	UNIC: Unified In-Context Video Editing	Zixuan Ye et.al.	2506.04216	null
2025-06-05	FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers	Xuanhua He et.al.	2506.04213	null
2025-06-04	DynTok: Dynamic Compression of Visual Tokens for Efficient and Effective Video Understanding	Hongzhi Zhang et.al.	2506.03990	null
2025-06-04	Video, How Do Your Tokens Merge?	Sam Pollard et.al.	2506.03885	null
2025-06-09	BiMa: Towards Biases Mitigation for Text-Video Retrieval via Scene Element Guidance	Huy Le et.al.	2506.03589	null
2025-06-04	Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning	Daeun Lee et.al.	2506.03525	null
2025-06-04	DenseDPO: Fine-Grained Temporal Preference Optimization for Video Diffusion Models	Ziyi Wu et.al.	2506.03517	null
2025-06-04	MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval	Xinru Ying et.al.	2506.03473	null
2025-06-03	Seeing the Arrow of Time in Large Multimodal Models	Zihui Xue et.al.	2506.03340	null
2025-06-03	Chipmunk: Training-Free Acceleration of Diffusion Transformers with Dynamic Column-Sparse Deltas	Austin Silveria et.al.	2506.03275	null
2025-05-29	Vid-SME: Membership Inference Attacks against Large Video Understanding Models	Qi Li et.al.	2506.03179	null
2025-06-03	IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation	Yuanze Lin et.al.	2506.03150	null
2025-06-03	Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval	Jiwen Yu et.al.	2506.03141	null
2025-06-03	CamCloneMaster: Enabling Reference-based Camera Control for Video Generation	Yawen Luo et.al.	2506.03140	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	DCM: Dual-Expert Consistency Model for Efficient and High-Quality Video Generation	Zhengyao Lv et.al.	2506.03123	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	EgoVLM: Policy Optimization for Egocentric Video Understanding	Ashwin Vinod et.al.	2506.03097	link
2025-06-03	ORV: 4D Occupancy-centric Robot Video Generation	Xiuyu Yang et.al.	2506.03079	link
2025-06-03	Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers	Pengtao Chen et.al.	2506.03065	null
2025-06-03	HaploOmni: Unified Single Transformer for Multimodal Video Understanding and Generation	Yicheng Xiao et.al.	2506.02975	null
2025-06-03	METok: Multi-Stage Event-based Token Compression for Efficient Long Video Understanding	Mengyue Wang et.al.	2506.02850	link
2025-06-04	Go Beyond Earth: Understanding Human Actions and Scenes in Microgravity Environments	Di Wen et.al.	2506.02845	null
2025-06-03	LinkTo-Anime: A 2D Animation Optical Flow Dataset from 3D Model Rendering	Xiaoyi Feng et.al.	2506.02733	null
2025-06-03	LumosFlow: Motion-Guided Long Video Generation	Jiahao Chen et.al.	2506.02497	null
2025-06-05	SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios	Lingwei Dang et.al.	2506.02444	link
2025-06-04	InterRVOS: Interaction-aware Referring Video Object Segmentation	Woojeong Jin et.al.	2506.02356	null
2025-06-02	Motion aware video generative model	Bowen Xue et.al.	2506.02244	null
2025-06-02	Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control	Xiao Fu et.al.	2506.01943	null
2025-06-02	Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency	Hongyu Li et.al.	2506.01908	link
2025-06-02	OmniV2V: Versatile Video Generation and Editing via Dynamic Content Manipulation	Sen Liang et.al.	2506.01801	null
2025-06-02	Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks	Tao Yang et.al.	2506.01758	null
2025-06-02	VideoCap-R1: Enhancing MLLMs for Video Captioning via Structured Thinking	Desen Meng et.al.	2506.01725	null
2025-06-02	Respond Beyond Language: A Benchmark for Video Generation in Response to Realistic User Intents	Shuting Wang et.al.	2506.01689	null
2025-06-02	LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model	Xiaodong Wang et.al.	2506.01546	null
2025-06-02	Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark	Shuyu Yang et.al.	2506.01466	null
2025-06-02	DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion	Geunmin Hwang et.al.	2506.01454	null
2025-06-02	Playing with Transformer at 30+ FPS via Next-Frame Diffusion	Xinle Cheng et.al.	2506.01380	null
2025-06-02	ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding	Yiyang Zhou et.al.	2506.01300	null
2025-06-04	FlowMo: Variance-Based Flow Guidance for Coherent Motion in Video Generation	Ariel Shaulov et.al.	2506.01144	null
2025-06-01	MOOSE: Pay Attention to Temporal Dynamics for Video Understanding via Optical Flows	Hong Nguyen et.al.	2506.01119	null
2025-06-01	DeepVerse: 4D Autoregressive Video Generation as a World Model	Junyi Chen et.al.	2506.01103	null
2025-06-01	Keystep Recognition using Graph Neural Networks	Julia Lee Romero et.al.	2506.01102	null
2025-06-01	Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models	Kinam Kim et.al.	2506.00996	null
2025-06-01	FlexSelect: Flexible Token Selection for Efficient Long Video Understanding	Yunzhu Zhang et.al.	2506.00993	null
2025-06-03	Uneven Event Modeling for Partially Relevant Video Retrieval	Sa Zhu et.al.	2506.00891	null
2025-06-01	SynPO: Synergizing Descriptiveness and Preference Optimization for Video Detailed Captioning	Jisheng Dang et.al.	2506.00835	null
2025-05-31	Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis	Vasilii Korolkov et.al.	2506.00667	null
2025-05-31	Video Signature: In-generation Watermarking for Latent Video Diffusion Models	Yu Huang et.al.	2506.00652	null
2025-05-31	Evaluating Robot Policies in a World Model	Julian Quevedo et.al.	2506.00613	null
2025-05-31	SatDreamer360: Geometry Consistent Street-View Video Generation from Satellite Imagery	Xianghui Ze et.al.	2506.00600	null
2025-05-31	Foresight: Adaptive Layer Reuse for Accelerated and High-Quality Text-to-Video Generation	Muhammad Adnan et.al.	2506.00329	null
2025-05-31	Chain-of-Frames: Advancing Video Understanding in Multimodal LLMs via Frame-Aware Reasoning	Sara Ghazanfari et.al.	2506.00318	null
2025-05-30	PerFormer: A Permutation Based Vision Transformer for Remaining Useful Life Prediction	Zhengyang Fan et.al.	2506.00259	null
2025-05-30	Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes	Anthony Gosselin et.al.	2506.00227	null
2025-05-30	MiniMax-Remover: Taming Bad Noise Helps Video Object Removal	Bojia Zi et.al.	2505.24873	null
2025-05-30	SiLVR: A Simple Language-based Video Reasoning Framework	Ce Zhang et.al.	2505.24869	link
2025-05-30	Time Blindness: Why Video-Language Models Can’t See What Humans Can?	Ujjwal Upadhyay et.al.	2505.24867	null
2025-05-30	VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software	Brandon Man et.al.	2505.24838	link
2025-05-30	DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds	Jiaxu Zhang et.al.	2505.24733	null
2025-05-30	Learning reusable concepts across different egocentric video understanding tasks	Simone Alberto Peirone et.al.	2505.24690	null
2025-05-30	UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation	Yang-Tian Sun et.al.	2505.24521	null
2025-06-04	Grid-LOGAT: Grid Based Local and Global Area Transcription for Video Question Answering	Md Intisar Chowdhury et.al.	2505.24371	null
2025-05-30	VUDG: A Dataset for Video Understanding Domain Generalization	Ziyi Wang et.al.	2505.24346	null
2025-05-30	DisTime: Distribution-based Time Representation for Video Large Language Models	Yingsen Zeng et.al.	2505.24329	link
2025-05-30	Interactive Video Generation via Domain Adaptation	Ishaan Rawal et.al.	2505.24253	null
2025-05-30	STORK: Improving the Fidelity of Mid-NFE Sampling for Diffusion and Flow Matching Models	Zheng Tan et.al.	2505.24210	link
2025-05-30	Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders	Bo Fang et.al.	2505.24158	null
2025-05-29	Multi-RAG: A Multimodal Retrieval-Augmented Generation System for Adaptive Video Understanding	Mingyang Mao et.al.	2505.23990	null
2025-05-29	Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review	Adriano Fragomeni et.al.	2505.23952	null
2025-05-29	ScaleLong: A Multi-Timescale Benchmark for Long Video Understanding	David Ma et.al.	2505.23922	link
2025-05-29	MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection	Yixian Shen et.al.	2505.23870	null
2025-05-29	MAGREF: Masked Guidance for Any-Reference Video Generation	Yufan Deng et.al.	2505.23742	link
2025-05-29	VF-Eval: Evaluating Multimodal LLMs for Generating Feedback on AIGC Videos	Tingyu Song et.al.	2505.23693	link
2025-05-29	VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models	Xiangdong Zhang et.al.	2505.23656	link
2025-05-29	One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory	Chenhao Zheng et.al.	2505.23617	null
2025-05-29	VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation	Shi-Xue Zhang et.al.	2505.23484	link
2025-05-29	VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?	Yuanxin Liu et.al.	2505.23359	link
2025-05-29	Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis	Hengyuan Cao et.al.	2505.23325	null
2025-05-29	RoboTransfer: Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer	Liu Liu et.al.	2505.23171	null
2025-05-29	PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling	Xiao Yu et.al.	2505.23155	link
2025-05-29	Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing	Tongtong Su et.al.	2505.23134	link
2025-05-29	MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation	Siyuan Wang et.al.	2505.23120	link
2025-05-29	GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion	Gwanghyun Kim et.al.	2505.23085	null
2025-05-29	MOVi: Training-free Text-conditioned Multi-Object Video Generation	Aimon Rahman et.al.	2505.22980	null
2025-05-29	HyperMotion: DiT-Based Pose-Guided Human Image Animation of Complex Motions	Shuolin Xu et.al.	2505.22977	link
2025-06-03	ATI: Any Trajectory Instruction for Controllable Video Generation	Angtian Wang et.al.	2505.22944	null
2025-05-28	VidText: Towards Comprehensive Evaluation for Video Text Understanding	Zhoufaran Yang et.al.	2505.22810	link
2025-05-28	Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation	Zhe Kong et.al.	2505.22647	link
2025-05-28	Universal Visuo-Tactile Video Understanding for Embodied Interaction	Yifan Xie et.al.	2505.22566	null
2025-05-28	Fostering Video Reasoning via Next-Event Prediction	Haonan Wang et.al.	2505.22457	null
2025-05-28	Q-VDiT: Towards Accurate Quantization and Distillation of Video-Generation Diffusion Transformers	Weilun Feng et.al.	2505.22167	null
2025-05-28	FaceEditTalker: Interactive Talking Head Generation with Facial Attribute Editing	Guanwen Feng et.al.	2505.22141	null
2025-05-28	LatentMove: Towards Complex Human Movement Video Generation	Ashkan Taghipour et.al.	2505.22046	null
2025-05-28	PanoWan: Lifting Diffusion Video Generation Models to 360° with Latitude/Longitude-aware Mechanisms	Yifei Xia et.al.	2505.22016	null
2025-05-28	Learning World Models for Interactive Video Generation	Taiye Chen et.al.	2505.21996	null
2025-05-27	HDRSDR-VQA: A Subjective Video Quality Dataset for HDR and SDR Comparative Evaluation	Bowen Chen et.al.	2505.21831	null
2025-05-27	Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation	Ke Zhang et.al.	2505.21653	null
2025-05-27	VideoMarkBench: Benchmarking Robustness of Video Watermarking	Zhengyuan Jiang et.al.	2505.21620	link
2025-05-24	Corruption-Aware Training of Latent Video Diffusion Models for Robust Text-to-Video Generation	Chika Maduabuchi et.al.	2505.21545	null
2025-05-27	Frame In-N-Out: Unbounded Controllable Image-to-Video Generation	Boyang Wang et.al.	2505.21491	null
2025-05-27	Dynamic Vision from EEG Brain Recordings: How much does EEG know?	Prajwal Singh et.al.	2505.21385	null
2025-05-28	SageAttention2++: A More Efficient Implementation of SageAttention2	Jintao Zhang et.al.	2505.21136	link
2025-05-29	Minute-Long Videos with Dual Parallelisms	Zeqing Wang et.al.	2505.21070	link
2025-05-27	RainFusion: Adaptive Video Generation Acceleration via Multi-Dimensional Visual Redundancy	Aiyue Chen et.al.	2505.21036	null
2025-05-27	HuMoCon: Concept Discovery for Human Motion Understanding	Qihang Fang et.al.	2505.20920	null
2025-05-27	Frame-Level Captions for Long Video Generation with Complex Multi Scenes	Guangcong Zheng et.al.	2505.20827	null
2025-05-27	Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt	Xiang Zhu et.al.	2505.20795	null
2025-05-27	MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding	Fuwen Luo et.al.	2505.20715	link
2025-05-27	Photography Perspective Composition: Towards Aesthetic Perspective Recommendation	Lujian Yao et.al.	2505.20655	null
2025-05-27	HCQA-1.5 @ Ego4D EgoSchema Challenge 2025	Haoyu Zhang et.al.	2505.20644	null
2025-05-27	Incorporating Flexible Image Conditioning into Text-to-Video Diffusion Models without Training	Bolin Lai et.al.	2505.20629	null
2025-05-26	HoPE: Hybrid of Position Embedding for Length Generalization in Vision-Language Models	Haoran Li et.al.	2505.20444	null
2025-06-03	OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation	Shenghai Yuan et.al.	2505.20292	link
2025-05-26	MotionPro: A Precise Motion Controller for Image-to-Video Generation	Zhongwei Zhang et.al.	2505.20287	null
2025-05-26	TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos	Fanheng Kong et.al.	2505.20124	link
2025-05-26	AdaTP: Attention-Debiased Token Pruning for Video Large Language Models	Fengyuan Sun et.al.	2505.20100	null
2025-06-03	Dynamic-I2V: Exploring Image-to-Video Generation Models via Multimodal LLM	Peng Liu et.al.	2505.19901	null
2025-05-26	Two Causally Related Needles in a Video Haystack	Miaoyu Li et.al.	2505.19853	null
2025-05-26	DriveCamSim: Generalizable Camera Simulation via Explicit Camera Modeling for Autonomous Driving	Wenchao Sun et.al.	2505.19692	link
2025-05-26	TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs	Juntong Wang et.al.	2505.19535	null
2025-05-26	The Role of Video Generation in Enhancing Data-Limited Action Understanding	Wei Li et.al.	2505.19495	null
2025-05-26	Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals	Nate Gillman et.al.	2505.19386	null
2025-05-25	From Single Images to Motion Policies via Video-Generation Environment Representations	Weiming Zhi et.al.	2505.19306	null
2025-05-25	Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs	Xuan Zhang et.al.	2505.19155	null
2025-05-25	SRDiffusion: Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation	Shenggan Cheng et.al.	2505.19151	null
2025-05-25	WorldEval: World Model as Real-World Robot Policies Evaluator	Yaxuan Li et.al.	2505.19017	null
2025-05-24	Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation	Shuo Yang et.al.	2505.18875	null
2025-05-24	VORTA: Efficient Video Diffusion via Routing Sparse Attention	Wenhao Sun et.al.	2505.18809	link
2025-05-24	DVD-Quant: Data-free Video Diffusion Transformers Quantization	Zhiteng Li et.al.	2505.18663	link
2025-05-24	ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos	Xiaodong Wang et.al.	2505.18650	null
2025-05-23	WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions	Zizhang Li et.al.	2505.18151	null
2025-05-28	Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding	Xiaoyi Zhang et.al.	2505.18079	null
2025-05-23	DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation	Junhao Chen et.al.	2505.18078	null
2025-05-23	SafeMVDrive: Multi-view Safety-Critical Driving Video Synthesis in the Real World Domain	Jiawei Zhou et.al.	2505.17727	null
2025-05-23	Scaling Image and Video Generation via Test-Time Evolutionary Search	Haoran He et.al.	2505.17618	null
2025-05-23	InfLVG: Reinforce Inference-Time Consistent Long Video Generation with GRPO	Xueji Fang et.al.	2505.17574	link
2025-05-23	VIBE: Video-to-Text Information Bottleneck Evaluation for TL;DR	Shenghui Chen et.al.	2505.17423	link
2025-05-22	Training-Free Efficient Video Generation via Dynamic Token Carving	Yuechen Zhang et.al.	2505.16864	link
2025-05-22	Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning	Fanrui Zhang et.al.	2505.16836	link
2025-05-22	Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts	Taewon Kang et.al.	2505.16819	null
2025-05-22	Four Eyes Are Better Than Two: Harnessing the Collaborative Potential of Large Models via Differentiated Thinking and Complementary Ensembles	Jun Xie et.al.	2505.16784	null
2025-05-22	SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding	Sushant Gautam et.al.	2505.16630	null
2025-05-22	Temporal Object Captioning for Street Scene Videos from LiDAR Tracks	Vignesh Gopinathan et.al.	2505.16594	null
2025-05-22	MAGIC: Motion-Aware Generative Inference via Confidence-Guided LLM	Siwei Meng et.al.	2505.16456	null
2025-05-31	QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design	Benjamin Schneider et.al.	2505.16175	link
2025-05-21	ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation	Tony Montes et.al.	2505.15928	link
2025-05-23	Challenger: Affordable Adversarial Driving Video Generation	Zhiyuan Xu et.al.	2505.15880	null
2025-05-21	Generative AI for Autonomous Driving: A Review	Katharina Winter et.al.	2505.15863	null
2025-05-25	Interspatial Attention for Efficient 4D Human Video Generation	Ruizhi Shao et.al.	2505.15800	null
2025-05-21	Clapper: Compact Learning and Video Representation in VLMs	Lingyu Kong et.al.	2505.15529	null
2025-05-21	ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning	Ziqiang Xu et.al.	2505.15447	null
2025-05-21	LiveVLM: Efficient Online Video Understanding via Streaming-Oriented KV Cache and Retrieval	Zhenyu Ning et.al.	2505.15269	null
2025-05-21	Leveraging Foundation Models for Multimodal Graph-Based Action Recognition	Fatemeh Ziaeetabar et.al.	2505.15192	null
2025-06-03	AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection	Zhipei Xu et.al.	2505.15173	null
2025-05-21	CineTechBench: A Benchmark for Cinematographic Technique Understanding and Generation	Xinran Wang et.al.	2505.15145	link
2025-05-20	Programmatic Video Prediction Using Large Language Models	Hao Tang et.al.	2505.14948	link
2025-05-17	FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge	Xuan Shen et.al.	2505.14709	link
2025-05-17	DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance	Xuan Shen et.al.	2505.14708	link
2025-05-20	Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers	Sucheng Ren et.al.	2505.14687	link
2025-05-20	VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation	Wentao Ma et.al.	2505.14640	null
2025-05-20	Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models	Xuyang Liu et.al.	2505.14454	link
2025-05-20	Breaking Down Video LLM Benchmarks: Knowledge, Spatial Perception, or True Temporal Understanding?	Bo Feng et.al.	2505.14321	null
2025-05-20	LMP: Leveraging Motion Prior in Zero-Shot Video Generation with Diffusion Transformer	Changgu Chen et.al.	2505.14167	null
2025-05-28	Hunyuan-Game: Industrial-grade Intelligent Game Creation Model	Ruihuang Li et.al.	2505.14135	null
2025-05-20	LoVR: A Benchmark for Long Video Retrieval in Multimodal Contexts	Qifeng Cai et.al.	2505.13928	link
2025-05-20	Domain Adaptation of VLM for Soccer Video Understanding	Tiancheng Jiang et.al.	2505.13860	null
2025-05-20	A Challenge to Build Neuro-Symbolic Video Agents	Sahil Shah et.al.	2505.13851	link
2025-05-19	FinePhys: Fine-grained Human Action Generation by Explicitly Incorporating Physical Laws for Effective Skeletal Guidance	Dian Shao et.al.	2505.13437	null
2025-05-19	Understanding Complexity in VideoQA via Visual Program Generation	Cristobal Eyzaguirre et.al.	2505.13429	null
2025-05-19	MAGI-1: Autoregressive Video Generation at Scale	Sand. ai et.al.	2505.13211	link
2025-05-25	FLASH: Latent-Aware Semi-Autoregressive Speculative Decoding for Multimodal Tasks	Zihua Wang et.al.	2505.12728	link
2025-05-19	DreamGen: Unlocking Generalization in Robot Learning through Neural Trajectories	Joel Jang et.al.	2505.12705	link
2025-05-19	Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking	Zihan Su et.al.	2505.12667	null
2025-05-21	BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation	Haiquan Wen et.al.	2505.12620	link
2025-05-19	Temporal-Oriented Recipe for Transferring Large Vision-Language Model to Video Understanding	Thong Nguyen et.al.	2505.12605	null
2025-05-19	SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models	Bo Liu et.al.	2505.12589	null
2025-05-27	Rebalancing Contrastive Alignment with Learnable Semantic Gaps in Text-Video Retrieval	Jian Xiao et.al.	2505.12499	link
2025-05-21	Video-GPT via Next Clip Diffusion	Shaobin Zhuang et.al.	2505.12489	null
2025-05-18	From Shots to Stories: LLM-Assisted Video Editing with Unified Language Representations	Yuzhi Li et.al.	2505.12237	null
2025-05-17	LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation	Jiarui Wang et.al.	2505.12098	link
2025-05-17	VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption	Tianxiong Zhong et.al.	2505.12053	null
2025-05-23	QVGen: Pushing the Limit of Quantized Video Generative Models	Yushi Huang et.al.	2505.11497	null
2025-05-16	Face Consistency Benchmark for GenAI Video	Michal Podstawski et.al.	2505.11425	null
2025-05-16	Temporally-Grounded Language Generation: A Benchmark for Real-Time Vision-Language Models	Keunwoo Peter Yu et.al.	2505.11326	link
2025-05-14	Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios	Huafeng Shi et.al.	2505.10584	null
2025-05-20	MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation	Yanbo Ding et.al.	2505.10238	link
2025-05-15	ToonifyGB: StyleGAN-based Gaussian Blendshapes for 3D Stylized Head Avatars	Rui-Yang Ju et.al.	2505.10072	null
2025-05-18	EWMBench: Evaluating Scene, Motion, and Semantic Quality in Embodied World Models	Hu Yue et.al.	2505.09694	link
2025-05-15	Generating time-consistent dynamics with discriminator-guided image diffusion models	Philipp Hess et.al.	2505.09089	null
2025-05-13	Generative AI for Autonomous Driving: Frontiers and Opportunities	Yuping Wang et.al.	2505.08854	link
2025-05-13	SkillFormer: Unified Multi-View Video Understanding for Proficiency Estimation	Edoardo Bianchi et.al.	2505.08665	null
2025-05-13	VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models	Pritam Sarkar et.al.	2505.08455	link
2025-05-13	Symbolically-Guided Visual Plan Inference from Uncurated Video Data	Wenyan Yang et.al.	2505.08444	null
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Pixel Motion as Universal Representation for Robot Control	Kanchana Ranasinghe et.al.	2505.07817	null
2025-05-12	Gameplay Highlights Generation	Vignesh Edithal et.al.	2505.07721	null
2025-05-12	ShotAdapter: Text-to-Multi-Shot Video Generation with Diffusion Models	Ozgur Kara et.al.	2505.07652	null
2025-05-16	Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model	Wei Li et.al.	2505.07449	link
2025-05-22	Generative Pre-trained Autoregressive Diffusion Transformer	Yuan Zhang et.al.	2505.07344	null
2025-05-11	Seed1.5-VL Technical Report	Dong Guo et.al.	2505.07062	null
2025-05-11	DAPE: Dual-Stage Parameter-Efficient Fine-Tuning for Consistent Video Editing with Diffusion Models	Junhao Xia et.al.	2505.07057	null
2025-05-11	BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation	Panwen Hu et.al.	2505.06985	null
2025-05-11	Overview of the NLPCC 2025 Shared Task 4: Multi-modal, Multilingual, and Multi-hop Medical Instructional Video Question Answering Challenge	Bin Li et.al.	2505.06814	null
2025-05-10	Jailbreaking the Text-to-Video Generative Models	Jiayang Liu et.al.	2505.06679	null
2025-05-10	ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images	Xianghao Kong et.al.	2505.06537	null
2025-05-08	StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant	Haibo Wang et.al.	2505.05467	null
2025-05-08	T2VTextBench: A Human Evaluation Benchmark for Textual Control in Video Generation Models	Xuyang Guo et.al.	2505.04946	null
2025-05-08	HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation	Teng Hu et.al.	2505.04512	null
2025-05-03	VideoLLM Benchmarks and Evaluation: A Survey	Yogesh Kumar et.al.	2505.03829	null
2025-05-06	Real-Time Person Image Synthesis Using a Flow Matching Model	Jiwoo Jeong et.al.	2505.03562	link
2025-05-06	Transformers for Learning on Noisy and Task-Level Manifolds: Approximation and Generalization Insights	Zhaiming Shen et.al.	2505.03205	null
2025-05-06	RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph	Sameer Malik et.al.	2505.03173	null
2025-05-08	Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection	Sungheon Jeong et.al.	2505.02393	link
2025-05-04	DualReal: Adaptive Joint Training for Lossless Identity-Motion Fusion in Video Customization	Wenchuan Wang et.al.	2505.02192	null
2025-05-03	CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture	Vladimir Frants et.al.	2505.01882	null
2025-05-03	An LLM-Empowered Low-Resolution Vision System for On-Device Human Behavior Understanding	Siyang Jiang et.al.	2505.01743	null
2025-05-03	PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth	Bu Jin et.al.	2505.01729	null
2025-05-02	TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in Action	Jen-Hao Cheng et.al.	2505.01583	link
2025-05-16	VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding	Zongxia Li et.al.	2505.01481	link
2025-05-02	FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis	Jiangtong Tan et.al.	2505.01172	link
2025-05-01	Controllable Weather Synthesis and Removal with Video Diffusion Models	Chih-Hao Lin et.al.	2505.00704	null
2025-05-01	T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation	Xuyang Guo et.al.	2505.00337	null
2025-05-16	Empowering Agentic Video Analytics Systems with Video Language Models	Yuxuan Yan et.al.	2505.00254	null
2025-04-30	Direct Motion Models for Assessing Generated Videos	Kelsey Allen et.al.	2505.00209	null
2025-04-30	Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis	Michal Geyer et.al.	2505.00135	null
2025-04-30	ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction	Qihao Liu et.al.	2504.21855	null
2025-05-13	HoloTime: Taming Video Diffusion Models for Panoramic 4D Scene Generation	Haiyang Zhou et.al.	2504.21650	link
2025-05-13	SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding	Chenkai Zhang et.al.	2504.21435	link
2025-04-30	Static or Dynamic: Towards Query-Adaptive Token Selection for Video Question Answering	Yumeng Shi et.al.	2504.21403	null
2025-04-30	Simple Visual Artifact Detection in Sora-Generated Videos	Misora Sugiyama et.al.	2504.21334	null
2025-04-30	Capturing Conditional Dependence via Auto-regressive Diffusion Models	Xunpeng Huang et.al.	2504.21314	null
2025-04-29	TesserAct: Learning 4D Embodied World Models	Haoyu Zhen et.al.	2504.20995	null
2025-04-29	DDPS: Discrete Diffusion Posterior Sampling for Paths in Layered Graphs	Hao Luan et.al.	2504.20754	null
2025-04-29	Advance Fake Video Detection via Vision Transformers	Joy Battocchio et.al.	2504.20669	null
2025-04-29	FiLA-Video: Spatio-Temporal Compression for Fine-Grained Long Video Understanding	Yanan Guo et.al.	2504.20384	null
2025-04-30	VideoMultiAgents: A Multi-Agent Framework for Video Question Answering	Noriyuki Kugo et.al.	2504.20091	link
2025-04-28	Learning Streaming Video Representation via Multitask Training	Yibin Yan et.al.	2504.20041	null
2025-04-28	Exploiting Inter-Sample Correlation and Intra-Sample Redundancy for Partially Relevant Video Retrieval	Junlong Ren et.al.	2504.19637	null
2025-04-28	DiVE: Efficient Multi-View Driving Scenes Generation Based on Video Diffusion Transformer	Junpeng Jiang et.al.	2504.19614	null
2025-04-26	Audio-Driven Talking Face Video Generation with Joint Uncertainty Learning	Yifan Xie et.al.	2504.18810	null
2025-04-26	Stealing Creator’s Workflow: A Creator-Inspired Agentic Framework with Iterative Feedback Loop for Improved Scientific Short-form Generation	Jong Inn Park et.al.	2504.18805	null
2025-04-22	DriVerse: Navigation World Model for Driving Simulation via Multimodal Trajectory Prompting and Motion Alignment	Xiaofan Li et.al.	2504.18576	link
2025-04-25	NoiseController: Towards Consistent Multi-view Video Generation via Noise Decomposition and Collaboration	Haotian Dong et.al.	2504.18448	null
2025-04-25	ActionArt: Advancing Multimodal Large Models for Fine-Grained Human-Centric Video Understanding	Yi-Xing Peng et.al.	2504.18152	null
2025-04-24	VEU-Bench: Towards Comprehensive Understanding of Video Editing	Bozheng Li et.al.	2504.17828	null
2025-04-23	Subject-driven Video Generation via Disentangled Identity and Motion	Daneul Kim et.al.	2504.17816	null
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788	null
2025-04-29	TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation	Ling You et.al.	2504.17365	null
2025-04-24	TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos	Linli Yao et.al.	2504.17343	link
2025-04-24	MV-Crafter: An Intelligent System for Music-guided Video Generation	Chuer Chen et.al.	2504.17267	null
2025-04-24	DIVE: Inverting Conditional Diffusion Models for Discriminative Tasks	Yinqi Li et.al.	2504.17253	link
2025-04-28	MASR: Self-Reflective Reasoning through Multimodal Hierarchical Attention Focusing for Agent-based Video Understanding	Shiwen Cao et.al.	2504.17213	null
2025-04-25	We’ll Fix it in Post: Improving Text-to-Video Generation with Neuro-Symbolic Feedback	Minkyu Choi et.al.	2504.17180	null
2025-05-10	DyMU: Dynamic Merging and Virtual Unmerging for Efficient VLMs	Zhenhailong Wang et.al.	2504.17040	null
2025-04-23	BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation	Ruotong Wang et.al.	2504.16907	null
2025-04-23	ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance	Ying Li et.al.	2504.16464	null
2025-04-23	VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models	Xuming Hu et.al.	2504.16359	null
2025-04-22	MR. Video: “MapReduce” is the Principle for Long Video Understanding	Ziqi Pang et.al.	2504.16082	null
2025-04-22	Survey of Video Diffusion Models: Foundations, Implementations, and Applications	Yimu Wang et.al.	2504.16081	link
2025-04-22	Describe Anything: Detailed Localized Image and Video Captioning	Long Lian et.al.	2504.16072	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-22	Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning	Wang Lin et.al.	2504.15932	null
2025-04-22	ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting	Jian Hu et.al.	2504.15921	null
2025-04-22	Satellite to GroundScape – Large-scale Consistent Ground View Generation from Satellite Views	Ningli Xu et.al.	2504.15786	null
2025-04-24	Vidi: Large Multimodal Models for Video Understanding and Editing	Vidi Team et.al.	2504.15681	null
2025-05-19	DiTPainter: Efficient Video Inpainting with Diffusion Transformers	Xian Wu et.al.	2504.15661	null
2025-04-21	IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs	David Ma et.al.	2504.15415	link
2025-04-21	Towards Understanding Camera Motions in Any Video	Zhiqiu Lin et.al.	2504.15376	null
2025-04-21	Solving New Tasks by Adapting Internet Video Knowledge	Calvin Luo et.al.	2504.15369	null
2025-04-21	Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models	Guo Chen et.al.	2504.15271	null
2025-04-21	An LMM for Efficient Video Understanding via Reinforced Compression of Video Cubes	Ji Qi et.al.	2504.15270	null
2025-04-21	Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform	Xianpan Zhou et.al.	2504.15182	null
2025-04-30	DyST-XL: Dynamic Layout Planning and Content Control for Compositional Text-to-Video Generation	Weijie He et.al.	2504.15032	null
2025-04-23	Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos	Songping Wang et.al.	2504.14921	null
2025-04-21	Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation	Chenjie Cao et.al.	2504.14899	link
2025-04-21	ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams	Chris Dongjoo Kim et.al.	2504.14875	link
2025-04-20	OmniV-Med: Scaling Medical Vision-Language Model for Universal Visual Understanding	Songtao Jiang et.al.	2504.14692	null
2025-04-20	Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection	Weijun Zhuang et.al.	2504.14553	null
2025-04-20	Are Vision LLMs Road-Ready? A Comprehensive Benchmark for Safety-Critical Driving Video Understanding	Tong Zeng et.al.	2504.14526	link
2025-04-20	Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis	Jingjing Ren et.al.	2504.14470	null
2025-04-20	ResNetVLLM – Multi-modal Vision LLM for the Video Understanding Task	Ahmad Khalil et.al.	2504.14432	null
2025-04-19	SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation	Minho Park et.al.	2504.14396	link
2025-04-19	How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?	Rahul Thapa et.al.	2504.14391	null
2025-04-10	Memory-efficient Streaming VideoLLMs for Real-time Procedural Video Understanding	Dibyadip Chatterjee et.al.	2504.13915	null
2025-04-28	Perception Encoder: The best visual embeddings are not at the output of the network	Daniel Bolya et.al.	2504.13181	null
2025-04-17	PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding	Jang Hyun Cho et.al.	2504.13180	link
2025-04-17	VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models	Haojian Huang et.al.	2504.13122	link
2025-04-21	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-17	Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval	WonJun Moon et.al.	2504.13035	null
2025-04-21	Packing Input Frame Context in Next-Frame Prediction Models for Video Generation	Lvmin Zhang et.al.	2504.12626	link
2025-04-16	VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate	Zhihang Yuan et.al.	2504.12259	link
2025-05-10	FocusedAD: Character-centric Movie Audio Description	Xiaojun Ye et.al.	2504.12157	link
2025-04-16	Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization	Pritam Sarkar et.al.	2504.12083	null
2025-04-16	Modular-Cam: Modular Dynamic Camera-view Video Generation with LLM	Zirui Pan et.al.	2504.12048	null
2025-05-06	The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation	Bingjie Gao et.al.	2504.11739	null
2025-04-17	VideoPanda: Video Panoramic Diffusion with Multi-view Attention	Kevin Xie et.al.	2504.11389	null
2025-04-21	PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild	Henghui Ding et.al.	2504.11326	null
2025-04-15	Video Summarization with Large Language Models	Min Jung Lee et.al.	2504.11199	null
2025-04-15	Towards Efficient Partially Relevant Video Retrieval with Active Moment Discovering	Peipei Song et.al.	2504.10920	link
2025-04-15	InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation	Yukang Lin et.al.	2504.10905	null
2025-04-15	OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding	Dianbing Xi et.al.	2504.10825	null
2025-04-14	H-MoRe: Learning Human-centric Motion Representation for Action Analysis	Zhanbo Huang et.al.	2504.10676	link
2025-04-14	H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models	Yushu Wu et.al.	2504.10567	null
2025-04-14	Multimodal Long Video Modeling Based on Temporal Dynamic Context	Haoran Hao et.al.	2504.10443	link
2025-04-14	FingER: Content Aware Fine-grained Evaluation with Reasoning for AI-Generated Videos	Rui Chen et.al.	2504.10358	null
2025-04-14	Mavors: Multi-granularity Video Representation for Multimodal Large Language Model	Yang Shi et.al.	2504.10068	null
2025-04-14	Aligning Anime Video Generation with Human Feedback	Bingwen Zhu et.al.	2504.10044	null
2025-04-14	EquiVDM: Equivariant Video Diffusion Models with Temporally Consistent Noise	Chao Liu et.al.	2504.09789	null
2025-04-13	TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning	Xingjian Zhang et.al.	2504.09641	link
2025-04-13	CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models	Pooja Guhan et.al.	2504.09472	null
2025-04-12	A Lightweight Moment Retrieval System with Global Re-Ranking and Robust Adaptive Bidirectional Temporal Search	Tinh-Anh Nguyen-Nhu et.al.	2504.09298	null
2025-04-12	VideoAds for Fast-Paced Video Understanding: Where Opensource Foundation Models Beat GPT-4o & Gemini-1.5 Pro	Zheyuan Zhang et.al.	2504.09282	null
2025-05-05	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization	Jialu Li et.al.	2504.08641	null
2025-04-11	Diffusion Models for Robotic Manipulation: A Survey	Rosa Wolf et.al.	2504.08438	null
2025-04-11	Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking	Huu-Loc Tran et.al.	2504.08384	null
2025-04-11	EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model	Renda Li et.al.	2504.08344	null
2025-04-15	F $^3$ Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos	Zhaoyu Liu et.al.	2504.08222	link
2025-04-11	RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements	Guangcong Zheng et.al.	2504.08212	link
2025-04-11	TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation	Ruineng Li et.al.	2504.08181	null
2025-04-10	Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction	Zeren Jiang et.al.	2504.07961	link
2025-04-17	Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos	Rundong Luo et.al.	2504.07940	null
2025-04-10	SF2T: Self-supervised Fragment Finetuning of Video-LLMs for Fine-Grained Understanding	Yangliu Hu et.al.	2504.07745	null
2025-04-18	Diffusion Transformers for Tabular Data Time Series Generation	Fabrizio Garuti et.al.	2504.07566	link
2025-04-10	VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding	Henghao Zhao et.al.	2504.07519	null
2025-04-10	How Can Objects Help Video-Language Understanding?	Zitian Tang et.al.	2504.07454	null
2025-04-13	VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning	Xinhao Li et.al.	2504.06958	null
2025-04-09	EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation	Diljeet Jagpal et.al.	2504.06861	null
2025-04-09	LVC: A Lightweight Compression Framework for Enhancing VLMs in Long Video Understanding	Ziyi Wang et.al.	2504.06835	null
2025-04-16	DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Wangbo Zhao et.al.	2504.06803	link
2025-04-09	RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism	Elia Peruzzo et.al.	2504.06672	null
2025-04-09	Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception	Ruotian Peng et.al.	2504.06666	null
2025-04-08	From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction	Vladimir Golovkin et.al.	2504.06357	null
2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null
2025-04-08	CamContextI2V: Context-aware Controllable Video Generation	Luis Denninger et.al.	2504.06022	link
2025-04-08	Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA	Zijie Song et.al.	2504.05783	null
2025-04-09	Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting	Yunlong Tang et.al.	2504.05541	link
2025-04-07	REEF: Relevance-Aware and Efficient LLM Adapter for Video Understanding	Sakib Reza et.al.	2504.05491	null
2025-04-07	REVEAL: Relation-based Video Representation Learning for Video-Question-Answering	Sofian Chaybouti et.al.	2504.05463	null
2025-04-07	One-Minute Video Generation with Test-Time Training	Karan Dalal et.al.	2504.05298	null
2025-04-07	PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition	Jie Wang et.al.	2504.05075	null
2025-04-07	InstructionBench: An Instructional Video Understanding Benchmark	Haiwan Wei et.al.	2504.05040	null
2025-04-07	Video-Bench: Human-Aligned Video Generation Benchmark	Hui Han et.al.	2504.04907	null
2025-04-12	Unsupervised Ego- and Exo-centric Dense Procedural Activity Captioning via Gaze Consensus Adaptation	Zhaofeng Shi et.al.	2504.04840	null
2025-04-07	TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval	Xiaolun Jing et.al.	2504.04707	link
2025-04-06	Multimodal Lengthy Videos Retrieval Framework and Evaluation Metric	Mohamed Eltahir et.al.	2504.04572	null
2025-04-06	Advancing Egocentric Video Question Answering with Multimodal Large Language Models	Alkesh Patel et.al.	2504.04550	null
2025-04-06	VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT	Zhuo Zhi et.al.	2504.04471	null
2025-04-05	Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization	Yikai Wang et.al.	2504.04153	link
2025-04-05	Multi-identity Human Image Animation with Structural Video Diffusion	Zhenzhi Wang et.al.	2504.04126	null
2025-04-05	Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models	Xuyang Guo et.al.	2504.04051	null
2025-04-05	DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion	Maksim Siniukov et.al.	2504.04010	null
2025-04-10	VideoComp: Advancing Fine-Grained Compositional and Temporal Alignment in Video-Text Models	Dahun Kim et.al.	2504.03970	link
2025-04-02	Proof of Humanity: A Multi-Layer Network Framework for Certifying Human-Originated Content in an AI-Dominated Internet	Sebastian Barros et.al.	2504.03752	null
2025-04-04	Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models	Xuran Ma et.al.	2504.03140	link
2025-04-03	How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models	Pascal Chang et.al.	2504.03072	null
2025-04-03	Morpheus: Benchmarking Physical Reasoning of Video Generative Models with Real Physical Experiments	Chenyu Zhang et.al.	2504.02918	null
2025-04-16	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null
2025-04-03	Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model	Shengjun Zhang et.al.	2504.02764	null
2025-04-07	Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation	Fa-Ting Hong et.al.	2504.02542	link
2025-04-03	ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer	Jiayi Gao et.al.	2504.02451	link
2025-04-21	Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation	Chuanqi Cheng et.al.	2504.02438	link
2025-04-03	SkyReels-A2: Compose Anything in Video Diffusion Transformers	Zhengcong Fei et.al.	2504.02436	link
2025-04-03	Leveraging Static Relationships for Intra-Type and Inter-Type Message Passing in Video Question Answering	Lili Liang et.al.	2504.02417	null
2025-04-03	Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval	Boseung Jeong et.al.	2504.02397	null
2025-04-04	MG-Gen: Single Image to Motion Graphics Generation with Layer Decomposition	Takahiro Shirakawa et.al.	2504.02361	null
2025-04-03	OmniCam: Unified Multimodal Video Generation via Camera Control	Xiaoda Yang et.al.	2504.02312	null
2025-04-03	Moment Quantization for Video Temporal Grounding	Xiaolong Sun et.al.	2504.02286	null
2025-04-06	Re-thinking Temporal Search for Long-Form Video Understanding	Jinhui Ye et.al.	2504.02259	link
2025-04-02	Aligned Better, Listen Better for Audio-Visual Large Language Models	Yuxin Guo et.al.	2504.02061	null
2025-04-02	WorldPrompter: Traversable Text-to-Scene Generation	Zhaoyang Zhang et.al.	2504.02045	null
2025-04-03	VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step	Hanyang Wang et.al.	2504.01956	null
2025-04-07	Is Temporal Prompting All We Need For Limited Labeled Action Recognition?	Shreyank N Gowda et.al.	2504.01890	null
2025-04-02	Spatial-R1: Enhancing MLLMs in Video Spatial Reasoning	Kun Ouyang et.al.	2504.01805	link
2025-04-03	Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval	Adriano Fragomeni et.al.	2504.01591	null
2025-04-02	TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding	Junwen Pan et.al.	2504.01407	null
2025-04-02	Slow-Fast Architecture for Video Multi-Modal Large Language Models	Min Shi et.al.	2504.01328	link
2025-04-01	Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation	Junyu Xie et.al.	2504.01020	null
2025-04-01	WorldScore: A Unified Evaluation Benchmark for World Generation	Haoyi Duan et.al.	2504.00983	null
2025-04-01	DecoFuse: Decomposing and Fusing the “What”, “Where”, and “How” for Brain-Inspired fMRI-to-Video Decoding	Chong Li et.al.	2504.00432	null
2025-03-31	GazeLLM: Multimodal LLMs incorporating Human Visual Attention	Jun Rekimoto et.al.	2504.00221	null
2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null
2025-03-31	Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1	Yi Chen et.al.	2503.24376	link
2025-03-31	DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description	Adrienne Deganutti et.al.	2503.24096	null
2025-04-01	HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation	Boyuan Wang et.al.	2503.24026	null
2025-03-31	H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding	Qi Wu et.al.	2503.24008	null
2025-03-31	JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation	Fangda Chen et.al.	2503.23951	null
2025-03-31	A SAT-centered XAI method for Deep Learning based Video Understanding	Hojer Key et.al.	2503.23870	null
2025-04-01	On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices	Bosung Kim et.al.	2503.23796	link
2025-03-31	HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation	Kun Liu et.al.	2503.23715	null
2025-03-31	The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning	Mingkai Tian et.al.	2503.23679	null
2025-03-30	VideoGen-Eval: Agent-based System for Video Generation Evaluation	Yuhang Yang et.al.	2503.23452	link
2025-03-30	CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition	Jongseo Lee et.al.	2503.23447	null
2025-03-30	JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization	Kai Liu et.al.	2503.23377	null
2025-04-04	VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior	Xindi Yang et.al.	2503.23368	null
2025-03-30	MoCha: Towards Movie-Grade Talking Character Synthesis	Cong Wei et.al.	2503.23307	null
2025-03-30	SketchVideo: Sketch-based Video Generation and Editing	Feng-Lin Liu et.al.	2503.23284	null
2025-03-29	OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts	Yuxuan Wang et.al.	2503.22952	null
2025-03-28	Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model	Jangho Park et.al.	2503.22622	null
2025-03-28	EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation	Hadrien Reynaud et.al.	2503.22357	null
2025-04-05	CoGen: 3D Consistent Video Generation via Adaptive Conditioning for Autonomous Driving	Yishen Ji et.al.	2503.22231	null
2025-03-28	EgoToM: Benchmarking Theory of Mind Reasoning from Egocentric Videos	Yuxuan Li et.al.	2503.22152	link
2025-03-26	Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations	Haitong Liu et.al.	2503.21824	link
2025-03-27	Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model	Abdelrahman Shaker et.al.	2503.21782	link
2025-03-27	VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models	Chi-Pin Huang et.al.	2503.21781	null
2025-03-27	Exploring the Evolution of Physics Cognition in Video Generation: A Survey	Minghui Lin et.al.	2503.21765	link
2025-03-27	VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness	Dian Zheng et.al.	2503.21755	link
2025-03-27	Audio-driven Gesture Generation via Deviation Feature in the Latent Space	Jiahui Chen et.al.	2503.21616	null
2025-03-27	BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding	Shuming Liu et.al.	2503.21483	link
2025-03-27	Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering	Erika Mori et.al.	2503.21190	null
2025-03-27	ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model	Jinwei Qi et.al.	2503.21144	null
2025-03-26	Synthetic Video Enhances Physical Fidelity in Video Synthesis	Qi Zhao et.al.	2503.20822	null
2025-03-26	BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation	Yulu Pan et.al.	2503.20781	link
2025-03-26	RecTable: Fast Modeling Tabular Data with Rectified Flow	Masane Fuchi et.al.	2503.20731	link
2025-03-26	AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports	Xiangwen Zhang et.al.	2503.20654	null
2025-03-26	GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving	Lloyd Russell et.al.	2503.20523	null
2025-03-26	VPO: Aligning Text-to-Video Generation Models with Prompt Optimization	Jiale Cheng et.al.	2503.20491	link
2025-03-26	From Trial to Triumph: Advancing Long Video Understanding via Visual Context Sample Scaling and Self-reward Alignment	Yucheng Suo et.al.	2503.20472	null
2025-03-26	Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding	Joao Pereira et.al.	2503.20362	null
2025-03-26	Wan: Open and Advanced Large-Scale Video Generative Models	WanTeam et.al.	2503.20314	link
2025-03-29	Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models	Prin Phunyaphibarn et.al.	2503.20240	null
2025-03-26	Video Motion Graphs	Haiyang Liu et.al.	2503.20218	null
2025-03-25	Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors	Yuke Lou et.al.	2503.20118	null
2025-03-25	Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals	Stefan Stojanov et.al.	2503.19953	null
2025-03-25	ACVUBench: Audio-Centric Video Understanding Benchmark	Yudong Yang et.al.	2503.19951	link
2025-03-25	FuXi-RTM: A Physics-Guided Prediction Framework with Radiative Transfer Modeling	Qiusheng Huang et.al.	2503.19940	null
2025-03-25	FullDiT: Multi-Task Video Generative Foundation Model with Full Attention	Xuan Ju et.al.	2503.19907	null
2025-03-25	Mask $^2$ DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation	Tianhao Qi et.al.	2503.19881	null
2025-03-25	AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers	Jiazhi Guan et.al.	2503.19824	null
2025-03-25	PAVE: Patching and Adapting Video Large Language Models	Zhuoming Liu et.al.	2503.19794	link
2025-03-31	Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations	Jungin Park et.al.	2503.19706	link
2025-03-25	Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation	Hongcheng Gao et.al.	2503.19622	link
2025-03-25	AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset	Haiyu Zhang et.al.	2503.19462	null
2025-03-26	Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing	Jaihoon Kim et.al.	2503.19385	null
2025-03-25	MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation	Yukang Lin et.al.	2503.19383	null
2025-03-26	EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models	Yufei Cai et.al.	2503.19369	link
2025-03-25	Long-Context Autoregressive Video Modeling with Next-Frame Prediction	Yuchao Gu et.al.	2503.19325	link
2025-03-24	Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval	Arun Reddy et.al.	2503.19009	null
2025-03-25	Aether: Geometric-Aware Unified World Modeling	Aether Team et.al.	2503.18945	null
2025-03-27	SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding	Mingze Xu et.al.	2503.18943	null
2025-04-01	Video-T1: Test-Time Scaling for Video Generation	Fangfu Liu et.al.	2503.18942	null
2025-03-27	Training-free Diffusion Acceleration with Bottleneck Sampling	Ye Tian et.al.	2503.18940	null
2025-03-24	CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos	Yang Liu et.al.	2503.18808	null
2025-03-24	Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks	Nina Shvetsova et.al.	2503.18637	null
2025-03-25	AMD-Hummingbird: Towards an Efficient Text-to-Video Model	Takashi Isobe et.al.	2503.18559	link
2025-03-24	EvAnimate: Event-conditioned Image-to-Video Generation for Human Animation	Qiang Qu et.al.	2503.18552	null
2025-03-24	Can Text-to-Video Generation help Video-Language Alignment?	Luca Zanella et.al.	2503.18507	null
2025-03-24	Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding	Xiangrui Liu et.al.	2503.18478	null
2025-03-24	Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation	Dingcheng Zhen et.al.	2503.18429	null
2025-03-24	Breaking the Encoder Barrier for Seamless Video-Language Understanding	Handong Li et.al.	2503.18422	null
2025-03-25	VTD-CLIP: Video-to-Text Discretization via Prompting CLIP	Wencheng Zhu et.al.	2503.18407	null
2025-03-24	Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance	Sicong Feng et.al.	2503.18386	null
2025-03-23	MammAlps: A multi-view video behavior monitoring dataset of wild mammals in the Swiss Alps	Valentin Gabeff et.al.	2503.18223	link
2025-03-23	LongDiff: Training-Free Long Video Generation in One Go	Zhuoling Li et.al.	2503.18150	null
2025-03-23	TransAnimate: Taming Layer Diffusion to Generate RGBA Video	Xuewei Chen et.al.	2503.17934	null
2025-03-22	4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding	Wenxuan Zhu et.al.	2503.17827	link
2025-03-22	V2P-Bench: Evaluating Video-Language Understanding with Visual Prompts for Better Human-Model Interaction	Yiming Zhao et.al.	2503.17736	link
2025-03-22	RDTF: Resource-efficient Dual-mask Training Framework for Multi-frame Animated Sticker Generation	Zhiqiang Yuan et.al.	2503.17735	null
2025-03-22	Collaborative Temporal Consistency Learning for Point-supervised Natural Language Video Localization	Zhuo Tao et.al.	2503.17651	null
2025-03-21	Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks	Bhishma Dedhia et.al.	2503.17539	null
2025-03-21	Enhancing Subsequent Video Retrieval via Vision-Language Models (VLMs)	Yicheng Duan et.al.	2503.17415	link
2025-03-21	Position: Interactive Generative Video as Next-Generation Game Engine	Jiwen Yu et.al.	2503.17359	null
2025-03-21	PVChat: Personalized Video Chat with One-Shot Learning	Yufei Shi et.al.	2503.17069	null
2025-03-21	AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process	Junjie Hu et.al.	2503.17029	null
2025-03-21	Enabling Versatile Controls for Video Diffusion Models	Xu Zhang et.al.	2503.16983	link
2025-03-25	Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model	Yingying Fan et.al.	2503.16942	null
2025-03-21	Temporal Action Detection Model Compression by Progressive Block Drop	Xiaoyong Chen et.al.	2503.16916	null
2025-03-17	Adams Bashforth Moulton Solver for Inversion and Editing in Rectified Flow	Yongjia Ma et.al.	2503.16522	null
2025-03-20	XAttention: Block Sparse Attention with Antidiagonal Scoring	Ruyi Xu et.al.	2503.16428	link
2025-03-20	MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance	Quanhao Li et.al.	2503.16421	null
2025-03-27	ScalingNoise: Scaling Inference-Time Search for Generating Infinite Videos	Haolin Yang et.al.	2503.16400	null
2025-03-20	PoseTraj: Pose-Aware Trajectory Control in Video Diffusion	Longbin Ji et.al.	2503.16068	null
2025-03-20	Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models	Zhihang Liu et.al.	2503.16036	link
2025-03-20	Agentic Keyframe Search for Video Question Answering	Sunqi Fan et.al.	2503.16032	link
2025-03-20	Animating the Uncaptured: Humanoid Mesh Animation with Video Diffusion Models	Marc Benedí San Millán et.al.	2503.15996	null
2025-03-25	STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding	Zichen Liu et.al.	2503.15973	link
2025-03-20	DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering	Haochen Wang et.al.	2503.15887	null
2025-03-20	MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving	Haiguang Wang et.al.	2503.15875	link
2025-03-20	MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations	Kyungho Bae et.al.	2503.15871	null
2025-03-20	VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling	Hyojun Go et.al.	2503.15855	null
2025-03-20	What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?	Xuanming Cui et.al.	2503.15846	null
2025-03-19	Temporal Regularization Makes Your Video Generator Stronger	Harold Haodong Chen et.al.	2503.15417	null
2025-03-20	VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention	Mingzhe Zheng et.al.	2503.15138	null
2025-03-19	Neuro Symbolic Knowledge Reasoning for Procedural Video Question Answering	Thanh-Son Nguyen et.al.	2503.14957	null
2025-03-19	FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding	Chongjun Tu et.al.	2503.14935	null
2025-03-18	MusicInfuser: Making Video Diffusion Listen and Dance	Susung Hong et.al.	2503.14505	null
2025-03-18	MagicComp: Training-free Dual-Phase Refinement for Compositional Video Generation	Hongyu Zhang et.al.	2503.14428	null
2025-03-18	Impossible Videos	Zechen Bai et.al.	2503.14378	null
2025-03-18	LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models	Yu Cheng et.al.	2503.14325	link
2025-03-18	Concat-ID: Towards Universal Identity-Preserving Video Synthesis	Yong Zhong et.al.	2503.14151	null
2025-03-18	Fast Autoregressive Video Generation with Diagonal Decoding	Yang Ye et.al.	2503.14070	null
2025-03-18	AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark	Xinhao Xiang et.al.	2503.14064	link
2025-03-18	SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability	Jiankang Wang et.al.	2503.13983	null
2025-03-18	Improving LLM Video Understanding with 16 Frames Per Second	Yixuan Li et.al.	2503.13956	null
2025-03-17	Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition	Shristi Das Biswas et.al.	2503.13724	null
2025-03-17	Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory	Saket Gurukar et.al.	2503.13707	null
2025-03-17	Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos	Chiara Plizzari et.al.	2503.13646	link
2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link
2025-03-17	Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding	Weiyu Guo et.al.	2503.13139	null
2025-03-17	Efficient Motion-Aware Video MLLM	Zijia Zhao et.al.	2503.13016	null
2025-03-17	Frame-wise Conditioning Adaptation for Fine-Tuning Diffusion Models in Text-to-Video Prediction	Zheyuan Liu et.al.	2503.12953	null
2025-03-17	VITED: Video Temporal Evidence Distillation	Yujie Lu et.al.	2503.12855	null
2025-03-17	AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations	Quang Trung Truong et.al.	2503.12828	null
2025-03-17	ViSpeak: Visual Instruction Feedback in Streaming Videos	Shenghao Fu et.al.	2503.12769	null
2025-03-16	AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding	Xiao Wang et.al.	2503.12559	link
2025-03-16	SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs	Guibiao Liao et.al.	2503.12535	null
2025-03-16	Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?	Tianyuan Qu et.al.	2503.12496	link
2025-03-16	Causality Model for Semantic Understanding on Videos	Li Yicong et.al.	2503.12447	null
2025-03-16	VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining	Yunze Liu et.al.	2503.12332	null
2025-03-15	A Speech-to-Video Synthesis Approach Using Spatio-Temporal Diffusion for Vocal Tract MRI	Paula Andrea Pérez-Toro et.al.	2503.12102	null
2025-03-15	SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering	Byeongjun Park et.al.	2503.12024	link
2025-03-14	ReCamMaster: Camera-Controlled Generative Rendering from A Single Video	Jianhong Bai et.al.	2503.11647	null
2025-03-14	Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers	Weiming Ren et.al.	2503.11579	null
2025-03-14	HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models	Ziqin Zhou et.al.	2503.11513	null
2025-03-14	V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Zixu Cheng et.al.	2503.11495	null
2025-03-14	TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation	Hongxiang Zhao et.al.	2503.11423	null
2025-03-14	Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding	David Gastager et.al.	2503.11392	null
2025-03-14	Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model	Haoyang Huang et.al.	2503.11251	link
2025-03-14	LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs	Leqi Shen et.al.	2503.11205	null
2025-03-14	Cross-Modal Learning for Music-to-Music-Video Description Generation	Zhuoyuan Mao et.al.	2503.11190	null
2025-03-24	Large-scale Pre-training for Grounded Video Caption Generation	Evangelos Kazakos et.al.	2503.10781	link
2025-03-13	Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing	Yudong Liu et.al.	2503.10742	link
2025-03-12	Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework	Jing Wang et.al.	2503.10704	null
2025-03-12	Neighboring Autoregressive Modeling for Efficient Visual Generation	Yefei He et.al.	2503.10696	link
2025-03-12	Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation	Qiji Zhou et.al.	2503.10691	null
2025-03-11	VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion	Lehan Yang et.al.	2503.10678	link
2025-03-13	CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models	Hao He et.al.	2503.10592	null
2025-03-13	Long Context Tuning for Video Generation	Yuwei Guo et.al.	2503.10589	null
2025-03-13	4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models	Wanhua Li et.al.	2503.10437	link
2025-03-13	CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance	Yufan Deng et.al.	2503.10391	null
2025-03-13	LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents	Boyu Chen et.al.	2503.10200	null
2025-03-13	StableFusion: Continual Video Retrieval via Frame Adaptation	Zecheng Zhao et.al.	2503.10111	link
2025-03-13	Semantic Latent Motion for Portrait Video Generation	Qiyuan Zhang et.al.	2503.10096	null
2025-03-16	VMBench: A Benchmark for Perception-Aligned Video Motion Generation	Xinran Ling et.al.	2503.10076	link
2025-03-13	TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs	Yunxiao Wang et.al.	2503.09994	null
2025-03-21	UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?	Yuanxin Liu et.al.	2503.09949	link
2025-03-13	VideoMerge: Towards Training-free Long Video Generation	Siyang Zhang et.al.	2503.09926	null
2025-03-12	LuciBot: Automated Robot Policy Learning from Generated Videos	Xiaowen Qiu et.al.	2503.09871	null
2025-03-14	On the Limitations of Vision-Language Models in Understanding Image Transforms	Ahmad Mustafa Anis et.al.	2503.09837	null
2025-03-12	I2V3D: Controllable image-to-video generation with 3D guidance	Zhiyuan Zhang et.al.	2503.09733	null
2025-03-12	Accelerating Diffusion Sampling via Exploiting Local Transition Coherence	Shangwen Zhu et.al.	2503.09675	null
2025-03-12	Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k	Xiangyu Peng et.al.	2503.09642	link
2025-03-11	V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video	Jianqi Chen et.al.	2503.09631	null
2025-03-12	PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop	Chenyu Li et.al.	2503.09595	link
2025-03-13	BIMBA: Selective-Scan Compression for Long-Range Video Question Answering	Md Mohaiminul Islam et.al.	2503.09590	link
2025-03-12	VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary	Kevin Qinghong Lin et.al.	2503.09402	link
2025-03-17	VideoScan: Enabling Efficient Streaming Video Understanding via Frame-level Semantic Carriers	Ruanjun Li et.al.	2503.09387	null
2025-03-12	Unified Dense Prediction of Video Diffusion	Lehan Yang et.al.	2503.09344	null
2025-03-12	Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption	Luozheng Qin et.al.	2503.09279	null
2025-03-17	Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space	Jian Zhu et.al.	2503.09215	null
2025-03-13	Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model	Ali Vosoughi et.al.	2503.09205	null
2025-03-15	WonderVerse: Extendable 3D Scene Generation with Video Generative Models	Hao Feng et.al.	2503.09160	null
2025-03-13	FaVChat: Unlocking Fine-Grained Facial Video Understanding with Multimodal Large Language Models	Fufangchen Zhao et.al.	2503.09158	null
2025-03-17	Reangle-A-Video: 4D Video Generation as Video-to-Video Translation	Hyeonho Jeong et.al.	2503.09151	null
2025-03-12	Memory-enhanced Retrieval Augmentation for Long Video Understanding	Huaying Yuan et.al.	2503.09149	null
2025-03-12	Generative Frame Sampler for Long Video Understanding	Linli Yao et.al.	2503.09146	null
2025-03-12	Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding	Haoyu Zhang et.al.	2503.09143	null
2025-03-12	Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment	Xiaowei Bi et.al.	2503.09081	null
2025-03-12	Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization	Zongshang Pang et.al.	2503.09027	null
2025-03-11	QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension	Yongdong Luo et.al.	2503.08689	link
2025-03-11	REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder	Yitian Zhang et.al.	2503.08665	null
2025-03-11	Tuning-Free Multi-Event Long Video Generation via Synchronized Coupled Sampling	Subin Kim et.al.	2503.08605	null
2025-03-11	HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding	Shehreen Azad et.al.	2503.08585	null
2025-03-11	RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding	Xichen Tan et.al.	2503.08576	null
2025-03-11	Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos	Soumya Shamarao Jahagirdar et.al.	2503.08335	null
2025-03-12	$^R$ FLAV: Rolling Flow matching for infinite Audio Video generation	Alex Ergasti et.al.	2503.08307	link
2025-03-11	WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation	Jing Wang et.al.	2503.08153	null
2025-03-11	ObjectMover: Generative Object Movement with Video Prior	Xin Yu et.al.	2503.08037	null
2025-03-11	How Can Video Generative AI Transform K-12 Education? Examining Teachers’ Perspectives through TPACK and TAM	Unggi Lee et.al.	2503.08003	null
2025-03-10	BEARCUBS: A benchmark for computer-using web agents	Yixiao Song et.al.	2503.07919	null
2025-03-10	DreamRelation: Relation-Centric Video Customization	Yujie Wei et.al.	2503.07602	null
2025-03-11	VACE: All-in-One Video Creation and Editing	Zeyinzi Jiang et.al.	2503.07598	null
2025-03-10	AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion	Mingzhen Sun et.al.	2503.07418	null
2025-03-10	Automated Movie Generation via Multi-Agent CoT Planning	Weijia Wu et.al.	2503.07314	link
2025-03-10	ALLVB: All-in-One Long Video Understanding Benchmark	Xichen Tan et.al.	2503.07298	null
2025-03-10	Towards Fine-Grained Video Question Answering	Wei Dai et.al.	2503.06820	null
2025-03-09	VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation	Hritik Bansal et.al.	2503.06800	null
2025-03-09	TR-DQ: Time-Rotation Diffusion Quantization	Yihua Shao et.al.	2503.06564	null
2025-03-09	QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation	Junyi Wu et.al.	2503.06545	link
2025-03-09	TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos	Chen-Lin Zhang et.al.	2503.06526	link
2025-03-11	LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation	Quanjian Song et.al.	2503.06508	link
2025-03-09	Generative Video Bi-flow	Chen Liu et.al.	2503.06364	null
2025-03-08	Text2Story: Advancing Video Storytelling with Text Guidance	Taewon Kang et.al.	2503.06310	null
2025-03-08	Get In Video: Add Anything You Want to the Video	Shaobin Zhuang et.al.	2503.06268	null
2025-03-12	Object-Centric World Model for Language-Guided Manipulation	Youngjoon Jeong et.al.	2503.06170	null
2025-03-08	VACT: A Video Automatic Causal Testing System and a Benchmark	Haotong Yang et.al.	2503.06163	null
2025-03-08	GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation	Ye Tao et.al.	2503.06136	null
2025-03-08	DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation	Runze Zhang et.al.	2503.06053	null
2025-03-07	MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice	Hongwei Yi et.al.	2503.05978	null
2025-03-07	MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio	Xuenan Xu et.al.	2503.05242	link
2025-03-07	Unified Reward Model for Multimodal Understanding and Generation	Yibin Wang et.al.	2503.05236	null
2025-03-13	Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions	Chan Hur et.al.	2503.05186	null
2025-03-06	Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation	Alexey Buzovkin et.al.	2503.04871	link
2025-03-05	ProReflow: Progressive Reflow with Decomposed Velocity	Lei Ke et.al.	2503.04824	null
2025-03-04	LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning	Zhibin Lan et.al.	2503.04812	null
2025-03-06	FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video	Yue Gao et.al.	2503.04720	null
2025-03-06	What Are You Doing? A Closer Look at Controllable Human Video Generation	Emanuele Bugliarello et.al.	2503.04666	null
2025-03-08	The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation	Aoxiong Yin et.al.	2503.04606	link
2025-03-06	Token-Efficient Long Video Understanding for Multimodal LLMs	Jindong Jiang et.al.	2503.04130	null
2025-03-06	EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models	Haiyang Yu et.al.	2503.04058	null
2025-03-05	EgoLife: Towards Egocentric Life Assistant	Jingkang Yang et.al.	2503.03803	link
2025-03-05	GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control	Xuanchi Ren et.al.	2503.03751	link
2025-03-08	Rethinking Video Tokenization: A Conditioned Diffusion-based Approach	Nianzu Yang et.al.	2503.03708	link
2025-03-05	DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance	Zhao Yang et.al.	2503.03689	link
2025-03-06	Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection	Wenqiao Li et.al.	2503.03562	null
2025-03-05	High-Quality Virtual Single-Viewpoint Surgical Video: Geometric Autocalibration of Multiple Cameras in Surgical Lights	Yuna Kato et.al.	2503.03558	link
2025-03-13	Video Super-Resolution: All You Need is a Video Diffusion Model	Zhihao Zhan et.al.	2503.03355	null
2025-03-04	GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning	Zhun Mou et.al.	2503.02341	null
2025-02-26	Online Pseudo-average Shifting Attention(PASA) for Robust Low-precision LLM Inference: Algorithms and Numerical Analysis	Long Cheng et.al.	2503.01873	null
2025-03-03	VideoUFO: A Million-Scale User-Focused Dataset for Text-to-Video Generation	Wenhao Wang et.al.	2503.01739	link
2025-03-03	Learning to Generate Long-term Future Narrations Describing Activities of Daily Living	Ramanathan Rajendiran et.al.	2503.01416	null
2025-03-03	Composed Multi-modal Retrieval: A Survey of Approaches and Applications	Kun Zhang et.al.	2503.01334	link
2025-03-03	Parameter-free Video Segmentation for Vision and Language Understanding	Louis Mahon et.al.	2503.01201	null
2025-03-03	VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors	Juil Koo et.al.	2503.01107	null
2025-03-02	Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning	Baoqi Pei et.al.	2503.00986	link
2025-03-02	Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think	Jie Tian et.al.	2503.00948	link
2025-03-04	An Efficient 3D Convolutional Neural Network with Channel-wise, Spatial-grouped, and Temporal Convolutions	Zhe Wang et.al.	2503.00796	null
2025-03-01	Streaming Video Question-Answering with In-context Video KV-Cache Retrieval	Shangzhe Di et.al.	2503.00540	link
2025-03-10	Learning to Animate Images from A Few Videos to Portray Delicate Human Actions	Haoxin Li et.al.	2503.00276	null
2025-03-04	Unified Video Action Model	Shuang Li et.al.	2503.00200	null
2025-02-28	PreMind: Multi-Agent Video Understanding for Advanced Indexing of Presentation-style Videos	Kangda Wei et.al.	2503.00162	null
2025-02-26	Glad: A Streaming Scene Generator for Autonomous Driving	Bin Xie et.al.	2503.00045	null
2025-02-25	An Analysis of Segment Anything 2	Clayton Bromley et.al.	2503.00042	null
2025-03-07	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	Adaptive Keyframe Sampling for Long Video Understanding	Xi Tang et.al.	2502.21271	null
2025-02-28	Training-free and Adaptive Sparse Attention for Efficient Long Video Generation	Yifei Xia et.al.	2502.21079	null
2025-02-28	HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models	Xiao Wang et.al.	2502.20811	null
2025-02-28	WorldModelBench: Judging Video Generation Models As World Models	Dacheng Li et.al.	2502.20694	null
2025-02-27	OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection	Shuming Liu et.al.	2502.20361	link
2025-02-27	Mobius: Text to Seamless Looping Video Generation via Latent Shift	Xiuli Bi et.al.	2502.20307	link
2025-02-27	FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute	Sotiris Anagnostidis et.al.	2502.20126	null
2025-02-27	C-Drag: Chain-of-Thought Driven Motion Controller for Video Generation	Yuhao Li et.al.	2502.19868	link
2025-02-27	M-LLM Based Video Frame Selection for Efficient Video Understanding	Kai Hu et.al.	2502.19680	null
2025-02-26	FLAP: Fully-controllable Audio-driven Portrait Video Generation through 3D head conditioned diffusion mode	Lingzhou Mu et.al.	2502.19455	null
2025-03-03	TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis	Menghao Li et.al.	2502.19454	null
2025-02-26	InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model	Fengbin Guan et.al.	2502.19026	null
2025-02-25	SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference	Jintao Zhang et.al.	2502.18137	link
2025-02-25	ASurvey: Spatiotemporal Consistency in Video Generation	Zhiyu Yin et.al.	2502.17863	null
2025-02-26	Task Graph Maximum Likelihood Estimation for Procedural Activity Understanding in Egocentric Videos	Luigi Seminara et.al.	2502.17753	link
2025-02-24	X-Dancer: Expressive Music to Human Dance Video Generation	Zeyuan Chen et.al.	2502.17414	null
2025-02-24	VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing	Xiangpeng Yang et.al.	2502.17258	null
2025-02-24	Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Zhong Li et.al.	2502.17119	link
2025-02-23	Fine-Grained Video Captioning through Scene Graph Consolidation	Sanghyeok Chu et.al.	2502.16427	null
2025-02-21	RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers	Min Zhao et.al.	2502.15894	null
2025-02-21	VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Florent Bartoccioni et.al.	2502.15672	link
2025-03-01	LongCaptioning: Unlocking the Power of Long Video Caption Generation in Large Multimodal Models	Hongchen Wei et.al.	2502.15393	null
2025-02-21	Weakly Supervised Video Scene Graph Generation via Natural Language Supervision	Kibum Kim et.al.	2502.15370	link
2025-02-21	TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba	Xiuwei Chen et.al.	2502.15130	null
2025-02-20	Can Hallucination Correction Improve Video-Language Alignment?	Lingjun Zhao et.al.	2502.15079	null
2025-02-20	Hardware-Friendly Static Quantization Method for Video Diffusion Transformers	Sanghyun Yi et.al.	2502.15077	null
2025-02-20	LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection	Qingyuan Liu et.al.	2502.14994	null
2025-02-20	Improving the Diffusability of Autoencoders	Ivan Skorokhodov et.al.	2502.14831	null
2025-03-04	AVD2: Accident Video Diffusion for Accident Video Description	Cheng Li et.al.	2502.14801	null
2025-02-28	RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers	Ke Cao et.al.	2502.14377	null
2025-02-20	Designing Parameter and Compute Efficient Diffusion Transformers using Distillation	Vignesh Sundaresha et.al.	2502.14226	null
2025-02-19	FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation	Yunpeng Zhang et.al.	2502.13995	link
2025-02-19	Capturing Rich Behavior Representations: A Dynamic Action Semantic-Aware Graph Transformer for Video Captioning	Caihua Liu et.al.	2502.13754	null
2025-02-19	Pretrained Image-Text Models are Secretly Video Captioners	Chunhui Zhang et.al.	2502.13363	link
2025-02-19	LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation	Junchen Fu et.al.	2502.12945	null
2025-02-18	VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation	Xinlong Chen et.al.	2502.12782	link
2025-02-18	MALT Diffusion: Memory-Augmented Latent Transformers for Any-Length Video Generation	Sihyun Yu et.al.	2502.12632	null
2025-03-10	MomentSeeker: A Comprehensive Benchmark and A Strong Baseline For Moment Retrieval Within Long Videos	Huaying Yuan et.al.	2502.12558	null
2025-02-21	LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities	Florian Sestak et.al.	2502.12128	link
2025-02-17	DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation	Zhihang Yuan et.al.	2502.11897	link
2025-02-17	video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model	Guangzhi Sun et.al.	2502.11775	link
2025-02-18	Open-Ended and Knowledge-Intensive Video Question Answering	Md Zarif Ul Alam et.al.	2502.11747	null
2025-02-17	VRoPE: Rotary Position Embedding for Video Large Language Models	Zikang Liu et.al.	2502.11664	link
2025-02-17	Object-Centric Image to Video Generation with Language Guidance	Angel Villar-Corrales et.al.	2502.11655	null
2025-02-18	iMOVE: Instance-Motion-Aware Video Understanding	Jiaze Li et.al.	2502.11594	null
2025-02-16	MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation	Michael Fuest et.al.	2502.11234	null
2025-02-16	Phantom: Subject-consistent video generation via cross-modal alignment	Lijie Liu et.al.	2502.11079	null
2025-02-15	SVBench: A Benchmark with Temporal Multi-Turn Dialogues for Streaming Video Understanding	Zhenyu Yang et.al.	2502.10810	null
2025-02-15	Semantics-aware Test-time Adaptation for 3D Human Pose Estimation	Qiuxia Lin et.al.	2502.10724	null
2025-02-24	Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model	Guoqing Ma et.al.	2502.10248	link
2025-02-14	RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control	Teng Li et.al.	2502.10059	null
2025-02-14	Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering	Mark Beliaev et.al.	2502.09573	null
2025-02-14	GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation	Hongyin Zhang et.al.	2502.09268	null
2025-02-12	CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation	Qinghe Wang et.al.	2502.08639	null
2025-02-12	FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis	Wonjoon Jin et.al.	2502.08244	null
2025-02-12	Learning Human Skill Generators at Key-Step Levels	Yilu Wu et.al.	2502.08234	null
2025-02-12	AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance	Zhao Wang et.al.	2502.08189	null
2025-02-10	Pre-Trained Video Generative Models as World Simulators	Haoran He et.al.	2502.07825	null
2025-02-12	Next Block Prediction: Video Generation via Semi-Autoregressive Modeling	Shuhuai Ren et.al.	2502.07737	null
2025-02-17	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link
2025-02-12	VidCRAFT3: Camera, Object, and Lighting Control for Image-to-Video Generation	Sixiao Zheng et.al.	2502.07531	null
2025-02-27	Enhance-A-Video: Better Generated Video for Free	Yang Luo et.al.	2502.07508	link
2025-02-11	EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering	Sheng Zhou et.al.	2502.07411	link
2025-02-11	Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos	Haowen Gao et.al.	2502.07327	null
2025-02-11	Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization	Aditya Vora et.al.	2502.07278	null
2025-02-11	Enhancing Video Understanding: Deep Neural Networks for Spatiotemporal Analysis	Amir Hosein Fadaei et.al.	2502.07277	null
2025-02-11	Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation	Pinxin Liu et.al.	2502.07239	null
2025-02-11	A Survey on Mamba Architecture for Vision Applications	Fady Ibrahim et.al.	2502.07161	null
2025-02-10	Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization	Aadit Barua et.al.	2502.07096	null
2025-02-19	Conditional diffusion model with spatial attention and latent embedding for medical image segmentation	Behzad Hejrati et.al.	2502.06997	link
2025-02-06	TorchResist: Open-Source Differentiable Resist Simulator	Zixiao Wang et.al.	2502.06838	link
2025-02-17	Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models	Shuting Wang et.al.	2502.06812	null
2025-02-12	Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT	Dongyang Liu et.al.	2502.06782	null
2025-02-10	History-Guided Video Diffusion	Kiwhan Song et.al.	2502.06764	null
2025-02-10	Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists	Bojia Zi et.al.	2502.06734	null
2025-02-27	TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models	Yangguang Li et.al.	2502.06608	link
2025-02-27	A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems	Linxiao Gong et.al.	2502.06581	null
2025-02-20	CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers	D. She et.al.	2502.06527	null
2025-02-11	CoS: Chain-of-Shot Prompting for Long Video Understanding	Jian Hu et.al.	2502.06428	null
2025-02-17	Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile	Hangliang Ding et.al.	2502.06155	null
2025-02-09	Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding	Xingjian Diao et.al.	2502.06020	link
2025-02-09	Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search	Hengzhu Tang et.al.	2502.05924	null
2025-02-08	Towards AI-driven Sign Language Generation with Non-manual Markers	Han Zhang et.al.	2502.05661	null
2025-02-08	Training-Free Constrained Generation With Stable Diffusion Models	Stefano Zampini et.al.	2502.05625	null
2025-02-18	A Physical Coherence Benchmark for Evaluating Video Generation Models via Optical Flow-guided Frame Prediction	Yongfan Chen et.al.	2502.05503	link
2025-02-08	Content-based Video Retrieval in Traffic Videos using Latent Dirichlet Allocation Topic Model	Mohammad Kianpisheh et.al.	2502.05457	null
2025-02-28	FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation	Shilong Zhang et.al.	2502.05179	link
2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link
2025-02-07	VideoRoPE: What Makes for Good Video Rotary Position Embedding?	Xilin Wei et.al.	2502.05173	link
2025-02-10	Goku: Flow Based Video Generative Foundation Models	Shoufa Chen et.al.	2502.04896	null
2025-02-10	HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation	Qijun Gan et.al.	2502.04847	null
2025-02-06	Fast Video Generation with Sliding Tile Attention	Peiyuan Zhang et.al.	2502.04507	null
2025-02-06	UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation	Wenzhang Sun et.al.	2502.04393	null
2025-02-05	On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices	Bosung Kim et.al.	2502.04363	link
2025-02-06	WorldSense: Evaluating Real-world Omnimodal Understanding for Multimodal LLMs	Jack Hong et.al.	2502.04326	null
2025-02-06	MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation	Jinbo Xing et.al.	2502.04299	null
2025-02-06	Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression	Lirui Wang et.al.	2502.04296	null
2025-02-06	Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency	Shangkun Sun et.al.	2502.04076	link
2025-02-08	UniForm: A Unified Diffusion Transformer for Audio-Video Generation	Lei Zhao et.al.	2502.03897	null
2025-02-05	Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach	Yunuo Chen et.al.	2502.03639	null
2025-02-19	FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise	Yunlong Yuan et.al.	2502.03496	null
2025-02-05	SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living	Arkaprava Sinha et.al.	2502.03459	null
2025-02-05	MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent	Xinyao Liao et.al.	2502.03207	null
2025-02-27	MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding	Pengyi Li et.al.	2502.03183	null
2025-02-05	A Decade of Action Quality Assessment: Largest Systematic Survey of Trends, Challenges, and Future Directions	Hao Yin et.al.	2502.02817	null
2025-02-04	Controllable Video Generation with Provable Disentanglement	Yifan Shen et.al.	2502.02690	null
2025-02-03	Secure & Personalized Music-to-Video Generation via CHARCHA	Mehul Agarwal et.al.	2502.02610	null
2025-02-04	VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models	Hila Chefer et.al.	2502.02492	null
2025-02-04	Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task Perspectives	Simone Alberto Peirone et.al.	2502.02487	link
2025-02-04	TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic Scenes	Xingcheng Zhou et.al.	2502.02449	null
2025-02-06	LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models	Tzu-Tao Chang et.al.	2502.02406	null
2025-02-05	IPO: Iterative Preference Optimization for Text-to-Video Generation	Xiaomeng Yang et.al.	2502.02088	null
2025-02-03	VILP: Imitation Learning with Latent Video Planning	Zhengtong Xu et.al.	2502.01784	link
2025-02-03	Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity	Haocheng Xi et.al.	2502.01776	null
2025-02-07	MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation	Haibo Tong et.al.	2502.01719	null
2025-02-02	HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment	Lifan Jiang et.al.	2502.01690	null
2025-02-03	VideoRAG: Retrieval-Augmented Generation with Extreme Long-Context Videos	Xubin Ren et.al.	2502.01549	link
2025-02-03	Improved Training Technique for Latent Consistency Models	Quan Dao et.al.	2502.01441	link
2025-02-17	VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion Control	Lifan Jiang et.al.	2502.01101	link
2025-02-13	OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models	Gaojie Lin et.al.	2502.01061	null
2025-02-03	Pushing the Boundaries of State Space Models for Image and Video Generation	Yicong Hong et.al.	2502.00972	null
2025-02-02	Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer	Tao Ren et.al.	2502.00639	null
2025-02-04	Video Latent Flow Matching: Optimal Polynomial Projections for Video Interpolation and Extrapolation	Yang Cao et.al.	2502.00500	null
2025-02-01	Masked Generative Nested Transformers with Decode Time Scaling	Sahil Goyal et.al.	2502.00382	null
2025-02-01	Shape from Semantics: 3D Shape Generation from Multi-View Semantics	Liangchen Li et.al.	2502.00360	null
2025-02-04	AIN: The Arabic INclusive Large Multimodal Model	Ahmed Heakl et.al.	2502.00094	link
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	$\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation	Saul Santos et.al.	2501.19098	link
2025-01-30	Every Image Listens, Every Image Dances: Music-Driven Image Animation	Zhikang Dong et.al.	2501.18801	null
2025-01-30	MAMS: Model-Agnostic Module Selection Framework for Video Captioning	Sangho Lee et.al.	2501.18269	null
2025-01-28	Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding	Yun Li et.al.	2501.16786	null
2025-01-28	CascadeV: An Implementation of Wurstchen Architecture for Video Generation	Wenfeng Lin et.al.	2501.16612	link
2025-01-27	AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models	Zheng Lian et.al.	2501.16566	link
2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null
2025-01-26	TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding	Xingjian Zhang et.al.	2501.15513	link
2025-01-26	“See What I Imagine, Imagine What I See”: Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR	Yunge Wen et.al.	2501.15456	null
2025-01-25	HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding	Jiaxing Zhao et.al.	2501.15111	null
2025-01-25	VideoPure: Diffusion-based Adversarial Purification for Video Recognition	Kaixun Jiang et.al.	2501.14999	link
2025-01-11	HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators	Le Chen et.al.	2501.14794	null
2025-01-24	VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking	Runyi Hu et.al.	2501.14195	link
2025-01-24	ENTER: Event Based Interpretable Reasoning for VideoQA	Hammad Ayyubi et.al.	2501.14194	null
2025-01-30	Temporal Preference Optimization for Long-Form Video Understanding	Rui Li et.al.	2501.13919	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	ReasVQA: Advancing VideoQA with Imperfect Reasoning Process	Jianxin Liang et.al.	2501.13536	null
2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link
2025-01-23	EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion	Jiangchuan Wei et.al.	2501.13452	null
2025-01-28	VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Boqiang Zhang et.al.	2501.13106	link
2025-01-21	Taming Teacher Forcing for Masked Autoregressive Video Generation	Deyu Zhou et.al.	2501.12389	null
2025-01-22	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	MMVU: Measuring Expert-Level Multi-Discipline Video Understanding	Yilun Zhao et.al.	2501.12380	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model	Yuhang Zang et.al.	2501.12368	link
2025-01-20	GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video	Zhenliang Ni et.al.	2501.11340	null
2025-01-20	CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation	Zheng Chong et.al.	2501.11325	link
2025-02-03	HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition	Pengcheng Dong et.al.	2501.11007	null
2025-01-18	EMO2: End-Effector Guided Audio-Driven Avatar Video Generation	Linrui Tian et.al.	2501.10687	null
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-02-02	RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation	Yuefan Cao et.al.	2501.09982	null
2025-01-16	VideoWorld: Exploring Knowledge Learning from Unlabeled Videos	Zhongwei Ren et.al.	2501.09781	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-02-10	Do generative video models learn physical principles from watching videos?	Saman Motamed et.al.	2501.09038	link
2025-01-15	Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion	Jingyuan Chen et.al.	2501.09019	null
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	Admitting Ignorance Helps the Video Question Answering Models to Answer	Haopeng Li et.al.	2501.08771	null
2025-01-31	Comprehensive Subjective and Objective Evaluation Method for Text-generated Video	Zelu Qi et.al.	2501.08545	null
2025-01-14	Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models	Weichen Fan et.al.	2501.08453	null
2025-01-14	3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering	Meenakshi Krishnan et.al.	2501.08370	null
2025-01-14	Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks	Miran Heo et.al.	2501.08326	null
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-17	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness	Jiaxing Zhao et.al.	2501.07978	link
2025-01-24	Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding	Liping Yuan et.al.	2501.07888	link
2025-01-14	AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation	Sitong Gong et.al.	2501.07810	link
2025-01-13	BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations	Weixi Feng et.al.	2501.07647	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-17	MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning	Tieyuan Chen et.al.	2501.07227	null
2025-01-13	TimeLogic: A Temporal Logic Benchmark for Video QA	Sirnam Swetha et.al.	2501.07214	null
2025-01-13	Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling	Jiebin Yan et.al.	2501.07087	null
2025-01-12	X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding	Wenqi Zhou et.al.	2501.06835	null
2025-01-12	VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning	Ji Soo Lee et.al.	2501.06761	link
2025-01-11	Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning	Maomao Li et.al.	2501.06438	null
2025-01-10	MEt3R: Measuring Multi-View Consistency in Generated Images	Mohammad Asim et.al.	2501.06336	null
2025-01-10	Multi-subject Open-set Personalization in Video Generation	Tsai-Shien Chen et.al.	2501.06187	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-13	Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Ziheng Wu et.al.	2501.05901	link
2025-01-10	Zero-shot Shark Tracking and Biometrics from Aerial Imagery	Chinmay K Lalgudi et.al.	2501.05717	null
2025-01-10	From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities	Dominick Reilly et.al.	2501.05711	link
2025-01-09	OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?	Yifei Li et.al.	2501.05510	link
2025-01-08	Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion	Yongjia Ma et.al.	2501.05484	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning	Huabin Liu et.al.	2501.05069	null
2025-01-09	LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding	Jiaxing Zhao et.al.	2501.05067	null
2025-01-09	LongViTU: Instruction Tuning for Long-Form Video Understanding	Rujie Wu et.al.	2501.05037	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs	Zeyi Huang et.al.	2501.04336	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	null
2025-01-08	LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition	Bowen Hao et.al.	2501.04204	null
2024-12-18	FlexCache: Flexible Approximate Cache System for Video Diffusion	Desen Sun et.al.	2501.04012	null
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-09	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Motion-Aware Generative Frame Interpolation	Guozhen Zhang et.al.	2501.03699	null
2025-01-06	License Plate Images Generation with Diffusion Models	Mariia Shpir et.al.	2501.03374	null
2025-01-03	Classifier-Guided Captioning Across Modalities	Ariel Shaulov et.al.	2501.03183	null
2025-01-06	Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation	Guy Yariv et.al.	2501.03059	null
2025-01-20	TransPixeler: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models	Wenyi Hong et.al.	2501.02955	null
2025-01-06	Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising	Yunlong Yuan et.al.	2501.02741	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-29	Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey	Zongxia Li et.al.	2501.02189	link
2025-01-10	Gender Bias in Text-to-Video Generation Models: A case study of Sora	Mohammad Nadeem et.al.	2501.01987	null
2024-12-30	FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models	Tianyu Fu et.al.	2501.01986	link
2025-01-03	JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing	Qili Wang et.al.	2501.01798	link
2025-01-03	HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding	Heqing Zou et.al.	2501.01645	link
2025-01-07	VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control	Yuanpeng Tu et.al.	2501.01427	null
2025-01-02	Unifying Specialized Visual Encoders for Video Language Models	Jihoon Chung et.al.	2501.01426	link
2025-01-03	Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions	Xincheng Shuai et.al.	2501.01425	null
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-29	Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform	Cheonsu Jeong et.al.	2501.00750	null
2025-01-03	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2025-01-08	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2024-12-31	Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method	Zhenpeng Huang et.al.	2501.00584	null
2024-12-31	Fine-grained Video-Text Retrieval: A New Benchmark and Method	Yifan Xu et.al.	2501.00513	null
2024-12-31	OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models	Lala Shakti Swarup Ray et.al.	2501.00432	null
2025-01-09	Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding	Yue Fan et.al.	2501.00358	null
2024-12-30	Detection-Fusion for Knowledge Graph Extraction from Videos	Taniya Das et.al.	2501.00136	link
2024-12-30	LTX-Video: Realtime Video Latent Diffusion	Yoav HaCohen et.al.	2501.00103	link
2024-12-30	Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model	Yifei Huang et.al.	2412.21080	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	Hierarchical Banzhaf Interaction for General Video-Language Representation Learning	Peng Jin et.al.	2412.20964	link
2024-12-30	ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation	Ting Zhang et.al.	2412.20901	null
2024-12-30	Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling	Min Zhang et.al.	2412.20725	null
2025-01-05	ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding	Xiao Wang et.al.	2412.20504	link
2024-12-29	Open-Sora: Democratizing Efficient Video Production for All	Zangwei Zheng et.al.	2412.20404	link
2024-12-28	DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments	Xijun Wang et.al.	2412.20042	null
2025-01-17	MVTamperBench: Evaluating Robustness of Vision-Language Models	Amit Agarwal et.al.	2412.19794	null
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-30	VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models	Tao Wu et.al.	2412.19645	null
2024-12-30	DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-26	Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries	Roberto Amoroso et.al.	2412.19304	null
2024-12-25	Accelerating Diffusion Transformers with Dual Feature Caching	Chang Zou et.al.	2412.18911	link
2024-12-24	Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation	Faraz Waseem et.al.	2412.18688	null
2024-12-24	Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models	Jinhui Yi et.al.	2412.18609	link
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-23	Large Motion Video Autoencoding with Cross-modal Video VAE	Yazhou Xing et.al.	2412.17805	null
2024-12-23	VidTwin: Video VAE with Decoupled Structure and Dynamics	Yuchi Wang et.al.	2412.17726	link
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	link
2024-12-23	VidCtx: Context-aware Video Question Answering with Image Models	Andreas Goulas et.al.	2412.17415	link
2024-12-23	FFA Sora, video generation as fundus fluorescein angiography simulator	Xinyuan Wu et.al.	2412.17346	null
2024-12-23	Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory	Xingyao Li et.al.	2412.17254	null
2024-12-22	SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults	Jinzhi Wang et.al.	2412.17077	null
2025-01-08	Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation	Luoxu Jin et.al.	2412.17042	null
2024-12-22	FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos	Zhengqian Wu et.al.	2412.17022	link
2024-12-22	Video Domain Incremental Learning for Human Action Recognition in Home Environments	Yuanda Hu et.al.	2412.16946	null
2024-12-21	GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space	Souhaib Attaiki et.al.	2412.16717	null
2024-12-21	TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models	Haocheng Huang et.al.	2412.16700	null
2024-12-21	VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation	Chi Zhang et.al.	2412.16677	null
2024-12-25	Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance	Beiyuan Zhang et.al.	2412.16495	null
2024-12-18	ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping	Youxin Pang et.al.	2412.16212	null
2024-12-17	Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation	Yiping Wang et.al.	2412.16211	null
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-20	DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization	Zihan Ding et.al.	2412.15689	null
2024-12-23	CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training	Xiuli Bi et.al.	2412.15646	link
2024-12-20	PolySmart @ TRECVid 2024 Medical Video Question Answering	Jiaxin Wu et.al.	2412.15514	null
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Parallelized Autoregressive Visual Generation	Yuqing Wang et.al.	2412.15119	null
2024-12-19	Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations	Yucheng Hu et.al.	2412.14803	null
2024-12-19	HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning	Minkuk Kim et.al.	2412.14585	null
2024-12-19	Consistent Human Image and Video Generation with Spatially Conditioned Diffusion	Mingdeng Cao et.al.	2412.14531	link
2024-12-19	DirectorLLM for Human-Centric Video Generation	Kunpeng Song et.al.	2412.14484	null
2024-12-18	Learning from Massive Human Videos for Universal Humanoid Pose Control	Jiageng Mao et.al.	2412.14172	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-29	AKiRa: Augmentation Kit on Rays for optical video generation	Xi Wang et.al.	2412.14158	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models	Cong Wei et.al.	2412.14006	link
2024-12-18	Do Language Models Understand Time?	Xi Ding et.al.	2412.13845	link
2024-12-19	G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o	Tony Cheng Tong et.al.	2412.13647	link
2024-12-18	Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning	Yunbin Tu et.al.	2412.13543	null
2024-12-18	Real-time One-Step Diffusion-based Expressive Portrait Videos Generation	Hanzhong Guo et.al.	2412.13479	link
2024-12-18	SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation	Kazuki Shimada et.al.	2412.13462	null
2024-12-17	CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices	Andrei Znobishchev et.al.	2412.13273	null
2025-01-07	MotionBridge: Dynamic Video Inbetweening with Flexible Controls	Maham Tanveer et.al.	2412.13190	null
2024-12-17	VidTok: A Versatile and Open-Source Video Tokenizer	Anni Tang et.al.	2412.13061	link
2024-12-17	FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering	Zheng Cheng et.al.	2412.12833	null
2024-12-17	Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning	Shiping Ge et.al.	2412.12791	link
2024-12-17	ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries	Wangyu Xue et.al.	2412.12675	null
2024-12-16	Can video generation replace cinematographers? Research on the cinematic language of generated video	Xiaozhe Li et.al.	2412.12223	null
2024-12-16	CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding	Guo Chen et.al.	2412.12075	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Generative Inbetweening through Frame-wise Conditions-Driven Video Generation	Tianyi Zhu et.al.	2412.11755	link
2024-12-16	VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting	Muhammet Furkan Ilaslan et.al.	2412.11621	link
2024-12-16	Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning	Zhuyang Xie et.al.	2412.11467	null
2024-12-15	Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition	Yulin Wang et.al.	2412.11228	link
2024-12-15	GenLit: Reformulating Single-Image Relighting as Video Generation	Shrisha Bharadwaj et.al.	2412.11224	null
2024-12-15	DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes	Jinxiu Liu et.al.	2412.11100	null
2024-12-15	Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track	Deepak Gupta et.al.	2412.11056	null
2024-12-20	Video Diffusion Transformers are In-Context Learners	Zhengcong Fei et.al.	2412.10783	link
2024-12-14	Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives	Ji-jun Park et.al.	2412.10720	null
2024-12-13	SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device	Yushu Wu et.al.	2412.10494	null
2024-12-12	VCA: Video Curious Agent for Long Video Understanding	Zeyuan Yang et.al.	2412.10471	null
2024-12-17	SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization	Zhentao Tan et.al.	2412.10443	null
2024-12-11	COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework	Xin Dong et.al.	2412.10435	null
2024-12-13	Apollo: An Exploration of Video Understanding in Large Multimodal Models	Orr Zohar et.al.	2412.10360	null
2024-12-16	TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation	Xingrui Wang et.al.	2412.10275	null
2024-12-19	AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era	Yudong Jiang et.al.	2412.10255	link
2024-12-13	B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens	Zhuqiang Lu et.al.	2412.09919	link
2024-12-16	IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs	Sosuke Yamao et.al.	2412.09907	null
2024-12-13	LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity	Hongjie Wang et.al.	2412.09856	null
2024-12-13	MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion	Xunnong Xu et.al.	2412.09828	null
2024-12-17	ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation	Ali Athar et.al.	2412.09754	null
2024-12-11	Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model	Junqi You et.al.	2412.09647	null
2024-12-16	Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models	Fan Zhang et.al.	2412.09645	link
2024-12-12	Doe-1: Closed-Loop Autonomous Driving with Large World Model	Wenzhao Zheng et.al.	2412.09627	link
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models	Chenyu Yang et.al.	2412.09613	null
2024-12-12	Owl-1: Omni World Model for Consistent Long Video Generation	Yuanhui Huang et.al.	2412.09600	link
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neptune: The Long Orbit to Benchmarking Long Video Understanding	Arsha Nagrani et.al.	2412.09582	link
2024-12-12	Video Creation by Demonstration	Yihong Sun et.al.	2412.09551	null
2024-12-12	Agent-based Video Trimming	Lingfeng Yang et.al.	2412.09513	null
2024-12-12	UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer	Delong Liu et.al.	2412.09389	link
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	null
2024-12-12	InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption	Tiehan Fan et.al.	2412.09283	null
2024-12-12	Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering	Sai Bhargav Rongali et.al.	2412.09230	null
2024-12-12	LVMark: Robust Watermark for latent video diffusion models	MinHyuk Jang et.al.	2412.09122	null
2024-12-12	Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation	Lianrui Mu et.al.	2412.08976	null
2024-12-12	Mojito: Motion Trajectory and Intensity Control for Video Generation	Xuehai He et.al.	2412.08948	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-13	Physical Informed Driving World Model	Zhuoran Yang et.al.	2412.08410	null
2024-12-11	FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks	Chongkai Gao et.al.	2412.08261	null
2024-12-11	VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation	Zhiqiang Yuan et.al.	2412.08259	null
2024-12-10	3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark	Wufei Ma et.al.	2412.07825	null
2024-12-11	UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics	Xi Chen et.al.	2412.07774	null
2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null
2024-12-10	SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Jianhong Bai et.al.	2412.07760	link
2024-12-10	3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation	Xiao Fu et.al.	2412.07759	null
2024-12-10	Multi-Shot Character Consistency for Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2024-12-10	StyleMaster: Stylize Your Video with Artistic Generation and Translation	Zixuan Ye et.al.	2412.07744	null
2024-12-10	STIV: Scalable Text and Image Conditioned Video Generation	Zongyu Lin et.al.	2412.07730	null
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-10	GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning	Yicheng Wang et.al.	2412.07704	null
2024-12-10	Multimodal Contextualized Support for Enhancing Video Retrieval System	Quoc-Bao Nguyen-Le et.al.	2412.07584	null
2024-12-19	Multi-Scale Contrastive Learning for Video Temporal Grounding	Thong Thanh Nguyen et.al.	2412.07157	null
2024-12-09	SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations	Zhaorun Chen et.al.	2412.06878	null
2024-12-09	VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features	Sifei Li et.al.	2412.06296	null
2024-12-11	Towards Long Video Understanding via Fine-detailed Video Story Generation	Zeng You et.al.	2412.06182	null
2024-12-08	Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training	Zhenghong Zhou et.al.	2412.06029	null
2024-12-08	FlexDiT: Dynamic Token Density Control for Diffusion Transformer	Shuning Chang et.al.	2412.06028	link
2024-12-10	Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation	Hyeonho Jeong et.al.	2412.06016	null
2024-12-08	Accelerating Video Diffusion Models via Distribution Matching	Yuanzhi Zhu et.al.	2412.05899	null
2024-12-08	MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation	Shuwei Shi et.al.	2412.05848	null
2024-12-08	Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval	Shanti Stewart et.al.	2412.05831	null
2024-12-08	Self-Guidance: Boosting Flow and Diffusion Generation on Their Own	Tiancheng Li et.al.	2412.05827	null
2024-12-07	Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation	Leonardo Pina et.al.	2412.05694	null
2024-12-11	Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model	Lening Wang et.al.	2412.05280	link
2024-12-17	Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling	Zhe Chen et.al.	2412.05271	link
2024-12-06	Mind the Time: Temporally-Controlled Multi-Event Video Generation	Ziyi Wu et.al.	2412.05263	null
2024-12-11	LinVT: Empower Your Image-level Large Language Model to Understand Videos	Lishuai Gao et.al.	2412.05185	link
2024-12-06	Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection	Khurram Azeem Hashmi et.al.	2412.04915	null
2024-12-06	UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving	Rui Chen et.al.	2412.04842	link
2024-12-12	Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model	Keunwoo Peter Yu et.al.	2412.04729	null
2024-12-05	Using Diffusion Priors for Video Amodal Segmentation	Kaihua Chen et.al.	2412.04623	null