2025-01-31 |
Vintix: Action Model via In-Context Reinforcement Learning |
Andrey Polubarov et.al. |
2501.19400 |
link |
2025-01-31 |
Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game |
Mustafa O. Karabag et.al. |
2501.19398 |
null |
2025-01-31 |
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models |
Alina Shutova et.al. |
2501.19392 |
null |
2025-01-31 |
Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models |
Wenzhi Fang et.al. |
2501.19389 |
null |
2025-02-03 |
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions |
Dominik Wagner et.al. |
2501.19377 |
null |
2025-01-31 |
Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions |
Sören Christensen et.al. |
2501.19373 |
null |
2025-01-31 |
We’re Different, We’re the Same: Creative Homogeneity Across LLMs |
Emily Wenger et.al. |
2501.19361 |
null |
2025-01-31 |
Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies |
Brandon P. Chelstrom et.al. |
2501.19359 |
null |
2025-01-31 |
The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking |
Yuchun Miao et.al. |
2501.19358 |
null |
2025-01-31 |
Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters |
Adrián Juan-Delgado et.al. |
2501.19356 |
null |
2025-01-31 |
Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 |
Ting-Yao E. Hsu et.al. |
2501.19353 |
null |
2025-01-31 |
Towards Adaptive Self-Improvement for Smarter Energy Systems |
Alexander Sommer et.al. |
2501.19340 |
null |
2025-01-31 |
PixelWorld: Towards Perceiving Everything as Pixels |
Zhiheng Lyu et.al. |
2501.19339 |
null |
2025-01-31 |
Homogeneity Bias as Differential Sampling Uncertainty in Language Models |
Messi H. J. Lee et.al. |
2501.19337 |
null |
2025-01-31 |
Reward-Guided Speculative Decoding for Efficient LLM Reasoning |
Baohao Liao et.al. |
2501.19324 |
null |
2025-01-31 |
MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems |
Anirudh Chari et.al. |
2501.19318 |
null |
2025-01-31 |
LLM-based Affective Text Generation Quality Based on Different Quantization Values |
Yarik Menchaca Resendiz et.al. |
2501.19317 |
null |
2025-01-31 |
Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment |
Gregor Bachmann et.al. |
2501.19309 |
null |
2025-02-03 |
SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling |
Jiefeng Chen et.al. |
2501.19306 |
null |
2025-01-31 |
Beyond checkmate: exploring the creative chokepoints in AI text |
Nafis Irtiza Tripto et.al. |
2501.19301 |
link |
2025-01-31 |
Offline Learning for Combinatorial Multi-armed Bandits |
Xutong Liu et.al. |
2501.19300 |
null |
2025-01-31 |
Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes |
Zhiyao Xu et.al. |
2501.19298 |
null |
2025-01-31 |
Analysis of LLMs vs Human Experts in Requirements Engineering |
Cory Hymel et.al. |
2501.19297 |
null |
2025-01-31 |
Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators |
Kunpeng Zhang et.al. |
2501.19282 |
null |
2025-01-31 |
Pheromone-based Learning of Optimal Reasoning Paths |
Anirudh Chari et.al. |
2501.19278 |
null |
2025-01-31 |
From Assistance to Autonomy – A Researcher Study on the Potential of AI Support for Qualitative Data Analysis |
Elisabeth Kirsten et.al. |
2501.19275 |
null |
2025-01-31 |
Jackpot! Alignment as a Maximal Lottery |
Roberto-Rafael Maura-Rivero et.al. |
2501.19266 |
null |
2025-01-31 |
Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge |
Amogh Joshi et.al. |
2501.19259 |
null |
2025-01-31 |
A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation |
Yunzhe Li et.al. |
2501.19232 |
null |
2025-01-31 |
Autonomous Legacy Web Application Upgrades Using a Multi-Agent System |
Valtteri Ala-Salmi et.al. |
2501.19204 |
null |
2025-02-03 |
Improving the Robustness of Representation Misdirection for Large Language Model Unlearning |
Dang Huu-Tien et.al. |
2501.19202 |
link |
2025-01-31 |
Efficient Reasoning with Hidden Thinking |
Xuan Shen et.al. |
2501.19201 |
link |
2025-01-31 |
Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning |
Xianglin Yang et.al. |
2501.19180 |
null |
2025-01-31 |
No Foundations without Foundations – Why semi-mechanistic models are essential for regulatory biology |
Luka Kovačević et.al. |
2501.19178 |
null |
2025-01-31 |
Position: Contextual Integrity Washing for Language Models |
Yan Shvartzshnaider et.al. |
2501.19173 |
null |
2025-01-31 |
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs |
Kejia Zhang et.al. |
2501.19164 |
null |
2025-01-31 |
A theoretical framework for overfitting in energy-based modeling |
Giovanni Catania et.al. |
2501.19158 |
null |
2025-01-31 |
A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator |
Sixiao Huang et.al. |
2501.19135 |
null |
2025-01-31 |
Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations |
Sihwan Park et.al. |
2501.19099 |
null |
2025-01-31 |
Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data |
Xichen Xu et.al. |
2501.19094 |
null |
2025-01-31 |
Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models |
Jialin Zhao et.al. |
2501.19090 |
null |
2025-01-31 |
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification |
Xiangyu Sun et.al. |
2501.19086 |
null |
2025-01-31 |
Enhancing Code Generation for Low-Resource Languages: No Silver Bullet |
Alessandro Giagnorio et.al. |
2501.19085 |
null |
2025-01-31 |
Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations |
Dahye Kim et.al. |
2501.19066 |
link |
2025-01-31 |
TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs |
Yan Sun et.al. |
2501.19057 |
null |
2025-01-31 |
Enabling Autonomic Microservice Management through Self-Learning Agents |
Fenglin Yu et.al. |
2501.19056 |
null |
2025-01-31 |
Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models |
Ruiyu Wang et.al. |
2501.19054 |
null |
2025-01-31 |
Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors |
Simon Idoko et.al. |
2501.19042 |
link |
2025-01-31 |
Towards the Worst-case Robustness of Large Language Models |
Huanran Chen et.al. |
2501.19040 |
null |
2025-01-31 |
Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs |
Hongliang Li et.al. |
2501.19036 |
null |
2025-01-31 |
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses |
Bo Lan et.al. |
2501.19034 |
link |
2025-01-31 |
Multilayer Networks in Neuroimaging |
Vesna Vuksanovic et.al. |
2501.19024 |
null |
2025-01-31 |
Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation |
Bin Zhu et.al. |
2501.19017 |
null |
2025-01-31 |
Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities |
Arjun Krishna et.al. |
2501.19012 |
null |
2025-01-31 |
Visual Autoregressive Modeling for Image Super-Resolution |
Yunpeng Qu et.al. |
2501.18993 |
null |
2025-01-31 |
Symmetric Pruning of Large Language Models |
Kai Yi et.al. |
2501.18980 |
null |
2025-01-31 |
BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics |
Yuxuan Liu et.al. |
2501.18972 |
null |
2025-01-31 |
Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping |
Pu Yang et.al. |
2501.18962 |
null |
2025-01-31 |
Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow |
Alfred Bexley et.al. |
2501.18957 |
null |
2025-01-31 |
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models |
Shenghao Fu et.al. |
2501.18954 |
link |
2025-01-31 |
TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment |
Zi-Jian Cheng et.al. |
2501.18935 |
link |
2025-01-31 |
Language Games as the Pathway to Artificial Superhuman Intelligence |
Ying Wen et.al. |
2501.18924 |
null |
2025-01-31 |
KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search |
Haoran Luo et.al. |
2501.18922 |
link |
2025-01-31 |
LLM Program Optimization via Retrieval Augmented Search |
Sagnik Anupam et.al. |
2501.18916 |
null |
2025-01-31 |
Scaling Laws for Differentially Private Language Models |
Ryan McKenna et.al. |
2501.18914 |
null |
2025-01-31 |
Streamlining Security Vulnerability Triage with Large Language Models |
Mohammad Jalili Torkamani et.al. |
2501.18908 |
null |
2025-01-31 |
Trustworthy Evaluation of Generative AI Models |
Zijun Gao et.al. |
2501.18897 |
null |
2025-01-31 |
Can We Predict the Effect of Prompts? |
Jae Yong Lee et.al. |
2501.18883 |
null |
2025-01-31 |
Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models |
Jiaqi Tang et.al. |
2501.18863 |
null |
2025-01-31 |
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning |
Han Zhong et.al. |
2501.18858 |
null |
2025-01-31 |
Equivariant Hypergraph Diffusion for Crystal Structure Prediction |
Yang Liu et.al. |
2501.18850 |
null |
2025-01-31 |
Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities |
Yaping Chai et.al. |
2501.18845 |
null |
2025-01-31 |
Trading Inference-Time Compute for Adversarial Robustness |
Wojciech Zaremba et.al. |
2501.18841 |
null |
2025-01-31 |
Partially Rewriting a Transformer in Natural Language |
Gonçalo Paulo et.al. |
2501.18838 |
null |
2025-01-31 |
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming |
Mrinank Sharma et.al. |
2501.18837 |
null |
2025-01-31 |
Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential |
Chenyu Gao et.al. |
2501.18834 |
null |
2025-01-31 |
Structural Embedding Projection for Contextual Large Language Model Inference |
Vincent Enoasmo et.al. |
2501.18826 |
null |
2025-01-31 |
Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies |
Andrey Borro et.al. |
2501.18817 |
link |
2025-01-31 |
Large Language Models as Common-Sense Heuristics |
Andrey Borro et.al. |
2501.18816 |
null |
2025-01-30 |
Compositional Generalization Requires More Than Disentangled Representations |
Qiyao Liang et.al. |
2501.18797 |
null |
2025-01-30 |
Rope to Nope and Back Again: A New Hybrid Attention Strategy |
Bowen Yang et.al. |
2501.18795 |
null |
2025-01-30 |
Survey and Improvement Strategies for Gene Prioritization with Large Language Models |
Matthew Neeley et.al. |
2501.18794 |
null |
2025-01-30 |
LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? |
Alexander Tuisov et.al. |
2501.18784 |
null |
2025-01-30 |
Navigating the Fragrance space Via Graph Generative Models And Predicting Odors |
Mrityunjay Sharma et.al. |
2501.18777 |
link |
2025-01-30 |
Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring |
Zijun Deng et.al. |
2501.18761 |
null |
2025-01-30 |
Synthetic Data Generation for Augmenting Small Samples |
Dan Liu et.al. |
2501.18741 |
null |
2025-01-30 |
Examining the Robustness of Large Language Models across Language Complexity |
Jiayi Zhang et.al. |
2501.18738 |
null |
2025-01-30 |
Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks |
Mohd. Farhan Israk Soumik et.al. |
2501.18727 |
null |
2025-01-30 |
Strong and Controllable 3D Motion Generation |
Canxuan Gang et.al. |
2501.18726 |
null |
2025-01-30 |
Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning |
Maya Kruse et.al. |
2501.18724 |
null |
2025-02-03 |
Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps |
Devansh Bhardwaj et.al. |
2501.18712 |
null |
2025-01-30 |
Regularized second-order optimization of tensor-network Born machines |
Matan Ben-Dov et.al. |
2501.18691 |
null |
2025-01-30 |
Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting |
Yansong Qu et.al. |
2501.18672 |
null |
2025-01-30 |
Foundational Models for 3D Point Clouds: A Survey and Outlook |
Vishal Thengane et.al. |
2501.18594 |
null |
2025-01-30 |
Diffusion Autoencoders are Scalable Image Tokenizers |
Yinbo Chen et.al. |
2501.18593 |
null |
2025-02-03 |
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models |
Hao Dong et.al. |
2501.18592 |
link |
2025-01-30 |
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs |
Yue Wang et.al. |
2501.18585 |
null |
2025-01-30 |
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH |
Evgenii Evstafev et.al. |
2501.18576 |
null |
2025-01-30 |
BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos |
Lehao Lin et.al. |
2501.18565 |
null |
2025-01-30 |
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation |
Haoquan Fang et.al. |
2501.18564 |
null |
2025-01-30 |
Semantic Web and Creative AI – A Technical Report from ISWS 2023 |
Raia Abu Ahmad et.al. |
2501.18542 |
null |
2025-01-30 |
Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges |
Manveer Singh Tamber et.al. |
2501.18536 |
link |
2025-01-30 |
Differentially Private Steering for Large Language Model Alignment |
Anmol Goel et.al. |
2501.18532 |
link |
2025-01-30 |
Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models |
Guanqun Cao et.al. |
2501.18516 |
null |
2025-01-30 |
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch |
Arthur Douillard et.al. |
2501.18512 |
null |
2025-01-30 |
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training |
Benjamin Feuer et.al. |
2501.18511 |
link |
2025-01-30 |
CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction |
Peter J. Bentley et.al. |
2501.18504 |
null |
2025-01-30 |
Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline |
Shivani Kapania et.al. |
2501.18493 |
null |
2025-01-30 |
A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models |
Changshu Liu et.al. |
2501.18482 |
null |
2025-01-30 |
CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization |
Yanxia Deng et.al. |
2501.18475 |
null |
2025-01-30 |
Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations |
Chengxi Zeng et.al. |
2501.18474 |
null |
2025-01-30 |
ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation |
Minghua He et.al. |
2501.18460 |
null |
2025-01-30 |
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering |
Yumeng Wang et.al. |
2501.18457 |
null |
2025-01-30 |
GENIE: Generative Note Information Extraction model for structuring EHR data |
Huaiyuan Ying et.al. |
2501.18435 |
null |
2025-01-30 |
Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation |
Youngjoon Lee et.al. |
2501.18416 |
null |
2025-01-30 |
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects |
Yiteng Tu et.al. |
2501.18365 |
link |
2025-01-30 |
A Video-grounded Dialogue Dataset and Metric for Event-driven Activities |
Wiradee Imrattanatrai et.al. |
2501.18324 |
link |
2025-01-30 |
Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach |
Tianpeng Pan et.al. |
2501.18320 |
null |
2025-01-30 |
Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models |
Jennifer D’Souza et.al. |
2501.18287 |
null |
2025-01-30 |
Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models |
Haoyu Liang et.al. |
2501.18280 |
null |
2025-01-30 |
Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence |
Kevin Roitero et.al. |
2501.18265 |
null |
2025-01-30 |
How to Select Datapoints for Efficient Human Evaluation of NLG Models? |
Vilém Zouhar et.al. |
2501.18251 |
link |
2025-01-30 |
Statistical multi-metric evaluation and visualization of LLM system predictive performance |
Samuel Ackerman et.al. |
2501.18243 |
null |
2025-01-30 |
Contextually Structured Token Dependency Encoding for Large Language Models |
James Blades et.al. |
2501.18205 |
null |
2025-01-30 |
Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents |
ShuiDe Wen et.al. |
2501.18190 |
null |
2025-01-30 |
Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation |
Teddy Lazebnik et.al. |
2501.18177 |
null |
2025-01-30 |
Continually Evolved Multimodal Foundation Models for Cancer Prognosis |
Jie Peng et.al. |
2501.18170 |
null |
2025-01-30 |
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing |
Jinyao Guo et.al. |
2501.18160 |
null |
2025-01-30 |
Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study |
Yuchen Lei et.al. |
2501.18158 |
null |
2025-01-30 |
Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models |
Wanlong Liu et.al. |
2501.18154 |
null |
2025-01-30 |
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models |
Qika Lin et.al. |
2501.18119 |
null |
2025-01-30 |
Scaling Inference-Efficient Language Models |
Song Bian et.al. |
2501.18107 |
null |
2025-01-30 |
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation |
Yibo Wang et.al. |
2501.18100 |
link |
2025-01-30 |
AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates |
Da Chang et.al. |
2501.18094 |
null |
2025-01-30 |
Normative Evaluation of Large Language Models with Everyday Moral Dilemmas |
Pratik S. Sachdeva et.al. |
2501.18081 |
null |
2025-01-30 |
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models |
Spencer Mateega et.al. |
2501.18062 |
null |
2025-01-29 |
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems |
Duy A. Nguyen et.al. |
2501.18056 |
null |
2025-01-29 |
Current Pathology Foundation Models are unrobust to Medical Center Differences |
Edwin D. de Jong et.al. |
2501.18055 |
null |
2025-01-29 |
A Proximal Operator for Inducing 2:4-Sparsity |
Jonas M Kübler et.al. |
2501.18015 |
null |
2025-01-29 |
Large Language Models Think Too Fast To Explore Effectively |
Lan Pan et.al. |
2501.18009 |
null |
2025-01-29 |
Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces |
Neetha Jambigi et.al. |
2501.18005 |
null |
2025-01-29 |
InnerThoughts: Disentangling Representations and Predictions in Large Language Models |
Didier Chételat et.al. |
2501.17994 |
null |
2025-01-29 |
Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study |
Marwah Alaofi et.al. |
2501.17981 |
link |
2025-01-29 |
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization |
Zishun Yu et.al. |
2501.17974 |
null |
2025-01-29 |
“I Would Never Trust Anything Western”: Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools |
Manas Mhasakar et.al. |
2501.17942 |
null |
2025-01-29 |
DReSS: Data-driven Regularized Structured Streamlining for Large Language Models |
Mingkuan Feng et.al. |
2501.17905 |
null |
2025-01-29 |
Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? |
Pouya Pezeshkpour et.al. |
2501.17840 |
link |
2025-01-29 |
Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology |
Sobhan Hemati et.al. |
2501.17822 |
null |
2025-01-30 |
Leveraging Multimodal LLM for Inspirational User Interface Search |
Seokhyeon Park et.al. |
2501.17799 |
link |
2025-01-29 |
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights |
Chan-Jan Hsu et.al. |
2501.17790 |
null |
2025-01-29 |
AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing |
Peter Pak et.al. |
2501.17784 |
null |
2025-01-29 |
2SSP: A Two-Stage Framework for Structured Pruning of LLMs |
Fabrizio Sandri et.al. |
2501.17771 |
link |
2025-01-29 |
Generative Unordered Flow for Set-Structured Data Generation |
Yangming Li et.al. |
2501.17770 |
null |
2025-01-29 |
Hybrid Graphs for Table-and-Text based Question Answering using LLMs |
Ankush Agarwal et.al. |
2501.17767 |
null |
2025-01-29 |
On the Partitioning of GPU Power among Multi-Instances |
Tirth Vamja et.al. |
2501.17752 |
null |
2025-01-29 |
Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation |
Aitor Arrieta et.al. |
2501.17749 |
null |
2025-01-29 |
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches |
Ana R. Baião et.al. |
2501.17729 |
null |
2025-01-29 |
Using Code Generation to Solve Open Instances of Combinatorial Design Problems |
Christopher D. Rosin et.al. |
2501.17725 |
link |
2025-01-29 |
RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts |
Eujeong Choi et.al. |
2501.17715 |
link |
2025-01-29 |
Source-Channel Separation Theorems for Distortion Perception Coding |
Chao Tian et.al. |
2501.17706 |
null |
2025-01-29 |
Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching |
Xuzhe Dang et.al. |
2501.17665 |
null |
2025-01-30 |
In-Context Meta LoRA Generation |
Yihua Shao et.al. |
2501.17635 |
null |
2025-01-29 |
Uncertainty Quantification and Decomposition for LLM-based Recommendation |
Wonbin Kweon et.al. |
2501.17630 |
link |
2025-01-29 |
The Imitation Game According To Turing |
Sharon Temtsin et.al. |
2501.17629 |
null |
2025-01-29 |
Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment |
Jonathan Teel et.al. |
2501.17617 |
null |
2025-01-29 |
Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis |
Kunrong Li et.al. |
2501.17598 |
null |
2025-01-30 |
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models |
Behraj Khan et.al. |
2501.17595 |
null |
2025-01-29 |
GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback |
Mohamed Abdelaal et.al. |
2501.17584 |
null |
2025-01-29 |
CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs |
Amey Hengle et.al. |
2501.17581 |
null |
2025-01-29 |
Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding |
Marco Pasini et.al. |
2501.17578 |
null |
2025-01-29 |
Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models |
Wooyoung Kim et.al. |
2501.17549 |
null |
2025-01-29 |
Towards Training-Free Open-World Classification with 3D Generative Models |
Xinzhe Xia et.al. |
2501.17547 |
null |
2025-01-29 |
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant |
Gaole He et.al. |
2501.17546 |
link |
2025-01-29 |
Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison |
Martin Nizon-Deladoeuille et.al. |
2501.17539 |
null |
2025-01-29 |
Neural Spelling: A Spell-Based BCI System for Language Neural Decoding |
Xiaowei Jiang et.al. |
2501.17489 |
null |
2025-01-29 |
DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance |
Seffi Cohen et.al. |
2501.17479 |
link |
2025-01-29 |
AugmenTest: Enhancing Tests with LLM-Driven Oracles |
Shaker Mahmud Khandaker et.al. |
2501.17461 |
null |
2025-01-29 |
Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction |
Kaiwei Luo et.al. |
2501.17459 |
null |
2025-01-29 |
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation |
Tiansheng Huang et.al. |
2501.17433 |
link |
2025-01-29 |
Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models |
Yuxuan Li et.al. |
2501.17420 |
null |
2025-01-29 |
MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs |
Ved Sirdeshmukh et.al. |
2501.17399 |
link |
2025-01-29 |
Learning Free Token Reduction for Multi-Modal LLM |
Zihui Zhao et.al. |
2501.17391 |
null |
2025-01-29 |
Context-Aware Semantic Recomposition Mechanism for Large Language Models |
Richard Katrix et.al. |
2501.17386 |
null |
2025-01-28 |
Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations |
Md Tauhidul Islam et.al. |
2501.17347 |
null |
2025-01-28 |
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction |
Mingyu Derek Ma et.al. |
2501.17326 |
null |
2025-01-28 |
CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data |
Lee Carlin et.al. |
2501.17324 |
null |
2025-01-30 |
Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding |
Yun-Shiuan Chuang et.al. |
2501.17310 |
null |
2025-01-28 |
“Ownership, Not Just Happy Talk”: Co-Designing a Participatory Large Language Model for Journalism |
Emily Tseng et.al. |
2501.17299 |
null |
2025-01-28 |
Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization |
Zilu Tang et.al. |
2501.17295 |
null |
2025-01-28 |
Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology |
Peilong Wang et.al. |
2501.17286 |
null |
2025-01-30 |
From Natural Language to Extensive-Form Game Representations |
Shilong Deng et.al. |
2501.17282 |
link |
2025-01-28 |
Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models |
Abdalaziz Al-Maeeni et.al. |
2501.17279 |
null |
2025-01-28 |
Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics |
Jasper Timm et.al. |
2501.17273 |
link |
2025-01-28 |
Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care |
Fengpei Yuan et.al. |
2501.17206 |
null |
2025-01-28 |
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training |
Tianzhe Chu et.al. |
2501.17161 |
null |
2025-01-28 |
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data |
Deren Lei et.al. |
2501.17144 |
link |
2025-01-28 |
ASTRAL: Automated Safety Testing of Large Language Models |
Miriam Ugarte et.al. |
2501.17132 |
null |
2025-01-28 |
Optimizing Large Language Model Training Using FP4 Quantization |
Ruizhe Wang et.al. |
2501.17116 |
null |
2025-01-28 |
Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction |
Carl-Leander Henneking et.al. |
2501.17112 |
null |
2025-01-28 |
Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics |
Guillaume Le Mailloux et.al. |
2501.17107 |
link |
2025-01-28 |
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving |
Evgenii Evstafev et.al. |
2501.17084 |
null |
2025-01-28 |
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding |
Akash Kumar et.al. |
2501.17053 |
null |
2025-01-28 |
Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models |
Minghan Li et.al. |
2501.17039 |
null |
2025-01-28 |
Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies |
Manojkumar Parmar et.al. |
2501.17030 |
null |
2025-01-28 |
Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs |
Alessandro Midolo et.al. |
2501.17024 |
link |
2025-01-28 |
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement |
Kei Katsumata et.al. |
2501.17022 |
link |
2025-01-28 |
MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition |
Philippe Pasquier et.al. |
2501.17011 |
null |
2025-01-28 |
Large Language Models for Code Generation: The Practitioners Perspective |
Zeeshan Rasheed et.al. |
2501.16998 |
link |
2025-01-28 |
Artificial Intelligence Clones |
Annie Liang et.al. |
2501.16996 |
null |
2025-01-28 |
FedEFM: Federated Endovascular Foundation Model with Unseen Data |
Tuong Do et.al. |
2501.16992 |
null |
2025-01-28 |
Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver |
Shunya Minami et.al. |
2501.16986 |
null |
2025-01-28 |
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling |
Hongzhi Huang et.al. |
2501.16975 |
null |
2025-01-28 |
Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers |
Mohammad Raza et.al. |
2501.16961 |
null |
2025-01-28 |
Multiple Abstraction Level Retrieve Augment Generation |
Zheng Zheng et.al. |
2501.16952 |
null |
2025-01-29 |
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models |
Makoto Shing et.al. |
2501.16937 |
null |
2025-01-28 |
Detecting harassment and defamation in cyberbullying with emotion-adaptive training |
Peiling Yi et.al. |
2501.16925 |
link |
2025-01-28 |
RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains |
Shady Nasrat et.al. |
2501.16899 |
link |
2025-01-28 |
Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model |
Eoghan Cronin et.al. |
2501.16893 |
null |
2025-01-28 |
Irony Detection, Reasoning and Understanding in Zero-shot Learning |
Peiling Yi et.al. |
2501.16884 |
null |
2025-01-28 |
Comparing Human and LLM Generated Code: The Jury is Still Out! |
Sherlock A. Licorish et.al. |
2501.16857 |
null |
2025-01-28 |
Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis |
Tiao Tan et.al. |
2501.16842 |
null |
2025-01-28 |
Misspellings in Natural Language Processing: A survey |
Gianluca Sperduti et.al. |
2501.16836 |
null |
2025-01-28 |
DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model |
Josua Spisak et.al. |
2501.16800 |
null |
2025-01-28 |
Algorithm for Automatic Legislative Text Consolidation |
Matias Etcheverry et.al. |
2501.16794 |
null |
2025-01-28 |
Exponential Family Attention |
Kevin Christian Wibisono et.al. |
2501.16790 |
link |
2025-01-28 |
Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding |
Yun Li et.al. |
2501.16786 |
null |
2025-01-28 |
TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network |
Yumingzhi Pan et.al. |
2501.16784 |
null |
2025-01-28 |
A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process |
Jack David Carson et.al. |
2501.16783 |
null |
2025-01-29 |
Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models |
Muhammad Atta ur Rahman et.al. |
2501.16769 |
null |
2025-01-28 |
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation |
Chenguo Lin et.al. |
2501.16764 |
null |
2025-01-28 |
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns |
Xinyue Shen et.al. |
2501.16750 |
link |
2025-01-28 |
Through the Prism of Culture: Evaluating LLMs’ Understanding of Indian Subcultures and Traditions |
Garima Chhikara et.al. |
2501.16748 |
null |
2025-01-28 |
LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience |
Nimesh Jha et.al. |
2501.16744 |
null |
2025-01-28 |
Distilling Large Language Models for Network Active Queue Management |
Deol Satish et.al. |
2501.16734 |
null |
2025-01-28 |
xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking |
Sunbowen Lee et.al. |
2501.16727 |
link |
2025-01-28 |
One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning |
Chunpeng Zhou et.al. |
2501.16720 |
null |
2025-01-28 |
Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection |
Hengzhuang Li et.al. |
2501.16718 |
link |
2025-01-28 |
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow |
Yueen Ma et.al. |
2501.16698 |
null |
2025-01-28 |
MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark |
Dongyi Yi et.al. |
2501.16688 |
null |
2025-01-28 |
Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting |
Li Yin et.al. |
2501.16673 |
link |
2025-01-28 |
VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records |
Philip Chung et.al. |
2501.16672 |
link |
2025-01-28 |
Contextual Reinforcement in Multimodal Token Compression for Large Language Models |
Naderdel Piero et.al. |
2501.16658 |
null |
2025-01-28 |
Large Language Model Critics for Execution-Free Evaluation of Code Changes |
Aashish Yadavally et.al. |
2501.16655 |
link |
2025-01-28 |
Molecular-driven Foundation Model for Oncologic Pathology |
Anurag Vaidya et.al. |
2501.16652 |
null |
2025-01-28 |
DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models |
Zeping Min et.al. |
2501.16650 |
null |
2025-01-28 |
An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue |
Koji Inoue et.al. |
2501.16643 |
null |
2025-01-28 |
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs |
Jinlan Fu et.al. |
2501.16629 |
link |
2025-01-28 |
Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems |
Baraa Hikal et.al. |
2501.16616 |
null |
2025-01-28 |
Sparse Autoencoders Trained on the Same Data Learn Different Features |
Gonçalo Paulo et.al. |
2501.16615 |
null |
2025-01-28 |
Fine-Tuned Language Models as Space Systems Controllers |
Enrico M. Zucchelli et.al. |
2501.16588 |
null |
2025-01-27 |
AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models |
Zheng Lian et.al. |
2501.16566 |
null |
2025-01-27 |
LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation |
Farzad Farhadzadeh et.al. |
2501.16559 |
null |
2025-01-27 |
Distributional Information Embedding: A Framework for Multi-bit Watermarking |
Haiyun He et.al. |
2501.16558 |
null |
2025-01-27 |
PackDiT: Joint Human Motion and Text Generation via Mutual Prompting |
Zhongyu Jiang et.al. |
2501.16551 |
null |
2025-01-27 |
PhysAnimator: Physics-Guided Generative Cartoon Animation |
Tianyi Xie et.al. |
2501.16550 |
null |
2025-01-27 |
Sample-Efficient Behavior Cloning Using General Domain Knowledge |
Feiyu Zhu et.al. |
2501.16546 |
null |
2025-01-27 |
Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees |
Piyush Gupta et.al. |
2501.16539 |
null |
2025-01-27 |
Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs |
Jean-Charles Noirot Ferrand et.al. |
2501.16534 |
null |
2025-01-27 |
A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain |
Jorge del Pozo Lérida et.al. |
2501.16533 |
null |
2025-01-27 |
Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction |
Atharva Naik et.al. |
2501.16524 |
null |
2025-01-27 |
How well can LLMs Grade Essays in Arabic? |
Rayed Ghazawi et.al. |
2501.16516 |
null |
2025-01-27 |
Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models |
Sudarshan Kamath Barkur et.al. |
2501.16513 |
null |
2025-01-27 |
Smoothed Embeddings for Robust Language Models |
Ryo Hase et.al. |
2501.16497 |
null |
2025-01-27 |
Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations |
Pablo Valenzuela-Toledo et.al. |
2501.16495 |
null |
2025-01-27 |
Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM |
Payal Kamboj et.al. |
2501.16481 |
link |
2025-01-27 |
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation |
Philip Hughes et.al. |
2501.16467 |
null |
2025-01-27 |
CoCoNUT: Structural Code Understanding does not fall out of a tree |
Claas Beger et.al. |
2501.16456 |
link |
2025-01-27 |
Detecting Zero-Day Attacks in Digital Substations via In-Context Learning |
Faizan Manzoor et.al. |
2501.16453 |
null |
2025-01-27 |
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation |
Hamed Firooz et.al. |
2501.16450 |
null |
2025-01-27 |
DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation |
Han Sun et.al. |
2501.16410 |
null |
2025-01-27 |
Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology |
Meiyun Cao et.al. |
2501.16309 |
null |
2025-01-27 |
RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval |
Long Nguyen et.al. |
2501.16303 |
null |
2025-01-27 |
Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width |
Zheng Liu et.al. |
2501.16302 |
null |
2025-01-27 |
Large Models in Dialogue for Active Perception and Anomaly Detection |
Tzoulio Chamiti et.al. |
2501.16300 |
link |
2025-01-27 |
FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers |
Renshan Zhang et.al. |
2501.16297 |
null |
2025-01-27 |
Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models |
Jing Zhang et.al. |
2501.16282 |
null |
2025-01-27 |
Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation |
Jiayi Hong et.al. |
2501.16277 |
link |
2025-01-27 |
URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT |
Long Nguyen et.al. |
2501.16276 |
null |
2025-01-27 |
A foundation model for human-AI collaboration in medical literature mining |
Zifeng Wang et.al. |
2501.16255 |
null |
2025-01-27 |
Multi-Agent Geospatial Copilots for Remote Sensing Workflows |
Chaehong Lee et.al. |
2501.16254 |
null |
2025-01-27 |
Zero-Shot Decision Tree Construction via Large Language Models |
Lucas Carrasco et.al. |
2501.16247 |
null |
2025-01-27 |
CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation |
Xiaochuan Ma et.al. |
2501.16246 |
null |
2025-01-27 |
Phase Transitions in Large Language Models and the $O(N)$ Model |
Youran Sun et.al. |
2501.16241 |
null |
2025-01-27 |
AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses |
Runze Cai et.al. |
2501.16240 |
null |
2025-01-28 |
Distilling foundation models for robust and efficient models in digital pathology |
Alexandre Filiot et.al. |
2501.16239 |
null |
2025-01-27 |
Language-Based Bayesian Optimization Research Assistant (BORA) |
Abdoulatif Cissé et.al. |
2501.16224 |
null |
2025-01-27 |
Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models |
Huayu Li et.al. |
2501.16215 |
link |
2025-01-27 |
Provence: efficient and robust context pruning for retrieval-augmented generation |
Nadezhda Chirkova et.al. |
2501.16214 |
null |
2025-01-27 |
Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs |
Antony Bartlett et.al. |
2501.16191 |
null |
2025-01-27 |
SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting |
Wenxuan Xie et.al. |
2501.16178 |
link |
2025-01-27 |
BAG: Body-Aligned 3D Wearable Asset Generation |
Zhongjin Luo et.al. |
2501.16177 |
null |
2025-01-27 |
Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma |
Richard Willis et.al. |
2501.16173 |
link |
2025-01-27 |
MetaDecorator: Generating Immersive Virtual Tours through Multimodality |
Shuang Xie et.al. |
2501.16164 |
null |
2025-01-27 |
CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge |
Yuwei Zhang et.al. |
2501.16155 |
null |
2025-01-27 |
AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought |
Xin Huang et.al. |
2501.16154 |
null |
2025-01-27 |
AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants |
Pascal J. Sager et.al. |
2501.16150 |
null |
2025-01-27 |
PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing |
Yuwei Zhang et.al. |
2501.16149 |
null |
2025-01-27 |
SampleLLM: Optimizing Tabular Data Synthesis in Recommendations |
Jingtong Gao et.al. |
2501.16125 |
null |
2025-01-27 |
Using Generative Models to Produce Realistic Populations of UK Windstorms |
Yee Chun Tsoi et.al. |
2501.16110 |
null |
2025-01-27 |
Integration of LLM Quality Assurance into an NLG System |
Ching-Yi Chen et.al. |
2501.16078 |
null |
2025-01-27 |
PISCO: Pretty Simple Compression for Retrieval-Augmented Generation |
Maxime Louis et.al. |
2501.16075 |
null |
2025-01-27 |
A generative material transformer using Wyckoff representation |
Pierre-Paul De Breuck et.al. |
2501.16051 |
null |
2025-01-27 |
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation |
Xing Zhang et.al. |
2501.16050 |
null |
2025-01-27 |
PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment |
Vincent Freiberger et.al. |
2501.16033 |
null |
2025-01-27 |
FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments |
Zhiyuan Fu et.al. |
2501.16029 |
null |
2025-01-27 |
Transformability reveals the interplay of dynamics across different network orders |
Ming Xie et.al. |
2501.16016 |
null |
2025-01-27 |
TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference |
Jack Min Ong et.al. |
2501.16007 |
null |
2025-01-27 |
EDSep: An Effective Diffusion-Based Method for Speech Source Separation |
Jinwei Dong et.al. |
2501.15965 |
null |
2025-01-27 |
Rethinking the Bias of Foundation Model under Long-tailed Distribution |
Jiahao Chen et.al. |
2501.15955 |
null |
2025-01-27 |
Understanding Long Videos via LLM-Powered Entity Relation Graphs |
Meng Chu et.al. |
2501.15953 |
null |
2025-01-27 |
TimeHF: Billion-Scale Time Series Models Guided by Human Feedback |
Yongzhi Qi et.al. |
2501.15942 |
null |
2025-01-27 |
SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub |
Benjamin C. Carter et.al. |
2501.15922 |
null |
2025-01-27 |
Parametric Retrieval Augmented Generation |
Weihang Su et.al. |
2501.15915 |
link |
2025-01-27 |
Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation |
Muhammad Taha Tariq et.al. |
2501.15901 |
null |
2025-01-27 |
Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects |
Victor Deng et.al. |
2501.15900 |
null |
2025-01-27 |
Adaptive Width Neural Networks |
Federico Errica et.al. |
2501.15889 |
null |
2025-01-27 |
LCTG Bench: LLM Controlled Text Generation Benchmark |
Kentaro Kurihara et.al. |
2501.15875 |
link |
2025-01-27 |
LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models |
Yuewen Mei et.al. |
2501.15850 |
null |
2025-01-27 |
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model |
Delin Qu et.al. |
2501.15830 |
null |
2025-01-27 |
Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference |
Tharindu B. Hewage et.al. |
2501.15829 |
link |
2025-01-27 |
MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer |
Qi Chen et.al. |
2501.15826 |
null |
2025-01-27 |
LemmaHead: RAG Assisted Proof Generation Using Large Language Models |
Tianbo Yang et.al. |
2501.15797 |
null |
2025-01-27 |
Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? |
Zhiling Chen et.al. |
2501.15795 |
null |
2025-01-27 |
Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs |
Yu Li et.al. |
2501.15791 |
link |
2025-01-27 |
Memorization and Regularization in Generative Diffusion Models |
Ricardo Baptista et.al. |
2501.15785 |
link |
2025-01-27 |
Large Language Models to Diffusion Finetuning |
Edoardo Cetin et.al. |
2501.15781 |
null |
2025-01-27 |
Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages |
Ivory Yang et.al. |
2501.15773 |
link |
2025-01-27 |
GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design |
Yuanfu Sun et.al. |
2501.15755 |
null |
2025-01-27 |
IndicMMLU-Pro: Benchmarking the Indic Large Language Models |
Sankalp KJ et.al. |
2501.15747 |
null |
2025-01-27 |
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning |
Michael Xieyang Liu et.al. |
2501.15727 |
null |
2025-01-27 |
A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks |
Dong Li et.al. |
2501.15724 |
null |
2025-01-27 |
On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models |
Tadahiro Taniguchi et.al. |
2501.15721 |
null |
2025-01-26 |
Adapting Biomedical Abstracts into Plain language using Large Language Models |
Haritha Gangavarapu et.al. |
2501.15700 |
null |
2025-01-26 |
TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs |
Yuxuan Gu et.al. |
2501.15674 |
null |
2025-01-26 |
Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting |
Yuxin Zhang et.al. |
2501.15641 |
null |
2025-01-26 |
BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation |
Ali Khodabandeh Yalabadi et.al. |
2501.15631 |
link |
2025-01-26 |
Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets |
Eduard Barbu et.al. |
2501.15624 |
null |
2025-01-26 |
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning |
Zeyu Gan et.al. |
2501.15602 |
link |
2025-01-26 |
Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals |
Yinzhou Wang et.al. |
2501.15599 |
null |
2025-01-26 |
Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images |
Sichen Zhu et.al. |
2501.15598 |
link |
2025-01-26 |
SedarEval: Automated Evaluation using Self-Adaptive Rubrics |
Zhiyuan Fan et.al. |
2501.15595 |
link |
2025-01-26 |
SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain |
Dakuan Lu et.al. |
2501.15587 |
link |
2025-01-26 |
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework |
Yuhong Sun et.al. |
2501.15581 |
null |
2025-01-26 |
Instruction Tuning for Story Understanding and Generation with Weak Supervision |
Yangshu Yuan et.al. |
2501.15574 |
null |
2025-01-26 |
Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models |
Spencer Ramsey et.al. |
2501.15571 |
null |
2025-01-26 |
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer |
Lin Yueyu et.al. |
2501.15570 |
link |
2025-01-26 |
Ocean-OCR: Towards General OCR Application via a Vision-Language Model |
Song Chen et.al. |
2501.15558 |
null |
2025-01-26 |
Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles |
Hanwen Zhang et.al. |
2501.15544 |
null |
2025-01-26 |
Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths |
Yueyang Wang et.al. |
2501.15522 |
null |
2025-01-26 |
Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification |
Dan Song et.al. |
2501.15503 |
null |
2025-01-26 |
Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning |
Xiaohan Yu et.al. |
2501.15470 |
null |
2025-01-26 |
Data-adaptive Safety Rules for Training Reward Models |
Xiaomin Li et.al. |
2501.15453 |
null |
2025-01-26 |
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas |
Xiaoyang Wang et.al. |
2501.15427 |
null |
2025-01-26 |
Visual Generation Without Guidance |
Huayu Chen et.al. |
2501.15420 |
link |
2025-01-26 |
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement |
Junan Zhang et.al. |
2501.15417 |
null |
2025-01-26 |
The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation |
Raha Aghaei et.al. |
2501.15411 |
null |
2025-01-26 |
Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency |
Irin Kabakum et.al. |
2501.15405 |
null |
2025-01-26 |
How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning |
Tohida Rehman et.al. |
2501.15398 |
null |
2025-01-26 |
Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations |
Zijun Long et.al. |
2501.15379 |
null |
2025-01-26 |
How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback |
Manzong Huang et.al. |
2501.15378 |
null |
2025-01-26 |
Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models |
Melkamu Abay Mersha et.al. |
2501.15374 |
null |
2025-01-26 |
Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis |
Robinson Umeike et.al. |
2501.15370 |
null |
2025-01-26 |
Decentralized Low-Rank Fine-Tuning of Large Language Models |
Sajjad Ghiasvand et.al. |
2501.15361 |
null |
2025-01-26 |
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection |
Bo Yang et.al. |
2501.15355 |
null |
2025-01-25 |
Fairness in LLM-Generated Surveys |
Andrés Abeliuk et.al. |
2501.15351 |
null |
2025-01-25 |
Between Puppet and Actor: Reframing Authorship in this Age of AI Agents |
Yuqian Sun et.al. |
2501.15346 |
null |
2025-01-25 |
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data |
Jiajie Li et.al. |
2501.15326 |
null |
2025-01-25 |
ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning |
Shangqian Gao et.al. |
2501.15316 |
null |
2025-01-25 |
The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? |
Ayo Adedeji et.al. |
2501.15310 |
null |
2025-01-25 |
You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning |
Ayan Sengupta et.al. |
2501.15296 |
null |
2025-01-24 |
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation |
Xin Zhou et.al. |
2501.14729 |
link |
2025-01-24 |
Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? |
Ipek Baris Schlicht et.al. |
2501.14719 |
null |
2025-01-24 |
Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models |
Naihao Deng et.al. |
2501.14717 |
null |
2025-01-24 |
FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing |
James Seale Smith et.al. |
2501.14713 |
null |
2025-01-24 |
The Karp Dataset |
Mason DiCicco et.al. |
2501.14705 |
null |
2025-01-24 |
Rethinking Table Instruction Tuning |
Naihao Deng et.al. |
2501.14693 |
null |
2025-01-24 |
Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST |
Fuping Wu et.al. |
2501.14685 |
null |
2025-01-24 |
An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations |
Shabnam Hassani et.al. |
2501.14683 |
null |
2025-01-24 |
Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning |
Jisi Zhang et.al. |
2501.14680 |
null |
2025-01-24 |
MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications |
Yixing Jiang et.al. |
2501.14654 |
link |
2025-01-24 |
Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion |
Ziyao Xu et.al. |
2501.14649 |
link |
2025-01-24 |
Towards Scalable Topological Regularizers |
Hiu-Tung Wong et.al. |
2501.14641 |
null |
2025-01-24 |
Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics |
Renato Ghisellini et.al. |
2501.14634 |
null |
2025-01-24 |
Extracting Problem Structure with LLMs for Optimized SAT Local Search |
André Schilder et.al. |
2501.14630 |
null |
2025-01-24 |
Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data |
Jordi Abante et.al. |
2501.14615 |
null |
2025-01-24 |
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations |
Tianming Liang et.al. |
2501.14607 |
null |
2025-01-24 |
Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research |
Hamid Sarmadi et.al. |
2501.14546 |
null |
2025-01-24 |
VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning |
Benjamin Callewaert et.al. |
2501.14540 |
null |
2025-01-24 |
Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models |
Zhenguang Zhong et.al. |
2501.14530 |
link |
2025-01-24 |
Scene Understanding Enabled Semantic Communication with Open Channel Coding |
Zhe Xiang et.al. |
2501.14520 |
null |
2025-01-24 |
Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel |
Zhuoran Liu et.al. |
2501.14512 |
null |
2025-01-24 |
Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course |
Pavlin G. Poličar et.al. |
2501.14499 |
null |
2025-01-24 |
Evaluating and Improving Graph to Text Generation with Large Language Models |
Jie He et.al. |
2501.14497 |
link |
2025-01-24 |
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques |
Zhengyang Tang et.al. |
2501.14492 |
link |
2025-01-24 |
Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design |
Taehan Kim et.al. |
2501.14469 |
null |
2025-01-24 |
Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis |
Xiujing Guo et.al. |
2501.14465 |
null |
2025-01-24 |
Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing |
Zeping Yu et.al. |
2501.14457 |
null |
2025-01-24 |
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains |
Xu Chu et.al. |
2501.14431 |
null |
2025-01-24 |
GraphBC: Improving LLMs for Better Graph Data Processing |
Xu Chu et.al. |
2501.14427 |
null |
2025-01-24 |
CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios |
Michael Fuest et.al. |
2501.14426 |
null |
2025-01-24 |
DeepFlow: Serverless Large Language Model Serving at Scale |
Junhao Hu et.al. |
2501.14417 |
null |
2025-01-24 |
SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation |
Shengjie Wang et.al. |
2501.14400 |
null |
2025-01-24 |
ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer |
Yoni Schirris et.al. |
2501.14379 |
link |
2025-01-24 |
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing |
Xinyu Ma et.al. |
2501.14371 |
link |
2025-01-24 |
Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches |
Ziad Sakr et.al. |
2501.14366 |
null |
2025-01-24 |
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration |
Kai-Tuo Xu et.al. |
2501.14350 |
link |
2025-01-24 |
Chain-of-Retrieval Augmented Generation |
Liang Wang et.al. |
2501.14342 |
null |
2025-01-24 |
Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts |
Clément Desroches et.al. |
2501.14334 |
null |
2025-01-24 |
Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models |
Ridhi Jain et.al. |
2501.14326 |
null |
2025-01-24 |
PAID: A Framework of Product-Centric Advertising Image Design |
Hongyu Chen et.al. |
2501.14316 |
null |
2025-01-24 |
Locality-aware Fair Scheduling in LLM Serving |
Shiyi Cao et.al. |
2501.14312 |
null |
2025-01-24 |
A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education |
Calvin Yeung et.al. |
2501.14305 |
link |
2025-01-24 |
MASTER: A Multi-Agent System with LLM Specialized MCTS |
Bingzheng Gan et.al. |
2501.14304 |
null |
2025-01-24 |
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph |
Xujian Liang et.al. |
2501.14300 |
link |
2025-01-24 |
Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment |
Julian A. Schnabel et.al. |
2501.14296 |
null |
2025-01-24 |
Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes |
Sullam Jeoung et.al. |
2501.14294 |
link |
2025-01-24 |
Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches |
Feng Zhou et.al. |
2501.14291 |
null |
2025-01-24 |
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation |
Sadegh Mahdavi et.al. |
2501.14275 |
link |
2025-01-24 |
Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors |
Yi Zhao et.al. |
2501.14250 |
link |
2025-01-24 |
Humanity’s Last Exam |
Long Phan et.al. |
2501.14249 |
null |
2025-01-24 |
Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game |
Rong Ye et.al. |
2501.14225 |
null |
2025-01-24 |
Top Ten Challenges Towards Agentic Neural Graph Databases |
Jiaxin Bai et.al. |
2501.14224 |
null |
2025-01-24 |
TFG-Flow: Training-free Guidance in Multimodal Generative Flow |
Haowei Lin et.al. |
2501.14216 |
null |
2025-01-24 |
Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading |
Minrui Xu et.al. |
2501.14205 |
null |
2025-01-24 |
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking |
Runyi Hu et.al. |
2501.14195 |
link |
2025-01-24 |
Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models |
Saaduddin Mahmud et.al. |
2501.14189 |
null |
2025-01-24 |
GeoSim.AI: AI assistants for numerical simulations in geomechanics |
Yared W. Bekele et.al. |
2501.14186 |
null |
2025-01-24 |
AI Chatbots as Professional Service Agents: Developing a Professional Identity |
Wenwen Li et.al. |
2501.14179 |
null |
2025-01-24 |
Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models |
Yile Gu et.al. |
2501.14170 |
null |
2025-01-24 |
Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction |
Dongming Sheng et.al. |
2501.14144 |
null |
2025-01-23 |
Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation |
Derek Yotheringhay et.al. |
2501.14119 |
null |
2025-01-23 |
Domain-Factored Untrained Deep Prior for Spectrum Cartography |
Subash Timilsina et.al. |
2501.14116 |
null |
2025-01-23 |
MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning |
Joshua Davis et.al. |
2501.14105 |
link |
2025-01-23 |
StreamingRAG: Real-time Contextual Retrieval and Generation Framework |
Murugan Sankaradas et.al. |
2501.14101 |
null |
2025-01-23 |
Enhancing Biomedical Relation Extraction with Directionality |
Po-Ting Lai et.al. |
2501.14079 |
link |
2025-01-23 |
LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language |
Yubin Ge et.al. |
2501.14073 |
null |
2025-01-23 |
Efficient 2D CT Foundation Model for Contrast Phase Classification |
Benjamin Hou et.al. |
2501.14066 |
null |
2025-01-23 |
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models |
Jakob Krogh Petersen et.al. |
2501.14051 |
link |
2025-01-23 |
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps |
Andrey Palaev et.al. |
2501.14046 |
link |
2025-01-23 |
Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions |
Jianfeng Zhu et.al. |
2501.14037 |
null |
2025-01-23 |
CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation |
Guofeng Cui et.al. |
2501.13927 |
null |
2025-01-23 |
Improving Video Generation with Human Feedback |
Jie Liu et.al. |
2501.13918 |
null |
2025-01-23 |
Binary Diffusion Probabilistic Model |
Vitaliy Kinakh et.al. |
2501.13915 |
null |
2025-01-23 |
Analysis of Indic Language Capabilities in LLMs |
Aatman Vaidya et.al. |
2501.13912 |
null |
2025-01-23 |
Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models |
Linh Tran et.al. |
2501.13904 |
null |
2025-01-23 |
Exploring Finetuned Audio-LLM on Heart Murmur Features |
Adrian Florea et.al. |
2501.13884 |
null |
2025-01-23 |
The machine learning platform for developers of large systems |
Alexey Naikov et.al. |
2501.13881 |
null |
2025-01-23 |
A RAG-Based Institutional Assistant |
Gustavo Kuratomi et.al. |
2501.13880 |
null |
2025-01-23 |
On the Reasoning Capacity of AI Models and How to Quantify It |
Santosh Kumar Radha et.al. |
2501.13833 |
null |
2025-01-23 |
Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing |
Hao Zhang et.al. |
2501.13831 |
null |
2025-01-23 |
Hallucinations Can Improve Large Language Models in Drug Discovery |
Shuzhou Yuan et.al. |
2501.13824 |
null |
2025-01-23 |
Large Language Model driven Policy Exploration for Recommender Systems |
Jie Wang et.al. |
2501.13816 |
null |
2025-01-23 |
Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change |
Mowafak Allaham et.al. |
2501.13802 |
null |
2025-01-23 |
Parameter-Efficient Fine-Tuning for Foundation Models |
Dan Zhang et.al. |
2501.13787 |
link |
2025-01-23 |
Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling |
Tanya Rodchenko et.al. |
2501.13779 |
null |
2025-01-23 |
Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework |
Yoonsang Kim et.al. |
2501.13778 |
link |
2025-01-23 |
Do Large Language Models Truly Understand Geometric Structures? |
Xiaofeng Wang et.al. |
2501.13773 |
link |
2025-01-23 |
Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak |
Erjia Xiao et.al. |
2501.13772 |
null |
2025-01-23 |
UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models |
Xin Xu et.al. |
2501.13766 |
null |
2025-01-23 |
EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents |
Yuhui Yun et.al. |
2501.13746 |
null |
2025-01-23 |
GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification |
Te Pei et.al. |
2501.13743 |
null |
2025-01-23 |
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities |
Zezhou Yang et.al. |
2501.13742 |
link |
2025-01-23 |
Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks |
Chang Gong et.al. |
2501.13731 |
null |
2025-01-23 |
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation |
Shi-Qi Yan et.al. |
2501.13726 |
null |
2025-01-23 |
Musical ethnocentrism in Large Language Models |
Anna Kruspe et.al. |
2501.13720 |
null |
2025-01-23 |
A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation |
Dario Serez et.al. |
2501.13718 |
null |
2025-01-23 |
EventVL: Understand Event Streams via Multimodal Large Language Model |
Pengteng Li et.al. |
2501.13707 |
null |
2025-01-23 |
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale |
Linghao Zhang et.al. |
2501.13699 |
null |
2025-01-23 |
Question Answering on Patient Medical Records with Private Fine-Tuned LLMs |
Sara Kothari et.al. |
2501.13687 |
null |
2025-01-23 |
HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor |
Zihui Wu et.al. |
2501.13677 |
link |
2025-01-23 |
How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization |
Shezheng Song et.al. |
2501.13669 |
null |
2025-01-23 |
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models |
Yizheng Sun et.al. |
2501.13652 |
null |
2025-01-23 |
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models |
Zhenghao Lin et.al. |
2501.13629 |
null |
2025-01-23 |
Text-to-SQL based on Large Language Models and Database Keyword Search |
Eduardo R. Nascimento et.al. |
2501.13594 |
null |
2025-01-23 |
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization |
Lei Huang et.al. |
2501.13573 |
null |
2025-01-23 |
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt |
Tao Liu et.al. |
2501.13554 |
link |
2025-01-23 |
LLMs Can Plan Only If We Tell Them |
Bilgehan Sel et.al. |
2501.13545 |
null |
2025-01-23 |
ReasVQA: Advancing VideoQA with Imperfect Reasoning Process |
Jianxin Liang et.al. |
2501.13536 |
null |
2025-01-23 |
RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles |
Munachiso Nwadike et.al. |
2501.13491 |
null |
2025-01-23 |
Adaptive Testing for LLM-Based Applications: A Diversity-based Approach |
Juyeon Yoon et.al. |
2501.13480 |
null |
2025-01-23 |
LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation |
JiaXin Chen et.al. |
2501.13475 |
null |
2025-01-23 |
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge |
Haomiao Xiong et.al. |
2501.13468 |
link |
2025-01-23 |
Spurious Forgetting in Continual Learning of Language Models |
Junhao Zheng et.al. |
2501.13453 |
link |
2025-01-23 |
Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models |
Bo Gao et.al. |
2501.13428 |
null |
2025-01-23 |
Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling |
Tomek Jaroslawski et.al. |
2501.13415 |
null |
2025-01-23 |
VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework |
He Kong et.al. |
2501.13411 |
link |
2025-01-23 |
Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures |
Minglong Dong et.al. |
2501.13396 |
null |
2025-01-23 |
Can Large Language Models Understand Preferences in Personalized Recommendation? |
Zhaoxuan Tan et.al. |
2501.13391 |
link |
2025-01-23 |
Do as We Do, Not as You Think: the Conformity of Large Language Models |
Zhiyuan Weng et.al. |
2501.13381 |
link |
2025-01-23 |
Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility |
Gabrielle Hoyer et.al. |
2501.13376 |
null |
2025-01-23 |
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement |
Jae-Sung Bae et.al. |
2501.13372 |
null |
2025-01-23 |
Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification |
Yuzhuo Li et.al. |
2501.13368 |
null |
2025-01-23 |
50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications |
Zewei Shi et.al. |
2501.13351 |
null |
2025-01-23 |
MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize |
Haohang Xu et.al. |
2501.13349 |
null |
2025-01-23 |
Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation |
Rong Shan et.al. |
2501.13344 |
null |
2025-01-23 |
Multi-aspect Knowledge Distillation with Large Language Model |
Taegyeong Lee et.al. |
2501.13341 |
link |
2025-01-23 |
Generative Multi-Form Bayesian Optimization |
Zhendong Guo et.al. |
2501.13337 |
null |
2025-01-23 |
SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network |
Songge Zhang et.al. |
2501.13318 |
null |
2025-01-23 |
Representing Visualization Insights as a Dense Insight Network |
Jane Hoffswell et.al. |
2501.13309 |
null |
2025-01-23 |
OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia |
Xuelong Geng et.al. |
2501.13306 |
link |
2025-01-23 |
Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers |
Akshit Achara et.al. |
2501.13302 |
link |
2025-01-23 |
Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents |
Shrinidhi Kumbhar et.al. |
2501.13299 |
null |
2025-01-23 |
RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering |
Yang Bai et.al. |
2501.13297 |
link |
2025-01-23 |
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols |
John Joon Young Chung et.al. |
2501.13284 |
null |
2025-01-22 |
MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis |
Daeun Jung et.al. |
2501.13277 |
link |
2025-01-22 |
RAG-Reward: Optimizing RAG with Reward Modeling and RLHF |
Hanning Zhang et.al. |
2501.13264 |
null |
2025-01-22 |
Exploring GPT’s Ability as a Judge in Music Understanding |
Kun Fang et.al. |
2501.13261 |
link |
2025-01-22 |
Bypassing Array Canaries via Autonomous Function Call Resolution |
Nathaniel Oh et.al. |
2501.13256 |
link |
2025-01-22 |
S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning |
Yichen Wu et.al. |
2501.13198 |
null |
2025-01-22 |
Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century |
Axel Loewe et.al. |
2501.13142 |
null |
2025-01-23 |
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding |
Boqiang Zhang et.al. |
2501.13106 |
link |
2025-01-22 |
Robust Representation Consistency Model via Contrastive Denoising |
Jiachen Lei et.al. |
2501.13094 |
link |
2025-01-22 |
Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment |
Melissa Kazemi Rad et.al. |
2501.13080 |
null |
2025-01-22 |
Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning |
Bohao Yang et.al. |
2501.13042 |
link |
2025-01-22 |
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament |
Yantao Liu et.al. |
2501.13007 |
link |
2025-01-22 |
Neural network enhanced cross entropy benchmark for monitored circuits |
Yangrui Hu et.al. |
2501.13005 |
null |
2025-01-22 |
Large Language Model-Based Semantic Communication System for Image Transmission |
Soheyb Ribouh et.al. |
2501.12988 |
null |
2025-01-22 |
LLM4WM: Adapting LLM for Wireless Multi-Tasking |
Xuanyu Liu et.al. |
2501.12983 |
null |
2025-01-22 |
Low-dimensional adaptation of diffusion models: Convergence in total variation |
Jiadong Liang et.al. |
2501.12982 |
null |
2025-01-22 |
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models |
Chongren Sun et.al. |
2501.12975 |
link |
2025-01-22 |
Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs |
Jan Corazza et.al. |
2501.12972 |
null |
2025-01-22 |
It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act |
Kristof Meding et.al. |
2501.12962 |
null |
2025-01-22 |
Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference |
Weizhi Fei et.al. |
2501.12959 |
null |
2025-01-22 |
GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models |
Pengxiang Zhao et.al. |
2501.12956 |
null |
2025-01-22 |
3D Object Manipulation in a Single Image using Generative Models |
Ruisi Zhao et.al. |
2501.12935 |
null |
2025-01-22 |
Correctness Assessment of Code Generated by Large Language Models Using Internal Representations |
Tuan-Dung Bui et.al. |
2501.12934 |
null |
2025-01-22 |
DynamicEarth: How Far are We from Open-Vocabulary Change Detection? |
Kaiyu Li et.al. |
2501.12931 |
null |
2025-01-22 |
A Functional Software Reference Architecture for LLM-Integrated Systems |
Alessio Bucaioni et.al. |
2501.12904 |
null |
2025-01-22 |
Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration |
Offa Kingsleigh et.al. |
2501.12901 |
null |
2025-01-22 |
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback |
Yafu Li et.al. |
2501.12895 |
link |
2025-01-23 |
Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program |
Carlton Shepherd et.al. |
2501.12883 |
null |
2025-01-22 |
WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge |
Jingyuan Chen et.al. |
2501.12877 |
null |
2025-01-22 |
ACEBench: Who Wins the Match Point in Tool Learning? |
Chen Chen et.al. |
2501.12851 |
null |
2025-01-22 |
AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation |
Aghiles Kebaili et.al. |
2501.12840 |
null |
2025-01-22 |
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home |
Viktor Moskvoretskii et.al. |
2501.12835 |
null |
2025-01-22 |
Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek |
John Pavlopoulos et.al. |
2501.12826 |
link |
2025-01-22 |
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks |
Alessio Quercia et.al. |
2501.12824 |
null |
2025-01-22 |
Certified Guidance for Planning with Deep Generative Models |
Francesco Giacomarra et.al. |
2501.12815 |
null |
2025-01-22 |
Revisit Self-Debugging with Self-Generated Tests for Code Generation |
Xiancai Chen et.al. |
2501.12793 |
null |
2025-01-22 |
LLMs as Repositories of Factual Knowledge: Limitations and Solutions |
Seyed Mahed Mousavi et.al. |
2501.12774 |
null |
2025-01-22 |
NExtLong: Toward Effective Long-Context Training without Long Documents |
Chaochen Gao et.al. |
2501.12766 |
link |
2025-01-22 |
Online Preference Alignment for Language Models via Count-based Exploration |
Chenjia Bai et.al. |
2501.12735 |
link |
2025-01-22 |
Paradigm-Based Automatic HDL Code Generation Using LLMs |
Wenhao Sun et.al. |
2501.12702 |
null |
2025-01-22 |
Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression |
Kai Yoshida et.al. |
2501.12698 |
null |
2025-01-22 |
Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering |
Qian Tao et.al. |
2501.12697 |
null |
2025-01-22 |
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling |
Shengshi Yao et.al. |
2501.12696 |
null |
2025-01-22 |
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation |
Yifan Yu et.al. |
2501.12689 |
null |
2025-01-22 |
Distillation Quantification for Large Language Models |
Sunbowen Lee et.al. |
2501.12619 |
link |
2025-01-22 |
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? |
Taiming Wang et.al. |
2501.12617 |
null |
2025-01-22 |
Kimi k1.5: Scaling Reinforcement Learning with LLMs |
Kimi Team et.al. |
2501.12599 |
null |
2025-01-22 |
Leveraging LLMs to Create a Haptic Devices’ Recommendation System |
Yang Liu et.al. |
2501.12573 |
null |
2025-01-22 |
Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review |
Rock Yuren Pang et.al. |
2501.12557 |
link |
2025-01-21 |
Human-like conceptual representations emerge from language prediction |
Ningyu Xu et.al. |
2501.12547 |
null |
2025-01-21 |
How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? |
Mirali Purohit et.al. |
2501.12535 |
null |
2025-01-21 |
An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts |
Dhia Elhaq Rzig et.al. |
2501.12521 |
null |
2025-01-21 |
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data |
Minh Tran et.al. |
2501.12501 |
null |
2025-01-21 |
The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws |
Tian Jin et.al. |
2501.12486 |
null |
2025-01-21 |
An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models |
Xiaoyu Chu et.al. |
2501.12469 |
link |
2025-01-21 |
Adaptive PII Mitigation Framework for Large Language Models |
Shubhi Asthana et.al. |
2501.12465 |
null |
2025-01-21 |
Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management |
Arthur Vitui et.al. |
2501.12461 |
link |
2025-01-21 |
Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications |
Shubhi Asthana et.al. |
2501.12456 |
null |
2025-01-21 |
Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation |
Dongsheng Zhu et.al. |
2501.12432 |
null |
2025-01-21 |
FREYR: A Framework for Recognizing and Executing Your Requests |
Roberto Gallotta et.al. |
2501.12423 |
link |
2025-01-21 |
CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning |
Eunjee Choi et.al. |
2501.12422 |
null |
2025-01-22 |
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling |
Yi Wang et.al. |
2501.12386 |
link |
2025-01-21 |
Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks |
Greg Olmschenk et.al. |
2501.12383 |
null |
2025-01-21 |
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding |
Yilun Zhao et.al. |
2501.12380 |
link |
2025-01-22 |
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos |
Sili Chen et.al. |
2501.12375 |
null |
2025-01-21 |
Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists |
Thomas F. Eisenmann et.al. |
2501.12374 |
link |
2025-01-21 |
Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL |
Yeounoh Chung et.al. |
2501.12372 |
null |
2025-01-21 |
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration |
Thomas Walshe et.al. |
2501.12332 |
null |
2025-01-21 |
Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops |
Mohamed Harmanani et.al. |
2501.12331 |
link |
2025-01-21 |
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model |
Xianwei Zhuang et.al. |
2501.12327 |
link |
2025-01-21 |
LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations |
Hasan Abu-Rasheed et.al. |
2501.12300 |
null |
2025-01-21 |
MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks |
Qishen Zhou et.al. |
2501.12281 |
link |
2025-01-21 |
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement |
Maosong Cao et.al. |
2501.12273 |
link |
2025-01-21 |
FOCUS: First Order Concentrated Updating Scheme |
Yizhou Liu et.al. |
2501.12243 |
null |
2025-01-21 |
InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models |
Pha Nguyen et.al. |
2501.12231 |
null |
2025-01-21 |
CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning |
Yuanheng Fang et.al. |
2501.12226 |
null |
2025-01-21 |
Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces |
Allard Oelen et.al. |
2501.12221 |
null |
2025-01-21 |
You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense |
Wuyuao Mai et.al. |
2501.12210 |
null |
2025-01-21 |
Explainability for Vision Foundation Models: A Survey |
Rémi Kazmierczak et.al. |
2501.12203 |
null |
2025-01-22 |
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation |
Zibo Zhao et.al. |
2501.12202 |
link |
2025-01-21 |
BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks |
Zhuang Li et.al. |
2501.12174 |
null |
2025-01-21 |
Contextualizing Recommendation Explanations with LLMs: A User Study |
Yuanjun Feng et.al. |
2501.12152 |
null |
2025-01-21 |
Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities |
Qirun Dai et.al. |
2501.12147 |
null |
2025-01-21 |
Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot |
Daniele Bifolco et.al. |
2501.12134 |
null |
2025-01-21 |
Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions |
Tim Schreiter et.al. |
2501.12128 |
null |
2025-01-21 |
Can open source large language models be used for tumor documentation in Germany? – An evaluation on urological doctors’ notes |
Stefan Lenz et.al. |
2501.12106 |
link |
2025-01-21 |
Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis |
Weile Luo et.al. |
2501.12084 |
null |
2025-01-21 |
Phishing Awareness via Game-Based Learning |
Argianto Rahartomo et.al. |
2501.12077 |
link |
2025-01-21 |
PINNsAgent: Automated PDE Surrogation with Large Language Models |
Qingpo Wuwu et.al. |
2501.12053 |
null |
2025-01-21 |
Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation |
Chen Griner et.al. |
2501.12033 |
null |
2025-01-21 |
Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing’s Syndrome Diagnosis in Facial Analysis |
Hongjun Liu et.al. |
2501.12023 |
null |
2025-01-21 |
Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? |
Samantha Min Er Yew et.al. |
2501.12016 |
null |
2025-01-21 |
Rate-Aware Learned Speech Compression |
Jun Xu et.al. |
2501.11999 |
null |
2025-01-21 |
Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models |
Rupesh Raj Karn et.al. |
2501.11979 |
null |
2025-01-21 |
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues |
Maya Medjad et.al. |
2501.11977 |
link |
2025-01-21 |
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization |
Jie Zhao et.al. |
2501.11968 |
null |
2025-01-21 |
A Hybrid Attention Framework for Fake News Detection with Large Language Models |
Xiaochuan Xu et.al. |
2501.11967 |
null |
2025-01-21 |
TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection |
Yang Cao et.al. |
2501.11960 |
null |
2025-01-21 |
Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model |
Minghan Wang et.al. |
2501.11953 |
null |
2025-01-21 |
ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation |
Peter Devine et.al. |
2501.11929 |
link |
2025-01-21 |
Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model |
He Chang et.al. |
2501.11911 |
null |
2025-01-21 |
Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation |
Junhong Lian et.al. |
2501.11900 |
link |
2025-01-22 |
Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine |
Keer Lu et.al. |
2501.11885 |
null |
2025-01-21 |
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning |
Yafu Li et.al. |
2501.11877 |
link |
2025-01-21 |
LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems |
Venkata Sai Aswath Duvvuru et.al. |
2501.11864 |
null |
2025-01-21 |
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents |
Zhili Cheng et.al. |
2501.11858 |
link |
2025-01-21 |
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance |
Nikos Kanakaris et.al. |
2501.11849 |
link |
2025-01-21 |
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science |
Kaiyuan Tian et.al. |
2501.11847 |
null |
2025-01-21 |
Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction |
Noah L. Schroeder et.al. |
2501.11840 |
null |
2025-01-21 |
PXGen: A Post-hoc Explainable Method for Generative Models |
Yen-Lung Huang et.al. |
2501.11827 |
null |
2025-01-21 |
CogMorph: Cognitive Morphing Attacks for Text-to-Image Models |
Zonglei Jing et.al. |
2501.11815 |
null |
2025-01-20 |
Benchmarking Large Language Models via Random Variables |
Zijin Hong et.al. |
2501.11790 |
null |
2025-01-20 |
Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection |
Ali Naseh et.al. |
2501.11786 |
null |
2025-01-20 |
Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference |
Pouya Hamadanian et.al. |
2501.11779 |
link |
2025-01-20 |
The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers |
Alina Starovolsky-Shitrit et.al. |
2501.11770 |
null |
2025-01-20 |
Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems |
Fatemeh Nazary et.al. |
2501.11759 |
link |
2025-01-20 |
A generalizable 3D framework and model for self-supervised learning in medical imaging |
Tony Xu et.al. |
2501.11755 |
null |
2025-01-20 |
Are generative models fair? A study of racial bias in dermatological image generation |
Miguel López-Pérez et.al. |
2501.11752 |
null |
2025-01-20 |
Optimizing Pretraining Data Mixtures with LLM-Estimated Utility |
William Held et.al. |
2501.11747 |
null |
2025-01-20 |
MedicoSAM: Towards foundation models for medical image segmentation |
Anwai Archit et.al. |
2501.11734 |
link |
2025-01-20 |
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks |
Zhenhailong Wang et.al. |
2501.11733 |
null |
2025-01-20 |
Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy |
Saeid Asgari Taghanaki et.al. |
2501.11721 |
link |
2025-01-20 |
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners’ Perspectives |
Nong Ming et.al. |
2501.11712 |
link |
2025-01-20 |
Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution |
Ramtin Ehsani et.al. |
2501.11709 |
null |
2025-01-20 |
Trustformer: A Trusted Federated Transformer |
Ali Abbasi Tadi et.al. |
2501.11706 |
null |
2025-01-20 |
Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) |
Brian E. Perron et.al. |
2501.11705 |
null |
2025-01-20 |
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling |
Zhenyu Hou et.al. |
2501.11651 |
link |
2025-01-20 |
Trojan Detection Through Pattern Recognition for Large Language Models |
Vedant Bhasin et.al. |
2501.11621 |
null |
2025-01-20 |
Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems |
Giorgio Robino et.al. |
2501.11613 |
null |
2025-01-20 |
SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks |
Wentao Wan et.al. |
2501.11599 |
link |
2025-01-20 |
Recurrent Diffusion for Large-Scale Parameter Generation |
Kai Wang et.al. |
2501.11587 |
link |
2025-01-20 |
Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models |
Mahyar Habibi et.al. |
2501.11581 |
null |
2025-01-20 |
Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution |
Zhiyuan You et.al. |
2501.11561 |
null |
2025-01-20 |
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation |
Jinyu Wang et.al. |
2501.11551 |
link |
2025-01-20 |
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion |
Zixuan Chen et.al. |
2501.11515 |
null |
2025-01-20 |
Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges |
Vincent Koc et.al. |
2501.11496 |
null |
2025-01-20 |
Graph-defined Language Learning with LLMs |
Huachi Zhou et.al. |
2501.11478 |
null |
2025-01-20 |
Curiosity-Driven Reinforcement Learning from Human Feedback |
Haoran Sun et.al. |
2501.11463 |
link |
2025-01-20 |
Ontology Matching with Large Language Models and Prioritized Depth-First Search |
Maria Taboada et.al. |
2501.11441 |
null |
2025-01-20 |
One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor |
Zhikun Wu et.al. |
2501.11433 |
null |
2025-01-20 |
A Survey on Diffusion Models for Anomaly Detection |
Jing Liu et.al. |
2501.11430 |
link |
2025-01-20 |
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training |
Siyu Yuan et.al. |
2501.11425 |
link |
2025-01-20 |
Neural Contextual Reinforcement Framework for Logical Structure Language Generation |
Marcus Irvin et.al. |
2501.11417 |
null |
2025-01-20 |
Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing |
Kevin Sim et.al. |
2501.11411 |
null |
2025-01-20 |
Revisiting Language Models in Neural News Recommender Systems |
Yuyue Zhao et.al. |
2501.11391 |
link |
2025-01-20 |
Towards Advancing Code Generation with Large Language Models: A Research Roadmap |
Haolin Jin et.al. |
2501.11354 |
null |
2025-01-20 |
EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery |
Guankun Wang et.al. |
2501.11347 |
link |
2025-01-20 |
GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video |
Zhenliang Ni et.al. |
2501.11340 |
null |
2025-01-20 |
Few-shot Policy (de)composition in Conversational Question Answering |
Kyle Erwin et.al. |
2501.11335 |
null |
2025-01-20 |
Nested Annealed Training Scheme for Generative Adversarial Networks |
Chang Wan et.al. |
2501.11318 |
null |
2025-01-20 |
Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning |
Zhongtian Hu et.al. |
2501.11292 |
null |
2025-01-20 |
Large Language Model Agents for Radio Map Generation and Wireless Network Planning |
Hongye Quan et.al. |
2501.11283 |
null |
2025-01-20 |
Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries |
Yi-Hui Lee et.al. |
2501.11273 |
null |
2025-01-20 |
Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios |
Zhongtian Hu et.al. |
2501.11269 |
null |
2025-01-20 |
Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian |
Wannita Takerngsaksiri et.al. |
2501.11264 |
link |
2025-01-20 |
Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models |
Zhuangzhuang Yan et.al. |
2501.11247 |
null |
2025-01-20 |
Irony in Emojis: A Comparative Study of Human and LLM Interpretation |
Yawen Zheng et.al. |
2501.11241 |
null |
2025-01-20 |
KPL: Training-Free Medical Knowledge Mining of Vision-Language Models |
Jiaxiang Liu et.al. |
2501.11231 |
link |
2025-01-20 |
Reasoning Language Models: A Blueprint |
Maciej Besta et.al. |
2501.11223 |
link |
2025-01-20 |
Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation |
Ivan Lopez et.al. |
2501.11199 |
null |
2025-01-19 |
Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests |
Kristin Blesch et.al. |
2501.11178 |
link |
2025-01-17 |
FaceXBench: Evaluating Multimodal LLMs on Face Understanding |
Kartik Narayan et.al. |
2501.10360 |
link |
2025-01-17 |
Zero-Shot Monocular Scene Flow Estimation in the Wild |
Yiqing Liang et.al. |
2501.10357 |
null |
2025-01-17 |
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems |
Weibo Gao et.al. |
2501.10332 |
null |
2025-01-17 |
Large language models for automated scholarly paper review: A survey |
Zhenzhen Zhuang et.al. |
2501.10326 |
null |
2025-01-17 |
HiMix: Reducing Computational Complexity in Large Vision-Language Models |
Xuange Zhang et.al. |
2501.10318 |
null |
2025-01-17 |
Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs |
Claudio Di Sipio et.al. |
2501.10313 |
null |
2025-01-17 |
Computational Protein Science in the Era of Large Language Models (LLMs) |
Wenqi Fan et.al. |
2501.10282 |
null |
2025-01-17 |
Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation |
Azat Abdullin et.al. |
2501.10200 |
null |
2025-01-17 |
Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education |
William Hersh et.al. |
2501.10186 |
null |
2025-01-17 |
Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval |
Vera Pavlova et.al. |
2501.10175 |
null |
2025-01-17 |
Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis |
Abhishek Kaushik et.al. |
2501.10134 |
null |
2025-01-17 |
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario |
Lucen Zhong et.al. |
2501.10132 |
link |
2025-01-17 |
PaSa: An LLM Agent for Comprehensive Academic Paper Search |
Yichen He et.al. |
2501.10120 |
link |
2025-01-17 |
AI-Generated Music Detection and its Challenges |
Darius Afchar et.al. |
2501.10111 |
link |
2025-01-17 |
LLM Reasoner and Automated Planner: A new NPC approach |
Israel Puerta-Merino et.al. |
2501.10106 |
null |
2025-01-17 |
Universal Actions for Enhanced Embodied Foundation Models |
Jinliang Zheng et.al. |
2501.10105 |
link |
2025-01-17 |
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks |
Michael Schwingshackl et.al. |
2501.10080 |
link |
2025-01-17 |
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization |
Zhaopeng Gu et.al. |
2501.10067 |
link |
2025-01-17 |
Accelerating Large Language Models through Partially Linear Feed-Forward Network |
Gansen Hu et.al. |
2501.10054 |
null |
2025-01-17 |
AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search |
Wenfeng Feng et.al. |
2501.10053 |
null |
2025-01-17 |
Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists |
Alyssia Chen et.al. |
2501.10037 |
null |
2025-01-17 |
Mapping scientific communities at scale |
Victor Barbier et.al. |
2501.10035 |
link |
2025-01-17 |
Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions |
Zhijie Tan et.al. |
2501.10011 |
null |
2025-01-17 |
Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models |
Qiang Liu et.al. |
2501.09997 |
null |
2025-01-17 |
Agent-as-Judge for Factual Summarization of Long Narratives |
Yeonseok Jeong et.al. |
2501.09993 |
link |
2025-01-17 |
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation |
Yuefan Cao et.al. |
2501.09982 |
null |
2025-01-17 |
GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions |
Heda Zuo et.al. |
2501.09972 |
null |
2025-01-17 |
Explainable artificial intelligence (XAI): from inherent explainability to large language models |
Fuseini Mumuni et.al. |
2501.09967 |
null |
2025-01-17 |
A Survey on Multi-Turn Interaction Capabilities of Large Language Models |
Chen Zhang et.al. |
2501.09959 |
null |
2025-01-17 |
FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs |
Zengyi Gao et.al. |
2501.09957 |
null |
2025-01-17 |
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations |
Jamin Seo et.al. |
2501.09954 |
link |
2025-01-17 |
Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt |
Qingcheng Zeng et.al. |
2501.09950 |
null |
2025-01-17 |
MultiPruner: Balanced Structure Removal in Foundation Models |
J. Pablo Muñoz et.al. |
2501.09949 |
link |
2025-01-17 |
Steering Large Language Models with Feature Guided Activation Additions |
Samuel Soo et.al. |
2501.09929 |
null |
2025-01-17 |
Towards A Litmus Test for Common Sense |
Hugo Latapie et.al. |
2501.09913 |
null |
2025-01-17 |
Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph |
Jiawei Xu et.al. |
2501.09909 |
null |
2025-01-17 |
Position: Open and Closed Large Language Models in Healthcare |
Jiawei Xu et.al. |
2501.09906 |
null |
2025-01-17 |
FoundationStereo: Zero-Shot Stereo Matching |
Bowen Wen et.al. |
2501.09898 |
null |
2025-01-17 |
Evolving Deeper LLM Thinking |
Kuang-Huei Lee et.al. |
2501.09891 |
null |
2025-01-17 |
Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment |
Mohammad Sadegh Sheikhaei et.al. |
2501.09888 |
link |
2025-01-17 |
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis |
Zhe Chen et.al. |
2501.09887 |
null |
2025-01-16 |
ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction |
Izzeddin Teeti et.al. |
2501.09878 |
null |
2025-01-16 |
Geometry-Preserving Encoder/Decoder in Latent Generative Models |
Wonjun Lee et.al. |
2501.09876 |
null |
2025-01-16 |
An LLM-Guided Tutoring System for Social Skills Training |
Michael Guevarra et.al. |
2501.09870 |
null |
2025-01-16 |
Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing |
Wenhan Wang et.al. |
2501.09866 |
null |
2025-01-16 |
Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message |
Jiawei Li et.al. |
2501.09861 |
null |
2025-01-16 |
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery |
Shristi Das Biswas et.al. |
2501.09826 |
link |
2025-01-16 |
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs |
Nada Saadi et.al. |
2501.09825 |
null |
2025-01-16 |
BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling |
Daniele Castellana et.al. |
2501.09821 |
link |
2025-01-16 |
Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems |
Soham Roy et.al. |
2501.09801 |
null |
2025-01-16 |
Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API |
Andrey Labunets et.al. |
2501.09798 |
null |
2025-01-16 |
GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation |
Weiliang Tang et.al. |
2501.09783 |
null |
2025-01-16 |
SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation |
Wanqi Yin et.al. |
2501.09782 |
link |
2025-01-16 |
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos |
Zhongwei Ren et.al. |
2501.09781 |
null |
2025-01-16 |
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong |
Tairan Fu et.al. |
2501.09775 |
null |
2025-01-16 |
Distilling Multi-modal Large Language Models for Autonomous Driving |
Deepti Hegde et.al. |
2501.09757 |
null |
2025-01-16 |
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation |
Philippe Hansen-Estruch et.al. |
2501.09755 |
null |
2025-01-16 |
Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues |
Youngjoon Jang et.al. |
2501.09754 |
null |
2025-01-16 |
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking |
Zekun Xi et.al. |
2501.09751 |
null |
2025-01-16 |
Enhancing Lexicon-Based Text Embeddings with Large Language Models |
Yibin Lei et.al. |
2501.09749 |
null |
2025-01-16 |
Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models |
Bihui Jin et.al. |
2501.09745 |
null |
2025-01-16 |
KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports |
Hajung Kim et.al. |
2501.09744 |
null |
2025-01-16 |
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps |
Nanye Ma et.al. |
2501.09732 |
null |
2025-01-16 |
A Simple Aerial Detection Baseline of Multimodal Language Models |
Qingyun Li et.al. |
2501.09720 |
link |
2025-01-16 |
Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text |
Jihed Ncib et.al. |
2501.09719 |
null |
2025-01-16 |
CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education |
Tianyu Wang et.al. |
2501.09709 |
link |
2025-01-16 |
Domain Adaptation of Foundation LLMs for e-Commerce |
Christian Herold et.al. |
2501.09706 |
null |
2025-01-16 |
Cueless EEG imagined speech for subject identification: dataset and benchmarks |
Ali Derakhshesh et.al. |
2501.09700 |
link |
2025-01-16 |
Simulated Interactive Debugging |
Yannic Noller et.al. |
2501.09694 |
null |
2025-01-17 |
Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities |
Fengli Xu et.al. |
2501.09686 |
null |
2025-01-16 |
Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review |
Masatoshi Uehara et.al. |
2501.09685 |
null |
2025-01-16 |
Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark |
Alexis Roger et.al. |
2501.09672 |
null |
2025-01-16 |
A Survey of Research in Large Language Models for Electronic Design Automation |
Jingyu Pan et.al. |
2501.09655 |
null |
2025-01-16 |
The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models |
Jonathan Katzy et.al. |
2501.09653 |
null |
2025-01-16 |
CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding |
Johannes Kirmayr et.al. |
2501.09645 |
link |
2025-01-17 |
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading |
Kuan-Ming Liu et.al. |
2501.09636 |
null |
2025-01-16 |
Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework |
Yushen Lin et.al. |
2501.09631 |
null |
2025-01-16 |
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment |
Chaoqi Wang et.al. |
2501.09620 |
link |
2025-01-16 |
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs |
Hrithik Majumdar Shibu et.al. |
2501.09604 |
link |
2025-01-16 |
Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures |
Pratyush Dhingra et.al. |
2501.09588 |
null |
2025-01-16 |
Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis |
Tingxuan Chen et.al. |
2501.09555 |
null |
2025-01-16 |
AI in Support of Diversity and Inclusion |
Çiçek Güven et.al. |
2501.09534 |
null |
2025-01-16 |
Confidence Estimation for Error Detection in Text-to-SQL Systems |
Oleg Somov et.al. |
2501.09527 |
null |
2025-01-16 |
Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data |
Omar Mena et.al. |
2501.09521 |
null |
2025-01-16 |
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation |
Junjie He et.al. |
2501.09503 |
null |
2025-01-16 |
Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis |
Qize Yang et.al. |
2501.09502 |
null |
2025-01-16 |
Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework |
Nuo Chen et.al. |
2501.09493 |
null |
2025-01-16 |
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators |
Zhaocheng Liu et.al. |
2501.09484 |
link |
2025-01-16 |
Guided Debugging of Auto-Translated Code Using Differential Testing |
Shengnan Wu et.al. |
2501.09475 |
null |
2025-01-16 |
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching |
Hualie Jiang et.al. |
2501.09466 |
link |
2025-01-16 |
Pruning for Sparse Diffusion Models based on Gradient Flow |
Ben Wan et.al. |
2501.09464 |
null |
2025-01-16 |
“A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design |
Tianhao He et.al. |
2501.09457 |
null |
2025-01-16 |
Solving the unsolvable: Translating case law in Hong Kong |
King-kui Sin et.al. |
2501.09444 |
null |
2025-01-16 |
Scaling up self-supervised learning for improved surgical foundation models |
Tim J. M. Jaspers et.al. |
2501.09436 |
link |
2025-01-16 |
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation |
Hwan Heo et.al. |
2501.09433 |
link |
2025-01-16 |
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy |
Huandong Wang et.al. |
2501.09431 |
null |
2025-01-16 |
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring |
Xinyi Wang et.al. |
2501.09428 |
null |
2025-01-16 |
AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling |
Ancheng Xu et.al. |
2501.09426 |
null |
2025-01-16 |
FASP: Fast and Accurate Structured Pruning of Large Language Models |
Hanyu Hu et.al. |
2501.09412 |
null |
2025-01-16 |
MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models |
Lyudong Jin et.al. |
2501.09410 |
null |
2025-01-16 |
Adaptive Contextual Caching for Mobile Edge Large Language Model Service |
Guangyuan Liu et.al. |
2501.09383 |
null |
2025-01-16 |
Aligning Instruction Tuning with Pre-training |
Yiming Liang et.al. |
2501.09368 |
null |
2025-01-16 |
PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks |
Huiyou Zhan et.al. |
2501.09367 |
null |
2025-01-16 |
YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks |
Saptarashmi Bandyopadhyay et.al. |
2501.09355 |
null |
2025-01-16 |
UVRM: A Scalable 3D Reconstruction Model from Unposed Videos |
Shiu-hong Kao et.al. |
2501.09347 |
null |
2025-01-16 |
Rational Tuning of LLM Cascades via Probabilistic Modeling |
Michael J. Zellinger et.al. |
2501.09345 |
null |
2025-01-16 |
SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs |
Anbang Ye et.al. |
2501.09316 |
null |
2025-01-16 |
A Study of In-Context-Learning-Based Text-to-SQL Errors |
Jiawei Shen et.al. |
2501.09310 |
link |
2025-01-16 |
To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation |
Kaustubh D. Dhole et.al. |
2501.09292 |
null |
2025-01-16 |
LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport |
Kyeongha Rho et.al. |
2501.09291 |
link |
2025-01-16 |
Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding |
Kohei Torimi et.al. |
2501.09278 |
null |
2025-01-16 |
Large Language Model is Secretly a Protein Sequence Optimizer |
Yinkai Wang et.al. |
2501.09274 |
null |
2025-01-16 |
Perspective Transition of Large Language Models for Solving Subjective Tasks |
Xiaolong Wang et.al. |
2501.09265 |
null |
2025-01-16 |
Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition |
Takaaki Hori et.al. |
2501.09258 |
null |
2025-01-16 |
Clone-Robust AI Alignment |
Ariel D. Procaccia et.al. |
2501.09254 |
null |
2025-01-16 |
Split Fine-Tuning for Large Language Models in Wireless Networks |
Songge Zhang et.al. |
2501.09237 |
null |
2025-01-16 |
Foundations of Large Language Models |
Tong Xiao et.al. |
2501.09223 |
null |
2025-01-16 |
Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs |
Sanchit Sinha et.al. |
2501.09221 |
null |
2025-01-16 |
A Simple Graph Contrastive Learning Framework for Short Text Classification |
Yonghao Liu et.al. |
2501.09219 |
link |
2025-01-16 |
Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics |
Yuanyuan Wei et.al. |
2501.09218 |
null |
2025-01-16 |
Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning |
Yonghao Liu et.al. |
2501.09214 |
link |
2025-01-16 |
FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training |
Hongzhou Yu et.al. |
2501.09213 |
link |
2025-01-15 |
Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures |
Pengru Deng et.al. |
2501.09203 |
null |
2025-01-15 |
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT |
Naifeng Zhang et.al. |
2501.09201 |
null |
2025-01-15 |
Guiding Retrieval using LLM-based Listwise Rankers |
Mandeep Rathee et.al. |
2501.09186 |
link |
2025-01-15 |
The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching |
Yevhen Kostiuk et.al. |
2501.09164 |
null |
2025-01-15 |
Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability |
Stephanie L. Day et.al. |
2501.09158 |
null |
2025-01-15 |
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History |
Yevhen Kostiuk et.al. |
2501.09154 |
null |
2025-01-15 |
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation |
Xingxin He et.al. |
2501.09138 |
null |
2025-01-15 |
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG |
Aditi Singh et.al. |
2501.09136 |
link |
2025-01-15 |
HAFix: History-Augmented Large Language Models for Bug Fixing |
Yu Shi et.al. |
2501.09135 |
link |
2025-01-15 |
Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing |
Eshaan Tanwar et.al. |
2501.09127 |
link |
2025-01-15 |
Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment |
Conrad Borchers et.al. |
2501.09126 |
null |
2025-01-15 |
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach |
Alireza Ghaffari et.al. |
2501.09107 |
null |
2025-01-15 |
Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites |
Hans W. A. Hanley et.al. |
2501.09102 |
link |
2025-01-15 |
Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative |
Yuqian Sun et.al. |
2501.09099 |
null |
2025-01-15 |
SteLLA: A Structured Grading System Using LLMs with RAG |
Hefei Qiu et.al. |
2501.09092 |
null |
2025-01-15 |
Generative diffusion model with inverse renormalization group flows |
Kanta Masuki et.al. |
2501.09064 |
link |
2025-01-15 |
Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition |
Sneheel Sarangi et.al. |
2501.09056 |
link |
2025-01-15 |
How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias |
Tosin Fadahunsi et.al. |
2501.09014 |
link |
2025-01-15 |
Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians |
Ishan Amin et.al. |
2501.09009 |
link |
2025-01-15 |
Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails |
Shaona Ghosh et.al. |
2501.09004 |
null |
2025-01-15 |
Vision Foundation Models for Computed Tomography |
Suraj Pai et.al. |
2501.09001 |
null |
2025-01-15 |
CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks |
Krit Tangsongcharoen et.al. |
2501.08998 |
link |
2025-01-15 |
VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science |
Youssef Abdalla et.al. |
2501.08995 |
link |
2025-01-15 |
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities |
Haozhe Xie et.al. |
2501.08983 |
link |
2025-01-15 |
Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models |
Emma Croxford et.al. |
2501.08977 |
null |
2025-01-15 |
Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models |
Karukriti Kaushik Ghosh et.al. |
2501.08974 |
null |
2025-01-15 |
Analyzing the Ethical Logic of Six Large Language Models |
W. Russell Neuman et.al. |
2501.08951 |
null |
2025-01-15 |
Applying General Turn-taking Models to Conversational Human-Robot Interaction |
Gabriel Skantze et.al. |
2501.08946 |
null |
2025-01-15 |
Disentangling Exploration of Large Language Models by Optimal Exploitation |
Tim Grams et.al. |
2501.08925 |
null |
2025-01-15 |
GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge |
Liam Dugan et.al. |
2501.08913 |
link |
2025-01-15 |
Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning |
Qinyu Ma et.al. |
2501.08897 |
link |
2025-01-15 |
Connecting SPDE to SGMs |
Junsu Seo et.al. |
2501.08877 |
null |
2025-01-15 |
Exploring Task-Level Optimal Prompts for Visual In-Context Learning |
Yan Zhu et.al. |
2501.08841 |
null |
2025-01-15 |
How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering |
Christoph Treude et.al. |
2501.08774 |
null |
2025-01-15 |
Admitting Ignorance Helps the Video Question Answering Models to Answer |
Haopeng Li et.al. |
2501.08771 |
null |
2025-01-15 |
Enhanced Large Language Models for Effective Screening of Depression and Anxiety |
June M. Liu et.al. |
2501.08769 |
null |
2025-01-15 |
Few-Shot Learner Generalizes Across AI-Generated Image Detection |
Shiyu Wu et.al. |
2501.08763 |
null |
2025-01-15 |
Leveraging LLM Agents for Translating Network Configurations |
Yunze Wei et.al. |
2501.08760 |
null |
2025-01-15 |
The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities |
Irina Bigoulaeva et.al. |
2501.08716 |
link |
2025-01-15 |
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching |
Chuangtao Ma et.al. |
2501.08686 |
link |
2025-01-15 |
RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency |
Siqi Li et.al. |
2501.08682 |
null |
2025-01-15 |
Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery |
Zeqin Liao et.al. |
2501.08670 |
null |
2025-01-15 |
MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities |
Savya Khosla et.al. |
2501.08648 |
null |
2025-01-15 |
Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations |
Kaiyuan Zheng et.al. |
2501.08641 |
null |
2025-01-15 |
SWSC: Shared Weight for Similar Channel in LLM |
Binrui Zeng et.al. |
2501.08631 |
null |
2025-01-15 |
Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models |
Aruna Sankaranarayanan et.al. |
2501.08618 |
link |
2025-01-15 |
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation |
Kaiqu Liang et.al. |
2501.08617 |
null |
2025-01-15 |
Assessing the Alignment of FOL Closeness Metrics with Human Judgement |
Ramya Keerthy Thatikonda et.al. |
2501.08613 |
link |
2025-01-15 |
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design |
Zhi Zheng et.al. |
2501.08603 |
link |
2025-01-15 |
AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL |
Tyler Stennett et.al. |
2501.08600 |
null |
2025-01-15 |
LlamaRestTest: Effective REST API Testing with Small Language Models |
Myeongsoo Kim et.al. |
2501.08598 |
null |
2025-01-15 |
Sound Scene Synthesis at the DCASE 2024 Challenge |
Mathieu Lagrange et.al. |
2501.08587 |
null |
2025-01-15 |
LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model |
Yuxuan Hu et.al. |
2501.08582 |
null |
2025-01-15 |
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation |
Jiaqi Huang et.al. |
2501.08580 |
link |
2025-01-15 |
Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms |
Kewei Li et.al. |
2501.08570 |
link |
2025-01-15 |
Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications |
Jin Chen et.al. |
2501.08563 |
link |
2025-01-15 |
LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation |
Yiran Tao et.al. |
2501.08558 |
null |
2025-01-15 |
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation |
Sitong Gong et.al. |
2501.08549 |
null |
2025-01-15 |
Comprehensive Subjective and Objective Evaluation Method for Text-generated Video |
Zelu Qi et.al. |
2501.08545 |
null |
2025-01-15 |
Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation |
Jiaxin Guo et.al. |
2501.08523 |
null |
2025-01-14 |
Quantifying the Importance of Data Alignment in Downstream Model Performance |
Krrish Chawla et.al. |
2501.08496 |
null |
2025-01-14 |
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition |
Md Meem Hossain et.al. |
2501.08471 |
null |
2025-01-14 |
Selective Attention Merging for low resource tasks: A case study of Child ASR |
Natarajan Balaji Shankar et.al. |
2501.08468 |
link |
2025-01-14 |
Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin |
Joao Carmo de Almeida Neto et.al. |
2501.08464 |
null |
2025-01-14 |
Large Language Models For Text Classification: Case Study And Comprehensive Review |
Arina Kostina et.al. |
2501.08457 |
null |
2025-01-14 |
Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack |
Sagiv Antebi et.al. |
2501.08454 |
null |
2025-01-14 |
Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies |
Ajwad Abrar et.al. |
2501.08441 |
null |
2025-01-14 |
SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models |
Anurag Kumar et.al. |
2501.08421 |
null |
2025-01-14 |
Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation |
Seyed Mehdi Rakhtala et.al. |
2501.08420 |
null |
2025-01-14 |
Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data |
Jiaxing Qiu et.al. |
2501.08413 |
link |
2025-01-14 |
OptiChat: Bridging Optimization Models and Practitioners with Large Language Models |
Hao Chen et.al. |
2501.08406 |
link |
2025-01-14 |
Towards Best Practices for Open Datasets for LLM Training |
Stefan Baack et.al. |
2501.08365 |
null |
2025-01-14 |
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise |
Ryan Burgert et.al. |
2501.08331 |
link |
2025-01-14 |
PokerBench: Training Large Language Models to become Professional Poker Players |
Richard Zhuang et.al. |
2501.08328 |
link |
2025-01-14 |
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks |
Miran Heo et.al. |
2501.08326 |
null |
2025-01-14 |
ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations |
Ziyuan Huang et.al. |
2501.08324 |
null |
2025-01-14 |
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data |
Amirhossein Aliakbarzadeh et.al. |
2501.08322 |
link |
2025-01-14 |
Enhancing Automated Interpretability with Output-Centric Feature Descriptions |
Yoav Gur-Arieh et.al. |
2501.08319 |
link |
2025-01-14 |
MiniMax-01: Scaling Foundation Models with Lightning Attention |
MiniMax et.al. |
2501.08313 |
null |
2025-01-14 |
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them |
Abhilasha Ravichander et.al. |
2501.08292 |
null |
2025-01-14 |
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding |
Hongyu Li et.al. |
2501.08282 |
link |
2025-01-14 |
Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing |
Pulkit Arora et.al. |
2501.08276 |
null |
2025-01-14 |
Addressing the sustainable AI trilemma: a case study on LLM agents and RAG |
Hui Wu et.al. |
2501.08262 |
null |
2025-01-14 |
Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models |
Yifu Qiu et.al. |
2501.08248 |
null |
2025-01-14 |
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints |
Jonathan Nöther et.al. |
2501.08246 |
null |
2025-01-14 |
CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset |
Jiawei Du et.al. |
2501.08238 |
null |
2025-01-14 |
Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings |
Paul Joe Maliakel et.al. |
2501.08219 |
null |
2025-01-14 |
ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems |
Mohita Chowdhury et.al. |
2501.08208 |
null |
2025-01-14 |
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving |
Zain Ul Abedin et.al. |
2501.08203 |
null |
2025-01-14 |
CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation |
Jinjun Peng et.al. |
2501.08200 |
link |
2025-01-14 |
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training |
Yijiong Yu et.al. |
2501.08197 |
link |
2025-01-14 |
PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving |
Ahmet Caner Yüzügüler et.al. |
2501.08192 |
null |
2025-01-14 |
A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation |
Steven Landgraf et.al. |
2501.08188 |
null |
2025-01-15 |
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following |
Yin Fang et.al. |
2501.08187 |
link |
2025-01-14 |
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data |
Rewina Bedemariam et.al. |
2501.08167 |
null |
2025-01-14 |
I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution |
Soohyeon Choi et.al. |
2501.08165 |
null |
2025-01-14 |
Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data |
Phai Vu Dinh et.al. |
2501.08149 |
null |
2025-01-14 |
Refusal Behavior in Large Language Models: A Nonlinear Perspective |
Fabian Hildebrandt et.al. |
2501.08145 |
link |
2025-01-14 |
Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying |
Jonathan Lyhs et.al. |
2501.08142 |
null |
2025-01-14 |
Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 |
Seamie Hayes et.al. |
2501.08118 |
null |
2025-01-15 |
Consistency of Responses and Continuations Generated by Large Language Models on Social Media |
Wenlu Fan et.al. |
2501.08102 |
null |
2025-01-14 |
Hierarchical Autoscaling for Large Language Model Serving with Chiron |
Archit Patke et.al. |
2501.08090 |
null |
2025-01-14 |
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving |
Nert Keser et.al. |
2501.08083 |
null |
2025-01-14 |
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning |
Guoliang He et.al. |
2501.08071 |
link |
2025-01-14 |
A Roadmap to Guide the Integration of LLMs in Hierarchical Planning |
Israel Puerta-Merino et.al. |
2501.08068 |
null |
2025-01-14 |
Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT |
Awritrojit Banerjee et.al. |
2501.08053 |
null |
2025-01-14 |
TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning |
Yao Liang et.al. |
2501.08008 |
null |
2025-01-14 |
LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS |
Muhammad Ashfaq et.al. |
2501.07992 |
null |
2025-01-14 |
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness |
Jiaxing Zhao et.al. |
2501.07978 |
null |
2025-01-14 |
Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models |
Yifang Xu et.al. |
2501.07972 |
null |
2025-01-14 |
Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning |
Jiaqi Hua et.al. |
2501.07959 |
link |
2025-01-14 |
AI Guide Dog: Egocentric Path Prediction on Smartphone |
Aishwarya Jadhav et.al. |
2501.07957 |
null |
2025-01-14 |
Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations |
Waqar Hussain et.al. |
2501.07931 |
null |
2025-01-14 |
Gandalf the Red: Adaptive Security for LLMs |
Niklas Pfister et.al. |
2501.07927 |
link |
2025-01-14 |
VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models |
Hui Kuurila-Zhang et.al. |
2501.07922 |
link |
2025-01-14 |
Large Language Model Interface for Home Energy Management Systems |
François Michelon et.al. |
2501.07919 |
null |
2025-01-14 |
Bridge-SR: Schrödinger Bridge for Efficient SR |
Chang Li et.al. |
2501.07897 |
null |
2025-01-14 |
Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs |
Shuai Wang et.al. |
2501.07892 |
null |
2025-01-14 |
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding |
Zhongxiang Sun et.al. |
2501.07861 |
null |
2025-01-14 |
Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques |
Shobhit Ratan et.al. |
2501.07853 |
null |
2025-01-14 |
Unveiling Provider Bias in Large Language Models for Code Generation |
Xiaoyu Zhang et.al. |
2501.07849 |
null |
2025-01-14 |
Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning |
Haoyu Han et.al. |
2501.07845 |
null |
2025-01-14 |
A Driver Advisory System Based on Large Language Model for High-speed Train |
Y. C. Luo et.al. |
2501.07837 |
null |
2025-01-14 |
Flow: A Modular Approach to Automated Agentic Workflow Generation |
Boye Niu et.al. |
2501.07834 |
null |
2025-01-14 |
Real-time Verification and Refinement of Language Model Text Generation |
Joonho Ko et.al. |
2501.07824 |
null |
2025-01-14 |
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding |
Haomiao Xiong et.al. |
2501.07819 |
link |
2025-01-14 |
A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models |
Kaustubh D. Dhole et.al. |
2501.07818 |
null |
2025-01-14 |
Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models |
Dhruv Dhamani et.al. |
2501.07815 |
null |
2025-01-14 |
Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering |
Feijie Wu et.al. |
2501.07813 |
null |
2025-01-14 |
CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation |
Ruwei Pan et.al. |
2501.07811 |
null |
2025-01-14 |
Visual Language Models as Operator Agents in the Space Domain |
Alejandro Carrasco et.al. |
2501.07802 |
null |
2025-01-14 |
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding |
Zhaokai Wang et.al. |
2501.07783 |
link |
2025-01-14 |
Symmetry-Aware Generative Modeling through Learned Canonicalization |
Kusha Sareen et.al. |
2501.07773 |
null |
2025-01-14 |
Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey |
Bingchen Liu et.al. |
2501.07766 |
null |
2025-01-14 |
On the Statistical Capacity of Deep Generative Models |
Edric Tam et.al. |
2501.07763 |
link |
2025-01-13 |
Advancing Student Writing Through Automated Syntax Feedback |
Kamyar Zeinalipour et.al. |
2501.07740 |
null |
2025-01-13 |
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens |
Dongwon Kim et.al. |
2501.07730 |
null |
2025-01-13 |
LLMic: Romanian Foundation Language Model |
Vlad-Andrei Bădoiu et.al. |
2501.07721 |
null |
2025-01-13 |
CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory |
Haokun Zhao et.al. |
2501.07674 |
null |
2025-01-13 |
Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning |
Karishma Thakrar et.al. |
2501.07663 |
null |
2025-01-13 |
Large Language Models for Interpretable Mental Health Diagnosis |
Brian Hyeongseok Kim et.al. |
2501.07653 |
null |
2025-01-13 |
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations |
Weixi Feng et.al. |
2501.07647 |
null |
2025-01-13 |
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective |
Kun-Peng Ning et.al. |
2501.07641 |
null |
2025-01-13 |
SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models |
Fabien Bernier et.al. |
2501.07639 |
null |
2025-01-13 |
Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss |
Xinyu Zhang et.al. |
2501.07563 |
null |
2025-01-13 |
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought |
Chengzu Li et.al. |
2501.07542 |
null |
2025-01-13 |
ML Mule: Mobile-Driven Context-Aware Collaborative Learning |
Haoxiang Yu et.al. |
2501.07536 |
null |
2025-01-13 |
Investigating Large Language Models in Inferring Personality Traits from User Conversations |
Jianfeng Zhu et.al. |
2501.07532 |
null |
2025-01-13 |
RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment |
Difei Gu et.al. |
2501.07525 |
link |
2025-01-13 |
Parallel Key-Value Cache Fusion for Position Invariant RAG |
Philhoon Oh et.al. |
2501.07523 |
null |
2025-01-13 |
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards |
Yangsibo Huang et.al. |
2501.07493 |
null |
2025-01-13 |
TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models |
Thales Sales Almeida et.al. |
2501.07482 |
null |
2025-01-13 |
A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities |
Yihao Liu et.al. |
2501.07468 |
null |
2025-01-13 |
Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI |
Rolf Pfister et.al. |
2501.07458 |
null |
2025-01-13 |
Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection |
Xin Yin et.al. |
2501.07425 |
null |
2025-01-13 |
Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion |
Lala Shakti Swarup Ray et.al. |
2501.07408 |
null |
2025-01-13 |
OCORD: Open-Campus Object Removal Dataset |
Shuo Zhang et.al. |
2501.07397 |
null |
2025-01-13 |
Simulating the Hubbard Model with Equivariant Normalizing Flows |
Dominic Schuh et.al. |
2501.07371 |
null |
2025-01-13 |
Emergent effects of scaling on the functional hierarchies within large language models |
Paul C. Bogdan et.al. |
2501.07359 |
null |
2025-01-13 |
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring |
Buse Sibel Korkmaz et.al. |
2501.07324 |
link |
2025-01-13 |
FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering |
Erik Henriksson et.al. |
2501.07314 |
link |
2025-01-13 |
The Lessons of Developing Process Reward Models in Mathematical Reasoning |
Zhenru Zhang et.al. |
2501.07301 |
null |
2025-01-13 |
GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction |
Oleg Kobzarev et.al. |
2501.07295 |
null |
2025-01-13 |
LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks |
Zan-Kai Chong et.al. |
2501.07288 |
null |
2025-01-13 |
Lifelong Learning of Large Language Model based Agents: A Roadmap |
Junhao Zheng et.al. |
2501.07278 |
link |
2025-01-13 |
Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation |
Amir Sartipi et.al. |
2501.07276 |
null |
2025-01-13 |
Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics |
Wonduk Seo et.al. |
2501.07267 |
null |
2025-01-13 |
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion |
Li Liang et.al. |
2501.07260 |
link |
2025-01-13 |
EdgeTAM: On-Device Track Anything Model |
Chong Zhou et.al. |
2501.07256 |
null |
2025-01-13 |
Large Language Models: New Opportunities for Access to Science |
Jutta Schnabel et.al. |
2501.07250 |
null |
2025-01-13 |
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training |
Ziqing Wen et.al. |
2501.07237 |
link |
2025-01-13 |
Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction |
Qiaoqiao Ren et.al. |
2501.07224 |
link |
2025-01-13 |
Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing |
Laifa Tao et.al. |
2501.07191 |
null |
2025-01-13 |
Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study |
Huashan Chen et.al. |
2501.07165 |
null |
2025-01-13 |
AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model |
Bangchen Yin et.al. |
2501.07155 |
link |
2025-01-13 |
LLM360 K2: Scaling Up 360-Open-Source Large Language Models |
Zhengzhong Liu et.al. |
2501.07124 |
null |
2025-01-13 |
How GPT learns layer by layer |
Jason Du et.al. |
2501.07108 |
link |
2025-01-13 |
ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training |
Jiayang Wu et.al. |
2501.07078 |
link |
2025-01-13 |
D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation |
Zhejun Zhang et.al. |
2501.07077 |
link |
2025-01-13 |
Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values |
Jing Yao et.al. |
2501.07071 |
null |
2025-01-13 |
Enhancing Image Generation Fidelity via Progressive Prompts |
Zhen Xiong et.al. |
2501.07070 |
link |
2025-01-13 |
Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities |
ZeKe Xiao et.al. |
2501.07058 |
null |
2025-01-13 |
SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation |
Yee-Fan Tan et.al. |
2501.07055 |
null |
2025-01-13 |
PoAct: Policy and Action Dual-Control Agent for Generalized Applications |
Guozhi Yuan et.al. |
2501.07054 |
null |
2025-01-13 |
ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction |
Yan Zhang et.al. |
2501.07051 |
link |
2025-01-13 |
Unveiling the Potential of Text in High-Dimensional Time Series Forecasting |
Xin Zhou et.al. |
2501.07048 |
link |
2025-01-13 |
Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis |
Luwei Zeng et.al. |
2501.07034 |
null |
2025-01-13 |
A Proposed Large Language Model-Based Smart Search for Archive System |
Ha Dung Nguyen et.al. |
2501.07024 |
null |
2025-01-13 |
Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps |
Henry Li et.al. |
2501.06999 |
link |
2025-01-13 |
LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models |
Mozhgan Nasr Azadani et.al. |
2501.06986 |
link |
2025-01-13 |
Combining LLM decision and RL action selection to improve RL policy for adaptive interventions |
Karine Karine et.al. |
2501.06980 |
null |
2025-01-12 |
How is Google using AI for internal code migrations? |
Stoyan Nikolov et.al. |
2501.06972 |
null |
2025-01-12 |
Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives |
Xinyao Ma et.al. |
2501.06964 |
null |
2025-01-12 |
Comparison of Autoencoders for tokenization of ASL datasets |
Vouk Praun-Petrovic et.al. |
2501.06942 |
null |
2025-01-12 |
Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy |
Evgeny Ugolkov et.al. |
2501.06939 |
link |
2025-01-12 |
Harnessing Large Language Models for Disaster Management: A Survey |
Zhenyu Lei et.al. |
2501.06932 |
null |
2025-01-12 |
Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories |
Faaiq Waqar et.al. |
2501.06921 |
null |
2025-01-12 |
Risk-Averse Finetuning of Large Language Models |
Sapana Chaudhary et.al. |
2501.06911 |
link |
2025-01-12 |
Deep Learning and Foundation Models for Weather Prediction: A Survey |
Jimeng Shi et.al. |
2501.06907 |
null |
2025-01-12 |
A Foundational Generative Model for Breast Ultrasound Image Analysis |
Haojun Yu et.al. |
2501.06869 |
null |
2025-01-12 |
Transfer Learning of Tabular Data by Finetuning Large Language Models |
Shourav B. Rabbani et.al. |
2501.06863 |
null |
2025-01-12 |
A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context |
Noureldin Zahran et.al. |
2501.06859 |
null |
2025-01-12 |
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training |
Tianjin Huang et.al. |
2501.06842 |
link |
2025-01-12 |
An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering |
Zaber Al Hassan Ayon et.al. |
2501.06837 |
null |
2025-01-12 |
X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding |
Wenqi Zhou et.al. |
2501.06835 |
null |
2025-01-12 |
LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents |
Augusto Gonzalez-Bonorino et.al. |
2501.06834 |
link |
2025-01-12 |
GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing |
Ruizhe Ou et.al. |
2501.06828 |
null |
2025-01-12 |
Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification |
Shijing Chen et.al. |
2501.06827 |
null |
2025-01-12 |
Event Argument Extraction with Enriched Prompts |
Chen Liang et.al. |
2501.06825 |
link |
2025-01-12 |
A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT |
Yizhou Zhou et.al. |
2501.06819 |
null |
2025-01-12 |
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models |
Keyan Chen et.al. |
2501.06809 |
link |
2025-01-12 |
Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting |
Yongshuo Zhu et.al. |
2501.06808 |
null |
2025-01-12 |
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference |
Wenxuan Zeng et.al. |
2501.06807 |
null |
2025-01-12 |
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences |
Liu Yu et.al. |
2501.06795 |
null |
2025-01-12 |
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes |
Mahmoud Ahmed et.al. |
2501.06785 |
link |
2025-01-12 |
Cost-Effective Robotic Handwriting System with AI Integration |
Tianyi Huang et.al. |
2501.06783 |
null |
2025-01-12 |
Eliza: A Web3 friendly AI Agent Operating System |
Shaw Walters et.al. |
2501.06781 |
link |
2025-01-12 |
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning |
Ji Soo Lee et.al. |
2501.06761 |
link |
2025-01-12 |
Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation |
Shunfan Zheng et.al. |
2501.06741 |
null |
2025-01-12 |
ZOQO: Zero-Order Quantized Optimization |
Noga Bar et.al. |
2501.06736 |
null |
2025-01-12 |
Better Prompt Compression Without Multi-Layer Perceptrons |
Edouardo Honig et.al. |
2501.06730 |
null |
2025-01-12 |
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems |
Justin Vasselli et.al. |
2501.06728 |
link |
2025-01-12 |
Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G |
Zhiyan Liu et.al. |
2501.06726 |
null |
2025-01-12 |
DRDT3: Diffusion-Refined Decision Test-Time Training Model |
Xingshuai Huang et.al. |
2501.06718 |
null |
2025-01-12 |
ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian |
Mykyta Syromiatnikov et.al. |
2501.06715 |
link |
2025-01-12 |
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management |
Liu Qianli et.al. |
2501.06709 |
null |
2025-01-12 |
Evaluating Sample Utility for Data Selection by Mimicking Model Weights |
Tzu-Heng Huang et.al. |
2501.06708 |
null |
2025-01-12 |
AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds |
Yinfang Chen et.al. |
2501.06706 |
null |
2025-01-12 |
Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese |
Jie Yang et.al. |
2501.06704 |
null |
2025-01-12 |
Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users’ Questions |
Aidan Hogan et.al. |
2501.06699 |
null |
2025-01-12 |
DVM: Towards Controllable LLM Agents in Social Deduction Games |
Zheng Zhang et.al. |
2501.06695 |
null |
2025-01-12 |
TAPO: Task-Referenced Adaptation for Prompt Optimization |
Wenxin Luo et.al. |
2501.06689 |
link |
2025-01-12 |
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning |
Xiangen Hu et.al. |
2501.06682 |
null |
2025-01-12 |
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving |
Haoxiang Gao et.al. |
2501.06680 |
null |
2025-01-11 |
Challenging reaction prediction models to generalize to novel chemistry |
John Bradshaw et.al. |
2501.06669 |
link |
2025-01-11 |
Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training |
Sanjit Kakarla et.al. |
2501.06658 |
link |
2025-01-11 |
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings |
Tong Liu et.al. |
2501.06645 |
null |
2025-01-11 |
Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models |
Veronika Smilga et.al. |
2501.06638 |
link |
2025-01-11 |
Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach |
Mohammed Maree et.al. |
2501.06628 |
null |
2025-01-11 |
Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks |
Amr Almorsi et.al. |
2501.06625 |
null |
2025-01-11 |
Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks |
Xuanhao Luo et.al. |
2501.06604 |
null |
2025-01-11 |
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation |
Xuanle Zhao et.al. |
2501.06598 |
link |
2025-01-11 |
ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning |
Xiangru Tang et.al. |
2501.06590 |
link |
2025-01-11 |
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping |
Muru Zhang et.al. |
2501.06589 |
link |
2025-01-10 |
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs |
Omkar Thawakar et.al. |
2501.06186 |
link |
2025-01-10 |
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs |
Yangyu Huang et.al. |
2501.06184 |
null |
2025-01-10 |
VideoAuteur: Towards Long Narrative Video Generation |
Junfei Xiao et.al. |
2501.06173 |
null |
2025-01-10 |
GenMol: A Drug Discovery Generalist with Discrete Diffusion |
Seul Lee et.al. |
2501.06158 |
null |
2025-01-10 |
Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories |
Gerd Kortemeyer et.al. |
2501.06143 |
null |
2025-01-10 |
Supervision policies can shape long-term risk management in general-purpose AI models |
Manuel Cebrian et.al. |
2501.06137 |
link |
2025-01-10 |
Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI |
Yuya Asano et.al. |
2501.06129 |
null |
2025-01-10 |
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding |
Fabian David Schmidt et.al. |
2501.06117 |
link |
2025-01-10 |
From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy |
Elham Aghakhani et.al. |
2501.06101 |
null |
2025-01-10 |
Photokinetics of Photothermal Reactions |
Mounir Maafi et.al. |
2501.06057 |
null |
2025-01-10 |
AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery |
Johann Wenckstern et.al. |
2501.06039 |
link |
2025-01-10 |
Addressing speaker gender bias in large scale speech translation systems |
Shubham Bansal et.al. |
2501.05989 |
null |
2025-01-10 |
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing |
Eklavya Sarkar et.al. |
2501.05987 |
link |
2025-01-10 |
Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys |
Divya Mani Adhikari et.al. |
2501.05985 |
null |
2025-01-10 |
Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea |
Eunjung Cho et.al. |
2501.05981 |
null |
2025-01-10 |
Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory |
Yunmeng Shu et.al. |
2501.05965 |
null |
2025-01-10 |
Effective faking of verbal deception detection with target-aligned adversarial attacks |
Bennett Kleinberg et.al. |
2501.05962 |
null |
2025-01-10 |
Reusable specimen-level inference in computational pathology |
Jakub R. Kaczmarzyk et.al. |
2501.05945 |
link |
2025-01-10 |
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information |
Yongfan Lai et.al. |
2501.05932 |
link |
2025-01-10 |
LLMs Reproduce Stereotypes of Sexual and Gender Minorities |
Ruby Ostrow et.al. |
2501.05926 |
null |
2025-01-10 |
Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction |
Petraq Nako et.al. |
2501.05925 |
null |
2025-01-10 |
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design |
Ziheng Wu et.al. |
2501.05901 |
link |
2025-01-10 |
Prompt engineering and its implications on the energy consumption of Large Language Models |
Riccardo Rubei et.al. |
2501.05899 |
link |
2025-01-10 |
Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs |
Bianca Raimondi et.al. |
2501.05891 |
link |
2025-01-10 |
Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs |
Dabing Cheng et.al. |
2501.05884 |
null |
2025-01-10 |
VideoRAG: Retrieval-Augmented Generation over Video Corpus |
Soyeong Jeong et.al. |
2501.05874 |
null |
2025-01-10 |
ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability |
Antonin Poché et.al. |
2501.05855 |
link |
2025-01-10 |
Understanding Impact of Human Feedback via Influence Functions |
Taywon Min et.al. |
2501.05790 |
link |
2025-01-10 |
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models |
You Li et.al. |
2501.05767 |
null |
2025-01-10 |
Controlling Large Language Models Through Concept Activation Vectors |
Hanyu Zhang et.al. |
2501.05764 |
null |
2025-01-10 |
StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation |
Shangjin Zhai et.al. |
2501.05763 |
null |
2025-01-10 |
CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech |
Madhurananda Pahar et.al. |
2501.05755 |
null |
2025-01-10 |
Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models |
Sungjae Lee et.al. |
2501.05752 |
null |
2025-01-10 |
TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos |
Korawat Charoenpitaks et.al. |
2501.05733 |
link |
2025-01-10 |
Enabling Scalable Oversight via Self-Evolving Critic |
Zhengyang Tang et.al. |
2501.05727 |
null |
2025-01-10 |
I Can’t Share Code, but I need Translation – An Empirical Study on Code Translation through Federated LLM |
Jahnavi Kumar et.al. |
2501.05724 |
null |
2025-01-10 |
How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond |
Chen Huang et.al. |
2501.05714 |
null |
2025-01-10 |
Multi-Step Reasoning in Korean and the Emergent Mirage |
Guijin Son et.al. |
2501.05712 |
null |
2025-01-10 |
EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model |
Yi He et.al. |
2501.05710 |
null |
2025-01-10 |
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains |
Vighnesh Subramaniam et.al. |
2501.05707 |
null |
2025-01-10 |
Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness |
Audrey Salmon et.al. |
2501.05706 |
null |
2025-01-10 |
Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection |
Feiyi Chen et.al. |
2501.05675 |
null |
2025-01-10 |
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration |
Zuyuan Zhang et.al. |
2501.05673 |
null |
2025-01-10 |
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models |
Zheqi Lv et.al. |
2501.05662 |
null |
2025-01-10 |
Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation |
Zheqi Lv et.al. |
2501.05647 |
null |
2025-01-10 |
Iconicity in Large Language Models |
Anna Marklová et.al. |
2501.05643 |
null |
2025-01-10 |
HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection |
Anant Mehta et.al. |
2501.05631 |
link |
2025-01-10 |
The Impact of Model Scaling on Seen and Unseen Language Performance |
Rhitabrat Pokharel et.al. |
2501.05629 |
null |
2025-01-09 |
Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study |
Zhenyu Qi et.al. |
2501.05625 |
null |
2025-01-09 |
Exploring Large Language Models for Translating Romanian Computational Problems into English |
Adrian Marius Dumitran et.al. |
2501.05601 |
null |
2025-01-09 |
Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics |
Gert Aarts et.al. |
2501.05580 |
null |
2025-01-09 |
Exploring Large Language Models (LLMs) through interactive Python activities |
Eugenio Tufino et.al. |
2501.05577 |
link |
2025-01-09 |
LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts |
Yuri Facanha Bezerra et.al. |
2501.05554 |
link |
2025-01-09 |
The dynamics of meaning through time: Assessment of Large Language Models |
Mohamed Taher Alrefaie et.al. |
2501.05552 |
null |
2025-01-09 |
Infecting Generative AI With Viruses |
David Noever et.al. |
2501.05542 |
null |
2025-01-09 |
NSChat: A Chatbot System To Rule Them All |
Zenon Lamprou et.al. |
2501.05541 |
null |
2025-01-09 |
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding |
Xingyu Fu et.al. |
2501.05452 |
null |
2025-01-09 |
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors |
Yifan Yu et.al. |
2501.05446 |
link |
2025-01-09 |
Consistent Flow Distillation for Text-to-3D Generation |
Runjie Yan et.al. |
2501.05445 |
null |
2025-01-09 |
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark |
Yunzhuo Hao et.al. |
2501.05444 |
null |
2025-01-09 |
A survey of textual cyber abuse detection using cutting-edge language models and large language models |
Jose A. Diaz-Garcia et.al. |
2501.05443 |
null |
2025-01-09 |
Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation |
Xuyi Meng et.al. |
2501.05427 |
null |
2025-01-09 |
Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers |
Jerry Chongyi Hu et.al. |
2501.05423 |
null |
2025-01-09 |
Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation |
Darius Petermann et.al. |
2501.05413 |
null |
2025-01-10 |
Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics |
Maximilian Alber et.al. |
2501.05409 |
null |
2025-01-09 |
TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts |
Yu-Hao Huang et.al. |
2501.05403 |
null |
2025-01-09 |
Mechanistic understanding and validation of large AI models with SemanticLens |
Maximilian Dreyer et.al. |
2501.05398 |
null |
2025-01-09 |
FairCode: Evaluating Social Bias of LLMs in Code Generation |
Yongkang Du et.al. |
2501.05396 |
link |
2025-01-09 |
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models |
Kristian G. Barman et.al. |
2501.05382 |
null |
2025-01-09 |
Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance |
Dimitrios Gerogiannis et.al. |
2501.05379 |
null |
2025-01-09 |
Accelerated Diffusion Models via Speculative Sampling |
Valentin De Bortoli et.al. |
2501.05370 |
null |
2025-01-09 |
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction |
Hantao Lou et.al. |
2501.05336 |
link |
2025-01-09 |
“What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles |
Xuewen Luo et.al. |
2501.05322 |
null |
2025-01-09 |
Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning |
Nora Gourmelon et.al. |
2501.05281 |
link |
2025-01-09 |
CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models |
Fabian Hörst et.al. |
2501.05269 |
link |
2025-01-09 |
Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal |
Wanli Ma et.al. |
2501.05265 |
null |
2025-01-09 |
CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models |
Yewei Song et.al. |
2501.05255 |
null |
2025-01-09 |
From Scientific Texts to Verifiable Code: Automating the Process with Transformers |
Changjie Wang et.al. |
2501.05252 |
null |
2025-01-09 |
RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models |
Peizhuo Lv et.al. |
2501.05249 |
null |
2025-01-09 |
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning |
Laura Puccioni et.al. |
2501.05248 |
null |
2025-01-09 |
Online Prompt and Solver Selection for Program Synthesis |
Yixuan Li et.al. |
2501.05247 |
null |
2025-01-09 |
Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs |
Artem Fedorchenko et.al. |
2501.05234 |
null |
2025-01-09 |
Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection |
Pei-Kang Lee et.al. |
2501.05228 |
null |
2025-01-09 |
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes |
Ludwic Leonard et.al. |
2501.05226 |
null |
2025-01-09 |
Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond |
Tomas Goldsack et.al. |
2501.05224 |
null |
2025-01-09 |
A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education |
Ziqing Li et.al. |
2501.05220 |
null |
2025-01-09 |
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration |
Xuyang Liu et.al. |
2501.05179 |
link |
2025-01-09 |
Emergence of human-like polarization among large language model agents |
Jinghua Piao et.al. |
2501.05171 |
null |
2025-01-09 |
Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering |
Matteo Esposito et.al. |
2501.05165 |
null |
2025-01-09 |
Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier |
Yufei Shang et.al. |
2501.05155 |
null |
2025-01-09 |
DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving |
Xuran Zheng et.al. |
2501.05081 |
null |
2025-01-09 |
Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization |
Harshith Manjunath et.al. |
2501.05079 |
null |
2025-01-09 |
Analyzing Memorization in Large Language Models through the Lens of Model Attribution |
Tarun Ram Menta et.al. |
2501.05078 |
link |
2025-01-09 |
A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model |
Shuo Tong et.al. |
2501.05075 |
null |
2025-01-09 |
Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning |
Huabin Liu et.al. |
2501.05069 |
null |
2025-01-09 |
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding |
Jiaxing Zhao et.al. |
2501.05067 |
null |
2025-01-09 |
Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators |
Leonard Lupin-Jimenez et.al. |
2501.05058 |
null |
2025-01-09 |
LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models |
Zengqi Peng et.al. |
2501.05057 |
null |
2025-01-09 |
On the Generalizability of Transformer Models to Code Completions of Different Lengths |
Nathan Cooper et.al. |
2501.05051 |
null |
2025-01-09 |
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution |
Chengxing Xie et.al. |
2501.05040 |
link |
2025-01-09 |
Enhancing Human-Like Responses in Large Language Models |
Ethem Yağız Çalık et.al. |
2501.05032 |
null |
2025-01-09 |
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark |
Ronghao Dang et.al. |
2501.05031 |
link |
2025-01-09 |
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications |
Ofir Marom et.al. |
2501.05030 |
null |
2025-01-09 |
TreeKV: Smooth Key-Value Cache Compression with Tree Structures |
Ziwei He et.al. |
2501.04987 |
null |
2025-01-09 |
SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs |
Muhammad Salman et.al. |
2501.04985 |
null |
2025-01-09 |
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer |
Hangzhou He et.al. |
2501.04975 |
link |
2025-01-09 |
Demystifying Domain-adaptive Post-training for Financial LLMs |
Zixuan Ke et.al. |
2501.04961 |
link |
2025-01-09 |
Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments |
Yifan Xu et.al. |
2501.04947 |
null |
2025-01-09 |
Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models |
Qingyu Ren et.al. |
2501.04945 |
link |
2025-01-09 |
Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency |
Shiji Zhao et.al. |
2501.04931 |
null |
2025-01-09 |
Investigating Numerical Translation with Large Language Models |
Wei Tang et.al. |
2501.04927 |
null |
2025-01-09 |
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching |
Jun-Hak Yun et.al. |
2501.04926 |
null |
2025-01-09 |
HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers |
Yiyao Yang et.al. |
2501.04908 |
link |
2025-01-09 |
JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis |
Jun-Hyeok Cha et.al. |
2501.04904 |
null |
2025-01-09 |
ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries |
Keke Huang et.al. |
2501.04901 |
null |
2025-01-09 |
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval |
Hanna Zubkova et.al. |
2501.04899 |
null |
2025-01-08 |
Leveraging Log Probabilities in Language Models to Forecast Future Events |
Tommaso Soru et.al. |
2501.04880 |
null |
2025-01-08 |
Real-Time Textless Dialogue Generation |
Long Mai et.al. |
2501.04877 |
link |
2025-01-08 |
Modelling complex proton transport phenomena – Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields |
Malte Grunert et.al. |
2501.04876 |
null |
2025-01-08 |
Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware |
Brandon J Walton et.al. |
2501.04848 |
null |
2025-01-08 |
Do Code LLMs Understand Design Patterns? |
Zhenyu Pan et.al. |
2501.04835 |
null |
2025-01-08 |
On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability |
Andreas Vogelsang et.al. |
2501.04810 |
null |
2025-01-08 |
IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX |
Erik Recio-Armengol et.al. |
2501.04776 |
link |
2025-01-08 |
Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations |
Kirandeep Kaur et.al. |
2501.04762 |
null |
2025-01-08 |
Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch |
Phillip Richter et.al. |
2501.04755 |
null |
2025-01-08 |
EditAR: Unified Conditional Generation with Autoregressive Models |
Jiteng Mu et.al. |
2501.04699 |
null |
2025-01-08 |
Re-ranking the Context for Multimodal Retrieval Augmented Generation |
Matin Mortaheb et.al. |
2501.04695 |
null |
2025-01-08 |
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images |
Zixuan Huang et.al. |
2501.04689 |
null |
2025-01-08 |
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics |
Ruilin Luo et.al. |
2501.04686 |
link |
2025-01-08 |
Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations |
Archita Srivastava et.al. |
2501.04675 |
null |
2025-01-08 |
Assessing Language Comprehension in Large Language Models Using Construction Grammar |
Wesley Scivetti et.al. |
2501.04661 |
null |
2025-01-08 |
Multi-task retriever fine-tuning for domain-specific and efficient RAG |
Patrice Béchard et.al. |
2501.04652 |
null |
2025-01-08 |
FlairGPT: Repurposing LLMs for Interior Designs |
Gabrielle Littlefair et.al. |
2501.04648 |
null |
2025-01-08 |
Knowledge Retrieval Based on Generative AI |
Te-Lun Yang et.al. |
2501.04635 |
null |
2025-01-08 |
“Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era |
Giulio Antonio Abbo et.al. |
2501.04633 |
null |
2025-01-09 |
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation |
Daniele Molino et.al. |
2501.04614 |
null |
2025-01-08 |
Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning |
Ivan Kankeu et.al. |
2501.04591 |
link |
2025-01-08 |
Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models |
Miaoyang He et.al. |
2501.04582 |
null |
2025-01-08 |
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection |
Yuhang Liu et.al. |
2501.04575 |
link |
2025-01-09 |
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis |
Run Luo et.al. |
2501.04561 |
link |
2025-01-08 |
The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? |
Christopher Lazik et.al. |
2501.04543 |
null |
2025-01-08 |
Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time |
Uri Berger et.al. |
2501.04513 |
null |
2025-01-08 |
CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection |
Ruijun Feng et.al. |
2501.04510 |
null |
2025-01-08 |
Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction |
Guofeng Yang et.al. |
2501.04487 |
null |
2025-01-08 |
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages |
Archchana Sindhujan et.al. |
2501.04473 |
null |
2025-01-08 |
Hidden Entity Detection from GitHub Leveraging Large Language Models |
Lu Gan et.al. |
2501.04455 |
link |
2025-01-08 |
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions |
Doaa Mahmud et.al. |
2501.04437 |
null |
2025-01-08 |
Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions |
Na Yan et.al. |
2501.04436 |
null |
2025-01-08 |
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach |
H. M. Shadman Tabib et.al. |
2501.04425 |
null |
2025-01-08 |
SEO: Stochastic Experience Optimization for Large Language Models |
Jitao Xu et.al. |
2501.04393 |
null |
2025-01-08 |
iFADIT: Invertible Face Anonymization via Disentangled Identity Transform |
Lin Yuan et.al. |
2501.04390 |
null |
2025-01-08 |
DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications |
Feng Liu et.al. |
2501.04366 |
link |
2025-01-08 |
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting |
Dong-Hai Zhu et.al. |
2501.04341 |
link |
2025-01-09 |
Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models |
Haonan Shi et.al. |
2501.04323 |
null |
2025-01-08 |
Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts |
Preethi Seshadri et.al. |
2501.04316 |
link |
2025-01-08 |
RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation |
Jun Liu et.al. |
2501.04315 |
null |
2025-01-08 |
Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models |
Kunpeng Zhang et.al. |
2501.04312 |
null |
2025-01-08 |
LLM4SR: A Survey on Large Language Models for Scientific Research |
Ziming Luo et.al. |
2501.04306 |
link |
2025-01-08 |
Multimodal Graph Constrastive Learning and Prompt for ChartQA |
Yue Dai et.al. |
2501.04303 |
null |
2025-01-08 |
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving |
Siran Chen et.al. |
2501.04302 |
null |
2025-01-08 |
An Analysis of Model Robustness across Concurrent Distribution Shifts |
Myeongho Jeon et.al. |
2501.04288 |
null |
2025-01-08 |
Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models |
Bahman Torkamandi et.al. |
2501.04286 |
null |
2025-01-08 |
Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking |
Tianqi Ren et.al. |
2501.04285 |
null |
2025-01-08 |
OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments |
Yujie Tang et.al. |
2501.04279 |
null |
2025-01-08 |
Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering |
Christophe Bajan et.al. |
2501.04277 |
link |
2025-01-08 |
Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation |
Senwei Xie et.al. |
2501.04268 |
null |
2025-01-08 |
Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning |
Lang Xu et.al. |
2501.04266 |
null |
2025-01-08 |
IOLBENCH: Benchmarking LLMs on Linguistic Reasoning |
Satyam Goyal et.al. |
2501.04249 |
link |
2025-01-08 |
TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events |
Jian-Hua Fang et.al. |
2501.04247 |
null |
2025-01-08 |
Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks |
Rachel Longjohn et.al. |
2501.04234 |
null |
2025-01-07 |
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation |
Alireza Salemi et.al. |
2501.04167 |
null |
2025-01-07 |
AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots |
Shaoyue Wen et.al. |
2501.04156 |
link |
2025-01-07 |
Multilingual Open QA on the MIA Shared Task |
Navya Yarrabelly et.al. |
2501.04153 |
null |
2025-01-07 |
The angular momentum spiral of the Milky Way disc in Gaia |
Rashid Yaaqib et.al. |
2501.04095 |
null |
2025-01-07 |
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives |
Xiaoqing Zhang et.al. |
2501.04070 |
link |
2025-01-07 |
ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono |
Jingquan Wang et.al. |
2501.04062 |
null |
2025-01-07 |
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving |
Lingdong Kong et.al. |
2501.04005 |
null |
2025-01-07 |
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos |
Haobo Yuan et.al. |
2501.04001 |
link |
2025-01-07 |
RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance |
Matin Mortaheb et.al. |
2501.03995 |
null |
2025-01-07 |
Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance |
Adil Rengim Cetingoz et.al. |
2501.03993 |
null |
2025-01-07 |
Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles |
Yuxi Xia et.al. |
2501.03991 |
null |
2025-01-07 |
(De)-Indexing and the Right to be Forgotten |
Salvatore Vilella et.al. |
2501.03989 |
null |
2025-01-07 |
VLM-driven Behavior Tree for Context-aware Task Planning |
Naoki Wake et.al. |
2501.03968 |
link |
2025-01-07 |
Vision Language Models as Values Detectors |
Giulio Antonio Abbo et.al. |
2501.03957 |
null |
2025-01-07 |
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States |
Jurgita Kapočiūtė-Dzikienė et.al. |
2501.03952 |
null |
2025-01-07 |
Synthetic Data Privacy Metrics |
Amy Steier et.al. |
2501.03941 |
null |
2025-01-07 |
Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection |
Pablo Miralles-González et.al. |
2501.03940 |
null |
2025-01-07 |
A precise asymptotic analysis of learning diffusion models: theory and insights |
Hugo Cui et.al. |
2501.03937 |
link |
2025-01-07 |
Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study |
Ramya Jonnala et.al. |
2501.03904 |
null |
2025-01-07 |
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token |
Shaolei Zhang et.al. |
2501.03895 |
link |
2025-01-07 |
AlphaPO – Reward shape matters for LLM alignment |
Aman Gupta et.al. |
2501.03884 |
null |
2025-01-07 |
CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds |
Keonwoo Kim et.al. |
2501.03879 |
null |
2025-01-07 |
Progressive Document-level Text Simplification via Large Language Models |
Dengzhao Fang et.al. |
2501.03857 |
null |
2025-01-07 |
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention |
Aadya Arora et.al. |
2501.03839 |
null |
2025-01-07 |
Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging |
Simon W. Penninga et.al. |
2501.03825 |
null |
2025-01-08 |
MADation: Face Morphing Attack Detection with Foundation Models |
Eduarda Caldeira et.al. |
2501.03800 |
link |
2025-01-07 |
KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration |
Chengyuan Li et.al. |
2501.03786 |
null |
2025-01-07 |
Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series |
Yuxiao Hu et.al. |
2501.03747 |
null |
2025-01-07 |
Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein |
Xiaotong Guo et.al. |
2501.03722 |
null |
2025-01-07 |
Motion-Aware Generative Frame Interpolation |
Guozhen Zhang et.al. |
2501.03699 |
null |
2025-01-07 |
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment |
Yuchun Fan et.al. |
2501.03681 |
link |
2025-01-07 |
Effective and Efficient Mixed Precision Quantization of Speech Foundation Models |
Haoning Xu et.al. |
2501.03643 |
null |
2025-01-07 |
CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems |
Zhaonan Wu et.al. |
2501.03626 |
link |
2025-01-07 |
LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment |
Gaoussou Youssouf Kebe et.al. |
2501.03624 |
null |
2025-01-07 |
Cosmos World Foundation Model Platform for Physical AI |
NVIDIA et.al. |
2501.03575 |
link |
2025-01-07 |
From Code to Compliance: Assessing ChatGPT’s Utility in Designing an Accessible Webpage – A Case Study |
Ammar Ahmed et.al. |
2501.03572 |
null |
2025-01-07 |
What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs |
Muneera Bano et.al. |
2501.03569 |
null |
2025-01-07 |
Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities |
Benedikt Reitemeyer et.al. |
2501.03566 |
null |
2025-01-07 |
Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis |
Haoran Lai et.al. |
2501.03565 |
null |
2025-01-07 |
PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models |
Lingzhi Yuan et.al. |
2501.03544 |
null |
2025-01-07 |
Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions |
Weijieying Ren et.al. |
2501.03540 |
null |
2025-01-07 |
Deep Learning for Pathological Speech: A Survey |
Shakeel A. Sheikh et.al. |
2501.03536 |
null |
2025-01-08 |
SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving |
Xuewen Luo et.al. |
2501.03535 |
null |
2025-01-07 |
A generative approach for lensless imaging in low-light conditions |
Ziyang Liu et.al. |
2501.03511 |
null |
2025-01-07 |
A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models |
Shuyang Wang et.al. |
2501.03508 |
null |
2025-01-07 |
Textualize Visual Prompt for Image Editing via Diffusion Bridge |
Pengcheng Xu et.al. |
2501.03495 |
null |
2025-01-07 |
Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment |
Prashant Trivedi et.al. |
2501.03486 |
null |
2025-01-07 |
Reading with Intent – Neutralizing Intent |
Benjamin Reichman et.al. |
2501.03475 |
null |
2025-01-07 |
Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning |
Chuang Niu et.al. |
2501.03469 |
link |
2025-01-07 |
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems |
Yannis Katsis et.al. |
2501.03468 |
link |
2025-01-07 |
ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation |
Yu-Cheng Liu et.al. |
2501.03462 |
null |
2025-01-07 |
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation |
Xiao Wang et.al. |
2501.03458 |
link |
2025-01-07 |
CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering |
Jialiang Chen et.al. |
2501.03447 |
null |
2025-01-07 |
LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models |
Mohamad Fakih et.al. |
2501.03446 |
null |
2025-01-07 |
Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology |
Sarah E. Finch et.al. |
2501.03441 |
link |
2025-01-06 |
SALT: Sales Autocompletion Linked Business Tables Dataset |
Tassilo Klein et.al. |
2501.03413 |
link |
2025-01-06 |
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations |
Simone Giovannini et.al. |
2501.03403 |
null |
2025-01-06 |
DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes |
Xuyang Wang et.al. |
2501.03397 |
link |
2025-01-06 |
Evolved Quantum Boltzmann Machines |
Michele Minervini et.al. |
2501.03367 |
null |
2025-01-06 |
CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets |
Tanay Agrawal et.al. |
2501.03332 |
null |
2025-01-06 |
LiLMaps: Learnable Implicit Language Maps |
Evgenii Kruzhkov et.al. |
2501.03304 |
null |
2025-01-06 |
A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval |
Shuo Tong et.al. |
2501.03295 |
null |
2025-01-06 |
Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model |
Naibo Wang et.al. |
2501.03292 |
null |
2025-01-06 |
ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning |
Pengwei Tang et.al. |
2501.03291 |
null |
2025-01-06 |
CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models |
Zhenyu Xu et.al. |
2501.03288 |
null |
2025-01-06 |
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning |
Beichen Zhang et.al. |
2501.03226 |
link |
2025-01-06 |
Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text |
Ayat Najjar et.al. |
2501.03212 |
null |
2025-01-06 |
Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity |
Ayat A. Najjar et.al. |
2501.03203 |
null |
2025-01-06 |
CLIX: Cross-Lingual Explanations of Idiomatic Expressions |
Aaron Gluck et.al. |
2501.03191 |
null |
2025-01-06 |
Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text |
Ali Al-Lawati et.al. |
2501.03166 |
link |
2025-01-06 |
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy |
Risha Goel et.al. |
2501.03153 |
link |
2025-01-06 |
Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches |
Alhassan Mumuni et.al. |
2501.03151 |
null |
2025-01-06 |
VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity |
Yerong Li et.al. |
2501.03139 |
null |
2025-01-07 |
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models |
Mingyang Song et.al. |
2501.03124 |
link |
2025-01-06 |
CAT: Content-Adaptive Image Tokenization |
Junhong Shen et.al. |
2501.03120 |
null |
2025-01-06 |
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases |
Dylan Bouchard et.al. |
2501.03112 |
link |
2025-01-06 |
Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling |
Aseem Srivastava et.al. |
2501.03088 |
null |
2025-01-06 |
Retrieval-Augmented TLAPS Proof Generation with Large Language Models |
Yuhao Zhou et.al. |
2501.03073 |
null |
2025-01-06 |
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events |
Duygu Sezen Islakoglu et.al. |
2501.03040 |
null |
2025-01-06 |
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning |
Zhen Li et.al. |
2501.03035 |
null |
2025-01-06 |
TransPixar: Advancing Text-to-Video Generation with Transparency |
Luozhou Wang et.al. |
2501.03006 |
link |
2025-01-06 |
CALM: Curiosity-Driven Auditing for Large Language Models |
Xiang Zheng et.al. |
2501.02997 |
link |
2025-01-06 |
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation |
Zhi Qu et.al. |
2501.02979 |
link |
2025-01-06 |
FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models |
Zhuo Chen et.al. |
2501.02968 |
null |
2025-01-07 |
Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild |
Wanpeng Hu et.al. |
2501.02964 |
link |
2025-01-07 |
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild |
Jiawei Liu et.al. |
2501.02962 |
null |
2025-01-06 |
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features |
Shi Bin Hoo et.al. |
2501.02945 |
link |
2025-01-07 |
Inhibition of bacterial growth by antibiotics |
Barnabe Ledoux et.al. |
2501.02944 |
null |
2025-01-06 |
Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions |
Jianhua Pei et.al. |
2501.02928 |
null |
2025-01-06 |
DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model |
Hao Yu et.al. |
2501.02901 |
link |
2025-01-06 |
FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection |
Guray Ozgur et.al. |
2501.02892 |
link |
2025-01-06 |
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs |
Hui Sun et.al. |
2501.02885 |
null |
2025-01-06 |
IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment |
Yiming Zhang et.al. |
2501.02869 |
null |
2025-01-06 |
Large Language Models for Video Surveillance Applications |
Ulindu De Silva et.al. |
2501.02850 |
null |
2025-01-06 |
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification |
Yubo Wang et.al. |
2501.02844 |
null |
2025-01-06 |
Foundations of GenIR |
Qingyao Ai et.al. |
2501.02842 |
null |
2025-01-06 |
An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models |
Dezhi Ran et.al. |
2501.02829 |
null |
2025-01-06 |
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion |
Zhaoyi Yan et.al. |
2501.02795 |
null |
2025-01-06 |
CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation |
Yuanhong Chen et.al. |
2501.02786 |
null |
2025-01-06 |
GeAR: Generation Augmented Retrieval |
Haoyu Liu et.al. |
2501.02772 |
null |
2025-01-06 |
Visual Large Language Models for Generalized and Specialized Applications |
Yifan Li et.al. |
2501.02765 |
link |
2025-01-06 |
Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? |
Hongyi Miao et.al. |
2501.02751 |
null |
2025-01-06 |
Artificial Intelligence in Creative Industries: Advances Prior to 2025 |
Nantheera Anantrasirichai et.al. |
2501.02725 |
null |
2025-01-06 |
KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models |
Zaiyi Zheng et.al. |
2501.02711 |
null |
2025-01-06 |
QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance |
Binita Saha et.al. |
2501.02702 |
null |
2025-01-06 |
EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models |
Andrés Villa et.al. |
2501.02699 |
null |
2025-01-05 |
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking |
Weikang Bian et.al. |
2501.02690 |
null |
2025-01-05 |
Decoding specialised feature neurons in LLMs with the final projection layer |
Harry J Davies et.al. |
2501.02688 |
null |
2025-01-05 |
From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering |
Wen-ran Li et.al. |
2501.02680 |
null |
2025-01-05 |
A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model |
Shivaram Kalyanakrishnan et.al. |
2501.02652 |
null |
2025-01-05 |
Representation Learning of Lab Values via Masked AutoEncoder |
David Restrepo et.al. |
2501.02648 |
link |
2025-01-05 |
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense |
Yang Ouyang et.al. |
2501.02629 |
link |
2025-01-05 |
Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets |
Mahmoud Jahanshahi et.al. |
2501.02628 |
null |
2025-01-05 |
HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning |
Saleh Ashkboos et.al. |
2501.02625 |
null |
2025-01-05 |
LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment |
Yifei Liu et.al. |
2501.02621 |
null |
2025-01-05 |
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms |
Jovan Stojkovic et.al. |
2501.02600 |
null |
2025-01-05 |
LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations |
Jiaping Wang et.al. |
2501.02573 |
link |
2025-01-05 |
Multi-LLM Collaborative Caption Generation in Scientific Documents |
Jaeyoung Kim et.al. |
2501.02552 |
link |
2025-01-05 |
Transformers Simulate MLE for Sequence Generation in Bayesian Networks |
Yuan Cao et.al. |
2501.02547 |
null |
2025-01-05 |
Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm |
Ljubisa Bojic et.al. |
2501.02532 |
null |
2025-01-05 |
Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI |
Ljubisa Bojic et.al. |
2501.02531 |
null |
2025-01-05 |
Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks |
Leo Franklin et.al. |
2501.02527 |
null |
2025-01-05 |
Unified Guidance for Geometry-Conditioned Molecular Generation |
Sirine Ayadi et.al. |
2501.02526 |
null |
2025-01-05 |
Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors |
Minglin Chen et.al. |
2501.02519 |
null |
2025-01-05 |
CHAIR-Classifier of Hallucination as Improver |
Ao Sun et.al. |
2501.02518 |
link |
2025-01-05 |
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use |
Junjie Ye et.al. |
2501.02506 |
null |
2025-01-05 |
Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons |
Sebastian Morel-Balbi et.al. |
2501.02505 |
null |
2025-01-05 |
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling |
Chaojie Mao et.al. |
2501.02487 |
null |
2025-01-05 |
LLMPC: Large Language Model Predictive Control |
Gabriel Maher et.al. |
2501.02486 |
link |
2025-01-05 |
Decoding News Bias: Multi Bias Detection in News Articles |
Bhushan Santosh Shah et.al. |
2501.02482 |
null |
2025-01-05 |
Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine |
Yishen Liu et.al. |
2501.02471 |
null |
2025-01-05 |
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera |
Yuliang Guo et.al. |
2501.02464 |
null |
2025-01-05 |
Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications |
Zhe Chen et.al. |
2501.02460 |
null |
2025-01-05 |
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap |
Hyunwoo Ko et.al. |
2501.02448 |
null |
2025-01-05 |
RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework |
Kun Wang et.al. |
2501.02446 |
null |
2025-01-05 |
A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models |
Yinpeng Cai et.al. |
2501.02441 |
null |
2025-01-05 |
Efficient Deployment of Large Language Models on Resource-constrained Devices |
Zhiwei Yao et.al. |
2501.02438 |
null |
2025-01-05 |
FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance |
Haicheng Wang et.al. |
2501.02430 |
link |
2025-01-05 |
GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems |
Mehmet Deniz Türkmen et.al. |
2501.02408 |
null |
2025-01-04 |
Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities |
Tara Radvand et.al. |
2501.02406 |
null |
2025-01-04 |
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers |
Markus J. Buehler et.al. |
2501.02393 |
link |
2025-01-04 |
Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations |
Kangyu Zhu et.al. |
2501.02385 |
null |
2025-01-04 |
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison |
Tsz Kin Lam et.al. |
2501.02370 |
null |
2025-01-04 |
Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving |
Sanghyun Park et.al. |
2501.02348 |
null |
2025-01-04 |
Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support |
Florian Putz et.al. |
2501.02346 |
null |
2025-01-04 |
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility |
Yonglin Tian et.al. |
2501.02341 |
link |
2025-01-04 |
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference |
Zhuomin He et.al. |
2501.02336 |
link |
2025-01-04 |
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications |
Jodi M. Casabianca et.al. |
2501.02334 |
null |
2025-01-04 |
Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance |
Marta Gentiloni-Silveri et.al. |
2501.02298 |
null |
2025-01-04 |
Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection |
Yachao Zhao et.al. |
2501.02295 |
null |
2025-01-04 |
Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control |
Yongjeong Oh et.al. |
2501.02273 |
null |
2025-01-04 |
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph |
Yutao Jiang et.al. |
2501.02268 |
link |
2025-01-04 |
Unsupervised Class Generation to Expand Semantic Segmentation Datasets |
Javier Montalvo et.al. |
2501.02264 |
null |
2025-01-04 |
Financial Named Entity Recognition: How Far Can LLM Go? |
Yi-Te Lu et.al. |
2501.02237 |
link |
2025-01-04 |
Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends |
Camille Barboule et.al. |
2501.02235 |
null |
2025-01-04 |
Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection |
S M Mostaq Hossain et.al. |
2501.02229 |
null |
2025-01-04 |
Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation |
Shijie Wang et.al. |
2501.02226 |
null |
2025-01-04 |
Can ChatGPT implement finite element models for geotechnical engineering applications? |
Taegu Kim et.al. |
2501.02199 |
null |
2025-01-04 |
EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks |
Shixuan Liu et.al. |
2501.02192 |
null |
2025-01-04 |
On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing |
Jianwei Wang et.al. |
2501.02191 |
link |
2025-01-04 |
The Application of Large Language Models in Recommendation Systems |
Peiyang Yu et.al. |
2501.02178 |
null |
2025-01-04 |
The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit |
Huixue Zhou et.al. |
2501.02173 |
null |
2025-01-04 |
Personalized Graph-Based Retrieval for Large Language Models |
Steven Au et.al. |
2501.02157 |
link |
2025-01-04 |
Table as Thought: Exploring Structured Thoughts in LLM Reasoning |
Zhenjie Sun et.al. |
2501.02152 |
null |
2025-01-04 |
Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN |
Yanxi Chen et.al. |
2501.02146 |
null |
2025-01-03 |
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction |
Chaoyou Fu et.al. |
2501.01957 |
link |
2025-01-03 |
Metadata Conditioning Accelerates Language Model Pre-training |
Tianyu Gao et.al. |
2501.01956 |
link |
2025-01-03 |
MADGEN – Mass-Spec attends to De Novo Molecular generation |
Yinkai Wang et.al. |
2501.01950 |
null |
2025-01-03 |
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap |
Weizhi Zhang et.al. |
2501.01945 |
link |
2025-01-03 |
Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models |
Manh Duong Nguyen et.al. |
2501.01932 |
link |
2025-01-03 |
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM |
Yifan Du et.al. |
2501.01904 |
link |
2025-01-03 |
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation |
Siyuan Huang et.al. |
2501.01895 |
null |
2025-01-03 |
Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions |
Rachneet Sachdeva et.al. |
2501.01872 |
link |
2025-01-03 |
Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification |
Xiangxiang Dai et.al. |
2501.01849 |
link |
2025-01-03 |
MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning |
Pu Yang et.al. |
2501.01834 |
null |
2025-01-03 |
Time Series Language Model for Descriptive Caption Generation |
Mohamed Trabelsi et.al. |
2501.01832 |
null |
2025-01-03 |
Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models |
Yanjiang Liu et.al. |
2501.01830 |
null |
2025-01-03 |
SDPO: Segment-Level Direct Preference Optimization for Social Agents |
Aobo Kong et.al. |
2501.01821 |
link |
2025-01-03 |
BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction |
Ferhat Ozgur Catak et.al. |
2501.01802 |
link |
2025-01-03 |
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation |
Mohammad Khalil et.al. |
2501.01793 |
link |
2025-01-03 |
Efficient LLM Inference with Activation Checkpointing and Hybrid Caching |
Sanghyeon Lee et.al. |
2501.01792 |
null |
2025-01-03 |
Nonparametric estimation of a factorizable density using diffusion models |
Hyeok Kyu Kwon et.al. |
2501.01783 |
null |
2025-01-03 |
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation |
Mingjie Li et.al. |
2501.01765 |
null |
2025-01-03 |
Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models |
Andrea Matteazzi et.al. |
2501.01761 |
null |
2025-01-03 |
MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling |
Simon Rouard et.al. |
2501.01757 |
null |
2025-01-03 |
Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation |
Kangcheng Luo et.al. |
2501.01743 |
null |
2025-01-03 |
How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models |
Simone Corbo et.al. |
2501.01741 |
null |
2025-01-03 |
AR4D: Autoregressive 4D Generation from Monocular Videos |
Hanxin Zhu et.al. |
2501.01722 |
null |
2025-01-03 |
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models |
Guosheng Zhang et.al. |
2501.01720 |
null |
2025-01-03 |
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries |
Michal Kuk et.al. |
2501.01711 |
null |
2025-01-03 |
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders |
Jiajun Cao et.al. |
2501.01709 |
null |
2025-01-03 |
AgentRefine: Enhancing Agent Generalization through Refinement Tuning |
Dayuan Fu et.al. |
2501.01702 |
null |
2025-01-03 |
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models |
Lei Tang et.al. |
2501.01679 |
null |
2025-01-03 |
Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption |
Zhang Ruoyan et.al. |
2501.01672 |
null |
2025-01-03 |
BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction |
Alaeddine Diaf et.al. |
2501.01664 |
null |
2025-01-03 |
Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning |
Danni Peng et.al. |
2501.01653 |
null |
2025-01-03 |
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments |
Cai Yin et.al. |
2501.01652 |
link |
2025-01-03 |
HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding |
Heqing Zou et.al. |
2501.01645 |
null |
2025-01-03 |
iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings |
Shuhei Tomoshige et.al. |
2501.01642 |
null |
2025-01-03 |
Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation |
Rini Smita Thakur et.al. |
2501.01640 |
null |
2025-01-03 |
A non-ergodic framework for understanding emergent capabilities in Large Language Models |
Javier Marin et.al. |
2501.01638 |
null |
2025-01-03 |
Revisiting Data Analysis with Pre-trained Foundation Models |
Chen Liang et.al. |
2501.01631 |
null |
2025-01-03 |
ICPC: In-context Prompt Compression with Faster Inference |
Ziyang Yu et.al. |
2501.01625 |
null |
2025-01-03 |
PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents |
Jingoo Lee et.al. |
2501.01594 |
null |
2025-01-03 |
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges |
Mohamed Hisham Abdellatif et.al. |
2501.01588 |
null |
2025-01-02 |
Predicting the Performance of Black-box LLMs through Self-Queries |
Dylan Sam et.al. |
2501.01558 |
link |
2025-01-02 |
Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) |
Jumana Almahmoud et.al. |
2501.01545 |
null |
2025-01-02 |
Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information |
Rasul Tutnov et.al. |
2501.01544 |
null |
2025-01-02 |
Denoising Diffused Embeddings: a Generative Approach for Hypergraphs |
Shihao Wu et.al. |
2501.01541 |
null |
2025-01-02 |
BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery |
Kanishk Gandhi et.al. |
2501.01540 |
link |
2025-01-02 |
SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers |
Bhavna Gopal et.al. |
2501.01529 |
null |
2025-01-02 |
Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search |
Shuangtao Li et.al. |
2501.01478 |
null |
2025-01-02 |
Unifying Specialized Visual Encoders for Video Language Models |
Jihoon Chung et.al. |
2501.01426 |
link |
2025-01-02 |
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models |
Jingfeng Yao et.al. |
2501.01423 |
link |
2025-01-02 |
Multi-Modal Video Feature Extraction for Popularity Prediction |
Haixu Liu et.al. |
2501.01422 |
null |
2025-01-02 |
Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers |
Seunghyun Lee et.al. |
2501.01414 |
null |
2025-01-02 |
On Unifying Video Generation and Camera Pose Estimation |
Chun-Hao Paul Huang et.al. |
2501.01409 |
null |
2025-01-02 |
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios |
Xize Cheng et.al. |
2501.01384 |
null |
2025-01-02 |
ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI |
Neda Tavakoli et.al. |
2501.01372 |
link |
2025-01-02 |
Aligning Large Language Models for Faithful Integrity Against Opposing Argument |
Yong Zhao et.al. |
2501.01336 |
link |
2025-01-02 |
CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models |
Johan Wahréus et.al. |
2501.01335 |
link |
2025-01-02 |
Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension |
Yanbo Fang et.al. |
2501.01332 |
null |
2025-01-02 |
The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation |
Shuzheng Gao et.al. |
2501.01329 |
null |
2025-01-03 |
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking |
Xiaoxue Cheng et.al. |
2501.01306 |
null |
2025-01-02 |
Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case |
Kaushik Roy et.al. |
2501.01305 |
null |
2025-01-02 |
Does a Large Language Model Really Speak in Human-Like Language? |
Mose Park et.al. |
2501.01273 |
null |
2025-01-02 |
ProgCo: Program Helps Self-Correction of Large Language Models |
Xiaoshuai Song et.al. |
2501.01264 |
null |
2025-01-03 |
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings |
Shanghaoran Quan et.al. |
2501.01257 |
null |
2025-01-02 |
Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? |
Manuel Weber et.al. |
2501.01256 |
null |
2025-01-02 |
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion |
Qiyuan He et.al. |
2501.01246 |
null |
2025-01-02 |
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization |
Yongle Huang et.al. |
2501.01245 |
link |
2025-01-02 |
Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants |
Lixiong Qin et.al. |
2501.01243 |
null |
2025-01-02 |
Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction |
Alexander Brinkmann et.al. |
2501.01237 |
link |
2025-01-03 |
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer |
Jiayu Li et.al. |
2501.01216 |
null |
2025-01-02 |
Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects |
Abdullah Mushtaq et.al. |
2501.01205 |
null |
2025-01-02 |
HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation |
Runsong Jia et.al. |
2501.01203 |
null |
2025-01-02 |
LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge |
Kyoungkook Kang et.al. |
2501.01197 |
null |
2025-01-02 |
Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education |
Annika Bush et.al. |
2501.01192 |
null |
2025-01-02 |
Towards Interactive Deepfake Analysis |
Lixiong Qin et.al. |
2501.01164 |
link |
2025-01-02 |
TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions |
Vriksha Srihari et.al. |
2501.01156 |
null |
2025-01-02 |
A3: Android Agent Arena for Mobile GUI Agents |
Yuxiang Chai et.al. |
2501.01149 |
null |
2025-01-03 |
BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference |
Wonsuk Jang et.al. |
2501.01144 |
link |
2025-01-02 |
Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method |
Ruichen Zhang et.al. |
2501.01141 |
null |
2025-01-02 |
Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning |
Shuo Yu et.al. |
2501.01124 |
null |
2025-01-02 |
MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification |
Jimin Park et.al. |
2501.01110 |
null |
2025-01-03 |
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization |
Haina Zhu et.al. |
2501.01108 |
link |
2025-01-02 |
Graph Generative Pre-trained Transformer |
Xiaohui Chen et.al. |
2501.01073 |
null |
2025-01-02 |
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models |
Yanwen Huang et.al. |
2501.01059 |
null |
2025-01-02 |
Risks of Cultural Erasure in Large Language Models |
Rida Qadri et.al. |
2501.01056 |
null |
2025-01-02 |
Dynamic Scaling of Unit Tests for Code Reward Modeling |
Zeyao Ma et.al. |
2501.01054 |
null |
2025-01-02 |
Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs |
Linhao Huang et.al. |
2501.01042 |
null |
2025-01-02 |
Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models |
Bin Wang et.al. |
2501.01034 |
link |
2025-01-02 |
ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning |
Wonduk Seo et.al. |
2501.01031 |
null |
2025-01-03 |
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model |
Xinshuo Hu et.al. |
2501.01028 |
link |
2025-01-02 |
MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model |
Chengze Zhang et.al. |
2501.01014 |
null |
2025-01-02 |
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving |
Zihao Ye et.al. |
2501.01005 |
link |
2025-01-02 |
Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory |
Zhou Yang et.al. |
2501.00999 |
null |
2025-01-02 |
Optimizing Noise Schedules of Generative Models in High Dimensionss |
Santiago Aranguri et.al. |
2501.00988 |
null |
2025-01-02 |
Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice |
Federico Ravenda et.al. |
2501.00982 |
link |
2025-01-01 |
IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs |
Junfeng Jiao et.al. |
2501.00959 |
null |
2025-01-01 |
Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors |
Junfeng Jiao et.al. |
2501.00957 |
null |
2025-01-01 |
Incremental Dialogue Management: Survey, Discussion, and Implications for HRI |
Casey Kennington et.al. |
2501.00953 |
null |
2025-01-01 |
SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering |
Shihab Ahmed et.al. |
2501.00940 |
null |
2025-01-01 |
Diffusion Policies for Generative Modeling of Spacecraft Trajectories |
Julia Briden et.al. |
2501.00915 |
null |
2025-01-01 |
Aligning LLMs with Domain Invariant Reward Models |
David Wu et.al. |
2501.00911 |
link |
2025-01-01 |
Population Aware Diffusion for Time Series Generation |
Yang Li et.al. |
2501.00910 |
link |
2025-01-01 |
Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things |
Talha Zeeshan et.al. |
2501.00906 |
null |
2025-01-01 |
Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model |
Chenyang Liu et.al. |
2501.00895 |
null |
2025-01-01 |
Evaluating Time Series Foundation Models on Noisy Periodic Time Series |
Syamantak Datta Gupta et.al. |
2501.00889 |
null |
2025-01-01 |
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization |
Weiqi Wu et.al. |
2501.00888 |
link |
2025-01-01 |
Representation in large language models |
Cameron C. Yetman et.al. |
2501.00885 |
null |
2025-01-01 |
Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents |
Fouad Bousetouane et.al. |
2501.00881 |
null |
2025-01-01 |
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction |
Teng Hu et.al. |
2501.00880 |
null |
2025-01-01 |
TrustRAG: Enhancing Robustness and Trustworthiness in RAG |
Huichi Zhou et.al. |
2501.00879 |
link |
2025-01-01 |
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models |
Hieu Man et.al. |
2501.00874 |
link |
2025-01-01 |
Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation |
Mingjia Li et.al. |
2501.00873 |
link |
2025-01-01 |
Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation |
Shoutao Guo et.al. |
2501.00868 |
link |
2025-01-01 |
Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era |
Mihnea C. Moldoveanu et.al. |
2501.00867 |
null |
2025-01-01 |
Alzheimer’s disease detection based on large language model prompt engineering |
Tian Zheng et.al. |
2501.00861 |
null |
2025-01-01 |
LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions |
Adam Ishay et.al. |
2501.00830 |
null |
2025-01-01 |
An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems |
Haoxiang Tian et.al. |
2501.00829 |
null |
2025-01-01 |
LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management |
Yichen Luo et.al. |
2501.00826 |
null |
2025-01-01 |
Multimodal Large Models Are Effective Action Anticipators |
Binglu Wang et.al. |
2501.00795 |
link |
2025-01-01 |
Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models |
Minhao Bai et.al. |
2501.00786 |
null |
2025-01-01 |
NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model |
Yuzhi Lai et.al. |
2501.00785 |
null |
2025-01-01 |
REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization |
Huyen Nguyen et.al. |
2501.00779 |
null |
2025-01-01 |
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation |
Qianli Wang et.al. |
2501.00777 |
null |
2025-01-01 |
Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis |
Jie Gao et.al. |
2501.00775 |
null |
2025-01-01 |
An AI-powered Bayesian generative modeling approach for causal inference in observational studies |
Qiao Liu et.al. |
2501.00755 |
null |
2025-01-01 |
Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform |
Cheonsu Jeong et.al. |
2501.00750 |
null |
2025-01-01 |
DIVE: Diversified Iterative Self-Improvement |
Yiwei Qin et.al. |
2501.00747 |
link |
2025-01-01 |
Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines |
Xiyang Hu et.al. |
2501.00745 |
null |
2025-01-01 |
A Distributional Evaluation of Generative Image Models |
Edric Tam et.al. |
2501.00744 |
null |
2025-01-01 |
New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory |
Madiha Ajmal et.al. |
2501.00721 |
null |
2025-01-01 |
Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection |
Hao Wang et.al. |
2501.00700 |
null |
2025-01-01 |
Adjoint sharding for very long context training of state space models |
Xingzi Xu et.al. |
2501.00692 |
null |
2025-01-01 |
Labels Generated by Large Language Model Helps Measuring People’s Empathy in Vitro |
Md Rakibul Hasan et.al. |
2501.00691 |
null |
2025-01-01 |
IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently |
Florian Dietz et.al. |
2501.00684 |
null |
2024-12-31 |
Grade Inflation in Generative Models |
Phuc Nguyen et.al. |
2501.00664 |
null |
2024-12-31 |
Finding Missed Code Size Optimizations in Compilers using LLMs |
Davide Italiano et.al. |
2501.00655 |
null |
2024-12-31 |
Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models |
Suttisak Wizadwongsa et.al. |
2501.00651 |
null |
2024-12-31 |
Efficient Standardization of Clinical Notes using Large Language Models |
Daniel B. Hier et.al. |
2501.00644 |
null |
2024-12-31 |
Enabling New HDLs with Agents |
Mark Zakharov et.al. |
2501.00642 |
null |
2024-12-31 |
DreamDrive: Generative 4D Scene Modeling from Street View Images |
Jiageng Mao et.al. |
2501.00601 |
null |
2024-12-31 |
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM |
Yuqian Yuan et.al. |
2501.00599 |
link |
2024-12-31 |
Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation |
M. Ali Bayram et.al. |
2501.00593 |
null |
2024-12-31 |
Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method |
Zhenpeng Huang et.al. |
2501.00584 |
null |
2024-12-31 |
Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders |
Yipeng Kang et.al. |
2501.00581 |
null |
2024-12-31 |
AI and Quantum Computing in Binary Photocatalytic Hydrogen Production |
Dennis Delali Kwesi Wayo et.al. |
2501.00575 |
null |
2024-12-31 |
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling |
Xinhao Li et.al. |
2501.00574 |
link |
2024-12-31 |
Probing Visual Language Priors in VLMs |
Tiange Luo et.al. |
2501.00569 |
null |
2024-12-31 |
Robust and Adaptive Optimization under a Large Language Model Lens |
Dimitris Bertsimas et.al. |
2501.00568 |
null |
2024-12-30 |
Distributed Mixture-of-Agents for Edge Inference with Large Language Models |
Purbesh Mitra et.al. |
2412.21200 |
link |
2024-12-31 |
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation |
Zhaojian Yu et.al. |
2412.21199 |
link |
2024-12-30 |
The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick |
Jonathan Berkheim et.al. |
2412.21186 |
null |
2024-12-30 |
Facilitating large language model Russian adaptation with Learned Embedding Propagation |
Mikhail Tikhomirov et.al. |
2412.21140 |
link |
2024-12-30 |
ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation |
Ruixuan Liu et.al. |
2412.21123 |
null |
2025-01-02 |
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation |
Yuanbo Yang et.al. |
2412.21117 |
null |
2024-12-30 |
Varformer: Adapting VAR’s Generative Prior for Image Restoration |
Siyang Wang et.al. |
2412.21063 |
link |
2024-12-30 |
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation |
Jiazheng Xu et.al. |
2412.21059 |
link |
2024-12-30 |
Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense |
Yuyang Zhou et.al. |
2412.21051 |
link |
2024-12-30 |
E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models |
Zhiyu Tan et.al. |
2412.21044 |
null |
2024-12-30 |
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration |
Wanglong Lu et.al. |
2412.21042 |
link |
2024-12-30 |
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization |
Chia-Yu Hung et.al. |
2412.21037 |
link |
2024-12-30 |
GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models |
Shangyu Xing et.al. |
2412.21036 |
null |
2024-12-30 |
MapQaTor: A System for Efficient Annotation of Map Query Datasets |
Mahir Labib Dihan et.al. |
2412.21015 |
link |
2024-12-31 |
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria |
Joonwon Jang et.al. |
2412.21006 |
null |
2024-12-30 |
Plug-and-Play Training Framework for Preference Optimization |
Jingyuan Ma et.al. |
2412.20996 |
null |
2024-12-30 |
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation |
Siyuan Fang et.al. |
2412.20995 |
null |
2024-12-30 |
Efficiently Serving LLM Reasoning Programs with Certaindex |
Yichao Fu et.al. |
2412.20993 |
null |
2024-12-30 |
QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing |
Shlomo Kashani et.al. |
2412.20956 |
null |
2024-12-30 |
AGON: Automated Design Framework for Customizing Processors from ISA Documents |
Chongxiao Li et.al. |
2412.20954 |
null |
2024-12-30 |
Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema |
Xiaohan Feng et.al. |
2412.20942 |
null |
2024-12-30 |
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering |
Junxiao Xue et.al. |
2412.20927 |
null |
2024-12-30 |
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation |
Ting Zhang et.al. |
2412.20901 |
null |
2024-12-30 |
Towards Compatible Fine-tuning for Vision-Language Model Updates |
Zhengbo Wang et.al. |
2412.20895 |
null |
2024-12-30 |
DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models |
Xiaolin Hu et.al. |
2412.20891 |
null |
2024-12-30 |
Enhancing Annotated Bibliography Generation with LLM Ensembles |
Sergio Bermejo et.al. |
2412.20864 |
null |
2024-12-30 |
Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs’ Memory |
Xingjian Tao et.al. |
2412.20846 |
null |
2024-12-30 |
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment |
Jianfei Zhang et.al. |
2412.20834 |
link |
2024-12-30 |
Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model |
Runtao Ren et.al. |
2412.20820 |
null |
2024-12-30 |
TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting |
Huanyu Zhang et.al. |
2412.20810 |
null |
2024-12-30 |
Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves |
Chayan Chatterjee et.al. |
2412.20789 |
null |
2024-12-31 |
SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity |
Pengfei Jing et.al. |
2412.20787 |
null |
2024-12-30 |
Large Language Model Enabled Multi-Task Physical Layer Network |
Tianyue Zheng et.al. |
2412.20772 |
null |
2024-12-30 |
Attributing Culture-Conditioned Generations to Pretraining Corpora |
Huihan Li et.al. |
2412.20760 |
link |
2024-12-30 |
M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs |
Bei Yan et.al. |
2412.20718 |
link |
2024-12-30 |
HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images |
Sungik Choi et.al. |
2412.20704 |
null |
2024-12-30 |
UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design |
Zijie Chen et.al. |
2412.20694 |
null |
2024-12-30 |
Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks |
Yuhe Ding et.al. |
2412.20682 |
null |
2024-12-30 |
Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA |
Qingyun Jin et.al. |
2412.20677 |
null |
2024-12-30 |
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner |
Yitong Zhou et.al. |
2412.20662 |
link |
2024-12-30 |
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis |
Yousef Yeganeh et.al. |
2412.20651 |
null |
2024-12-30 |
SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy |
Md Mahadi Hasan Nahid et.al. |
2412.20641 |
null |
2024-12-30 |
Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble |
Yongchang Li et.al. |
2412.20637 |
null |
2024-12-30 |
EVOLVE: Emotion and Visual Output Learning via LLM Evaluation |
Jordan Sinclair et.al. |
2412.20632 |
null |
2024-12-29 |
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study |
Yulin Fei et.al. |
2412.20613 |
link |
2024-12-29 |
NLP-based Regulatory Compliance – Using GPT 4.0 to Decode Regulatory Documents |
Bimal Kumar et.al. |
2412.20602 |
null |
2024-12-29 |
MATEY: multiscale adaptive foundation models for spatiotemporal physical systems |
Pei Zhang et.al. |
2412.20601 |
null |
2024-12-29 |
Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection |
Dmitri Roussinov et.al. |
2412.20595 |
link |
2024-12-29 |
Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches |
Madhavendra Thakur et.al. |
2412.20584 |
null |
2024-12-29 |
Counterfactual Samples Constructing and Training for Commonsense Statements Estimation |
Chong Liu et.al. |
2412.20563 |
null |
2024-12-29 |
Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces |
Linglingzhi Zhu et.al. |
2412.20556 |
null |
2024-12-29 |
The Impact of Prompt Programming on Function-Level Code Generation |
Ranim Khojah et.al. |
2412.20545 |
link |
2024-12-29 |
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning |
Xingshuai Huang et.al. |
2412.20519 |
null |
2024-12-29 |
Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning |
Hang Ni et.al. |
2412.20505 |
null |
2024-12-29 |
ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding |
Xiao Wang et.al. |
2412.20504 |
link |
2024-12-29 |
TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication |
Zongwu Wang et.al. |
2412.20501 |
link |
2024-12-29 |
Multimodal Variational Autoencoder: a Barycentric View |
Peijie Qiu et.al. |
2412.20487 |
null |
2024-12-29 |
JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling |
Haorui Ji et.al. |
2412.20470 |
null |
2024-12-29 |
Improving Vision-Language-Action Models via Chain-of-Affordance |
Jinming Li et.al. |
2412.20451 |
null |
2024-12-29 |
Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs |
Pratik Rakesh Singh et.al. |
2412.20440 |
null |
2024-12-29 |
Image Augmentation Agent for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2412.20439 |
null |
2024-12-29 |
Unlocking adaptive digital pathology through dynamic feature learning |
Jiawen Li et.al. |
2412.20430 |
null |
2024-12-29 |
AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models |
Mansi et.al. |
2412.20427 |
null |
2024-12-29 |
Bringing Objects to Life: 4D generation from 3D objects |
Ohad Rahamim et.al. |
2412.20422 |
null |
2024-12-29 |
Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection |
Kalin Kopanov et.al. |
2412.20414 |
null |
2024-12-29 |
Multi-Objective Large Language Model Unlearning |
Zibin Pan et.al. |
2412.20412 |
link |
2024-12-29 |
Open-Sora: Democratizing Efficient Video Production for All |
Zangwei Zheng et.al. |
2412.20404 |
link |
2024-12-29 |
Natural Language Fine-Tuning |
Jia Liu et.al. |
2412.20382 |
link |
2024-12-29 |
Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) |
Jia Wei Sii et.al. |
2412.20381 |
null |
2024-12-29 |
FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation |
Yan Luo et.al. |
2412.20374 |
link |
2024-12-29 |
LLM2: Let Large Language Models Harness System 2 Reasoning |
Cheng Yang et.al. |
2412.20372 |
link |
2025-01-02 |
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey |
Junqiao Wang et.al. |
2412.20367 |
null |
2024-12-29 |
HindiLLM: Large Language Model for Hindi |
Sanjay Chouhan et.al. |
2412.20357 |
null |
2024-12-29 |
Distilling Desired Comments for Enhanced Code Review with Large Language Models |
Yongda Yu et.al. |
2412.20340 |
null |
2024-12-29 |
Mind the Data Gap: Bridging LLMs to Enterprise Data Integration |
Moe Kayali et.al. |
2412.20331 |
null |
2024-12-29 |
GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions |
Tianyao Shi et.al. |
2412.20322 |
null |
2024-12-29 |
Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain |
Shintaro Ozaki et.al. |
2412.20309 |
null |
2024-12-28 |
FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration |
Jia Liu et.al. |
2412.20297 |
null |
2024-12-28 |
Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games |
Guan-Horng Liu et.al. |
2412.20279 |
null |
2024-12-28 |
Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues |
Henry J. Xie et.al. |
2412.20264 |
link |
2024-12-28 |
Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception |
Athanasios Karagounis et.al. |
2412.20230 |
null |
2024-12-28 |
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning |
Shuguang Chen et.al. |
2412.20227 |
null |
2024-12-28 |
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation |
Yeonhong Park et.al. |
2412.20185 |
null |
2024-12-28 |
LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System |
Hyucksung Kwon et.al. |
2412.20166 |
null |
2024-12-28 |
StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN |
Andrzej Bedychaj et.al. |
2412.20164 |
null |
2024-12-28 |
Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems |
Minhye Jeon et.al. |
2412.20163 |
null |
2024-12-28 |
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation |
Guanglei Yang et.al. |
2412.20162 |
null |
2024-12-28 |
Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses |
Xinru Wen et.al. |
2412.20154 |
null |
2024-12-28 |
Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering |
Wei Zhou et.al. |
2412.20145 |
null |
2024-12-28 |
TradingAgents: Multi-Agents LLM Financial Trading Framework |
Yijia Xiao et.al. |
2412.20138 |
null |
2024-12-28 |
M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation |
Zhaopeng Feng et.al. |
2412.20127 |
link |
2024-12-28 |
Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers |
Tuomas Hakoniemi et.al. |
2412.20114 |
null |
2024-12-28 |
ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming |
Jiedong Zhuang et.al. |
2412.20105 |
null |
2024-12-28 |
On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs |
Atmane Ayoub Mansour Bahar et.al. |
2412.20087 |
null |
2024-12-31 |
Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset |
Chongjian Yue et.al. |
2412.20072 |
null |
2024-12-28 |
On the Compositional Generalization of Multimodal LLMs for Medical Imaging |
Zhenyang Cai et.al. |
2412.20070 |
link |
2024-12-28 |
VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition |
Lan Chen et.al. |
2412.20064 |
link |
2024-12-28 |
MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion |
Zechao Zhan et.al. |
2412.20062 |
null |
2024-12-28 |
Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts |
Yanxin Shen et.al. |
2412.20061 |
null |
2024-12-28 |
“My life is miserable, have to sign 500 autographs everyday”: Exposing Humblebragging, the Brags in Disguise |
Sharath Naganna et.al. |
2412.20057 |
null |
2024-12-27 |
Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization |
Kumud Tripathi et.al. |
2412.19785 |
null |
2024-12-27 |
Can AI Help with Your Personal Finances? |
Oudom Hean et.al. |
2412.19784 |
null |
2024-12-27 |
Tensor Network Estimation of Distribution Algorithms |
John Gardiner et.al. |
2412.19780 |
null |
2024-12-27 |
Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration |
Le Chen et.al. |
2412.19770 |
link |
2024-12-27 |
Generative Video Propagation |
Shaoteng Liu et.al. |
2412.19761 |
null |
2024-12-27 |
On dual-projectively equivalent connections associated to second order superintegrable systems |
Andreas Vollmer et.al. |
2412.19739 |
null |
2024-12-27 |
Can Large Language Models Adapt to Other Agents In-Context? |
Matthew Riemer et.al. |
2412.19726 |
null |
2024-12-27 |
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition |
Jiawei Lin et.al. |
2412.19712 |
null |
2024-12-27 |
Toward Adaptive Reasoning in Large Language Models with Thought Rollback |
Sijia Chen et.al. |
2412.19707 |
link |
2024-12-27 |
A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization |
Jingchun Lian et.al. |
2412.19685 |
null |
2024-12-27 |
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework |
Jiang Liu et.al. |
2412.19684 |
null |
2024-12-27 |
CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs |
Siyu Wang et.al. |
2412.19663 |
null |
2024-12-27 |
Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis |
Jiaqi Wang et.al. |
2412.19654 |
link |
2024-12-27 |
FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios |
Kaiyi Pang et.al. |
2412.19652 |
null |
2024-12-27 |
Xmodel-2 Technical Report |
Wang Qun et.al. |
2412.19638 |
null |
2024-12-27 |
IMTP: Search-based Code Generation for In-memory Tensor Programs |
Yongwon Shin et.al. |
2412.19630 |
null |
2024-12-27 |
Signatures of prediction during natural listening in MEG data? |
Sahel Azizpour et.al. |
2412.19622 |
null |
2024-12-27 |
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training |
Jia-Hong Huang et.al. |
2412.19616 |
link |
2024-12-27 |
SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms |
Shashank Rao Marpally et.al. |
2412.19595 |
null |
2024-12-27 |
Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following |
Yuxiao Yang et.al. |
2412.19562 |
null |
2024-12-27 |
Diverse Rare Sample Generation with Pretrained GANs |
Subeen Lee et.al. |
2412.19543 |
link |
2024-12-27 |
Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations |
Yuanfei Huang et.al. |
2412.19520 |
null |
2024-12-27 |
Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model |
Hyunwoo Cho et.al. |
2412.19517 |
null |
2024-12-27 |
Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs |
Zhe Yang et.al. |
2412.19513 |
link |
2024-12-27 |
Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging |
Hua Farn et.al. |
2412.19512 |
null |
2024-12-27 |
Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion |
Koustav Ghosal et.al. |
2412.19510 |
null |
2024-12-27 |
MBQ: Modality-Balanced Quantization for Large Vision-Language Models |
Shiyao Li et.al. |
2412.19509 |
link |
2024-12-27 |
DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT |
Xiaotao Hu et.al. |
2412.19505 |
link |
2024-12-27 |
Casevo: A Cognitive Agents and Social Evolution Simulator |
Zexun Jiang et.al. |
2412.19498 |
link |
2024-12-27 |
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation |
Chengyang Ye et.al. |
2412.19492 |
link |
2024-12-27 |
Focusing Image Generation to Mitigate Spurious Correlations |
Xuewei Li et.al. |
2412.19457 |
null |
2024-12-27 |
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models |
Hyeonseok Moon et.al. |
2412.19450 |
link |
2024-12-27 |
Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models |
Shuo Wang et.al. |
2412.19449 |
null |
2024-12-27 |
A Survey on Large Language Model Acceleration based on KV Cache Management |
Haoyang Li et.al. |
2412.19442 |
link |
2024-12-27 |
Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback |
Seong Jin Lee et.al. |
2412.19436 |
null |
2024-12-27 |
Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints |
Alberto Maté et.al. |
2412.19424 |
null |
2024-12-27 |
Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning |
Chen Li et.al. |
2412.19422 |
link |
2024-12-27 |
MINIMA: Modality Invariant Image Matching |
Xingyu Jiang et.al. |
2412.19412 |
link |
2024-12-27 |
MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios |
Jiaqi Fan et.al. |
2412.19406 |
null |
2024-12-27 |
An Engorgio Prompt Makes Large Language Model Babble on |
Jianshuo Dong et.al. |
2412.19394 |
link |
2024-12-26 |
Large Language Models for Market Research: A Data-augmentation Approach |
Mengxin Wang et.al. |
2412.19363 |
null |
2024-12-26 |
Dynamic Skill Adaptation for Large Language Models |
Jiaao Chen et.al. |
2412.19361 |
null |
2024-12-26 |
Identifying Split Vacancies with Foundation Models and Electrostatics |
Seán R. Kavanagh et.al. |
2412.19330 |
null |
2024-12-26 |
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment |
Ziang Yan et.al. |
2412.19326 |
link |
2024-12-26 |
Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones |
Mehrnaz Mofakhami et.al. |
2412.19325 |
null |
2024-12-26 |
From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries |
Hugh Van Deventer et.al. |
2412.19312 |
link |
2024-12-26 |
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries |
Roberto Amoroso et.al. |
2412.19304 |
null |
2024-12-26 |
RecLM: Recommendation Instruction Tuning |
Yangqin Jiang et.al. |
2412.19302 |
link |
2024-12-26 |
RAG with Differential Privacy |
Nicolas Grislain et.al. |
2412.19291 |
link |
2024-12-26 |
Time Series Foundational Models: Their Role in Anomaly Detection and Prediction |
Chathurangi Shyalika et.al. |
2412.19286 |
link |
2024-12-26 |
PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing |
Michael Bezick et.al. |
2412.19284 |
null |
2024-12-26 |
MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes |
Asma Ben Abacha et.al. |
2412.19260 |
link |
2024-12-26 |
VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis |
Jaemin Jung et.al. |
2412.19259 |
null |
2024-12-26 |
Sentiment trading with large language models |
Kemal Kirtac et.al. |
2412.19245 |
null |
2024-12-26 |
SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model |
Xuyang Li et.al. |
2412.19237 |
null |
2024-12-26 |
Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining |
Yuxin You et.al. |
2412.19211 |
null |
2024-12-26 |
Multi-Attribute Constraint Satisfaction via Language Model Rewriting |
Ashutosh Baheti et.al. |
2412.19198 |
null |
2024-12-26 |
Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models |
Haonan He et.al. |
2412.19191 |
null |
2024-12-26 |
Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue |
Shuzhi Xu et.al. |
2412.19154 |
null |
2024-12-26 |
AskChart: Universal Chart Understanding through Textual Enhancement |
Xudong Yang et.al. |
2412.19146 |
link |
2024-12-26 |
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis |
Senbin Zhu et.al. |
2412.19140 |
link |
2024-12-26 |
PlanLLM: Video Procedure Planning with Refinable Large Language Models |
Dejie Yang et.al. |
2412.19139 |
link |
2024-12-26 |
Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing |
Inpyo Hong et.al. |
2412.19125 |
link |
2024-12-26 |
Discrete vs. Continuous Trade-offs for Generative Models |
Jathin Korrapati et.al. |
2412.19114 |
null |
2024-12-26 |
SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values |
Yunfan Zhang et.al. |
2412.19113 |
null |
2024-12-26 |
Stochastic normalizing flows for Effective String Theory |
Michele Caselle et.al. |
2412.19109 |
null |
2024-12-26 |
“I’ve Heard of You!”: Generate Spoken Named Entity Recognition Data for Unseen Entities |
Jiawei Yu et.al. |
2412.19102 |
null |
2024-12-26 |
Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security |
Vasileios Alevizos et.al. |
2412.19088 |
null |
2024-12-26 |
Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation |
Haotian Qian et.al. |
2412.19080 |
null |
2024-12-26 |
CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers |
Jingyi Zheng et.al. |
2412.19037 |
link |
2024-12-26 |
Repository Structure-Aware Training Makes SLMs Better Issue Resolver |
Zexiong Ma et.al. |
2412.19031 |
null |
2024-12-26 |
Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation |
Yixin Chen et.al. |
2412.19026 |
link |
2024-12-26 |
Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication |
Xiqiang Qu et.al. |
2412.19025 |
null |
2024-12-26 |
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation |
Tao Liu et.al. |
2412.19021 |
null |
2024-12-26 |
Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability |
Ruixi Lin et.al. |
2412.19018 |
null |
2024-12-25 |
How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study |
Alejandro Velasco et.al. |
2412.18989 |
null |
2024-12-25 |
ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement |
Zhefan Rao et.al. |
2412.18966 |
null |
2024-12-25 |
Musings About the Future of Search: A Return to the Past? |
Jimmy Lin et.al. |
2412.18956 |
null |
2024-12-25 |
A Power-Efficient Hardware Implementation of L-Mul |
Ruiqi Chen et.al. |
2412.18948 |
null |
2024-12-25 |
MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models |
Kaiwen Zuo et.al. |
2412.18947 |
null |
2024-12-25 |
Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations |
Yewon Kim et.al. |
2412.18940 |
null |
2024-12-25 |
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference |
Libo Zhang et.al. |
2412.18934 |
null |
2024-12-25 |
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation |
Lunhao Duan et.al. |
2412.18928 |
null |
2024-12-25 |
Exemplar-condensed Federated Class-incremental Learning |
Rui Sun et.al. |
2412.18926 |
null |
2024-12-25 |
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model |
Yi-Chia Chen et.al. |
2412.18917 |
link |
2024-12-25 |
AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures |
Situo Zhang et.al. |
2412.18910 |
null |
2024-12-25 |
CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models |
Ping Guo et.al. |
2412.18890 |
link |
2024-12-25 |
MotionMap: Representing Multimodality in Human Pose Forecasting |
Reyhaneh Hosseininejad et.al. |
2412.18883 |
null |
2024-12-25 |
Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models |
Meltem Aksoy et.al. |
2412.18863 |
null |
2024-12-25 |
Improving the Readability of Automatically Generated Tests using Large Language Models |
Matteo Biagiola et.al. |
2412.18843 |
null |
2024-12-25 |
LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements |
Hao Zhang et.al. |
2412.18835 |
null |
2024-12-25 |
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition |
Shujie Hu et.al. |
2412.18832 |
null |
2024-12-25 |
RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting |
Yilei Jiang et.al. |
2412.18826 |
null |
2024-12-25 |
CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection |
Wenbin Li et.al. |
2412.18820 |
link |
2024-12-25 |
LLM-assisted vector similarity search |
Md Riyadh et.al. |
2412.18819 |
null |
2024-12-25 |
DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search |
Lei Yang et.al. |
2412.18811 |
null |
2024-12-25 |
Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation |
Xinkai Du et.al. |
2412.18800 |
null |
2024-12-25 |
Torque-Aware Momentum |
Pranshu Malviya et.al. |
2412.18790 |
null |
2024-12-25 |
Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models |
Yu-An Liu et.al. |
2412.18770 |
link |
2024-12-25 |
The Impact of Input Order Bias on Large Language Models for Software Fault Localization |
Md Nakhla Rafi et.al. |
2412.18750 |
null |
2024-12-24 |
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models |
Zehan Wang et.al. |
2412.18605 |
link |
2024-12-24 |
Long-Form Speech Generation with Spoken Language Models |
Se Jin Park et.al. |
2412.18603 |
link |
2024-12-24 |
Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems |
Fernando Jia et.al. |
2412.18601 |
link |
2024-12-24 |
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation |
Hongjie Li et.al. |
2412.18600 |
null |
2024-12-24 |
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation |
Minghong Cai et.al. |
2412.18597 |
link |
2024-12-24 |
A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs |
OpenMind et.al. |
2412.18588 |
null |
2024-12-24 |
Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control |
Sergey Sedov et.al. |
2412.18582 |
null |
2024-12-24 |
Zero-resource Speech Translation and Recognition with LLMs |
Karel Mundnich et.al. |
2412.18566 |
null |
2024-12-24 |
Distilling Fine-grained Sentiment Understanding from Large Language Models |
Yice Zhang et.al. |
2412.18552 |
link |
2024-12-24 |
Token-Budget-Aware LLM Reasoning |
Tingxu Han et.al. |
2412.18547 |
link |
2024-12-24 |
PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction |
Xingjian Xu et.al. |
2412.18541 |
null |
2024-12-24 |
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation |
Derong Xu Xinhang Li et.al. |
2412.18537 |
link |
2024-12-24 |
Automated Code Review In Practice |
Umut Cihan et.al. |
2412.18531 |
null |
2024-12-24 |
Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving |
Hao Pang et.al. |
2412.18511 |
null |
2024-12-24 |
Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization |
Yi-Fu Fu et.al. |
2412.18497 |
null |
2024-12-24 |
GeFL: Model-Agnostic Federated Learning with Generative Models |
Honggu Kang et.al. |
2412.18460 |
null |
2024-12-24 |
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding |
Tatiana Zemskova et.al. |
2412.18450 |
link |
2024-12-24 |
Is Large Language Model Good at Triple Set Prediction? An Empirical Study |
Yuan Yuan et.al. |
2412.18443 |
null |
2024-12-24 |
Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm |
O. Deniz Akyildiz et.al. |
2412.18432 |
null |
2024-12-24 |
GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent |
Kangjia Zhao et.al. |
2412.18426 |
null |
2024-12-24 |
Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models |
Zihan Zhou et.al. |
2412.18419 |
null |
2024-12-24 |
Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles |
Zihan Wang et.al. |
2412.18416 |
null |
2024-12-24 |
Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English |
Avinash Anand et.al. |
2412.18415 |
link |
2024-12-24 |
Discovery of 2D Materials via Symmetry-Constrained Diffusion Model |
Shihang Xu et.al. |
2412.18414 |
null |
2024-12-24 |
A Statistical Framework for Ranking LLM-Based Chatbots |
Siavash Ameli et.al. |
2412.18407 |
link |
2024-12-24 |
Extract Free Dense Misalignment from CLIP |
JeongYeon Nam et.al. |
2412.18404 |
link |
2024-12-24 |
RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction |
Wu Xiaoping et.al. |
2412.18390 |
null |
2024-12-24 |
MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs |
Qiuyi Gu et.al. |
2412.18381 |
null |
2024-12-24 |
Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents |
Kaiwen Ning et.al. |
2412.18371 |
link |
2024-12-24 |
Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering |
Zhongjian Hu et.al. |
2412.18351 |
null |
2024-12-24 |
M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models |
Jiaxin Guo et.al. |
2412.18299 |
null |
2024-12-24 |
Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight |
Xi Ding et.al. |
2412.18298 |
link |
2024-12-24 |
Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases |
Christian Di Maio et.al. |
2412.18295 |
null |
2024-12-24 |
DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation |
Junyi Lu et.al. |
2412.18291 |
null |
2024-12-24 |
Improved Feature Generating Framework for Transductive Zero-shot Learning |
Zihan Ye et.al. |
2412.18282 |
null |
2024-12-24 |
GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications |
Zhenzhou Jin et.al. |
2412.18281 |
null |
2024-12-24 |
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization |
Jiacai Liu et.al. |
2412.18279 |
null |
2024-12-24 |
GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge |
Shammur Absar Chowdhury et.al. |
2412.18274 |
null |
2024-12-24 |
Annotating References to Mythological Entities in French Literature |
Thierry Poibeau et.al. |
2412.18270 |
null |
2024-12-24 |
Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study |
Xuefeng Jiang et.al. |
2412.18260 |
link |
2024-12-24 |
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction |
Pufan Zou et.al. |
2412.18255 |
null |
2024-12-24 |
An Automatic Graph Construction Framework based on Large Language Models for Recommendation |
Rong Shan et.al. |
2412.18241 |
link |
2024-12-24 |
Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection |
Jango Zhang et.al. |
2412.18225 |
null |
2024-12-24 |
Expand VSR Benchmark for VLLM to Expertize in Spatial Rules |
Peijin Xie et.al. |
2412.18224 |
link |
2024-12-24 |
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation |
Mengyang Wu et.al. |
2412.18216 |
link |
2024-12-24 |
Adapting Large Language Models for Improving TCP Fairness over WiFi |
Shyam Kumar Shrestha et.al. |
2412.18200 |
null |
2024-12-24 |
Robustness-aware Automatic Prompt Optimization |
Zeru Shi et.al. |
2412.18196 |
link |
2024-12-24 |
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks |
Shiduo Zhang et.al. |
2412.18194 |
null |
2024-12-24 |
TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization |
Yucong Luo et.al. |
2412.18185 |
null |
2024-12-24 |
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation |
Yucong Luo et.al. |
2412.18176 |
null |
2024-12-24 |
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent |
Haohang Li et.al. |
2412.18174 |
null |
2024-12-24 |
Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models |
Xiaomeng Hu et.al. |
2412.18171 |
null |
2024-12-24 |
KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management |
Rongxin Cheng et.al. |
2412.18169 |
null |
2024-12-24 |
Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence |
Yinbin Han et.al. |
2412.18164 |
null |
2024-12-24 |
VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities |
Shray Mathur et.al. |
2412.18161 |
null |
2024-12-24 |
Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task |
Jinming Liu et.al. |
2412.18158 |
null |
2024-12-24 |
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance |
Yaoyun Zhang et.al. |
2412.18157 |
null |
2024-12-24 |
scReader: Prompting Large Language Models to Interpret scRNA-seq Data |
Cong Li et.al. |
2412.18156 |
null |
2024-12-24 |
GeneSUM: Large Language Model-based Gene Summary Extraction |
Zhijian Chen et.al. |
2412.18154 |
null |
2024-12-24 |
CoAM: Corpus of All-Type Multiword Expressions |
Yusuke Ide et.al. |
2412.18151 |
null |
2024-12-24 |
EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation |
Shuhao Han et.al. |
2412.18150 |
link |
2024-12-24 |
Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction |
Xiao Guo et.al. |
2412.18149 |
null |
2024-12-24 |
Ensuring Consistency for In-Image Translation |
Chengpeng Fu et.al. |
2412.18139 |
null |
2024-12-24 |
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment |
Binrui Zeng et.al. |
2412.18135 |
null |
2024-12-24 |
VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection |
Zhaohui Jin et.al. |
2412.18124 |
null |
2024-12-24 |
AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation |
Hao Wen et.al. |
2412.18116 |
null |
2024-12-24 |
AIGT: AI Generative Table Based on Prompt |
Mingming Zhang et.al. |
2412.18111 |
null |
2024-12-24 |
SlimGPT: Layer-wise Structured Pruning for Large Language Models |
Gui Ling et.al. |
2412.18110 |
null |
2024-12-24 |
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach |
Jing Bi et.al. |
2412.18108 |
null |
2024-12-24 |
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels |
Mingcong Song et.al. |
2412.18106 |
null |
2024-12-24 |
EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent |
Suyuan Wang et.al. |
2412.18100 |
null |
2024-12-24 |
Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) – a Large Language Model Chatbot for Perioperative Medicine |
Yu He Ke et.al. |
2412.18096 |
null |
2024-12-24 |
Molly: Making Large Language Model Agents Solve Python Problem More Logically |
Rui Xiao et.al. |
2412.18093 |
null |
2024-12-24 |
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner |
Aizierjiang Aiersilan et.al. |
2412.18086 |
link |
2024-12-24 |
Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models |
Xuan Lin et.al. |
2412.18084 |
link |
2024-12-24 |
Improving Factuality with Explicit Working Memory |
Mingda Chen et.al. |
2412.18069 |
null |
2024-12-24 |
LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR |
Osama Hosam Abdellaif et.al. |
2412.18063 |
link |
2024-12-24 |
Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction |
Hyunbae Jeon et.al. |
2412.18061 |
null |
2024-12-24 |
An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM |
Wen Wen et.al. |
2412.18060 |
null |
2024-12-23 |
Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations |
Maya Patel et.al. |
2412.18051 |
null |
2024-12-23 |
AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data |
Mirko Zaffaroni et.al. |
2412.18038 |
link |
2024-12-23 |
Generating refactored code accurately using reinforcement learning |
Indranil Palit et.al. |
2412.18035 |
null |
2024-12-23 |
More than Chit-Chat: Developing Robots for Small-Talk Interactions |
Rebecca Ramnauth et.al. |
2412.18023 |
null |
2024-12-23 |
Trustworthy and Efficient LLMs Meet Databases |
Kyoungmin Kim et.al. |
2412.18022 |
null |
2024-12-23 |
StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs |
Hailin Chen et.al. |
2412.18011 |
null |
2024-12-23 |
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models |
Ruibo Tu et.al. |
2412.17970 |
link |
2024-12-23 |
LMV-RPA: Large Model Voting-based Robotic Process Automation |
Osama Abdellatif et.al. |
2412.17965 |
link |
2024-12-23 |
Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models |
Antony Seabra et.al. |
2412.17964 |
null |
2024-12-23 |
Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models |
Ge Zhang et.al. |
2412.17963 |
null |
2024-12-23 |
Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents |
Antony Seabra et.al. |
2412.17942 |
null |
2024-12-23 |
BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism |
Martin Fajcik et.al. |
2412.17933 |
null |
2024-12-23 |
Causal Composition Diffusion Model for Closed-loop Traffic Generation |
Haohong Lin et.al. |
2412.17920 |
null |
2024-12-23 |
Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning |
Orson Mengara et.al. |
2412.17908 |
null |
2024-12-23 |
LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses |
Sara Riazi et.al. |
2412.17892 |
null |
2024-12-23 |
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models |
Siyuan Bian et.al. |
2412.17811 |
null |
2024-12-23 |
Reconstructing People, Places, and Cameras |
Lea Müller et.al. |
2412.17806 |
null |
2024-12-23 |
Automating the Search for Artificial Life with Foundation Models |
Akarsh Kumar et.al. |
2412.17799 |
link |
2024-12-23 |
ResearchTown: Simulator of Human Research Community |
Haofei Yu et.al. |
2412.17767 |
link |
2024-12-23 |
ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback |
Wei Zhang et.al. |
2412.17754 |
null |
2024-12-23 |
Deliberation in Latent Space via Differentiable Cache Augmentation |
Luyang Liu et.al. |
2412.17747 |
null |
2024-12-23 |
YuLan-Mini: An Open Data-efficient Language Model |
Yiwen Hu et.al. |
2412.17743 |
link |
2024-12-23 |
**Reasoning to Attend: Try to Understand How Token Works** |
Rui Qian et.al. |
2412.17741 |
link |
2024-12-23 |
Knowledge Editing through Chain-of-Thought |
Changyue Wang et.al. |
2412.17727 |
link |
2024-12-23 |
Understanding the Logic of Direct Preference Alignment through Logic |
Kyle Richardson et.al. |
2412.17696 |
null |
2024-12-23 |
Large Language Model Safety: A Holistic Survey |
Dan Shi et.al. |
2412.17686 |
link |
2024-12-23 |
A Bias-Free Training Paradigm for More General AI-generated Image Detection |
Fabrizio Guillaro et.al. |
2412.17671 |
null |
2024-12-23 |
Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models |
Sijbren van Vaals et.al. |
2412.17669 |
link |
2024-12-23 |
Detecting anxiety and depression in dialogues: a multi-label and explainable approach |
Francisco de Arriba-Pérez et.al. |
2412.17651 |
null |
2024-12-23 |
SCBench: A Sports Commentary Benchmark for Video LLMs |
Kuangzhi Ge et.al. |
2412.17637 |
null |
2024-12-23 |
ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance |
Renyang Liu et.al. |
2412.17632 |
link |
2024-12-23 |
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study |
Yang Xu et.al. |
2412.17626 |
null |
2024-12-23 |
Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models |
Parham Rezaei et.al. |
2412.17622 |
link |
2024-12-23 |
Emerging Security Challenges of Large Language Models |
Herve Debar et.al. |
2412.17614 |
null |
2024-12-23 |
Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs |
Fabrizio Frasca et.al. |
2412.17609 |
null |
2024-12-23 |
EasyTime: Time Series Forecasting Made Easy |
Xiangfei Qiu et.al. |
2412.17603 |
null |
2024-12-23 |
LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context |
Kai Ruan et.al. |
2412.17596 |
link |
2024-12-23 |
Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation |
Chengbing Wang et.al. |
2412.17593 |
null |
2024-12-23 |
HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data |
Ting Zhou et.al. |
2412.17574 |
link |
2024-12-23 |
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field |
Zixi Liang et.al. |
2412.17561 |
link |
2024-12-23 |
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference |
Chao Zeng et.al. |
2412.17560 |
null |
2024-12-23 |
A Survey of Query Optimization in Large Language Models |
Mingyang Song et.al. |
2412.17558 |
null |
2024-12-23 |
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing |
Prakash Aryan et.al. |
2412.17548 |
link |
2024-12-23 |
Retention Score: Quantifying Jailbreak Risks for Vision Language Models |
Zaitang Li et.al. |
2412.17544 |
null |
2024-12-23 |
Constructing Fair Latent Space for Intersection of Fairness and Explainability |
Hyungjun Joo et.al. |
2412.17523 |
null |
2024-12-23 |
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak |
Hao Wang et.al. |
2412.17522 |
null |
2024-12-23 |
Improving the Noise Estimation of Latent Neural Stochastic Differential Equations |
Linus Heck et.al. |
2412.17499 |
null |
2024-12-23 |
Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings |
Jérémie Sublime et.al. |
2412.17486 |
null |
2024-12-23 |
Power- and Fragmentation-aware Online Scheduling for GPU Datacenters |
Francesco Lettich et.al. |
2412.17484 |
link |
2024-12-23 |
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression |
Chenlong Deng et.al. |
2412.17483 |
null |
2024-12-23 |
A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers |
Shuaihang Chen et.al. |
2412.17481 |
link |
2024-12-23 |
CALLIC: Content Adaptive Learning for Lossless Image Compression |
Daxin Li et.al. |
2412.17464 |
null |
2024-12-23 |
Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning |
Xiaodan Chen et.al. |
2412.17456 |
null |
2024-12-23 |
Applying LLM and Topic Modelling in Psychotherapeutic Contexts |
Alexander Vanin et.al. |
2412.17449 |
null |
2024-12-23 |
Measuring Contextual Informativeness in Child-Directed Text |
Maria Valentini et.al. |
2412.17427 |
link |
2024-12-23 |
Multimodal Preference Data Synthetic Alignment with Reward Model |
Robert Wijaya et.al. |
2412.17417 |
link |
2024-12-23 |
VidCtx: Context-aware Video Question Answering with Image Models |
Andreas Goulas et.al. |
2412.17415 |
null |
2024-12-23 |
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance |
Muhammad Reza Qorib et.al. |
2412.17408 |
link |
2024-12-23 |
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning |
Huchen Jiang et.al. |
2412.17397 |
null |
2024-12-23 |
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models |
Huawen Feng et.al. |
2412.17395 |
null |
2024-12-23 |
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement |
Hyeonjin Kim et.al. |
2412.17387 |
link |
2024-12-23 |
Interweaving Memories of a Siamese Large Language Model |
Xin Song et.al. |
2412.17383 |
link |
2024-12-23 |
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models |
Beibei Yu et.al. |
2412.17339 |
null |
2024-12-23 |
A Dual-Perspective Metaphor Detection Framework Using Large Language Models |
Yujie Lin et.al. |
2412.17332 |
link |
2024-12-23 |
Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance |
Nicolas Devatine et.al. |
2412.17321 |
null |
2024-12-23 |
CodeV: Issue Resolving with Visual Data |
Linhao Zhang et.al. |
2412.17315 |
link |
2024-12-23 |
Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories |
Mahan Tafreshipour et.al. |
2412.17298 |
null |
2024-12-23 |
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples |
Taewoong Kim et.al. |
2412.17288 |
link |
2024-12-23 |
LLM4AD: A Platform for Algorithm Design with Large Language Model |
Fei Liu et.al. |
2412.17287 |
link |
2024-12-23 |
Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning |
Rui Liang et.al. |
2412.17285 |
null |
2024-12-23 |
Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach |
Rafid Ishrak Jahan et.al. |
2412.17255 |
link |
2024-12-23 |
SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval |
Xiaopeng Li et.al. |
2412.17250 |
null |
2024-12-23 |
EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling |
Zichen Song et.al. |
2412.17249 |
null |
2024-12-23 |
On the Generalization Ability of Machine-Generated Text Detectors |
Yule Liu et.al. |
2412.17242 |
link |
2024-12-23 |
Brain-to-Text Benchmark ‘24: Lessons Learned |
Francis R. Willett et.al. |
2412.17227 |
link |
2024-12-23 |
CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder |
Lichen Ma et.al. |
2412.17225 |
null |
2024-12-22 |
Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension |
Jio Oh et.al. |
2412.17189 |
null |
2024-12-22 |
Foundation Model for Lossy Compression of Spatiotemporal Scientific Data |
Xiao Li et.al. |
2412.17184 |
null |
2024-12-22 |
Enhancing Item Tokenization for Generative Recommendation through Self-Improvement |
Runjin Chen et.al. |
2412.17171 |
null |
2024-12-22 |
Generative Diffusion Modeling: A Practical Handbook |
Zihan Ding et.al. |
2412.17162 |
null |
2024-12-22 |
LLM-based relevance assessment still can’t replace human relevance assessment |
Charles L. A. Clarke et.al. |
2412.17156 |
null |
2024-12-22 |
LLM Agent for Fire Dynamics Simulations |
Leidong Xu et.al. |
2412.17146 |
null |
2024-12-22 |
Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs |
Rushendra Sidibomma et.al. |
2412.17131 |
null |
2024-12-22 |
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models |
Cameron R. Jones et.al. |
2412.17128 |
null |
2024-12-22 |
Learning to Adapt to Low-Resource Paraphrase Generation |
Zhigen Li et.al. |
2412.17111 |
null |
2024-12-22 |
DreamOmni: Unified Image Generation and Editing |
Bin Xia et.al. |
2412.17098 |
null |
2024-12-22 |
Analysis on LLMs Performance for Code Summarization |
Md. Ahnaf Akib et.al. |
2412.17094 |
null |
2024-12-22 |
SAIL: Sample-Centric In-Context Learning for Document Information Extraction |
Jinyu Zhang et.al. |
2412.17092 |
link |
2024-12-22 |
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults |
Jinzhi Wang et.al. |
2412.17077 |
null |
2024-12-22 |
The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM’s Internal States |
Fabian Ridder et.al. |
2412.17056 |
link |
2024-12-22 |
DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately |
Huiwen Wu et.al. |
2412.17053 |
null |
2024-12-22 |
ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues |
Shaina Raza et.al. |
2412.17052 |
link |
2024-12-22 |
Modular Conversational Agents for Surveys and Interviews |
Jiangbo Yu et.al. |
2412.17049 |
null |
2024-12-22 |
Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective |
Hankun Wang et.al. |
2412.17048 |
null |
2024-12-22 |
Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation |
Luoxu Jin et.al. |
2412.17042 |
null |
2024-12-22 |
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories |
Eric Hedlin et.al. |
2412.17040 |
null |
2024-12-22 |
Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model |
Kevin Shen et.al. |
2412.17039 |
null |
2024-12-22 |
Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models |
Lang Gao et.al. |
2412.17034 |
null |
2024-12-22 |
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge |
Jie He et.al. |
2412.17032 |
null |
2024-12-22 |
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos |
Zhengqian Wu et.al. |
2412.17022 |
link |
2024-12-22 |
GAS: Generative Auto-bidding with Post-training Search |
Yewen Li et.al. |
2412.17018 |
null |
2024-12-22 |
Robustness of Large Language Models Against Adversarial Attacks |
Yiyi Tao et.al. |
2412.17011 |
null |
2024-12-22 |
InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions |
Ronghui Li et.al. |
2412.16982 |
null |
2024-12-22 |
On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora |
Tzu-Chieh Chen et.al. |
2412.16976 |
null |
2024-12-22 |
Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs |
Alexander von Recum et.al. |
2412.16974 |
null |
2024-12-22 |
Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach |
Chunxu Zhang et.al. |
2412.16969 |
link |
2024-12-22 |
System-2 Mathematical Reasoning via Enriched Instruction Tuning |
Huanqia Cai et.al. |
2412.16964 |
null |
2024-12-22 |
Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework |
Jundong Xu et.al. |
2412.16953 |
null |
2024-12-22 |
A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation |
Ekai Hashimoto et.al. |
2412.16943 |
null |
2024-12-22 |
Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering |
Zhongjian Hu et.al. |
2412.16936 |
null |
2024-12-22 |
Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models |
Kai Zheng et.al. |
2412.16933 |
null |
2024-12-22 |
Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs |
Bohan Jin et.al. |
2412.16922 |
null |
2024-12-22 |
Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection |
Yuhang Gan et.al. |
2412.16918 |
null |
2024-12-22 |
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation |
Quan Dao et.al. |
2412.16906 |
null |
2024-12-22 |
Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model |
Songjun Tu et.al. |
2412.16878 |
link |
2024-12-20 |
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding |
Chenxin Tao et.al. |
2412.16158 |
null |
2024-12-20 |
Can Generative Video Models Help Pose Estimation? |
Ruojin Cai et.al. |
2412.16155 |
null |
2024-12-20 |
Offline Reinforcement Learning for LLM Multi-Step Reasoning |
Huaijie Wang et.al. |
2412.16145 |
link |
2024-12-20 |
Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation |
Seyedreza Mohseni et.al. |
2412.16135 |
null |
2024-12-20 |
Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information |
Dirk Bergemann et.al. |
2412.16132 |
null |
2024-12-20 |
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics |
Daniil Larionov et.al. |
2412.16120 |
null |
2024-12-20 |
Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts |
Muhammad Abdullah Sohail et.al. |
2412.16119 |
link |
2024-12-20 |
PruneVid: Visual Token Pruning for Efficient Video Large Language Models |
Xiaohu Huang et.al. |
2412.16117 |
link |
2024-12-20 |
The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse |
Mahyar Habibi et.al. |
2412.16114 |
null |
2024-12-20 |
Logical Consistency of Large Language Models in Fact-checking |
Bishwamittra Ghosh et.al. |
2412.16100 |
null |
2024-12-20 |
The Evolution of LLM Adoption in Industry Data Curation Practices |
Crystal Qian et.al. |
2412.16089 |
null |
2024-12-20 |
Efficient MedSAMs: Segment Anything in Medical Images on Laptop |
Jun Ma et.al. |
2412.16085 |
link |
2024-12-20 |
Formal Mathematical Reasoning: A New Frontier in AI |
Kaiyu Yang et.al. |
2412.16075 |
null |
2024-12-20 |
The Only Way is Ethics: A Guide to Ethical Research with Large Language Models |
Eddie L. Ungless et.al. |
2412.16022 |
link |
2024-12-20 |
Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support |
Qijiong Liu et.al. |
2412.15973 |
link |
2024-12-20 |
From General to Specific: Tailoring Large Language Models for Personalized Healthcare |
Ruize Shi et.al. |
2412.15957 |
null |
2024-12-20 |
Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring |
Markus Borg et.al. |
2412.15948 |
null |
2024-12-20 |
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation |
Gautier Evennou et.al. |
2412.15939 |
link |
2024-12-20 |
Large Language Model assisted Hybrid Fuzzing |
Ruijie Meng et.al. |
2412.15931 |
null |
2024-12-20 |
MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection |
Andrea Moglia et.al. |
2412.15925 |
link |
2024-12-20 |
RiTTA: Modeling Event Relations in Text-to-Audio Generation |
Yuhang He et.al. |
2412.15922 |
link |
2024-12-20 |
Less is More: Towards Green Code Large Language Models via Unified Structural Pruning |
Guang Yang et.al. |
2412.15921 |
null |
2024-12-20 |
Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model |
Yosuke Yamagishi et.al. |
2412.15907 |
null |
2024-12-20 |
Evaluation of Reliability Criteria for News Publishers with Large Language Models |
Manuel Pratelli et.al. |
2412.15896 |
null |
2024-12-20 |
TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain |
Camille Barboule et.al. |
2412.15891 |
null |
2024-12-20 |
AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI |
Katja Bühler et.al. |
2412.15876 |
null |
2024-12-20 |
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback |
Jiaming Ji et.al. |
2412.15838 |
link |
2024-12-20 |
WebLLM: A High-Performance In-Browser LLM Inference Engine |
Charlie F. Ruan et.al. |
2412.15803 |
link |
2024-12-20 |
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning |
Sungjin Park et.al. |
2412.15797 |
null |
2024-12-20 |
GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning |
Heming Zhang et.al. |
2412.15790 |
null |
2024-12-20 |
Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech |
Jonathan Heitz et.al. |
2412.15772 |
link |
2024-12-20 |
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference |
Jorge García-Carrasco et.al. |
2412.15750 |
link |
2024-12-20 |
Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models |
Shamus Sim et.al. |
2412.15748 |
null |
2024-12-20 |
VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models |
Dexter Neo et.al. |
2412.15739 |
null |
2024-12-20 |
AutoLife: Automatic Life Journaling with Smartphones and LLMs |
Huatao Xu et.al. |
2412.15714 |
null |
2024-12-20 |
Contrastive Learning for Task-Independent SpeechLLM-Pretraining |
Maike Züfle et.al. |
2412.15712 |
link |
2024-12-20 |
Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback |
Niklas Ippisch et.al. |
2412.15702 |
null |
2024-12-20 |
Code Review Automation Via Multi-task Federated LLM – An Empirical Study |
Jahnavi Kumar et.al. |
2412.15676 |
null |
2024-12-20 |
Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline |
Guancheng Zeng et.al. |
2412.15660 |
null |
2024-12-20 |
Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class |
Annie D’souza et.al. |
2412.15657 |
null |
2024-12-20 |
MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula |
Sieun Hyeon et.al. |
2412.15655 |
link |
2024-12-20 |
Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution |
Wentao Tan et.al. |
2412.15650 |
null |
2024-12-20 |
Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model |
Xin Du et.al. |
2412.15634 |
link |
2024-12-20 |
Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? |
Mengyu Ye et.al. |
2412.15628 |
null |
2024-12-20 |
JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs |
Hongyi Li et.al. |
2412.15623 |
null |
2024-12-20 |
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage |
Zhi Gao et.al. |
2412.15606 |
null |
2024-12-20 |
Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks |
Brian J Chan et.al. |
2412.15605 |
link |
2024-12-20 |
Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification |
Gyutae Park et.al. |
2412.15603 |
null |
2024-12-20 |
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation |
Xiaoqiang Kang et.al. |
2412.15594 |
link |
2024-12-20 |
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization |
Danial Kamali et.al. |
2412.15588 |
link |
2024-12-20 |
To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models |
Jessica Y. Bo et.al. |
2412.15584 |
null |
2024-12-20 |
A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation |
Ryien Hosseini et.al. |
2412.15582 |
null |
2024-12-20 |
Score-based Generative Diffusion Models for Social Recommendations |
Chengyi Liu et.al. |
2412.15579 |
link |
2024-12-20 |
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning |
Xinyang Tong et.al. |
2412.15576 |
null |
2024-12-20 |
J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM |
Takero Yoshida et.al. |
2412.15574 |
null |
2024-12-20 |
Continual Learning Using a Kernel-Based Method Over Foundation Models |
Saleh Momeni et.al. |
2412.15571 |
link |
2024-12-20 |
DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation |
Yichun Tai et.al. |
2412.15570 |
link |
2024-12-20 |
In-context Continual Learning Assisted by an External Continual Learner |
Saleh Momeni et.al. |
2412.15563 |
null |
2024-12-20 |
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning |
Zheyuan Zhang et.al. |
2412.15547 |
null |
2024-12-20 |
MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering |
Zhang Siyue et.al. |
2412.15540 |
null |
2024-12-20 |
XRAG: eXamining the Core – Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation |
Qianren Mao et.al. |
2412.15529 |
link |
2024-12-20 |
HREF: Human Response-Guided Evaluation of Instruction Following in Language Models |
Xinxi Lyu et.al. |
2412.15524 |
link |
2024-12-20 |
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time |
Alireza Pourali et.al. |
2412.15519 |
link |
2024-12-20 |
Stylish and Functional: Guided Interpolation Subject to Physical Constraints |
Yan-Ying Chen et.al. |
2412.15507 |
null |
2024-12-20 |
Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework |
Zhenjie Xu et.al. |
2412.15504 |
link |
2024-12-20 |
Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models |
Zhisheng Tang et.al. |
2412.15501 |
null |
2024-12-20 |
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use |
Junjie Ye et.al. |
2412.15495 |
link |
2024-12-20 |
PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search |
Jiaxin Wu et.al. |
2412.15494 |
null |
2024-12-20 |
GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators |
Hengjia Li et.al. |
2412.15491 |
null |
2024-12-20 |
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage |
Saehyung Lee et.al. |
2412.15484 |
null |
2024-12-20 |
Continual Learning Using Only Large Language Model Prompting |
Jiabao Qiu et.al. |
2412.15479 |
null |
2024-12-19 |
TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models |
Ammar N. Abbas et.al. |
2412.15462 |
null |
2024-12-19 |
Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization |
Sahil Wadhwa et.al. |
2412.15453 |
null |
2024-12-19 |
AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals |
Angela Mastrianni et.al. |
2412.15444 |
null |
2024-12-19 |
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval |
Aakash Mahalingam et.al. |
2412.15443 |
null |
2024-12-19 |
Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models |
Tianchen Zhang et.al. |
2412.15431 |
null |
2024-12-19 |
MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale |
Swapnil Gandhi et.al. |
2412.15411 |
null |
2024-12-19 |
Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification |
Edoardo Allegrini et.al. |
2412.15410 |
null |
2024-12-19 |
Systematic Evaluation of Long-Context LLMs on Financial Concepts |
Lavanya Gupta et.al. |
2412.15386 |
null |
2024-12-19 |
Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation |
Joanne Boisson et.al. |
2412.15375 |
link |
2024-12-19 |
Automated Root Cause Analysis System for Complex Data Products |
Mathieu Demarne et.al. |
2412.15374 |
null |
2024-12-19 |
Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs |
Liam Seymour et.al. |
2412.15352 |
link |
2024-12-19 |
Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models |
Reza Shirkavand et.al. |
2412.15341 |
null |
2024-12-19 |
Complete background cosmology of parity-even quadratic metric-affine gravity |
Thomas Dyer et.al. |
2412.15329 |
null |
2024-12-19 |
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving |
Shuo Xing et.al. |
2412.15208 |
link |
2024-12-19 |
MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark |
Qihao Zhao et.al. |
2412.15194 |
link |
2024-12-19 |
LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation |
Weijia Shi et.al. |
2412.15188 |
null |
2024-12-19 |
Tiled Diffusion |
Or Madar et.al. |
2412.15185 |
null |
2024-12-19 |
Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning |
Simon Frieder et.al. |
2412.15184 |
null |
2024-12-19 |
STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning |
Marius Memmel et.al. |
2412.15182 |
null |
2024-12-19 |
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages |
Aman Chaturvedi et.al. |
2412.15178 |
null |
2024-12-19 |
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying |
Federico Castagna et.al. |
2412.15177 |
link |
2024-12-19 |
Rethinking Uncertainty Estimation in Natural Language Generation |
Lukas Aichberger et.al. |
2412.15176 |
null |
2024-12-19 |
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM |
Yatai Ji et.al. |
2412.15156 |
link |
2024-12-19 |
Language Models as Continuous Self-Evolving Data Engineers |
Peidong Wang et.al. |
2412.15151 |
null |
2024-12-19 |
Jet: A Modern Transformer-Based Normalizing Flow |
Alexander Kolesnikov et.al. |
2412.15129 |
null |
2024-12-19 |
Adaptive Pruning for Large Language Models with Structural Importance Awareness |
Haotian Zheng et.al. |
2412.15127 |
null |
2024-12-19 |
Outcome-Refining Process Supervision for Code Generation |
Zhuohao Yu et.al. |
2412.15118 |
link |
2024-12-19 |
Qwen2.5 Technical Report |
Qwen et.al. |
2412.15115 |
link |
2024-12-19 |
Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture |
Thomas F Burns et.al. |
2412.15113 |
link |
2024-12-19 |
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation |
Yang Tian et.al. |
2412.15109 |
link |
2024-12-19 |
Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability |
Xiangsen Chen et.al. |
2412.15101 |
null |
2024-12-19 |
Nano-ESG: Extracting Corporate Sustainability Information from News Articles |
Fabian Billert et.al. |
2412.15093 |
link |
2024-12-19 |
Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation |
Haoran Liu et.al. |
2412.15086 |
null |
2024-12-19 |
ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots |
Bhupendra Acharya et.al. |
2412.15072 |
null |
2024-12-19 |
ConfliBERT: A Language Model for Political Conflict |
Patrick T. Brandt et.al. |
2412.15060 |
link |
2024-12-19 |
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps |
Felix Friedrich et.al. |
2412.15035 |
null |
2024-12-19 |
DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space |
Mang Ning et.al. |
2412.15032 |
link |
2024-12-19 |
Large Language Models and Code Security: A Systematic Literature Review |
Enna Basic et.al. |
2412.15004 |
null |
2024-12-19 |
HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs |
Pham Vu Tuan Dat et.al. |
2412.14995 |
link |
2024-12-19 |
RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning |
Raphael Memmesheimer et.al. |
2412.14989 |
null |
2024-12-19 |
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts |
Ioana Buhnila et.al. |
2412.14986 |
null |
2024-12-19 |
AI and Cultural Context: An Empirical Investigation of Large Language Models’ Performance on Chinese Social Work Professional Standards |
Zia Qi et.al. |
2412.14971 |
null |
2024-12-19 |
Movie2Story: A framework for understanding videos and telling stories in the form of novel text |
Kangning Li et.al. |
2412.14965 |
null |
2024-12-19 |
Knowledge Injection via Prompt Distillation |
Kalle Kujanpää et.al. |
2412.14964 |
null |
2024-12-19 |
Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities |
Daniil Medyakov et.al. |
2412.14935 |
null |
2024-12-19 |
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response |
Junyu Luo et.al. |
2412.14922 |
link |
2024-12-19 |
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation |
Zexiong Ma et.al. |
2412.14905 |
null |
2024-12-19 |
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering |
Peize Li et.al. |
2412.14880 |
null |
2024-12-19 |
Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering |
Imed Keraghel et.al. |
2412.14867 |
null |
2024-12-19 |
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling |
Junyi Li et.al. |
2412.14860 |
null |
2024-12-19 |
DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis |
Hongling Xu et.al. |
2412.14849 |
link |
2024-12-19 |
Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas |
Pietro Bernardelle et.al. |
2412.14843 |
null |
2024-12-19 |
Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis |
Greta Dolcetti et.al. |
2412.14841 |
null |
2024-12-19 |
Progressive Multimodal Reasoning via Active Retrieval |
Guanting Dong et.al. |
2412.14835 |
null |
2024-12-19 |
Answer Set Networks: Casting Answer Set Programming into Deep Learning |
Arseny Skryagin et.al. |
2412.14814 |
link |
2024-12-19 |
ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis |
Zeao Tu et.al. |
2412.14809 |
link |
2024-12-19 |
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning |
Ziang Ye et.al. |
2412.14780 |
null |
2024-12-19 |
ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine |
Rabee Qasem et.al. |
2412.14771 |
null |
2024-12-19 |
PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children |
Yiqun Zhang et.al. |
2412.14769 |
link |
2024-12-19 |
CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering |
Ruida Hu et.al. |
2412.14764 |
link |
2024-12-19 |
Query pipeline optimization for cancer patient question answering systems |
Maolin He et.al. |
2412.14751 |
null |
2024-12-19 |
Active Inference and Human–Computer Interaction |
Roderick Murray-Smith et.al. |
2412.14741 |
null |
2024-12-19 |
On Verbalized Confidence Scores for LLMs |
Daniel Yang et.al. |
2412.14737 |
link |
2024-12-19 |
Creation of AI-driven Smart Spaces for Enhanced Indoor Environments – A Survey |
Aygün Varol et.al. |
2412.14708 |
null |
2024-12-19 |
LLMs as mediators: Can they diagnose conflicts accurately? |
Özgecan Koçak et.al. |
2412.14675 |
null |
2024-12-19 |
Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT |
Hassane Kissane et.al. |
2412.14670 |
null |
2024-12-19 |
IOHunter: Graph Foundation Model to Uncover Online Information Operations |
Marco Minici et.al. |
2412.14663 |
link |
2024-12-19 |
Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models |
Zijun Chen et.al. |
2412.14660 |
link |
2024-12-19 |
Length Controlled Generation for Black-box LLMs |
Yuxuan Gu et.al. |
2412.14656 |
null |
2024-12-19 |
Learning to Generate Research Idea with Dynamic Control |
Ruochen Li et.al. |
2412.14626 |
null |
2024-12-19 |
How good is GPT at writing political speeches for the White House? |
Jacques Savoy et.al. |
2412.14617 |
null |
2024-12-19 |
Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning |
Kepu Zhang et.al. |
2412.14588 |
null |
2024-12-19 |
HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning |
Minkuk Kim et.al. |
2412.14585 |
null |
2024-12-19 |
Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues |
Tao He et.al. |
2412.14584 |
null |
2024-12-19 |
CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation |
Youngwon Lee et.al. |
2412.14581 |
null |
2024-12-19 |
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity |
Yiren Song et.al. |
2412.14580 |
link |
2024-12-19 |
Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models |
Wenhan Liu et.al. |
2412.14574 |
link |
2024-12-19 |
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model |
Shunlin Lu et.al. |
2412.14559 |
null |
2024-12-19 |
The Current Challenges of Software Engineering in the Era of Large Language Models |
Cuiyun Gao et.al. |
2412.14554 |
null |
2024-12-19 |
Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models |
Xiao Cui et.al. |
2412.14528 |
link |
2024-12-19 |
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment |
Teng Xiao et.al. |
2412.14516 |
link |
2024-12-19 |
Relational Programming with Foundation Models |
Ziyang Li et.al. |
2412.14515 |
null |
2024-12-19 |
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization |
Jiayi Wu et.al. |
2412.14510 |
link |
2024-12-19 |
Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs |
Yuzuki Arai et.al. |
2412.14501 |
null |
2024-12-19 |
Guided Diffusion Model for Sensor Data Obfuscation |
Xin Yang et.al. |
2412.14499 |
null |
2024-12-19 |
FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis |
Abdullah Khan et.al. |
2412.14492 |
link |
2024-12-19 |
Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities |
Amandeep Kaur et.al. |
2412.14486 |
null |
2024-12-19 |
DirectorLLM for Human-Centric Video Generation |
Kunpeng Song et.al. |
2412.14484 |
null |
2024-12-19 |
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs |
Koshiro Saito et.al. |
2412.14471 |
null |
2024-12-19 |
Agent-SafetyBench: Evaluating the Safety of LLM Agents |
Zhexin Zhang et.al. |
2412.14470 |
link |
2024-12-19 |
From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research |
Xiang Cheng et.al. |
2412.14461 |
null |
2024-12-19 |
LEDiff: Latent Exposure Diffusion for HDR Generation |
Chao Wang et.al. |
2412.14456 |
null |
2024-12-19 |
Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems |
Genki Kusano et.al. |
2412.14454 |
null |
2024-12-19 |
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation |
Shengqi Liu et.al. |
2412.14453 |
null |
2024-12-19 |
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study |
Eric Modesitt et.al. |
2412.14436 |
link |
2024-12-19 |
All-in-One Tuning and Structural Pruning for Domain-Specific LLMs |
Lei Lu et.al. |
2412.14426 |
null |
2024-12-19 |
FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning |
Pramit Saha et.al. |
2412.14424 |
null |
2024-12-19 |
Enhancing Diffusion Models for High-Quality Image Generation |
Jaineet Shah et.al. |
2412.14422 |
null |
2024-12-18 |
ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers |
Haowei Liu et.al. |
2412.14405 |
null |
2024-12-18 |
Clinical Trials Ontology Engineering with Large Language Models |
Berkan Çakır et.al. |
2412.14387 |
null |
2024-12-18 |
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling |
William Han et.al. |
2412.14373 |
link |
2024-12-18 |
Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models’ Character Understanding Evaluation |
Yuxuan Jiang et.al. |
2412.14368 |
null |
2024-12-18 |
Surrealistic-like Image Generation with Vision-Language Models |
Elif Ayten et.al. |
2412.14366 |
link |
2024-12-18 |
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals |
Utkarsh Saxena et.al. |
2412.14363 |
link |
2024-12-18 |
A Unifying Information-theoretic Perspective on Evaluating Generative Models |
Alexis Fox et.al. |
2412.14340 |
null |
2024-12-18 |
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation |
Benjamin Steenhoek et.al. |
2412.14308 |
null |
2024-12-18 |
Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs |
David Restrepo et.al. |
2412.14304 |
null |
2024-12-18 |
Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data |
haina Raza et.al. |
2412.14276 |
link |
2024-12-18 |
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces |
Jihan Yang et.al. |
2412.14171 |
link |
2024-12-18 |
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning |
Shengbang Tong et.al. |
2412.14164 |
null |
2024-12-18 |
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks |
Frank F. Xu et.al. |
2412.14161 |
link |
2024-12-18 |
Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models |
Atin Sakkeer Hussain et.al. |
2412.14146 |
null |
2024-12-18 |
LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research |
Tianyang Gu et.al. |
2412.14141 |
null |