2025-03-24 |
Equivariant Image Modeling |
Ruixiao Dong et.al. |
2503.18948 |
link |
2025-03-24 |
Aether: Geometric-Aware Unified World Modeling |
Aether Team et.al. |
2503.18945 |
null |
2025-03-24 |
DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation |
Karim Abou Zeid et.al. |
2503.18944 |
link |
2025-03-24 |
SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding |
Mingze Xu et.al. |
2503.18943 |
null |
2025-03-24 |
Video-T1: Test-Time Scaling for Video Generation |
Fangfu Liu et.al. |
2503.18942 |
null |
2025-03-24 |
Exploring Training and Inference Scaling Laws in Generative Retrieval |
Hongru Cai et.al. |
2503.18941 |
null |
2025-03-24 |
CoMP: Continual Multimodal Pre-training for Vision Foundation Models |
Yitong Chen et.al. |
2503.18931 |
link |
2025-03-24 |
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training |
Brian R. Bartoldson et.al. |
2503.18929 |
null |
2025-03-24 |
FFN Fusion: Rethinking Sequential Computation in Large Language Models |
Akhiad Bercovich et.al. |
2503.18908 |
null |
2025-03-24 |
xKV: Cross-Layer SVD for KV-Cache Compression |
Chi-Chih Chang et.al. |
2503.18893 |
null |
2025-03-24 |
AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration |
Zhexuan Wang et.al. |
2503.18891 |
null |
2025-03-24 |
Toward building next-generation Geocoding systems: a systematic review |
Zhengcong Yin et.al. |
2503.18888 |
null |
2025-03-24 |
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders |
Andrey Galichin et.al. |
2503.18878 |
null |
2025-03-24 |
Efficient Self-Supervised Adaptation for Medical Image Analysis |
Moein Sorkhei et.al. |
2503.18873 |
null |
2025-03-24 |
Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design |
Rui Xie et.al. |
2503.18869 |
null |
2025-03-24 |
Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations |
Junlan Chen et.al. |
2503.18865 |
null |
2025-03-24 |
3DSwapping: Texture Swapping For 3D Object From Single Reference Image |
Xiao Cao et.al. |
2503.18853 |
null |
2025-03-24 |
Defeating Prompt Injections by Design |
Edoardo Debenedetti et.al. |
2503.18813 |
null |
2025-03-24 |
Classical Planning with LLM-Generated Heuristics: Challenging the State of the Art with Python Code |
Augusto B. Corrêa et.al. |
2503.18809 |
null |
2025-03-24 |
REALM: A Dataset of Real-World LLM Use Cases |
Jingwen Cheng et.al. |
2503.18792 |
null |
2025-03-24 |
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache |
Dayou Du et.al. |
2503.18773 |
null |
2025-03-24 |
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning |
Alan Dao et.al. |
2503.18769 |
null |
2025-03-24 |
RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation |
Chengbo Yuan et.al. |
2503.18738 |
null |
2025-03-24 |
Predicting the Road Ahead: A Knowledge Graph based Foundation Model for Scene Understanding in Autonomous Driving |
Hongkuan Zhou et.al. |
2503.18730 |
null |
2025-03-24 |
LLaVAction: evaluating and training multi-modal large language models for action recognition |
Shaokai Ye et.al. |
2503.18712 |
null |
2025-03-24 |
Revisiting Automatic Data Curation for Vision Foundation Models in Digital Pathology |
Boqi Chen et.al. |
2503.18709 |
null |
2025-03-24 |
OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad |
Luyao Tang et.al. |
2503.18695 |
null |
2025-03-25 |
Commander-GPT: Fully Unleashing the Sarcasm Detection Capability of Multi-Modal Large Language Models |
Yazhou Zhang et.al. |
2503.18681 |
null |
2025-03-24 |
NullSwap: Proactive Identity Cloaking Against Deepfake Face Swapping |
Tianyi Wang et.al. |
2503.18678 |
null |
2025-03-24 |
Boosting Virtual Agent Learning and Reasoning: A Step-wise, Multi-dimensional, and Generalist Reward Model with Benchmark |
Bingchen Miao et.al. |
2503.18665 |
null |
2025-03-24 |
From Fragment to One Piece: A Survey on AI-Driven Graphic Design |
Xingxing Zou et.al. |
2503.18641 |
null |
2025-03-24 |
Adaptive Machine Learning for Resource-Constrained Environments |
Sebastián A. Cajas Ordóñez et.al. |
2503.18634 |
null |
2025-03-24 |
Generative Dataset Distillation using Min-Max Diffusion Model |
Junqiao Fan et.al. |
2503.18626 |
null |
2025-03-24 |
Scaling Laws for Emulation of Stellar Spectra |
Tomasz Różański et.al. |
2503.18617 |
null |
2025-03-24 |
LANGALIGN: Enhancing Non-English Language Models via Cross-Lingual Embedding Alignment |
Jong Myoung Kim et.al. |
2503.18603 |
null |
2025-03-24 |
Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization |
Minsu Kim et.al. |
2503.18599 |
null |
2025-03-24 |
A Universal Model Combining Differential Equations and Neural Networks for Ball Trajectory Prediction |
Zhiwei Shi et.al. |
2503.18584 |
null |
2025-03-24 |
Anchor-based oversampling for imbalanced tabular data via contrastive and adversarial learning |
Hadi Mohammadi et.al. |
2503.18569 |
null |
2025-03-24 |
Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures |
Abdoul Majid O. Thiombiano et.al. |
2503.18565 |
null |
2025-03-24 |
Power-fractional distributions and branching processes |
Gerold Alsmeyer et.al. |
2503.18563 |
null |
2025-03-24 |
Self-Reported Confidence of Large Language Models in Gastroenterology: Analysis of Commercial, Open-Source, and Quantized Models |
Nariman Naderi et.al. |
2503.18562 |
null |
2025-03-25 |
AMD-Hummingbird: Towards an Efficient Text-to-Video Model |
Takashi Isobe et.al. |
2503.18559 |
null |
2025-03-24 |
HiRes-FusedMIM: A High-Resolution RGB-DSM Pre-trained Model for Building-Level Remote Sensing Applications |
Guneet Mutreja et.al. |
2503.18540 |
null |
2025-03-24 |
SciClaims: An End-to-End Generative System for Biomedical Claim Analysis |
Raúl Ortega et.al. |
2503.18526 |
null |
2025-03-24 |
P3Nav: A Unified Framework for Embodied Navigation Integrating Perception, Planning, and Prediction |
Yufeng Zhong et.al. |
2503.18525 |
null |
2025-03-24 |
Can Text-to-Video Generation help Video-Language Alignment? |
Luca Zanella et.al. |
2503.18507 |
null |
2025-03-24 |
Autoregressive Language Models for Knowledge Base Population: A case study in the space mission domain |
Andrés García-Silva et.al. |
2503.18502 |
null |
2025-03-24 |
Verbal Process Supervision Elicits Better Coding Agents |
Hao-Yuan Chen et.al. |
2503.18494 |
null |
2025-03-24 |
Safeguarding Mobile GUI Agent via Logic-based Action Verification |
Jungjae Lee et.al. |
2503.18492 |
null |
2025-03-24 |
Large Language Models powered Network Attack Detection: Architecture, Opportunities and Case Study |
Xinggong Zhang et.al. |
2503.18487 |
null |
2025-03-24 |
Explaining Domain Shifts in Language: Concept erasing for Interpretable Image Classification |
Zequn Zeng et.al. |
2503.18483 |
null |
2025-03-24 |
Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding |
Xiangrui Liu et.al. |
2503.18478 |
null |
2025-03-24 |
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models |
Tadeusz Dziarmaga et.al. |
2503.18462 |
null |
2025-03-24 |
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing |
Lingting Zhu et.al. |
2503.18461 |
null |
2025-03-24 |
ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation |
Jiahui Xiang et.al. |
2503.18460 |
null |
2025-03-24 |
SEAlign: Alignment Training for Software Engineering Agent |
Kechi Zhang et.al. |
2503.18455 |
null |
2025-03-24 |
InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment |
Yunhong Lu et.al. |
2503.18454 |
null |
2025-03-24 |
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation |
Guosheng Zhao et.al. |
2503.18438 |
null |
2025-03-24 |
A Simple yet Effective Layout Token in Large Language Models for Document Understanding |
Zhaoqing Zhu et.al. |
2503.18434 |
null |
2025-03-24 |
Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning |
Junsong Li et.al. |
2503.18432 |
null |
2025-03-24 |
Breaking the Encoder Barrier for Seamless Video-Language Understanding |
Handong Li et.al. |
2503.18422 |
null |
2025-03-25 |
Instruct-CLIP: Improving Instruction-Guided Image Editing with Automated Data Refinement Using Contrastive Learning |
Sherry X. Chen et.al. |
2503.18406 |
null |
2025-03-24 |
Solving Situation Puzzles with Large Language Model and External Reformulation |
Kun Li et.al. |
2503.18394 |
null |
2025-03-24 |
Manipulation and the AI Act: Large Language Model Chatbots and the Danger of Mirrors |
Joshua Krook et.al. |
2503.18387 |
null |
2025-03-24 |
Resource-Efficient Motion Control for Video Generation via Dynamic Mask Guidance |
Sicong Feng et.al. |
2503.18386 |
null |
2025-03-24 |
Maximum Redundancy Pruning: A Principle-Driven Layerwise Sparsity Allocation for LLMs |
Chang Gao et.al. |
2503.18377 |
null |
2025-03-24 |
J&H: Evaluating the Robustness of Large Language Models Under Knowledge-Injection Attacks in Legal Domain |
Yiran Hu et.al. |
2503.18360 |
null |
2025-03-24 |
Mitigating Cache Noise in Test-Time Adaptation for Large Vision-Language Models |
Haotian Zhai et.al. |
2503.18334 |
null |
2025-03-24 |
Optimizing Influence Campaigns: Nudging under Bounded Confidence |
Yen-Shao Chen et.al. |
2503.18331 |
null |
2025-03-24 |
Towards Training-free Anomaly Detection with Vision and Language Foundation Models |
Jinjin Zhang et.al. |
2503.18325 |
null |
2025-03-24 |
Bridging Writing Manner Gap in Visual Instruction Tuning by Creating LLM-aligned Instructions |
Dong Jing et.al. |
2503.18320 |
null |
2025-03-24 |
Knowledge Transfer from LLMs to Provenance Analysis: A Semantic-Augmented Method for APT Detection |
Fei Zuo et.al. |
2503.18316 |
null |
2025-03-24 |
DeepFund: Will LLM be Professional at Fund Investment? A Live Arena Perspective |
Changlun Li et.al. |
2503.18313 |
null |
2025-03-24 |
Enhancing LLM-based Code Translation in Repository Context via Triple Knowledge-Augmented |
Guangsheng Ou et.al. |
2503.18305 |
null |
2025-03-24 |
How to Capture and Study Conversations Between Research Participants and ChatGPT: GPT for Researchers (g4r.org) |
Jin Kim et.al. |
2503.18303 |
null |
2025-03-24 |
Image-to-Text for Medical Reports Using Adaptive Co-Attention and Triple-LSTM Module |
Yishen Liu et.al. |
2503.18297 |
null |
2025-03-24 |
Surgical Action Planning with Large Language Models |
Mengya Xu et.al. |
2503.18296 |
null |
2025-03-24 |
Fact-checking AI-generated news reports: Can LLMs catch their own lies? |
Jiayi Yao et.al. |
2503.18293 |
null |
2025-03-24 |
Jenga: Effective Memory Management for Serving LLM with Heterogeneity |
Chen Zhang et.al. |
2503.18292 |
null |
2025-03-24 |
Sun-Shine: A Large Language Model for Tibetan Culture |
Cheng Huang et.al. |
2503.18288 |
null |
2025-03-24 |
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI |
Siyuan Cheng et.al. |
2503.18286 |
null |
2025-03-24 |
Analyzing Islamophobic Discourse Using Semi-Coded Terms and LLMs |
Raza Ul Mustafa et.al. |
2503.18273 |
null |
2025-03-24 |
Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift |
Xiudi Li et.al. |
2503.18256 |
null |
2025-03-24 |
Surface-Aware Distilled 3D Semantic Features |
Lukas Uzolas et.al. |
2503.18254 |
null |
2025-03-24 |
Enhancing Multi-Label Emotion Analysis and Corresponding Intensities for Ethiopian Languages |
Tadesse Destaw Belay et.al. |
2503.18253 |
null |
2025-03-23 |
CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation |
Jungsoo Lee et.al. |
2503.18244 |
null |
2025-03-23 |
ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices |
Aneesh Vathul et.al. |
2503.18242 |
null |
2025-03-23 |
Adaptive Rank Allocation: Speeding Up Modern Transformers with RaNA Adapters |
Roberto Garcia et.al. |
2503.18216 |
null |
2025-03-23 |
LakotaBERT: A Transformer-based Model for Low Resource Lakota Language |
Kanishka Parankusham et.al. |
2503.18212 |
null |
2025-03-23 |
The Power of Small LLMs in Geometry Generation for Physical Simulations |
Ossama Shafiq et.al. |
2503.18178 |
null |
2025-03-23 |
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering |
Zixin Chen et.al. |
2503.18172 |
null |
2025-03-23 |
Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes |
Kelly O. Marshall et.al. |
2503.18155 |
null |
2025-03-23 |
LocDiffusion: Identifying Locations on Earth by Diffusing in the Hilbert Space |
Zhangyu Wang et.al. |
2503.18142 |
null |
2025-03-23 |
AGIR: Assessing 3D Gait Impairment with Reasoning based on LLMs |
Diwei Wang et.al. |
2503.18141 |
null |
2025-03-23 |
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation |
Jiaxin Huang et.al. |
2503.18135 |
null |
2025-03-23 |
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection |
Yibo Yan et.al. |
2503.18132 |
null |
2025-03-23 |
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization |
Juntao Dai et.al. |
2503.18130 |
null |
2025-03-23 |
GeoBenchX: Benchmarking LLMs for Multistep Geospatial Tasks |
Varvara Krechetova et.al. |
2503.18129 |
null |
2025-03-23 |
$D^2LoRA$ : Data-Driven LoRA Initialization for Low Resource Tasks |
Javad SeraJ et.al. |
2503.18089 |
null |
2025-03-23 |
Vehicular Road Crack Detection with Deep Learning: A New Online Benchmark for Comprehensive Evaluation of Existing Algorithms |
Nachuan Ma et.al. |
2503.18082 |
null |
2025-03-21 |
Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique |
Yansi Li et.al. |
2503.17363 |
null |
2025-03-21 |
Position: Interactive Generative Video as Next-Generation Game Engine |
Jiwen Yu et.al. |
2503.17359 |
null |
2025-03-21 |
HCAST: Human-Calibrated Autonomy Software Tasks |
David Rein et.al. |
2503.17354 |
null |
2025-03-21 |
NdLinear Is All You Need for Representation Learning |
Alex Reneau et.al. |
2503.17353 |
null |
2025-03-21 |
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement |
Yihe Deng et.al. |
2503.17352 |
null |
2025-03-21 |
Capturing Individual Human Preferences with Reward Features |
André Barreto et.al. |
2503.17338 |
null |
2025-03-21 |
Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs |
Reem Gody et.al. |
2503.17336 |
null |
2025-03-21 |
CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities |
Yuxuan Zhu et.al. |
2503.17332 |
link |
2025-03-21 |
LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language |
Kun Chu et.al. |
2503.17309 |
null |
2025-03-21 |
Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests |
John Naulty et.al. |
2503.17302 |
null |
2025-03-21 |
Offline Model-Based Optimization: Comprehensive Review |
Minsu Kim et.al. |
2503.17286 |
null |
2025-03-21 |
CASE – Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement |
Gaifan Zhang et.al. |
2503.17279 |
null |
2025-03-21 |
Unsupervised Joint Learning of Optical Flow and Intensity with Event Cameras |
Shuang Guo et.al. |
2503.17262 |
null |
2025-03-21 |
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging |
Aladin Djuhera et.al. |
2503.17239 |
null |
2025-03-21 |
FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs |
Albert Sawczyn et.al. |
2503.17229 |
null |
2025-03-21 |
Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation |
Giacomo Savazzi et.al. |
2503.17224 |
null |
2025-03-21 |
Automating Adjudication of Cardiovascular Events Using Large Language Models |
Sonish Sivarajkumar et.al. |
2503.17222 |
null |
2025-03-21 |
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning |
Sheng Wang et.al. |
2503.17195 |
null |
2025-03-21 |
LLMs Love Python: A Study of LLMs’ Bias for Programming Languages and Libraries |
Lukas Twist et.al. |
2503.17181 |
null |
2025-03-21 |
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens |
Panpan Wang et.al. |
2503.17155 |
null |
2025-03-21 |
Modifying Large Language Model Post-Training for Diverse Creative Writing |
John Joon Young Chung et.al. |
2503.17126 |
null |
2025-03-21 |
Large Language Model Compression via the Nested Activation-Aware Decomposition |
Jun Lu et.al. |
2503.17101 |
null |
2025-03-21 |
Deterministic AI Agent Personality Expression through Standard Psychological Diagnostics |
J. M. Diederik Kruijssen et.al. |
2503.17085 |
null |
2025-03-21 |
A Study into Investigating Temporal Robustness of LLMs |
Jonas Wallat et.al. |
2503.17073 |
null |
2025-03-21 |
PVChat: Personalized Video Chat with One-Shot Learning |
Yufei Shi et.al. |
2503.17069 |
null |
2025-03-21 |
Problem Framing in the AI era: a new model |
Matteo Tuveri et.al. |
2503.17040 |
null |
2025-03-21 |
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process |
Junjie Hu et.al. |
2503.17029 |
null |
2025-03-21 |
RiboFlow: Conditional De Novo RNA Sequence-Structure Co-Design via Synergistic Flow Matching |
Runze Ma et.al. |
2503.17007 |
null |
2025-03-21 |
Text2Model: Generating dynamic chemical reactor models using large language models (LLMs) |
Sophia Rupprecht et.al. |
2503.17004 |
null |
2025-03-21 |
A Survey on Personalized Alignment – The Missing Piece for Large Language Models in Real-World Applications |
Jian Guan et.al. |
2503.17003 |
null |
2025-03-21 |
Steady Progress Beats Stagnation: Mutual Aid of Foundation and Conventional Models in Mixed Domain Semi-Supervised Medical Image Segmentation |
Qinghe Ma et.al. |
2503.16997 |
null |
2025-03-21 |
TRACE: Time SeRies PArameter EffiCient FinE-tuning |
Yuze Li et.al. |
2503.16991 |
null |
2025-03-21 |
Token Dynamics: Towards Efficient and Dynamic Video Token Representation for Video Large Language Models |
Haichao Zhang et.al. |
2503.16980 |
null |
2025-03-21 |
Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks |
Julian Junyan Wang et.al. |
2503.16974 |
null |
2025-03-21 |
Distilling Monocular Foundation Model for Fine-grained Depth Completion |
Yingping Liang et.al. |
2503.16970 |
null |
2025-03-21 |
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis |
Mengtian Li et.al. |
2503.16944 |
null |
2025-03-21 |
TEMPO: Temporal Preference Optimization of Video LLMs via Difficulty Scheduling and Pre-SFT Alignment |
Shicheng Li et.al. |
2503.16929 |
null |
2025-03-21 |
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation |
Linxi Liang et.al. |
2503.16922 |
null |
2025-03-21 |
Malliavin-Bismut Score-based Diffusion Models |
Ehsan Mirafzali et.al. |
2503.16917 |
null |
2025-03-21 |
FAIT: Fault-Aware Fine-Tuning for Better Code Generation |
Lishui Fan et.al. |
2503.16913 |
null |
2025-03-21 |
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation |
Jingzhi Fang et.al. |
2503.16893 |
null |
2025-03-21 |
Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation |
Jiangcheng Qin et.al. |
2503.16875 |
null |
2025-03-21 |
MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization |
Jian Zhang et.al. |
2503.16874 |
null |
2025-03-21 |
Lie Detector: Unified Backdoor Detection via Cross-Examination Framework |
Xuan Wang et.al. |
2503.16872 |
null |
2025-03-21 |
Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMs |
Anshumann et.al. |
2503.16870 |
null |
2025-03-21 |
Nonparametric Factor Analysis and Beyond |
Yujia Zheng et.al. |
2503.16865 |
null |
2025-03-21 |
MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering |
Jialin Chen et.al. |
2503.16858 |
null |
2025-03-21 |
Generative Compositor for Few-Shot Visual Information Extraction |
Zhibo Yang et.al. |
2503.16854 |
null |
2025-03-21 |
Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models |
Suho Yoo et.al. |
2503.16853 |
null |
2025-03-21 |
Towards LLM Guardrails via Sparse Representation Steering |
Zeqing He et.al. |
2503.16851 |
null |
2025-03-21 |
LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models |
Jian Liang et.al. |
2503.16843 |
null |
2025-03-21 |
Downstream Analysis of Foundational Medical Vision Models for Disease Progression |
Basar Demir et.al. |
2503.16842 |
null |
2025-03-21 |
When Tom Eats Kimchi: Evaluating Cultural Bias of Multimodal Large Language Models in Cultural Mixture Contexts |
Jun Seong Kim et.al. |
2503.16826 |
null |
2025-03-21 |
When Debate Fails: Bias Reinforcement in Large Language Models |
Jihwan Oh et.al. |
2503.16814 |
null |
2025-03-21 |
Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models |
Mengsong Wu et.al. |
2503.16779 |
null |
2025-03-21 |
Current and Future Use of Large Language Models for Knowledge Work |
Michelle Brachman et.al. |
2503.16774 |
null |
2025-03-21 |
On Explaining (Large) Language Models For Code Using Global Code-Based Explanations |
David N. Palacio et.al. |
2503.16771 |
null |
2025-03-20 |
Automated Harmfulness Testing for Code Large Language Models |
Honghao Tan et.al. |
2503.16740 |
null |
2025-03-20 |
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models |
Chengkai Huang et.al. |
2503.16734 |
null |
2025-03-20 |
Natural Language Generation |
Emiel van Miltenburg et.al. |
2503.16728 |
null |
2025-03-20 |
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding |
Jinlong Li et.al. |
2503.16707 |
null |
2025-03-20 |
APPA : Agentic Preformulation Pathway Assistant |
Julius Lange et.al. |
2503.16698 |
null |
2025-03-20 |
GAIR: Improving Multimodal Geo-Foundation Model with Geo-Aligned Implicit Representations |
Zeping Liu et.al. |
2503.16683 |
null |
2025-03-20 |
Echoes of Power: Investigating Geopolitical Bias in US and China Large Language Models |
Andre G. C. Pacheco et.al. |
2503.16679 |
null |
2025-03-20 |
Accelerating Transformer Inference and Training with 2:4 Activation Sparsity |
Daniel Haziza et.al. |
2503.16672 |
null |
2025-03-20 |
Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms |
Niki van Stein et.al. |
2503.16668 |
null |
2025-03-20 |
A preliminary data fusion study to assess the feasibility of Foundation Process-Property Models in Laser Powder Bed Fusion |
Oriol Vendrell-Gallart et.al. |
2503.16667 |
null |
2025-03-20 |
Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs |
Maxime Delmas et.al. |
2503.16655 |
null |
2025-03-20 |
Leveraging Large Language Models for Explainable Activity Recognition in Smart Homes: A Critical Evaluation |
Michele Fiori et.al. |
2503.16622 |
null |
2025-03-20 |
A Recipe for Generating 3D Worlds From a Single Image |
Katja Schwarz et.al. |
2503.16611 |
null |
2025-03-20 |
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions |
Hadi Amini et.al. |
2503.16585 |
null |
2025-03-22 |
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation |
Yuqing Wang et.al. |
2503.16430 |
null |
2025-03-20 |
DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding |
Keyan Chen et.al. |
2503.16426 |
link |
2025-03-20 |
SynCity: Training-Free Generation of 3D Worlds |
Paul Engstler et.al. |
2503.16420 |
null |
2025-03-20 |
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models |
Yang Sui et.al. |
2503.16419 |
link |
2025-03-20 |
M3: 3D-Spatial MultiModal Memory |
Xueyan Zou et.al. |
2503.16413 |
link |
2025-03-20 |
DreamTexture: Shape from Virtual Texture with Analysis by Augmentation |
Ananta R. Bhattarai et.al. |
2503.16412 |
null |
2025-03-20 |
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness |
SeungJu Cha et.al. |
2503.16406 |
link |
2025-03-20 |
The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination |
Yifan Sun et.al. |
2503.16402 |
link |
2025-03-20 |
Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them |
Guanyu Chen et.al. |
2503.16401 |
null |
2025-03-20 |
Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation |
Yijia Luo et.al. |
2503.16385 |
link |
2025-03-20 |
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images |
Leyang Wang et.al. |
2503.16376 |
null |
2025-03-20 |
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse |
Muyao Li et.al. |
2503.16365 |
null |
2025-03-20 |
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners |
Yunzhi Yao et.al. |
2503.16356 |
link |
2025-03-20 |
Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences |
Krithik Ramesh et.al. |
2503.16351 |
null |
2025-03-20 |
LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates |
Ying Shen et.al. |
2503.16334 |
null |
2025-03-20 |
OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence |
Long Yuan et.al. |
2503.16326 |
null |
2025-03-20 |
Issue2Test: Generating Reproducing Test Cases from Issue Reports |
Noor Nashid et.al. |
2503.16320 |
null |
2025-03-21 |
Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 |
Peiran Gu et.al. |
2503.16304 |
null |
2025-03-20 |
SceneMI: Motion In-betweening for Modeling Human-Scene Interactions |
Inwoo Hwang et.al. |
2503.16289 |
null |
2025-03-21 |
Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens |
Shuqi Lu et.al. |
2503.16278 |
link |
2025-03-20 |
Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data |
Zijian Li et.al. |
2503.16260 |
null |
2025-03-20 |
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models |
Keda Tao et.al. |
2503.16257 |
null |
2025-03-21 |
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning |
Zhaowei Liu et.al. |
2503.16252 |
null |
2025-03-20 |
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t |
Quy-Anh Dang et.al. |
2503.16219 |
link |
2025-03-20 |
MathFusion: Enhancing Mathematic Problem-solving of LLM through Instruction Fusion |
Qizhi Pei et.al. |
2503.16212 |
link |
2025-03-20 |
VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis |
Chia-Yi Hsu et.al. |
2503.16195 |
null |
2025-03-21 |
Affective Polarization Amongst Swedish Politicians |
François t’Serstevens et.al. |
2503.16193 |
link |
2025-03-20 |
Large Language Models for Water Distribution Systems Modeling and Decision-Making |
Yinon Goldshtein et.al. |
2503.16191 |
null |
2025-03-20 |
CLS-RL: Image Classification with Rule-Based Reinforcement Learning |
Ming Li et.al. |
2503.16188 |
null |
2025-03-20 |
Narrowing Class-Wise Robustness Gaps in Adversarial Training |
Fatemeh Amerehi et.al. |
2503.16179 |
null |
2025-03-20 |
CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models |
Hong Yi Lin et.al. |
2503.16167 |
null |
2025-03-20 |
SpeCache: Speculative Key-Value Caching for Efficient Generation of LLMs |
Shibo Jie et.al. |
2503.16163 |
null |
2025-03-20 |
Towards Lighter and Robust Evaluation for Retrieval Augmented Generation |
Alex-Razvan Ispas et.al. |
2503.16161 |
null |
2025-03-20 |
Automatically Generating Chinese Homophone Words to Probe Machine Translation Estimation Systems |
Shenbin Qian et.al. |
2503.16158 |
null |
2025-03-20 |
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models |
Mats Faulborn et.al. |
2503.16148 |
null |
2025-03-20 |
Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs |
Djamel Eddine Khelladi et.al. |
2503.16144 |
null |
2025-03-21 |
MKG-Rank: Enhancing Large Language Models with Knowledge Graph for Multilingual Medical Question Answering |
Feiyang Li et.al. |
2503.16131 |
null |
2025-03-20 |
The Impact of Revealing Large Language Model Stochasticity on Trust, Reliability, and Anthropomorphization |
Chelse Swoopes et.al. |
2503.16114 |
null |
2025-03-20 |
OSLoPrompt: Bridging Low-Supervision Challenges and Open-Set Domain Generalization in CLIP |
Mohamad Hassan N C et.al. |
2503.16106 |
null |
2025-03-20 |
Cultural Alignment in Large Language Models Using Soft Prompt Tuning |
Reem I. Masoud et.al. |
2503.16094 |
null |
2025-03-20 |
Quantum Chebyshev Probabilistic Models for Fragmentation Functions |
Jorge J. Martínez de Lejarza et.al. |
2503.16073 |
null |
2025-03-20 |
Tuning LLMs by RAG Principles: Towards LLM-native Memory |
Jiale Wei et.al. |
2503.16071 |
null |
2025-03-20 |
SALT: Singular Value Adaptation with Low-Rank Transformation |
Abdelrahman Elsayed et.al. |
2503.16055 |
null |
2025-03-20 |
Meta-Learning Neural Mechanisms rather than Bayesian Priors |
Michael Goodale et.al. |
2503.16048 |
null |
2025-03-20 |
Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation |
Zhiyu Cao et.al. |
2503.16043 |
null |
2025-03-20 |
GreenIQ: A Deep Search Platform for Comprehensive Carbon Market Analysis and Automated Report Generation |
Bisola Faith Kayode et.al. |
2503.16041 |
null |
2025-03-20 |
Evaluating Test-Time Scaling LLMs for Legal Reasoning: OpenAI o1, DeepSeek-R1, and Beyond |
Yaoyao Yu et.al. |
2503.16040 |
null |
2025-03-20 |
Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models |
Zhihang Liu et.al. |
2503.16036 |
null |
2025-03-20 |
The Lighthouse of Language: Enhancing LLM Agents via Critique-Guided Improvement |
Ruihan Yang et.al. |
2503.16024 |
null |
2025-03-20 |
BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models |
Zenghui Yuan et.al. |
2503.16023 |
null |
2025-03-20 |
Corrective In-Context Learning: Evaluating Self-Correction in Large Language Models |
Mario Sanz-Guerrero et.al. |
2503.16022 |
null |
2025-03-21 |
Autonomous AI imitators increase diversity in homogeneous information ecosystems |
Emil Bakkensen Johansen et.al. |
2503.16021 |
null |
2025-03-20 |
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions |
Xiaomeng Chu et.al. |
2503.16013 |
null |
2025-03-20 |
“This could save us months of work” – Use Cases of AI and Automation Support in Investigative Journalism |
Besjon Cifliku et.al. |
2503.16011 |
null |
2025-03-20 |
ECKGBench: Benchmarking Large Language Models in E-commerce Leveraging Knowledge Graph |
Langming Liu et.al. |
2503.15990 |
null |
2025-03-20 |
A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli |
Pengyu Liu et.al. |
2503.15978 |
null |
2025-03-20 |
Stability of Schrödinger bridges and Sinkhorn semigroups for log-concave models |
Pierre Del Moral et.al. |
2503.15963 |
null |
2025-03-20 |
GAN-enhanced Simulation-driven DNN Testing in Absence of Ground Truth |
Mohammed Attaoui et.al. |
2503.15953 |
null |
2025-03-20 |
From Chaos to Order: The Atomic Reasoner Framework for Fine-grained Reasoning in Large Language Models |
Jinyi Liu et.al. |
2503.15944 |
null |
2025-03-21 |
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment |
Gaole Dai et.al. |
2503.15937 |
null |
2025-03-20 |
Towards Automatic Continual Learning: A Self-Adaptive Framework for Continual Instruction Tuning |
Peiyi Lin et.al. |
2503.15924 |
null |
2025-03-20 |
SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models |
Fahao Chen et.al. |
2503.15921 |
null |
2025-03-20 |
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras |
Beilei Cui et.al. |
2503.15917 |
null |
2025-03-20 |
From Structured Prompts to Open Narratives: Measuring Gender Bias in LLMs Through Open-Ended Storytelling |
Evan Chen et.al. |
2503.15904 |
null |
2025-03-20 |
Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models |
Baolong Bi et.al. |
2503.15888 |
null |
2025-03-21 |
Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance |
Hui Liu et.al. |
2503.15886 |
null |
2025-03-20 |
DeepPsy-Agent: A Stage-Aware and Deep-Thinking Emotional Support Agent System |
Kai Chen et.al. |
2503.15876 |
null |
2025-03-20 |
MASH-VLM: Mitigating Action-Scene Hallucination in Video-LLMs through Disentangled Spatial-Temporal Representations |
Kyungho Bae et.al. |
2503.15871 |
null |
2025-03-20 |
TruthLens: Explainable DeepFake Detection for Face Manipulated and Fully Synthetic Data |
Rohit Kundu et.al. |
2503.15867 |
null |
2025-03-20 |
DroidTTP: Mapping Android Applications with TTP for Cyber Threat Intelligence |
Dincy R Arikkat et.al. |
2503.15866 |
null |
2025-03-20 |
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling |
Hyojun Go et.al. |
2503.15855 |
null |
2025-03-20 |
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey |
Xiaoou Liu et.al. |
2503.15850 |
null |
2025-03-20 |
Entropy-based Exploration Conduction for Multi-step Reasoning |
Jinghan Zhang et.al. |
2503.15848 |
null |
2025-03-20 |
Automatic Generation of Safety-compliant Linear Temporal Logic via Large Language Model: A Self-supervised Framework |
Junle Li et.al. |
2503.15840 |
null |
2025-03-20 |
Enhancing LLM Code Generation with Ensembles: A Similarity-Based Selection Approach |
Tarek Mahmud et.al. |
2503.15838 |
null |
2025-03-20 |
Fùxì: A Benchmark for Evaluating Language Models on Ancient Chinese Text Understanding and Generation |
Shangqing Zhao et.al. |
2503.15837 |
null |
2025-03-20 |
Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection |
Haotian Ma et.al. |
2503.15818 |
null |
2025-03-20 |
A Vision Centric Remote Sensing Benchmark |
Abduljaleel Adejumo et.al. |
2503.15816 |
null |
2025-03-20 |
Attention Pruning: Automated Fairness Repair of Language Models via Surrogate Simulated Annealing |
Vishnu Asutosh Dasu et.al. |
2503.15815 |
null |
2025-03-20 |
ChatGPT and U(X): A Rapid Review on Measuring the User Experience |
Katie Seaborn et.al. |
2503.15808 |
null |
2025-03-20 |
Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture |
Cheng Li et.al. |
2503.15807 |
null |
2025-03-20 |
DNA Bench: When Silence is Smarter – Benchmarking Over-Reasoning in Reasoning LLMs |
Masoud Hashemi et.al. |
2503.15793 |
null |
2025-03-20 |
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models |
Parham Saremi et.al. |
2503.15784 |
null |
2025-03-20 |
Grammar and Gameplay-aligned RL for Game Description Generation with LLMs |
Tsunehiko Tanaka et.al. |
2503.15783 |
null |
2025-03-20 |
AutoDrive-QA- Automated Generation of Multiple-Choice Questions for Autonomous Driving Datasets Using Large Vision-Language Models |
Boshra Khalili et.al. |
2503.15778 |
null |
2025-03-20 |
Detecting LLM-Written Peer Reviews |
Vishisht Rao et.al. |
2503.15772 |
null |
2025-03-20 |
Towards Agentic AI Networking in 6G: A Generative Foundation Model-as-Agent Approach |
Yong Xiao et.al. |
2503.15764 |
null |
2025-03-20 |
Dialogic Learning in Child-Robot Interaction: A Hybrid Approach to Personalized Educational Content Generation |
Elena Malnatsky et.al. |
2503.15762 |
null |
2025-03-20 |
GraPLUS: Graph-based Placement Using Semantics for Image Composition |
Mir Mohammad Khaleghi et.al. |
2503.15761 |
null |
2025-03-20 |
AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration |
Andy Zhou et.al. |
2503.15754 |
null |
2025-03-20 |
Using Language Models to Decipher the Motivation Behind Human Behaviors |
Yutong Xie et.al. |
2503.15752 |
null |
2025-03-19 |
Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat |
Joseph Emmanuel DL Dayo et.al. |
2503.15726 |
null |
2025-03-21 |
Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication |
Sin-Yu Huang et.al. |
2503.15722 |
null |
2025-03-19 |
Am I eligible? Natural Language Inference for Clinical Trial Patient Recruitment: the Patient’s Point of View |
Mathilde Aguiar et.al. |
2503.15718 |
null |
2025-03-19 |
Safety Aware Task Planning via Large Language Models in Robotics |
Azal Ahmad Khan et.al. |
2503.15707 |
null |
2025-03-19 |
GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving |
William Ljungbergh et.al. |
2503.15672 |
null |
2025-03-19 |
Enhancing Pancreatic Cancer Staging with Large Language Models: The Role of Retrieval-Augmented Generation |
Hisashi Johno et.al. |
2503.15664 |
null |
2025-03-19 |
R $^2$ : A LLM Based Novel-to-Screenplay Generation Framework with Causal Plot Graphs |
Zefeng Lin et.al. |
2503.15655 |
null |
2025-03-19 |
LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning |
Federico Cocchi et.al. |
2503.15621 |
link |
2025-03-19 |
Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings |
Austin Xu et.al. |
2503.15620 |
null |
2025-03-19 |
SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks |
Yifei Zhou et.al. |
2503.15478 |
null |
2025-03-19 |
Cube: A Roblox View of 3D Intelligence |
Foundation AI Team et.al. |
2503.15475 |
null |
2025-03-19 |
EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining |
Boshen Xu et.al. |
2503.15470 |
null |
2025-03-19 |
From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment |
Jia-Nan Li et.al. |
2503.15463 |
null |
2025-03-19 |
Di $\mathtt{[M]}$ O: Distilling Masked Diffusion Models into One-step Generator |
Yuanzhi Zhu et.al. |
2503.15457 |
null |
2025-03-19 |
SkyLadder: Better and Faster Pretraining via Context Window Scheduling |
Tongyao Zhu et.al. |
2503.15450 |
null |
2025-03-19 |
Visual Position Prompt for MLLM based Visual Grounding |
Wei Tang et.al. |
2503.15426 |
null |
2025-03-19 |
Probing the topology of the space of tokens with structured prompts |
Michael Robinson et.al. |
2503.15421 |
null |
2025-03-19 |
LIFT: Latent Implicit Functions for Task- and Data-Agnostic Encoding |
Amirhossein Kazerouni et.al. |
2503.15420 |
null |
2025-03-19 |
Temporal Regularization Makes Your Video Generator Stronger |
Harold Haodong Chen et.al. |
2503.15417 |
null |
2025-03-19 |
Visual Persona: Foundation Model for Full-Body Human Customization |
Jisu Nam et.al. |
2503.15406 |
null |
2025-03-19 |
FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation |
Yumin Zhang et.al. |
2503.15390 |
null |
2025-03-19 |
Material Decomposition in Photon-Counting Computed Tomography with Diffusion Models: Comparative Study and Hybridization with Variational Regularizers |
Corentin Vazia et.al. |
2503.15383 |
null |
2025-03-19 |
EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models |
Yinan Liang et.al. |
2503.15369 |
null |
2025-03-19 |
SemEval-2025 Task 1: AdMIRe – Advancing Multimodal Idiomaticity Representation |
Thomas Pickard et.al. |
2503.15358 |
null |
2025-03-19 |
SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models |
I-Fan Lin et.al. |
2503.15351 |
null |
2025-03-19 |
TruthLens:A Training-Free Paradigm for DeepFake Detection |
Ritabrata Chakraborty et.al. |
2503.15342 |
null |
2025-03-19 |
Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs |
Yuqi Zhu et.al. |
2503.15341 |
null |
2025-03-19 |
Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context |
Junyi Ao et.al. |
2503.15338 |
null |
2025-03-19 |
Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model |
Euclid Collaboration et.al. |
2503.15312 |
null |
2025-03-19 |
Euclid Quick Data Release (Q1): First visual morphology catalogue |
Euclid Collaboration et.al. |
2503.15310 |
null |
2025-03-19 |
aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion |
Jia Li et.al. |
2503.15301 |
null |
2025-03-19 |
Inside-Out: Hidden Factual Knowledge in LLMs |
Zorik Gekhman et.al. |
2503.15299 |
null |
2025-03-19 |
SENAI: Towards Software Engineering Native Generative Artificial Intelligence |
Mootez Saad et.al. |
2503.15282 |
null |
2025-03-19 |
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration |
David Wan et.al. |
2503.15272 |
null |
2025-03-19 |
Do Chains-of-Thoughts of Large Language Models Suffer from Hallucinations, Cognitive Biases, or Phobias in Bayesian Reasoning? |
Roberto Araya et.al. |
2503.15268 |
null |
2025-03-19 |
LEGION: Learning to Ground and Explain for Synthetic Image Detection |
Hengrui Kang et.al. |
2503.15264 |
null |
2025-03-19 |
Efficient allocation of image recognition and LLM tasks on multi-GPU system |
Marcin Lawenda et.al. |
2503.15252 |
null |
2025-03-19 |
Automated Non-Functional Requirements Generation in Software Engineering with Large Language Models: A Comparative Study |
Jomar Thomas Almonte et.al. |
2503.15248 |
null |
2025-03-19 |
Exploring Large Language Models for Word Games:Who is the Spy? |
Chentian Wei et.al. |
2503.15235 |
null |
2025-03-19 |
When LLMs Meet API Documentation: Can Retrieval Augmentation Aid Code Generation Just as It Helps Developers? |
Jingyi Chen et.al. |
2503.15231 |
null |
2025-03-19 |
A Personalized Data-Driven Generative Model of Human Motion |
Angelo Di Porzio et.al. |
2503.15225 |
null |
2025-03-19 |
A Foundation Model for Patient Behavior Monitoring and Suicide Detection |
Rodrigo Oliver et.al. |
2503.15221 |
null |
2025-03-19 |
Context-Aware Vision Language Foundation Models for Ocular Disease Screening in Retinal Images |
Lucie Berger et.al. |
2503.15212 |
null |
2025-03-19 |
DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation |
Jiazhe Guo et.al. |
2503.15208 |
null |
2025-03-19 |
Benchmarking Large Language Models for Handwritten Text Recognition |
Giorgia Crosilla et.al. |
2503.15195 |
null |
2025-03-19 |
Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems |
Sejong Kim et.al. |
2503.15191 |
null |
2025-03-19 |
Foundation models may exhibit staged progression in novel CBRN threat disclosure |
Kevin M Esvelt et.al. |
2503.15182 |
null |
2025-03-19 |
A Review on Large Language Models for Visual Analytics |
Navya Sonal Agarwal et.al. |
2503.15176 |
null |
2025-03-19 |
Comparing Llama3 and DeepSeekR1 on Biomedical Text Classification Tasks |
Yuting Guo et.al. |
2503.15169 |
null |
2025-03-19 |
Object-Centric Pretraining via Target Encoder Bootstrapping |
Nikola Đukić et.al. |
2503.15141 |
null |
2025-03-19 |
VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention |
Mingzhe Zheng et.al. |
2503.15138 |
null |
2025-03-19 |
Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models |
Man Fai Wong et.al. |
2503.15129 |
null |
2025-03-19 |
Text-Derived Relational Graph-Enhanced Network for Skeleton-Based Action Segmentation |
Haoyu Ji et.al. |
2503.15126 |
null |
2025-03-19 |
Exploring Model Editing for LLM-based Aspect-Based Sentiment Classification |
Shichen Li et.al. |
2503.15117 |
null |
2025-03-19 |
DeCaFlow: A Deconfounding Causal Generative Model |
Alejandro Almodóvar et.al. |
2503.15114 |
null |
2025-03-19 |
Reasoning Effort and Problem Complexity: A Scaling Analysis in LLMs |
Benjamin Estermann et.al. |
2503.15113 |
null |
2025-03-19 |
OpenLLM-RTL: Open Dataset and Benchmark for LLM-Aided Design RTL Generation |
Shang Liu et.al. |
2503.15112 |
null |
2025-03-19 |
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making |
Mohamed Salim Aissi et.al. |
2503.15108 |
null |
2025-03-19 |
Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings |
Zonghao Ying et.al. |
2503.15092 |
null |
2025-03-19 |
Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs |
Yao Cheng et.al. |
2503.15091 |
null |
2025-03-19 |
LogiAgent: Automated Logical Testing for REST Systems with LLM-Based Multi-Agents |
Ke Zhang et.al. |
2503.15079 |
null |
2025-03-19 |
Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis |
Imanol G. Estepa et.al. |
2503.15060 |
null |
2025-03-19 |
ELTEX: A Framework for Domain-Driven Synthetic Data Generation |
Arina Razmyslovich et.al. |
2503.15055 |
link |
2025-03-19 |
Studying and Understanding the Effectiveness and Failures of Conversational LLM-Based Repair |
Aolin Chen et.al. |
2503.15050 |
null |
2025-03-19 |
SPADE: Systematic Prompt Framework for Automated Dialogue Expansion in Machine-Generated Text Detection |
Haoyi Li et.al. |
2503.15044 |
null |
2025-03-19 |
DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling |
Jianbo Zhao et.al. |
2503.15029 |
null |
2025-03-19 |
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene |
Shengqiong Wu et.al. |
2503.15019 |
null |
2025-03-19 |
LLM Alignment for the Arabs: A Homogenous Culture or Diverse Ones? |
Amr Keleg et.al. |
2503.15003 |
null |
2025-03-19 |
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering |
Francesco Maria Molfese et.al. |
2503.14996 |
null |
2025-03-19 |
ChatStitch: Visualizing Through Structures via Surround-View Unsupervised Deep Image Stitching with Collaborative LLM-Agents |
Hao Liang et.al. |
2503.14948 |
null |
2025-03-19 |
Generating Multimodal Driving Scenes via Next-Scene Prediction |
Yanhao Wu et.al. |
2503.14945 |
null |
2025-03-19 |
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation |
Qihui Zhang et.al. |
2503.14941 |
null |
2025-03-19 |
VisNumBench: Evaluating Number Sense of Multimodal Large Language Models |
Tengjin Weng et.al. |
2503.14939 |
null |
2025-03-19 |
Proceedings of the 3rd Italian Conference on Big Data and Data Science (ITADATA2024) |
Nicola Bena et.al. |
2503.14937 |
null |
2025-03-19 |
FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding |
Chongjun Tu et.al. |
2503.14935 |
null |
2025-03-19 |
Prada: Black-Box LLM Adaptation with Private Data on Resource-Constrained Devices |
Ziyao Wang et.al. |
2503.14932 |
null |
2025-03-19 |
GenM $^3$ : Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation |
Junyu Shi et.al. |
2503.14919 |
null |
2025-03-19 |
MASS: Mathematical Data Selection via Skill Graphs for Pretraining Large Language Models |
Jiazheng Li et.al. |
2503.14917 |
null |
2025-03-19 |
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology |
Siyuan Yan et.al. |
2503.14911 |
null |
2025-03-19 |
POSTA: A Go-to Framework for Customized Artistic Poster Generation |
Haoyu Chen et.al. |
2503.14908 |
null |
2025-03-19 |
Deep Contrastive Unlearning for Language Models |
Estrid He et.al. |
2503.14900 |
null |
2025-03-19 |
When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach |
Vaibhav Rathore et.al. |
2503.14897 |
null |
2025-03-19 |
Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations |
Shuo Li et.al. |
2503.14895 |
null |
2025-03-19 |
MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer |
Honglin Lin et.al. |
2503.14891 |
null |
2025-03-19 |
Pseudo-Relevance Feedback Can Improve Zero-Shot LLM-Based Dense Retrieval |
Hang Li et.al. |
2503.14887 |
null |
2025-03-19 |
Envisioning an AI-Enhanced Mental Health Ecosystem |
Kellie Yu Hui Sim et.al. |
2503.14883 |
null |
2025-03-19 |
Communication-Efficient Distributed On-Device LLM Inference Over Wireless Networks |
Kai Zhang et.al. |
2503.14882 |
null |
2025-03-19 |
Chemical Foundation Model Guided Design of High Ionic Conductivity Electrolyte Formulations |
Murtaza Zohair et.al. |
2503.14878 |
null |
2025-03-19 |
Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection |
Peipeng Yu et.al. |
2503.14853 |
null |
2025-03-19 |
LogLLaMA: Transformer-based log anomaly detection with LLaMA |
Zhuoyi Yang et.al. |
2503.14849 |
null |
2025-03-19 |
Think Like Human Developers: Harnessing Community Knowledge for Structured Code Reasoning |
Chengran Yang et.al. |
2503.14838 |
null |
2025-03-19 |
Robust Transmission of Punctured Text with Large Language Model-based Recovery |
Sojeong Park et.al. |
2503.14831 |
null |
2025-03-19 |
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models |
Chejian Xu et.al. |
2503.14827 |
null |
2025-03-18 |
Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection |
Matt Franchi et.al. |
2503.14754 |
null |
2025-03-18 |
Uncertainty Distillation: Teaching Language Models to Express Semantic Confidence |
Sophia Hager et.al. |
2503.14749 |
null |
2025-03-18 |
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots |
NVIDIA et.al. |
2503.14734 |
null |
2025-03-18 |
CodingGenie: A Proactive LLM-Powered Programming Assistant |
Sebastian Zhao et.al. |
2503.14724 |
null |
2025-03-18 |
Generating Medically-Informed Explanations for Depression Detection using LLMs |
Xiangyong Chen et.al. |
2503.14671 |
null |
2025-03-18 |
RAGO: Systematic Performance Optimization for Retrieval-Augmented Generation Serving |
Wenqi Jiang et.al. |
2503.14649 |
null |
2025-03-18 |
Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache |
Hanchen Li et.al. |
2503.14647 |
null |
2025-03-18 |
Reinforcement learning-based motion imitation for physiologically plausible musculoskeletal motor control |
Merkourios Simos et.al. |
2503.14637 |
null |
2025-03-18 |
Assessing Large Language Models for Automated Feedback Generation in Learning Programming Problem Solving |
Priscylla Silva et.al. |
2503.14630 |
null |
2025-03-18 |
Image Captioning Evaluation in the Age of Multimodal LLMs: Challenges and Future Perspectives |
Sara Sarto et.al. |
2503.14604 |
null |
2025-03-18 |
Command R7B Arabic: A Small, Enterprise Focused, Multilingual, and Culturally Aware Arabic LLM |
Yazeed Alnumay et.al. |
2503.14603 |
null |
2025-03-18 |
Aligning Multimodal LLM with Human Preference: A Survey |
Tao Yu et.al. |
2503.14504 |
null |
2025-03-18 |
Deeply Supervised Flow-Based Generative Models |
Inkyu Shin et.al. |
2503.14494 |
null |
2025-03-18 |
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control |
NVIDIA et.al. |
2503.14492 |
link |
2025-03-18 |
Engineering Scientific Assistants using Interactive Structured Induction of Programs |
Shraddha Surana et.al. |
2503.14488 |
null |
2025-03-18 |
Gricean Norms as a Basis for Effective Collaboration |
Fardin Saad et.al. |
2503.14484 |
link |
2025-03-18 |
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing |
Yulin Pan et.al. |
2503.14482 |
null |
2025-03-18 |
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM |
Xinyu Fang et.al. |
2503.14478 |
link |
2025-03-18 |
The Atacama Cosmology Telescope: DR6 Constraints on Extended Cosmological Models |
Erminia Calabrese et.al. |
2503.14454 |
null |
2025-03-18 |
Bolt3D: Generating 3D Scenes in Seconds |
Stanislaw Szymanowicz et.al. |
2503.14445 |
null |
2025-03-18 |
EnvBench: A Benchmark for Automated Environment Setup |
Aleksandra Eliseeva et.al. |
2503.14443 |
link |
2025-03-18 |
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers |
Nikhil Abhyankar et.al. |
2503.14434 |
link |
2025-03-18 |
PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play |
Wei Fang et.al. |
2503.14432 |
null |
2025-03-18 |
Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models |
Siwei Zhang et.al. |
2503.14411 |
null |
2025-03-18 |
Large Language Models for Virtual Human Gesture Selection |
Parisa Ghanad Torshizi et.al. |
2503.14408 |
null |
2025-03-18 |
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers |
Mert Bulent Sariyildiz et.al. |
2503.14405 |
null |
2025-03-18 |
Diffusion-based Facial Aesthetics Enhancement with 3D Structure Guidance |
Lisha Li et.al. |
2503.14402 |
null |
2025-03-18 |
From “Hallucination” to “Suture”: Insights from Language Philosophy to Enhance Large Language Models |
Qiantong Wang et.al. |
2503.14392 |
null |
2025-03-18 |
How much do LLMs learn from negative examples? |
Shadi Hamdan et.al. |
2503.14391 |
null |
2025-03-18 |
Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation |
Rikuto Tsuchida et.al. |
2503.14382 |
null |
2025-03-18 |
On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller? |
Pouria Sarhadi et.al. |
2503.14379 |
link |
2025-03-18 |
Impossible Videos |
Zechen Bai et.al. |
2503.14378 |
null |
2025-03-18 |
RFMI: Estimating Mutual Information on Rectified Flow for Text-to-Image Alignment |
Chao Wang et.al. |
2503.14358 |
null |
2025-03-18 |
MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts |
Runqi Meng et.al. |
2503.14355 |
null |
2025-03-18 |
MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration |
Yisen Xu et.al. |
2503.14340 |
null |
2025-03-18 |
DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies |
Wei Song et.al. |
2503.14324 |
link |
2025-03-18 |
COPA: Comparing the Incomparable to Explore the Pareto Front |
Adrián Javaloy et.al. |
2503.14321 |
null |
2025-03-18 |
RoMedFormer: A Rotary-Embedding Transformer Foundation Model for 3D Genito-Pelvic Structure Segmentation in MRI and CT |
Yuheng Li et.al. |
2503.14304 |
null |
2025-03-18 |
Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs |
Nicolas Le Roux et.al. |
2503.14286 |
null |
2025-03-18 |
DARS: Dynamic Action Re-Sampling to Enhance Coding Agent Performance by Adaptive Tree Traversal |
Vaibhav Aggarwal et.al. |
2503.14269 |
link |
2025-03-18 |
Quantization-Free Autoregressive Action Transformer |
Ziyad Sheebaelhamd et.al. |
2503.14259 |
link |
2025-03-18 |
InnerSelf: Designing Self-Deepfaked Voice for Emotional Well-being |
Guang Dai et.al. |
2503.14257 |
null |
2025-03-18 |
Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search |
Yu Feng et.al. |
2503.14251 |
null |
2025-03-19 |
KG-IRAG: A Knowledge Graph-Based Iterative Retrieval-Augmented Generation Framework for Temporal Reasoning |
Ruiyi Yang et.al. |
2503.14234 |
null |
2025-03-18 |
CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models |
Yuyang Xue et.al. |
2503.14232 |
null |
2025-03-18 |
Decision Tree Induction Through LLMs via Semantically-Aware Evolution |
Tennison Liu et.al. |
2503.14217 |
null |
2025-03-18 |
Inferring Event Descriptions from Time Series with Language Models |
Mingtian Tan et.al. |
2503.14190 |
link |
2025-03-18 |
Towards Harmless Multimodal Assistants with Blind Preference Optimization |
Yongqi Li et.al. |
2503.14189 |
null |
2025-03-18 |
Can LLMs Enable Verification in Mainstream Programming? |
Aleksandr Shefer et.al. |
2503.14183 |
null |
2025-03-18 |
EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models |
Zongyun Zhang et.al. |
2503.14162 |
null |
2025-03-18 |
Speculative Decoding for Verilog: Speed and Quality, All in One |
Changran Xu et.al. |
2503.14153 |
null |
2025-03-18 |
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding |
Zining Wang et.al. |
2503.14140 |
null |
2025-03-18 |
CARE: A QLoRA-Fine Tuned Multi-Domain Chatbot With Fast Learning On Minimal Hardware |
Ankit Dutta et.al. |
2503.14136 |
null |
2025-03-18 |
Inference-Time Intervention in Large Language Models for Reliable Requirement Verification |
Paul Darm et.al. |
2503.14130 |
null |
2025-03-18 |
SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models |
Subhadeep Koley et.al. |
2503.14129 |
null |
2025-03-18 |
PET-MAD, a universal interatomic potential for advanced materials modeling |
Arslan Mazitov et.al. |
2503.14118 |
link |
2025-03-18 |
DangerMaps: Personalized Safety Advice for Travel in Urban Environments using a Retrieval-Augmented Language Model |
Jonas Oppenlaender et.al. |
2503.14103 |
null |
2025-03-18 |
Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency |
Jiangxuan Long et.al. |
2503.14076 |
null |
2025-03-18 |
Fast Autoregressive Video Generation with Diagonal Decoding |
Yang Ye et.al. |
2503.14070 |
null |
2025-03-18 |
AIGVE-Tool: AI-Generated Video Evaluation Toolkit with Multifaceted Benchmark |
Xinhao Xiang et.al. |
2503.14064 |
link |
2025-03-18 |
Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach |
Tianshu Wu et.al. |
2503.14051 |
null |
2025-03-18 |
Learning on LLM Output Signatures for gray-box LLM Behavior Analysis |
Guy Bar-Shalom et.al. |
2503.14043 |
link |
2025-03-18 |
Intra and Inter Parser-Prompted Transformers for Effective Image Restoration |
Cong Wang et.al. |
2503.14037 |
link |
2025-03-18 |
Synthetic Data Generation Using Large Language Models: Advances in Text and Code |
Mihai Nadas et.al. |
2503.14023 |
null |
2025-03-18 |
MP-GUI: Modality Perception with MLLMs for GUI Understanding |
Ziwei Wang et.al. |
2503.14021 |
link |
2025-03-18 |
Predicting Human Choice Between Textually Described Lotteries |
Eyal Marantz et.al. |
2503.14004 |
null |
2025-03-18 |
MeshFleet: Filtered and Annotated 3D Vehicle Dataset for Domain Specific Generative Modeling |
Damian Boborzi et.al. |
2503.14002 |
link |
2025-03-18 |
The KoLMogorov Test: Compression by Code Generation |
Ori Yoran et.al. |
2503.13992 |
null |
2025-03-18 |
Empowering Smaller Models: Tuning LLaMA and Gemma with Chain-of-Thought for Ukrainian Exam Tasks |
Mykyta Syromiatnikov et.al. |
2503.13988 |
link |
2025-03-18 |
DefectFill: Realistic Defect Generation with Inpainting Diffusion Model for Visual Inspection |
Jaewoo Song et.al. |
2503.13985 |
null |
2025-03-18 |
SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability |
Jiankang Wang et.al. |
2503.13983 |
null |
2025-03-18 |
Empowering LLMs in Decision Games through Algorithmic Data Synthesis |
Haolin Wang et.al. |
2503.13980 |
null |
2025-03-18 |
FlexVLN: Flexible Adaptation for Diverse Vision-and-Language Navigation Tasks |
Siqi Zhang et.al. |
2503.13966 |
null |
2025-03-18 |
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding |
Siwei Han et.al. |
2503.13964 |
link |
2025-03-18 |
Survey of Adversarial Robustness in Multimodal Large Language Models |
Chengze Jiang et.al. |
2503.13962 |
null |
2025-03-18 |
Improving LLM Video Understanding with 16 Frames Per Second |
Yixuan Li et.al. |
2503.13956 |
null |
2025-03-18 |
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models |
Alexey Karev et.al. |
2503.13923 |
null |
2025-03-18 |
MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments |
Zhengsheng Guo et.al. |
2503.13882 |
null |
2025-03-18 |
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation |
Donggon Jang et.al. |
2503.13881 |
link |
2025-03-18 |
Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment |
Wei Chen et.al. |
2503.13879 |
null |
2025-03-18 |
Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations |
Rui Yang et.al. |
2503.13857 |
null |
2025-03-18 |
MDTeamGPT: A Self-Evolving LLM-based Multi-Agent Framework for Multi-Disciplinary Team Medical Consultation |
Kai Chen et.al. |
2503.13856 |
null |
2025-03-18 |
Causal Discovery from Data Assisted by Large Language Models |
Kamyar Barakati et.al. |
2503.13833 |
null |
2025-03-18 |
Scale-Aware Contrastive Reverse Distillation for Unsupervised Medical Anomaly Detection |
Chunlei Li et.al. |
2503.13828 |
link |
2025-03-18 |
LLM-Empowered IoT for 6G Networks: Architecture, Challenges, and Solutions |
Xiaopei Chen et.al. |
2503.13819 |
null |
2025-03-18 |
Automatic MILP Model Construction for Multi-Robot Task Allocation and Scheduling Based on Large Language Models |
Mingming Peng et.al. |
2503.13813 |
null |
2025-03-18 |
The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations |
Suyash Fulay et.al. |
2503.13812 |
null |
2025-03-18 |
Empowering GraphRAG with Knowledge Filtering and Integration |
Kai Guo et.al. |
2503.13804 |
null |
2025-03-18 |
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation |
Yang Zhou et.al. |
2503.13794 |
null |
2025-03-18 |
Mapping the Trust Terrain: LLMs in Software Engineering – Insights and Perspectives |
Dipin Khati et.al. |
2503.13793 |
null |
2025-03-17 |
Mitigating KV Cache Competition to Enhance User Experience in LLM Inference |
Haiying Shen et.al. |
2503.13773 |
null |
2025-03-17 |
Do Large Language Models Understand Performance Optimization? |
Bowen Cui et.al. |
2503.13772 |
null |
2025-03-17 |
Continual Unlearning for Foundational Text-to-Image Models without Generalization Erosion |
Kartik Thakral et.al. |
2503.13769 |
null |
2025-03-17 |
AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications |
Haiying Shen et.al. |
2503.13737 |
null |
2025-03-17 |
CoDet-M4: Detecting Machine-Generated Code in Multi-Lingual, Multi-Generator and Multi-Domain Settings |
Daniil Orel et.al. |
2503.13733 |
null |
2025-03-17 |
FiVE: A Fine-grained Video Editing Benchmark for Evaluating Emerging Diffusion and Rectified Flow Models |
Minghan Li et.al. |
2503.13684 |
null |
2025-03-17 |
Pensez: Less Data, Better Reasoning – Rethinking French LLM |
Huy Hoang Ha et.al. |
2503.13661 |
null |
2025-03-17 |
INPROVF: Leveraging Large Language Models to Repair High-level Robot Controllers from Assumption Violations |
Qian Meng et.al. |
2503.13660 |
null |
2025-03-17 |
SOSecure: Safer Code Generation with RAG and StackOverflow Discussions |
Manisha Mukherjee et.al. |
2503.13654 |
null |
2025-03-17 |
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos |
Chiara Plizzari et.al. |
2503.13646 |
link |
2025-03-17 |
Plasmon-Plasmon Interaction in Nanoparticle Assemblies: Role of the Dipole-Quadrupole Coupling |
Olivier Masset et.al. |
2503.13645 |
null |
2025-03-17 |
Evaluating Programming Language Confusion |
Micheline Bénédicte Moumoula et.al. |
2503.13620 |
null |
2025-03-17 |
MetaScale: Test-Time Scaling with Evolving Meta-Thoughts |
Qin Liu et.al. |
2503.13447 |
null |
2025-03-17 |
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation |
Zhenyu Wu et.al. |
2503.13446 |
null |
2025-03-17 |
Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance |
Noah Y. Siegel et.al. |
2503.13445 |
null |
2025-03-17 |
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning |
Ye Liu et.al. |
2503.13444 |
null |
2025-03-17 |
Amodal3R: Amodal 3D Reconstruction from Occluded 2D Images |
Tianhao Wu et.al. |
2503.13439 |
null |
2025-03-17 |
xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference |
Maximilian Beck et.al. |
2503.13427 |
link |
2025-03-17 |
Infinite Mobility: Scalable High-Fidelity Synthesis of Articulated Objects via Procedural Generation |
Xinyu Lian et.al. |
2503.13424 |
null |
2025-03-17 |
A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives |
Weiqiang Jin et.al. |
2503.13415 |
null |
2025-03-18 |
DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective |
Dengyun Peng et.al. |
2503.13413 |
link |
2025-03-17 |
Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis |
Alexander Ku et.al. |
2503.13401 |
null |
2025-03-17 |
MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research |
James Burgess et.al. |
2503.13399 |
link |
2025-03-17 |
Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning |
Mengyao Lyu et.al. |
2503.13383 |
null |
2025-03-17 |
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning |
Hai-Long Sun et.al. |
2503.13360 |
null |
2025-03-17 |
Agents Play Thousands of 3D Video Games |
Zhongwen Xu et.al. |
2503.13356 |
null |
2025-03-17 |
Valid Text-to-SQL Generation with Unification-based DeepStochLog |
Ying Jiao et.al. |
2503.13342 |
link |
2025-03-17 |
LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support |
Xinyu Jessica Wang et.al. |
2503.13340 |
null |
2025-03-17 |
LEAVS: An LLM-based Labeler for Abdominal CT Supervision |
Ricardo Bigolin Lanfredi et.al. |
2503.13330 |
link |
2025-03-17 |
Edit Transfer: Learning Image Editing via Vision In-Context Relations |
Lan Chen et.al. |
2503.13327 |
null |
2025-03-17 |
Computation Mechanism Behind LLM Position Generalization |
Chi Han et.al. |
2503.13305 |
null |
2025-03-17 |
LIMCA: LLM for Automating Analog In-Memory Computing Architecture Design Exploration |
Deepak Vungarala et.al. |
2503.13301 |
null |
2025-03-17 |
A Survey on Transformer Context Extension: Approaches and Evaluation |
Yijun Liu et.al. |
2503.13299 |
null |
2025-03-17 |
LLM-Match: An Open-Sourced Patient Matching Model Based on Large Language Models and Retrieval-Augmented Generation |
Xiaodi Li et.al. |
2503.13281 |
null |
2025-03-17 |
Knowledge-Aware Iterative Retrieval for Multi-Agent Systems |
Seyoung Song et.al. |
2503.13275 |
null |
2025-03-17 |
Graph Generative Models Evaluation with Masked Autoencoder |
Chengen Wang et.al. |
2503.13271 |
null |
2025-03-17 |
TablePilot; Recommending Human-Preferred Tabular Data Analysis with Large Language Models |
Deyin Yi et.al. |
2503.13262 |
null |
2025-03-17 |
MindEye-OmniAssist: A Gaze-Driven LLM-Enhanced Assistive Robot System for Implicit Intention Recognition and Task Execution |
Zejia Zhang et.al. |
2503.13250 |
null |
2025-03-17 |
Can Language Models Follow Multiple Turns of Entangled Instructions? |
Chi Han et.al. |
2503.13222 |
null |
2025-03-17 |
Dense Policy: Bidirectional Autoregressive Learning of Actions |
Yue Su et.al. |
2503.13217 |
null |
2025-03-17 |
MedLoRD: A Medical Low-Resource Diffusion Model for High-Resolution 3D CT Image Synthesis |
Marvin Seyfarth et.al. |
2503.13211 |
null |
2025-03-17 |
Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach |
Sinan Fan et.al. |
2503.13208 |
null |
2025-03-17 |
MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways |
Zhen Chen et.al. |
2503.13205 |
null |
2025-03-17 |
3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o |
Dingning Liu et.al. |
2503.13185 |
null |
2025-03-17 |
Are LLMs (Really) Ideological? An IRT-based Analysis and Alignment Tool for Perceived Socio-Economic Bias in LLMs |
Jasmin Wachter et.al. |
2503.13149 |
null |
2025-03-17 |
Patient-specific radiomic feature selection with reconstructed healthy persona of knee MR images |
Yaxi Chen et.al. |
2503.13131 |
null |
2025-03-17 |
3D Human Interaction Generation: A Survey |
Siyuan Fan et.al. |
2503.13120 |
null |
2025-03-17 |
VeriLeaky: Navigating IP Protection vs Utility in Fine-Tuning for LLM-Driven Verilog Coding |
Zeng Wang et.al. |
2503.13116 |
null |
2025-03-17 |
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs |
Erik Daxberger et.al. |
2503.13111 |
null |
2025-03-17 |
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry |
Jing Li et.al. |
2503.13110 |
null |
2025-03-17 |
Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences |
Kedi Chen et.al. |
2503.13109 |
null |
2025-03-17 |
Lifting the Veil on Visual Information Flow in MLLMs: Unlocking Pathways to Faster Inference |
Hao Yin et.al. |
2503.13108 |
link |
2025-03-17 |
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models |
Hao Yin et.al. |
2503.13107 |
link |
2025-03-17 |
Managing Hybrid Solid-State Drives Using Large Language Models |
Qian Wei et.al. |
2503.13105 |
null |
2025-03-17 |
REPA: Russian Error Types Annotation for Evaluating Text Generation and Judgment Capabilities |
Alexander Pugachev et.al. |
2503.13102 |
null |
2025-03-17 |
Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa |
Babangida Sani et.al. |
2503.13101 |
null |
2025-03-17 |
ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning |
Baohao Liao et.al. |
2503.13089 |
null |
2025-03-17 |
A Framework to Assess Multilingual Vulnerabilities of LLMs |
Likai Tang et.al. |
2503.13081 |
null |
2025-03-17 |
Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation |
Yihong Luo et.al. |
2503.13070 |
null |
2025-03-17 |
Do Vision Models Develop Human-Like Progressive Difficulty Understanding? |
Zeyi Huang et.al. |
2503.13058 |
null |
2025-03-17 |
MaskSDM with Shapley values to improve flexibility, robustness, and explainability in species distribution modeling |
Robin Zbinden et.al. |
2503.13057 |
null |
2025-03-17 |
Mitigating Cross-Modal Distraction and Ensuring Geometric Feasibility via Affordance-Guided, Self-Consistent MLLMs for Food Preparation Task Planning |
Yu-Hong Shen et.al. |
2503.13055 |
null |
2025-03-17 |
InsightDrive: Insight Scene Representation for End-to-End Autonomous Driving |
Ruiqi Song et.al. |
2503.13047 |
null |
2025-03-17 |
Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task |
Junjie Chen et.al. |
2503.13038 |
null |
2025-03-17 |
How Good is my Histopathology Vision-Language Foundation Model? A Holistic Benchmark |
Roba Al Majzoub et.al. |
2503.12990 |
link |
2025-03-17 |
A Multi-Stage Framework with Taxonomy-Guided Reasoning for Occupation Classification Using Large Language Models |
Palakorn Achananuparp et.al. |
2503.12989 |
null |
2025-03-17 |
ROMA: a Read-Only-Memory-based Accelerator for QLoRA-based On-Device LLM |
Wenqiang Wang et.al. |
2503.12988 |
null |
2025-03-17 |
Aligning Vision to Language: Text-Free Multimodal Knowledge Graph Construction for Enhanced LLMs Reasoning |
Junming Liu et.al. |
2503.12972 |
null |
2025-03-17 |
Optimal Denoising in Score-Based Generative Models: The Role of Data Regularity |
Eliot Beyler et.al. |
2503.12966 |
null |
2025-03-17 |
Training Video Foundation Models with NVIDIA NeMo |
Zeeshan Patel et.al. |
2503.12964 |
null |
2025-03-17 |
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding |
Jiahe Zhao et.al. |
2503.12955 |
null |
2025-03-17 |
HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model |
Haiyang Guo et.al. |
2503.12941 |
null |
2025-03-17 |
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization |
Jingyi Zhang et.al. |
2503.12937 |
null |
2025-03-17 |
Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs |
Wei Hung et.al. |
2503.12932 |
null |
2025-03-17 |
MirrorGuard: Adaptive Defense Against Jailbreaks via Entropy-Guided Mirror Crafting |
Rui Pu et.al. |
2503.12931 |
null |
2025-03-17 |
Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models |
Zhiyi Huang et.al. |
2503.12923 |
null |
2025-03-17 |
ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs |
Pengcheng Wen et.al. |
2503.12918 |
null |
2025-03-17 |
HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models |
Xinyan Jiang et.al. |
2503.12908 |
null |
2025-03-17 |
Optimizing Ansatz Design in Quantum Generative Adversarial Networks Using Large Language Models |
Kento Ueda et.al. |
2503.12884 |
null |
2025-03-17 |
nvBench 2.0: A Benchmark for Natural Language to Visualization under Ambiguity |
Tianqi Luo et.al. |
2503.12880 |
null |
2025-03-17 |
An interpretable approach to automating the assessment of biofouling in video footage |
Evelyn J. Mannix et.al. |
2503.12875 |
null |
2025-03-17 |
UniReg: Foundation Model for Controllable Medical Image Registration |
Zi Li et.al. |
2503.12868 |
null |
2025-03-17 |
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English |
Duke Nguyen et.al. |
2503.12858 |
null |
2025-03-17 |
Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation |
Songjun Tu et.al. |
2503.12854 |
null |
2025-03-17 |
ACT360: An Efficient 360-Degree Action Detection and Summarization Framework for Mission-Critical Training and Debriefing |
Aditi Tiwari et.al. |
2503.12852 |
null |
2025-03-17 |
GuideDog: A Real-World Egocentric Multimodal Dataset for Blind and Low-Vision Accessibility-Aware Guidance |
Junhyeok Kim et.al. |
2503.12844 |
null |
2025-03-18 |
Towards Scalable Foundation Model for Multi-modal and Hyperspectral Geospatial Data |
Haozhe Si et.al. |
2503.12843 |
null |
2025-03-17 |
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules |
Kairong Luo et.al. |
2503.12811 |
null |
2025-03-17 |
Grounded Chain-of-Thought for Multimodal Large Language Models |
Qiong Wu et.al. |
2503.12799 |
null |
2025-03-18 |
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding |
Xinyu Ma et.al. |
2503.12797 |
link |
2025-03-17 |
Quantum-Enhanced LLM Efficient Fine Tuning |
Xiaofei Kong et.al. |
2503.12790 |
null |
2025-03-17 |
SAM2 for Image and Video Segmentation: A Comprehensive Survey |
Zhang Jiaxing et.al. |
2503.12781 |
null |
2025-03-17 |
NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models |
Sung-Yeon Park et.al. |
2503.12772 |
null |
2025-03-17 |
A Survey on Human Interaction Motion Generation |
Kewei Sui et.al. |
2503.12763 |
link |
2025-03-17 |
RAG-RL: Advancing Retrieval-Augmented Generation via RL and Curriculum Learning |
Jerry Huang et.al. |
2503.12759 |
null |
2025-03-17 |
VasTSD: Learning 3D Vascular Tree-state Space Diffusion Model for Angiography Synthesis |
Zhifeng Wang et.al. |
2503.12758 |
null |
2025-03-17 |
MAP: Multi-user Personalization with Collaborative LLM-powered Agents |
Christine Lee et.al. |
2503.12757 |
link |
2025-03-17 |
Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering |
Kenneth J. K. Ong et.al. |
2503.12722 |
null |
2025-03-17 |
Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective |
Luca Collini et.al. |
2503.12721 |
null |
2025-03-16 |
AnyCalib: On-Manifold Learning for Model-Agnostic Single-View Camera Calibration |
Javier Tirado-Garín et.al. |
2503.12701 |
null |
2025-03-16 |
A Continual Learning-driven Model for Accurate and Generalizable Segmentation of Clinically Comprehensive and Fine-grained Whole-body Anatomies in CT |
Dazhou Guo et.al. |
2503.12698 |
null |
2025-03-16 |
AI Agents: Evolution, Architecture, and Real-World Applications |
Naveen Krishnan et.al. |
2503.12687 |
null |
2025-03-16 |
ZO2: Scalable Zeroth-Order Fine-Tuning for Extremely Large Language Models with Limited GPU Memory |
Liangyu Wang et.al. |
2503.12668 |
link |
2025-03-16 |
Plausibility Vaccine: Injecting LLM Knowledge for Event Plausibility |
Jacob Chmura et.al. |
2503.12667 |
null |
2025-03-16 |
Quantum Chemistry Driven Molecular Inverse Design with Data-free Reinforcement Learning |
Francesco Calcagno et.al. |
2503.12653 |
null |
2025-03-16 |
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing |
Tsu-Jui Fu et.al. |
2503.12652 |
null |
2025-03-16 |
VeriLA: A Human-Centered Evaluation Framework for Interpretable Verification of LLM Agent Failures |
Yoo Yeon Sung et.al. |
2503.12651 |
null |
2025-03-16 |
FW-Merging: Scaling Model Merging with Frank-Wolfe Optimization |
Hao Mark Chen et.al. |
2503.12649 |
null |
2025-03-16 |
LATINO-PRO: LAtent consisTency INverse sOlver with PRompt Optimization |
Alessio Spagnoletti et.al. |
2503.12615 |
null |
2025-03-16 |
VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and Invisibility |
Yitian Shi et.al. |
2503.12609 |
null |
2025-03-16 |
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey |
Yaoting Wang et.al. |
2503.12605 |
link |
2025-03-16 |
SynLlama: Generating Synthesizable Molecules and Their Analogs with Large Language Models |
Kunyang Sun et.al. |
2503.12602 |
null |
2025-03-14 |
From few to many maps: A fast map-level emulator for extreme augmentation of CMB systematics datasets |
P. Campeti et.al. |
2503.11643 |
link |
2025-03-14 |
Gradient-bridged Posterior: Bayesian Inference for Models with Implicit Functions |
Cheng Zeng et.al. |
2503.11637 |
null |
2025-03-14 |
ASMA-Tune: Unlocking LLMs’ Assembly Code Comprehension via Structural-Semantic Instruction Tuning |
Xinyi Wang et.al. |
2503.11617 |
link |
2025-03-14 |
Pathology Image Compression with Pre-trained Autoencoders |
Srikar Yellapragada et.al. |
2503.11591 |
null |
2025-03-14 |
Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space |
Zhiliang Chen et.al. |
2503.11586 |
link |
2025-03-14 |
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion |
Ahmed Nassar et.al. |
2503.11576 |
null |
2025-03-14 |
Synthesizing Access Control Policies using Large Language Models |
Adarsh Vatsa et.al. |
2503.11573 |
null |
2025-03-14 |
Implicit Bias-Like Patterns in Reasoning Models |
Messi H. J. Lee et.al. |
2503.11572 |
null |
2025-03-14 |
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity |
Jing Bi et.al. |
2503.11557 |
null |
2025-03-14 |
AugGen: Synthetic Augmentation Can Improve Discriminative Models |
Parsa Rahimi et.al. |
2503.11544 |
null |
2025-03-14 |
Potential of large language model-powered nudges for promoting daily water and energy conservation |
Zonghan Li et.al. |
2503.11531 |
null |
2025-03-14 |
Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models |
Hao Cheng et.al. |
2503.11519 |
null |
2025-03-14 |
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models |
Ziqin Zhou et.al. |
2503.11513 |
null |
2025-03-14 |
Perfect Stabilization of Biomolecular Adhesions under Load |
Anton F. Burnet et.al. |
2503.11510 |
null |
2025-03-14 |
V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning |
Zixu Cheng et.al. |
2503.11495 |
null |
2025-03-14 |
A Review of DeepSeek Models’ Key Innovative Techniques |
Chengen Wang et.al. |
2503.11486 |
null |
2025-03-14 |
Exponential Quantum Advantage for Simulating Open Classical Systems |
Agi Villanyi et.al. |
2503.11483 |
null |
2025-03-14 |
T2I-FineEval: Fine-Grained Compositional Metric for Text-to-Image Evaluation |
Seyed Mohammad Hadi Hosseini et.al. |
2503.11481 |
null |
2025-03-14 |
Integrating LLMs in Gamified Systems |
Carlos J. Costa et.al. |
2503.11458 |
null |
2025-03-14 |
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning |
Jia Zhang et.al. |
2503.11441 |
null |
2025-03-14 |
Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models |
Xu Liu et.al. |
2503.11411 |
null |
2025-03-14 |
A Framework for a Capability-driven Evaluation of Scenario Understanding for Multimodal Large Language Models in Autonomous Driving |
Tin Stribor Sohn et.al. |
2503.11400 |
null |
2025-03-14 |
Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages |
Jiyeong Kim et.al. |
2503.11384 |
null |
2025-03-14 |
Modeling Subjectivity in Cognitive Appraisal with Language Models |
Yuxiang Zhou et.al. |
2503.11381 |
null |
2025-03-14 |
Annotating Scientific Uncertainty: A comprehensive model using linguistic patterns and comparison with existing approaches |
Panggih Kusuma Ningrum et.al. |
2503.11376 |
null |
2025-03-14 |
Cornstarch: Distributed Multimodal Training Must Be Multimodality-Aware |
Insu Jang et.al. |
2503.11367 |
link |
2025-03-14 |
PARIC: Probabilistic Attention Regularization for Language Guided Image Classification from Pre-trained Vison Language Models |
Mayank Nautiyal et.al. |
2503.11360 |
null |
2025-03-14 |
Integrating Dynamical Systems Modeling with Spatiotemporal scRNA-seq Data Analysis |
Zhenyi Zhang et.al. |
2503.11347 |
null |
2025-03-14 |
AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation |
Fengyu Li et.al. |
2503.11346 |
link |
2025-03-14 |
Rule-Guided Feedback: Enhancing Reasoning by Enforcing Rule Adherence in Large Language Models |
Aissatou Diallo et.al. |
2503.11336 |
null |
2025-03-14 |
Safe-VAR: Safe Visual Autoregressive Model for Text-to-Image Generative Watermarking |
Ziyi Wang et.al. |
2503.11324 |
null |
2025-03-14 |
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens |
Jeong Hun Yeo et.al. |
2503.11315 |
link |
2025-03-14 |
Unlocking General Long Chain-of-Thought Reasoning Capabilities of Large Language Models via Representation Engineering |
Xinyu Tang et.al. |
2503.11314 |
link |
2025-03-14 |
Are formal and functional linguistic mechanisms dissociated? |
Michael Hanna et.al. |
2503.11302 |
link |
2025-03-14 |
GNNs as Predictors of Agentic Workflow Performances |
Yuanshuo Zhang et.al. |
2503.11301 |
link |
2025-03-14 |
BriLLM: Brain-inspired Large Language Model |
Hai Zhao et.al. |
2503.11299 |
null |
2025-03-14 |
High-Dimensional Interlingual Representations of Large Language Models |
Bryan Wilie et.al. |
2503.11280 |
null |
2025-03-14 |
When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective |
Alireza Mousavi-Hosseini et.al. |
2503.11272 |
link |
2025-03-14 |
CyclePose – Leveraging Cycle-Consistency for Annotation-Free Nuclei Segmentation in Fluorescence Microscopy |
Jonas Utz et.al. |
2503.11266 |
null |
2025-03-14 |
Step-Video-TI2V Technical Report: A State-of-the-Art Text-Driven Image-to-Video Generation Model |
Haoyang Huang et.al. |
2503.11251 |
link |
2025-03-14 |
Reasoning-Grounded Natural Language Explanations for Language Models |
Vojtech Cahlik et.al. |
2503.11248 |
link |
2025-03-14 |
LLMPerf: GPU Performance Modeling meets Large Language Models |
Khoi N. M. Nguyen et.al. |
2503.11244 |
link |
2025-03-14 |
PrivacyScalpel: Enhancing LLM Privacy via Interpretable Feature Intervention with Sparse Autoencoders |
Ahmed Frikha et.al. |
2503.11232 |
null |
2025-03-14 |
Exploring the Potential of Large Multimodal Models as Effective Alternatives for Pronunciation Assessment |
Ke Wang et.al. |
2503.11229 |
null |
2025-03-14 |
GKG-LLM: A Unified Framework for Generalized Knowledge Graph Construction |
Jian Zhang et.al. |
2503.11227 |
null |
2025-03-14 |
Heterogeneously structured compartmental models of epidemiological systems: from individual-level processes to population-scale dynamics |
Emanuele Bernardi et.al. |
2503.11225 |
null |
2025-03-14 |
Can Large Reasoning Models do Analogical Reasoning under Perceptual Uncertainty? |
Giacomo Camposampiero et.al. |
2503.11207 |
link |
2025-03-14 |
LLaVA-MLB: Mitigating and Leveraging Attention Bias for Training-Free Video LLMs |
Leqi Shen et.al. |
2503.11205 |
null |
2025-03-14 |
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering |
Gang Li et.al. |
2503.11197 |
link |
2025-03-14 |
FastVID: Dynamic Density Pruning for Fast Video Large Language Models |
Leqi Shen et.al. |
2503.11187 |
link |
2025-03-14 |
Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification |
Yingjie Zhang et.al. |
2503.11185 |
null |
2025-03-14 |
Palette of Language Models: A Solver for Controlled Text Generation |
Zhe Yang et.al. |
2503.11182 |
null |
2025-03-14 |
Towards Extreme Pruning of LLMs with Plug-and-Play Mixed Sparsity |
Chi Xu et.al. |
2503.11164 |
null |
2025-03-14 |
Don’t Take Things Out of Context: Attention Intervention for Enhancing Chain-of-Thought Reasoning in Large Language Models |
Shaotian Yan et.al. |
2503.11154 |
null |
2025-03-14 |
SpaceSeg: A High-Precision Intelligent Perception Segmentation Method for Multi-Spacecraft On-Orbit Targets |
Hao Liu et.al. |
2503.11133 |
null |
2025-03-14 |
Don’t Forget It! Conditional Sparse Autoencoder Clamping Works for Unlearning |
Matthew Khoriaty et.al. |
2503.11127 |
null |
2025-03-14 |
Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers |
Yifang Chen et.al. |
2503.11108 |
null |
2025-03-14 |
Quantifying Interpretability in CLIP Models with Concept Consistency |
Avinash Madasu et.al. |
2503.11103 |
null |
2025-03-14 |
Open3DVQA: A Benchmark for Comprehensive Spatial Reasoning with Multimodal Large Language Model in Open Space |
Weichen Zhan et.al. |
2503.11094 |
link |
2025-03-14 |
OmniDiff: A Comprehensive Benchmark for Fine-grained Image Difference Captioning |
Yuan Liu et.al. |
2503.11093 |
null |
2025-03-14 |
EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks |
Yi Zhang et.al. |
2503.11089 |
null |
2025-03-14 |
A Survey of Cross-domain Graph Learning: Progress and Future Directions |
Haihong Zhao et.al. |
2503.11086 |
link |
2025-03-14 |
Prompt Alchemy: Automatic Prompt Refinement for Enhancing Code Generation |
Sixiang Ye et.al. |
2503.11085 |
link |
2025-03-14 |
LLMs are Bug Replicators: An Empirical Study on LLMs’ Capability in Completing Bug-prone Code |
Liwei Guo et.al. |
2503.11082 |
link |
2025-03-14 |
Understanding Flatness in Generative Models: Its Role and Benefits |
Taehwan Lee et.al. |
2503.11078 |
null |
2025-03-14 |
Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities |
Xueyang Zhou et.al. |
2503.11074 |
null |
2025-03-14 |
Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models |
Hongyang Wei et.al. |
2503.11073 |
link |
2025-03-14 |
Falcon: A Remote Sensing Vision-Language Foundation Model |
Kelu Yao et.al. |
2503.11070 |
link |
2025-03-14 |
API Agents vs. GUI Agents: Divergence and Convergence |
Chaoyun Zhang et.al. |
2503.11069 |
null |
2025-03-14 |
DeepSeek Powered Solid Dosage Formulation Design and Development |
Leqi Lin et.al. |
2503.11068 |
null |
2025-03-14 |
Generative Modelling for Mathematical Discovery |
Jordan S. Ellenberg et.al. |
2503.11061 |
link |
2025-03-14 |
BannerAgency: Advertising Banner Design with Multimodal LLM Agents |
Heng Wang et.al. |
2503.11060 |
null |
2025-03-14 |
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization |
Kyle Sargent et.al. |
2503.11056 |
null |
2025-03-14 |
Towards Privacy-preserved Pre-training of Remote Sensing Foundation Models with Federated Mutual-guidance Learning |
Jieyi Tan et.al. |
2503.11051 |
null |
2025-03-14 |
PSF-4D: A Progressive Sampling Framework for View Consistent 4D Editing |
Hasan Iqbal et.al. |
2503.11044 |
null |
2025-03-14 |
Beyond A Single AI Cluster: A Survey of Decentralized LLM Training |
Haotian Dong et.al. |
2503.11023 |
null |
2025-03-14 |
An LLM’s Attempts to Adapt to Diverse Software Engineers’ Problem-Solving Styles: More Inclusive & Equitable? |
Andrew Anderson et.al. |
2503.11018 |
null |
2025-03-14 |
RONA: Pragmatically Diverse Image Captioning with Coherence Relations |
Aashish Anantha Ramakrishnan et.al. |
2503.10997 |
link |
2025-03-14 |
TigerLLM – A Family of Bangla Large Language Models |
Nishat Raihan et.al. |
2503.10995 |
null |
2025-03-14 |
Statistical Impossibility and Possibility of Aligning LLMs with Human Preferences: From Condorcet Paradox to Nash Equilibrium |
Kaizhao Liu et.al. |
2503.10990 |
link |
2025-03-14 |
From Dionysius Emerges Apollo – Learning Patterns and Abstractions from Perceptual Sequences |
Shuchen Wu et.al. |
2503.10973 |
null |
2025-03-14 |
Combinatorial Optimization for All: Using LLMs to Aid Non-Experts in Improving Optimization Algorithms |
Camilo Chacón Sartori et.al. |
2503.10968 |
null |
2025-03-13 |
Empirical Computation |
Eric Tang et.al. |
2503.10954 |
null |
2025-03-13 |
Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations |
Piyush Gupta et.al. |
2503.10941 |
null |
2025-03-13 |
ChatGPT Encounters Morphing Attack Detection: Zero-Shot MAD with Multi-Modal Large Language Models and General Vision Models |
Haoyu Zhang et.al. |
2503.10937 |
null |
2025-03-13 |
OASST-ETC Dataset: Alignment Signals from Eye-tracking Analysis of LLM Responses |
Angela Lopez-Cardona et.al. |
2503.10927 |
link |
2025-03-13 |
Learning to Inference Adaptively for Multimodal Large Language Models |
Zhuoyan Xu et.al. |
2503.10905 |
null |
2025-03-13 |
Taxonomic Reasoning for Rare Arthropods: Combining Dense Image Captioning and RAG for Interpretable Classification |
Nathaniel Lesperance et.al. |
2503.10886 |
null |
2025-03-13 |
Chat-TS: Enhancing Multi-Modal Reasoning Over Time-Series and Natural Language Data |
Paul Quinlan et.al. |
2503.10883 |
null |
2025-03-13 |
SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable |
Jiaxin Zhang et.al. |
2503.10881 |
null |
2025-03-13 |
Teamwork makes the dream work: LLMs-Based Agents for GitHub README.MD Summarization |
Duc S. H. Nguyen et.al. |
2503.10876 |
null |
2025-03-13 |
Panopticon: Advancing Any-Sensor Foundation Models for Earth Observation |
Leonard Waldmann et.al. |
2503.10845 |
link |
2025-03-13 |
Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs? |
So Young Lee et.al. |
2503.10838 |
null |
2025-03-13 |
Exploiting Concavity Information in Gaussian Process Contextual Bandit Optimization |
Kevin Li et.al. |
2503.10836 |
null |
2025-03-13 |
Thinking Machines: A Survey of LLM based Reasoning Strategies |
Dibyanayan Bandyopadhyay et.al. |
2503.10814 |
null |
2025-03-13 |
HALURust: Exploiting Hallucinations of Large Language Models to Detect Vulnerabilities in Rust |
Yu Luo et.al. |
2503.10793 |
null |
2025-03-13 |
Vulnerability Detection: From Formal Verification to Large Language Models and Hybrid Approaches: A Comprehensive Overview |
Norbert Tihanyi et.al. |
2503.10784 |
null |
2025-03-13 |
Large-scale Pre-training for Grounded Video Caption Generation |
Evangelos Kazakos et.al. |
2503.10781 |
link |
2025-03-13 |
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing |
Rongyao Fang et.al. |
2503.10639 |
link |
2025-03-13 |
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model |
Jiaming Liu et.al. |
2503.10631 |
null |
2025-03-13 |
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation |
Hang Yin et.al. |
2503.10630 |
null |
2025-03-13 |
From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM |
Kshitij Ambilduke et.al. |
2503.10620 |
link |
2025-03-13 |
Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search |
Andy Zhou et.al. |
2503.10619 |
null |
2025-03-13 |
Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models |
Andy Zhou et.al. |
2503.10617 |
null |
2025-03-13 |
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization |
Yi Yang et.al. |
2503.10615 |
link |
2025-03-13 |
CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing |
Advait Gupta et.al. |
2503.10613 |
link |
2025-03-13 |
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention |
Jinhao Duan et.al. |
2503.10602 |
link |
2025-03-13 |
CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models |
Hao He et.al. |
2503.10592 |
null |
2025-03-13 |
Unlock the Power of Unlabeled Data in Language Driving Model |
Chaoqun Wang et.al. |
2503.10586 |
null |
2025-03-13 |
Sample and Map from a Single Convex Potential: Generation using Conjugate Moment Measures |
Nina Vesseron et.al. |
2503.10576 |
null |
2025-03-13 |
Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models |
Afrar Jahin et.al. |
2503.10573 |
null |
2025-03-13 |
ASIDE: Architectural Separation of Instructions and Data in Language Models |
Egor Zverev et.al. |
2503.10566 |
null |
2025-03-13 |
Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations |
Brett Puppart et.al. |
2503.10556 |
null |
2025-03-13 |
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation |
Zixian Liu et.al. |
2503.10546 |
null |
2025-03-13 |
DP-GPL: Differentially Private Graph Prompt Learning |
Jing Xu et.al. |
2503.10544 |
null |
2025-03-13 |
Foundation Models for Atomistic Simulation of Chemistry and Materials |
Eric C. -Y. Yuan et.al. |
2503.10538 |
null |
2025-03-13 |
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models |
Zilu Guo et.al. |
2503.10529 |
null |
2025-03-13 |
Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set |
Florian Eichin et.al. |
2503.10515 |
link |
2025-03-13 |
Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression |
Hooman Shahrokhi et.al. |
2503.10512 |
null |
2025-03-13 |
SySLLM: Generating Synthesized Policy Summaries for Reinforcement Learning Agents Using Large Language Models |
Sahar Admoni et.al. |
2503.10509 |
null |
2025-03-13 |
TokenCarve: Information-Preserving Visual Token Compression in Multimodal Large Language Models |
Xudong Tan et.al. |
2503.10501 |
link |
2025-03-13 |
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation |
Weihao Xuan et.al. |
2503.10497 |
null |
2025-03-13 |
Source-primed Multi-turn Conversation Helps Large Language Models Translate Documents |
Hanxu Hu et.al. |
2503.10494 |
link |
2025-03-13 |
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion |
Evgeniia Vu et.al. |
2503.10488 |
null |
2025-03-13 |
LLMs in Disease Diagnosis: A Comparative Study of DeepSeek-R1 and O3 Mini Across Chronic Health Conditions |
Gaurav Kumar Gupta et.al. |
2503.10486 |
null |
2025-03-13 |
Siamese Foundation Models for Crystal Structure Prediction |
Liming Wu et.al. |
2503.10471 |
null |
2025-03-13 |
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation |
Wenhao Hu et.al. |
2503.10452 |
null |
2025-03-13 |
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models |
Wanhua Li et.al. |
2503.10437 |
null |
2025-03-13 |
Finetuning Generative Trajectory Model with Reinforcement Learning from Human Feedback |
Derun Li et.al. |
2503.10434 |
null |
2025-03-13 |
BeamLLM: Vision-Empowered mmWave Beam Prediction with Large Language Models |
Can Zheng et.al. |
2503.10432 |
null |
2025-03-13 |
Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning |
Jonathan Shaki et.al. |
2503.10408 |
null |
2025-03-13 |
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models |
Yijing Lin et.al. |
2503.10406 |
null |
2025-03-13 |
RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing |
Fengxiang Wang et.al. |
2503.10392 |
link |
2025-03-13 |
CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance |
Yufan Deng et.al. |
2503.10391 |
null |
2025-03-13 |
SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading |
Qiaoling Chen et.al. |
2503.10377 |
null |
2025-03-13 |
Probabilistic Forecasting via Autoregressive Flow Matching |
Ahmed El-Gazzar et.al. |
2503.10375 |
null |
2025-03-13 |
G-Boost: Boosting Private SLMs with General LLMs |
Yijiang Fan et.al. |
2503.10367 |
null |
2025-03-13 |
Piece it Together: Part-Based Concepting with IP-Priors |
Elad Richardson et.al. |
2503.10365 |
null |
2025-03-13 |
BioSerenity-E1: a self-supervised EEG model for medical applications |
Ruggero G. Bettinardi et.al. |
2503.10362 |
null |
2025-03-13 |
Collaborative Speculative Inference for Efficient LLM Inference Serving |
Luyao Gao et.al. |
2503.10325 |
null |
2025-03-13 |
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification |
Yuhao Wang et.al. |
2503.10324 |
null |
2025-03-13 |
Towards Fast, Memory-based and Data-Efficient Vision-Language Policy |
Haoxuan Li et.al. |
2503.10322 |
null |
2025-03-13 |
Capturing Semantic Flow of ML-based Systems |
Shin Yoo et.al. |
2503.10310 |
null |
2025-03-13 |
Test Amplification for REST APIs Using “Out-of-the-box” Large Language Models |
Tolgahan Bardakci et.al. |
2503.10306 |
null |
2025-03-13 |
CoDiPhy: A General Framework for Applying Denoising Diffusion Models to the Physical Layer of Wireless Communication Systems |
Peyman Neshaastegaran et.al. |
2503.10297 |
null |
2025-03-13 |
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning |
Weiyun Wang et.al. |
2503.10291 |
null |
2025-03-13 |
MACS: Multi-source Audio-to-image Generation with Contextual Significance and Semantic Alignment |
Hao Zhou et.al. |
2503.10287 |
null |
2025-03-13 |
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies |
Laurie Burchell et.al. |
2503.10267 |
null |
2025-03-13 |
Numerical Error Analysis of Large Language Models |
Stanislav Budzinskiy et.al. |
2503.10251 |
null |
2025-03-13 |
LLM Agents Display Human Biases but Exhibit Distinct Learning Patterns |
Idan Horowitz et.al. |
2503.10248 |
null |
2025-03-13 |
MinorBench: A hand-built benchmark for content-based risks for children |
Shaun Khoo et.al. |
2503.10242 |
null |
2025-03-13 |
SCOOP: A Framework for Proactive Collaboration and Social Continual Learning through Natural Language Interaction andCausal Reasoning |
Dimitri Ognibene et.al. |
2503.10241 |
null |
2025-03-13 |
Unveiling the Invisible: Reasoning Complex Occlusions Amodally with AURA |
Zhixuan Li et.al. |
2503.10225 |
null |
2025-03-13 |
Efficient Federated Fine-Tuning of Large Language Models with Layer Dropout |
Shilong Wang et.al. |
2503.10217 |
null |
2025-03-13 |
Adaptive Preference Aggregation |
Benjamin Heymann et.al. |
2503.10215 |
null |
2025-03-13 |
Singular Value Fine-tuning for Few-Shot Class-Incremental Learning |
Zhiwu Wang et.al. |
2503.10214 |
null |
2025-03-13 |
Adaptive Inner Speech-Text Alignment for LLM-based Speech Translation |
Henglyu Liu et.al. |
2503.10211 |
null |
2025-03-13 |
LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents |
Boyu Chen et.al. |
2503.10200 |
null |
2025-03-13 |
Robustness Tokens: Towards Adversarial Robustness of Transformers |
Brian Pulfer et.al. |
2503.10191 |
link |
2025-03-13 |
“Well, Keep Thinking”: Enhancing LLM Reasoning with Adaptive Injection Decoding |
Hyunbin Jin et.al. |
2503.10167 |
null |
2025-03-13 |
Retrieval-Augmented Generation with Hierarchical Knowledge |
Haoyu Huang et.al. |
2503.10150 |
link |
2025-03-13 |
Gumiho: A Hybrid Architecture to Prioritize Early Tokens in Speculative Decoding |
Jinze Li et.al. |
2503.10135 |
null |
2025-03-13 |
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models |
Runze He et.al. |
2503.10127 |
null |
2025-03-13 |
Hybrid Agents for Image Restoration |
Bingchen Li et.al. |
2503.10120 |
null |
2025-03-13 |
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-Error |
Shu-Xun Yang et.al. |
2503.10105 |
link |
2025-03-13 |
AgentDAO: Synthesis of Proposal Transactions Via Abstract DAO Semantics |
Lin Ao et.al. |
2503.10099 |
null |
2025-03-13 |
Semantic Latent Motion for Portrait Video Generation |
Qiyuan Zhang et.al. |
2503.10096 |
null |
2025-03-13 |
Cognitive-Mental-LLM: Leveraging Reasoning in Large Language Models for Mental Health Prediction via Online Text |
Avinash Patil et.al. |
2503.10095 |
null |
2025-03-13 |
Representation-based Reward Modeling for Efficient Safety Alignment of Large Language Model |
Qiyuan Deng et.al. |
2503.10093 |
null |
2025-03-13 |
Light-weighted foundation model for seismic data processing based on representative and non-redundant pre-training dataset |
Xintong Dong et.al. |
2503.10092 |
null |
2025-03-13 |
Why Does Your CoT Prompt (Not) Work? Theoretical Analysis of Prompt Space Complexity, its Interaction with Answer Space During CoT Reasoning with LLMs: A Recurrent Perspective |
Xiang Zhang et.al. |
2503.10084 |
null |
2025-03-13 |
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption |
Joonsung Jeon et.al. |
2503.10081 |
link |
2025-03-13 |
Information Density Principle for MLLM Benchmarks |
Chunyi Li et.al. |
2503.10079 |
link |
2025-03-13 |
VMBench: A Benchmark for Perception-Aligned Video Motion Generation |
Xinrang Ling et.al. |
2503.10076 |
link |
2025-03-13 |
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation |
Xiangyu Shi et.al. |
2503.10069 |
null |
2025-03-13 |
Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations |
Ho Hin Lee et.al. |
2503.10057 |
link |
2025-03-13 |
Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy |
Ziqi Jia et.al. |
2503.10049 |
null |
2025-03-13 |
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game |
Ziyue Wang et.al. |
2503.10042 |
link |
2025-03-13 |
NumScout: Unveiling Numerical Defects in Smart Contracts using LLM-Pruning Symbolic Execution |
Jiachi Chen et.al. |
2503.10041 |
link |
2025-03-13 |
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model |
Bowen Zhang et.al. |
2503.10009 |
link |
2025-03-13 |
TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs |
Yunxiao Wang et.al. |
2503.09994 |
null |
2025-03-13 |
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes |
JunYong Choi et.al. |
2503.09993 |
null |
2025-03-13 |
From Equations to Insights: Unraveling Symbolic Structures in PDEs with LLMs |
Rohan Bhatnagar et.al. |
2503.09986 |
null |
2025-03-13 |
ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content |
Bhavik Chandna et.al. |
2503.09964 |
null |
2025-03-13 |
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification |
Jiayu Jiang et.al. |
2503.09962 |
link |
2025-03-13 |
RMG: Real-Time Expressive Motion Generation with Self-collision Avoidance for 6-DOF Companion Robotic Arms |
Jiansheng Li et.al. |
2503.09959 |
null |
2025-03-13 |
Exploring Mutual Empowerment Between Wireless Networks and RL-based LLMs: A Survey |
Yu Qiao et.al. |
2503.09956 |
null |
2025-03-13 |
UVE: Are MLLMs Unified Evaluators for AI-Generated Videos? |
Yuanxin Liu et.al. |
2503.09949 |
link |
2025-03-13 |
PluralLLM: Pluralistic Alignment in LLMs via Federated Learning |
Mahmoud Srewa et.al. |
2503.09925 |
null |
2025-03-13 |
Inter-environmental world modeling for continuous and compositional dynamics |
Kohei Hayashi et.al. |
2503.09911 |
null |
2025-03-12 |
Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets |
Zahra Abbasiantaeb et.al. |
2503.09902 |
link |
2025-03-12 |
Improving the Reusability of Conversational Search Test Collections |
Zahra Abbasiantaeb et.al. |
2503.09899 |
link |
2025-03-12 |
What’s In Your Field? Mapping Scientific Research with Knowledge Graphs and Large Language Models |
Abhipsha Das et.al. |
2503.09894 |
link |
2025-03-12 |
On the contraction properties of Sinkhorn semigroups |
O. Deniz Akyildiz et.al. |
2503.09887 |
null |
2025-03-12 |
CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation |
Hariprasath Govindarajan et.al. |
2503.09878 |
null |
2025-03-12 |
LuciBot: Automated Robot Policy Learning from Generated Videos |
Xiaowen Qiu et.al. |
2503.09871 |
null |
2025-03-12 |
Foundation X: Integrating Classification, Localization, and Segmentation through Lock-Release Pretraining Strategy for Chest X-ray Analysis |
Nahid Ul Islam et.al. |
2503.09860 |
null |
2025-03-12 |
Media and responsible AI governance: a game-theoretic and LLM analysis |
Nataliya Balabanova et.al. |
2503.09858 |
null |
2025-03-12 |
MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System |
Jihao Zhao et.al. |
2503.09600 |
link |
2025-03-12 |
How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation |
Ruohao Guo et.al. |
2503.09598 |
link |
2025-03-12 |
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop |
Chenyu Li et.al. |
2503.09595 |
link |
2025-03-12 |
SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment |
Katrin Renz et.al. |
2503.09594 |
null |
2025-03-12 |
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering |
Md Mohaiminul Islam et.al. |
2503.09590 |
link |
2025-03-12 |
Minimax Optimality of the Probability Flow ODE for Diffusion Models |
Changxiao Cai et.al. |
2503.09583 |
null |
2025-03-12 |
Cost-Optimal Grouped-Query Attention for Long-Context LLMs |
Yingfa Chen et.al. |
2503.09579 |
link |
2025-03-12 |
Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks |
Lutfi Eren Erdogan et.al. |
2503.09572 |
null |
2025-03-13 |
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models |
Qiguang Chen et.al. |
2503.09567 |
null |
2025-03-12 |
GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals |
Shuokang Huang et.al. |
2503.09537 |
null |
2025-03-13 |
Large Language Models for Multi-Facility Location Mechanism Design |
Nguyen Thach et.al. |
2503.09533 |
null |
2025-03-12 |
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning |
Bowen Jin et.al. |
2503.09516 |
link |
2025-03-12 |
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning |
Ziyu Wan et.al. |
2503.09501 |
null |
2025-03-12 |
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection |
Romain Thoreau et.al. |
2503.09493 |
null |
2025-03-12 |
DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution Prediction |
Junjie Zhou et.al. |
2503.09491 |
null |
2025-03-12 |
Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness |
Beier Zhu et.al. |
2503.09487 |
null |
2025-03-12 |
BAMBI: Developing Baby Language Models for Italian |
Alice Suozzi et.al. |
2503.09481 |
null |
2025-03-12 |
Explicit Learning and the LLM in Machine Translation |
Malik Marmonier et.al. |
2503.09454 |
link |
2025-03-12 |
How Well Does Your Tabular Generator Learn the Structure of Tabular Data? |
Xiangjian Jiang et.al. |
2503.09453 |
link |
2025-03-12 |
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models |
Julian Spravil et.al. |
2503.09443 |
null |
2025-03-12 |
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection |
Richard A. Dubniczky et.al. |
2503.09433 |
null |
2025-03-12 |
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter |
Kechun Xu et.al. |
2503.09423 |
null |
2025-03-12 |
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary |
Kevin Qinghong Lin et.al. |
2503.09402 |
link |
2025-03-12 |
ForAug: Recombining Foregrounds and Backgrounds to Improve Vision Transformer Training with Bias Mitigation |
Tobias Christian Nauen et.al. |
2503.09399 |
link |
2025-03-12 |
Close-up-GS: Enhancing Close-Up View Synthesis in 3D Gaussian Splatting with Progressive Self-Training |
Jiatong Xia et.al. |
2503.09396 |
null |
2025-03-12 |
Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs |
Jiani Huang et.al. |
2503.09382 |
link |
2025-03-12 |
Towards Graph Foundation Models: A Transferability Perspective |
Yuxiang Wang et.al. |
2503.09363 |
null |
2025-03-12 |
Deep Learning for Climate Action: Computer Vision Analysis of Visual Narratives on X |
Katharina Prasse et.al. |
2503.09361 |
null |
2025-03-12 |
RetSTA: An LLM-Based Approach for Standardizing Clinical Fundus Image Reports |
Jiushen Cai et.al. |
2503.09358 |
null |
2025-03-12 |
Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts |
Hongyu Chen et.al. |
2503.09347 |
null |
2025-03-12 |
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language Model |
Yuzhi Lai et.al. |
2503.09335 |
null |
2025-03-12 |
CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data |
Adel ElZemity et.al. |
2503.09334 |
null |
2025-03-12 |
A Survey on Enhancing Causal Reasoning Ability of Large Language Models |
Xin Li et.al. |
2503.09326 |
null |
2025-03-12 |
Revealing the Implicit Noise-based Imprint of Generative Models |
Xinghan Li et.al. |
2503.09314 |
null |
2025-03-12 |
xVLM2Vec: Adapting LVLM-based embedding models to multilinguality using Self-Knowledge Distillation |
Elio Musacchio et.al. |
2503.09313 |
null |
2025-03-12 |
Adaptive political surveys and GPT-4: Tackling the cold start problem with simulated user interactions |
Fynn Bachmann et.al. |
2503.09311 |
link |
2025-03-12 |
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference |
Mohammad Siavashi et.al. |
2503.09304 |
null |
2025-03-12 |
Prompt Inference Attack on Distributed Large Language Model Inference Frameworks |
Xinjian Luo et.al. |
2503.09291 |
null |
2025-03-12 |
Crowdsourced Homophily Ties Based Graph Annotation Via Large Language Model |
Yu Bu et.al. |
2503.09281 |
null |
2025-03-12 |
Fine-Tuning Large Language Models for Educational Support: Leveraging Gagne’s Nine Events of Instruction for Lesson Planning |
Linzhao Jia et.al. |
2503.09276 |
null |
2025-03-12 |
COLA: A Scalable Multi-Agent Framework For Windows UI Task Automation |
Di Zhao et.al. |
2503.09263 |
link |
2025-03-13 |
DeepInnovation AI: A Global Dataset Mapping the AI innovation from Academic Research to Industrial Patents |
Haixing Gong et.al. |
2503.09257 |
null |
2025-03-12 |
City Models: Past, Present and Future Prospects |
Helge Ritter et.al. |
2503.09237 |
null |
2025-03-12 |
LREF: A Novel LLM-based Relevance Framework for E-commerce |
Tian Tang et.al. |
2503.09223 |
null |
2025-03-12 |
Rethinking Prompt-based Debiasing in Large Language Models |
Xinyi Yang et.al. |
2503.09219 |
null |
2025-03-12 |
Why LLMs Cannot Think and How to Fix It |
Marius Jahrens et.al. |
2503.09211 |
null |
2025-03-12 |
Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model |
Ali Vosoughi et.al. |
2503.09205 |
null |
2025-03-12 |
Token Weighting for Long-Range Language Modeling |
Falko Helm et.al. |
2503.09202 |
link |
2025-03-12 |
WonderVerse: Extendable 3D Scene Generation with Video Generative Models |
Hao Feng et.al. |
2503.09160 |
null |
2025-03-12 |
FaVChat: Unlocking Fine-Grained Facail Video Understanding with Multimodal Large Language Models |
Fufangchen Zhao et.al. |
2503.09158 |
null |
2025-03-12 |
AdaptAI: A Personalized Solution to Sense Your Stress, Fix Your Mess, and Boost Productivity |
Rushiraj Gadhvi et.al. |
2503.09150 |
link |
2025-03-12 |
Generative Frame Sampler for Long Video Understanding |
Linli Yao et.al. |
2503.09146 |
null |
2025-03-12 |
Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding |
Haoyu Zhang et.al. |
2503.09143 |
null |
2025-03-12 |
AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks |
Jin Li et.al. |
2503.09124 |
null |
2025-03-12 |
Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training? |
Yuechen Xie et.al. |
2503.09122 |
link |
2025-03-12 |
GRU: Mitigating the Trade-off between Unlearning and Retention for Large Language Models |
Yue Wang et.al. |
2503.09117 |
null |
2025-03-12 |
VaxGuard: A Multi-Generator, Multi-Type, and Multi-Role Dataset for Detecting LLM-Generated Vaccine Misinformation |
Syed Talal Ahmad et.al. |
2503.09103 |
null |
2025-03-12 |
Multi-Modal Foundation Models for Computational Pathology: A Survey |
Dong Li et.al. |
2503.09091 |
null |
2025-03-12 |
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows |
Chengyue Gong et.al. |
2503.09069 |
null |
2025-03-12 |
Probing Latent Subspaces in LLM for AI Security: Identifying and Manipulating Adversarial States |
Xin Wei Chia et.al. |
2503.09066 |
null |
2025-03-12 |
Discovering Influential Neuron Path in Vision Transformers |
Yifan Wang et.al. |
2503.09046 |
null |
2025-03-12 |
ManeuverGPT Agentic Control for Safe Autonomous Stunt Maneuvers |
Shawn Azdam et.al. |
2503.09035 |
link |
2025-03-12 |
Teaching LLMs How to Learn with Contextual Fine-Tuning |
Younwoo Choi et.al. |
2503.09032 |
null |
2025-03-12 |
DAST: Difficulty-Aware Self-Training on Large Language Models |
Boyang Xue et.al. |
2503.09029 |
link |
2025-03-12 |
Aligning to What? Limits to RLHF Based Alignment |
Logan Barnhart et.al. |
2503.09025 |
null |
2025-03-13 |
Prompt Inversion Attack against Collaborative Inference of Large Language Models |
Wenjie Qu et.al. |
2503.09022 |
null |
2025-03-12 |
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning |
Yuan Jiang et.al. |
2503.09020 |
link |
2025-03-12 |
Natural Humanoid Robot Locomotion with Generative Motion Prior |
Haodong Zhang et.al. |
2503.09015 |
null |
2025-03-12 |
Leveraging Retrieval Augmented Generative LLMs For Automated Metadata Description Generation to Enhance Data Catalogs |
Mayank Singh et.al. |
2503.09003 |
null |
2025-03-12 |
KNighter: Transforming Static Analysis with LLM-Synthesized Checkers |
Chenyuan Yang et.al. |
2503.09002 |
link |
2025-03-12 |
JBFuzz: Jailbreaking LLMs Efficiently and Effectively Using Fuzzing |
Vasudev Gohil et.al. |
2503.08990 |
null |
2025-03-12 |
I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data? |
Yuhang Liu et.al. |
2503.08980 |
null |
2025-03-12 |
Large Language Models-Aided Program Debloating |
Bo Lin et.al. |
2503.08969 |
null |
2025-03-11 |
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation |
Yu Wang et.al. |
2503.08963 |
null |
2025-03-11 |
FP3: A 3D Foundation Policy for Robotic Manipulation |
Rujia Yang et.al. |
2503.08950 |
null |
2025-03-11 |
Near-Optimal Sample Complexity for Iterated CVaR Reinforcement Learning with a Generative Model |
Zilong Deng et.al. |
2503.08934 |
null |
2025-03-11 |
ARCHED: A Human-Centered Framework for Transparent, Responsible, and Collaborative AI-Assisted Instructional Design |
Hongming Li et.al. |
2503.08931 |
null |
2025-03-11 |
Enhancing Large Language Models for Hardware Verification: A Novel SystemVerilog Assertion Dataset |
Anand Menon et.al. |
2503.08923 |
null |
2025-03-11 |
Backtracking for Safety |
Bilgehan Sel et.al. |
2503.08919 |
null |
2025-03-11 |
Multilevel Generative Samplers for Investigating Critical Phenomena |
Ankur Singha et.al. |
2503.08918 |
link |
2025-03-11 |
Reconstruct Anything Model: a lightweight foundation model for computational imaging |
Matthieu Terris et.al. |
2503.08915 |
null |
2025-03-11 |
Interpreting the Repeated Token Phenomenon in Large Language Models |
Itay Yona et.al. |
2503.08908 |
link |
2025-03-11 |
A Deep Bayesian Nonparametric Framework for Robust Mutual Information Estimation |
Forough Fazeliasl et.al. |
2503.08902 |
null |
2025-03-11 |
Seeing What’s Not There: Spurious Correlation in Multimodal LLMs |
Parsa Hosseini et.al. |
2503.08884 |
null |
2025-03-11 |
LLMs Know What to Drop: Self-Attention Guided KV Cache Eviction for Efficient Long-Context Inference |
Guangtao Wang et.al. |
2503.08879 |
null |
2025-03-11 |
Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs |
Rafael Carranza et.al. |
2503.08857 |
null |
2025-03-11 |
Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs |
Tianyu Sun et.al. |
2503.08842 |
null |
2025-03-11 |
ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness |
Ce Guo et.al. |
2503.08823 |
null |
2025-03-11 |
Cross-Examiner: Evaluating Consistency of Large Language Model-Generated Explanations |
Danielle Villa et.al. |
2503.08815 |
null |
2025-03-11 |
Robust Multi-Objective Controlled Decoding of Large Language Models |
Seongho Son et.al. |
2503.08796 |
link |
2025-03-11 |
Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs |
Ariba Khan et.al. |
2503.08688 |
null |
2025-03-11 |
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models |
Jialv Zou et.al. |
2503.08686 |
link |
2025-03-11 |
Self-Taught Self-Correction for Small Language Models |
Viktor Moskvoretskii et.al. |
2503.08681 |
null |
2025-03-11 |
GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing |
Yuanhao Wang et.al. |
2503.08678 |
null |
2025-03-12 |
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting |
Yongsheng Yu et.al. |
2503.08677 |
null |
2025-03-11 |
Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields |
Tobias Kreiman et.al. |
2503.08674 |
null |
2025-03-11 |
REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder |
Yitian Zhang et.al. |
2503.08665 |
null |
2025-03-11 |
Generating Robot Constitutions & Benchmarks for Semantic Safety |
Pierre Sermanet et.al. |
2503.08663 |
null |
2025-03-11 |
Exploring the Word Sense Disambiguation Capabilities of Large Language Models |
Pierpaolo Basile et.al. |
2503.08662 |
null |
2025-03-11 |
YuE: Scaling Open Foundation Models for Long-Form Music Generation |
Ruibin Yuan et.al. |
2503.08638 |
link |
2025-03-11 |
LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization |
Xianfeng Wu et.al. |
2503.08619 |
link |
2025-03-11 |
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments |
Dongping Li et.al. |
2503.08604 |
null |
2025-03-11 |
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims |
Delip Rao et.al. |
2503.08600 |
null |
2025-03-11 |
3D Point Cloud Generation via Autoregressive Up-sampling |
Ziqiao Meng et.al. |
2503.08594 |
null |
2025-03-11 |
Proc4Gem: Foundation models for physical agency through procedural generation |
Yixin Lin et.al. |
2503.08593 |
null |
2025-03-11 |
HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding |
Shehreen Azad et.al. |
2503.08585 |
null |
2025-03-11 |
RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding |
Xichen Tan et.al. |
2503.08576 |
null |
2025-03-11 |
DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process |
Minjun Zhu et.al. |
2503.08569 |
null |
2025-03-11 |
Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies |
Chen Xu et.al. |
2503.08558 |
null |
2025-03-11 |
Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs |
Wanyong Feng et.al. |
2503.08551 |
null |
2025-03-11 |
Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling |
Craig Messner et.al. |
2503.08550 |
null |
2025-03-11 |
Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation |
Xian Gao et.al. |
2503.08549 |
null |
2025-03-11 |
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering |
Sher Badshah et.al. |
2503.08542 |
null |
2025-03-11 |
Mellow: a small audio language model for reasoning |
Soham Deshmukh et.al. |
2503.08540 |
link |
2025-03-11 |
Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation |
Andres M Bran et.al. |
2503.08537 |
link |
2025-03-11 |
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems |
Siddhant Arora et.al. |
2503.08533 |
null |
2025-03-11 |
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training |
Tong Wei et.al. |
2503.08525 |
null |
2025-03-11 |
Position-Aware Depth Decay Decoding ( $D^3$ ): Boosting Large Language Model Inference Efficiency |
Siqi Fan et.al. |
2503.08524 |
null |
2025-03-11 |
High-Quality 3D Head Reconstruction from Any Single Portrait Image |
Jianfu Zhang et.al. |
2503.08516 |
null |
2025-03-11 |
LightPlanner: Unleashing the Reasoning Capabilities of Lightweight Large Language Models in Task Planning |
Weijie Zhou et.al. |
2503.08508 |
link |
2025-03-11 |
Referring to Any Person |
Qing Jiang et.al. |
2503.08507 |
link |
2025-03-11 |
ReviewAgents: Bridging the Gap Between Human and AI-Generated Paper Reviews |
Xian Gao et.al. |
2503.08506 |
null |
2025-03-11 |
Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models |
Han Cao et.al. |
2503.08495 |
null |
2025-03-11 |
TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting |
Fengyi Zhang et.al. |
2503.08485 |
null |
2025-03-11 |
Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum |
Shengpeng Xiao et.al. |
2503.08484 |
null |
2025-03-11 |
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability |
Weijie Zhou et.al. |
2503.08481 |
link |
2025-03-11 |
FastCache: Optimizing Multimodal LLM Serving through Lightweight KV-Cache Compression Framework |
Jianian Zhu et.al. |
2503.08461 |
null |
2025-03-11 |
KAP: MLLM-assisted OCR Text Enhancement for Hybrid Retrieval in Chinese Non-Narrative Documents |
Hsin-Ling Hsu et.al. |
2503.08452 |
null |
2025-03-11 |
LLM-Pack: Intuitive Grocery Handling for Logistics Applications |
Yannik Blei et.al. |
2503.08445 |
null |
2025-03-11 |
TokenSim: Enabling Hardware and Software Exploration for Large Language Model Inference Systems |
Feiyang Wu et.al. |
2503.08415 |
null |
2025-03-11 |
Fact-checking with Generative AI: A Systematic Cross-Topic Examination of LLMs Capacity to Detect Veracity of Political Information |
Elizaveta Kuznetsova et.al. |
2503.08404 |
null |
2025-03-11 |
OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning |
Jiawei Zhou et.al. |
2503.08398 |
null |
2025-03-11 |
Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens |
Qingsong Xie et.al. |
2503.08377 |
null |
2025-03-11 |
nnInteractive: Redefining 3D Promptable Segmentation |
Fabian Isensee et.al. |
2503.08373 |
link |
2025-03-11 |
MetaFold: Language-Guided Multi-Category Garment Folding Framework via Trajectory Generation and Foundation Model |
Haonan Chen et.al. |
2503.08372 |
null |
2025-03-11 |
Robust Latent Matters: Boosting Image Generation with Sampling Error |
Kai Qiu et.al. |
2503.08354 |
link |
2025-03-12 |
Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs |
Chongjun Tu et.al. |
2503.08342 |
null |
2025-03-11 |
Trinity: A Modular Humanoid Robot AI System |
Jingkai Sun et.al. |
2503.08338 |
null |
2025-03-11 |
Prompt2LVideos: Exploring Prompts for Understanding Long-Form Multimodal Videos |
Soumya Shamarao Jahagirdar et.al. |
2503.08335 |
null |
2025-03-11 |
KiteRunner: Language-Driven Cooperative Local-Global Navigation Policy with UAV Mapping in Outdoor Environments |
Shibo Huang et.al. |
2503.08330 |
null |
2025-03-11 |
Towards Scalable and Cross-Lingual Specialist Language Models for Oncology |
Morteza Rohanian et.al. |
2503.08323 |
null |
2025-03-11 |
Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference |
Pol G. Recasens et.al. |
2503.08311 |
null |
2025-03-11 |
Seeing and Reasoning with Confidence: Supercharging Multimodal LLMs with an Uncertainty-Aware Agentic Framework |
Zhuo Zhi et.al. |
2503.08308 |
null |
2025-03-11 |
General-Purpose Aerial Intelligent Agents Empowered by Large Language Models |
Ji Zhao et.al. |
2503.08302 |
null |
2025-03-12 |
Large Language Model as Meta-Surrogate for Data-Driven Many-Task Optimization: A Proof-of-Principle Study |
Xian-Rong Zhang et.al. |
2503.08301 |
null |
2025-03-11 |
Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges |
Xiaoxiao Liu et.al. |
2503.08292 |
null |
2025-03-11 |
PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net |
Jun Yin et.al. |
2503.08276 |
null |
2025-03-11 |
LangTime: A Language-Guided Unified Model for Time Series Forecasting with Proximal Policy Optimization |
Wenzhe Niu et.al. |
2503.08271 |
null |
2025-03-11 |
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness |
Yiming Zhong et.al. |
2503.08257 |
link |
2025-03-11 |
Aligning Text to Image in Diffusion Models is Easier Than You Think |
Jaa-Yeon Lee et.al. |
2503.08250 |
null |
2025-03-11 |
Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices |
Tao Shen et.al. |
2503.08223 |
null |
2025-03-11 |
EgoBlind: Towards Egocentric Visual Assistance for the Blind People |
Junbin Xiao et.al. |
2503.08221 |
null |
2025-03-11 |
S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction |
Guangting Zheng et.al. |
2503.08217 |
null |
2025-03-11 |
To Use or Not to Use a Universal Force Field |
Denan Li et.al. |
2503.08207 |
null |
2025-03-11 |
Route Sparse Autoencoder to Interpret Large Language Models |
Wei Shi et.al. |
2503.08200 |
null |
2025-03-11 |
A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models |
Miao Zhang et.al. |
2503.08199 |
null |
2025-03-11 |
Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation |
Wenlong Meng et.al. |
2503.08195 |
link |
2025-03-11 |
Automating Violence Detection and Categorization from Ancient Texts |
Alhassan Abdelhalim et.al. |
2503.08192 |
null |
2025-03-11 |
RigoChat 2: an adapted language model to Spanish using a bounded dataset and reduced hardware |
Gonzalo Santamaría Gómez et.al. |
2503.08188 |
null |
2025-03-11 |
Mutation Testing via Iterative Large Language Model-Driven Scientific Debugging |
Philipp Straubinger et.al. |
2503.08182 |
null |
2025-03-12 |
ProtTeX: Structure-In-Context Reasoning and Editing of Proteins with Large Language Models |
Zicheng Ma et.al. |
2503.08179 |
null |
2025-03-11 |
Investigating the Effectiveness of a Socratic Chain-of-Thoughts Reasoning Method for Task Planning in Robotics, A Case Study |
Veronica Bot et.al. |
2503.08174 |
null |
2025-03-11 |
Towards All-in-One Medical Image Re-Identification |
Yuan Tian et.al. |
2503.08173 |
link |
2025-03-11 |
TSCnet: A Text-driven Semantic-level Controllable Framework for Customized Low-Light Image Enhancement |
Miao Zhang et.al. |
2503.08168 |
null |
2025-03-11 |
FASIONAD++ : Integrating High-Level Instruction and Information Bottleneck in FAt-Slow fusION Systems for Enhanced Safety in Autonomous Driving with Adaptive Feedback |
Kangan Qian et.al. |
2503.08162 |
null |
2025-03-12 |
OASIS: Order-Augmented Strategy for Improved Code Search |
Zuchen Gao et.al. |
2503.08161 |
null |
2025-03-11 |
Towards Large-scale Chemical Reaction Image Parsing via a Multimodal Large Language Model |
Yufan Chen et.al. |
2503.08156 |
null |
2025-03-11 |
WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation |
Jing Wang et.al. |
2503.08153 |
null |
2025-03-11 |
Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features |
Hanbyul Lee et.al. |
2503.08148 |
null |
2025-03-11 |
FilmComposer: LLM-Driven Music Production for Silent Film Clips |
Zhifeng Xie et.al. |
2503.08147 |
null |
2025-03-11 |
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method |
Fei Wang et.al. |
2503.08144 |
null |
2025-03-11 |
FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems |
Jeongsol Kim et.al. |
2503.08136 |
null |
2025-03-11 |
Large Scale Multi-Task Bayesian Optimization with Large Language Models |
Yimeng Zeng et.al. |
2503.08131 |
null |
2025-03-11 |
LLM4MAC: An LLM-Driven Reinforcement Learning Framework for MAC Protocol Emergence |
Renxuan Tan et.al. |
2503.08123 |
null |
2025-03-11 |
Toward Stable World Models: Measuring and Addressing World Instability in Generative Environments |
Soonwoo Kwon et.al. |
2503.08122 |
null |
2025-03-11 |
Uni $\textbf{F}^2$ ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models |
Junzhe Li et.al. |
2503.08120 |
null |
2025-03-11 |
Convergence Dynamics and Stabilization Strategies of Co-Evolving Generative Models |
Weiguo Gao et.al. |
2503.08117 |
null |
2025-03-11 |
AI-native Memory 2.0: Second Me |
Jiale Wei et.al. |
2503.08102 |
null |
2025-03-12 |
PRISM: Privacy-Preserving Improved Stochastic Masking for Federated Generative Models |
Kyeongkook Seo et.al. |
2503.08085 |
link |
2025-03-11 |
Instruction-Augmented Long-Horizon Planning: Embedding Grounding Mechanisms in Embodied Mobile Manipulation |
Fangyuan Wang et.al. |
2503.08084 |
null |
2025-03-11 |
Seeing Beyond Haze: Generative Nighttime Image Dehazing |
Beibei Lin et.al. |
2503.08073 |
null |
2025-03-11 |
Flow Matching for Discrete Systems: Efficient Free Energy Sampling Across Lattice Sizes and Temperatures |
Ping Tuo et.al. |
2503.08063 |
null |
2025-03-11 |
Odysseus Navigates the Sirens’ Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text Generation |
Wen Luo et.al. |
2503.08057 |
null |
2025-03-11 |
Counterfactual Language Reasoning for Explainable Recommendation Systems |
Guanrong Li et.al. |
2503.08051 |
null |
2025-03-11 |
SphOR: A Representation Learning Perspective on Open-set Recognition for Identifying Unknown Classes in Deep Learning Models |
Nadarasar Bahavan et.al. |
2503.08049 |
null |
2025-03-11 |
LongProLIP: A Probabilistic Vision-Language Model with Long Context Text |
Sanghyuk Chun et.al. |
2503.08048 |
link |
2025-03-11 |
Adapting Large Language Models for Parameter-Efficient Log Anomaly Detection |
Ying Fu Lim et.al. |
2503.08045 |
null |
2025-03-11 |
ObjectMover: Generative Object Movement with Video Prior |
Xin Yu et.al. |
2503.08037 |
null |
2025-03-11 |
Learning to Search Effective Example Sequences for In-Context Learning |
Xiang Gao et.al. |
2503.08030 |
null |
2025-03-11 |
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents |
Zhen Tan et.al. |
2503.08026 |
null |
2025-03-10 |
V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation |
Guiwei Zhang et.al. |
2503.07493 |
link |
2025-03-10 |
LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? |
Bangyan Li et.al. |
2503.07487 |
null |
2025-03-10 |
Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction |
Zongzheng Zhang et.al. |
2503.07485 |
link |
2025-03-10 |
GenAIReading: Augmenting Human Cognition with Interactive Digital Textbooks Using Large Language Models and Image Generation Models |
Ryugo Morita et.al. |
2503.07463 |
null |
2025-03-10 |
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning |
Xiangru Tang et.al. |
2503.07459 |
link |
2025-03-10 |
LLMs syntactically adapt their language use to their conversational partner |
Florian Kandra et.al. |
2503.07457 |
null |
2025-03-10 |
Is a Good Foundation Necessary for Efficient Reinforcement Learning? The Computational Role of the Base Model in Exploration |
Dylan J. Foster et.al. |
2503.07453 |
null |
2025-03-10 |
From Idea to Implementation: Evaluating the Influence of Large Language Models in Software Development – An Opinion Paper |
Sargam Yadav et.al. |
2503.07450 |
null |
2025-03-10 |
From Text to Visuals: Using LLMs to Generate Math Diagrams with Vector Graphics |
Jaewook Lee et.al. |
2503.07429 |
null |
2025-03-10 |
RePO: ReLU-based Preference Optimization |
Junkang Wu et.al. |
2503.07426 |
link |
2025-03-10 |
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding |
Yan Tai et.al. |
2503.07413 |
link |
2025-03-10 |
Towards Safe Robot Foundation Models |
Maximilian Tölle et.al. |
2503.07404 |
null |
2025-03-10 |
Keeping Representation Similarity in Finetuning for Medical Image Analysis |
Wenqiang Zu et.al. |
2503.07399 |
null |
2025-03-10 |
Revisiting Noise in Natural Language Processing for Computational Social Science |
Nadav Borenstein et.al. |
2503.07395 |
null |
2025-03-10 |
Process-Supervised LLM Recommenders via Flow-guided Tuning |
Chongming Gao et.al. |
2503.07377 |
link |
2025-03-10 |
Artificial Utopia: Simulation and Intelligent Agents for a Democratised Future |
Yannick Oswald et.al. |
2503.07364 |
null |
2025-03-10 |
RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing |
Yiqing Xie et.al. |
2503.07358 |
link |
2025-03-10 |
Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment |
Xing Xie et.al. |
2503.07334 |
null |
2025-03-10 |
Assessing the Macro and Micro Effects of Random Seeds on Fine-Tuning Large Language Models |
Hao Zhou et.al. |
2503.07329 |
null |
2025-03-10 |
Dynamic Path Navigation for Motion Agents with LLM Reasoning |
Yubo Zhao et.al. |
2503.07323 |
null |
2025-03-10 |
Experimental Exploration: Investigating Cooperative Interaction Behavior Between Humans and Large Language Model Agents |
Guanxuan Jiang et.al. |
2503.07320 |
null |
2025-03-10 |
Self-Corrective Task Planning by Inverse Prompting with Large Language Models |
Jiho Lee et.al. |
2503.07317 |
null |
2025-03-10 |
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies |
Luyi Jiang et.al. |
2503.07306 |
null |
2025-03-10 |
A Graph-based Verification Framework for Fact-Checking |
Yani Huang et.al. |
2503.07282 |
null |
2025-03-10 |
COMODO: Cross-Modal Video-to-IMU Distillation for Efficient Egocentric Human Activity Recognition |
Baiyu Chen et.al. |
2503.07259 |
link |
2025-03-10 |
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting |
Haicheng Liao et.al. |
2503.07234 |
null |
2025-03-10 |
Control Flow-Augmented Decompiler based on Large Language Model |
Peipei Liu et.al. |
2503.07215 |
null |
2025-03-10 |
Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion |
Mona Sheikh Zeinoddin et.al. |
2503.07204 |
null |
2025-03-10 |
A Zero-shot Learning Method Based on Large Language Models for Multi-modal Knowledge Graph Embedding |
Bingchen Liu et.al. |
2503.07202 |
null |
2025-03-10 |
Effective and Efficient Masked Image Generation Models |
Zebin You et.al. |
2503.07197 |
link |
2025-03-10 |
Contextual Cues in Machine Translation: Investigating the Potential of Multi-Source Input Strategies in LLMs and NMT Systems |
Lia Shahnazaryan et.al. |
2503.07195 |
null |
2025-03-10 |
Ideas in Inference-time Scaling can Benefit Generative Pre-training Algorithms |
Jiaming Song et.al. |
2503.07154 |
null |
2025-03-10 |
MRCEval: A Comprehensive, Challenging and Accessible Machine Reading Comprehension Benchmark |
Shengkun Ma et.al. |
2503.07144 |
link |
2025-03-10 |
Application of Multiple Chain-of-Thought in Contrastive Reasoning for Implicit Sentiment Analysis |
Liwei Yang et.al. |
2503.07140 |
null |
2025-03-10 |
VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation |
Hanzhi Chen et.al. |
2503.07135 |
null |
2025-03-10 |
Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation |
Sihao Lin et.al. |
2503.07125 |
null |
2025-03-10 |
Quantizing Large Language Models for Code Generation: A Differentiated Replication |
Alessandro Giagnorio et.al. |
2503.07103 |
null |
2025-03-10 |
A Novel Ophthalmic Benchmark for Evaluating Multimodal Large Language Models with Fundus Photographs and OCT Images |
Xiaoyi Liang et.al. |
2503.07094 |
null |
2025-03-10 |
Linguistic Knowledge Transfer Learning for Speech Enhancement |
Kuo-Hsuan Hung et.al. |
2503.07078 |
null |
2025-03-10 |
DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs |
Jongwoo Ko et.al. |
2503.07067 |
null |
2025-03-10 |
Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning |
Huilin Deng et.al. |
2503.07065 |
link |
2025-03-10 |
TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation |
Victor Shea-Jay Huang et.al. |
2503.07050 |
null |
2025-03-10 |
Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion |
Yongle Zhang et.al. |
2503.07047 |
null |
2025-03-10 |
Conditional Generative Modeling for Amorphous Multi-Element Materials |
Honglin Li et.al. |
2503.07043 |
link |
2025-03-10 |
TCM-3CEval: A Triaxial Benchmark for Assessing Responses from Large Language Models in Traditional Chinese Medicine |
Tianai Huang et.al. |
2503.07041 |
null |
2025-03-10 |
Bot Wars Evolved: Orchestrating Competing LLMs in a Counterstrike Against Phone Scams |
Nardine Basta et.al. |
2503.07036 |
null |
2025-03-10 |
Multimodal Human-AI Synergy for Medical Imaging Quality Control: A Hybrid Intelligence Framework with Adaptive Dataset Curation and Closed-Loop Evaluation |
Zhi Qin et.al. |
2503.07032 |
null |
2025-03-10 |
Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense |
Yuting Hu et.al. |
2503.07020 |
null |
2025-03-10 |
Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning |
Xintong Li et.al. |
2503.07018 |
null |
2025-03-10 |
HELM: Human-Preferred Exploration with Language Models |
Shuhao Liao et.al. |
2503.07006 |
null |
2025-03-10 |
Large Language Models Often Say One Thing and Do Another |
Ruoxi Xu et.al. |
2503.07003 |
link |
2025-03-10 |
Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning |
Jiazheng Liu et.al. |
2503.07002 |
null |
2025-03-10 |
Utilizing Jailbreak Probability to Attack and Safeguard Multimodal LLMs |
Wenzhuo Xu et.al. |
2503.06989 |
null |
2025-03-10 |
Social Bias Benchmark for Generation: A Comparison of Generation and QA-Based Evaluations |
Jiho Jin et.al. |
2503.06987 |
null |
2025-03-10 |
Learning Decision Trees as Amortized Structure Inference |
Mohammed Mahfoud et.al. |
2503.06985 |
link |
2025-03-10 |
Exploring Multimodal Perception in Large Language Models Through Perceptual Strength Ratings |
Jonghyun Lee et.al. |
2503.06980 |
null |
2025-03-10 |
Lightweight Multimodal Artificial Intelligence Framework for Maritime Multi-Scene Recognition |
Xinyu Xi et.al. |
2503.06978 |
null |
2025-03-10 |
Task-Specific Knowledge Distillation from the Vision Foundation Model for Enhanced Medical Image Segmentation |
Pengchen Liang et.al. |
2503.06976 |
null |
2025-03-10 |
ReAgent: Reversible Multi-Agent Reasoning for Knowledge-Enhanced Multi-Hop QA |
Zhao Xinjie et.al. |
2503.06951 |
null |
2025-03-10 |
CtrlRAG: Black-box Adversarial Attacks Based on Masked Language Models in Retrieval-Augmented Language Generation |
Runqi Sui et.al. |
2503.06950 |
null |
2025-03-11 |
LexPro-1.0 Technical Report |
Haotian Chen et.al. |
2503.06949 |
link |
2025-03-10 |
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection |
Wentao Wu et.al. |
2503.06948 |
null |
2025-03-10 |
Handle Object Navigation as Weighted Traveling Repairman Problem |
Ruimeng Liu et.al. |
2503.06937 |
link |
2025-03-10 |
Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping |
Ning Ding et.al. |
2503.06930 |
null |
2025-03-10 |
Effect of Selection Format on LLM Performance |
Yuchen Han et.al. |
2503.06926 |
null |
2025-03-10 |
Combinatorial Optimization via LLM-driven Iterated Fine-tuning |
Pranjal Awasthi et.al. |
2503.06917 |
null |
2025-03-10 |
Beyond Code Generation: LLM-supported Exploration of the Program Design Space |
J. D. Zamfirescu-Pereira et.al. |
2503.06911 |
null |
2025-03-10 |
A Query Optimization Method Utilizing Large Language Models |
Zhiming Yao et.al. |
2503.06902 |
null |
2025-03-10 |
DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation |
Xiaoliang Ju et.al. |
2503.06900 |
null |
2025-03-10 |
SafePlan: Leveraging Formal Logic and Chain-of-Thought Reasoning for Enhanced Safety in LLM-based Robotic Task Planning |
Ike Obi et.al. |
2503.06892 |
null |
2025-03-10 |
ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks |
Yan Yang et.al. |
2503.06885 |
null |
2025-03-10 |
Text-to-Image Diffusion Models Cannot Count, and Prompt Refinement Cannot Help |
Yuefan Cao et.al. |
2503.06884 |
null |
2025-03-10 |
ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration |
Mengting Ai et.al. |
2503.06881 |
link |
2025-03-10 |
Graphormer-Guided Task Planning: Beyond Static Rules with LLM Safety Perception |
Wanjing Huang et.al. |
2503.06866 |
link |
2025-03-10 |
FIGLUT: An Energy-Efficient Accelerator Design for FP-INT GEMM Using Look-Up Tables |
Gunho Park et.al. |
2503.06862 |
null |
2025-03-10 |
Enhanced Multi-Tuple Extraction for Alloys: Integrating Pointer Networks and Augmented Attention |
Mengzhe Hei et.al. |
2503.06861 |
null |
2025-03-10 |
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification |
Xiangyan Qu et.al. |
2503.06847 |
null |
2025-03-10 |
GUIDE-CoT: Goal-driven and User-Informed Dynamic Estimation for Pedestrian Trajectory using Chain-of-Thought |
Sungsik Kim et.al. |
2503.06832 |
null |
2025-03-10 |
Towards a Multimodal MRI-Based Foundation Model for Multi-Level Feature Exploration in Segmentation, Molecular Subtyping, and Grading of Glioma |
Somayeh Farahani et.al. |
2503.06828 |
null |
2025-03-10 |
eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference |
Suraiya Tairin et.al. |
2503.06823 |
null |
2025-03-10 |
HierDAMap: Towards Universal Domain Adaptive BEV Mapping via Hierarchical Perspective Priors |
Siyu Li et.al. |
2503.06821 |
link |
2025-03-10 |
Towards Fine-Grained Video Question Answering |
Wei Dai et.al. |
2503.06820 |
null |
2025-03-09 |
Privacy Auditing of Large Language Models |
Ashwinee Panda et.al. |
2503.06808 |
null |
2025-03-09 |
VideoPhy-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation |
Hritik Bansal et.al. |
2503.06800 |
null |
2025-03-09 |
Multimodal AI-driven Biomarker for Early Detection of Cancer Cachexia |
Sabeen Ahmed et.al. |
2503.06797 |
null |
2025-03-09 |
RoboDesign1M: A Large-scale Dataset for Robot Design Understanding |
Tri Le et.al. |
2503.06796 |
null |
2025-03-09 |
AutoMisty: A Multi-Agent LLM Framework for Automated Code Generation in the Misty Social Robot |
Xiao Wang et.al. |
2503.06791 |
null |
2025-03-09 |
Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models |
Tianyi Zhang et.al. |
2503.06784 |
null |
2025-03-09 |
Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting |
Yufei Li et.al. |
2503.06781 |
null |
2025-03-09 |
Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators |
Feng Gu et.al. |
2503.06778 |
null |
2025-03-09 |
Primal-Dual Sample Complexity Bounds for Constrained Markov Decision Processes with Multiple Constraints |
Max Buckley et.al. |
2503.06751 |
null |
2025-03-09 |
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models |
Wenxuan Huang et.al. |
2503.06749 |
link |
2025-03-09 |
CoDa-4DGS: Dynamic Gaussian Splatting with Context and Deformation Awareness for Autonomous Driving |
Rui Song et.al. |
2503.06744 |
null |
2025-03-09 |
Delusions of Large Language Models |
Hongshen Xu et.al. |
2503.06709 |
null |
2025-03-09 |
Alignment for Efficient Tool Calling of Large Language Models |
Hongshen Xu et.al. |
2503.06708 |
null |
2025-03-09 |
PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts |
Ming Zhang et.al. |
2503.06706 |
link |
2025-03-09 |
InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models |
Yuchen Yan et.al. |
2503.06692 |
null |
2025-03-09 |
DependEval: Benchmarking LLMs for Repository Dependency Understanding |
Junjia Du et.al. |
2503.06689 |
link |
2025-03-09 |
UniGenX: Unified Generation of Sequence and Structure with Autoregressive Diffusion |
Gongbo Zhang et.al. |
2503.06687 |
null |
2025-03-09 |
FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation |
Wei Li et.al. |
2503.06680 |
null |
2025-03-09 |
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets |
Tommaso Bendinelli et.al. |
2503.06664 |
null |
2025-03-07 |
Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints |
Parameswaran Kamalaruban et.al. |
2503.05684 |
null |
2025-03-07 |
Understanding the Limits of Lifelong Knowledge Editing in LLMs |
Lukas Thede et.al. |
2503.05683 |
null |
2025-03-07 |
AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data |
Zengqun Zhao et.al. |
2503.05665 |
link |
2025-03-07 |
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval |
Yu Zhang et.al. |
2503.05659 |
link |
2025-03-07 |
A functional approach for curve alignment and shape analysis |
Issam-Ali Moindjié et.al. |
2503.05632 |
null |
2025-03-07 |
Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings |
Xuanqing Liu et.al. |
2503.05620 |
null |
2025-03-07 |
A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models |
Dong Shu et.al. |
2503.05613 |
null |
2025-03-07 |
From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing |
Prashant K. Jha et.al. |
2503.05598 |
link |
2025-03-07 |
R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning |
Huatong Song et.al. |
2503.05592 |
null |
2025-03-07 |
Evaluating open-source Large Language Models for automated fact-checking |
Nicolo’ Fontana et.al. |
2503.05565 |
null |
2025-03-07 |
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance |
Bryan Etzine et.al. |
2503.05551 |
null |
2025-03-07 |
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation |
Shai Bergman et.al. |
2503.05530 |
null |
2025-03-07 |
PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs |
Roberto Cerina et.al. |
2503.05529 |
null |
2025-03-07 |
Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations |
Eren Erogullari et.al. |
2503.05522 |
link |
2025-03-07 |
Cognitive Bias Detection Using Advanced Prompt Engineering |
Frederic Lemieux et.al. |
2503.05516 |
null |
2025-03-07 |
Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering |
Yusong Ke et.al. |
2503.05505 |
null |
2025-03-07 |
Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders |
Qijiong Liu et.al. |
2503.05493 |
null |
2025-03-07 |
Statistical Deficiency for Task Inclusion Estimation |
Loïc Fosse et.al. |
2503.05491 |
null |
2025-03-07 |
Maximum Hallucination Standards for Domain-Specific Large Language Models |
Tingmingke Lu et.al. |
2503.05481 |
null |
2025-03-07 |
The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence |
Noah Mamie et.al. |
2503.05473 |
null |
2025-03-07 |
De Novo Design of Protein-Binding Peptides by Quantum Computing |
Lars Meuser et.al. |
2503.05458 |
null |
2025-03-07 |
LLM-based Iterative Approach to Metamodeling in Automotive |
Nenad Petrovic et.al. |
2503.05449 |
null |
2025-03-07 |
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts |
Weigao Sun et.al. |
2503.05447 |
link |
2025-03-07 |
Are Your LLM-based Text-to-SQL Models Secure? Exploring SQL Injection via Backdoor Attacks |
Meiyu Lin et.al. |
2503.05445 |
null |
2025-03-07 |
Static Program Analysis Guided LLM Based Unit Test Generation |
Sujoy Roychowdhury et.al. |
2503.05394 |
null |
2025-03-07 |
Ontology Generation using Large Language Models |
Anna Sofia Lippolis et.al. |
2503.05388 |
link |
2025-03-07 |
VLMs Play StarCraft II: A Benchmark and Multimodal Decision Method |
Weiyu Ma et.al. |
2503.05383 |
link |
2025-03-07 |
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning |
Jiaxing Zhao et.al. |
2503.05379 |
null |
2025-03-07 |
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs |
Zara Siddique et.al. |
2503.05371 |
null |
2025-03-07 |
Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter |
Weixiang Zhao et.al. |
2503.05362 |
null |
2025-03-07 |
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation |
Zhenxuan Zhang et.al. |
2503.05347 |
link |
2025-03-07 |
AutoIOT: LLM-Driven Automated Natural Language Programming for AIoT Applications |
Leming Shen et.al. |
2503.05346 |
link |
2025-03-07 |
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations? |
Martin Spitznagel et.al. |
2503.05333 |
null |
2025-03-07 |
Dynamic Knowledge Integration for Evidence-Driven Counter-Argument Generation with Large Language Models |
Anar Yeginbergen et.al. |
2503.05328 |
null |
2025-03-07 |
Routing for Large ML Models |
Ofir Cohen et.al. |
2503.05324 |
link |
2025-03-07 |
Riemannian Metric Learning: Closer to You than You Imagine |
Samuel Gruffaz et.al. |
2503.05321 |
null |
2025-03-07 |
Disentangling Task Interference within Neurons: Model Merging in Alignment with Neuronal Mechanisms |
Zitao Fang et.al. |
2503.05320 |
null |
2025-03-07 |
Escaping Plato’s Cave: Towards the Alignment of 3D and Text Latent Spaces |
Souhail Hadgi et.al. |
2503.05283 |
null |
2025-03-07 |
Similarity-Based Domain Adaptation with LLMs |
Jie He et.al. |
2503.05281 |
null |
2025-03-07 |
Optimizing LLM Inference Throughput via Memory-aware and SLA-constrained Dynamic Batching |
Bowen Pang et.al. |
2503.05248 |
link |
2025-03-07 |
L-FUSION: Laplacian Fetal Ultrasound Segmentation & Uncertainty Estimation |
Johanna P. Müller et.al. |
2503.05245 |
null |
2025-03-07 |
WritingBench: A Comprehensive Benchmark for Generative Writing |
Yuning Wu et.al. |
2503.05244 |
link |
2025-03-07 |
MM-StoryAgent: Immersive Narrated Storybook Video Generation with a Multi-Agent Paradigm across Text, Image and Audio |
Xuenan Xu et.al. |
2503.05242 |
link |
2025-03-07 |
Unveiling Biases in AI: ChatGPT’s Political Economy Perspectives and Human Comparisons |
Leonardo Becchetti et.al. |
2503.05234 |
null |
2025-03-07 |
Kaiwu: A Multimodal Manipulation Dataset and Framework for Robot Learning and Human-Robot Interaction |
Shuo Jiang et.al. |
2503.05231 |
null |
2025-03-07 |
ARbiter: Generating Dialogue Options and Communication Support in Augmented Reality |
Julián Méndez et.al. |
2503.05220 |
null |
2025-03-07 |
Knowledge Updating? No More Model Editing! Just Selective Contextual Reasoning |
Guoxiu He et.al. |
2503.05212 |
null |
2025-03-07 |
Path Pooling: Train-Free Structure Enhancement for Efficient Knowledge Graph Retrieval-Augmented Generation |
Hairu Wang et.al. |
2503.05203 |
null |
2025-03-07 |
ORANSight-2.0: Foundational LLMs for O-RAN |
Pranshav Gajjar et.al. |
2503.05200 |
null |
2025-03-07 |
Memory-augmented Query Reconstruction for LLM-based Knowledge Graph Reasoning |
Mufan Xu et.al. |
2503.05193 |
null |
2025-03-07 |
Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions |
Chan hur et.al. |
2503.05186 |
null |
2025-03-07 |
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching |
Simon A. Aytes et.al. |
2503.05179 |
null |
2025-03-07 |
Development and Enhancement of Text-to-Image Diffusion Models |
Rajdeep Roshan Sahu et.al. |
2503.05149 |
null |
2025-03-07 |
RocketEval: Efficient Automated LLM Evaluation via Grading Checklist |
Tianjun Wei et.al. |
2503.05142 |
null |
2025-03-07 |
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs |
Ling Team et.al. |
2503.05139 |
null |
2025-03-07 |
R1-Zero’s “Aha Moment” in Visual Reasoning on a 2B Non-SFT Model |
Hengguang Zhou et.al. |
2503.05132 |
link |
2025-03-07 |
Dilu: Enabling GPU Resourcing-on-Demand for Serverless DL Serving via Introspective Elasticity |
Cunchi Lv et.al. |
2503.05130 |
null |
2025-03-07 |
Can Large Language Models Grasp Concepts in Visual Content? A Case Study on YouTube Shorts about Depression |
Jiaying “Lizzy” Liu et.al. |
2503.05109 |
null |
2025-03-07 |
AutoTestForge: A Multidimensional Automated Testing Framework for Natural Language Processing Models |
Hengrui Xing et.al. |
2503.05102 |
null |
2025-03-07 |
SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding |
Kaiyu Huang et.al. |
2503.05096 |
null |
2025-03-07 |
S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information |
Feng Jiang et.al. |
2503.05085 |
null |
2025-03-07 |
On a Connection Between Imitation Learning and RLHF |
Teng Xiao et.al. |
2503.05079 |
null |
2025-03-07 |
PromptPex: Automatic Test Generation for Language Model Prompts |
Reshabh K Sharma et.al. |
2503.05070 |
link |
2025-03-07 |
Capacity-Aware Inference: Mitigating the Straggler Effect in Mixture of Experts |
Shwai He et.al. |
2503.05066 |
null |
2025-03-07 |
No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding |
Michael Krumdick et.al. |
2503.05061 |
null |
2025-03-06 |
Dynamic-KGQA: A Scalable Framework for Generating Adaptive Question Answering Datasets |
Preetam Prabhu Srikar Dammu et.al. |
2503.05049 |
null |
2025-03-06 |
Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference |
Grace Proebsting et.al. |
2503.05047 |
null |
2025-03-06 |
Continual Pre-training of MoEs: How robust is your router? |
Benjamin Thérien et.al. |
2503.05029 |
null |
2025-03-06 |
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids |
Hannes Stark et.al. |
2503.05025 |
link |
2025-03-06 |
Safety is Not Only About Refusal: Reasoning-Enhanced Fine-tuning for Interpretable LLM Safety |
Yuyou Zhang et.al. |
2503.05021 |
null |
2025-03-06 |
LLMs’ Reshaping of People, Processes, Products, and Society in Software Development: A Comprehensive Exploration with Early Adopters |
Benyamin Tabarsi et.al. |
2503.05012 |
null |
2025-03-06 |
Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation |
Bryan Li et.al. |
2503.05010 |
null |
2025-03-06 |
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models |
Benyamin Jamialahmadi et.al. |
2503.05005 |
link |
2025-03-06 |
Wanda++: Pruning Large Language Models via Regional Gradients |
Yifan Yang et.al. |
2503.04992 |
null |
2025-03-06 |
DP-GTR: Differentially Private Prompt Protection via Group Text Rewriting |
Mingchen Li et.al. |
2503.04990 |
null |
2025-03-06 |
Leveraging Large Language Models For Scalable Vector Graphics Processing: A Review |
Boris Malashenko et.al. |
2503.04983 |
null |
2025-03-06 |
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression |
Souvik Kundu et.al. |
2503.04982 |
null |
2025-03-06 |
Quantifying the Relevance of Youth Research Cited in the US Policy Documents |
Miftahul Jannat Mokarrama et.al. |
2503.04977 |
link |
2025-03-06 |
Energy-Weighted Flow Matching for Offline Reinforcement Learning |
Shiyuan Zhang et.al. |
2503.04975 |
null |
2025-03-06 |
Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning |
Giulio Corallo et.al. |
2503.04973 |
null |
2025-03-06 |
Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge |
Songyuan Li et.al. |
2503.04971 |
null |
2025-03-06 |
DB-Explore: Automated Database Exploration and Instruction Synthesis for Text-to-SQL |
Haoyuan Ma et.al. |
2503.04959 |
null |
2025-03-06 |
Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems |
Jooyoung Lee et.al. |
2503.04945 |
null |
2025-03-06 |
HILGEN: Hierarchically-Informed Data Generation for Biomedical NER Using Knowledgebases and Large Language Models |
Yao Ge et.al. |
2503.04930 |
null |
2025-03-06 |
Metadata-free Georegistration of Ground and Airborne Imagery |
Adam Bredvik et.al. |
2503.04927 |
null |
2025-03-06 |
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement |
Ian Huang et.al. |
2503.04919 |
null |
2025-03-06 |
L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling |
Zhuo Chen et.al. |
2503.04725 |
link |
2025-03-07 |
Shifting Long-Context LLMs Research from Input to Output |
Yuhao Wu et.al. |
2503.04723 |
null |
2025-03-06 |
Enough Coin Flips Can Make LLMs Act Bayesian |
Ritwik Gupta et.al. |
2503.04722 |
null |
2025-03-06 |
Predictable Scale: Part I – Optimal Hyperparameter Scaling Law in Large Language Model Pretraining |
Houyi Li et.al. |
2503.04715 |
null |
2025-03-07 |
Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size |
Alireza Behtash et.al. |
2503.04704 |
null |
2025-03-06 |
UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets |
Wenyu Wang et.al. |
2503.04693 |
null |
2025-03-06 |
Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases |
Pengcheng Qiu et.al. |
2503.04691 |
null |
2025-03-06 |
LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue |
Sangyeop Kim et.al. |
2503.04675 |
null |
2025-03-06 |
What Are You Doing? A Closer Look at Controllable Human Video Generation |
Emanuele Bugliarello et.al. |
2503.04666 |
null |
2025-03-06 |
CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models |
Shengzhuang Chen et.al. |
2503.04655 |
null |
2025-03-06 |
Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators |
Blaine Quackenbush et.al. |
2503.04649 |
link |
2025-03-06 |
Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment |
Wen Yang et.al. |
2503.04647 |
null |
2025-03-06 |
Simulating the Real World: A Unified Survey of Multimodal Generative Models |
Yuqi Hu et.al. |
2503.04641 |
link |
2025-03-06 |
Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation |
Aishik Konwer et.al. |
2503.04639 |
null |
2025-03-06 |
Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking |
Yijie Xu et.al. |
2503.04636 |
null |
2025-03-06 |
3HANDS Dataset: Learning from Humans for Generating Naturalistic Handovers with Supernumerary Robotic Limbs |
Artin Saberpour Abadian et.al. |
2503.04635 |
null |
2025-03-06 |
Better Process Supervision with Bi-directional Rewarding Signals |
Wenxiang Chen et.al. |
2503.04618 |
null |
2025-03-06 |
Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning |
Mohammad Amin Ghanizadeh et.al. |
2503.04611 |
null |
2025-03-06 |
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization |
Zhijian Zhuo et.al. |
2503.04598 |
null |
2025-03-06 |
The Next Frontier of LLM Applications: Open Ecosystems and Hardware Synergy |
Xinyi Hou et.al. |
2503.04596 |
null |
2025-03-06 |
Learning Generalizable Language-Conditioned Cloth Manipulation from Long Demonstrations |
Hanyi Zhao et.al. |
2503.04557 |
null |
2025-03-06 |
Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation |
Armel Zebaze et.al. |
2503.04554 |
null |
2025-03-06 |
Benchmarking Reasoning Robustness in Large Language Models |
Tong Yu et.al. |
2503.04550 |
null |
2025-03-06 |
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model |
Wenke Huang et.al. |
2503.04543 |
null |
2025-03-06 |
SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning |
Chen Li et.al. |
2503.04530 |
null |
2025-03-06 |
Multi-modal Summarization in Model-Based Engineering: Automotive Software Development Case Study |
Nenad Petrovic et.al. |
2503.04506 |
null |
2025-03-06 |
Learning Object Placement Programs for Indoor Scene Synthesis with Iterative Self Training |
Adrian Chang et.al. |
2503.04496 |
null |
2025-03-06 |
Large Language Models in Bioinformatics: A Survey |
Zhenyu Wang et.al. |
2503.04490 |
null |
2025-03-06 |
InfoSEM: A Deep Generative Model with Informative Priors for Gene Regulatory Network Inference |
Tianyu Cui et.al. |
2503.04483 |
null |
2025-03-06 |
ToolFuzz – Automated Agent Tool Testing |
Ivan Milev et.al. |
2503.04479 |
null |
2025-03-06 |
Semantic Alignment of Unimodal Medical Text and Vision Representations |
Maxime Di Folco et.al. |
2503.04478 |
null |
2025-03-06 |
Know Thy Judge: On the Robustness Meta-Evaluation of LLM Safety Judges |
Francisco Eiras et.al. |
2503.04474 |
null |
2025-03-06 |
Guiding LLMs to Generate High-Fidelity and High-Quality Counterfactual Explanations for Text Classification |
Van Bach Nguyen et.al. |
2503.04463 |
null |
2025-03-06 |
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction |
Chao Wang et.al. |
2503.04457 |
null |
2025-03-06 |
Activation Space Interventions Can Be Transferred Between Large Language Models |
Narmeen Oozeer et.al. |
2503.04429 |
null |
2025-03-06 |
AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services |
Xiaoqi Wang et.al. |
2503.04418 |
null |
2025-03-06 |
Can Large Language Models Predict Antimicrobial Resistance Gene? |
Hyunwoo Yoo et.al. |
2503.04413 |
null |
2025-03-06 |
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search |
Kou Misaki et.al. |
2503.04412 |
null |
2025-03-06 |
Speculative MoE: Communication Efficient Parallel MoE Inference with Speculative Token and Expert Pre-scheduling |
Yan Li et.al. |
2503.04398 |
null |
2025-03-06 |
TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models |
Xinyi He et.al. |
2503.04396 |
null |
2025-03-06 |
Shaping Shared Languages: Human and Large Language Models’ Inductive Biases in Emergent Communication |
Tom Kouwenhoven et.al. |
2503.04395 |
null |
2025-03-06 |
AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management |
Junyuan Mao et.al. |
2503.04392 |
null |
2025-03-06 |
TRACT: Regression-Aware Fine-tuning Meets Chain-of-Thought Reasoning for LLM-as-a-Judge |
Cheng-Han Chiang et.al. |
2503.04381 |
null |
2025-03-06 |
Lost in Literalism: How Supervised Training Shapes Translationese in LLMs |
Yafu Li et.al. |
2503.04369 |
null |
2025-03-06 |
A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery |
Yiheng Zhu et.al. |
2503.04362 |
null |
2025-03-06 |
LONGCODEU: Benchmarking Long-Context Language Models on Long Code Understanding |
Jia Li et.al. |
2503.04359 |
null |
2025-03-06 |
scDD: Latent Codes Based scRNA-seq Dataset Distillation with Foundation Model Knowledge |
Zhen Yu et.al. |
2503.04357 |
null |
2025-03-06 |
Layer-Specific Scaling of Positional Encodings for Superior Long-Context Modeling |
Zhenghua Wang et.al. |
2503.04355 |
null |
2025-03-06 |
Large Language Models for Zero-shot Inference of Causal Structures in Biology |
Izzy Newsham et.al. |
2503.04347 |
null |
2025-03-06 |
TRANSIT your events into a new mass: Fast background interpolation for weakly-supervised anomaly searches |
Ivan Oleksiyuk et.al. |
2503.04342 |
link |
2025-03-06 |
In-depth Analysis of Graph-based RAG in a Unified Framework |
Yingli Zhou et.al. |
2503.04338 |
null |
2025-03-06 |
The Challenge of Identifying the Origin of Black-Box Large Language Models |
Ziqing Yang et.al. |
2503.04332 |
null |
2025-03-06 |
Solving Word-Sense Disambiguation and Word-Sense Induction with Dictionary Examples |
Tadej Škvorc et.al. |
2503.04328 |
null |
2025-03-06 |
Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation |
Christian Rondanini et.al. |
2503.04302 |
null |
2025-03-06 |
Mapping AI Benchmark Data to Quantitative Risk Estimates Through Expert Elicitation |
Malcolm Murray et.al. |
2503.04299 |
null |
2025-03-06 |
MathMistake Checker: A Comprehensive Demonstration for Step-by-Step Math Problem Mistake Finding by Prompt-Guided LLMs |
Tianyang Zhang et.al. |
2503.04291 |
null |
2025-03-06 |
How Do Hackathons Foster Creativity? Towards AI Collaborative Evaluation of Creativity at Scale |
Jeanette Falk et.al. |
2503.04290 |
null |
2025-03-06 |
Towards Autonomous Reinforcement Learning for Real-World Robotic Manipulation with Large Language Models |
Niccolò Turcato et.al. |
2503.04280 |
null |
2025-03-06 |
VirtualXAI: A User-Centric Framework for Explainability Assessment Leveraging GPT-Generated Personas |
Georgios Makridis et.al. |
2503.04261 |
null |
2025-03-06 |
Knowledge Retention for Continual Model-Based Reinforcement Learning |
Yixiang Sun et.al. |
2503.04256 |
null |
2025-03-06 |
ADOR: A Design Exploration Framework for LLM Serving with Enhanced Latency and Throughput |
Junsoo Kim et.al. |
2503.04253 |
null |
2025-03-06 |
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant |
Yifei Huang et.al. |
2503.04250 |
link |
2025-03-06 |
How to Mitigate Overfitting in Weak-to-strong Generalization? |
Junhao Shi et.al. |
2503.04249 |
null |
2025-03-06 |
ThrowBench: Benchmarking LLMs by Predicting Runtime Exceptions |
Julian Aron Prenner et.al. |
2503.04241 |
link |
2025-03-06 |
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models |
Ruizhe Chen et.al. |
2503.04240 |
null |
2025-03-06 |
SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models |
Zesong Zhang et.al. |
2503.04234 |
null |
2025-03-06 |
FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion |
Ziyi Yang et.al. |
2503.04222 |
null |
2025-03-06 |
Knowledge-Decoupled Synergetic Learning: An MLLM based Collaborative Approach to Few-shot Multimodal Dialogue Intention Recognition |
Bin Chen et.al. |
2503.04201 |
null |
2025-03-06 |
MASTER: Multimodal Segmentation with Text Prompts |
Fuyang Liu et.al. |
2503.04199 |
null |
2025-03-06 |
Measuring temporal effects of agent knowledge by date-controlled tool use |
R. Patrick Xian et.al. |
2503.04188 |
null |
2025-03-06 |
TIMER: Temporal Instruction Modeling and Evaluation for Longitudinal Clinical Records |
Hejie Cui et.al. |
2503.04176 |
null |
2025-03-06 |
DuCos: Duality Constrained Depth Super-Resolution via Foundation Model |
Zhiqiang Yan et.al. |
2503.04171 |
null |
2025-03-06 |
CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation |
Yuki Tanaka et.al. |
2503.04164 |
null |
2025-03-06 |
VLA Model-Expert Collaboration for Bi-directional Manipulation Learning |
Tian-Yu Xiang et.al. |
2503.04163 |
null |
2025-03-06 |
Semantic Retrieval Augmented Contrastive Learning for Sequential Recommendation |
Ziqiang Cui et.al. |
2503.04162 |
null |
2025-03-06 |
KidneyTalk-open: No-code Deployment of a Private Large Language Model with Medical Documentation-Enhanced Knowledge Database for Kidney Disease |
Yongchao Long et.al. |
2503.04153 |
link |
2025-03-06 |
Ticktack : Long Span Temporal Alignment of Large Language Models Leveraging Sexagenary Cycle Time Expression |
Xue Han et.al. |
2503.04150 |
null |
2025-03-06 |
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination |
Simin Chen et.al. |
2503.04149 |
null |
2025-03-06 |
Biological Sequence with Language Model Prompting: A Survey |
Jiyue Jiang et.al. |
2503.04135 |
null |
2025-03-06 |
Token-Efficient Long Video Understanding for Multimodal LLMs |
Jindong Jiang et.al. |
2503.04130 |
null |
2025-03-06 |
TimeFound: A Foundation Model for Time Series Forecasting |
Congxi Xiao et.al. |
2503.04118 |
null |
2025-03-06 |
InterChat: Enhancing Generative Visual Analytics using Multimodal Interactions |
Juntong Chen et.al. |
2503.04110 |
null |
2025-03-06 |
WeakMedSAM: Weakly-Supervised Medical Image Segmentation via SAM with Sub-Class Exploration and Prompt Affinity Mining |
Haoran Wang et.al. |
2503.04106 |
link |
2025-03-06 |
LLMs Can Generate a Better Answer by Aggregating Their Own Responses |
Zichong Li et.al. |
2503.04104 |
null |
2025-03-06 |
Disparities in LLM Reasoning Accuracy and Explanations: A Case Study on African American English |
Runtao Zhou et.al. |
2503.04099 |
null |
2025-03-07 |
Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts |
Xiangnan Chen et.al. |
2503.04095 |
null |
2025-03-06 |
PokéChamp: an Expert-level Minimax Language Agent |
Seth Karten et.al. |
2503.04094 |
null |
2025-03-06 |
Beyond Memorization: Evaluating the True Type Inference Capabilities of LLMs for Java Code Snippets |
Yiwen Dong et.al. |
2503.04076 |
null |
2025-03-06 |
PP-DocBee: Improving Multimodal Document Understanding Through a Bag of Tricks |
Feng Ni et.al. |
2503.04065 |
link |
2025-03-06 |
Uncovering inequalities in new knowledge learning by large language models across different languages |
Chenglong Wang et.al. |
2503.04064 |
link |
2025-03-06 |
EVE: Towards End-to-End Video Subtitle Extraction with Vision-Language Models |
Haiyang Yu et.al. |
2503.04058 |
null |
2025-03-06 |
Insights from Rights and Wrongs: A Large Language Model for Solving Assertion Failures in RTL Design |
Jie Zhou et.al. |
2503.04057 |
link |
2025-03-06 |
GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding |
Xihan Wang et.al. |
2503.04034 |
null |
2025-03-06 |
Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting |
Jiyue Jiang et.al. |
2503.04013 |
null |
2025-03-06 |
DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation |
Amin Karimi et.al. |
2503.04006 |
null |
2025-03-06 |
Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows |
Xiangxin Zhou et.al. |
2503.03989 |
null |
2025-03-06 |
RetinalGPT: A Retinal Clinical Preference Conversational Assistant Powered by Large Vision-Language Models |
Wenhui Zhu et.al. |
2503.03987 |
null |
2025-03-06 |
ReasonGraph: Visualisation of Reasoning Paths |
Zongqian Li et.al. |
2503.03979 |
link |
2025-03-05 |
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge |
Fanwen Wang et.al. |
2503.03971 |
link |
2025-03-05 |
Model Behavior Specification by Leveraging LLM Self-Playing and Self-Improving |
Soya Park et.al. |
2503.03967 |
null |
2025-03-05 |
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems |
Richard Ren et.al. |
2503.03750 |
null |
2025-03-05 |
Process-based Self-Rewarding Language Models |
Shimao Zhang et.al. |
2503.03746 |
null |
2025-03-05 |
Towards Understanding Distilled Reasoning Models: A Representational Approach |
David D. Baek et.al. |
2503.03730 |
null |
2025-03-05 |
Improving LLM Safety Alignment with Dual-Objective Optimization |
Xuandong Zhao et.al. |
2503.03710 |
link |
2025-03-05 |
Rethinking Video Tokenization: A Conditioned Diffusion-based Approach |
Nianzu Yang et.al. |
2503.03708 |
null |
2025-03-05 |
Effective LLM Knowledge Learning via Model Generalization |
Mingkang Zhu et.al. |
2503.03705 |
null |
2025-03-05 |
A Practical Memory Injection Attack against LLM Agents |
Shen Dong et.al. |
2503.03704 |
null |
2025-03-05 |
Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models |
Jiyue Jiang et.al. |
2503.03702 |
null |
2025-03-05 |
Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks |
Zihao Zhao et.al. |
2503.03687 |
link |
2025-03-05 |
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models |
Bar Karov et.al. |
2503.03669 |
link |
2025-03-05 |
Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction |
Gustaw Opiełka et.al. |
2503.03666 |
link |
2025-03-05 |
A Generative Approach to High Fidelity 3D Reconstruction from Text Data |
Venkat Kumar R et.al. |
2503.03664 |
null |
2025-03-05 |
Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset |
Jessica Hoffmann et.al. |
2503.03654 |
null |
2025-03-05 |
Token-Level Privacy in Large Language Models |
Re’em Harel et.al. |
2503.03652 |
null |
2025-03-05 |
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal Cycles |
Rui Zhao et.al. |
2503.03651 |
link |
2025-03-05 |
Psy-Copilot: Visual Chain of Thought for Counseling |
Keqi Chen et.al. |
2503.03645 |
null |
2025-03-05 |
Large language models in finance: estimating financial sentiment for stock prediction |
Kemal Kirtac et.al. |
2503.03612 |
null |
2025-03-05 |
Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering |
Lingli Cao et.al. |
2503.03609 |
link |
2025-03-05 |
Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling |
Keqi Chen et.al. |
2503.03607 |
null |
2025-03-05 |
Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders |
Kristian Kuznetsov et.al. |
2503.03601 |
null |
2025-03-05 |
PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention |
Lida Chen et.al. |
2503.03588 |
null |
2025-03-05 |
“You don’t need a university degree to comprehend data protection this way”: LLM-Powered Interactive Privacy Policy Assessment |
Vincent Freiberger et.al. |
2503.03587 |
null |
2025-03-05 |
Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories |
Alperen Yildiz et.al. |
2503.03586 |
null |
2025-03-05 |
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection |
Wenqiao Li et.al. |
2503.03562 |
null |
2025-03-05 |
Afford-X: Generalizable and Slim Affordance Reasoning for Task-oriented Manipulation |
Xiaomeng Zhu et.al. |
2503.03556 |
null |
2025-03-05 |
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems |
Yaoru Li et.al. |
2503.03505 |
link |
2025-03-05 |
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization |
Jiajun Yu et.al. |
2503.03503 |
link |
2025-03-05 |
CURVALID: Geometrically-guided Adversarial Prompt Detection |
Canaan Yung et.al. |
2503.03502 |
link |
2025-03-05 |
TEDDY: A Family Of Foundation Models For Understanding Single Cell Biology |
Alexis Chevalier et.al. |
2503.03485 |
null |
2025-03-05 |
Generative Artificial Intelligence in Robotic Manipulation: A Survey |
Kun Zhang et.al. |
2503.03464 |
null |
2025-03-05 |
Open-Source Large Language Models as Multilingual Crowdworkers: Synthesizing Open-Domain Dialogues in Several Languages With No Examples in Targets and No Machine Translation |
Ahmed Njifenjou et.al. |
2503.03462 |
null |
2025-03-05 |
Visualising Policy-Reward Interplay to Inform Zeroth-Order Preference Optimisation of Large Language Models |
Alessio Galatolo et.al. |
2503.03460 |
link |
2025-03-05 |
Unified Mind Model: Reimagining Autonomous Agents in the LLM Era |
Pengbo Hu et.al. |
2503.03459 |
null |
2025-03-05 |
Taxation Perspectives from Large Language Models: A Case Study on Additional Tax Penalties |
Eunkyung Choi et.al. |
2503.03444 |
null |
2025-03-05 |
RASD: Retrieval-Augmented Speculative Decoding |
Guofeng Quan et.al. |
2503.03434 |
null |
2025-03-05 |
Video Super-Resolution: All You Need is a Video Diffusion Model |
Zhihao Zhan et.al. |
2503.03355 |
null |
2025-03-05 |
Leveraging Large Language Models to Develop Heuristics for Emerging Optimization Problems |
Thomas Bömer et.al. |
2503.03350 |
null |
2025-03-05 |
EnigmaToM: Improve LLMs’ Theory-of-Mind Reasoning Capabilities with Neural Knowledge Base of Entity States |
Hainiu Xu et.al. |
2503.03340 |
link |
2025-03-05 |
LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models |
Xi Zhu et.al. |
2503.03313 |
null |
2025-03-05 |
SEOE: A Scalable and Reliable Semantic Evaluation Framework for Open Domain Event Detection |
Yi-Fan Lu et.al. |
2503.03303 |
null |
2025-03-05 |
Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters |
Julia Hindel et.al. |
2503.03299 |
null |
2025-03-05 |
A 262 TOPS Hyperdimensional Photonic AI Accelerator powered by a Si3N4 microcomb laser |
Christos Pappas et.al. |
2503.03263 |
null |
2025-03-05 |
Can Frontier LLMs Replace Annotators in Biomedical Text Mining? Analyzing Challenges and Exploring Solutions |
Yichong Zhao et.al. |
2503.03261 |
null |
2025-03-05 |
Exploring the Potential of Large Language Models as Predictors in Dynamic Text-Attributed Graphs |
Runlin Lei et.al. |
2503.03258 |
null |
2025-03-05 |
PAIR: A Novel Large Language Model-Guided Selection Strategy for Evolutionary Algorithms |
Shady Ali et.al. |
2503.03239 |
link |
2025-03-05 |
FANS – Formal Answer Selection for Natural Language Math Reasoning Using Lean4 |
Jiarui Yao et.al. |
2503.03238 |
null |
2025-03-05 |
Targeted Distillation for Sentiment Analysis |
Yice Zhang et.al. |
2503.03225 |
null |
2025-03-05 |
Mocap-2-to-3: Lifting 2D Diffusion-Based Pretrained Models for 3D Motion Capture |
Zhumei Wang et.al. |
2503.03222 |
null |
2025-03-05 |
COSINT-Agent: A Knowledge-Driven Multimodal Agent for Chinese Open Source Intelligence |
Wentao Li et.al. |
2503.03215 |
null |
2025-03-05 |
PolyVer: A Compositional Approach for Polyglot System Modeling and Verification |
Pei-Wei Chen et.al. |
2503.03207 |
null |
2025-03-05 |
An Analytical Theory of Power Law Spectral Bias in the Learning Dynamics of Diffusion Models |
Binxu Wang et.al. |
2503.03206 |
null |
2025-03-05 |
MA-LoT: Multi-Agent Lean-based Long Chain-of-Thought Reasoning enhances Formal Theorem Proving |
Ruida Wang et.al. |
2503.03205 |
link |
2025-03-05 |
Find Matching Faces Based On Face Parameters |
Setu A. Bhatt et.al. |
2503.03204 |
null |
2025-03-05 |
Towards Robust Universal Information Extraction: Benchmark, Evaluation, and Solution |
Jizhao Zhu et.al. |
2503.03201 |
null |
2025-03-05 |
Structured Outputs Enable General-Purpose LLMs to be Medical Experts |
Guangfu Guo et.al. |
2503.03194 |
null |
2025-03-05 |
Enhancing Memory Efficiency in Large Language Model Training Through Chronos-aware Pipeline Parallelism |
Xinyuan Lin et.al. |
2503.03182 |
null |
2025-03-05 |
Enhancing Cybersecurity in Critical Infrastructure with LLM-Assisted Explainable IoT Systems |
Ashutosh Ghimire et.al. |
2503.03180 |
null |
2025-03-05 |
AttackSeqBench: Benchmarking Large Language Models’ Understanding of Sequential Patterns in Cyber Attacks |
Javier Yong et.al. |
2503.03170 |
link |
2025-03-05 |
Dango: A Mixed-Initiative Data Wrangling System using Large Language Model |
Wei-Hao Chen et.al. |
2503.03154 |
null |
2025-03-05 |
Position: Model Collapse Does Not Mean What You Think |
Rylan Schaeffer et.al. |
2503.03150 |
null |
2025-03-05 |
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models |
YiQiu Guo et.al. |
2503.03149 |
null |
2025-03-05 |
PriFFT: Privacy-preserving Federated Fine-tuning of Large Language Models via Function Secret Sharing |
Zhichao You et.al. |
2503.03146 |
null |
2025-03-05 |
A Survey of Foundation Models for Environmental Science |
Runlong Yu et.al. |
2503.03142 |
null |
2025-03-05 |
StarFlow: Leveraging Normalizing Flows for Stellar Age Estimation in SDSS-V DR19 |
Alexander Stone-Martinez et.al. |
2503.03138 |
null |
2025-03-05 |
Bridging Molecular Graphs and Large Language Models |
Runze Wang et.al. |
2503.03135 |
link |
2025-03-05 |
Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability |
Chenhui Xu et.al. |
2503.03128 |
null |
2025-03-05 |
The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models |
Zichao Li et.al. |
2503.03122 |
null |
2025-03-05 |
PromAssistant: Leveraging Large Language Models for Text-to-PromQL |
Chenxi Zhang et.al. |
2503.03114 |
null |
2025-03-05 |
SoK: Knowledge is All You Need: Last Mile Delivery for Automated Provenance-based Intrusion Detection with LLMs |
Wenrui Cheng et.al. |
2503.03108 |
null |
2025-03-05 |
Monitoring Decoding: Mitigating Hallucination via Evaluating the Factuality of Partial Response during Generation |
Yurui Chang et.al. |
2503.03106 |
null |
2025-03-05 |
BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving |
Katharina Winter et.al. |
2503.03074 |
link |
2025-03-04 |
Unification of Stochastic and Quantum Thermodynamics in Scalar Field Theory via a Model with Brownian Thermostat |
T. Koide et.al. |
2503.03059 |
null |
2025-03-04 |
SAGE: Steering and Refining Dialog Generation with State-Action Augmentation |
Yizhe Zhang et.al. |
2503.03040 |
link |
2025-03-04 |
SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs |
Samir Abdaljalil et.al. |
2503.03032 |
null |
2025-03-04 |
Generative Active Adaptation for Drifting and Imbalanced Network Intrusion Detection |
Ragini Gupta et.al. |
2503.03022 |
null |
2025-03-04 |
Can Diffusion Models Provide Rigorous Uncertainty Quantification for Bayesian Inverse Problems? |
Evan Scope Crafts et.al. |
2503.03007 |
link |
2025-03-04 |
Teaching AI to Handle Exceptions: Supervised Fine-Tuning with Human-Aligned Judgment |
Matthew DosSantos DiSorbo et.al. |
2503.02976 |
null |
2025-03-04 |
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation |
Jude Khouja et.al. |
2503.02972 |
null |
2025-03-04 |
Multilingual Relative Clause Attachment Ambiguity Resolution in Large Language Models |
So Young Lee et.al. |
2503.02971 |
link |
2025-03-04 |
InfiniSST: Simultaneous Translation of Unbounded Speech with Large Language Model |
Siqi Ouyang et.al. |
2503.02969 |
link |
2025-03-04 |
Privacy-Preserving Fair Synthetic Tabular Data |
Fatima J. Sarmin et.al. |
2503.02968 |
null |
2025-03-04 |
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding |
Zhangchen Xu et.al. |
2503.02951 |
link |
2025-03-04 |
Train on classical, deploy on quantum: scaling generative quantum machine learning to a thousand qubits |
Erik Recio-Armengol et.al. |
2503.02934 |
link |
2025-03-04 |
Optimizing open-domain question answering with graph-based retrieval augmented generation |
Joyce Cahoon et.al. |
2503.02922 |
null |
2025-03-04 |
ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models |
Qinyu Zhao et.al. |
2503.02883 |
link |
2025-03-04 |
Wikipedia in the Era of LLMs: Evolution and Risks |
Siming Huang et.al. |
2503.02879 |
link |
2025-03-04 |
SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models |
Dmitry Nechaev et.al. |
2503.02876 |
link |
2025-03-04 |
The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models |
Ke Ji et.al. |
2503.02875 |
null |
2025-03-04 |
Prompting Generative AI with Interaction-Augmented Instructions |
Leixian Shen et.al. |
2503.02874 |
null |
2025-03-05 |
FairSense-AI: Responsible AI Meets Sustainability |
Shaina Raza et.al. |
2503.02865 |
null |
2025-03-04 |
Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework |
Ziang Zhou et.al. |
2503.02863 |
null |
2025-03-04 |
Privacy and Accuracy-Aware AI/ML Model Deduplication |
Hong Guan et.al. |
2503.02862 |
null |
2025-03-04 |
Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs’ Decoding Layers |
Zicong He et.al. |
2503.02851 |
link |
2025-03-04 |
Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs |
Yuzhe Gu et.al. |
2503.02846 |
link |
2025-03-04 |
SeqFusion: Sequential Fusion of Pre-Trained Models for Zero-Shot Time-Series Forecasting |
Ting-Ji Huang et.al. |
2503.02836 |
link |
2025-03-04 |
AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation |
Songming Zhang et.al. |
2503.02832 |
null |
2025-03-04 |
Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging |
Yujin Oh et.al. |
2503.02824 |
null |
2025-03-04 |
A Multimodal Symphony: Integrating Taste and Sound through Generative AI |
Matteo Spanio et.al. |
2503.02823 |
null |
2025-03-04 |
Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts |
Marta Skreta et.al. |
2503.02819 |
link |
2025-03-04 |
RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration |
Alicia Russell-Gilbert et.al. |
2503.02800 |
null |
2025-03-04 |
Multimodal AI predicts clinical outcomes of drug combinations from preclinical data |
Yepeng Huang et.al. |
2503.02781 |
link |
2025-03-04 |
Implicit Bias in LLMs: A Survey |
Xinru Lin et.al. |
2503.02776 |
null |
2025-03-04 |
InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training |
Dingdong Wang et.al. |
2503.02769 |
null |
2025-03-04 |
BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt Compression |
Daniil Larionov et.al. |
2503.02756 |
null |
2025-03-04 |
Large Language Models for Multilingual Previously Fact-Checked Claim Detection |
Ivan Vykopal et.al. |
2503.02737 |
link |
2025-03-04 |
RedChronos: A Large Language Model-Based Log Analysis System for Insider Threat Detection in Enterprises |
Chenyu Li et.al. |
2503.02702 |
null |
2025-03-04 |
MindBridge: Scalable and Cross-Model Knowledge Editing via Memory-Augmented Modality |
Shuaike Li et.al. |
2503.02701 |
link |
2025-03-04 |
Zero-Shot Complex Question-Answering on Long Scientific Documents |
Wanting Wang et.al. |
2503.02695 |
link |
2025-03-04 |
FinArena: A Human-Agent Collaboration Framework for Financial Market Analysis and Forecasting |
Congluo Xu et.al. |
2503.02692 |
null |
2025-03-04 |
Generative Modeling of Microweather Wind Velocities for Urban Air Mobility |
Tristan A. Shah et.al. |
2503.02690 |
link |
2025-03-04 |
MPO: Boosting LLM Agents with Meta Plan Optimization |
Weimin Xiong et.al. |
2503.02682 |
link |
2025-03-04 |
Multidimensional Consistency Improves Reasoning in Language Models |
Huiyuan Lai et.al. |
2503.02670 |
null |
2025-03-04 |
LoRA-Null: Low-Rank Adaptation via Null Space for Large Language Models |
Pengwei Tang et.al. |
2503.02659 |
null |
2025-03-04 |
The Effectiveness of Large Language Models in Transforming Unstructured Text to Standardized Formats |
William Brach et.al. |
2503.02650 |
link |
2025-03-04 |
YARE-GAN: Yet Another Resting State EEG-GAN |
Yeganeh Farahzadi et.al. |
2503.02636 |
link |
2025-03-04 |
Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective |
Haotian Li et.al. |
2503.02631 |
null |
2025-03-04 |
Towards Event Extraction with Massive Types: LLM-based Collaborative Annotation and Partitioning Extraction |
Wenxuan Liu et.al. |
2503.02628 |
null |
2025-03-04 |
Rewarding Doubt: A Reinforcement Learning Approach to Confidence Calibration of Large Language Models |
Paul Stangel et.al. |
2503.02623 |
null |
2025-03-04 |
OkraLong: A Flexible Retrieval-Augmented Framework for Long-Text Query Processing |
Yulong Hui et.al. |
2503.02603 |
null |
2025-03-04 |
Seeing is Understanding: Unlocking Causal Attention into Modality-Mutual Attention for Multimodal LLMs |
Wei-Yao Wang et.al. |
2503.02597 |
link |
2025-03-04 |
StageDesigner: Artistic Stage Generation for Scenography via Theater Scripts |
Zhaoxing Gan et.al. |
2503.02595 |
null |
2025-03-04 |
MciteBench: A Benchmark for Multimodal Citation Text Generation in MLLMs |
Caiyu Hu et.al. |
2503.02589 |
link |
2025-03-04 |
Playing games with Large language models: Randomness and strategy |
Alicia Vidler et.al. |
2503.02582 |
null |
2025-03-04 |
LLM-Safety Evaluations Lack Robustness |
Tim Beyer et.al. |
2503.02574 |
null |
2025-03-04 |
SpecInF: Exploiting Idle GPU Resources in Distributed DL Training via Speculative Inference Filling |
Cunchi Lv et.al. |
2503.02550 |
null |
2025-03-04 |
PVTree: Realistic and Controllable Palm Vein Generation for Recognition Tasks |
Sheng Shang et.al. |
2503.02547 |
null |
2025-03-04 |
SAGE-Amine: Generative Amine Design with Multi-Property Optimization for Efficient CO2 Capture |
Hocheol Lim et.al. |
2503.02534 |
null |
2025-03-04 |
Use Me Wisely: AI-Driven Assessment for LLM Prompting Skills Development |
Dimitri Ognibene et.al. |
2503.02532 |
null |
2025-03-04 |
Generator-Assistant Stepwise Rollback Framework for Large Language Model Agent |
Xingzuo Li et.al. |
2503.02519 |
link |
2025-03-04 |
Deepfake Detection via Knowledge Injection |
Tonghui Li et.al. |
2503.02503 |
null |
2025-03-04 |
LADM: Long-context Training Data Selection with Attention-based Dependency Measurement for LLMs |
Jianghao Chen et.al. |
2503.02502 |
null |
2025-03-04 |
PennyLang: Pioneering LLM-Based Quantum Code Generation with a Novel PennyLane-Centric Dataset |
Haider Asif et.al. |
2503.02497 |
null |
2025-03-04 |
BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA |
Zhengyang Ji et.al. |
2503.02476 |
link |
2025-03-04 |
It Helps to Take a Second Opinion: Teaching Smaller LLMs to Deliberate Mutually via Selective Rationale Optimisation |
Sohan Patnaik et.al. |
2503.02463 |
null |
2025-03-04 |
Don’t Get Too Excited – Eliciting Emotions in LLMs |
Gino Franco Fazzi et.al. |
2503.02457 |
null |
2025-03-04 |
Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations |
Yuhao Yang et.al. |
2503.02453 |
null |
2025-03-04 |
Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization |
Yilun Qiu et.al. |
2503.02450 |
link |
2025-03-04 |
AILS-NTUA at SemEval-2025 Task 4: Parameter-Efficient Unlearning for Large Language Models using Data Chunking |
Iraklis Premptis et.al. |
2503.02443 |
null |
2025-03-04 |
AILS-NTUA at SemEval-2025 Task 3: Leveraging Large Language Models and Translation Strategies for Multilingual Hallucination Detection |
Dimitra Karkani et.al. |
2503.02442 |
null |
2025-03-04 |
Artificial Intelligence in Reactor Physics: Current Status and Future Prospects |
Ruizhi Zhang et.al. |
2503.02440 |
null |
2025-03-04 |
Beyond the Leland strategies |
Emmanuel Lepinette et.al. |
2503.02419 |
null |
2025-03-04 |
Building 3D In-Context Learning Universal Model in Neuroimaging |
Jiesi Hu et.al. |
2503.02410 |
link |
2025-03-04 |
Wyckoff Transformer: Generation of Symmetric Crystals |
Nikita Kazeev et.al. |
2503.02407 |
link |
2025-03-04 |
Hierarchical Re-ranker Retriever (HRR) |
Ashish Singh et.al. |
2503.02401 |
null |
2025-03-04 |
Promptware Engineering: Software Engineering for LLM Prompt Development |
Zhenpeng Chen et.al. |
2503.02400 |
null |
2025-03-04 |
PersonaX: A Recommendation Agent Oriented User Modeling Framework for Long Behavior Sequence |
Yunxiao Shi et.al. |
2503.02398 |
link |
2025-03-04 |
ReSo: A Reward-driven Self-organizing LLM-based Multi-Agent System for Reasoning Tasks |
Heng Zhou et.al. |
2503.02390 |
link |
2025-03-04 |
RGBSQGrasp: Inferring Local Superquadric Primitives from Single RGB Image for Graspability-Aware Bin Picking |
Yifeng Xu et.al. |
2503.02387 |
null |
2025-03-04 |
An Efficient and Precise Training Data Construction Framework for Process-supervised Reward Model in Mathematical Reasoning |
Wei Sun et.al. |
2503.02382 |
null |
2025-03-04 |
Teaching Metric Distance to Autoregressive Multimodal Foundational Models |
Jiwan Chung et.al. |
2503.02379 |
null |
2025-03-04 |
MedEthicEval: Evaluating Large Language Models Based on Chinese Medical Ethics |
Haoan Jin et.al. |
2503.02374 |
null |
2025-03-04 |
EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports |
Lama Moukheiber et.al. |
2503.02365 |
null |
2025-03-04 |
Add-One-In: Incremental Sample Selection for Large Language Models via a Choice-Based Greedy Paradigm |
Zhuo Li et.al. |
2503.02359 |
null |
2025-03-04 |
Efficient Long Context Fine-tuning with Chunk Flow |
Xiulong Yuan et.al. |
2503.02356 |
null |
2025-03-04 |
CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory |
Jiashun Suo et.al. |
2503.02354 |
null |
2025-03-04 |
DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability |
Yunzhen He et.al. |
2503.02343 |
link |
2025-03-04 |
GRADEO: Towards Human-Like Evaluation for Text-to-Video Generation via Multi-Step Reasoning |
Zhun Mou et.al. |
2503.02341 |
null |
2025-03-04 |
Limited Effectiveness of LLM-based Data Augmentation for COVID-19 Misinformation Stance Detection |
Eun Cheol Choi et.al. |
2503.02328 |
null |
2025-03-04 |
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models |
Xueliang Zhao et.al. |
2503.02324 |
link |
2025-03-04 |
Generative Model-Assisted Demosaicing for Cross-multispectral Cameras |
Jiahui Luo et.al. |
2503.02322 |
null |
2025-03-04 |
Semantic Prior Distillation with Vision Foundation Model for Enhanced Rapid Bone Scintigraphy Image Restoration |
Pengchen Liang et.al. |
2503.02321 |
null |
2025-03-04 |
A Token-level Text Image Foundation Model for Document Understanding |
Tongkun Guan et.al. |
2503.02304 |
null |
2025-03-04 |
Towards Large Language Model Guided Kernel Direct Fuzzing |
Xie Li et.al. |
2503.02301 |
null |
2025-03-04 |
Towards Explainable Doctor Recommendation with Large Language Models |
Ziyang Zeng et.al. |
2503.02298 |
null |
2025-03-04 |
Memorize or Generalize? Evaluating LLM Code Generation with Evolved Questions |
Wentao Chen et.al. |
2503.02296 |
null |
2025-03-04 |
spike: A tool to drizzle HST, JWST, and Roman PSFs for improved analyses |
Ava Polzin et.al. |
2503.02288 |
link |
2025-03-04 |
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users |
Wenjia Jiang et.al. |
2503.02268 |
null |
2025-03-04 |
Large Language Models as Natural Selector for Embodied Soft Robot Design |
Changhe Chen et.al. |
2503.02249 |
null |
2025-03-04 |
Making Better Mistakes in CLIP-Based Zero-Shot Classification with Hierarchy-Aware Language Prompts |
Tong Liang et.al. |
2503.02248 |
null |
2025-03-04 |
From Code to Courtroom: LLMs as the New Software Judges |
Junda He et.al. |
2503.02246 |
null |
2025-03-04 |
OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale |
Haoyang Li et.al. |
2503.02240 |
link |
2025-03-04 |
V2X-LLM: Enhancing V2X Integration and Understanding in Connected Vehicle Corridors |
Keshu Wu et.al. |
2503.02239 |
null |
2025-03-04 |
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions |
Zirui Wu et.al. |
2503.02238 |
link |
2025-03-04 |
Enhancing LLM Reliability via Explicit Knowledge Boundary Modeling |
Hang Zheng et.al. |
2503.02233 |
null |
2025-03-04 |
ATLaS: Agent Tuning via Learning Critical Steps |
Zhixun Chen et.al. |
2503.02197 |
null |
2025-03-04 |
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models |
Saeed Ranjbar Alvar et.al. |
2503.02175 |
link |
2025-03-04 |
Leveraging Large Language Models for Enhanced Digital Twin Modeling: Trends, Methods, and Challenges |
Linyao Yang et.al. |
2503.02167 |
null |
2025-03-04 |
X2CT-CLIP: Enable Multi-Abnormality Detection in Computed Tomography from Chest Radiography via Tri-Modal Contrastive Learning |
Jianzhong You et.al. |
2503.02162 |
null |
2025-03-04 |
LLM-TabFlow: Synthetic Tabular Data Generation with Inter-column Logical Relationship Preservation |
Yunbo Long et.al. |
2503.02161 |
null |
2025-03-04 |
Tabby: Tabular Data Synthesis with Language Models |
Sonia Cromp et.al. |
2503.02152 |
null |
2025-03-04 |
Malware Classification from Memory Dumps Using Machine Learning, Transformers, and Large Language Models |
Areej Dweib et.al. |
2503.02144 |
null |
2025-03-04 |
Measuring Intrinsic Dimension of Token Embeddings |
Takuya Kataiwa et.al. |
2503.02142 |
null |
2025-03-04 |
Network Traffic Classification Using Machine Learning, Transformer, and Large Language Models |
Ahmad Antari et.al. |
2503.02141 |
null |
2025-03-03 |
TMIQ: Quantifying Test and Measurement Domain Intelligence in Large Language Models |
Emmanuel A. Olowe et.al. |
2503.02123 |
null |
2025-02-28 |
LLM Post-Training: A Deep Dive into Reasoning Large Language Models |
Komal Kumar et.al. |
2502.21321 |
null |
2025-02-28 |
How far can we go with ImageNet for Text-to-Image generation? |
L. Degeorge et.al. |
2502.21318 |
null |
2025-02-28 |
FANformer: Improving Large Language Models Through Effective Periodicity Modeling |
Yihong Dong et.al. |
2502.21309 |
null |
2025-02-28 |
Contextualizing biological perturbation experiments through language |
Menghua Wu et.al. |
2502.21290 |
link |
2025-02-28 |
Does Generation Require Memorization? Creative Diffusion Models using Ambient Diffusion |
Kulin Shah et.al. |
2502.21278 |
null |
2025-02-28 |
Adaptive Keyframe Sampling for Long Video Understanding |
Xi Tang et.al. |
2502.21271 |
null |
2025-03-03 |
Foundation Models – A Panacea for Artificial Intelligence in Pathology? |
Nita Mulliqi et.al. |
2502.21264 |
null |
2025-02-28 |
Modeling Human Beliefs about AI Behavior for Scalable Oversight |
Leon Lang et.al. |
2502.21262 |
null |
2025-02-28 |
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete |
Yuheng Ji et.al. |
2502.21257 |
null |
2025-02-28 |
TimesBERT: A BERT-Style Foundation Model for Time Series Understanding |
Haoran Zhang et.al. |
2502.21245 |
null |
2025-03-04 |
Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs |
Xiaomin Li et.al. |
2502.21239 |
null |
2025-02-28 |
Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication |
Daniil Filienko et.al. |
2502.21236 |
null |
2025-02-28 |
ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs |
Hao Ge et.al. |
2502.21231 |
null |
2025-03-03 |
ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer |
Omer Goldman et.al. |
2502.21228 |
null |
2025-02-28 |
Dynamic Markov Blanket Detection for Macroscopic Physics Discovery |
Jeff Beck et.al. |
2502.21217 |
link |
2025-02-28 |
Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought |
Jianhao Huang et.al. |
2502.21212 |
null |
2025-02-28 |
Chronologically Consistent Large Language Models |
Songrun He et.al. |
2502.21206 |
null |
2025-03-04 |
SYN-LUNGS: Towards Simulating Lung Nodules with Anatomy-Informed Digital Twins for AI Training |
Fakrul Islam Tushar et.al. |
2502.21187 |
null |
2025-02-28 |
$Δ$ -model correction of Foundation Model based on the models own understanding |
Mads-Peter Verner Christiansen et.al. |
2502.21179 |
null |
2025-03-03 |
Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models |
Ruta Binkyte et.al. |
2502.21123 |
null |
2025-02-28 |
Optimizing Large Language Models for ESG Activity Detection in Financial Texts |
Mattia Birti et.al. |
2502.21112 |
link |
2025-02-28 |
Rare event modeling with self-regularized normalizing flows: what can we learn from a single failure? |
Charles Dawson et.al. |
2502.21110 |
null |
2025-02-28 |
Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization |
Lie Meng Pang et.al. |
2502.21108 |
null |
2025-02-28 |
Generating patient cohorts from electronic health records using two-step retrieval-augmented text-to-SQL generation |
Angelo Ziletti et.al. |
2502.21107 |
null |
2025-02-28 |
A Non-contrast Head CT Foundation Model for Comprehensive Neuro-Trauma Triage |
Youngjin Yoo et.al. |
2502.21106 |
null |
2025-02-28 |
Re-evaluating Theory of Mind evaluation in large language models |
Jennifer Hu et.al. |
2502.21098 |
null |
2025-02-28 |
An LLM-based Delphi Study to Predict GenAI Evolution |
Francesco Bertolotti et.al. |
2502.21092 |
null |
2025-02-28 |
PASemiQA: Plan-Assisted Agent for Question Answering on Semi-Structured Data with Text and Relational Information |
Hansi Yang et.al. |
2502.21087 |
null |
2025-02-28 |
Are foundation models useful feature extractors for electroencephalography analysis? |
Özgün Turgut et.al. |
2502.21086 |
null |
2025-02-28 |
Spatial Reasoning with Denoising Models |
Christopher Wewer et.al. |
2502.21075 |
null |
2025-02-28 |
CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation |
Zhenyi Shen et.al. |
2502.21074 |
null |
2025-02-28 |
GUIDE: LLM-Driven GUI Generation Decomposition for Automated Prototyping |
Kristian Kolthoff et.al. |
2502.21068 |
null |
2025-02-28 |
Synthesizing Individualized Aging Brains in Health and Disease with Generative Models and Parallel Transport |
Jingru Fu et.al. |
2502.21049 |
link |
2025-02-28 |
Incorporating Long-Range Interactions via the Multipole Expansion into Ground and Excited-State Molecular Simulations |
Rhyan Barrett et.al. |
2502.21045 |
null |
2025-02-28 |
The amplifier effect of artificial agents in social contagion |
Eric Hitz et.al. |
2502.21037 |
null |
2025-02-28 |
Beyond Words: A Latent Memory Approach to Internal Reasoning in LLMs |
José I. Orlicki et.al. |
2502.21030 |
null |
2025-02-28 |
Measuring and identifying factors of individuals’ trust in Large Language Models |
Edoardo Sebastiano De Duro et.al. |
2502.21028 |
null |
2025-02-28 |
PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues |
Fangxu Yu et.al. |
2502.21017 |
null |
2025-02-28 |
Explainable Biomedical Claim Verification with Large Language Models |
Siting Liang et.al. |
2502.21014 |
null |
2025-02-28 |
Merging Clinical Knowledge into Large Language Models for Medical Research and Applications: A Survey |
Qiyuan Li et.al. |
2502.20988 |
null |
2025-02-28 |
UoR-NCL at SemEval-2025 Task 1: Using Generative LLMs and CLIP Models for Multilingual Multimodal Idiomaticity Representation |
Thanet Markchom et.al. |
2502.20984 |
null |
2025-02-28 |
Set-Theoretic Compositionality of Sentence Embeddings |
Naman Bansal et.al. |
2502.20975 |
null |
2025-02-28 |
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval |
Chien-Yu Lin et.al. |
2502.20969 |
null |
2025-02-28 |
Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs |
Weixiang Zhao et.al. |
2502.20968 |
null |
2025-02-28 |
Fine-Grained Retrieval-Augmented Generation for Visual Question Answering |
Zhengxuan Zhang et.al. |
2502.20964 |
null |
2025-02-28 |
Efficient Jailbreaking of Large Models by Freeze Training: Lower Layers Exhibit Greater Sensitivity to Harmful Content |
Hongyuan Shen et.al. |
2502.20952 |
null |
2025-02-28 |
Generative Uncertainty in Diffusion Models |
Metod Jazbec et.al. |
2502.20946 |
null |
2025-02-28 |
A Deep User Interface for Exploring LLaMa |
Divya Perumal et.al. |
2502.20938 |
null |
2025-02-28 |
Large Language Models Are Innate Crystal Structure Generators |
Jingru Gan et.al. |
2502.20933 |
null |
2025-02-28 |
Automated Evaluation of Meter and Rhyme in Russian Generative and Human-Authored Poetry |
Ilya Koziev et.al. |
2502.20931 |
null |
2025-02-28 |
A database to support the evaluation of gender biases in GPT-4o output |
Luise Mehner et.al. |
2502.20898 |
null |
2025-02-28 |
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals’ Subjective Text Perceptions |
Matthias Orlikowski et.al. |
2502.20897 |
null |
2025-02-28 |
PathVG: A New Benchmark and Dataset for Pathology Visual Grounding |
Chunlin Zhong et.al. |
2502.20869 |
null |
2025-02-28 |
ProBench: Benchmarking Large Language Models in Competitive Programming |
Lei Yang et.al. |
2502.20868 |
null |
2025-02-28 |
The Power of Personality: A Human Simulation Perspective to Investigate Large Language Model Agents |
Yifan Duan et.al. |
2502.20859 |
null |
2025-02-28 |
Learning to Substitute Components for Compositional Generalization |
Zhaoyi Li et.al. |
2502.20834 |
null |
2025-02-28 |
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval |
Zelong Sun et.al. |
2502.20826 |
null |
2025-02-28 |
Can We Simplify Slide-level Fine-tuning of Pathology Foundation Models? |
Jiawen Li et.al. |
2502.20823 |
null |
2025-02-28 |
Towards Reliable Vector Database Management Systems: A Software Testing Roadmap for 2030 |
Shenao Wang et.al. |
2502.20812 |
null |
2025-02-28 |
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models |
Xiao Wang et.al. |
2502.20811 |
null |
2025-02-28 |
PFD: Automatically Generating Machine Learning Force Fields from Universal Models |
Ruoyu Wang et.al. |
2502.20809 |
link |
2025-03-03 |
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts |
Peijie Wang et.al. |
2502.20808 |
null |
2025-02-28 |
Digital Player: Evaluating Large Language Models based Human-like Agent in Games |
Jiawei Wang et.al. |
2502.20807 |
link |
2025-02-28 |
Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation |
Kuang-Da Wang et.al. |
2502.20795 |
null |
2025-02-28 |
Cyber Defense Reinvented: Large Language Models as Threat Intelligence Copilots |
Xiaoqun Liu et.al. |
2502.20791 |
null |
2025-02-28 |
Chain-of-Thought Matters: Improving Long-Context Language Models with Reasoning Path Supervision |
Dawei Zhu et.al. |
2502.20790 |
null |
2025-02-28 |
Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective |
Yuko Nakagi et.al. |
2502.20779 |
null |
2025-02-28 |
FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference |
Xunhao Lai et.al. |
2502.20766 |
link |
2025-02-28 |
Collective Reasoning Among LLMs A Framework for Answer Validation Without Ground Truth |
Seyed Pouyan Mousavi Davoudi et.al. |
2502.20758 |
null |
2025-02-28 |
The Rise of Darkness: Safety-Utility Trade-Offs in Role-Playing Dialogue Agents |
Yihong Tang et.al. |
2502.20757 |
null |
2025-02-28 |
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models |
Yichi Zhang et.al. |
2502.20749 |
link |
2025-02-28 |
Teach-to-Reason with Scoring: Self-Explainable Rationale-Driven Multi-Trait Essay Scoring |
Heejin Do et.al. |
2502.20748 |
null |
2025-02-28 |
Measuring Determinism in Large Language Models for Software Code Review |
Eugene Klishevich et.al. |
2502.20747 |
null |
2025-02-28 |
CADDreamer: CAD object Generation from Single-view Images |
Yuan Li et.al. |
2502.20732 |
null |
2025-02-28 |
SPD: Sync-Point Drop for efficient tensor parallelism of Large Language Models |
Han-Byul Kim et.al. |
2502.20727 |
null |
2025-02-28 |
Retrieval Backward Attention without Additional Training: Enhance Embeddings of Large Language Models via Repetition |
Yifei Duan et.al. |
2502.20726 |
link |
2025-02-28 |
Generating Clinically Realistic EHR Data via a Hierarchy- and Semantics-Guided Transformer |
Guanglin Zhou et.al. |
2502.20719 |
null |
2025-02-28 |
Why Trust in AI May Be Inevitable |
Nghi Truong et.al. |
2502.20701 |
null |
2025-02-28 |
Towards General Visual-Linguistic Face Forgery Detection(V2) |
Ke Sun et.al. |
2502.20698 |
link |
2025-02-28 |
WorldModelBench: Judging Video Generation Models As World Models |
Dacheng Li et.al. |
2502.20694 |
null |
2025-02-28 |
Unleashing the Potential of Two-Tower Models: Diffusion-Based Cross-Interaction for Large-Scale Matching |
Yihan Wang et.al. |
2502.20687 |
null |
2025-02-28 |
JAM: Controllable and Responsible Text Generation via Causal Reasoning and Latent Vector Manipulation |
Yingbing Huang et.al. |
2502.20684 |
null |
2025-02-28 |
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding |
Aaryan Garg et.al. |
2502.20678 |
null |
2025-02-28 |
SciceVPR: Stable Cross-Image Correlation Enhanced Model for Visual Place Recognition |
Shanshan Wan et.al. |
2502.20676 |
null |
2025-02-28 |
Advancing AI-Powered Medical Image Synthesis: Insights from MedVQA-GI Challenge Using CLIP, Fine-Tuned Stable Diffusion, and Dream-Booth + LoRA |
Ojonugwa Oluwafemi Ejiga Peter et.al. |
2502.20667 |
null |
2025-02-28 |
Consistency Evaluation of News Article Summaries Generated by Large (and Small) Language Models |
Colleen Gilhuly et.al. |
2502.20647 |
null |
2025-02-28 |
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation |
Haitao Li et.al. |
2502.20640 |
null |
2025-02-28 |
Can LLM Assist in the Evaluation of the Quality of Machine Learning Explanations? |
Bo Wang et.al. |
2502.20635 |
null |
2025-02-28 |
Are LLMs Ready for Practical Adoption for Assertion Generation? |
Vaishnavi Pulavarthi et.al. |
2502.20633 |
null |
2025-02-28 |
Rectifying Belief Space via Unlearning to Harness LLMs’ Reasoning |
Ayana Niwa et.al. |
2502.20620 |
null |
2025-02-28 |
Leveraging Large Language Models for Building Interpretable Rule-Based Data-to-Text Systems |
Jędrzej Warczyński et.al. |
2502.20609 |
null |
2025-02-28 |
NutriGen: Personalized Meal Plan Generator Leveraging Large Language Models to Enhance Dietary and Nutritional Adherence |
Saman Khamesian et.al. |
2502.20601 |
link |
2025-02-27 |
Few-Shot, No Problem: Descriptive Continual Relation Extraction |
Nguyen Xuan Thanh et.al. |
2502.20596 |
null |
2025-02-27 |
Multi $^2$ : Multi-Agent Test-Time Scalable Framework for Multi-Document Processing |
Juntai Cao et.al. |
2502.20592 |
null |
2025-02-27 |
LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis |
Saeif Alhazbi et.al. |
2502.20589 |
null |
2025-03-04 |
InstaFace: Identity-Preserving Facial Editing with Single Image Inference |
MD Wahiduzzaman Khan et.al. |
2502.20577 |
null |
2025-02-27 |
ECCOS: Efficient Capability and Cost Coordinated Scheduling for Multi-LLM Serving |
Kai Mei et.al. |
2502.20576 |
link |
2025-02-27 |
Visual Reasoning at Urban Intersections: FineTuning GPT-4o for Traffic Conflict Detection |
Sari Masri et.al. |
2502.20573 |
null |
2025-02-27 |
Stochastic Rounding for LLM Training: Theory and Practice |
Kaan Ozkara et.al. |
2502.20566 |
null |
2025-02-27 |
LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks |
Joana C. Costa et.al. |
2502.20562 |
null |
2025-02-27 |
R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts |
Zhongyang Li et.al. |
2502.20395 |
link |
2025-02-27 |
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions |
Sirui Xu et.al. |
2502.20390 |
null |
2025-02-27 |
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation |
Sucheng Ren et.al. |
2502.20388 |
link |
2025-02-27 |
Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis |
Jeffrey Yang Fan Chiang et.al. |
2502.20383 |
null |
2025-02-27 |
Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers |
Shalev Lifshitz et.al. |
2502.20379 |
null |
2025-02-27 |
PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation |
Albert Gong et.al. |
2502.20377 |
link |
2025-02-27 |
Constrained Generative Modeling with Manually Bridged Diffusion Models |
Saeid Naderiparizi et.al. |
2502.20371 |
null |
2025-02-27 |
Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization |
Ryan C. Barron et.al. |
2502.20364 |
null |
2025-02-27 |
Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs |
Kuan Lok Zhou et.al. |
2502.20356 |
null |
2025-02-27 |
KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model |
Kai Zhang et.al. |
2502.20350 |
null |
2025-02-27 |
Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models |
Yi Jing et.al. |
2502.20344 |
null |
2025-02-27 |
Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners |
Daniele Paliotta et.al. |
2502.20339 |
null |
2025-02-27 |
Expertise Is What We Want |
Alan Ashworth et.al. |
2502.20335 |
null |
2025-02-27 |
Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models |
Yukang Yang et.al. |
2502.20332 |
null |
2025-02-27 |
Long-Context Inference with Retrieval-Augmented Speculative Decoding |
Guanzheng Chen et.al. |
2502.20330 |
link |
2025-02-27 |
EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants |
Franck Cappello et.al. |
2502.20309 |
null |
2025-02-27 |
M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging |
Jinghao Feng et.al. |
2502.20301 |
null |
2025-02-27 |
An exploration of features to improve the generalisability of fake news detection models |
Nathaniel Hoy et.al. |
2502.20299 |
null |
2025-02-27 |
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription |
Benjamin Gutteridge et.al. |
2502.20295 |
link |
2025-02-27 |
Conformal Tail Risk Control for Large Language Model Alignment |
Catherine Yu-Chi Chen et.al. |
2502.20285 |
null |
2025-02-27 |
Evaluating Human Trust in LLM-Based Planners: A Preliminary Study |
Shenghui Chen et.al. |
2502.20284 |
null |
2025-02-27 |
Large Language Models as Attribution Regularizers for Efficient Model Training |
Davor Vukadin et.al. |
2502.20268 |
link |
2025-02-27 |
Vector-Quantized Vision Foundation Models for Object-Centric Learning |
Rongzhen Zhao et.al. |
2502.20263 |
null |
2025-02-27 |
LLM as a Broken Telephone: Iterative Generation Distorts Information |
Amr Mohamed et.al. |
2502.20258 |
link |
2025-02-27 |
Do computer vision foundation models learn the low-level characteristics of the human visual system? |
Yancheng Cai et.al. |
2502.20256 |
null |
2025-02-27 |
Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets |
Chichien Tsai et.al. |
2502.20246 |
null |
2025-02-27 |
From Retrieval to Generation: Comparing Different Approaches |
Abdelrahman Abdallah et.al. |
2502.20245 |
null |
2025-02-27 |
FINEREASON: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving |
Guizhen Chen et.al. |
2502.20238 |
link |
2025-02-27 |
AI Will Always Love You: Studying Implicit Biases in Romantic AI Companions |
Clare Grogan et.al. |
2502.20231 |
link |
2025-02-27 |
Avat3r: Large Animatable Gaussian Reconstruction Model for High-fidelity 3D Head Avatars |
Tobias Kirschstein et.al. |
2502.20220 |
null |
2025-02-27 |
ChineseEcomQA: A Scalable E-commerce Concept Evaluation Benchmark for Large Language Models |
Haibin Chen et.al. |
2502.20196 |
link |
2025-02-27 |
Model Checking Linear Temporal Logic with Standpoint Modalities |
Rajab Aghamov et.al. |
2502.20193 |
null |
2025-02-27 |
Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge |
Yan-Lun Chen et.al. |
2502.20186 |
null |
2025-02-27 |
DGFM: Full Body Dance Generation Driven by Music Foundation Models |
Xinran Liu et.al. |
2502.20176 |
null |
2025-02-27 |
An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs |
Kaustubh Vyas et.al. |
2502.20175 |
null |
2025-02-27 |
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think |
Liang Chen et.al. |
2502.20172 |
link |
2025-02-27 |
Re-evaluating Open-ended Evaluation of Large Language Models |
Siqi Liu et.al. |
2502.20170 |
null |
2025-02-27 |
Adaptive H&E-IHC information fusion staining framework based on feature extra |
Yifan Jia et.al. |
2502.20156 |
link |
2025-02-27 |
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale |
Max M. Lang et.al. |
2502.20140 |
null |
2025-02-27 |
Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking |
Yifan Zhang et.al. |
2502.20129 |
null |
2025-02-27 |
Self-Training Elicits Concise Reasoning in Large Language Models |
Tergel Munkhbat et.al. |
2502.20122 |
link |
2025-02-27 |
LongRoPE2: Near-Lossless LLM Context Window Scaling |
Ning Shang et.al. |
2502.20082 |
link |
2025-02-27 |
Collab-Overcooked: Benchmarking and Evaluating Large Language Models as Collaborative Agents |
Haochen Sun et.al. |
2502.20073 |
link |
2025-02-27 |
A Generative Model Enhanced Multi-Agent Reinforcement Learning Method for Electric Vehicle Charging Navigation |
Tianyang Qi et.al. |
2502.20068 |
null |
2025-02-27 |
Polish-ASTE: Aspect-Sentiment Triplet Extraction Datasets for Polish |
Marta Lango et.al. |
2502.20046 |
null |
2025-02-27 |
3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds |
Hengshuo Chu et.al. |
2502.20041 |
null |
2025-02-27 |
AsymLoRA: Harmonizing Data Conflicts and Commonalities in MLLMs |
Xuyang Wei et.al. |
2502.20035 |
link |
2025-02-27 |
Erasing Without Remembering: Safeguarding Knowledge Forgetting in Large Language Models |
Huazheng Wang et.al. |
2502.19982 |
link |
2025-02-27 |
The Lookahead Limitation: Why Multi-Operand Addition is Hard for LLMs |
Tanja Baeumel et.al. |
2502.19981 |
null |
2025-02-27 |
Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios |
Chao Wang et.al. |
2502.19973 |
null |
2025-02-27 |
Deterministic or probabilistic? The psychology of LLMs as random number generators |
Javier Coronado-Blázquez et.al. |
2502.19965 |
null |
2025-02-27 |
SeisMoLLM: Advancing Seismic Monitoring via Cross-modal Transfer with Pre-trained Large Language Model |
Xinghao Wang et.al. |
2502.19960 |
link |
2025-02-27 |
Collaborative Stance Detection via Small-Large Language Model Consistency Verification |
Yu Yan et.al. |
2502.19954 |
link |
2025-02-27 |
GeoEdit: Geometric Knowledge Editing for Large Language Models |
Yujie Feng et.al. |
2502.19953 |
null |
2025-02-27 |
Algebraic Machine Learning: Learning as computing an algebraic decomposition of a task |
Fernando Martin-Maroto et.al. |
2502.19944 |
link |
2025-02-27 |
Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation |
Xiang Geng et.al. |
2502.19941 |
null |
2025-02-27 |
Playing Pokémon Red via Deep Reinforcement Learning |
Marco Pleines et.al. |
2502.19920 |
link |
2025-02-27 |
Meta-Reasoner: Dynamic Guidance for Optimized Inference-time Reasoning in Large Language Models |
Yuan Sui et.al. |
2502.19918 |
null |
2025-02-27 |
Picking the Cream of the Crop: Visual-Centric Data Selection with Collaborative Agents |
Zhenyu Liu et.al. |
2502.19917 |
link |
2025-02-27 |
LLM-driven Effective Knowledge Tracing by Integrating Dual-channel Difficulty |
Jiahui Cen et.al. |
2502.19915 |
null |
2025-02-27 |
SkipPipe: Partial and Reordered Pipelining Framework for Training LLMs in Heterogeneous Networks |
Nikolay Blagoev et.al. |
2502.19913 |
link |
2025-02-27 |
Order Doesn’t Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation |
Qianxi He et.al. |
2502.19907 |
null |
2025-02-27 |
Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy |
Zaijing Li et.al. |
2502.19902 |
null |
2025-02-27 |
GenPC: Zero-shot Point Cloud Completion via 3D Generative Priors |
An Li et.al. |
2502.19896 |
null |
2025-02-27 |
Beyond the Tip of Efficiency: Uncovering the Submerged Threats of Jailbreak Attacks in Small Language Models |
Sibo Yi et.al. |
2502.19883 |
null |
2025-02-27 |
Towards Multimodal Large-Language Models for Parent-Child Interaction: A Focus on Joint Attention |
Weiyan Shi et.al. |
2502.19877 |
null |
2025-02-27 |
MMKE-Bench: A Multimodal Editing Benchmark for Diverse Visual Knowledge |
Yuntao Du et.al. |
2502.19870 |
link |
2025-02-27 |
MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue |
Yujia Chen et.al. |
2502.19860 |
null |
2025-02-27 |
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments |
Hojae Han et.al. |
2502.19852 |
null |
2025-02-27 |
One-for-More: Continual Diffusion Model for Anomaly Detection |
Xiaofan Li et.al. |
2502.19848 |
link |
2025-02-27 |
ProAPO: Progressively Automatic Prompt Optimization for Visual Classification |
Xiangyan Qu et.al. |
2502.19844 |
link |
2025-02-27 |
Shared Stochastic Gaussian Process Latent Variable Models: A Multi-modal Generative Model for Quasar Spectra |
Vidhi Lalchand et.al. |
2502.19824 |
link |
2025-02-27 |
Foot-In-The-Door: A Multi-turn Jailbreak for LLMs |
Zixuan Weng et.al. |
2502.19820 |
link |
2025-02-27 |
Comet: Fine-grained Computation-communication Overlapping for Mixture-of-Experts |
Shulai Zhang et.al. |
2502.19811 |
link |
2025-02-27 |
Implicit Search via Discrete Diffusion: A Study on Chess |
Jiacheng Ye et.al. |
2502.19805 |
link |
2025-02-27 |
Developmental Support Approach to AI’s Autonomous Growth: Toward the Realization of a Mutually Beneficial Stage Through Experiential Learning |
Taichiro Endo et.al. |
2502.19798 |
null |
2025-02-27 |
ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model |
Chuanliu Fan et.al. |
2502.19794 |
null |
2025-02-27 |
Mixtera: A Data Plane for Foundation Model Training |
Maximilian Böther et.al. |
2502.19790 |
link |
2025-02-27 |
Advancements in Natural Language Processing for Automatic Text Summarization |
Nevidu Jayatilleke et.al. |
2502.19773 |
null |
2025-02-27 |
Does Your Voice Assistant Remember? Analyzing Conversational Context Recall and Utilization in Voice Interaction Models |
Heeseung Kim et.al. |
2502.19759 |
null |
2025-02-27 |
PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation |
Nathan Roll et.al. |
2502.19756 |
null |
2025-02-27 |
Beneath the Surface: How Large Language Models Reflect Hidden Bias |
Jinhao Pan et.al. |
2502.19749 |
link |
2025-02-27 |
HaLoRA: Hardware-aware Low-Rank Adaptation for Large Language Models Based on Hybrid Compute-in-Memory Architecture |
Taiqiang Wu et.al. |
2502.19747 |
null |
2025-02-27 |
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning |
Minggui He et.al. |
2502.19735 |
null |
2025-02-27 |
Preference Learning Unlocks LLMs’ Psycho-Counseling Skills |
Mian Zhang et.al. |
2502.19731 |
null |
2025-02-27 |
Do Expressions Change Decisions? Exploring the Impact of AI’s Explanation Tone on Decision-Making |
Ayano Okoso et.al. |
2502.19730 |
null |
2025-02-27 |
Tokens for Learning, Tokens for Unlearning: Mitigating Membership Inference Attacks in Large Language Models via Dual-Purpose Training |
Toan Tran et.al. |
2502.19726 |
null |
2025-02-27 |
Few-Shot Multilingual Open-Domain QA from 5 Examples |
Fan Jiang et.al. |
2502.19722 |
link |
2025-02-27 |
Sensing and Steering Stereotypes: Extracting and Applying Gender Representation Vectors in LLMs |
Hannah Cyberey et.al. |
2502.19721 |
link |
2025-02-27 |
Teaching Dense Retrieval Models to Specialize with Listwise Distillation and LLM Data Augmentation |
Manveer Singh Tamber et.al. |
2502.19712 |
null |
2025-02-27 |
AoECR: AI-ization of Elderly Care Robot |
Linkun Zhou et.al. |
2502.19706 |
null |
2025-02-27 |
You Only Click Once: Single Point Weakly Supervised 3D Instance Segmentation for Autonomous Driving |
Guangfeng Jiang et.al. |
2502.19698 |
null |
2025-02-27 |
M-LLM Based Video Frame Selection for Efficient Video Understanding |
Kai Hu et.al. |
2502.19680 |
null |
2025-02-27 |
Old Experience Helps: Leveraging Survey Methodology to Improve AI Text Annotation Reliability in Social Sciences |
Linzhuo li et.al. |
2502.19679 |
null |
2025-02-27 |
Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack |
Chenhe Gu et.al. |
2502.19672 |
null |
2025-02-27 |
SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning |
Mingsheng Cai et.al. |
2502.19668 |
null |
2025-02-27 |
Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models |
Jan Wehner et.al. |
2502.19649 |
null |
2025-02-27 |
cMIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning |
Micha Livne et.al. |
2502.19642 |
null |
2025-02-26 |
Agentic Mixture-of-Workflows for Multi-Modal Chemical Search |
Tiffany J. Callahan et.al. |
2502.19629 |
null |
2025-02-26 |
Treatment Non-Adherence Bias in Clinical Machine Learning: A Real-World Study on Hypertension Medication |
Zhongyuan Liang et.al. |
2502.19625 |
null |
2025-02-26 |
Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing |
Akshat Gupta et.al. |
2502.19416 |
null |
2025-02-26 |
Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs |
Dayu Yang et.al. |
2502.19411 |
link |
2025-02-26 |
Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices |
Xinru Wang et.al. |
2502.19410 |
null |
2025-02-26 |
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models |
Danae Sánchez Villegas et.al. |
2502.19409 |
null |
2025-02-26 |
Learning Code-Edit Embedding to Model Student Debugging Behavior |
Hasnain Heickal et.al. |
2502.19407 |
null |
2025-02-26 |
General Reasoning Requires Learning to Reason from the Get-go |
Seungwook Han et.al. |
2502.19402 |
null |
2025-02-26 |
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding |
Max Ku et.al. |
2502.19400 |
null |
2025-02-26 |
Multi-modal Contrastive Learning for Tumor-specific Missing Modality Synthesis |
Minjoo Lim et.al. |
2502.19390 |
null |
2025-02-26 |
LiDAR Registration with Visual Foundation Models |
Niclas Vödisch et.al. |
2502.19374 |
null |
2025-02-26 |
Deep Learning For Time Series Analysis With Application On Human Motion |
Ali Ismail-Fawaz et.al. |
2502.19364 |
null |
2025-02-26 |
DataMan: Data Manager for Pre-training Large Language Models |
Ru Peng et.al. |
2502.19363 |
null |
2025-02-26 |
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? |
Yancheng He et.al. |
2502.19361 |
link |
2025-02-26 |
Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets |
Tohida Rehman et.al. |
2502.19339 |
null |
2025-02-26 |
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems |
Hao Peng et.al. |
2502.19328 |
link |
2025-02-26 |
Shh, don’t say that! Domain Certification in LLMs |
Cornelius Emde et.al. |
2502.19320 |
null |
2025-02-26 |
Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond |
Qizhou Wang et.al. |
2502.19301 |
null |
2025-02-26 |
Agent-centric Information Access |
Evangelos Kanoulas et.al. |
2502.19298 |
null |
2025-02-26 |
Complex LLM Planning via Automated Heuristics Discovery |
Hongyi Ling et.al. |
2502.19295 |
null |
2025-02-26 |
Efficient Federated Search for Retrieval-Augmented Generation |
Rachid Guerraoui et.al. |
2502.19280 |
null |
2025-02-26 |
ArtInsight: Enabling AI-Powered Artwork Engagement for Mixed Visual-Ability Families |
Arnavi Chheda-Kothary et.al. |
2502.19263 |
null |
2025-02-26 |
AI-Powered Bayesian Inference |
Veronika Ročková et.al. |
2502.19231 |
null |
2025-02-26 |
Two Heads Are Better Than One: Dual-Model Verbal Reflection at Inference-Time |
Jiazheng Li et.al. |
2502.19230 |
null |
2025-02-26 |
A Lightweight and Extensible Cell Segmentation and Classification Model for Whole Slide Images |
Nikita Shvetsov et.al. |
2502.19217 |
null |
2025-02-26 |
A Hybrid Transformer Architecture with a Quantized Self-Attention Mechanism Applied to Molecular Generation |
Anthony M. Smaldone et.al. |
2502.19214 |
link |
2025-02-26 |
Negation-Induced Forgetting in LLMs |
Francesca Capuano et.al. |
2502.19211 |
null |
2025-02-26 |
Bi’an: A Bilingual Benchmark and Model for Hallucination Detection in Retrieval-Augmented Generation |
Zhouyu Jiang et.al. |
2502.19209 |
null |
2025-02-26 |
Simulation of Language Evolution under Regulated Social Media Platforms: A Synergistic Approach of Large Language Models and Genetic Algorithms |
Jinyu Cai et.al. |
2502.19193 |
null |
2025-02-26 |
BIG-Bench Extra Hard |
Mehran Kazemi et.al. |
2502.19187 |
link |
2025-02-26 |
INFO-SEDD: Continuous Time Markov Chains as Scalable Information Metrics Estimators |
Alberto Foresti et.al. |
2502.19183 |
null |
2025-02-26 |
UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering |
Langming Liu et.al. |
2502.19178 |
null |
2025-02-26 |
MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis |
Daniel Rose et.al. |
2502.19175 |
null |
2025-02-26 |
A Model-Centric Review of Deep Learning for Protein Design |
Gregory W. Kyro et.al. |
2502.19173 |
null |
2025-02-26 |
CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation |
Kaiwen Yan et.al. |
2502.19166 |
link |
2025-02-26 |
TestNUC: Enhancing Test-Time Computing Approaches through Neighboring Unlabeled Data Consistency |
Henry Peng Zou et.al. |
2502.19163 |
null |
2025-02-26 |
Detecting Linguistic Indicators for Stereotype Assessment with Large Language Models |
Rebekka Görge et.al. |
2502.19160 |
null |
2025-02-26 |
A Sliding Layer Merging Method for Efficient Depth-Wise Pruning in LLMs |
Xuan Ding et.al. |
2502.19159 |
null |
2025-02-26 |
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning |
Yijiang River Dong et.al. |
2502.19158 |
null |
2025-02-26 |
Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval |
Jiarong Wu et.al. |
2502.19149 |
null |
2025-02-26 |
Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs |
Zhaowei Zhang et.al. |
2502.19148 |
null |
2025-02-26 |
Identification Under the Semantic Effective Secrecy Constraint |
Abdalla Ibrahim et.al. |
2502.19142 |
null |
2025-02-26 |
A Temporal Planning Framework for Multi-Agent Systems via LLM-Aided Knowledge Base Management |
Enrico Saccon et.al. |
2502.19135 |
null |
2025-02-26 |
Self-Memory Alignment: Mitigating Factual Hallucinations with Generalized Improvement |
Siyuan Zhang et.al. |
2502.19127 |
null |
2025-02-26 |
A Survey on Foundation-Model-Based Industrial Defect Detection |
Tianle Yang et.al. |
2502.19106 |
null |
2025-02-26 |
Evaluating Gender Bias in German Machine Translation |
Michelle Kappl et.al. |
2502.19104 |
link |
2025-02-26 |
LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based Paradigm |
Siwei Wu et.al. |
2502.19103 |
null |
2025-02-26 |
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation |
Humza Sami et.al. |
2502.19091 |
link |
2025-02-26 |
EndoMamba: An Efficient Foundation Model for Endoscopic Videos |
Qingyao Tian et.al. |
2502.19090 |
null |
2025-02-26 |
Sparse Brains are Also Adaptive Brains: Cognitive-Load-Aware Dynamic Activation for LLMs |
Yiheng Yang et.al. |
2502.19078 |
null |
2025-02-26 |
IndicEval-XL: Bridging Linguistic Diversity in Code Generation Across Indic Languages |
Ujjwal Singh et.al. |
2502.19067 |
link |
2025-02-26 |
Can Large Language Models Outperform Non-Experts in Poetry Evaluation? A Comparative Study Using the Consensual Assessment Technique |
Piotr Sawicki et.al. |
2502.19064 |
null |
2025-02-26 |
MathClean: A Benchmark for Synthetic Mathematical Data Cleaning |
Hao Liang et.al. |
2502.19058 |
null |
2025-02-26 |
Beyond Surface-Level Patterns: An Essence-Driven Defense Framework Against Jailbreak Attacks in LLMs |
Shiyu Xiang et.al. |
2502.19041 |
null |
2025-02-26 |
FungalZSL: Zero-Shot Fungal Classification with Image Captioning Using a Synthetic Data Approach |
Anju Rani et.al. |
2502.19038 |
null |
2025-02-26 |
InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model |
Fengbin Guan et.al. |
2502.19026 |
null |
2025-02-26 |
Binary Neural Networks for Large Language Model: A Survey |
Liangdong Liu et.al. |
2502.19008 |
null |
2025-02-26 |
The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training |
Jinbo Wang et.al. |
2502.19002 |
null |
2025-02-26 |
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering |
Teng Lin et.al. |
2502.18993 |
null |
2025-02-26 |
OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language Models |
Hui Feng et.al. |
2502.18992 |
null |
2025-02-26 |
GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation |
Jie He et.al. |
2502.18990 |
null |
2025-02-26 |
PEToolLLM: Towards Personalized Tool Learning in Large Language Models |
Qiancheng Xu et.al. |
2502.18980 |
null |
2025-02-26 |
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning |
Hongyi Cal et.al. |
2502.18978 |
null |
2025-02-26 |
(Mis)Fitting: A Survey of Scaling Laws |
Margaret Li et.al. |
2502.18969 |
link |
2025-02-26 |
Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles |
Kuang Wang et.al. |
2502.18968 |
link |
2025-02-26 |
OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment |
Jiaxin Deng et.al. |
2502.18965 |
null |
2025-02-26 |
DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model |
Lei Zhao et.al. |
2502.18952 |
null |
2025-02-26 |
Towards Label-Only Membership Inference Attack against Pre-trained Large Language Models |
Yu He et.al. |
2502.18943 |
null |
2025-02-26 |
JailBench: A Comprehensive Chinese Security Assessment Benchmark for Large Language Models |
Shuyi Liu et.al. |
2502.18935 |
null |
2025-02-26 |
Talking like Piping and Instrumentation Diagrams (P&IDs) |
Achmad Anggawirya Alimin et.al. |
2502.18928 |
null |
2025-02-26 |
ClassInvGen: Class Invariant Synthesis using Large Language Models |
Chuyue Sun et.al. |
2502.18917 |
null |
2025-02-26 |
END: Early Noise Dropping for Efficient and Effective Context Denoising |
Hongye Jin et.al. |
2502.18915 |
null |
2025-02-26 |
CLLoRA: An Approach to Measure the Effects of the Context Length for LLM Fine-Tuning |
Ping Zhang et.al. |
2502.18910 |
null |
2025-02-26 |
An Empirical Study on Commit Message Generation using LLMs via In-Context Learning |
Yifan Wu et.al. |
2502.18904 |
link |
2025-02-26 |
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens |
Tong Wu et.al. |
2502.18890 |
link |
2025-02-26 |
Letters from Future Self: Augmenting the Letter-Exchange Exercise with LLM-based Agents to Enhance Young Adults’ Career Exploration |
Hayeon Jeon et.al. |
2502.18881 |
null |
2025-02-26 |
Learning to Generate Structured Output with Schema Reinforcement Learning |
Yaxi Lu et.al. |
2502.18878 |
null |
2025-02-26 |
Learning to Align Multi-Faceted Evaluation: A Unified and Robust Framework |
Kaishuai Xu et.al. |
2502.18874 |
null |
2025-02-26 |
Multi-LLM Collaborative Search for Complex Problem Solving |
Sen Yang et.al. |
2502.18873 |
null |
2025-02-26 |
A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops |
Shi Fu et.al. |
2502.18865 |
null |
2025-02-26 |
Sherlock: Towards Multi-scene Video Abnormal Event Extraction and Localization via a Global-local Spatial-sensitive LLM |
Junxiao Ma et.al. |
2502.18863 |
null |
2025-02-26 |
A Causal Lens for Evaluating Faithfulness Metrics |
Kerem Zaman et.al. |
2502.18848 |
null |
2025-02-26 |
Sliding Window Attention Training for Efficient Large Language Models |
Zichuan Fu et.al. |
2502.18845 |
null |
2025-02-26 |
Evidence-Driven Marker Extraction for Social Media Suicide Risk Detection |
Carter Adams et.al. |
2502.18823 |
null |
2025-02-26 |
Data-Efficient Multi-Agent Spatial Planning with LLMs |
Huangyuan Su et.al. |
2502.18822 |
null |
2025-02-26 |
CAMEx: Curvature-aware Merging of Experts |
Dung V. Nguyen et.al. |
2502.18821 |
link |
2025-02-26 |
Judge as A Judge: Improving the Evaluation of Retrieval-Augmented Generation through the Judge-Consistency of Large Language Models |
Shuliang Liu et.al. |
2502.18817 |
null |
2025-02-26 |
Holistic Audit Dataset Generation for LLM Unlearning via Knowledge Graph Traversal and Redundancy Removal |
Weipeng Jiang et.al. |
2502.18810 |
null |
2025-02-26 |
Optimal Stochastic Trace Estimation in Generative Modeling |
Xinyang Liu et.al. |
2502.18808 |
null |
2025-02-26 |
SolEval: Benchmarking Large Language Models for Repository-level Solidity Code Generation |
Zhiyuan Peng et.al. |
2502.18793 |
null |
2025-02-26 |
Active Few-Shot Learning for Text Classification |
Saeed Ahmadnia et.al. |
2502.18782 |
null |
2025-02-26 |
Towards Optimal Multi-draft Speculative Decoding |
Zhengmian Hu et.al. |
2502.18779 |
null |
2025-02-26 |
M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance |
Qingpei Guo et.al. |
2502.18778 |
null |
2025-02-26 |
Plutus: Benchmarking Large Language Models in Low-Resource Greek Finance |
Xueqing Peng et.al. |
2502.18772 |
null |
2025-02-26 |
Exploring Graph Tasks with Pure LLMs: A Comprehensive Benchmark and Investigation |
Yuxiang Wang et.al. |
2502.18771 |
link |
2025-02-26 |
Reward Shaping to Mitigate Reward Hacking in RLHF |
Jiayi Fu et.al. |
2502.18770 |
link |
2025-02-26 |
CommGPT: A Graph and Retrieval-Augmented Multimodal Communication Foundation Model |
Feibo Jiang et.al. |
2502.18763 |
null |
2025-02-26 |
Training Large Recommendation Models via Graph-Language Token Alignment |
Mingdai Yang et.al. |
2502.18757 |
null |
2025-02-26 |
M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type |
Weiming Hu et.al. |
2502.18755 |
null |
2025-02-26 |
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web Platforms |
Yuwei Yan et.al. |
2502.18754 |
link |
2025-02-26 |
Spectral-Enhanced Transformers: Leveraging Large-Scale Pretrained Models for Hyperspectral Object Tracking |
Shaheer Mohamed et.al. |
2502.18748 |
null |
2025-02-26 |
Automatic Prompt Optimization via Heuristic Search: A Survey |
Wendi Cui et.al. |
2502.18746 |
null |
2025-02-25 |
DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers |
Xueguang Ma et.al. |
2502.18460 |
link |
2025-02-25 |
LLM-Based Design Pattern Detection |
Christian Schindler et.al. |
2502.18458 |
null |
2025-02-25 |
FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response |
Mollie Shichman et.al. |
2502.18452 |
null |
2025-02-25 |
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution |
Yuxiang Wei et.al. |
2502.18449 |
null |
2025-02-25 |
MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning |
Chanwoo Park et.al. |
2502.18439 |
null |
2025-02-25 |
TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning |
Frederikus Hudi et.al. |
2502.18431 |
link |
2025-02-25 |
OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference |
Xiangyu Zhao et.al. |
2502.18411 |
link |
2025-02-25 |
Enhancing DNA Foundation Models to Address Masking Inefficiencies |
Monireh Safari et.al. |
2502.18405 |
null |
2025-02-25 |
Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods |
Nicola Cecere et.al. |
2502.18389 |
null |
2025-02-25 |
How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities |
Minhua Lin et.al. |
2502.18387 |
null |
2025-02-25 |
MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning |
Sepehr Asgarian et.al. |
2502.18371 |
null |
2025-02-25 |
Sparse Bayesian Generative Modeling for Joint Parameter and Channel Estimation |
Benedikt Böck et.al. |
2502.18369 |
null |
2025-02-25 |
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation |
Yifan Pu et.al. |
2502.18364 |
null |
2025-02-25 |
Responsible AI Agents |
Deven R. Desai et.al. |
2502.18359 |
null |
2025-02-25 |
Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation |
Jessica He et.al. |
2502.18357 |
null |
2025-02-25 |
BRIDO: Bringing Democratic Order to Abstractive Summarization |
Junhyun Lee et.al. |
2502.18342 |
null |
2025-02-25 |
Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology |
Romy Beauté et.al. |
2502.18318 |
null |
2025-02-25 |
GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music |
Xinran Liu et.al. |
2502.18309 |
null |
2025-02-25 |
RefuteBench 2.0 – Agentic Benchmark for Dynamic Evaluation of LLM Responses to Refutation Instruction |
Jianhao Yan et.al. |
2502.18308 |
null |
2025-02-25 |
LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation |
Pengzhi Li et.al. |
2502.18302 |
null |
2025-02-25 |
Bayesian Computation in Deep Learning |
Wenlong Chen et.al. |
2502.18300 |
null |
2025-02-25 |
DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis |
Zeju Li et.al. |
2502.18297 |
null |
2025-02-25 |
AMPO: Active Multi-Preference Optimization |
Taneesh Gupta et.al. |
2502.18293 |
null |
2025-02-25 |
Better Aligned with Survey Respondents or Training Data? Unveiling Political Leanings of LLMs on U.S. Supreme Court Cases |
Shanshan Xu et.al. |
2502.18282 |
null |
2025-02-25 |
Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support |
Guoxin Wang et.al. |
2502.18274 |
link |
2025-02-25 |
Imperfect Knowledge Management (IKM) in GEFRED (GENeralized model for Fuzzy RElational Databases) |
Leoncio Jimenez et.al. |
2502.18255 |
null |
2025-02-25 |
Iterative Counterfactual Data Augmentation |
Mitchell Plyler et.al. |
2502.18249 |
link |
2025-02-25 |
Unveiling and Causalizing CoT: A Causal Pespective |
Jiarun Fu et.al. |
2502.18239 |
null |
2025-02-25 |
Beyond the convexity assumption: Realistic tabular data generation under quantifier-free real linear constraints |
Mihaela Cătălina Stoian et.al. |
2502.18237 |
link |
2025-02-25 |
Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent |
Xiaofeng Wang et.al. |
2502.18228 |
null |
2025-02-25 |
From ChatGPT to DeepSeek: Can LLMs Simulate Humanity? |
Qian Wang et.al. |
2502.18210 |
null |
2025-02-25 |
LAG: LLM agents for Leaderboard Auto Generation on Demanding |
Jian Wu et.al. |
2502.18209 |
null |
2025-02-25 |
Grandes modelos de lenguaje: de la predicción de palabras a la comprensión? |
Carlos Gómez-Rodríguez et.al. |
2502.18205 |
null |
2025-02-25 |
Intersubjective Model of AI-mediated Communication: Augmenting Human-Human Text Chat through LLM-based Adaptive Agent Pair |
Shutaro Aoyama et.al. |
2502.18201 |
null |
2025-02-25 |
Task-Agnostic Semantic Communication with Multimodal Foundation Models |
Jiangjing Hu et.al. |
2502.18200 |
null |
2025-02-25 |
Agnostic calculation of atomic free energies with the descriptor density of states |
Thomas D Swinburne et.al. |
2502.18191 |
link |
2025-02-25 |
ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis |
Li Lei et.al. |
2502.18180 |
null |
2025-02-25 |
Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs |
Gaye Colakoglu et.al. |
2502.18179 |
link |
2025-02-25 |
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification |
Mingkun Zhang et.al. |
2502.18176 |
link |
2025-02-25 |
SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models |
Zhang Yuxuan et.al. |
2502.18168 |
null |
2025-02-25 |
Can LLMs Explain Themselves Counterfactually? |
Zahra Dehghanighobadi et.al. |
2502.18156 |
null |
2025-02-25 |
Carbon and Silicon, Coexist or Compete? A Survey on Human-AI Interactions in Agent-based Modeling and Simulation |
Ziyue Lin et.al. |
2502.18145 |
null |
2025-02-25 |
LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented Searchers |
Zhuocheng Zhang et.al. |
2502.18139 |
link |
2025-02-25 |
Large Language Model Driven Agents for Simulating Echo Chamber Formation |
Chenhao Gu et.al. |
2502.18138 |
null |
2025-02-25 |
Inverse Materials Design by Large Language Model-Assisted Generative Framework |
Yun Hao et.al. |
2502.18127 |
link |
2025-02-25 |
HyperG: Hypergraph-Enhanced LLMs for Structured Knowledge |
Sirui Huang et.al. |
2502.18125 |
null |
2025-02-25 |
Bayesian Optimization for Controlled Image Editing via LLMs |
Chengkun Cai et.al. |
2502.18116 |
null |
2025-02-25 |
PromptMID: Modal Invariant Descriptors Based on Diffusion and Vision Foundation Models for Optical-SAR Image Matching |
Han Nie et.al. |
2502.18104 |
link |
2025-02-25 |
Detecting Offensive Memes with Social Biases in Singapore Context Using Multimodal Large Language Models |
Cao Yuxuan et.al. |
2502.18101 |
link |
2025-02-25 |
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning |
Wenkai Yang et.al. |
2502.18080 |
null |
2025-02-25 |
Examining the Threat Landscape: Foundation Models and Model Stealing |
Ankita Raj et.al. |
2502.18077 |
null |
2025-02-25 |
MRBTP: Efficient Multi-Robot Behavior Tree Planning and Collaboration |
Yishuai Cai et.al. |
2502.18072 |
link |
2025-02-25 |
Golden Ratio Mixing of Real and Synthetic Data for Stabilizing Generative Model Training |
Hengzhi He et.al. |
2502.18049 |
null |
2025-02-25 |
AutoCas: Autoregressive Cascade Predictor in Social Networks via Large Language Models |
Yuhao Zheng et.al. |
2502.18040 |
null |
2025-02-25 |
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble |
Zhijun Chen et.al. |
2502.18036 |
link |
2025-02-25 |
Detecting Knowledge Boundary of Vision Large Language Models by Sampling-Based Inference |
Zhuo Chen et.al. |
2502.18023 |
null |
2025-02-25 |
AfroXLMR-Comet: Multilingual Knowledge Distillation with Attention Matching for Low-Resource languages |
Joshua Sakthivel Raju et.al. |
2502.18020 |
null |
2025-02-25 |
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms |
Yashan Wang et.al. |
2502.18008 |
null |
2025-02-25 |
Unveiling the Key Factors for Distilling Chain-of-Thought Reasoning |
Xinghao Chen et.al. |
2502.18001 |
link |
2025-02-25 |
Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation |
Guang Lin et.al. |
2502.17972 |
null |
2025-02-25 |
LLM Knows Geometry Better than Algebra: Numerical Understanding of LLM-Based Agents in A Trading Arena |
Tianmi Ma et.al. |
2502.17967 |
link |
2025-02-25 |
Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments |
Patomporn Payoungkhamdee et.al. |
2502.17956 |
null |
2025-02-25 |
DeepSeek-R1 Outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in Bilingual Complex Ophthalmology Reasoning |
Pusheng Xu et.al. |
2502.17947 |
null |
2025-02-25 |
Assessing Large Language Models in Agentic Multilingual National Bias |
Qianying Liu et.al. |
2502.17945 |
null |
2025-02-25 |
CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation |
Haitao Li et.al. |
2502.17943 |
link |
2025-02-25 |
Advantage-Guided Distillation for Preference Alignment in Small Language Models |
Shiping Gao et.al. |
2502.17927 |
link |
2025-02-25 |
LeanProgress: Guiding Search for Neural Theorem Proving via Proof Progress Prediction |
Suozhi Huang et.al. |
2502.17925 |
null |
2025-02-25 |
FACT-AUDIT: An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models |
Hongzhan Lin et.al. |
2502.17924 |
link |
2025-02-25 |
Towards Sustainable Web Agents: A Plea for Transparency and Dedicated Metrics for Energy Consumption |
Lars Krupp et.al. |
2502.17903 |
null |
2025-02-25 |
Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs |
Che Liu et.al. |
2502.17900 |
null |
2025-02-25 |
Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation |
Tong Li et.al. |
2502.17899 |
null |
2025-02-25 |
FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real |
Weiheng Liu et.al. |
2502.17894 |
null |
2025-02-25 |
RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts |
Mingyan Wu et.al. |
2502.17888 |
link |
2025-02-25 |
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers |
Hannah Calzi Kleidermacher et.al. |
2502.17882 |
null |
2025-02-25 |
EEGM2: An Efficient Mamba-2-Based Self-Supervised Framework for Long-Sequence EEG Modeling |
Jiazhen Hong et.al. |
2502.17873 |
link |
2025-02-25 |
ASurvey: Spatiotemporal Consistency in Video Generation |
Zhiyu Yin et.al. |
2502.17863 |
null |
2025-02-25 |
HRR: Hierarchical Retrospection Refinement for Generated Image Detection |
Peipei Yuan et.al. |
2502.17862 |
null |
2025-02-25 |
LR ${}^{2}$ Bench: Evaluating Long-chain Reflective Reasoning Capabilities of Large Language Models via Constraint Satisfaction Problems |
Jianghao Chen et.al. |
2502.17848 |
null |
2025-02-25 |
Quantifying interdisciplinary synergy in higher STEM education |
Gahyoun Gim et.al. |
2502.17841 |
null |
2025-02-25 |
A Combinatorial Identities Benchmark for Theorem Proving via Automated Theorem Generation |
Beibei Xiong et.al. |
2502.17840 |
null |
2025-02-25 |
TagGAN: A Generative Model for Data Tagging |
Muhammad Nawaz et.al. |
2502.17836 |
null |
2025-02-25 |
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks |
Hyeonjeong Ha et.al. |
2502.17832 |
link |
2025-02-25 |
A General Framework to Enhance Fine-tuning-based LLM Unlearning |
Jie Ren et.al. |
2502.17823 |
link |
2025-02-25 |
An Overview of Large Language Models for Statisticians |
Wenlong Ji et.al. |
2502.17814 |
null |
2025-02-25 |
Can Multimodal LLMs Perform Time Series Anomaly Detection? |
Xiongxiao Xu et.al. |
2502.17812 |
link |
2025-02-25 |
URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models |
Ruiqi Yan et.al. |
2502.17810 |
null |
2025-02-25 |
DocPuzzle: A Process-Aware Benchmark for Evaluating Realistic Long-Context Reasoning Capabilities |
Tianyi Zhuang et.al. |
2502.17807 |
null |
2025-02-25 |
Your Language Model May Think Too Rigidly: Achieving Reasoning Consistency with Symmetry-Enhanced Training |
Yihang Yao et.al. |
2502.17800 |
null |
2025-02-25 |
AIR: Complex Instruction Generation via Automatic Iterative Refinement |
Wei Liu et.al. |
2502.17787 |
link |
2025-02-25 |
Exploring the Potential of Large Language Models for Estimating the Reading Comprehension Question Difficulty |
Yoshee Jain et.al. |
2502.17785 |
null |
2025-02-25 |
Tip of the Tongue Query Elicitation for Simulated Evaluation |
Yifan He et.al. |
2502.17776 |
link |
2025-02-25 |
FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks |
Tanawan Premsri et.al. |
2502.17775 |
link |
2025-02-25 |
Uncertainty Quantification for LLM-Based Survey Simulations |
Chengpiao Huang et.al. |
2502.17773 |
null |
2025-02-25 |
DeepSeek vs. ChatGPT: A Comparative Study for Scientific Computing and Scientific Machine Learning Tasks |
Qile Jiang et.al. |
2502.17764 |
null |
2025-02-25 |
Design and implementation of a distributed security threat detection system integrating federated learning and multimodal LLM |
Yuqing Wang et.al. |
2502.17763 |
null |
2025-02-25 |
Detection of LLM-Paraphrased Code and Identification of the Responsible LLM Using Coding Style Features |
Shinwoo Park et.al. |
2502.17749 |
null |
2025-02-24 |
LLM Inference Acceleration via Efficient Operation Fusion |
Mahsa Salmani et.al. |
2502.17728 |
null |
2025-02-24 |
Can Score-Based Generative Modeling Effectively Handle Medical Image Classification? |
Sushmita Sarker et.al. |
2502.17727 |
link |
2025-02-24 |
Spontaneous Giving and Calculated Greed in Language Models |
Yuxuan Li et.al. |
2502.17720 |
null |
2025-02-24 |
Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures |
Akhila Yerukola et.al. |
2502.17710 |
link |
2025-02-24 |
Fractal Generative Models |
Tianhong Li et.al. |
2502.17437 |
link |
2025-02-24 |
Introducing Visual Perception Token into Multimodal Large Language Model |
Runpeng Yu et.al. |
2502.17425 |
link |
2025-02-24 |
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs |
Jiarui Zhang et.al. |
2502.17422 |
link |
2025-02-24 |
LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification |
Penghui Yang et.al. |
2502.17421 |
link |
2025-02-24 |
The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence |
Tom Wollschläger et.al. |
2502.17420 |
null |
2025-02-24 |
From System 1 to System 2: A Survey of Reasoning Large Language Models |
Zhong-Zhi Li et.al. |
2502.17419 |
link |
2025-02-24 |
Reasoning with Latent Thoughts: On the Power of Looped Transformers |
Nikunj Saunshi et.al. |
2502.17416 |
null |
2025-02-24 |
COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs |
Liming Liu et.al. |
2502.17410 |
link |
2025-02-24 |
Large Language Models are Powerful EHR Encoders |
Stefan Hegselmann et.al. |
2502.17403 |
link |
2025-02-24 |
What is a Good Question? Utility Estimation with LLM-based Simulations |
Dong-Ho Lee et.al. |
2502.17383 |
null |
2025-02-24 |
KV-Edit: Training-Free Image Editing for Precise Background Preservation |
Tianrui Zhu et.al. |
2502.17363 |
link |
2025-02-24 |
A Closer Look at TabPFN v2: Strength, Limitation, and Extension |
Han-Jia Ye et.al. |
2502.17361 |
null |
2025-02-24 |
RELICT: A Replica Detection Framework for Medical Image Generation |
Orhun Utku Aydin et.al. |
2502.17360 |
link |
2025-02-24 |
On Relation-Specific Neurons in Large Language Models |
Yihong Liu et.al. |
2502.17355 |
link |
2025-02-24 |
How Scientists Use Large Language Models to Program |
Gabrielle O’Brien et.al. |
2502.17348 |
null |
2025-02-24 |
Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators |
João Pedro Matos-Carvalho et.al. |
2502.17341 |
null |
2025-02-24 |
HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization |
Zhenghao Liu et.al. |
2502.17315 |
link |
2025-02-24 |
Delta Decompression for MoE-based LLMs Compression |
Hao Gu et.al. |
2502.17298 |
link |
2025-02-24 |
Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts |
Zhenghao Liu et.al. |
2502.17297 |
link |
2025-02-24 |
Integrating protein sequence embeddings with structure via graph-based deep learning for the prediction of single-residue properties |
Kevin Michalewicz et.al. |
2502.17294 |
link |
2025-02-24 |
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing |
Yi-Kai Zhang et.al. |
2502.17282 |
link |
2025-02-24 |
MonoTODia: Translating Monologue Requests to Task-Oriented Dialogues |
Sebastian Steindl et.al. |
2502.17268 |
null |
2025-02-24 |
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective |
Chengyin Xu et.al. |
2502.17262 |
null |
2025-02-24 |
Detecting Benchmark Contamination Through Watermarking |
Tom Sander et.al. |
2502.17259 |
null |
2025-02-24 |
REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective |
Simon Geisler et.al. |
2502.17254 |
null |
2025-02-24 |
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search |
Boyan Li et.al. |
2502.17248 |
null |
2025-02-24 |
Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction |
Tianpeng Li et.al. |
2502.17239 |
link |
2025-02-24 |
Making LLMs Reason? The Intermediate Language Problem in Neurosymbolic Approaches |
Alexander Beiser et.al. |
2502.17216 |
null |
2025-02-24 |
CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought |
Boxuan Zhang et.al. |
2502.17214 |
link |
2025-02-24 |
Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following |
Jie Zeng et.al. |
2502.17204 |
link |
2025-02-24 |
IGDA: Interactive Graph Discovery through Large Language Model Agents |
Alex Havrilla et.al. |
2502.17189 |
null |
2025-02-24 |
Evaluating Expert Contributions in a MoE LLM for Quiz-Based Tasks |
Andrei Chernov et.al. |
2502.17187 |
null |
2025-02-24 |
Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric |
Yuming Yang et.al. |
2502.17184 |
link |
2025-02-24 |
Unsupervised Accelerated MRI Reconstruction via Ground-Truth-Free Flow Matching |
Xinzhe Luo et.al. |
2502.17174 |
null |
2025-02-24 |
Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch |
Xueru Wen et.al. |
2502.17173 |
null |
2025-02-24 |
Logic Haystacks: Probing LLMs Long-Context Logical Reasoning (Without Easily Identifiable Unrelated Padding) |
Damien Sileo et.al. |
2502.17169 |
null |
2025-02-24 |
JUREX-4E: Juridical Expert-Annotated Four-Element Knowledge Base for Legal Reasoning |
Huanghai Liu et.al. |
2502.17166 |
link |
2025-02-24 |
MEMERAG: A Multilingual End-to-End Meta-Evaluation Benchmark for Retrieval Augmented Generation |
María Andrea Cruz Blandón et.al. |
2502.17163 |
null |
2025-02-24 |
Real-time Monitoring of Economic Shocks using Company Websites |
Michael Koenig et.al. |
2502.17161 |
null |
2025-02-24 |
A Pragmatic Note on Evaluating Generative Models with Fréchet Inception Distance for Retinal Image Synthesis |
Yuli Wu et.al. |
2502.17160 |
null |
2025-02-24 |
Parameter Efficient Merging for Multimodal Large Language Models with Complementary Parameter Adaptation |
Fanhu Zeng et.al. |
2502.17159 |
null |
2025-02-24 |
CodeSwift: Accelerating LLM Inference for Efficient Code Generation |
Qianhui Zhao et.al. |
2502.17139 |
null |
2025-02-24 |
Evaluating the Effectiveness of Large Language Models in Automated News Article Summarization |
Lionel Richy Panlap Houamegni et.al. |
2502.17136 |
null |
2025-02-24 |
Applications of Large Models in Medicine |
YunHe Su et.al. |
2502.17132 |
null |
2025-02-24 |
Thus Spake Long-Context Large Language Model |
Xiaoran Liu et.al. |
2502.17129 |
null |
2025-02-24 |
Adversarial Training for Defense Against Label Poisoning Attacks |
Melis Ilayda Bal et.al. |
2502.17121 |
link |
2025-02-24 |
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions |
Zhong Li et.al. |
2502.17119 |
link |
2025-02-24 |
SFLD: Reducing the content bias for AI-generated Image Detection |
Seoyeon Gye et.al. |
2502.17105 |
null |
2025-02-24 |
Generative Models in Decision Making: A Survey |
Yinchuan Li et.al. |
2502.17100 |
null |
2025-02-24 |
Improved Diffusion-based Generative Model with Better Adversarial Robustness |
Zekun Wang et.al. |
2502.17099 |
link |
2025-02-24 |
Conditional Diffusion-Flow models for generating 3D cosmic density fields: applications to f(R) cosmologies |
Julieth Katherine Riveros et.al. |
2502.17087 |
null |
2025-02-24 |
Automatically Evaluating the Paper Reviewing Capability of Large Language Models |
Hyungyu Shin et.al. |
2502.17086 |
null |
2025-02-24 |
Pleno-Generation: A Scalable Generative Face Video Compression Framework with Bandwidth Intelligence |
Bolin Chen et.al. |
2502.17085 |
null |
2025-02-24 |
Systematic Weight Evaluation for Pruning Large Language Models: Enhancing Performance and Sustainability |
Ashhadul Islam et.al. |
2502.17071 |
null |
2025-02-24 |
LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking Preferences |
Sijia Yao et.al. |
2502.17057 |
link |
2025-02-24 |
PrivaCI-Bench: Evaluating Privacy with Contextual Integrity and Legal Compliance |
Haoran Li et.al. |
2502.17041 |
link |
2025-02-24 |
Evolution 6.0: Evolving Robotic Capabilities Through Generative Design |
Muhammad Haris Khan et.al. |
2502.17034 |
null |
2025-02-24 |
Understanding the Uncertainty of LLM Explanations: A Perspective Based on Reasoning Topology |
Longchao Da et.al. |
2502.17026 |
null |
2025-02-24 |
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization |
Zixuan Gong et.al. |
2502.17024 |
null |
2025-02-24 |
Quantifying Logical Consistency in Transformers via Query-Key Alignment |
Eduard Tulchinskii et.al. |
2502.17017 |
null |
2025-02-24 |
Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation |
Jaskaran Singh Walia et.al. |
2502.17011 |
null |
2025-02-24 |
Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators |
Shixin Zhao et.al. |
2502.17006 |
null |
2025-02-24 |
An Enhanced Large Language Model For Cross Modal Query Understanding System Using DL-KeyBERT Based CAZSSCL-MPGPT |
Shreya Singh et.al. |
2502.17000 |
null |
2025-02-24 |
Active Learning for Conditional Inverse Design with Crystal Generation and Foundation Atomic Models |
Zhuoyuan Li et.al. |
2502.16984 |
null |
2025-02-24 |
LongSafety: Evaluating Long-Context Safety of Large Language Models |
Yida Lu et.al. |
2502.16971 |
link |
2025-02-24 |
Autoregressive Image Generation Guided by Chains of Thought |
Miaomiao Cai et.al. |
2502.16965 |
null |
2025-02-24 |
Make LLM Inference Affordable to Everyone: Augmenting GPU Memory with NDP-DIMM |
Lian Liu et.al. |
2502.16963 |
null |
2025-02-24 |
UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings |
Layba Fiaz et.al. |
2502.16961 |
null |
2025-02-24 |
Lean and Mean: Decoupled Value Policy Optimization with Global Value Guidance |
Chenghua Huang et.al. |
2502.16944 |
null |
2025-02-24 |
Reasoning Does Not Necessarily Improve Role-Playing Ability |
Xiachong Feng et.al. |
2502.16940 |
null |
2025-02-24 |
BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference |
Zewen Jin et.al. |
2502.16927 |
null |
2025-02-24 |
FilterLLM: Text-To-Distribution LLM for Billion-Scale Cold-Start Recommendation |
Ruochen Liu et.al. |
2502.16924 |
null |
2025-02-24 |
A Systematic Survey of Automatic Prompt Optimization Techniques |
Kiran Ramnath et.al. |
2502.16923 |
null |
2025-02-24 |
Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties |
Zhenglin Wang et.al. |
2502.16922 |
link |
2025-02-24 |
SS-MPC: A Sequence-Structured Multi-Party Conversation System |
Yoonjin Jang et.al. |
2502.16920 |
null |
2025-02-24 |
Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model |
Kang Fu et.al. |
2502.16915 |
null |
2025-02-24 |
SPARC: Score Prompting and Adaptive Fusion for Zero-Shot Multi-Label Recognition in Vision-Language Models |
Kevin Miller et.al. |
2502.16911 |
null |
2025-02-24 |
AutoLogi: Automated Generation of Logic Puzzles for Evaluating Reasoning Abilities of Large Language Models |
Qin Zhu et.al. |
2502.16906 |
link |
2025-02-24 |
GuidedBench: Equipping Jailbreak Evaluation with Guidelines |
Ruixuan Huang et.al. |
2502.16903 |
null |
2025-02-24 |
Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinment |
Suchae Jeong et.al. |
2502.16902 |
null |
2025-02-24 |
Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs |
Himanshu Beniwal et.al. |
2502.16901 |
link |
2025-02-24 |
Zero-shot Load Forecasting for Integrated Energy Systems: A Large Language Model-based Framework with Multi-task Learning |
Jiaheng Li et.al. |
2502.16896 |
null |
2025-02-24 |
Unlocking Scientific Concepts: How Effective Are LLM-Generated Analogies for Student Understanding and Classroom Practice? |
Zekai Shao et.al. |
2502.16895 |
null |
2025-02-24 |
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment |
Chenghao Fan et.al. |
2502.16894 |
null |
2025-02-24 |
Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data |
Yejian Zhang et.al. |
2502.16892 |
null |
2025-02-24 |
Unveiling Institution-Specific Bias in Pathology Foundation Models: Detriments, Causes, and Potential Solutions |
Weiping Lin et.al. |
2502.16889 |
null |
2025-02-24 |
DBudgetKV: Dynamic Budget in KV Cache Compression for Ensuring Optimal Performance |
Xuanfan Ni et.al. |
2502.16886 |
null |
2025-02-24 |
CORAL: Learning Consistent Representations across Multi-step Training with Lighter Speculative Drafter |
Yepeng Weng et.al. |
2502.16880 |
null |
2025-02-24 |
A Multi-LLM-Agent-Based Framework for Economic and Public Policy Analysis |
Yuzhi Hao et.al. |
2502.16879 |
null |
2025-02-24 |
Graphy’our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data |
Longbin Lai et.al. |
2502.16868 |
null |
2025-02-24 |
Leveraging Large Language Models for Effective and Explainable Multi-Agent Credit Assignment |
Kartik Nagpal et.al. |
2502.16863 |
null |
2025-02-24 |
LongAttn: Selecting Long-context Training Data via Token-level Attention |
Longyun Wu et.al. |
2502.16860 |
link |
2025-02-24 |
Sarang at DEFACTIFY 4.0: Detecting AI-Generated Text Using Noised Data and an Ensemble of DeBERTa Models |
Avinash Trivedi et.al. |
2502.16857 |
null |
2025-02-24 |
Improving LLM General Preference Alignment via Optimistic Online Mirror Descent |
Yuheng Zhang et.al. |
2502.16852 |
null |
2025-02-24 |
Exploring Causes and Mitigation of Hallucinations in Large Vision Language Models |
Yaqi Sun et.al. |
2502.16842 |
null |
2025-02-24 |
Fair Foundation Models for Medical Image Analysis: Challenges and Perspectives |
Dilermando Queiroz et.al. |
2502.16841 |
null |
2025-02-24 |
In-context learning of evolving data streams with tabular foundational models |
Afonso Lourenço et.al. |
2502.16840 |
null |
2025-02-24 |
“Actionable Help” in Crises: A Novel Dataset and Resource-Efficient Models for Identifying Request and Offer Social Media Posts |
Rabindra Lamsal et.al. |
2502.16839 |
null |
2025-02-24 |
REGen: A Reliable Evaluation Framework for Generative Event Argument Extraction |
Omar Sharif et.al. |
2502.16838 |
null |
2025-02-24 |
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization |
Yao Xiao et.al. |
2502.16825 |
null |
2025-02-21 |
ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval |
Guanqi Zhan et.al. |
2502.15682 |
null |
2025-02-21 |
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training |
Jaydeep Borkar et.al. |
2502.15680 |
link |
2025-02-21 |
FLEKE: Federated Locate-then-Edit Knowledge Editing |
Zongkai Zhao et.al. |
2502.15677 |
link |
2025-02-21 |
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind |
Zhining Zhang et.al. |
2502.15676 |
link |
2025-02-21 |
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling |
Florent Bartoccioni et.al. |
2502.15672 |
link |
2025-02-21 |
Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing |
Shoumik Saha et.al. |
2502.15666 |
link |
2025-02-21 |
Machine-generated text detection prevents language model collapse |
George Drayson et.al. |
2502.15654 |
null |
2025-02-21 |
Empowering LLMs with Logical Reasoning: A Comprehensive Survey |
Fengxiang Cheng et.al. |
2502.15652 |
null |
2025-02-21 |
Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models |
Anirudh Sundar et.al. |
2502.15639 |
null |
2025-02-21 |
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification |
Vasilii Feofanov et.al. |
2502.15637 |
link |
2025-02-21 |
The Relationship Between Reasoning and Performance in Large Language Models – o3 (mini) Thinks Harder, Not Longer |
Marthe Ballon et.al. |
2502.15631 |
link |
2025-02-21 |
Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing |
Qi Le et.al. |
2502.15618 |
link |
2025-02-21 |
On the Robustness of Transformers against Context Hijacking for Linear Classification |
Tianle Li et.al. |
2502.15609 |
null |
2025-02-21 |
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance |
Akos Nagy et.al. |
2502.15604 |
null |
2025-02-21 |
Do Multilingual LLMs Think In English? |
Lisa Schut et.al. |
2502.15603 |
null |
2025-02-21 |
WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents |
Xinhang Liu et.al. |
2502.15601 |
null |
2025-02-21 |
SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention |
Jiaqi Wu et.al. |
2502.15594 |
null |
2025-02-21 |
Generalizing From Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning |
Wenhao Zhu et.al. |
2502.15592 |
link |
2025-02-21 |
LightThinker: Thinking Step-by-Step Compression |
Jintian Zhang et.al. |
2502.15589 |
null |
2025-02-21 |
Chats-Grid: An Iterative Retrieval Q&A Optimization Scheme Leveraging Large Model and Retrieval Enhancement Generation in smart grid |
Yunfeng Li et.al. |
2502.15583 |
null |
2025-02-21 |
Fine-tuning foundation models of materials interatomic potentials with frozen transfer learning |
Mariia Radova et.al. |
2502.15582 |
null |
2025-02-21 |
Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders |
Xuansheng Wu et.al. |
2502.15576 |
null |
2025-02-21 |
DReSD: Dense Retrieval for Speculative Decoding |
Milan Gritta et.al. |
2502.15572 |
link |
2025-02-21 |
A Cautionary Tale About “Neutrally” Informative AI Tools Ahead of the 2025 Federal Elections in Germany |
Ina Dormuth et.al. |
2502.15568 |
null |
2025-02-21 |
PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning |
Pengcheng Huang et.al. |
2502.15543 |
link |
2025-02-21 |
Accurate and efficient machine learning interatomic potentials for finite temperature modeling of molecular crystals |
Flaviano Della Pia et.al. |
2502.15530 |
null |
2025-02-21 |
Scaling Sparse and Dense Retrieval in Decoder-Only LLMs |
Hansi Zeng et.al. |
2502.15526 |
link |
2025-02-21 |
Towards Swift Serverless LLM Cold Starts with ParaServe |
Chiheng Lou et.al. |
2502.15524 |
null |
2025-02-21 |
Activation Steering in Neural Theorem Provers |
Shashank Kirtania et.al. |
2502.15507 |
null |
2025-02-21 |
Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing |
Masaya Kobayashi et.al. |
2502.15506 |
null |
2025-02-21 |
Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models |
Ya Wang et.al. |
2502.15499 |
link |
2025-02-21 |
Programmers Aren’t Obsolete Yet: A Syllabus for Teaching CS Students to Responsibly Use Large Language Models for Code Generation |
Bruno Pereira Cipriano et.al. |
2502.15493 |
null |
2025-02-21 |
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models |
Martina Miliani et.al. |
2502.15487 |
null |
2025-02-21 |
Enhancing RWKV-based Language Models for Long-Sequence Text Generation |
Xinghan Pan et.al. |
2502.15485 |
link |
2025-02-21 |
FaultGPT: Industrial Fault Diagnosis Question Answering System by Vision Language Models |
Jiao Chen et.al. |
2502.15481 |
null |
2025-02-21 |
PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System |
Yintao He et.al. |
2502.15470 |
null |
2025-02-21 |
Mitigating Data Scarcity in Time Series Analysis: A Foundation Model with Series-Symbol Data Generation |
Wenxuan Wang et.al. |
2502.15466 |
null |
2025-02-21 |
Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs |
Gengyuan Zhang et.al. |
2502.15457 |
null |
2025-02-21 |
R-LoRA: Random Initialization of Multi-Head LoRA for Multi-Task Learning |
Jinda Liu et.al. |
2502.15455 |
link |
2025-02-21 |
A fast convergence algorithm based on binary integer programming for expert load balancing in MoE LLMs |
Yuan Sun et.al. |
2502.15451 |
null |
2025-02-21 |
When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models |
Weilan Wang et.al. |
2502.15443 |
null |
2025-02-21 |
On the Effectiveness of Large Language Models in Writing Alloy Formulas |
Yang Hong et.al. |
2502.15441 |
null |
2025-02-21 |
Fed-SB: A Silver Bullet for Extreme Communication Efficiency and Performance in (Private) Federated LoRA Fine-Tuning |
Raghav Singhal et.al. |
2502.15436 |
link |
2025-02-21 |
Single-pass Detection of Jailbreaking Input in Large Language Models |
Leyla Naz Candogan et.al. |
2502.15435 |
null |
2025-02-21 |
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear Interpolation |
Yue Zhou et.al. |
2502.15434 |
link |
2025-02-21 |
Pub-Guard-LLM: Detecting Fraudulent Biomedical Articles with Reliable Explanations |
Lihu Chen et.al. |
2502.15429 |
link |
2025-02-21 |
Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs |
Giulio Zizzo et.al. |
2502.15427 |
link |
2025-02-21 |
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking |
Yi-Ling Chung et.al. |
2502.15419 |
link |
2025-02-21 |
MHQA: A Diverse, Knowledge Intensive Mental Health Question Answering Challenge for Language Models |
Suraj Racha et.al. |
2502.15418 |
link |
2025-02-21 |
HiFi-KPI: A Dataset for Hierarchical KPI Extraction from Earnings Filings |
Rasmus Aavang et.al. |
2502.15411 |
link |
2025-02-21 |
Problem-Solving Logic Guided Curriculum In-Context Learning for LLMs Complex Reasoning |
Xuetao Ma et.al. |
2502.15401 |
null |
2025-02-21 |
Beyond Tools: Understanding How Heavy Users Integrate LLMs into Everyday Tasks and Decision-Making |
Eunhye Kim et.al. |
2502.15395 |
null |
2025-02-21 |
Chitrarth: Bridging Vision and Language for a Billion People |
Shaharukh Khan et.al. |
2502.15392 |
null |
2025-02-21 |
MOVE: A Mixture-of-Vision-Encoders Approach for Domain-Focused Vision-Language Processing |
Matvey Skripkin et.al. |
2502.15381 |
null |
2025-02-21 |
Weakly Supervised Video Scene Graph Generation via Natural Language Supervision |
Kibum Kim et.al. |
2502.15370 |
link |
2025-02-21 |
Identifying Features that Shape Perceived Consciousness in Large Language Model-based AI: A Quantitative Study of Human Responses |
Kang Bongsu et.al. |
2502.15365 |
null |
2025-02-21 |
Evaluating Social Biases in LLM Reasoning |
Xuyang Wu et.al. |
2502.15361 |
null |
2025-02-21 |
ARS: Automatic Routing Solver with Large Language Models |
Kai Li et.al. |
2502.15359 |
link |
2025-02-21 |
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms |
Feiyang Chen et.al. |
2502.15349 |
link |
2025-02-21 |
Constructing a Norm for Children’s Scientific Drawing: Distribution Features Based on Semantic Similarity of Large Language Models |
Yi Zhang et.al. |
2502.15348 |
null |
2025-02-21 |
Efficiently Solving Discounted MDPs with Predictions on Transition Matrices |
Lixing Lyu et.al. |
2502.15345 |
null |
2025-02-21 |
Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions |
Shoubin Chen et.al. |
2502.15336 |
null |
2025-02-21 |
Stepwise Informativeness Search for Improving LLM Reasoning |
Siyuan Wang et.al. |
2502.15335 |
null |
2025-02-21 |
Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment |
Pedram Zaree et.al. |
2502.15334 |
null |
2025-02-21 |
Detecting Future-related Contexts of Entity Mentions |
Puneet Prashar et.al. |
2502.15332 |
null |
2025-02-21 |
DynamicGSG: Dynamic 3D Gaussian Scene Graphs for Environment Adaptation |
Luzhou Ge et.al. |
2502.15309 |
link |
2025-02-21 |
SVDq: 1.25-bit and 410x Key Cache Compression for LLM Attention |
Hong Yankun et.al. |
2502.15304 |
null |
2025-02-21 |
Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference |
Yaohua Tang et.al. |
2502.15294 |
null |
2025-02-21 |
Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models |
Jianming Chang et.al. |
2502.15292 |
null |
2025-02-21 |
BundleFlow: Deep Menus for Combinatorial Auctions by Diffusion-Based Optimization |
Tonghan Wang et.al. |
2502.15283 |
null |
2025-02-21 |
A Training-free LLM-based Approach to General Chinese Character Error Correction |
Houquan Zhou et.al. |
2502.15266 |
link |
2025-02-21 |
Retrieval-Augmented Speech Recognition Approach for Domain Challenges |
Peng Shen et.al. |
2502.15264 |
null |
2025-02-21 |
LightMamba: Efficient Mamba Acceleration on FPGA with Quantization and Hardware Co-design |
Renjie Wei et.al. |
2502.15260 |
null |
2025-02-21 |
An approach for API synthesis using large language models |
Hua Zhong et.al. |
2502.15246 |
null |
2025-02-21 |
Comparative Analysis of Large Language Models for Context-Aware Code Completion using SAFIM Framework |
Hang Zhang et.al. |
2502.15243 |
null |
2025-02-21 |
From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants |
Manisha Mukherjee et.al. |
2502.15237 |
null |
2025-02-21 |
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text Generation |
Shilong Hou et.al. |
2502.15233 |
link |
2025-02-21 |
User Experience with LLM-powered Conversational Recommendation Systems: A Case of Music Recommendation |
Sojeong Yun et.al. |
2502.15229 |
null |
2025-02-21 |
Understand User Opinions of Large Language Models via LLM-Powered In-the-Moment User Experience Interviews |
Mengqiao Liu et.al. |
2502.15226 |
null |
2025-02-21 |
Auto-Bench: An Automated Benchmark for Scientific Discovery in LLMs |
Tingting Chen et.al. |
2502.15224 |
null |
2025-02-21 |
FormalSpecCpp: A Dataset of C++ Formal Specifications created using LLMs |
Madhurima Chakraborty et.al. |
2502.15217 |
link |
2025-02-21 |
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning |
Sheila Schoepp et.al. |
2502.15214 |
null |
2025-02-21 |
Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing |
Zhilin Wang et.al. |
2502.15208 |
null |
2025-02-21 |
Lung-DDPM: Semantic Layout-guided Diffusion Models for Thoracic CT Image Synthesis |
Yifan Jiang et.al. |
2502.15204 |
link |
2025-02-21 |
TETRIS: Optimal Draft Token Selection for Batch Speculative Decoding |
Zhaoxuan Wu et.al. |
2502.15197 |
null |
2025-02-21 |
LEDD: Large Language Model-Empowered Data Discovery in Data Lakes |
Qi An et.al. |
2502.15182 |
null |
2025-02-21 |
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders |
Weiqiao Shan et.al. |
2502.15178 |
null |
2025-02-21 |
Methods and Trends in Detecting Generated Images: A Comprehensive Review |
Arpan Mahara et.al. |
2502.15176 |
null |
2025-02-21 |
M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment |
Chuan Cui et.al. |
2502.15167 |
null |
2025-02-21 |
Extreme Speech Classification in the Era of LLMs: Exploring Open-Source and Proprietary Models |
Sarthak Mahajan et.al. |
2502.15155 |
null |
2025-02-21 |
Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems |
Tianjie Ju et.al. |
2502.15153 |
link |
2025-02-21 |
Do LLMs Make Mistakes Like Students? Exploring Natural Alignment between Language Models and Human Error Patterns |
Naiming Liu et.al. |
2502.15140 |
null |
2025-02-21 |
Chain-of-Rank: Enhancing Large Language Models for Domain-Specific RAG in Edge Device |
Juntae Lee et.al. |
2502.15134 |
null |
2025-02-21 |
TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba |
Xiuwei Chen et.al. |
2502.15130 |
null |
2025-02-20 |
LUME: LLM Unlearning with Multitask Evaluations |
Anil Ramakrishna et.al. |
2502.15097 |
null |
2025-02-20 |
Detecting Student Intent for Chat-Based Intelligent Tutoring Systems |
Ella Cutler et.al. |
2502.15096 |
null |
2025-02-20 |
Judging It, Washing It: Scoring and Greenwashing Corporate Climate Disclosures using Large Language Models |
Marianne Chuang et.al. |
2502.15094 |
null |
2025-02-20 |
Optimizing Singular Spectrum for Large Language Model Compression |
Dengjie Li et.al. |
2502.15092 |
null |
2025-02-20 |
Analyze the Neurons, not the Embeddings: Understanding When and Where LLM Representations Align with Humans |
Masha Fedzechkina et.al. |
2502.15090 |
null |
2025-02-20 |
Is Safety Standard Same for Everyone? User-Specific Safety Evaluation of Large Language Models |
Yeonjun In et.al. |
2502.15086 |
link |
2025-02-20 |
LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention |
Shang Yang et.al. |
2502.14866 |
link |
2025-02-20 |
Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning |
Shuyue Stella Li et.al. |
2502.14860 |
link |
2025-02-20 |
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling |
Weilin Zhao et.al. |
2502.14856 |
null |
2025-02-20 |
Prompt-to-Leaderboard |
Evan Frick et.al. |
2502.14855 |
link |
2025-02-20 |
GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks |
Jianwen Luo et.al. |
2502.14848 |
link |
2025-02-20 |
Red-Teaming LLM Multi-Agent Systems via Communication Attacks |
Pengfei He et.al. |
2502.14847 |
null |
2025-02-20 |
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation |
Yue Yang et.al. |
2502.14846 |
null |
2025-02-20 |
Revealing and Mitigating Over-Attention in Knowledge Editing |
Pinzheng Wang et.al. |
2502.14838 |
link |
2025-02-20 |
Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs |
Danni Liu et.al. |
2502.14830 |
link |
2025-02-20 |
A Survey of Model Architectures in Information Retrieval |
Zhichao Xu et.al. |
2502.14822 |
null |
2025-02-20 |
eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables |
Luis Antonio Gutiérrez Guanilo et.al. |
2502.14820 |
null |
2025-02-20 |
Dynamic Low-Rank Sparse Adaptation for Large Language Models |
Weizhong Huang et.al. |
2502.14816 |
link |
2025-02-20 |
FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis |
Fadillah Maani et.al. |
2502.14807 |
link |
2025-02-20 |
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models |
Bernal Jiménez Gutiérrez et.al. |
2502.14802 |
link |
2025-02-20 |
A Multi-Agent Perspective on Modern Information Retrieval |
Haya Nachimovsky et.al. |
2502.14796 |
null |
2025-02-20 |
Rapid Word Learning Through Meta In-Context Learning |
Wentao Wang et.al. |
2502.14791 |
null |
2025-02-20 |
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models |
Hongji Yang et.al. |
2502.14779 |
null |
2025-02-20 |
SurveyX: Academic Survey Automation via Large Language Models |
Xun Liang et.al. |
2502.14776 |
null |
2025-02-20 |
Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective |
Weizhong Huang et.al. |
2502.14770 |
null |
2025-02-20 |
Tree-of-Debate: Multi-Persona Debate Trees Elicit Critical Thinking for Scientific Comparative Analysis |
Priyanka Kargupta et.al. |
2502.14767 |
link |
2025-02-20 |
EquivaMap: Leveraging LLMs for Automatic Equivalence Checking of Optimization Formulations |
Haotian Zhai et.al. |
2502.14760 |
link |
2025-02-20 |
On the Influence of Context Size and Model Choice in Retrieval-Augmented Generation Systems |
Juraj Vladika et.al. |
2502.14759 |
link |
2025-02-20 |
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators |
Jianling Li et.al. |
2502.14752 |
link |
2025-02-20 |
Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs |
Zongxia Li et.al. |
2502.14748 |
null |
2025-02-20 |
Multi-Agent Coordination across Diverse Applications: A Survey |
Lijun Sun et.al. |
2502.14743 |
null |
2025-02-20 |
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines |
M-A-P Team et.al. |
2502.14739 |
null |
2025-02-20 |
EAGER-LLM: Enhancing Large Language Models as Recommenders through Exogenous Behavior-Semantic Integration |
Minjie Hong et.al. |
2502.14735 |
null |
2025-02-20 |
WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models |
Yifu Chen et.al. |
2502.14727 |
null |
2025-02-20 |
I-MCTS: Enhancing Agentic AutoML via Introspective Monte Carlo Tree Search |
Zujie Liang et.al. |
2502.14693 |
null |
2025-02-20 |
Bridging the Gap: Transforming Natural Language Questions into SQL Queries via Abstract Query Pattern and Contextual Schema Markup |
Yonghui Kong et.al. |
2502.14682 |
null |
2025-02-20 |
How to Get Your LLM to Generate Challenging Problems for Evaluation |
Arkil Patel et.al. |
2502.14678 |
link |
2025-02-20 |
Data-Constrained Synthesis of Training Data for De-Identification |
Thomas Vakili et.al. |
2502.14677 |
null |
2025-02-20 |
Explanations of Deep Language Models Explain Language Representations in the Brain |
Maryam Rahimi et.al. |
2502.14671 |
null |
2025-02-20 |
AlphaMaze: Enhancing Large Language Models’ Spatial Intelligence via GRPO |
Alan Dao et.al. |
2502.14669 |
link |
2025-02-20 |
Beyond the Surface: Uncovering Implicit Locations with LLMs for Personalized Local News |
Gali Katz et.al. |
2502.14660 |
null |
2025-02-20 |
Edit Once, Update Everywhere: A Simple Framework for Cross-Lingual Knowledge Synchronization in LLMs |
Yuchen Wu et.al. |
2502.14645 |
null |
2025-02-20 |
LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning |
Yansheng Mao et.al. |
2502.14644 |
null |
2025-02-20 |
Length-Controlled Margin-Based Preference Optimization without Reference Model |
Gengxu Li et.al. |
2502.14643 |
link |
2025-02-20 |
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation |
Angxiao Yue et.al. |
2502.14637 |
link |
2025-02-20 |
CER: Confidence Enhanced Reasoning in LLMs |
Ali Razghandi et.al. |
2502.14634 |
link |
2025-02-20 |
Augmenting Coaching with GenAI: Insights into Use, Effectiveness, and Future Potential |
Jennifer Haase et.al. |
2502.14632 |
null |
2025-02-20 |
Synergistic Fusion of Multi-Source Knowledge via Evidence Theory for High-Entropy Alloy Discovery |
Minh-Quyet Ha et.al. |
2502.14631 |
null |
2025-02-20 |
PEARL: Towards Permutation-Resilient LLMs |
Liang Chen et.al. |
2502.14628 |
link |
2025-02-20 |
Reward Models Identify Consistency, Not Causality |
Yuhui Xu et.al. |
2502.14619 |
null |
2025-02-20 |
Serving Models, Fast and Slow:Optimizing Heterogeneous LLM Inferencing Workloads at Scale |
Shashwat Jaiswal et.al. |
2502.14617 |
null |
2025-02-20 |
FIND: Fine-grained Information Density Guided Adaptive Retrieval-Augmented Generation for Disease Diagnosis |
Mingyi Jia et.al. |
2502.14614 |
null |
2025-02-20 |
Behavioral Analysis of Information Salience in Large Language Models |
Jan Trienes et.al. |
2502.14613 |
link |
2025-02-20 |
“Don’t Forget the Teachers”: Towards an Educator-Centered Understanding of Harms from Large Language Models in Education |
Emma Harvey et.al. |
2502.14592 |
null |
2025-02-20 |
Vision Foundation Models in Medical Image Analysis: Advances and Challenges |
Pengchen Liang et.al. |
2502.14584 |
null |
2025-02-20 |
A Theory for Conditional Generative Modeling on Multiple Data Sources |
Rongzhen Wang et.al. |
2502.14583 |
link |
2025-02-20 |
ReVISE: Learning to Refine at Test-Time via Intrinsic Self-Verification |
Hyunseok Lee et.al. |
2502.14565 |
null |
2025-02-20 |
Plan-over-Graph: Towards Parallelable LLM Agent Schedule |
Shiqi Zhang et.al. |
2502.14563 |
link |
2025-02-20 |
Can LLMs Predict Citation Intent? An Experimental Analysis of In-context Learning and Fine-tuning on Open LLMs |
Paris Koloveas et.al. |
2502.14561 |
link |
2025-02-20 |
Less is More: Improving LLM Alignment via Preference Data Selection |
Xun Deng et.al. |
2502.14560 |
null |
2025-02-20 |
Multiscale Byte Language Models – A Hierarchical Architecture for Causal Million-Length Sequence Modeling |
Eric Egli et.al. |
2502.14553 |
link |
2025-02-20 |
Position: Graph Learning Will Lose Relevance Due To Poor Benchmarks |
Maya Bechler-Speicher et.al. |
2502.14546 |
null |
2025-02-20 |
LLM-based User Profile Management for Recommender System |
Seunghwan Bang et.al. |
2502.14541 |
null |
2025-02-20 |
LoRA-GGPO: Mitigating Double Descent in LoRA Fine-Tuning via Gradient-Guided Perturbation Optimization |
Yupeng Chang et.al. |
2502.14538 |
link |
2025-02-20 |
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models |
Zhenhong Zhou et.al. |
2502.14529 |
link |
2025-02-20 |
Generative adversarial networks vs large language models: a comparative study on synthetic tabular data generation |
Austin A. Barr et.al. |
2502.14523 |
link |
2025-02-20 |
Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases |
Rena Gao et.al. |
2502.14507 |
link |
2025-02-20 |
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? |
Sergey Pletenev et.al. |
2502.14502 |
link |
2025-02-20 |
MLGym: A New Framework and Benchmark for Advancing AI Research Agents |
Deepak Nathani et.al. |
2502.14499 |
null |
2025-02-20 |
StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following |
Jinnan Li et.al. |
2502.14494 |
link |
2025-02-20 |
How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation |
Zhuohang Long et.al. |
2502.14486 |
null |
2025-02-20 |
NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language Models |
Chenlu Guo et.al. |
2502.14482 |
link |
2025-02-20 |
Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression |
Haoyu Wang et.al. |
2502.14477 |
null |
2025-02-20 |
Argument-Based Comparative Question Answering Evaluation Benchmark |
Irina Nikishina et.al. |
2502.14476 |
null |
2025-02-20 |
Enhancing Smart Environments with Context-Aware Chatbots using Large Language Models |
Aurora Polo-Rodríguez et.al. |
2502.14469 |
null |
2025-02-20 |
Narrative-Driven Travel Planning: Geoculturally-Grounded Script Generation with Evolutionary Itinerary Optimization |
Ran Ding et.al. |
2502.14456 |
link |
2025-02-20 |
Optimal word order for non-causal text generation with Large Language Models: the Spanish case |
Andrea Busto-Castiñeira et.al. |
2502.14451 |
null |
2025-02-20 |
LLM4FaaS: No-Code Application Development using LLMs and FaaS |
Minghe Wang et.al. |
2502.14450 |
null |
2025-02-20 |
PredictaBoard: Benchmarking LLM Score Predictability |
Lorenzo Pacchiardi et.al. |
2502.14445 |
link |
2025-02-20 |
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language Models |
Artem Vazhentsev et.al. |
2502.14427 |
link |
2025-02-20 |
A Survey on Data Contamination for Large Language Models |
Yuxing Cheng et.al. |
2502.14425 |
link |
2025-02-20 |
ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model |
Zhongyi Zhou et.al. |
2502.14420 |
null |
2025-02-20 |
Towards Efficient Automatic Self-Pruning of Large Language Models |
Weizhong Huang et.al. |
2502.14413 |
null |
2025-02-20 |
Evaluating Precise Geolocation Inference Capabilities of Vision Language Models |
Neel Jay et.al. |
2502.14412 |
link |
2025-02-20 |
Unstructured Evidence Attribution for Long Context Query Focused Summarization |
Dustin Wright et.al. |
2502.14409 |
link |
2025-02-20 |
HPS: Hard Preference Sampling for Human Preference Alignment |
Xiandong Zou et.al. |
2502.14400 |
null |
2025-02-20 |
Enhancing Portuguese Variety Identification with Cross-Domain Approaches |
Hugo Sousa et.al. |
2502.14394 |
null |
2025-02-20 |
Leveraging Small LLMs for Argument Mining in Education: Argument Component Identification, Classification, and Assessment |
Lucile Favero et.al. |
2502.14389 |
null |
2025-02-20 |
S*: Test Time Scaling for Code Generation |
Dacheng Li et.al. |
2502.14382 |
link |
2025-02-20 |
PPO-MI: Efficient Black-Box Model Inversion via Proximal Policy Optimization |
Xinpeng Shou et.al. |
2502.14370 |
null |
2025-02-20 |
Entropy-UID: A Method for Optimizing Information Density |
Xinpeng Shou et.al. |
2502.14366 |
null |
2025-02-20 |
Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning |
Jiachen Zhu et.al. |
2502.14361 |
null |
2025-02-20 |
SR-LLM: Rethinking the Structured Representation in Large Language Model |
Jiahuan Zhang et.al. |
2502.14352 |
null |
2025-02-20 |
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images |
Yichi Zhang et.al. |
2502.14351 |
link |
2025-02-20 |
FlowAgent: Achieving Compliance and Flexibility for Workflow Agents |
Yuchen Shi et.al. |
2502.14345 |
link |
2025-02-20 |
Earlier Tokens Contribute More: Learning Direct Preference Optimization From Temporal Decay Perspective |
Ruichen Shao et.al. |
2502.14340 |
null |
2025-02-20 |
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics |
Ting-Ruen Wei et.al. |
2502.14333 |
null |
2025-02-20 |
SolSearch: An LLM-Driven Framework for Efficient SAT-Solving Code Generation |
Junjie Sheng et.al. |
2502.14328 |
null |
2025-02-20 |
ChemHTS: Hierarchical Tool Stacking for Enhancing Chemical Agents |
Zhucong Li et.al. |
2502.14327 |
link |
2025-02-20 |
Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems |
Bingyu Yan et.al. |
2502.14321 |
null |
2025-02-20 |
Line Goes Up? Inherent Limitations of Benchmarks for Evaluating Large Language Models |
James Fodor et.al. |
2502.14318 |
null |
2025-02-20 |
ParallelComp: Parallel Long-Context Compressor for Length Extrapolation |
Jing Xiong et.al. |
2502.14317 |
null |
2025-02-20 |
Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension |
Amir Hossein Yari et.al. |
2502.14315 |
null |
2025-02-20 |
Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications |
Kayhan Behdin et.al. |
2502.14305 |
null |
2025-02-20 |
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models |
Shrey Pandit et.al. |
2502.14302 |
null |
2025-02-20 |
SEA-HELM: Southeast Asian Holistic Evaluation of Language Models |
Yosephine Susanto et.al. |
2502.14301 |
null |
2025-02-19 |
Where’s the Bug? Attention Probing for Scalable Fault Localization |
Adam Stein et.al. |
2502.13966 |
null |
2025-02-19 |
Autellix: An Efficient Serving Engine for LLM Agents as General Programs |
Michael Luo et.al. |
2502.13965 |
null |
2025-02-19 |
MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads |
Weihao Liu et.al. |
2502.13963 |
link |
2025-02-19 |
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering |
William Jurayj et.al. |
2502.13962 |
null |
2025-02-19 |
LIDDIA: Language-based Intelligent Drug Discovery Agent |
Reza Averly et.al. |
2502.13959 |
null |
2025-02-19 |
Neurosymbolic artificial intelligence via large language models and coherence-driven inference |
Steve Huntsman et.al. |
2502.13953 |
null |
2025-02-19 |
Why Safeguarded Ships Run Aground? Aligned Large Language Models’ Safety Mechanisms Tend to Be Anchored in The Template Region |
Chak Tou Leong et.al. |
2502.13946 |
null |
2025-02-19 |
Image compositing is all you need for data augmentation |
Ang Jia Ning Shermaine et.al. |
2502.13936 |
null |
2025-02-19 |
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization |
Guanzheng Chen et.al. |
2502.13922 |
link |
2025-02-19 |
Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis |
Jiahao Gai et.al. |
2502.13921 |
null |
2025-02-19 |
Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health |
Xingbo Wang et.al. |
2502.13920 |
null |
2025-02-19 |
How Do LLMs Perform Two-Hop Reasoning in Context? |
Tianyu Guo et.al. |
2502.13913 |
null |
2025-02-19 |
Lost in Sequence: Do Large Language Models Understand Sequential Recommendation? |
Sein Kim et.al. |
2502.13909 |
link |
2025-02-19 |
Judging the Judges: A Collection of LLM-Generated Relevance Judgements |
Hossein A. Rahmani et.al. |
2502.13908 |
link |
2025-02-19 |
DataSciBench: An LLM Agent Benchmark for Data Science |
Dan Zhang et.al. |
2502.13897 |
link |
2025-02-19 |
NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants |
Yiran Qin et.al. |
2502.13894 |
null |
2025-02-19 |
Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models |
Matthew P. Wilson et.al. |
2502.13886 |
link |
2025-02-19 |
SPEX: Scaling Feature Interaction Explanations for LLMs |
Justin Singh Kang et.al. |
2502.13870 |
link |
2025-02-19 |
MagicGeo: Training-Free Text-Guided Geometric Diagram Generation |
Junxiao Wang et.al. |
2502.13855 |
null |
2025-02-19 |
Enhancing LLM-Based Recommendations Through Personalized Reasoning |
Jiahao Liu et.al. |
2502.13845 |
null |
2025-02-19 |
Enhancing Cross-Domain Recommendations with Memory-Optimized LLM-Based User Agents |
Jiahao Liu et.al. |
2502.13843 |
null |
2025-02-19 |
Inner Thinking Transformer: Leveraging Dynamic Depth Scaling to Foster Adaptive Internal Thinking |
Yilong Chen et.al. |
2502.13842 |
null |
2025-02-19 |
Quantifying Memorization and Retriever Performance in Retrieval-Augmented Vision-Language Models |
Peter Carragher et.al. |
2502.13836 |
null |
2025-02-19 |
Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning |
Zenan Li et.al. |
2502.13834 |
link |
2025-02-19 |
ArtMentor: AI-Assisted Evaluation of Artworks to Explore Multimodal Large Language Models Capabilities |
Chanjin Zheng et.al. |
2502.13832 |
link |
2025-02-19 |
LESA: Learnable LLM Layer Scaling-Up |
Yifei Yang et.al. |
2502.13794 |
link |
2025-02-19 |
From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions |
Nathanaël Carraz Rakotonirina et.al. |
2502.13791 |
link |
2025-02-19 |
From Correctness to Comprehension: AI Agents for Personalized Error Diagnosis in Education |
Yi-Fan Zhang et.al. |
2502.13789 |
null |
2025-02-19 |
Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics |
Matthew Wood et.al. |
2502.13785 |
link |
2025-02-19 |
Generative Large Recommendation Models: Emerging Trends in LLMs for Recommendation |
Hao Wang et.al. |
2502.13783 |
null |
2025-02-19 |
Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions |
Beatrice Savoldi et.al. |
2502.13780 |
null |
2025-02-19 |
VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare |
Anudeex Shetty et.al. |
2502.13775 |
null |
2025-02-19 |
AI Software Engineer: Programming with Trust |
Abhik Roychoudhury et.al. |
2502.13767 |
null |
2025-02-19 |
SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning |
Renxi Wang et.al. |
2502.13753 |
link |
2025-02-19 |
Reverse Markov Learning: Multi-Step Generative Models for Complex Distributions |
Xinwei Shen et.al. |
2502.13747 |
null |
2025-02-19 |
Enhancing Input-Label Mapping in In-Context Learning with Contrastive Decoding |
Keqin Peng et.al. |
2502.13738 |
null |
2025-02-19 |
CARE: Confidence-Aware Regression Estimation of building density fine-tuning EO Foundation Models |
Nikolaos Dionelis et.al. |
2502.13734 |
null |
2025-02-19 |
Adapting Large Language Models for Time Series Modeling via a Novel Parameter-efficient Adaptation Method |
Juyuan Zhang et.al. |
2502.13725 |
null |
2025-02-19 |
Direct Value Optimization: Improving Chain-of-Thought Reasoning in LLMs with Refined Values |
Hongbo Zhang et.al. |
2502.13723 |
null |
2025-02-19 |
TALKPLAY: Multimodal Music Recommendation with Large Language Models |
Seungheon Doh et.al. |
2502.13713 |
null |
2025-02-19 |
Is This Collection Worth My LLM’s Time? Automatically Measuring Information Potential in Text Corpora |
Tristan Karch et.al. |
2502.13691 |
null |
2025-02-19 |
An LLM-based Agent for Reliable Docker Environment Configuration |
Ruida Hu et.al. |
2502.13681 |
link |
2025-02-19 |
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation |
Song Duong et.al. |
2502.13674 |
null |
2025-02-19 |
Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models |
Liyang He et.al. |
2502.13656 |
link |
2025-02-19 |
C2T: A Classifier-Based Tree Construction Method in Speculative Decoding |
Feiye Huo et.al. |
2502.13652 |
null |
2025-02-19 |
Reliability Across Parametric and External Knowledge: Understanding Knowledge Handling in LLMs |
Youna Kim et.al. |
2502.13648 |
null |
2025-02-19 |
D.Va: Validate Your Demonstration First Before You Use It |
Qi Zhang et.al. |
2502.13646 |
null |
2025-02-19 |
Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts |
Maiya Goloburda et.al. |
2502.13640 |
null |
2025-02-19 |
Concept Layers: Enhancing Interpretability and Intervenability via LLM Conceptualization |
Or Raphael Bidusa et.al. |
2502.13632 |
null |
2025-02-19 |
AI-Empowered Catalyst Discovery: A Survey from Classical Machine Learning Approaches to Large Language Models |
Yuanyuan Xu et.al. |
2502.13626 |
null |
2025-02-19 |
REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models |
DongGeon Lee et.al. |
2502.13622 |
null |
2025-02-19 |
Complex Ontology Matching with Large Language Model Embeddings |
Guilherme Sousa et.al. |
2502.13619 |
null |
2025-02-19 |
LaVCa: LLM-assisted Visual Cortex Captioning |
Takuya Matsuyama et.al. |
2502.13606 |
null |
2025-02-19 |
BeamLoRA: Beam-Constraint Low-Rank Adaptation |
Naibin Gu et.al. |
2502.13604 |
null |
2025-02-19 |
MMTEB: Massive Multilingual Text Embedding Benchmark |
Kenneth Enevoldsen et.al. |
2502.13595 |
null |
2025-02-19 |
Don’t Stop the Multi-Party! On Generating Synthetic Multi-Party Conversations with Constraints |
Nicolò Penzo et.al. |
2502.13592 |
link |
2025-02-19 |
Unraveling the Localized Latents: Learning Stratified Manifold Structures in LLM Embedding Space with Sparse Mixture-of-Experts |
Xin Li et.al. |
2502.13577 |
null |
2025-02-19 |
LSR-Adapt: Ultra-Efficient Parameter Tuning with Matrix Low Separation Rank Kernel Adaptation |
Xin Li et.al. |
2502.13568 |
null |
2025-02-19 |
Extracting Social Connections from Finnish Karelian Refugee Interviews Using LLMs |
Joonatan Laato et.al. |
2502.13566 |
null |
2025-02-19 |
PRIV-QA: Privacy-Preserving Question Answering for Cloud Large Language Models |
Guangwei Li et.al. |
2502.13564 |
link |
2025-02-19 |
Are Large Language Models In-Context Graph Learners? |
Jintang Li et.al. |
2502.13562 |
null |
2025-02-19 |
Democratizing Large Language Model-Based Graph Data Augmentation via Latent Knowledge Graphs |
Yushi Feng et.al. |
2502.13555 |
link |
2025-02-19 |
STaR-SQL: Self-Taught Reasoner for Text-to-SQL |
Mingqian He et.al. |
2502.13550 |
null |
2025-02-19 |
Detecting Linguistic Bias in Government Documents Using Large language Models |
Milena de Swart et.al. |
2502.13548 |
null |
2025-02-19 |
From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN |
Peiwen Yuan et.al. |
2502.13544 |
null |
2025-02-19 |
Activation-aware Probe-Query: Effective Key-Value Retrieval for Long-Context LLMs Inference |
Qingfa Xiao et.al. |
2502.13542 |
null |
2025-02-19 |
Bursting Filter Bubble: Enhancing Serendipity Recommendations with Aligned Large Language Models |
Yunjia Xi et.al. |
2502.13539 |
null |
2025-02-19 |
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models |
Jun Zhang et.al. |
2502.13533 |
link |
2025-02-19 |
Exploiting Prefix-Tree in Structured Output Interfaces for Enhancing Jailbreak Attacking |
Yanzeng Li et.al. |
2502.13527 |
link |
2025-02-19 |
SPPD: Self-training with Process Preference Learning Using Dynamic Value Margin |
Hao Yi et.al. |
2502.13516 |
null |
2025-02-19 |
Unlocking Multimodal Integration in EHRs: A Prompt Learning Framework for Language and Time Series Fusion |
Shuai Niu et.al. |
2502.13509 |
null |
2025-02-19 |
Reproducing NevIR: Negation in Neural Information Retrieval |
Coen van Elsen et.al. |
2502.13506 |
link |
2025-02-19 |
PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference |
Burc Gokden et.al. |
2502.13502 |
link |
2025-02-19 |
Towards Geo-Culturally Grounded LLM Generations |
Piyawat Lertvittayakumjorn et.al. |
2502.13497 |
null |
2025-02-19 |
What are Models Thinking about? Understanding Large Language Model Hallucinations “Psychology” through Model Inner State Analysis |
Peiran Wang et.al. |
2502.13490 |
null |
2025-02-19 |
LLM4Tag: Automatic Tagging System for Information Retrieval via Large Language Models |
Ruiming Tang et.al. |
2502.13481 |
null |
2025-02-19 |
Integration of Agentic AI with 6G Networks for Mission-Critical Applications: Use-case and Challenges |
Sunder Ali Khowaja et.al. |
2502.13476 |
null |
2025-02-19 |
LLM should think and action as a human |
Haun Leung et.al. |
2502.13475 |
null |
2025-02-19 |
Towards Lightweight, Adaptive and Attribute-Aware Multi-Aspect Controllable Text Generation with Large Language Models |
Chenyu Zhu et.al. |
2502.13474 |
null |
2025-02-19 |
ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails |
Xiaofei Wen et.al. |
2502.13458 |
link |
2025-02-19 |
Interleaved Gibbs Diffusion for Constrained Generation |
Gautham Govind Anil et.al. |
2502.13450 |
null |
2025-02-19 |
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning |
Yang Yan et.al. |
2502.13447 |
null |
2025-02-19 |
TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation |
Jialin Ouyang et.al. |
2502.13442 |
link |
2025-02-19 |
The Self-Improvement Paradox: Can Language Models Bootstrap Reasoning Capabilities without External Scaffolding? |
Yutao Sun et.al. |
2502.13441 |
null |
2025-02-19 |
MATS: An Audio Language Model under Text-only Supervision |
Wen Wang et.al. |
2502.13433 |
null |
2025-02-19 |
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning |
Hao Ma et.al. |
2502.13430 |
null |
2025-02-19 |
MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering |
Guanming Xiong et.al. |
2502.13428 |
null |
2025-02-19 |
TabSD: Large Free-Form Table Question Answering with SQL-Based Table Decomposition |
Yuxiang Wang et.al. |
2502.13422 |
null |
2025-02-19 |
RLTHF: Targeted Human Feedback for LLM Alignment |
Yifei Xu et.al. |
2502.13417 |
null |
2025-02-19 |
Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning |
Ningke Li et.al. |
2502.13416 |
null |
2025-02-19 |
Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction |
Yanbang Sun et.al. |
2502.13412 |
null |
2025-02-19 |
Generative Predictive Control: Flow Matching Policies for Dynamic and Difficult-to-Demonstrate Tasks |
Vince Kurtz et.al. |
2502.13406 |
null |
2025-02-19 |
$\mathtt{GeLLM^3O}$ : Generalizing Large Language Models for Multi-property Molecule Optimization |
Vishal Dey et.al. |
2502.13398 |
link |
2025-02-19 |
Prompting a Weighting Mechanism into LLM-as-a-Judge in Two-Step: A Case Study |
Wenwen Xie et.al. |
2502.13396 |
null |
2025-02-19 |
Flow-based generative models as iterative algorithms in probability space |
Yao Xie et.al. |
2502.13394 |
null |
2025-02-19 |
Reasoning with Reinforced Functional Token Tuning |
Kongcheng Zhang et.al. |
2502.13389 |
link |
2025-02-19 |
Reflection of Episodes: Learning to Play Game from Expert and Self Experiences |
Xiaojie Xu et.al. |
2502.13388 |
null |
2025-02-19 |
MM-Verify: Enhancing Multimodal Reasoning with Chain-of-Thought Verification |
Linzhuang Sun et.al. |
2502.13383 |
link |
2025-02-19 |
AutoTEE: Automated Migration and Protection of Programs in Trusted Execution Environments |
Ruidong Han et.al. |
2502.13379 |
link |
2025-02-19 |
Task-agnostic Prompt Compression with Context-aware Sentence Embedding and Reward-guided Task Descriptor |
Barys Liskavets et.al. |
2502.13374 |
null |
2025-02-18 |
Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization |
Shuo Xing et.al. |
2502.13146 |
link |
2025-02-18 |
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation |
Bencheng Liao et.al. |
2502.13145 |
link |
2025-02-18 |
Pre-training Auto-regressive Robotic Models with 4D Representations |
Dantong Niu et.al. |
2502.13142 |
null |
2025-02-18 |
UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models |
Huawei Lin et.al. |
2502.13141 |
link |
2025-02-18 |
AIDE: AI-Driven Exploration in the Space of Code |
Zhengyao Jiang et.al. |
2502.13138 |
link |
2025-02-18 |
Theorem Prover as a Judge for Synthetic Data Generation |
Joshua Ong Jun Leang et.al. |
2502.13137 |
null |
2025-02-18 |
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions |
Aggelina Chatziagapi et.al. |
2502.13133 |
null |
2025-02-18 |
Learning to Defer for Causal Discovery with Imperfect Experts |
Oscar Clivio et.al. |
2502.13132 |
null |
2025-02-18 |
Rethinking Diverse Human Preference Learning through Principal Component Analysis |
Feng Luo et.al. |
2502.13131 |
null |
2025-02-18 |
Magma: A Foundation Model for Multimodal AI Agents |
Jianwei Yang et.al. |
2502.13130 |
link |
2025-02-18 |
Is Noise Conditioning Necessary for Denoising Generative Models? |
Qiao Sun et.al. |
2502.13129 |
null |
2025-02-18 |
Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning |
Jingyang Lin et.al. |
2502.13127 |
null |
2025-02-18 |
RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises |
Zenan Zhai et.al. |
2502.13125 |
link |
2025-02-18 |
Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context |
Marion Bartl et.al. |
2502.13120 |
null |
2025-02-18 |
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models |
Narun Raman et.al. |
2502.13119 |
null |
2025-02-18 |
Performance Evaluation of Large Language Models in Statistical Programming |
Xinyi Song et.al. |
2502.13117 |
link |
2025-02-18 |
MatterChat: A Multi-Modal LLM for Material Science |
Yingheng Tang et.al. |
2502.13107 |
null |
2025-02-18 |
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation |
Mengkang Hu et.al. |
2502.13092 |
null |
2025-02-18 |
A Neural Difference-of-Entropies Estimator for Mutual Information |
Haoran Ni et.al. |
2502.13085 |
null |
2025-02-18 |
Personalized Image Generation with Deep Generative Models: A Decade Survey |
Yuxiang Wei et.al. |
2502.13081 |
link |
2025-02-18 |
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models |
Xianfu Cheng et.al. |
2502.13059 |
null |
2025-02-18 |
LAMD: Context-driven Android Malware Detection and Classification with LLMs |
Xingzhi Qian et.al. |
2502.13055 |
null |
2025-02-18 |
Do we still need Human Annotators? Prompting Large Language Models for Aspect Sentiment Quad Prediction |
Nils Constantin Hellwig et.al. |
2502.13044 |
null |
2025-02-18 |
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators |
Bosi Wen et.al. |
2502.13031 |
null |
2025-02-18 |
A deep learning framework for efficient pathology image analysis |
Peter Neidlinger et.al. |
2502.13027 |
null |
2025-02-18 |
Agentic Deep Graph Reasoning Yields Self-Organizing Knowledge Networks |
Markus J. Buehler et.al. |
2502.13025 |
link |
2025-02-18 |
Oreo: A Plug-in Context Reconstructor to Enhance Retrieval-Augmented Generation |
Sha Li et.al. |
2502.13019 |
null |
2025-02-18 |
Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents |
Chaoran Chen et.al. |
2502.13012 |
null |
2025-02-18 |
Adaptive Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge |
Mohammad Reza Rezaei et.al. |
2502.13010 |
null |
2025-02-18 |
You need to MIMIC to get FAME: Solving Meeting Transcript Scarcity with a Multi-Agent Conversations |
Frederic Kirstein et.al. |
2502.13001 |
null |
2025-02-18 |
Personalized Top-k Set Queries Over Predicted Scores |
Sohrab Namazi Nia et.al. |
2502.12998 |
null |
2025-02-18 |
Beyond Profile: From Surface-Level Facts to Deep Persona Simulation in LLMs |
Zixiao Wang et.al. |
2502.12988 |
null |
2025-02-18 |
Towards Variational Flow Matching on General Geometries |
Olga Zaghen et.al. |
2502.12981 |
null |
2025-02-18 |
Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search |
Yifan Ji et.al. |
2502.12974 |
link |
2025-02-18 |
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking |
Junda Zhu et.al. |
2502.12970 |
link |
2025-02-18 |
Trust Me, I’m Wrong: High-Certainty Hallucinations in LLMs |
Adi Simhi et.al. |
2502.12964 |
null |
2025-02-18 |
Infinite Retrieval: Attention Enhanced LLMs in Long-Context Processing |
Xiaoju Ye et.al. |
2502.12962 |
null |
2025-02-18 |
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger |
Wenjun Li et.al. |
2502.12961 |
null |
2025-02-18 |
Guaranteed Conditional Diffusion: 3D Block-based Models for Scientific Data Compression |
Jaemoon Lee et.al. |
2502.12951 |
null |
2025-02-18 |
Fake It Till You Make It: Using Synthetic Data and Domain Knowledge for Improved Text-Based Learning for LGE Detection |
Athira J Jacob et.al. |
2502.12948 |
null |
2025-02-18 |
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models |
Gyeongman Kim et.al. |
2502.12947 |
null |
2025-02-18 |
LLMPopcorn: An Empirical Study of LLMs as Assistants for Popular Micro-video Generation |
Junchen Fu et.al. |
2502.12945 |
null |
2025-02-18 |
Performance of Zero-Shot Time Series Foundation Models on Cloud Data |
William Toner et.al. |
2502.12944 |
null |
2025-02-18 |
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options |
Lakshmi Nair et.al. |
2502.12929 |
link |
2025-02-18 |
Finedeep: Mitigating Sparse Activation in Dense LLMs via Multi-Layer Fine-Grained Experts |
Leiyu Pan et.al. |
2502.12928 |
null |
2025-02-18 |
SEFL: Harnessing Large Language Model Agents to Improve Educational Feedback Systems |
Mike Zhang et.al. |
2502.12927 |
link |
2025-02-18 |
Towards more Contextual Agents: An extractor-Generator Optimization Framework |
Mourad Aouini et.al. |
2502.12926 |
null |
2025-02-18 |
Keep what you need : extracting efficient subnetworks from large audio representation models |
David Genova et.al. |
2502.12925 |
link |
2025-02-18 |
Conditioning LLMs to Generate Code-Switched Text: A Methodology Grounded in Naturally Occurring Data |
Maite Heredia et.al. |
2502.12924 |
link |
2025-02-18 |
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation |
Rune Birkmose et.al. |
2502.12923 |
link |
2025-02-18 |
Q-STRUM Debate: Query-Driven Contrastive Summarization for Recommendation Comparison |
George-Kirollos Saad et.al. |
2502.12921 |
null |
2025-02-18 |
Lightweight Online Adaption for Time Series Foundation Model Forecasts |
Thomas L. Lee et.al. |
2502.12920 |
null |
2025-02-18 |
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning |
Sifan Zhou et.al. |
2502.12913 |
null |
2025-02-18 |
Probabilistic neural operators for functional uncertainty quantification |
Christopher Bülte et.al. |
2502.12902 |
link |
2025-02-18 |
Soundwave: Less is More for Speech-Text Alignment in LLMs |
Yuhao Zhang et.al. |
2502.12900 |
link |
2025-02-18 |
Multilingual European Language Models: Benchmarking Approaches and Challenges |
Fabio Barth et.al. |
2502.12895 |
null |
2025-02-18 |
CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image |
Kaixin Yao et.al. |
2502.12894 |
null |
2025-02-18 |
Are Multilingual Language Models an Off-ramp for Under-resourced Languages? Will we arrive at Digital Language Equality in Europe in 2030? |
Georg Rehm et.al. |
2502.12886 |
null |
2025-02-18 |
How desirable is alignment between LLMs and linguistically diverse human users? |
Pia Knoeferle et.al. |
2502.12884 |
null |
2025-02-18 |
Continuous Learning Conversational AI: A Personalized Agent Framework via A2C Reinforcement Learning |
Nandakishor M et.al. |
2502.12876 |
null |
2025-02-18 |
RobotIQ: Empowering Mobile Robots with Human-Level Planning for Real-World Execution |
Emmanuel K. Raptis et.al. |
2502.12862 |
link |
2025-02-18 |
PAFT: Prompt-Agnostic Fine-Tuning |
Chenxing Wei et.al. |
2502.12859 |
null |
2025-02-18 |
Rejected Dialects: Biases Against African American Language in Reward Models |
Joel Mire et.al. |
2502.12858 |
link |
2025-02-18 |
MeMo: Towards Language Models with Associative Memory Mechanisms |
Fabio Massimo Zanzotto et.al. |
2502.12851 |
null |
2025-02-18 |
MOLLM: Multi-Objective Large Language Model for Molecular Design – Optimizing with Experts |
Nian Ran et.al. |
2502.12845 |
null |
2025-02-18 |
Towards Adaptive Feedback with AI: Comparing the Feedback Quality of LLMs and Teachers on Experimentation Protocols |
Kathrin Seßler et.al. |
2502.12842 |
null |
2025-02-18 |
Towards Equitable AI: Detecting Bias in Using Large Language Models for Marketing |
Berk Yilmaz et.al. |
2502.12838 |
null |
2025-02-18 |
An LLM-Powered Agent for Physiological Data Analysis: A Case Study on PPG-based Heart Rate Estimation |
Mohammad Feli et.al. |
2502.12836 |
null |
2025-02-18 |
KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan |
Mukhammed Togmanov et.al. |
2502.12829 |
null |
2025-02-18 |
Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models |
Rubing Lu et.al. |
2502.12825 |
null |
2025-02-18 |
Pitfalls of Scale: Investigating the Inverse Task of Redefinition in Large Language Models |
Elena Stringli et.al. |
2502.12821 |
null |
2025-02-18 |
Simulating User Diversity in Task-Oriented Dialogue Systems using Large Language Models |
Adnan Ahmad et.al. |
2502.12813 |
null |
2025-02-18 |
Towards Text-Image Interleaved Retrieval |
Xin Zhang et.al. |
2502.12799 |
link |
2025-02-18 |
RAPID: Retrieval Augmented Training of Differentially Private Diffusion Models |
Tanqiu Jiang et.al. |
2502.12794 |
link |
2025-02-18 |
Commonsense Reasoning in Arab Culture |
Abdelrahman Sadallah et.al. |
2502.12788 |
null |
2025-02-18 |
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models |
Daiki Chijiwa et.al. |
2502.12776 |
null |
2025-02-18 |
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild |
Saad Obaid ul Islam et.al. |
2502.12769 |
link |
2025-02-18 |
R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs |
Sumin Jo et.al. |
2502.12767 |
null |
2025-02-18 |
One-bit Compressed Sensing using Generative Models |
Swatantra Kafle et.al. |
2502.12762 |
null |
2025-02-18 |
Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models |
Kamer Ali Yuksel et.al. |
2502.12755 |
link |
2025-02-18 |
Architect of the Bits World: Masked Autoregressive Modeling for Circuit Generation Guided by Truth Table |
Haoyuan Wu et.al. |
2502.12751 |
null |
2025-02-18 |
Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation |
Yong Zhang et.al. |
2502.12744 |
null |
2025-02-18 |
“I know myself better, but not really greatly”: Using LLMs to Detect and Explain LLM-Generated Texts |
Jiazhou Ji et.al. |
2502.12743 |
null |
2025-02-18 |
Circuit Representation Learning with Masked Gate Modeling and Verilog-AIG Alignment |
Haoyuan Wu et.al. |
2502.12732 |
null |
2025-02-18 |
TREND: A Whitespace Replacement Information Hiding Method |
Malte Hellmeier et.al. |
2502.12710 |
null |
2025-02-18 |
Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming |
Arash Lagzian et.al. |
2502.12700 |
null |
2025-02-18 |
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees |
Yongtao Wu et.al. |
2502.12678 |
null |
2025-02-18 |
Baichuan-M1: Pushing the Medical Capability of Large Language Models |
Bingning Wang et.al. |
2502.12671 |
null |
2025-02-18 |
Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research |
Xiang Liu et.al. |
2502.12669 |
null |
2025-02-18 |
Evaluation of Best-of-N Sampling Strategies for Language Model Alignment |
Yuki Ichihara et.al. |
2502.12668 |
null |
2025-02-18 |
A $^2$ ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization |
Junhui He et.al. |
2502.12665 |
null |
2025-02-18 |
Demystifying Multilingual Chain-of-Thought in Process Reward Modeling |
Weixuan Wang et.al. |
2502.12663 |
null |
2025-02-18 |
The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1 |
Kaiwen Zhou et.al. |
2502.12659 |
null |
2025-02-18 |
R.R.: Unveiling LLM Training Privacy through Recollection and Ranking |
Wenlong Meng et.al. |
2502.12658 |
link |
2025-02-18 |
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation |
Zhiyuan Liu et.al. |
2502.12638 |
link |
2025-02-18 |
Corrupted but Not Broken: Rethinking the Impact of Corrupted Data in Visual Instruction Tuning |
Yunhao Gou et.al. |
2502.12635 |
null |
2025-02-18 |
\textit{One Size doesn’t Fit All}: A Personalized Conversational Tutoring Agent for Mathematics Instruction |
Ben Liu et.al. |
2502.12633 |
null |
2025-02-18 |
Automating Prompt Leakage Attacks on Large Language Models Using Agentic Approach |
Tvrtko Sternak et.al. |
2502.12630 |
link |
2025-02-18 |
DeepResonance: Enhancing Multimodal Music Understanding via Music-centric Multi-way Instruction Tuning |
Zhuoyuan Mao et.al. |
2502.12623 |
null |
2025-02-18 |
Improving Chain-of-Thought Reasoning via Quasi-Symbolic Abstractions |
Leonardo Ranaldi et.al. |
2502.12616 |
null |
2025-02-17 |
Idiosyncrasies in Large Language Models |
Mingjie Sun et.al. |
2502.12150 |
link |
2025-02-17 |
HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation |
Ling Yang et.al. |
2502.12148 |
link |
2025-02-17 |
Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control |
Jinyan Su et.al. |
2502.12145 |
link |
2025-02-17 |
Small Models Struggle to Learn from Strong Reasoners |
Yuetai Li et.al. |
2502.12143 |
null |
2025-02-17 |
SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs |
Yige Xu et.al. |
2502.12134 |
null |
2025-02-17 |
Transformer Dynamics: A neuroscientific approach to interpretability of large language models |
Jesseba Fernando et.al. |
2502.12131 |
null |
2025-02-17 |
Scaling Autonomous Agents via Automatic Reward Modeling And Planning |
Zhenfang Chen et.al. |
2502.12130 |
null |
2025-02-17 |
LaM-SLidE: Latent Space Modeling of Spatial Dynamical Systems via Linked Entities |
Florian Sestak et.al. |
2502.12128 |
link |
2025-02-17 |
Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA |
Patryk Marszałek et.al. |
2502.12122 |
link |
2025-02-17 |
LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws |
Prasanna Mayilvahanan et.al. |
2502.12120 |
null |
2025-02-17 |
PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection |
Jinhe Bi et.al. |
2502.12119 |
null |
2025-02-17 |
A-MEM: Agentic Memory for LLM Agents |
Wujiang Xu et.al. |
2502.12110 |
link |
2025-02-17 |
Personality Structured Interview for Large Language Model Simulation in Personality Research |
Pengda Wang et.al. |
2502.12109 |
null |
2025-02-17 |
Relational Norms for Human-AI Cooperation |
Brian D. Earp et.al. |
2502.12102 |
null |
2025-02-17 |
Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications |
Li Qiao et.al. |
2502.12096 |
null |
2025-02-17 |
How compositional generalization and creativity improve as diffusion models are trained |
Alessandro Favero et.al. |
2502.12089 |
null |
2025-02-17 |
Meta-Statistical Learning: Supervised Learning of Statistical Inference |
Maxime Peyrard et.al. |
2502.12088 |
null |
2025-02-17 |
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs |
Yuxiang Huang et.al. |
2502.12085 |
link |
2025-02-17 |
Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation |
Zhongyi Qiu et.al. |
2502.12073 |
null |
2025-02-17 |
TokenSkip: Controllable Chain-of-Thought Compression in LLMs |
Heming Xia et.al. |
2502.12067 |
link |
2025-02-17 |
CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models |
Yifan Zhang et.al. |
2502.12066 |
null |
2025-02-17 |
AI-generated Text Detection with a GLTR-based Approach |
Lucía Yan Wu et.al. |
2502.12064 |
null |
2025-02-17 |
Designing Role Vectors to Improve LLM Inference Behaviour |
Daniele Potertì et.al. |
2502.12055 |
null |
2025-02-17 |
PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning |
Xinyu Zhang et.al. |
2502.12054 |
null |
2025-02-17 |
A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond |
Shreya Shukla et.al. |
2502.12048 |
null |
2025-02-17 |
KnowPath: Knowledge-enhanced Reasoning via LLM-generated Inference Paths over Knowledge Graphs |
Qi Zhao et.al. |
2502.12029 |
null |
2025-02-17 |
SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities |
Fengqing Jiang et.al. |
2502.12025 |
null |
2025-02-17 |
Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem Solving |
Xin Xu et.al. |
2502.12022 |
null |
2025-02-17 |
Atom of Thoughts for Markov LLM Test-Time Scaling |
Fengwei Teng et.al. |
2502.12018 |
link |
2025-02-17 |
Unsupervised Structural-Counterfactual Generation under Domain Shift |
Krishn Vishwas Kher et.al. |
2502.12013 |
null |
2025-02-17 |
Design Considerations Based on Stability for a Class of TCP Algorithms |
Sreekanth Prabhakar et.al. |
2502.11983 |
null |
2025-02-17 |
Image Inversion: A Survey from GANs to Diffusion and Beyond |
Yinan Chen et.al. |
2502.11974 |
link |
2025-02-17 |
Generating Text from Uniform Meaning Representation |
Emma Markle et.al. |
2502.11973 |
link |
2025-02-17 |
A MIMO Wireless Channel Foundation Model via CIR-CSI Consistency |
Jun Jiang et.al. |
2502.11965 |
null |
2025-02-17 |
Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning |
Tianyi Wu et.al. |
2502.11962 |
null |
2025-02-17 |
On Representational Dissociation of Language and Arithmetic in Large Language Models |
Riku Kisako et.al. |
2502.11932 |
null |
2025-02-17 |
GRAPHGPT-O: Synergistic Multimodal Comprehension and Generation on Graphs |
Yi Fang et.al. |
2502.11925 |
null |
2025-02-17 |
From Text to Trust: Empowering AI-assisted Decision Making with Adaptive LLM-powered Analysis |
Zhuoyan Li et.al. |
2502.11919 |
null |
2025-02-17 |
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models |
Jiamin Su et.al. |
2502.11916 |
null |
2025-02-17 |
Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives |
Leo Schwinn et.al. |
2502.11910 |
null |
2025-02-17 |
MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation |
Haochen Xue et.al. |
2502.11903 |
null |
2025-02-17 |
DLFR-VAE: Dynamic Latent Frame Rate VAE for Video Generation |
Zhihang Yuan et.al. |
2502.11897 |
link |
2025-02-17 |
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning |
Yanxiao Zhao et.al. |
2502.11896 |
null |
2025-02-17 |
Continual Quantization-Aware Pre-Training: When to transition from 16-bit to 1.58-bit pre-training for BitNet language models? |
Jacob Nielsen et.al. |
2502.11895 |
null |
2025-02-17 |
Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration |
Shao Zhang et.al. |
2502.11882 |
link |
2025-02-17 |
Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models |
Hyunwoo Kim et.al. |
2502.11881 |
null |
2025-02-17 |
Bitnet.cpp: Efficient Edge Inference for Ternary LLMs |
Jinheng Wang et.al. |
2502.11880 |
link |
2025-02-17 |
JoLT: Joint Probabilistic Predictions on Tabular Data Using LLMs |
Aliaksandra Shysheya et.al. |
2502.11877 |
link |
2025-02-17 |
FedEAT: A Robustness Optimization Framework for Federated LLMs |
Yahao Pang et.al. |
2502.11863 |
null |
2025-02-17 |
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu |
Renhao Pei et.al. |
2502.11862 |
link |
2025-02-17 |
Exploring Large Language Models in Healthcare: Insights into Corpora Sources, Customization Strategies, and Evaluation Metrics |
Shuqi Yang et.al. |
2502.11861 |
null |
2025-02-17 |
StructTransform: A Scalable Attack Surface for Safety-Aligned Large Language Models |
Shehel Yoosuf et.al. |
2502.11853 |
link |
2025-02-17 |
BaxBench: Can LLMs Generate Correct and Secure Backends? |
Mark Vero et.al. |
2502.11844 |
null |
2025-02-17 |
Can LLM Agents Maintain a Persona in Discourse? |
Pranav Bhandari et.al. |
2502.11843 |
null |
2025-02-17 |
Model Generalization on Text Attribute Graphs: Principles with Large Language Models |
Haoyu Wang et.al. |
2502.11836 |
link |
2025-02-17 |
HAAN: A Holistic Approach for Accelerating Normalization Operations in Large Language Models |
Tianfan Peng et.al. |
2502.11832 |
null |
2025-02-17 |
Intuitive physics understanding emerges from self-supervised pretraining on natural videos |
Quentin Garrido et.al. |
2502.11831 |
link |
2025-02-17 |
Text Classification in the LLM Era - Where do we stand? |
Sowmya Vajjala et.al. |
2502.11830 |
null |
2025-02-17 |
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities |
Hanbin Wang et.al. |
2502.11829 |
link |
2025-02-17 |
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis |
Chengyan Wu et.al. |
2502.11824 |
link |
2025-02-17 |
Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis |
Xu Wang et.al. |
2502.11812 |
null |
2025-02-17 |
FineFilter: A Fine-grained Noise Filtering Mechanism for Retrieval-Augmented Large Language Models |
Qianchi Zhang et.al. |
2502.11811 |
null |
2025-02-17 |
Exploring Translation Mechanism of Large Language Models |
Hongbin Zhang et.al. |
2502.11806 |
null |
2025-02-17 |
Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning |
Peiying Yu et.al. |
2502.11799 |
null |
2025-02-17 |
Personality Editing for Language Models through Relevant Knowledge Editing |
Seojin Hwang et.al. |
2502.11789 |
null |
2025-02-17 |
Efficient Response Generation Method Selection for Fine-Tuning Large Language Models |
Xuan Ren et.al. |
2502.11779 |
null |
2025-02-17 |
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model |
Guangzhi Sun et.al. |
2502.11775 |
null |
2025-02-17 |
The Validation Gap: A Mechanistic Analysis of How Language Models Compute Arithmetic but Fail to Validate It |
Leonardo Bertolazzi et.al. |
2502.11771 |
link |
2025-02-17 |
Cognitive-Aligned Document Selection for Retrieval-augmented Generation |
Bingyu Wan et.al. |
2502.11770 |
null |
2025-02-17 |
From Selection to Generation: A Survey of LLM-based Active Learning |
Yu Xia et.al. |
2502.11767 |
null |
2025-02-17 |
Warmup-Distill: Bridge the Distribution Mismatch between Teacher and Student before Knowledge Distillation |
Zengkui Sun et.al. |
2502.11766 |
link |
2025-02-17 |
HintsOfTruth: A Multimodal Checkworthiness Detection Dataset with Real and Synthetic Claims |
Michiel van der Meer et.al. |
2502.11753 |
null |
2025-02-17 |
Language Models Can See Better: Visual Contrastive Decoding For LLM Multimodal Reasoning |
Yuqi Pang et.al. |
2502.11751 |
link |
2025-02-17 |
ILIAS: Instance-Level Image retrieval At Scale |
Giorgos Kordopatis-Zilos et.al. |
2502.11748 |
null |
2025-02-17 |
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL |
Shuai Lyu et.al. |
2502.11741 |
link |
2025-02-17 |
ReviewEval: An Evaluation Framework for AI-Generated Reviews |
Chavvi Kirtani et.al. |
2502.11736 |
null |
2025-02-17 |
Plant in Cupboard, Orange on Table, Book on Shelf. Benchmarking Practical Reasoning and Situation Modelling in a Text-Simulated Situated Environment |
Jonathan Jordan et.al. |
2502.11733 |
null |
2025-02-17 |
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption |
Alireza Nik et.al. |
2502.11723 |
null |
2025-02-17 |
Enhancing Recommendation Explanations through User-Centric Refinement |
Jingsen Zhang et.al. |
2502.11721 |
null |
2025-02-17 |
Can you pass that tool?: Implications of Indirect Speech in Physical Human-Robot Collaboration |
Yan Zhang et.al. |
2502.11720 |
null |
2025-02-17 |
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection |
Xuan Tong et.al. |
2502.11712 |
null |
2025-02-17 |
Ad-hoc Concept Forming in the Game Codenames as a Means for Evaluating Large Language Models |
Sherzod Hakimov et.al. |
2502.11707 |
null |
2025-02-17 |
LLM Agents Making Agent Tools |
Georg Wölflein et.al. |
2502.11705 |
null |
2025-02-17 |
CMQCIC-Bench: A Chinese Benchmark for Evaluating Large Language Models in Medical Quality Control Indicator Calculation |
Guangya Yu et.al. |
2502.11703 |
null |
2025-02-17 |
MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow |
Hanzhuo Huang et.al. |
2502.11697 |
null |
2025-02-17 |
Improve LLM-as-a-Judge Ability as a General Ability |
Jiachen Yu et.al. |
2502.11689 |
null |
2025-02-17 |
MathFimer: Enhancing Mathematical Reasoning by Expanding Reasoning Steps through Fill-in-the-Middle Task |
Yuchen Yan et.al. |
2502.11684 |
null |
2025-02-17 |
RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars |
Yuncheng Hua et.al. |
2502.11681 |
link |
2025-02-17 |
Exploring LLM-based Student Simulation for Metacognitive Cultivation |
Haoxuan Li et.al. |
2502.11678 |
null |
2025-02-17 |
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception |
Shiyu Ni et.al. |
2502.11677 |
null |
2025-02-17 |
Diversity-Oriented Data Augmentation with Large Language Models |
Zaitian Wang et.al. |
2502.11671 |
null |
2025-02-17 |
VRoPE: Rotary Position Embedding for Video Large Language Models |
Zikang Liu et.al. |
2502.11664 |
link |
2025-02-17 |
An Innovative Brain-Computer Interface Interaction System Based on the Large Language Model |
Jing Jina et.al. |
2502.11659 |
null |
2025-02-17 |
Competing LLM Agents in a Non-Cooperative Game of Opinion Polarisation |
Amin Qasmi et.al. |
2502.11649 |
null |
2025-02-17 |
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing |
Yi Wang et.al. |
2502.11647 |
null |
2025-02-17 |
Hyperspherical Energy Transformer with Recurrent Depth |
Yunzhe Hu et.al. |
2502.11646 |
null |
2025-02-17 |
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI |
Yuxia Wang et.al. |
2502.11614 |
null |
2025-02-17 |
Maximum Entropy Reinforcement Learning with Diffusion Policy |
Xiaoyi Dong et.al. |
2502.11612 |
link |
2025-02-17 |
Accuracy Assessment of OpenAlex and Clarivate Scholar ID with an LLM-Assisted Benchmark |
Renyu Zhao et.al. |
2502.11610 |
null |
2025-02-17 |
GraphThought: Graph Combinatorial Optimization with Thought Generation |
Zixiao Huang et.al. |
2502.11607 |
null |
2025-02-14 |
MM-RLHF: The Next Step Forward in Multimodal LLM Alignment |
Yi-Fan Zhang et.al. |
2502.10391 |
null |
2025-02-14 |
Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction |
WonJin Yoon et.al. |
2502.10388 |
null |
2025-02-14 |
Robustness tests for biomedical foundation models should tailor to specification |
R. Patrick Xian et.al. |
2502.10374 |
link |
2025-02-14 |
AffinityFlow: Guided Flows for Antibody Affinity Maturation |
Can Chen et.al. |
2502.10365 |
null |
2025-02-14 |
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection |
Bettina Messmer et.al. |
2502.10361 |
null |
2025-02-14 |
Dimension-free Score Matching and Time Bootstrapping for Diffusion Models |
Syamantak Kumar et.al. |
2502.10354 |
null |
2025-02-14 |
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation |
Alexander Wettig et.al. |
2502.10341 |
null |
2025-02-14 |
Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering |
Nick Ferguson et.al. |
2502.10338 |
null |
2025-02-14 |
Generalised Parallel Tempering: Flexible Replica Exchange via Flows and Diffusions |
Leo Zhang et.al. |
2502.10328 |
null |
2025-02-14 |
LLM-Powered Preference Elicitation in Combinatorial Assignment |
Ermis Soumalias et.al. |
2502.10308 |
null |
2025-02-14 |
SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models |
Aditya Mishra et.al. |
2502.10307 |
null |
2025-02-14 |
Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2 |
Saem Hasan et.al. |
2502.10299 |
null |
2025-02-14 |
Probabilistic Super-Resolution for High-Fidelity Physical System Simulations with Uncertainty Quantification |
Pengyu Zhang et.al. |
2502.10280 |
null |
2025-02-14 |
Are Large Language Models the future crowd workers of Linguistics? |
Iris Ferrazzo et.al. |
2502.10266 |
null |
2025-02-14 |
Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers |
Aivin V. Solatorio et.al. |
2502.10263 |
null |
2025-02-14 |
VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models |
Gokul Karthik Kumar et.al. |
2502.10250 |
null |
2025-02-14 |
Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model |
Guoqing Ma et.al. |
2502.10248 |
link |
2025-02-14 |
Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices |
Mohamed Aboelenien Ahmed et.al. |
2502.10239 |
null |
2025-02-14 |
Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control |
Thomas Jiralerspong et.al. |
2502.10236 |
null |
2025-02-14 |
AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting |
Abdelhakim Benechehab et.al. |
2502.10235 |
link |
2025-02-14 |
Do Large Language Models Reason Causally Like Us? Even Better? |
Hanna M. Dettki et.al. |
2502.10215 |
null |
2025-02-14 |
Can Post-Training Quantization Benefit from an Additional QLoRA Integration? |
Xiliang Zhu et.al. |
2502.10202 |
null |
2025-02-14 |
Prediction hubs are context-informed frequent tokens in LLMs |
Beatrix M. G. Nielsen et.al. |
2502.10201 |
null |
2025-02-14 |
MathConstruct: Challenging LLM Reasoning with Constructive Proofs |
Mislav Balunović et.al. |
2502.10197 |
null |
2025-02-14 |
Translating Common Security Assertions Across Processor Designs: A RISC-V Case Study |
Sharjeel Imtiaz et.al. |
2502.10194 |
null |
2025-02-14 |
VideoDiff: Human-AI Video Co-Creation with Alternatives |
Mina Huh et.al. |
2502.10190 |
null |
2025-02-14 |
Modeling biases in binary decision-making within the generalized nonlinear q-voter model |
Maciej Doniec et.al. |
2502.10172 |
null |
2025-02-14 |
Video Soundtrack Generation by Aligning Emotions and Temporal Boundaries |
Serkan Sulun et.al. |
2502.10154 |
null |
2025-02-14 |
Semantica: Decentralized Search using a LLM-Guided Semantic Tree Overlay |
Petru Neague et.al. |
2502.10151 |
link |
2025-02-14 |
Cooperative Multi-Agent Planning with Adaptive Skill Synthesis |
Zhiyuan Li et.al. |
2502.10148 |
null |
2025-02-14 |
Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of Small Multilingual Language Models for Low-Resource Languages |
Daniil Gurgurov et.al. |
2502.10140 |
null |
2025-02-14 |
Physics-Informed Generative Modeling of Wireless Channels |
Benedikt Böck et.al. |
2502.10137 |
null |
2025-02-14 |
ScamFerret: Detecting Scam Websites Autonomously with Large Language Models |
Hiroki Nakano et.al. |
2502.10110 |
link |
2025-02-14 |
NeuroXVocal: Detection and Explanation of Alzheimer’s Disease through Non-invasive Analysis of Picture-prompted Speech |
Nikolaos Ntampakis et.al. |
2502.10108 |
null |
2025-02-14 |
A novel approach to data generation in generative model |
JaeHong Kim et.al. |
2502.10092 |
null |
2025-02-14 |
Enhancing Patient Acceptance of Robotic Ultrasound through Conversational Virtual Agent and Immersive Visualizations |
Tianyu Song et.al. |
2502.10088 |
link |
2025-02-14 |
DiSciPLE: Learning Interpretable Programs for Scientific Visual Discovery |
Utkarsh Mall et.al. |
2502.10060 |
null |
2025-02-14 |
A Generalized Modeling Approach to Liquid-driven Ballooning Membranes |
Mirroyal Ismayilov et.al. |
2502.10057 |
null |
2025-02-14 |
ORI: O Routing Intelligence |
Ahmad Shadid et.al. |
2502.10051 |
null |
2025-02-14 |
A Survey on LLM-powered Agents for Recommender Systems |
Qiyao Peng et.al. |
2502.10050 |
null |
2025-02-14 |
ViRAC: A Vision-Reasoning Agent Head Movement Control Framework in Arbitrary Virtual Environments |
Juyeong Hwang et.al. |
2502.10046 |
null |
2025-02-14 |
POI-Enhancer: An LLM-based Semantic Enhancement Framework for POI Representation Learning |
Jiawei Cheng et.al. |
2502.10038 |
null |
2025-02-14 |
Probabilistic Lexical Manifold Construction in Large Language Models via Hierarchical Vector Field Interpolation |
Clive Pendleton et.al. |
2502.10013 |
null |
2025-02-14 |
ChatGPT and Deepseek: Can They Predict the Stock Market and Macroeconomy? |
Jian Chen et.al. |
2502.10008 |
null |
2025-02-14 |
EmbBERT-Q: Breaking Memory Barriers in Embedded NLP |
Riccardo Bravin et.al. |
2502.10001 |
null |
2025-02-14 |
Decision Information Meets Large Language Models: The Future of Explainable Operations Research |
Yansen Zhang et.al. |
2502.09994 |
null |
2025-02-14 |
Large Language Diffusion Models |
Shen Nie et.al. |
2502.09992 |
null |
2025-02-14 |
V2V-LLM: Vehicle-to-Vehicle Cooperative Autonomous Driving with Multi-Modal Large Language Models |
Hsu-kuang Chiu et.al. |
2502.09980 |
null |
2025-02-14 |
LaRA: Benchmarking Retrieval-Augmented Generation and Long-Context LLMs - No Silver Bullet for LC or RAG Routing |
Kuan Li et.al. |
2502.09977 |
null |
2025-02-14 |
Has My System Prompt Been Used? Large Language Model Prompt Membership Inference |
Roman Levin et.al. |
2502.09974 |
null |
2025-02-14 |
KGGen: Extracting Knowledge Graphs from Plain Text with Language Models |
Belinda Mo et.al. |
2502.09956 |
null |
2025-02-14 |
A Preliminary Exploration with GPT-4o Voice Mode |
Yu-Xiang Lin et.al. |
2502.09940 |
null |
2025-02-14 |
Precise Parameter Localization for Textual Generation in Diffusion Models |
Łukasz Staniszewski et.al. |
2502.09935 |
null |
2025-02-14 |
MIR-Bench: Benchmarking LLM’s Long-Context Intelligence via Many-Shot In-Context Inductive Reasoning |
Kai Yan et.al. |
2502.09933 |
null |
2025-02-14 |
Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence |
Granite Vision Team et.al. |
2502.09927 |
null |
2025-02-14 |
λScale: Enabling Fast Scaling for Serverless Large Language Model Inference |
Minchen Yu et.al. |
2502.09922 |
null |
2025-02-14 |
INF^2: High-Throughput Generative Inference of Large Language Models using Near-Storage Processing |
Hongsun Jang et.al. |
2502.09921 |
null |
2025-02-14 |
AutoS $^2$ earch: Unlocking the Reasoning Potential of Large Models for Web-based Source Search |
Zhengqiu Zhu et.al. |
2502.09913 |
null |
2025-02-14 |
Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding |
Thanh-Dat Truong et.al. |
2502.09906 |
null |
2025-02-14 |
The Ann Arbor Architecture for Agent-Oriented Programming |
Wei Dong et.al. |
2502.09903 |
link |
2025-02-14 |
Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond |
Kehan Guo et.al. |
2502.09897 |
null |
2025-02-14 |
ChatIoT: Large Language Model-based Security Assistant for Internet of Things with Retrieval-Augmented Generation |
Ye Dong et.al. |
2502.09896 |
null |
2025-02-14 |
ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation |
Shu Wang et.al. |
2502.09891 |
null |
2025-02-14 |
Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos |
Weirui Ye et.al. |
2502.09886 |
null |
2025-02-14 |
Solvable Dynamics of Self-Supervised Word Embeddings and the Emergence of Analogical Reasoning |
Dhruva Karkada et.al. |
2502.09863 |
null |
2025-02-14 |
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge |
Naoyuki Kamo et.al. |
2502.09859 |
null |
2025-02-14 |
Automated Hypothesis Validation with Agentic Sequential Falsifications |
Kexin Huang et.al. |
2502.09858 |
link |
2025-02-14 |
Port-LLM: A Port Prediction Method for Fluid Antenna based on Large Language Models |
Yali Zhang et.al. |
2502.09857 |
null |
2025-02-14 |
Efficient Multitask Learning in Small Language Models Through Upside-Down Reinforcement Learning |
Yu-Chen Lin et.al. |
2502.09854 |
null |
2025-02-14 |
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation |
Tianwei Lin et.al. |
2502.09838 |
link |
2025-02-13 |
A Solver-Aided Hierarchical Language for LLM-Driven CAD Design |
Benjamin T. Jones et.al. |
2502.09819 |
null |
2025-02-13 |
Statistical Coherence Alignment for Large Language Model Representation Learning Through Tensor Field Convergence |
Jonathan Gale et.al. |
2502.09815 |
null |
2025-02-13 |
INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages |
Hao Yu et.al. |
2502.09814 |
null |
2025-02-13 |
AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration |
Jizhou Chen et.al. |
2502.09809 |
null |
2025-02-13 |
Unit Testing Past vs. Present: Examining LLMs’ Impact on Defect Detection and Efficiency |
Rudolf Ramler et.al. |
2502.09801 |
null |
2025-02-13 |
Co-designing Large Language Model Tools for Project-Based Learning with K12 Educators |
Prerna Ravi et.al. |
2502.09799 |
null |
2025-02-13 |
A Survey on LLM-based News Recommender Systems |
Rongyao Wang et.al. |
2502.09797 |
null |
2025-02-13 |
TableTalk: Scaffolding Spreadsheet Development with a Language Agent |
Jenny T. Liang et.al. |
2502.09787 |
null |
2025-02-13 |
Improving Acoustic Side-Channel Attacks on Keyboards Using Transformers and Large Language Models |
Jin Hyun Park et.al. |
2502.09782 |
null |
2025-02-13 |
CellFlow: Simulating Cellular Morphology Changes via Flow Matching |
Yuhui Zhang et.al. |
2502.09775 |
null |
2025-02-13 |
Non-Markovian Discrete Diffusion with Causal Language Models |
Yangtian Zhang et.al. |
2502.09767 |
null |
2025-02-13 |
LLM-Generated Microservice Implementations from RESTful API Definitions |
Saurabh Chauhan et.al. |
2502.09766 |
link |
2025-02-13 |
Enhancing Jailbreak Attacks via Compliance-Refusal-Based Initialization |
Amit Levi et.al. |
2502.09755 |
null |
2025-02-13 |
Vote-Tree-Planner: Optimizing Execution Order in LLM-based Task Planning Pipeline via Voting |
Chaoyuan Zhang et.al. |
2502.09749 |
null |
2025-02-13 |
The Widespread Adoption of Large Language Model-Assisted Writing Across Society |
Weixin Liang et.al. |
2502.09747 |
null |
2025-02-13 |
Fine-Tuning Foundation Models with Federated Learning for Privacy Preserving Medical Time Series Forecasting |
Mahad Ali et.al. |
2502.09744 |
null |
2025-02-13 |
FoNE: Precise Single-Token Number Embeddings via Fourier Features |
Tianyi Zhou et.al. |
2502.09741 |
null |
2025-02-13 |
Making Them a Malicious Database: Exploiting Query Code to Jailbreak Aligned Large Language Models |
Qingsong Zou et.al. |
2502.09723 |
link |
2025-02-13 |
NestQuant: Nested Lattice Quantization for Matrix Products and LLMs |
Semyon Savkin et.al. |
2502.09720 |
null |
2025-02-13 |
Genetic Data Governance in Crisis: Policy Recommendations for Safeguarding Privacy and Preventing Discrimination |
Vivek Ramanan et.al. |
2502.09716 |
null |
2025-02-13 |
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency |
Dongzhi Jiang et.al. |
2502.09621 |
null |
2025-02-13 |
Exploring the Potential of Encoder-free Architectures in 3D LMMs |
Yiwen Tang et.al. |
2502.09620 |
link |
2025-02-13 |
Designing a Conditional Prior Distribution for Flow-Based Generative Models |
Noam Issachar et.al. |
2502.09611 |
null |
2025-02-14 |
Score-of-Mixture Training: Training One-Step Generative Models Made Simple via Score Estimation of Mixture Distributions |
Tejas Jayashankar et.al. |
2502.09609 |
null |
2025-02-13 |
Human-LLM Coevolution: Evidence from Academic Writing |
Mingmeng Geng et.al. |
2502.09606 |
null |
2025-02-13 |
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models |
Yung-Sung Chuang et.al. |
2502.09604 |
link |
2025-02-13 |
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs |
Siyan Zhao et.al. |
2502.09597 |
link |
2025-02-13 |
KIMAs: A Configurable Knowledge Integrated Multi-Agent System |
Zitao Li et.al. |
2502.09596 |
null |
2025-02-13 |
Logical forms complement probability in understanding language model (and human) performance |
Yixuan Wang et.al. |
2502.09589 |
null |
2025-02-13 |
Rolling Ahead Diffusion for Traffic Scene Simulation |
Yunpeng Liu et.al. |
2502.09587 |
null |
2025-02-13 |
Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks |
Qian Wan et.al. |
2502.09577 |
null |
2025-02-13 |
Zero-shot generation of synthetic neurosurgical data with large language models |
Austin A. Barr et.al. |
2502.09566 |
link |
2025-02-13 |
MDCrow: Automating Molecular Dynamics Workflows with Large Language Models |
Quintina Campbell et.al. |
2502.09565 |
link |
2025-02-13 |
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents |
Rui Yang et.al. |
2502.09560 |
null |
2025-02-13 |
Explainable AI-assisted Optimization for Feynman Integral Reduction |
Zhuo-Yang Song et.al. |
2502.09544 |
null |
2025-02-13 |
Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages |
Shreyan Biswas et.al. |
2502.09532 |
null |
2025-02-13 |
SQ-GAN: Semantic Image Communications Using Masked Vector Quantization |
Francesco Pezone et.al. |
2502.09520 |
link |
2025-02-13 |
Diffusion Models for Molecules: A Survey of Methods and Tasks |
Liang Wang et.al. |
2502.09511 |
link |
2025-02-13 |
EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling |
Theodoros Kouzelis et.al. |
2502.09509 |
null |
2025-02-13 |
Improve LLM-based Automatic Essay Scoring with Linguistic Features |
Zhaoyi Joey Hou et.al. |
2502.09497 |
null |
2025-02-13 |
Foundation Neural-Network Quantum States |
Riccardo Rende et.al. |
2502.09488 |
null |
2025-02-13 |
Objective quantification of mood states using large language models |
Jakub Onysk et.al. |
2502.09487 |
null |
2025-02-13 |
DiffRenderGAN: Addressing Training Data Scarcity in Deep Segmentation Networks for Quantitative Nanomaterial Analysis through Differentiable Rendering and Generative Modelling |
Dennis Possart et.al. |
2502.09477 |
null |
2025-02-13 |
Transformer-Enhanced Variational Autoencoder for Crystal Structure Prediction |
Ziyi Chen et.al. |
2502.09423 |
null |
2025-02-13 |
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation |
Rotem Shalev-Arkushin et.al. |
2502.09411 |
null |
2025-02-13 |
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models |
Daniel Fleischer et.al. |
2502.09390 |
link |
2025-02-13 |
Truth Knows No Language: Evaluating Truthfulness Beyond English |
Blanca Calvo Figueras et.al. |
2502.09387 |
null |
2025-02-13 |
APT-LLM: Embedding-Based Anomaly Detection of Cyber Advanced Persistent Threats Using Large Language Models |
Sidahmed Benabderrahmane et.al. |
2502.09385 |
null |
2025-02-13 |
LoRA Training Provably Converges to a Low-Rank Global Minimum or It Fails Loudly (But it Probably Won’t Fail) |
Junsu Kim et.al. |
2502.09376 |
null |
2025-02-13 |
Inverse problems with experiment-guided AlphaFold |
Advaith Maddipatla et.al. |
2502.09372 |
null |
2025-02-13 |
Language Agents as Digital Representatives in Collective Decision-Making |
Daniel Jarrett et.al. |
2502.09369 |
null |
2025-02-13 |
Machine learning for modelling unstructured grid data in computational physics: a review |
Sibo Cheng et.al. |
2502.09346 |
null |
2025-02-13 |
ThunderServe: High-performance and Cost-efficient LLM Serving in Cloud Environments |
Youhe Jiang et.al. |
2502.09334 |
null |
2025-02-13 |
Beyond English: The Impact of Prompt Translation Strategies across Languages and Tasks in Multilingual LLMs |
Itai Mondshine et.al. |
2502.09331 |
null |
2025-02-13 |
Copilot Arena: A Platform for Code LLM Evaluation in the Wild |
Wayne Chi et.al. |
2502.09328 |
null |
2025-02-13 |
A Benchmark for Crime Surveillance Video Analysis with Large Models |
Haoran Chen et.al. |
2502.09325 |
null |
2025-02-13 |
A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional Hypothesis |
Kentaro Imajo et.al. |
2502.09316 |
link |
2025-02-13 |
When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models |
Samuel Joseph Amouyal et.al. |
2502.09307 |
null |
2025-02-13 |
Non-asymptotic Analysis of Diffusion Annealed Langevin Monte Carlo for Generative Modelling |
Paula Cordero-Encinar et.al. |
2502.09306 |
null |
2025-02-13 |
KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG |
Yiqian Huang et.al. |
2502.09304 |
link |
2025-02-13 |
When do neural networks learn world models? |
Tianren Zhang et.al. |
2502.09297 |
null |
2025-02-13 |
SparQLe: Speech Queries to Text Translation Through LLMs |
Amirbek Djanibekov et.al. |
2502.09284 |
link |
2025-02-13 |
GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation |
Hongyin Zhang et.al. |
2502.09268 |
null |
2025-02-13 |
AnomalyGFM: Graph Foundation Model for Zero/Few-shot Anomaly Detection |
Hezhe Qiao et.al. |
2502.09254 |
null |
2025-02-13 |
From large language models to multimodal AI: A scoping review on the potential of generative AI in medicine |
Lukas Buess et.al. |
2502.09242 |
null |
2025-02-13 |
OpenBench: A New Benchmark and Baseline for Semantic Navigation in Smart Logistics |
Junhui Wang et.al. |
2502.09238 |
null |
2025-02-13 |
Reliable Conversational Agents under ASP Control that Understand Natural Language |
Yankai Zeng et.al. |
2502.09237 |
null |
2025-02-13 |
Data2Concept2Text: An Explainable Multilingual Framework for Data Analysis Narration |
Flavio Bertini et.al. |
2502.09218 |
null |
2025-02-13 |
LP-LM: No Hallucinations in Question Answering with Logic Programming |
Katherine Wu et.al. |
2502.09212 |
link |
2025-02-13 |
Visual Graph Question Answering with ASP and LLMs for Language Parsing |
Jakob Johannes Bauer et.al. |
2502.09211 |
null |
2025-02-13 |
On LLM-generated Logic Programs and their Inference Execution Methods |
Paul Tarau et.al. |
2502.09209 |
null |
2025-02-13 |
Logical Lease Litigation: Prolog and LLMs for Rental Law Compliance in New York |
Sanskar Sehgal et.al. |
2502.09204 |
null |
2025-02-13 |
XAInomaly: Explainable and Interpretable Deep Contractive Autoencoder for O-RAN Traffic Anomaly Detection |
Osman Tugay Basaran et.al. |
2502.09194 |
null |
2025-02-13 |
Thinking beyond the anthropomorphic paradigm benefits LLM research |
Lujain Ibrahim et.al. |
2502.09192 |
null |
2025-02-13 |
Matina: A Large-Scale 73B Token Persian Text Corpus |
Sara Bourbour Hosseinbeigi et.al. |
2502.09188 |
null |
2025-02-13 |
RefineCoder: Iterative Improving of Large Language Models via Adaptive Critique Refinement for Code Generation |
Changzhi Zhou et.al. |
2502.09183 |
null |
2025-02-13 |
FLAME: Flexible LLM-Assisted Moderation Engine |
Ivan Bakulin et.al. |
2502.09175 |
null |
2025-02-13 |
Two-Stage Representation Learning for Analyzing Movement Behavior Dynamics in People Living with Dementia |
Jin Cui et.al. |
2502.09173 |
null |
2025-02-13 |
Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs |
Chang Liu et.al. |
2502.09156 |
null |
2025-02-13 |
Finite-Time Analysis of Discrete-Time Stochastic Interpolants |
Yuhao Liu et.al. |
2502.09130 |
null |
2025-02-13 |
One-shot Federated Learning Methods: A Practical Guide |
Xiang Liu et.al. |
2502.09104 |
null |
2025-02-13 |
Bridging the Gap Between LLMs and Human Intentions: Progresses and Challenges in Instruction Understanding, Intention Reasoning, and Reliable Generation |
Zongyu Chang et.al. |
2502.09101 |
null |
2025-02-13 |
Logical Reasoning in Large Language Models: A Survey |
Hanmeng Liu et.al. |
2502.09100 |
null |
2025-02-13 |
Show Me the Work: Fact-Checkers’ Requirements for Explainable Automated Fact-Checking |
Greta Warren et.al. |
2502.09083 |
null |
2025-02-13 |
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles |
Xintao Wang et.al. |
2502.09082 |
link |
2025-02-13 |
Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables |
Xuzhao Geng et.al. |
2502.09073 |
null |
2025-02-13 |
Unleashing the Power of Large Language Model for Denoising Recommendation |
Shuyao Wang et.al. |
2502.09058 |
null |
2025-02-13 |
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging |
Kunat Pipatanakul et.al. |
2502.09056 |
null |
2025-02-13 |
Game Theory Meets Large Language Models: A Systematic Survey |
Haoran Sun et.al. |
2502.09053 |
null |
2025-02-13 |
Typhoon T1: An Open Thai Reasoning Model |
Pittawat Taveekitworachai et.al. |
2502.09042 |
null |
2025-02-13 |
Implementation of a Fuzzy Relational Database. Case Study: Chilean Cardboard Industry in the Maule Region |
Leoncio Jimenez et.al. |
2502.09035 |
null |
2025-02-13 |
MTDP: Modulated Transformer Diffusion Policy Model |
Qianhao Wang et.al. |
2502.09029 |
null |
2025-02-13 |
EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition |
Xiao Wang et.al. |
2502.09020 |
link |
2025-02-13 |
Diversity Enhances an LLM’s Performance in RAG and Long-context Task |
Zhchao Wang et.al. |
2502.09017 |
null |
2025-02-13 |
Hope vs. Hate: Understanding User Interactions with LGBTQ+ News Content in Mainstream US News Media through the Lens of Hope Speech |
Jonathan Pofcher et.al. |
2502.09004 |
null |
2025-02-13 |
RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models |
Quan Wei et.al. |
2502.09003 |
null |
2025-02-13 |
End-to-End triplet loss based fine-tuning for network embedding in effective PII detection |
Rishika Kohli et.al. |
2502.09002 |
null |
2025-02-13 |
Task Generalization With AutoRegressive Compositional Structure: Can Learning From $\d$ Tasks Generalize to $\d^{T}$ Tasks? |
Amirhesam Abedsoltan et.al. |
2502.08991 |
null |
2025-02-13 |
Prophet Inequalities for Bandits, Cabinets, and DAGs |
Robin Bowers et.al. |
2502.08976 |
null |
2025-02-13 |
Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning |
Leon Nissen et.al. |
2502.08954 |
link |
2025-02-13 |
Structured Convergence in Large Language Model Representations via Hierarchical Latent Space Folding |
Fenella Harcourt et.al. |
2502.08947 |
null |
2025-02-13 |
Beyond the Singular: The Essential Role of Multiple Generations in Effective Benchmark Evaluation and Analysis |
Wenbo Zhang et.al. |
2502.08943 |
null |
2025-02-13 |
Escaping Collapse: The Strength of Weak Data for Large Language Model Training |
Kareem Amin et.al. |
2502.08924 |
null |
2025-02-13 |
Self-Consistency of the Internal Reward Models Improves Self-Rewarding Language Models |
Xin Zhou et.al. |
2502.08922 |
null |
2025-02-13 |
Detecting Malicious Concepts Without Image Generation in AIGC |
Kun Xu et.al. |
2502.08921 |
null |
2025-02-13 |
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU |
Heejun Lee et.al. |
2502.08910 |
null |
2025-02-13 |
Towards Automated Fact-Checking of Real-World Claims: Exploring Task Formulation and Assessment with LLMs |
Premtim Sahitaj et.al. |
2502.08909 |
null |
2025-02-13 |
Reinforced Large Language Model is a formal theorem prover |
Zhiling Luo et.al. |
2502.08908 |
link |
2025-02-13 |
DiffoRA: Enabling Parameter-Efficient LLM Fine-Tuning via Differential Low-Rank Matrix Adaptation |
Tangyu Jiang et.al. |
2502.08905 |
null |
2025-02-13 |
MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training |
Xinxin You et.al. |
2502.08904 |
null |
2025-02-13 |
3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning |
Guoqin Tang et.al. |
2502.08903 |
null |
2025-02-13 |
Communication is All You Need: Persuasion Dataset Construction via Multi-LLM Communication |
Weicheng Ma et.al. |
2502.08896 |
null |
2025-02-13 |
ShapeLib: designing a library of procedural 3D shape abstractions with Large Language Models |
R. Kenny Jones et.al. |
2502.08884 |
null |
2025-02-13 |
Utilizing Pre-trained and Large Language Models for 10-K Items Segmentation |
Hsin-Min Lu et.al. |
2502.08875 |
null |
2025-02-13 |
Harnessing Vision Models for Time Series Analysis: A Survey |
Jingchao Ni et.al. |
2502.08869 |
link |
2025-02-13 |
A Systematic Evaluation of Generative Models on Tabular Transportation Data |
Chengen Wang et.al. |
2502.08856 |
link |
2025-02-12 |
Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation |
Mohammad Mahdi Abootorabi et.al. |
2502.08826 |
link |
2025-02-12 |
DejAIvu: Identifying and Explaining AI Art on the Web in Real-Time with Saliency Maps |
Jocelyn Dzuong et.al. |
2502.08821 |
link |
2025-02-12 |
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model |
Emre Can Acikgoz et.al. |
2502.08820 |
null |
2025-02-12 |
Lexical Manifold Reconfiguration in Large Language Models: A Novel Architectural Approach for Contextual Modulation |
Koinis Vassilis et.al. |
2502.08818 |
null |
2025-02-12 |
Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples |
Andrianos Michail et.al. |
2502.08638 |
null |
2025-02-12 |
Ensemble based approach to quantifying uncertainty of LLM based classifications |
Srijith Rajamohan et.al. |
2502.08631 |
null |
2025-02-12 |
Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model |
Saurabh Kataria et.al. |
2502.08612 |
null |
2025-02-12 |
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors |
Vishwanath Pratap Singh et.al. |
2502.08587 |
null |
2025-02-12 |
Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks |
Ang Li et.al. |
2502.08586 |
null |
2025-02-12 |
Statistically validated projection of bipartite signed networks |
Anna Gallo et.al. |
2502.08567 |
null |
2025-02-12 |
QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval |
Wonduk Seo et.al. |
2502.08557 |
null |
2025-02-12 |
Human-Centric Foundation Models: Perception, Generation and Agentic Modeling |
Shixiang Tang et.al. |
2502.08556 |
link |
2025-02-12 |
Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies |
Sunnie S. Y. Kim et.al. |
2502.08554 |
null |
2025-02-12 |
LLMs can implicitly learn from mistakes in-context |
Lisa Alazraki et.al. |
2502.08550 |
null |
2025-02-12 |
LLM Pretraining with Continuous Concepts |
Jihoon Tack et.al. |
2502.08524 |
null |
2025-02-12 |
FedMHO: Heterogeneous One-Shot Federated Learning Towards Resource-Constrained Edge Devices |
Dezhong Yao et.al. |
2502.08518 |
link |
2025-02-12 |
The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data |
Evgenii Evstafev et.al. |
2502.08515 |
null |
2025-02-12 |
Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation |
Mahnaz Koupaee et.al. |
2502.08514 |
link |
2025-02-12 |
Measuring Diversity in Synthetic Datasets |
Yuchang Zhu et.al. |
2502.08512 |
link |
2025-02-12 |
Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction |
Wei Li et.al. |
2502.08507 |
link |
2025-02-12 |
Salamandra Technical Report |
Aitor Gonzalez-Agirre et.al. |
2502.08489 |
link |
2025-02-12 |
One-Shot Federated Learning with Classifier-Free Diffusion Models |
Obaidullah Zaland et.al. |
2502.08488 |
null |
2025-02-12 |
Computed fingertip touch for the instrumental control of musical sound with an excursion on the computed retinal afterimage |
Staas de Jong et.al. |
2502.08471 |
null |
2025-02-12 |
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data |
Haonan Chen et.al. |
2502.08468 |
link |
2025-02-12 |
From Haystack to Needle: Label Space Reduction for Zero-shot Classification |
Nathan Vandemoortele et.al. |
2502.08436 |
null |
2025-02-12 |
IssueBench: Millions of Realistic Prompts for Measuring Issue Bias in LLM Writing Assistance |
Paul Röttger et.al. |
2502.08395 |
null |
2025-02-12 |
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification |
Jiangbo Shi et.al. |
2502.08391 |
link |
2025-02-12 |
Top-Theta Attention: Sparsifying Transformers by Compensated Thresholding |
Konstantin Berestizshevsky et.al. |
2502.08363 |
link |
2025-02-12 |
Systematic Knowledge Injection into Large Language Models via Diverse Augmentation for Domain-Specific RAG |
Kushagra Bhushan et.al. |
2502.08356 |
null |
2025-02-12 |
Trustworthy GNNs with LLMs: A Systematic Review and Taxonomy |
Ruizhan Xue et.al. |
2502.08353 |
null |
2025-02-12 |
Graph Foundation Models for Recommendation: A Comprehensive Survey |
Bin Wu et.al. |
2502.08346 |
null |
2025-02-12 |
Foundation Models in Computational Pathology: A Review of Challenges, Opportunities, and Impact |
Mohsin Bilal et.al. |
2502.08333 |
null |
2025-02-12 |
Modification and Generated-Text Detection: Achieving Dual Detection Capabilities for the Outputs of LLM by Watermark |
Yuhang Cai et.al. |
2502.08332 |
null |
2025-02-12 |
Contextual Compression Encoding for Large Language Models: A Novel Framework for Multi-Layered Parameter Space Pruning |
Barnaby Schmitt et.al. |
2502.08323 |
null |
2025-02-12 |
MultiProSE: A Multi-label Arabic Dataset for Propaganda, Sentiment, and Emotion Detection |
Lubna Al-Henaki et.al. |
2502.08319 |
null |
2025-02-12 |
Word Synchronization Challenge: A Benchmark for Word Association Responses for LLMs |
Tanguy Cazalets et.al. |
2502.08312 |
null |
2025-02-12 |
Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model |
Bencheng Yan et.al. |
2502.08309 |
null |
2025-02-12 |
HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting |
Shibo Feng et.al. |
2502.08302 |
link |
2025-02-12 |
Compromising Honesty and Harmlessness in Language Models via Deception Attacks |
Laurène Vaugrante et.al. |
2502.08301 |
null |
2025-02-12 |
Improving Existing Optimization Algorithms with LLMs |
Camilo Chacón Sartori et.al. |
2502.08298 |
null |
2025-02-12 |
Redefining Simplicity: Benchmarking Large Language Models from Lexical to Document Simplification |
Jipeng Qiang et.al. |
2502.08281 |
null |
2025-02-12 |
MoLoRec: A Generalizable and Efficient Framework for LLM-Based Recommendation |
Min Hou et.al. |
2502.08271 |
null |
2025-02-12 |
Exploring the Potential of Large Language Models to Simulate Personality |
Maria Molchanova et.al. |
2502.08265 |
link |
2025-02-12 |
GenIAS: Generator for Instantiating Anomalies in time Series |
Zahra Zamanzadeh Darban et.al. |
2502.08262 |
null |
2025-02-12 |
FixDrive: Automatically Repairing Autonomous Vehicle Driving Behaviour for $0.08 per Violation |
Yang Sun et.al. |
2502.08260 |
link |
2025-02-12 |
Learning Human Skill Generators at Key-Step Levels |
Yilu Wu et.al. |
2502.08234 |
null |
2025-02-12 |
Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause Analysis |
Changhua Pei et.al. |
2502.08224 |
null |
2025-02-12 |
Memory Offloading for Large Language Model Inference with Latency SLO Guarantees |
Chenxiang Ma et.al. |
2502.08182 |
null |
2025-02-12 |
Enhancing LLM Character-Level Manipulation via Divide and Conquer |
Zhen Xiong et.al. |
2502.08180 |
null |
2025-02-12 |
ParetoRAG: Leveraging Sentence-Context Attention for Robust and Efficient Retrieval-Augmented Generation |
Ruobing Yao et.al. |
2502.08178 |
null |
2025-02-12 |
SycEval: Evaluating LLM Sycophancy |
Aaron Fanous et.al. |
2502.08177 |
null |
2025-02-12 |
Intention is All You Need: Refining Your Code from Your Intention |
Qi Guo et.al. |
2502.08172 |
null |
2025-02-12 |
Force Matching with Relativistic Constraints: A Physics-Inspired Approach to Stable and Efficient Generative Modeling |
Yang Cao et.al. |
2502.08150 |
null |
2025-02-12 |
ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning |
Vy Vo et.al. |
2502.08148 |
null |
2025-02-12 |
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers |
Siddharth Singh et.al. |
2502.08145 |
null |
2025-02-12 |
Bridging the Safety Gap: A Guardrail Pipeline for Trustworthy LLM Inferences |
Shanshan Han et.al. |
2502.08142 |
null |
2025-02-12 |
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits |
Zikai Zhou et.al. |
2502.08141 |
null |
2025-02-12 |
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models |
Sonam Gupta et.al. |
2502.08130 |
null |
2025-02-12 |
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance |
Lingfei Qian et.al. |
2502.08127 |
link |
2025-02-12 |
HuDEx: Integrating Hallucination Detection and Explainability for Enhancing the Reliability of LLM responses |
Sujeong Lee et.al. |
2502.08109 |
null |
2025-02-12 |
Large language models perpetuate bias in palliative care: development and analysis of the Palliative Care Adversarial Dataset (PCAD) |
Naomi Akhras et.al. |
2502.08073 |
null |
2025-02-12 |
On Mechanistic Circuits for Extractive Question-Answering |
Samyadeep Basu et.al. |
2502.08059 |
null |
2025-02-12 |
Break the Checkbox: Challenging Closed-Style Evaluations of Cultural Alignment in LLMs |
Mohsinul Kabir et.al. |
2502.08045 |
null |
2025-02-12 |
Franken-Adapter: Cross-Lingual Adaptation of LLMs by Embedding Surgery |
Fan Jiang et.al. |
2502.08037 |
null |
2025-02-12 |
Stochastic Kinetics of Transcription: Analysis and Computation |
Yuntao Lu et.al. |
2502.08028 |
null |
2025-02-12 |
Contextual Subspace Manifold Projection for Structural Refinement of Large Language Model Representations |
Alistair Wren et.al. |
2502.08026 |
null |
2025-02-11 |
Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding |
Ziyao Wang et.al. |
2502.08020 |
null |
2025-02-11 |
The Geometry of Prompting: Unveiling Distinct Mechanisms of Task Adaptation in Language Models |
Artem Kirsanov et.al. |
2502.08009 |
null |
2025-02-11 |
An Interactive Framework for Implementing Privacy-Preserving Federated Learning: Experiments on Large Language Models |
Kasra Ahmadi et.al. |
2502.08008 |
link |
2025-02-11 |
Towards Training One-Step Diffusion Models Without Distillation |
Mingtian Zhang et.al. |
2502.08005 |
null |
2025-02-11 |
Universal Adversarial Attack on Aligned Multimodal LLMs |
Temurbek Rahmatullaev et.al. |
2502.07987 |
null |
2025-02-11 |
Deep Semantic Graph Learning via LLM based Node Enhancement |
Chuanqi Shi et.al. |
2502.07982 |
null |
2025-02-11 |
CIRCUIT: A Benchmark for Circuit Interpretation and Reasoning Capabilities of LLMs |
Lejla Skelic et.al. |
2502.07980 |
null |
2025-02-11 |
From Hazard Identification to Controller Design: Proactive and LLM-Supported Safety Engineering for ML-Powered Systems |
Yining Hong et.al. |
2502.07974 |
null |
2025-02-11 |
Caught in the Web of Words: Do LLMs Fall for Spin in Medical Literature? |
Hye Sun Yun et.al. |
2502.07963 |
null |
2025-02-11 |
Accelerating Scientific Research Through a Multi-LLM Framework |
Joaquin Ramirez-Medina et.al. |
2502.07960 |
null |
2025-02-11 |
Bridging HCI and AI Research for the Evaluation of Conversational SE Assistants |
Jonan Richards et.al. |
2502.07956 |
null |
2025-02-11 |
Symbiotic Cooperation for Web Agents: Harnessing Complementary Strengths of Large and Small LLMs |
Ruichen Zhang et.al. |
2502.07942 |
null |
2025-02-11 |
Discrete Markov Probabilistic Models |
Le-Tuyet-Nhi Pham et.al. |
2502.07939 |
null |
2025-02-11 |
Distributed Approach to Haskell Based Applications Refactoring with LLMs Based Multi-Agent Systems |
Shahbaz Siddeeq et.al. |
2502.07928 |
null |
2025-02-11 |
Sign Operator for Coping with Heavy-Tailed Noise: High Probability Convergence Bounds with Extensions to Distributed Optimization and Comparison Oracle |
Nikita Kornilov et.al. |
2502.07923 |
null |
2025-02-11 |
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning |
Rujing Yao et.al. |
2502.07912 |
link |
2025-02-11 |
DeepSeek on a Trip: Inducing Targeted Visual Hallucinations via Representation Vulnerabilities |
Chashi Mahiul Islam et.al. |
2502.07905 |
null |
2025-02-11 |
Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering |
Rujing Yao et.al. |
2502.07904 |
null |
2025-02-11 |
HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment |
Youhe Jiang et.al. |
2502.07903 |
null |
2025-02-11 |
TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation |
Alex Jinpeng Wang et.al. |
2502.07870 |
link |
2025-02-11 |
TransMLA: Multi-head Latent Attention Is All You Need |
Fanxu Meng et.al. |
2502.07864 |
link |
2025-02-11 |
BalanceKV: KV Cache Compression through Discrepancy Theory |
Insu Han et.al. |
2502.07861 |
null |
2025-02-11 |
Pippo: High-Resolution Multi-View Humans from a Single Image |
Yash Kant et.al. |
2502.07785 |
null |
2025-02-11 |
DarwinLM: Evolutionary Structured Pruning of Large Language Models |
Shengkun Tang et.al. |
2502.07780 |
link |
2025-02-11 |
Stay-Positive: A Case for Ignoring Real Image Features in Fake Image Detection |
Anirudh Sundara Rajan et.al. |
2502.07778 |
null |
2025-02-11 |
Auditing Prompt Caching in Language Model APIs |
Chenchen Gu et.al. |
2502.07776 |
link |
2025-02-11 |
Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming |
Azizjon Kobilov et.al. |
2502.07772 |
null |
2025-02-11 |
Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers |
Italo Santos et.al. |
2502.07763 |
null |
2025-02-11 |
Scalable Fingerprinting of Large Language Models |
Anshul Nasery et.al. |
2502.07760 |
null |
2025-02-11 |
Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension |
Wenbo Gong et.al. |
2502.07752 |
null |
2025-02-11 |
WHODUNIT: Evaluation benchmark for culprit detection in mystery stories |
Kshitij Gupta et.al. |
2502.07747 |
link |
2025-02-11 |
The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing |
Dirk Bergemann et.al. |
2502.07736 |
null |
2025-02-11 |
Revisiting Non-Acyclic GFlowNets in Discrete Environments |
Nikita Morozov et.al. |
2502.07735 |
link |
2025-02-11 |
Economics of Sourcing Human Data |
Sebastin Santy et.al. |
2502.07732 |
null |
2025-02-11 |
Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK |
Marcos Cramer et.al. |
2502.07728 |
null |
2025-02-11 |
Near-Optimal Sample Complexity in Reward-Free Kernel-Based Reinforcement Learning |
Aya Kayal et.al. |
2502.07715 |
null |
2025-02-11 |
Magic 1-For-1: Generating One Minute Video Clips within One Minute |
Hongwei Yi et.al. |
2502.07701 |
link |
2025-02-11 |
A Framework for LLM-powered Design Assistants |
Swaroop Panda et.al. |
2502.07698 |
null |
2025-02-11 |
Large Language Models as Proxies for Theories of Human Linguistic Cognition |
Imry Ziv et.al. |
2502.07687 |
null |
2025-02-11 |
Steering Protein Family Design through Profile Bayesian Flow |
Jingjing Gong et.al. |
2502.07671 |
null |
2025-02-11 |
Guiding Time-Varying Generative Models with Natural Gradients on Exponential Family Manifold |
Song Liu et.al. |
2502.07650 |
null |
2025-02-11 |
SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models |
Shihao Xia et.al. |
2502.07644 |
null |
2025-02-11 |
FoQA: A Faroese Question-Answering Dataset |
Annika Simonsen et.al. |
2502.07642 |
null |
2025-02-11 |
Distributional Instrumental Variable Method |
Anastasiia Holovchak et.al. |
2502.07641 |
link |
2025-02-11 |
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving |
Yong Lin et.al. |
2502.07640 |
link |
2025-02-11 |
Consistency Training with Physical Constraints |
Che-Chia Chang et.al. |
2502.07636 |
null |
2025-02-11 |
Exploring Mobile Touch Interaction with Large Language Models |
Tim Zindulka et.al. |
2502.07629 |
null |
2025-02-11 |
Tractable Transformers for Flexible Conditional Generation |
Anji Liu et.al. |
2502.07616 |
null |
2025-02-11 |
Beyond Prompting: Time2Lang – Bridging Time-Series Foundation Models and Large Language Models for Health Sensing |
Arvind Pillai et.al. |
2502.07608 |
null |
2025-02-11 |
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models |
Jiacong Xu et.al. |
2502.07601 |
null |
2025-02-11 |
Towards spatial computing: recent advances in multimodal natural interaction for XR headsets |
Zhimin Wang et.al. |
2502.07598 |
null |
2025-02-11 |
SEMU: Singular Value Decomposition for Efficient Machine Unlearning |
Marcin Sendera et.al. |
2502.07587 |
null |
2025-02-11 |
Generative Modeling with Bayesian Sample Inference |
Marten Lienen et.al. |
2502.07580 |
link |
2025-02-11 |
PIM Is All You Need: A CXL-Enabled GPU-Free System for Large Language Model Inference |
Yufeng Gu et.al. |
2502.07578 |
link |
2025-02-11 |
Automated Capability Discovery via Model Self-Exploration |
Cong Lu et.al. |
2502.07577 |
link |
2025-02-11 |
JBShield: Defending Large Language Models from Jailbreak Attacks through Activated Concept Analysis and Manipulation |
Shenyi Zhang et.al. |
2502.07557 |
link |
2025-02-11 |
O1 Embedder: Let Retrievers Think Before Action |
Ruin Yan et.al. |
2502.07555 |
null |
2025-02-11 |
Grammar Control in Dialogue Response Generation for Language Learning Chatbots |
Dominik Glandorf et.al. |
2502.07544 |
link |
2025-02-11 |
NatureLM: Deciphering the Language of Nature for Scientific Discovery |
Yingce Xia et.al. |
2502.07527 |
null |
2025-02-11 |
The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray Generation |
Raman Dutt et.al. |
2502.07516 |
link |
2025-02-11 |
Enhance-A-Video: Better Generated Video for Free |
Yang Luo et.al. |
2502.07508 |
link |
2025-02-11 |
Towards THz-based Obstacle Sensing: A Generative Radio Environment Awareness Framework |
Tianyu Hu et.al. |
2502.07504 |
null |
2025-02-11 |
Unified Graph Networks (UGN): A Deep Neural Framework for Solving Graph Problems |
Rudrajit Dawn et.al. |
2502.07500 |
null |
2025-02-11 |
LLM-Sketch: Enhancing Network Sketches with LLM |
Yuanpeng Li et.al. |
2502.07495 |
link |
2025-02-11 |
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More |
Xialie Zhuang et.al. |
2502.07490 |
link |
2025-02-11 |
Improving Adaptive Moment Optimization via Preconditioner Diagonalization |
Son Nguyen et.al. |
2502.07488 |
null |
2025-02-11 |
ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model |
Xiaochen Liu et.al. |
2502.07474 |
null |
2025-02-11 |
JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata |
Abhinaba Roy et.al. |
2502.07461 |
link |
2025-02-11 |
Logarithmic Regret for Online KL-Regularized Reinforcement Learning |
Heyang Zhao et.al. |
2502.07460 |
null |
2025-02-11 |
PerCul: A Story-Driven Cultural Evaluation of LLMs in Persian |
Erfan Moosavi Monazzah et.al. |
2502.07459 |
null |
2025-02-11 |
RusCode: Russian Cultural Code Benchmark for Text-to-Image Generation |
Viacheslav Vasilev et.al. |
2502.07455 |
link |
2025-02-11 |
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon |
Nurit Cohen-Inger et.al. |
2502.07445 |
link |
2025-02-11 |
Towards a Foundation Model for Physics-Informed Neural Networks: Multi-PDE Learning with Active Sampling |
Keon Vin Park et.al. |
2502.07425 |
null |
2025-02-11 |
RomanLens: Latent Romanization and its role in Multilinguality in LLMs |
Alan Saji et.al. |
2502.07424 |
null |
2025-02-11 |
Entity Linking using LLMs for Automated Product Carbon Footprint Estimation |
Steffen Castle et.al. |
2502.07418 |
null |
2025-02-11 |
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering |
Sheng Zhou et.al. |
2502.07411 |
link |
2025-02-11 |
MGPATH: Vision-Language Model with Multi-Granular Prompt Learning for Few-Shot WSI Classification |
Anh-Tien Nguyen et.al. |
2502.07409 |
link |
2025-02-11 |
On Iterative Evaluation and Enhancement of Code Quality Using GPT-4o |
Rundong Liu et.al. |
2502.07399 |
link |
2025-02-11 |
FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents |
Mostapha Benhenda et.al. |
2502.07393 |
link |
2025-02-11 |
LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! |
Dacheng Li et.al. |
2502.07374 |
link |
2025-02-11 |
EvoFlow: Evolving Diverse Agentic Workflows On The Fly |
Guibin Zhang et.al. |
2502.07373 |
null |
2025-02-11 |
LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation |
Zican Dong et.al. |
2502.07365 |
null |
2025-02-11 |
Bridging the Evaluation Gap: Leveraging Large Language Models for Topic Model Evaluation |
Zhiyin Tan et.al. |
2502.07352 |
link |
2025-02-11 |
KABB: Knowledge-Aware Bayesian Bandits for Dynamic Expert Coordination in Multi-Agent Systems |
Jusheng Zhang et.al. |
2502.07350 |
null |
2025-02-11 |
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models |
Xu Huang et.al. |
2502.07346 |
link |
2025-02-11 |
Aligning Large Language Models to Follow Instructions and Hallucinate Less via Effective Data Filtering |
Shuzheng Si et.al. |
2502.07340 |
link |
2025-02-11 |
Music for All: Exploring Multicultural Representations in Music Generation Models (Camera Ready) |
Atharva Mehta et.al. |
2502.07328 |
link |
2025-02-11 |
Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos |
Haowen Gao et.al. |
2502.07327 |
null |
2025-02-11 |
MEMIT-Merge: Addressing MEMIT’s Key-Value Conflicts in Same-Subject Batch Editing for LLMs |
Zilu Dong et.al. |
2502.07322 |
null |
2025-02-11 |
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction |
Junlong Li et.al. |
2502.07316 |
link |
2025-02-11 |
Prompt-Based Document Modifications In Ranking Competitions |
Niv Bardas et.al. |
2502.07315 |
null |
2025-02-11 |
CreAgent: Towards Long-Term Evaluation of Recommender System under Platform-Creator Information Asymmetry |
Xiaopeng Ye et.al. |
2502.07307 |
link |
2025-02-11 |
TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation |
Navid Rajabi et.al. |
2502.07306 |
null |
2025-02-11 |
Flow Matching for Collaborative Filtering |
Chengkai Liu et.al. |
2502.07303 |
link |
2025-02-11 |
Generation of Drug-Induced Cardiac Reactions towards Virtual Clinical Trials |
Qian Shao et.al. |
2502.07297 |
null |
2025-02-11 |
Small Language Model Makes an Effective Long Text Extractor |
Yelin Chen et.al. |
2502.07286 |
link |
2025-02-11 |
Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization |
Aditya Vora et.al. |
2502.07278 |
null |
2025-02-11 |
Cost-Efficient Continual Learning with Sufficient Exemplar Memory |
Dongkyu Cho et.al. |
2502.07274 |
null |
2025-02-11 |
GENERator: A Long-Context Generative Genomic Foundation Model |
Wei Wu et.al. |
2502.07272 |
link |
2025-02-11 |
When More is Less: Understanding Chain-of-Thought Length in LLMs |
Yuyang Wu et.al. |
2502.07266 |
null |
2025-02-11 |
DrugImproverGPT: A Large Language Model for Drug Optimization with Fine-Tuning via Structured Policy Optimization |
Xuefeng Liu et.al. |
2502.07237 |
null |
2025-02-11 |
A Memory Efficient Randomized Subspace Optimization Method for Training Large Language Models |
Yiming Chen et.al. |
2502.07222 |
null |
2025-02-11 |
MLLM4PUE: Toward Universal Embeddings in Computational Pathology through Multimodal LLMs |
Qifeng Zhou et.al. |
2502.07221 |
null |
2025-02-11 |
LUNAR: LLM Unlearning via Neural Activation Redirection |
William F. Shen et.al. |
2502.07218 |
null |
2025-02-11 |
Playmate: Flexible Control of Portrait Animation via 3D-Implicit Space Guided Diffusion |
Xingpei Ma et.al. |
2502.07203 |
null |
2025-02-11 |
Provably Efficient RLHF Pipeline: A Unified View from Contextual Bandits |
Long-Fei Li et.al. |
2502.07193 |
link |
2025-02-11 |
Bag of Tricks for Inference-time Computation of LLM Reasoning |
Fan Liu et.al. |
2502.07191 |
link |
2025-02-11 |
A Large-Scale Benchmark for Vietnamese Sentence Paraphrases |
Sang Quang Nguyen et.al. |
2502.07188 |
link |
2025-02-11 |
Refine Knowledge of Large Language Models via Adaptive Contrastive Learning |
Yinghui Li et.al. |
2502.07184 |
null |
2025-02-11 |
Does Training on Synthetic Data Make Models Less Robust? |
Lingze Zhang et.al. |
2502.07164 |
null |
2025-02-11 |
Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning |
Feng Chen et.al. |
2502.07154 |
link |
2025-02-11 |
Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning |
Jiayuan Zhu et.al. |
2502.07143 |
null |
2025-02-11 |
Language-TPP: Integrating Temporal Point Processes with Language Models for Event Analysis |
Quyu Kong et.al. |
2502.07139 |
null |
2025-02-10 |
Cardiverse: Harnessing LLMs for Novel Card Game Prototyping |
Danrui Li et.al. |
2502.07128 |
null |
2025-02-10 |
Structural Reformation of Large Language Model Neuron Encapsulation for Divergent Information Aggregation |
Denis Bakushev et.al. |
2502.07124 |
null |
2025-02-10 |
Online Scheduling for LLM Inference with KV Cache Constraints |
Patrick Jaillet et.al. |
2502.07115 |
null |
2025-02-10 |
Generative Distribution Prediction: A Unified Approach to Multimodal Learning |
Xinyu Tian et.al. |
2502.07090 |
null |
2025-02-10 |
Evaluating the Systematic Reasoning Abilities of Large Language Models through Graph Coloring |
Alex Heyman et.al. |
2502.07087 |
link |
2025-02-10 |
MPFBench: A Large Scale Dataset for SciML of Multi-Phase-Flows: Droplet and Bubble Dynamics |
Mehdi Shadkhah et.al. |
2502.07080 |
null |
2025-02-10 |
Multi-turn Evaluation of Anthropomorphic Behaviours in Large Language Models |
Lujain Ibrahim et.al. |
2502.07077 |
null |
2025-02-10 |
IRepair: An Intent-Aware Approach to Repair Data-Driven Errors in Large Language Models |
Sayem Mohammad Imtiaz et.al. |
2502.07072 |
null |
2025-02-10 |
Specializing Large Language Models to Simulate Survey Response Distributions for Global Populations |
Yong Cao et.al. |
2502.07068 |
link |
2025-02-10 |
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT |
Dongyang Liu et.al. |
2502.06782 |
null |
2025-02-10 |
Enhancing Performance of Explainable AI Models with Constrained Concept Refinement |
Geyu Liang et.al. |
2502.06775 |
null |
2025-02-10 |
Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions |
Jaeyeon Kim et.al. |
2502.06768 |
null |
2025-02-10 |
Rationalization Models for Text-to-SQL |
Gaetano Rossiello et.al. |
2502.06759 |
null |
2025-02-10 |
Accelerating Data Processing and Benchmarking of AI Models for Pathology |
Andrew Zhang et.al. |
2502.06750 |
link |
2025-02-10 |
Gradient Multi-Normalization for Stateless and Scalable LLM Training |
Meyer Scetbon et.al. |
2502.06742 |
null |
2025-02-10 |
VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data |
Thomas Zeng et.al. |
2502.06737 |
null |
2025-02-10 |
Señorita-2M: A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists |
Bojia Zi et.al. |
2502.06734 |
null |
2025-02-10 |
Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining |
Daouda Sow et.al. |
2502.06733 |
null |
2025-02-10 |
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling |
Runze Liu et.al. |
2502.06703 |
link |
2025-02-10 |
No Trick, No Treat: Pursuits and Challenges Towards Simulation-free Training of Neural Samplers |
Jiajun He et.al. |
2502.06685 |
null |
2025-02-10 |
EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks |
Michael Arbel et.al. |
2502.06684 |
null |
2025-02-10 |
Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations |
Rui Chen et.al. |
2502.06669 |
null |
2025-02-10 |
Automatic Evaluation of Healthcare LLMs Beyond Question-Answering |
Anna Arias-Duart et.al. |
2502.06666 |
null |
2025-02-10 |
Evaluation of Deep Audio Representations for Hearables |
Fabian Gröger et.al. |
2502.06664 |
null |
2025-02-10 |
EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models |
Xingrun Xing et.al. |
2502.06663 |
null |
2025-02-10 |
Unbiased Evaluation of Large Language Models from a Causal Perspective |
Meilin Chen et.al. |
2502.06655 |
null |
2025-02-10 |
In-Context Learning (and Unlearning) of Length Biases |
Stephanie Schoch et.al. |
2502.06653 |
null |
2025-02-10 |
Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A |
Anna Leschanowsky et.al. |
2502.06652 |
null |
2025-02-10 |
Automatic Annotation Augmentation Boosts Translation between Molecules and Natural Language |
Zhiqiang Zhong et.al. |
2502.06634 |
null |
2025-02-10 |
Combining Large Language Models with Static Analyzers for Code Review Generation |
Imen Jaoua et.al. |
2502.06633 |
null |
2025-02-10 |
Multi-Scale Feature Fusion with Image-Driven Spatial Integration for Left Atrium Segmentation from Cardiac MRI Images |
Bipasha Kundu et.al. |
2502.06615 |
null |
2025-02-10 |
A Large-scale AI-generated Image Inpainting Benchmark |
Paschalis Giakoumoglou et.al. |
2502.06593 |
null |
2025-02-10 |
Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training |
Yuchen Zhuang et.al. |
2502.06589 |
null |
2025-02-10 |
A Survey on Video Analytics in Cloud-Edge-Terminal Collaborative Systems |
Linxiao Gong et.al. |
2502.06581 |
null |
2025-02-10 |
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM |
Zhi Zhou et.al. |
2502.06572 |
link |
2025-02-10 |
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation |
Chengwen Qi et.al. |
2502.06563 |
link |
2025-02-10 |
Is API Access to LLMs Useful for Generating Private Synthetic Tabular Data? |
Marika Swanberg et.al. |
2502.06555 |
null |
2025-02-10 |
Efficient Scientific Full Text Classification: The Case of EICAT Impact Assessments |
Marc Felix Brinner et.al. |
2502.06551 |
null |
2025-02-10 |
Ignore the KL Penalty! Boosting Exploration on Critical Tokens to Enhance RL Fine-Tuning |
Jean Vassoyan et.al. |
2502.06533 |
null |
2025-02-10 |
Properties of Wasserstein Gradient Flows for the Sliced-Wasserstein Distance |
Christophe Vauthier et.al. |
2502.06525 |
null |
2025-02-10 |
GuideLLM: Exploring LLM-Guided Conversation with Applications in Autobiography Interviewing |
Jinhao Duan et.al. |
2502.06494 |
null |
2025-02-10 |
Recent Advances in Discrete Speech Tokens: A Review |
Yiwei Guo et.al. |
2502.06490 |
null |
2025-02-10 |
Adaptive Prompting: Ad-hoc Prompt Composition for Social Bias Detection |
Maximilian Spliethöver et.al. |
2502.06487 |
null |
2025-02-10 |
WyckoffDiff - A Generative Diffusion Model for Crystal Symmetry |
Filip Ekström Kelvinius et.al. |
2502.06485 |
null |
2025-02-10 |
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths |
Weijia Mao et.al. |
2502.06474 |
null |
2025-02-10 |
KARMA: Leveraging Multi-Agent LLMs for Automated Knowledge Graph Enrichment |
Yuxing Lu et.al. |
2502.06472 |
link |
2025-02-10 |
A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks |
Hieu Minh “Jord” Nguyen et.al. |
2502.06470 |
null |
2025-02-10 |
MATH-Perturb: Benchmarking LLMs’ Math Reasoning Abilities against Hard Perturbations |
Kaixuan Huang et.al. |
2502.06453 |
null |
2025-02-10 |
FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model |
Anna Tegon et.al. |
2502.06438 |
null |
2025-02-10 |
Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising |
Huaqiu Li et.al. |
2502.06432 |
link |
2025-02-10 |
CoS: Chain-of-Shot Prompting for Long Video Understanding |
Jian Hu et.al. |
2502.06428 |
null |
2025-02-10 |
Generating Privacy-Preserving Personalized Advice with Zero-Knowledge Proofs and LLMs |
Hiroki Watanabe et.al. |
2502.06425 |
null |
2025-02-10 |
Occ-LLM: Enhancing Autonomous Driving with Occupancy-Based Large Language Models |
Tianshuo Xu et.al. |
2502.06419 |
null |
2025-02-10 |
Systematic Outliers in Large Language Models |
Yongqi An et.al. |
2502.06415 |
link |
2025-02-10 |
AppVLM: A Lightweight Vision Language Model for Online App Control |
Georgios Papoudakis et.al. |
2502.06395 |
null |
2025-02-10 |
How Humans Help LLMs: Assessing and Incentivizing Human Preference Annotators |
Shang Liu et.al. |
2502.06387 |
null |
2025-02-10 |
Simulation as Reality? The Effectiveness of LLM-Generated Data in Open-ended Question Assessment |
Long Zhang et.al. |
2502.06371 |
null |
2025-02-10 |
Calibrating LLMs with Information-Theoretic Evidential Deep Learning |
Yawei Li et.al. |
2502.06351 |
link |
2025-02-10 |
Can AI Examine Novelty of Patents?: Novelty Evaluation Based on the Correspondence between Patent Claim and Prior Art |
Hayato Ikoma et.al. |
2502.06316 |
null |
2025-02-10 |
Latent Convergence Modulation in Large Language Models: A Novel Approach to Iterative Contextual Realignment |
Patricia Porretta et.al. |
2502.06302 |
null |
2025-02-10 |
SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia |
Chaoqun Liu et.al. |
2502.06298 |
null |
2025-02-10 |
Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases? |
Qingshan Hou et.al. |
2502.06289 |
null |
2025-02-10 |
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE |
Haiduo Huang et.al. |
2502.06282 |
link |
2025-02-10 |
DebateBench: A Challenging Long Context Reasoning Benchmark For Large Language Models |
Utkarsh Tiwari et.al. |
2502.06279 |
null |
2025-02-10 |
Emergent Response Planning in LLM |
Zhichen Dong et.al. |
2502.06258 |
null |
2025-02-10 |
K-ON: Stacking Knowledge On the Head Layer of Large Language Model |
Lingbing Guo et.al. |
2502.06257 |
null |
2025-02-10 |
Find Central Dogma Again |
Wang Liang et.al. |
2502.06253 |
null |
2025-02-10 |
Amplifying Minority Voices: AI-Mediated Devil’s Advocate System for Inclusive Group Decision-Making |
Soohwan Lee et.al. |
2502.06251 |
null |
2025-02-10 |
PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts |
Zeman Li et.al. |
2502.06244 |
null |
2025-02-10 |
Fully Exploiting Vision Foundation Model’s Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing |
Sicen Guo et.al. |
2502.06219 |
null |
2025-02-10 |
LessLeak-Bench: A First Investigation of Data Leakage in LLMs Across 83 Software Engineering Benchmarks |
Xin Zhou et.al. |
2502.06215 |
null |
2025-02-10 |
Unveiling the Capabilities of Large Language Models in Detecting Offensive Language with Annotation Disagreement |
Junyu Lu et.al. |
2502.06207 |
null |
2025-02-10 |
C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation |
Guoxin Chen et.al. |
2502.06205 |
null |
2025-02-10 |
Non-literal Understanding of Number Words by Language Models |
Polina Tsvilodub et.al. |
2502.06204 |
null |
2025-02-10 |
Timing Matters: How Using LLMs at Different Timings Influences Writers’ Perceptions and Ideation Outcomes in AI-Assisted Ideation |
Peinuan Qin et.al. |
2502.06197 |
null |
2025-02-10 |
Can LLMs Replace Human Evaluators? An Empirical Study of LLM-as-a-Judge in Software Engineering |
Ruiqi Wang et.al. |
2502.06193 |
null |
2025-02-10 |
Uncertainty-Aware Adaptation of Large Language Models for Protein-Protein Interaction Analysis |
Sanket Jantre et.al. |
2502.06173 |
null |
2025-02-10 |
A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation |
Wenhui Lei et.al. |
2502.06171 |
null |
2025-02-10 |
Universal Approximation of Visual Autoregressive Transformers |
Yifang Chen et.al. |
2502.06167 |
null |
2025-02-10 |
Scaling Public Health Text Annotation: Zero-Shot Learning vs. Crowdsourcing for Improved Efficiency and Labeling Accuracy |
Kamyar Kazari et.al. |
2502.06150 |
null |
2025-02-10 |
Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection |
Yan Weng et.al. |
2502.06148 |
null |
2025-02-10 |
LegalViz: Legal Text Visualization by Text To Diagram Generation |
Eri Onami et.al. |
2502.06147 |
null |
2025-02-10 |
LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMs |
Sumin An et.al. |
2502.06139 |
null |
2025-02-10 |
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models |
Ce Zhang et.al. |
2502.06130 |
null |
2025-02-10 |
Foundation Model of Electronic Medical Records for Adaptive Risk Estimation |
Pawel Renc et.al. |
2502.06124 |
link |
2025-02-10 |
Task-driven Layerwise Additive Activation Intervention |
Hieu Trung Nguyen et.al. |
2502.06115 |
null |
2025-02-10 |
CSR-Bench: Benchmarking LLM Agents in Deployment of Computer Science Research Repositories |
Yijia Xiao et.al. |
2502.06111 |
null |
2025-02-10 |
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning |
Jian Xu et.al. |
2502.06101 |
link |
2025-02-10 |
ConMeC: A Dataset for Metonymy Resolution with Common Nouns |
Saptarshi Ghosh et.al. |
2502.06087 |
link |
2025-02-10 |
Physics-Guided Foundation Model for Scientific Discovery: An Application to Aquatic Science |
Runlong Yu et.al. |
2502.06084 |
link |
2025-02-10 |
Debiasing Guidance for Discrete Diffusion with Sequential Monte Carlo |
Cheuk Kit Lee et.al. |
2502.06079 |
null |
2025-02-09 |
Deconstructing Depression Stigma: Integrating AI-driven Data Collection and Analysis with Causal Knowledge Graphs |
Han Meng et.al. |
2502.06075 |
null |
2025-02-09 |
Allegro-FM: Towards Equivariant Foundation Model for Exascale Molecular Dynamics Simulations |
Ken-ichi Nomura et.al. |
2502.06073 |
null |
2025-02-09 |
Benchmarking Prompt Sensitivity in Large Language Models |
Amirhossein Razavi et.al. |
2502.06065 |
null |
2025-02-09 |
Online Reward-Weighted Fine-Tuning of Flow Matching with Wasserstein Regularization |
Jiajun Fan et.al. |
2502.06061 |
null |
2025-02-09 |
Benchmarking Prompt Engineering Techniques for Secure Code Generation with GPT Models |
Marc Bruni et.al. |
2502.06039 |
null |
2025-02-09 |
Investigating Compositional Reasoning in Time Series Foundation Models |
Willa Potosnak et.al. |
2502.06037 |
link |
2025-02-09 |
A Multimodal PDE Foundation Model for Prediction and Scientific Text Descriptions |
Elisa Negrini et.al. |
2502.06026 |
link |
2025-02-09 |
Dual Caption Preference Optimization for Diffusion Models |
Amir Saeidi et.al. |
2502.06023 |
null |
2025-02-09 |
Temporal Working Memory: Query-Guided Segment Refinement for Enhanced Multimodal Understanding |
Xingjian Diao et.al. |
2502.06020 |
link |
2025-02-09 |
Media Bias Detector: Designing and Implementing a Tool for Real-Time Selection and Framing Bias Analysis in News Coverage |
Jenny S Wang et.al. |
2502.06009 |
null |
2025-02-09 |
Analysis of LLM as a grammatical feature tagger for African American English |
Rahul Porwal et.al. |
2502.06004 |
null |
2025-02-09 |
HamRaz: A Culture-Based Persian Conversation Dataset for Person-Centered Therapy Using LLM Agents |
Mohammad Amin Abbasi et.al. |
2502.05982 |
null |
2025-02-09 |
$μ$ nit Scaling: Simple and Scalable FP8 LLM Training |
Saaketh Narayan et.al. |
2502.05967 |
null |
2025-02-09 |
Redefining Robot Generalization Through Interactive Intelligence |
Sharmita Dey et.al. |
2502.05963 |
null |
2025-02-09 |
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents |
Jiabin Tang et.al. |
2502.05957 |
null |
2025-02-09 |
Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks |
Antonio La Torre et.al. |
2502.05951 |
null |
2025-02-09 |
Acceleration Multiple Heads Decoding for LLM via Dynamic Tree Attention |
Zhendong Zhang et.al. |
2502.05947 |
null |
2025-02-09 |
“Let the AI conspiracy begin…” Language Model coordination is just one inference-intervention away |
Paul Darm et.al. |
2502.05945 |
null |
2025-02-07 |
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray |
Yunhang Shen et.al. |
2502.05177 |
link |
2025-02-07 |
Fillerbuster: Multi-View Scene Completion for Casual Captures |
Ethan Weber et.al. |
2502.05175 |
null |
2025-02-07 |
NoLiMa: Long-Context Evaluation Beyond Literal Matching |
Ali Modarressi et.al. |
2502.05167 |
link |
2025-02-07 |
Multitwine: Multi-Object Compositing with Text and Layout Control |
Gemma Canet Tarrés et.al. |
2502.05165 |
null |
2025-02-07 |
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails |
Yihe Deng et.al. |
2502.05163 |
link |
2025-02-07 |
A Lightweight Method to Disrupt Memorized Sequences in LLM |
Parjanya Prajakta Prashant et.al. |
2502.05159 |
null |
2025-02-07 |
Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation |
Steffen Eger et.al. |
2502.05151 |
link |
2025-02-07 |
CodeSCM: Causal Analysis for Multi-Modal Code Generation |
Mukur Gupta et.al. |
2502.05150 |
link |
2025-02-07 |
An Annotated Reading of ‘The Singer of Tales’ in the LLM Era |
Kush R. Varshney et.al. |
2502.05148 |
null |
2025-02-07 |
Chest X-ray Foundation Model with Global and Local Representations Integration |
Zefan Yang et.al. |
2502.05142 |
link |
2025-02-07 |
Latent Swap Joint Diffusion for Long-Form Audio Generation |
Yusheng Dai et.al. |
2502.05130 |
null |
2025-02-07 |
Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning |
Matt von Hippel et.al. |
2502.05121 |
null |
2025-02-07 |
Flexible and Efficient Grammar-Constrained Decoding |
Kanghee Park et.al. |
2502.05111 |
null |
2025-02-07 |
Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs |
Rohit Saxena et.al. |
2502.05092 |
null |
2025-02-07 |
Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs |
Thierry Bossy et.al. |
2502.05087 |
link |
2025-02-07 |
Causality can systematically address the monsters under the bench(marks) |
Felix Leeb et.al. |
2502.05085 |
null |
2025-02-07 |
ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework |
Xiaoyu Deng et.al. |
2502.05084 |
null |
2025-02-07 |
Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures |
Tushar Pandey et.al. |
2502.05078 |
link |
2025-02-07 |
Beautiful Images, Toxic Words: Understanding and Addressing Offensive Text in Generated Images |
Aditya Kumar et.al. |
2502.05066 |
link |
2025-02-07 |
nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow |
Geliang Ouyang et.al. |
2502.05036 |
link |
2025-02-07 |
Prospects for detecting generic fast-time features in the neutrino lightcurve of nearby supernovae in neutrino telescopes |
Jakob Beise et.al. |
2502.05024 |
null |
2025-02-07 |
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations |
Andrei Panferov et.al. |
2502.05003 |
link |
2025-02-07 |
Aligning Black-box Language Models with Human Judgments |
Gerrit J. J. van den Burg et.al. |
2502.04997 |
null |
2025-02-07 |
C2GM: Cascading Conditional Generation of Multi-scale Maps from Remote Sensing Images Constrained by Geographic Features |
Chenxing Sun et.al. |
2502.04991 |
null |
2025-02-07 |
MoGraphGPT: Creating Interactive Scenes Using Modular LLM and Graphical Control |
Hui Ye et.al. |
2502.04983 |
null |
2025-02-07 |
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits |
Finn Rietz et.al. |
2502.04979 |
null |
2025-02-07 |
Towards Multimodal Empathetic Response Generation: A Rich Text-Speech-Vision Avatar-based Benchmark |
Han Zhang et.al. |
2502.04976 |
null |
2025-02-07 |
CoCoA: A Generalized Approach to Uncertainty Quantification by Integrating Confidence and Consistency of LLM Outputs |
Roman Vashurin et.al. |
2502.04964 |
null |
2025-02-07 |
The Rising Threat to Emerging AI-Powered Search Engines |
Zeren Luo et.al. |
2502.04951 |
null |
2025-02-07 |
Mobile Network-specialized Large Language Models for 6G: Architectures, Innovations, Challenges, and Future Trends |
Abdelaali Chaoub et.al. |
2502.04933 |
null |
2025-02-07 |
Generative-enhanced optimization for knapsack problems: an industry-relevant study |
Yelyzaveta Vodovozova et.al. |
2502.04928 |
null |
2025-02-07 |
Classification or Prompting: A Case Study on Legal Requirements Traceability |
Romina Etezadi et.al. |
2502.04916 |
null |
2025-02-07 |
Goku: Flow Based Video Generative Foundation Models |
Shoufa Chen et.al. |
2502.04896 |
null |
2025-02-07 |
A Foundational Brain Dynamics Model via Stochastic Optimal Control |
Joonhyeong Park et.al. |
2502.04892 |
null |
2025-02-07 |
Training-free Task-oriented Grasp Generation |
Jiaming Wang et.al. |
2502.04873 |
null |
2025-02-07 |
Advancing Wasserstein Convergence Analysis of Score-Based Models: Insights from Discretization and Second-Order Acceleration |
Yifeng Yu et.al. |
2502.04849 |
null |
2025-02-07 |
Developmentally-plausible Working Memory Shapes a Critical Period for Language Acquisition |
Masato Mita et.al. |
2502.04795 |
null |
2025-02-07 |
S $^2$ -MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency |
Yuting Zeng et.al. |
2502.04790 |
null |
2025-02-07 |
Probing Internal Representations of Multi-Word Verbs in Large Language Models |
Hassane Kissane et.al. |
2502.04789 |
null |
2025-02-07 |
Enhancing SQL Injection Detection and Prevention Using Generative Models |
Naga Sai Dasari et.al. |
2502.04786 |
null |
2025-02-07 |
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning |
Wanjia Zhao et.al. |
2502.04780 |
link |
2025-02-07 |
SeDi-Instruct: Enhancing Alignment of Language Models through Self-Directed Instruction Generation |
Jungwoo Kim et.al. |
2502.04774 |
null |
2025-02-07 |
Enhancing Phishing Email Identification with Large Language Models |
Catherine Lee et.al. |
2502.04759 |
null |
2025-02-07 |
Concept Navigation and Classification via Open Source Large Language Model Processing |
Maël Kubli et.al. |
2502.04756 |
null |
2025-02-07 |
Every Software as an Agent: Blueprint and Case Study |
Mengwei Xu et.al. |
2502.04747 |
null |
2025-02-07 |
PhyloVAE: Unsupervised Learning of Phylogenetic Trees via Variational Autoencoders |
Tianyu Xie et.al. |
2502.04730 |
link |
2025-02-07 |
Generating Symbolic World Models via Test-time Scaling of Large Language Models |
Zhouliang Yu et.al. |
2502.04728 |
link |
2025-02-07 |
Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics? |
Sourabrata Mukherjee et.al. |
2502.04718 |
null |
2025-02-07 |
Enhancing Impression Change Prediction in Speed Dating Simulations Based on Speakers’ Personalities |
Kazuya Matsuo et.al. |
2502.04706 |
null |
2025-02-07 |
STRIDE: Automating Reward Design, Deep Reinforcement Learning Training and Feedback Optimization in Humanoid Robotics Locomotion |
Zhenwei Wu et.al. |
2502.04692 |
null |
2025-02-07 |
ARR: Question Answering with Large Language Models via Analyzing, Retrieving, and Reasoning |
Yuwei Yin et.al. |
2502.04689 |
link |
2025-02-07 |
M-IFEval: Multilingual Instruction-Following Evaluation |
Antoine Dussolle et.al. |
2502.04688 |
link |
2025-02-07 |
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization |
Zelai Xu et.al. |
2502.04686 |
null |
2025-02-07 |
G2PDiffusion: Genotype-to-Phenotype Prediction with Diffusion Models |
Mengdi Liu et.al. |
2502.04684 |
null |
2025-02-07 |
CALF-SBM: A Covariate-Assisted Latent Factor Stochastic Block Model |
Sydney Louit et.al. |
2502.04681 |
null |
2025-02-07 |
LLM Query Scheduling with Prefix Reuse and Latency Constraints |
Gregory Dexter et.al. |
2502.04677 |
null |
2025-02-07 |
AdParaphrase: Paraphrase Dataset for Analyzing Linguistic Features toward Generating Attractive Ad Texts |
Soichiro Murakami et.al. |
2502.04674 |
link |
2025-02-07 |
Unveiling the Mechanisms of Explicit CoT Training: How Chain-of-Thought Enhances Reasoning Generalization |
Xinhao Yao et.al. |
2502.04667 |
link |
2025-02-07 |
Enhancing Health Information Retrieval with RAG by Prioritizing Topical Relevance and Factual Accuracy |
Rishabh Uapadhyay et.al. |
2502.04666 |
null |
2025-02-07 |
Importance Sampling via Score-based Generative Models |
Heasung Kim et.al. |
2502.04646 |
null |
2025-02-07 |
Agentic Reasoning: Reasoning LLMs with Tools for the Deep Research |
Junde Wu et.al. |
2502.04644 |
link |
2025-02-07 |
Confidence Elicitation: A New Attack Vector for Large Language Models |
Brian Formento et.al. |
2502.04643 |
link |
2025-02-07 |
Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition |
Khaled Sellami et.al. |
2502.04604 |
null |
2025-02-07 |
Extracting and Understanding the Superficial Knowledge in Alignment |
Runjin Chen et.al. |
2502.04602 |
link |
2025-02-07 |
The $α$ -Alternator: Dynamic Adaptation To Varying Noise Levels In Sequences Using The Vendi Score For Improved Robustness and Performance |
Mohammad Reza Rezaei et.al. |
2502.04593 |
null |
2025-02-07 |
Position-aware Automatic Circuit Discovery |
Tal Haklay et.al. |
2502.04577 |
link |
2025-02-06 |
My LLM might Mimic AAE – But When Should it? |
Sandra C. Sandoval et.al. |
2502.04564 |
link |
2025-02-06 |
Speeding up Speculative Decoding via Approximate Verification |
Meiyu Zhong et.al. |
2502.04557 |
null |
2025-02-06 |
TruthFlow: Truthful LLM Generation via Representation Flow Correction |
Hanyu Wang et.al. |
2502.04556 |
null |
2025-02-06 |
Contextual Gradient Flow Modeling for Large Language Model Generalization in Multi-Scale Feature Spaces |
Daphne Quillington et.al. |
2502.04548 |
null |
2025-02-06 |
Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection |
Minseok Jung et.al. |
2502.04528 |
null |
2025-02-06 |
Safety is Essential for Responsible Open-Ended Systems |
Ivaxi Sheth et.al. |
2502.04512 |
null |
2025-02-06 |
ULPT: Prompt Tuning with Ultra-Low-Dimensional Optimization |
Zijun Wu et.al. |
2502.04501 |
null |
2025-02-06 |
Verifiable Format Control for Large Language Model Generations |
Zhaoyang Wang et.al. |
2502.04498 |
null |
2025-02-06 |
Multi-Agent Reinforcement Learning with Focal Diversity Optimization |
Selim Furkan Tekin et.al. |
2502.04492 |
link |
2025-02-06 |
Building A Unified AI-centric Language System: analysis, framework and future work |
Edward Hong Wang et.al. |
2502.04488 |
null |
2025-02-06 |
Active Task Disambiguation with LLMs |
Katarzyna Kobalczyk et.al. |
2502.04485 |
link |
2025-02-06 |
The ML Supply Chain in the Era of Software 2.0: Lessons Learned from Hugging Face |
Trevor Stalnaker et.al. |
2502.04484 |
null |
2025-02-06 |
Near-Optimal Sample Complexity for MDPs via Anchoring |
Jongmin Lee et.al. |
2502.04477 |
null |
2025-02-06 |
ADIFF: Explaining audio difference using natural language |
Soham Deshmukh et.al. |
2502.04476 |
link |
2025-02-06 |
Augmented Conditioning Is Enough For Effective Training Image Generation |
Jiahui Chen et.al. |
2502.04475 |
null |
2025-02-06 |
Iterative Importance Fine-tuning of Diffusion Models |
Alexander Denker et.al. |
2502.04468 |
null |
2025-02-06 |
FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks |
Luca Della Libera et.al. |
2502.04465 |
null |
2025-02-06 |
Training Language Models to Reason Efficiently |
Daman Arora et.al. |
2502.04463 |
link |
2025-02-06 |
Confident or Seek Stronger: Exploring Uncertainty-Based On-device LLM Routing From Benchmarking to Generalization |
Yu-Neng Chuang et.al. |
2502.04428 |
null |
2025-02-06 |
Decoding AI Judgment: How LLMs Assess News Credibility and Bias |
Edoardo Loru et.al. |
2502.04426 |
null |
2025-02-06 |
EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models |
He Hu et.al. |
2502.04424 |
null |
2025-02-06 |
Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment |
Zuyan Liu et.al. |
2502.04328 |
link |
2025-02-06 |
Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness |
Karolina Rudnicka et.al. |
2502.04324 |
null |
2025-02-06 |
Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions |
Yik Siu Chan et.al. |
2502.04322 |
link |
2025-02-06 |
ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features |
Alec Helbling et.al. |
2502.04320 |
link |
2025-02-06 |
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views |
Eyvaz Najafli et.al. |
2502.04318 |
null |
2025-02-06 |
ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters |
Kamer Ali Yuksel et.al. |
2502.04315 |
link |
2025-02-06 |
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization |
Yinjie Wang et.al. |
2502.04306 |
link |
2025-02-06 |
MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation |
Jinbo Xing et.al. |
2502.04299 |
null |
2025-02-06 |
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression |
Lirui Wang et.al. |
2502.04296 |
null |
2025-02-06 |
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization |
Yuanye Liu et.al. |
2502.04295 |
link |
2025-02-06 |
PILAF: Optimal Human Preference Sampling for Reward Modeling |
Yunzhen Feng et.al. |
2502.04270 |
null |
2025-02-06 |
Efficient Randomized Experiments Using Foundation Models |
Piersilvio De Bartolomeis et.al. |
2502.04262 |
link |
2025-02-06 |
Realistic Image-to-Image Machine Unlearning via Decoupling and Knowledge Retention |
Ayush K. Varshney et.al. |
2502.04260 |
null |
2025-02-06 |
MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion |
Xintong Hao et.al. |
2502.04235 |
null |
2025-02-06 |
Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks |
Andreas Happe et.al. |
2502.04227 |
null |
2025-02-06 |
Keep It Light! Simplifying Image Clustering Via Text-Free Adapters |
Yicen Li et.al. |
2502.04226 |
null |
2025-02-06 |
Éclair – Extracting Content and Layout with Integrated Reading Order for Documents |
Ilia Karmanov et.al. |
2502.04223 |
null |
2025-02-06 |
Sports and Women’s Sports: Gender Bias in Text Generation with Olympic Data |
Laura Biester et.al. |
2502.04218 |
null |
2025-02-06 |
Algorithmic causal structure emerging through compression |
Liang Wendong et.al. |
2502.04210 |
null |
2025-02-06 |
“Short-length” Adversarial Training Helps LLMs Defend “Long-length” Jailbreak Attacks: Theoretical and Empirical Evidence |
Shaopeng Fu et.al. |
2502.04204 |
link |
2025-02-06 |
The Best Instruction-Tuning Data are Those That Fit |
Dylan Zhang et.al. |
2502.04194 |
null |
2025-02-06 |
PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models? |
Mennatullah Siam et.al. |
2502.04192 |
link |
2025-02-06 |
Automated Microservice Pattern Instance Detection Using Infrastructure-as-Code Artifacts and Large Language Models |
Carlos Eduardo Duarte et.al. |
2502.04188 |
null |
2025-02-06 |
Multi-agent Architecture Search via Agentic Supernet |
Guibin Zhang et.al. |
2502.04180 |
link |
2025-02-06 |
MRAMG-Bench: A BeyondText Benchmark for Multimodal Retrieval-Augmented Multimodal Generation |
Qinhan Yu et.al. |
2502.04176 |
null |
2025-02-06 |
Diffusion-based mass map reconstruction from weak lensing data |
Supranta S. Boruah et.al. |
2502.04158 |
null |
2025-02-06 |
UltraIF: Advancing Instruction Following from the Wild |
Kaikai An et.al. |
2502.04153 |
link |
2025-02-06 |
The Order Effect: Investigating Prompt Sensitivity in Closed-Source LLMs |
Bryan Guan et.al. |
2502.04134 |
null |
2025-02-06 |
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis |
Zhen Ye et.al. |
2502.04128 |
null |
2025-02-06 |
Generative Adversarial Networks Bridging Art and Machine Intelligence |
Junhao Song et.al. |
2502.04116 |
null |
2025-02-06 |
VTutor: An Open-Source SDK for Generative AI-Powered Animated Pedagogical Agents with Multi-Media Output |
Eason Chen et.al. |
2502.04103 |
null |
2025-02-06 |
LLMs to Support a Domain Specific Knowledge Assistant |
Maria-Flavia Lovin et.al. |
2502.04095 |
null |
2025-02-06 |
AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference |
Qingyue Yang et.al. |
2502.04077 |
link |
2025-02-06 |
Content-Rich AIGC Video Quality Assessment via Intricate Text Alignment and Motion-Aware Consistency |
Shangkun Sun et.al. |
2502.04076 |
link |
2025-02-06 |
Predicting Large Language Model Capabilities on Closed-Book QA Tasks Using Only Information Available Prior to Training |
Changhao Jiang et.al. |
2502.04066 |
null |
2025-02-06 |
TQ-DiT: Efficient Time-Aware Quantization for Diffusion Transformers |
Younghye Hwang et.al. |
2502.04056 |
null |
2025-02-06 |
Exploring Imbalanced Annotations for Effective In-Context Learning |
Hongfu Gao et.al. |
2502.04037 |
null |
2025-02-06 |
Fine, I’ll Merge It Myself: A Multi-Fidelity Framework for Automated Model Merging |
Guinan Su et.al. |
2502.04030 |
null |
2025-02-06 |
Echo-Teddy: Preliminary Design and Development of Large Language Model-based Social Robot for Autistic Students |
Unggi Lee et.al. |
2502.04029 |
null |
2025-02-06 |
Quantification of Biodiversity from Historical Survey Text with LLM-based Best-Worst Scaling |
Thomas Haider et.al. |
2502.04022 |
null |
2025-02-06 |
Automating a Complete Software Test Process Using LLMs: An Automotive Case Study |
Shuai Wang et.al. |
2502.04008 |
null |
2025-02-06 |
CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing |
Yu Yuan et.al. |
2502.03997 |
null |
2025-02-06 |
Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering |
Longquan Jiang et.al. |
2502.03992 |
link |
2025-02-06 |
Tight Bounds on Jensen’s Gap: Novel Approach with Applications in Generative Modeling |
Marcin Mazur et.al. |
2502.03988 |
null |
2025-02-06 |
MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation |
YoonJe Kang et.al. |
2502.03966 |
null |
2025-02-06 |
MAQInstruct: Instruction-based Unified Event Relation Extraction |
Jun Xu et.al. |
2502.03954 |
null |
2025-02-06 |
LR0.FM: Low-Resolution Zero-shot Classification Benchmark For Foundation Models |
Priyank Pathak et.al. |
2502.03950 |
link |
2025-02-06 |
Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond |
Mardhiyah Sanni et.al. |
2502.03945 |
null |
2025-02-06 |
Unravelling Causal Genetic Biomarkers of Alzheimer’s Disease via Neuron to Gene-token Backtracking in Neural Architecture: A Groundbreaking Reverse-Gene-Finder Approach |
Victor OK Li et.al. |
2502.03938 |
null |
2025-02-06 |
Quantifying Correlations of Machine Learning Models |
Yuanyuan Li et.al. |
2502.03937 |
link |
2025-02-06 |
HEP-JEPA: A foundation model for collider physics using joint embedding predictive architecture |
Jai Bardhan et.al. |
2502.03933 |
null |
2025-02-06 |
Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software |
Andreas Baumann et.al. |
2502.03916 |
null |
2025-02-06 |
No Free Lunch in Annotation either: An objective evaluation of foundation models for streamlining annotation in animal tracking |
Emil Mededovic et.al. |
2502.03907 |
link |
2025-02-06 |
LeAP: Consistent multi-domain 3D labeling using Foundation Models |
Simon Gebraad et.al. |
2502.03901 |
null |
2025-02-06 |
InfinitePOD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers |
Chenchen Shou et.al. |
2502.03885 |
null |
2025-02-06 |
Rank Also Matters: Hierarchical Configuration for Mixture of Adapter Experts in LLM Fine-Tuning |
Peizhuang Cong et.al. |
2502.03884 |
null |
2025-02-06 |
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation |
Bo Pang et.al. |
2502.03860 |
null |
2025-02-06 |
PAGNet: Pluggable Adaptive Generative Networks for Information Completion in Multi-Agent Communication |
Zhuohui Zhang et.al. |
2502.03845 |
null |
2025-02-06 |
Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis |
Lin Yuan et.al. |
2502.03843 |
null |
2025-02-06 |
FairT2I: Mitigating Social Bias in Text-to-Image Generation via Large Language Model-Assisted Detection and Attribute Rebalancing |
Jinya Sakurai et.al. |
2502.03826 |
null |
2025-02-06 |
Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation |
Tianhao Li et.al. |
2502.03825 |
null |
2025-02-06 |
PsyPlay: Personality-Infused Role-Playing Conversational Agents |
Tao Yang et.al. |
2502.03821 |
null |
2025-02-06 |
Large Language Models for Multi-Robot Systems: A Survey |
Peihan Li et.al. |
2502.03814 |
null |
2025-02-06 |
Identify Critical KV Cache in LLM Inference from an Output Perturbation Perspective |
Yuan Feng et.al. |
2502.03805 |
link |
2025-02-06 |
Understanding and Supporting Formal Email Exchange by Answering AI-Generated Questions |
Yusuke Miura et.al. |
2502.03804 |
null |
2025-02-06 |
Enhancing Hallucination Detection through Noise Injection |
Litian Liu et.al. |
2502.03799 |
null |
2025-02-06 |
Distribution learning via neural differential equations: minimal energy regularization and approximation theory |
Youssef Marzouk et.al. |
2502.03795 |
null |
2025-02-06 |
It’s All in The [MASK]: Simple Instruction-Tuning Enables BERT-like Masked Language Models As Generative Classifiers |
Benjamin Clavié et.al. |
2502.03793 |
null |
2025-02-06 |
Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence |
Jacob Fein-Ashley et.al. |
2502.03787 |
null |
2025-02-06 |
GistVis: Automatic Generation of Word-scale Visualizations from Data-rich Documents |
Ruishi Zou et.al. |
2502.03784 |
link |
2025-02-06 |
Adaptive Semantic Prompt Caching with VectorQ |
Luis Gaspar Schroeder et.al. |
2502.03771 |
null |
2025-02-06 |
Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models |
Meiquan Dong et.al. |
2502.03766 |
null |
2025-02-06 |
Rethinking the Residual Distribution of Locate-then-Editing Methods in Model Editing |
Xiaopeng Li et.al. |
2502.03748 |
null |
2025-02-06 |
Speaking the Language of Teamwork: LLM-Guided Credit Assignment in Multi-Agent Reinforcement Learning |
Muhan Lin et.al. |
2502.03723 |
null |
2025-02-06 |
Boosting Knowledge Graph-based Recommendations through Confidence-Aware Augmentation with Large Language Models |
Rui Cai et.al. |
2502.03715 |
null |
2025-02-06 |
MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers |
Nicole Cho et.al. |
2502.03711 |
null |
2025-02-06 |
Aggregate and conquer: detecting and steering LLM concepts by combining nonlinear predictors over multiple layers |
Daniel Beaglehole et.al. |
2502.03708 |
null |
2025-02-06 |
LLM Alignment as Retriever Optimization: An Information Retrieval Perspective |
Bowen Jin et.al. |
2502.03699 |
null |
2025-02-06 |
A Comparison of DeepSeek and Other LLMs |
Tianchen Gao et.al. |
2502.03688 |
null |
2025-02-06 |
Conditional Diffusion Models are Medical Image Classifiers that Provide Explainability and Uncertainty for Free |
Gian Mario Favero et.al. |
2502.03687 |
null |
2025-02-06 |
Controlled LLM Decoding via Discrete Auto-regressive Biasing |
Patrick Pynadath et.al. |
2502.03685 |
null |
2025-02-05 |
Reflection-Window Decoding: Text Generation with Selective Refinement |
Zeyu Tang et.al. |
2502.03678 |
null |
2025-02-05 |
Advancing Reasoning in Large Language Models: Promising Methods and Approaches |
Avinash Patil et.al. |
2502.03671 |
null |
2025-02-05 |
Unrealized Expectations: Comparing AI Methods vs Classical Algorithms for Maximum Independent Set |
Yikai Wu et.al. |
2502.03669 |
null |
2025-02-05 |
Privacy-Preserving Generative Models: A Comprehensive Survey |
Debalina Padariya et.al. |
2502.03668 |
null |
2025-02-05 |
Context-Preserving Gradient Modulation for Large Language Models: A Novel Approach to Semantic Consistency in Long-Form Text Generation |
Nirola Kobanov et.al. |
2502.03643 |
null |
2025-02-05 |
SymmCD: Symmetry-Preserving Crystal Generation with Diffusion Models |
Daniel Levy et.al. |
2502.03638 |
link |
2025-02-05 |
AdaPhish: AI-Powered Adaptive Defense and Education Resource Against Deceptive Emails |
Rei Meguro et.al. |
2502.03622 |
null |
2025-02-05 |
Bilevel ZOFO: Bridging Parameter-Efficient and Zeroth-Order Techniques for Efficient LLM Fine-Tuning and Meta-Training |
Reza Shirkavand et.al. |
2502.03604 |
null |
2025-02-05 |
HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference |
Zeyu Zhang et.al. |
2502.03589 |
null |
2025-02-05 |
A Mixed-Methods Evaluation of LLM-Based Chatbots for Menopause |
Roshini Deva et.al. |
2502.03579 |
null |
2025-02-05 |
Code Simulation as a Proxy for High-order Tasks in Large Language Models |
Emanuele La Malfa et.al. |
2502.03568 |
null |
2025-02-05 |
Kronecker Mask and Interpretive Prompts are Language-Action Video Learners |
Jingyi Yang et.al. |
2502.03549 |
link |
2025-02-05 |
YINYANG-ALIGN: Benchmarking Contradictory Objectives and Proposing Multi-Objective Optimization based DPO for Text-to-Image Alignment |
Amitava Das et.al. |
2502.03512 |
null |
2025-02-05 |
Do Large Language Model Benchmarks Test Reliability? |
Joshua Vendrow et.al. |
2502.03461 |
link |
2025-02-05 |
Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training |
Boyao Wang et.al. |
2502.03460 |
null |
2025-02-05 |
A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) |
Yiye Chen et.al. |
2502.03450 |
null |
2025-02-05 |
Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics |
Xuan Li et.al. |
2502.03449 |
null |
2025-02-05 |
BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving |
Ran Xin et.al. |
2502.03438 |
null |
2025-02-05 |
Taking a Big Step: Large Learning Rates in Denoising Score Matching Prevent Memorization |
Yu-Han Wu et.al. |
2502.03435 |
null |
2025-02-05 |
On Fairness of Unified Multimodal Large Language Model for Image Generation |
Ming Liu et.al. |
2502.03429 |
null |
2025-02-05 |
Harnessing Large Language Models for Curated Code Reviews |
Oussama Ben Sghaier et.al. |
2502.03425 |
link |
2025-02-05 |
Can Text-to-Image Generative Models Accurately Depict Age? A Comparative Study on Synthetic Portrait Generation and Age Estimation |
Alexey A. Novikov et.al. |
2502.03420 |
null |
2025-02-05 |
Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts |
Nikta Gohari Sadr et.al. |
2502.03418 |
null |
2025-02-05 |
SPRI: Aligning Large Language Models with Context-Situated Principles |
Hongli Zhan et.al. |
2502.03397 |
null |
2025-02-05 |
Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications |
Issar Arab et.al. |
2502.03395 |
null |
2025-02-05 |
LIMO: Less is More for Reasoning |
Yixin Ye et.al. |
2502.03387 |
link |
2025-02-05 |
Transformers and Their Roles as Time Series Foundation Models |
Dennis Wu et.al. |
2502.03383 |
null |
2025-02-05 |
Demystifying Long Chain-of-Thought Reasoning in LLMs |
Edward Yeo et.al. |
2502.03373 |
link |
2025-02-05 |
PalimpChat: Declarative and Interactive AI analytics |
Chunwei Liu et.al. |
2502.03368 |
null |
2025-02-05 |
RadVLM: A Multitask Conversational Vision-Language Model for Radiology |
Nicolas Deperrois et.al. |
2502.03333 |
null |
2025-02-05 |
ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model |
Qiguang Chen et.al. |
2502.03325 |
null |
2025-02-05 |
Out-of-Distribution Detection using Synthetic Data Generation |
Momin Abbas et.al. |
2502.03323 |
null |
2025-02-05 |
Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques |
Sangjun Han et.al. |
2502.03321 |
null |
2025-02-05 |
Intent Representation Learning with Large Language Model for Recommendation |
Yu Wang et.al. |
2502.03307 |
link |
2025-02-05 |
Harmony in Divergence: Towards Fast, Accurate, and Memory-efficient Zeroth-order LLM Fine-tuning |
Qitao Tan et.al. |
2502.03304 |
null |
2025-02-05 |
MeDiSumQA: Patient-Oriented Question-Answer Generation from Discharge Letters |
Amin Dada et.al. |
2502.03298 |
null |
2025-02-05 |
SymAgent: A Neural-Symbolic Self-Learning Agent Framework for Complex Reasoning over Knowledge Graphs |
Ben Liu et.al. |
2502.03283 |
null |
2025-02-05 |
Posterior SBC: Simulation-Based Calibration Checking Conditional on Data |
Teemu Säilynoja et.al. |
2502.03279 |
link |
2025-02-05 |
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning |
DiJia Su et.al. |
2502.03275 |
null |
2025-02-05 |
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models |
Ying Zhang et.al. |
2502.03266 |
link |
2025-02-05 |
General Time-series Model for Universal Knowledge Representation of Multivariate Time-Series data |
Cheng He et.al. |
2502.03264 |
null |
2025-02-05 |
CARROT: A Cost Aware Rate Optimal Router |
Seamus Somerstep et.al. |
2502.03261 |
null |
2025-02-05 |
RiemannGFM: Learning a Graph Foundation Model from Riemannian Geometry |
Li Sun et.al. |
2502.03251 |
null |
2025-02-05 |
Exploring the Security Threats of Knowledge Base Poisoning in Retrieval-Augmented Code Generation |
Bo Lin et.al. |
2502.03233 |
null |
2025-02-05 |
Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models |
Jialiang Wu et.al. |
2502.03199 |
null |
2025-02-05 |
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding |
Pengyi Li et.al. |
2502.03183 |
null |
2025-02-05 |
PICBench: Benchmarking LLMs for Photonic Integrated Circuits Design |
Yuchao Wu et.al. |
2502.03159 |
null |
2025-02-05 |
Strategizing with AI: Insights from a Beauty Contest Experiment |
Iuliia Alekseenko et.al. |
2502.03158 |
null |
2025-02-05 |
Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models |
Xumeng Wen et.al. |
2502.03147 |
null |
2025-02-05 |
Symmetry-Aware Bayesian Flow Networks for Crystal Generation |
Laura Ruple et.al. |
2502.03146 |
null |
2025-02-05 |
Teaching Large Language Models Number-Focused Headline Generation With Key Element Rationales |
Zhen Qian et.al. |
2502.03129 |
null |
2025-02-05 |
Metis: A Foundation Speech Generation Model with Masked Generative Pre-training |
Yuancheng Wang et.al. |
2502.03128 |
link |
2025-02-05 |
Structured Token Retention and Computational Memory Paths in Large Language Models |
Jonathan Delena et.al. |
2502.03102 |
null |
2025-02-05 |
Reveal the Mystery of DPO: The Connection between DPO and RL Algorithms |
Xuerui Su et.al. |
2502.03095 |
null |
2025-02-05 |
Implementing Large Quantum Boltzmann Machines as Generative AI Models for Dataset Balancing |
Salvatore Sinno et.al. |
2502.03086 |
null |
2025-02-05 |
IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates |
Aissatou Diallo et.al. |
2502.03080 |
null |
2025-02-05 |
Poisson Flow Joint Model for Multiphase contrast-enhanced CT |
Rongjun Ge et.al. |
2502.03079 |
null |
2025-02-05 |
Automatic Prompt Optimization Techniques: Exploring the Potential for Synthetic Data Generation |
Nina Freise et.al. |
2502.03078 |
null |
2025-02-05 |
Optimizing Electric Vehicles Charging using Large Language Models and Graph Neural Networks |
Stavros Orfanoudakis et.al. |
2502.03067 |
null |
2025-02-05 |
Understanding and Enhancing the Transferability of Jailbreaking Attacks |
Runqi Lin et.al. |
2502.03052 |
link |
2025-02-05 |
RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts |
Tuan Truong et.al. |
2502.03044 |
null |
2025-02-05 |
Large Language Models Are Universal Recommendation Learners |
Junguang Jiang et.al. |
2502.03041 |
null |
2025-02-05 |
FuXi- $α$ : Scaling Recommendation Model with Feature Interaction Enhanced Transformer |
Yufei Ye et.al. |
2502.03036 |
null |
2025-02-05 |
Knowledge Distillation from Large Language Models for Household Energy Modeling |
Mohannad Takrouri et.al. |
2502.03034 |
null |
2025-02-05 |
Analyze Feature Flow to Enhance Interpretation and Steering in Language Models |
Daniil Laptev et.al. |
2502.03032 |
null |
2025-02-05 |
Scaling Laws for Upcycling Mixture-of-Experts Language Models |
Seng Pei Liew et.al. |
2502.03009 |
null |
2025-02-05 |
MedBioLM: Optimizing Medical and Biological QA with Fine-Tuned Large Language Models and Retrieval-Augmented Generation |
Seonok Kim et.al. |
2502.03004 |
null |
2025-02-05 |
Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons |
Renjun Hu et.al. |
2502.02988 |
null |
2025-02-05 |
Membership Inference Attack Should Move On to Distributional Statistics for Distilled Generative Models |
Muxing Li et.al. |
2502.02970 |
null |
2025-02-05 |
The Labeled Coupon Collector Problem with Random Sample Sizes and Partial Recovery |
Shoham Shimon Berrebi et.al. |
2502.02968 |
null |
2025-02-05 |
Large Language Model Adversarial Landscape Through the Lens of Attack Objectives |
Nan Wang et.al. |
2502.02960 |
null |
2025-02-05 |
Position: Editing Large Language Models Poses Serious Safety Risks |
Paul Youssef et.al. |
2502.02958 |
null |
2025-02-05 |
Control Search Rankings, Control the World: What is a Good Search Engine? |
Simon Coghlan et.al. |
2502.02957 |
null |
2025-02-05 |
LLM-KT: Aligning Large Language Models with Knowledge Tracing using a Plug-and-Play Instruction |
Ziwei Wang et.al. |
2502.02945 |
null |
2025-02-05 |
Large Language Model Guided Self-Debugging Code Generation |
Muntasir Adnan et.al. |
2502.02928 |
null |
2025-02-05 |
SPARC: Subspace-Aware Prompt Adaptation for Robust Continual Learning in LLMs |
Dinithi Jayasuriya et.al. |
2502.02909 |
null |
2025-02-05 |
AI-driven materials design: a mini-review |
Mouyang Cheng et.al. |
2502.02905 |
null |
2025-02-05 |
A Benchmark for the Detection of Metalinguistic Disagreements between LLMs and Knowledge Graphs |
Bradley P. Allen et.al. |
2502.02896 |
null |
2025-02-05 |
Lowering the Barrier of Machine Learning: Achieving Zero Manual Labeling in Review Classification Using LLMs |
Yejian Zhang et.al. |
2502.02893 |
null |
2025-02-05 |
Expertized Caption Auto-Enhancement for Video-Text Retrieval |
Junxiang Chen et.al. |
2502.02885 |
null |
2025-02-05 |
SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions |
Xiaofan Yu et.al. |
2502.02883 |
null |
2025-02-05 |
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning |
Yibo Yan et.al. |
2502.02871 |
null |
2025-02-05 |
A Systematic Approach for Assessing Large Language Models’ Test Case Generation Capability |
Hung-Fu Chang et.al. |
2502.02866 |
null |
2025-02-05 |
OceanChat: The Effect of Virtual Conversational AI Agents on Sustainable Attitude and Behavior Change |
Pat Pataranutaporn et.al. |
2502.02863 |
null |
2025-02-05 |
A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges |
Lei Ding et.al. |
2502.02835 |
null |
2025-02-05 |
COFFE: A Code Efficiency Benchmark for Code Generation |
Yun Peng et.al. |
2502.02827 |
link |
2025-02-05 |
Accessible and Portable LLM Inference by Compiling Computational Graphs into SQL |
Wenbo Sun et.al. |
2502.02818 |
null |
2025-02-05 |
Mol-LLM: Generalist Molecular LLM with Improved Graph Utilization |
Chanhui Lee et.al. |
2502.02810 |
null |
2025-02-05 |
CAMI: A Counselor Agent Supporting Motivational Interviewing through State Inference and Topic Exploration |
Yizhe Yang et.al. |
2502.02807 |
null |
2025-02-05 |
Leveraging the true depth of LLMs |
Ramón Calvo González et.al. |
2502.02790 |
null |
2025-02-05 |
Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation |
Jingyu Liu et.al. |
2502.02789 |
link |
2025-02-05 |
SimMark: A Robust Sentence-Level Similarity-Based Watermarking Algorithm for Large Language Models |
Amirhossein Dabiriaghdam et.al. |
2502.02787 |
link |
2025-02-04 |
Classroom Simulacra: Building Contextual Student Generative Agents in Online Education for Learning Behavioral Simulation |
Songlin Xu et.al. |
2502.02780 |
link |
2025-02-04 |
3D Foundation AI Model for Generalizable Disease Detection in Head Computed Tomography |
Weicheng Zhu et.al. |
2502.02779 |
null |
2025-02-04 |
Twilight: Adaptive Attention Sparsity with Hierarchical Top- $p$ Pruning |
Chaofan Lin et.al. |
2502.02770 |
null |
2025-02-04 |
LLM-USO: Large Language Model-based Universal Sizing Optimizer |
Karthik Somayaji N. S et.al. |
2502.02764 |
null |
2025-02-04 |
Rethinking Vision Transformer for Object Centric Foundation Models |
Manuel Traub et.al. |
2502.02763 |
null |
2025-02-04 |
Too Noisy To Learn: Enhancing Data Quality for Code Review C |
Chunhua Liu et.al. |
2502.02757 |
null |
2025-02-04 |
PatchPilot: A Stable and Cost-Efficient Agentic Patching Framework |
Hongwei Li et.al. |
2502.02747 |
null |
2025-02-04 |
LLM Bandit: Cost-Efficient LLM Generation via Preference-Conditioned Dynamic Routing |
Yang Li et.al. |
2502.02743 |
null |
2025-02-04 |
RFMedSAM 2: Automatic Prompt Refinement for Enhanced Volumetric Medical Image Segmentation with SAM 2 |
Bin Xie et.al. |
2502.02741 |
null |
2025-02-04 |
SmolLM2: When Smol Goes Big – Data-Centric Training of a Small Language Model |
Loubna Ben Allal et.al. |
2502.02737 |
null |
2025-02-04 |
Peri-LN: Revisiting Layer Normalization in the Transformer Architecture |
Jeonghoon Kim et.al. |
2502.02732 |
null |
2025-02-04 |
Cross-Lingual Transfer for Low-Resource Natural Language Processing |
Iker García-Ferrero et.al. |
2502.02722 |
null |
2025-02-04 |
Astromer 2 |
Cristobal Donoso-Oliva et.al. |
2502.02717 |
null |
2025-02-04 |
A Unified Understanding and Evaluation of Steering Methods |
Shawn Im et.al. |
2502.02716 |
null |
2025-02-04 |
An Analysis of LLM Fine-Tuning and Few-Shot Learning for Flaky Test Detection and Classification |
Riddhi More et.al. |
2502.02715 |
null |
2025-02-04 |
Exploring LLMs Impact on Student-Created User Stories and Acceptance Testing in Software Development |
Allan Brockenbrough et.al. |
2502.02675 |
null |
2025-02-04 |
MedRAX: Medical Reasoning Agent for Chest X-ray |
Adibvafa Fallahpour et.al. |
2502.02673 |
link |
2025-02-04 |
Transformers Boost the Performance of Decision Trees on Tabular Data across Sample Sizes |
Mayuka Jayawardhana et.al. |
2502.02672 |
null |
2025-02-04 |
Machine-learning approaches to accelerating lattice simulations |
Scott Lawrence et.al. |
2502.02670 |
null |
2025-02-04 |
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI) |
Yan Li et.al. |
2502.02659 |
link |
2025-02-04 |
Introducing the Rhea simulations of Milky-Way-like galaxies I: Effect of gravitational potential on morphology and star formation |
Junia Göller et.al. |
2502.02646 |
null |
2025-02-04 |
COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation |
Xueqing Deng et.al. |
2502.02589 |
null |
2025-02-04 |
Open Materials Generation with Stochastic Interpolants |
Philipp Hoellmer et.al. |
2502.02582 |
null |
2025-02-04 |
A comparison of translation performance between DeepL and Supertext |
Alex Flückiger et.al. |
2502.02577 |
link |
2025-02-04 |
Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement |
Soheil Abbasloo et.al. |
2502.02573 |
null |
2025-02-04 |
Learning the RoPEs: Better 2D and 3D Position Encodings with STRING |
Connor Schenck et.al. |
2502.02562 |
null |
2025-02-04 |
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation |
Junha Lee et.al. |
2502.02548 |
null |
2025-02-04 |
LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World |
Shrikara Arun et.al. |
2502.02539 |
null |
2025-02-04 |
Adaptive Self-improvement LLM Agentic System for ML Library Development |
Genghan Zhang et.al. |
2502.02534 |
link |
2025-02-04 |
Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies |
Han Zhou et.al. |
2502.02533 |
null |
2025-02-04 |
Generative Modeling on Lie Groups via Euclidean Generalized Score Matching |
Marco Bertolini et.al. |
2502.02513 |
null |
2025-02-04 |
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search |
Maohao Shen et.al. |
2502.02508 |
null |
2025-02-04 |
Learning to generate physical ocean states: Towards hybrid climate modeling |
Etienne Meunier et.al. |
2502.02499 |
null |
2025-02-04 |
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization |
Yize Wu et.al. |
2502.02493 |
null |
2025-02-04 |
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study |
Menglong Cui et.al. |
2502.02481 |
null |
2025-02-04 |
Style transfer as data augmentation: evaluating unpaired image-to-image translation models in mammography |
Emir Ahmed et.al. |
2502.02475 |
null |
2025-02-04 |
Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification |
Valentina Vadori et.al. |
2502.02471 |
link |
2025-02-04 |
SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency |
Qianhao Yuan et.al. |
2502.02458 |
null |
2025-02-04 |
Personalization Toolkit: Training Free Personalization of Large Vision Language Models |
Soroush Seifi et.al. |
2502.02452 |
null |
2025-02-04 |
Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study |
Calvin Yixiang Cheng et.al. |
2502.02451 |
link |
2025-02-04 |
Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models |
Haoran Ye et.al. |
2502.02444 |
null |
2025-02-04 |
LLMER: Crafting Interactive Extended Reality Worlds with JSON Data Generated by Large Language Models |
Jiangong Chen et.al. |
2502.02441 |
link |
2025-02-04 |
Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment |
Yaling Shen et.al. |
2502.02438 |
null |
2025-02-04 |
TransformDAS: Mapping Φ-OTDR Signals to Riemannian Manifold for Robust Classification |
Jiaju Kang et.al. |
2502.02428 |
null |
2025-02-04 |
Activation-Informed Merging of Large Language Models |
Amin Heyrani Nobari et.al. |
2502.02421 |
link |
2025-02-04 |
Towards Fast Graph Generation via Autoregressive Noisy Filtration Modeling |
Markus Krimmel et.al. |
2502.02415 |
link |
2025-02-04 |
AI-Powered, But Power-Hungry? Energy Efficiency of LLM-Generated Code |
Lola Solovyeva et.al. |
2502.02412 |
null |
2025-02-04 |
Avoiding spurious sharpness minimization broadens applicability of SAM |
Sidak Pal Singh et.al. |
2502.02407 |
null |
2025-02-04 |
LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models |
Tzu-Tao Chang et.al. |
2502.02406 |
null |
2025-02-04 |
CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning |
Jianfeng Pan et.al. |
2502.02390 |
null |
2025-02-04 |
Hypergraph Link Prediction via Hyperedge Copying |
Xie He et.al. |
2502.02386 |
link |
2025-02-04 |
STAIR: Improving Safety Alignment with Introspective Reasoning |
Yichi Zhang et.al. |
2502.02384 |
link |
2025-02-04 |
Evaluating the Effectiveness of LLMs in Fixing Maintainability Issues in Real-World Projects |
Henrique Nunes et.al. |
2502.02368 |
null |
2025-02-04 |
Field Matching: an Electrostatic Paradigm to Generate and Transfer Data |
Alexander Kolesov et.al. |
2502.02367 |
null |
2025-02-04 |
Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs |
Sagnik Mukherjee et.al. |
2502.02362 |
null |
2025-02-04 |
SHIELD: APT Detection and Intelligent Explanation Using LLM |
Parth Atulbhai Gandhi et.al. |
2502.02342 |
null |
2025-02-04 |
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking |
Jinyang Wu et.al. |
2502.02339 |
null |
2025-02-04 |
ReSpark: Leveraging Previous Data Reports as References to Generate New Reports with LLMs |
Yuan Tian et.al. |
2502.02329 |
null |
2025-02-04 |
Information-Theoretic Proofs for Diffusion Sampling |
Galen Reeves et.al. |
2502.02305 |
null |
2025-02-04 |
Density Ratio Estimation with Conditional Probability Paths |
Hanlin Yu et.al. |
2502.02300 |
null |
2025-02-04 |
Evalita-LLM: Benchmarking Large Language Models on Italian |
Bernardo Magnini et.al. |
2502.02289 |
null |
2025-02-04 |
Adaptive Resource Allocation Optimization Using Large Language Models in Dynamic Wireless Environments |
Hyeonho Noh et.al. |
2502.02287 |
null |
2025-02-04 |
Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation |
Atharva Mangeshkumar Agrawal et.al. |
2502.02249 |
null |
2025-02-04 |
Flatten Graphs as Sequences: Transformers are Scalable Graph Generators |
Dexiong Chen et.al. |
2502.02216 |
null |
2025-02-04 |
When Dimensionality Hurts: The Role of LLM Embedding Compression for Noisy Regression Tasks |
Felix Drinkall et.al. |
2502.02199 |
link |
2025-02-04 |
Large language models in climate and sustainability policy: limits and opportunities |
Francesca Larosa et.al. |
2502.02191 |
null |
2025-02-04 |
ShapeShifter: 3D Variations Using Multiscale and Sparse Point-Voxel Diffusion |
Nissim Maruani et.al. |
2502.02187 |
null |
2025-02-04 |
Generative Kernel Spectral Clustering |
David Winant et.al. |
2502.02185 |
null |
2025-02-04 |
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge |
Daniel Tamayo et.al. |
2502.02173 |
link |
2025-02-04 |
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues |
Rohit Girmaji et.al. |
2502.02172 |
null |
2025-02-04 |
Risk-Aware Driving Scenario Analysis with Large Language Models |
Yuan Gao et.al. |
2502.02145 |
link |
2025-02-04 |
IPO: Iterative Preference Optimization for Text-to-Video Generation |
Xiaomeng Yang et.al. |
2502.02088 |
null |
2025-02-04 |
Position Paper: Building Trust in Synthetic Data for Clinical AI |
Krishan Agyakari Raja Babu et.al. |
2502.02076 |
null |
2025-02-04 |
Rethinking stance detection: A theoretically-informed research agenda for user-level inference using language models |
Prasanta Bhattacharya et.al. |
2502.02074 |
null |
2025-02-04 |
ASCenD-BDS: Adaptable, Stochastic and Context-aware framework for Detection of Bias, Discrimination and Stereotyping |
Rajiv Bahl et.al. |
2502.02072 |
null |
2025-02-04 |
Robust and Secure Code Watermarking for Large Language Models via ML/Crypto Codesign |
Ruisi Zhang et.al. |
2502.02068 |
null |
2025-02-04 |
AdaptBot: Combining LLM with Knowledge Graphs and Human Input for Generic-to-Specific Task Decomposition and Knowledge Refinement |
Shivam Singh et.al. |
2502.02067 |
link |
2025-02-04 |
Anticipate & Act : Integrating LLMs and Classical Planning for Efficient Task Execution in Household Environments |
Raghav Arora et.al. |
2502.02066 |
null |
2025-02-04 |
CASIM: Composite Aware Semantic Injection for Text to Motion Generation |
Che-Jui Chang et.al. |
2502.02063 |
null |
2025-02-04 |
Large Language Models for Recommendation with Deliberative User Preference Alignment |
Yi Fang et.al. |
2502.02061 |
null |
2025-02-04 |
Efficient Domain Adaptation of Multimodal Embeddings using Constrastive Learning |
Georgios Margaritis et.al. |
2502.02048 |
null |
2025-02-04 |
Contextual Memory Reweaving in Large Language Models Using Layered Latent State Reconstruction |
Frederick Dillon et.al. |
2502.02046 |
null |
2025-02-04 |
M2R2: Mixture of Multi-Rate Residuals for Efficient Transformer Inference |
Nikhil Bhendawade et.al. |
2502.02040 |
null |
2025-02-04 |
ContinuouSP: Generative Model for Crystal Structure Prediction with Invariance and Continuity |
Yuji Tone et.al. |
2502.02026 |
link |
2025-02-04 |
From Accidents to Insights: Leveraging Multimodal Data for Scenario-Driven ADS Testing |
Siwei Luo et.al. |
2502.02025 |
null |
2025-02-04 |
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling |
Yi-Chiao Wu et.al. |
2502.02019 |
null |
2025-02-04 |
Multi-Domain Graph Foundation Models: Robust Knowledge Transfer via Topology Alignment |
Shuo Wang et.al. |
2502.02017 |
null |
2025-02-04 |
A Periodic Bayesian Flow for Material Generation |
Hanlin Wu et.al. |
2502.02016 |
link |
2025-02-04 |
Layer by Layer: Uncovering Hidden Representations in Language Models |
Oscar Skean et.al. |
2502.02013 |
null |
2025-02-04 |
LLMSecConfig: An LLM-Based Approach for Fixing Software Container Misconfigurations |
Ziyang Ye et.al. |
2502.02009 |
null |
2025-02-04 |
Reasoning Bias of Next Token Prediction Training |
Pengxiao Lin et.al. |
2502.02007 |
null |
2025-02-04 |
FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024 |
Arnav Grover et.al. |
2502.01992 |
null |
2025-02-04 |
Can LLMs Assist Annotators in Identifying Morality Frames? – Case Study on Vaccination Debate on Social Media |
Tunazzina Islam et.al. |
2502.01991 |
null |
2025-02-04 |
Generative Data Mining with Longtail-Guided Diffusion |
David S. Hayden et.al. |
2502.01980 |
null |
2025-02-04 |
Gradient-Regularized Latent Space Modulation in Large Language Models for Structured Contextual Synthesis |
Derek Yotheringhay et.al. |
2502.01979 |
null |
2025-02-04 |
AutoGUI: Scaling GUI Grounding with Automatic Functionality Annotations from LLMs |
Hongxin Li et.al. |
2502.01977 |
null |
2025-02-04 |
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level Routing |
Wenhao Zheng et.al. |
2502.01976 |
null |
2025-02-04 |
Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning |
Jinlong Pang et.al. |
2502.01968 |
null |
2025-02-04 |
MPIC: Position-Independent Multimodal Context Caching System for Efficient MLLM Serving |
Shiju Zhao et.al. |
2502.01960 |
null |
2025-02-04 |
Local minima of the empirical risk in high dimension: General theorems and convex examples |
Kiana Asgari et.al. |
2502.01953 |
null |
2025-02-04 |
DAMO: Data- and Model-aware Alignment of Multi-modal LLMs |
Jinda Lu et.al. |
2502.01943 |
null |
2025-02-04 |
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? |
Xiang Liu et.al. |
2502.01941 |
null |
2025-02-04 |
Toward a Low-Cost Perception System in Autonomous Vehicles: A Spectrum Learning Approach |
Mohammed Alsakabi et.al. |
2502.01940 |
null |
2025-02-04 |
Distributionally Robust Direct Preference Optimization |
Zaiyan Xu et.al. |
2502.01930 |
null |
2025-02-04 |
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling |
Avery Ma et.al. |
2502.01925 |
null |
2025-02-04 |
LAST SToP For Modeling Asynchronous Time Series |
Shubham Gupta et.al. |
2502.01922 |
null |
2025-02-04 |
Anomaly Detection via Autoencoder Composite Features and NCE |
Yalin Liao et.al. |
2502.01920 |
null |
2025-02-04 |
Unlocking Efficient Large Inference Models: One-Bit Unrolling Tips the Scales |
Arian Eamaz et.al. |
2502.01908 |
null |
2025-02-04 |
Rethinking Homogeneity of Vision and Text Tokens in Large Vision-and-Language Models |
Chia-Wen Kuo et.al. |
2502.01906 |
null |
2025-02-04 |
Conceptual Metaphor Theory as a Prompting Paradigm for Large Language Models |
Oliver Kramer et.al. |
2502.01901 |
null |
2025-02-03 |
Latent Lexical Projection in Large Language Models: A Novel Approach to Implicit Representation Refinement |
Ziad Shaker et.al. |
2502.01882 |
null |
2025-02-03 |
SE Arena: Benchmarking Software Engineering Chatbots with Iterative Interactions |
Zhimin Zhao et.al. |
2502.01860 |
null |
2025-02-03 |
Security and Quality in LLM-Generated Code: A Multi-Language, Multi-Model Analysis |
Mohammed Kharma et.al. |
2502.01853 |
null |
2025-02-03 |
Foundation Model-Based Apple Ripeness and Size Estimation for Selective Harvesting |
Keyi Zhu et.al. |
2502.01850 |
link |
2025-02-03 |
Relatively-Secure LLM-Based Steganography via Constrained Markov Decision Processes |
Yu-Shin Huang et.al. |
2502.01827 |
link |
2025-02-03 |
Agentic Bug Reproduction for Effective Automated Program Repair at Google |
Runxiang Cheng et.al. |
2502.01821 |
null |
2025-02-03 |
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning |
Hanyang Zhao et.al. |
2502.01819 |
null |
2025-02-03 |
SelfCheckAgent: Zero-Resource Hallucination Detection in Generative Large Language Models |
Diyana Muhammed et.al. |
2502.01812 |
null |
2025-02-03 |
Toward Neurosymbolic Program Comprehension |
Alejandro Velasco et.al. |
2502.01806 |
null |
2025-02-03 |
Discovering Chunks in Neural Embeddings for Interpretability |
Shuchen Wu et.al. |
2502.01803 |
null |
2025-02-03 |
Harmful Terms and Where to Find Them: Measuring and Modeling Unfavorable Financial Terms and Conditions in Shopping Websites at Scale |
Elisa Tsai et.al. |
2502.01798 |
link |
2025-01-31 |
Vintix: Action Model via In-Context Reinforcement Learning |
Andrey Polubarov et.al. |
2501.19400 |
link |
2025-01-31 |
Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game |
Mustafa O. Karabag et.al. |
2501.19398 |
link |
2025-01-31 |
Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models |
Alina Shutova et.al. |
2501.19392 |
link |
2025-01-31 |
Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models |
Wenzhi Fang et.al. |
2501.19389 |
link |
2025-02-03 |
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions |
Dominik Wagner et.al. |
2501.19377 |
null |
2025-01-31 |
Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions |
Sören Christensen et.al. |
2501.19373 |
null |
2025-01-31 |
We’re Different, We’re the Same: Creative Homogeneity Across LLMs |
Emily Wenger et.al. |
2501.19361 |
null |
2025-01-31 |
Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies |
Brandon P. Chelstrom et.al. |
2501.19359 |
null |
2025-01-31 |
The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking |
Yuchun Miao et.al. |
2501.19358 |
null |
2025-01-31 |
Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters |
Adrián Juan-Delgado et.al. |
2501.19356 |
null |
2025-01-31 |
Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 |
Ting-Yao E. Hsu et.al. |
2501.19353 |
null |
2025-01-31 |
Towards Adaptive Self-Improvement for Smarter Energy Systems |
Alexander Sommer et.al. |
2501.19340 |
null |
2025-01-31 |
PixelWorld: Towards Perceiving Everything as Pixels |
Zhiheng Lyu et.al. |
2501.19339 |
null |
2025-01-31 |
Homogeneity Bias as Differential Sampling Uncertainty in Language Models |
Messi H. J. Lee et.al. |
2501.19337 |
null |
2025-01-31 |
Reward-Guided Speculative Decoding for Efficient LLM Reasoning |
Baohao Liao et.al. |
2501.19324 |
null |
2025-01-31 |
MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems |
Anirudh Chari et.al. |
2501.19318 |
null |
2025-01-31 |
LLM-based Affective Text Generation Quality Based on Different Quantization Values |
Yarik Menchaca Resendiz et.al. |
2501.19317 |
null |
2025-01-31 |
Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment |
Gregor Bachmann et.al. |
2501.19309 |
null |
2025-02-03 |
SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling |
Jiefeng Chen et.al. |
2501.19306 |
null |
2025-01-31 |
Beyond checkmate: exploring the creative chokepoints in AI text |
Nafis Irtiza Tripto et.al. |
2501.19301 |
link |
2025-01-31 |
Offline Learning for Combinatorial Multi-armed Bandits |
Xutong Liu et.al. |
2501.19300 |
null |
2025-01-31 |
Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes |
Zhiyao Xu et.al. |
2501.19298 |
null |
2025-01-31 |
Analysis of LLMs vs Human Experts in Requirements Engineering |
Cory Hymel et.al. |
2501.19297 |
null |
2025-01-31 |
Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators |
Kunpeng Zhang et.al. |
2501.19282 |
null |
2025-01-31 |
Pheromone-based Learning of Optimal Reasoning Paths |
Anirudh Chari et.al. |
2501.19278 |
null |
2025-01-31 |
From Assistance to Autonomy – A Researcher Study on the Potential of AI Support for Qualitative Data Analysis |
Elisabeth Kirsten et.al. |
2501.19275 |
null |
2025-01-31 |
Jackpot! Alignment as a Maximal Lottery |
Roberto-Rafael Maura-Rivero et.al. |
2501.19266 |
null |
2025-01-31 |
Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge |
Amogh Joshi et.al. |
2501.19259 |
null |
2025-01-31 |
A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation |
Yunzhe Li et.al. |
2501.19232 |
null |
2025-01-31 |
Autonomous Legacy Web Application Upgrades Using a Multi-Agent System |
Valtteri Ala-Salmi et.al. |
2501.19204 |
link |
2025-02-03 |
Improving the Robustness of Representation Misdirection for Large Language Model Unlearning |
Dang Huu-Tien et.al. |
2501.19202 |
link |
2025-01-31 |
Efficient Reasoning with Hidden Thinking |
Xuan Shen et.al. |
2501.19201 |
link |
2025-01-31 |
Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning |
Xianglin Yang et.al. |
2501.19180 |
null |
2025-01-31 |
No Foundations without Foundations – Why semi-mechanistic models are essential for regulatory biology |
Luka Kovačević et.al. |
2501.19178 |
null |
2025-01-31 |
Position: Contextual Integrity Washing for Language Models |
Yan Shvartzshnaider et.al. |
2501.19173 |
null |
2025-01-31 |
Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs |
Kejia Zhang et.al. |
2501.19164 |
null |
2025-01-31 |
A theoretical framework for overfitting in energy-based modeling |
Giovanni Catania et.al. |
2501.19158 |
null |
2025-01-31 |
A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator |
Sixiao Huang et.al. |
2501.19135 |
null |
2025-01-31 |
Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations |
Sihwan Park et.al. |
2501.19099 |
null |
2025-01-31 |
Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data |
Xichen Xu et.al. |
2501.19094 |
null |
2025-01-31 |
Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models |
Jialin Zhao et.al. |
2501.19090 |
null |
2025-01-31 |
Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification |
Xiangyu Sun et.al. |
2501.19086 |
null |
2025-01-31 |
Enhancing Code Generation for Low-Resource Languages: No Silver Bullet |
Alessandro Giagnorio et.al. |
2501.19085 |
null |
2025-01-31 |
Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations |
Dahye Kim et.al. |
2501.19066 |
link |
2025-01-31 |
TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs |
Yan Sun et.al. |
2501.19057 |
null |
2025-01-31 |
Enabling Autonomic Microservice Management through Self-Learning Agents |
Fenglin Yu et.al. |
2501.19056 |
null |
2025-01-31 |
Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models |
Ruiyu Wang et.al. |
2501.19054 |
null |
2025-01-31 |
Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors |
Simon Idoko et.al. |
2501.19042 |
link |
2025-01-31 |
Towards the Worst-case Robustness of Large Language Models |
Huanran Chen et.al. |
2501.19040 |
null |
2025-01-31 |
Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs |
Hongliang Li et.al. |
2501.19036 |
null |
2025-01-31 |
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses |
Bo Lan et.al. |
2501.19034 |
link |
2025-01-31 |
Multilayer Networks in Neuroimaging |
Vesna Vuksanovic et.al. |
2501.19024 |
null |
2025-01-31 |
Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation |
Bin Zhu et.al. |
2501.19017 |
null |
2025-01-31 |
Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities |
Arjun Krishna et.al. |
2501.19012 |
null |
2025-01-31 |
Visual Autoregressive Modeling for Image Super-Resolution |
Yunpeng Qu et.al. |
2501.18993 |
link |
2025-01-31 |
Symmetric Pruning of Large Language Models |
Kai Yi et.al. |
2501.18980 |
null |
2025-01-31 |
BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics |
Yuxuan Liu et.al. |
2501.18972 |
null |
2025-01-31 |
Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping |
Pu Yang et.al. |
2501.18962 |
link |
2025-01-31 |
Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow |
Alfred Bexley et.al. |
2501.18957 |
null |
2025-01-31 |
LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models |
Shenghao Fu et.al. |
2501.18954 |
link |
2025-01-31 |
TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment |
Zi-Jian Cheng et.al. |
2501.18935 |
link |
2025-01-31 |
Language Games as the Pathway to Artificial Superhuman Intelligence |
Ying Wen et.al. |
2501.18924 |
null |
2025-01-31 |
KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search |
Haoran Luo et.al. |
2501.18922 |
link |
2025-01-31 |
LLM Program Optimization via Retrieval Augmented Search |
Sagnik Anupam et.al. |
2501.18916 |
null |
2025-01-31 |
Scaling Laws for Differentially Private Language Models |
Ryan McKenna et.al. |
2501.18914 |
null |
2025-01-31 |
Streamlining Security Vulnerability Triage with Large Language Models |
Mohammad Jalili Torkamani et.al. |
2501.18908 |
null |
2025-01-31 |
Trustworthy Evaluation of Generative AI Models |
Zijun Gao et.al. |
2501.18897 |
null |
2025-01-31 |
Can We Predict the Effect of Prompts? |
Jae Yong Lee et.al. |
2501.18883 |
null |
2025-01-31 |
Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models |
Jiaqi Tang et.al. |
2501.18863 |
null |
2025-01-31 |
BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning |
Han Zhong et.al. |
2501.18858 |
null |
2025-01-31 |
Equivariant Hypergraph Diffusion for Crystal Structure Prediction |
Yang Liu et.al. |
2501.18850 |
null |
2025-01-31 |
Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities |
Yaping Chai et.al. |
2501.18845 |
null |
2025-01-31 |
Trading Inference-Time Compute for Adversarial Robustness |
Wojciech Zaremba et.al. |
2501.18841 |
null |
2025-01-31 |
Partially Rewriting a Transformer in Natural Language |
Gonçalo Paulo et.al. |
2501.18838 |
link |
2025-01-31 |
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming |
Mrinank Sharma et.al. |
2501.18837 |
null |
2025-01-31 |
Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential |
Chenyu Gao et.al. |
2501.18834 |
null |
2025-01-31 |
Structural Embedding Projection for Contextual Large Language Model Inference |
Vincent Enoasmo et.al. |
2501.18826 |
null |
2025-01-31 |
Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies |
Andrey Borro et.al. |
2501.18817 |
link |
2025-01-31 |
Large Language Models as Common-Sense Heuristics |
Andrey Borro et.al. |
2501.18816 |
null |
2025-01-30 |
Compositional Generalization Requires More Than Disentangled Representations |
Qiyao Liang et.al. |
2501.18797 |
null |
2025-01-30 |
Rope to Nope and Back Again: A New Hybrid Attention Strategy |
Bowen Yang et.al. |
2501.18795 |
null |
2025-01-30 |
Survey and Improvement Strategies for Gene Prioritization with Large Language Models |
Matthew Neeley et.al. |
2501.18794 |
null |
2025-01-30 |
LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? |
Alexander Tuisov et.al. |
2501.18784 |
null |
2025-01-30 |
Navigating the Fragrance space Via Graph Generative Models And Predicting Odors |
Mrityunjay Sharma et.al. |
2501.18777 |
link |
2025-01-30 |
Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring |
Zijun Deng et.al. |
2501.18761 |
null |
2025-01-30 |
Synthetic Data Generation for Augmenting Small Samples |
Dan Liu et.al. |
2501.18741 |
null |
2025-01-30 |
Examining the Robustness of Large Language Models across Language Complexity |
Jiayi Zhang et.al. |
2501.18738 |
null |
2025-01-30 |
Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks |
Mohd. Farhan Israk Soumik et.al. |
2501.18727 |
null |
2025-01-30 |
Strong and Controllable 3D Motion Generation |
Canxuan Gang et.al. |
2501.18726 |
null |
2025-01-30 |
Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning |
Maya Kruse et.al. |
2501.18724 |
null |
2025-02-03 |
Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps |
Devansh Bhardwaj et.al. |
2501.18712 |
null |
2025-01-30 |
Regularized second-order optimization of tensor-network Born machines |
Matan Ben-Dov et.al. |
2501.18691 |
null |
2025-01-30 |
Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting |
Yansong Qu et.al. |
2501.18672 |
null |
2025-01-30 |
Foundational Models for 3D Point Clouds: A Survey and Outlook |
Vishal Thengane et.al. |
2501.18594 |
null |
2025-01-30 |
Diffusion Autoencoders are Scalable Image Tokenizers |
Yinbo Chen et.al. |
2501.18593 |
null |
2025-02-03 |
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models |
Hao Dong et.al. |
2501.18592 |
link |
2025-01-30 |
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs |
Yue Wang et.al. |
2501.18585 |
null |
2025-01-30 |
Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH |
Evgenii Evstafev et.al. |
2501.18576 |
null |
2025-01-30 |
BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos |
Lehao Lin et.al. |
2501.18565 |
null |
2025-01-30 |
SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation |
Haoquan Fang et.al. |
2501.18564 |
link |
2025-01-30 |
Semantic Web and Creative AI – A Technical Report from ISWS 2023 |
Raia Abu Ahmad et.al. |
2501.18542 |
null |
2025-01-30 |
Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges |
Manveer Singh Tamber et.al. |
2501.18536 |
link |
2025-01-30 |
Differentially Private Steering for Large Language Model Alignment |
Anmol Goel et.al. |
2501.18532 |
link |
2025-01-30 |
Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models |
Guanqun Cao et.al. |
2501.18516 |
null |
2025-01-30 |
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch |
Arthur Douillard et.al. |
2501.18512 |
null |
2025-01-30 |
WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training |
Benjamin Feuer et.al. |
2501.18511 |
link |
2025-01-30 |
CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction |
Peter J. Bentley et.al. |
2501.18504 |
null |
2025-01-30 |
Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline |
Shivani Kapania et.al. |
2501.18493 |
null |
2025-01-30 |
A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models |
Changshu Liu et.al. |
2501.18482 |
null |
2025-01-30 |
CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization |
Yanxia Deng et.al. |
2501.18475 |
null |
2025-01-30 |
Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations |
Chengxi Zeng et.al. |
2501.18474 |
null |
2025-01-30 |
ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation |
Minghua He et.al. |
2501.18460 |
null |
2025-01-30 |
CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering |
Yumeng Wang et.al. |
2501.18457 |
null |
2025-01-30 |
GENIE: Generative Note Information Extraction model for structuring EHR data |
Huaiyuan Ying et.al. |
2501.18435 |
null |
2025-01-30 |
Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation |
Youngjoon Lee et.al. |
2501.18416 |
null |
2025-01-30 |
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects |
Yiteng Tu et.al. |
2501.18365 |
link |
2025-01-30 |
A Video-grounded Dialogue Dataset and Metric for Event-driven Activities |
Wiradee Imrattanatrai et.al. |
2501.18324 |
link |
2025-01-30 |
Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach |
Tianpeng Pan et.al. |
2501.18320 |
null |
2025-01-30 |
Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models |
Jennifer D’Souza et.al. |
2501.18287 |
null |
2025-01-30 |
Jailbreaking LLMs’ Safeguard with Universal Magic Words for Text Embedding Models |
Haoyu Liang et.al. |
2501.18280 |
null |
2025-01-30 |
Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence |
Kevin Roitero et.al. |
2501.18265 |
null |
2025-01-30 |
How to Select Datapoints for Efficient Human Evaluation of NLG Models? |
Vilém Zouhar et.al. |
2501.18251 |
link |
2025-01-30 |
Statistical multi-metric evaluation and visualization of LLM system predictive performance |
Samuel Ackerman et.al. |
2501.18243 |
null |
2025-01-30 |
Contextually Structured Token Dependency Encoding for Large Language Models |
James Blades et.al. |
2501.18205 |
null |
2025-01-30 |
Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents |
ShuiDe Wen et.al. |
2501.18190 |
null |
2025-01-30 |
Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation |
Teddy Lazebnik et.al. |
2501.18177 |
null |
2025-01-30 |
Continually Evolved Multimodal Foundation Models for Cancer Prognosis |
Jie Peng et.al. |
2501.18170 |
null |
2025-01-30 |
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing |
Jinyao Guo et.al. |
2501.18160 |
null |
2025-01-30 |
Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study |
Yuchen Lei et.al. |
2501.18158 |
null |
2025-01-30 |
Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models |
Wanlong Liu et.al. |
2501.18154 |
null |
2025-01-30 |
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models |
Qika Lin et.al. |
2501.18119 |
null |
2025-01-30 |
Scaling Inference-Efficient Language Models |
Song Bian et.al. |
2501.18107 |
null |
2025-01-30 |
Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation |
Yibo Wang et.al. |
2501.18100 |
link |
2025-01-30 |
AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates |
Da Chang et.al. |
2501.18094 |
null |
2025-01-30 |
Normative Evaluation of Large Language Models with Everyday Moral Dilemmas |
Pratik S. Sachdeva et.al. |
2501.18081 |
null |
2025-01-30 |
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models |
Spencer Mateega et.al. |
2501.18062 |
null |
2025-01-29 |
RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems |
Duy A. Nguyen et.al. |
2501.18056 |
null |
2025-01-29 |
Current Pathology Foundation Models are unrobust to Medical Center Differences |
Edwin D. de Jong et.al. |
2501.18055 |
null |
2025-01-29 |
A Proximal Operator for Inducing 2:4-Sparsity |
Jonas M Kübler et.al. |
2501.18015 |
null |
2025-01-29 |
Large Language Models Think Too Fast To Explore Effectively |
Lan Pan et.al. |
2501.18009 |
null |
2025-01-29 |
Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces |
Neetha Jambigi et.al. |
2501.18005 |
null |
2025-01-29 |
InnerThoughts: Disentangling Representations and Predictions in Large Language Models |
Didier Chételat et.al. |
2501.17994 |
null |
2025-01-29 |
Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study |
Marwah Alaofi et.al. |
2501.17981 |
link |
2025-01-29 |
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization |
Zishun Yu et.al. |
2501.17974 |
null |
2025-01-29 |
“I Would Never Trust Anything Western”: Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools |
Manas Mhasakar et.al. |
2501.17942 |
null |
2025-01-29 |
DReSS: Data-driven Regularized Structured Streamlining for Large Language Models |
Mingkuan Feng et.al. |
2501.17905 |
null |
2025-01-29 |
Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs’ Domain-Specific Insight Learning? |
Pouya Pezeshkpour et.al. |
2501.17840 |
link |
2025-01-29 |
Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology |
Sobhan Hemati et.al. |
2501.17822 |
null |
2025-01-30 |
Leveraging Multimodal LLM for Inspirational User Interface Search |
Seokhyeon Park et.al. |
2501.17799 |
link |
2025-01-29 |
BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation – Challenges and Insights |
Chan-Jan Hsu et.al. |
2501.17790 |
null |
2025-01-29 |
AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing |
Peter Pak et.al. |
2501.17784 |
null |
2025-01-29 |
2SSP: A Two-Stage Framework for Structured Pruning of LLMs |
Fabrizio Sandri et.al. |
2501.17771 |
link |
2025-01-29 |
Generative Unordered Flow for Set-Structured Data Generation |
Yangming Li et.al. |
2501.17770 |
null |
2025-01-29 |
Hybrid Graphs for Table-and-Text based Question Answering using LLMs |
Ankush Agarwal et.al. |
2501.17767 |
null |
2025-01-29 |
On the Partitioning of GPU Power among Multi-Instances |
Tirth Vamja et.al. |
2501.17752 |
null |
2025-01-29 |
Early External Safety Testing of OpenAI’s o3-mini: Insights from the Pre-Deployment Evaluation |
Aitor Arrieta et.al. |
2501.17749 |
null |
2025-01-29 |
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches |
Ana R. Baião et.al. |
2501.17729 |
null |
2025-01-29 |
Using Code Generation to Solve Open Instances of Combinatorial Design Problems |
Christopher D. Rosin et.al. |
2501.17725 |
link |
2025-01-29 |
RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts |
Eujeong Choi et.al. |
2501.17715 |
link |
2025-01-29 |
Source-Channel Separation Theorems for Distortion Perception Coding |
Chao Tian et.al. |
2501.17706 |
null |
2025-01-29 |
Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching |
Xuzhe Dang et.al. |
2501.17665 |
null |
2025-01-30 |
In-Context Meta LoRA Generation |
Yihua Shao et.al. |
2501.17635 |
null |
2025-01-29 |
Uncertainty Quantification and Decomposition for LLM-based Recommendation |
Wonbin Kweon et.al. |
2501.17630 |
link |
2025-01-29 |
The Imitation Game According To Turing |
Sharon Temtsin et.al. |
2501.17629 |
null |
2025-01-29 |
Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment |
Jonathan Teel et.al. |
2501.17617 |
null |
2025-01-29 |
Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis |
Kunrong Li et.al. |
2501.17598 |
null |
2025-01-30 |
Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models |
Behraj Khan et.al. |
2501.17595 |
null |
2025-01-29 |
GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback |
Mohamed Abdelaal et.al. |
2501.17584 |
null |
2025-01-29 |
CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs |
Amey Hengle et.al. |
2501.17581 |
null |
2025-01-29 |
Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding |
Marco Pasini et.al. |
2501.17578 |
null |
2025-01-29 |
Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models |
Wooyoung Kim et.al. |
2501.17549 |
null |
2025-01-29 |
Towards Training-Free Open-World Classification with 3D Generative Models |
Xinzhe Xia et.al. |
2501.17547 |
null |
2025-01-29 |
Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant |
Gaole He et.al. |
2501.17546 |
link |
2025-01-29 |
Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison |
Martin Nizon-Deladoeuille et.al. |
2501.17539 |
null |
2025-01-29 |
Neural Spelling: A Spell-Based BCI System for Language Neural Decoding |
Xiaowei Jiang et.al. |
2501.17489 |
null |
2025-01-29 |
DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance |
Seffi Cohen et.al. |
2501.17479 |
link |
2025-01-29 |
AugmenTest: Enhancing Tests with LLM-Driven Oracles |
Shaker Mahmud Khandaker et.al. |
2501.17461 |
null |
2025-01-29 |
Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction |
Kaiwei Luo et.al. |
2501.17459 |
null |
2025-01-29 |
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation |
Tiansheng Huang et.al. |
2501.17433 |
link |
2025-01-29 |
Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models |
Yuxuan Li et.al. |
2501.17420 |
null |
2025-01-29 |
MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs |
Ved Sirdeshmukh et.al. |
2501.17399 |
link |
2025-01-29 |
Learning Free Token Reduction for Multi-Modal LLM |
Zihui Zhao et.al. |
2501.17391 |
null |
2025-01-29 |
Context-Aware Semantic Recomposition Mechanism for Large Language Models |
Richard Katrix et.al. |
2501.17386 |
null |
2025-01-28 |
Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations |
Md Tauhidul Islam et.al. |
2501.17347 |
null |
2025-01-28 |
Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction |
Mingyu Derek Ma et.al. |
2501.17326 |
null |
2025-01-28 |
CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data |
Lee Carlin et.al. |
2501.17324 |
null |
2025-01-30 |
Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding |
Yun-Shiuan Chuang et.al. |
2501.17310 |
null |
2025-01-28 |
“Ownership, Not Just Happy Talk”: Co-Designing a Participatory Large Language Model for Journalism |
Emily Tseng et.al. |
2501.17299 |
null |
2025-01-28 |
Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization |
Zilu Tang et.al. |
2501.17295 |
null |
2025-01-28 |
Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology |
Peilong Wang et.al. |
2501.17286 |
null |
2025-01-30 |
From Natural Language to Extensive-Form Game Representations |
Shilong Deng et.al. |
2501.17282 |
link |
2025-01-28 |
Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models |
Abdalaziz Al-Maeeni et.al. |
2501.17279 |
null |
2025-01-28 |
Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics |
Jasper Timm et.al. |
2501.17273 |
link |
2025-01-28 |
Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care |
Fengpei Yuan et.al. |
2501.17206 |
null |
2025-01-28 |
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training |
Tianzhe Chu et.al. |
2501.17161 |
null |
2025-01-28 |
FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data |
Deren Lei et.al. |
2501.17144 |
link |
2025-01-28 |
ASTRAL: Automated Safety Testing of Large Language Models |
Miriam Ugarte et.al. |
2501.17132 |
null |
2025-01-28 |
Optimizing Large Language Model Training Using FP4 Quantization |
Ruizhe Wang et.al. |
2501.17116 |
null |
2025-01-28 |
Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction |
Carl-Leander Henneking et.al. |
2501.17112 |
null |
2025-01-28 |
Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics |
Guillaume Le Mailloux et.al. |
2501.17107 |
link |
2025-01-28 |
Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving |
Evgenii Evstafev et.al. |
2501.17084 |
null |
2025-01-28 |
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding |
Akash Kumar et.al. |
2501.17053 |
null |
2025-01-28 |
Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models |
Minghan Li et.al. |
2501.17039 |
null |
2025-01-28 |
Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies |
Manojkumar Parmar et.al. |
2501.17030 |
null |
2025-01-28 |
Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs |
Alessandro Midolo et.al. |
2501.17024 |
link |
2025-01-28 |
Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement |
Kei Katsumata et.al. |
2501.17022 |
link |
2025-01-28 |
MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition |
Philippe Pasquier et.al. |
2501.17011 |
null |
2025-01-28 |
Large Language Models for Code Generation: The Practitioners Perspective |
Zeeshan Rasheed et.al. |
2501.16998 |
link |
2025-01-28 |
Artificial Intelligence Clones |
Annie Liang et.al. |
2501.16996 |
null |
2025-01-28 |
FedEFM: Federated Endovascular Foundation Model with Unseen Data |
Tuong Do et.al. |
2501.16992 |
null |
2025-01-28 |
Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver |
Shunya Minami et.al. |
2501.16986 |
null |
2025-01-28 |
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling |
Hongzhi Huang et.al. |
2501.16975 |
null |
2025-01-28 |
Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers |
Mohammad Raza et.al. |
2501.16961 |
null |
2025-01-28 |
Multiple Abstraction Level Retrieve Augment Generation |
Zheng Zheng et.al. |
2501.16952 |
null |
2025-01-29 |
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models |
Makoto Shing et.al. |
2501.16937 |
null |
2025-01-28 |
Detecting harassment and defamation in cyberbullying with emotion-adaptive training |
Peiling Yi et.al. |
2501.16925 |
link |
2025-01-28 |
RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains |
Shady Nasrat et.al. |
2501.16899 |
link |
2025-01-28 |
Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model |
Eoghan Cronin et.al. |
2501.16893 |
link |
2025-01-28 |
Irony Detection, Reasoning and Understanding in Zero-shot Learning |
Peiling Yi et.al. |
2501.16884 |
null |
2025-01-28 |
Comparing Human and LLM Generated Code: The Jury is Still Out! |
Sherlock A. Licorish et.al. |
2501.16857 |
null |
2025-01-28 |
Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis |
Tiao Tan et.al. |
2501.16842 |
null |
2025-01-28 |
Misspellings in Natural Language Processing: A survey |
Gianluca Sperduti et.al. |
2501.16836 |
null |
2025-01-28 |
DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model |
Josua Spisak et.al. |
2501.16800 |
null |
2025-01-28 |
Algorithm for Automatic Legislative Text Consolidation |
Matias Etcheverry et.al. |
2501.16794 |
null |
2025-01-28 |
Exponential Family Attention |
Kevin Christian Wibisono et.al. |
2501.16790 |
link |
2025-01-28 |
Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding |
Yun Li et.al. |
2501.16786 |
null |
2025-01-28 |
TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network |
Yumingzhi Pan et.al. |
2501.16784 |
null |
2025-01-28 |
A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process |
Jack David Carson et.al. |
2501.16783 |
null |
2025-01-29 |
Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models |
Muhammad Atta ur Rahman et.al. |
2501.16769 |
null |
2025-01-28 |
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation |
Chenguo Lin et.al. |
2501.16764 |
null |
2025-01-28 |
HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns |
Xinyue Shen et.al. |
2501.16750 |
link |
2025-01-28 |
Through the Prism of Culture: Evaluating LLMs’ Understanding of Indian Subcultures and Traditions |
Garima Chhikara et.al. |
2501.16748 |
null |
2025-01-28 |
LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience |
Nimesh Jha et.al. |
2501.16744 |
null |
2025-01-28 |
Distilling Large Language Models for Network Active Queue Management |
Deol Satish et.al. |
2501.16734 |
null |
2025-01-28 |
xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking |
Sunbowen Lee et.al. |
2501.16727 |
link |
2025-01-28 |
One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning |
Chunpeng Zhou et.al. |
2501.16720 |
null |
2025-01-28 |
Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection |
Hengzhuang Li et.al. |
2501.16718 |
link |
2025-01-28 |
3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow |
Yueen Ma et.al. |
2501.16698 |
null |
2025-01-28 |
MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark |
Dongyi Yi et.al. |
2501.16688 |
null |
2025-01-28 |
Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting |
Li Yin et.al. |
2501.16673 |
link |
2025-01-28 |
VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records |
Philip Chung et.al. |
2501.16672 |
link |
2025-01-28 |
Contextual Reinforcement in Multimodal Token Compression for Large Language Models |
Naderdel Piero et.al. |
2501.16658 |
null |
2025-01-28 |
Large Language Model Critics for Execution-Free Evaluation of Code Changes |
Aashish Yadavally et.al. |
2501.16655 |
link |
2025-01-28 |
Molecular-driven Foundation Model for Oncologic Pathology |
Anurag Vaidya et.al. |
2501.16652 |
link |
2025-01-28 |
DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models |
Zeping Min et.al. |
2501.16650 |
null |
2025-01-28 |
An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue |
Koji Inoue et.al. |
2501.16643 |
null |
2025-01-28 |
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs |
Jinlan Fu et.al. |
2501.16629 |
link |
2025-01-28 |
Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems |
Baraa Hikal et.al. |
2501.16616 |
null |
2025-01-28 |
Sparse Autoencoders Trained on the Same Data Learn Different Features |
Gonçalo Paulo et.al. |
2501.16615 |
null |
2025-01-28 |
Fine-Tuned Language Models as Space Systems Controllers |
Enrico M. Zucchelli et.al. |
2501.16588 |
null |
2025-01-27 |
AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models |
Zheng Lian et.al. |
2501.16566 |
null |
2025-01-27 |
LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation |
Farzad Farhadzadeh et.al. |
2501.16559 |
null |
2025-01-27 |
Distributional Information Embedding: A Framework for Multi-bit Watermarking |
Haiyun He et.al. |
2501.16558 |
null |
2025-01-27 |
PackDiT: Joint Human Motion and Text Generation via Mutual Prompting |
Zhongyu Jiang et.al. |
2501.16551 |
null |
2025-01-27 |
PhysAnimator: Physics-Guided Generative Cartoon Animation |
Tianyi Xie et.al. |
2501.16550 |
null |
2025-01-27 |
Sample-Efficient Behavior Cloning Using General Domain Knowledge |
Feiyu Zhu et.al. |
2501.16546 |
null |
2025-01-27 |
Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees |
Piyush Gupta et.al. |
2501.16539 |
null |
2025-01-27 |
Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs |
Jean-Charles Noirot Ferrand et.al. |
2501.16534 |
null |
2025-01-27 |
A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain |
Jorge del Pozo Lérida et.al. |
2501.16533 |
null |
2025-01-27 |
Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction |
Atharva Naik et.al. |
2501.16524 |
null |
2025-01-27 |
How well can LLMs Grade Essays in Arabic? |
Rayed Ghazawi et.al. |
2501.16516 |
null |
2025-01-27 |
Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models |
Sudarshan Kamath Barkur et.al. |
2501.16513 |
null |
2025-01-27 |
Smoothed Embeddings for Robust Language Models |
Ryo Hase et.al. |
2501.16497 |
null |
2025-01-27 |
Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations |
Pablo Valenzuela-Toledo et.al. |
2501.16495 |
null |
2025-01-27 |
Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM |
Payal Kamboj et.al. |
2501.16481 |
link |
2025-01-27 |
Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation |
Philip Hughes et.al. |
2501.16467 |
null |
2025-01-27 |
CoCoNUT: Structural Code Understanding does not fall out of a tree |
Claas Beger et.al. |
2501.16456 |
link |
2025-01-27 |
Detecting Zero-Day Attacks in Digital Substations via In-Context Learning |
Faizan Manzoor et.al. |
2501.16453 |
null |
2025-01-27 |
360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation |
Hamed Firooz et.al. |
2501.16450 |
null |
2025-01-27 |
DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation |
Han Sun et.al. |
2501.16410 |
null |
2025-01-27 |
Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology |
Meiyun Cao et.al. |
2501.16309 |
null |
2025-01-27 |
RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval |
Long Nguyen et.al. |
2501.16303 |
null |
2025-01-27 |
Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width |
Zheng Liu et.al. |
2501.16302 |
null |
2025-01-27 |
Large Models in Dialogue for Active Perception and Anomaly Detection |
Tzoulio Chamiti et.al. |
2501.16300 |
link |
2025-01-27 |
FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers |
Renshan Zhang et.al. |
2501.16297 |
null |
2025-01-27 |
Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models |
Jing Zhang et.al. |
2501.16282 |
null |
2025-01-27 |
Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation |
Jiayi Hong et.al. |
2501.16277 |
link |
2025-01-27 |
URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots – A Case Study at HCMUT |
Long Nguyen et.al. |
2501.16276 |
null |
2025-01-27 |
A foundation model for human-AI collaboration in medical literature mining |
Zifeng Wang et.al. |
2501.16255 |
null |
2025-01-27 |
Multi-Agent Geospatial Copilots for Remote Sensing Workflows |
Chaehong Lee et.al. |
2501.16254 |
null |
2025-01-27 |
Zero-Shot Decision Tree Construction via Large Language Models |
Lucas Carrasco et.al. |
2501.16247 |
null |
2025-01-27 |
CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation |
Xiaochuan Ma et.al. |
2501.16246 |
null |
2025-01-27 |
Phase Transitions in Large Language Models and the $O(N)$ Model |
Youran Sun et.al. |
2501.16241 |
null |
2025-01-27 |
AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses |
Runze Cai et.al. |
2501.16240 |
link |
2025-01-28 |
Distilling foundation models for robust and efficient models in digital pathology |
Alexandre Filiot et.al. |
2501.16239 |
null |
2025-01-27 |
Language-Based Bayesian Optimization Research Assistant (BORA) |
Abdoulatif Cissé et.al. |
2501.16224 |
null |
2025-01-27 |
Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models |
Huayu Li et.al. |
2501.16215 |
link |
2025-01-27 |
Provence: efficient and robust context pruning for retrieval-augmented generation |
Nadezhda Chirkova et.al. |
2501.16214 |
null |
2025-01-27 |
Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs |
Antony Bartlett et.al. |
2501.16191 |
null |
2025-01-27 |
SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting |
Wenxuan Xie et.al. |
2501.16178 |
link |
2025-01-27 |
BAG: Body-Aligned 3D Wearable Asset Generation |
Zhongjin Luo et.al. |
2501.16177 |
null |
2025-01-27 |
Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma |
Richard Willis et.al. |
2501.16173 |
link |
2025-01-27 |
MetaDecorator: Generating Immersive Virtual Tours through Multimodality |
Shuang Xie et.al. |
2501.16164 |
null |
2025-01-27 |
CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge |
Yuwei Zhang et.al. |
2501.16155 |
null |
2025-01-27 |
AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought |
Xin Huang et.al. |
2501.16154 |
null |
2025-01-27 |
AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants |
Pascal J. Sager et.al. |
2501.16150 |
null |
2025-01-27 |
PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing |
Yuwei Zhang et.al. |
2501.16149 |
null |
2025-01-27 |
SampleLLM: Optimizing Tabular Data Synthesis in Recommendations |
Jingtong Gao et.al. |
2501.16125 |
null |
2025-01-27 |
Using Generative Models to Produce Realistic Populations of UK Windstorms |
Yee Chun Tsoi et.al. |
2501.16110 |
null |
2025-01-27 |
Integration of LLM Quality Assurance into an NLG System |
Ching-Yi Chen et.al. |
2501.16078 |
null |
2025-01-27 |
PISCO: Pretty Simple Compression for Retrieval-Augmented Generation |
Maxime Louis et.al. |
2501.16075 |
null |
2025-01-27 |
A generative material transformer using Wyckoff representation |
Pierre-Paul De Breuck et.al. |
2501.16051 |
null |
2025-01-27 |
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation |
Xing Zhang et.al. |
2501.16050 |
null |
2025-01-27 |
PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment |
Vincent Freiberger et.al. |
2501.16033 |
null |
2025-01-27 |
FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments |
Zhiyuan Fu et.al. |
2501.16029 |
null |
2025-01-27 |
Transformability reveals the interplay of dynamics across different network orders |
Ming Xie et.al. |
2501.16016 |
null |
2025-01-27 |
TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference |
Jack Min Ong et.al. |
2501.16007 |
null |
2025-01-27 |
EDSep: An Effective Diffusion-Based Method for Speech Source Separation |
Jinwei Dong et.al. |
2501.15965 |
null |
2025-01-27 |
Rethinking the Bias of Foundation Model under Long-tailed Distribution |
Jiahao Chen et.al. |
2501.15955 |
null |
2025-01-27 |
Understanding Long Videos via LLM-Powered Entity Relation Graphs |
Meng Chu et.al. |
2501.15953 |
null |
2025-01-27 |
TimeHF: Billion-Scale Time Series Models Guided by Human Feedback |
Yongzhi Qi et.al. |
2501.15942 |
null |
2025-01-27 |
SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub |
Benjamin C. Carter et.al. |
2501.15922 |
null |
2025-01-27 |
Parametric Retrieval Augmented Generation |
Weihang Su et.al. |
2501.15915 |
link |
2025-01-27 |
Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation |
Muhammad Taha Tariq et.al. |
2501.15901 |
null |
2025-01-27 |
Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects |
Victor Deng et.al. |
2501.15900 |
null |
2025-01-27 |
Adaptive Width Neural Networks |
Federico Errica et.al. |
2501.15889 |
null |
2025-01-27 |
LCTG Bench: LLM Controlled Text Generation Benchmark |
Kentaro Kurihara et.al. |
2501.15875 |
link |
2025-01-27 |
LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models |
Yuewen Mei et.al. |
2501.15850 |
null |
2025-01-27 |
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model |
Delin Qu et.al. |
2501.15830 |
null |
2025-01-27 |
Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference |
Tharindu B. Hewage et.al. |
2501.15829 |
link |
2025-01-27 |
MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer |
Qi Chen et.al. |
2501.15826 |
null |
2025-01-27 |
LemmaHead: RAG Assisted Proof Generation Using Large Language Models |
Tianbo Yang et.al. |
2501.15797 |
null |
2025-01-27 |
Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? |
Zhiling Chen et.al. |
2501.15795 |
null |
2025-01-27 |
Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs |
Yu Li et.al. |
2501.15791 |
link |
2025-01-27 |
Memorization and Regularization in Generative Diffusion Models |
Ricardo Baptista et.al. |
2501.15785 |
link |
2025-01-27 |
Large Language Models to Diffusion Finetuning |
Edoardo Cetin et.al. |
2501.15781 |
null |
2025-01-27 |
Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages |
Ivory Yang et.al. |
2501.15773 |
link |
2025-01-27 |
GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design |
Yuanfu Sun et.al. |
2501.15755 |
null |
2025-01-27 |
IndicMMLU-Pro: Benchmarking the Indic Large Language Models |
Sankalp KJ et.al. |
2501.15747 |
null |
2025-01-27 |
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning |
Michael Xieyang Liu et.al. |
2501.15727 |
null |
2025-01-27 |
A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks |
Dong Li et.al. |
2501.15724 |
null |
2025-01-27 |
On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models |
Tadahiro Taniguchi et.al. |
2501.15721 |
null |
2025-01-26 |
Adapting Biomedical Abstracts into Plain language using Large Language Models |
Haritha Gangavarapu et.al. |
2501.15700 |
null |
2025-01-26 |
TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs |
Yuxuan Gu et.al. |
2501.15674 |
null |
2025-01-26 |
Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting |
Yuxin Zhang et.al. |
2501.15641 |
null |
2025-01-26 |
BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation |
Ali Khodabandeh Yalabadi et.al. |
2501.15631 |
link |
2025-01-26 |
Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets |
Eduard Barbu et.al. |
2501.15624 |
null |
2025-01-26 |
Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning |
Zeyu Gan et.al. |
2501.15602 |
link |
2025-01-26 |
Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals |
Yinzhou Wang et.al. |
2501.15599 |
null |
2025-01-26 |
Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images |
Sichen Zhu et.al. |
2501.15598 |
link |
2025-01-26 |
SedarEval: Automated Evaluation using Self-Adaptive Rubrics |
Zhiyuan Fan et.al. |
2501.15595 |
link |
2025-01-26 |
SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain |
Dakuan Lu et.al. |
2501.15587 |
link |
2025-01-26 |
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework |
Yuhong Sun et.al. |
2501.15581 |
null |
2025-01-26 |
Instruction Tuning for Story Understanding and Generation with Weak Supervision |
Yangshu Yuan et.al. |
2501.15574 |
null |
2025-01-26 |
Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models |
Spencer Ramsey et.al. |
2501.15571 |
null |
2025-01-26 |
ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer |
Lin Yueyu et.al. |
2501.15570 |
link |
2025-01-26 |
Ocean-OCR: Towards General OCR Application via a Vision-Language Model |
Song Chen et.al. |
2501.15558 |
link |
2025-01-26 |
Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles |
Hanwen Zhang et.al. |
2501.15544 |
null |
2025-01-26 |
Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths |
Yueyang Wang et.al. |
2501.15522 |
null |
2025-01-26 |
Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification |
Dan Song et.al. |
2501.15503 |
null |
2025-01-26 |
Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning |
Xiaohan Yu et.al. |
2501.15470 |
null |
2025-01-26 |
Data-adaptive Safety Rules for Training Reward Models |
Xiaomin Li et.al. |
2501.15453 |
null |
2025-01-26 |
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas |
Xiaoyang Wang et.al. |
2501.15427 |
null |
2025-01-26 |
Visual Generation Without Guidance |
Huayu Chen et.al. |
2501.15420 |
link |
2025-01-26 |
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement |
Junan Zhang et.al. |
2501.15417 |
null |
2025-01-26 |
The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation |
Raha Aghaei et.al. |
2501.15411 |
null |
2025-01-26 |
Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency |
Irin Kabakum et.al. |
2501.15405 |
null |
2025-01-26 |
How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning |
Tohida Rehman et.al. |
2501.15398 |
null |
2025-01-26 |
Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations |
Zijun Long et.al. |
2501.15379 |
null |
2025-01-26 |
How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback |
Manzong Huang et.al. |
2501.15378 |
null |
2025-01-26 |
Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models |
Melkamu Abay Mersha et.al. |
2501.15374 |
null |
2025-01-26 |
Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis |
Robinson Umeike et.al. |
2501.15370 |
null |
2025-01-26 |
Decentralized Low-Rank Fine-Tuning of Large Language Models |
Sajjad Ghiasvand et.al. |
2501.15361 |
null |
2025-01-26 |
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection |
Bo Yang et.al. |
2501.15355 |
null |
2025-01-25 |
Fairness in LLM-Generated Surveys |
Andrés Abeliuk et.al. |
2501.15351 |
null |
2025-01-25 |
Between Puppet and Actor: Reframing Authorship in this Age of AI Agents |
Yuqian Sun et.al. |
2501.15346 |
null |
2025-01-25 |
Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data |
Jiajie Li et.al. |
2501.15326 |
null |
2025-01-25 |
ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning |
Shangqian Gao et.al. |
2501.15316 |
null |
2025-01-25 |
The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? |
Ayo Adedeji et.al. |
2501.15310 |
null |
2025-01-25 |
You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning |
Ayan Sengupta et.al. |
2501.15296 |
null |
2025-01-24 |
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation |
Xin Zhou et.al. |
2501.14729 |
link |
2025-01-24 |
Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? |
Ipek Baris Schlicht et.al. |
2501.14719 |
null |
2025-01-24 |
Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models |
Naihao Deng et.al. |
2501.14717 |
null |
2025-01-24 |
FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing |
James Seale Smith et.al. |
2501.14713 |
null |
2025-01-24 |
The Karp Dataset |
Mason DiCicco et.al. |
2501.14705 |
null |
2025-01-24 |
Rethinking Table Instruction Tuning |
Naihao Deng et.al. |
2501.14693 |
null |
2025-01-24 |
Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST |
Fuping Wu et.al. |
2501.14685 |
null |
2025-01-24 |
An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations |
Shabnam Hassani et.al. |
2501.14683 |
null |
2025-01-24 |
Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning |
Jisi Zhang et.al. |
2501.14680 |
null |
2025-01-24 |
MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications |
Yixing Jiang et.al. |
2501.14654 |
link |
2025-01-24 |
Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion |
Ziyao Xu et.al. |
2501.14649 |
link |
2025-01-24 |
Towards Scalable Topological Regularizers |
Hiu-Tung Wong et.al. |
2501.14641 |
null |
2025-01-24 |
Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics |
Renato Ghisellini et.al. |
2501.14634 |
null |
2025-01-24 |
Extracting Problem Structure with LLMs for Optimized SAT Local Search |
André Schilder et.al. |
2501.14630 |
null |
2025-01-24 |
Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data |
Jordi Abante et.al. |
2501.14615 |
null |
2025-01-24 |
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations |
Tianming Liang et.al. |
2501.14607 |
null |
2025-01-24 |
Leveraging ChatGPT’s Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research |
Hamid Sarmadi et.al. |
2501.14546 |
null |
2025-01-24 |
VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning |
Benjamin Callewaert et.al. |
2501.14540 |
null |
2025-01-24 |
Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models |
Zhenguang Zhong et.al. |
2501.14530 |
link |
2025-01-24 |
Scene Understanding Enabled Semantic Communication with Open Channel Coding |
Zhe Xiang et.al. |
2501.14520 |
null |
2025-01-24 |
Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel |
Zhuoran Liu et.al. |
2501.14512 |
null |
2025-01-24 |
Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course |
Pavlin G. Poličar et.al. |
2501.14499 |
null |
2025-01-24 |
Evaluating and Improving Graph to Text Generation with Large Language Models |
Jie He et.al. |
2501.14497 |
link |
2025-01-24 |
RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques |
Zhengyang Tang et.al. |
2501.14492 |
link |
2025-01-24 |
Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design |
Taehan Kim et.al. |
2501.14469 |
null |
2025-01-24 |
Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis |
Xiujing Guo et.al. |
2501.14465 |
null |
2025-01-24 |
Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing |
Zeping Yu et.al. |
2501.14457 |
null |
2025-01-24 |
Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains |
Xu Chu et.al. |
2501.14431 |
null |
2025-01-24 |
GraphBC: Improving LLMs for Better Graph Data Processing |
Xu Chu et.al. |
2501.14427 |
null |
2025-01-24 |
CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios |
Michael Fuest et.al. |
2501.14426 |
null |
2025-01-24 |
DeepFlow: Serverless Large Language Model Serving at Scale |
Junhao Hu et.al. |
2501.14417 |
null |
2025-01-24 |
SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation |
Shengjie Wang et.al. |
2501.14400 |
null |
2025-01-24 |
ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer |
Yoni Schirris et.al. |
2501.14379 |
link |
2025-01-24 |
DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing |
Xinyu Ma et.al. |
2501.14371 |
link |
2025-01-24 |
Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches |
Ziad Sakr et.al. |
2501.14366 |
null |
2025-01-24 |
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration |
Kai-Tuo Xu et.al. |
2501.14350 |
link |
2025-01-24 |
Chain-of-Retrieval Augmented Generation |
Liang Wang et.al. |
2501.14342 |
null |
2025-01-24 |
Exploring the sustainable scaling of AI dilemma: A projective study of corporations’ AI environmental impacts |
Clément Desroches et.al. |
2501.14334 |
null |
2025-01-24 |
Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models |
Ridhi Jain et.al. |
2501.14326 |
null |
2025-01-24 |
PAID: A Framework of Product-Centric Advertising Image Design |
Hongyu Chen et.al. |
2501.14316 |
null |
2025-01-24 |
Locality-aware Fair Scheduling in LLM Serving |
Shiyi Cao et.al. |
2501.14312 |
null |
2025-01-24 |
A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education |
Calvin Yeung et.al. |
2501.14305 |
link |
2025-01-24 |
MASTER: A Multi-Agent System with LLM Specialized MCTS |
Bingzheng Gan et.al. |
2501.14304 |
null |
2025-01-24 |
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph |
Xujian Liang et.al. |
2501.14300 |
link |
2025-01-24 |
Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment |
Julian A. Schnabel et.al. |
2501.14296 |
null |
2025-01-24 |
Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes |
Sullam Jeoung et.al. |
2501.14294 |
link |
2025-01-24 |
Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches |
Feng Zhou et.al. |
2501.14291 |
null |
2025-01-24 |
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation |
Sadegh Mahdavi et.al. |
2501.14275 |
link |
2025-01-24 |
Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors |
Yi Zhao et.al. |
2501.14250 |
link |
2025-01-24 |
Humanity’s Last Exam |
Long Phan et.al. |
2501.14249 |
null |
2025-01-24 |
Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game |
Rong Ye et.al. |
2501.14225 |
null |
2025-01-24 |
Top Ten Challenges Towards Agentic Neural Graph Databases |
Jiaxin Bai et.al. |
2501.14224 |
null |
2025-01-24 |
TFG-Flow: Training-free Guidance in Multimodal Generative Flow |
Haowei Lin et.al. |
2501.14216 |
link |
2025-01-24 |
Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading |
Minrui Xu et.al. |
2501.14205 |
null |
2025-01-24 |
VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking |
Runyi Hu et.al. |
2501.14195 |
link |
2025-01-24 |
Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models |
Saaduddin Mahmud et.al. |
2501.14189 |
null |
2025-01-24 |
GeoSim.AI: AI assistants for numerical simulations in geomechanics |
Yared W. Bekele et.al. |
2501.14186 |
null |
2025-01-24 |
AI Chatbots as Professional Service Agents: Developing a Professional Identity |
Wenwen Li et.al. |
2501.14179 |
null |
2025-01-24 |
Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models |
Yile Gu et.al. |
2501.14170 |
null |
2025-01-24 |
Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction |
Dongming Sheng et.al. |
2501.14144 |
null |
2025-01-23 |
Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation |
Derek Yotheringhay et.al. |
2501.14119 |
null |
2025-01-23 |
Domain-Factored Untrained Deep Prior for Spectrum Cartography |
Subash Timilsina et.al. |
2501.14116 |
null |
2025-01-23 |
MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning |
Joshua Davis et.al. |
2501.14105 |
link |
2025-01-23 |
StreamingRAG: Real-time Contextual Retrieval and Generation Framework |
Murugan Sankaradas et.al. |
2501.14101 |
null |
2025-01-23 |
Enhancing Biomedical Relation Extraction with Directionality |
Po-Ting Lai et.al. |
2501.14079 |
link |
2025-01-23 |
LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language |
Yubin Ge et.al. |
2501.14073 |
null |
2025-01-23 |
Efficient 2D CT Foundation Model for Contrast Phase Classification |
Benjamin Hou et.al. |
2501.14066 |
null |
2025-01-23 |
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models |
Jakob Krogh Petersen et.al. |
2501.14051 |
link |
2025-01-23 |
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps |
Andrey Palaev et.al. |
2501.14046 |
link |
2025-01-23 |
Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions |
Jianfeng Zhu et.al. |
2501.14037 |
null |
2025-01-23 |
CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation |
Guofeng Cui et.al. |
2501.13927 |
null |
2025-01-23 |
Improving Video Generation with Human Feedback |
Jie Liu et.al. |
2501.13918 |
null |
2025-01-23 |
Binary Diffusion Probabilistic Model |
Vitaliy Kinakh et.al. |
2501.13915 |
null |
2025-01-23 |
Analysis of Indic Language Capabilities in LLMs |
Aatman Vaidya et.al. |
2501.13912 |
null |
2025-01-23 |
Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models |
Linh Tran et.al. |
2501.13904 |
null |
2025-01-23 |
Exploring Finetuned Audio-LLM on Heart Murmur Features |
Adrian Florea et.al. |
2501.13884 |
null |
2025-01-23 |
The machine learning platform for developers of large systems |
Alexey Naikov et.al. |
2501.13881 |
null |
2025-01-23 |
A RAG-Based Institutional Assistant |
Gustavo Kuratomi et.al. |
2501.13880 |
null |
2025-01-23 |
On the Reasoning Capacity of AI Models and How to Quantify It |
Santosh Kumar Radha et.al. |
2501.13833 |
null |
2025-01-23 |
Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing |
Hao Zhang et.al. |
2501.13831 |
null |
2025-01-23 |
Hallucinations Can Improve Large Language Models in Drug Discovery |
Shuzhou Yuan et.al. |
2501.13824 |
null |
2025-01-23 |
Large Language Model driven Policy Exploration for Recommender Systems |
Jie Wang et.al. |
2501.13816 |
null |
2025-01-23 |
Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change |
Mowafak Allaham et.al. |
2501.13802 |
null |
2025-01-23 |
Parameter-Efficient Fine-Tuning for Foundation Models |
Dan Zhang et.al. |
2501.13787 |
link |
2025-01-23 |
Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling |
Tanya Rodchenko et.al. |
2501.13779 |
null |
2025-01-23 |
Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework |
Yoonsang Kim et.al. |
2501.13778 |
link |
2025-01-23 |
Do Large Language Models Truly Understand Geometric Structures? |
Xiaofeng Wang et.al. |
2501.13773 |
link |
2025-01-23 |
Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak |
Erjia Xiao et.al. |
2501.13772 |
null |
2025-01-23 |
UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models |
Xin Xu et.al. |
2501.13766 |
null |
2025-01-23 |
EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents |
Yuhui Yun et.al. |
2501.13746 |
null |
2025-01-23 |
GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification |
Te Pei et.al. |
2501.13743 |
null |
2025-01-23 |
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities |
Zezhou Yang et.al. |
2501.13742 |
link |
2025-01-23 |
Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks |
Chang Gong et.al. |
2501.13731 |
null |
2025-01-23 |
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation |
Shi-Qi Yan et.al. |
2501.13726 |
null |
2025-01-23 |
Musical ethnocentrism in Large Language Models |
Anna Kruspe et.al. |
2501.13720 |
null |
2025-01-23 |
A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation |
Dario Serez et.al. |
2501.13718 |
null |
2025-01-23 |
EventVL: Understand Event Streams via Multimodal Large Language Model |
Pengteng Li et.al. |
2501.13707 |
null |
2025-01-23 |
DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale |
Linghao Zhang et.al. |
2501.13699 |
null |
2025-01-23 |
Question Answering on Patient Medical Records with Private Fine-Tuned LLMs |
Sara Kothari et.al. |
2501.13687 |
null |
2025-01-23 |
HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor |
Zihui Wu et.al. |
2501.13677 |
link |
2025-01-23 |
How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization |
Shezheng Song et.al. |
2501.13669 |
null |
2025-01-23 |
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models |
Yizheng Sun et.al. |
2501.13652 |
null |
2025-01-23 |
Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models |
Zhenghao Lin et.al. |
2501.13629 |
null |
2025-01-23 |
Text-to-SQL based on Large Language Models and Database Keyword Search |
Eduardo R. Nascimento et.al. |
2501.13594 |
null |
2025-01-23 |
Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization |
Lei Huang et.al. |
2501.13573 |
null |
2025-01-23 |
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt |
Tao Liu et.al. |
2501.13554 |
link |
2025-01-23 |
LLMs Can Plan Only If We Tell Them |
Bilgehan Sel et.al. |
2501.13545 |
null |
2025-01-23 |
ReasVQA: Advancing VideoQA with Imperfect Reasoning Process |
Jianxin Liang et.al. |
2501.13536 |
null |
2025-01-23 |
RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles |
Munachiso Nwadike et.al. |
2501.13491 |
link |
2025-01-23 |
Adaptive Testing for LLM-Based Applications: A Diversity-based Approach |
Juyeon Yoon et.al. |
2501.13480 |
null |
2025-01-23 |
LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation |
JiaXin Chen et.al. |
2501.13475 |
null |
2025-01-23 |
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge |
Haomiao Xiong et.al. |
2501.13468 |
link |
2025-01-23 |
Spurious Forgetting in Continual Learning of Language Models |
Junhao Zheng et.al. |
2501.13453 |
link |
2025-01-23 |
Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models |
Bo Gao et.al. |
2501.13428 |
null |
2025-01-23 |
Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling |
Tomek Jaroslawski et.al. |
2501.13415 |
null |
2025-01-23 |
VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework |
He Kong et.al. |
2501.13411 |
link |
2025-01-23 |
Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures |
Minglong Dong et.al. |
2501.13396 |
null |
2025-01-23 |
Can Large Language Models Understand Preferences in Personalized Recommendation? |
Zhaoxuan Tan et.al. |
2501.13391 |
link |
2025-01-23 |
Do as We Do, Not as You Think: the Conformity of Large Language Models |
Zhiyuan Weng et.al. |
2501.13381 |
link |
2025-01-23 |
Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility |
Gabrielle Hoyer et.al. |
2501.13376 |
link |
2025-01-23 |
Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement |
Jae-Sung Bae et.al. |
2501.13372 |
null |
2025-01-23 |
Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification |
Yuzhuo Li et.al. |
2501.13368 |
null |
2025-01-23 |
50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications |
Zewei Shi et.al. |
2501.13351 |
link |
2025-01-23 |
MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize |
Haohang Xu et.al. |
2501.13349 |
null |
2025-01-23 |
Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation |
Rong Shan et.al. |
2501.13344 |
link |
2025-01-23 |
Multi-aspect Knowledge Distillation with Large Language Model |
Taegyeong Lee et.al. |
2501.13341 |
link |
2025-01-23 |
Generative Multi-Form Bayesian Optimization |
Zhendong Guo et.al. |
2501.13337 |
null |
2025-01-23 |
SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network |
Songge Zhang et.al. |
2501.13318 |
null |
2025-01-23 |
Representing Visualization Insights as a Dense Insight Network |
Jane Hoffswell et.al. |
2501.13309 |
null |
2025-01-23 |
OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia |
Xuelong Geng et.al. |
2501.13306 |
link |
2025-01-23 |
Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers |
Akshit Achara et.al. |
2501.13302 |
link |
2025-01-23 |
Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents |
Shrinidhi Kumbhar et.al. |
2501.13299 |
null |
2025-01-23 |
RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering |
Yang Bai et.al. |
2501.13297 |
link |
2025-01-23 |
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols |
John Joon Young Chung et.al. |
2501.13284 |
null |
2025-01-22 |
MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis |
Daeun Jung et.al. |
2501.13277 |
link |
2025-01-22 |
RAG-Reward: Optimizing RAG with Reward Modeling and RLHF |
Hanning Zhang et.al. |
2501.13264 |
null |
2025-01-22 |
Exploring GPT’s Ability as a Judge in Music Understanding |
Kun Fang et.al. |
2501.13261 |
link |
2025-01-22 |
Bypassing Array Canaries via Autonomous Function Call Resolution |
Nathaniel Oh et.al. |
2501.13256 |
link |
2025-01-22 |
S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning |
Yichen Wu et.al. |
2501.13198 |
null |
2025-01-22 |
Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century |
Axel Loewe et.al. |
2501.13142 |
null |
2025-01-23 |
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding |
Boqiang Zhang et.al. |
2501.13106 |
link |
2025-01-22 |
Robust Representation Consistency Model via Contrastive Denoising |
Jiachen Lei et.al. |
2501.13094 |
link |
2025-01-22 |
Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment |
Melissa Kazemi Rad et.al. |
2501.13080 |
null |
2025-01-22 |
Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning |
Bohao Yang et.al. |
2501.13042 |
link |
2025-01-22 |
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament |
Yantao Liu et.al. |
2501.13007 |
link |
2025-01-22 |
Neural network enhanced cross entropy benchmark for monitored circuits |
Yangrui Hu et.al. |
2501.13005 |
null |
2025-01-22 |
Large Language Model-Based Semantic Communication System for Image Transmission |
Soheyb Ribouh et.al. |
2501.12988 |
null |
2025-01-22 |
LLM4WM: Adapting LLM for Wireless Multi-Tasking |
Xuanyu Liu et.al. |
2501.12983 |
null |
2025-01-22 |
Low-dimensional adaptation of diffusion models: Convergence in total variation |
Jiadong Liang et.al. |
2501.12982 |
null |
2025-01-22 |
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models |
Chongren Sun et.al. |
2501.12975 |
link |
2025-01-22 |
Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs |
Jan Corazza et.al. |
2501.12972 |
null |
2025-01-22 |
It’s complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act |
Kristof Meding et.al. |
2501.12962 |
null |
2025-01-22 |
Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference |
Weizhi Fei et.al. |
2501.12959 |
null |
2025-01-22 |
GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models |
Pengxiang Zhao et.al. |
2501.12956 |
null |
2025-01-22 |
3D Object Manipulation in a Single Image using Generative Models |
Ruisi Zhao et.al. |
2501.12935 |
null |
2025-01-22 |
Correctness Assessment of Code Generated by Large Language Models Using Internal Representations |
Tuan-Dung Bui et.al. |
2501.12934 |
link |
2025-01-22 |
DynamicEarth: How Far are We from Open-Vocabulary Change Detection? |
Kaiyu Li et.al. |
2501.12931 |
null |
2025-01-22 |
A Functional Software Reference Architecture for LLM-Integrated Systems |
Alessio Bucaioni et.al. |
2501.12904 |
null |
2025-01-22 |
Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration |
Offa Kingsleigh et.al. |
2501.12901 |
null |
2025-01-22 |
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback |
Yafu Li et.al. |
2501.12895 |
link |
2025-01-23 |
Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program |
Carlton Shepherd et.al. |
2501.12883 |
null |
2025-01-22 |
WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge |
Jingyuan Chen et.al. |
2501.12877 |
null |
2025-01-22 |
ACEBench: Who Wins the Match Point in Tool Learning? |
Chen Chen et.al. |
2501.12851 |
null |
2025-01-22 |
AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation |
Aghiles Kebaili et.al. |
2501.12840 |
null |
2025-01-22 |
Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home |
Viktor Moskvoretskii et.al. |
2501.12835 |
null |
2025-01-22 |
Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek |
John Pavlopoulos et.al. |
2501.12826 |
link |
2025-01-22 |
Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks |
Alessio Quercia et.al. |
2501.12824 |
link |
2025-01-22 |
Certified Guidance for Planning with Deep Generative Models |
Francesco Giacomarra et.al. |
2501.12815 |
null |
2025-01-22 |
Revisit Self-Debugging with Self-Generated Tests for Code Generation |
Xiancai Chen et.al. |
2501.12793 |
null |
2025-01-22 |
LLMs as Repositories of Factual Knowledge: Limitations and Solutions |
Seyed Mahed Mousavi et.al. |
2501.12774 |
null |
2025-01-22 |
NExtLong: Toward Effective Long-Context Training without Long Documents |
Chaochen Gao et.al. |
2501.12766 |
link |
2025-01-22 |
Online Preference Alignment for Language Models via Count-based Exploration |
Chenjia Bai et.al. |
2501.12735 |
link |
2025-01-22 |
Paradigm-Based Automatic HDL Code Generation Using LLMs |
Wenhao Sun et.al. |
2501.12702 |
null |
2025-01-22 |
Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression |
Kai Yoshida et.al. |
2501.12698 |
null |
2025-01-22 |
Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering |
Qian Tao et.al. |
2501.12697 |
null |
2025-01-22 |
SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling |
Shengshi Yao et.al. |
2501.12696 |
null |
2025-01-22 |
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation |
Yifan Yu et.al. |
2501.12689 |
null |
2025-01-22 |
Distillation Quantification for Large Language Models |
Sunbowen Lee et.al. |
2501.12619 |
link |
2025-01-22 |
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? |
Taiming Wang et.al. |
2501.12617 |
null |
2025-01-22 |
Kimi k1.5: Scaling Reinforcement Learning with LLMs |
Kimi Team et.al. |
2501.12599 |
null |
2025-01-22 |
Leveraging LLMs to Create a Haptic Devices’ Recommendation System |
Yang Liu et.al. |
2501.12573 |
null |
2025-01-22 |
Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review |
Rock Yuren Pang et.al. |
2501.12557 |
link |
2025-01-21 |
Human-like conceptual representations emerge from language prediction |
Ningyu Xu et.al. |
2501.12547 |
null |
2025-01-21 |
How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? |
Mirali Purohit et.al. |
2501.12535 |
null |
2025-01-21 |
An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts |
Dhia Elhaq Rzig et.al. |
2501.12521 |
null |
2025-01-21 |
A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data |
Minh Tran et.al. |
2501.12501 |
null |
2025-01-21 |
The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws |
Tian Jin et.al. |
2501.12486 |
null |
2025-01-21 |
An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models |
Xiaoyu Chu et.al. |
2501.12469 |
link |
2025-01-21 |
Adaptive PII Mitigation Framework for Large Language Models |
Shubhi Asthana et.al. |
2501.12465 |
null |
2025-01-21 |
Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management |
Arthur Vitui et.al. |
2501.12461 |
link |
2025-01-21 |
Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications |
Shubhi Asthana et.al. |
2501.12456 |
null |
2025-01-21 |
Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation |
Dongsheng Zhu et.al. |
2501.12432 |
null |
2025-01-21 |
FREYR: A Framework for Recognizing and Executing Your Requests |
Roberto Gallotta et.al. |
2501.12423 |
link |
2025-01-21 |
CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning |
Eunjee Choi et.al. |
2501.12422 |
null |
2025-01-22 |
InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling |
Yi Wang et.al. |
2501.12386 |
link |
2025-01-21 |
Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks |
Greg Olmschenk et.al. |
2501.12383 |
null |
2025-01-21 |
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding |
Yilun Zhao et.al. |
2501.12380 |
link |
2025-01-22 |
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos |
Sili Chen et.al. |
2501.12375 |
null |
2025-01-21 |
Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists |
Thomas F. Eisenmann et.al. |
2501.12374 |
link |
2025-01-21 |
Is Long Context All You Need? Leveraging LLM’s Extended Context for NL2SQL |
Yeounoh Chung et.al. |
2501.12372 |
null |
2025-01-21 |
Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration |
Thomas Walshe et.al. |
2501.12332 |
null |
2025-01-21 |
Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops |
Mohamed Harmanani et.al. |
2501.12331 |
link |
2025-01-21 |
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model |
Xianwei Zhuang et.al. |
2501.12327 |
link |
2025-01-21 |
LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations |
Hasan Abu-Rasheed et.al. |
2501.12300 |
null |
2025-01-21 |
MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks |
Qishen Zhou et.al. |
2501.12281 |
link |
2025-01-21 |
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement |
Maosong Cao et.al. |
2501.12273 |
link |
2025-01-21 |
FOCUS: First Order Concentrated Updating Scheme |
Yizhou Liu et.al. |
2501.12243 |
null |
2025-01-21 |
InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models |
Pha Nguyen et.al. |
2501.12231 |
null |
2025-01-21 |
CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning |
Yuanheng Fang et.al. |
2501.12226 |
null |
2025-01-21 |
Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces |
Allard Oelen et.al. |
2501.12221 |
null |
2025-01-21 |
You Can’t Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense |
Wuyuao Mai et.al. |
2501.12210 |
null |
2025-01-21 |
Explainability for Vision Foundation Models: A Survey |
Rémi Kazmierczak et.al. |
2501.12203 |
null |
2025-01-22 |
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation |
Zibo Zhao et.al. |
2501.12202 |
link |
2025-01-21 |
BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks |
Zhuang Li et.al. |
2501.12174 |
null |
2025-01-21 |
Contextualizing Recommendation Explanations with LLMs: A User Study |
Yuanjun Feng et.al. |
2501.12152 |
null |
2025-01-21 |
Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities |
Qirun Dai et.al. |
2501.12147 |
null |
2025-01-21 |
Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot |
Daniele Bifolco et.al. |
2501.12134 |
null |
2025-01-21 |
Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions |
Tim Schreiter et.al. |
2501.12128 |
null |
2025-01-21 |
Can open source large language models be used for tumor documentation in Germany? – An evaluation on urological doctors’ notes |
Stefan Lenz et.al. |
2501.12106 |
link |
2025-01-21 |
Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis |
Weile Luo et.al. |
2501.12084 |
null |
2025-01-21 |
Phishing Awareness via Game-Based Learning |
Argianto Rahartomo et.al. |
2501.12077 |
link |
2025-01-21 |
PINNsAgent: Automated PDE Surrogation with Large Language Models |
Qingpo Wuwu et.al. |
2501.12053 |
null |
2025-01-21 |
Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation |
Chen Griner et.al. |
2501.12033 |
null |
2025-01-21 |
Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing’s Syndrome Diagnosis in Facial Analysis |
Hongjun Liu et.al. |
2501.12023 |
null |
2025-01-21 |
Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? |
Samantha Min Er Yew et.al. |
2501.12016 |
null |
2025-01-21 |
Rate-Aware Learned Speech Compression |
Jun Xu et.al. |
2501.11999 |
null |
2025-01-21 |
Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models |
Rupesh Raj Karn et.al. |
2501.11979 |
null |
2025-01-21 |
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues |
Maya Medjad et.al. |
2501.11977 |
link |
2025-01-21 |
Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization |
Jie Zhao et.al. |
2501.11968 |
null |
2025-01-21 |
A Hybrid Attention Framework for Fake News Detection with Large Language Models |
Xiaochuan Xu et.al. |
2501.11967 |
null |
2025-01-21 |
TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection |
Yang Cao et.al. |
2501.11960 |
null |
2025-01-21 |
Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model |
Minghan Wang et.al. |
2501.11953 |
null |
2025-01-21 |
ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation |
Peter Devine et.al. |
2501.11929 |
link |
2025-01-21 |
Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model |
He Chang et.al. |
2501.11911 |
null |
2025-01-21 |
Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation |
Junhong Lian et.al. |
2501.11900 |
link |
2025-01-22 |
Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine |
Keer Lu et.al. |
2501.11885 |
link |
2025-01-21 |
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning |
Yafu Li et.al. |
2501.11877 |
link |
2025-01-21 |
LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems |
Venkata Sai Aswath Duvvuru et.al. |
2501.11864 |
null |
2025-01-21 |
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents |
Zhili Cheng et.al. |
2501.11858 |
link |
2025-01-21 |
Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance |
Nikos Kanakaris et.al. |
2501.11849 |
link |
2025-01-21 |
A Survey on Memory-Efficient Large-Scale Model Training in AI for Science |
Kaiyuan Tian et.al. |
2501.11847 |
null |
2025-01-21 |
Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction |
Noah L. Schroeder et.al. |
2501.11840 |
null |
2025-01-21 |
PXGen: A Post-hoc Explainable Method for Generative Models |
Yen-Lung Huang et.al. |
2501.11827 |
null |
2025-01-21 |
CogMorph: Cognitive Morphing Attacks for Text-to-Image Models |
Zonglei Jing et.al. |
2501.11815 |
null |
2025-01-20 |
Benchmarking Large Language Models via Random Variables |
Zijin Hong et.al. |
2501.11790 |
null |
2025-01-20 |
Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection |
Ali Naseh et.al. |
2501.11786 |
null |
2025-01-20 |
Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference |
Pouya Hamadanian et.al. |
2501.11779 |
link |
2025-01-20 |
The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers |
Alina Starovolsky-Shitrit et.al. |
2501.11770 |
null |
2025-01-20 |
Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems |
Fatemeh Nazary et.al. |
2501.11759 |
link |
2025-01-20 |
A generalizable 3D framework and model for self-supervised learning in medical imaging |
Tony Xu et.al. |
2501.11755 |
link |
2025-01-20 |
Are generative models fair? A study of racial bias in dermatological image generation |
Miguel López-Pérez et.al. |
2501.11752 |
null |
2025-01-20 |
Optimizing Pretraining Data Mixtures with LLM-Estimated Utility |
William Held et.al. |
2501.11747 |
null |
2025-01-20 |
MedicoSAM: Towards foundation models for medical image segmentation |
Anwai Archit et.al. |
2501.11734 |
link |
2025-01-20 |
Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks |
Zhenhailong Wang et.al. |
2501.11733 |
null |
2025-01-20 |
Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy |
Saeid Asgari Taghanaki et.al. |
2501.11721 |
link |
2025-01-20 |
YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners’ Perspectives |
Nong Ming et.al. |
2501.11712 |
link |
2025-01-20 |
Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution |
Ramtin Ehsani et.al. |
2501.11709 |
link |
2025-01-20 |
Trustformer: A Trusted Federated Transformer |
Ali Abbasi Tadi et.al. |
2501.11706 |
null |
2025-01-20 |
Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) |
Brian E. Perron et.al. |
2501.11705 |
null |
2025-01-20 |
Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling |
Zhenyu Hou et.al. |
2501.11651 |
link |
2025-01-20 |
Trojan Detection Through Pattern Recognition for Large Language Models |
Vedant Bhasin et.al. |
2501.11621 |
null |
2025-01-20 |
Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems |
Giorgio Robino et.al. |
2501.11613 |
null |
2025-01-20 |
SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks |
Wentao Wan et.al. |
2501.11599 |
link |
2025-01-20 |
Recurrent Diffusion for Large-Scale Parameter Generation |
Kai Wang et.al. |
2501.11587 |
link |
2025-01-20 |
Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models |
Mahyar Habibi et.al. |
2501.11581 |
null |
2025-01-20 |
Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution |
Zhiyuan You et.al. |
2501.11561 |
null |
2025-01-20 |
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation |
Jinyu Wang et.al. |
2501.11551 |
link |
2025-01-20 |
UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion |
Zixuan Chen et.al. |
2501.11515 |
null |
2025-01-20 |
Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges |
Vincent Koc et.al. |
2501.11496 |
null |
2025-01-20 |
Graph-defined Language Learning with LLMs |
Huachi Zhou et.al. |
2501.11478 |
null |
2025-01-20 |
Curiosity-Driven Reinforcement Learning from Human Feedback |
Haoran Sun et.al. |
2501.11463 |
link |
2025-01-20 |
Ontology Matching with Large Language Models and Prioritized Depth-First Search |
Maria Taboada et.al. |
2501.11441 |
null |
2025-01-20 |
One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor |
Zhikun Wu et.al. |
2501.11433 |
null |
2025-01-20 |
A Survey on Diffusion Models for Anomaly Detection |
Jing Liu et.al. |
2501.11430 |
link |
2025-01-20 |
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training |
Siyu Yuan et.al. |
2501.11425 |
link |
2025-01-20 |
Neural Contextual Reinforcement Framework for Logical Structure Language Generation |
Marcus Irvin et.al. |
2501.11417 |
null |
2025-01-20 |
Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing |
Kevin Sim et.al. |
2501.11411 |
null |
2025-01-20 |
Revisiting Language Models in Neural News Recommender Systems |
Yuyue Zhao et.al. |
2501.11391 |
link |
2025-01-20 |
Towards Advancing Code Generation with Large Language Models: A Research Roadmap |
Haolin Jin et.al. |
2501.11354 |
null |
2025-01-20 |
EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery |
Guankun Wang et.al. |
2501.11347 |
link |
2025-01-20 |
GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video |
Zhenliang Ni et.al. |
2501.11340 |
null |
2025-01-20 |
Few-shot Policy (de)composition in Conversational Question Answering |
Kyle Erwin et.al. |
2501.11335 |
null |
2025-01-20 |
Nested Annealed Training Scheme for Generative Adversarial Networks |
Chang Wan et.al. |
2501.11318 |
null |
2025-01-20 |
Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning |
Zhongtian Hu et.al. |
2501.11292 |
null |
2025-01-20 |
Large Language Model Agents for Radio Map Generation and Wireless Network Planning |
Hongye Quan et.al. |
2501.11283 |
null |
2025-01-20 |
Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries |
Yi-Hui Lee et.al. |
2501.11273 |
null |
2025-01-20 |
Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios |
Zhongtian Hu et.al. |
2501.11269 |
null |
2025-01-20 |
Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian |
Wannita Takerngsaksiri et.al. |
2501.11264 |
link |
2025-01-20 |
Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models |
Zhuangzhuang Yan et.al. |
2501.11247 |
null |
2025-01-20 |
Irony in Emojis: A Comparative Study of Human and LLM Interpretation |
Yawen Zheng et.al. |
2501.11241 |
null |
2025-01-20 |
KPL: Training-Free Medical Knowledge Mining of Vision-Language Models |
Jiaxiang Liu et.al. |
2501.11231 |
link |
2025-01-20 |
Reasoning Language Models: A Blueprint |
Maciej Besta et.al. |
2501.11223 |
link |
2025-01-20 |
Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation |
Ivan Lopez et.al. |
2501.11199 |
null |
2025-01-19 |
Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests |
Kristin Blesch et.al. |
2501.11178 |
link |
2025-01-17 |
FaceXBench: Evaluating Multimodal LLMs on Face Understanding |
Kartik Narayan et.al. |
2501.10360 |
link |
2025-01-17 |
Zero-Shot Monocular Scene Flow Estimation in the Wild |
Yiqing Liang et.al. |
2501.10357 |
null |
2025-01-17 |
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems |
Weibo Gao et.al. |
2501.10332 |
link |
2025-01-17 |
Large language models for automated scholarly paper review: A survey |
Zhenzhen Zhuang et.al. |
2501.10326 |
null |
2025-01-17 |
HiMix: Reducing Computational Complexity in Large Vision-Language Models |
Xuange Zhang et.al. |
2501.10318 |
null |
2025-01-17 |
Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs |
Claudio Di Sipio et.al. |
2501.10313 |
null |
2025-01-17 |
Computational Protein Science in the Era of Large Language Models (LLMs) |
Wenqi Fan et.al. |
2501.10282 |
null |
2025-01-17 |
Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation |
Azat Abdullin et.al. |
2501.10200 |
null |
2025-01-17 |
Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education |
William Hersh et.al. |
2501.10186 |
null |
2025-01-17 |
Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval |
Vera Pavlova et.al. |
2501.10175 |
null |
2025-01-17 |
Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis |
Abhishek Kaushik et.al. |
2501.10134 |
null |
2025-01-17 |
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario |
Lucen Zhong et.al. |
2501.10132 |
link |
2025-01-17 |
PaSa: An LLM Agent for Comprehensive Academic Paper Search |
Yichen He et.al. |
2501.10120 |
link |
2025-01-17 |
AI-Generated Music Detection and its Challenges |
Darius Afchar et.al. |
2501.10111 |
link |
2025-01-17 |
LLM Reasoner and Automated Planner: A new NPC approach |
Israel Puerta-Merino et.al. |
2501.10106 |
null |
2025-01-17 |
Universal Actions for Enhanced Embodied Foundation Models |
Jinliang Zheng et.al. |
2501.10105 |
link |
2025-01-17 |
Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks |
Michael Schwingshackl et.al. |
2501.10080 |
link |
2025-01-17 |
FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization |
Zhaopeng Gu et.al. |
2501.10067 |
link |
2025-01-17 |
Accelerating Large Language Models through Partially Linear Feed-Forward Network |
Gansen Hu et.al. |
2501.10054 |
null |
2025-01-17 |
AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search |
Wenfeng Feng et.al. |
2501.10053 |
null |
2025-01-17 |
Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists |
Alyssia Chen et.al. |
2501.10037 |
null |
2025-01-17 |
Mapping scientific communities at scale |
Victor Barbier et.al. |
2501.10035 |
link |
2025-01-17 |
Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions |
Zhijie Tan et.al. |
2501.10011 |
null |
2025-01-17 |
Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models |
Qiang Liu et.al. |
2501.09997 |
null |
2025-01-17 |
Agent-as-Judge for Factual Summarization of Long Narratives |
Yeonseok Jeong et.al. |
2501.09993 |
link |
2025-01-17 |
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation |
Yuefan Cao et.al. |
2501.09982 |
null |
2025-01-17 |
GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions |
Heda Zuo et.al. |
2501.09972 |
null |
2025-01-17 |
Explainable artificial intelligence (XAI): from inherent explainability to large language models |
Fuseini Mumuni et.al. |
2501.09967 |
null |
2025-01-17 |
A Survey on Multi-Turn Interaction Capabilities of Large Language Models |
Chen Zhang et.al. |
2501.09959 |
null |
2025-01-17 |
FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs |
Zengyi Gao et.al. |
2501.09957 |
null |
2025-01-17 |
AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations |
Jamin Seo et.al. |
2501.09954 |
link |
2025-01-17 |
Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt |
Qingcheng Zeng et.al. |
2501.09950 |
null |
2025-01-17 |
MultiPruner: Balanced Structure Removal in Foundation Models |
J. Pablo Muñoz et.al. |
2501.09949 |
link |
2025-01-17 |
Steering Large Language Models with Feature Guided Activation Additions |
Samuel Soo et.al. |
2501.09929 |
null |
2025-01-17 |
Towards A Litmus Test for Common Sense |
Hugo Latapie et.al. |
2501.09913 |
null |
2025-01-17 |
Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project’s Talent Knowledge Graph |
Jiawei Xu et.al. |
2501.09909 |
null |
2025-01-17 |
Position: Open and Closed Large Language Models in Healthcare |
Jiawei Xu et.al. |
2501.09906 |
null |
2025-01-17 |
FoundationStereo: Zero-Shot Stereo Matching |
Bowen Wen et.al. |
2501.09898 |
link |
2025-01-17 |
Evolving Deeper LLM Thinking |
Kuang-Huei Lee et.al. |
2501.09891 |
null |
2025-01-17 |
Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment |
Mohammad Sadegh Sheikhaei et.al. |
2501.09888 |
link |
2025-01-17 |
FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis |
Zhe Chen et.al. |
2501.09887 |
null |
2025-01-16 |
ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction |
Izzeddin Teeti et.al. |
2501.09878 |
null |
2025-01-16 |
Geometry-Preserving Encoder/Decoder in Latent Generative Models |
Wonjun Lee et.al. |
2501.09876 |
null |
2025-01-16 |
An LLM-Guided Tutoring System for Social Skills Training |
Michael Guevarra et.al. |
2501.09870 |
null |
2025-01-16 |
Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing |
Wenhan Wang et.al. |
2501.09866 |
null |
2025-01-16 |
Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message |
Jiawei Li et.al. |
2501.09861 |
null |
2025-01-16 |
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery |
Shristi Das Biswas et.al. |
2501.09826 |
link |
2025-01-16 |
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs |
Nada Saadi et.al. |
2501.09825 |
null |
2025-01-16 |
BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling |
Daniele Castellana et.al. |
2501.09821 |
link |
2025-01-16 |
Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems |
Soham Roy et.al. |
2501.09801 |
null |
2025-01-16 |
Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API |
Andrey Labunets et.al. |
2501.09798 |
null |
2025-01-16 |
GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation |
Weiliang Tang et.al. |
2501.09783 |
null |
2025-01-16 |
SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation |
Wanqi Yin et.al. |
2501.09782 |
link |
2025-01-16 |
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos |
Zhongwei Ren et.al. |
2501.09781 |
null |
2025-01-16 |
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong |
Tairan Fu et.al. |
2501.09775 |
null |
2025-01-16 |
Distilling Multi-modal Large Language Models for Autonomous Driving |
Deepti Hegde et.al. |
2501.09757 |
null |
2025-01-16 |
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation |
Philippe Hansen-Estruch et.al. |
2501.09755 |
null |
2025-01-16 |
Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues |
Youngjoon Jang et.al. |
2501.09754 |
null |
2025-01-16 |
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking |
Zekun Xi et.al. |
2501.09751 |
link |
2025-01-16 |
Enhancing Lexicon-Based Text Embeddings with Large Language Models |
Yibin Lei et.al. |
2501.09749 |
null |
2025-01-16 |
Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models |
Bihui Jin et.al. |
2501.09745 |
null |
2025-01-16 |
KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports |
Hajung Kim et.al. |
2501.09744 |
null |
2025-01-16 |
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps |
Nanye Ma et.al. |
2501.09732 |
null |
2025-01-16 |
A Simple Aerial Detection Baseline of Multimodal Language Models |
Qingyun Li et.al. |
2501.09720 |
link |
2025-01-16 |
Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text |
Jihed Ncib et.al. |
2501.09719 |
null |
2025-01-16 |
CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education |
Tianyu Wang et.al. |
2501.09709 |
link |
2025-01-16 |
Domain Adaptation of Foundation LLMs for e-Commerce |
Christian Herold et.al. |
2501.09706 |
null |
2025-01-16 |
Cueless EEG imagined speech for subject identification: dataset and benchmarks |
Ali Derakhshesh et.al. |
2501.09700 |
link |
2025-01-16 |
Simulated Interactive Debugging |
Yannic Noller et.al. |
2501.09694 |
null |
2025-01-17 |
Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities |
Fengli Xu et.al. |
2501.09686 |
null |
2025-01-16 |
Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review |
Masatoshi Uehara et.al. |
2501.09685 |
null |
2025-01-16 |
Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark |
Alexis Roger et.al. |
2501.09672 |
null |
2025-01-16 |
A Survey of Research in Large Language Models for Electronic Design Automation |
Jingyu Pan et.al. |
2501.09655 |
null |
2025-01-16 |
The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models |
Jonathan Katzy et.al. |
2501.09653 |
null |
2025-01-16 |
CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding |
Johannes Kirmayr et.al. |
2501.09645 |
link |
2025-01-17 |
LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading |
Kuan-Ming Liu et.al. |
2501.09636 |
null |
2025-01-16 |
Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework |
Yushen Lin et.al. |
2501.09631 |
null |
2025-01-16 |
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment |
Chaoqi Wang et.al. |
2501.09620 |
link |
2025-01-16 |
From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs |
Hrithik Majumdar Shibu et.al. |
2501.09604 |
link |
2025-01-16 |
Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures |
Pratyush Dhingra et.al. |
2501.09588 |
null |
2025-01-16 |
Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis |
Tingxuan Chen et.al. |
2501.09555 |
link |
2025-01-16 |
AI in Support of Diversity and Inclusion |
Çiçek Güven et.al. |
2501.09534 |
null |
2025-01-16 |
Confidence Estimation for Error Detection in Text-to-SQL Systems |
Oleg Somov et.al. |
2501.09527 |
link |
2025-01-16 |
Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data |
Omar Mena et.al. |
2501.09521 |
null |
2025-01-16 |
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation |
Junjie He et.al. |
2501.09503 |
null |
2025-01-16 |
Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis |
Qize Yang et.al. |
2501.09502 |
null |
2025-01-16 |
Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework |
Nuo Chen et.al. |
2501.09493 |
null |
2025-01-16 |
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators |
Zhaocheng Liu et.al. |
2501.09484 |
link |
2025-01-16 |
Guided Debugging of Auto-Translated Code Using Differential Testing |
Shengnan Wu et.al. |
2501.09475 |
null |
2025-01-16 |
DEFOM-Stereo: Depth Foundation Model Based Stereo Matching |
Hualie Jiang et.al. |
2501.09466 |
link |
2025-01-16 |
Pruning for Sparse Diffusion Models based on Gradient Flow |
Ben Wan et.al. |
2501.09464 |
null |
2025-01-16 |
“A Great Start, But…”: Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design |
Tianhao He et.al. |
2501.09457 |
null |
2025-01-16 |
Solving the unsolvable: Translating case law in Hong Kong |
King-kui Sin et.al. |
2501.09444 |
null |
2025-01-16 |
Scaling up self-supervised learning for improved surgical foundation models |
Tim J. M. Jaspers et.al. |
2501.09436 |
link |
2025-01-16 |
CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation |
Hwan Heo et.al. |
2501.09433 |
link |
2025-01-16 |
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy |
Huandong Wang et.al. |
2501.09431 |
null |
2025-01-16 |
AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring |
Xinyi Wang et.al. |
2501.09428 |
null |
2025-01-16 |
AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling |
Ancheng Xu et.al. |
2501.09426 |
null |
2025-01-16 |
FASP: Fast and Accurate Structured Pruning of Large Language Models |
Hanyu Hu et.al. |
2501.09412 |
null |
2025-01-16 |
MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models |
Lyudong Jin et.al. |
2501.09410 |
null |
2025-01-16 |
Adaptive Contextual Caching for Mobile Edge Large Language Model Service |
Guangyuan Liu et.al. |
2501.09383 |
null |
2025-01-16 |
Aligning Instruction Tuning with Pre-training |
Yiming Liang et.al. |
2501.09368 |
null |
2025-01-16 |
PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks |
Huiyou Zhan et.al. |
2501.09367 |
null |
2025-01-16 |
YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks |
Saptarashmi Bandyopadhyay et.al. |
2501.09355 |
null |
2025-01-16 |
UVRM: A Scalable 3D Reconstruction Model from Unposed Videos |
Shiu-hong Kao et.al. |
2501.09347 |
null |
2025-01-16 |
Rational Tuning of LLM Cascades via Probabilistic Modeling |
Michael J. Zellinger et.al. |
2501.09345 |
null |
2025-01-16 |
SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs |
Anbang Ye et.al. |
2501.09316 |
null |
2025-01-16 |
A Study of In-Context-Learning-Based Text-to-SQL Errors |
Jiawei Shen et.al. |
2501.09310 |
link |
2025-01-16 |
To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation |
Kaustubh D. Dhole et.al. |
2501.09292 |
null |
2025-01-16 |
LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport |
Kyeongha Rho et.al. |
2501.09291 |
link |
2025-01-16 |
Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding |
Kohei Torimi et.al. |
2501.09278 |
null |
2025-01-16 |
Large Language Model is Secretly a Protein Sequence Optimizer |
Yinkai Wang et.al. |
2501.09274 |
null |
2025-01-16 |
Perspective Transition of Large Language Models for Solving Subjective Tasks |
Xiaolong Wang et.al. |
2501.09265 |
null |
2025-01-16 |
Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition |
Takaaki Hori et.al. |
2501.09258 |
null |
2025-01-16 |
Clone-Robust AI Alignment |
Ariel D. Procaccia et.al. |
2501.09254 |
null |
2025-01-16 |
Split Fine-Tuning for Large Language Models in Wireless Networks |
Songge Zhang et.al. |
2501.09237 |
null |
2025-01-16 |
Foundations of Large Language Models |
Tong Xiao et.al. |
2501.09223 |
link |
2025-01-16 |
Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs |
Sanchit Sinha et.al. |
2501.09221 |
null |
2025-01-16 |
A Simple Graph Contrastive Learning Framework for Short Text Classification |
Yonghao Liu et.al. |
2501.09219 |
link |
2025-01-16 |
Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics |
Yuanyuan Wei et.al. |
2501.09218 |
null |
2025-01-16 |
Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning |
Yonghao Liu et.al. |
2501.09214 |
link |
2025-01-16 |
FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training |
Hongzhou Yu et.al. |
2501.09213 |
link |
2025-01-15 |
Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures |
Pengru Deng et.al. |
2501.09203 |
null |
2025-01-15 |
Towards Semantics Lifting for Scientific Computing: A Case Study on FFT |
Naifeng Zhang et.al. |
2501.09201 |
null |
2025-01-15 |
Guiding Retrieval using LLM-based Listwise Rankers |
Mandeep Rathee et.al. |
2501.09186 |
link |
2025-01-15 |
The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching |
Yevhen Kostiuk et.al. |
2501.09164 |
null |
2025-01-15 |
Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability |
Stephanie L. Day et.al. |
2501.09158 |
null |
2025-01-15 |
Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History |
Yevhen Kostiuk et.al. |
2501.09154 |
null |
2025-01-15 |
Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation |
Xingxin He et.al. |
2501.09138 |
null |
2025-01-15 |
Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG |
Aditi Singh et.al. |
2501.09136 |
link |
2025-01-15 |
HAFix: History-Augmented Large Language Models for Bug Fixing |
Yu Shi et.al. |
2501.09135 |
link |
2025-01-15 |
Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing |
Eshaan Tanwar et.al. |
2501.09127 |
link |
2025-01-15 |
Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment |
Conrad Borchers et.al. |
2501.09126 |
null |
2025-01-15 |
Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach |
Alireza Ghaffari et.al. |
2501.09107 |
null |
2025-01-15 |
Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites |
Hans W. A. Hanley et.al. |
2501.09102 |
link |
2025-01-15 |
Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative |
Yuqian Sun et.al. |
2501.09099 |
null |
2025-01-15 |
SteLLA: A Structured Grading System Using LLMs with RAG |
Hefei Qiu et.al. |
2501.09092 |
null |
2025-01-15 |
Generative diffusion model with inverse renormalization group flows |
Kanta Masuki et.al. |
2501.09064 |
link |
2025-01-15 |
Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition |
Sneheel Sarangi et.al. |
2501.09056 |
link |
2025-01-15 |
How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias |
Tosin Fadahunsi et.al. |
2501.09014 |
link |
2025-01-15 |
Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians |
Ishan Amin et.al. |
2501.09009 |
link |
2025-01-15 |
Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails |
Shaona Ghosh et.al. |
2501.09004 |
null |
2025-01-15 |
Vision Foundation Models for Computed Tomography |
Suraj Pai et.al. |
2501.09001 |
link |
2025-01-15 |
CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks |
Krit Tangsongcharoen et.al. |
2501.08998 |
link |
2025-01-15 |
VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science |
Youssef Abdalla et.al. |
2501.08995 |
link |
2025-01-15 |
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities |
Haozhe Xie et.al. |
2501.08983 |
link |
2025-01-15 |
Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models |
Emma Croxford et.al. |
2501.08977 |
null |
2025-01-15 |
Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models |
Karukriti Kaushik Ghosh et.al. |
2501.08974 |
null |
2025-01-15 |
Analyzing the Ethical Logic of Six Large Language Models |
W. Russell Neuman et.al. |
2501.08951 |
null |
2025-01-15 |
Applying General Turn-taking Models to Conversational Human-Robot Interaction |
Gabriel Skantze et.al. |
2501.08946 |
null |
2025-01-15 |
Disentangling Exploration of Large Language Models by Optimal Exploitation |
Tim Grams et.al. |
2501.08925 |
null |
2025-01-15 |
GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge |
Liam Dugan et.al. |
2501.08913 |
link |
2025-01-15 |
Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning |
Qinyu Ma et.al. |
2501.08897 |
link |
2025-01-15 |
Connecting SPDE to SGMs |
Junsu Seo et.al. |
2501.08877 |
null |
2025-01-15 |
Exploring Task-Level Optimal Prompts for Visual In-Context Learning |
Yan Zhu et.al. |
2501.08841 |
null |
2025-01-15 |
How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering |
Christoph Treude et.al. |
2501.08774 |
null |
2025-01-15 |
Admitting Ignorance Helps the Video Question Answering Models to Answer |
Haopeng Li et.al. |
2501.08771 |
null |
2025-01-15 |
Enhanced Large Language Models for Effective Screening of Depression and Anxiety |
June M. Liu et.al. |
2501.08769 |
null |
2025-01-15 |
Few-Shot Learner Generalizes Across AI-Generated Image Detection |
Shiyu Wu et.al. |
2501.08763 |
null |
2025-01-15 |
Leveraging LLM Agents for Translating Network Configurations |
Yunze Wei et.al. |
2501.08760 |
null |
2025-01-15 |
The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities |
Irina Bigoulaeva et.al. |
2501.08716 |
link |
2025-01-15 |
Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching |
Chuangtao Ma et.al. |
2501.08686 |
link |
2025-01-15 |
RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency |
Siqi Li et.al. |
2501.08682 |
null |
2025-01-15 |
Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery |
Zeqin Liao et.al. |
2501.08670 |
null |
2025-01-15 |
MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities |
Savya Khosla et.al. |
2501.08648 |
null |
2025-01-15 |
Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations |
Kaiyuan Zheng et.al. |
2501.08641 |
null |
2025-01-15 |
SWSC: Shared Weight for Similar Channel in LLM |
Binrui Zeng et.al. |
2501.08631 |
null |
2025-01-15 |
Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models |
Aruna Sankaranarayanan et.al. |
2501.08618 |
link |
2025-01-15 |
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation |
Kaiqu Liang et.al. |
2501.08617 |
null |
2025-01-15 |
Assessing the Alignment of FOL Closeness Metrics with Human Judgement |
Ramya Keerthy Thatikonda et.al. |
2501.08613 |
link |
2025-01-15 |
Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design |
Zhi Zheng et.al. |
2501.08603 |
link |
2025-01-15 |
AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL |
Tyler Stennett et.al. |
2501.08600 |
null |
2025-01-15 |
LlamaRestTest: Effective REST API Testing with Small Language Models |
Myeongsoo Kim et.al. |
2501.08598 |
null |
2025-01-15 |
Sound Scene Synthesis at the DCASE 2024 Challenge |
Mathieu Lagrange et.al. |
2501.08587 |
null |
2025-01-15 |
LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model |
Yuxuan Hu et.al. |
2501.08582 |
null |
2025-01-15 |
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation |
Jiaqi Huang et.al. |
2501.08580 |
link |
2025-01-15 |
Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms |
Kewei Li et.al. |
2501.08570 |
link |
2025-01-15 |
Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications |
Jin Chen et.al. |
2501.08563 |
link |
2025-01-15 |
LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation |
Yiran Tao et.al. |
2501.08558 |
null |
2025-01-15 |
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation |
Sitong Gong et.al. |
2501.08549 |
link |
2025-01-15 |
Comprehensive Subjective and Objective Evaluation Method for Text-generated Video |
Zelu Qi et.al. |
2501.08545 |
null |
2025-01-15 |
Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation |
Jiaxin Guo et.al. |
2501.08523 |
null |
2025-01-14 |
Quantifying the Importance of Data Alignment in Downstream Model Performance |
Krrish Chawla et.al. |
2501.08496 |
null |
2025-01-14 |
Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition |
Md Meem Hossain et.al. |
2501.08471 |
null |
2025-01-14 |
Selective Attention Merging for low resource tasks: A case study of Child ASR |
Natarajan Balaji Shankar et.al. |
2501.08468 |
link |
2025-01-14 |
Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin |
Joao Carmo de Almeida Neto et.al. |
2501.08464 |
null |
2025-01-14 |
Large Language Models For Text Classification: Case Study And Comprehensive Review |
Arina Kostina et.al. |
2501.08457 |
null |
2025-01-14 |
Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack |
Sagiv Antebi et.al. |
2501.08454 |
null |
2025-01-14 |
Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies |
Ajwad Abrar et.al. |
2501.08441 |
link |
2025-01-14 |
SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models |
Anurag Kumar et.al. |
2501.08421 |
null |
2025-01-14 |
Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation |
Seyed Mehdi Rakhtala et.al. |
2501.08420 |
null |
2025-01-14 |
Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data |
Jiaxing Qiu et.al. |
2501.08413 |
link |
2025-01-14 |
OptiChat: Bridging Optimization Models and Practitioners with Large Language Models |
Hao Chen et.al. |
2501.08406 |
link |
2025-01-14 |
Towards Best Practices for Open Datasets for LLM Training |
Stefan Baack et.al. |
2501.08365 |
null |
2025-01-14 |
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise |
Ryan Burgert et.al. |
2501.08331 |
link |
2025-01-14 |
PokerBench: Training Large Language Models to become Professional Poker Players |
Richard Zhuang et.al. |
2501.08328 |
link |
2025-01-14 |
Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks |
Miran Heo et.al. |
2501.08326 |
null |
2025-01-14 |
ADAM-1: AI and Bioinformatics for Alzheimer’s Detection and Microbiome-Clinical Data Integrations |
Ziyuan Huang et.al. |
2501.08324 |
null |
2025-01-14 |
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data |
Amirhossein Aliakbarzadeh et.al. |
2501.08322 |
link |
2025-01-14 |
Enhancing Automated Interpretability with Output-Centric Feature Descriptions |
Yoav Gur-Arieh et.al. |
2501.08319 |
link |
2025-01-14 |
MiniMax-01: Scaling Foundation Models with Lightning Attention |
MiniMax et.al. |
2501.08313 |
null |
2025-01-14 |
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them |
Abhilasha Ravichander et.al. |
2501.08292 |
null |
2025-01-14 |
LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding |
Hongyu Li et.al. |
2501.08282 |
link |
2025-01-14 |
Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing |
Pulkit Arora et.al. |
2501.08276 |
null |
2025-01-14 |
Addressing the sustainable AI trilemma: a case study on LLM agents and RAG |
Hui Wu et.al. |
2501.08262 |
null |
2025-01-14 |
Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models |
Yifu Qiu et.al. |
2501.08248 |
null |
2025-01-14 |
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints |
Jonathan Nöther et.al. |
2501.08246 |
null |
2025-01-14 |
CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset |
Jiawei Du et.al. |
2501.08238 |
null |
2025-01-14 |
Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings |
Paul Joe Maliakel et.al. |
2501.08219 |
null |
2025-01-14 |
ASTRID – An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems |
Mohita Chowdhury et.al. |
2501.08208 |
null |
2025-01-14 |
ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving |
Zain Ul Abedin et.al. |
2501.08203 |
null |
2025-01-14 |
CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation |
Jinjun Peng et.al. |
2501.08200 |
link |
2025-01-14 |
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training |
Yijiong Yu et.al. |
2501.08197 |
link |
2025-01-14 |
PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving |
Ahmet Caner Yüzügüler et.al. |
2501.08192 |
null |
2025-01-14 |
A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation |
Steven Landgraf et.al. |
2501.08188 |
null |
2025-01-15 |
A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following |
Yin Fang et.al. |
2501.08187 |
link |
2025-01-14 |
Potential and Perils of Large Language Models as Judges of Unstructured Textual Data |
Rewina Bedemariam et.al. |
2501.08167 |
null |
2025-01-14 |
I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution |
Soohyeon Choi et.al. |
2501.08165 |
null |
2025-01-14 |
Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data |
Phai Vu Dinh et.al. |
2501.08149 |
null |
2025-01-14 |
Refusal Behavior in Large Language Models: A Nonlinear Perspective |
Fabian Hildebrandt et.al. |
2501.08145 |
link |
2025-01-14 |
Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying |
Jonathan Lyhs et.al. |
2501.08142 |
null |
2025-01-14 |
Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 |
Seamie Hayes et.al. |
2501.08118 |
null |
2025-01-15 |
Consistency of Responses and Continuations Generated by Large Language Models on Social Media |
Wenlu Fan et.al. |
2501.08102 |
null |
2025-01-14 |
Hierarchical Autoscaling for Large Language Model Serving with Chiron |
Archit Patke et.al. |
2501.08090 |
null |
2025-01-14 |
Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving |
Nert Keser et.al. |
2501.08083 |
null |
2025-01-14 |
CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning |
Guoliang He et.al. |
2501.08071 |
link |
2025-01-14 |
A Roadmap to Guide the Integration of LLMs in Hierarchical Planning |
Israel Puerta-Merino et.al. |
2501.08068 |
null |
2025-01-14 |
Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT |
Awritrojit Banerjee et.al. |
2501.08053 |
null |
2025-01-14 |
TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning |
Yao Liang et.al. |
2501.08008 |
null |
2025-01-14 |
LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS |
Muhammad Ashfaq et.al. |
2501.07992 |
null |
2025-01-14 |
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness |
Jiaxing Zhao et.al. |
2501.07978 |
link |
2025-01-14 |
Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models |
Yifang Xu et.al. |
2501.07972 |
null |
2025-01-14 |
Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning |
Jiaqi Hua et.al. |
2501.07959 |
link |
2025-01-14 |
AI Guide Dog: Egocentric Path Prediction on Smartphone |
Aishwarya Jadhav et.al. |
2501.07957 |
null |
2025-01-14 |
Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations |
Waqar Hussain et.al. |
2501.07931 |
null |
2025-01-14 |
Gandalf the Red: Adaptive Security for LLMs |
Niklas Pfister et.al. |
2501.07927 |
link |
2025-01-14 |
VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models |
Hui Kuurila-Zhang et.al. |
2501.07922 |
link |
2025-01-14 |
Large Language Model Interface for Home Energy Management Systems |
François Michelon et.al. |
2501.07919 |
null |
2025-01-14 |
Bridge-SR: Schrödinger Bridge for Efficient SR |
Chang Li et.al. |
2501.07897 |
null |
2025-01-14 |
Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs |
Shuai Wang et.al. |
2501.07892 |
null |
2025-01-14 |
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding |
Zhongxiang Sun et.al. |
2501.07861 |
null |
2025-01-14 |
Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques |
Shobhit Ratan et.al. |
2501.07853 |
null |
2025-01-14 |
Unveiling Provider Bias in Large Language Models for Code Generation |
Xiaoyu Zhang et.al. |
2501.07849 |
null |
2025-01-14 |
Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning |
Haoyu Han et.al. |
2501.07845 |
null |
2025-01-14 |
A Driver Advisory System Based on Large Language Model for High-speed Train |
Y. C. Luo et.al. |
2501.07837 |
null |
2025-01-14 |
Flow: A Modular Approach to Automated Agentic Workflow Generation |
Boye Niu et.al. |
2501.07834 |
link |
2025-01-14 |
Real-time Verification and Refinement of Language Model Text Generation |
Joonho Ko et.al. |
2501.07824 |
null |
2025-01-14 |
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding |
Haomiao Xiong et.al. |
2501.07819 |
link |
2025-01-14 |
A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models |
Kaustubh D. Dhole et.al. |
2501.07818 |
null |
2025-01-14 |
Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models |
Dhruv Dhamani et.al. |
2501.07815 |
null |
2025-01-14 |
Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering |
Feijie Wu et.al. |
2501.07813 |
null |
2025-01-14 |
CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation |
Ruwei Pan et.al. |
2501.07811 |
null |
2025-01-14 |
Visual Language Models as Operator Agents in the Space Domain |
Alejandro Carrasco et.al. |
2501.07802 |
null |
2025-01-14 |
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding |
Zhaokai Wang et.al. |
2501.07783 |
link |
2025-01-14 |
Symmetry-Aware Generative Modeling through Learned Canonicalization |
Kusha Sareen et.al. |
2501.07773 |
null |
2025-01-14 |
Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey |
Bingchen Liu et.al. |
2501.07766 |
null |
2025-01-14 |
On the Statistical Capacity of Deep Generative Models |
Edric Tam et.al. |
2501.07763 |
link |
2025-01-13 |
Advancing Student Writing Through Automated Syntax Feedback |
Kamyar Zeinalipour et.al. |
2501.07740 |
null |
2025-01-13 |
Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens |
Dongwon Kim et.al. |
2501.07730 |
null |
2025-01-13 |
LLMic: Romanian Foundation Language Model |
Vlad-Andrei Bădoiu et.al. |
2501.07721 |
null |
2025-01-13 |
CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory |
Haokun Zhao et.al. |
2501.07674 |
null |
2025-01-13 |
Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning |
Karishma Thakrar et.al. |
2501.07663 |
null |
2025-01-13 |
Large Language Models for Interpretable Mental Health Diagnosis |
Brian Hyeongseok Kim et.al. |
2501.07653 |
null |
2025-01-13 |
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations |
Weixi Feng et.al. |
2501.07647 |
null |
2025-01-13 |
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective |
Kun-Peng Ning et.al. |
2501.07641 |
null |
2025-01-13 |
SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models |
Fabien Bernier et.al. |
2501.07639 |
null |
2025-01-13 |
Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss |
Xinyu Zhang et.al. |
2501.07563 |
null |
2025-01-13 |
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought |
Chengzu Li et.al. |
2501.07542 |
null |
2025-01-13 |
ML Mule: Mobile-Driven Context-Aware Collaborative Learning |
Haoxiang Yu et.al. |
2501.07536 |
null |
2025-01-13 |
Investigating Large Language Models in Inferring Personality Traits from User Conversations |
Jianfeng Zhu et.al. |
2501.07532 |
null |
2025-01-13 |
RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment |
Difei Gu et.al. |
2501.07525 |
link |
2025-01-13 |
Parallel Key-Value Cache Fusion for Position Invariant RAG |
Philhoon Oh et.al. |
2501.07523 |
null |
2025-01-13 |
Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards |
Yangsibo Huang et.al. |
2501.07493 |
null |
2025-01-13 |
TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models |
Thales Sales Almeida et.al. |
2501.07482 |
null |
2025-01-13 |
A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities |
Yihao Liu et.al. |
2501.07468 |
null |
2025-01-13 |
Understanding and Benchmarking Artificial Intelligence: OpenAI’s o3 Is Not AGI |
Rolf Pfister et.al. |
2501.07458 |
null |
2025-01-13 |
Enhancing LLM’s Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection |
Xin Yin et.al. |
2501.07425 |
null |
2025-01-13 |
Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion |
Lala Shakti Swarup Ray et.al. |
2501.07408 |
null |
2025-01-13 |
OCORD: Open-Campus Object Removal Dataset |
Shuo Zhang et.al. |
2501.07397 |
null |
2025-01-13 |
Simulating the Hubbard Model with Equivariant Normalizing Flows |
Dominic Schuh et.al. |
2501.07371 |
null |
2025-01-13 |
Emergent effects of scaling on the functional hierarchies within large language models |
Paul C. Bogdan et.al. |
2501.07359 |
null |
2025-01-13 |
Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring |
Buse Sibel Korkmaz et.al. |
2501.07324 |
link |
2025-01-13 |
FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering |
Erik Henriksson et.al. |
2501.07314 |
link |
2025-01-13 |
The Lessons of Developing Process Reward Models in Mathematical Reasoning |
Zhenru Zhang et.al. |
2501.07301 |
null |
2025-01-13 |
GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction |
Oleg Kobzarev et.al. |
2501.07295 |
null |
2025-01-13 |
LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks |
Zan-Kai Chong et.al. |
2501.07288 |
null |
2025-01-13 |
Lifelong Learning of Large Language Model based Agents: A Roadmap |
Junhao Zheng et.al. |
2501.07278 |
link |
2025-01-13 |
Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation |
Amir Sartipi et.al. |
2501.07276 |
null |
2025-01-13 |
Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics |
Wonduk Seo et.al. |
2501.07267 |
null |
2025-01-13 |
Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion |
Li Liang et.al. |
2501.07260 |
link |
2025-01-13 |
EdgeTAM: On-Device Track Anything Model |
Chong Zhou et.al. |
2501.07256 |
null |
2025-01-13 |
Large Language Models: New Opportunities for Access to Science |
Jutta Schnabel et.al. |
2501.07250 |
null |
2025-01-13 |
Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training |
Ziqing Wen et.al. |
2501.07237 |
link |
2025-01-13 |
Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction |
Qiaoqiao Ren et.al. |
2501.07224 |
link |
2025-01-13 |
Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing |
Laifa Tao et.al. |
2501.07191 |
null |
2025-01-13 |
Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study |
Huashan Chen et.al. |
2501.07165 |
null |
2025-01-13 |
AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model |
Bangchen Yin et.al. |
2501.07155 |
link |
2025-01-13 |
LLM360 K2: Scaling Up 360-Open-Source Large Language Models |
Zhengzhong Liu et.al. |
2501.07124 |
null |
2025-01-13 |
How GPT learns layer by layer |
Jason Du et.al. |
2501.07108 |
link |
2025-01-13 |
ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training |
Jiayang Wu et.al. |
2501.07078 |
link |
2025-01-13 |
D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation |
Zhejun Zhang et.al. |
2501.07077 |
link |
2025-01-13 |
Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values |
Jing Yao et.al. |
2501.07071 |
null |
2025-01-13 |
Enhancing Image Generation Fidelity via Progressive Prompts |
Zhen Xiong et.al. |
2501.07070 |
link |
2025-01-13 |
Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities |
ZeKe Xiao et.al. |
2501.07058 |
null |
2025-01-13 |
SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation |
Yee-Fan Tan et.al. |
2501.07055 |
null |
2025-01-13 |
PoAct: Policy and Action Dual-Control Agent for Generalized Applications |
Guozhi Yuan et.al. |
2501.07054 |
null |
2025-01-13 |
ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction |
Yan Zhang et.al. |
2501.07051 |
link |
2025-01-13 |
Unveiling the Potential of Text in High-Dimensional Time Series Forecasting |
Xin Zhou et.al. |
2501.07048 |
link |
2025-01-13 |
Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis |
Luwei Zeng et.al. |
2501.07034 |
null |
2025-01-13 |
A Proposed Large Language Model-Based Smart Search for Archive System |
Ha Dung Nguyen et.al. |
2501.07024 |
null |
2025-01-13 |
Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps |
Henry Li et.al. |
2501.06999 |
link |
2025-01-13 |
LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models |
Mozhgan Nasr Azadani et.al. |
2501.06986 |
link |
2025-01-13 |
Combining LLM decision and RL action selection to improve RL policy for adaptive interventions |
Karine Karine et.al. |
2501.06980 |
null |
2025-01-12 |
How is Google using AI for internal code migrations? |
Stoyan Nikolov et.al. |
2501.06972 |
null |
2025-01-12 |
Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives |
Xinyao Ma et.al. |
2501.06964 |
null |
2025-01-12 |
Comparison of Autoencoders for tokenization of ASL datasets |
Vouk Praun-Petrovic et.al. |
2501.06942 |
null |
2025-01-12 |
Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy |
Evgeny Ugolkov et.al. |
2501.06939 |
link |
2025-01-12 |
Harnessing Large Language Models for Disaster Management: A Survey |
Zhenyu Lei et.al. |
2501.06932 |
null |
2025-01-12 |
Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories |
Faaiq Waqar et.al. |
2501.06921 |
null |
2025-01-12 |
Risk-Averse Finetuning of Large Language Models |
Sapana Chaudhary et.al. |
2501.06911 |
link |
2025-01-12 |
Deep Learning and Foundation Models for Weather Prediction: A Survey |
Jimeng Shi et.al. |
2501.06907 |
null |
2025-01-12 |
A Foundational Generative Model for Breast Ultrasound Image Analysis |
Haojun Yu et.al. |
2501.06869 |
null |
2025-01-12 |
Transfer Learning of Tabular Data by Finetuning Large Language Models |
Shourav B. Rabbani et.al. |
2501.06863 |
null |
2025-01-12 |
A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context |
Noureldin Zahran et.al. |
2501.06859 |
null |
2025-01-12 |
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training |
Tianjin Huang et.al. |
2501.06842 |
link |
2025-01-12 |
An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering |
Zaber Al Hassan Ayon et.al. |
2501.06837 |
null |
2025-01-12 |
X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding |
Wenqi Zhou et.al. |
2501.06835 |
null |
2025-01-12 |
LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents |
Augusto Gonzalez-Bonorino et.al. |
2501.06834 |
link |
2025-01-12 |
GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing |
Ruizhe Ou et.al. |
2501.06828 |
null |
2025-01-12 |
Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification |
Shijing Chen et.al. |
2501.06827 |
null |
2025-01-12 |
Event Argument Extraction with Enriched Prompts |
Chen Liang et.al. |
2501.06825 |
link |
2025-01-12 |
A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT |
Yizhou Zhou et.al. |
2501.06819 |
null |
2025-01-12 |
RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models |
Keyan Chen et.al. |
2501.06809 |
link |
2025-01-12 |
Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting |
Yongshuo Zhu et.al. |
2501.06808 |
null |
2025-01-12 |
MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference |
Wenxuan Zeng et.al. |
2501.06807 |
null |
2025-01-12 |
Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences |
Liu Yu et.al. |
2501.06795 |
null |
2025-01-12 |
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes |
Mahmoud Ahmed et.al. |
2501.06785 |
link |
2025-01-12 |
Cost-Effective Robotic Handwriting System with AI Integration |
Tianyi Huang et.al. |
2501.06783 |
null |
2025-01-12 |
Eliza: A Web3 friendly AI Agent Operating System |
Shaw Walters et.al. |
2501.06781 |
link |
2025-01-12 |
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning |
Ji Soo Lee et.al. |
2501.06761 |
link |
2025-01-12 |
Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation |
Shunfan Zheng et.al. |
2501.06741 |
null |
2025-01-12 |
ZOQO: Zero-Order Quantized Optimization |
Noga Bar et.al. |
2501.06736 |
null |
2025-01-12 |
Better Prompt Compression Without Multi-Layer Perceptrons |
Edouardo Honig et.al. |
2501.06730 |
null |
2025-01-12 |
Measuring the Robustness of Reference-Free Dialogue Evaluation Systems |
Justin Vasselli et.al. |
2501.06728 |
link |
2025-01-12 |
Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G |
Zhiyan Liu et.al. |
2501.06726 |
null |
2025-01-12 |
DRDT3: Diffusion-Refined Decision Test-Time Training Model |
Xingshuai Huang et.al. |
2501.06718 |
null |
2025-01-12 |
ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian |
Mykyta Syromiatnikov et.al. |
2501.06715 |
link |
2025-01-12 |
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management |
Liu Qianli et.al. |
2501.06709 |
null |
2025-01-12 |
Evaluating Sample Utility for Data Selection by Mimicking Model Weights |
Tzu-Heng Huang et.al. |
2501.06708 |
null |
2025-01-12 |
AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds |
Yinfang Chen et.al. |
2501.06706 |
null |
2025-01-12 |
Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese |
Jie Yang et.al. |
2501.06704 |
null |
2025-01-12 |
Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users’ Questions |
Aidan Hogan et.al. |
2501.06699 |
null |
2025-01-12 |
DVM: Towards Controllable LLM Agents in Social Deduction Games |
Zheng Zhang et.al. |
2501.06695 |
null |
2025-01-12 |
TAPO: Task-Referenced Adaptation for Prompt Optimization |
Wenxin Luo et.al. |
2501.06689 |
link |
2025-01-12 |
Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning |
Xiangen Hu et.al. |
2501.06682 |
null |
2025-01-12 |
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving |
Haoxiang Gao et.al. |
2501.06680 |
null |
2025-01-11 |
Challenging reaction prediction models to generalize to novel chemistry |
John Bradshaw et.al. |
2501.06669 |
link |
2025-01-11 |
Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training |
Sanjit Kakarla et.al. |
2501.06658 |
link |
2025-01-11 |
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings |
Tong Liu et.al. |
2501.06645 |
null |
2025-01-11 |
Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models |
Veronika Smilga et.al. |
2501.06638 |
link |
2025-01-11 |
Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach |
Mohammed Maree et.al. |
2501.06628 |
null |
2025-01-11 |
Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks |
Amr Almorsi et.al. |
2501.06625 |
null |
2025-01-11 |
Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks |
Xuanhao Luo et.al. |
2501.06604 |
null |
2025-01-11 |
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation |
Xuanle Zhao et.al. |
2501.06598 |
link |
2025-01-11 |
ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning |
Xiangru Tang et.al. |
2501.06590 |
link |
2025-01-11 |
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping |
Muru Zhang et.al. |
2501.06589 |
link |
2025-01-10 |
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs |
Omkar Thawakar et.al. |
2501.06186 |
link |
2025-01-10 |
PEACE: Empowering Geologic Map Holistic Understanding with MLLMs |
Yangyu Huang et.al. |
2501.06184 |
null |
2025-01-10 |
VideoAuteur: Towards Long Narrative Video Generation |
Junfei Xiao et.al. |
2501.06173 |
null |
2025-01-10 |
GenMol: A Drug Discovery Generalist with Discrete Diffusion |
Seul Lee et.al. |
2501.06158 |
null |
2025-01-10 |
Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories |
Gerd Kortemeyer et.al. |
2501.06143 |
null |
2025-01-10 |
Supervision policies can shape long-term risk management in general-purpose AI models |
Manuel Cebrian et.al. |
2501.06137 |
link |
2025-01-10 |
Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI |
Yuya Asano et.al. |
2501.06129 |
null |
2025-01-10 |
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding |
Fabian David Schmidt et.al. |
2501.06117 |
link |
2025-01-10 |
From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy |
Elham Aghakhani et.al. |
2501.06101 |
null |
2025-01-10 |
Photokinetics of Photothermal Reactions |
Mounir Maafi et.al. |
2501.06057 |
null |
2025-01-10 |
AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery |
Johann Wenckstern et.al. |
2501.06039 |
link |
2025-01-10 |
Addressing speaker gender bias in large scale speech translation systems |
Shubham Bansal et.al. |
2501.05989 |
null |
2025-01-10 |
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing |
Eklavya Sarkar et.al. |
2501.05987 |
link |
2025-01-10 |
Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys |
Divya Mani Adhikari et.al. |
2501.05985 |
null |
2025-01-10 |
Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea |
Eunjung Cho et.al. |
2501.05981 |
null |
2025-01-10 |
Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory |
Yunmeng Shu et.al. |
2501.05965 |
null |
2025-01-10 |
Effective faking of verbal deception detection with target-aligned adversarial attacks |
Bennett Kleinberg et.al. |
2501.05962 |
null |
2025-01-10 |
Reusable specimen-level inference in computational pathology |
Jakub R. Kaczmarzyk et.al. |
2501.05945 |
link |
2025-01-10 |
DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information |
Yongfan Lai et.al. |
2501.05932 |
link |
2025-01-10 |
LLMs Reproduce Stereotypes of Sexual and Gender Minorities |
Ruby Ostrow et.al. |
2501.05926 |
null |
2025-01-10 |
Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction |
Petraq Nako et.al. |
2501.05925 |
null |
2025-01-10 |
Valley2: Exploring Multimodal Models with Scalable Vision-Language Design |
Ziheng Wu et.al. |
2501.05901 |
link |
2025-01-10 |
Prompt engineering and its implications on the energy consumption of Large Language Models |
Riccardo Rubei et.al. |
2501.05899 |
link |
2025-01-10 |
Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs |
Bianca Raimondi et.al. |
2501.05891 |
link |
2025-01-10 |
Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs |
Dabing Cheng et.al. |
2501.05884 |
null |
2025-01-10 |
VideoRAG: Retrieval-Augmented Generation over Video Corpus |
Soyeong Jeong et.al. |
2501.05874 |
null |
2025-01-10 |
ConSim: Measuring Concept-Based Explanations’ Effectiveness with Automated Simulatability |
Antonin Poché et.al. |
2501.05855 |
link |
2025-01-10 |
Understanding Impact of Human Feedback via Influence Functions |
Taywon Min et.al. |
2501.05790 |
link |
2025-01-10 |
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models |
You Li et.al. |
2501.05767 |
null |
2025-01-10 |
Controlling Large Language Models Through Concept Activation Vectors |
Hanyu Zhang et.al. |
2501.05764 |
null |
2025-01-10 |
StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation |
Shangjin Zhai et.al. |
2501.05763 |
null |
2025-01-10 |
CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech |
Madhurananda Pahar et.al. |
2501.05755 |
null |
2025-01-10 |
Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models |
Sungjae Lee et.al. |
2501.05752 |
null |
2025-01-10 |
TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos |
Korawat Charoenpitaks et.al. |
2501.05733 |
link |
2025-01-10 |
Enabling Scalable Oversight via Self-Evolving Critic |
Zhengyang Tang et.al. |
2501.05727 |
null |
2025-01-10 |
I Can’t Share Code, but I need Translation – An Empirical Study on Code Translation through Federated LLM |
Jahnavi Kumar et.al. |
2501.05724 |
null |
2025-01-10 |
How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond |
Chen Huang et.al. |
2501.05714 |
null |
2025-01-10 |
Multi-Step Reasoning in Korean and the Emergent Mirage |
Guijin Son et.al. |
2501.05712 |
null |
2025-01-10 |
EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model |
Yi He et.al. |
2501.05710 |
null |
2025-01-10 |
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains |
Vighnesh Subramaniam et.al. |
2501.05707 |
null |
2025-01-10 |
Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness |
Audrey Salmon et.al. |
2501.05706 |
null |
2025-01-10 |
Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection |
Feiyi Chen et.al. |
2501.05675 |
null |
2025-01-10 |
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration |
Zuyuan Zhang et.al. |
2501.05673 |
null |
2025-01-10 |
Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models |
Zheqi Lv et.al. |
2501.05662 |
null |
2025-01-10 |
Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation |
Zheqi Lv et.al. |
2501.05647 |
null |
2025-01-10 |
Iconicity in Large Language Models |
Anna Marklová et.al. |
2501.05643 |
null |
2025-01-10 |
HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection |
Anant Mehta et.al. |
2501.05631 |
link |
2025-01-10 |
The Impact of Model Scaling on Seen and Unseen Language Performance |
Rhitabrat Pokharel et.al. |
2501.05629 |
null |
2025-01-09 |
Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study |
Zhenyu Qi et.al. |
2501.05625 |
null |
2025-01-09 |
Exploring Large Language Models for Translating Romanian Computational Problems into English |
Adrian Marius Dumitran et.al. |
2501.05601 |
null |
2025-01-09 |
Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics |
Gert Aarts et.al. |
2501.05580 |
null |
2025-01-09 |
Exploring Large Language Models (LLMs) through interactive Python activities |
Eugenio Tufino et.al. |
2501.05577 |
link |
2025-01-09 |
LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts |
Yuri Facanha Bezerra et.al. |
2501.05554 |
link |
2025-01-09 |
The dynamics of meaning through time: Assessment of Large Language Models |
Mohamed Taher Alrefaie et.al. |
2501.05552 |
null |
2025-01-09 |
Infecting Generative AI With Viruses |
David Noever et.al. |
2501.05542 |
null |
2025-01-09 |
NSChat: A Chatbot System To Rule Them All |
Zenon Lamprou et.al. |
2501.05541 |
null |
2025-01-09 |
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding |
Xingyu Fu et.al. |
2501.05452 |
null |
2025-01-09 |
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors |
Yifan Yu et.al. |
2501.05446 |
link |
2025-01-09 |
Consistent Flow Distillation for Text-to-3D Generation |
Runjie Yan et.al. |
2501.05445 |
null |
2025-01-09 |
Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark |
Yunzhuo Hao et.al. |
2501.05444 |
link |
2025-01-09 |
A survey of textual cyber abuse detection using cutting-edge language models and large language models |
Jose A. Diaz-Garcia et.al. |
2501.05443 |
null |
2025-01-09 |
Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation |
Xuyi Meng et.al. |
2501.05427 |
null |
2025-01-09 |
Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers |
Jerry Chongyi Hu et.al. |
2501.05423 |
null |
2025-01-09 |
Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation |
Darius Petermann et.al. |
2501.05413 |
null |
2025-01-10 |
Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics |
Maximilian Alber et.al. |
2501.05409 |
null |
2025-01-09 |
TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts |
Yu-Hao Huang et.al. |
2501.05403 |
link |
2025-01-09 |
Mechanistic understanding and validation of large AI models with SemanticLens |
Maximilian Dreyer et.al. |
2501.05398 |
null |
2025-01-09 |
FairCode: Evaluating Social Bias of LLMs in Code Generation |
Yongkang Du et.al. |
2501.05396 |
link |
2025-01-09 |
Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models |
Kristian G. Barman et.al. |
2501.05382 |
null |
2025-01-09 |
Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance |
Dimitrios Gerogiannis et.al. |
2501.05379 |
null |
2025-01-09 |
Accelerated Diffusion Models via Speculative Sampling |
Valentin De Bortoli et.al. |
2501.05370 |
null |
2025-01-09 |
Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction |
Hantao Lou et.al. |
2501.05336 |
link |
2025-01-09 |
“What’s Happening”- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles |
Xuewen Luo et.al. |
2501.05322 |
null |
2025-01-09 |
Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning |
Nora Gourmelon et.al. |
2501.05281 |
link |
2025-01-09 |
CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models |
Fabian Hörst et.al. |
2501.05269 |
link |
2025-01-09 |
Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal |
Wanli Ma et.al. |
2501.05265 |
null |
2025-01-09 |
CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models |
Yewei Song et.al. |
2501.05255 |
null |
2025-01-09 |
From Scientific Texts to Verifiable Code: Automating the Process with Transformers |
Changjie Wang et.al. |
2501.05252 |
null |
2025-01-09 |
RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models |
Peizhuo Lv et.al. |
2501.05249 |
null |
2025-01-09 |
Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning |
Laura Puccioni et.al. |
2501.05248 |
null |
2025-01-09 |
Online Prompt and Solver Selection for Program Synthesis |
Yixuan Li et.al. |
2501.05247 |
null |
2025-01-09 |
Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs |
Artem Fedorchenko et.al. |
2501.05234 |
null |
2025-01-09 |
Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection |
Pei-Kang Lee et.al. |
2501.05228 |
null |
2025-01-09 |
Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes |
Ludwic Leonard et.al. |
2501.05226 |
null |
2025-01-09 |
Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond |
Tomas Goldsack et.al. |
2501.05224 |
null |
2025-01-09 |
A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education |
Ziqing Li et.al. |
2501.05220 |
null |
2025-01-09 |
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration |
Xuyang Liu et.al. |
2501.05179 |
link |
2025-01-09 |
Emergence of human-like polarization among large language model agents |
Jinghua Piao et.al. |
2501.05171 |
null |
2025-01-09 |
Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering |
Matteo Esposito et.al. |
2501.05165 |
null |
2025-01-09 |
Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier |
Yufei Shang et.al. |
2501.05155 |
null |
2025-01-09 |
DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving |
Xuran Zheng et.al. |
2501.05081 |
null |
2025-01-09 |
Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization |
Harshith Manjunath et.al. |
2501.05079 |
null |
2025-01-09 |
Analyzing Memorization in Large Language Models through the Lens of Model Attribution |
Tarun Ram Menta et.al. |
2501.05078 |
link |
2025-01-09 |
A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model |
Shuo Tong et.al. |
2501.05075 |
null |
2025-01-09 |
Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning |
Huabin Liu et.al. |
2501.05069 |
null |
2025-01-09 |
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding |
Jiaxing Zhao et.al. |
2501.05067 |
null |
2025-01-09 |
Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators |
Leonard Lupin-Jimenez et.al. |
2501.05058 |
null |
2025-01-09 |
LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models |
Zengqi Peng et.al. |
2501.05057 |
null |
2025-01-09 |
On the Generalizability of Transformer Models to Code Completions of Different Lengths |
Nathan Cooper et.al. |
2501.05051 |
null |
2025-01-09 |
SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution |
Chengxing Xie et.al. |
2501.05040 |
link |
2025-01-09 |
Enhancing Human-Like Responses in Large Language Models |
Ethem Yağız Çalık et.al. |
2501.05032 |
null |
2025-01-09 |
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark |
Ronghao Dang et.al. |
2501.05031 |
link |
2025-01-09 |
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications |
Ofir Marom et.al. |
2501.05030 |
null |
2025-01-09 |
TreeKV: Smooth Key-Value Cache Compression with Tree Structures |
Ziwei He et.al. |
2501.04987 |
null |
2025-01-09 |
SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs |
Muhammad Salman et.al. |
2501.04985 |
null |
2025-01-09 |
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer |
Hangzhou He et.al. |
2501.04975 |
link |
2025-01-09 |
Demystifying Domain-adaptive Post-training for Financial LLMs |
Zixuan Ke et.al. |
2501.04961 |
link |
2025-01-09 |
Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments |
Yifan Xu et.al. |
2501.04947 |
null |
2025-01-09 |
Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models |
Qingyu Ren et.al. |
2501.04945 |
link |
2025-01-09 |
Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency |
Shiji Zhao et.al. |
2501.04931 |
null |
2025-01-09 |
Investigating Numerical Translation with Large Language Models |
Wei Tang et.al. |
2501.04927 |
null |
2025-01-09 |
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching |
Jun-Hak Yun et.al. |
2501.04926 |
link |
2025-01-09 |
HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers |
Yiyao Yang et.al. |
2501.04908 |
link |
2025-01-09 |
JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis |
Jun-Hyeok Cha et.al. |
2501.04904 |
null |
2025-01-09 |
ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries |
Keke Huang et.al. |
2501.04901 |
null |
2025-01-09 |
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval |
Hanna Zubkova et.al. |
2501.04899 |
null |
2025-01-08 |
Leveraging Log Probabilities in Language Models to Forecast Future Events |
Tommaso Soru et.al. |
2501.04880 |
null |
2025-01-08 |
Real-Time Textless Dialogue Generation |
Long Mai et.al. |
2501.04877 |
link |
2025-01-08 |
Modelling complex proton transport phenomena – Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields |
Malte Grunert et.al. |
2501.04876 |
null |
2025-01-08 |
Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware |
Brandon J Walton et.al. |
2501.04848 |
null |
2025-01-08 |
Do Code LLMs Understand Design Patterns? |
Zhenyu Pan et.al. |
2501.04835 |
null |
2025-01-08 |
On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability |
Andreas Vogelsang et.al. |
2501.04810 |
null |
2025-01-08 |
IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX |
Erik Recio-Armengol et.al. |
2501.04776 |
link |
2025-01-08 |
Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations |
Kirandeep Kaur et.al. |
2501.04762 |
null |
2025-01-08 |
Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch |
Phillip Richter et.al. |
2501.04755 |
null |
2025-01-08 |
EditAR: Unified Conditional Generation with Autoregressive Models |
Jiteng Mu et.al. |
2501.04699 |
null |
2025-01-08 |
Re-ranking the Context for Multimodal Retrieval Augmented Generation |
Matin Mortaheb et.al. |
2501.04695 |
null |
2025-01-08 |
SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images |
Zixuan Huang et.al. |
2501.04689 |
null |
2025-01-08 |
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics |
Ruilin Luo et.al. |
2501.04686 |
link |
2025-01-08 |
Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations |
Archita Srivastava et.al. |
2501.04675 |
null |
2025-01-08 |
Assessing Language Comprehension in Large Language Models Using Construction Grammar |
Wesley Scivetti et.al. |
2501.04661 |
null |
2025-01-08 |
Multi-task retriever fine-tuning for domain-specific and efficient RAG |
Patrice Béchard et.al. |
2501.04652 |
null |
2025-01-08 |
FlairGPT: Repurposing LLMs for Interior Designs |
Gabrielle Littlefair et.al. |
2501.04648 |
null |
2025-01-08 |
Knowledge Retrieval Based on Generative AI |
Te-Lun Yang et.al. |
2501.04635 |
null |
2025-01-08 |
“Can you be my mum?”: Manipulating Social Robots in the Large Language Models Era |
Giulio Antonio Abbo et.al. |
2501.04633 |
null |
2025-01-09 |
MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation |
Daniele Molino et.al. |
2501.04614 |
null |
2025-01-08 |
Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning |
Ivan Kankeu et.al. |
2501.04591 |
link |
2025-01-08 |
Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models |
Miaoyang He et.al. |
2501.04582 |
null |
2025-01-08 |
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection |
Yuhang Liu et.al. |
2501.04575 |
link |
2025-01-09 |
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis |
Run Luo et.al. |
2501.04561 |
link |
2025-01-08 |
The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? |
Christopher Lazik et.al. |
2501.04543 |
null |
2025-01-08 |
Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time |
Uri Berger et.al. |
2501.04513 |
null |
2025-01-08 |
CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection |
Ruijun Feng et.al. |
2501.04510 |
null |
2025-01-08 |
Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction |
Guofeng Yang et.al. |
2501.04487 |
null |
2025-01-08 |
When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages |
Archchana Sindhujan et.al. |
2501.04473 |
null |
2025-01-08 |
Hidden Entity Detection from GitHub Leveraging Large Language Models |
Lu Gan et.al. |
2501.04455 |
link |
2025-01-08 |
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions |
Doaa Mahmud et.al. |
2501.04437 |
null |
2025-01-08 |
Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions |
Na Yan et.al. |
2501.04436 |
null |
2025-01-08 |
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach |
H. M. Shadman Tabib et.al. |
2501.04425 |
null |
2025-01-08 |
SEO: Stochastic Experience Optimization for Large Language Models |
Jitao Xu et.al. |
2501.04393 |
null |
2025-01-08 |
iFADIT: Invertible Face Anonymization via Disentangled Identity Transform |
Lin Yuan et.al. |
2501.04390 |
null |
2025-01-08 |
DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications |
Feng Liu et.al. |
2501.04366 |
link |
2025-01-08 |
Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting |
Dong-Hai Zhu et.al. |
2501.04341 |
link |
2025-01-09 |
Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models |
Haonan Shi et.al. |
2501.04323 |
null |
2025-01-08 |
Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts |
Preethi Seshadri et.al. |
2501.04316 |
link |
2025-01-08 |
RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation |
Jun Liu et.al. |
2501.04315 |
null |
2025-01-08 |
Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models |
Kunpeng Zhang et.al. |
2501.04312 |
null |
2025-01-08 |
LLM4SR: A Survey on Large Language Models for Scientific Research |
Ziming Luo et.al. |
2501.04306 |
link |
2025-01-08 |
Multimodal Graph Constrastive Learning and Prompt for ChartQA |
Yue Dai et.al. |
2501.04303 |
null |
2025-01-08 |
H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving |
Siran Chen et.al. |
2501.04302 |
null |
2025-01-08 |
An Analysis of Model Robustness across Concurrent Distribution Shifts |
Myeongho Jeon et.al. |
2501.04288 |
null |
2025-01-08 |
Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models |
Bahman Torkamandi et.al. |
2501.04286 |
link |
2025-01-08 |
Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking |
Tianqi Ren et.al. |
2501.04285 |
null |
2025-01-08 |
OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments |
Yujie Tang et.al. |
2501.04279 |
null |
2025-01-08 |
Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering |
Christophe Bajan et.al. |
2501.04277 |
link |
2025-01-08 |
Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation |
Senwei Xie et.al. |
2501.04268 |
null |
2025-01-08 |
Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning |
Lang Xu et.al. |
2501.04266 |
null |
2025-01-08 |
IOLBENCH: Benchmarking LLMs on Linguistic Reasoning |
Satyam Goyal et.al. |
2501.04249 |
link |
2025-01-08 |
TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events |
Jian-Hua Fang et.al. |
2501.04247 |
null |
2025-01-08 |
Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks |
Rachel Longjohn et.al. |
2501.04234 |
null |
2025-01-07 |
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation |
Alireza Salemi et.al. |
2501.04167 |
null |
2025-01-07 |
AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots |
Shaoyue Wen et.al. |
2501.04156 |
link |
2025-01-07 |
Multilingual Open QA on the MIA Shared Task |
Navya Yarrabelly et.al. |
2501.04153 |
null |
2025-01-07 |
The angular momentum spiral of the Milky Way disc in Gaia |
Rashid Yaaqib et.al. |
2501.04095 |
null |
2025-01-07 |
More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives |
Xiaoqing Zhang et.al. |
2501.04070 |
link |
2025-01-07 |
ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono |
Jingquan Wang et.al. |
2501.04062 |
null |
2025-01-07 |
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving |
Lingdong Kong et.al. |
2501.04005 |
null |
2025-01-07 |
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos |
Haobo Yuan et.al. |
2501.04001 |
link |
2025-01-07 |
RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance |
Matin Mortaheb et.al. |
2501.03995 |
null |
2025-01-07 |
Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance |
Adil Rengim Cetingoz et.al. |
2501.03993 |
null |
2025-01-07 |
Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles |
Yuxi Xia et.al. |
2501.03991 |
null |
2025-01-07 |
(De)-Indexing and the Right to be Forgotten |
Salvatore Vilella et.al. |
2501.03989 |
null |
2025-01-07 |
VLM-driven Behavior Tree for Context-aware Task Planning |
Naoki Wake et.al. |
2501.03968 |
link |
2025-01-07 |
Vision Language Models as Values Detectors |
Giulio Antonio Abbo et.al. |
2501.03957 |
null |
2025-01-07 |
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States |
Jurgita Kapočiūtė-Dzikienė et.al. |
2501.03952 |
null |
2025-01-07 |
Synthetic Data Privacy Metrics |
Amy Steier et.al. |
2501.03941 |
null |
2025-01-07 |
Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection |
Pablo Miralles-González et.al. |
2501.03940 |
null |
2025-01-07 |
A precise asymptotic analysis of learning diffusion models: theory and insights |
Hugo Cui et.al. |
2501.03937 |
link |
2025-01-07 |
Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study |
Ramya Jonnala et.al. |
2501.03904 |
null |
2025-01-07 |
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token |
Shaolei Zhang et.al. |
2501.03895 |
link |
2025-01-07 |
AlphaPO – Reward shape matters for LLM alignment |
Aman Gupta et.al. |
2501.03884 |
null |
2025-01-07 |
CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds |
Keonwoo Kim et.al. |
2501.03879 |
null |
2025-01-07 |
Progressive Document-level Text Simplification via Large Language Models |
Dengzhao Fang et.al. |
2501.03857 |
null |
2025-01-07 |
MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention |
Aadya Arora et.al. |
2501.03839 |
null |
2025-01-07 |
Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging |
Simon W. Penninga et.al. |
2501.03825 |
null |
2025-01-08 |
MADation: Face Morphing Attack Detection with Foundation Models |
Eduarda Caldeira et.al. |
2501.03800 |
link |
2025-01-07 |
KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration |
Chengyuan Li et.al. |
2501.03786 |
null |
2025-01-07 |
Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series |
Yuxiao Hu et.al. |
2501.03747 |
null |
2025-01-07 |
Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein |
Xiaotong Guo et.al. |
2501.03722 |
null |
2025-01-07 |
Motion-Aware Generative Frame Interpolation |
Guozhen Zhang et.al. |
2501.03699 |
null |
2025-01-07 |
SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment |
Yuchun Fan et.al. |
2501.03681 |
link |
2025-01-07 |
Effective and Efficient Mixed Precision Quantization of Speech Foundation Models |
Haoning Xu et.al. |
2501.03643 |
null |
2025-01-07 |
CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems |
Zhaonan Wu et.al. |
2501.03626 |
link |
2025-01-07 |
LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment |
Gaoussou Youssouf Kebe et.al. |
2501.03624 |
null |
2025-01-07 |
Cosmos World Foundation Model Platform for Physical AI |
NVIDIA et.al. |
2501.03575 |
link |
2025-01-07 |
From Code to Compliance: Assessing ChatGPT’s Utility in Designing an Accessible Webpage – A Case Study |
Ammar Ahmed et.al. |
2501.03572 |
null |
2025-01-07 |
What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs |
Muneera Bano et.al. |
2501.03569 |
null |
2025-01-07 |
Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities |
Benedikt Reitemeyer et.al. |
2501.03566 |
null |
2025-01-07 |
Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis |
Haoran Lai et.al. |
2501.03565 |
null |
2025-01-07 |
PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models |
Lingzhi Yuan et.al. |
2501.03544 |
null |
2025-01-07 |
Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions |
Weijieying Ren et.al. |
2501.03540 |
null |
2025-01-07 |
Deep Learning for Pathological Speech: A Survey |
Shakeel A. Sheikh et.al. |
2501.03536 |
null |
2025-01-08 |
SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving |
Xuewen Luo et.al. |
2501.03535 |
null |
2025-01-07 |
A generative approach for lensless imaging in low-light conditions |
Ziyang Liu et.al. |
2501.03511 |
null |
2025-01-07 |
A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models |
Shuyang Wang et.al. |
2501.03508 |
null |
2025-01-07 |
Textualize Visual Prompt for Image Editing via Diffusion Bridge |
Pengcheng Xu et.al. |
2501.03495 |
null |
2025-01-07 |
Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment |
Prashant Trivedi et.al. |
2501.03486 |
null |
2025-01-07 |
Reading with Intent – Neutralizing Intent |
Benjamin Reichman et.al. |
2501.03475 |
null |
2025-01-07 |
Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning |
Chuang Niu et.al. |
2501.03469 |
link |
2025-01-07 |
MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems |
Yannis Katsis et.al. |
2501.03468 |
link |
2025-01-07 |
ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation |
Yu-Cheng Liu et.al. |
2501.03462 |
null |
2025-01-07 |
Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation |
Xiao Wang et.al. |
2501.03458 |
link |
2025-01-07 |
CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering |
Jialiang Chen et.al. |
2501.03447 |
null |
2025-01-07 |
LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models |
Mohamad Fakih et.al. |
2501.03446 |
null |
2025-01-07 |
Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology |
Sarah E. Finch et.al. |
2501.03441 |
link |
2025-01-06 |
SALT: Sales Autocompletion Linked Business Tables Dataset |
Tassilo Klein et.al. |
2501.03413 |
link |
2025-01-06 |
BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations |
Simone Giovannini et.al. |
2501.03403 |
null |
2025-01-06 |
DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes |
Xuyang Wang et.al. |
2501.03397 |
link |
2025-01-06 |
Evolved Quantum Boltzmann Machines |
Michele Minervini et.al. |
2501.03367 |
null |
2025-01-06 |
CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets |
Tanay Agrawal et.al. |
2501.03332 |
null |
2025-01-06 |
LiLMaps: Learnable Implicit Language Maps |
Evgenii Kruzhkov et.al. |
2501.03304 |
null |
2025-01-06 |
A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval |
Shuo Tong et.al. |
2501.03295 |
null |
2025-01-06 |
Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model |
Naibo Wang et.al. |
2501.03292 |
null |
2025-01-06 |
ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning |
Pengwei Tang et.al. |
2501.03291 |
link |
2025-01-06 |
CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models |
Zhenyu Xu et.al. |
2501.03288 |
null |
2025-01-06 |
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning |
Beichen Zhang et.al. |
2501.03226 |
link |
2025-01-06 |
Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text |
Ayat Najjar et.al. |
2501.03212 |
null |
2025-01-06 |
Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity |
Ayat A. Najjar et.al. |
2501.03203 |
null |
2025-01-06 |
CLIX: Cross-Lingual Explanations of Idiomatic Expressions |
Aaron Gluck et.al. |
2501.03191 |
null |
2025-01-06 |
Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text |
Ali Al-Lawati et.al. |
2501.03166 |
link |
2025-01-06 |
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy |
Risha Goel et.al. |
2501.03153 |
link |
2025-01-06 |
Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches |
Alhassan Mumuni et.al. |
2501.03151 |
null |
2025-01-06 |
VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity |
Yerong Li et.al. |
2501.03139 |
null |
2025-01-07 |
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models |
Mingyang Song et.al. |
2501.03124 |
link |
2025-01-06 |
CAT: Content-Adaptive Image Tokenization |
Junhong Shen et.al. |
2501.03120 |
null |
2025-01-06 |
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases |
Dylan Bouchard et.al. |
2501.03112 |
link |
2025-01-06 |
Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling |
Aseem Srivastava et.al. |
2501.03088 |
null |
2025-01-06 |
Retrieval-Augmented TLAPS Proof Generation with Large Language Models |
Yuhao Zhou et.al. |
2501.03073 |
null |
2025-01-06 |
ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events |
Duygu Sezen Islakoglu et.al. |
2501.03040 |
null |
2025-01-06 |
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning |
Zhen Li et.al. |
2501.03035 |
null |
2025-01-06 |
TransPixar: Advancing Text-to-Video Generation with Transparency |
Luozhou Wang et.al. |
2501.03006 |
link |
2025-01-06 |
CALM: Curiosity-Driven Auditing for Large Language Models |
Xiang Zheng et.al. |
2501.02997 |
link |
2025-01-06 |
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation |
Zhi Qu et.al. |
2501.02979 |
link |
2025-01-06 |
FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models |
Zhuo Chen et.al. |
2501.02968 |
null |
2025-01-07 |
Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild |
Wanpeng Hu et.al. |
2501.02964 |
link |
2025-01-07 |
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild |
Jiawei Liu et.al. |
2501.02962 |
null |
2025-01-06 |
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features |
Shi Bin Hoo et.al. |
2501.02945 |
link |
2025-01-07 |
Inhibition of bacterial growth by antibiotics |
Barnabe Ledoux et.al. |
2501.02944 |
null |
2025-01-06 |
Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions |
Jianhua Pei et.al. |
2501.02928 |
null |
2025-01-06 |
DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model |
Hao Yu et.al. |
2501.02901 |
link |
2025-01-06 |
FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection |
Guray Ozgur et.al. |
2501.02892 |
link |
2025-01-06 |
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs |
Hui Sun et.al. |
2501.02885 |
null |
2025-01-06 |
IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment |
Yiming Zhang et.al. |
2501.02869 |
null |
2025-01-06 |
Large Language Models for Video Surveillance Applications |
Ulindu De Silva et.al. |
2501.02850 |
null |
2025-01-06 |
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification |
Yubo Wang et.al. |
2501.02844 |
null |
2025-01-06 |
Foundations of GenIR |
Qingyao Ai et.al. |
2501.02842 |
null |
2025-01-06 |
An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models |
Dezhi Ran et.al. |
2501.02829 |
null |
2025-01-06 |
InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion |
Zhaoyi Yan et.al. |
2501.02795 |
null |
2025-01-06 |
CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation |
Yuanhong Chen et.al. |
2501.02786 |
null |
2025-01-06 |
GeAR: Generation Augmented Retrieval |
Haoyu Liu et.al. |
2501.02772 |
null |
2025-01-06 |
Visual Large Language Models for Generalized and Specialized Applications |
Yifan Li et.al. |
2501.02765 |
link |
2025-01-06 |
Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? |
Hongyi Miao et.al. |
2501.02751 |
null |
2025-01-06 |
Artificial Intelligence in Creative Industries: Advances Prior to 2025 |
Nantheera Anantrasirichai et.al. |
2501.02725 |
null |
2025-01-06 |
KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models |
Zaiyi Zheng et.al. |
2501.02711 |
null |
2025-01-06 |
QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance |
Binita Saha et.al. |
2501.02702 |
null |
2025-01-06 |
EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models |
Andrés Villa et.al. |
2501.02699 |
null |
2025-01-05 |
GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking |
Weikang Bian et.al. |
2501.02690 |
null |
2025-01-05 |
Decoding specialised feature neurons in LLMs with the final projection layer |
Harry J Davies et.al. |
2501.02688 |
null |
2025-01-05 |
From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering |
Wen-ran Li et.al. |
2501.02680 |
null |
2025-01-05 |
A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model |
Shivaram Kalyanakrishnan et.al. |
2501.02652 |
null |
2025-01-05 |
Representation Learning of Lab Values via Masked AutoEncoder |
David Restrepo et.al. |
2501.02648 |
link |
2025-01-05 |
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense |
Yang Ouyang et.al. |
2501.02629 |
link |
2025-01-05 |
Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets |
Mahmoud Jahanshahi et.al. |
2501.02628 |
null |
2025-01-05 |
HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning |
Saleh Ashkboos et.al. |
2501.02625 |
link |
2025-01-05 |
LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment |
Yifei Liu et.al. |
2501.02621 |
null |
2025-01-05 |
TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms |
Jovan Stojkovic et.al. |
2501.02600 |
null |
2025-01-05 |
LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations |
Jiaping Wang et.al. |
2501.02573 |
link |
2025-01-05 |
Multi-LLM Collaborative Caption Generation in Scientific Documents |
Jaeyoung Kim et.al. |
2501.02552 |
link |
2025-01-05 |
Transformers Simulate MLE for Sequence Generation in Bayesian Networks |
Yuan Cao et.al. |
2501.02547 |
null |
2025-01-05 |
Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm |
Ljubisa Bojic et.al. |
2501.02532 |
null |
2025-01-05 |
Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI |
Ljubisa Bojic et.al. |
2501.02531 |
null |
2025-01-05 |
Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks |
Leo Franklin et.al. |
2501.02527 |
null |
2025-01-05 |
Unified Guidance for Geometry-Conditioned Molecular Generation |
Sirine Ayadi et.al. |
2501.02526 |
null |
2025-01-05 |
Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors |
Minglin Chen et.al. |
2501.02519 |
null |
2025-01-05 |
CHAIR-Classifier of Hallucination as Improver |
Ao Sun et.al. |
2501.02518 |
link |
2025-01-05 |
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use |
Junjie Ye et.al. |
2501.02506 |
null |
2025-01-05 |
Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons |
Sebastian Morel-Balbi et.al. |
2501.02505 |
null |
2025-01-05 |
ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling |
Chaojie Mao et.al. |
2501.02487 |
null |
2025-01-05 |
LLMPC: Large Language Model Predictive Control |
Gabriel Maher et.al. |
2501.02486 |
link |
2025-01-05 |
Decoding News Bias: Multi Bias Detection in News Articles |
Bhushan Santosh Shah et.al. |
2501.02482 |
null |
2025-01-05 |
Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine |
Yishen Liu et.al. |
2501.02471 |
null |
2025-01-05 |
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera |
Yuliang Guo et.al. |
2501.02464 |
link |
2025-01-05 |
Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications |
Zhe Chen et.al. |
2501.02460 |
null |
2025-01-05 |
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap |
Hyunwoo Ko et.al. |
2501.02448 |
null |
2025-01-05 |
RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework |
Kun Wang et.al. |
2501.02446 |
null |
2025-01-05 |
A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models |
Yinpeng Cai et.al. |
2501.02441 |
null |
2025-01-05 |
Efficient Deployment of Large Language Models on Resource-constrained Devices |
Zhiwei Yao et.al. |
2501.02438 |
null |
2025-01-05 |
FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance |
Haicheng Wang et.al. |
2501.02430 |
link |
2025-01-05 |
GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems |
Mehmet Deniz Türkmen et.al. |
2501.02408 |
null |
2025-01-04 |
Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities |
Tara Radvand et.al. |
2501.02406 |
null |
2025-01-04 |
Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers |
Markus J. Buehler et.al. |
2501.02393 |
link |
2025-01-04 |
Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations |
Kangyu Zhu et.al. |
2501.02385 |
null |
2025-01-04 |
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison |
Tsz Kin Lam et.al. |
2501.02370 |
null |
2025-01-04 |
Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving |
Sanghyun Park et.al. |
2501.02348 |
null |
2025-01-04 |
Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support |
Florian Putz et.al. |
2501.02346 |
null |
2025-01-04 |
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility |
Yonglin Tian et.al. |
2501.02341 |
link |
2025-01-04 |
AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference |
Zhuomin He et.al. |
2501.02336 |
link |
2025-01-04 |
Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications |
Jodi M. Casabianca et.al. |
2501.02334 |
null |
2025-01-04 |
Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance |
Marta Gentiloni-Silveri et.al. |
2501.02298 |
null |
2025-01-04 |
Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection |
Yachao Zhao et.al. |
2501.02295 |
null |
2025-01-04 |
Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control |
Yongjeong Oh et.al. |
2501.02273 |
null |
2025-01-04 |
What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph |
Yutao Jiang et.al. |
2501.02268 |
link |
2025-01-04 |
Unsupervised Class Generation to Expand Semantic Segmentation Datasets |
Javier Montalvo et.al. |
2501.02264 |
null |
2025-01-04 |
Financial Named Entity Recognition: How Far Can LLM Go? |
Yi-Te Lu et.al. |
2501.02237 |
link |
2025-01-04 |
Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends |
Camille Barboule et.al. |
2501.02235 |
null |
2025-01-04 |
Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection |
S M Mostaq Hossain et.al. |
2501.02229 |
null |
2025-01-04 |
Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation |
Shijie Wang et.al. |
2501.02226 |
null |
2025-01-04 |
Can ChatGPT implement finite element models for geotechnical engineering applications? |
Taegu Kim et.al. |
2501.02199 |
null |
2025-01-04 |
EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks |
Shixuan Liu et.al. |
2501.02192 |
null |
2025-01-04 |
On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing |
Jianwei Wang et.al. |
2501.02191 |
link |
2025-01-04 |
The Application of Large Language Models in Recommendation Systems |
Peiyang Yu et.al. |
2501.02178 |
null |
2025-01-04 |
The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit |
Huixue Zhou et.al. |
2501.02173 |
null |
2025-01-04 |
Personalized Graph-Based Retrieval for Large Language Models |
Steven Au et.al. |
2501.02157 |
link |
2025-01-04 |
Table as Thought: Exploring Structured Thoughts in LLM Reasoning |
Zhenjie Sun et.al. |
2501.02152 |
null |
2025-01-04 |
Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN |
Yanxi Chen et.al. |
2501.02146 |
null |
2025-01-03 |
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction |
Chaoyou Fu et.al. |
2501.01957 |
link |
2025-01-03 |
Metadata Conditioning Accelerates Language Model Pre-training |
Tianyu Gao et.al. |
2501.01956 |
link |
2025-01-03 |
MADGEN – Mass-Spec attends to De Novo Molecular generation |
Yinkai Wang et.al. |
2501.01950 |
null |
2025-01-03 |
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap |
Weizhi Zhang et.al. |
2501.01945 |
link |
2025-01-03 |
Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models |
Manh Duong Nguyen et.al. |
2501.01932 |
link |
2025-01-03 |
Virgo: A Preliminary Exploration on Reproducing o1-like MLLM |
Yifan Du et.al. |
2501.01904 |
link |
2025-01-03 |
EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation |
Siyuan Huang et.al. |
2501.01895 |
null |
2025-01-03 |
Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions |
Rachneet Sachdeva et.al. |
2501.01872 |
link |
2025-01-03 |
Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification |
Xiangxiang Dai et.al. |
2501.01849 |
link |
2025-01-03 |
MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning |
Pu Yang et.al. |
2501.01834 |
null |
2025-01-03 |
Time Series Language Model for Descriptive Caption Generation |
Mohamed Trabelsi et.al. |
2501.01832 |
null |
2025-01-03 |
Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models |
Yanjiang Liu et.al. |
2501.01830 |
null |
2025-01-03 |
SDPO: Segment-Level Direct Preference Optimization for Social Agents |
Aobo Kong et.al. |
2501.01821 |
link |
2025-01-03 |
BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction |
Ferhat Ozgur Catak et.al. |
2501.01802 |
link |
2025-01-03 |
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation |
Mohammad Khalil et.al. |
2501.01793 |
link |
2025-01-03 |
Efficient LLM Inference with Activation Checkpointing and Hybrid Caching |
Sanghyeon Lee et.al. |
2501.01792 |
null |
2025-01-03 |
Nonparametric estimation of a factorizable density using diffusion models |
Hyeok Kyu Kwon et.al. |
2501.01783 |
null |
2025-01-03 |
SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation |
Mingjie Li et.al. |
2501.01765 |
null |
2025-01-03 |
Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models |
Andrea Matteazzi et.al. |
2501.01761 |
null |
2025-01-03 |
MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling |
Simon Rouard et.al. |
2501.01757 |
null |
2025-01-03 |
Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation |
Kangcheng Luo et.al. |
2501.01743 |
null |
2025-01-03 |
How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models |
Simone Corbo et.al. |
2501.01741 |
null |
2025-01-03 |
AR4D: Autoregressive 4D Generation from Monocular Videos |
Hanxin Zhu et.al. |
2501.01722 |
null |
2025-01-03 |
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models |
Guosheng Zhang et.al. |
2501.01720 |
null |
2025-01-03 |
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries |
Michal Kuk et.al. |
2501.01711 |
null |
2025-01-03 |
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders |
Jiajun Cao et.al. |
2501.01709 |
null |
2025-01-03 |
AgentRefine: Enhancing Agent Generalization through Refinement Tuning |
Dayuan Fu et.al. |
2501.01702 |
null |
2025-01-03 |
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models |
Lei Tang et.al. |
2501.01679 |
null |
2025-01-03 |
Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption |
Zhang Ruoyan et.al. |
2501.01672 |
null |
2025-01-03 |
BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction |
Alaeddine Diaf et.al. |
2501.01664 |
null |
2025-01-03 |
Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning |
Danni Peng et.al. |
2501.01653 |
null |
2025-01-03 |
MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments |
Cai Yin et.al. |
2501.01652 |
link |
2025-01-03 |
HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding |
Heqing Zou et.al. |
2501.01645 |
null |
2025-01-03 |
iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings |
Shuhei Tomoshige et.al. |
2501.01642 |
null |
2025-01-03 |
Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation |
Rini Smita Thakur et.al. |
2501.01640 |
null |
2025-01-03 |
A non-ergodic framework for understanding emergent capabilities in Large Language Models |
Javier Marin et.al. |
2501.01638 |
null |
2025-01-03 |
Revisiting Data Analysis with Pre-trained Foundation Models |
Chen Liang et.al. |
2501.01631 |
null |
2025-01-03 |
ICPC: In-context Prompt Compression with Faster Inference |
Ziyang Yu et.al. |
2501.01625 |
null |
2025-01-03 |
PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents |
Jingoo Lee et.al. |
2501.01594 |
null |
2025-01-03 |
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges |
Mohamed Hisham Abdellatif et.al. |
2501.01588 |
null |
2025-01-02 |
Predicting the Performance of Black-box LLMs through Self-Queries |
Dylan Sam et.al. |
2501.01558 |
link |
2025-01-02 |
Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) |
Jumana Almahmoud et.al. |
2501.01545 |
null |
2025-01-02 |
Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information |
Rasul Tutnov et.al. |
2501.01544 |
null |
2025-01-02 |
Denoising Diffused Embeddings: a Generative Approach for Hypergraphs |
Shihao Wu et.al. |
2501.01541 |
null |
2025-01-02 |
BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery |
Kanishk Gandhi et.al. |
2501.01540 |
link |
2025-01-02 |
SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers |
Bhavna Gopal et.al. |
2501.01529 |
null |
2025-01-02 |
Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search |
Shuangtao Li et.al. |
2501.01478 |
null |
2025-01-02 |
Unifying Specialized Visual Encoders for Video Language Models |
Jihoon Chung et.al. |
2501.01426 |
link |
2025-01-02 |
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models |
Jingfeng Yao et.al. |
2501.01423 |
link |
2025-01-02 |
Multi-Modal Video Feature Extraction for Popularity Prediction |
Haixu Liu et.al. |
2501.01422 |
null |
2025-01-02 |
Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers |
Seunghyun Lee et.al. |
2501.01414 |
null |
2025-01-02 |
On Unifying Video Generation and Camera Pose Estimation |
Chun-Hao Paul Huang et.al. |
2501.01409 |
null |
2025-01-02 |
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios |
Xize Cheng et.al. |
2501.01384 |
null |
2025-01-02 |
ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI |
Neda Tavakoli et.al. |
2501.01372 |
link |
2025-01-02 |
Aligning Large Language Models for Faithful Integrity Against Opposing Argument |
Yong Zhao et.al. |
2501.01336 |
link |
2025-01-02 |
CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models |
Johan Wahréus et.al. |
2501.01335 |
link |
2025-01-02 |
Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension |
Yanbo Fang et.al. |
2501.01332 |
null |
2025-01-02 |
The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation |
Shuzheng Gao et.al. |
2501.01329 |
null |
2025-01-03 |
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking |
Xiaoxue Cheng et.al. |
2501.01306 |
null |
2025-01-02 |
Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments – The Depression and Anxiety Case |
Kaushik Roy et.al. |
2501.01305 |
null |
2025-01-02 |
Does a Large Language Model Really Speak in Human-Like Language? |
Mose Park et.al. |
2501.01273 |
null |
2025-01-02 |
ProgCo: Program Helps Self-Correction of Large Language Models |
Xiaoshuai Song et.al. |
2501.01264 |
null |
2025-01-03 |
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings |
Shanghaoran Quan et.al. |
2501.01257 |
null |
2025-01-02 |
Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? |
Manuel Weber et.al. |
2501.01256 |
null |
2025-01-02 |
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion |
Qiyuan He et.al. |
2501.01246 |
null |
2025-01-02 |
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization |
Yongle Huang et.al. |
2501.01245 |
link |
2025-01-02 |
Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants |
Lixiong Qin et.al. |
2501.01243 |
null |
2025-01-02 |
Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction |
Alexander Brinkmann et.al. |
2501.01237 |
link |
2025-01-03 |
TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer |
Jiayu Li et.al. |
2501.01216 |
null |
2025-01-02 |
Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects |
Abdullah Mushtaq et.al. |
2501.01205 |
null |
2025-01-02 |
HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation |
Runsong Jia et.al. |
2501.01203 |
null |
2025-01-02 |
LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge |
Kyoungkook Kang et.al. |
2501.01197 |
null |
2025-01-02 |
Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education |
Annika Bush et.al. |
2501.01192 |
null |
2025-01-02 |
Towards Interactive Deepfake Analysis |
Lixiong Qin et.al. |
2501.01164 |
link |
2025-01-02 |
TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions |
Vriksha Srihari et.al. |
2501.01156 |
null |
2025-01-02 |
A3: Android Agent Arena for Mobile GUI Agents |
Yuxiang Chai et.al. |
2501.01149 |
null |
2025-01-03 |
BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference |
Wonsuk Jang et.al. |
2501.01144 |
link |
2025-01-02 |
Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method |
Ruichen Zhang et.al. |
2501.01141 |
null |
2025-01-02 |
Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning |
Shuo Yu et.al. |
2501.01124 |
null |
2025-01-02 |
MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification |
Jimin Park et.al. |
2501.01110 |
link |
2025-01-03 |
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization |
Haina Zhu et.al. |
2501.01108 |
link |
2025-01-02 |
Graph Generative Pre-trained Transformer |
Xiaohui Chen et.al. |
2501.01073 |
null |
2025-01-02 |
Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models |
Yanwen Huang et.al. |
2501.01059 |
null |
2025-01-02 |
Risks of Cultural Erasure in Large Language Models |
Rida Qadri et.al. |
2501.01056 |
null |
2025-01-02 |
Dynamic Scaling of Unit Tests for Code Reward Modeling |
Zeyao Ma et.al. |
2501.01054 |
null |
2025-01-02 |
Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs |
Linhao Huang et.al. |
2501.01042 |
null |
2025-01-02 |
Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models |
Bin Wang et.al. |
2501.01034 |
link |
2025-01-02 |
ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning |
Wonduk Seo et.al. |
2501.01031 |
null |
2025-01-03 |
KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model |
Xinshuo Hu et.al. |
2501.01028 |
link |
2025-01-02 |
MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model |
Chengze Zhang et.al. |
2501.01014 |
null |
2025-01-02 |
FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving |
Zihao Ye et.al. |
2501.01005 |
link |
2025-01-02 |
Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory |
Zhou Yang et.al. |
2501.00999 |
null |
2025-01-02 |
Optimizing Noise Schedules of Generative Models in High Dimensionss |
Santiago Aranguri et.al. |
2501.00988 |
null |
2025-01-02 |
Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice |
Federico Ravenda et.al. |
2501.00982 |
link |
2025-01-01 |
IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs |
Junfeng Jiao et.al. |
2501.00959 |
null |
2025-01-01 |
Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors |
Junfeng Jiao et.al. |
2501.00957 |
null |
2025-01-01 |
Incremental Dialogue Management: Survey, Discussion, and Implications for HRI |
Casey Kennington et.al. |
2501.00953 |
null |
2025-01-01 |
SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering |
Shihab Ahmed et.al. |
2501.00940 |
null |
2025-01-01 |
Diffusion Policies for Generative Modeling of Spacecraft Trajectories |
Julia Briden et.al. |
2501.00915 |
null |
2025-01-01 |
Aligning LLMs with Domain Invariant Reward Models |
David Wu et.al. |
2501.00911 |
link |
2025-01-01 |
Population Aware Diffusion for Time Series Generation |
Yang Li et.al. |
2501.00910 |
link |
2025-01-01 |
Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things |
Talha Zeeshan et.al. |
2501.00906 |
null |
2025-01-01 |
Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model |
Chenyang Liu et.al. |
2501.00895 |
null |
2025-01-01 |
Evaluating Time Series Foundation Models on Noisy Periodic Time Series |
Syamantak Datta Gupta et.al. |
2501.00889 |
null |
2025-01-01 |
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization |
Weiqi Wu et.al. |
2501.00888 |
link |
2025-01-01 |
Representation in large language models |
Cameron C. Yetman et.al. |
2501.00885 |
null |
2025-01-01 |
Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents |
Fouad Bousetouane et.al. |
2501.00881 |
null |
2025-01-01 |
Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction |
Teng Hu et.al. |
2501.00880 |
null |
2025-01-01 |
TrustRAG: Enhancing Robustness and Trustworthiness in RAG |
Huichi Zhou et.al. |
2501.00879 |
link |
2025-01-01 |
LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models |
Hieu Man et.al. |
2501.00874 |
link |
2025-01-01 |
Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation |
Mingjia Li et.al. |
2501.00873 |
link |
2025-01-01 |
Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation |
Shoutao Guo et.al. |
2501.00868 |
link |
2025-01-01 |
Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era |
Mihnea C. Moldoveanu et.al. |
2501.00867 |
null |
2025-01-01 |
Alzheimer’s disease detection based on large language model prompt engineering |
Tian Zheng et.al. |
2501.00861 |
null |
2025-01-01 |
LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions |
Adam Ishay et.al. |
2501.00830 |
null |
2025-01-01 |
An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems |
Haoxiang Tian et.al. |
2501.00829 |
null |
2025-01-01 |
LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management |
Yichen Luo et.al. |
2501.00826 |
null |
2025-01-01 |
Multimodal Large Models Are Effective Action Anticipators |
Binglu Wang et.al. |
2501.00795 |
link |
2025-01-01 |
Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models |
Minhao Bai et.al. |
2501.00786 |
null |
2025-01-01 |
NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model |
Yuzhi Lai et.al. |
2501.00785 |
null |
2025-01-01 |
REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization |
Huyen Nguyen et.al. |
2501.00779 |
null |
2025-01-01 |
FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation |
Qianli Wang et.al. |
2501.00777 |
null |
2025-01-01 |
Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis |
Jie Gao et.al. |
2501.00775 |
null |
2025-01-01 |
An AI-powered Bayesian generative modeling approach for causal inference in observational studies |
Qiao Liu et.al. |
2501.00755 |
null |
2025-01-01 |
Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform |
Cheonsu Jeong et.al. |
2501.00750 |
null |
2025-01-01 |
DIVE: Diversified Iterative Self-Improvement |
Yiwei Qin et.al. |
2501.00747 |
link |
2025-01-01 |
Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines |
Xiyang Hu et.al. |
2501.00745 |
null |
2025-01-01 |
A Distributional Evaluation of Generative Image Models |
Edric Tam et.al. |
2501.00744 |
null |
2025-01-01 |
New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory |
Madiha Ajmal et.al. |
2501.00721 |
null |
2025-01-01 |
Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection |
Hao Wang et.al. |
2501.00700 |
null |
2025-01-01 |
Adjoint sharding for very long context training of state space models |
Xingzi Xu et.al. |
2501.00692 |
null |
2025-01-01 |
Labels Generated by Large Language Model Helps Measuring People’s Empathy in Vitro |
Md Rakibul Hasan et.al. |
2501.00691 |
null |
2025-01-01 |
IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently |
Florian Dietz et.al. |
2501.00684 |
null |
2024-12-31 |
Grade Inflation in Generative Models |
Phuc Nguyen et.al. |
2501.00664 |
null |
2024-12-31 |
Finding Missed Code Size Optimizations in Compilers using LLMs |
Davide Italiano et.al. |
2501.00655 |
null |
2024-12-31 |
Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models |
Suttisak Wizadwongsa et.al. |
2501.00651 |
null |
2024-12-31 |
Efficient Standardization of Clinical Notes using Large Language Models |
Daniel B. Hier et.al. |
2501.00644 |
null |
2024-12-31 |
Enabling New HDLs with Agents |
Mark Zakharov et.al. |
2501.00642 |
null |
2024-12-31 |
DreamDrive: Generative 4D Scene Modeling from Street View Images |
Jiageng Mao et.al. |
2501.00601 |
null |
2024-12-31 |
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM |
Yuqian Yuan et.al. |
2501.00599 |
link |
2024-12-31 |
Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation |
M. Ali Bayram et.al. |
2501.00593 |
null |
2024-12-31 |
Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method |
Zhenpeng Huang et.al. |
2501.00584 |
null |
2024-12-31 |
Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders |
Yipeng Kang et.al. |
2501.00581 |
null |
2024-12-31 |
AI and Quantum Computing in Binary Photocatalytic Hydrogen Production |
Dennis Delali Kwesi Wayo et.al. |
2501.00575 |
null |
2024-12-31 |
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling |
Xinhao Li et.al. |
2501.00574 |
link |
2024-12-31 |
Probing Visual Language Priors in VLMs |
Tiange Luo et.al. |
2501.00569 |
null |
2024-12-31 |
Robust and Adaptive Optimization under a Large Language Model Lens |
Dimitris Bertsimas et.al. |
2501.00568 |
null |
2024-12-30 |
Distributed Mixture-of-Agents for Edge Inference with Large Language Models |
Purbesh Mitra et.al. |
2412.21200 |
link |
2024-12-31 |
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation |
Zhaojian Yu et.al. |
2412.21199 |
link |
2024-12-30 |
The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick |
Jonathan Berkheim et.al. |
2412.21186 |
null |
2024-12-30 |
Facilitating large language model Russian adaptation with Learned Embedding Propagation |
Mikhail Tikhomirov et.al. |
2412.21140 |
link |
2024-12-30 |
ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation |
Ruixuan Liu et.al. |
2412.21123 |
null |
2025-01-02 |
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation |
Yuanbo Yang et.al. |
2412.21117 |
null |
2024-12-30 |
Varformer: Adapting VAR’s Generative Prior for Image Restoration |
Siyang Wang et.al. |
2412.21063 |
link |
2024-12-30 |
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation |
Jiazheng Xu et.al. |
2412.21059 |
link |
2024-12-30 |
Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense |
Yuyang Zhou et.al. |
2412.21051 |
link |
2024-12-30 |
E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models |
Zhiyu Tan et.al. |
2412.21044 |
null |
2024-12-30 |
Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration |
Wanglong Lu et.al. |
2412.21042 |
link |
2024-12-30 |
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization |
Chia-Yu Hung et.al. |
2412.21037 |
link |
2024-12-30 |
GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models |
Shangyu Xing et.al. |
2412.21036 |
null |
2024-12-30 |
MapQaTor: A System for Efficient Annotation of Map Query Datasets |
Mahir Labib Dihan et.al. |
2412.21015 |
link |
2024-12-31 |
Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria |
Joonwon Jang et.al. |
2412.21006 |
null |
2024-12-30 |
Plug-and-Play Training Framework for Preference Optimization |
Jingyuan Ma et.al. |
2412.20996 |
null |
2024-12-30 |
KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model’s Reasoning Path Aggregation |
Siyuan Fang et.al. |
2412.20995 |
null |
2024-12-30 |
Efficiently Serving LLM Reasoning Programs with Certaindex |
Yichao Fu et.al. |
2412.20993 |
null |
2024-12-30 |
QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing |
Shlomo Kashani et.al. |
2412.20956 |
null |
2024-12-30 |
AGON: Automated Design Framework for Customizing Processors from ISA Documents |
Chongxiao Li et.al. |
2412.20954 |
null |
2024-12-30 |
Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema |
Xiaohan Feng et.al. |
2412.20942 |
null |
2024-12-30 |
Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering |
Junxiao Xue et.al. |
2412.20927 |
null |
2024-12-30 |
ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation |
Ting Zhang et.al. |
2412.20901 |
null |
2024-12-30 |
Towards Compatible Fine-tuning for Vision-Language Model Updates |
Zhengbo Wang et.al. |
2412.20895 |
null |
2024-12-30 |
DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models |
Xiaolin Hu et.al. |
2412.20891 |
null |
2024-12-30 |
Enhancing Annotated Bibliography Generation with LLM Ensembles |
Sergio Bermejo et.al. |
2412.20864 |
null |
2024-12-30 |
Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs’ Memory |
Xingjian Tao et.al. |
2412.20846 |
null |
2024-12-30 |
Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment |
Jianfei Zhang et.al. |
2412.20834 |
link |
2024-12-30 |
Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model |
Runtao Ren et.al. |
2412.20820 |
null |
2024-12-30 |
TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting |
Huanyu Zhang et.al. |
2412.20810 |
null |
2024-12-30 |
Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves |
Chayan Chatterjee et.al. |
2412.20789 |
null |
2024-12-31 |
SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity |
Pengfei Jing et.al. |
2412.20787 |
null |
2024-12-30 |
Large Language Model Enabled Multi-Task Physical Layer Network |
Tianyue Zheng et.al. |
2412.20772 |
null |
2024-12-30 |
Attributing Culture-Conditioned Generations to Pretraining Corpora |
Huihan Li et.al. |
2412.20760 |
link |
2024-12-30 |
M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs |
Bei Yan et.al. |
2412.20718 |
link |
2024-12-30 |
HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images |
Sungik Choi et.al. |
2412.20704 |
null |
2024-12-30 |
UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design |
Zijie Chen et.al. |
2412.20694 |
null |
2024-12-30 |
Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks |
Yuhe Ding et.al. |
2412.20682 |
null |
2024-12-30 |
Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA |
Qingyun Jin et.al. |
2412.20677 |
null |
2024-12-30 |
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner |
Yitong Zhou et.al. |
2412.20662 |
link |
2024-12-30 |
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis |
Yousef Yeganeh et.al. |
2412.20651 |
null |
2024-12-30 |
SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy |
Md Mahadi Hasan Nahid et.al. |
2412.20641 |
null |
2024-12-30 |
Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble |
Yongchang Li et.al. |
2412.20637 |
null |
2024-12-30 |
EVOLVE: Emotion and Visual Output Learning via LLM Evaluation |
Jordan Sinclair et.al. |
2412.20632 |
null |
2024-12-29 |
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study |
Yulin Fei et.al. |
2412.20613 |
link |
2024-12-29 |
NLP-based Regulatory Compliance – Using GPT 4.0 to Decode Regulatory Documents |
Bimal Kumar et.al. |
2412.20602 |
null |
2024-12-29 |
MATEY: multiscale adaptive foundation models for spatiotemporal physical systems |
Pei Zhang et.al. |
2412.20601 |
null |
2024-12-29 |
Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection |
Dmitri Roussinov et.al. |
2412.20595 |
link |
2024-12-29 |
Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches |
Madhavendra Thakur et.al. |
2412.20584 |
null |
2024-12-29 |
Counterfactual Samples Constructing and Training for Commonsense Statements Estimation |
Chong Liu et.al. |
2412.20563 |
null |
2024-12-29 |
Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces |
Linglingzhi Zhu et.al. |
2412.20556 |
null |
2024-12-29 |
The Impact of Prompt Programming on Function-Level Code Generation |
Ranim Khojah et.al. |
2412.20545 |
link |
2024-12-29 |
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning |
Xingshuai Huang et.al. |
2412.20519 |
null |
2024-12-29 |
Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning |
Hang Ni et.al. |
2412.20505 |
null |
2024-12-29 |
ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding |
Xiao Wang et.al. |
2412.20504 |
link |
2024-12-29 |
TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication |
Zongwu Wang et.al. |
2412.20501 |
link |
2024-12-29 |
Multimodal Variational Autoencoder: a Barycentric View |
Peijie Qiu et.al. |
2412.20487 |
null |
2024-12-29 |
JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling |
Haorui Ji et.al. |
2412.20470 |
null |
2024-12-29 |
Improving Vision-Language-Action Models via Chain-of-Affordance |
Jinming Li et.al. |
2412.20451 |
null |
2024-12-29 |
Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs |
Pratik Rakesh Singh et.al. |
2412.20440 |
null |
2024-12-29 |
Image Augmentation Agent for Weakly Supervised Semantic Segmentation |
Wangyu Wu et.al. |
2412.20439 |
null |
2024-12-29 |
Unlocking adaptive digital pathology through dynamic feature learning |
Jiawen Li et.al. |
2412.20430 |
null |
2024-12-29 |
AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models |
Mansi et.al. |
2412.20427 |
null |
2024-12-29 |
Bringing Objects to Life: 4D generation from 3D objects |
Ohad Rahamim et.al. |
2412.20422 |
null |
2024-12-29 |
Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection |
Kalin Kopanov et.al. |
2412.20414 |
null |
2024-12-29 |
Multi-Objective Large Language Model Unlearning |
Zibin Pan et.al. |
2412.20412 |
link |
2024-12-29 |
Open-Sora: Democratizing Efficient Video Production for All |
Zangwei Zheng et.al. |
2412.20404 |
link |
2024-12-29 |
Natural Language Fine-Tuning |
Jia Liu et.al. |
2412.20382 |
link |
2024-12-29 |
Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) |
Jia Wei Sii et.al. |
2412.20381 |
null |
2024-12-29 |
FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation |
Yan Luo et.al. |
2412.20374 |
link |
2024-12-29 |
LLM2: Let Large Language Models Harness System 2 Reasoning |
Cheng Yang et.al. |
2412.20372 |
link |
2025-01-02 |
Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey |
Junqiao Wang et.al. |
2412.20367 |
null |
2024-12-29 |
HindiLLM: Large Language Model for Hindi |
Sanjay Chouhan et.al. |
2412.20357 |
null |
2024-12-29 |
Distilling Desired Comments for Enhanced Code Review with Large Language Models |
Yongda Yu et.al. |
2412.20340 |
null |
2024-12-29 |
Mind the Data Gap: Bridging LLMs to Enterprise Data Integration |
Moe Kayali et.al. |
2412.20331 |
null |
2024-12-29 |
GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions |
Tianyao Shi et.al. |
2412.20322 |
null |
2024-12-29 |
Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain |
Shintaro Ozaki et.al. |
2412.20309 |
null |
2024-12-28 |
FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration |
Jia Liu et.al. |
2412.20297 |
null |
2024-12-28 |
Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games |
Guan-Horng Liu et.al. |
2412.20279 |
null |
2024-12-28 |
Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues |
Henry J. Xie et.al. |
2412.20264 |
link |
2024-12-28 |
Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception |
Athanasios Karagounis et.al. |
2412.20230 |
null |
2024-12-28 |
LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning |
Shuguang Chen et.al. |
2412.20227 |
null |
2024-12-28 |
Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation |
Yeonhong Park et.al. |
2412.20185 |
null |
2024-12-28 |
LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System |
Hyucksung Kwon et.al. |
2412.20166 |
null |
2024-12-28 |
StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN |
Andrzej Bedychaj et.al. |
2412.20164 |
null |
2024-12-28 |
Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems |
Minhye Jeon et.al. |
2412.20163 |
null |
2024-12-28 |
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation |
Guanglei Yang et.al. |
2412.20162 |
null |
2024-12-28 |
Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses |
Xinru Wen et.al. |
2412.20154 |
null |
2024-12-28 |
Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering |
Wei Zhou et.al. |
2412.20145 |
null |
2024-12-28 |
TradingAgents: Multi-Agents LLM Financial Trading Framework |
Yijia Xiao et.al. |
2412.20138 |
null |
2024-12-28 |
M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation |
Zhaopeng Feng et.al. |
2412.20127 |
link |
2024-12-28 |
Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers |
Tuomas Hakoniemi et.al. |
2412.20114 |
null |
2024-12-28 |
ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming |
Jiedong Zhuang et.al. |
2412.20105 |
null |
2024-12-28 |
On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs |
Atmane Ayoub Mansour Bahar et.al. |
2412.20087 |
null |
2024-12-31 |
Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset |
Chongjian Yue et.al. |
2412.20072 |
null |
2024-12-28 |
On the Compositional Generalization of Multimodal LLMs for Medical Imaging |
Zhenyang Cai et.al. |
2412.20070 |
link |
2024-12-28 |
VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition |
Lan Chen et.al. |
2412.20064 |
link |
2024-12-28 |
MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion |
Zechao Zhan et.al. |
2412.20062 |
null |
2024-12-28 |
Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts |
Yanxin Shen et.al. |
2412.20061 |
null |
2024-12-28 |
“My life is miserable, have to sign 500 autographs everyday”: Exposing Humblebragging, the Brags in Disguise |
Sharath Naganna et.al. |
2412.20057 |
null |
2024-12-27 |
Enhancing Whisper’s Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization |
Kumud Tripathi et.al. |
2412.19785 |
null |
2024-12-27 |
Can AI Help with Your Personal Finances? |
Oudom Hean et.al. |
2412.19784 |
null |
2024-12-27 |
Tensor Network Estimation of Distribution Algorithms |
John Gardiner et.al. |
2412.19780 |
null |
2024-12-27 |
Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration |
Le Chen et.al. |
2412.19770 |
link |
2024-12-27 |
Generative Video Propagation |
Shaoteng Liu et.al. |
2412.19761 |
null |
2024-12-27 |
On dual-projectively equivalent connections associated to second order superintegrable systems |
Andreas Vollmer et.al. |
2412.19739 |
null |
2024-12-27 |
Can Large Language Models Adapt to Other Agents In-Context? |
Matthew Riemer et.al. |
2412.19726 |
null |
2024-12-27 |
From Elements to Design: A Layered Approach for Automatic Graphic Design Composition |
Jiawei Lin et.al. |
2412.19712 |
null |
2024-12-27 |
Toward Adaptive Reasoning in Large Language Models with Thought Rollback |
Sijia Chen et.al. |
2412.19707 |
link |
2024-12-27 |
A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization |
Jingchun Lian et.al. |
2412.19685 |
null |
2024-12-27 |
Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework |
Jiang Liu et.al. |
2412.19684 |
null |
2024-12-27 |
CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs |
Siyu Wang et.al. |
2412.19663 |
null |
2024-12-27 |
Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis |
Jiaqi Wang et.al. |
2412.19654 |
link |
2024-12-27 |
FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios |
Kaiyi Pang et.al. |
2412.19652 |
null |
2024-12-27 |
Xmodel-2 Technical Report |
Wang Qun et.al. |
2412.19638 |
null |
2024-12-27 |
IMTP: Search-based Code Generation for In-memory Tensor Programs |
Yongwon Shin et.al. |
2412.19630 |
null |
2024-12-27 |
Signatures of prediction during natural listening in MEG data? |
Sahel Azizpour et.al. |
2412.19622 |
null |
2024-12-27 |
Gradient Weight-normalized Low-rank Projection for Efficient LLM Training |
Jia-Hong Huang et.al. |
2412.19616 |
link |
2024-12-27 |
SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms |
Shashank Rao Marpally et.al. |
2412.19595 |
null |
2024-12-27 |
Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following |
Yuxiao Yang et.al. |
2412.19562 |
null |
2024-12-27 |
Diverse Rare Sample Generation with Pretrained GANs |
Subeen Lee et.al. |
2412.19543 |
link |
2024-12-27 |
Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy–Fokker–Planck Equations |
Yuanfei Huang et.al. |
2412.19520 |
null |
2024-12-27 |
Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model |
Hyunwoo Cho et.al. |
2412.19517 |
null |
2024-12-27 |
Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs |
Zhe Yang et.al. |
2412.19513 |
link |
2024-12-27 |
Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging |
Hua Farn et.al. |
2412.19512 |
null |
2024-12-27 |
Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion |
Koustav Ghosal et.al. |
2412.19510 |
null |
2024-12-27 |
MBQ: Modality-Balanced Quantization for Large Vision-Language Models |
Shiyao Li et.al. |
2412.19509 |
link |
2024-12-27 |
DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT |
Xiaotao Hu et.al. |
2412.19505 |
link |
2024-12-27 |
Casevo: A Cognitive Agents and Social Evolution Simulator |
Zexun Jiang et.al. |
2412.19498 |
link |
2024-12-27 |
Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation |
Chengyang Ye et.al. |
2412.19492 |
link |
2024-12-27 |
Focusing Image Generation to Mitigate Spurious Correlations |
Xuewei Li et.al. |
2412.19457 |
null |
2024-12-27 |
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models |
Hyeonseok Moon et.al. |
2412.19450 |
link |
2024-12-27 |
Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models |
Shuo Wang et.al. |
2412.19449 |
null |
2024-12-27 |
A Survey on Large Language Model Acceleration based on KV Cache Management |
Haoyang Li et.al. |
2412.19442 |
link |
2024-12-27 |
Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback |
Seong Jin Lee et.al. |
2412.19436 |
null |
2024-12-27 |
Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints |
Alberto Maté et.al. |
2412.19424 |
null |
2024-12-27 |
Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning |
Chen Li et.al. |
2412.19422 |
link |
2024-12-27 |
MINIMA: Modality Invariant Image Matching |
Xingyu Jiang et.al. |
2412.19412 |
link |
2024-12-27 |
MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios |
Jiaqi Fan et.al. |
2412.19406 |
link |
2024-12-27 |
An Engorgio Prompt Makes Large Language Model Babble on |
Jianshuo Dong et.al. |
2412.19394 |
link |
2024-12-26 |
Large Language Models for Market Research: A Data-augmentation Approach |
Mengxin Wang et.al. |
2412.19363 |
null |
2024-12-26 |
Dynamic Skill Adaptation for Large Language Models |
Jiaao Chen et.al. |
2412.19361 |
null |
2024-12-26 |
Identifying Split Vacancies with Foundation Models and Electrostatics |
Seán R. Kavanagh et.al. |
2412.19330 |
null |
2024-12-26 |
Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment |
Ziang Yan et.al. |
2412.19326 |
link |
2024-12-26 |
Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones |
Mehrnaz Mofakhami et.al. |
2412.19325 |
null |
2024-12-26 |
From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries |
Hugh Van Deventer et.al. |
2412.19312 |
link |
2024-12-26 |
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries |
Roberto Amoroso et.al. |
2412.19304 |
null |
2024-12-26 |
RecLM: Recommendation Instruction Tuning |
Yangqin Jiang et.al. |
2412.19302 |
link |
2024-12-26 |
RAG with Differential Privacy |
Nicolas Grislain et.al. |
2412.19291 |
link |
2024-12-26 |
Time Series Foundational Models: Their Role in Anomaly Detection and Prediction |
Chathurangi Shyalika et.al. |
2412.19286 |
link |
2024-12-26 |
PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing |
Michael Bezick et.al. |
2412.19284 |
null |
2024-12-26 |
MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes |
Asma Ben Abacha et.al. |
2412.19260 |
link |
2024-12-26 |
VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis |
Jaemin Jung et.al. |
2412.19259 |
null |
2024-12-26 |
Sentiment trading with large language models |
Kemal Kirtac et.al. |
2412.19245 |
null |
2024-12-26 |
SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model |
Xuyang Li et.al. |
2412.19237 |
null |
2024-12-26 |
Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining |
Yuxin You et.al. |
2412.19211 |
null |
2024-12-26 |
Multi-Attribute Constraint Satisfaction via Language Model Rewriting |
Ashutosh Baheti et.al. |
2412.19198 |
null |
2024-12-26 |
Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models |
Haonan He et.al. |
2412.19191 |
null |
2024-12-26 |
Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue |
Shuzhi Xu et.al. |
2412.19154 |
null |
2024-12-26 |
AskChart: Universal Chart Understanding through Textual Enhancement |
Xudong Yang et.al. |
2412.19146 |
link |
2024-12-26 |
SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis |
Senbin Zhu et.al. |
2412.19140 |
link |
2024-12-26 |
PlanLLM: Video Procedure Planning with Refinable Large Language Models |
Dejie Yang et.al. |
2412.19139 |
link |
2024-12-26 |
Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing |
Inpyo Hong et.al. |
2412.19125 |
link |
2024-12-26 |
Discrete vs. Continuous Trade-offs for Generative Models |
Jathin Korrapati et.al. |
2412.19114 |
null |
2024-12-26 |
SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values |
Yunfan Zhang et.al. |
2412.19113 |
null |
2024-12-26 |
Stochastic normalizing flows for Effective String Theory |
Michele Caselle et.al. |
2412.19109 |
null |
2024-12-26 |
“I’ve Heard of You!”: Generate Spoken Named Entity Recognition Data for Unseen Entities |
Jiawei Yu et.al. |
2412.19102 |
null |
2024-12-26 |
Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security |
Vasileios Alevizos et.al. |
2412.19088 |
null |
2024-12-26 |
Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation |
Haotian Qian et.al. |
2412.19080 |
null |
2024-12-26 |
CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers |
Jingyi Zheng et.al. |
2412.19037 |
link |
2024-12-26 |
Repository Structure-Aware Training Makes SLMs Better Issue Resolver |
Zexiong Ma et.al. |
2412.19031 |
null |
2024-12-26 |
Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation |
Yixin Chen et.al. |
2412.19026 |
link |
2024-12-26 |
Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication |
Xiqiang Qu et.al. |
2412.19025 |
null |
2024-12-26 |
Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation |
Tao Liu et.al. |
2412.19021 |
null |
2024-12-26 |
Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability |
Ruixi Lin et.al. |
2412.19018 |
null |
2024-12-25 |
How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study |
Alejandro Velasco et.al. |
2412.18989 |
null |
2024-12-25 |
ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement |
Zhefan Rao et.al. |
2412.18966 |
null |
2024-12-25 |
Musings About the Future of Search: A Return to the Past? |
Jimmy Lin et.al. |
2412.18956 |
null |
2024-12-25 |
A Power-Efficient Hardware Implementation of L-Mul |
Ruiqi Chen et.al. |
2412.18948 |
null |
2024-12-25 |
MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models |
Kaiwen Zuo et.al. |
2412.18947 |
null |
2024-12-25 |
Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations |
Yewon Kim et.al. |
2412.18940 |
null |
2024-12-25 |
Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference |
Libo Zhang et.al. |
2412.18934 |
null |
2024-12-25 |
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation |
Lunhao Duan et.al. |
2412.18928 |
null |
2024-12-25 |
Exemplar-condensed Federated Class-incremental Learning |
Rui Sun et.al. |
2412.18926 |
null |
2024-12-25 |
Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model |
Yi-Chia Chen et.al. |
2412.18917 |
link |
2024-12-25 |
AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures |
Situo Zhang et.al. |
2412.18910 |
null |
2024-12-25 |
CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models |
Ping Guo et.al. |
2412.18890 |
link |
2024-12-25 |
MotionMap: Representing Multimodality in Human Pose Forecasting |
Reyhaneh Hosseininejad et.al. |
2412.18883 |
null |
2024-12-25 |
Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models |
Meltem Aksoy et.al. |
2412.18863 |
null |
2024-12-25 |
Improving the Readability of Automatically Generated Tests using Large Language Models |
Matteo Biagiola et.al. |
2412.18843 |
null |
2024-12-25 |
LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements |
Hao Zhang et.al. |
2412.18835 |
null |
2024-12-25 |
Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition |
Shujie Hu et.al. |
2412.18832 |
null |
2024-12-25 |
RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting |
Yilei Jiang et.al. |
2412.18826 |
null |
2024-12-25 |
CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection |
Wenbin Li et.al. |
2412.18820 |
link |
2024-12-25 |
LLM-assisted vector similarity search |
Md Riyadh et.al. |
2412.18819 |
null |
2024-12-25 |
DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search |
Lei Yang et.al. |
2412.18811 |
link |
2024-12-25 |
Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation |
Xinkai Du et.al. |
2412.18800 |
null |
2024-12-25 |
Torque-Aware Momentum |
Pranshu Malviya et.al. |
2412.18790 |
null |
2024-12-25 |
Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models |
Yu-An Liu et.al. |
2412.18770 |
link |
2024-12-25 |
The Impact of Input Order Bias on Large Language Models for Software Fault Localization |
Md Nakhla Rafi et.al. |
2412.18750 |
null |
2024-12-24 |
Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models |
Zehan Wang et.al. |
2412.18605 |
link |
2024-12-24 |
Long-Form Speech Generation with Spoken Language Models |
Se Jin Park et.al. |
2412.18603 |
link |
2024-12-24 |
Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems |
Fernando Jia et.al. |
2412.18601 |
link |
2024-12-24 |
ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation |
Hongjie Li et.al. |
2412.18600 |
null |
2024-12-24 |
DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation |
Minghong Cai et.al. |
2412.18597 |
link |
2024-12-24 |
A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs |
OpenMind et.al. |
2412.18588 |
null |
2024-12-24 |
Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control |
Sergey Sedov et.al. |
2412.18582 |
null |
2024-12-24 |
Zero-resource Speech Translation and Recognition with LLMs |
Karel Mundnich et.al. |
2412.18566 |
null |
2024-12-24 |
Distilling Fine-grained Sentiment Understanding from Large Language Models |
Yice Zhang et.al. |
2412.18552 |
link |
2024-12-24 |
Token-Budget-Aware LLM Reasoning |
Tingxu Han et.al. |
2412.18547 |
link |
2024-12-24 |
PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction |
Xingjian Xu et.al. |
2412.18541 |
null |
2024-12-24 |
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation |
Derong Xu Xinhang Li et.al. |
2412.18537 |
link |
2024-12-24 |
Automated Code Review In Practice |
Umut Cihan et.al. |
2412.18531 |
null |
2024-12-24 |
Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving |
Hao Pang et.al. |
2412.18511 |
null |
2024-12-24 |
Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization |
Yi-Fu Fu et.al. |
2412.18497 |
null |
2024-12-24 |
GeFL: Model-Agnostic Federated Learning with Generative Models |
Honggu Kang et.al. |
2412.18460 |
null |
2024-12-24 |
3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding |
Tatiana Zemskova et.al. |
2412.18450 |
link |
2024-12-24 |
Is Large Language Model Good at Triple Set Prediction? An Empirical Study |
Yuan Yuan et.al. |
2412.18443 |
null |
2024-12-24 |
Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm |
O. Deniz Akyildiz et.al. |
2412.18432 |
null |
2024-12-24 |
GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent |
Kangjia Zhao et.al. |
2412.18426 |
null |
2024-12-24 |
Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models |
Zihan Zhou et.al. |
2412.18419 |
null |
2024-12-24 |
Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles |
Zihan Wang et.al. |
2412.18416 |
null |
2024-12-24 |
Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English |
Avinash Anand et.al. |
2412.18415 |
link |
2024-12-24 |
Discovery of 2D Materials via Symmetry-Constrained Diffusion Model |
Shihang Xu et.al. |
2412.18414 |
null |
2024-12-24 |
A Statistical Framework for Ranking LLM-Based Chatbots |
Siavash Ameli et.al. |
2412.18407 |
link |
2024-12-24 |
Extract Free Dense Misalignment from CLIP |
JeongYeon Nam et.al. |
2412.18404 |
link |
2024-12-24 |
RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction |
Wu Xiaoping et.al. |
2412.18390 |
null |
2024-12-24 |
MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs |
Qiuyi Gu et.al. |
2412.18381 |
link |
2024-12-24 |
Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents |
Kaiwen Ning et.al. |
2412.18371 |
link |
2024-12-24 |
Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering |
Zhongjian Hu et.al. |
2412.18351 |
null |
2024-12-24 |
M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models |
Jiaxin Guo et.al. |
2412.18299 |
null |
2024-12-24 |
Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight |
Xi Ding et.al. |
2412.18298 |
link |
2024-12-24 |
Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases |
Christian Di Maio et.al. |
2412.18295 |
null |
2024-12-24 |
DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation |
Junyi Lu et.al. |
2412.18291 |
null |
2024-12-24 |
Improved Feature Generating Framework for Transductive Zero-shot Learning |
Zihan Ye et.al. |
2412.18282 |
null |
2024-12-24 |
GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications |
Zhenzhou Jin et.al. |
2412.18281 |
null |
2024-12-24 |
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization |
Jiacai Liu et.al. |
2412.18279 |
null |
2024-12-24 |
GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge |
Shammur Absar Chowdhury et.al. |
2412.18274 |
null |
2024-12-24 |
Annotating References to Mythological Entities in French Literature |
Thierry Poibeau et.al. |
2412.18270 |
null |
2024-12-24 |
Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study |
Xuefeng Jiang et.al. |
2412.18260 |
link |
2024-12-24 |
AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction |
Pufan Zou et.al. |
2412.18255 |
null |
2024-12-24 |
An Automatic Graph Construction Framework based on Large Language Models for Recommendation |
Rong Shan et.al. |
2412.18241 |
link |
2024-12-24 |
Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection |
Jango Zhang et.al. |
2412.18225 |
null |
2024-12-24 |
Expand VSR Benchmark for VLLM to Expertize in Spatial Rules |
Peijin Xie et.al. |
2412.18224 |
link |
2024-12-24 |
ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation |
Mengyang Wu et.al. |
2412.18216 |
link |
2024-12-24 |
Adapting Large Language Models for Improving TCP Fairness over WiFi |
Shyam Kumar Shrestha et.al. |
2412.18200 |
null |
2024-12-24 |
Robustness-aware Automatic Prompt Optimization |
Zeru Shi et.al. |
2412.18196 |
link |
2024-12-24 |
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks |
Shiduo Zhang et.al. |
2412.18194 |
null |
2024-12-24 |
TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization |
Yucong Luo et.al. |
2412.18185 |
null |
2024-12-24 |
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation |
Yucong Luo et.al. |
2412.18176 |
null |
2024-12-24 |
INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent |
Haohang Li et.al. |
2412.18174 |
null |
2024-12-24 |
Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models |
Xiaomeng Hu et.al. |
2412.18171 |
null |
2024-12-24 |
KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management |
Rongxin Cheng et.al. |
2412.18169 |
null |
2024-12-24 |
Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence |
Yinbin Han et.al. |
2412.18164 |
null |
2024-12-24 |
VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities |
Shray Mathur et.al. |
2412.18161 |
null |
2024-12-24 |
Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task |
Jinming Liu et.al. |
2412.18158 |
null |
2024-12-24 |
Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance |
Yaoyun Zhang et.al. |
2412.18157 |
null |
2024-12-24 |
scReader: Prompting Large Language Models to Interpret scRNA-seq Data |
Cong Li et.al. |
2412.18156 |
null |
2024-12-24 |
GeneSUM: Large Language Model-based Gene Summary Extraction |
Zhijian Chen et.al. |
2412.18154 |
null |
2024-12-24 |
CoAM: Corpus of All-Type Multiword Expressions |
Yusuke Ide et.al. |
2412.18151 |
null |
2024-12-24 |
EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation |
Shuhao Han et.al. |
2412.18150 |
link |
2024-12-24 |
Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction |
Xiao Guo et.al. |
2412.18149 |
null |
2024-12-24 |
Ensuring Consistency for In-Image Translation |
Chengpeng Fu et.al. |
2412.18139 |
null |
2024-12-24 |
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment |
Binrui Zeng et.al. |
2412.18135 |
null |
2024-12-24 |
VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection |
Zhaohui Jin et.al. |
2412.18124 |
null |
2024-12-24 |
AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation |
Hao Wen et.al. |
2412.18116 |
null |
2024-12-24 |
AIGT: AI Generative Table Based on Prompt |
Mingming Zhang et.al. |
2412.18111 |
null |
2024-12-24 |
SlimGPT: Layer-wise Structured Pruning for Large Language Models |
Gui Ling et.al. |
2412.18110 |
null |
2024-12-24 |
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach |
Jing Bi et.al. |
2412.18108 |
null |
2024-12-24 |
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels |
Mingcong Song et.al. |
2412.18106 |
null |
2024-12-24 |
EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent |
Suyuan Wang et.al. |
2412.18100 |
null |
2024-12-24 |
Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) – a Large Language Model Chatbot for Perioperative Medicine |
Yu He Ke et.al. |
2412.18096 |
null |
2024-12-24 |
Molly: Making Large Language Model Agents Solve Python Problem More Logically |
Rui Xiao et.al. |
2412.18093 |
null |
2024-12-24 |
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner |
Aizierjiang Aiersilan et.al. |
2412.18086 |
link |
2024-12-24 |
Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models |
Xuan Lin et.al. |
2412.18084 |
link |
2024-12-24 |
Improving Factuality with Explicit Working Memory |
Mingda Chen et.al. |
2412.18069 |
null |
2024-12-24 |
LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR |
Osama Hosam Abdellaif et.al. |
2412.18063 |
link |
2024-12-24 |
Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction |
Hyunbae Jeon et.al. |
2412.18061 |
null |
2024-12-24 |
An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM |
Wen Wen et.al. |
2412.18060 |
null |
2024-12-23 |
Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations |
Maya Patel et.al. |
2412.18051 |
null |
2024-12-23 |
AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data |
Mirko Zaffaroni et.al. |
2412.18038 |
link |
2024-12-23 |
Generating refactored code accurately using reinforcement learning |
Indranil Palit et.al. |
2412.18035 |
null |
2024-12-23 |
More than Chit-Chat: Developing Robots for Small-Talk Interactions |
Rebecca Ramnauth et.al. |
2412.18023 |
null |
2024-12-23 |
Trustworthy and Efficient LLMs Meet Databases |
Kyoungmin Kim et.al. |
2412.18022 |
null |
2024-12-23 |
StructTest: Benchmarking LLMs’ Reasoning through Compositional Structured Outputs |
Hailin Chen et.al. |
2412.18011 |
null |
2024-12-23 |
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models |
Ruibo Tu et.al. |
2412.17970 |
link |
2024-12-23 |
LMV-RPA: Large Model Voting-based Robotic Process Automation |
Osama Abdellatif et.al. |
2412.17965 |
link |
2024-12-23 |
Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models |
Antony Seabra et.al. |
2412.17964 |
null |
2024-12-23 |
Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models |
Ge Zhang et.al. |
2412.17963 |
null |
2024-12-23 |
Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents |
Antony Seabra et.al. |
2412.17942 |
null |
2024-12-23 |
BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism |
Martin Fajcik et.al. |
2412.17933 |
null |
2024-12-23 |
Causal Composition Diffusion Model for Closed-loop Traffic Generation |
Haohong Lin et.al. |
2412.17920 |
null |
2024-12-23 |
Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning |
Orson Mengara et.al. |
2412.17908 |
null |
2024-12-23 |
LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses |
Sara Riazi et.al. |
2412.17892 |
null |
2024-12-23 |
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models |
Siyuan Bian et.al. |
2412.17811 |
null |
2024-12-23 |
Reconstructing People, Places, and Cameras |
Lea Müller et.al. |
2412.17806 |
null |
2024-12-23 |
Automating the Search for Artificial Life with Foundation Models |
Akarsh Kumar et.al. |
2412.17799 |
link |
2024-12-23 |
ResearchTown: Simulator of Human Research Community |
Haofei Yu et.al. |
2412.17767 |
link |
2024-12-23 |
ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback |
Wei Zhang et.al. |
2412.17754 |
null |
2024-12-23 |
Deliberation in Latent Space via Differentiable Cache Augmentation |
Luyang Liu et.al. |
2412.17747 |
null |
2024-12-23 |
YuLan-Mini: An Open Data-efficient Language Model |
Yiwen Hu et.al. |
2412.17743 |
link |
2024-12-23 |
**Reasoning to Attend: Try to Understand How Token Works** |
Rui Qian et.al. |
2412.17741 |
link |
2024-12-23 |
Knowledge Editing through Chain-of-Thought |
Changyue Wang et.al. |
2412.17727 |
link |
2024-12-23 |
Understanding the Logic of Direct Preference Alignment through Logic |
Kyle Richardson et.al. |
2412.17696 |
null |
2024-12-23 |
Large Language Model Safety: A Holistic Survey |
Dan Shi et.al. |
2412.17686 |
link |
2024-12-23 |
A Bias-Free Training Paradigm for More General AI-generated Image Detection |
Fabrizio Guillaro et.al. |
2412.17671 |
null |
2024-12-23 |
Generating Completions for Fragmented Broca’s Aphasic Sentences Using Large Language Models |
Sijbren van Vaals et.al. |
2412.17669 |
link |
2024-12-23 |
Detecting anxiety and depression in dialogues: a multi-label and explainable approach |
Francisco de Arriba-Pérez et.al. |
2412.17651 |
null |
2024-12-23 |
SCBench: A Sports Commentary Benchmark for Video LLMs |
Kuangzhi Ge et.al. |
2412.17637 |
null |
2024-12-23 |
ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance |
Renyang Liu et.al. |
2412.17632 |
link |
2024-12-23 |
Tracking the Feature Dynamics in LLM Training: A Mechanistic Study |
Yang Xu et.al. |
2412.17626 |
null |
2024-12-23 |
Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models |
Parham Rezaei et.al. |
2412.17622 |
link |
2024-12-23 |
Emerging Security Challenges of Large Language Models |
Herve Debar et.al. |
2412.17614 |
null |
2024-12-23 |
Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs |
Fabrizio Frasca et.al. |
2412.17609 |
null |
2024-12-23 |
EasyTime: Time Series Forecasting Made Easy |
Xiangfei Qiu et.al. |
2412.17603 |
null |
2024-12-23 |
LiveIdeaBench: Evaluating LLMs’ Scientific Creativity and Idea Generation with Minimal Context |
Kai Ruan et.al. |
2412.17596 |
link |
2024-12-23 |
Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation |
Chengbing Wang et.al. |
2412.17593 |
null |
2024-12-23 |
HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data |
Ting Zhou et.al. |
2412.17574 |
link |
2024-12-23 |
S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field |
Zixi Liang et.al. |
2412.17561 |
link |
2024-12-23 |
GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference |
Chao Zeng et.al. |
2412.17560 |
null |
2024-12-23 |
A Survey of Query Optimization in Large Language Models |
Mingyang Song et.al. |
2412.17558 |
null |
2024-12-23 |
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing |
Prakash Aryan et.al. |
2412.17548 |
link |
2024-12-23 |
Retention Score: Quantifying Jailbreak Risks for Vision Language Models |
Zaitang Li et.al. |
2412.17544 |
null |
2024-12-23 |
Constructing Fair Latent Space for Intersection of Fairness and Explainability |
Hyungjun Joo et.al. |
2412.17523 |
null |
2024-12-23 |
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak |
Hao Wang et.al. |
2412.17522 |
null |
2024-12-23 |
Improving the Noise Estimation of Latent Neural Stochastic Differential Equations |
Linus Heck et.al. |
2412.17499 |
null |
2024-12-23 |
Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings |
Jérémie Sublime et.al. |
2412.17486 |
null |
2024-12-23 |
Power- and Fragmentation-aware Online Scheduling for GPU Datacenters |
Francesco Lettich et.al. |
2412.17484 |
link |
2024-12-23 |
A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression |
Chenlong Deng et.al. |
2412.17483 |
null |
2024-12-23 |
A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers |
Shuaihang Chen et.al. |
2412.17481 |
link |
2024-12-23 |
CALLIC: Content Adaptive Learning for Lossless Image Compression |
Daxin Li et.al. |
2412.17464 |
null |
2024-12-23 |
Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning |
Xiaodan Chen et.al. |
2412.17456 |
null |
2024-12-23 |
Applying LLM and Topic Modelling in Psychotherapeutic Contexts |
Alexander Vanin et.al. |
2412.17449 |
null |
2024-12-23 |
Measuring Contextual Informativeness in Child-Directed Text |
Maria Valentini et.al. |
2412.17427 |
link |
2024-12-23 |
Multimodal Preference Data Synthetic Alignment with Reward Model |
Robert Wijaya et.al. |
2412.17417 |
link |
2024-12-23 |
VidCtx: Context-aware Video Question Answering with Image Models |
Andreas Goulas et.al. |
2412.17415 |
null |
2024-12-23 |
Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance |
Muhammad Reza Qorib et.al. |
2412.17408 |
link |
2024-12-23 |
Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning |
Huchen Jiang et.al. |
2412.17397 |
null |
2024-12-23 |
WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models |
Huawen Feng et.al. |
2412.17395 |
null |
2024-12-23 |
Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement |
Hyeonjin Kim et.al. |
2412.17387 |
link |
2024-12-23 |
Interweaving Memories of a Siamese Large Language Model |
Xin Song et.al. |
2412.17383 |
link |
2024-12-23 |
MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models |
Beibei Yu et.al. |
2412.17339 |
null |
2024-12-23 |
A Dual-Perspective Metaphor Detection Framework Using Large Language Models |
Yujie Lin et.al. |
2412.17332 |
link |
2024-12-23 |
Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance |
Nicolas Devatine et.al. |
2412.17321 |
null |
2024-12-23 |
CodeV: Issue Resolving with Visual Data |
Linhao Zhang et.al. |
2412.17315 |
link |
2024-12-23 |
Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories |
Mahan Tafreshipour et.al. |
2412.17298 |
null |
2024-12-23 |
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples |
Taewoong Kim et.al. |
2412.17288 |
link |
2024-12-23 |
LLM4AD: A Platform for Algorithm Design with Large Language Model |
Fei Liu et.al. |
2412.17287 |
link |
2024-12-23 |
Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning |
Rui Liang et.al. |
2412.17285 |
null |
2024-12-23 |
Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach |
Rafid Ishrak Jahan et.al. |
2412.17255 |
link |
2024-12-23 |
SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval |
Xiaopeng Li et.al. |
2412.17250 |
null |
2024-12-23 |
EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling |
Zichen Song et.al. |
2412.17249 |
null |
2024-12-23 |
On the Generalization Ability of Machine-Generated Text Detectors |
Yule Liu et.al. |
2412.17242 |
link |
2024-12-23 |
Brain-to-Text Benchmark ‘24: Lessons Learned |
Francis R. Willett et.al. |
2412.17227 |
link |
2024-12-23 |
CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder |
Lichen Ma et.al. |
2412.17225 |
null |
2024-12-22 |
Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension |
Jio Oh et.al. |
2412.17189 |
null |
2024-12-22 |
Foundation Model for Lossy Compression of Spatiotemporal Scientific Data |
Xiao Li et.al. |
2412.17184 |
null |
2024-12-22 |
Enhancing Item Tokenization for Generative Recommendation through Self-Improvement |
Runjin Chen et.al. |
2412.17171 |
null |
2024-12-22 |
Generative Diffusion Modeling: A Practical Handbook |
Zihan Ding et.al. |
2412.17162 |
null |
2024-12-22 |
LLM-based relevance assessment still can’t replace human relevance assessment |
Charles L. A. Clarke et.al. |
2412.17156 |
null |
2024-12-22 |
LLM Agent for Fire Dynamics Simulations |
Leidong Xu et.al. |
2412.17146 |
null |
2024-12-22 |
Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs |
Rushendra Sidibomma et.al. |
2412.17131 |
link |
2024-12-22 |
Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models |
Cameron R. Jones et.al. |
2412.17128 |
null |
2024-12-22 |
Learning to Adapt to Low-Resource Paraphrase Generation |
Zhigen Li et.al. |
2412.17111 |
null |
2024-12-22 |
DreamOmni: Unified Image Generation and Editing |
Bin Xia et.al. |
2412.17098 |
null |
2024-12-22 |
Analysis on LLMs Performance for Code Summarization |
Md. Ahnaf Akib et.al. |
2412.17094 |
null |
2024-12-22 |
SAIL: Sample-Centric In-Context Learning for Document Information Extraction |
Jinyu Zhang et.al. |
2412.17092 |
link |
2024-12-22 |
SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults |
Jinzhi Wang et.al. |
2412.17077 |
null |
2024-12-22 |
The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM’s Internal States |
Fabian Ridder et.al. |
2412.17056 |
link |
2024-12-22 |
DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately |
Huiwen Wu et.al. |
2412.17053 |
null |
2024-12-22 |
ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues |
Shaina Raza et.al. |
2412.17052 |
link |
2024-12-22 |
Modular Conversational Agents for Surveys and Interviews |
Jiangbo Yu et.al. |
2412.17049 |
null |
2024-12-22 |
Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective |
Hankun Wang et.al. |
2412.17048 |
null |
2024-12-22 |
Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation |
Luoxu Jin et.al. |
2412.17042 |
null |
2024-12-22 |
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories |
Eric Hedlin et.al. |
2412.17040 |
null |
2024-12-22 |
Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model |
Kevin Shen et.al. |
2412.17039 |
null |
2024-12-22 |
Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models |
Lang Gao et.al. |
2412.17034 |
null |
2024-12-22 |
MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge |
Jie He et.al. |
2412.17032 |
link |
2024-12-22 |
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos |
Zhengqian Wu et.al. |
2412.17022 |
link |
2024-12-22 |
GAS: Generative Auto-bidding with Post-training Search |
Yewen Li et.al. |
2412.17018 |
null |
2024-12-22 |
Robustness of Large Language Models Against Adversarial Attacks |
Yiyi Tao et.al. |
2412.17011 |
null |
2024-12-22 |
InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions |
Ronghui Li et.al. |
2412.16982 |
null |
2024-12-22 |
On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora |
Tzu-Chieh Chen et.al. |
2412.16976 |
null |
2024-12-22 |
Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs |
Alexander von Recum et.al. |
2412.16974 |
null |
2024-12-22 |
Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach |
Chunxu Zhang et.al. |
2412.16969 |
link |
2024-12-22 |
System-2 Mathematical Reasoning via Enriched Instruction Tuning |
Huanqia Cai et.al. |
2412.16964 |
null |
2024-12-22 |
Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework |
Jundong Xu et.al. |
2412.16953 |
null |
2024-12-22 |
A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation |
Ekai Hashimoto et.al. |
2412.16943 |
null |
2024-12-22 |
Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering |
Zhongjian Hu et.al. |
2412.16936 |
null |
2024-12-22 |
Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models |
Kai Zheng et.al. |
2412.16933 |
null |
2024-12-22 |
Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs |
Bohan Jin et.al. |
2412.16922 |
null |
2024-12-22 |
Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection |
Yuhang Gan et.al. |
2412.16918 |
null |
2024-12-22 |
Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation |
Quan Dao et.al. |
2412.16906 |
null |
2024-12-22 |
Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model |
Songjun Tu et.al. |
2412.16878 |
link |
2024-12-20 |
HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding |
Chenxin Tao et.al. |
2412.16158 |
null |
2024-12-20 |
Can Generative Video Models Help Pose Estimation? |
Ruojin Cai et.al. |
2412.16155 |
null |
2024-12-20 |
Offline Reinforcement Learning for LLM Multi-Step Reasoning |
Huaijie Wang et.al. |
2412.16145 |
link |
2024-12-20 |
Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation |
Seyedreza Mohseni et.al. |
2412.16135 |
null |
2024-12-20 |
Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information |
Dirk Bergemann et.al. |
2412.16132 |
null |
2024-12-20 |
PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics |
Daniil Larionov et.al. |
2412.16120 |
null |
2024-12-20 |
Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts |
Muhammad Abdullah Sohail et.al. |
2412.16119 |
link |
2024-12-20 |
PruneVid: Visual Token Pruning for Efficient Video Large Language Models |
Xiaohu Huang et.al. |
2412.16117 |
link |
2024-12-20 |
The Content Moderator’s Dilemma: Removal of Toxic Content and Distortions to Online Discourse |
Mahyar Habibi et.al. |
2412.16114 |
null |
2024-12-20 |
Logical Consistency of Large Language Models in Fact-checking |
Bishwamittra Ghosh et.al. |
2412.16100 |
null |
2024-12-20 |
The Evolution of LLM Adoption in Industry Data Curation Practices |
Crystal Qian et.al. |
2412.16089 |
null |
2024-12-20 |
Efficient MedSAMs: Segment Anything in Medical Images on Laptop |
Jun Ma et.al. |
2412.16085 |
link |
2024-12-20 |
Formal Mathematical Reasoning: A New Frontier in AI |
Kaiyu Yang et.al. |
2412.16075 |
null |
2024-12-20 |
The Only Way is Ethics: A Guide to Ethical Research with Large Language Models |
Eddie L. Ungless et.al. |
2412.16022 |
link |
2024-12-20 |
Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support |
Qijiong Liu et.al. |
2412.15973 |
link |
2024-12-20 |
From General to Specific: Tailoring Large Language Models for Personalized Healthcare |
Ruize Shi et.al. |
2412.15957 |
null |
2024-12-20 |
Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring |
Markus Borg et.al. |
2412.15948 |
null |
2024-12-20 |
Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation |
Gautier Evennou et.al. |
2412.15939 |
link |
2024-12-20 |
Large Language Model assisted Hybrid Fuzzing |
Ruijie Meng et.al. |
2412.15931 |
null |
2024-12-20 |
MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection |
Andrea Moglia et.al. |
2412.15925 |
link |
2024-12-20 |
RiTTA: Modeling Event Relations in Text-to-Audio Generation |
Yuhang He et.al. |
2412.15922 |
link |
2024-12-20 |
Less is More: Towards Green Code Large Language Models via Unified Structural Pruning |
Guang Yang et.al. |
2412.15921 |
null |
2024-12-20 |
Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model |
Yosuke Yamagishi et.al. |
2412.15907 |
null |
2024-12-20 |
Evaluation of Reliability Criteria for News Publishers with Large Language Models |
Manuel Pratelli et.al. |
2412.15896 |
null |
2024-12-20 |
TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain |
Camille Barboule et.al. |
2412.15891 |
null |
2024-12-20 |
AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI |
Katja Bühler et.al. |
2412.15876 |
null |
2024-12-20 |
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback |
Jiaming Ji et.al. |
2412.15838 |
link |
2024-12-20 |
WebLLM: A High-Performance In-Browser LLM Inference Engine |
Charlie F. Ruan et.al. |
2412.15803 |
link |
2024-12-20 |
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning |
Sungjin Park et.al. |
2412.15797 |
null |
2024-12-20 |
GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning |
Heming Zhang et.al. |
2412.15790 |
link |
2024-12-20 |
Linguistic Features Extracted by GPT-4 Improve Alzheimer’s Disease Detection based on Spontaneous Speech |
Jonathan Heitz et.al. |
2412.15772 |
link |
2024-12-20 |
Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference |
Jorge García-Carrasco et.al. |
2412.15750 |
link |
2024-12-20 |
Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models |
Shamus Sim et.al. |
2412.15748 |
null |
2024-12-20 |
VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models |
Dexter Neo et.al. |
2412.15739 |
null |
2024-12-20 |
AutoLife: Automatic Life Journaling with Smartphones and LLMs |
Huatao Xu et.al. |
2412.15714 |
null |
2024-12-20 |
Contrastive Learning for Task-Independent SpeechLLM-Pretraining |
Maike Züfle et.al. |
2412.15712 |
link |
2024-12-20 |
Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback |
Niklas Ippisch et.al. |
2412.15702 |
null |
2024-12-20 |
Code Review Automation Via Multi-task Federated LLM – An Empirical Study |
Jahnavi Kumar et.al. |
2412.15676 |
null |
2024-12-20 |
Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline |
Guancheng Zeng et.al. |
2412.15660 |
null |
2024-12-20 |
Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class |
Annie D’souza et.al. |
2412.15657 |
link |
2024-12-20 |
MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula |
Sieun Hyeon et.al. |
2412.15655 |
link |
2024-12-20 |
Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution |
Wentao Tan et.al. |
2412.15650 |
link |
2024-12-20 |
Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model |
Xin Du et.al. |
2412.15634 |
link |
2024-12-20 |
Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? |
Mengyu Ye et.al. |
2412.15628 |
null |
2024-12-20 |
JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs |
Hongyi Li et.al. |
2412.15623 |
null |
2024-12-20 |
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage |
Zhi Gao et.al. |
2412.15606 |
null |
2024-12-20 |
Don’t Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks |
Brian J Chan et.al. |
2412.15605 |
link |
2024-12-20 |
Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification |
Gyutae Park et.al. |
2412.15603 |
null |
2024-12-20 |
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation |
Xiaoqiang Kang et.al. |
2412.15594 |
link |
2024-12-20 |
NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization |
Danial Kamali et.al. |
2412.15588 |
link |
2024-12-20 |
To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models |
Jessica Y. Bo et.al. |
2412.15584 |
null |
2024-12-20 |
A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation |
Ryien Hosseini et.al. |
2412.15582 |
link |
2024-12-20 |
Score-based Generative Diffusion Models for Social Recommendations |
Chengyi Liu et.al. |
2412.15579 |
link |
2024-12-20 |
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning |
Xinyang Tong et.al. |
2412.15576 |
null |
2024-12-20 |
J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM |
Takero Yoshida et.al. |
2412.15574 |
null |
2024-12-20 |
Continual Learning Using a Kernel-Based Method Over Foundation Models |
Saleh Momeni et.al. |
2412.15571 |
link |
2024-12-20 |
DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation |
Yichun Tai et.al. |
2412.15570 |
link |
2024-12-20 |
In-context Continual Learning Assisted by an External Continual Learner |
Saleh Momeni et.al. |
2412.15563 |
null |
2024-12-20 |
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning |
Zheyuan Zhang et.al. |
2412.15547 |
null |
2024-12-20 |
MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering |
Zhang Siyue et.al. |
2412.15540 |
null |
2024-12-20 |
XRAG: eXamining the Core – Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation |
Qianren Mao et.al. |
2412.15529 |
link |
2024-12-20 |
HREF: Human Response-Guided Evaluation of Instruction Following in Language Models |
Xinxi Lyu et.al. |
2412.15524 |
link |
2024-12-20 |
PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time |
Alireza Pourali et.al. |
2412.15519 |
link |
2024-12-20 |
Stylish and Functional: Guided Interpolation Subject to Physical Constraints |
Yan-Ying Chen et.al. |
2412.15507 |
null |
2024-12-20 |
Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework |
Zhenjie Xu et.al. |
2412.15504 |
link |
2024-12-20 |
Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models |
Zhisheng Tang et.al. |
2412.15501 |
null |
2024-12-20 |
TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use |
Junjie Ye et.al. |
2412.15495 |
link |
2024-12-20 |
PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search |
Jiaxin Wu et.al. |
2412.15494 |
null |
2024-12-20 |
GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators |
Hengjia Li et.al. |
2412.15491 |
null |
2024-12-20 |
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage |
Saehyung Lee et.al. |
2412.15484 |
null |
2024-12-20 |
Continual Learning Using Only Large Language Model Prompting |
Jiabao Qiu et.al. |
2412.15479 |
null |
2024-12-19 |
TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models |
Ammar N. Abbas et.al. |
2412.15462 |
null |
2024-12-19 |
Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization |
Sahil Wadhwa et.al. |
2412.15453 |
null |
2024-12-19 |
AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals |
Angela Mastrianni et.al. |
2412.15444 |
null |
2024-12-19 |
SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval |
Aakash Mahalingam et.al. |
2412.15443 |
null |
2024-12-19 |
Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models |
Tianchen Zhang et.al. |
2412.15431 |
null |
2024-12-19 |
MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale |
Swapnil Gandhi et.al. |
2412.15411 |
null |
2024-12-19 |
Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification |
Edoardo Allegrini et.al. |
2412.15410 |
null |
2024-12-19 |
Systematic Evaluation of Long-Context LLMs on Financial Concepts |
Lavanya Gupta et.al. |
2412.15386 |
null |
2024-12-19 |
Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation |
Joanne Boisson et.al. |
2412.15375 |
link |
2024-12-19 |
Automated Root Cause Analysis System for Complex Data Products |
Mathieu Demarne et.al. |
2412.15374 |
null |
2024-12-19 |
Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs |
Liam Seymour et.al. |
2412.15352 |
link |
2024-12-19 |
Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models |
Reza Shirkavand et.al. |
2412.15341 |
link |
2024-12-19 |
Complete background cosmology of parity-even quadratic metric-affine gravity |
Thomas Dyer et.al. |
2412.15329 |
null |
2024-12-19 |
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving |
Shuo Xing et.al. |
2412.15208 |
link |
2024-12-19 |
MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark |
Qihao Zhao et.al. |
2412.15194 |
link |
2024-12-19 |
LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation |
Weijia Shi et.al. |
2412.15188 |
null |
2024-12-19 |
Tiled Diffusion |
Or Madar et.al. |
2412.15185 |
null |
2024-12-19 |
Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning |
Simon Frieder et.al. |
2412.15184 |
null |
2024-12-19 |
STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning |
Marius Memmel et.al. |
2412.15182 |
null |
2024-12-19 |
HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages |
Aman Chaturvedi et.al. |
2412.15178 |
null |
2024-12-19 |
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying |
Federico Castagna et.al. |
2412.15177 |
link |
2024-12-19 |
Rethinking Uncertainty Estimation in Natural Language Generation |
Lukas Aichberger et.al. |
2412.15176 |
null |
2024-12-19 |
Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM |
Yatai Ji et.al. |
2412.15156 |
link |
2024-12-19 |
Language Models as Continuous Self-Evolving Data Engineers |
Peidong Wang et.al. |
2412.15151 |
null |
2024-12-19 |
Jet: A Modern Transformer-Based Normalizing Flow |
Alexander Kolesnikov et.al. |
2412.15129 |
null |
2024-12-19 |
Adaptive Pruning for Large Language Models with Structural Importance Awareness |
Haotian Zheng et.al. |
2412.15127 |
null |
2024-12-19 |
Outcome-Refining Process Supervision for Code Generation |
Zhuohao Yu et.al. |
2412.15118 |
link |
2024-12-19 |
Qwen2.5 Technical Report |
Qwen et.al. |
2412.15115 |
link |
2024-12-19 |
Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture |
Thomas F Burns et.al. |
2412.15113 |
link |
2024-12-19 |
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation |
Yang Tian et.al. |
2412.15109 |
link |
2024-12-19 |
Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability |
Xiangsen Chen et.al. |
2412.15101 |
null |
2024-12-19 |
Nano-ESG: Extracting Corporate Sustainability Information from News Articles |
Fabian Billert et.al. |
2412.15093 |
link |
2024-12-19 |
Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation |
Haoran Liu et.al. |
2412.15086 |
null |
2024-12-19 |
ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots |
Bhupendra Acharya et.al. |
2412.15072 |
null |
2024-12-19 |
ConfliBERT: A Language Model for Political Conflict |
Patrick T. Brandt et.al. |
2412.15060 |
link |
2024-12-19 |
LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps |
Felix Friedrich et.al. |
2412.15035 |
null |
2024-12-19 |
DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space |
Mang Ning et.al. |
2412.15032 |
link |
2024-12-19 |
Large Language Models and Code Security: A Systematic Literature Review |
Enna Basic et.al. |
2412.15004 |
null |
2024-12-19 |
HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs |
Pham Vu Tuan Dat et.al. |
2412.14995 |
link |
2024-12-19 |
RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning |
Raphael Memmesheimer et.al. |
2412.14989 |
null |
2024-12-19 |
Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts |
Ioana Buhnila et.al. |
2412.14986 |
null |
2024-12-19 |
AI and Cultural Context: An Empirical Investigation of Large Language Models’ Performance on Chinese Social Work Professional Standards |
Zia Qi et.al. |
2412.14971 |
null |
2024-12-19 |
Movie2Story: A framework for understanding videos and telling stories in the form of novel text |
Kangning Li et.al. |
2412.14965 |
null |
2024-12-19 |
Knowledge Injection via Prompt Distillation |
Kalle Kujanpää et.al. |
2412.14964 |
null |
2024-12-19 |
Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities |
Daniil Medyakov et.al. |
2412.14935 |
null |
2024-12-19 |
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response |
Junyu Luo et.al. |
2412.14922 |
link |
2024-12-19 |
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation |
Zexiong Ma et.al. |
2412.14905 |
null |
2024-12-19 |
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering |
Peize Li et.al. |
2412.14880 |
null |
2024-12-19 |
Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering |
Imed Keraghel et.al. |
2412.14867 |
null |
2024-12-19 |
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling |
Junyi Li et.al. |
2412.14860 |
null |
2024-12-19 |
DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis |
Hongling Xu et.al. |
2412.14849 |
link |
2024-12-19 |
Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas |
Pietro Bernardelle et.al. |
2412.14843 |
null |
2024-12-19 |
Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis |
Greta Dolcetti et.al. |
2412.14841 |
null |
2024-12-19 |
Progressive Multimodal Reasoning via Active Retrieval |
Guanting Dong et.al. |
2412.14835 |
null |
2024-12-19 |
Answer Set Networks: Casting Answer Set Programming into Deep Learning |
Arseny Skryagin et.al. |
2412.14814 |
link |
2024-12-19 |
ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis |
Zeao Tu et.al. |
2412.14809 |
link |
2024-12-19 |
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning |
Ziang Ye et.al. |
2412.14780 |
null |
2024-12-19 |
ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine |
Rabee Qasem et.al. |
2412.14771 |
null |
2024-12-19 |
PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children |
Yiqun Zhang et.al. |
2412.14769 |
link |
2024-12-19 |
CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering |
Ruida Hu et.al. |
2412.14764 |
link |
2024-12-19 |
Query pipeline optimization for cancer patient question answering systems |
Maolin He et.al. |
2412.14751 |
null |
2024-12-19 |
Active Inference and Human–Computer Interaction |
Roderick Murray-Smith et.al. |
2412.14741 |
null |
2024-12-19 |
On Verbalized Confidence Scores for LLMs |
Daniel Yang et.al. |
2412.14737 |
link |
2024-12-19 |
Creation of AI-driven Smart Spaces for Enhanced Indoor Environments – A Survey |
Aygün Varol et.al. |
2412.14708 |
null |
2024-12-19 |
LLMs as mediators: Can they diagnose conflicts accurately? |
Özgecan Koçak et.al. |
2412.14675 |
null |
2024-12-19 |
Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT |
Hassane Kissane et.al. |
2412.14670 |
null |
2024-12-19 |
IOHunter: Graph Foundation Model to Uncover Online Information Operations |
Marco Minici et.al. |
2412.14663 |
link |
2024-12-19 |
Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models |
Zijun Chen et.al. |
2412.14660 |
link |
2024-12-19 |
Length Controlled Generation for Black-box LLMs |
Yuxuan Gu et.al. |
2412.14656 |
null |
2024-12-19 |
Learning to Generate Research Idea with Dynamic Control |
Ruochen Li et.al. |
2412.14626 |
null |
2024-12-19 |
How good is GPT at writing political speeches for the White House? |
Jacques Savoy et.al. |
2412.14617 |
null |
2024-12-19 |
Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning |
Kepu Zhang et.al. |
2412.14588 |
null |
2024-12-19 |
HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning |
Minkuk Kim et.al. |
2412.14585 |
null |
2024-12-19 |
Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues |
Tao He et.al. |
2412.14584 |
null |
2024-12-19 |
CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation |
Youngwon Lee et.al. |
2412.14581 |
null |
2024-12-19 |
DiffSim: Taming Diffusion Models for Evaluating Visual Similarity |
Yiren Song et.al. |
2412.14580 |
link |
2024-12-19 |
Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models |
Wenhan Liu et.al. |
2412.14574 |
link |
2024-12-19 |
ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model |
Shunlin Lu et.al. |
2412.14559 |
null |
2024-12-19 |
The Current Challenges of Software Engineering in the Era of Large Language Models |
Cuiyun Gao et.al. |
2412.14554 |
null |
2024-12-19 |
Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models |
Xiao Cui et.al. |
2412.14528 |
link |
2024-12-19 |
Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment |
Teng Xiao et.al. |
2412.14516 |
link |
2024-12-19 |
Relational Programming with Foundation Models |
Ziyang Li et.al. |
2412.14515 |
null |
2024-12-19 |
PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization |
Jiayi Wu et.al. |
2412.14510 |
link |
2024-12-19 |
Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs |
Yuzuki Arai et.al. |
2412.14501 |
null |
2024-12-19 |
Guided Diffusion Model for Sensor Data Obfuscation |
Xin Yang et.al. |
2412.14499 |
null |
2024-12-19 |
FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis |
Abdullah Khan et.al. |
2412.14492 |
link |
2024-12-19 |
Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities |
Amandeep Kaur et.al. |
2412.14486 |
null |
2024-12-19 |
DirectorLLM for Human-Centric Video Generation |
Kunpeng Song et.al. |
2412.14484 |
null |
2024-12-19 |
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs |
Koshiro Saito et.al. |
2412.14471 |
null |
2024-12-19 |
Agent-SafetyBench: Evaluating the Safety of LLM Agents |
Zhexin Zhang et.al. |
2412.14470 |
link |
2024-12-19 |
From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research |
Xiang Cheng et.al. |
2412.14461 |
null |
2024-12-19 |
LEDiff: Latent Exposure Diffusion for HDR Generation |
Chao Wang et.al. |
2412.14456 |
null |
2024-12-19 |
Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems |
Genki Kusano et.al. |
2412.14454 |
null |
2024-12-19 |
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation |
Shengqi Liu et.al. |
2412.14453 |
null |
2024-12-19 |
ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study |
Eric Modesitt et.al. |
2412.14436 |
link |
2024-12-19 |
All-in-One Tuning and Structural Pruning for Domain-Specific LLMs |
Lei Lu et.al. |
2412.14426 |
null |
2024-12-19 |
FedPIA – Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning |
Pramit Saha et.al. |
2412.14424 |
null |
2024-12-19 |
Enhancing Diffusion Models for High-Quality Image Generation |
Jaineet Shah et.al. |
2412.14422 |
null |
2024-12-18 |
ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers |
Haowei Liu et.al. |
2412.14405 |
null |
2024-12-18 |
Clinical Trials Ontology Engineering with Large Language Models |
Berkan Çakır et.al. |
2412.14387 |
null |
2024-12-18 |
ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling |
William Han et.al. |
2412.14373 |
link |
2024-12-18 |
Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models’ Character Understanding Evaluation |
Yuxuan Jiang et.al. |
2412.14368 |
null |
2024-12-18 |
Surrealistic-like Image Generation with Vision-Language Models |
Elif Ayten et.al. |
2412.14366 |
link |
2024-12-18 |
ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals |
Utkarsh Saxena et.al. |
2412.14363 |
link |
2024-12-18 |
A Unifying Information-theoretic Perspective on Evaluating Generative Models |
Alexis Fox et.al. |
2412.14340 |
null |
2024-12-18 |
Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation |
Benjamin Steenhoek et.al. |
2412.14308 |
null |
2024-12-18 |
Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs |
David Restrepo et.al. |
2412.14304 |
null |
2024-12-18 |
Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data |
haina Raza et.al. |
2412.14276 |
link |
2024-12-18 |
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces |
Jihan Yang et.al. |
2412.14171 |
link |
2024-12-18 |
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning |
Shengbang Tong et.al. |
2412.14164 |
null |
2024-12-18 |
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks |
Frank F. Xu et.al. |
2412.14161 |
link |
2024-12-18 |
Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models |
Atin Sakkeer Hussain et.al. |
2412.14146 |
null |
2024-12-18 |
LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research |
Tianyang Gu et.al. |
2412.14141 |
null |