Skip to content

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Notifications You must be signed in to change notification settings

Xuchen-Li/llm-arxiv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2025.02.04

Table of Contents
  1. LLM Reasoning
  2. LLM Evaluation
  3. LLM MLLM
  4. Video Understanding

LLM Reasoning

Publish Date Title Authors PDF Code
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-28 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-01-22 EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering Chang Zong et.al. 2501.12746 null
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-22 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-23 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Fengli Xu et.al. 2501.09686 null
2025-01-14 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-03 Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models Kaleem Ullah Qasim et.al. 2501.02026 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-03 Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents Chengbo He et.al. 2501.00430 null
2024-12-31 EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta Raymond Bernard et.al. 2501.00257 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2024-12-31 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-23 StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2024-12-22 Evaluating LLM Reasoning in the Operations Research Domain with ORQA Mahdi Mostajabdaveh et.al. 2412.17874 link
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-19 Eliciting Causal Abilities in Large Language Models for Reasoning Tasks Yajing Wang et.al. 2412.15314 link
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-18 Cognition Chain for Explainable Psychological Stress Detection on Social Media Xin Wang et.al. 2412.14009 null
2024-12-18 Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games Wenye Lin et.al. 2412.13602 null
2024-12-17 ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models Yuxi Sun et.al. 2412.12848 null
2024-12-12 A NotSo Simple Way to Beat Simple Bench Soham Sane et.al. 2412.12173 null
2024-12-11 What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis Jiayu Liu et.al. 2412.12157 null
2024-12-24 Stepwise Reasoning Error Disruption Attack of LLMs Jingyu Peng et.al. 2412.11934 null
2024-12-15 SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation Hang Zhang et.al. 2412.11026 null
2024-12-15 Entropy-Regularized Process Reward Model Hanning Zhang et.al. 2412.11006 link
2024-12-14 Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation Sukai Huang et.al. 2412.10675 null
2024-12-14 Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data Xue Wu et.al. 2412.10654 null
2024-12-13 Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education Naiming Liu et.al. 2412.09914 null
2024-12-12 Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning Zhenni Bi et.al. 2412.09078 null
2024-12-11 Training Large Language Models to Reason in a Continuous Latent Space Shibo Hao et.al. 2412.06769 null
2025-01-23 GameArena: Evaluating LLM Reasoning through Live Computer Games Lanxiang Hu et.al. 2412.06394 null
2024-12-08 Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt Damien de Mijolla et.al. 2412.05967 null
2024-12-05 SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions Bufang Yang et.al. 2412.04036 null
2024-12-03 Explainable CTR Prediction via LLM Reasoning Xiaohan Yu et.al. 2412.02588 null
2024-12-02 NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers Angel Yahir Loredo Lopez et.al. 2412.01621 null
2025-01-13 Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability Zicheng Lin et.al. 2411.19943 null
2024-11-29 TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension Zipeng Qiu et.al. 2411.19504 link
2024-11-29 COLD: Causal reasOning in cLosed Daily activities Abhinav Joshi et.al. 2411.19500 link
2024-11-25 Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision Zhiheng Xi et.al. 2411.16579 null
2024-11-22 On the Impact of Fine-Tuning on Chain-of-Thought Reasoning Elita Lobo et.al. 2411.15382 null
2024-11-21 Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models Yuhao Dong et.al. 2411.14432 link
2024-11-15 Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination Haojie Zheng et.al. 2411.12591 link
2024-12-23 Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus Terufumi Morishita et.al. 2411.12498 link
2024-11-18 Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation Mingchao Qi et.al. 2411.11714 link
2024-12-31 Enhancing LLM Reasoning with Reward-guided Tree Search Jinhao Jiang et.al. 2411.11694 null
2024-12-15 A dataset of questions on decision-theoretic reasoning in Newcomb-like problems Caspar Oesterheld et.al. 2411.10588 link
2024-11-14 Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering Nghia Trung Ngo et.al. 2411.09213 null
2024-11-13 Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding Deyi Ji et.al. 2411.08516 null
2024-11-18 What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? Katie Kang et.al. 2411.07681 link
2024-11-27 Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation Jaehyeok Lee et.al. 2411.06387 link
2024-11-09 A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization Haoxin Liu et.al. 2411.06018 null
2024-11-11 LLMs as Method Actors: A Model for Prompt Engineering and Architecture Colin Doyle et.al. 2411.05778 link
2024-11-12 Kwai-STaR: Transform LLMs into State-Transition Reasoners Xingyu Lu et.al. 2411.04799 null
2024-11-21 Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Haolin Chen et.al. 2411.04282 link
2024-11-05 CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library Yimeng Liu et.al. 2411.03477 null
2025-01-27 MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs Manar Abdelatty et.al. 2411.03471 link
2024-11-04 RuAG: Learned-rule-augmented Generation for Large Language Models Yudi Zhang et.al. 2411.03349 null
2024-10-30 Vision-Language Models Can Self-Improve Reasoning via Reflection Kanzhi Cheng et.al. 2411.00855 null
2024-11-01 Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling Yiwen Ding et.al. 2411.00750 link
2024-11-01 STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing Jiaru Zou et.al. 2411.00387 null
2024-11-08 GRS-QA -- Graph Reasoning-Structured Question Answering Dataset Anish Pahilajani et.al. 2411.00369 null
2024-10-31 Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning Jinghan Zhang et.al. 2410.24155 null
2024-10-31 RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner Fu-Chieh Chang et.al. 2410.23912 null
2024-10-31 OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models Junda Wu et.al. 2410.23703 null
2024-10-30 ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning Millennium Bismay et.al. 2410.23180 link
2024-10-30 On Memorization of Large Language Models in Logical Reasoning Chulin Xie et.al. 2410.23123 null
2024-10-28 Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to Semantics Isabelle Lee et.al. 2410.21353 null
2024-10-28 Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments Sangmim Song et.al. 2410.20666 null
2024-10-25 Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models Danqing Wang et.al. 2410.20007 null
2024-10-25 Can Stories Help LLMs Reason? Curating Information Space Through Narrative Vahid Sadiri Javadi et.al. 2410.19221 null
2024-10-18 Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning Pengfei He et.al. 2410.19000 link
2024-10-25 CLR-Bench: Evaluating Large Language Models in College-level Reasoning Junnan Dong et.al. 2410.17558 null
2024-10-28 Non-myopic Generation of Language Models for Reasoning and Planning Chang Ma et.al. 2410.17195 link
2024-11-06 Improving Causal Reasoning in Large Language Models: A Survey Longxuan Yu et.al. 2410.16676 link
2024-10-22 A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs Ryosuke Sonoda et.al. 2410.16640 null
2024-10-21 Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic Jason Chan et.al. 2410.16502 null
2024-11-27 On Designing Effective RL Reward at Training Time for LLM Reasoning Jiaxuan Gao et.al. 2410.15115 null
2025-01-28 Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning Xingyu Tan et.al. 2410.14211 null
2024-10-21 Unconstrained Model Merging for Enhanced LLM Reasoning Yiming Zhang et.al. 2410.13699 null
2024-10-16 Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Linhao Luo et.al. 2410.13080 link
2024-10-16 KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs Yongqin Xu et.al. 2410.12480 null
2024-10-17 Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning Qian Wang et.al. 2410.12464 null
2024-10-16 Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up Jiahao Yuan et.al. 2410.12323 link
2024-10-16 Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval Hai-Long Nguyen et.al. 2410.12154 null
2024-10-15 Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming Yilun Hao et.al. 2410.12112 null
2024-10-12 OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models Jun Wang et.al. 2410.09671 null
2024-10-11 P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains Simeng Han et.al. 2410.09207 null
2024-10-11 Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning Yunpeng Gao et.al. 2410.08500 null
2024-10-10 SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation Hang Yin et.al. 2410.08189 null
2024-10-10 Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning Amrith Setlur et.al. 2410.08146 null
2024-10-10 Automatic Curriculum Expert Iteration for Reliable LLM Reasoning Zirui Zhao et.al. 2410.07627 null
2024-10-09 Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis Ahmed Abdullah et.al. 2410.06841 null
2024-10-09 Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning Xiyao Wang et.al. 2410.06508 null
2025-01-02 Filtering Discomforting Recommendations with Large Language Models Jiahao Liu et.al. 2410.05411 null
2024-10-05 Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification Zhenwen Liang et.al. 2410.05318 null
2024-10-06 Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval Pengcheng Jiang et.al. 2410.04585 link
2024-10-03 The Role of Deductive and Inductive Reasoning in Large Language Models Chengkun Cai et.al. 2410.02892 null
2024-10-02 Not All LLM Reasoners Are Created Equal Arian Hosseini et.al. 2410.01748 null
2024-12-25 Interpretable Contrastive Monte Carlo Tree Search Reasoning Zitian Gao et.al. 2410.01707 link
2024-10-02 VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment Amirhossein Kazemnejad et.al. 2410.01679 link
2024-10-02 AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses Xiaotian Lu et.al. 2410.01246 null
2024-10-01 Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness Xiao Peng et.al. 2410.00359 null
2024-10-01 Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis Chun-Hsiao Yeh et.al. 2410.00292 null
2024-10-08 GUNDAM: Aligning Large Language Models with Graph Understanding Sheng Ouyang et.al. 2409.20053 null
2024-09-27 Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs Yanyuan Qiao et.al. 2409.18794 null
2024-10-23 Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning Debargha Ganguly et.al. 2409.17270 null
2024-09-20 CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency Kangsheng Wang et.al. 2409.17174 null
2024-09-20 Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM Zheng Wei Lim et.al. 2409.13949 null
2024-09-19 SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning Zhipeng Li et.al. 2409.12836 null
2024-10-04 Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning Jiaxin Wen et.al. 2409.12452 link
2024-12-16 Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data Jiaming Zhou et.al. 2409.12437 link
2024-09-18 MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning Justin Chih-Yao Chen et.al. 2409.12147 link
2024-11-05 Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent Fatemeh Haji et.al. 2409.11527 link
2024-09-16 Enhancing RL Safety with Counterfactual LLM Reasoning Dennis Gross et.al. 2409.10188 link
2024-09-11 Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation SeongYeub Chu et.al. 2409.07355 link

(back to top)

LLM Evaluation

Publish Date Title Authors PDF Code
2025-01-30 Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation Muhammed Yusuf Kocyigit et.al. 2501.18771 null
2025-01-31 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-01-25 LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering Beiming Liu et.al. 2501.17183 null
2025-01-28 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-26 HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI Tidor-Vlad Pricope et.al. 2501.15627 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-10 CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback En-Qi Tseng et.al. 2501.10421 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-13 Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles Samia Touileb et.al. 2501.07718 null
2025-01-03 FLAME: Financial Large-Language Model Assessment and Metrics Evaluation Jiayu Guo et.al. 2501.06211 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-04 LLMzSzŁ: a comprehensive LLM benchmark for Polish Krzysztof Jassem et.al. 2501.02266 null
2025-01-08 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2025-01-04 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs Weijia Xu et.al. 2501.00273 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2025-01-25 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2025-01-02 Baichuan4-Finance Technical Report Hanyu Zhang et.al. 2412.15270 null
2024-12-19 ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects Qihang Cao et.al. 2412.14837 null
2024-12-18 AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge Xiaobao Wu et.al. 2412.13670 link
2024-12-18 Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning Eitan Wagner et.al. 2412.13631 null
2024-12-17 OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain Shuting Wang et.al. 2412.13018 link
2024-12-10 How to Choose a Threshold for an Evaluation Metric for Large Language Models Bhaskarjit Sarmah et.al. 2412.12148 null
2024-12-15 Dual Traits in Probabilistic Reasoning of Large Language Models Shenxiong Li et.al. 2412.11009 link
2024-12-30 LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation Eunsu Kim et.al. 2412.10424 null
2024-12-13 Cultural Evolution of Cooperation among LLM Agents Aron Vallinder et.al. 2412.10270 null
2024-12-12 Towards Understanding the Robustness of LLM-based Evaluations under Perturbations Manav Chaudhary et.al. 2412.09269 null
2024-12-10 BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities Sahal Shaji Mullappilly et.al. 2412.07769 link
2024-12-12 PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models Qian Zhang et.al. 2412.06287 link
2024-12-02 AI Benchmarks and Datasets for LLM Evaluation Todor Ivanov et.al. 2412.01020 null
2024-11-30 Evaluating the Consistency of LLM Evaluators Noah Lee et.al. 2412.00543 null
2024-11-29 MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks John Francis et.al. 2411.19689 null
2024-11-29 Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension Ability Yujin Han et.al. 2411.19456 link
2024-11-27 Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator Frederic Kirstein et.al. 2411.18444 null
2025-01-17 CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity Zhengmin Yu et.al. 2411.16239 link
2024-11-25 SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text Reshmi Ghosh et.al. 2411.16077 null
2024-11-26 Do LLMs Agree on the Creativity Evaluation of Alternative Uses? Abdullah Al Rabeyah et.al. 2411.15560 null
2024-11-19 Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat Roland Daynauth et.al. 2411.14483 link
2024-11-21 Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models Lovish Madaan et.al. 2411.14103 null
2024-11-21 An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture Boming Xia et.al. 2411.13768 null
2024-11-21 A Framework for Evaluating LLMs Under Task Indeterminacy Luke Guerdan et.al. 2411.13760 null
2024-11-12 Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning Linyang He et.al. 2411.07533 null
2024-11-13 Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models Yancheng He et.al. 2411.07140 null
2024-11-09 Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Xiaojun Wu et.al. 2411.06272 link
2024-11-16 ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding Israel Abebe Azime et.al. 2411.05049 null
2024-11-07 Bayesian Calibration of Win Rate Estimation with LLM Evaluators Yicheng Gao et.al. 2411.04424 link
2024-11-05 Enhancing LLM Evaluations: The Garbling Trick William F. Bradley et.al. 2411.01533 null
2025-01-31 Mastering the Craft of Data Synthesis for CodeLLMs Meng Chen et.al. 2411.00005 null
2024-10-28 Project MPG: towards a generalized performance benchmark for LLM capabilities Lucas Spangher et.al. 2410.22368 null
2024-10-29 Self-Preference Bias in LLM-as-a-Judge Koki Wataoka et.al. 2410.21819 null
2024-10-28 Unveiling Context-Aware Criteria in Self-Assessing LLMs Taneesh Gupta et.al. 2410.21545 null
2024-10-27 LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization Jui-Nan Yen et.al. 2410.20625 null
2024-10-26 Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks Annalisa Szymanski et.al. 2410.20266 null
2024-10-23 MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning Jingfan Zhang et.al. 2410.18035 null
2025-01-30 Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements Isamu Isozaki et.al. 2410.17141 link
2024-10-21 CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Maosong Cao et.al. 2410.16256 link
2025-01-26 mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation Nishat Raihan et.al. 2410.15037 link
2024-10-19 CAP: Data Contamination Detection via Consistency Amplification Yi Zhao et.al. 2410.15005 null
2024-10-18 Enabling Scalable Evaluation of Bias Patterns in Medical LLMs Hamed Fayyaz et.al. 2410.14763 link
2024-11-06 Diverging Preferences: When do Annotators Disagree and do Models Know? Michael JQ Zhang et.al. 2410.14632 null
2024-10-18 Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models James Vo et.al. 2410.14480 null
2024-10-21 BenTo: Benchmark Task Reduction with In-Context Transferability Hongyu Zhao et.al. 2410.13804 link
2024-10-16 BenchmarkCards: Large Language Model and Risk Reporting Anna Sokol et.al. 2410.12974 null
2024-12-29 Language Model Preference Evaluation with Multiple Weak Evaluators Zhengyu Hu et.al. 2410.12869 link
2024-10-11 Enterprise Benchmarks for Large Language Model Evaluation Bing Zhang et.al. 2410.12857 link
2024-10-16 An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation Junjie Chen et.al. 2410.12265 null
2024-10-15 Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers Lorenzo Pacchiardi et.al. 2410.11672 link
2024-10-15 Black-box Uncertainty Quantification Method for LLM-as-a-Judge Nico Wagner et.al. 2410.11594 null
2024-10-14 Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting Yifan Luo et.al. 2410.10150 null
2024-12-13 HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics Jingxuan Fan et.al. 2410.09988 link
2024-10-15 LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models Han Qiu et.al. 2410.09962 link
2024-10-17 Towards Multilingual LLM Evaluation for European Languages Klaudia Thellmann et.al. 2410.08928 null
2024-10-11 Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example Marcus Kessel et.al. 2410.08911 null
2024-10-10 Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks Mathis Pink et.al. 2410.08133 null
2024-10-10 COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act Philipp Guldimann et.al. 2410.07959 null
2024-11-06 News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News Tarun Jain et.al. 2410.07520 null
2024-10-09 Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates Xiaosen Zheng et.al. 2410.07137 link
2024-10-09 ReIFE: Re-evaluating Instruction-Following Evaluation Yixin Liu et.al. 2410.07069 link
2024-10-08 Active Evaluation Acquisition for Efficient LLM Benchmarking Yang Li et.al. 2410.05952 null
2024-10-07 TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles Qingchen Yu et.al. 2410.05262 link
2024-10-01 Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model Aidan Gilson et.al. 2410.03740 null
2024-10-04 TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation Jonathan Cook et.al. 2410.03608 null
2024-10-04 Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores Robert E. Blackwell et.al. 2410.03492 null
2024-10-29 AIME: AI System Optimization via Multiple LLM Evaluators Bhrij Patel et.al. 2410.03131 null
2024-10-02 Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation Annalisa Szymanski et.al. 2410.02054 null
2024-10-02 Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models Joseph Lee et.al. 2410.01795 link
2024-10-03 Extending Context Window of Large Language Models from a Distributional Perspective Yingsheng Wu et.al. 2410.01490 null
2024-10-02 ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving Yifan Qiao et.al. 2410.01228 null
2024-10-01 ViDAS: Vision-based Danger Assessment and Scoring Pranav Gupta et.al. 2410.00477 null
2024-10-01 PclGPT: A Large Language Model for Patronizing and Condescending Language Detection Hongbo Wang et.al. 2410.00361 link
2024-11-26 LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models Haitao Li et.al. 2409.20288 link
2024-09-29 Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems Xuyang Wu et.al. 2409.19804 null
2024-10-19 Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models Xin Li et.al. 2409.19667 link
2024-10-05 IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation Fan Lin et.al. 2409.18892 link
2024-12-13 A Character-Centric Creative Story Generation via Imagination Kyeongman Park et.al. 2409.16667 null
2024-09-25 Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models Sungjune Park et.al. 2409.16635 null
2024-12-18 Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino Jann Railey Montalan et.al. 2409.15380 link
2024-12-16 MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators Qingyu Lu et.al. 2409.14335 link
2024-09-21 ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models Yuqing Huang et.al. 2409.13989 link
2024-12-17 AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs Basel Mousi et.al. 2409.11404 null
2024-10-02 LLM-as-a-Judge & Reward Model: What They Can and Cannot Do Guijin Son et.al. 2409.11239 null
2024-12-08 Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges Vinay Samuel et.al. 2409.09927 link
2024-09-13 Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia Fajri Koto et.al. 2409.08564 null
2024-09-09 Assessing SPARQL capabilities of Large Language Models Lars-Peter Meyer et.al. 2409.05925 link
2024-10-08 LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs Yuhao Wu et.al. 2409.02076 link
2024-10-14 Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation Jasper Dekoninck et.al. 2409.00696 null
2024-08-26 Evaluating ChatGPT on Nuclear Domain-Specific Data Muhammad Anwar et.al. 2409.00090 null
2024-08-28 LLMSecCode: Evaluating Large Language Models for Secure Coding Anton Rydén et.al. 2408.16100 link
2024-08-26 LLM-3D Print: Large Language Models To Monitor and Control 3D Printing Yayati Jadhav et.al. 2408.14307 null
2024-08-26 Epidemic Information Extraction for Event-Based Surveillance using Large Language Models Sergio Consoli et.al. 2408.14277 null
2024-10-04 MobileQuant: Mobile-friendly Quantization for On-device Language Models Fuwen Tan et.al. 2408.13933 link
2024-08-23 LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models Chongyan Sun et.al. 2408.13338 null
2024-08-23 Open Llama2 Model for the Lithuanian Language Artūras Nakvosas et.al. 2408.12963 null
2024-08-23 LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction Songwei Li et.al. 2408.12832 link
2024-12-20 Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts Jiaqing Liu et.al. 2408.09688 null
2024-08-20 Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge Ravi Raju et.al. 2408.08808 null
2024-10-16 The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation Samee Arif et.al. 2408.08688 link
2024-10-19 Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks Junseok Kim et.al. 2408.08631 null

(back to top)

LLM MLLM

Publish Date Title Authors PDF Code
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 null
2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 null
2025-01-31 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Wenzhi Fang et.al. 2501.19389 null
2025-01-31 SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Dominik Wagner et.al. 2501.19377 null
2025-01-31 Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions Sören Christensen et.al. 2501.19373 null
2025-01-31 We're Different, We're the Same: Creative Homogeneity Across LLMs Emily Wenger et.al. 2501.19361 null
2025-01-31 Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies Brandon P. Chelstrom et.al. 2501.19359 null
2025-01-31 The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking Yuchun Miao et.al. 2501.19358 null
2025-01-31 Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters Adrián Juan-Delgado et.al. 2501.19356 null
2025-01-31 Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 Ting-Yao E. Hsu et.al. 2501.19353 null
2025-01-31 Towards Adaptive Self-Improvement for Smarter Energy Systems Alexander Sommer et.al. 2501.19340 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 Homogeneity Bias as Differential Sampling Uncertainty in Language Models Messi H. J. Lee et.al. 2501.19337 null
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 LLM-based Affective Text Generation Quality Based on Different Quantization Values Yarik Menchaca Resendiz et.al. 2501.19317 null
2025-01-31 Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment Gregor Bachmann et.al. 2501.19309 null
2025-01-31 SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling Jiefeng Chen et.al. 2501.19306 null
2025-01-31 Beyond checkmate: exploring the creative chokepoints in AI text Nafis Irtiza Tripto et.al. 2501.19301 link
2025-01-31 Offline Learning for Combinatorial Multi-armed Bandits Xutong Liu et.al. 2501.19300 null
2025-01-31 Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes Zhiyao Xu et.al. 2501.19298 null
2025-01-31 Analysis of LLMs vs Human Experts in Requirements Engineering Cory Hymel et.al. 2501.19297 null
2025-01-31 Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators Kunpeng Zhang et.al. 2501.19282 null
2025-01-31 Pheromone-based Learning of Optimal Reasoning Paths Anirudh Chari et.al. 2501.19278 null
2025-01-31 From Assistance to Autonomy -- A Researcher Study on the Potential of AI Support for Qualitative Data Analysis Elisabeth Kirsten et.al. 2501.19275 null
2025-01-31 Jackpot! Alignment as a Maximal Lottery Roberto-Rafael Maura-Rivero et.al. 2501.19266 null
2025-01-31 Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge Amogh Joshi et.al. 2501.19259 null
2025-01-31 A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation Yunzhe Li et.al. 2501.19232 null
2025-01-31 Autonomous Legacy Web Application Upgrades Using a Multi-Agent System Valtteri Ala-Salmi et.al. 2501.19204 null
2025-01-31 Improving the Robustness of Representation Misdirection for Large Language Model Unlearning Dang Huu-Tien et.al. 2501.19202 null
2025-01-31 Efficient Reasoning with Hidden Thinking Xuan Shen et.al. 2501.19201 link
2025-01-31 Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning Xianglin Yang et.al. 2501.19180 null
2025-01-31 No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology Luka Kovačević et.al. 2501.19178 null
2025-01-31 Position: Contextual Integrity Washing for Language Models Yan Shvartzshnaider et.al. 2501.19173 null
2025-01-31 Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs Kejia Zhang et.al. 2501.19164 null
2025-01-31 A theoretical framework for overfitting in energy-based modeling Giovanni Catania et.al. 2501.19158 null
2025-01-31 A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator Sixiao Huang et.al. 2501.19135 null
2025-01-31 Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations Sihwan Park et.al. 2501.19099 null
2025-01-31 Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data Xichen Xu et.al. 2501.19094 null
2025-01-31 Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models Jialin Zhao et.al. 2501.19090 null
2025-01-31 Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification Xiangyu Sun et.al. 2501.19086 null
2025-01-31 Enhancing Code Generation for Low-Resource Languages: No Silver Bullet Alessandro Giagnorio et.al. 2501.19085 null
2025-01-31 Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations Dahye Kim et.al. 2501.19066 link
2025-01-31 TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs Yan Sun et.al. 2501.19057 null
2025-01-31 Enabling Autonomic Microservice Management through Self-Learning Agents Fenglin Yu et.al. 2501.19056 null
2025-01-31 Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models Ruiyu Wang et.al. 2501.19054 null
2025-01-31 Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors Simon Idoko et.al. 2501.19042 link
2025-01-31 Towards the Worst-case Robustness of Large Language Models Huanran Chen et.al. 2501.19040 null
2025-01-31 Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs Hongliang Li et.al. 2501.19036 null
2025-01-31 XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses Bo Lan et.al. 2501.19034 link
2025-01-31 Multilayer Networks in Neuroimaging Vesna Vuksanovic et.al. 2501.19024 null
2025-01-31 Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation Bin Zhu et.al. 2501.19017 null
2025-01-31 Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities Arjun Krishna et.al. 2501.19012 null
2025-01-31 Visual Autoregressive Modeling for Image Super-Resolution Yunpeng Qu et.al. 2501.18993 null
2025-01-31 Symmetric Pruning of Large Language Models Kai Yi et.al. 2501.18980 null
2025-01-31 BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics Yuxuan Liu et.al. 2501.18972 null
2025-01-31 Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping Pu Yang et.al. 2501.18962 null
2025-01-31 Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow Alfred Bexley et.al. 2501.18957 null
2025-01-31 LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models Shenghao Fu et.al. 2501.18954 link
2025-01-31 TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment Zi-Jian Cheng et.al. 2501.18935 link
2025-01-31 Language Games as the Pathway to Artificial Superhuman Intelligence Ying Wen et.al. 2501.18924 null
2025-01-31 KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search Haoran Luo et.al. 2501.18922 link
2025-01-31 LLM Program Optimization via Retrieval Augmented Search Sagnik Anupam et.al. 2501.18916 null
2025-01-31 Scaling Laws for Differentially Private Language Models Ryan McKenna et.al. 2501.18914 null
2025-01-31 Streamlining Security Vulnerability Triage with Large Language Models Mohammad Jalili Torkamani et.al. 2501.18908 null
2025-01-31 Trustworthy Evaluation of Generative AI Models Zijun Gao et.al. 2501.18897 null
2025-01-31 Can We Predict the Effect of Prompts? Jae Yong Lee et.al. 2501.18883 null
2025-01-31 Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models Jiaqi Tang et.al. 2501.18863 null
2025-01-31 BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning Han Zhong et.al. 2501.18858 null
2025-01-31 Equivariant Hypergraph Diffusion for Crystal Structure Prediction Yang Liu et.al. 2501.18850 null
2025-01-31 Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities Yaping Chai et.al. 2501.18845 null
2025-01-31 Trading Inference-Time Compute for Adversarial Robustness Wojciech Zaremba et.al. 2501.18841 null
2025-01-31 Partially Rewriting a Transformer in Natural Language Gonçalo Paulo et.al. 2501.18838 null
2025-01-31 Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming Mrinank Sharma et.al. 2501.18837 null
2025-01-31 Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential Chenyu Gao et.al. 2501.18834 null
2025-01-31 Structural Embedding Projection for Contextual Large Language Model Inference Vincent Enoasmo et.al. 2501.18826 null
2025-01-31 Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies Andrey Borro et.al. 2501.18817 link
2025-01-31 Large Language Models as Common-Sense Heuristics Andrey Borro et.al. 2501.18816 null
2025-01-30 Compositional Generalization Requires More Than Disentangled Representations Qiyao Liang et.al. 2501.18797 null
2025-01-30 Rope to Nope and Back Again: A New Hybrid Attention Strategy Bowen Yang et.al. 2501.18795 null
2025-01-30 Survey and Improvement Strategies for Gene Prioritization with Large Language Models Matthew Neeley et.al. 2501.18794 null
2025-01-30 LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? Alexander Tuisov et.al. 2501.18784 null
2025-01-30 Navigating the Fragrance space Via Graph Generative Models And Predicting Odors Mrityunjay Sharma et.al. 2501.18777 link
2025-01-30 Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring Zijun Deng et.al. 2501.18761 null
2025-01-30 Synthetic Data Generation for Augmenting Small Samples Dan Liu et.al. 2501.18741 null
2025-01-30 Examining the Robustness of Large Language Models across Language Complexity Jiayi Zhang et.al. 2501.18738 null
2025-01-30 Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks Mohd. Farhan Israk Soumik et.al. 2501.18727 null
2025-01-30 Strong and Controllable 3D Motion Generation Canxuan Gang et.al. 2501.18726 null
2025-01-30 Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning Maya Kruse et.al. 2501.18724 null
2025-01-30 Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps Devansh Bhardwaj et.al. 2501.18712 null
2025-01-30 Regularized second-order optimization of tensor-network Born machines Matan Ben-Dov et.al. 2501.18691 null
2025-01-30 Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting Yansong Qu et.al. 2501.18672 null
2025-01-30 Foundational Models for 3D Point Clouds: A Survey and Outlook Vishal Thengane et.al. 2501.18594 null
2025-01-30 Diffusion Autoencoders are Scalable Image Tokenizers Yinbo Chen et.al. 2501.18593 null
2025-01-30 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Yue Wang et.al. 2501.18585 null
2025-01-30 Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH Evgenii Evstafev et.al. 2501.18576 null
2025-01-30 BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos Lehao Lin et.al. 2501.18565 null
2025-01-30 SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Haoquan Fang et.al. 2501.18564 null
2025-01-30 Semantic Web and Creative AI -- A Technical Report from ISWS 2023 Raia Abu Ahmad et.al. 2501.18542 null
2025-01-30 Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges Manveer Singh Tamber et.al. 2501.18536 link
2025-01-30 Differentially Private Steering for Large Language Model Alignment Anmol Goel et.al. 2501.18532 link
2025-01-30 Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models Guanqun Cao et.al. 2501.18516 null
2025-01-30 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Arthur Douillard et.al. 2501.18512 null
2025-01-30 WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Benjamin Feuer et.al. 2501.18511 link
2025-01-30 CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction Peter J. Bentley et.al. 2501.18504 null
2025-01-30 Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline Shivani Kapania et.al. 2501.18493 null
2025-01-30 A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models Changshu Liu et.al. 2501.18482 null
2025-01-30 CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization Yanxia Deng et.al. 2501.18475 null
2025-01-30 Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations Chengxi Zeng et.al. 2501.18474 null
2025-01-30 ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation Minghua He et.al. 2501.18460 null
2025-01-30 CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering Yumeng Wang et.al. 2501.18457 null
2025-01-30 GENIE: Generative Note Information Extraction model for structuring EHR data Huaiyuan Ying et.al. 2501.18435 null
2025-01-30 Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation Youngjoon Lee et.al. 2501.18416 null
2025-01-30 RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects Yiteng Tu et.al. 2501.18365 link
2025-01-30 A Video-grounded Dialogue Dataset and Metric for Event-driven Activities Wiradee Imrattanatrai et.al. 2501.18324 link
2025-01-30 Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach Tianpeng Pan et.al. 2501.18320 null
2025-01-30 Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models Jennifer D'Souza et.al. 2501.18287 null
2025-01-30 Jailbreaking LLMs' Safeguard with Universal Magic Words for Text Embedding Models Haoyu Liang et.al. 2501.18280 null
2025-01-30 Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence Kevin Roitero et.al. 2501.18265 null
2025-01-30 How to Select Datapoints for Efficient Human Evaluation of NLG Models? Vilém Zouhar et.al. 2501.18251 link
2025-01-30 Statistical multi-metric evaluation and visualization of LLM system predictive performance Samuel Ackerman et.al. 2501.18243 null
2025-01-30 Contextually Structured Token Dependency Encoding for Large Language Models James Blades et.al. 2501.18205 null
2025-01-30 Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents ShuiDe Wen et.al. 2501.18190 null
2025-01-30 Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation Teddy Lazebnik et.al. 2501.18177 null
2025-01-30 Continually Evolved Multimodal Foundation Models for Cancer Prognosis Jie Peng et.al. 2501.18170 null
2025-01-30 RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing Jinyao Guo et.al. 2501.18160 null
2025-01-30 Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study Yuchen Lei et.al. 2501.18158 null
2025-01-30 Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models Wanlong Liu et.al. 2501.18154 null
2025-01-30 Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Qika Lin et.al. 2501.18119 null
2025-01-30 Scaling Inference-Efficient Language Models Song Bian et.al. 2501.18107 null
2025-01-30 Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation Yibo Wang et.al. 2501.18100 link
2025-01-30 AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates Da Chang et.al. 2501.18094 null
2025-01-30 Normative Evaluation of Large Language Models with Everyday Moral Dilemmas Pratik S. Sachdeva et.al. 2501.18081 null
2025-01-30 FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models Spencer Mateega et.al. 2501.18062 null
2025-01-29 RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems Duy A. Nguyen et.al. 2501.18056 null
2025-01-29 Current Pathology Foundation Models are unrobust to Medical Center Differences Edwin D. de Jong et.al. 2501.18055 null
2025-01-29 A Proximal Operator for Inducing 2:4-Sparsity Jonas M Kübler et.al. 2501.18015 null
2025-01-29 Large Language Models Think Too Fast To Explore Effectively Lan Pan et.al. 2501.18009 null
2025-01-29 Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces Neetha Jambigi et.al. 2501.18005 null
2025-01-29 InnerThoughts: Disentangling Representations and Predictions in Large Language Models Didier Chételat et.al. 2501.17994 null
2025-01-29 Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study Marwah Alaofi et.al. 2501.17981 link
2025-01-29 Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu et.al. 2501.17974 null
2025-01-29 "I Would Never Trust Anything Western": Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools Manas Mhasakar et.al. 2501.17942 null
2025-01-29 DReSS: Data-driven Regularized Structured Streamlining for Large Language Models Mingkuan Feng et.al. 2501.17905 null
2025-01-29 Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning? Pouya Pezeshkpour et.al. 2501.17840 link
2025-01-29 Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology Sobhan Hemati et.al. 2501.17822 null
2025-01-30 Leveraging Multimodal LLM for Inspirational User Interface Search Seokhyeon Park et.al. 2501.17799 link
2025-01-29 BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights Chan-Jan Hsu et.al. 2501.17790 null
2025-01-29 AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing Peter Pak et.al. 2501.17784 null
2025-01-29 2SSP: A Two-Stage Framework for Structured Pruning of LLMs Fabrizio Sandri et.al. 2501.17771 link
2025-01-29 Generative Unordered Flow for Set-Structured Data Generation Yangming Li et.al. 2501.17770 null
2025-01-29 Hybrid Graphs for Table-and-Text based Question Answering using LLMs Ankush Agarwal et.al. 2501.17767 null
2025-01-29 On the Partitioning of GPU Power among Multi-Instances Tirth Vamja et.al. 2501.17752 null
2025-01-29 Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Aitor Arrieta et.al. 2501.17749 null
2025-01-29 A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches Ana R. Baião et.al. 2501.17729 null
2025-01-29 Using Code Generation to Solve Open Instances of Combinatorial Design Problems Christopher D. Rosin et.al. 2501.17725 link
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Source-Channel Separation Theorems for Distortion Perception Coding Chao Tian et.al. 2501.17706 null
2025-01-29 Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching Xuzhe Dang et.al. 2501.17665 null
2025-01-30 In-Context Meta LoRA Generation Yihua Shao et.al. 2501.17635 null
2025-01-29 Uncertainty Quantification and Decomposition for LLM-based Recommendation Wonbin Kweon et.al. 2501.17630 link
2025-01-29 The Imitation Game According To Turing Sharon Temtsin et.al. 2501.17629 null
2025-01-29 Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment Jonathan Teel et.al. 2501.17617 null
2025-01-29 Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis Kunrong Li et.al. 2501.17598 null
2025-01-30 Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models Behraj Khan et.al. 2501.17595 null
2025-01-29 GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback Mohamed Abdelaal et.al. 2501.17584 null
2025-01-29 CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs Amey Hengle et.al. 2501.17581 null
2025-01-29 Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding Marco Pasini et.al. 2501.17578 null
2025-01-29 Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models Wooyoung Kim et.al. 2501.17549 null
2025-01-29 Towards Training-Free Open-World Classification with 3D Generative Models Xinzhe Xia et.al. 2501.17547 null
2025-01-29 Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant Gaole He et.al. 2501.17546 link
2025-01-29 Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison Martin Nizon-Deladoeuille et.al. 2501.17539 null
2025-01-29 Neural Spelling: A Spell-Based BCI System for Language Neural Decoding Xiaowei Jiang et.al. 2501.17489 null
2025-01-29 DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance Seffi Cohen et.al. 2501.17479 link
2025-01-29 AugmenTest: Enhancing Tests with LLM-Driven Oracles Shaker Mahmud Khandaker et.al. 2501.17461 null
2025-01-29 Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction Kaiwei Luo et.al. 2501.17459 null
2025-01-29 Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Tiansheng Huang et.al. 2501.17433 link
2025-01-29 Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models Yuxuan Li et.al. 2501.17420 null
2025-01-29 MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs Ved Sirdeshmukh et.al. 2501.17399 link
2025-01-29 Learning Free Token Reduction for Multi-Modal LLM Zihui Zhao et.al. 2501.17391 null
2025-01-29 Context-Aware Semantic Recomposition Mechanism for Large Language Models Richard Katrix et.al. 2501.17386 null
2025-01-28 Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations Md Tauhidul Islam et.al. 2501.17347 null
2025-01-28 Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction Mingyu Derek Ma et.al. 2501.17326 null
2025-01-28 CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data Lee Carlin et.al. 2501.17324 null
2025-01-30 Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding Yun-Shiuan Chuang et.al. 2501.17310 null
2025-01-28 "Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism Emily Tseng et.al. 2501.17299 null
2025-01-28 Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization Zilu Tang et.al. 2501.17295 null
2025-01-28 Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology Peilong Wang et.al. 2501.17286 null
2025-01-30 From Natural Language to Extensive-Form Game Representations Shilong Deng et.al. 2501.17282 link
2025-01-28 Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models Abdalaziz Al-Maeeni et.al. 2501.17279 null
2025-01-28 Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics Jasper Timm et.al. 2501.17273 link
2025-01-28 Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care Fengpei Yuan et.al. 2501.17206 null
2025-01-28 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Tianzhe Chu et.al. 2501.17161 null
2025-01-28 FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Deren Lei et.al. 2501.17144 link
2025-01-28 ASTRAL: Automated Safety Testing of Large Language Models Miriam Ugarte et.al. 2501.17132 null
2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization Ruizhe Wang et.al. 2501.17116 null
2025-01-28 Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction Carl-Leander Henneking et.al. 2501.17112 null
2025-01-28 Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics Guillaume Le Mailloux et.al. 2501.17107 link
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models Minghan Li et.al. 2501.17039 null
2025-01-28 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Manojkumar Parmar et.al. 2501.17030 null
2025-01-28 Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs Alessandro Midolo et.al. 2501.17024 link
2025-01-28 Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement Kei Katsumata et.al. 2501.17022 link
2025-01-28 MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition Philippe Pasquier et.al. 2501.17011 null
2025-01-28 Large Language Models for Code Generation: The Practitioners Perspective Zeeshan Rasheed et.al. 2501.16998 link
2025-01-28 Artificial Intelligence Clones Annie Liang et.al. 2501.16996 null
2025-01-28 FedEFM: Federated Endovascular Foundation Model with Unseen Data Tuong Do et.al. 2501.16992 null
2025-01-28 Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver Shunya Minami et.al. 2501.16986 null
2025-01-28 Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Hongzhi Huang et.al. 2501.16975 null
2025-01-28 Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers Mohammad Raza et.al. 2501.16961 null
2025-01-28 Multiple Abstraction Level Retrieve Augment Generation Zheng Zheng et.al. 2501.16952 null
2025-01-29 TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Makoto Shing et.al. 2501.16937 null
2025-01-28 Detecting harassment and defamation in cyberbullying with emotion-adaptive training Peiling Yi et.al. 2501.16925 link
2025-01-28 RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains Shady Nasrat et.al. 2501.16899 link
2025-01-28 Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model Eoghan Cronin et.al. 2501.16893 null
2025-01-28 Irony Detection, Reasoning and Understanding in Zero-shot Learning Peiling Yi et.al. 2501.16884 null
2025-01-28 Comparing Human and LLM Generated Code: The Jury is Still Out! Sherlock A. Licorish et.al. 2501.16857 null
2025-01-28 Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis Tiao Tan et.al. 2501.16842 null
2025-01-28 Misspellings in Natural Language Processing: A survey Gianluca Sperduti et.al. 2501.16836 null
2025-01-28 DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model Josua Spisak et.al. 2501.16800 null
2025-01-28 Algorithm for Automatic Legislative Text Consolidation Matias Etcheverry et.al. 2501.16794 null
2025-01-28 Exponential Family Attention Kevin Christian Wibisono et.al. 2501.16790 link
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network Yumingzhi Pan et.al. 2501.16784 null
2025-01-28 A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process Jack David Carson et.al. 2501.16783 null
2025-01-29 Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Muhammad Atta ur Rahman et.al. 2501.16769 null
2025-01-28 DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation Chenguo Lin et.al. 2501.16764 null
2025-01-28 HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns Xinyue Shen et.al. 2501.16750 link
2025-01-28 Through the Prism of Culture: Evaluating LLMs' Understanding of Indian Subcultures and Traditions Garima Chhikara et.al. 2501.16748 null
2025-01-28 LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience Nimesh Jha et.al. 2501.16744 null
2025-01-28 Distilling Large Language Models for Network Active Queue Management Deol Satish et.al. 2501.16734 null
2025-01-28 xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking Sunbowen Lee et.al. 2501.16727 link
2025-01-28 One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning Chunpeng Zhou et.al. 2501.16720 null
2025-01-28 Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection Hengzhuang Li et.al. 2501.16718 link
2025-01-28 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Yueen Ma et.al. 2501.16698 null
2025-01-28 MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark Dongyi Yi et.al. 2501.16688 null
2025-01-28 Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting Li Yin et.al. 2501.16673 link
2025-01-28 VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records Philip Chung et.al. 2501.16672 link
2025-01-28 Contextual Reinforcement in Multimodal Token Compression for Large Language Models Naderdel Piero et.al. 2501.16658 null
2025-01-28 Large Language Model Critics for Execution-Free Evaluation of Code Changes Aashish Yadavally et.al. 2501.16655 link
2025-01-28 Molecular-driven Foundation Model for Oncologic Pathology Anurag Vaidya et.al. 2501.16652 null
2025-01-28 DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models Zeping Min et.al. 2501.16650 null
2025-01-28 An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue Koji Inoue et.al. 2501.16643 null
2025-01-28 CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs Jinlan Fu et.al. 2501.16629 link
2025-01-28 Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems Baraa Hikal et.al. 2501.16616 null
2025-01-28 Sparse Autoencoders Trained on the Same Data Learn Different Features Gonçalo Paulo et.al. 2501.16615 null
2025-01-28 Fine-Tuned Language Models as Space Systems Controllers Enrico M. Zucchelli et.al. 2501.16588 null
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation Farzad Farhadzadeh et.al. 2501.16559 null
2025-01-27 Distributional Information Embedding: A Framework for Multi-bit Watermarking Haiyun He et.al. 2501.16558 null
2025-01-27 PackDiT: Joint Human Motion and Text Generation via Mutual Prompting Zhongyu Jiang et.al. 2501.16551 null
2025-01-27 PhysAnimator: Physics-Guided Generative Cartoon Animation Tianyi Xie et.al. 2501.16550 null
2025-01-27 Sample-Efficient Behavior Cloning Using General Domain Knowledge Feiyu Zhu et.al. 2501.16546 null
2025-01-27 Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees Piyush Gupta et.al. 2501.16539 null
2025-01-27 Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs Jean-Charles Noirot Ferrand et.al. 2501.16534 null
2025-01-27 A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain Jorge del Pozo Lérida et.al. 2501.16533 null
2025-01-27 Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction Atharva Naik et.al. 2501.16524 null
2025-01-27 How well can LLMs Grade Essays in Arabic? Rayed Ghazawi et.al. 2501.16516 null
2025-01-27 Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models Sudarshan Kamath Barkur et.al. 2501.16513 null
2025-01-27 Smoothed Embeddings for Robust Language Models Ryo Hase et.al. 2501.16497 null
2025-01-27 Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations Pablo Valenzuela-Toledo et.al. 2501.16495 null
2025-01-27 Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM Payal Kamboj et.al. 2501.16481 link
2025-01-27 Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Philip Hughes et.al. 2501.16467 null
2025-01-27 CoCoNUT: Structural Code Understanding does not fall out of a tree Claas Beger et.al. 2501.16456 link
2025-01-27 Detecting Zero-Day Attacks in Digital Substations via In-Context Learning Faizan Manzoor et.al. 2501.16453 null
2025-01-27 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation Hamed Firooz et.al. 2501.16450 null
2025-01-27 DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Han Sun et.al. 2501.16410 null
2025-01-27 Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology Meiyun Cao et.al. 2501.16309 null
2025-01-27 RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval Long Nguyen et.al. 2501.16303 null
2025-01-27 Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width Zheng Liu et.al. 2501.16302 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-27 FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Renshan Zhang et.al. 2501.16297 null
2025-01-27 Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models Jing Zhang et.al. 2501.16282 null
2025-01-27 Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation Jiayi Hong et.al. 2501.16277 link
2025-01-27 URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT Long Nguyen et.al. 2501.16276 null
2025-01-27 A foundation model for human-AI collaboration in medical literature mining Zifeng Wang et.al. 2501.16255 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Zero-Shot Decision Tree Construction via Large Language Models Lucas Carrasco et.al. 2501.16247 null
2025-01-27 CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation Xiaochuan Ma et.al. 2501.16246 null
2025-01-27 Phase Transitions in Large Language Models and the $O(N)$ Model Youran Sun et.al. 2501.16241 null
2025-01-27 AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses Runze Cai et.al. 2501.16240 null
2025-01-28 Distilling foundation models for robust and efficient models in digital pathology Alexandre Filiot et.al. 2501.16239 null
2025-01-27 Language-Based Bayesian Optimization Research Assistant (BORA) Abdoulatif Cissé et.al. 2501.16224 null
2025-01-27 Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models Huayu Li et.al. 2501.16215 link
2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null
2025-01-27 Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs Antony Bartlett et.al. 2501.16191 null
2025-01-27 SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting Wenxuan Xie et.al. 2501.16178 link
2025-01-27 BAG: Body-Aligned 3D Wearable Asset Generation Zhongjin Luo et.al. 2501.16177 null
2025-01-27 Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma Richard Willis et.al. 2501.16173 link
2025-01-27 MetaDecorator: Generating Immersive Virtual Tours through Multimodality Shuang Xie et.al. 2501.16164 null
2025-01-27 CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge Yuwei Zhang et.al. 2501.16155 null
2025-01-27 AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought Xin Huang et.al. 2501.16154 null
2025-01-27 AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants Pascal J. Sager et.al. 2501.16150 null
2025-01-27 PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing Yuwei Zhang et.al. 2501.16149 null
2025-01-27 SampleLLM: Optimizing Tabular Data Synthesis in Recommendations Jingtong Gao et.al. 2501.16125 null
2025-01-27 Using Generative Models to Produce Realistic Populations of UK Windstorms Yee Chun Tsoi et.al. 2501.16110 null
2025-01-27 Integration of LLM Quality Assurance into an NLG System Ching-Yi Chen et.al. 2501.16078 null
2025-01-27 PISCO: Pretty Simple Compression for Retrieval-Augmented Generation Maxime Louis et.al. 2501.16075 null
2025-01-27 A generative material transformer using Wyckoff representation Pierre-Paul De Breuck et.al. 2501.16051 null
2025-01-27 Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation Xing Zhang et.al. 2501.16050 null
2025-01-27 PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment Vincent Freiberger et.al. 2501.16033 null
2025-01-27 FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments Zhiyuan Fu et.al. 2501.16029 null
2025-01-27 Transformability reveals the interplay of dynamics across different network orders Ming Xie et.al. 2501.16016 null
2025-01-27 TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference Jack Min Ong et.al. 2501.16007 null
2025-01-27 EDSep: An Effective Diffusion-Based Method for Speech Source Separation Jinwei Dong et.al. 2501.15965 null
2025-01-27 Rethinking the Bias of Foundation Model under Long-tailed Distribution Jiahao Chen et.al. 2501.15955 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-27 TimeHF: Billion-Scale Time Series Models Guided by Human Feedback Yongzhi Qi et.al. 2501.15942 null
2025-01-27 SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub Benjamin C. Carter et.al. 2501.15922 null
2025-01-27 Parametric Retrieval Augmented Generation Weihang Su et.al. 2501.15915 link
2025-01-27 Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation Muhammad Taha Tariq et.al. 2501.15901 null
2025-01-27 Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects Victor Deng et.al. 2501.15900 null
2025-01-27 Adaptive Width Neural Networks Federico Errica et.al. 2501.15889 null
2025-01-27 LCTG Bench: LLM Controlled Text Generation Benchmark Kentaro Kurihara et.al. 2501.15875 link
2025-01-27 LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models Yuewen Mei et.al. 2501.15850 null
2025-01-27 SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model Delin Qu et.al. 2501.15830 null
2025-01-27 Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference Tharindu B. Hewage et.al. 2501.15829 link
2025-01-27 MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer Qi Chen et.al. 2501.15826 null
2025-01-27 LemmaHead: RAG Assisted Proof Generation Using Large Language Models Tianbo Yang et.al. 2501.15797 null
2025-01-27 Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? Zhiling Chen et.al. 2501.15795 null
2025-01-27 Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs Yu Li et.al. 2501.15791 link
2025-01-27 Memorization and Regularization in Generative Diffusion Models Ricardo Baptista et.al. 2501.15785 link
2025-01-27 Large Language Models to Diffusion Finetuning Edoardo Cetin et.al. 2501.15781 null
2025-01-27 Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages Ivory Yang et.al. 2501.15773 link
2025-01-27 GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design Yuanfu Sun et.al. 2501.15755 null
2025-01-27 IndicMMLU-Pro: Benchmarking the Indic Large Language Models Sankalp KJ et.al. 2501.15747 null
2025-01-27 Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning Michael Xieyang Liu et.al. 2501.15727 null
2025-01-27 A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks Dong Li et.al. 2501.15724 null
2025-01-27 On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models Tadahiro Taniguchi et.al. 2501.15721 null
2025-01-26 Adapting Biomedical Abstracts into Plain language using Large Language Models Haritha Gangavarapu et.al. 2501.15700 null
2025-01-26 TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs Yuxuan Gu et.al. 2501.15674 null
2025-01-26 Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting Yuxin Zhang et.al. 2501.15641 null
2025-01-26 BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation Ali Khodabandeh Yalabadi et.al. 2501.15631 link
2025-01-26 Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets Eduard Barbu et.al. 2501.15624 null
2025-01-26 Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning Zeyu Gan et.al. 2501.15602 link
2025-01-26 Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals Yinzhou Wang et.al. 2501.15599 null
2025-01-26 Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images Sichen Zhu et.al. 2501.15598 link
2025-01-26 SedarEval: Automated Evaluation using Self-Adaptive Rubrics Zhiyuan Fan et.al. 2501.15595 link
2025-01-26 SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain Dakuan Lu et.al. 2501.15587 link
2025-01-26 Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework Yuhong Sun et.al. 2501.15581 null
2025-01-26 Instruction Tuning for Story Understanding and Generation with Weak Supervision Yangshu Yuan et.al. 2501.15574 null
2025-01-26 Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models Spencer Ramsey et.al. 2501.15571 null
2025-01-26 ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer Lin Yueyu et.al. 2501.15570 link
2025-01-26 Ocean-OCR: Towards General OCR Application via a Vision-Language Model Song Chen et.al. 2501.15558 null
2025-01-26 Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles Hanwen Zhang et.al. 2501.15544 null
2025-01-26 Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths Yueyang Wang et.al. 2501.15522 null
2025-01-26 Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification Dan Song et.al. 2501.15503 null
2025-01-26 Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning Xiaohan Yu et.al. 2501.15470 null
2025-01-26 Data-adaptive Safety Rules for Training Reward Models Xiaomin Li et.al. 2501.15453 null
2025-01-26 OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas Xiaoyang Wang et.al. 2501.15427 null
2025-01-26 Visual Generation Without Guidance Huayu Chen et.al. 2501.15420 link
2025-01-26 AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement Junan Zhang et.al. 2501.15417 null
2025-01-26 The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation Raha Aghaei et.al. 2501.15411 null
2025-01-26 Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency Irin Kabakum et.al. 2501.15405 null
2025-01-26 How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning Tohida Rehman et.al. 2501.15398 null
2025-01-26 Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations Zijun Long et.al. 2501.15379 null
2025-01-26 How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback Manzong Huang et.al. 2501.15378 null
2025-01-26 Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models Melkamu Abay Mersha et.al. 2501.15374 null
2025-01-26 Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis Robinson Umeike et.al. 2501.15370 null
2025-01-26 Decentralized Low-Rank Fine-Tuning of Large Language Models Sajjad Ghiasvand et.al. 2501.15361 null
2025-01-26 Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection Bo Yang et.al. 2501.15355 null
2025-01-25 Fairness in LLM-Generated Surveys Andrés Abeliuk et.al. 2501.15351 null
2025-01-25 Between Puppet and Actor: Reframing Authorship in this Age of AI Agents Yuqian Sun et.al. 2501.15346 null
2025-01-25 Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data Jiajie Li et.al. 2501.15326 null
2025-01-25 ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning Shangqian Gao et.al. 2501.15316 null
2025-01-25 The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? Ayo Adedeji et.al. 2501.15310 null
2025-01-25 You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning Ayan Sengupta et.al. 2501.15296 null
2025-01-24 HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Xin Zhou et.al. 2501.14729 link
2025-01-24 Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? Ipek Baris Schlicht et.al. 2501.14719 null
2025-01-24 Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models Naihao Deng et.al. 2501.14717 null
2025-01-24 FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing James Seale Smith et.al. 2501.14713 null
2025-01-24 The Karp Dataset Mason DiCicco et.al. 2501.14705 null
2025-01-24 Rethinking Table Instruction Tuning Naihao Deng et.al. 2501.14693 null
2025-01-24 Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST Fuping Wu et.al. 2501.14685 null
2025-01-24 An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations Shabnam Hassani et.al. 2501.14683 null
2025-01-24 Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning Jisi Zhang et.al. 2501.14680 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion Ziyao Xu et.al. 2501.14649 link
2025-01-24 Towards Scalable Topological Regularizers Hiu-Tung Wong et.al. 2501.14641 null
2025-01-24 Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics Renato Ghisellini et.al. 2501.14634 null
2025-01-24 Extracting Problem Structure with LLMs for Optimized SAT Local Search André Schilder et.al. 2501.14630 null
2025-01-24 Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data Jordi Abante et.al. 2501.14615 null
2025-01-24 ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Tianming Liang et.al. 2501.14607 null
2025-01-24 Leveraging ChatGPT's Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research Hamid Sarmadi et.al. 2501.14546 null
2025-01-24 VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning Benjamin Callewaert et.al. 2501.14540 null
2025-01-24 Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models Zhenguang Zhong et.al. 2501.14530 link
2025-01-24 Scene Understanding Enabled Semantic Communication with Open Channel Coding Zhe Xiang et.al. 2501.14520 null
2025-01-24 Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel Zhuoran Liu et.al. 2501.14512 null
2025-01-24 Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course Pavlin G. Poličar et.al. 2501.14499 null
2025-01-24 Evaluating and Improving Graph to Text Generation with Large Language Models Jie He et.al. 2501.14497 link
2025-01-24 RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques Zhengyang Tang et.al. 2501.14492 link
2025-01-24 Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design Taehan Kim et.al. 2501.14469 null
2025-01-24 Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis Xiujing Guo et.al. 2501.14465 null
2025-01-24 Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing Zeping Yu et.al. 2501.14457 null
2025-01-24 Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains Xu Chu et.al. 2501.14431 null
2025-01-24 GraphBC: Improving LLMs for Better Graph Data Processing Xu Chu et.al. 2501.14427 null
2025-01-24 CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios Michael Fuest et.al. 2501.14426 null
2025-01-24 DeepFlow: Serverless Large Language Model Serving at Scale Junhao Hu et.al. 2501.14417 null
2025-01-24 SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation Shengjie Wang et.al. 2501.14400 null
2025-01-24 ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer Yoni Schirris et.al. 2501.14379 link
2025-01-24 DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing Xinyu Ma et.al. 2501.14371 link
2025-01-24 Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches Ziad Sakr et.al. 2501.14366 null
2025-01-24 FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration Kai-Tuo Xu et.al. 2501.14350 link
2025-01-24 Chain-of-Retrieval Augmented Generation Liang Wang et.al. 2501.14342 null
2025-01-24 Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Clément Desroches et.al. 2501.14334 null
2025-01-24 Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models Ridhi Jain et.al. 2501.14326 null
2025-01-24 PAID: A Framework of Product-Centric Advertising Image Design Hongyu Chen et.al. 2501.14316 null
2025-01-24 Locality-aware Fair Scheduling in LLM Serving Shiyi Cao et.al. 2501.14312 null
2025-01-24 A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education Calvin Yeung et.al. 2501.14305 link
2025-01-24 MASTER: A Multi-Agent System with LLM Specialized MCTS Bingzheng Gan et.al. 2501.14304 null
2025-01-24 Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph Xujian Liang et.al. 2501.14300 link
2025-01-24 Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment Julian A. Schnabel et.al. 2501.14296 null
2025-01-24 Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes Sullam Jeoung et.al. 2501.14294 link
2025-01-24 Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches Feng Zhou et.al. 2501.14291 null
2025-01-24 Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation Sadegh Mahdavi et.al. 2501.14275 link
2025-01-24 Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors Yi Zhao et.al. 2501.14250 link
2025-01-24 Humanity's Last Exam Long Phan et.al. 2501.14249 null
2025-01-24 Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game Rong Ye et.al. 2501.14225 null
2025-01-24 Top Ten Challenges Towards Agentic Neural Graph Databases Jiaxin Bai et.al. 2501.14224 null
2025-01-24 TFG-Flow: Training-free Guidance in Multimodal Generative Flow Haowei Lin et.al. 2501.14216 null
2025-01-24 Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading Minrui Xu et.al. 2501.14205 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models Saaduddin Mahmud et.al. 2501.14189 null
2025-01-24 GeoSim.AI: AI assistants for numerical simulations in geomechanics Yared W. Bekele et.al. 2501.14186 null
2025-01-24 AI Chatbots as Professional Service Agents: Developing a Professional Identity Wenwen Li et.al. 2501.14179 null
2025-01-24 Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models Yile Gu et.al. 2501.14170 null
2025-01-24 Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction Dongming Sheng et.al. 2501.14144 null
2025-01-23 Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation Derek Yotheringhay et.al. 2501.14119 null
2025-01-23 Domain-Factored Untrained Deep Prior for Spectrum Cartography Subash Timilsina et.al. 2501.14116 null
2025-01-23 MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning Joshua Davis et.al. 2501.14105 link
2025-01-23 StreamingRAG: Real-time Contextual Retrieval and Generation Framework Murugan Sankaradas et.al. 2501.14101 null
2025-01-23 Enhancing Biomedical Relation Extraction with Directionality Po-Ting Lai et.al. 2501.14079 link
2025-01-23 LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language Yubin Ge et.al. 2501.14073 null
2025-01-23 Efficient 2D CT Foundation Model for Contrast Phase Classification Benjamin Hou et.al. 2501.14066 null
2025-01-23 Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models Jakob Krogh Petersen et.al. 2501.14051 link
2025-01-23 LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps Andrey Palaev et.al. 2501.14046 link
2025-01-23 Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions Jianfeng Zhu et.al. 2501.14037 null
2025-01-23 CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation Guofeng Cui et.al. 2501.13927 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 Binary Diffusion Probabilistic Model Vitaliy Kinakh et.al. 2501.13915 null
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 null
2025-01-23 Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models Linh Tran et.al. 2501.13904 null
2025-01-23 Exploring Finetuned Audio-LLM on Heart Murmur Features Adrian Florea et.al. 2501.13884 null
2025-01-23 The machine learning platform for developers of large systems Alexey Naikov et.al. 2501.13881 null
2025-01-23 A RAG-Based Institutional Assistant Gustavo Kuratomi et.al. 2501.13880 null
2025-01-23 On the Reasoning Capacity of AI Models and How to Quantify It Santosh Kumar Radha et.al. 2501.13833 null
2025-01-23 Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Hao Zhang et.al. 2501.13831 null
2025-01-23 Hallucinations Can Improve Large Language Models in Drug Discovery Shuzhou Yuan et.al. 2501.13824 null
2025-01-23 Large Language Model driven Policy Exploration for Recommender Systems Jie Wang et.al. 2501.13816 null
2025-01-23 Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change Mowafak Allaham et.al. 2501.13802 null
2025-01-23 Parameter-Efficient Fine-Tuning for Foundation Models Dan Zhang et.al. 2501.13787 link
2025-01-23 Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling Tanya Rodchenko et.al. 2501.13779 null
2025-01-23 Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework Yoonsang Kim et.al. 2501.13778 link
2025-01-23 Do Large Language Models Truly Understand Geometric Structures? Xiaofeng Wang et.al. 2501.13773 link
2025-01-23 Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak Erjia Xiao et.al. 2501.13772 null
2025-01-23 UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models Xin Xu et.al. 2501.13766 null
2025-01-23 EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents Yuhui Yun et.al. 2501.13746 null
2025-01-23 GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification Te Pei et.al. 2501.13743 null
2025-01-23 An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities Zezhou Yang et.al. 2501.13742 link
2025-01-23 Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks Chang Gong et.al. 2501.13731 null
2025-01-23 RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation Shi-Qi Yan et.al. 2501.13726 null
2025-01-23 Musical ethnocentrism in Large Language Models Anna Kruspe et.al. 2501.13720 null
2025-01-23 A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation Dario Serez et.al. 2501.13718 null
2025-01-23 EventVL: Understand Event Streams via Multimodal Large Language Model Pengteng Li et.al. 2501.13707 null
2025-01-23 DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale Linghao Zhang et.al. 2501.13699 null
2025-01-23 Question Answering on Patient Medical Records with Private Fine-Tuned LLMs Sara Kothari et.al. 2501.13687 null
2025-01-23 HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor Zihui Wu et.al. 2501.13677 link
2025-01-23 How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization Shezheng Song et.al. 2501.13669 null
2025-01-23 LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models Yizheng Sun et.al. 2501.13652 null
2025-01-23 Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models Zhenghao Lin et.al. 2501.13629 null
2025-01-23 Text-to-SQL based on Large Language Models and Database Keyword Search Eduardo R. Nascimento et.al. 2501.13594 null
2025-01-23 Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization Lei Huang et.al. 2501.13573 null
2025-01-23 One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt Tao Liu et.al. 2501.13554 link
2025-01-23 LLMs Can Plan Only If We Tell Them Bilgehan Sel et.al. 2501.13545 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles Munachiso Nwadike et.al. 2501.13491 null
2025-01-23 Adaptive Testing for LLM-Based Applications: A Diversity-based Approach Juyeon Yoon et.al. 2501.13480 null
2025-01-23 LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation JiaXin Chen et.al. 2501.13475 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 Spurious Forgetting in Continual Learning of Language Models Junhao Zheng et.al. 2501.13453 link
2025-01-23 Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models Bo Gao et.al. 2501.13428 null
2025-01-23 Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling Tomek Jaroslawski et.al. 2501.13415 null
2025-01-23 VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework He Kong et.al. 2501.13411 link
2025-01-23 Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures Minglong Dong et.al. 2501.13396 null
2025-01-23 Can Large Language Models Understand Preferences in Personalized Recommendation? Zhaoxuan Tan et.al. 2501.13391 link
2025-01-23 Do as We Do, Not as You Think: the Conformity of Large Language Models Zhiyuan Weng et.al. 2501.13381 link
2025-01-23 Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility Gabrielle Hoyer et.al. 2501.13376 null
2025-01-23 Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement Jae-Sung Bae et.al. 2501.13372 null
2025-01-23 Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification Yuzhuo Li et.al. 2501.13368 null
2025-01-23 50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications Zewei Shi et.al. 2501.13351 null
2025-01-23 MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize Haohang Xu et.al. 2501.13349 null
2025-01-23 Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation Rong Shan et.al. 2501.13344 null
2025-01-23 Multi-aspect Knowledge Distillation with Large Language Model Taegyeong Lee et.al. 2501.13341 link
2025-01-23 Generative Multi-Form Bayesian Optimization Zhendong Guo et.al. 2501.13337 null
2025-01-23 SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network Songge Zhang et.al. 2501.13318 null
2025-01-23 Representing Visualization Insights as a Dense Insight Network Jane Hoffswell et.al. 2501.13309 null
2025-01-23 OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia Xuelong Geng et.al. 2501.13306 link
2025-01-23 Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers Akshit Achara et.al. 2501.13302 link
2025-01-23 Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents Shrinidhi Kumbhar et.al. 2501.13299 null
2025-01-23 RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering Yang Bai et.al. 2501.13297 link
2025-01-23 Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols John Joon Young Chung et.al. 2501.13284 null
2025-01-22 MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis Daeun Jung et.al. 2501.13277 link
2025-01-22 RAG-Reward: Optimizing RAG with Reward Modeling and RLHF Hanning Zhang et.al. 2501.13264 null
2025-01-22 Exploring GPT's Ability as a Judge in Music Understanding Kun Fang et.al. 2501.13261 link
2025-01-22 Bypassing Array Canaries via Autonomous Function Call Resolution Nathaniel Oh et.al. 2501.13256 link
2025-01-22 S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning Yichen Wu et.al. 2501.13198 null
2025-01-22 Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century Axel Loewe et.al. 2501.13142 null
2025-01-23 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-22 Robust Representation Consistency Model via Contrastive Denoising Jiachen Lei et.al. 2501.13094 link
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Bohao Yang et.al. 2501.13042 link
2025-01-22 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Yantao Liu et.al. 2501.13007 link
2025-01-22 Neural network enhanced cross entropy benchmark for monitored circuits Yangrui Hu et.al. 2501.13005 null
2025-01-22 Large Language Model-Based Semantic Communication System for Image Transmission Soheyb Ribouh et.al. 2501.12988 null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null
2025-01-22 Low-dimensional adaptation of diffusion models: Convergence in total variation Jiadong Liang et.al. 2501.12982 null
2025-01-22 OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun et.al. 2501.12975 link
2025-01-22 Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs Jan Corazza et.al. 2501.12972 null
2025-01-22 It's complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act Kristof Meding et.al. 2501.12962 null
2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null
2025-01-22 GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models Pengxiang Zhao et.al. 2501.12956 null
2025-01-22 3D Object Manipulation in a Single Image using Generative Models Ruisi Zhao et.al. 2501.12935 null
2025-01-22 Correctness Assessment of Code Generated by Large Language Models Using Internal Representations Tuan-Dung Bui et.al. 2501.12934 null
2025-01-22 DynamicEarth: How Far are We from Open-Vocabulary Change Detection? Kaiyu Li et.al. 2501.12931 null
2025-01-22 A Functional Software Reference Architecture for LLM-Integrated Systems Alessio Bucaioni et.al. 2501.12904 null
2025-01-22 Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration Offa Kingsleigh et.al. 2501.12901 null
2025-01-22 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Yafu Li et.al. 2501.12895 link
2025-01-23 Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program Carlton Shepherd et.al. 2501.12883 null
2025-01-22 WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge Jingyuan Chen et.al. 2501.12877 null
2025-01-22 ACEBench: Who Wins the Match Point in Tool Learning? Chen Chen et.al. 2501.12851 null
2025-01-22 AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation Aghiles Kebaili et.al. 2501.12840 null
2025-01-22 Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home Viktor Moskvoretskii et.al. 2501.12835 null
2025-01-22 Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek John Pavlopoulos et.al. 2501.12826 link
2025-01-22 Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks Alessio Quercia et.al. 2501.12824 null
2025-01-22 Certified Guidance for Planning with Deep Generative Models Francesco Giacomarra et.al. 2501.12815 null
2025-01-22 Revisit Self-Debugging with Self-Generated Tests for Code Generation Xiancai Chen et.al. 2501.12793 null
2025-01-22 LLMs as Repositories of Factual Knowledge: Limitations and Solutions Seyed Mahed Mousavi et.al. 2501.12774 null
2025-01-22 NExtLong: Toward Effective Long-Context Training without Long Documents Chaochen Gao et.al. 2501.12766 link
2025-01-22 Online Preference Alignment for Language Models via Count-based Exploration Chenjia Bai et.al. 2501.12735 link
2025-01-22 Paradigm-Based Automatic HDL Code Generation Using LLMs Wenhao Sun et.al. 2501.12702 null
2025-01-22 Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression Kai Yoshida et.al. 2501.12698 null
2025-01-22 Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering Qian Tao et.al. 2501.12697 null
2025-01-22 SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling Shengshi Yao et.al. 2501.12696 null
2025-01-22 EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation Yifan Yu et.al. 2501.12689 null
2025-01-22 Distillation Quantification for Large Language Models Sunbowen Lee et.al. 2501.12619 link
2025-01-22 Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? Taiming Wang et.al. 2501.12617 null
2025-01-22 Kimi k1.5: Scaling Reinforcement Learning with LLMs Kimi Team et.al. 2501.12599 null
2025-01-22 Leveraging LLMs to Create a Haptic Devices' Recommendation System Yang Liu et.al. 2501.12573 null
2025-01-22 Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review Rock Yuren Pang et.al. 2501.12557 link
2025-01-21 Human-like conceptual representations emerge from language prediction Ningyu Xu et.al. 2501.12547 null
2025-01-21 How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? Mirali Purohit et.al. 2501.12535 null
2025-01-21 An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts Dhia Elhaq Rzig et.al. 2501.12521 null
2025-01-21 A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data Minh Tran et.al. 2501.12501 null
2025-01-21 The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws Tian Jin et.al. 2501.12486 null
2025-01-21 An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models Xiaoyu Chu et.al. 2501.12469 link
2025-01-21 Adaptive PII Mitigation Framework for Large Language Models Shubhi Asthana et.al. 2501.12465 null
2025-01-21 Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management Arthur Vitui et.al. 2501.12461 link
2025-01-21 Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications Shubhi Asthana et.al. 2501.12456 null
2025-01-21 Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation Dongsheng Zhu et.al. 2501.12432 null
2025-01-21 FREYR: A Framework for Recognizing and Executing Your Requests Roberto Gallotta et.al. 2501.12423 link
2025-01-21 CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning Eunjee Choi et.al. 2501.12422 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks Greg Olmschenk et.al. 2501.12383 null
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 null
2025-01-21 Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration Thomas Walshe et.al. 2501.12332 null
2025-01-21 Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops Mohamed Harmanani et.al. 2501.12331 link
2025-01-21 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Xianwei Zhuang et.al. 2501.12327 link
2025-01-21 LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations Hasan Abu-Rasheed et.al. 2501.12300 null
2025-01-21 MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks Qishen Zhou et.al. 2501.12281 link
2025-01-21 Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Maosong Cao et.al. 2501.12273 link
2025-01-21 FOCUS: First Order Concentrated Updating Scheme Yizhou Liu et.al. 2501.12243 null
2025-01-21 InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models Pha Nguyen et.al. 2501.12231 null
2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null
2025-01-21 Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces Allard Oelen et.al. 2501.12221 null
2025-01-21 You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense Wuyuao Mai et.al. 2501.12210 null
2025-01-21 Explainability for Vision Foundation Models: A Survey Rémi Kazmierczak et.al. 2501.12203 null
2025-01-22 Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Zibo Zhao et.al. 2501.12202 link
2025-01-21 BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks Zhuang Li et.al. 2501.12174 null
2025-01-21 Contextualizing Recommendation Explanations with LLMs: A User Study Yuanjun Feng et.al. 2501.12152 null
2025-01-21 Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities Qirun Dai et.al. 2501.12147 null
2025-01-21 Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot Daniele Bifolco et.al. 2501.12134 null
2025-01-21 Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions Tim Schreiter et.al. 2501.12128 null
2025-01-21 Can open source large language models be used for tumor documentation in Germany? -- An evaluation on urological doctors' notes Stefan Lenz et.al. 2501.12106 link
2025-01-21 Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis Weile Luo et.al. 2501.12084 null
2025-01-21 Phishing Awareness via Game-Based Learning Argianto Rahartomo et.al. 2501.12077 link
2025-01-21 PINNsAgent: Automated PDE Surrogation with Large Language Models Qingpo Wuwu et.al. 2501.12053 null
2025-01-21 Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation Chen Griner et.al. 2501.12033 null
2025-01-21 Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis Hongjun Liu et.al. 2501.12023 null
2025-01-21 Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? Samantha Min Er Yew et.al. 2501.12016 null
2025-01-21 Rate-Aware Learned Speech Compression Jun Xu et.al. 2501.11999 null
2025-01-21 Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models Rupesh Raj Karn et.al. 2501.11979 null
2025-01-21 Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues Maya Medjad et.al. 2501.11977 link
2025-01-21 Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization Jie Zhao et.al. 2501.11968 null
2025-01-21 A Hybrid Attention Framework for Fake News Detection with Large Language Models Xiaochuan Xu et.al. 2501.11967 null
2025-01-21 TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection Yang Cao et.al. 2501.11960 null
2025-01-21 Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model Minghan Wang et.al. 2501.11953 null
2025-01-21 ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation Peter Devine et.al. 2501.11929 link
2025-01-21 Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model He Chang et.al. 2501.11911 null
2025-01-21 Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation Junhong Lian et.al. 2501.11900 link
2025-01-22 Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine Keer Lu et.al. 2501.11885 null
2025-01-21 From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning Yafu Li et.al. 2501.11877 link
2025-01-21 LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems Venkata Sai Aswath Duvvuru et.al. 2501.11864 null
2025-01-21 EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Zhili Cheng et.al. 2501.11858 link
2025-01-21 Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance Nikos Kanakaris et.al. 2501.11849 link
2025-01-21 A Survey on Memory-Efficient Large-Scale Model Training in AI for Science Kaiyuan Tian et.al. 2501.11847 null
2025-01-21 Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction Noah L. Schroeder et.al. 2501.11840 null
2025-01-21 PXGen: A Post-hoc Explainable Method for Generative Models Yen-Lung Huang et.al. 2501.11827 null
2025-01-21 CogMorph: Cognitive Morphing Attacks for Text-to-Image Models Zonglei Jing et.al. 2501.11815 null
2025-01-20 Benchmarking Large Language Models via Random Variables Zijin Hong et.al. 2501.11790 null
2025-01-20 Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection Ali Naseh et.al. 2501.11786 null
2025-01-20 Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference Pouya Hamadanian et.al. 2501.11779 link
2025-01-20 The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers Alina Starovolsky-Shitrit et.al. 2501.11770 null
2025-01-20 Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems Fatemeh Nazary et.al. 2501.11759 link
2025-01-20 A generalizable 3D framework and model for self-supervised learning in medical imaging Tony Xu et.al. 2501.11755 null
2025-01-20 Are generative models fair? A study of racial bias in dermatological image generation Miguel López-Pérez et.al. 2501.11752 null
2025-01-20 Optimizing Pretraining Data Mixtures with LLM-Estimated Utility William Held et.al. 2501.11747 null
2025-01-20 MedicoSAM: Towards foundation models for medical image segmentation Anwai Archit et.al. 2501.11734 link
2025-01-20 Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks Zhenhailong Wang et.al. 2501.11733 null
2025-01-20 Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy Saeid Asgari Taghanaki et.al. 2501.11721 link
2025-01-20 YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives Nong Ming et.al. 2501.11712 link
2025-01-20 Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution Ramtin Ehsani et.al. 2501.11709 null
2025-01-20 Trustformer: A Trusted Federated Transformer Ali Abbasi Tadi et.al. 2501.11706 null
2025-01-20 Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) Brian E. Perron et.al. 2501.11705 null
2025-01-20 Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling Zhenyu Hou et.al. 2501.11651 link
2025-01-20 Trojan Detection Through Pattern Recognition for Large Language Models Vedant Bhasin et.al. 2501.11621 null
2025-01-20 Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems Giorgio Robino et.al. 2501.11613 null
2025-01-20 SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks Wentao Wan et.al. 2501.11599 link
2025-01-20 Recurrent Diffusion for Large-Scale Parameter Generation Kai Wang et.al. 2501.11587 link
2025-01-20 Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models Mahyar Habibi et.al. 2501.11581 null
2025-01-20 Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution Zhiyuan You et.al. 2501.11561 null
2025-01-20 PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation Jinyu Wang et.al. 2501.11551 link
2025-01-20 UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion Zixuan Chen et.al. 2501.11515 null
2025-01-20 Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges Vincent Koc et.al. 2501.11496 null
2025-01-20 Graph-defined Language Learning with LLMs Huachi Zhou et.al. 2501.11478 null
2025-01-20 Curiosity-Driven Reinforcement Learning from Human Feedback Haoran Sun et.al. 2501.11463 link
2025-01-20 Ontology Matching with Large Language Models and Prioritized Depth-First Search Maria Taboada et.al. 2501.11441 null
2025-01-20 One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor Zhikun Wu et.al. 2501.11433 null
2025-01-20 A Survey on Diffusion Models for Anomaly Detection Jing Liu et.al. 2501.11430 link
2025-01-20 Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Siyu Yuan et.al. 2501.11425 link
2025-01-20 Neural Contextual Reinforcement Framework for Logical Structure Language Generation Marcus Irvin et.al. 2501.11417 null
2025-01-20 Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing Kevin Sim et.al. 2501.11411 null
2025-01-20 Revisiting Language Models in Neural News Recommender Systems Yuyue Zhao et.al. 2501.11391 link
2025-01-20 Towards Advancing Code Generation with Large Language Models: A Research Roadmap Haolin Jin et.al. 2501.11354 null
2025-01-20 EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery Guankun Wang et.al. 2501.11347 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 Few-shot Policy (de)composition in Conversational Question Answering Kyle Erwin et.al. 2501.11335 null
2025-01-20 Nested Annealed Training Scheme for Generative Adversarial Networks Chang Wan et.al. 2501.11318 null
2025-01-20 Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning Zhongtian Hu et.al. 2501.11292 null
2025-01-20 Large Language Model Agents for Radio Map Generation and Wireless Network Planning Hongye Quan et.al. 2501.11283 null
2025-01-20 Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries Yi-Hui Lee et.al. 2501.11273 null
2025-01-20 Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios Zhongtian Hu et.al. 2501.11269 null
2025-01-20 Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian Wannita Takerngsaksiri et.al. 2501.11264 link
2025-01-20 Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models Zhuangzhuang Yan et.al. 2501.11247 null
2025-01-20 Irony in Emojis: A Comparative Study of Human and LLM Interpretation Yawen Zheng et.al. 2501.11241 null
2025-01-20 KPL: Training-Free Medical Knowledge Mining of Vision-Language Models Jiaxiang Liu et.al. 2501.11231 link
2025-01-20 Reasoning Language Models: A Blueprint Maciej Besta et.al. 2501.11223 link
2025-01-20 Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation Ivan Lopez et.al. 2501.11199 null
2025-01-19 Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests Kristin Blesch et.al. 2501.11178 link
2025-01-17 FaceXBench: Evaluating Multimodal LLMs on Face Understanding Kartik Narayan et.al. 2501.10360 link
2025-01-17 Zero-Shot Monocular Scene Flow Estimation in the Wild Yiqing Liang et.al. 2501.10357 null
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 null
2025-01-17 Large language models for automated scholarly paper review: A survey Zhenzhen Zhuang et.al. 2501.10326 null
2025-01-17 HiMix: Reducing Computational Complexity in Large Vision-Language Models Xuange Zhang et.al. 2501.10318 null
2025-01-17 Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs Claudio Di Sipio et.al. 2501.10313 null
2025-01-17 Computational Protein Science in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2501.10282 null
2025-01-17 Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation Azat Abdullin et.al. 2501.10200 null
2025-01-17 Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education William Hersh et.al. 2501.10186 null
2025-01-17 Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval Vera Pavlova et.al. 2501.10175 null
2025-01-17 Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis Abhishek Kaushik et.al. 2501.10134 null
2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-17 AI-Generated Music Detection and its Challenges Darius Afchar et.al. 2501.10111 link
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization Zhaopeng Gu et.al. 2501.10067 link
2025-01-17 Accelerating Large Language Models through Partially Linear Feed-Forward Network Gansen Hu et.al. 2501.10054 null
2025-01-17 AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search Wenfeng Feng et.al. 2501.10053 null
2025-01-17 Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists Alyssia Chen et.al. 2501.10037 null
2025-01-17 Mapping scientific communities at scale Victor Barbier et.al. 2501.10035 link
2025-01-17 Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions Zhijie Tan et.al. 2501.10011 null
2025-01-17 Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models Qiang Liu et.al. 2501.09997 null
2025-01-17 Agent-as-Judge for Factual Summarization of Long Narratives Yeonseok Jeong et.al. 2501.09993 link
2025-01-17 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-17 GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions Heda Zuo et.al. 2501.09972 null
2025-01-17 Explainable artificial intelligence (XAI): from inherent explainability to large language models Fuseini Mumuni et.al. 2501.09967 null
2025-01-17 A Survey on Multi-Turn Interaction Capabilities of Large Language Models Chen Zhang et.al. 2501.09959 null
2025-01-17 FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs Zengyi Gao et.al. 2501.09957 null
2025-01-17 AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations Jamin Seo et.al. 2501.09954 link
2025-01-17 Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt Qingcheng Zeng et.al. 2501.09950 null
2025-01-17 MultiPruner: Balanced Structure Removal in Foundation Models J. Pablo Muñoz et.al. 2501.09949 link
2025-01-17 Steering Large Language Models with Feature Guided Activation Additions Samuel Soo et.al. 2501.09929 null
2025-01-17 Towards A Litmus Test for Common Sense Hugo Latapie et.al. 2501.09913 null
2025-01-17 Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project's Talent Knowledge Graph Jiawei Xu et.al. 2501.09909 null
2025-01-17 Position: Open and Closed Large Language Models in Healthcare Jiawei Xu et.al. 2501.09906 null
2025-01-17 FoundationStereo: Zero-Shot Stereo Matching Bowen Wen et.al. 2501.09898 null
2025-01-17 Evolving Deeper LLM Thinking Kuang-Huei Lee et.al. 2501.09891 null
2025-01-17 Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment Mohammad Sadegh Sheikhaei et.al. 2501.09888 link
2025-01-17 FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis Zhe Chen et.al. 2501.09887 null
2025-01-16 ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction Izzeddin Teeti et.al. 2501.09878 null
2025-01-16 Geometry-Preserving Encoder/Decoder in Latent Generative Models Wonjun Lee et.al. 2501.09876 null
2025-01-16 An LLM-Guided Tutoring System for Social Skills Training Michael Guevarra et.al. 2501.09870 null
2025-01-16 Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing Wenhan Wang et.al. 2501.09866 null
2025-01-16 Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message Jiawei Li et.al. 2501.09861 null
2025-01-16 PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery Shristi Das Biswas et.al. 2501.09826 link
2025-01-16 Bridging Language Barriers in Healthcare: A Study on Arabic LLMs Nada Saadi et.al. 2501.09825 null
2025-01-16 BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling Daniele Castellana et.al. 2501.09821 link
2025-01-16 Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems Soham Roy et.al. 2501.09801 null
2025-01-16 Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API Andrey Labunets et.al. 2501.09798 null
2025-01-16 GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation Weiliang Tang et.al. 2501.09783 null
2025-01-16 SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation Wanqi Yin et.al. 2501.09782 link
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Tairan Fu et.al. 2501.09775 null
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-01-16 Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Youngjoon Jang et.al. 2501.09754 null
2025-01-16 OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi et.al. 2501.09751 null
2025-01-16 Enhancing Lexicon-Based Text Embeddings with Large Language Models Yibin Lei et.al. 2501.09749 null
2025-01-16 Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models Bihui Jin et.al. 2501.09745 null
2025-01-16 KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports Hajung Kim et.al. 2501.09744 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text Jihed Ncib et.al. 2501.09719 null
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 Domain Adaptation of Foundation LLMs for e-Commerce Christian Herold et.al. 2501.09706 null
2025-01-16 Cueless EEG imagined speech for subject identification: dataset and benchmarks Ali Derakhshesh et.al. 2501.09700 link
2025-01-16 Simulated Interactive Debugging Yannic Noller et.al. 2501.09694 null
2025-01-17 Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities Fengli Xu et.al. 2501.09686 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark Alexis Roger et.al. 2501.09672 null
2025-01-16 A Survey of Research in Large Language Models for Electronic Design Automation Jingyu Pan et.al. 2501.09655 null
2025-01-16 The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Jonathan Katzy et.al. 2501.09653 null
2025-01-16 CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding Johannes Kirmayr et.al. 2501.09645 link
2025-01-17 LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading Kuan-Ming Liu et.al. 2501.09636 null
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-16 Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment Chaoqi Wang et.al. 2501.09620 link
2025-01-16 From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs Hrithik Majumdar Shibu et.al. 2501.09604 link
2025-01-16 Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Pratyush Dhingra et.al. 2501.09588 null
2025-01-16 Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis Tingxuan Chen et.al. 2501.09555 null
2025-01-16 AI in Support of Diversity and Inclusion Çiçek Güven et.al. 2501.09534 null
2025-01-16 Confidence Estimation for Error Detection in Text-to-SQL Systems Oleg Somov et.al. 2501.09527 null
2025-01-16 Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data Omar Mena et.al. 2501.09521 null
2025-01-16 AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Junjie He et.al. 2501.09503 null
2025-01-16 Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis Qize Yang et.al. 2501.09502 null
2025-01-16 Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework Nuo Chen et.al. 2501.09493 null
2025-01-16 Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators Zhaocheng Liu et.al. 2501.09484 link
2025-01-16 Guided Debugging of Auto-Translated Code Using Differential Testing Shengnan Wu et.al. 2501.09475 null
2025-01-16 DEFOM-Stereo: Depth Foundation Model Based Stereo Matching Hualie Jiang et.al. 2501.09466 link
2025-01-16 Pruning for Sparse Diffusion Models based on Gradient Flow Ben Wan et.al. 2501.09464 null
2025-01-16 "A Great Start, But...": Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design Tianhao He et.al. 2501.09457 null
2025-01-16 Solving the unsolvable: Translating case law in Hong Kong King-kui Sin et.al. 2501.09444 null
2025-01-16 Scaling up self-supervised learning for improved surgical foundation models Tim J. M. Jaspers et.al. 2501.09436 link
2025-01-16 CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation Hwan Heo et.al. 2501.09433 link
2025-01-16 A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy Huandong Wang et.al. 2501.09431 null
2025-01-16 AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring Xinyi Wang et.al. 2501.09428 null
2025-01-16 AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling Ancheng Xu et.al. 2501.09426 null
2025-01-16 FASP: Fast and Accurate Structured Pruning of Large Language Models Hanyu Hu et.al. 2501.09412 null
2025-01-16 MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models Lyudong Jin et.al. 2501.09410 null
2025-01-16 Adaptive Contextual Caching for Mobile Edge Large Language Model Service Guangyuan Liu et.al. 2501.09383 null
2025-01-16 Aligning Instruction Tuning with Pre-training Yiming Liang et.al. 2501.09368 null
2025-01-16 PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks Huiyou Zhan et.al. 2501.09367 null
2025-01-16 YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks Saptarashmi Bandyopadhyay et.al. 2501.09355 null
2025-01-16 UVRM: A Scalable 3D Reconstruction Model from Unposed Videos Shiu-hong Kao et.al. 2501.09347 null
2025-01-16 Rational Tuning of LLM Cascades via Probabilistic Modeling Michael J. Zellinger et.al. 2501.09345 null
2025-01-16 SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs Anbang Ye et.al. 2501.09316 null
2025-01-16 A Study of In-Context-Learning-Based Text-to-SQL Errors Jiawei Shen et.al. 2501.09310 link
2025-01-16 To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation Kaustubh D. Dhole et.al. 2501.09292 null
2025-01-16 LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport Kyeongha Rho et.al. 2501.09291 link
2025-01-16 Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding Kohei Torimi et.al. 2501.09278 null
2025-01-16 Large Language Model is Secretly a Protein Sequence Optimizer Yinkai Wang et.al. 2501.09274 null
2025-01-16 Perspective Transition of Large Language Models for Solving Subjective Tasks Xiaolong Wang et.al. 2501.09265 null
2025-01-16 Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition Takaaki Hori et.al. 2501.09258 null
2025-01-16 Clone-Robust AI Alignment Ariel D. Procaccia et.al. 2501.09254 null
2025-01-16 Split Fine-Tuning for Large Language Models in Wireless Networks Songge Zhang et.al. 2501.09237 null
2025-01-16 Foundations of Large Language Models Tong Xiao et.al. 2501.09223 null
2025-01-16 Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs Sanchit Sinha et.al. 2501.09221 null
2025-01-16 A Simple Graph Contrastive Learning Framework for Short Text Classification Yonghao Liu et.al. 2501.09219 link
2025-01-16 Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics Yuanyuan Wei et.al. 2501.09218 null
2025-01-16 Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning Yonghao Liu et.al. 2501.09214 link
2025-01-16 FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training Hongzhou Yu et.al. 2501.09213 link
2025-01-15 Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures Pengru Deng et.al. 2501.09203 null
2025-01-15 Towards Semantics Lifting for Scientific Computing: A Case Study on FFT Naifeng Zhang et.al. 2501.09201 null
2025-01-15 Guiding Retrieval using LLM-based Listwise Rankers Mandeep Rathee et.al. 2501.09186 link
2025-01-15 The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching Yevhen Kostiuk et.al. 2501.09164 null
2025-01-15 Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability Stephanie L. Day et.al. 2501.09158 null
2025-01-15 Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History Yevhen Kostiuk et.al. 2501.09154 null
2025-01-15 Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation Xingxin He et.al. 2501.09138 null
2025-01-15 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG Aditi Singh et.al. 2501.09136 link
2025-01-15 HAFix: History-Augmented Large Language Models for Bug Fixing Yu Shi et.al. 2501.09135 link
2025-01-15 Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing Eshaan Tanwar et.al. 2501.09127 link
2025-01-15 Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment Conrad Borchers et.al. 2501.09126 null
2025-01-15 Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach Alireza Ghaffari et.al. 2501.09107 null
2025-01-15 Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites Hans W. A. Hanley et.al. 2501.09102 link
2025-01-15 Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative Yuqian Sun et.al. 2501.09099 null
2025-01-15 SteLLA: A Structured Grading System Using LLMs with RAG Hefei Qiu et.al. 2501.09092 null
2025-01-15 Generative diffusion model with inverse renormalization group flows Kanta Masuki et.al. 2501.09064 link
2025-01-15 Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition Sneheel Sarangi et.al. 2501.09056 link
2025-01-15 How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias Tosin Fadahunsi et.al. 2501.09014 link
2025-01-15 Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians Ishan Amin et.al. 2501.09009 link
2025-01-15 Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails Shaona Ghosh et.al. 2501.09004 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 null
2025-01-15 CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks Krit Tangsongcharoen et.al. 2501.08998 link
2025-01-15 VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science Youssef Abdalla et.al. 2501.08995 link
2025-01-15 CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities Haozhe Xie et.al. 2501.08983 link
2025-01-15 Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models Emma Croxford et.al. 2501.08977 null
2025-01-15 Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models Karukriti Kaushik Ghosh et.al. 2501.08974 null
2025-01-15 Analyzing the Ethical Logic of Six Large Language Models W. Russell Neuman et.al. 2501.08951 null
2025-01-15 Applying General Turn-taking Models to Conversational Human-Robot Interaction Gabriel Skantze et.al. 2501.08946 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Liam Dugan et.al. 2501.08913 link
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Connecting SPDE to SGMs Junsu Seo et.al. 2501.08877 null
2025-01-15 Exploring Task-Level Optimal Prompts for Visual In-Context Learning Yan Zhu et.al. 2501.08841 null
2025-01-15 How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering Christoph Treude et.al. 2501.08774 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-15 Enhanced Large Language Models for Effective Screening of Depression and Anxiety June M. Liu et.al. 2501.08769 null
2025-01-15 Few-Shot Learner Generalizes Across AI-Generated Image Detection Shiyu Wu et.al. 2501.08763 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-15 The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities Irina Bigoulaeva et.al. 2501.08716 link
2025-01-15 Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching Chuangtao Ma et.al. 2501.08686 link
2025-01-15 RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency Siqi Li et.al. 2501.08682 null
2025-01-15 Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery Zeqin Liao et.al. 2501.08670 null
2025-01-15 MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities Savya Khosla et.al. 2501.08648 null
2025-01-15 Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations Kaiyuan Zheng et.al. 2501.08641 null
2025-01-15 SWSC: Shared Weight for Similar Channel in LLM Binrui Zeng et.al. 2501.08631 null
2025-01-15 Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models Aruna Sankaranarayanan et.al. 2501.08618 link
2025-01-15 RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Kaiqu Liang et.al. 2501.08617 null
2025-01-15 Assessing the Alignment of FOL Closeness Metrics with Human Judgement Ramya Keerthy Thatikonda et.al. 2501.08613 link
2025-01-15 Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design Zhi Zheng et.al. 2501.08603 link
2025-01-15 AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL Tyler Stennett et.al. 2501.08600 null
2025-01-15 LlamaRestTest: Effective REST API Testing with Small Language Models Myeongsoo Kim et.al. 2501.08598 null
2025-01-15 Sound Scene Synthesis at the DCASE 2024 Challenge Mathieu Lagrange et.al. 2501.08587 null
2025-01-15 LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model Yuxuan Hu et.al. 2501.08582 null
2025-01-15 Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation Jiaqi Huang et.al. 2501.08580 link
2025-01-15 Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms Kewei Li et.al. 2501.08570 link
2025-01-15 Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications Jin Chen et.al. 2501.08563 link
2025-01-15 LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation Yiran Tao et.al. 2501.08558 null
2025-01-15 The Devil is in Temporal Token: High Quality Video Reasoning Segmentation Sitong Gong et.al. 2501.08549 null
2025-01-15 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-15 Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation Jiaxin Guo et.al. 2501.08523 null
2025-01-14 Quantifying the Importance of Data Alignment in Downstream Model Performance Krrish Chawla et.al. 2501.08496 null
2025-01-14 Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition Md Meem Hossain et.al. 2501.08471 null
2025-01-14 Selective Attention Merging for low resource tasks: A case study of Child ASR Natarajan Balaji Shankar et.al. 2501.08468 link
2025-01-14 Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin Joao Carmo de Almeida Neto et.al. 2501.08464 null
2025-01-14 Large Language Models For Text Classification: Case Study And Comprehensive Review Arina Kostina et.al. 2501.08457 null
2025-01-14 Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack Sagiv Antebi et.al. 2501.08454 null
2025-01-14 Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies Ajwad Abrar et.al. 2501.08441 null
2025-01-14 SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models Anurag Kumar et.al. 2501.08421 null
2025-01-14 Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation Seyed Mehdi Rakhtala et.al. 2501.08420 null
2025-01-14 Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data Jiaxing Qiu et.al. 2501.08413 link
2025-01-14 OptiChat: Bridging Optimization Models and Practitioners with Large Language Models Hao Chen et.al. 2501.08406 link
2025-01-14 Towards Best Practices for Open Datasets for LLM Training Stefan Baack et.al. 2501.08365 null
2025-01-14 Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Ryan Burgert et.al. 2501.08331 link
2025-01-14 PokerBench: Training Large Language Models to become Professional Poker Players Richard Zhuang et.al. 2501.08328 link
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Exploring Robustness of Multilingual LLMs on Real-World Noisy Data Amirhossein Aliakbarzadeh et.al. 2501.08322 link
2025-01-14 Enhancing Automated Interpretability with Output-Centric Feature Descriptions Yoav Gur-Arieh et.al. 2501.08319 link
2025-01-14 MiniMax-01: Scaling Foundation Models with Lightning Attention MiniMax et.al. 2501.08313 null
2025-01-14 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Abhilasha Ravichander et.al. 2501.08292 null
2025-01-14 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Hongyu Li et.al. 2501.08282 link
2025-01-14 Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing Pulkit Arora et.al. 2501.08276 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 null
2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset Jiawei Du et.al. 2501.08238 null
2025-01-14 Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings Paul Joe Maliakel et.al. 2501.08219 null
2025-01-14 ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving Zain Ul Abedin et.al. 2501.08203 null
2025-01-14 CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jinjun Peng et.al. 2501.08200 link
2025-01-14 OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Yijiong Yu et.al. 2501.08197 link
2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-15 A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following Yin Fang et.al. 2501.08187 link
2025-01-14 Potential and Perils of Large Language Models as Judges of Unstructured Textual Data Rewina Bedemariam et.al. 2501.08167 null
2025-01-14 I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution Soohyeon Choi et.al. 2501.08165 null
2025-01-14 Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data Phai Vu Dinh et.al. 2501.08149 null
2025-01-14 Refusal Behavior in Large Language Models: A Nonlinear Perspective Fabian Hildebrandt et.al. 2501.08145 link
2025-01-14 Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying Jonathan Lyhs et.al. 2501.08142 null
2025-01-14 Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 Seamie Hayes et.al. 2501.08118 null
2025-01-15 Consistency of Responses and Continuations Generated by Large Language Models on Social Media Wenlu Fan et.al. 2501.08102 null
2025-01-14 Hierarchical Autoscaling for Large Language Model Serving with Chiron Archit Patke et.al. 2501.08090 null
2025-01-14 Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving Nert Keser et.al. 2501.08083 null
2025-01-14 CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning Guoliang He et.al. 2501.08071 link
2025-01-14 A Roadmap to Guide the Integration of LLMs in Hierarchical Planning Israel Puerta-Merino et.al. 2501.08068 null
2025-01-14 Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT Awritrojit Banerjee et.al. 2501.08053 null
2025-01-14 TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Yao Liang et.al. 2501.08008 null
2025-01-14 LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS Muhammad Ashfaq et.al. 2501.07992 null
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 null
2025-01-14 Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models Yifang Xu et.al. 2501.07972 null
2025-01-14 Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning Jiaqi Hua et.al. 2501.07959 link
2025-01-14 AI Guide Dog: Egocentric Path Prediction on Smartphone Aishwarya Jadhav et.al. 2501.07957 null
2025-01-14 Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations Waqar Hussain et.al. 2501.07931 null
2025-01-14 Gandalf the Red: Adaptive Security for LLMs Niklas Pfister et.al. 2501.07927 link
2025-01-14 VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models Hui Kuurila-Zhang et.al. 2501.07922 link
2025-01-14 Large Language Model Interface for Home Energy Management Systems François Michelon et.al. 2501.07919 null
2025-01-14 Bridge-SR: Schrödinger Bridge for Efficient SR Chang Li et.al. 2501.07897 null
2025-01-14 Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs Shuai Wang et.al. 2501.07892 null
2025-01-14 ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding Zhongxiang Sun et.al. 2501.07861 null
2025-01-14 Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques Shobhit Ratan et.al. 2501.07853 null
2025-01-14 Unveiling Provider Bias in Large Language Models for Code Generation Xiaoyu Zhang et.al. 2501.07849 null
2025-01-14 Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning Haoyu Han et.al. 2501.07845 null
2025-01-14 A Driver Advisory System Based on Large Language Model for High-speed Train Y. C. Luo et.al. 2501.07837 null
2025-01-14 Flow: A Modular Approach to Automated Agentic Workflow Generation Boye Niu et.al. 2501.07834 null
2025-01-14 Real-time Verification and Refinement of Language Model Text Generation Joonho Ko et.al. 2501.07824 null
2025-01-14 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding Haomiao Xiong et.al. 2501.07819 link
2025-01-14 A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models Kaustubh D. Dhole et.al. 2501.07818 null
2025-01-14 Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models Dhruv Dhamani et.al. 2501.07815 null
2025-01-14 Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering Feijie Wu et.al. 2501.07813 null
2025-01-14 CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation Ruwei Pan et.al. 2501.07811 null
2025-01-14 Visual Language Models as Operator Agents in the Space Domain Alejandro Carrasco et.al. 2501.07802 null
2025-01-14 Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding Zhaokai Wang et.al. 2501.07783 link
2025-01-14 Symmetry-Aware Generative Modeling through Learned Canonicalization Kusha Sareen et.al. 2501.07773 null
2025-01-14 Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey Bingchen Liu et.al. 2501.07766 null
2025-01-14 On the Statistical Capacity of Deep Generative Models Edric Tam et.al. 2501.07763 link
2025-01-13 Advancing Student Writing Through Automated Syntax Feedback Kamyar Zeinalipour et.al. 2501.07740 null
2025-01-13 Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens Dongwon Kim et.al. 2501.07730 null
2025-01-13 LLMic: Romanian Foundation Language Model Vlad-Andrei Bădoiu et.al. 2501.07721 null
2025-01-13 CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory Haokun Zhao et.al. 2501.07674 null
2025-01-13 Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning Karishma Thakrar et.al. 2501.07663 null
2025-01-13 Large Language Models for Interpretable Mental Health Diagnosis Brian Hyeongseok Kim et.al. 2501.07653 null
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 GPT as a Monte Carlo Language Tree: A Probabilistic Perspective Kun-Peng Ning et.al. 2501.07641 null
2025-01-13 SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models Fabien Bernier et.al. 2501.07639 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chengzu Li et.al. 2501.07542 null
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 Investigating Large Language Models in Inferring Personality Traits from User Conversations Jianfeng Zhu et.al. 2501.07532 null
2025-01-13 RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment Difei Gu et.al. 2501.07525 link
2025-01-13 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 null
2025-01-13 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards Yangsibo Huang et.al. 2501.07493 null
2025-01-13 TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models Thales Sales Almeida et.al. 2501.07482 null
2025-01-13 A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities Yihao Liu et.al. 2501.07468 null
2025-01-13 Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI Rolf Pfister et.al. 2501.07458 null
2025-01-13 Enhancing LLM's Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection Xin Yin et.al. 2501.07425 null
2025-01-13 Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion Lala Shakti Swarup Ray et.al. 2501.07408 null
2025-01-13 OCORD: Open-Campus Object Removal Dataset Shuo Zhang et.al. 2501.07397 null
2025-01-13 Simulating the Hubbard Model with Equivariant Normalizing Flows Dominic Schuh et.al. 2501.07371 null
2025-01-13 Emergent effects of scaling on the functional hierarchies within large language models Paul C. Bogdan et.al. 2501.07359 null
2025-01-13 Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring Buse Sibel Korkmaz et.al. 2501.07324 link
2025-01-13 FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering Erik Henriksson et.al. 2501.07314 link
2025-01-13 The Lessons of Developing Process Reward Models in Mathematical Reasoning Zhenru Zhang et.al. 2501.07301 null
2025-01-13 GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction Oleg Kobzarev et.al. 2501.07295 null
2025-01-13 LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks Zan-Kai Chong et.al. 2501.07288 null
2025-01-13 Lifelong Learning of Large Language Model based Agents: A Roadmap Junhao Zheng et.al. 2501.07278 link
2025-01-13 Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation Amir Sartipi et.al. 2501.07276 null
2025-01-13 Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics Wonduk Seo et.al. 2501.07267 null
2025-01-13 Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Li Liang et.al. 2501.07260 link
2025-01-13 EdgeTAM: On-Device Track Anything Model Chong Zhou et.al. 2501.07256 null
2025-01-13 Large Language Models: New Opportunities for Access to Science Jutta Schnabel et.al. 2501.07250 null
2025-01-13 Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training Ziqing Wen et.al. 2501.07237 link
2025-01-13 Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction Qiaoqiao Ren et.al. 2501.07224 link
2025-01-13 Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing Laifa Tao et.al. 2501.07191 null
2025-01-13 Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study Huashan Chen et.al. 2501.07165 null
2025-01-13 AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model Bangchen Yin et.al. 2501.07155 link
2025-01-13 LLM360 K2: Scaling Up 360-Open-Source Large Language Models Zhengzhong Liu et.al. 2501.07124 null
2025-01-13 How GPT learns layer by layer Jason Du et.al. 2501.07108 link
2025-01-13 ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training Jiayang Wu et.al. 2501.07078 link
2025-01-13 D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation Zhejun Zhang et.al. 2501.07077 link
2025-01-13 Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values Jing Yao et.al. 2501.07071 null
2025-01-13 Enhancing Image Generation Fidelity via Progressive Prompts Zhen Xiong et.al. 2501.07070 link
2025-01-13 Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities ZeKe Xiao et.al. 2501.07058 null
2025-01-13 SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation Yee-Fan Tan et.al. 2501.07055 null
2025-01-13 PoAct: Policy and Action Dual-Control Agent for Generalized Applications Guozhi Yuan et.al. 2501.07054 null
2025-01-13 ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction Yan Zhang et.al. 2501.07051 link
2025-01-13 Unveiling the Potential of Text in High-Dimensional Time Series Forecasting Xin Zhou et.al. 2501.07048 link
2025-01-13 Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis Luwei Zeng et.al. 2501.07034 null
2025-01-13 A Proposed Large Language Model-Based Smart Search for Archive System Ha Dung Nguyen et.al. 2501.07024 null
2025-01-13 Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps Henry Li et.al. 2501.06999 link
2025-01-13 LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models Mozhgan Nasr Azadani et.al. 2501.06986 link
2025-01-13 Combining LLM decision and RL action selection to improve RL policy for adaptive interventions Karine Karine et.al. 2501.06980 null
2025-01-12 How is Google using AI for internal code migrations? Stoyan Nikolov et.al. 2501.06972 null
2025-01-12 Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives Xinyao Ma et.al. 2501.06964 null
2025-01-12 Comparison of Autoencoders for tokenization of ASL datasets Vouk Praun-Petrovic et.al. 2501.06942 null
2025-01-12 Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy Evgeny Ugolkov et.al. 2501.06939 link
2025-01-12 Harnessing Large Language Models for Disaster Management: A Survey Zhenyu Lei et.al. 2501.06932 null
2025-01-12 Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories Faaiq Waqar et.al. 2501.06921 null
2025-01-12 Risk-Averse Finetuning of Large Language Models Sapana Chaudhary et.al. 2501.06911 link
2025-01-12 Deep Learning and Foundation Models for Weather Prediction: A Survey Jimeng Shi et.al. 2501.06907 null
2025-01-12 A Foundational Generative Model for Breast Ultrasound Image Analysis Haojun Yu et.al. 2501.06869 null
2025-01-12 Transfer Learning of Tabular Data by Finetuning Large Language Models Shourav B. Rabbani et.al. 2501.06863 null
2025-01-12 A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context Noureldin Zahran et.al. 2501.06859 null
2025-01-12 SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training Tianjin Huang et.al. 2501.06842 link
2025-01-12 An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering Zaber Al Hassan Ayon et.al. 2501.06837 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents Augusto Gonzalez-Bonorino et.al. 2501.06834 link
2025-01-12 GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing Ruizhe Ou et.al. 2501.06828 null
2025-01-12 Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification Shijing Chen et.al. 2501.06827 null
2025-01-12 Event Argument Extraction with Enriched Prompts Chen Liang et.al. 2501.06825 link
2025-01-12 A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT Yizhou Zhou et.al. 2501.06819 null
2025-01-12 RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models Keyan Chen et.al. 2501.06809 link
2025-01-12 Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting Yongshuo Zhu et.al. 2501.06808 null
2025-01-12 MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference Wenxuan Zeng et.al. 2501.06807 null
2025-01-12 Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences Liu Yu et.al. 2501.06795 null
2025-01-12 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes Mahmoud Ahmed et.al. 2501.06785 link
2025-01-12 Cost-Effective Robotic Handwriting System with AI Integration Tianyi Huang et.al. 2501.06783 null
2025-01-12 Eliza: A Web3 friendly AI Agent Operating System Shaw Walters et.al. 2501.06781 link
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-12 Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation Shunfan Zheng et.al. 2501.06741 null
2025-01-12 ZOQO: Zero-Order Quantized Optimization Noga Bar et.al. 2501.06736 null
2025-01-12 Better Prompt Compression Without Multi-Layer Perceptrons Edouardo Honig et.al. 2501.06730 null
2025-01-12 Measuring the Robustness of Reference-Free Dialogue Evaluation Systems Justin Vasselli et.al. 2501.06728 link
2025-01-12 Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G Zhiyan Liu et.al. 2501.06726 null
2025-01-12 DRDT3: Diffusion-Refined Decision Test-Time Training Model Xingshuai Huang et.al. 2501.06718 null
2025-01-12 ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian Mykyta Syromiatnikov et.al. 2501.06715 link
2025-01-12 Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management Liu Qianli et.al. 2501.06709 null
2025-01-12 Evaluating Sample Utility for Data Selection by Mimicking Model Weights Tzu-Heng Huang et.al. 2501.06708 null
2025-01-12 AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds Yinfang Chen et.al. 2501.06706 null
2025-01-12 Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese Jie Yang et.al. 2501.06704 null
2025-01-12 Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions Aidan Hogan et.al. 2501.06699 null
2025-01-12 DVM: Towards Controllable LLM Agents in Social Deduction Games Zheng Zhang et.al. 2501.06695 null
2025-01-12 TAPO: Task-Referenced Adaptation for Prompt Optimization Wenxin Luo et.al. 2501.06689 link
2025-01-12 Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning Xiangen Hu et.al. 2501.06682 null
2025-01-12 Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving Haoxiang Gao et.al. 2501.06680 null
2025-01-11 Challenging reaction prediction models to generalize to novel chemistry John Bradshaw et.al. 2501.06669 link
2025-01-11 Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training Sanjit Kakarla et.al. 2501.06658 link
2025-01-11 FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings Tong Liu et.al. 2501.06645 null
2025-01-11 Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models Veronika Smilga et.al. 2501.06638 link
2025-01-11 Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach Mohammed Maree et.al. 2501.06628 null
2025-01-11 Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks Amr Almorsi et.al. 2501.06625 null
2025-01-11 Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks Xuanhao Luo et.al. 2501.06604 null
2025-01-11 ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation Xuanle Zhao et.al. 2501.06598 link
2025-01-11 ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning Xiangru Tang et.al. 2501.06590 link
2025-01-11 Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping Muru Zhang et.al. 2501.06589 link
2025-01-10 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Omkar Thawakar et.al. 2501.06186 link
2025-01-10 PEACE: Empowering Geologic Map Holistic Understanding with MLLMs Yangyu Huang et.al. 2501.06184 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-10 GenMol: A Drug Discovery Generalist with Discrete Diffusion Seul Lee et.al. 2501.06158 null
2025-01-10 Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories Gerd Kortemeyer et.al. 2501.06143 null
2025-01-10 Supervision policies can shape long-term risk management in general-purpose AI models Manuel Cebrian et.al. 2501.06137 link
2025-01-10 Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI Yuya Asano et.al. 2501.06129 null
2025-01-10 Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding Fabian David Schmidt et.al. 2501.06117 link
2025-01-10 From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy Elham Aghakhani et.al. 2501.06101 null
2025-01-10 Photokinetics of Photothermal Reactions Mounir Maafi et.al. 2501.06057 null
2025-01-10 AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery Johann Wenckstern et.al. 2501.06039 link
2025-01-10 Addressing speaker gender bias in large scale speech translation systems Shubham Bansal et.al. 2501.05989 null
2025-01-10 Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing Eklavya Sarkar et.al. 2501.05987 link
2025-01-10 Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys Divya Mani Adhikari et.al. 2501.05985 null
2025-01-10 Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea Eunjung Cho et.al. 2501.05981 null
2025-01-10 Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory Yunmeng Shu et.al. 2501.05965 null
2025-01-10 Effective faking of verbal deception detection with target-aligned adversarial attacks Bennett Kleinberg et.al. 2501.05962 null
2025-01-10 Reusable specimen-level inference in computational pathology Jakub R. Kaczmarzyk et.al. 2501.05945 link
2025-01-10 DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information Yongfan Lai et.al. 2501.05932 link
2025-01-10 LLMs Reproduce Stereotypes of Sexual and Gender Minorities Ruby Ostrow et.al. 2501.05926 null
2025-01-10 Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction Petraq Nako et.al. 2501.05925 null
2025-01-10 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Prompt engineering and its implications on the energy consumption of Large Language Models Riccardo Rubei et.al. 2501.05899 link
2025-01-10 Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs Bianca Raimondi et.al. 2501.05891 link
2025-01-10 Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs Dabing Cheng et.al. 2501.05884 null
2025-01-10 VideoRAG: Retrieval-Augmented Generation over Video Corpus Soyeong Jeong et.al. 2501.05874 null
2025-01-10 ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability Antonin Poché et.al. 2501.05855 link
2025-01-10 Understanding Impact of Human Feedback via Influence Functions Taywon Min et.al. 2501.05790 link
2025-01-10 Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models You Li et.al. 2501.05767 null
2025-01-10 Controlling Large Language Models Through Concept Activation Vectors Hanyu Zhang et.al. 2501.05764 null
2025-01-10 StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation Shangjin Zhai et.al. 2501.05763 null
2025-01-10 CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech Madhurananda Pahar et.al. 2501.05755 null
2025-01-10 Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models Sungjae Lee et.al. 2501.05752 null
2025-01-10 TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos Korawat Charoenpitaks et.al. 2501.05733 link
2025-01-10 Enabling Scalable Oversight via Self-Evolving Critic Zhengyang Tang et.al. 2501.05727 null
2025-01-10 I Can't Share Code, but I need Translation -- An Empirical Study on Code Translation through Federated LLM Jahnavi Kumar et.al. 2501.05724 null
2025-01-10 How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond Chen Huang et.al. 2501.05714 null
2025-01-10 Multi-Step Reasoning in Korean and the Emergent Mirage Guijin Son et.al. 2501.05712 null
2025-01-10 EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model Yi He et.al. 2501.05710 null
2025-01-10 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Vighnesh Subramaniam et.al. 2501.05707 null
2025-01-10 Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness Audrey Salmon et.al. 2501.05706 null
2025-01-10 Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection Feiyi Chen et.al. 2501.05675 null
2025-01-10 Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration Zuyuan Zhang et.al. 2501.05673 null
2025-01-10 Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models Zheqi Lv et.al. 2501.05662 null
2025-01-10 Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation Zheqi Lv et.al. 2501.05647 null
2025-01-10 Iconicity in Large Language Models Anna Marklová et.al. 2501.05643 null
2025-01-10 HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection Anant Mehta et.al. 2501.05631 link
2025-01-10 The Impact of Model Scaling on Seen and Unseen Language Performance Rhitabrat Pokharel et.al. 2501.05629 null
2025-01-09 Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study Zhenyu Qi et.al. 2501.05625 null
2025-01-09 Exploring Large Language Models for Translating Romanian Computational Problems into English Adrian Marius Dumitran et.al. 2501.05601 null
2025-01-09 Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics Gert Aarts et.al. 2501.05580 null
2025-01-09 Exploring Large Language Models (LLMs) through interactive Python activities Eugenio Tufino et.al. 2501.05577 link
2025-01-09 LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts Yuri Facanha Bezerra et.al. 2501.05554 link
2025-01-09 The dynamics of meaning through time: Assessment of Large Language Models Mohamed Taher Alrefaie et.al. 2501.05552 null
2025-01-09 Infecting Generative AI With Viruses David Noever et.al. 2501.05542 null
2025-01-09 NSChat: A Chatbot System To Rule Them All Zenon Lamprou et.al. 2501.05541 null
2025-01-09 ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Xingyu Fu et.al. 2501.05452 null
2025-01-09 Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Yifan Yu et.al. 2501.05446 link
2025-01-09 Consistent Flow Distillation for Text-to-3D Generation Runjie Yan et.al. 2501.05445 null
2025-01-09 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark Yunzhuo Hao et.al. 2501.05444 null
2025-01-09 A survey of textual cyber abuse detection using cutting-edge language models and large language models Jose A. Diaz-Garcia et.al. 2501.05443 null
2025-01-09 Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation Xuyi Meng et.al. 2501.05427 null
2025-01-09 Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers Jerry Chongyi Hu et.al. 2501.05423 null
2025-01-09 Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation Darius Petermann et.al. 2501.05413 null
2025-01-10 Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics Maximilian Alber et.al. 2501.05409 null
2025-01-09 TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts Yu-Hao Huang et.al. 2501.05403 null
2025-01-09 Mechanistic understanding and validation of large AI models with SemanticLens Maximilian Dreyer et.al. 2501.05398 null
2025-01-09 FairCode: Evaluating Social Bias of LLMs in Code Generation Yongkang Du et.al. 2501.05396 link
2025-01-09 Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models Kristian G. Barman et.al. 2501.05382 null
2025-01-09 Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance Dimitrios Gerogiannis et.al. 2501.05379 null
2025-01-09 Accelerated Diffusion Models via Speculative Sampling Valentin De Bortoli et.al. 2501.05370 null
2025-01-09 Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction Hantao Lou et.al. 2501.05336 link
2025-01-09 "What's Happening"- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles Xuewen Luo et.al. 2501.05322 null
2025-01-09 Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning Nora Gourmelon et.al. 2501.05281 link
2025-01-09 CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models Fabian Hörst et.al. 2501.05269 link
2025-01-09 Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal Wanli Ma et.al. 2501.05265 null
2025-01-09 CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models Yewei Song et.al. 2501.05255 null
2025-01-09 From Scientific Texts to Verifiable Code: Automating the Process with Transformers Changjie Wang et.al. 2501.05252 null
2025-01-09 RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models Peizhuo Lv et.al. 2501.05249 null
2025-01-09 Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning Laura Puccioni et.al. 2501.05248 null
2025-01-09 Online Prompt and Solver Selection for Program Synthesis Yixuan Li et.al. 2501.05247 null
2025-01-09 Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs Artem Fedorchenko et.al. 2501.05234 null
2025-01-09 Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection Pei-Kang Lee et.al. 2501.05228 null
2025-01-09 Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes Ludwic Leonard et.al. 2501.05226 null
2025-01-09 Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond Tomas Goldsack et.al. 2501.05224 null
2025-01-09 A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education Ziqing Li et.al. 2501.05220 null
2025-01-09 Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration Xuyang Liu et.al. 2501.05179 link
2025-01-09 Emergence of human-like polarization among large language model agents Jinghua Piao et.al. 2501.05171 null
2025-01-09 Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering Matteo Esposito et.al. 2501.05165 null
2025-01-09 Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier Yufei Shang et.al. 2501.05155 null
2025-01-09 DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving Xuran Zheng et.al. 2501.05081 null
2025-01-09 Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization Harshith Manjunath et.al. 2501.05079 null
2025-01-09 Analyzing Memorization in Large Language Models through the Lens of Model Attribution Tarun Ram Menta et.al. 2501.05078 link
2025-01-09 A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model Shuo Tong et.al. 2501.05075 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators Leonard Lupin-Jimenez et.al. 2501.05058 null
2025-01-09 LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models Zengqi Peng et.al. 2501.05057 null
2025-01-09 On the Generalizability of Transformer Models to Code Completions of Different Lengths Nathan Cooper et.al. 2501.05051 null
2025-01-09 SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution Chengxing Xie et.al. 2501.05040 link
2025-01-09 Enhancing Human-Like Responses in Large Language Models Ethem Yağız Çalık et.al. 2501.05032 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-09 A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications Ofir Marom et.al. 2501.05030 null
2025-01-09 TreeKV: Smooth Key-Value Cache Compression with Tree Structures Ziwei He et.al. 2501.04987 null
2025-01-09 SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs Muhammad Salman et.al. 2501.04985 null
2025-01-09 V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer Hangzhou He et.al. 2501.04975 link
2025-01-09 Demystifying Domain-adaptive Post-training for Financial LLMs Zixuan Ke et.al. 2501.04961 link
2025-01-09 Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments Yifan Xu et.al. 2501.04947 null
2025-01-09 Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models Qingyu Ren et.al. 2501.04945 link
2025-01-09 Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency Shiji Zhao et.al. 2501.04931 null
2025-01-09 Investigating Numerical Translation with Large Language Models Wei Tang et.al. 2501.04927 null
2025-01-09 FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jun-Hak Yun et.al. 2501.04926 null
2025-01-09 HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers Yiyao Yang et.al. 2501.04908 link
2025-01-09 JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis Jun-Hyeok Cha et.al. 2501.04904 null
2025-01-09 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries Keke Huang et.al. 2501.04901 null
2025-01-09 SUGAR: Leveraging Contextual Confidence for Smarter Retrieval Hanna Zubkova et.al. 2501.04899 null
2025-01-08 Leveraging Log Probabilities in Language Models to Forecast Future Events Tommaso Soru et.al. 2501.04880 null
2025-01-08 Real-Time Textless Dialogue Generation Long Mai et.al. 2501.04877 link
2025-01-08 Modelling complex proton transport phenomena -- Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields Malte Grunert et.al. 2501.04876 null
2025-01-08 Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware Brandon J Walton et.al. 2501.04848 null
2025-01-08 Do Code LLMs Understand Design Patterns? Zhenyu Pan et.al. 2501.04835 null
2025-01-08 On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability Andreas Vogelsang et.al. 2501.04810 null
2025-01-08 IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX Erik Recio-Armengol et.al. 2501.04776 link
2025-01-08 Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations Kirandeep Kaur et.al. 2501.04762 null
2025-01-08 Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch Phillip Richter et.al. 2501.04755 null
2025-01-08 EditAR: Unified Conditional Generation with Autoregressive Models Jiteng Mu et.al. 2501.04699 null
2025-01-08 Re-ranking the Context for Multimodal Retrieval Augmented Generation Matin Mortaheb et.al. 2501.04695 null
2025-01-08 SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Zixuan Huang et.al. 2501.04689 null
2025-01-08 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Ruilin Luo et.al. 2501.04686 link
2025-01-08 Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations Archita Srivastava et.al. 2501.04675 null
2025-01-08 Assessing Language Comprehension in Large Language Models Using Construction Grammar Wesley Scivetti et.al. 2501.04661 null
2025-01-08 Multi-task retriever fine-tuning for domain-specific and efficient RAG Patrice Béchard et.al. 2501.04652 null
2025-01-08 FlairGPT: Repurposing LLMs for Interior Designs Gabrielle Littlefair et.al. 2501.04648 null
2025-01-08 Knowledge Retrieval Based on Generative AI Te-Lun Yang et.al. 2501.04635 null
2025-01-08 "Can you be my mum?": Manipulating Social Robots in the Large Language Models Era Giulio Antonio Abbo et.al. 2501.04633 null
2025-01-09 MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation Daniele Molino et.al. 2501.04614 null
2025-01-08 Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning Ivan Kankeu et.al. 2501.04591 link
2025-01-08 Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models Miaoyang He et.al. 2501.04582 null
2025-01-08 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection Yuhang Liu et.al. 2501.04575 link
2025-01-09 OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Run Luo et.al. 2501.04561 link
2025-01-08 The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? Christopher Lazik et.al. 2501.04543 null
2025-01-08 Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time Uri Berger et.al. 2501.04513 null
2025-01-08 CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection Ruijun Feng et.al. 2501.04510 null
2025-01-08 Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction Guofeng Yang et.al. 2501.04487 null
2025-01-08 When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages Archchana Sindhujan et.al. 2501.04473 null
2025-01-08 Hidden Entity Detection from GitHub Leveraging Large Language Models Lu Gan et.al. 2501.04455 link
2025-01-08 Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions Doaa Mahmud et.al. 2501.04437 null
2025-01-08 Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions Na Yan et.al. 2501.04436 null
2025-01-08 End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach H. M. Shadman Tabib et.al. 2501.04425 null
2025-01-08 SEO: Stochastic Experience Optimization for Large Language Models Jitao Xu et.al. 2501.04393 null
2025-01-08 iFADIT: Invertible Face Anonymization via Disentangled Identity Transform Lin Yuan et.al. 2501.04390 null
2025-01-08 DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications Feng Liu et.al. 2501.04366 link
2025-01-08 Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting Dong-Hai Zhu et.al. 2501.04341 link
2025-01-09 Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models Haonan Shi et.al. 2501.04323 null
2025-01-08 Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts Preethi Seshadri et.al. 2501.04316 link
2025-01-08 RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation Jun Liu et.al. 2501.04315 null
2025-01-08 Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models Kunpeng Zhang et.al. 2501.04312 null
2025-01-08 LLM4SR: A Survey on Large Language Models for Scientific Research Ziming Luo et.al. 2501.04306 link
2025-01-08 Multimodal Graph Constrastive Learning and Prompt for ChartQA Yue Dai et.al. 2501.04303 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 An Analysis of Model Robustness across Concurrent Distribution Shifts Myeongho Jeon et.al. 2501.04288 null
2025-01-08 Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models Bahman Torkamandi et.al. 2501.04286 null
2025-01-08 Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking Tianqi Ren et.al. 2501.04285 null
2025-01-08 OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments Yujie Tang et.al. 2501.04279 null
2025-01-08 Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering Christophe Bajan et.al. 2501.04277 link
2025-01-08 Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation Senwei Xie et.al. 2501.04268 null
2025-01-08 Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning Lang Xu et.al. 2501.04266 null
2025-01-08 IOLBENCH: Benchmarking LLMs on Linguistic Reasoning Satyam Goyal et.al. 2501.04249 link
2025-01-08 TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events Jian-Hua Fang et.al. 2501.04247 null
2025-01-08 Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks Rachel Longjohn et.al. 2501.04234 null
2025-01-07 Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation Alireza Salemi et.al. 2501.04167 null
2025-01-07 AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots Shaoyue Wen et.al. 2501.04156 link
2025-01-07 Multilingual Open QA on the MIA Shared Task Navya Yarrabelly et.al. 2501.04153 null
2025-01-07 The angular momentum spiral of the Milky Way disc in Gaia Rashid Yaaqib et.al. 2501.04095 null
2025-01-07 More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives Xiaoqing Zhang et.al. 2501.04070 link
2025-01-07 ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono Jingquan Wang et.al. 2501.04062 null
2025-01-07 LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving Lingdong Kong et.al. 2501.04005 null
2025-01-07 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Haobo Yuan et.al. 2501.04001 link
2025-01-07 RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance Matin Mortaheb et.al. 2501.03995 null
2025-01-07 Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance Adil Rengim Cetingoz et.al. 2501.03993 null
2025-01-07 Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles Yuxi Xia et.al. 2501.03991 null
2025-01-07 (De)-Indexing and the Right to be Forgotten Salvatore Vilella et.al. 2501.03989 null
2025-01-07 VLM-driven Behavior Tree for Context-aware Task Planning Naoki Wake et.al. 2501.03968 link
2025-01-07 Vision Language Models as Values Detectors Giulio Antonio Abbo et.al. 2501.03957 null
2025-01-07 Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States Jurgita Kapočiūtė-Dzikienė et.al. 2501.03952 null
2025-01-07 Synthetic Data Privacy Metrics Amy Steier et.al. 2501.03941 null
2025-01-07 Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection Pablo Miralles-González et.al. 2501.03940 null
2025-01-07 A precise asymptotic analysis of learning diffusion models: theory and insights Hugo Cui et.al. 2501.03937 link
2025-01-07 Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study Ramya Jonnala et.al. 2501.03904 null
2025-01-07 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Shaolei Zhang et.al. 2501.03895 link
2025-01-07 AlphaPO -- Reward shape matters for LLM alignment Aman Gupta et.al. 2501.03884 null
2025-01-07 CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds Keonwoo Kim et.al. 2501.03879 null
2025-01-07 Progressive Document-level Text Simplification via Large Language Models Dengzhao Fang et.al. 2501.03857 null
2025-01-07 MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention Aadya Arora et.al. 2501.03839 null
2025-01-07 Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging Simon W. Penninga et.al. 2501.03825 null
2025-01-08 MADation: Face Morphing Attack Detection with Foundation Models Eduarda Caldeira et.al. 2501.03800 link
2025-01-07 KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration Chengyuan Li et.al. 2501.03786 null
2025-01-07 Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series Yuxiao Hu et.al. 2501.03747 null
2025-01-07 Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein Xiaotong Guo et.al. 2501.03722 null
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-07 SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment Yuchun Fan et.al. 2501.03681 link
2025-01-07 Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Haoning Xu et.al. 2501.03643 null
2025-01-07 CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems Zhaonan Wu et.al. 2501.03626 link
2025-01-07 LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment Gaoussou Youssouf Kebe et.al. 2501.03624 null
2025-01-07 Cosmos World Foundation Model Platform for Physical AI NVIDIA et.al. 2501.03575 link
2025-01-07 From Code to Compliance: Assessing ChatGPT's Utility in Designing an Accessible Webpage -- A Case Study Ammar Ahmed et.al. 2501.03572 null
2025-01-07 What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs Muneera Bano et.al. 2501.03569 null
2025-01-07 Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities Benedikt Reitemeyer et.al. 2501.03566 null
2025-01-07 Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis Haoran Lai et.al. 2501.03565 null
2025-01-07 PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models Lingzhi Yuan et.al. 2501.03544 null
2025-01-07 Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions Weijieying Ren et.al. 2501.03540 null
2025-01-07 Deep Learning for Pathological Speech: A Survey Shakeel A. Sheikh et.al. 2501.03536 null
2025-01-08 SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving Xuewen Luo et.al. 2501.03535 null
2025-01-07 A generative approach for lensless imaging in low-light conditions Ziyang Liu et.al. 2501.03511 null
2025-01-07 A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models Shuyang Wang et.al. 2501.03508 null
2025-01-07 Textualize Visual Prompt for Image Editing via Diffusion Bridge Pengcheng Xu et.al. 2501.03495 null
2025-01-07 Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment Prashant Trivedi et.al. 2501.03486 null
2025-01-07 Reading with Intent -- Neutralizing Intent Benjamin Reichman et.al. 2501.03475 null
2025-01-07 Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning Chuang Niu et.al. 2501.03469 link
2025-01-07 MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems Yannis Katsis et.al. 2501.03468 link
2025-01-07 ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation Yu-Cheng Liu et.al. 2501.03462 null
2025-01-07 Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation Xiao Wang et.al. 2501.03458 link
2025-01-07 CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering Jialiang Chen et.al. 2501.03447 null
2025-01-07 LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models Mohamad Fakih et.al. 2501.03446 null
2025-01-07 Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology Sarah E. Finch et.al. 2501.03441 link
2025-01-06 SALT: Sales Autocompletion Linked Business Tables Dataset Tassilo Klein et.al. 2501.03413 link
2025-01-06 BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations Simone Giovannini et.al. 2501.03403 null
2025-01-06 DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes Xuyang Wang et.al. 2501.03397 link
2025-01-06 Evolved Quantum Boltzmann Machines Michele Minervini et.al. 2501.03367 null
2025-01-06 CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets Tanay Agrawal et.al. 2501.03332 null
2025-01-06 LiLMaps: Learnable Implicit Language Maps Evgenii Kruzhkov et.al. 2501.03304 null
2025-01-06 A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval Shuo Tong et.al. 2501.03295 null
2025-01-06 Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model Naibo Wang et.al. 2501.03292 null
2025-01-06 ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning Pengwei Tang et.al. 2501.03291 null
2025-01-06 CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models Zhenyu Xu et.al. 2501.03288 null
2025-01-06 BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning Beichen Zhang et.al. 2501.03226 link
2025-01-06 Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text Ayat Najjar et.al. 2501.03212 null
2025-01-06 Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity Ayat A. Najjar et.al. 2501.03203 null
2025-01-06 CLIX: Cross-Lingual Explanations of Idiomatic Expressions Aaron Gluck et.al. 2501.03191 null
2025-01-06 Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text Ali Al-Lawati et.al. 2501.03166 link
2025-01-06 Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy Risha Goel et.al. 2501.03153 link
2025-01-06 Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches Alhassan Mumuni et.al. 2501.03151 null
2025-01-06 VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity Yerong Li et.al. 2501.03139 null
2025-01-07 PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Mingyang Song et.al. 2501.03124 link
2025-01-06 CAT: Content-Adaptive Image Tokenization Junhong Shen et.al. 2501.03120 null
2025-01-06 LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases Dylan Bouchard et.al. 2501.03112 link
2025-01-06 Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling Aseem Srivastava et.al. 2501.03088 null
2025-01-06 Retrieval-Augmented TLAPS Proof Generation with Large Language Models Yuhao Zhou et.al. 2501.03073 null
2025-01-06 ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events Duygu Sezen Islakoglu et.al. 2501.03040 null
2025-01-06 Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Zhen Li et.al. 2501.03035 null
2025-01-06 TransPixar: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 CALM: Curiosity-Driven Auditing for Large Language Models Xiang Zheng et.al. 2501.02997 link
2025-01-06 Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation Zhi Qu et.al. 2501.02979 link
2025-01-06 FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models Zhuo Chen et.al. 2501.02968 null
2025-01-07 Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild Wanpeng Hu et.al. 2501.02964 link
2025-01-07 SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild Jiawei Liu et.al. 2501.02962 null
2025-01-06 The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features Shi Bin Hoo et.al. 2501.02945 link
2025-01-07 Inhibition of bacterial growth by antibiotics Barnabe Ledoux et.al. 2501.02944 null
2025-01-06 Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions Jianhua Pei et.al. 2501.02928 null
2025-01-06 DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model Hao Yu et.al. 2501.02901 link
2025-01-06 FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection Guray Ozgur et.al. 2501.02892 link
2025-01-06 MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs Hui Sun et.al. 2501.02885 null
2025-01-06 IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment Yiming Zhang et.al. 2501.02869 null
2025-01-06 Large Language Models for Video Surveillance Applications Ulindu De Silva et.al. 2501.02850 null
2025-01-06 Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification Yubo Wang et.al. 2501.02844 null
2025-01-06 Foundations of GenIR Qingyao Ai et.al. 2501.02842 null
2025-01-06 An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models Dezhi Ran et.al. 2501.02829 null
2025-01-06 InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion Zhaoyi Yan et.al. 2501.02795 null
2025-01-06 CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation Yuanhong Chen et.al. 2501.02786 null
2025-01-06 GeAR: Generation Augmented Retrieval Haoyu Liu et.al. 2501.02772 null
2025-01-06 Visual Large Language Models for Generalized and Specialized Applications Yifan Li et.al. 2501.02765 link
2025-01-06 Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? Hongyi Miao et.al. 2501.02751 null
2025-01-06 Artificial Intelligence in Creative Industries: Advances Prior to 2025 Nantheera Anantrasirichai et.al. 2501.02725 null
2025-01-06 KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models Zaiyi Zheng et.al. 2501.02711 null
2025-01-06 QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance Binita Saha et.al. 2501.02702 null
2025-01-06 EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models Andrés Villa et.al. 2501.02699 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-05 Decoding specialised feature neurons in LLMs with the final projection layer Harry J Davies et.al. 2501.02688 null
2025-01-05 From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering Wen-ran Li et.al. 2501.02680 null
2025-01-05 A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model Shivaram Kalyanakrishnan et.al. 2501.02652 null
2025-01-05 Representation Learning of Lab Values via Masked AutoEncoder David Restrepo et.al. 2501.02648 link
2025-01-05 Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense Yang Ouyang et.al. 2501.02629 link
2025-01-05 Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets Mahmoud Jahanshahi et.al. 2501.02628 null
2025-01-05 HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning Saleh Ashkboos et.al. 2501.02625 null
2025-01-05 LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment Yifei Liu et.al. 2501.02621 null
2025-01-05 TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms Jovan Stojkovic et.al. 2501.02600 null
2025-01-05 LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations Jiaping Wang et.al. 2501.02573 link
2025-01-05 Multi-LLM Collaborative Caption Generation in Scientific Documents Jaeyoung Kim et.al. 2501.02552 link
2025-01-05 Transformers Simulate MLE for Sequence Generation in Bayesian Networks Yuan Cao et.al. 2501.02547 null
2025-01-05 Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm Ljubisa Bojic et.al. 2501.02532 null
2025-01-05 Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI Ljubisa Bojic et.al. 2501.02531 null
2025-01-05 Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks Leo Franklin et.al. 2501.02527 null
2025-01-05 Unified Guidance for Geometry-Conditioned Molecular Generation Sirine Ayadi et.al. 2501.02526 null
2025-01-05 Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors Minglin Chen et.al. 2501.02519 null
2025-01-05 CHAIR-Classifier of Hallucination as Improver Ao Sun et.al. 2501.02518 link
2025-01-05 ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use Junjie Ye et.al. 2501.02506 null
2025-01-05 Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons Sebastian Morel-Balbi et.al. 2501.02505 null
2025-01-05 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Chaojie Mao et.al. 2501.02487 null
2025-01-05 LLMPC: Large Language Model Predictive Control Gabriel Maher et.al. 2501.02486 link
2025-01-05 Decoding News Bias: Multi Bias Detection in News Articles Bhushan Santosh Shah et.al. 2501.02482 null
2025-01-05 Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine Yishen Liu et.al. 2501.02471 null
2025-01-05 Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera Yuliang Guo et.al. 2501.02464 null
2025-01-05 Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications Zhe Chen et.al. 2501.02460 null
2025-01-05 Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap Hyunwoo Ko et.al. 2501.02448 null
2025-01-05 RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework Kun Wang et.al. 2501.02446 null
2025-01-05 A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models Yinpeng Cai et.al. 2501.02441 null
2025-01-05 Efficient Deployment of Large Language Models on Resource-constrained Devices Zhiwei Yao et.al. 2501.02438 null
2025-01-05 FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance Haicheng Wang et.al. 2501.02430 link
2025-01-05 GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems Mehmet Deniz Türkmen et.al. 2501.02408 null
2025-01-04 Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities Tara Radvand et.al. 2501.02406 null
2025-01-04 Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers Markus J. Buehler et.al. 2501.02393 link
2025-01-04 Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations Kangyu Zhu et.al. 2501.02385 null
2025-01-04 Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison Tsz Kin Lam et.al. 2501.02370 null
2025-01-04 Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving Sanghyun Park et.al. 2501.02348 null
2025-01-04 Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support Florian Putz et.al. 2501.02346 null
2025-01-04 UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility Yonglin Tian et.al. 2501.02341 link
2025-01-04 AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference Zhuomin He et.al. 2501.02336 link
2025-01-04 Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications Jodi M. Casabianca et.al. 2501.02334 null
2025-01-04 Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance Marta Gentiloni-Silveri et.al. 2501.02298 null
2025-01-04 Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection Yachao Zhao et.al. 2501.02295 null
2025-01-04 Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control Yongjeong Oh et.al. 2501.02273 null
2025-01-04 What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph Yutao Jiang et.al. 2501.02268 link
2025-01-04 Unsupervised Class Generation to Expand Semantic Segmentation Datasets Javier Montalvo et.al. 2501.02264 null
2025-01-04 Financial Named Entity Recognition: How Far Can LLM Go? Yi-Te Lu et.al. 2501.02237 link
2025-01-04 Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends Camille Barboule et.al. 2501.02235 null
2025-01-04 Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection S M Mostaq Hossain et.al. 2501.02229 null
2025-01-04 Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation Shijie Wang et.al. 2501.02226 null
2025-01-04 Can ChatGPT implement finite element models for geotechnical engineering applications? Taegu Kim et.al. 2501.02199 null
2025-01-04 EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks Shixuan Liu et.al. 2501.02192 null
2025-01-04 On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing Jianwei Wang et.al. 2501.02191 link
2025-01-04 The Application of Large Language Models in Recommendation Systems Peiyang Yu et.al. 2501.02178 null
2025-01-04 The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit Huixue Zhou et.al. 2501.02173 null
2025-01-04 Personalized Graph-Based Retrieval for Large Language Models Steven Au et.al. 2501.02157 link
2025-01-04 Table as Thought: Exploring Structured Thoughts in LLM Reasoning Zhenjie Sun et.al. 2501.02152 null
2025-01-04 Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN Yanxi Chen et.al. 2501.02146 null
2025-01-03 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Chaoyou Fu et.al. 2501.01957 link
2025-01-03 Metadata Conditioning Accelerates Language Model Pre-training Tianyu Gao et.al. 2501.01956 link
2025-01-03 MADGEN -- Mass-Spec attends to De Novo Molecular generation Yinkai Wang et.al. 2501.01950 null
2025-01-03 Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap Weizhi Zhang et.al. 2501.01945 link
2025-01-03 Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models Manh Duong Nguyen et.al. 2501.01932 link
2025-01-03 Virgo: A Preliminary Exploration on Reproducing o1-like MLLM Yifan Du et.al. 2501.01904 link
2025-01-03 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation Siyuan Huang et.al. 2501.01895 null
2025-01-03 Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions Rachneet Sachdeva et.al. 2501.01872 link
2025-01-03 Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification Xiangxiang Dai et.al. 2501.01849 link
2025-01-03 MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning Pu Yang et.al. 2501.01834 null
2025-01-03 Time Series Language Model for Descriptive Caption Generation Mohamed Trabelsi et.al. 2501.01832 null
2025-01-03 Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models Yanjiang Liu et.al. 2501.01830 null
2025-01-03 SDPO: Segment-Level Direct Preference Optimization for Social Agents Aobo Kong et.al. 2501.01821 link
2025-01-03 BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction Ferhat Ozgur Catak et.al. 2501.01802 link
2025-01-03 Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation Mohammad Khalil et.al. 2501.01793 link
2025-01-03 Efficient LLM Inference with Activation Checkpointing and Hybrid Caching Sanghyeon Lee et.al. 2501.01792 null
2025-01-03 Nonparametric estimation of a factorizable density using diffusion models Hyeok Kyu Kwon et.al. 2501.01783 null
2025-01-03 SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation Mingjie Li et.al. 2501.01765 null
2025-01-03 Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models Andrea Matteazzi et.al. 2501.01761 null
2025-01-03 MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling Simon Rouard et.al. 2501.01757 null
2025-01-03 Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation Kangcheng Luo et.al. 2501.01743 null
2025-01-03 How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models Simone Corbo et.al. 2501.01741 null
2025-01-03 AR4D: Autoregressive 4D Generation from Monocular Videos Hanxin Zhu et.al. 2501.01722 null
2025-01-03 Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models Guosheng Zhang et.al. 2501.01720 null
2025-01-03 LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries Michal Kuk et.al. 2501.01711 null
2025-01-03 MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders Jiajun Cao et.al. 2501.01709 null
2025-01-03 AgentRefine: Enhancing Agent Generalization through Refinement Tuning Dayuan Fu et.al. 2501.01702 null
2025-01-03 Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models Lei Tang et.al. 2501.01679 null
2025-01-03 Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption Zhang Ruoyan et.al. 2501.01672 null
2025-01-03 BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction Alaeddine Diaf et.al. 2501.01664 null
2025-01-03 Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning Danni Peng et.al. 2501.01653 null
2025-01-03 MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments Cai Yin et.al. 2501.01652 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-03 iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings Shuhei Tomoshige et.al. 2501.01642 null
2025-01-03 Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation Rini Smita Thakur et.al. 2501.01640 null
2025-01-03 A non-ergodic framework for understanding emergent capabilities in Large Language Models Javier Marin et.al. 2501.01638 null
2025-01-03 Revisiting Data Analysis with Pre-trained Foundation Models Chen Liang et.al. 2501.01631 null
2025-01-03 ICPC: In-context Prompt Compression with Faster Inference Ziyang Yu et.al. 2501.01625 null
2025-01-03 PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents Jingoo Lee et.al. 2501.01594 null
2025-01-03 (WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges Mohamed Hisham Abdellatif et.al. 2501.01588 null
2025-01-02 Predicting the Performance of Black-box LLMs through Self-Queries Dylan Sam et.al. 2501.01558 link
2025-01-02 Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) Jumana Almahmoud et.al. 2501.01545 null
2025-01-02 Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information Rasul Tutnov et.al. 2501.01544 null
2025-01-02 Denoising Diffused Embeddings: a Generative Approach for Hypergraphs Shihao Wu et.al. 2501.01541 null
2025-01-02 BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi et.al. 2501.01540 link
2025-01-02 SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers Bhavna Gopal et.al. 2501.01529 null
2025-01-02 Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search Shuangtao Li et.al. 2501.01478 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-02 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Jingfeng Yao et.al. 2501.01423 link
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers Seunghyun Lee et.al. 2501.01414 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-02 OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios Xize Cheng et.al. 2501.01384 null
2025-01-02 ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI Neda Tavakoli et.al. 2501.01372 link
2025-01-02 Aligning Large Language Models for Faithful Integrity Against Opposing Argument Yong Zhao et.al. 2501.01336 link
2025-01-02 CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models Johan Wahréus et.al. 2501.01335 link
2025-01-02 Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension Yanbo Fang et.al. 2501.01332 null
2025-01-02 The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation Shuzheng Gao et.al. 2501.01329 null
2025-01-03 Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking Xiaoxue Cheng et.al. 2501.01306 null
2025-01-02 Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments -- The Depression and Anxiety Case Kaushik Roy et.al. 2501.01305 null
2025-01-02 Does a Large Language Model Really Speak in Human-Like Language? Mose Park et.al. 2501.01273 null
2025-01-02 ProgCo: Program Helps Self-Correction of Large Language Models Xiaoshuai Song et.al. 2501.01264 null
2025-01-03 CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Shanghaoran Quan et.al. 2501.01257 null
2025-01-02 Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? Manuel Weber et.al. 2501.01256 null
2025-01-02 Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion Qiyuan He et.al. 2501.01246 null
2025-01-02 SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization Yongle Huang et.al. 2501.01245 link
2025-01-02 Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants Lixiong Qin et.al. 2501.01243 null
2025-01-02 Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction Alexander Brinkmann et.al. 2501.01237 link
2025-01-03 TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer Jiayu Li et.al. 2501.01216 null
2025-01-02 Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects Abdullah Mushtaq et.al. 2501.01205 null
2025-01-02 HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation Runsong Jia et.al. 2501.01203 null
2025-01-02 LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge Kyoungkook Kang et.al. 2501.01197 null
2025-01-02 Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education Annika Bush et.al. 2501.01192 null
2025-01-02 Towards Interactive Deepfake Analysis Lixiong Qin et.al. 2501.01164 link
2025-01-02 TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions Vriksha Srihari et.al. 2501.01156 null
2025-01-02 A3: Android Agent Arena for Mobile GUI Agents Yuxiang Chai et.al. 2501.01149 null
2025-01-03 BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference Wonsuk Jang et.al. 2501.01144 link
2025-01-02 Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method Ruichen Zhang et.al. 2501.01141 null
2025-01-02 Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning Shuo Yu et.al. 2501.01124 null
2025-01-02 MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification Jimin Park et.al. 2501.01110 null
2025-01-03 MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization Haina Zhu et.al. 2501.01108 link
2025-01-02 Graph Generative Pre-trained Transformer Xiaohui Chen et.al. 2501.01073 null
2025-01-02 Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models Yanwen Huang et.al. 2501.01059 null
2025-01-02 Risks of Cultural Erasure in Large Language Models Rida Qadri et.al. 2501.01056 null
2025-01-02 Dynamic Scaling of Unit Tests for Code Reward Modeling Zeyao Ma et.al. 2501.01054 null
2025-01-02 Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs Linhao Huang et.al. 2501.01042 null
2025-01-02 Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models Bin Wang et.al. 2501.01034 link
2025-01-02 ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning Wonduk Seo et.al. 2501.01031 null
2025-01-03 KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model Xinshuo Hu et.al. 2501.01028 link
2025-01-02 MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model Chengze Zhang et.al. 2501.01014 null
2025-01-02 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving Zihao Ye et.al. 2501.01005 link
2025-01-02 Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory Zhou Yang et.al. 2501.00999 null
2025-01-02 Optimizing Noise Schedules of Generative Models in High Dimensionss Santiago Aranguri et.al. 2501.00988 null
2025-01-02 Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice Federico Ravenda et.al. 2501.00982 link
2025-01-01 IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs Junfeng Jiao et.al. 2501.00959 null
2025-01-01 Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors Junfeng Jiao et.al. 2501.00957 null
2025-01-01 Incremental Dialogue Management: Survey, Discussion, and Implications for HRI Casey Kennington et.al. 2501.00953 null
2025-01-01 SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering Shihab Ahmed et.al. 2501.00940 null
2025-01-01 Diffusion Policies for Generative Modeling of Spacecraft Trajectories Julia Briden et.al. 2501.00915 null
2025-01-01 Aligning LLMs with Domain Invariant Reward Models David Wu et.al. 2501.00911 link
2025-01-01 Population Aware Diffusion for Time Series Generation Yang Li et.al. 2501.00910 link
2025-01-01 Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things Talha Zeeshan et.al. 2501.00906 null
2025-01-01 Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model Chenyang Liu et.al. 2501.00895 null
2025-01-01 Evaluating Time Series Foundation Models on Noisy Periodic Time Series Syamantak Datta Gupta et.al. 2501.00889 null
2025-01-01 Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization Weiqi Wu et.al. 2501.00888 link
2025-01-01 Representation in large language models Cameron C. Yetman et.al. 2501.00885 null
2025-01-01 Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents Fouad Bousetouane et.al. 2501.00881 null
2025-01-01 Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction Teng Hu et.al. 2501.00880 null
2025-01-01 TrustRAG: Enhancing Robustness and Trustworthiness in RAG Huichi Zhou et.al. 2501.00879 link
2025-01-01 LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models Hieu Man et.al. 2501.00874 link
2025-01-01 Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation Mingjia Li et.al. 2501.00873 link
2025-01-01 Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation Shoutao Guo et.al. 2501.00868 link
2025-01-01 Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era Mihnea C. Moldoveanu et.al. 2501.00867 null
2025-01-01 Alzheimer's disease detection based on large language model prompt engineering Tian Zheng et.al. 2501.00861 null
2025-01-01 LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions Adam Ishay et.al. 2501.00830 null
2025-01-01 An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems Haoxiang Tian et.al. 2501.00829 null
2025-01-01 LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management Yichen Luo et.al. 2501.00826 null
2025-01-01 Multimodal Large Models Are Effective Action Anticipators Binglu Wang et.al. 2501.00795 link
2025-01-01 Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models Minhao Bai et.al. 2501.00786 null
2025-01-01 NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model Yuzhi Lai et.al. 2501.00785 null
2025-01-01 REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization Huyen Nguyen et.al. 2501.00779 null
2025-01-01 FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation Qianli Wang et.al. 2501.00777 null
2025-01-01 Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis Jie Gao et.al. 2501.00775 null
2025-01-01 An AI-powered Bayesian generative modeling approach for causal inference in observational studies Qiao Liu et.al. 2501.00755 null
2025-01-01 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-01 DIVE: Diversified Iterative Self-Improvement Yiwei Qin et.al. 2501.00747 link
2025-01-01 Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines Xiyang Hu et.al. 2501.00745 null
2025-01-01 A Distributional Evaluation of Generative Image Models Edric Tam et.al. 2501.00744 null
2025-01-01 New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory Madiha Ajmal et.al. 2501.00721 null
2025-01-01 Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection Hao Wang et.al. 2501.00700 null
2025-01-01 Adjoint sharding for very long context training of state space models Xingzi Xu et.al. 2501.00692 null
2025-01-01 Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro Md Rakibul Hasan et.al. 2501.00691 null
2025-01-01 IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently Florian Dietz et.al. 2501.00684 null
2024-12-31 Grade Inflation in Generative Models Phuc Nguyen et.al. 2501.00664 null
2024-12-31 Finding Missed Code Size Optimizations in Compilers using LLMs Davide Italiano et.al. 2501.00655 null
2024-12-31 Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models Suttisak Wizadwongsa et.al. 2501.00651 null
2024-12-31 Efficient Standardization of Clinical Notes using Large Language Models Daniel B. Hier et.al. 2501.00644 null
2024-12-31 Enabling New HDLs with Agents Mark Zakharov et.al. 2501.00642 null
2024-12-31 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2024-12-31 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation M. Ali Bayram et.al. 2501.00593 null
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders Yipeng Kang et.al. 2501.00581 null
2024-12-31 AI and Quantum Computing in Binary Photocatalytic Hydrogen Production Dennis Delali Kwesi Wayo et.al. 2501.00575 null
2024-12-31 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Xinhao Li et.al. 2501.00574 link
2024-12-31 Probing Visual Language Priors in VLMs Tiange Luo et.al. 2501.00569 null
2024-12-31 Robust and Adaptive Optimization under a Large Language Model Lens Dimitris Bertsimas et.al. 2501.00568 null
2024-12-30 Distributed Mixture-of-Agents for Edge Inference with Large Language Models Purbesh Mitra et.al. 2412.21200 link
2024-12-31 HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Zhaojian Yu et.al. 2412.21199 link
2024-12-30 The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick Jonathan Berkheim et.al. 2412.21186 null
2024-12-30 Facilitating large language model Russian adaptation with Learned Embedding Propagation Mikhail Tikhomirov et.al. 2412.21140 link
2024-12-30 ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation Ruixuan Liu et.al. 2412.21123 null
2025-01-02 Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation Yuanbo Yang et.al. 2412.21117 null
2024-12-30 Varformer: Adapting VAR's Generative Prior for Image Restoration Siyang Wang et.al. 2412.21063 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense Yuyang Zhou et.al. 2412.21051 link
2024-12-30 E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models Zhiyu Tan et.al. 2412.21044 null
2024-12-30 Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration Wanglong Lu et.al. 2412.21042 link
2024-12-30 TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization Chia-Yu Hung et.al. 2412.21037 link
2024-12-30 GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models Shangyu Xing et.al. 2412.21036 null
2024-12-30 MapQaTor: A System for Efficient Annotation of Map Query Datasets Mahir Labib Dihan et.al. 2412.21015 link
2024-12-31 Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria Joonwon Jang et.al. 2412.21006 null
2024-12-30 Plug-and-Play Training Framework for Preference Optimization Jingyuan Ma et.al. 2412.20996 null
2024-12-30 KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation Siyuan Fang et.al. 2412.20995 null
2024-12-30 Efficiently Serving LLM Reasoning Programs with Certaindex Yichao Fu et.al. 2412.20993 null
2024-12-30 QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing Shlomo Kashani et.al. 2412.20956 null
2024-12-30 AGON: Automated Design Framework for Customizing Processors from ISA Documents Chongxiao Li et.al. 2412.20954 null
2024-12-30 Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema Xiaohan Feng et.al. 2412.20942 null
2024-12-30 Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering Junxiao Xue et.al. 2412.20927 null
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Towards Compatible Fine-tuning for Vision-Language Model Updates Zhengbo Wang et.al. 2412.20895 null
2024-12-30 DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models Xiaolin Hu et.al. 2412.20891 null
2024-12-30 Enhancing Annotated Bibliography Generation with LLM Ensembles Sergio Bermejo et.al. 2412.20864 null
2024-12-30 Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory Xingjian Tao et.al. 2412.20846 null
2024-12-30 Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment Jianfei Zhang et.al. 2412.20834 link
2024-12-30 Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model Runtao Ren et.al. 2412.20820 null
2024-12-30 TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting Huanyu Zhang et.al. 2412.20810 null
2024-12-30 Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves Chayan Chatterjee et.al. 2412.20789 null
2024-12-31 SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity Pengfei Jing et.al. 2412.20787 null
2024-12-30 Large Language Model Enabled Multi-Task Physical Layer Network Tianyue Zheng et.al. 2412.20772 null
2024-12-30 Attributing Culture-Conditioned Generations to Pretraining Corpora Huihan Li et.al. 2412.20760 link
2024-12-30 M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs Bei Yan et.al. 2412.20718 link
2024-12-30 HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images Sungik Choi et.al. 2412.20704 null
2024-12-30 UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design Zijie Chen et.al. 2412.20694 null
2024-12-30 Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks Yuhe Ding et.al. 2412.20682 null
2024-12-30 Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA Qingyun Jin et.al. 2412.20677 null
2024-12-30 Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner Yitong Zhou et.al. 2412.20662 link
2024-12-30 Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis Yousef Yeganeh et.al. 2412.20651 null
2024-12-30 SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy Md Mahadi Hasan Nahid et.al. 2412.20641 null
2024-12-30 Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble Yongchang Li et.al. 2412.20637 null
2024-12-30 EVOLVE: Emotion and Visual Output Learning via LLM Evaluation Jordan Sinclair et.al. 2412.20632 null
2024-12-29 Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study Yulin Fei et.al. 2412.20613 link
2024-12-29 NLP-based Regulatory Compliance -- Using GPT 4.0 to Decode Regulatory Documents Bimal Kumar et.al. 2412.20602 null
2024-12-29 MATEY: multiscale adaptive foundation models for spatiotemporal physical systems Pei Zhang et.al. 2412.20601 null
2024-12-29 Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection Dmitri Roussinov et.al. 2412.20595 link
2024-12-29 Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches Madhavendra Thakur et.al. 2412.20584 null
2024-12-29 Counterfactual Samples Constructing and Training for Commonsense Statements Estimation Chong Liu et.al. 2412.20563 null
2024-12-29 Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces Linglingzhi Zhu et.al. 2412.20556 null
2024-12-29 The Impact of Prompt Programming on Function-Level Code Generation Ranim Khojah et.al. 2412.20545 link
2024-12-29 Goal-Conditioned Data Augmentation for Offline Reinforcement Learning Xingshuai Huang et.al. 2412.20519 null
2024-12-29 Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning Hang Ni et.al. 2412.20505 null
2024-12-29 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication Zongwu Wang et.al. 2412.20501 link
2024-12-29 Multimodal Variational Autoencoder: a Barycentric View Peijie Qiu et.al. 2412.20487 null
2024-12-29 JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling Haorui Ji et.al. 2412.20470 null
2024-12-29 Improving Vision-Language-Action Models via Chain-of-Affordance Jinming Li et.al. 2412.20451 null
2024-12-29 Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs Pratik Rakesh Singh et.al. 2412.20440 null
2024-12-29 Image Augmentation Agent for Weakly Supervised Semantic Segmentation Wangyu Wu et.al. 2412.20439 null
2024-12-29 Unlocking adaptive digital pathology through dynamic feature learning Jiawen Li et.al. 2412.20430 null
2024-12-29 AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models Mansi et.al. 2412.20427 null
2024-12-29 Bringing Objects to Life: 4D generation from 3D objects Ohad Rahamim et.al. 2412.20422 null
2024-12-29 Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection Kalin Kopanov et.al. 2412.20414 null
2024-12-29 Multi-Objective Large Language Model Unlearning Zibin Pan et.al. 2412.20412 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-29 Natural Language Fine-Tuning Jia Liu et.al. 2412.20382 link
2024-12-29 Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) Jia Wei Sii et.al. 2412.20381 null
2024-12-29 FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation Yan Luo et.al. 2412.20374 link
2024-12-29 LLM2: Let Large Language Models Harness System 2 Reasoning Cheng Yang et.al. 2412.20372 link
2025-01-02 Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey Junqiao Wang et.al. 2412.20367 null
2024-12-29 HindiLLM: Large Language Model for Hindi Sanjay Chouhan et.al. 2412.20357 null
2024-12-29 Distilling Desired Comments for Enhanced Code Review with Large Language Models Yongda Yu et.al. 2412.20340 null
2024-12-29 Mind the Data Gap: Bridging LLMs to Enterprise Data Integration Moe Kayali et.al. 2412.20331 null
2024-12-29 GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions Tianyao Shi et.al. 2412.20322 null
2024-12-29 Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain Shintaro Ozaki et.al. 2412.20309 null
2024-12-28 FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration Jia Liu et.al. 2412.20297 null
2024-12-28 Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games Guan-Horng Liu et.al. 2412.20279 null
2024-12-28 Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues Henry J. Xie et.al. 2412.20264 link
2024-12-28 Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception Athanasios Karagounis et.al. 2412.20230 null
2024-12-28 LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning Shuguang Chen et.al. 2412.20227 null
2024-12-28 Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation Yeonhong Park et.al. 2412.20185 null
2024-12-28 LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System Hyucksung Kwon et.al. 2412.20166 null
2024-12-28 StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN Andrzej Bedychaj et.al. 2412.20164 null
2024-12-28 Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems Minhye Jeon et.al. 2412.20163 null
2024-12-28 Multi-Modality Driven LoRA for Adverse Condition Depth Estimation Guanglei Yang et.al. 2412.20162 null
2024-12-28 Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses Xinru Wen et.al. 2412.20154 null
2024-12-28 Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering Wei Zhou et.al. 2412.20145 null
2024-12-28 TradingAgents: Multi-Agents LLM Financial Trading Framework Yijia Xiao et.al. 2412.20138 null
2024-12-28 M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation Zhaopeng Feng et.al. 2412.20127 link
2024-12-28 Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers Tuomas Hakoniemi et.al. 2412.20114 null
2024-12-28 ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming Jiedong Zhuang et.al. 2412.20105 null
2024-12-28 On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs Atmane Ayoub Mansour Bahar et.al. 2412.20087 null
2024-12-31 Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset Chongjian Yue et.al. 2412.20072 null
2024-12-28 On the Compositional Generalization of Multimodal LLMs for Medical Imaging Zhenyang Cai et.al. 2412.20070 link
2024-12-28 VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition Lan Chen et.al. 2412.20064 link
2024-12-28 MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion Zechao Zhan et.al. 2412.20062 null
2024-12-28 Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts Yanxin Shen et.al. 2412.20061 null
2024-12-28 "My life is miserable, have to sign 500 autographs everyday": Exposing Humblebragging, the Brags in Disguise Sharath Naganna et.al. 2412.20057 null
2024-12-27 Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization Kumud Tripathi et.al. 2412.19785 null
2024-12-27 Can AI Help with Your Personal Finances? Oudom Hean et.al. 2412.19784 null
2024-12-27 Tensor Network Estimation of Distribution Algorithms John Gardiner et.al. 2412.19780 null
2024-12-27 Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration Le Chen et.al. 2412.19770 link
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-27 On dual-projectively equivalent connections associated to second order superintegrable systems Andreas Vollmer et.al. 2412.19739 null
2024-12-27 Can Large Language Models Adapt to Other Agents In-Context? Matthew Riemer et.al. 2412.19726 null
2024-12-27 From Elements to Design: A Layered Approach for Automatic Graphic Design Composition Jiawei Lin et.al. 2412.19712 null
2024-12-27 Toward Adaptive Reasoning in Large Language Models with Thought Rollback Sijia Chen et.al. 2412.19707 link
2024-12-27 A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization Jingchun Lian et.al. 2412.19685 null
2024-12-27 Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework Jiang Liu et.al. 2412.19684 null
2024-12-27 CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs Siyu Wang et.al. 2412.19663 null
2024-12-27 Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis Jiaqi Wang et.al. 2412.19654 link
2024-12-27 FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios Kaiyi Pang et.al. 2412.19652 null
2024-12-27 Xmodel-2 Technical Report Wang Qun et.al. 2412.19638 null
2024-12-27 IMTP: Search-based Code Generation for In-memory Tensor Programs Yongwon Shin et.al. 2412.19630 null
2024-12-27 Signatures of prediction during natural listening in MEG data? Sahel Azizpour et.al. 2412.19622 null
2024-12-27 Gradient Weight-normalized Low-rank Projection for Efficient LLM Training Jia-Hong Huang et.al. 2412.19616 link
2024-12-27 SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms Shashank Rao Marpally et.al. 2412.19595 null
2024-12-27 Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Yuxiao Yang et.al. 2412.19562 null
2024-12-27 Diverse Rare Sample Generation with Pretrained GANs Subeen Lee et.al. 2412.19543 link
2024-12-27 Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy--Fokker--Planck Equations Yuanfei Huang et.al. 2412.19520 null
2024-12-27 Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model Hyunwoo Cho et.al. 2412.19517 null
2024-12-27 Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs Zhe Yang et.al. 2412.19513 link
2024-12-27 Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging Hua Farn et.al. 2412.19512 null
2024-12-27 Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion Koustav Ghosal et.al. 2412.19510 null
2024-12-27 MBQ: Modality-Balanced Quantization for Large Vision-Language Models Shiyao Li et.al. 2412.19509 link
2024-12-27 DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-27 Casevo: A Cognitive Agents and Social Evolution Simulator Zexun Jiang et.al. 2412.19498 link
2024-12-27 Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation Chengyang Ye et.al. 2412.19492 link
2024-12-27 Focusing Image Generation to Mitigate Spurious Correlations Xuewei Li et.al. 2412.19457 null
2024-12-27 Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models Hyeonseok Moon et.al. 2412.19450 link
2024-12-27 Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models Shuo Wang et.al. 2412.19449 null
2024-12-27 A Survey on Large Language Model Acceleration based on KV Cache Management Haoyang Li et.al. 2412.19442 link
2024-12-27 Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback Seong Jin Lee et.al. 2412.19436 null
2024-12-27 Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints Alberto Maté et.al. 2412.19424 null
2024-12-27 Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning Chen Li et.al. 2412.19422 link
2024-12-27 MINIMA: Modality Invariant Image Matching Xingyu Jiang et.al. 2412.19412 link
2024-12-27 MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios Jiaqi Fan et.al. 2412.19406 null
2024-12-27 An Engorgio Prompt Makes Large Language Model Babble on Jianshuo Dong et.al. 2412.19394 link
2024-12-26 Large Language Models for Market Research: A Data-augmentation Approach Mengxin Wang et.al. 2412.19363 null
2024-12-26 Dynamic Skill Adaptation for Large Language Models Jiaao Chen et.al. 2412.19361 null
2024-12-26 Identifying Split Vacancies with Foundation Models and Electrostatics Seán R. Kavanagh et.al. 2412.19330 null
2024-12-26 Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment Ziang Yan et.al. 2412.19326 link
2024-12-26 Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones Mehrnaz Mofakhami et.al. 2412.19325 null
2024-12-26 From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries Hugh Van Deventer et.al. 2412.19312 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-26 RecLM: Recommendation Instruction Tuning Yangqin Jiang et.al. 2412.19302 link
2024-12-26 RAG with Differential Privacy Nicolas Grislain et.al. 2412.19291 link
2024-12-26 Time Series Foundational Models: Their Role in Anomaly Detection and Prediction Chathurangi Shyalika et.al. 2412.19286 link
2024-12-26 PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing Michael Bezick et.al. 2412.19284 null
2024-12-26 MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes Asma Ben Abacha et.al. 2412.19260 link
2024-12-26 VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis Jaemin Jung et.al. 2412.19259 null
2024-12-26 Sentiment trading with large language models Kemal Kirtac et.al. 2412.19245 null
2024-12-26 SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model Xuyang Li et.al. 2412.19237 null
2024-12-26 Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining Yuxin You et.al. 2412.19211 null
2024-12-26 Multi-Attribute Constraint Satisfaction via Language Model Rewriting Ashutosh Baheti et.al. 2412.19198 null
2024-12-26 Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models Haonan He et.al. 2412.19191 null
2024-12-26 Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue Shuzhi Xu et.al. 2412.19154 null
2024-12-26 AskChart: Universal Chart Understanding through Textual Enhancement Xudong Yang et.al. 2412.19146 link
2024-12-26 SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis Senbin Zhu et.al. 2412.19140 link
2024-12-26 PlanLLM: Video Procedure Planning with Refinable Large Language Models Dejie Yang et.al. 2412.19139 link
2024-12-26 Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing Inpyo Hong et.al. 2412.19125 link
2024-12-26 Discrete vs. Continuous Trade-offs for Generative Models Jathin Korrapati et.al. 2412.19114 null
2024-12-26 SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values Yunfan Zhang et.al. 2412.19113 null
2024-12-26 Stochastic normalizing flows for Effective String Theory Michele Caselle et.al. 2412.19109 null
2024-12-26 "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities Jiawei Yu et.al. 2412.19102 null
2024-12-26 Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security Vasileios Alevizos et.al. 2412.19088 null
2024-12-26 Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation Haotian Qian et.al. 2412.19080 null
2024-12-26 CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers Jingyi Zheng et.al. 2412.19037 link
2024-12-26 Repository Structure-Aware Training Makes SLMs Better Issue Resolver Zexiong Ma et.al. 2412.19031 null
2024-12-26 Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation Yixin Chen et.al. 2412.19026 link
2024-12-26 Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication Xiqiang Qu et.al. 2412.19025 null
2024-12-26 Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation Tao Liu et.al. 2412.19021 null
2024-12-26 Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability Ruixi Lin et.al. 2412.19018 null
2024-12-25 How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study Alejandro Velasco et.al. 2412.18989 null
2024-12-25 ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement Zhefan Rao et.al. 2412.18966 null
2024-12-25 Musings About the Future of Search: A Return to the Past? Jimmy Lin et.al. 2412.18956 null
2024-12-25 A Power-Efficient Hardware Implementation of L-Mul Ruiqi Chen et.al. 2412.18948 null
2024-12-25 MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models Kaiwen Zuo et.al. 2412.18947 null
2024-12-25 Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations Yewon Kim et.al. 2412.18940 null
2024-12-25 Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference Libo Zhang et.al. 2412.18934 null
2024-12-25 UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation Lunhao Duan et.al. 2412.18928 null
2024-12-25 Exemplar-condensed Federated Class-incremental Learning Rui Sun et.al. 2412.18926 null
2024-12-25 Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model Yi-Chia Chen et.al. 2412.18917 link
2024-12-25 AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures Situo Zhang et.al. 2412.18910 null
2024-12-25 CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models Ping Guo et.al. 2412.18890 link
2024-12-25 MotionMap: Representing Multimodality in Human Pose Forecasting Reyhaneh Hosseininejad et.al. 2412.18883 null
2024-12-25 Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models Meltem Aksoy et.al. 2412.18863 null
2024-12-25 Improving the Readability of Automatically Generated Tests using Large Language Models Matteo Biagiola et.al. 2412.18843 null
2024-12-25 LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements Hao Zhang et.al. 2412.18835 null
2024-12-25 Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition Shujie Hu et.al. 2412.18832 null
2024-12-25 RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting Yilei Jiang et.al. 2412.18826 null
2024-12-25 CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection Wenbin Li et.al. 2412.18820 link
2024-12-25 LLM-assisted vector similarity search Md Riyadh et.al. 2412.18819 null
2024-12-25 DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search Lei Yang et.al. 2412.18811 null
2024-12-25 Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation Xinkai Du et.al. 2412.18800 null
2024-12-25 Torque-Aware Momentum Pranshu Malviya et.al. 2412.18790 null
2024-12-25 Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models Yu-An Liu et.al. 2412.18770 link
2024-12-25 The Impact of Input Order Bias on Large Language Models for Software Fault Localization Md Nakhla Rafi et.al. 2412.18750 null
2024-12-24 Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models Zehan Wang et.al. 2412.18605 link
2024-12-24 Long-Form Speech Generation with Spoken Language Models Se Jin Park et.al. 2412.18603 link
2024-12-24 Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems Fernando Jia et.al. 2412.18601 link
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-24 A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs OpenMind et.al. 2412.18588 null
2024-12-24 Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control Sergey Sedov et.al. 2412.18582 null
2024-12-24 Zero-resource Speech Translation and Recognition with LLMs Karel Mundnich et.al. 2412.18566 null
2024-12-24 Distilling Fine-grained Sentiment Understanding from Large Language Models Yice Zhang et.al. 2412.18552 link
2024-12-24 Token-Budget-Aware LLM Reasoning Tingxu Han et.al. 2412.18547 link
2024-12-24 PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction Xingjian Xu et.al. 2412.18541 null
2024-12-24 Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation Derong Xu Xinhang Li et.al. 2412.18537 link
2024-12-24 Automated Code Review In Practice Umut Cihan et.al. 2412.18531 null
2024-12-24 Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving Hao Pang et.al. 2412.18511 null
2024-12-24 Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization Yi-Fu Fu et.al. 2412.18497 null
2024-12-24 GeFL: Model-Agnostic Federated Learning with Generative Models Honggu Kang et.al. 2412.18460 null
2024-12-24 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding Tatiana Zemskova et.al. 2412.18450 link
2024-12-24 Is Large Language Model Good at Triple Set Prediction? An Empirical Study Yuan Yuan et.al. 2412.18443 null
2024-12-24 Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm O. Deniz Akyildiz et.al. 2412.18432 null
2024-12-24 GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent Kangjia Zhao et.al. 2412.18426 null
2024-12-24 Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models Zihan Zhou et.al. 2412.18419 null
2024-12-24 Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles Zihan Wang et.al. 2412.18416 null
2024-12-24 Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English Avinash Anand et.al. 2412.18415 link
2024-12-24 Discovery of 2D Materials via Symmetry-Constrained Diffusion Model Shihang Xu et.al. 2412.18414 null
2024-12-24 A Statistical Framework for Ranking LLM-Based Chatbots Siavash Ameli et.al. 2412.18407 link
2024-12-24 Extract Free Dense Misalignment from CLIP JeongYeon Nam et.al. 2412.18404 link
2024-12-24 RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction Wu Xiaoping et.al. 2412.18390 null
2024-12-24 MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs Qiuyi Gu et.al. 2412.18381 null
2024-12-24 Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents Kaiwen Ning et.al. 2412.18371 link
2024-12-24 Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.18351 null
2024-12-24 M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models Jiaxin Guo et.al. 2412.18299 null
2024-12-24 Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight Xi Ding et.al. 2412.18298 link
2024-12-24 Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases Christian Di Maio et.al. 2412.18295 null
2024-12-24 DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation Junyi Lu et.al. 2412.18291 null
2024-12-24 Improved Feature Generating Framework for Transductive Zero-shot Learning Zihan Ye et.al. 2412.18282 null
2024-12-24 GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications Zhenzhou Jin et.al. 2412.18281 null
2024-12-24 Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization Jiacai Liu et.al. 2412.18279 null
2024-12-24 GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge Shammur Absar Chowdhury et.al. 2412.18274 null
2024-12-24 Annotating References to Mythological Entities in French Literature Thierry Poibeau et.al. 2412.18270 null
2024-12-24 Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study Xuefeng Jiang et.al. 2412.18260 link
2024-12-24 AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction Pufan Zou et.al. 2412.18255 null
2024-12-24 An Automatic Graph Construction Framework based on Large Language Models for Recommendation Rong Shan et.al. 2412.18241 link
2024-12-24 Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection Jango Zhang et.al. 2412.18225 null
2024-12-24 Expand VSR Benchmark for VLLM to Expertize in Spatial Rules Peijin Xie et.al. 2412.18224 link
2024-12-24 ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation Mengyang Wu et.al. 2412.18216 link
2024-12-24 Adapting Large Language Models for Improving TCP Fairness over WiFi Shyam Kumar Shrestha et.al. 2412.18200 null
2024-12-24 Robustness-aware Automatic Prompt Optimization Zeru Shi et.al. 2412.18196 link
2024-12-24 VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks Shiduo Zhang et.al. 2412.18194 null
2024-12-24 TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization Yucong Luo et.al. 2412.18185 null
2024-12-24 Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation Yucong Luo et.al. 2412.18176 null
2024-12-24 INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent Haohang Li et.al. 2412.18174 null
2024-12-24 Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models Xiaomeng Hu et.al. 2412.18171 null
2024-12-24 KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management Rongxin Cheng et.al. 2412.18169 null
2024-12-24 Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence Yinbin Han et.al. 2412.18164 null
2024-12-24 VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities Shray Mathur et.al. 2412.18161 null
2024-12-24 Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task Jinming Liu et.al. 2412.18158 null
2024-12-24 Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance Yaoyun Zhang et.al. 2412.18157 null
2024-12-24 scReader: Prompting Large Language Models to Interpret scRNA-seq Data Cong Li et.al. 2412.18156 null
2024-12-24 GeneSUM: Large Language Model-based Gene Summary Extraction Zhijian Chen et.al. 2412.18154 null
2024-12-24 CoAM: Corpus of All-Type Multiword Expressions Yusuke Ide et.al. 2412.18151 null
2024-12-24 EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation Shuhao Han et.al. 2412.18150 link
2024-12-24 Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction Xiao Guo et.al. 2412.18149 null
2024-12-24 Ensuring Consistency for In-Image Translation Chengpeng Fu et.al. 2412.18139 null
2024-12-24 LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment Binrui Zeng et.al. 2412.18135 null
2024-12-24 VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection Zhaohui Jin et.al. 2412.18124 null
2024-12-24 AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation Hao Wen et.al. 2412.18116 null
2024-12-24 AIGT: AI Generative Table Based on Prompt Mingming Zhang et.al. 2412.18111 null
2024-12-24 SlimGPT: Layer-wise Structured Pruning for Large Language Models Gui Ling et.al. 2412.18110 null
2024-12-24 Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach Jing Bi et.al. 2412.18108 null
2024-12-24 Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels Mingcong Song et.al. 2412.18106 null
2024-12-24 EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent Suyuan Wang et.al. 2412.18100 null
2024-12-24 Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine Yu He Ke et.al. 2412.18096 null
2024-12-24 Molly: Making Large Language Model Agents Solve Python Problem More Logically Rui Xiao et.al. 2412.18093 null
2024-12-24 Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner Aizierjiang Aiersilan et.al. 2412.18086 link
2024-12-24 Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models Xuan Lin et.al. 2412.18084 link
2024-12-24 Improving Factuality with Explicit Working Memory Mingda Chen et.al. 2412.18069 null
2024-12-24 LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR Osama Hosam Abdellaif et.al. 2412.18063 link
2024-12-24 Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction Hyunbae Jeon et.al. 2412.18061 null
2024-12-24 An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM Wen Wen et.al. 2412.18060 null
2024-12-23 Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations Maya Patel et.al. 2412.18051 null
2024-12-23 AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data Mirko Zaffaroni et.al. 2412.18038 link
2024-12-23 Generating refactored code accurately using reinforcement learning Indranil Palit et.al. 2412.18035 null
2024-12-23 More than Chit-Chat: Developing Robots for Small-Talk Interactions Rebecca Ramnauth et.al. 2412.18023 null
2024-12-23 Trustworthy and Efficient LLMs Meet Databases Kyoungmin Kim et.al. 2412.18022 null
2024-12-23 StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs Hailin Chen et.al. 2412.18011 null
2024-12-23 CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models Ruibo Tu et.al. 2412.17970 link
2024-12-23 LMV-RPA: Large Model Voting-based Robotic Process Automation Osama Abdellatif et.al. 2412.17965 link
2024-12-23 Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models Antony Seabra et.al. 2412.17964 null
2024-12-23 Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models Ge Zhang et.al. 2412.17963 null
2024-12-23 Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents Antony Seabra et.al. 2412.17942 null
2024-12-23 BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism Martin Fajcik et.al. 2412.17933 null
2024-12-23 Causal Composition Diffusion Model for Closed-loop Traffic Generation Haohong Lin et.al. 2412.17920 null
2024-12-23 Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning Orson Mengara et.al. 2412.17908 null
2024-12-23 LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses Sara Riazi et.al. 2412.17892 null
2024-12-23 ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Siyuan Bian et.al. 2412.17811 null
2024-12-23 Reconstructing People, Places, and Cameras Lea Müller et.al. 2412.17806 null
2024-12-23 Automating the Search for Artificial Life with Foundation Models Akarsh Kumar et.al. 2412.17799 link
2024-12-23 ResearchTown: Simulator of Human Research Community Haofei Yu et.al. 2412.17767 link
2024-12-23 ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback Wei Zhang et.al. 2412.17754 null
2024-12-23 Deliberation in Latent Space via Differentiable Cache Augmentation Luyang Liu et.al. 2412.17747 null
2024-12-23 YuLan-Mini: An Open Data-efficient Language Model Yiwen Hu et.al. 2412.17743 link
2024-12-23 Reasoning to Attend: Try to Understand How Token Works Rui Qian et.al. 2412.17741 link
2024-12-23 Knowledge Editing through Chain-of-Thought Changyue Wang et.al. 2412.17727 link
2024-12-23 Understanding the Logic of Direct Preference Alignment through Logic Kyle Richardson et.al. 2412.17696 null
2024-12-23 Large Language Model Safety: A Holistic Survey Dan Shi et.al. 2412.17686 link
2024-12-23 A Bias-Free Training Paradigm for More General AI-generated Image Detection Fabrizio Guillaro et.al. 2412.17671 null
2024-12-23 Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models Sijbren van Vaals et.al. 2412.17669 link
2024-12-23 Detecting anxiety and depression in dialogues: a multi-label and explainable approach Francisco de Arriba-Pérez et.al. 2412.17651 null
2024-12-23 SCBench: A Sports Commentary Benchmark for Video LLMs Kuangzhi Ge et.al. 2412.17637 null
2024-12-23 ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance Renyang Liu et.al. 2412.17632 link
2024-12-23 Tracking the Feature Dynamics in LLM Training: A Mechanistic Study Yang Xu et.al. 2412.17626 null
2024-12-23 Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models Parham Rezaei et.al. 2412.17622 link
2024-12-23 Emerging Security Challenges of Large Language Models Herve Debar et.al. 2412.17614 null
2024-12-23 Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs Fabrizio Frasca et.al. 2412.17609 null
2024-12-23 EasyTime: Time Series Forecasting Made Easy Xiangfei Qiu et.al. 2412.17603 null
2024-12-23 LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context Kai Ruan et.al. 2412.17596 link
2024-12-23 Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation Chengbing Wang et.al. 2412.17593 null
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field Zixi Liang et.al. 2412.17561 link
2024-12-23 GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Chao Zeng et.al. 2412.17560 null
2024-12-23 A Survey of Query Optimization in Large Language Models Mingyang Song et.al. 2412.17558 null
2024-12-23 Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing Prakash Aryan et.al. 2412.17548 link
2024-12-23 Retention Score: Quantifying Jailbreak Risks for Vision Language Models Zaitang Li et.al. 2412.17544 null
2024-12-23 Constructing Fair Latent Space for Intersection of Fairness and Explainability Hyungjun Joo et.al. 2412.17523 null
2024-12-23 DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak Hao Wang et.al. 2412.17522 null
2024-12-23 Improving the Noise Estimation of Latent Neural Stochastic Differential Equations Linus Heck et.al. 2412.17499 null
2024-12-23 Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings Jérémie Sublime et.al. 2412.17486 null
2024-12-23 Power- and Fragmentation-aware Online Scheduling for GPU Datacenters Francesco Lettich et.al. 2412.17484 link
2024-12-23 A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression Chenlong Deng et.al. 2412.17483 null
2024-12-23 A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers Shuaihang Chen et.al. 2412.17481 link
2024-12-23 CALLIC: Content Adaptive Learning for Lossless Image Compression Daxin Li et.al. 2412.17464 null
2024-12-23 Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning Xiaodan Chen et.al. 2412.17456 null
2024-12-23 Applying LLM and Topic Modelling in Psychotherapeutic Contexts Alexander Vanin et.al. 2412.17449 null
2024-12-23 Measuring Contextual Informativeness in Child-Directed Text Maria Valentini et.al. 2412.17427 link
2024-12-23 Multimodal Preference Data Synthetic Alignment with Reward Model Robert Wijaya et.al. 2412.17417 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance Muhammad Reza Qorib et.al. 2412.17408 link
2024-12-23 Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning Huchen Jiang et.al. 2412.17397 null
2024-12-23 WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models Huawen Feng et.al. 2412.17395 null
2024-12-23 Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement Hyeonjin Kim et.al. 2412.17387 link
2024-12-23 Interweaving Memories of a Siamese Large Language Model Xin Song et.al. 2412.17383 link
2024-12-23 MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models Beibei Yu et.al. 2412.17339 null
2024-12-23 A Dual-Perspective Metaphor Detection Framework Using Large Language Models Yujie Lin et.al. 2412.17332 link
2024-12-23 Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance Nicolas Devatine et.al. 2412.17321 null
2024-12-23 CodeV: Issue Resolving with Visual Data Linhao Zhang et.al. 2412.17315 link
2024-12-23 Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories Mahan Tafreshipour et.al. 2412.17298 null
2024-12-23 Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Taewoong Kim et.al. 2412.17288 link
2024-12-23 LLM4AD: A Platform for Algorithm Design with Large Language Model Fei Liu et.al. 2412.17287 link
2024-12-23 Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning Rui Liang et.al. 2412.17285 null
2024-12-23 Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach Rafid Ishrak Jahan et.al. 2412.17255 link
2024-12-23 SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval Xiaopeng Li et.al. 2412.17250 null
2024-12-23 EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling Zichen Song et.al. 2412.17249 null
2024-12-23 On the Generalization Ability of Machine-Generated Text Detectors Yule Liu et.al. 2412.17242 link
2024-12-23 Brain-to-Text Benchmark '24: Lessons Learned Francis R. Willett et.al. 2412.17227 link
2024-12-23 CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder Lichen Ma et.al. 2412.17225 null
2024-12-22 Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension Jio Oh et.al. 2412.17189 null
2024-12-22 Foundation Model for Lossy Compression of Spatiotemporal Scientific Data Xiao Li et.al. 2412.17184 null
2024-12-22 Enhancing Item Tokenization for Generative Recommendation through Self-Improvement Runjin Chen et.al. 2412.17171 null
2024-12-22 Generative Diffusion Modeling: A Practical Handbook Zihan Ding et.al. 2412.17162 null
2024-12-22 LLM-based relevance assessment still can't replace human relevance assessment Charles L. A. Clarke et.al. 2412.17156 null
2024-12-22 LLM Agent for Fire Dynamics Simulations Leidong Xu et.al. 2412.17146 null
2024-12-22 Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs Rushendra Sidibomma et.al. 2412.17131 null
2024-12-22 Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models Cameron R. Jones et.al. 2412.17128 null
2024-12-22 Learning to Adapt to Low-Resource Paraphrase Generation Zhigen Li et.al. 2412.17111 null
2024-12-22 DreamOmni: Unified Image Generation and Editing Bin Xia et.al. 2412.17098 null
2024-12-22 Analysis on LLMs Performance for Code Summarization Md. Ahnaf Akib et.al. 2412.17094 null
2024-12-22 SAIL: Sample-Centric In-Context Learning for Document Information Extraction Jinyu Zhang et.al. 2412.17092 link
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2024-12-22 The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM's Internal States Fabian Ridder et.al. 2412.17056 link
2024-12-22 DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately Huiwen Wu et.al. 2412.17053 null
2024-12-22 ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues Shaina Raza et.al. 2412.17052 link
2024-12-22 Modular Conversational Agents for Surveys and Interviews Jiangbo Yu et.al. 2412.17049 null
2024-12-22 Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective Hankun Wang et.al. 2412.17048 null
2024-12-22 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories Eric Hedlin et.al. 2412.17040 null
2024-12-22 Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model Kevin Shen et.al. 2412.17039 null
2024-12-22 Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models Lang Gao et.al. 2412.17034 null
2024-12-22 MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge Jie He et.al. 2412.17032 null
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 GAS: Generative Auto-bidding with Post-training Search Yewen Li et.al. 2412.17018 null
2024-12-22 Robustness of Large Language Models Against Adversarial Attacks Yiyi Tao et.al. 2412.17011 null
2024-12-22 InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions Ronghui Li et.al. 2412.16982 null
2024-12-22 On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora Tzu-Chieh Chen et.al. 2412.16976 null
2024-12-22 Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs Alexander von Recum et.al. 2412.16974 null
2024-12-22 Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach Chunxu Zhang et.al. 2412.16969 link
2024-12-22 System-2 Mathematical Reasoning via Enriched Instruction Tuning Huanqia Cai et.al. 2412.16964 null
2024-12-22 Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework Jundong Xu et.al. 2412.16953 null
2024-12-22 A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation Ekai Hashimoto et.al. 2412.16943 null
2024-12-22 Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering Zhongjian Hu et.al. 2412.16936 null
2024-12-22 Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models Kai Zheng et.al. 2412.16933 null
2024-12-22 Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs Bohan Jin et.al. 2412.16922 null
2024-12-22 Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection Yuhang Gan et.al. 2412.16918 null
2024-12-22 Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation Quan Dao et.al. 2412.16906 null
2024-12-22 Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model Songjun Tu et.al. 2412.16878 link
2024-12-20 HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding Chenxin Tao et.al. 2412.16158 null
2024-12-20 Can Generative Video Models Help Pose Estimation? Ruojin Cai et.al. 2412.16155 null
2024-12-20 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang et.al. 2412.16145 link
2024-12-20 Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation Seyedreza Mohseni et.al. 2412.16135 null
2024-12-20 Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information Dirk Bergemann et.al. 2412.16132 null
2024-12-20 PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics Daniil Larionov et.al. 2412.16120 null
2024-12-20 Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts Muhammad Abdullah Sohail et.al. 2412.16119 link
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse Mahyar Habibi et.al. 2412.16114 null
2024-12-20 Logical Consistency of Large Language Models in Fact-checking Bishwamittra Ghosh et.al. 2412.16100 null
2024-12-20 The Evolution of LLM Adoption in Industry Data Curation Practices Crystal Qian et.al. 2412.16089 null
2024-12-20 Efficient MedSAMs: Segment Anything in Medical Images on Laptop Jun Ma et.al. 2412.16085 link
2024-12-20 Formal Mathematical Reasoning: A New Frontier in AI Kaiyu Yang et.al. 2412.16075 null
2024-12-20 The Only Way is Ethics: A Guide to Ethical Research with Large Language Models Eddie L. Ungless et.al. 2412.16022 link
2024-12-20 Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support Qijiong Liu et.al. 2412.15973 link
2024-12-20 From General to Specific: Tailoring Large Language Models for Personalized Healthcare Ruize Shi et.al. 2412.15957 null
2024-12-20 Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring Markus Borg et.al. 2412.15948 null
2024-12-20 Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation Gautier Evennou et.al. 2412.15939 link
2024-12-20 Large Language Model assisted Hybrid Fuzzing Ruijie Meng et.al. 2412.15931 null
2024-12-20 MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection Andrea Moglia et.al. 2412.15925 link
2024-12-20 RiTTA: Modeling Event Relations in Text-to-Audio Generation Yuhang He et.al. 2412.15922 link
2024-12-20 Less is More: Towards Green Code Large Language Models via Unified Structural Pruning Guang Yang et.al. 2412.15921 null
2024-12-20 Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model Yosuke Yamagishi et.al. 2412.15907 null
2024-12-20 Evaluation of Reliability Criteria for News Publishers with Large Language Models Manuel Pratelli et.al. 2412.15896 null
2024-12-20 TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain Camille Barboule et.al. 2412.15891 null
2024-12-20 AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI Katja Bühler et.al. 2412.15876 null
2024-12-20 Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback Jiaming Ji et.al. 2412.15838 link
2024-12-20 WebLLM: A High-Performance In-Browser LLM Inference Engine Charlie F. Ruan et.al. 2412.15803 link
2024-12-20 Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Sungjin Park et.al. 2412.15797 null
2024-12-20 GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning Heming Zhang et.al. 2412.15790 null
2024-12-20 Linguistic Features Extracted by GPT-4 Improve Alzheimer's Disease Detection based on Spontaneous Speech Jonathan Heitz et.al. 2412.15772 link
2024-12-20 Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference Jorge García-Carrasco et.al. 2412.15750 link
2024-12-20 Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models Shamus Sim et.al. 2412.15748 null
2024-12-20 VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models Dexter Neo et.al. 2412.15739 null
2024-12-20 AutoLife: Automatic Life Journaling with Smartphones and LLMs Huatao Xu et.al. 2412.15714 null
2024-12-20 Contrastive Learning for Task-Independent SpeechLLM-Pretraining Maike Züfle et.al. 2412.15712 link
2024-12-20 Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback Niklas Ippisch et.al. 2412.15702 null
2024-12-20 Code Review Automation Via Multi-task Federated LLM -- An Empirical Study Jahnavi Kumar et.al. 2412.15676 null
2024-12-20 Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline Guancheng Zeng et.al. 2412.15660 null
2024-12-20 Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class Annie D'souza et.al. 2412.15657 null
2024-12-20 MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Sieun Hyeon et.al. 2412.15655 link
2024-12-20 Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution Wentao Tan et.al. 2412.15650 null
2024-12-20 Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model Xin Du et.al. 2412.15634 link
2024-12-20 Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? Mengyu Ye et.al. 2412.15628 null
2024-12-20 JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs Hongyi Li et.al. 2412.15623 null
2024-12-20 Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage Zhi Gao et.al. 2412.15606 null
2024-12-20 Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks Brian J Chan et.al. 2412.15605 link
2024-12-20 Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification Gyutae Park et.al. 2412.15603 null
2024-12-20 Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation Xiaoqiang Kang et.al. 2412.15594 link
2024-12-20 NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization Danial Kamali et.al. 2412.15588 link
2024-12-20 To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models Jessica Y. Bo et.al. 2412.15584 null
2024-12-20 A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation Ryien Hosseini et.al. 2412.15582 null
2024-12-20 Score-based Generative Diffusion Models for Social Recommendations Chengyi Liu et.al. 2412.15579 link
2024-12-20 QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning Xinyang Tong et.al. 2412.15576 null
2024-12-20 J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM Takero Yoshida et.al. 2412.15574 null
2024-12-20 Continual Learning Using a Kernel-Based Method Over Foundation Models Saleh Momeni et.al. 2412.15571 link
2024-12-20 DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation Yichun Tai et.al. 2412.15570 link
2024-12-20 In-context Continual Learning Assisted by an External Continual Learner Saleh Momeni et.al. 2412.15563 null
2024-12-20 NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning Zheyuan Zhang et.al. 2412.15547 null
2024-12-20 MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering Zhang Siyue et.al. 2412.15540 null
2024-12-20 XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation Qianren Mao et.al. 2412.15529 link
2024-12-20 HREF: Human Response-Guided Evaluation of Instruction Following in Language Models Xinxi Lyu et.al. 2412.15524 link
2024-12-20 PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time Alireza Pourali et.al. 2412.15519 link
2024-12-20 Stylish and Functional: Guided Interpolation Subject to Physical Constraints Yan-Ying Chen et.al. 2412.15507 null
2024-12-20 Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework Zhenjie Xu et.al. 2412.15504 link
2024-12-20 Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models Zhisheng Tang et.al. 2412.15501 null
2024-12-20 TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use Junjie Ye et.al. 2412.15495 link
2024-12-20 PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search Jiaxin Wu et.al. 2412.15494 null
2024-12-20 GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators Hengjia Li et.al. 2412.15491 null
2024-12-20 Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Saehyung Lee et.al. 2412.15484 null
2024-12-20 Continual Learning Using Only Large Language Model Prompting Jiabao Qiu et.al. 2412.15479 null
2024-12-19 TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models Ammar N. Abbas et.al. 2412.15462 null
2024-12-19 Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization Sahil Wadhwa et.al. 2412.15453 null
2024-12-19 AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals Angela Mastrianni et.al. 2412.15444 null
2024-12-19 SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval Aakash Mahalingam et.al. 2412.15443 null
2024-12-19 Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models Tianchen Zhang et.al. 2412.15431 null
2024-12-19 MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale Swapnil Gandhi et.al. 2412.15411 null
2024-12-19 Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification Edoardo Allegrini et.al. 2412.15410 null
2024-12-19 Systematic Evaluation of Long-Context LLMs on Financial Concepts Lavanya Gupta et.al. 2412.15386 null
2024-12-19 Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation Joanne Boisson et.al. 2412.15375 link
2024-12-19 Automated Root Cause Analysis System for Complex Data Products Mathieu Demarne et.al. 2412.15374 null
2024-12-19 Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs Liam Seymour et.al. 2412.15352 link
2024-12-19 Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models Reza Shirkavand et.al. 2412.15341 null
2024-12-19 Complete background cosmology of parity-even quadratic metric-affine gravity Thomas Dyer et.al. 2412.15329 null
2024-12-19 OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving Shuo Xing et.al. 2412.15208 link
2024-12-19 MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark Qihao Zhao et.al. 2412.15194 link
2024-12-19 LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation Weijia Shi et.al. 2412.15188 null
2024-12-19 Tiled Diffusion Or Madar et.al. 2412.15185 null
2024-12-19 Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning Simon Frieder et.al. 2412.15184 null
2024-12-19 STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning Marius Memmel et.al. 2412.15182 null
2024-12-19 HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages Aman Chaturvedi et.al. 2412.15178 null
2024-12-19 Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying Federico Castagna et.al. 2412.15177 link
2024-12-19 Rethinking Uncertainty Estimation in Natural Language Generation Lukas Aichberger et.al. 2412.15176 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Language Models as Continuous Self-Evolving Data Engineers Peidong Wang et.al. 2412.15151 null
2024-12-19 Jet: A Modern Transformer-Based Normalizing Flow Alexander Kolesnikov et.al. 2412.15129 null
2024-12-19 Adaptive Pruning for Large Language Models with Structural Importance Awareness Haotian Zheng et.al. 2412.15127 null
2024-12-19 Outcome-Refining Process Supervision for Code Generation Zhuohao Yu et.al. 2412.15118 link
2024-12-19 Qwen2.5 Technical Report Qwen et.al. 2412.15115 link
2024-12-19 Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture Thomas F Burns et.al. 2412.15113 link
2024-12-19 Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation Yang Tian et.al. 2412.15109 link
2024-12-19 Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability Xiangsen Chen et.al. 2412.15101 null
2024-12-19 Nano-ESG: Extracting Corporate Sustainability Information from News Articles Fabian Billert et.al. 2412.15093 link
2024-12-19 Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation Haoran Liu et.al. 2412.15086 null
2024-12-19 ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots Bhupendra Acharya et.al. 2412.15072 null
2024-12-19 ConfliBERT: A Language Model for Political Conflict Patrick T. Brandt et.al. 2412.15060 link
2024-12-19 LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps Felix Friedrich et.al. 2412.15035 null
2024-12-19 DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space Mang Ning et.al. 2412.15032 link
2024-12-19 Large Language Models and Code Security: A Systematic Literature Review Enna Basic et.al. 2412.15004 null
2024-12-19 HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs Pham Vu Tuan Dat et.al. 2412.14995 link
2024-12-19 RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning Raphael Memmesheimer et.al. 2412.14989 null
2024-12-19 Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts Ioana Buhnila et.al. 2412.14986 null
2024-12-19 AI and Cultural Context: An Empirical Investigation of Large Language Models' Performance on Chinese Social Work Professional Standards Zia Qi et.al. 2412.14971 null
2024-12-19 Movie2Story: A framework for understanding videos and telling stories in the form of novel text Kangning Li et.al. 2412.14965 null
2024-12-19 Knowledge Injection via Prompt Distillation Kalle Kujanpää et.al. 2412.14964 null
2024-12-19 Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities Daniil Medyakov et.al. 2412.14935 null
2024-12-19 RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Junyu Luo et.al. 2412.14922 link
2024-12-19 Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation Zexiong Ma et.al. 2412.14905 null
2024-12-19 Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering Peize Li et.al. 2412.14880 null
2024-12-19 Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering Imed Keraghel et.al. 2412.14867 null
2024-12-19 Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling Junyi Li et.al. 2412.14860 null
2024-12-19 DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis Hongling Xu et.al. 2412.14849 link
2024-12-19 Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas Pietro Bernardelle et.al. 2412.14843 null
2024-12-19 Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis Greta Dolcetti et.al. 2412.14841 null
2024-12-19 Progressive Multimodal Reasoning via Active Retrieval Guanting Dong et.al. 2412.14835 null
2024-12-19 Answer Set Networks: Casting Answer Set Programming into Deep Learning Arseny Skryagin et.al. 2412.14814 link
2024-12-19 ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis Zeao Tu et.al. 2412.14809 link
2024-12-19 Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning Ziang Ye et.al. 2412.14780 null
2024-12-19 ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine Rabee Qasem et.al. 2412.14771 null
2024-12-19 PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children Yiqun Zhang et.al. 2412.14769 link
2024-12-19 CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering Ruida Hu et.al. 2412.14764 link
2024-12-19 Query pipeline optimization for cancer patient question answering systems Maolin He et.al. 2412.14751 null
2024-12-19 Active Inference and Human--Computer Interaction Roderick Murray-Smith et.al. 2412.14741 null
2024-12-19 On Verbalized Confidence Scores for LLMs Daniel Yang et.al. 2412.14737 link
2024-12-19 Creation of AI-driven Smart Spaces for Enhanced Indoor Environments -- A Survey Aygün Varol et.al. 2412.14708 null
2024-12-19 LLMs as mediators: Can they diagnose conflicts accurately? Özgecan Koçak et.al. 2412.14675 null
2024-12-19 Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT Hassane Kissane et.al. 2412.14670 null
2024-12-19 IOHunter: Graph Foundation Model to Uncover Online Information Operations Marco Minici et.al. 2412.14663 link
2024-12-19 Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models Zijun Chen et.al. 2412.14660 link
2024-12-19 Length Controlled Generation for Black-box LLMs Yuxuan Gu et.al. 2412.14656 null
2024-12-19 Learning to Generate Research Idea with Dynamic Control Ruochen Li et.al. 2412.14626 null
2024-12-19 How good is GPT at writing political speeches for the White House? Jacques Savoy et.al. 2412.14617 null
2024-12-19 Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning Kepu Zhang et.al. 2412.14588 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues Tao He et.al. 2412.14584 null
2024-12-19 CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation Youngwon Lee et.al. 2412.14581 null
2024-12-19 DiffSim: Taming Diffusion Models for Evaluating Visual Similarity Yiren Song et.al. 2412.14580 link
2024-12-19 Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models Wenhan Liu et.al. 2412.14574 link
2024-12-19 ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model Shunlin Lu et.al. 2412.14559 null
2024-12-19 The Current Challenges of Software Engineering in the Era of Large Language Models Cuiyun Gao et.al. 2412.14554 null
2024-12-19 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models Xiao Cui et.al. 2412.14528 link
2024-12-19 Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment Teng Xiao et.al. 2412.14516 link
2024-12-19 Relational Programming with Foundation Models Ziyang Li et.al. 2412.14515 null
2024-12-19 PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization Jiayi Wu et.al. 2412.14510 link
2024-12-19 Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs Yuzuki Arai et.al. 2412.14501 null
2024-12-19 Guided Diffusion Model for Sensor Data Obfuscation Xin Yang et.al. 2412.14499 null
2024-12-19 FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis Abdullah Khan et.al. 2412.14492 link
2024-12-19 Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities Amandeep Kaur et.al. 2412.14486 null
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-19 Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs Koshiro Saito et.al. 2412.14471 null
2024-12-19 Agent-SafetyBench: Evaluating the Safety of LLM Agents Zhexin Zhang et.al. 2412.14470 link
2024-12-19 From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research Xiang Cheng et.al. 2412.14461 null
2024-12-19 LEDiff: Latent Exposure Diffusion for HDR Generation Chao Wang et.al. 2412.14456 null
2024-12-19 Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems Genki Kusano et.al. 2412.14454 null
2024-12-19 Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation Shengqi Liu et.al. 2412.14453 null
2024-12-19 ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study Eric Modesitt et.al. 2412.14436 link
2024-12-19 All-in-One Tuning and Structural Pruning for Domain-Specific LLMs Lei Lu et.al. 2412.14426 null
2024-12-19 FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning Pramit Saha et.al. 2412.14424 null
2024-12-19 Enhancing Diffusion Models for High-Quality Image Generation Jaineet Shah et.al. 2412.14422 null
2024-12-18 ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers Haowei Liu et.al. 2412.14405 null
2024-12-18 Clinical Trials Ontology Engineering with Large Language Models Berkan Çakır et.al. 2412.14387 null
2024-12-18 ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling William Han et.al. 2412.14373 link
2024-12-18 Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation Yuxuan Jiang et.al. 2412.14368 null
2024-12-18 Surrealistic-like Image Generation with Vision-Language Models Elif Ayten et.al. 2412.14366 link
2024-12-18 ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals Utkarsh Saxena et.al. 2412.14363 link
2024-12-18 A Unifying Information-theoretic Perspective on Evaluating Generative Models Alexis Fox et.al. 2412.14340 null
2024-12-18 Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation Benjamin Steenhoek et.al. 2412.14308 null
2024-12-18 Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs David Restrepo et.al. 2412.14304 null
2024-12-18 Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data haina Raza et.al. 2412.14276 link
2024-12-18 Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Jihan Yang et.al. 2412.14171 link
2024-12-18 MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Shengbang Tong et.al. 2412.14164 null
2024-12-18 TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Frank F. Xu et.al. 2412.14161 link
2024-12-18 Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models Atin Sakkeer Hussain et.al. 2412.14146 null
2024-12-18 LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research Tianyang Gu et.al. 2412.14141 null

(back to top)

Video Understanding

Publish Date Title Authors PDF Code
2025-01-31 Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search Yuta Oshima et.al. 2501.19252 null
2025-01-31 $\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation Saul Santos et.al. 2501.19098 link
2025-01-30 Every Image Listens, Every Image Dances: Music-Driven Image Animation Zhikang Dong et.al. 2501.18801 null
2025-01-30 MAMS: Model-Agnostic Module Selection Framework for Video Captioning Sangho Lee et.al. 2501.18269 null
2025-01-28 Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding Yun Li et.al. 2501.16786 null
2025-01-28 CascadeV: An Implementation of Wurstchen Architecture for Video Generation Wenfeng Lin et.al. 2501.16612 link
2025-01-27 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models Zheng Lian et.al. 2501.16566 null
2025-01-27 Understanding Long Videos via LLM-Powered Entity Relation Graphs Meng Chu et.al. 2501.15953 null
2025-01-26 TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Xingjian Zhang et.al. 2501.15513 link
2025-01-26 "See What I Imagine, Imagine What I See": Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR Yunge Wen et.al. 2501.15456 null
2025-01-25 HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding Jiaxing Zhao et.al. 2501.15111 null
2025-01-25 VideoPure: Diffusion-based Adversarial Purification for Video Recognition Kaixun Jiang et.al. 2501.14999 link
2025-01-11 HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators Le Chen et.al. 2501.14794 null
2025-01-24 VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking Runyi Hu et.al. 2501.14195 link
2025-01-24 ENTER: Event Based Interpretable Reasoning for VideoQA Hammad Ayyubi et.al. 2501.14194 null
2025-01-30 Temporal Preference Optimization for Long-Form Video Understanding Rui Li et.al. 2501.13919 null
2025-01-23 Improving Video Generation with Human Feedback Jie Liu et.al. 2501.13918 null
2025-01-23 ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Jianxin Liang et.al. 2501.13536 null
2025-01-23 Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge Haomiao Xiong et.al. 2501.13468 link
2025-01-23 EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion Jiangchuan Wei et.al. 2501.13452 null
2025-01-28 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-21 Taming Teacher Forcing for Masked Autoregressive Video Generation Deyu Zhou et.al. 2501.12389 null
2025-01-22 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-22 Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Sili Chen et.al. 2501.12375 null
2025-01-21 InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Yuhang Zang et.al. 2501.12368 link
2025-01-20 GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video Zhenliang Ni et.al. 2501.11340 null
2025-01-20 CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation Zheng Chong et.al. 2501.11325 null
2025-01-23 HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition Pengcheng Dong et.al. 2501.11007 null
2025-01-18 EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Linrui Tian et.al. 2501.10687 null
2025-01-17 DiffuEraser: A Diffusion Model for Video Inpainting Xiaowen Li et.al. 2501.10018 link
2025-01-17 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation Yuefan Cao et.al. 2501.09982 null
2025-01-16 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos Zhongwei Ren et.al. 2501.09781 null
2025-01-16 Learnings from Scaling Visual Tokenizers for Reconstruction and Generation Philippe Hansen-Estruch et.al. 2501.09755 null
2025-01-14 Do generative video models learn physical principles from watching videos? Saman Motamed et.al. 2501.09038 link
2025-01-15 Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion Jingyuan Chen et.al. 2501.09019 null
2025-01-15 RepVideo: Rethinking Cross-Layer Representation for Video Generation Chenyang Si et.al. 2501.08994 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-31 Comprehensive Subjective and Objective Evaluation Method for Text-generated Video Zelu Qi et.al. 2501.08545 null
2025-01-14 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models Weichen Fan et.al. 2501.08453 null
2025-01-14 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering Meenakshi Krishnan et.al. 2501.08370 null
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 GameFactory: Creating New Games with Generative Interactive Videos Jiwen Yu et.al. 2501.08325 null
2025-01-14 Diffusion Adversarial Post-Training for One-Step Video Generation Shanchuan Lin et.al. 2501.08316 null
2025-01-17 LayerAnimate: Layer-specific Control for Animation Yuxue Yang et.al. 2501.08295 null
2025-01-14 FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Yabo Zhang et.al. 2501.08225 link
2025-01-14 Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jiaxing Zhao et.al. 2501.07978 null
2025-01-24 Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Liping Yuan et.al. 2501.07888 link
2025-01-14 AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation Sitong Gong et.al. 2501.07810 link
2025-01-13 BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations Weixi Feng et.al. 2501.07647 null
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-17 MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning Tieyuan Chen et.al. 2501.07227 null
2025-01-13 TimeLogic: A Temporal Logic Benchmark for Video QA Sirnam Swetha et.al. 2501.07214 null
2025-01-13 Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling Jiebin Yan et.al. 2501.07087 null
2025-01-12 X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding Wenqi Zhou et.al. 2501.06835 null
2025-01-12 VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning Ji Soo Lee et.al. 2501.06761 link
2025-01-11 Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning Maomao Li et.al. 2501.06438 null
2025-01-10 MEt3R: Measuring Multi-View Consistency in Generated Images Mohammad Asim et.al. 2501.06336 null
2025-01-10 Multi-subject Open-set Personalization in Video Generation Tsai-Shien Chen et.al. 2501.06187 null
2025-01-10 VideoAuteur: Towards Long Narrative Video Generation Junfei Xiao et.al. 2501.06173 null
2025-01-13 Valley2: Exploring Multimodal Models with Scalable Vision-Language Design Ziheng Wu et.al. 2501.05901 link
2025-01-10 Zero-shot Shark Tracking and Biometrics from Aerial Imagery Chinmay K Lalgudi et.al. 2501.05717 null
2025-01-10 From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities Dominick Reilly et.al. 2501.05711 link
2025-01-09 OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Yifei Li et.al. 2501.05510 link
2025-01-08 Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion Yongjia Ma et.al. 2501.05484 null
2025-01-09 Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces Aniruddha Mahapatra et.al. 2501.05442 null
2025-01-09 Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning Huabin Liu et.al. 2501.05069 null
2025-01-09 LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding Jiaxing Zhao et.al. 2501.05067 null
2025-01-09 LongViTU: Instruction Tuning for Long-Form Video Understanding Rujie Wu et.al. 2501.05037 null
2025-01-09 ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark Ronghao Dang et.al. 2501.05031 link
2025-01-08 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Yuzhou Huang et.al. 2501.04698 null
2025-01-08 Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs Zeyi Huang et.al. 2501.04336 null
2025-01-08 H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving Siran Chen et.al. 2501.04302 null
2025-01-08 LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Bowen Hao et.al. 2501.04204 null
2024-12-18 FlexCache: Flexible Approximate Cache System for Video Diffusion Desen Sun et.al. 2501.04012 null
2025-01-07 Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers Yuechen Zhang et.al. 2501.03931 link
2025-01-09 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Zekai Gu et.al. 2501.03847 link
2025-01-07 Motion-Aware Generative Frame Interpolation Guozhen Zhang et.al. 2501.03699 null
2025-01-06 License Plate Images Generation with Diffusion Models Mariia Shpir et.al. 2501.03374 null
2025-01-03 Classifier-Guided Captioning Across Modalities Ariel Shaulov et.al. 2501.03183 null
2025-01-06 Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation Guy Yariv et.al. 2501.03059 null
2025-01-20 TransPixeler: Advancing Text-to-Video Generation with Transparency Luozhou Wang et.al. 2501.03006 link
2025-01-06 MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models Wenyi Hong et.al. 2501.02955 null
2025-01-06 Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising Yunlong Yuan et.al. 2501.02741 null
2025-01-05 GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking Weikang Bian et.al. 2501.02690 null
2025-01-29 Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey Zongxia Li et.al. 2501.02189 link
2025-01-10 Gender Bias in Text-to-Video Generation Models: A case study of Sora Mohammad Nadeem et.al. 2501.01987 null
2024-12-30 FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models Tianyu Fu et.al. 2501.01986 link
2025-01-03 JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing Qili Wang et.al. 2501.01798 link
2025-01-03 HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding Heqing Zou et.al. 2501.01645 null
2025-01-07 VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control Yuanpeng Tu et.al. 2501.01427 null
2025-01-02 Unifying Specialized Visual Encoders for Video Language Models Jihoon Chung et.al. 2501.01426 link
2025-01-03 Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions Xincheng Shuai et.al. 2501.01425 null
2025-01-02 Multi-Modal Video Feature Extraction for Popularity Prediction Haixu Liu et.al. 2501.01422 null
2025-01-02 On Unifying Video Generation and Camera Pose Estimation Chun-Hao Paul Huang et.al. 2501.01409 null
2025-01-29 Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform Cheonsu Jeong et.al. 2501.00750 null
2025-01-03 DreamDrive: Generative 4D Scene Modeling from Street View Images Jiageng Mao et.al. 2501.00601 null
2025-01-08 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM Yuqian Yuan et.al. 2501.00599 link
2024-12-31 Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method Zhenpeng Huang et.al. 2501.00584 null
2024-12-31 Fine-grained Video-Text Retrieval: A New Benchmark and Method Yifan Xu et.al. 2501.00513 null
2024-12-31 OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models Lala Shakti Swarup Ray et.al. 2501.00432 null
2025-01-09 Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding Yue Fan et.al. 2501.00358 null
2024-12-30 Detection-Fusion for Knowledge Graph Extraction from Videos Taniya Das et.al. 2501.00136 link
2024-12-30 LTX-Video: Realtime Video Latent Diffusion Yoav HaCohen et.al. 2501.00103 link
2024-12-30 Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model Yifei Huang et.al. 2412.21080 link
2024-12-30 VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation Jiazheng Xu et.al. 2412.21059 link
2024-12-30 Hierarchical Banzhaf Interaction for General Video-Language Representation Learning Peng Jin et.al. 2412.20964 link
2024-12-30 ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation Ting Zhang et.al. 2412.20901 null
2024-12-30 Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling Min Zhang et.al. 2412.20725 null
2025-01-05 ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding Xiao Wang et.al. 2412.20504 link
2024-12-29 Open-Sora: Democratizing Efficient Video Production for All Zangwei Zheng et.al. 2412.20404 link
2024-12-28 DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments Xijun Wang et.al. 2412.20042 null
2025-01-17 MVTamperBench: Evaluating Robustness of Vision-Language Models Amit Agarwal et.al. 2412.19794 null
2024-12-27 Generative Video Propagation Shaoteng Liu et.al. 2412.19761 null
2024-12-30 VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models Tao Wu et.al. 2412.19645 null
2024-12-30 DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT Xiaotao Hu et.al. 2412.19505 link
2024-12-26 Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries Roberto Amoroso et.al. 2412.19304 null
2024-12-25 Accelerating Diffusion Transformers with Dual Feature Caching Chang Zou et.al. 2412.18911 link
2024-12-24 Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation Faraz Waseem et.al. 2412.18688 null
2024-12-24 Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models Jinhui Yi et.al. 2412.18609 link
2024-12-24 DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers Yuntao Chen et.al. 2412.18607 null
2024-12-24 ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation Hongjie Li et.al. 2412.18600 null
2024-12-24 DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation Minghong Cai et.al. 2412.18597 link
2024-12-23 Large Motion Video Autoencoding with Cross-modal Video VAE Yazhou Xing et.al. 2412.17805 null
2024-12-23 VidTwin: Video VAE with Decoupled Structure and Dynamics Yuchi Wang et.al. 2412.17726 link
2024-12-23 HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data Ting Zhou et.al. 2412.17574 link
2024-12-23 VidCtx: Context-aware Video Question Answering with Image Models Andreas Goulas et.al. 2412.17415 null
2024-12-23 FFA Sora, video generation as fundus fluorescein angiography simulator Xinyuan Wu et.al. 2412.17346 null
2024-12-23 Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory Xingyao Li et.al. 2412.17254 null
2024-12-22 SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults Jinzhi Wang et.al. 2412.17077 null
2025-01-08 Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation Luoxu Jin et.al. 2412.17042 null
2024-12-22 FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos Zhengqian Wu et.al. 2412.17022 link
2024-12-22 Video Domain Incremental Learning for Human Action Recognition in Home Environments Yuanda Hu et.al. 2412.16946 null
2024-12-21 GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space Souhaib Attaiki et.al. 2412.16717 null
2024-12-21 TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models Haocheng Huang et.al. 2412.16700 null
2024-12-21 VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation Chi Zhang et.al. 2412.16677 null
2024-12-25 Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance Beiyuan Zhang et.al. 2412.16495 null
2024-12-18 ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping Youxin Pang et.al. 2412.16212 null
2024-12-17 Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation Yiping Wang et.al. 2412.16211 null
2024-12-20 PruneVid: Visual Token Pruning for Efficient Video Large Language Models Xiaohu Huang et.al. 2412.16117 link
2024-12-20 DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization Zihan Ding et.al. 2412.15689 null
2024-12-23 CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training Xiuli Bi et.al. 2412.15646 link
2024-12-20 PolySmart @ TRECVid 2024 Medical Video Question Answering Jiaxin Wu et.al. 2412.15514 null
2024-12-19 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation Moayed Haji-Ali et.al. 2412.15191 null
2024-12-19 Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM Yatai Ji et.al. 2412.15156 link
2024-12-19 Parallelized Autoregressive Visual Generation Yuqing Wang et.al. 2412.15119 null
2024-12-19 Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations Yucheng Hu et.al. 2412.14803 null
2024-12-19 HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning Minkuk Kim et.al. 2412.14585 null
2024-12-19 Consistent Human Image and Video Generation with Spatially Conditioned Diffusion Mingdeng Cao et.al. 2412.14531 link
2024-12-19 DirectorLLM for Human-Centric Video Generation Kunpeng Song et.al. 2412.14484 null
2024-12-18 Learning from Massive Human Videos for Universal Humanoid Pose Control Jiageng Mao et.al. 2412.14172 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167 null
2024-12-29 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018 null
2024-12-18 InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models Cong Wei et.al. 2412.14006 link
2024-12-18 Do Language Models Understand Time? Xi Ding et.al. 2412.13845 link
2024-12-19 G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o Tony Cheng Tong et.al. 2412.13647 link
2024-12-18 Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning Yunbin Tu et.al. 2412.13543 null
2024-12-18 Real-time One-Step Diffusion-based Expressive Portrait Videos Generation Hanzhong Guo et.al. 2412.13479 link
2024-12-18 SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation Kazuki Shimada et.al. 2412.13462 null
2024-12-17 CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices Andrei Znobishchev et.al. 2412.13273 null
2025-01-07 MotionBridge: Dynamic Video Inbetweening with Flexible Controls Maham Tanveer et.al. 2412.13190 null
2024-12-17 VidTok: A Versatile and Open-Source Video Tokenizer Anni Tang et.al. 2412.13061 link
2024-12-17 FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering Zheng Cheng et.al. 2412.12833 null
2024-12-17 Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning Shiping Ge et.al. 2412.12791 link
2024-12-17 ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries Wangyu Xue et.al. 2412.12675 null
2024-12-16 Can video generation replace cinematographers? Research on the cinematic language of generated video Xiaozhe Li et.al. 2412.12223 null
2024-12-16 CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding Guo Chen et.al. 2412.12075 null
2024-12-16 InterDyn: Controllable Interactive Dynamics with Video Diffusion Models Rick Akkerman et.al. 2412.11785 null
2024-12-16 Generative Inbetweening through Frame-wise Conditions-Driven Video Generation Tianyi Zhu et.al. 2412.11755 link
2024-12-16 VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting Muhammet Furkan Ilaslan et.al. 2412.11621 link
2024-12-16 Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning Zhuyang Xie et.al. 2412.11467 null
2024-12-15 Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition Yulin Wang et.al. 2412.11228 link
2024-12-15 GenLit: Reformulating Single-Image Relighting as Video Generation Shrisha Bharadwaj et.al. 2412.11224 null
2024-12-15 DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes Jinxiu Liu et.al. 2412.11100 null
2024-12-15 Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track Deepak Gupta et.al. 2412.11056 null
2024-12-20 Video Diffusion Transformers are In-Context Learners Zhengcong Fei et.al. 2412.10783 link
2024-12-14 Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives Ji-jun Park et.al. 2412.10720 null
2024-12-13 SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device Yushu Wu et.al. 2412.10494 null
2024-12-12 VCA: Video Curious Agent for Long Video Understanding Zeyuan Yang et.al. 2412.10471 null
2024-12-17 SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization Zhentao Tan et.al. 2412.10443 null
2024-12-11 COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework Xin Dong et.al. 2412.10435 null
2024-12-13 Apollo: An Exploration of Video Understanding in Large Multimodal Models Orr Zohar et.al. 2412.10360 null
2024-12-16 TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation Xingrui Wang et.al. 2412.10275 null
2024-12-19 AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era Yudong Jiang et.al. 2412.10255 link
2024-12-13 B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens Zhuqiang Lu et.al. 2412.09919 link
2024-12-16 IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs Sosuke Yamao et.al. 2412.09907 null
2024-12-13 LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity Hongjie Wang et.al. 2412.09856 null
2024-12-13 MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion Xunnong Xu et.al. 2412.09828 null
2024-12-17 ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation Ali Athar et.al. 2412.09754 null
2024-12-11 Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model Junqi You et.al. 2412.09647 null
2024-12-16 Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Fan Zhang et.al. 2412.09645 link
2024-12-12 Doe-1: Closed-Loop Autonomous Driving with Large World Model Wenzhao Zheng et.al. 2412.09627 link
2024-12-12 OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation Weiqi Li et.al. 2412.09623 null
2024-12-12 PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models Chenyu Yang et.al. 2412.09613 null
2024-12-12 Owl-1: Omni World Model for Consistent Long Video Generation Yuanhui Huang et.al. 2412.09600 link
2024-12-12 LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors Yabo Chen et.al. 2412.09597 null
2024-12-12 Neptune: The Long Orbit to Benchmarking Long Video Understanding Arsha Nagrani et.al. 2412.09582 link
2024-12-12 Video Creation by Demonstration Yihong Sun et.al. 2412.09551 null
2024-12-12 Agent-based Video Trimming Lingfeng Yang et.al. 2412.09513 null
2024-12-12 UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer Delong Liu et.al. 2412.09389 link
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323 null
2024-12-12 InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Tiehan Fan et.al. 2412.09283 null
2024-12-12 Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering Sai Bhargav Rongali et.al. 2412.09230 null
2024-12-12 LVMark: Robust Watermark for latent video diffusion models MinHyuk Jang et.al. 2412.09122 null
2024-12-12 Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation Lianrui Mu et.al. 2412.08976 null
2024-12-12 Mojito: Motion Trajectory and Intensity Control for Video Generation Xuehai He et.al. 2412.08948 null
2024-12-11 Generative Semantic Communication: Architectures, Technologies, and Applications Jinke Ren et.al. 2412.08642 null
2024-12-13 Physical Informed Driving World Model Zhuoran Yang et.al. 2412.08410 null
2024-12-11 FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks Chongkai Gao et.al. 2412.08261 null
2024-12-11 VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation Zhiqiang Yuan et.al. 2412.08259 null
2024-12-10 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark Wufei Ma et.al. 2412.07825 null
2024-12-11 UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Xi Chen et.al. 2412.07774 null
2024-12-10 From Slow Bidirectional to Fast Causal Video Generators Tianwei Yin et.al. 2412.07772 null
2024-12-10 SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Jianhong Bai et.al. 2412.07760 link
2024-12-10 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Xiao Fu et.al. 2412.07759 null
2024-12-10 Multi-Shot Character Consistency for Text-to-Video Generation Yuval Atzmon et.al. 2412.07750 null
2024-12-10 StyleMaster: Stylize Your Video with Artistic Generation and Translation Zixuan Ye et.al. 2412.07744 null
2024-12-10 STIV: Scalable Text and Image Conditioned Video Generation Zongyu Lin et.al. 2412.07730 null
2024-12-10 ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Jinyi Hu et.al. 2412.07720 link
2024-12-10 GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning Yicheng Wang et.al. 2412.07704 null
2024-12-10 Multimodal Contextualized Support for Enhancing Video Retrieval System Quoc-Bao Nguyen-Le et.al. 2412.07584 null
2024-12-19 Multi-Scale Contrastive Learning for Video Temporal Grounding Thong Thanh Nguyen et.al. 2412.07157 null
2024-12-09 SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations Zhaorun Chen et.al. 2412.06878 null
2024-12-09 VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features Sifei Li et.al. 2412.06296 null
2024-12-11 Towards Long Video Understanding via Fine-detailed Video Story Generation Zeng You et.al. 2412.06182 null
2024-12-08 Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training Zhenghong Zhou et.al. 2412.06029 null
2024-12-08 FlexDiT: Dynamic Token Density Control for Diffusion Transformer Shuning Chang et.al. 2412.06028 null
2024-12-10 Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation Hyeonho Jeong et.al. 2412.06016 null
2024-12-08 Accelerating Video Diffusion Models via Distribution Matching Yuanzhi Zhu et.al. 2412.05899 null
2024-12-08 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation Shuwei Shi et.al. 2412.05848 null
2024-12-08 Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval Shanti Stewart et.al. 2412.05831 null
2024-12-08 Self-Guidance: Boosting Flow and Diffusion Generation on Their Own Tiancheng Li et.al. 2412.05827 null
2024-12-07 Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation Leonardo Pina et.al. 2412.05694 null
2024-12-11 Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model Lening Wang et.al. 2412.05280 link
2024-12-17 Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Zhe Chen et.al. 2412.05271 link
2024-12-06 Mind the Time: Temporally-Controlled Multi-Event Video Generation Ziyi Wu et.al. 2412.05263 null
2024-12-11 LinVT: Empower Your Image-level Large Language Model to Understand Videos Lishuai Gao et.al. 2412.05185 link
2024-12-06 Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection Khurram Azeem Hashmi et.al. 2412.04915 null
2024-12-06 UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving Rui Chen et.al. 2412.04842 link
2024-12-12 Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model Keunwoo Peter Yu et.al. 2412.04729 null
2024-12-05 Using Diffusion Priors for Video Amodal Segmentation Kaihua Chen et.al. 2412.04623 null

(back to top)

About

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages