GitHub - Xuchen-Li/llm-arxiv-daily: Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Updated on 2025.02.04

Table of Contents

LLM Reasoning
LLM Evaluation
LLM MLLM
Video Understanding

LLM Reasoning

Publish Date	Title	Authors	PDF	Code
2025-01-31	Reward-Guided Speculative Decoding for Efficient LLM Reasoning	Baohao Liao et.al.	2501.19324	null
2025-01-31	BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning	Han Zhong et.al.	2501.18858	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-27	Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations	Pablo Valenzuela-Toledo et.al.	2501.16495	null
2025-01-27	Large Models in Dialogue for Active Perception and Anomaly Detection	Tzoulio Chamiti et.al.	2501.16300	link
2025-01-26	TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs	Yuxuan Gu et.al.	2501.15674	null
2025-01-28	Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning	Zeyu Gan et.al.	2501.15602	link
2025-01-26	Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Yuhong Sun et.al.	2501.15581	null
2025-01-24	Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains	Xu Chu et.al.	2501.14431	null
2025-01-24	GraphBC: Improving LLMs for Better Graph Data Processing	Xu Chu et.al.	2501.14427	null
2025-01-23	Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks	Chang Gong et.al.	2501.13731	null
2025-01-22	EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering	Chang Zong et.al.	2501.12746	null
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-22	FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs	Zengyi Gao et.al.	2501.09957	null
2025-01-17	Evolving Deeper LLM Thinking	Kuang-Huei Lee et.al.	2501.09891	null
2025-01-23	Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models	Fengli Xu et.al.	2501.09686	null
2025-01-14	Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data	Jiaxing Qiu et.al.	2501.08413	link
2025-01-14	Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning	Haoyu Han et.al.	2501.07845	null
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting	Dong-Hai Zhu et.al.	2501.04341	link
2025-01-07	Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation	Alireza Salemi et.al.	2501.04167	null
2025-01-06	KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models	Zaiyi Zheng et.al.	2501.02711	null
2025-01-04	Table as Thought: Exploring Structured Thoughts in LLM Reasoning	Zhenjie Sun et.al.	2501.02152	null
2025-01-03	Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models	Kaleem Ullah Qasim et.al.	2501.02026	null
2025-01-02	Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search	Shuangtao Li et.al.	2501.01478	null
2025-01-02	HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation	Runsong Jia et.al.	2501.01203	null
2025-01-03	Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents	Chengbo He et.al.	2501.00430	null
2024-12-31	EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta	Raymond Bernard et.al.	2501.00257	null
2024-12-30	Efficiently Serving LLM Reasoning Programs with Certaindex	Yichao Fu et.al.	2412.20993	null
2024-12-28	LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning	Shuguang Chen et.al.	2412.20227	null
2024-12-31	Token-Budget-Aware LLM Reasoning	Tingxu Han et.al.	2412.18547	link
2024-12-23	StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs	Hailin Chen et.al.	2412.18011	null
2024-12-22	Evaluating LLM Reasoning in the Operations Research Domain with ORQA	Mahdi Mostajabdaveh et.al.	2412.17874	link
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-19	Eliciting Causal Abilities in Large Language Models for Reasoning Tasks	Yajing Wang et.al.	2412.15314	link
2024-12-19	Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Federico Castagna et.al.	2412.15177	link
2024-12-19	FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis	Abdullah Khan et.al.	2412.14492	link
2024-12-18	Cognition Chain for Explainable Psychological Stress Detection on Social Media	Xin Wang et.al.	2412.14009	null
2024-12-18	Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games	Wenye Lin et.al.	2412.13602	null
2024-12-17	ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models	Yuxi Sun et.al.	2412.12848	null
2024-12-12	A NotSo Simple Way to Beat Simple Bench	Soham Sane et.al.	2412.12173	null
2024-12-11	What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis	Jiayu Liu et.al.	2412.12157	null
2024-12-24	Stepwise Reasoning Error Disruption Attack of LLMs	Jingyu Peng et.al.	2412.11934	null
2024-12-15	SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation	Hang Zhang et.al.	2412.11026	null
2024-12-15	Entropy-Regularized Process Reward Model	Hanning Zhang et.al.	2412.11006	link
2024-12-14	Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation	Sukai Huang et.al.	2412.10675	null
2024-12-14	Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data	Xue Wu et.al.	2412.10654	null
2024-12-13	Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education	Naiming Liu et.al.	2412.09914	null
2024-12-12	Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning	Zhenni Bi et.al.	2412.09078	null
2024-12-11	Training Large Language Models to Reason in a Continuous Latent Space	Shibo Hao et.al.	2412.06769	null
2025-01-23	GameArena: Evaluating LLM Reasoning through Live Computer Games	Lanxiang Hu et.al.	2412.06394	null
2024-12-08	Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt	Damien de Mijolla et.al.	2412.05967	null
2024-12-05	SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions	Bufang Yang et.al.	2412.04036	null
2024-12-03	Explainable CTR Prediction via LLM Reasoning	Xiaohan Yu et.al.	2412.02588	null
2024-12-02	NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers	Angel Yahir Loredo Lopez et.al.	2412.01621	null
2025-01-13	Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability	Zicheng Lin et.al.	2411.19943	null
2024-11-29	TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension	Zipeng Qiu et.al.	2411.19504	link
2024-11-29	COLD: Causal reasOning in cLosed Daily activities	Abhinav Joshi et.al.	2411.19500	link
2024-11-25	Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision	Zhiheng Xi et.al.	2411.16579	null
2024-11-22	On the Impact of Fine-Tuning on Chain-of-Thought Reasoning	Elita Lobo et.al.	2411.15382	null
2024-11-21	Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models	Yuhao Dong et.al.	2411.14432	link
2024-11-15	Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination	Haojie Zheng et.al.	2411.12591	link
2024-12-23	Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus	Terufumi Morishita et.al.	2411.12498	link
2024-11-18	Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation	Mingchao Qi et.al.	2411.11714	link
2024-12-31	Enhancing LLM Reasoning with Reward-guided Tree Search	Jinhao Jiang et.al.	2411.11694	null
2024-12-15	A dataset of questions on decision-theoretic reasoning in Newcomb-like problems	Caspar Oesterheld et.al.	2411.10588	link
2024-11-14	Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering	Nghia Trung Ngo et.al.	2411.09213	null
2024-11-13	Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding	Deyi Ji et.al.	2411.08516	null
2024-11-18	What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?	Katie Kang et.al.	2411.07681	link
2024-11-27	Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation	Jaehyeok Lee et.al.	2411.06387	link
2024-11-09	A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization	Haoxin Liu et.al.	2411.06018	null
2024-11-11	LLMs as Method Actors: A Model for Prompt Engineering and Architecture	Colin Doyle et.al.	2411.05778	link
2024-11-12	Kwai-STaR: Transform LLMs into State-Transition Reasoners	Xingyu Lu et.al.	2411.04799	null
2024-11-21	Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding	Haolin Chen et.al.	2411.04282	link
2024-11-05	CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library	Yimeng Liu et.al.	2411.03477	null
2025-01-27	MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs	Manar Abdelatty et.al.	2411.03471	link
2024-11-04	RuAG: Learned-rule-augmented Generation for Large Language Models	Yudi Zhang et.al.	2411.03349	null
2024-10-30	Vision-Language Models Can Self-Improve Reasoning via Reflection	Kanzhi Cheng et.al.	2411.00855	null
2024-11-01	Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling	Yiwen Ding et.al.	2411.00750	link
2024-11-01	STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing	Jiaru Zou et.al.	2411.00387	null
2024-11-08	GRS-QA -- Graph Reasoning-Structured Question Answering Dataset	Anish Pahilajani et.al.	2411.00369	null
2024-10-31	Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning	Jinghan Zhang et.al.	2410.24155	null
2024-10-31	RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner	Fu-Chieh Chang et.al.	2410.23912	null
2024-10-31	OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models	Junda Wu et.al.	2410.23703	null
2024-10-30	ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning	Millennium Bismay et.al.	2410.23180	link
2024-10-30	On Memorization of Large Language Models in Logical Reasoning	Chulin Xie et.al.	2410.23123	null
2024-10-28	Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to Semantics	Isabelle Lee et.al.	2410.21353	null
2024-10-28	Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments	Sangmim Song et.al.	2410.20666	null
2024-10-25	Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models	Danqing Wang et.al.	2410.20007	null
2024-10-25	Can Stories Help LLMs Reason? Curating Information Space Through Narrative	Vahid Sadiri Javadi et.al.	2410.19221	null
2024-10-18	Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning	Pengfei He et.al.	2410.19000	link
2024-10-25	CLR-Bench: Evaluating Large Language Models in College-level Reasoning	Junnan Dong et.al.	2410.17558	null
2024-10-28	Non-myopic Generation of Language Models for Reasoning and Planning	Chang Ma et.al.	2410.17195	link
2024-11-06	Improving Causal Reasoning in Large Language Models: A Survey	Longxuan Yu et.al.	2410.16676	link
2024-10-22	A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs	Ryosuke Sonoda et.al.	2410.16640	null
2024-10-21	Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic	Jason Chan et.al.	2410.16502	null
2024-11-27	On Designing Effective RL Reward at Training Time for LLM Reasoning	Jiaxuan Gao et.al.	2410.15115	null
2025-01-28	Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning	Xingyu Tan et.al.	2410.14211	null
2024-10-21	Unconstrained Model Merging for Enhanced LLM Reasoning	Yiming Zhang et.al.	2410.13699	null
2024-10-16	Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models	Linhao Luo et.al.	2410.13080	link
2024-10-16	KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs	Yongqin Xu et.al.	2410.12480	null
2024-10-17	Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning	Qian Wang et.al.	2410.12464	null
2024-10-16	Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up	Jiahao Yuan et.al.	2410.12323	link
2024-10-16	Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval	Hai-Long Nguyen et.al.	2410.12154	null
2024-10-15	Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming	Yilun Hao et.al.	2410.12112	null
2024-10-12	OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models	Jun Wang et.al.	2410.09671	null
2024-10-11	P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains	Simeng Han et.al.	2410.09207	null
2024-10-11	Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning	Yunpeng Gao et.al.	2410.08500	null
2024-10-10	SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation	Hang Yin et.al.	2410.08189	null
2024-10-10	Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning	Amrith Setlur et.al.	2410.08146	null
2024-10-10	Automatic Curriculum Expert Iteration for Reliable LLM Reasoning	Zirui Zhao et.al.	2410.07627	null
2024-10-09	Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis	Ahmed Abdullah et.al.	2410.06841	null
2024-10-09	Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning	Xiyao Wang et.al.	2410.06508	null
2025-01-02	Filtering Discomforting Recommendations with Large Language Models	Jiahao Liu et.al.	2410.05411	null
2024-10-05	Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification	Zhenwen Liang et.al.	2410.05318	null
2024-10-06	Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval	Pengcheng Jiang et.al.	2410.04585	link
2024-10-03	The Role of Deductive and Inductive Reasoning in Large Language Models	Chengkun Cai et.al.	2410.02892	null
2024-10-02	Not All LLM Reasoners Are Created Equal	Arian Hosseini et.al.	2410.01748	null
2024-12-25	Interpretable Contrastive Monte Carlo Tree Search Reasoning	Zitian Gao et.al.	2410.01707	link
2024-10-02	VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment	Amirhossein Kazemnejad et.al.	2410.01679	link
2024-10-02	AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses	Xiaotian Lu et.al.	2410.01246	null
2024-10-01	Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness	Xiao Peng et.al.	2410.00359	null
2024-10-01	Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis	Chun-Hsiao Yeh et.al.	2410.00292	null
2024-10-08	GUNDAM: Aligning Large Language Models with Graph Understanding	Sheng Ouyang et.al.	2409.20053	null
2024-09-27	Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs	Yanyuan Qiao et.al.	2409.18794	null
2024-10-23	Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning	Debargha Ganguly et.al.	2409.17270	null
2024-09-20	CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency	Kangsheng Wang et.al.	2409.17174	null
2024-09-20	Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM	Zheng Wei Lim et.al.	2409.13949	null
2024-09-19	SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning	Zhipeng Li et.al.	2409.12836	null
2024-10-04	Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning	Jiaxin Wen et.al.	2409.12452	link
2024-12-16	Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data	Jiaming Zhou et.al.	2409.12437	link
2024-09-18	MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning	Justin Chih-Yao Chen et.al.	2409.12147	link
2024-11-05	Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent	Fatemeh Haji et.al.	2409.11527	link
2024-09-16	Enhancing RL Safety with Counterfactual LLM Reasoning	Dennis Gross et.al.	2409.10188	link
2024-09-11	Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation	SeongYeub Chu et.al.	2409.07355	link

(back to top)

LLM Evaluation

Publish Date	Title	Authors	PDF	Code
2025-01-30	Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation	Muhammed Yusuf Kocyigit et.al.	2501.18771	null
2025-01-31	ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation	Minghua He et.al.	2501.18460	null
2025-01-25	LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering	Beiming Liu et.al.	2501.17183	null
2025-01-28	An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue	Koji Inoue et.al.	2501.16643	null
2025-01-26	HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI	Tidor-Vlad Pricope et.al.	2501.15627	null
2025-01-23	Question Answering on Patient Medical Records with Private Fine-Tuned LLMs	Sara Kothari et.al.	2501.13687	null
2025-01-10	CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback	En-Qi Tseng et.al.	2501.10421	null
2025-01-15	Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History	Yevhen Kostiuk et.al.	2501.09154	null
2025-01-13	Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles	Samia Touileb et.al.	2501.07718	null
2025-01-03	FLAME: Financial Large-Language Model Assessment and Metrics Evaluation	Jiayu Guo et.al.	2501.06211	link
2025-01-07	MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems	Yannis Katsis et.al.	2501.03468	link
2025-01-05	Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm	Ljubisa Bojic et.al.	2501.02532	null
2025-01-04	LLMzSzŁ: a comprehensive LLM benchmark for Polish	Krzysztof Jassem et.al.	2501.02266	null
2025-01-08	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2025-01-04	Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation	M. Ali Bayram et.al.	2501.00593	null
2024-12-31	Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs	Weijia Xu et.al.	2501.00273	null
2024-12-30	EVOLVE: Emotion and Visual Output Learning via LLM Evaluation	Jordan Sinclair et.al.	2412.20632	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-24	A Statistical Framework for Ranking LLM-Based Chatbots	Siavash Ameli et.al.	2412.18407	link
2025-01-25	DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation	Junyi Lu et.al.	2412.18291	null
2024-12-23	CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models	Ruibo Tu et.al.	2412.17970	link
2025-01-02	Baichuan4-Finance Technical Report	Hanyu Zhang et.al.	2412.15270	null
2024-12-19	ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects	Qihang Cao et.al.	2412.14837	null
2024-12-18	AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge	Xiaobao Wu et.al.	2412.13670	link
2024-12-18	Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning	Eitan Wagner et.al.	2412.13631	null
2024-12-17	OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain	Shuting Wang et.al.	2412.13018	link
2024-12-10	How to Choose a Threshold for an Evaluation Metric for Large Language Models	Bhaskarjit Sarmah et.al.	2412.12148	null
2024-12-15	Dual Traits in Probabilistic Reasoning of Large Language Models	Shenxiong Li et.al.	2412.11009	link
2024-12-30	LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation	Eunsu Kim et.al.	2412.10424	null
2024-12-13	Cultural Evolution of Cooperation among LLM Agents	Aron Vallinder et.al.	2412.10270	null
2024-12-12	Towards Understanding the Robustness of LLM-based Evaluations under Perturbations	Manav Chaudhary et.al.	2412.09269	null
2024-12-10	BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities	Sahal Shaji Mullappilly et.al.	2412.07769	link
2024-12-12	PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models	Qian Zhang et.al.	2412.06287	link
2024-12-02	AI Benchmarks and Datasets for LLM Evaluation	Todor Ivanov et.al.	2412.01020	null
2024-11-30	Evaluating the Consistency of LLM Evaluators	Noah Lee et.al.	2412.00543	null
2024-11-29	MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks	John Francis et.al.	2411.19689	null
2024-11-29	Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension Ability	Yujin Han et.al.	2411.19456	link
2024-11-27	Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator	Frederic Kirstein et.al.	2411.18444	null
2025-01-17	CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity	Zhengmin Yu et.al.	2411.16239	link
2024-11-25	SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text	Reshmi Ghosh et.al.	2411.16077	null
2024-11-26	Do LLMs Agree on the Creativity Evaluation of Alternative Uses?	Abdullah Al Rabeyah et.al.	2411.15560	null
2024-11-19	Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat	Roland Daynauth et.al.	2411.14483	link
2024-11-21	Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models	Lovish Madaan et.al.	2411.14103	null
2024-11-21	An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture	Boming Xia et.al.	2411.13768	null
2024-11-21	A Framework for Evaluating LLMs Under Task Indeterminacy	Luke Guerdan et.al.	2411.13760	null
2024-11-12	Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning	Linyang He et.al.	2411.07533	null
2024-11-13	Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models	Yancheng He et.al.	2411.07140	null
2024-11-09	Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models	Xiaojun Wu et.al.	2411.06272	link
2024-11-16	ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding	Israel Abebe Azime et.al.	2411.05049	null
2024-11-07	Bayesian Calibration of Win Rate Estimation with LLM Evaluators	Yicheng Gao et.al.	2411.04424	link
2024-11-05	Enhancing LLM Evaluations: The Garbling Trick	William F. Bradley et.al.	2411.01533	null
2025-01-31	Mastering the Craft of Data Synthesis for CodeLLMs	Meng Chen et.al.	2411.00005	null
2024-10-28	Project MPG: towards a generalized performance benchmark for LLM capabilities	Lucas Spangher et.al.	2410.22368	null
2024-10-29	Self-Preference Bias in LLM-as-a-Judge	Koki Wataoka et.al.	2410.21819	null
2024-10-28	Unveiling Context-Aware Criteria in Self-Assessing LLMs	Taneesh Gupta et.al.	2410.21545	null
2024-10-27	LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization	Jui-Nan Yen et.al.	2410.20625	null
2024-10-26	Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks	Annalisa Szymanski et.al.	2410.20266	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2025-01-30	Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements	Isamu Isozaki et.al.	2410.17141	link
2024-10-21	CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution	Maosong Cao et.al.	2410.16256	link
2025-01-26	mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation	Nishat Raihan et.al.	2410.15037	link
2024-10-19	CAP: Data Contamination Detection via Consistency Amplification	Yi Zhao et.al.	2410.15005	null
2024-10-18	Enabling Scalable Evaluation of Bias Patterns in Medical LLMs	Hamed Fayyaz et.al.	2410.14763	link
2024-11-06	Diverging Preferences: When do Annotators Disagree and do Models Know?	Michael JQ Zhang et.al.	2410.14632	null
2024-10-18	Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models	James Vo et.al.	2410.14480	null
2024-10-21	BenTo: Benchmark Task Reduction with In-Context Transferability	Hongyu Zhao et.al.	2410.13804	link
2024-10-16	BenchmarkCards: Large Language Model and Risk Reporting	Anna Sokol et.al.	2410.12974	null
2024-12-29	Language Model Preference Evaluation with Multiple Weak Evaluators	Zhengyu Hu et.al.	2410.12869	link
2024-10-11	Enterprise Benchmarks for Large Language Model Evaluation	Bing Zhang et.al.	2410.12857	link
2024-10-16	An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation	Junjie Chen et.al.	2410.12265	null
2024-10-15	Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers	Lorenzo Pacchiardi et.al.	2410.11672	link
2024-10-15	Black-box Uncertainty Quantification Method for LLM-as-a-Judge	Nico Wagner et.al.	2410.11594	null
2024-10-14	Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting	Yifan Luo et.al.	2410.10150	null
2024-12-13	HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics	Jingxuan Fan et.al.	2410.09988	link
2024-10-15	LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models	Han Qiu et.al.	2410.09962	link
2024-10-17	Towards Multilingual LLM Evaluation for European Languages	Klaudia Thellmann et.al.	2410.08928	null
2024-10-11	Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example	Marcus Kessel et.al.	2410.08911	null
2024-10-10	Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks	Mathis Pink et.al.	2410.08133	null
2024-10-10	COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act	Philipp Guldimann et.al.	2410.07959	null
2024-11-06	News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News	Tarun Jain et.al.	2410.07520	null
2024-10-09	Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates	Xiaosen Zheng et.al.	2410.07137	link
2024-10-09	ReIFE: Re-evaluating Instruction-Following Evaluation	Yixin Liu et.al.	2410.07069	link
2024-10-08	Active Evaluation Acquisition for Efficient LLM Benchmarking	Yang Li et.al.	2410.05952	null
2024-10-07	TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles	Qingchen Yu et.al.	2410.05262	link
2024-10-01	Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model	Aidan Gilson et.al.	2410.03740	null
2024-10-04	TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation	Jonathan Cook et.al.	2410.03608	null
2024-10-04	Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores	Robert E. Blackwell et.al.	2410.03492	null
2024-10-29	AIME: AI System Optimization via Multiple LLM Evaluators	Bhrij Patel et.al.	2410.03131	null
2024-10-02	Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation	Annalisa Szymanski et.al.	2410.02054	null
2024-10-02	Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models	Joseph Lee et.al.	2410.01795	link
2024-10-03	Extending Context Window of Large Language Models from a Distributional Perspective	Yingsheng Wu et.al.	2410.01490	null
2024-10-02	ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving	Yifan Qiao et.al.	2410.01228	null
2024-10-01	ViDAS: Vision-based Danger Assessment and Scoring	Pranav Gupta et.al.	2410.00477	null
2024-10-01	PclGPT: A Large Language Model for Patronizing and Condescending Language Detection	Hongbo Wang et.al.	2410.00361	link
2024-11-26	LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models	Haitao Li et.al.	2409.20288	link
2024-09-29	Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems	Xuyang Wu et.al.	2409.19804	null
2024-10-19	Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models	Xin Li et.al.	2409.19667	link
2024-10-05	IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation	Fan Lin et.al.	2409.18892	link
2024-12-13	A Character-Centric Creative Story Generation via Imagination	Kyeongman Park et.al.	2409.16667	null
2024-09-25	Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models	Sungjune Park et.al.	2409.16635	null
2024-12-18	Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino	Jann Railey Montalan et.al.	2409.15380	link
2024-12-16	MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators	Qingyu Lu et.al.	2409.14335	link
2024-09-21	ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models	Yuqing Huang et.al.	2409.13989	link
2024-12-17	AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs	Basel Mousi et.al.	2409.11404	null
2024-10-02	LLM-as-a-Judge & Reward Model: What They Can and Cannot Do	Guijin Son et.al.	2409.11239	null
2024-12-08	Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges	Vinay Samuel et.al.	2409.09927	link
2024-09-13	Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia	Fajri Koto et.al.	2409.08564	null
2024-09-09	Assessing SPARQL capabilities of Large Language Models	Lars-Peter Meyer et.al.	2409.05925	link
2024-10-08	LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs	Yuhao Wu et.al.	2409.02076	link
2024-10-14	Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation	Jasper Dekoninck et.al.	2409.00696	null
2024-08-26	Evaluating ChatGPT on Nuclear Domain-Specific Data	Muhammad Anwar et.al.	2409.00090	null
2024-08-28	LLMSecCode: Evaluating Large Language Models for Secure Coding	Anton Rydén et.al.	2408.16100	link
2024-08-26	LLM-3D Print: Large Language Models To Monitor and Control 3D Printing	Yayati Jadhav et.al.	2408.14307	null
2024-08-26	Epidemic Information Extraction for Event-Based Surveillance using Large Language Models	Sergio Consoli et.al.	2408.14277	null
2024-10-04	MobileQuant: Mobile-friendly Quantization for On-device Language Models	Fuwen Tan et.al.	2408.13933	link
2024-08-23	LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models	Chongyan Sun et.al.	2408.13338	null
2024-08-23	Open Llama2 Model for the Lithuanian Language	Artūras Nakvosas et.al.	2408.12963	null
2024-08-23	LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction	Songwei Li et.al.	2408.12832	link
2024-12-20	Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts	Jiaqing Liu et.al.	2408.09688	null
2024-08-20	Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge	Ravi Raju et.al.	2408.08808	null
2024-10-16	The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation	Samee Arif et.al.	2408.08688	link
2024-10-19	Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks	Junseok Kim et.al.	2408.08631	null

(back to top)

LLM MLLM

Publish Date	Title	Authors	PDF	Code
2025-01-31	Vintix: Action Model via In-Context Reinforcement Learning	Andrey Polubarov et.al.	2501.19400	link
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	null
2025-01-31	Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models	Alina Shutova et.al.	2501.19392	null
2025-01-31	Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models	Wenzhi Fang et.al.	2501.19389	null
2025-01-31	SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions	Dominik Wagner et.al.	2501.19377	null
2025-01-31	Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions	Sören Christensen et.al.	2501.19373	null
2025-01-31	We're Different, We're the Same: Creative Homogeneity Across LLMs	Emily Wenger et.al.	2501.19361	null
2025-01-31	Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies	Brandon P. Chelstrom et.al.	2501.19359	null
2025-01-31	The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking	Yuchun Miao et.al.	2501.19358	null
2025-01-31	Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters	Adrián Juan-Delgado et.al.	2501.19356	null
2025-01-31	Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Ting-Yao E. Hsu et.al.	2501.19353	null
2025-01-31	Towards Adaptive Self-Improvement for Smarter Energy Systems	Alexander Sommer et.al.	2501.19340	null
2025-01-31	PixelWorld: Towards Perceiving Everything as Pixels	Zhiheng Lyu et.al.	2501.19339	null
2025-01-31	Homogeneity Bias as Differential Sampling Uncertainty in Language Models	Messi H. J. Lee et.al.	2501.19337	null
2025-01-31	Reward-Guided Speculative Decoding for Efficient LLM Reasoning	Baohao Liao et.al.	2501.19324	null
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	LLM-based Affective Text Generation Quality Based on Different Quantization Values	Yarik Menchaca Resendiz et.al.	2501.19317	null
2025-01-31	Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment	Gregor Bachmann et.al.	2501.19309	null
2025-01-31	SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling	Jiefeng Chen et.al.	2501.19306	null
2025-01-31	Beyond checkmate: exploring the creative chokepoints in AI text	Nafis Irtiza Tripto et.al.	2501.19301	link
2025-01-31	Offline Learning for Combinatorial Multi-armed Bandits	Xutong Liu et.al.	2501.19300	null
2025-01-31	Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes	Zhiyao Xu et.al.	2501.19298	null
2025-01-31	Analysis of LLMs vs Human Experts in Requirements Engineering	Cory Hymel et.al.	2501.19297	null
2025-01-31	Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators	Kunpeng Zhang et.al.	2501.19282	null
2025-01-31	Pheromone-based Learning of Optimal Reasoning Paths	Anirudh Chari et.al.	2501.19278	null
2025-01-31	From Assistance to Autonomy -- A Researcher Study on the Potential of AI Support for Qualitative Data Analysis	Elisabeth Kirsten et.al.	2501.19275	null
2025-01-31	Jackpot! Alignment as a Maximal Lottery	Roberto-Rafael Maura-Rivero et.al.	2501.19266	null
2025-01-31	Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge	Amogh Joshi et.al.	2501.19259	null
2025-01-31	A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation	Yunzhe Li et.al.	2501.19232	null
2025-01-31	Autonomous Legacy Web Application Upgrades Using a Multi-Agent System	Valtteri Ala-Salmi et.al.	2501.19204	null
2025-01-31	Improving the Robustness of Representation Misdirection for Large Language Model Unlearning	Dang Huu-Tien et.al.	2501.19202	null
2025-01-31	Efficient Reasoning with Hidden Thinking	Xuan Shen et.al.	2501.19201	link
2025-01-31	Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning	Xianglin Yang et.al.	2501.19180	null
2025-01-31	No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology	Luka Kovačević et.al.	2501.19178	null
2025-01-31	Position: Contextual Integrity Washing for Language Models	Yan Shvartzshnaider et.al.	2501.19173	null
2025-01-31	Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs	Kejia Zhang et.al.	2501.19164	null
2025-01-31	A theoretical framework for overfitting in energy-based modeling	Giovanni Catania et.al.	2501.19158	null
2025-01-31	A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator	Sixiao Huang et.al.	2501.19135	null
2025-01-31	Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations	Sihwan Park et.al.	2501.19099	null
2025-01-31	Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data	Xichen Xu et.al.	2501.19094	null
2025-01-31	Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models	Jialin Zhao et.al.	2501.19090	null
2025-01-31	Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification	Xiangyu Sun et.al.	2501.19086	null
2025-01-31	Enhancing Code Generation for Low-Resource Languages: No Silver Bullet	Alessandro Giagnorio et.al.	2501.19085	null
2025-01-31	Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations	Dahye Kim et.al.	2501.19066	link
2025-01-31	TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs	Yan Sun et.al.	2501.19057	null
2025-01-31	Enabling Autonomic Microservice Management through Self-Learning Agents	Fenglin Yu et.al.	2501.19056	null
2025-01-31	Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models	Ruiyu Wang et.al.	2501.19054	null
2025-01-31	Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors	Simon Idoko et.al.	2501.19042	link
2025-01-31	Towards the Worst-case Robustness of Large Language Models	Huanran Chen et.al.	2501.19040	null
2025-01-31	Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs	Hongliang Li et.al.	2501.19036	null
2025-01-31	XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses	Bo Lan et.al.	2501.19034	link
2025-01-31	Multilayer Networks in Neuroimaging	Vesna Vuksanovic et.al.	2501.19024	null
2025-01-31	Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation	Bin Zhu et.al.	2501.19017	null
2025-01-31	Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities	Arjun Krishna et.al.	2501.19012	null
2025-01-31	Visual Autoregressive Modeling for Image Super-Resolution	Yunpeng Qu et.al.	2501.18993	null
2025-01-31	Symmetric Pruning of Large Language Models	Kai Yi et.al.	2501.18980	null
2025-01-31	BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics	Yuxuan Liu et.al.	2501.18972	null
2025-01-31	Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping	Pu Yang et.al.	2501.18962	null
2025-01-31	Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow	Alfred Bexley et.al.	2501.18957	null
2025-01-31	LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models	Shenghao Fu et.al.	2501.18954	link
2025-01-31	TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment	Zi-Jian Cheng et.al.	2501.18935	link
2025-01-31	Language Games as the Pathway to Artificial Superhuman Intelligence	Ying Wen et.al.	2501.18924	null
2025-01-31	KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search	Haoran Luo et.al.	2501.18922	link
2025-01-31	LLM Program Optimization via Retrieval Augmented Search	Sagnik Anupam et.al.	2501.18916	null
2025-01-31	Scaling Laws for Differentially Private Language Models	Ryan McKenna et.al.	2501.18914	null
2025-01-31	Streamlining Security Vulnerability Triage with Large Language Models	Mohammad Jalili Torkamani et.al.	2501.18908	null
2025-01-31	Trustworthy Evaluation of Generative AI Models	Zijun Gao et.al.	2501.18897	null
2025-01-31	Can We Predict the Effect of Prompts?	Jae Yong Lee et.al.	2501.18883	null
2025-01-31	Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models	Jiaqi Tang et.al.	2501.18863	null
2025-01-31	BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning	Han Zhong et.al.	2501.18858	null
2025-01-31	Equivariant Hypergraph Diffusion for Crystal Structure Prediction	Yang Liu et.al.	2501.18850	null
2025-01-31	Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities	Yaping Chai et.al.	2501.18845	null
2025-01-31	Trading Inference-Time Compute for Adversarial Robustness	Wojciech Zaremba et.al.	2501.18841	null
2025-01-31	Partially Rewriting a Transformer in Natural Language	Gonçalo Paulo et.al.	2501.18838	null
2025-01-31	Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming	Mrinank Sharma et.al.	2501.18837	null
2025-01-31	Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential	Chenyu Gao et.al.	2501.18834	null
2025-01-31	Structural Embedding Projection for Contextual Large Language Model Inference	Vincent Enoasmo et.al.	2501.18826	null
2025-01-31	Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies	Andrey Borro et.al.	2501.18817	link
2025-01-31	Large Language Models as Common-Sense Heuristics	Andrey Borro et.al.	2501.18816	null
2025-01-30	Compositional Generalization Requires More Than Disentangled Representations	Qiyao Liang et.al.	2501.18797	null
2025-01-30	Rope to Nope and Back Again: A New Hybrid Attention Strategy	Bowen Yang et.al.	2501.18795	null
2025-01-30	Survey and Improvement Strategies for Gene Prioritization with Large Language Models	Matthew Neeley et.al.	2501.18794	null
2025-01-30	LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore?	Alexander Tuisov et.al.	2501.18784	null
2025-01-30	Navigating the Fragrance space Via Graph Generative Models And Predicting Odors	Mrityunjay Sharma et.al.	2501.18777	link
2025-01-30	Probabilistic Joint Recovery Method for CO $_2$ Plume Monitoring	Zijun Deng et.al.	2501.18761	null
2025-01-30	Synthetic Data Generation for Augmenting Small Samples	Dan Liu et.al.	2501.18741	null
2025-01-30	Examining the Robustness of Large Language Models across Language Complexity	Jiayi Zhang et.al.	2501.18738	null
2025-01-30	Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks	Mohd. Farhan Israk Soumik et.al.	2501.18727	null
2025-01-30	Strong and Controllable 3D Motion Generation	Canxuan Gang et.al.	2501.18726	null
2025-01-30	Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning	Maya Kruse et.al.	2501.18724	null
2025-01-30	Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps	Devansh Bhardwaj et.al.	2501.18712	null
2025-01-30	Regularized second-order optimization of tensor-network Born machines	Matan Ben-Dov et.al.	2501.18691	null
2025-01-30	Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting	Yansong Qu et.al.	2501.18672	null
2025-01-30	Foundational Models for 3D Point Clouds: A Survey and Outlook	Vishal Thengane et.al.	2501.18594	null
2025-01-30	Diffusion Autoencoders are Scalable Image Tokenizers	Yinbo Chen et.al.	2501.18593	null
2025-01-30	Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models	Hao Dong et.al.	2501.18592	link
2025-01-30	Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs	Yue Wang et.al.	2501.18585	null
2025-01-30	Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Evgenii Evstafev et.al.	2501.18576	null
2025-01-30	BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos	Lehao Lin et.al.	2501.18565	null
2025-01-30	SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation	Haoquan Fang et.al.	2501.18564	null
2025-01-30	Semantic Web and Creative AI -- A Technical Report from ISWS 2023	Raia Abu Ahmad et.al.	2501.18542	null
2025-01-30	Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges	Manveer Singh Tamber et.al.	2501.18536	link
2025-01-30	Differentially Private Steering for Large Language Model Alignment	Anmol Goel et.al.	2501.18532	link
2025-01-30	Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models	Guanqun Cao et.al.	2501.18516	null
2025-01-30	Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch	Arthur Douillard et.al.	2501.18512	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction	Peter J. Bentley et.al.	2501.18504	null
2025-01-30	Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline	Shivani Kapania et.al.	2501.18493	null
2025-01-30	A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models	Changshu Liu et.al.	2501.18482	null
2025-01-30	CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization	Yanxia Deng et.al.	2501.18475	null
2025-01-30	Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations	Chengxi Zeng et.al.	2501.18474	null
2025-01-30	ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation	Minghua He et.al.	2501.18460	null
2025-01-30	CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering	Yumeng Wang et.al.	2501.18457	null
2025-01-30	GENIE: Generative Note Information Extraction model for structuring EHR data	Huaiyuan Ying et.al.	2501.18435	null
2025-01-30	Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation	Youngjoon Lee et.al.	2501.18416	null
2025-01-30	RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects	Yiteng Tu et.al.	2501.18365	link
2025-01-30	A Video-grounded Dialogue Dataset and Metric for Event-driven Activities	Wiradee Imrattanatrai et.al.	2501.18324	link
2025-01-30	Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach	Tianpeng Pan et.al.	2501.18320	null
2025-01-30	Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models	Jennifer D'Souza et.al.	2501.18287	null
2025-01-30	Jailbreaking LLMs' Safeguard with Universal Magic Words for Text Embedding Models	Haoyu Liang et.al.	2501.18280	null
2025-01-30	Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence	Kevin Roitero et.al.	2501.18265	null
2025-01-30	How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Vilém Zouhar et.al.	2501.18251	link
2025-01-30	Statistical multi-metric evaluation and visualization of LLM system predictive performance	Samuel Ackerman et.al.	2501.18243	null
2025-01-30	Contextually Structured Token Dependency Encoding for Large Language Models	James Blades et.al.	2501.18205	null
2025-01-30	Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents	ShuiDe Wen et.al.	2501.18190	null
2025-01-30	Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation	Teddy Lazebnik et.al.	2501.18177	null
2025-01-30	Continually Evolved Multimodal Foundation Models for Cancer Prognosis	Jie Peng et.al.	2501.18170	null
2025-01-30	RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing	Jinyao Guo et.al.	2501.18160	null
2025-01-30	Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study	Yuchen Lei et.al.	2501.18158	null
2025-01-30	Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models	Wanlong Liu et.al.	2501.18154	null
2025-01-30	Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models	Qika Lin et.al.	2501.18119	null
2025-01-30	Scaling Inference-Efficient Language Models	Song Bian et.al.	2501.18107	null
2025-01-30	Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation	Yibo Wang et.al.	2501.18100	link
2025-01-30	AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates	Da Chang et.al.	2501.18094	null
2025-01-30	Normative Evaluation of Large Language Models with Everyday Moral Dilemmas	Pratik S. Sachdeva et.al.	2501.18081	null
2025-01-30	FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models	Spencer Mateega et.al.	2501.18062	null
2025-01-29	RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems	Duy A. Nguyen et.al.	2501.18056	null
2025-01-29	Current Pathology Foundation Models are unrobust to Medical Center Differences	Edwin D. de Jong et.al.	2501.18055	null
2025-01-29	A Proximal Operator for Inducing 2:4-Sparsity	Jonas M Kübler et.al.	2501.18015	null
2025-01-29	Large Language Models Think Too Fast To Explore Effectively	Lan Pan et.al.	2501.18009	null
2025-01-29	Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces	Neetha Jambigi et.al.	2501.18005	null
2025-01-29	InnerThoughts: Disentangling Representations and Predictions in Large Language Models	Didier Chételat et.al.	2501.17994	null
2025-01-29	Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study	Marwah Alaofi et.al.	2501.17981	link
2025-01-29	Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization	Zishun Yu et.al.	2501.17974	null
2025-01-29	"I Would Never Trust Anything Western": Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools	Manas Mhasakar et.al.	2501.17942	null
2025-01-29	DReSS: Data-driven Regularized Structured Streamlining for Large Language Models	Mingkuan Feng et.al.	2501.17905	null
2025-01-29	Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning?	Pouya Pezeshkpour et.al.	2501.17840	link
2025-01-29	Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology	Sobhan Hemati et.al.	2501.17822	null
2025-01-30	Leveraging Multimodal LLM for Inspirational User Interface Search	Seokhyeon Park et.al.	2501.17799	link
2025-01-29	BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights	Chan-Jan Hsu et.al.	2501.17790	null
2025-01-29	AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing	Peter Pak et.al.	2501.17784	null
2025-01-29	2SSP: A Two-Stage Framework for Structured Pruning of LLMs	Fabrizio Sandri et.al.	2501.17771	link
2025-01-29	Generative Unordered Flow for Set-Structured Data Generation	Yangming Li et.al.	2501.17770	null
2025-01-29	Hybrid Graphs for Table-and-Text based Question Answering using LLMs	Ankush Agarwal et.al.	2501.17767	null
2025-01-29	On the Partitioning of GPU Power among Multi-Instances	Tirth Vamja et.al.	2501.17752	null
2025-01-29	Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation	Aitor Arrieta et.al.	2501.17749	null
2025-01-29	A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches	Ana R. Baião et.al.	2501.17729	null
2025-01-29	Using Code Generation to Solve Open Instances of Combinatorial Design Problems	Christopher D. Rosin et.al.	2501.17725	link
2025-01-29	RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts	Eujeong Choi et.al.	2501.17715	link
2025-01-29	Source-Channel Separation Theorems for Distortion Perception Coding	Chao Tian et.al.	2501.17706	null
2025-01-29	Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching	Xuzhe Dang et.al.	2501.17665	null
2025-01-30	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-29	Uncertainty Quantification and Decomposition for LLM-based Recommendation	Wonbin Kweon et.al.	2501.17630	link
2025-01-29	The Imitation Game According To Turing	Sharon Temtsin et.al.	2501.17629	null
2025-01-29	Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment	Jonathan Teel et.al.	2501.17617	null
2025-01-29	Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis	Kunrong Li et.al.	2501.17598	null
2025-01-30	Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models	Behraj Khan et.al.	2501.17595	null
2025-01-29	GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback	Mohamed Abdelaal et.al.	2501.17584	null
2025-01-29	CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs	Amey Hengle et.al.	2501.17581	null
2025-01-29	Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding	Marco Pasini et.al.	2501.17578	null
2025-01-29	Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models	Wooyoung Kim et.al.	2501.17549	null
2025-01-29	Towards Training-Free Open-World Classification with 3D Generative Models	Xinzhe Xia et.al.	2501.17547	null
2025-01-29	Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant	Gaole He et.al.	2501.17546	link
2025-01-29	Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison	Martin Nizon-Deladoeuille et.al.	2501.17539	null
2025-01-29	Neural Spelling: A Spell-Based BCI System for Language Neural Decoding	Xiaowei Jiang et.al.	2501.17489	null
2025-01-29	DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance	Seffi Cohen et.al.	2501.17479	link
2025-01-29	AugmenTest: Enhancing Tests with LLM-Driven Oracles	Shaker Mahmud Khandaker et.al.	2501.17461	null
2025-01-29	Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction	Kaiwei Luo et.al.	2501.17459	null
2025-01-29	Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation	Tiansheng Huang et.al.	2501.17433	link
2025-01-29	Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models	Yuxuan Li et.al.	2501.17420	null
2025-01-29	MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs	Ved Sirdeshmukh et.al.	2501.17399	link
2025-01-29	Learning Free Token Reduction for Multi-Modal LLM	Zihui Zhao et.al.	2501.17391	null
2025-01-29	Context-Aware Semantic Recomposition Mechanism for Large Language Models	Richard Katrix et.al.	2501.17386	null
2025-01-28	Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations	Md Tauhidul Islam et.al.	2501.17347	null
2025-01-28	Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction	Mingyu Derek Ma et.al.	2501.17326	null
2025-01-28	CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data	Lee Carlin et.al.	2501.17324	null
2025-01-30	Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding	Yun-Shiuan Chuang et.al.	2501.17310	null
2025-01-28	"Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism	Emily Tseng et.al.	2501.17299	null
2025-01-28	Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization	Zilu Tang et.al.	2501.17295	null
2025-01-28	Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology	Peilong Wang et.al.	2501.17286	null
2025-01-30	From Natural Language to Extensive-Form Game Representations	Shilong Deng et.al.	2501.17282	link
2025-01-28	Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models	Abdalaziz Al-Maeeni et.al.	2501.17279	null
2025-01-28	Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics	Jasper Timm et.al.	2501.17273	link
2025-01-28	Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care	Fengpei Yuan et.al.	2501.17206	null
2025-01-28	SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training	Tianzhe Chu et.al.	2501.17161	null
2025-01-28	FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data	Deren Lei et.al.	2501.17144	link
2025-01-28	ASTRAL: Automated Safety Testing of Large Language Models	Miriam Ugarte et.al.	2501.17132	null
2025-01-28	Optimizing Large Language Model Training Using FP4 Quantization	Ruizhe Wang et.al.	2501.17116	null
2025-01-28	Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction	Carl-Leander Henneking et.al.	2501.17112	null
2025-01-28	Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics	Guillaume Le Mailloux et.al.	2501.17107	link
2025-01-28	Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Evgenii Evstafev et.al.	2501.17084	null
2025-01-28	Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Akash Kumar et.al.	2501.17053	null
2025-01-28	Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models	Minghan Li et.al.	2501.17039	null
2025-01-28	Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies	Manojkumar Parmar et.al.	2501.17030	null
2025-01-28	Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs	Alessandro Midolo et.al.	2501.17024	link
2025-01-28	Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement	Kei Katsumata et.al.	2501.17022	link
2025-01-28	MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition	Philippe Pasquier et.al.	2501.17011	null
2025-01-28	Large Language Models for Code Generation: The Practitioners Perspective	Zeeshan Rasheed et.al.	2501.16998	link
2025-01-28	Artificial Intelligence Clones	Annie Liang et.al.	2501.16996	null
2025-01-28	FedEFM: Federated Endovascular Foundation Model with Unseen Data	Tuong Do et.al.	2501.16992	null
2025-01-28	Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver	Shunya Minami et.al.	2501.16986	null
2025-01-28	Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling	Hongzhi Huang et.al.	2501.16975	null
2025-01-28	Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers	Mohammad Raza et.al.	2501.16961	null
2025-01-28	Multiple Abstraction Level Retrieve Augment Generation	Zheng Zheng et.al.	2501.16952	null
2025-01-29	TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models	Makoto Shing et.al.	2501.16937	null
2025-01-28	Detecting harassment and defamation in cyberbullying with emotion-adaptive training	Peiling Yi et.al.	2501.16925	link
2025-01-28	RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains	Shady Nasrat et.al.	2501.16899	link
2025-01-28	Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model	Eoghan Cronin et.al.	2501.16893	null
2025-01-28	Irony Detection, Reasoning and Understanding in Zero-shot Learning	Peiling Yi et.al.	2501.16884	null
2025-01-28	Comparing Human and LLM Generated Code: The Jury is Still Out!	Sherlock A. Licorish et.al.	2501.16857	null
2025-01-28	Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis	Tiao Tan et.al.	2501.16842	null
2025-01-28	Misspellings in Natural Language Processing: A survey	Gianluca Sperduti et.al.	2501.16836	null
2025-01-28	DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model	Josua Spisak et.al.	2501.16800	null
2025-01-28	Algorithm for Automatic Legislative Text Consolidation	Matias Etcheverry et.al.	2501.16794	null
2025-01-28	Exponential Family Attention	Kevin Christian Wibisono et.al.	2501.16790	link
2025-01-28	Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding	Yun Li et.al.	2501.16786	null
2025-01-28	TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network	Yumingzhi Pan et.al.	2501.16784	null
2025-01-28	A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process	Jack David Carson et.al.	2501.16783	null
2025-01-29	Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models	Muhammad Atta ur Rahman et.al.	2501.16769	null
2025-01-28	DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation	Chenguo Lin et.al.	2501.16764	null
2025-01-28	HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns	Xinyue Shen et.al.	2501.16750	link
2025-01-28	Through the Prism of Culture: Evaluating LLMs' Understanding of Indian Subcultures and Traditions	Garima Chhikara et.al.	2501.16748	null
2025-01-28	LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience	Nimesh Jha et.al.	2501.16744	null
2025-01-28	Distilling Large Language Models for Network Active Queue Management	Deol Satish et.al.	2501.16734	null
2025-01-28	xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking	Sunbowen Lee et.al.	2501.16727	link
2025-01-28	One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning	Chunpeng Zhou et.al.	2501.16720	null
2025-01-28	Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection	Hengzhuang Li et.al.	2501.16718	link
2025-01-28	3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow	Yueen Ma et.al.	2501.16698	null
2025-01-28	MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark	Dongyi Yi et.al.	2501.16688	null
2025-01-28	Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting	Li Yin et.al.	2501.16673	link
2025-01-28	VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records	Philip Chung et.al.	2501.16672	link
2025-01-28	Contextual Reinforcement in Multimodal Token Compression for Large Language Models	Naderdel Piero et.al.	2501.16658	null
2025-01-28	Large Language Model Critics for Execution-Free Evaluation of Code Changes	Aashish Yadavally et.al.	2501.16655	link
2025-01-28	Molecular-driven Foundation Model for Oncologic Pathology	Anurag Vaidya et.al.	2501.16652	null
2025-01-28	DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models	Zeping Min et.al.	2501.16650	null
2025-01-28	An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue	Koji Inoue et.al.	2501.16643	null
2025-01-28	CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs	Jinlan Fu et.al.	2501.16629	link
2025-01-28	Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems	Baraa Hikal et.al.	2501.16616	null
2025-01-28	Sparse Autoencoders Trained on the Same Data Learn Different Features	Gonçalo Paulo et.al.	2501.16615	null
2025-01-28	Fine-Tuned Language Models as Space Systems Controllers	Enrico M. Zucchelli et.al.	2501.16588	null
2025-01-27	AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models	Zheng Lian et.al.	2501.16566	null
2025-01-27	LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation	Farzad Farhadzadeh et.al.	2501.16559	null
2025-01-27	Distributional Information Embedding: A Framework for Multi-bit Watermarking	Haiyun He et.al.	2501.16558	null
2025-01-27	PackDiT: Joint Human Motion and Text Generation via Mutual Prompting	Zhongyu Jiang et.al.	2501.16551	null
2025-01-27	PhysAnimator: Physics-Guided Generative Cartoon Animation	Tianyi Xie et.al.	2501.16550	null
2025-01-27	Sample-Efficient Behavior Cloning Using General Domain Knowledge	Feiyu Zhu et.al.	2501.16546	null
2025-01-27	Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees	Piyush Gupta et.al.	2501.16539	null
2025-01-27	Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs	Jean-Charles Noirot Ferrand et.al.	2501.16534	null
2025-01-27	A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain	Jorge del Pozo Lérida et.al.	2501.16533	null
2025-01-27	Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction	Atharva Naik et.al.	2501.16524	null
2025-01-27	How well can LLMs Grade Essays in Arabic?	Rayed Ghazawi et.al.	2501.16516	null
2025-01-27	Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models	Sudarshan Kamath Barkur et.al.	2501.16513	null
2025-01-27	Smoothed Embeddings for Robust Language Models	Ryo Hase et.al.	2501.16497	null
2025-01-27	Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations	Pablo Valenzuela-Toledo et.al.	2501.16495	null
2025-01-27	Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM	Payal Kamboj et.al.	2501.16481	link
2025-01-27	Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation	Philip Hughes et.al.	2501.16467	null
2025-01-27	CoCoNUT: Structural Code Understanding does not fall out of a tree	Claas Beger et.al.	2501.16456	link
2025-01-27	Detecting Zero-Day Attacks in Digital Substations via In-Context Learning	Faizan Manzoor et.al.	2501.16453	null
2025-01-27	360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation	Hamed Firooz et.al.	2501.16450	null
2025-01-27	DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation	Han Sun et.al.	2501.16410	null
2025-01-27	Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology	Meiyun Cao et.al.	2501.16309	null
2025-01-27	RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval	Long Nguyen et.al.	2501.16303	null
2025-01-27	Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width	Zheng Liu et.al.	2501.16302	null
2025-01-27	Large Models in Dialogue for Active Perception and Anomaly Detection	Tzoulio Chamiti et.al.	2501.16300	link
2025-01-27	FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers	Renshan Zhang et.al.	2501.16297	null
2025-01-27	Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models	Jing Zhang et.al.	2501.16282	null
2025-01-27	Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation	Jiayi Hong et.al.	2501.16277	link
2025-01-27	URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT	Long Nguyen et.al.	2501.16276	null
2025-01-27	A foundation model for human-AI collaboration in medical literature mining	Zifeng Wang et.al.	2501.16255	null
2025-01-27	Multi-Agent Geospatial Copilots for Remote Sensing Workflows	Chaehong Lee et.al.	2501.16254	null
2025-01-27	Zero-Shot Decision Tree Construction via Large Language Models	Lucas Carrasco et.al.	2501.16247	null
2025-01-27	CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation	Xiaochuan Ma et.al.	2501.16246	null
2025-01-27	Phase Transitions in Large Language Models and the $O(N)$ Model	Youran Sun et.al.	2501.16241	null
2025-01-27	AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses	Runze Cai et.al.	2501.16240	null
2025-01-28	Distilling foundation models for robust and efficient models in digital pathology	Alexandre Filiot et.al.	2501.16239	null
2025-01-27	Language-Based Bayesian Optimization Research Assistant (BORA)	Abdoulatif Cissé et.al.	2501.16224	null
2025-01-27	Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models	Huayu Li et.al.	2501.16215	link
2025-01-27	Provence: efficient and robust context pruning for retrieval-augmented generation	Nadezhda Chirkova et.al.	2501.16214	null
2025-01-27	Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs	Antony Bartlett et.al.	2501.16191	null
2025-01-27	SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting	Wenxuan Xie et.al.	2501.16178	link
2025-01-27	BAG: Body-Aligned 3D Wearable Asset Generation	Zhongjin Luo et.al.	2501.16177	null
2025-01-27	Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma	Richard Willis et.al.	2501.16173	link
2025-01-27	MetaDecorator: Generating Immersive Virtual Tours through Multimodality	Shuang Xie et.al.	2501.16164	null
2025-01-27	CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge	Yuwei Zhang et.al.	2501.16155	null
2025-01-27	AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought	Xin Huang et.al.	2501.16154	null
2025-01-27	AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants	Pascal J. Sager et.al.	2501.16150	null
2025-01-27	PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing	Yuwei Zhang et.al.	2501.16149	null
2025-01-27	SampleLLM: Optimizing Tabular Data Synthesis in Recommendations	Jingtong Gao et.al.	2501.16125	null
2025-01-27	Using Generative Models to Produce Realistic Populations of UK Windstorms	Yee Chun Tsoi et.al.	2501.16110	null
2025-01-27	Integration of LLM Quality Assurance into an NLG System	Ching-Yi Chen et.al.	2501.16078	null
2025-01-27	PISCO: Pretty Simple Compression for Retrieval-Augmented Generation	Maxime Louis et.al.	2501.16075	null
2025-01-27	A generative material transformer using Wyckoff representation	Pierre-Paul De Breuck et.al.	2501.16051	null
2025-01-27	Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation	Xing Zhang et.al.	2501.16050	null
2025-01-27	PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment	Vincent Freiberger et.al.	2501.16033	null
2025-01-27	FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments	Zhiyuan Fu et.al.	2501.16029	null
2025-01-27	Transformability reveals the interplay of dynamics across different network orders	Ming Xie et.al.	2501.16016	null
2025-01-27	TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference	Jack Min Ong et.al.	2501.16007	null
2025-01-27	EDSep: An Effective Diffusion-Based Method for Speech Source Separation	Jinwei Dong et.al.	2501.15965	null
2025-01-27	Rethinking the Bias of Foundation Model under Long-tailed Distribution	Jiahao Chen et.al.	2501.15955	null
2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null
2025-01-27	TimeHF: Billion-Scale Time Series Models Guided by Human Feedback	Yongzhi Qi et.al.	2501.15942	null
2025-01-27	SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub	Benjamin C. Carter et.al.	2501.15922	null
2025-01-27	Parametric Retrieval Augmented Generation	Weihang Su et.al.	2501.15915	link
2025-01-27	Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation	Muhammad Taha Tariq et.al.	2501.15901	null
2025-01-27	Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects	Victor Deng et.al.	2501.15900	null
2025-01-27	Adaptive Width Neural Networks	Federico Errica et.al.	2501.15889	null
2025-01-27	LCTG Bench: LLM Controlled Text Generation Benchmark	Kentaro Kurihara et.al.	2501.15875	link
2025-01-27	LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models	Yuewen Mei et.al.	2501.15850	null
2025-01-27	SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model	Delin Qu et.al.	2501.15830	null
2025-01-27	Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference	Tharindu B. Hewage et.al.	2501.15829	link
2025-01-27	MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer	Qi Chen et.al.	2501.15826	null
2025-01-27	LemmaHead: RAG Assisted Proof Generation Using Large Language Models	Tianbo Yang et.al.	2501.15797	null
2025-01-27	Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?	Zhiling Chen et.al.	2501.15795	null
2025-01-27	Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs	Yu Li et.al.	2501.15791	link
2025-01-27	Memorization and Regularization in Generative Diffusion Models	Ricardo Baptista et.al.	2501.15785	link
2025-01-27	Large Language Models to Diffusion Finetuning	Edoardo Cetin et.al.	2501.15781	null
2025-01-27	Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages	Ivory Yang et.al.	2501.15773	link
2025-01-27	GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design	Yuanfu Sun et.al.	2501.15755	null
2025-01-27	IndicMMLU-Pro: Benchmarking the Indic Large Language Models	Sankalp KJ et.al.	2501.15747	null
2025-01-27	Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning	Michael Xieyang Liu et.al.	2501.15727	null
2025-01-27	A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks	Dong Li et.al.	2501.15724	null
2025-01-27	On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models	Tadahiro Taniguchi et.al.	2501.15721	null
2025-01-26	Adapting Biomedical Abstracts into Plain language using Large Language Models	Haritha Gangavarapu et.al.	2501.15700	null
2025-01-26	TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs	Yuxuan Gu et.al.	2501.15674	null
2025-01-26	Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting	Yuxin Zhang et.al.	2501.15641	null
2025-01-26	BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation	Ali Khodabandeh Yalabadi et.al.	2501.15631	link
2025-01-26	Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets	Eduard Barbu et.al.	2501.15624	null
2025-01-26	Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning	Zeyu Gan et.al.	2501.15602	link
2025-01-26	Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals	Yinzhou Wang et.al.	2501.15599	null
2025-01-26	Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images	Sichen Zhu et.al.	2501.15598	link
2025-01-26	SedarEval: Automated Evaluation using Self-Adaptive Rubrics	Zhiyuan Fan et.al.	2501.15595	link
2025-01-26	SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain	Dakuan Lu et.al.	2501.15587	link
2025-01-26	Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework	Yuhong Sun et.al.	2501.15581	null
2025-01-26	Instruction Tuning for Story Understanding and Generation with Weak Supervision	Yangshu Yuan et.al.	2501.15574	null
2025-01-26	Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models	Spencer Ramsey et.al.	2501.15571	null
2025-01-26	ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer	Lin Yueyu et.al.	2501.15570	link
2025-01-26	Ocean-OCR: Towards General OCR Application via a Vision-Language Model	Song Chen et.al.	2501.15558	null
2025-01-26	Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles	Hanwen Zhang et.al.	2501.15544	null
2025-01-26	Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths	Yueyang Wang et.al.	2501.15522	null
2025-01-26	Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification	Dan Song et.al.	2501.15503	null
2025-01-26	Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning	Xiaohan Yu et.al.	2501.15470	null
2025-01-26	Data-adaptive Safety Rules for Training Reward Models	Xiaomin Li et.al.	2501.15453	null
2025-01-26	OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas	Xiaoyang Wang et.al.	2501.15427	null
2025-01-26	Visual Generation Without Guidance	Huayu Chen et.al.	2501.15420	link
2025-01-26	AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement	Junan Zhang et.al.	2501.15417	null
2025-01-26	The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation	Raha Aghaei et.al.	2501.15411	null
2025-01-26	Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency	Irin Kabakum et.al.	2501.15405	null
2025-01-26	How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning	Tohida Rehman et.al.	2501.15398	null
2025-01-26	Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations	Zijun Long et.al.	2501.15379	null
2025-01-26	How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback	Manzong Huang et.al.	2501.15378	null
2025-01-26	Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models	Melkamu Abay Mersha et.al.	2501.15374	null
2025-01-26	Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis	Robinson Umeike et.al.	2501.15370	null
2025-01-26	Decentralized Low-Rank Fine-Tuning of Large Language Models	Sajjad Ghiasvand et.al.	2501.15361	null
2025-01-26	Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection	Bo Yang et.al.	2501.15355	null
2025-01-25	Fairness in LLM-Generated Surveys	Andrés Abeliuk et.al.	2501.15351	null
2025-01-25	Between Puppet and Actor: Reframing Authorship in this Age of AI Agents	Yuqian Sun et.al.	2501.15346	null
2025-01-25	Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data	Jiajie Li et.al.	2501.15326	null
2025-01-25	ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning	Shangqian Gao et.al.	2501.15316	null
2025-01-25	The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?	Ayo Adedeji et.al.	2501.15310	null
2025-01-25	You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning	Ayan Sengupta et.al.	2501.15296	null
2025-01-24	HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Xin Zhou et.al.	2501.14729	link
2025-01-24	Do LLMs Provide Consistent Answers to Health-Related Questions across Languages?	Ipek Baris Schlicht et.al.	2501.14719	null
2025-01-24	Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models	Naihao Deng et.al.	2501.14717	null
2025-01-24	FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing	James Seale Smith et.al.	2501.14713	null
2025-01-24	The Karp Dataset	Mason DiCicco et.al.	2501.14705	null
2025-01-24	Rethinking Table Instruction Tuning	Naihao Deng et.al.	2501.14693	null
2025-01-24	Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST	Fuping Wu et.al.	2501.14685	null
2025-01-24	An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations	Shabnam Hassani et.al.	2501.14683	null
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications	Yixing Jiang et.al.	2501.14654	link
2025-01-24	Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion	Ziyao Xu et.al.	2501.14649	link
2025-01-24	Towards Scalable Topological Regularizers	Hiu-Tung Wong et.al.	2501.14641	null
2025-01-24	Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics	Renato Ghisellini et.al.	2501.14634	null
2025-01-24	Extracting Problem Structure with LLMs for Optimized SAT Local Search	André Schilder et.al.	2501.14630	null
2025-01-24	Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data	Jordi Abante et.al.	2501.14615	null
2025-01-24	ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations	Tianming Liang et.al.	2501.14607	null
2025-01-24	Leveraging ChatGPT's Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research	Hamid Sarmadi et.al.	2501.14546	null
2025-01-24	VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning	Benjamin Callewaert et.al.	2501.14540	null
2025-01-24	Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models	Zhenguang Zhong et.al.	2501.14530	link
2025-01-24	Scene Understanding Enabled Semantic Communication with Open Channel Coding	Zhe Xiang et.al.	2501.14520	null
2025-01-24	Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel	Zhuoran Liu et.al.	2501.14512	null
2025-01-24	Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course	Pavlin G. Poličar et.al.	2501.14499	null
2025-01-24	Evaluating and Improving Graph to Text Generation with Large Language Models	Jie He et.al.	2501.14497	link
2025-01-24	RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques	Zhengyang Tang et.al.	2501.14492	link
2025-01-24	Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design	Taehan Kim et.al.	2501.14469	null
2025-01-24	Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis	Xiujing Guo et.al.	2501.14465	null
2025-01-24	Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing	Zeping Yu et.al.	2501.14457	null
2025-01-24	Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains	Xu Chu et.al.	2501.14431	null
2025-01-24	GraphBC: Improving LLMs for Better Graph Data Processing	Xu Chu et.al.	2501.14427	null
2025-01-24	CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios	Michael Fuest et.al.	2501.14426	null
2025-01-24	DeepFlow: Serverless Large Language Model Serving at Scale	Junhao Hu et.al.	2501.14417	null
2025-01-24	SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation	Shengjie Wang et.al.	2501.14400	null
2025-01-24	ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer	Yoni Schirris et.al.	2501.14379	link
2025-01-24	DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing	Xinyu Ma et.al.	2501.14371	link
2025-01-24	Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches	Ziad Sakr et.al.	2501.14366	null
2025-01-24	FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration	Kai-Tuo Xu et.al.	2501.14350	link
2025-01-24	Chain-of-Retrieval Augmented Generation	Liang Wang et.al.	2501.14342	null
2025-01-24	Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts	Clément Desroches et.al.	2501.14334	null
2025-01-24	Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models	Ridhi Jain et.al.	2501.14326	null
2025-01-24	PAID: A Framework of Product-Centric Advertising Image Design	Hongyu Chen et.al.	2501.14316	null
2025-01-24	Locality-aware Fair Scheduling in LLM Serving	Shiyi Cao et.al.	2501.14312	null
2025-01-24	A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education	Calvin Yeung et.al.	2501.14305	link
2025-01-24	MASTER: A Multi-Agent System with LLM Specialized MCTS	Bingzheng Gan et.al.	2501.14304	null
2025-01-24	Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph	Xujian Liang et.al.	2501.14300	link
2025-01-24	Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment	Julian A. Schnabel et.al.	2501.14296	null
2025-01-24	Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes	Sullam Jeoung et.al.	2501.14294	link
2025-01-24	Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches	Feng Zhou et.al.	2501.14291	null
2025-01-24	Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation	Sadegh Mahdavi et.al.	2501.14275	link
2025-01-24	Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors	Yi Zhao et.al.	2501.14250	link
2025-01-24	Humanity's Last Exam	Long Phan et.al.	2501.14249	null
2025-01-24	Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game	Rong Ye et.al.	2501.14225	null
2025-01-24	Top Ten Challenges Towards Agentic Neural Graph Databases	Jiaxin Bai et.al.	2501.14224	null
2025-01-24	TFG-Flow: Training-free Guidance in Multimodal Generative Flow	Haowei Lin et.al.	2501.14216	null
2025-01-24	Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading	Minrui Xu et.al.	2501.14205	null
2025-01-24	VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking	Runyi Hu et.al.	2501.14195	link
2025-01-24	Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models	Saaduddin Mahmud et.al.	2501.14189	null
2025-01-24	GeoSim.AI: AI assistants for numerical simulations in geomechanics	Yared W. Bekele et.al.	2501.14186	null
2025-01-24	AI Chatbots as Professional Service Agents: Developing a Professional Identity	Wenwen Li et.al.	2501.14179	null
2025-01-24	Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models	Yile Gu et.al.	2501.14170	null
2025-01-24	Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction	Dongming Sheng et.al.	2501.14144	null
2025-01-23	Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation	Derek Yotheringhay et.al.	2501.14119	null
2025-01-23	Domain-Factored Untrained Deep Prior for Spectrum Cartography	Subash Timilsina et.al.	2501.14116	null
2025-01-23	MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning	Joshua Davis et.al.	2501.14105	link
2025-01-23	StreamingRAG: Real-time Contextual Retrieval and Generation Framework	Murugan Sankaradas et.al.	2501.14101	null
2025-01-23	Enhancing Biomedical Relation Extraction with Directionality	Po-Ting Lai et.al.	2501.14079	link
2025-01-23	LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language	Yubin Ge et.al.	2501.14073	null
2025-01-23	Efficient 2D CT Foundation Model for Contrast Phase Classification	Benjamin Hou et.al.	2501.14066	null
2025-01-23	Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jakob Krogh Petersen et.al.	2501.14051	link
2025-01-23	LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps	Andrey Palaev et.al.	2501.14046	link
2025-01-23	Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions	Jianfeng Zhu et.al.	2501.14037	null
2025-01-23	CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation	Guofeng Cui et.al.	2501.13927	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	Binary Diffusion Probabilistic Model	Vitaliy Kinakh et.al.	2501.13915	null
2025-01-23	Analysis of Indic Language Capabilities in LLMs	Aatman Vaidya et.al.	2501.13912	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	Exploring Finetuned Audio-LLM on Heart Murmur Features	Adrian Florea et.al.	2501.13884	null
2025-01-23	The machine learning platform for developers of large systems	Alexey Naikov et.al.	2501.13881	null
2025-01-23	A RAG-Based Institutional Assistant	Gustavo Kuratomi et.al.	2501.13880	null
2025-01-23	On the Reasoning Capacity of AI Models and How to Quantify It	Santosh Kumar Radha et.al.	2501.13833	null
2025-01-23	Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing	Hao Zhang et.al.	2501.13831	null
2025-01-23	Hallucinations Can Improve Large Language Models in Drug Discovery	Shuzhou Yuan et.al.	2501.13824	null
2025-01-23	Large Language Model driven Policy Exploration for Recommender Systems	Jie Wang et.al.	2501.13816	null
2025-01-23	Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change	Mowafak Allaham et.al.	2501.13802	null
2025-01-23	Parameter-Efficient Fine-Tuning for Foundation Models	Dan Zhang et.al.	2501.13787	link
2025-01-23	Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling	Tanya Rodchenko et.al.	2501.13779	null
2025-01-23	Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework	Yoonsang Kim et.al.	2501.13778	link
2025-01-23	Do Large Language Models Truly Understand Geometric Structures?	Xiaofeng Wang et.al.	2501.13773	link
2025-01-23	Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak	Erjia Xiao et.al.	2501.13772	null
2025-01-23	UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models	Xin Xu et.al.	2501.13766	null
2025-01-23	EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents	Yuhui Yun et.al.	2501.13746	null
2025-01-23	GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification	Te Pei et.al.	2501.13743	null
2025-01-23	An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities	Zezhou Yang et.al.	2501.13742	link
2025-01-23	Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks	Chang Gong et.al.	2501.13731	null
2025-01-23	RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation	Shi-Qi Yan et.al.	2501.13726	null
2025-01-23	Musical ethnocentrism in Large Language Models	Anna Kruspe et.al.	2501.13720	null
2025-01-23	A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation	Dario Serez et.al.	2501.13718	null
2025-01-23	EventVL: Understand Event Streams via Multimodal Large Language Model	Pengteng Li et.al.	2501.13707	null
2025-01-23	DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale	Linghao Zhang et.al.	2501.13699	null
2025-01-23	Question Answering on Patient Medical Records with Private Fine-Tuned LLMs	Sara Kothari et.al.	2501.13687	null
2025-01-23	HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor	Zihui Wu et.al.	2501.13677	link
2025-01-23	How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization	Shezheng Song et.al.	2501.13669	null
2025-01-23	LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models	Yizheng Sun et.al.	2501.13652	null
2025-01-23	Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models	Zhenghao Lin et.al.	2501.13629	null
2025-01-23	Text-to-SQL based on Large Language Models and Database Keyword Search	Eduardo R. Nascimento et.al.	2501.13594	null
2025-01-23	Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization	Lei Huang et.al.	2501.13573	null
2025-01-23	One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt	Tao Liu et.al.	2501.13554	link
2025-01-23	LLMs Can Plan Only If We Tell Them	Bilgehan Sel et.al.	2501.13545	null
2025-01-23	ReasVQA: Advancing VideoQA with Imperfect Reasoning Process	Jianxin Liang et.al.	2501.13536	null
2025-01-23	RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles	Munachiso Nwadike et.al.	2501.13491	null
2025-01-23	Adaptive Testing for LLM-Based Applications: A Diversity-based Approach	Juyeon Yoon et.al.	2501.13480	null
2025-01-23	LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation	JiaXin Chen et.al.	2501.13475	null
2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link
2025-01-23	Spurious Forgetting in Continual Learning of Language Models	Junhao Zheng et.al.	2501.13453	link
2025-01-23	Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models	Bo Gao et.al.	2501.13428	null
2025-01-23	Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling	Tomek Jaroslawski et.al.	2501.13415	null
2025-01-23	VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework	He Kong et.al.	2501.13411	link
2025-01-23	Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures	Minglong Dong et.al.	2501.13396	null
2025-01-23	Can Large Language Models Understand Preferences in Personalized Recommendation?	Zhaoxuan Tan et.al.	2501.13391	link
2025-01-23	Do as We Do, Not as You Think: the Conformity of Large Language Models	Zhiyuan Weng et.al.	2501.13381	link
2025-01-23	Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility	Gabrielle Hoyer et.al.	2501.13376	null
2025-01-23	Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement	Jae-Sung Bae et.al.	2501.13372	null
2025-01-23	Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification	Yuzhuo Li et.al.	2501.13368	null
2025-01-23	50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications	Zewei Shi et.al.	2501.13351	null
2025-01-23	MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize	Haohang Xu et.al.	2501.13349	null
2025-01-23	Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation	Rong Shan et.al.	2501.13344	null
2025-01-23	Multi-aspect Knowledge Distillation with Large Language Model	Taegyeong Lee et.al.	2501.13341	link
2025-01-23	Generative Multi-Form Bayesian Optimization	Zhendong Guo et.al.	2501.13337	null
2025-01-23	SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network	Songge Zhang et.al.	2501.13318	null
2025-01-23	Representing Visualization Insights as a Dense Insight Network	Jane Hoffswell et.al.	2501.13309	null
2025-01-23	OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia	Xuelong Geng et.al.	2501.13306	link
2025-01-23	Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers	Akshit Achara et.al.	2501.13302	link
2025-01-23	Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents	Shrinidhi Kumbhar et.al.	2501.13299	null
2025-01-23	RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering	Yang Bai et.al.	2501.13297	link
2025-01-23	Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols	John Joon Young Chung et.al.	2501.13284	null
2025-01-22	MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis	Daeun Jung et.al.	2501.13277	link
2025-01-22	RAG-Reward: Optimizing RAG with Reward Modeling and RLHF	Hanning Zhang et.al.	2501.13264	null
2025-01-22	Exploring GPT's Ability as a Judge in Music Understanding	Kun Fang et.al.	2501.13261	link
2025-01-22	Bypassing Array Canaries via Autonomous Function Call Resolution	Nathaniel Oh et.al.	2501.13256	link
2025-01-22	S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning	Yichen Wu et.al.	2501.13198	null
2025-01-22	Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century	Axel Loewe et.al.	2501.13142	null
2025-01-23	VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Boqiang Zhang et.al.	2501.13106	link
2025-01-22	Robust Representation Consistency Model via Contrastive Denoising	Jiachen Lei et.al.	2501.13094	link
2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null
2025-01-22	Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning	Bohao Yang et.al.	2501.13042	link
2025-01-22	Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament	Yantao Liu et.al.	2501.13007	link
2025-01-22	Neural network enhanced cross entropy benchmark for monitored circuits	Yangrui Hu et.al.	2501.13005	null
2025-01-22	Large Language Model-Based Semantic Communication System for Image Transmission	Soheyb Ribouh et.al.	2501.12988	null
2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null
2025-01-22	Low-dimensional adaptation of diffusion models: Convergence in total variation	Jiadong Liang et.al.	2501.12982	null
2025-01-22	OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models	Chongren Sun et.al.	2501.12975	link
2025-01-22	Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs	Jan Corazza et.al.	2501.12972	null
2025-01-22	It's complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act	Kristof Meding et.al.	2501.12962	null
2025-01-22	Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference	Weizhi Fei et.al.	2501.12959	null
2025-01-22	GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models	Pengxiang Zhao et.al.	2501.12956	null
2025-01-22	3D Object Manipulation in a Single Image using Generative Models	Ruisi Zhao et.al.	2501.12935	null
2025-01-22	Correctness Assessment of Code Generated by Large Language Models Using Internal Representations	Tuan-Dung Bui et.al.	2501.12934	null
2025-01-22	DynamicEarth: How Far are We from Open-Vocabulary Change Detection?	Kaiyu Li et.al.	2501.12931	null
2025-01-22	A Functional Software Reference Architecture for LLM-Integrated Systems	Alessio Bucaioni et.al.	2501.12904	null
2025-01-22	Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration	Offa Kingsleigh et.al.	2501.12901	null
2025-01-22	Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback	Yafu Li et.al.	2501.12895	link
2025-01-23	Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program	Carlton Shepherd et.al.	2501.12883	null
2025-01-22	WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge	Jingyuan Chen et.al.	2501.12877	null
2025-01-22	ACEBench: Who Wins the Match Point in Tool Learning?	Chen Chen et.al.	2501.12851	null
2025-01-22	AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation	Aghiles Kebaili et.al.	2501.12840	null
2025-01-22	Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home	Viktor Moskvoretskii et.al.	2501.12835	null
2025-01-22	Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek	John Pavlopoulos et.al.	2501.12826	link
2025-01-22	Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks	Alessio Quercia et.al.	2501.12824	null
2025-01-22	Certified Guidance for Planning with Deep Generative Models	Francesco Giacomarra et.al.	2501.12815	null
2025-01-22	Revisit Self-Debugging with Self-Generated Tests for Code Generation	Xiancai Chen et.al.	2501.12793	null
2025-01-22	LLMs as Repositories of Factual Knowledge: Limitations and Solutions	Seyed Mahed Mousavi et.al.	2501.12774	null
2025-01-22	NExtLong: Toward Effective Long-Context Training without Long Documents	Chaochen Gao et.al.	2501.12766	link
2025-01-22	Online Preference Alignment for Language Models via Count-based Exploration	Chenjia Bai et.al.	2501.12735	link
2025-01-22	Paradigm-Based Automatic HDL Code Generation Using LLMs	Wenhao Sun et.al.	2501.12702	null
2025-01-22	Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression	Kai Yoshida et.al.	2501.12698	null
2025-01-22	Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering	Qian Tao et.al.	2501.12697	null
2025-01-22	SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling	Shengshi Yao et.al.	2501.12696	null
2025-01-22	EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation	Yifan Yu et.al.	2501.12689	null
2025-01-22	Distillation Quantification for Large Language Models	Sunbowen Lee et.al.	2501.12619	link
2025-01-22	Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?	Taiming Wang et.al.	2501.12617	null
2025-01-22	Kimi k1.5: Scaling Reinforcement Learning with LLMs	Kimi Team et.al.	2501.12599	null
2025-01-22	Leveraging LLMs to Create a Haptic Devices' Recommendation System	Yang Liu et.al.	2501.12573	null
2025-01-22	Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review	Rock Yuren Pang et.al.	2501.12557	link
2025-01-21	Human-like conceptual representations emerge from language prediction	Ningyu Xu et.al.	2501.12547	null
2025-01-21	How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models?	Mirali Purohit et.al.	2501.12535	null
2025-01-21	An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts	Dhia Elhaq Rzig et.al.	2501.12521	null
2025-01-21	A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data	Minh Tran et.al.	2501.12501	null
2025-01-21	The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws	Tian Jin et.al.	2501.12486	null
2025-01-21	An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models	Xiaoyu Chu et.al.	2501.12469	link
2025-01-21	Adaptive PII Mitigation Framework for Large Language Models	Shubhi Asthana et.al.	2501.12465	null
2025-01-21	Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management	Arthur Vitui et.al.	2501.12461	link
2025-01-21	Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications	Shubhi Asthana et.al.	2501.12456	null
2025-01-21	Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation	Dongsheng Zhu et.al.	2501.12432	null
2025-01-21	FREYR: A Framework for Recognizing and Executing Your Requests	Roberto Gallotta et.al.	2501.12423	link
2025-01-21	CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning	Eunjee Choi et.al.	2501.12422	null
2025-01-22	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks	Greg Olmschenk et.al.	2501.12383	null
2025-01-21	MMVU: Measuring Expert-Level Multi-Discipline Video Understanding	Yilun Zhao et.al.	2501.12380	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	link
2025-01-21	Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL	Yeounoh Chung et.al.	2501.12372	null
2025-01-21	Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration	Thomas Walshe et.al.	2501.12332	null
2025-01-21	Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops	Mohamed Harmanani et.al.	2501.12331	link
2025-01-21	VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Xianwei Zhuang et.al.	2501.12327	link
2025-01-21	LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations	Hasan Abu-Rasheed et.al.	2501.12300	null
2025-01-21	MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks	Qishen Zhou et.al.	2501.12281	link
2025-01-21	Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement	Maosong Cao et.al.	2501.12273	link
2025-01-21	FOCUS: First Order Concentrated Updating Scheme	Yizhou Liu et.al.	2501.12243	null
2025-01-21	InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models	Pha Nguyen et.al.	2501.12231	null
2025-01-21	CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning	Yuanheng Fang et.al.	2501.12226	null
2025-01-21	Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces	Allard Oelen et.al.	2501.12221	null
2025-01-21	You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense	Wuyuao Mai et.al.	2501.12210	null
2025-01-21	Explainability for Vision Foundation Models: A Survey	Rémi Kazmierczak et.al.	2501.12203	null
2025-01-22	Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation	Zibo Zhao et.al.	2501.12202	link
2025-01-21	BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks	Zhuang Li et.al.	2501.12174	null
2025-01-21	Contextualizing Recommendation Explanations with LLMs: A User Study	Yuanjun Feng et.al.	2501.12152	null
2025-01-21	Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities	Qirun Dai et.al.	2501.12147	null
2025-01-21	Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot	Daniele Bifolco et.al.	2501.12134	null
2025-01-21	Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions	Tim Schreiter et.al.	2501.12128	null
2025-01-21	Can open source large language models be used for tumor documentation in Germany? -- An evaluation on urological doctors' notes	Stefan Lenz et.al.	2501.12106	link
2025-01-21	Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis	Weile Luo et.al.	2501.12084	null
2025-01-21	Phishing Awareness via Game-Based Learning	Argianto Rahartomo et.al.	2501.12077	link
2025-01-21	PINNsAgent: Automated PDE Surrogation with Large Language Models	Qingpo Wuwu et.al.	2501.12053	null
2025-01-21	Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation	Chen Griner et.al.	2501.12033	null
2025-01-21	Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis	Hongjun Liu et.al.	2501.12023	null
2025-01-21	Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection?	Samantha Min Er Yew et.al.	2501.12016	null
2025-01-21	Rate-Aware Learned Speech Compression	Jun Xu et.al.	2501.11999	null
2025-01-21	Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models	Rupesh Raj Karn et.al.	2501.11979	null
2025-01-21	Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues	Maya Medjad et.al.	2501.11977	link
2025-01-21	Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization	Jie Zhao et.al.	2501.11968	null
2025-01-21	A Hybrid Attention Framework for Fake News Detection with Large Language Models	Xiaochuan Xu et.al.	2501.11967	null
2025-01-21	TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection	Yang Cao et.al.	2501.11960	null
2025-01-21	Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model	Minghan Wang et.al.	2501.11953	null
2025-01-21	ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation	Peter Devine et.al.	2501.11929	link
2025-01-21	Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model	He Chang et.al.	2501.11911	null
2025-01-21	Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation	Junhong Lian et.al.	2501.11900	link
2025-01-22	Med-R $^2$ : Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine	Keer Lu et.al.	2501.11885	null
2025-01-21	From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning	Yafu Li et.al.	2501.11877	link
2025-01-21	LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems	Venkata Sai Aswath Duvvuru et.al.	2501.11864	null
2025-01-21	EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents	Zhili Cheng et.al.	2501.11858	link
2025-01-21	Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance	Nikos Kanakaris et.al.	2501.11849	link
2025-01-21	A Survey on Memory-Efficient Large-Scale Model Training in AI for Science	Kaiyuan Tian et.al.	2501.11847	null
2025-01-21	Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction	Noah L. Schroeder et.al.	2501.11840	null
2025-01-21	PXGen: A Post-hoc Explainable Method for Generative Models	Yen-Lung Huang et.al.	2501.11827	null
2025-01-21	CogMorph: Cognitive Morphing Attacks for Text-to-Image Models	Zonglei Jing et.al.	2501.11815	null
2025-01-20	Benchmarking Large Language Models via Random Variables	Zijin Hong et.al.	2501.11790	null
2025-01-20	Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection	Ali Naseh et.al.	2501.11786	null
2025-01-20	Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference	Pouya Hamadanian et.al.	2501.11779	link
2025-01-20	The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers	Alina Starovolsky-Shitrit et.al.	2501.11770	null
2025-01-20	Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems	Fatemeh Nazary et.al.	2501.11759	link
2025-01-20	A generalizable 3D framework and model for self-supervised learning in medical imaging	Tony Xu et.al.	2501.11755	null
2025-01-20	Are generative models fair? A study of racial bias in dermatological image generation	Miguel López-Pérez et.al.	2501.11752	null
2025-01-20	Optimizing Pretraining Data Mixtures with LLM-Estimated Utility	William Held et.al.	2501.11747	null
2025-01-20	MedicoSAM: Towards foundation models for medical image segmentation	Anwai Archit et.al.	2501.11734	link
2025-01-20	Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks	Zhenhailong Wang et.al.	2501.11733	null
2025-01-20	Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy	Saeid Asgari Taghanaki et.al.	2501.11721	link
2025-01-20	YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives	Nong Ming et.al.	2501.11712	link
2025-01-20	Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution	Ramtin Ehsani et.al.	2501.11709	null
2025-01-20	Trustformer: A Trusted Federated Transformer	Ali Abbasi Tadi et.al.	2501.11706	null
2025-01-20	Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s)	Brian E. Perron et.al.	2501.11705	null
2025-01-20	Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling	Zhenyu Hou et.al.	2501.11651	link
2025-01-20	Trojan Detection Through Pattern Recognition for Large Language Models	Vedant Bhasin et.al.	2501.11621	null
2025-01-20	Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems	Giorgio Robino et.al.	2501.11613	null
2025-01-20	SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks	Wentao Wan et.al.	2501.11599	link
2025-01-20	Recurrent Diffusion for Large-Scale Parameter Generation	Kai Wang et.al.	2501.11587	link
2025-01-20	Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models	Mahyar Habibi et.al.	2501.11581	null
2025-01-20	Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution	Zhiyuan You et.al.	2501.11561	null
2025-01-20	PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation	Jinyu Wang et.al.	2501.11551	link
2025-01-20	UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion	Zixuan Chen et.al.	2501.11515	null
2025-01-20	Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges	Vincent Koc et.al.	2501.11496	null
2025-01-20	Graph-defined Language Learning with LLMs	Huachi Zhou et.al.	2501.11478	null
2025-01-20	Curiosity-Driven Reinforcement Learning from Human Feedback	Haoran Sun et.al.	2501.11463	link
2025-01-20	Ontology Matching with Large Language Models and Prioritized Depth-First Search	Maria Taboada et.al.	2501.11441	null
2025-01-20	One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor	Zhikun Wu et.al.	2501.11433	null
2025-01-20	A Survey on Diffusion Models for Anomaly Detection	Jing Liu et.al.	2501.11430	link
2025-01-20	Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training	Siyu Yuan et.al.	2501.11425	link
2025-01-20	Neural Contextual Reinforcement Framework for Logical Structure Language Generation	Marcus Irvin et.al.	2501.11417	null
2025-01-20	Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing	Kevin Sim et.al.	2501.11411	null
2025-01-20	Revisiting Language Models in Neural News Recommender Systems	Yuyue Zhao et.al.	2501.11391	link
2025-01-20	Towards Advancing Code Generation with Large Language Models: A Research Roadmap	Haolin Jin et.al.	2501.11354	null
2025-01-20	EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery	Guankun Wang et.al.	2501.11347	link
2025-01-20	GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video	Zhenliang Ni et.al.	2501.11340	null
2025-01-20	Few-shot Policy (de)composition in Conversational Question Answering	Kyle Erwin et.al.	2501.11335	null
2025-01-20	Nested Annealed Training Scheme for Generative Adversarial Networks	Chang Wan et.al.	2501.11318	null
2025-01-20	Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning	Zhongtian Hu et.al.	2501.11292	null
2025-01-20	Large Language Model Agents for Radio Map Generation and Wireless Network Planning	Hongye Quan et.al.	2501.11283	null
2025-01-20	Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries	Yi-Hui Lee et.al.	2501.11273	null
2025-01-20	Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios	Zhongtian Hu et.al.	2501.11269	null
2025-01-20	Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian	Wannita Takerngsaksiri et.al.	2501.11264	link
2025-01-20	Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models	Zhuangzhuang Yan et.al.	2501.11247	null
2025-01-20	Irony in Emojis: A Comparative Study of Human and LLM Interpretation	Yawen Zheng et.al.	2501.11241	null
2025-01-20	KPL: Training-Free Medical Knowledge Mining of Vision-Language Models	Jiaxiang Liu et.al.	2501.11231	link
2025-01-20	Reasoning Language Models: A Blueprint	Maciej Besta et.al.	2501.11223	link
2025-01-20	Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation	Ivan Lopez et.al.	2501.11199	null
2025-01-19	Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests	Kristin Blesch et.al.	2501.11178	link
2025-01-17	FaceXBench: Evaluating Multimodal LLMs on Face Understanding	Kartik Narayan et.al.	2501.10360	link
2025-01-17	Zero-Shot Monocular Scene Flow Estimation in the Wild	Yiqing Liang et.al.	2501.10357	null
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	null
2025-01-17	Large language models for automated scholarly paper review: A survey	Zhenzhen Zhuang et.al.	2501.10326	null
2025-01-17	HiMix: Reducing Computational Complexity in Large Vision-Language Models	Xuange Zhang et.al.	2501.10318	null
2025-01-17	Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs	Claudio Di Sipio et.al.	2501.10313	null
2025-01-17	Computational Protein Science in the Era of Large Language Models (LLMs)	Wenqi Fan et.al.	2501.10282	null
2025-01-17	Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation	Azat Abdullin et.al.	2501.10200	null
2025-01-17	Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education	William Hersh et.al.	2501.10186	null
2025-01-17	Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval	Vera Pavlova et.al.	2501.10175	null
2025-01-17	Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis	Abhishek Kaushik et.al.	2501.10134	null
2025-01-17	ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario	Lucen Zhong et.al.	2501.10132	link
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-17	AI-Generated Music Detection and its Challenges	Darius Afchar et.al.	2501.10111	link
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-17	Universal Actions for Enhanced Embodied Foundation Models	Jinliang Zheng et.al.	2501.10105	link
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-17	FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization	Zhaopeng Gu et.al.	2501.10067	link
2025-01-17	Accelerating Large Language Models through Partially Linear Feed-Forward Network	Gansen Hu et.al.	2501.10054	null
2025-01-17	AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search	Wenfeng Feng et.al.	2501.10053	null
2025-01-17	Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists	Alyssia Chen et.al.	2501.10037	null
2025-01-17	Mapping scientific communities at scale	Victor Barbier et.al.	2501.10035	link
2025-01-17	Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions	Zhijie Tan et.al.	2501.10011	null
2025-01-17	Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models	Qiang Liu et.al.	2501.09997	null
2025-01-17	Agent-as-Judge for Factual Summarization of Long Narratives	Yeonseok Jeong et.al.	2501.09993	link
2025-01-17	RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation	Yuefan Cao et.al.	2501.09982	null
2025-01-17	GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions	Heda Zuo et.al.	2501.09972	null
2025-01-17	Explainable artificial intelligence (XAI): from inherent explainability to large language models	Fuseini Mumuni et.al.	2501.09967	null
2025-01-17	A Survey on Multi-Turn Interaction Capabilities of Large Language Models	Chen Zhang et.al.	2501.09959	null
2025-01-17	FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs	Zengyi Gao et.al.	2501.09957	null
2025-01-17	AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations	Jamin Seo et.al.	2501.09954	link
2025-01-17	Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt	Qingcheng Zeng et.al.	2501.09950	null
2025-01-17	MultiPruner: Balanced Structure Removal in Foundation Models	J. Pablo Muñoz et.al.	2501.09949	link
2025-01-17	Steering Large Language Models with Feature Guided Activation Additions	Samuel Soo et.al.	2501.09929	null
2025-01-17	Towards A Litmus Test for Common Sense	Hugo Latapie et.al.	2501.09913	null
2025-01-17	Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project's Talent Knowledge Graph	Jiawei Xu et.al.	2501.09909	null
2025-01-17	Position: Open and Closed Large Language Models in Healthcare	Jiawei Xu et.al.	2501.09906	null
2025-01-17	FoundationStereo: Zero-Shot Stereo Matching	Bowen Wen et.al.	2501.09898	null
2025-01-17	Evolving Deeper LLM Thinking	Kuang-Huei Lee et.al.	2501.09891	null
2025-01-17	Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment	Mohammad Sadegh Sheikhaei et.al.	2501.09888	link
2025-01-17	FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis	Zhe Chen et.al.	2501.09887	null
2025-01-16	ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction	Izzeddin Teeti et.al.	2501.09878	null
2025-01-16	Geometry-Preserving Encoder/Decoder in Latent Generative Models	Wonjun Lee et.al.	2501.09876	null
2025-01-16	An LLM-Guided Tutoring System for Social Skills Training	Michael Guevarra et.al.	2501.09870	null
2025-01-16	Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing	Wenhan Wang et.al.	2501.09866	null
2025-01-16	Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message	Jiawei Li et.al.	2501.09861	null
2025-01-16	PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery	Shristi Das Biswas et.al.	2501.09826	link
2025-01-16	Bridging Language Barriers in Healthcare: A Study on Arabic LLMs	Nada Saadi et.al.	2501.09825	null
2025-01-16	BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling	Daniele Castellana et.al.	2501.09821	link
2025-01-16	Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems	Soham Roy et.al.	2501.09801	null
2025-01-16	Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API	Andrey Labunets et.al.	2501.09798	null
2025-01-16	GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation	Weiliang Tang et.al.	2501.09783	null
2025-01-16	SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation	Wanqi Yin et.al.	2501.09782	link
2025-01-16	VideoWorld: Exploring Knowledge Learning from Unlabeled Videos	Zhongwei Ren et.al.	2501.09781	null
2025-01-16	Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong	Tairan Fu et.al.	2501.09775	null
2025-01-16	Distilling Multi-modal Large Language Models for Autonomous Driving	Deepti Hegde et.al.	2501.09757	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-01-16	Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues	Youngjoon Jang et.al.	2501.09754	null
2025-01-16	OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking	Zekun Xi et.al.	2501.09751	null
2025-01-16	Enhancing Lexicon-Based Text Embeddings with Large Language Models	Yibin Lei et.al.	2501.09749	null
2025-01-16	Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models	Bihui Jin et.al.	2501.09745	null
2025-01-16	KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports	Hajung Kim et.al.	2501.09744	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	A Simple Aerial Detection Baseline of Multimodal Language Models	Qingyun Li et.al.	2501.09720	link
2025-01-16	Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text	Jihed Ncib et.al.	2501.09719	null
2025-01-16	CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education	Tianyu Wang et.al.	2501.09709	link
2025-01-16	Domain Adaptation of Foundation LLMs for e-Commerce	Christian Herold et.al.	2501.09706	null
2025-01-16	Cueless EEG imagined speech for subject identification: dataset and benchmarks	Ali Derakhshesh et.al.	2501.09700	link
2025-01-16	Simulated Interactive Debugging	Yannic Noller et.al.	2501.09694	null
2025-01-17	Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities	Fengli Xu et.al.	2501.09686	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark	Alexis Roger et.al.	2501.09672	null
2025-01-16	A Survey of Research in Large Language Models for Electronic Design Automation	Jingyu Pan et.al.	2501.09655	null
2025-01-16	The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models	Jonathan Katzy et.al.	2501.09653	null
2025-01-16	CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding	Johannes Kirmayr et.al.	2501.09645	link
2025-01-17	LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading	Kuan-Ming Liu et.al.	2501.09636	null
2025-01-16	Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework	Yushen Lin et.al.	2501.09631	null
2025-01-16	Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment	Chaoqi Wang et.al.	2501.09620	link
2025-01-16	From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs	Hrithik Majumdar Shibu et.al.	2501.09604	link
2025-01-16	Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures	Pratyush Dhingra et.al.	2501.09588	null
2025-01-16	Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis	Tingxuan Chen et.al.	2501.09555	null
2025-01-16	AI in Support of Diversity and Inclusion	Çiçek Güven et.al.	2501.09534	null
2025-01-16	Confidence Estimation for Error Detection in Text-to-SQL Systems	Oleg Somov et.al.	2501.09527	null
2025-01-16	Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data	Omar Mena et.al.	2501.09521	null
2025-01-16	AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation	Junjie He et.al.	2501.09503	null
2025-01-16	Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis	Qize Yang et.al.	2501.09502	null
2025-01-16	Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework	Nuo Chen et.al.	2501.09493	null
2025-01-16	Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators	Zhaocheng Liu et.al.	2501.09484	link
2025-01-16	Guided Debugging of Auto-Translated Code Using Differential Testing	Shengnan Wu et.al.	2501.09475	null
2025-01-16	DEFOM-Stereo: Depth Foundation Model Based Stereo Matching	Hualie Jiang et.al.	2501.09466	link
2025-01-16	Pruning for Sparse Diffusion Models based on Gradient Flow	Ben Wan et.al.	2501.09464	null
2025-01-16	"A Great Start, But...": Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design	Tianhao He et.al.	2501.09457	null
2025-01-16	Solving the unsolvable: Translating case law in Hong Kong	King-kui Sin et.al.	2501.09444	null
2025-01-16	Scaling up self-supervised learning for improved surgical foundation models	Tim J. M. Jaspers et.al.	2501.09436	link
2025-01-16	CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation	Hwan Heo et.al.	2501.09433	link
2025-01-16	A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy	Huandong Wang et.al.	2501.09431	null
2025-01-16	AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring	Xinyi Wang et.al.	2501.09428	null
2025-01-16	AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling	Ancheng Xu et.al.	2501.09426	null
2025-01-16	FASP: Fast and Accurate Structured Pruning of Large Language Models	Hanyu Hu et.al.	2501.09412	null
2025-01-16	MoE $^2$ : Optimizing Collaborative Inference for Edge Large Language Models	Lyudong Jin et.al.	2501.09410	null
2025-01-16	Adaptive Contextual Caching for Mobile Edge Large Language Model Service	Guangyuan Liu et.al.	2501.09383	null
2025-01-16	Aligning Instruction Tuning with Pre-training	Yiming Liang et.al.	2501.09368	null
2025-01-16	PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks	Huiyou Zhan et.al.	2501.09367	null
2025-01-16	YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks	Saptarashmi Bandyopadhyay et.al.	2501.09355	null
2025-01-16	UVRM: A Scalable 3D Reconstruction Model from Unposed Videos	Shiu-hong Kao et.al.	2501.09347	null
2025-01-16	Rational Tuning of LLM Cascades via Probabilistic Modeling	Michael J. Zellinger et.al.	2501.09345	null
2025-01-16	SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs	Anbang Ye et.al.	2501.09316	null
2025-01-16	A Study of In-Context-Learning-Based Text-to-SQL Errors	Jiawei Shen et.al.	2501.09310	link
2025-01-16	To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation	Kaustubh D. Dhole et.al.	2501.09292	null
2025-01-16	LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport	Kyeongha Rho et.al.	2501.09291	link
2025-01-16	Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding	Kohei Torimi et.al.	2501.09278	null
2025-01-16	Large Language Model is Secretly a Protein Sequence Optimizer	Yinkai Wang et.al.	2501.09274	null
2025-01-16	Perspective Transition of Large Language Models for Solving Subjective Tasks	Xiaolong Wang et.al.	2501.09265	null
2025-01-16	Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition	Takaaki Hori et.al.	2501.09258	null
2025-01-16	Clone-Robust AI Alignment	Ariel D. Procaccia et.al.	2501.09254	null
2025-01-16	Split Fine-Tuning for Large Language Models in Wireless Networks	Songge Zhang et.al.	2501.09237	null
2025-01-16	Foundations of Large Language Models	Tong Xiao et.al.	2501.09223	null
2025-01-16	Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs	Sanchit Sinha et.al.	2501.09221	null
2025-01-16	A Simple Graph Contrastive Learning Framework for Short Text Classification	Yonghao Liu et.al.	2501.09219	link
2025-01-16	Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics	Yuanyuan Wei et.al.	2501.09218	null
2025-01-16	Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning	Yonghao Liu et.al.	2501.09214	link
2025-01-16	FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training	Hongzhou Yu et.al.	2501.09213	link
2025-01-15	Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures	Pengru Deng et.al.	2501.09203	null
2025-01-15	Towards Semantics Lifting for Scientific Computing: A Case Study on FFT	Naifeng Zhang et.al.	2501.09201	null
2025-01-15	Guiding Retrieval using LLM-based Listwise Rankers	Mandeep Rathee et.al.	2501.09186	link
2025-01-15	The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching	Yevhen Kostiuk et.al.	2501.09164	null
2025-01-15	Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability	Stephanie L. Day et.al.	2501.09158	null
2025-01-15	Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History	Yevhen Kostiuk et.al.	2501.09154	null
2025-01-15	Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation	Xingxin He et.al.	2501.09138	null
2025-01-15	Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG	Aditi Singh et.al.	2501.09136	link
2025-01-15	HAFix: History-Augmented Large Language Models for Bug Fixing	Yu Shi et.al.	2501.09135	link
2025-01-15	Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing	Eshaan Tanwar et.al.	2501.09127	link
2025-01-15	Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment	Conrad Borchers et.al.	2501.09126	null
2025-01-15	Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach	Alireza Ghaffari et.al.	2501.09107	null
2025-01-15	Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites	Hans W. A. Hanley et.al.	2501.09102	link
2025-01-15	Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative	Yuqian Sun et.al.	2501.09099	null
2025-01-15	SteLLA: A Structured Grading System Using LLMs with RAG	Hefei Qiu et.al.	2501.09092	null
2025-01-15	Generative diffusion model with inverse renormalization group flows	Kanta Masuki et.al.	2501.09064	link
2025-01-15	Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition	Sneheel Sarangi et.al.	2501.09056	link
2025-01-15	How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias	Tosin Fadahunsi et.al.	2501.09014	link
2025-01-15	Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians	Ishan Amin et.al.	2501.09009	link
2025-01-15	Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails	Shaona Ghosh et.al.	2501.09004	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001	null
2025-01-15	CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks	Krit Tangsongcharoen et.al.	2501.08998	link
2025-01-15	VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science	Youssef Abdalla et.al.	2501.08995	link
2025-01-15	CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities	Haozhe Xie et.al.	2501.08983	link
2025-01-15	Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models	Emma Croxford et.al.	2501.08977	null
2025-01-15	Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models	Karukriti Kaushik Ghosh et.al.	2501.08974	null
2025-01-15	Analyzing the Ethical Logic of Six Large Language Models	W. Russell Neuman et.al.	2501.08951	null
2025-01-15	Applying General Turn-taking Models to Conversational Human-Robot Interaction	Gabriel Skantze et.al.	2501.08946	null
2025-01-15	Disentangling Exploration of Large Language Models by Optimal Exploitation	Tim Grams et.al.	2501.08925	null
2025-01-15	GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge	Liam Dugan et.al.	2501.08913	link
2025-01-15	Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning	Qinyu Ma et.al.	2501.08897	link
2025-01-15	Connecting SPDE to SGMs	Junsu Seo et.al.	2501.08877	null
2025-01-15	Exploring Task-Level Optimal Prompts for Visual In-Context Learning	Yan Zhu et.al.	2501.08841	null
2025-01-15	How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering	Christoph Treude et.al.	2501.08774	null
2025-01-15	Admitting Ignorance Helps the Video Question Answering Models to Answer	Haopeng Li et.al.	2501.08771	null
2025-01-15	Enhanced Large Language Models for Effective Screening of Depression and Anxiety	June M. Liu et.al.	2501.08769	null
2025-01-15	Few-Shot Learner Generalizes Across AI-Generated Image Detection	Shiyu Wu et.al.	2501.08763	null
2025-01-15	Leveraging LLM Agents for Translating Network Configurations	Yunze Wei et.al.	2501.08760	null
2025-01-15	The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities	Irina Bigoulaeva et.al.	2501.08716	link
2025-01-15	Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching	Chuangtao Ma et.al.	2501.08686	link
2025-01-15	RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency	Siqi Li et.al.	2501.08682	null
2025-01-15	Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery	Zeqin Liao et.al.	2501.08670	null
2025-01-15	MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities	Savya Khosla et.al.	2501.08648	null
2025-01-15	Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations	Kaiyuan Zheng et.al.	2501.08641	null
2025-01-15	SWSC: Shared Weight for Similar Channel in LLM	Binrui Zeng et.al.	2501.08631	null
2025-01-15	Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models	Aruna Sankaranarayanan et.al.	2501.08618	link
2025-01-15	RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation	Kaiqu Liang et.al.	2501.08617	null
2025-01-15	Assessing the Alignment of FOL Closeness Metrics with Human Judgement	Ramya Keerthy Thatikonda et.al.	2501.08613	link
2025-01-15	Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design	Zhi Zheng et.al.	2501.08603	link
2025-01-15	AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL	Tyler Stennett et.al.	2501.08600	null
2025-01-15	LlamaRestTest: Effective REST API Testing with Small Language Models	Myeongsoo Kim et.al.	2501.08598	null
2025-01-15	Sound Scene Synthesis at the DCASE 2024 Challenge	Mathieu Lagrange et.al.	2501.08587	null
2025-01-15	LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model	Yuxuan Hu et.al.	2501.08582	null
2025-01-15	Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation	Jiaqi Huang et.al.	2501.08580	link
2025-01-15	Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms	Kewei Li et.al.	2501.08570	link
2025-01-15	Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications	Jin Chen et.al.	2501.08563	link
2025-01-15	LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation	Yiran Tao et.al.	2501.08558	null
2025-01-15	The Devil is in Temporal Token: High Quality Video Reasoning Segmentation	Sitong Gong et.al.	2501.08549	null
2025-01-15	Comprehensive Subjective and Objective Evaluation Method for Text-generated Video	Zelu Qi et.al.	2501.08545	null
2025-01-15	Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation	Jiaxin Guo et.al.	2501.08523	null
2025-01-14	Quantifying the Importance of Data Alignment in Downstream Model Performance	Krrish Chawla et.al.	2501.08496	null
2025-01-14	Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition	Md Meem Hossain et.al.	2501.08471	null
2025-01-14	Selective Attention Merging for low resource tasks: A case study of Child ASR	Natarajan Balaji Shankar et.al.	2501.08468	link
2025-01-14	Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin	Joao Carmo de Almeida Neto et.al.	2501.08464	null
2025-01-14	Large Language Models For Text Classification: Case Study And Comprehensive Review	Arina Kostina et.al.	2501.08457	null
2025-01-14	Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack	Sagiv Antebi et.al.	2501.08454	null
2025-01-14	Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies	Ajwad Abrar et.al.	2501.08441	null
2025-01-14	SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models	Anurag Kumar et.al.	2501.08421	null
2025-01-14	Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation	Seyed Mehdi Rakhtala et.al.	2501.08420	null
2025-01-14	Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data	Jiaxing Qiu et.al.	2501.08413	link
2025-01-14	OptiChat: Bridging Optimization Models and Practitioners with Large Language Models	Hao Chen et.al.	2501.08406	link
2025-01-14	Towards Best Practices for Open Datasets for LLM Training	Stefan Baack et.al.	2501.08365	null
2025-01-14	Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise	Ryan Burgert et.al.	2501.08331	link
2025-01-14	PokerBench: Training Large Language Models to become Professional Poker Players	Richard Zhuang et.al.	2501.08328	link
2025-01-14	Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks	Miran Heo et.al.	2501.08326	null
2025-01-14	ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations	Ziyuan Huang et.al.	2501.08324	null
2025-01-14	Exploring Robustness of Multilingual LLMs on Real-World Noisy Data	Amirhossein Aliakbarzadeh et.al.	2501.08322	link
2025-01-14	Enhancing Automated Interpretability with Output-Centric Feature Descriptions	Yoav Gur-Arieh et.al.	2501.08319	link
2025-01-14	MiniMax-01: Scaling Foundation Models with Lightning Attention	MiniMax et.al.	2501.08313	null
2025-01-14	HALoGEN: Fantastic LLM Hallucinations and Where to Find Them	Abhilasha Ravichander et.al.	2501.08292	null
2025-01-14	LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding	Hongyu Li et.al.	2501.08282	link
2025-01-14	Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing	Pulkit Arora et.al.	2501.08276	null
2025-01-14	Addressing the sustainable AI trilemma: a case study on LLM agents and RAG	Hui Wu et.al.	2501.08262	null
2025-01-14	Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models	Yifu Qiu et.al.	2501.08248	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset	Jiawei Du et.al.	2501.08238	null
2025-01-14	Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings	Paul Joe Maliakel et.al.	2501.08219	null
2025-01-14	ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Mohita Chowdhury et.al.	2501.08208	null
2025-01-14	ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Zain Ul Abedin et.al.	2501.08203	null
2025-01-14	CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation	Jinjun Peng et.al.	2501.08200	link
2025-01-14	OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training	Yijiong Yu et.al.	2501.08197	link
2025-01-14	PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving	Ahmet Caner Yüzügüler et.al.	2501.08192	null
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-15	A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following	Yin Fang et.al.	2501.08187	link
2025-01-14	Potential and Perils of Large Language Models as Judges of Unstructured Textual Data	Rewina Bedemariam et.al.	2501.08167	null
2025-01-14	I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution	Soohyeon Choi et.al.	2501.08165	null
2025-01-14	Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data	Phai Vu Dinh et.al.	2501.08149	null
2025-01-14	Refusal Behavior in Large Language Models: A Nonlinear Perspective	Fabian Hildebrandt et.al.	2501.08145	link
2025-01-14	Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying	Jonathan Lyhs et.al.	2501.08142	null
2025-01-14	Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2	Seamie Hayes et.al.	2501.08118	null
2025-01-15	Consistency of Responses and Continuations Generated by Large Language Models on Social Media	Wenlu Fan et.al.	2501.08102	null
2025-01-14	Hierarchical Autoscaling for Large Language Model Serving with Chiron	Archit Patke et.al.	2501.08090	null
2025-01-14	Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving	Nert Keser et.al.	2501.08083	null
2025-01-14	CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning	Guoliang He et.al.	2501.08071	link
2025-01-14	A Roadmap to Guide the Integration of LLMs in Hierarchical Planning	Israel Puerta-Merino et.al.	2501.08068	null
2025-01-14	Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT	Awritrojit Banerjee et.al.	2501.08053	null
2025-01-14	TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning	Yao Liang et.al.	2501.08008	null
2025-01-14	LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS	Muhammad Ashfaq et.al.	2501.07992	null
2025-01-14	Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness	Jiaxing Zhao et.al.	2501.07978	null
2025-01-14	Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models	Yifang Xu et.al.	2501.07972	null
2025-01-14	Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning	Jiaqi Hua et.al.	2501.07959	link
2025-01-14	AI Guide Dog: Egocentric Path Prediction on Smartphone	Aishwarya Jadhav et.al.	2501.07957	null
2025-01-14	Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations	Waqar Hussain et.al.	2501.07931	null
2025-01-14	Gandalf the Red: Adaptive Security for LLMs	Niklas Pfister et.al.	2501.07927	link
2025-01-14	VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models	Hui Kuurila-Zhang et.al.	2501.07922	link
2025-01-14	Large Language Model Interface for Home Energy Management Systems	François Michelon et.al.	2501.07919	null
2025-01-14	Bridge-SR: Schrödinger Bridge for Efficient SR	Chang Li et.al.	2501.07897	null
2025-01-14	Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs	Shuai Wang et.al.	2501.07892	null
2025-01-14	ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding	Zhongxiang Sun et.al.	2501.07861	null
2025-01-14	Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques	Shobhit Ratan et.al.	2501.07853	null
2025-01-14	Unveiling Provider Bias in Large Language Models for Code Generation	Xiaoyu Zhang et.al.	2501.07849	null
2025-01-14	Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning	Haoyu Han et.al.	2501.07845	null
2025-01-14	A Driver Advisory System Based on Large Language Model for High-speed Train	Y. C. Luo et.al.	2501.07837	null
2025-01-14	Flow: A Modular Approach to Automated Agentic Workflow Generation	Boye Niu et.al.	2501.07834	null
2025-01-14	Real-time Verification and Refinement of Language Model Text Generation	Joonho Ko et.al.	2501.07824	null
2025-01-14	3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding	Haomiao Xiong et.al.	2501.07819	link
2025-01-14	A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models	Kaustubh D. Dhole et.al.	2501.07818	null
2025-01-14	Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models	Dhruv Dhamani et.al.	2501.07815	null
2025-01-14	Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering	Feijie Wu et.al.	2501.07813	null
2025-01-14	CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation	Ruwei Pan et.al.	2501.07811	null
2025-01-14	Visual Language Models as Operator Agents in the Space Domain	Alejandro Carrasco et.al.	2501.07802	null
2025-01-14	Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding	Zhaokai Wang et.al.	2501.07783	link
2025-01-14	Symmetry-Aware Generative Modeling through Learned Canonicalization	Kusha Sareen et.al.	2501.07773	null
2025-01-14	Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey	Bingchen Liu et.al.	2501.07766	null
2025-01-14	On the Statistical Capacity of Deep Generative Models	Edric Tam et.al.	2501.07763	link
2025-01-13	Advancing Student Writing Through Automated Syntax Feedback	Kamyar Zeinalipour et.al.	2501.07740	null
2025-01-13	Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens	Dongwon Kim et.al.	2501.07730	null
2025-01-13	LLMic: Romanian Foundation Language Model	Vlad-Andrei Bădoiu et.al.	2501.07721	null
2025-01-13	CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory	Haokun Zhao et.al.	2501.07674	null
2025-01-13	Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning	Karishma Thakrar et.al.	2501.07663	null
2025-01-13	Large Language Models for Interpretable Mental Health Diagnosis	Brian Hyeongseok Kim et.al.	2501.07653	null
2025-01-13	BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations	Weixi Feng et.al.	2501.07647	null
2025-01-13	GPT as a Monte Carlo Language Tree: A Probabilistic Perspective	Kun-Peng Ning et.al.	2501.07641	null
2025-01-13	SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models	Fabien Bernier et.al.	2501.07639	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	Imagine while Reasoning in Space: Multimodal Visualization-of-Thought	Chengzu Li et.al.	2501.07542	null
2025-01-13	ML Mule: Mobile-Driven Context-Aware Collaborative Learning	Haoxiang Yu et.al.	2501.07536	null
2025-01-13	Investigating Large Language Models in Inferring Personality Traits from User Conversations	Jianfeng Zhu et.al.	2501.07532	null
2025-01-13	RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment	Difei Gu et.al.	2501.07525	link
2025-01-13	Parallel Key-Value Cache Fusion for Position Invariant RAG	Philhoon Oh et.al.	2501.07523	null
2025-01-13	Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards	Yangsibo Huang et.al.	2501.07493	null
2025-01-13	TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models	Thales Sales Almeida et.al.	2501.07482	null
2025-01-13	A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities	Yihao Liu et.al.	2501.07468	null
2025-01-13	Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI	Rolf Pfister et.al.	2501.07458	null
2025-01-13	Enhancing LLM's Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection	Xin Yin et.al.	2501.07425	null
2025-01-13	Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion	Lala Shakti Swarup Ray et.al.	2501.07408	null
2025-01-13	OCORD: Open-Campus Object Removal Dataset	Shuo Zhang et.al.	2501.07397	null
2025-01-13	Simulating the Hubbard Model with Equivariant Normalizing Flows	Dominic Schuh et.al.	2501.07371	null
2025-01-13	Emergent effects of scaling on the functional hierarchies within large language models	Paul C. Bogdan et.al.	2501.07359	null
2025-01-13	Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring	Buse Sibel Korkmaz et.al.	2501.07324	link
2025-01-13	FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering	Erik Henriksson et.al.	2501.07314	link
2025-01-13	The Lessons of Developing Process Reward Models in Mathematical Reasoning	Zhenru Zhang et.al.	2501.07301	null
2025-01-13	GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction	Oleg Kobzarev et.al.	2501.07295	null
2025-01-13	LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks	Zan-Kai Chong et.al.	2501.07288	null
2025-01-13	Lifelong Learning of Large Language Model based Agents: A Roadmap	Junhao Zheng et.al.	2501.07278	link
2025-01-13	Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation	Amir Sartipi et.al.	2501.07276	null
2025-01-13	Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics	Wonduk Seo et.al.	2501.07267	null
2025-01-13	Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion	Li Liang et.al.	2501.07260	link
2025-01-13	EdgeTAM: On-Device Track Anything Model	Chong Zhou et.al.	2501.07256	null
2025-01-13	Large Language Models: New Opportunities for Access to Science	Jutta Schnabel et.al.	2501.07250	null
2025-01-13	Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training	Ziqing Wen et.al.	2501.07237	link
2025-01-13	Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction	Qiaoqiao Ren et.al.	2501.07224	link
2025-01-13	Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing	Laifa Tao et.al.	2501.07191	null
2025-01-13	Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study	Huashan Chen et.al.	2501.07165	null
2025-01-13	AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model	Bangchen Yin et.al.	2501.07155	link
2025-01-13	LLM360 K2: Scaling Up 360-Open-Source Large Language Models	Zhengzhong Liu et.al.	2501.07124	null
2025-01-13	How GPT learns layer by layer	Jason Du et.al.	2501.07108	link
2025-01-13	ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training	Jiayang Wu et.al.	2501.07078	link
2025-01-13	D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation	Zhejun Zhang et.al.	2501.07077	link
2025-01-13	Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values	Jing Yao et.al.	2501.07071	null
2025-01-13	Enhancing Image Generation Fidelity via Progressive Prompts	Zhen Xiong et.al.	2501.07070	link
2025-01-13	Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities	ZeKe Xiao et.al.	2501.07058	null
2025-01-13	SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation	Yee-Fan Tan et.al.	2501.07055	null
2025-01-13	PoAct: Policy and Action Dual-Control Agent for Generalized Applications	Guozhi Yuan et.al.	2501.07054	null
2025-01-13	ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction	Yan Zhang et.al.	2501.07051	link
2025-01-13	Unveiling the Potential of Text in High-Dimensional Time Series Forecasting	Xin Zhou et.al.	2501.07048	link
2025-01-13	Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis	Luwei Zeng et.al.	2501.07034	null
2025-01-13	A Proposed Large Language Model-Based Smart Search for Archive System	Ha Dung Nguyen et.al.	2501.07024	null
2025-01-13	Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps	Henry Li et.al.	2501.06999	link
2025-01-13	LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models	Mozhgan Nasr Azadani et.al.	2501.06986	link
2025-01-13	Combining LLM decision and RL action selection to improve RL policy for adaptive interventions	Karine Karine et.al.	2501.06980	null
2025-01-12	How is Google using AI for internal code migrations?	Stoyan Nikolov et.al.	2501.06972	null
2025-01-12	Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives	Xinyao Ma et.al.	2501.06964	null
2025-01-12	Comparison of Autoencoders for tokenization of ASL datasets	Vouk Praun-Petrovic et.al.	2501.06942	null
2025-01-12	Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy	Evgeny Ugolkov et.al.	2501.06939	link
2025-01-12	Harnessing Large Language Models for Disaster Management: A Survey	Zhenyu Lei et.al.	2501.06932	null
2025-01-12	Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories	Faaiq Waqar et.al.	2501.06921	null
2025-01-12	Risk-Averse Finetuning of Large Language Models	Sapana Chaudhary et.al.	2501.06911	link
2025-01-12	Deep Learning and Foundation Models for Weather Prediction: A Survey	Jimeng Shi et.al.	2501.06907	null
2025-01-12	A Foundational Generative Model for Breast Ultrasound Image Analysis	Haojun Yu et.al.	2501.06869	null
2025-01-12	Transfer Learning of Tabular Data by Finetuning Large Language Models	Shourav B. Rabbani et.al.	2501.06863	null
2025-01-12	A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context	Noureldin Zahran et.al.	2501.06859	null
2025-01-12	SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training	Tianjin Huang et.al.	2501.06842	link
2025-01-12	An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering	Zaber Al Hassan Ayon et.al.	2501.06837	null
2025-01-12	X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding	Wenqi Zhou et.al.	2501.06835	null
2025-01-12	LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents	Augusto Gonzalez-Bonorino et.al.	2501.06834	link
2025-01-12	GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing	Ruizhe Ou et.al.	2501.06828	null
2025-01-12	Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification	Shijing Chen et.al.	2501.06827	null
2025-01-12	Event Argument Extraction with Enriched Prompts	Chen Liang et.al.	2501.06825	link
2025-01-12	A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT	Yizhou Zhou et.al.	2501.06819	null
2025-01-12	RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models	Keyan Chen et.al.	2501.06809	link
2025-01-12	Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting	Yongshuo Zhu et.al.	2501.06808	null
2025-01-12	MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference	Wenxuan Zeng et.al.	2501.06807	null
2025-01-12	Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences	Liu Yu et.al.	2501.06795	null
2025-01-12	3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes	Mahmoud Ahmed et.al.	2501.06785	link
2025-01-12	Cost-Effective Robotic Handwriting System with AI Integration	Tianyi Huang et.al.	2501.06783	null
2025-01-12	Eliza: A Web3 friendly AI Agent Operating System	Shaw Walters et.al.	2501.06781	link
2025-01-12	VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning	Ji Soo Lee et.al.	2501.06761	link
2025-01-12	Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation	Shunfan Zheng et.al.	2501.06741	null
2025-01-12	ZOQO: Zero-Order Quantized Optimization	Noga Bar et.al.	2501.06736	null
2025-01-12	Better Prompt Compression Without Multi-Layer Perceptrons	Edouardo Honig et.al.	2501.06730	null
2025-01-12	Measuring the Robustness of Reference-Free Dialogue Evaluation Systems	Justin Vasselli et.al.	2501.06728	link
2025-01-12	Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G	Zhiyan Liu et.al.	2501.06726	null
2025-01-12	DRDT3: Diffusion-Refined Decision Test-Time Training Model	Xingshuai Huang et.al.	2501.06718	null
2025-01-12	ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian	Mykyta Syromiatnikov et.al.	2501.06715	link
2025-01-12	Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management	Liu Qianli et.al.	2501.06709	null
2025-01-12	Evaluating Sample Utility for Data Selection by Mimicking Model Weights	Tzu-Heng Huang et.al.	2501.06708	null
2025-01-12	AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds	Yinfang Chen et.al.	2501.06706	null
2025-01-12	Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese	Jie Yang et.al.	2501.06704	null
2025-01-12	Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions	Aidan Hogan et.al.	2501.06699	null
2025-01-12	DVM: Towards Controllable LLM Agents in Social Deduction Games	Zheng Zhang et.al.	2501.06695	null
2025-01-12	TAPO: Task-Referenced Adaptation for Prompt Optimization	Wenxin Luo et.al.	2501.06689	link
2025-01-12	Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning	Xiangen Hu et.al.	2501.06682	null
2025-01-12	Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Haoxiang Gao et.al.	2501.06680	null
2025-01-11	Challenging reaction prediction models to generalize to novel chemistry	John Bradshaw et.al.	2501.06669	link
2025-01-11	Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training	Sanjit Kakarla et.al.	2501.06658	link
2025-01-11	FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings	Tong Liu et.al.	2501.06645	null
2025-01-11	Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models	Veronika Smilga et.al.	2501.06638	link
2025-01-11	Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach	Mohammed Maree et.al.	2501.06628	null
2025-01-11	Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks	Amr Almorsi et.al.	2501.06625	null
2025-01-11	Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks	Xuanhao Luo et.al.	2501.06604	null
2025-01-11	ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation	Xuanle Zhao et.al.	2501.06598	link
2025-01-11	ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning	Xiangru Tang et.al.	2501.06590	link
2025-01-11	Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping	Muru Zhang et.al.	2501.06589	link
2025-01-10	LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs	Omkar Thawakar et.al.	2501.06186	link
2025-01-10	PEACE: Empowering Geologic Map Holistic Understanding with MLLMs	Yangyu Huang et.al.	2501.06184	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-10	GenMol: A Drug Discovery Generalist with Discrete Diffusion	Seul Lee et.al.	2501.06158	null
2025-01-10	Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories	Gerd Kortemeyer et.al.	2501.06143	null
2025-01-10	Supervision policies can shape long-term risk management in general-purpose AI models	Manuel Cebrian et.al.	2501.06137	link
2025-01-10	Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI	Yuya Asano et.al.	2501.06129	null
2025-01-10	Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Fabian David Schmidt et.al.	2501.06117	link
2025-01-10	From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy	Elham Aghakhani et.al.	2501.06101	null
2025-01-10	Photokinetics of Photothermal Reactions	Mounir Maafi et.al.	2501.06057	null
2025-01-10	AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery	Johann Wenckstern et.al.	2501.06039	link
2025-01-10	Addressing speaker gender bias in large scale speech translation systems	Shubham Bansal et.al.	2501.05989	null
2025-01-10	Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing	Eklavya Sarkar et.al.	2501.05987	link
2025-01-10	Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys	Divya Mani Adhikari et.al.	2501.05985	null
2025-01-10	Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea	Eunjung Cho et.al.	2501.05981	null
2025-01-10	Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory	Yunmeng Shu et.al.	2501.05965	null
2025-01-10	Effective faking of verbal deception detection with target-aligned adversarial attacks	Bennett Kleinberg et.al.	2501.05962	null
2025-01-10	Reusable specimen-level inference in computational pathology	Jakub R. Kaczmarzyk et.al.	2501.05945	link
2025-01-10	DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information	Yongfan Lai et.al.	2501.05932	link
2025-01-10	LLMs Reproduce Stereotypes of Sexual and Gender Minorities	Ruby Ostrow et.al.	2501.05926	null
2025-01-10	Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction	Petraq Nako et.al.	2501.05925	null
2025-01-10	Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Ziheng Wu et.al.	2501.05901	link
2025-01-10	Prompt engineering and its implications on the energy consumption of Large Language Models	Riccardo Rubei et.al.	2501.05899	link
2025-01-10	Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs	Bianca Raimondi et.al.	2501.05891	link
2025-01-10	Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs	Dabing Cheng et.al.	2501.05884	null
2025-01-10	VideoRAG: Retrieval-Augmented Generation over Video Corpus	Soyeong Jeong et.al.	2501.05874	null
2025-01-10	ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability	Antonin Poché et.al.	2501.05855	link
2025-01-10	Understanding Impact of Human Feedback via Influence Functions	Taywon Min et.al.	2501.05790	link
2025-01-10	Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models	You Li et.al.	2501.05767	null
2025-01-10	Controlling Large Language Models Through Concept Activation Vectors	Hanyu Zhang et.al.	2501.05764	null
2025-01-10	StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation	Shangjin Zhai et.al.	2501.05763	null
2025-01-10	CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech	Madhurananda Pahar et.al.	2501.05755	null
2025-01-10	Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models	Sungjae Lee et.al.	2501.05752	null
2025-01-10	TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos	Korawat Charoenpitaks et.al.	2501.05733	link
2025-01-10	Enabling Scalable Oversight via Self-Evolving Critic	Zhengyang Tang et.al.	2501.05727	null
2025-01-10	I Can't Share Code, but I need Translation -- An Empirical Study on Code Translation through Federated LLM	Jahnavi Kumar et.al.	2501.05724	null
2025-01-10	How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond	Chen Huang et.al.	2501.05714	null
2025-01-10	Multi-Step Reasoning in Korean and the Emergent Mirage	Guijin Son et.al.	2501.05712	null
2025-01-10	EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model	Yi He et.al.	2501.05710	null
2025-01-10	Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains	Vighnesh Subramaniam et.al.	2501.05707	null
2025-01-10	Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness	Audrey Salmon et.al.	2501.05706	null
2025-01-10	Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection	Feiyi Chen et.al.	2501.05675	null
2025-01-10	Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration	Zuyuan Zhang et.al.	2501.05673	null
2025-01-10	Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models	Zheqi Lv et.al.	2501.05662	null
2025-01-10	Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation	Zheqi Lv et.al.	2501.05647	null
2025-01-10	Iconicity in Large Language Models	Anna Marklová et.al.	2501.05643	null
2025-01-10	HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection	Anant Mehta et.al.	2501.05631	link
2025-01-10	The Impact of Model Scaling on Seen and Unseen Language Performance	Rhitabrat Pokharel et.al.	2501.05629	null
2025-01-09	Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study	Zhenyu Qi et.al.	2501.05625	null
2025-01-09	Exploring Large Language Models for Translating Romanian Computational Problems into English	Adrian Marius Dumitran et.al.	2501.05601	null
2025-01-09	Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics	Gert Aarts et.al.	2501.05580	null
2025-01-09	Exploring Large Language Models (LLMs) through interactive Python activities	Eugenio Tufino et.al.	2501.05577	link
2025-01-09	LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts	Yuri Facanha Bezerra et.al.	2501.05554	link
2025-01-09	The dynamics of meaning through time: Assessment of Large Language Models	Mohamed Taher Alrefaie et.al.	2501.05552	null
2025-01-09	Infecting Generative AI With Viruses	David Noever et.al.	2501.05542	null
2025-01-09	NSChat: A Chatbot System To Rule Them All	Zenon Lamprou et.al.	2501.05541	null
2025-01-09	ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding	Xingyu Fu et.al.	2501.05452	null
2025-01-09	Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Yifan Yu et.al.	2501.05446	link
2025-01-09	Consistent Flow Distillation for Text-to-3D Generation	Runjie Yan et.al.	2501.05445	null
2025-01-09	Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark	Yunzhuo Hao et.al.	2501.05444	null
2025-01-09	A survey of textual cyber abuse detection using cutting-edge language models and large language models	Jose A. Diaz-Garcia et.al.	2501.05443	null
2025-01-09	Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation	Xuyi Meng et.al.	2501.05427	null
2025-01-09	Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers	Jerry Chongyi Hu et.al.	2501.05423	null
2025-01-09	Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation	Darius Petermann et.al.	2501.05413	null
2025-01-10	Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics	Maximilian Alber et.al.	2501.05409	null
2025-01-09	TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts	Yu-Hao Huang et.al.	2501.05403	null
2025-01-09	Mechanistic understanding and validation of large AI models with SemanticLens	Maximilian Dreyer et.al.	2501.05398	null
2025-01-09	FairCode: Evaluating Social Bias of LLMs in Code Generation	Yongkang Du et.al.	2501.05396	link
2025-01-09	Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models	Kristian G. Barman et.al.	2501.05382	null
2025-01-09	Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance	Dimitrios Gerogiannis et.al.	2501.05379	null
2025-01-09	Accelerated Diffusion Models via Speculative Sampling	Valentin De Bortoli et.al.	2501.05370	null
2025-01-09	Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction	Hantao Lou et.al.	2501.05336	link
2025-01-09	"What's Happening"- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles	Xuewen Luo et.al.	2501.05322	null
2025-01-09	Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning	Nora Gourmelon et.al.	2501.05281	link
2025-01-09	CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models	Fabian Hörst et.al.	2501.05269	link
2025-01-09	Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal	Wanli Ma et.al.	2501.05265	null
2025-01-09	CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models	Yewei Song et.al.	2501.05255	null
2025-01-09	From Scientific Texts to Verifiable Code: Automating the Process with Transformers	Changjie Wang et.al.	2501.05252	null
2025-01-09	RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models	Peizhuo Lv et.al.	2501.05249	null
2025-01-09	Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning	Laura Puccioni et.al.	2501.05248	null
2025-01-09	Online Prompt and Solver Selection for Program Synthesis	Yixuan Li et.al.	2501.05247	null
2025-01-09	Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs	Artem Fedorchenko et.al.	2501.05234	null
2025-01-09	Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection	Pei-Kang Lee et.al.	2501.05228	null
2025-01-09	Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes	Ludwic Leonard et.al.	2501.05226	null
2025-01-09	Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond	Tomas Goldsack et.al.	2501.05224	null
2025-01-09	A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education	Ziqing Li et.al.	2501.05220	null
2025-01-09	Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration	Xuyang Liu et.al.	2501.05179	link
2025-01-09	Emergence of human-like polarization among large language model agents	Jinghua Piao et.al.	2501.05171	null
2025-01-09	Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering	Matteo Esposito et.al.	2501.05165	null
2025-01-09	Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier	Yufei Shang et.al.	2501.05155	null
2025-01-09	DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving	Xuran Zheng et.al.	2501.05081	null
2025-01-09	Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization	Harshith Manjunath et.al.	2501.05079	null
2025-01-09	Analyzing Memorization in Large Language Models through the Lens of Model Attribution	Tarun Ram Menta et.al.	2501.05078	link
2025-01-09	A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model	Shuo Tong et.al.	2501.05075	null
2025-01-09	Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning	Huabin Liu et.al.	2501.05069	null
2025-01-09	LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding	Jiaxing Zhao et.al.	2501.05067	null
2025-01-09	Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators	Leonard Lupin-Jimenez et.al.	2501.05058	null
2025-01-09	LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models	Zengqi Peng et.al.	2501.05057	null
2025-01-09	On the Generalizability of Transformer Models to Code Completions of Different Lengths	Nathan Cooper et.al.	2501.05051	null
2025-01-09	SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution	Chengxing Xie et.al.	2501.05040	link
2025-01-09	Enhancing Human-Like Responses in Large Language Models	Ethem Yağız Çalık et.al.	2501.05032	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-09	A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications	Ofir Marom et.al.	2501.05030	null
2025-01-09	TreeKV: Smooth Key-Value Cache Compression with Tree Structures	Ziwei He et.al.	2501.04987	null
2025-01-09	SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs	Muhammad Salman et.al.	2501.04985	null
2025-01-09	V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer	Hangzhou He et.al.	2501.04975	link
2025-01-09	Demystifying Domain-adaptive Post-training for Financial LLMs	Zixuan Ke et.al.	2501.04961	link
2025-01-09	Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments	Yifan Xu et.al.	2501.04947	null
2025-01-09	Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models	Qingyu Ren et.al.	2501.04945	link
2025-01-09	Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency	Shiji Zhao et.al.	2501.04931	null
2025-01-09	Investigating Numerical Translation with Large Language Models	Wei Tang et.al.	2501.04927	null
2025-01-09	FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching	Jun-Hak Yun et.al.	2501.04926	null
2025-01-09	HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers	Yiyao Yang et.al.	2501.04908	link
2025-01-09	JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis	Jun-Hyeok Cha et.al.	2501.04904	null
2025-01-09	ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries	Keke Huang et.al.	2501.04901	null
2025-01-09	SUGAR: Leveraging Contextual Confidence for Smarter Retrieval	Hanna Zubkova et.al.	2501.04899	null
2025-01-08	Leveraging Log Probabilities in Language Models to Forecast Future Events	Tommaso Soru et.al.	2501.04880	null
2025-01-08	Real-Time Textless Dialogue Generation	Long Mai et.al.	2501.04877	link
2025-01-08	Modelling complex proton transport phenomena -- Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields	Malte Grunert et.al.	2501.04876	null
2025-01-08	Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware	Brandon J Walton et.al.	2501.04848	null
2025-01-08	Do Code LLMs Understand Design Patterns?	Zhenyu Pan et.al.	2501.04835	null
2025-01-08	On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability	Andreas Vogelsang et.al.	2501.04810	null
2025-01-08	IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX	Erik Recio-Armengol et.al.	2501.04776	link
2025-01-08	Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations	Kirandeep Kaur et.al.	2501.04762	null
2025-01-08	Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch	Phillip Richter et.al.	2501.04755	null
2025-01-08	EditAR: Unified Conditional Generation with Autoregressive Models	Jiteng Mu et.al.	2501.04699	null
2025-01-08	Re-ranking the Context for Multimodal Retrieval Augmented Generation	Matin Mortaheb et.al.	2501.04695	null
2025-01-08	SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images	Zixuan Huang et.al.	2501.04689	null
2025-01-08	URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics	Ruilin Luo et.al.	2501.04686	link
2025-01-08	Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations	Archita Srivastava et.al.	2501.04675	null
2025-01-08	Assessing Language Comprehension in Large Language Models Using Construction Grammar	Wesley Scivetti et.al.	2501.04661	null
2025-01-08	Multi-task retriever fine-tuning for domain-specific and efficient RAG	Patrice Béchard et.al.	2501.04652	null
2025-01-08	FlairGPT: Repurposing LLMs for Interior Designs	Gabrielle Littlefair et.al.	2501.04648	null
2025-01-08	Knowledge Retrieval Based on Generative AI	Te-Lun Yang et.al.	2501.04635	null
2025-01-08	"Can you be my mum?": Manipulating Social Robots in the Large Language Models Era	Giulio Antonio Abbo et.al.	2501.04633	null
2025-01-09	MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation	Daniele Molino et.al.	2501.04614	null
2025-01-08	Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning	Ivan Kankeu et.al.	2501.04591	link
2025-01-08	Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models	Miaoyang He et.al.	2501.04582	null
2025-01-08	InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection	Yuhang Liu et.al.	2501.04575	link
2025-01-09	OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis	Run Luo et.al.	2501.04561	link
2025-01-08	The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas?	Christopher Lazik et.al.	2501.04543	null
2025-01-08	Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time	Uri Berger et.al.	2501.04513	null
2025-01-08	CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection	Ruijun Feng et.al.	2501.04510	null
2025-01-08	Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction	Guofeng Yang et.al.	2501.04487	null
2025-01-08	When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages	Archchana Sindhujan et.al.	2501.04473	null
2025-01-08	Hidden Entity Detection from GitHub Leveraging Large Language Models	Lu Gan et.al.	2501.04455	link
2025-01-08	Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions	Doaa Mahmud et.al.	2501.04437	null
2025-01-08	Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions	Na Yan et.al.	2501.04436	null
2025-01-08	End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach	H. M. Shadman Tabib et.al.	2501.04425	null
2025-01-08	SEO: Stochastic Experience Optimization for Large Language Models	Jitao Xu et.al.	2501.04393	null
2025-01-08	iFADIT: Invertible Face Anonymization via Disentangled Identity Transform	Lin Yuan et.al.	2501.04390	null
2025-01-08	DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications	Feng Liu et.al.	2501.04366	link
2025-01-08	Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting	Dong-Hai Zhu et.al.	2501.04341	link
2025-01-09	Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models	Haonan Shi et.al.	2501.04323	null
2025-01-08	Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts	Preethi Seshadri et.al.	2501.04316	link
2025-01-08	RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation	Jun Liu et.al.	2501.04315	null
2025-01-08	Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models	Kunpeng Zhang et.al.	2501.04312	null
2025-01-08	LLM4SR: A Survey on Large Language Models for Scientific Research	Ziming Luo et.al.	2501.04306	link
2025-01-08	Multimodal Graph Constrastive Learning and Prompt for ChartQA	Yue Dai et.al.	2501.04303	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	null
2025-01-08	An Analysis of Model Robustness across Concurrent Distribution Shifts	Myeongho Jeon et.al.	2501.04288	null
2025-01-08	Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models	Bahman Torkamandi et.al.	2501.04286	null
2025-01-08	Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking	Tianqi Ren et.al.	2501.04285	null
2025-01-08	OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments	Yujie Tang et.al.	2501.04279	null
2025-01-08	Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering	Christophe Bajan et.al.	2501.04277	link
2025-01-08	Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation	Senwei Xie et.al.	2501.04268	null
2025-01-08	Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning	Lang Xu et.al.	2501.04266	null
2025-01-08	IOLBENCH: Benchmarking LLMs on Linguistic Reasoning	Satyam Goyal et.al.	2501.04249	link
2025-01-08	TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events	Jian-Hua Fang et.al.	2501.04247	null
2025-01-08	Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks	Rachel Longjohn et.al.	2501.04234	null
2025-01-07	Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation	Alireza Salemi et.al.	2501.04167	null
2025-01-07	AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots	Shaoyue Wen et.al.	2501.04156	link
2025-01-07	Multilingual Open QA on the MIA Shared Task	Navya Yarrabelly et.al.	2501.04153	null
2025-01-07	The angular momentum spiral of the Milky Way disc in Gaia	Rashid Yaaqib et.al.	2501.04095	null
2025-01-07	More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives	Xiaoqing Zhang et.al.	2501.04070	link
2025-01-07	ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono	Jingquan Wang et.al.	2501.04062	null
2025-01-07	LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Lingdong Kong et.al.	2501.04005	null
2025-01-07	Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos	Haobo Yuan et.al.	2501.04001	link
2025-01-07	RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance	Matin Mortaheb et.al.	2501.03995	null
2025-01-07	Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance	Adil Rengim Cetingoz et.al.	2501.03993	null
2025-01-07	Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles	Yuxi Xia et.al.	2501.03991	null
2025-01-07	(De)-Indexing and the Right to be Forgotten	Salvatore Vilella et.al.	2501.03989	null
2025-01-07	VLM-driven Behavior Tree for Context-aware Task Planning	Naoki Wake et.al.	2501.03968	link
2025-01-07	Vision Language Models as Values Detectors	Giulio Antonio Abbo et.al.	2501.03957	null
2025-01-07	Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States	Jurgita Kapočiūtė-Dzikienė et.al.	2501.03952	null
2025-01-07	Synthetic Data Privacy Metrics	Amy Steier et.al.	2501.03941	null
2025-01-07	Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection	Pablo Miralles-González et.al.	2501.03940	null
2025-01-07	A precise asymptotic analysis of learning diffusion models: theory and insights	Hugo Cui et.al.	2501.03937	link
2025-01-07	Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study	Ramya Jonnala et.al.	2501.03904	null
2025-01-07	LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token	Shaolei Zhang et.al.	2501.03895	link
2025-01-07	AlphaPO -- Reward shape matters for LLM alignment	Aman Gupta et.al.	2501.03884	null
2025-01-07	CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds	Keonwoo Kim et.al.	2501.03879	null
2025-01-07	Progressive Document-level Text Simplification via Large Language Models	Dengzhao Fang et.al.	2501.03857	null
2025-01-07	MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention	Aadya Arora et.al.	2501.03839	null
2025-01-07	Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging	Simon W. Penninga et.al.	2501.03825	null
2025-01-08	MADation: Face Morphing Attack Detection with Foundation Models	Eduarda Caldeira et.al.	2501.03800	link
2025-01-07	KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration	Chengyuan Li et.al.	2501.03786	null
2025-01-07	Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series	Yuxiao Hu et.al.	2501.03747	null
2025-01-07	Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein	Xiaotong Guo et.al.	2501.03722	null
2025-01-07	Motion-Aware Generative Frame Interpolation	Guozhen Zhang et.al.	2501.03699	null
2025-01-07	SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment	Yuchun Fan et.al.	2501.03681	link
2025-01-07	Effective and Efficient Mixed Precision Quantization of Speech Foundation Models	Haoning Xu et.al.	2501.03643	null
2025-01-07	CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems	Zhaonan Wu et.al.	2501.03626	link
2025-01-07	LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment	Gaoussou Youssouf Kebe et.al.	2501.03624	null
2025-01-07	Cosmos World Foundation Model Platform for Physical AI	NVIDIA et.al.	2501.03575	link
2025-01-07	From Code to Compliance: Assessing ChatGPT's Utility in Designing an Accessible Webpage -- A Case Study	Ammar Ahmed et.al.	2501.03572	null
2025-01-07	What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs	Muneera Bano et.al.	2501.03569	null
2025-01-07	Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities	Benedikt Reitemeyer et.al.	2501.03566	null
2025-01-07	Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis	Haoran Lai et.al.	2501.03565	null
2025-01-07	PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models	Lingzhi Yuan et.al.	2501.03544	null
2025-01-07	Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions	Weijieying Ren et.al.	2501.03540	null
2025-01-07	Deep Learning for Pathological Speech: A Survey	Shakeel A. Sheikh et.al.	2501.03536	null
2025-01-08	SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving	Xuewen Luo et.al.	2501.03535	null
2025-01-07	A generative approach for lensless imaging in low-light conditions	Ziyang Liu et.al.	2501.03511	null
2025-01-07	A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models	Shuyang Wang et.al.	2501.03508	null
2025-01-07	Textualize Visual Prompt for Image Editing via Diffusion Bridge	Pengcheng Xu et.al.	2501.03495	null
2025-01-07	Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment	Prashant Trivedi et.al.	2501.03486	null
2025-01-07	Reading with Intent -- Neutralizing Intent	Benjamin Reichman et.al.	2501.03475	null
2025-01-07	Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning	Chuang Niu et.al.	2501.03469	link
2025-01-07	MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems	Yannis Katsis et.al.	2501.03468	link
2025-01-07	ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation	Yu-Cheng Liu et.al.	2501.03462	null
2025-01-07	Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation	Xiao Wang et.al.	2501.03458	link
2025-01-07	CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering	Jialiang Chen et.al.	2501.03447	null
2025-01-07	LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models	Mohamad Fakih et.al.	2501.03446	null
2025-01-07	Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology	Sarah E. Finch et.al.	2501.03441	link
2025-01-06	SALT: Sales Autocompletion Linked Business Tables Dataset	Tassilo Klein et.al.	2501.03413	link
2025-01-06	BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations	Simone Giovannini et.al.	2501.03403	null
2025-01-06	DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes	Xuyang Wang et.al.	2501.03397	link
2025-01-06	Evolved Quantum Boltzmann Machines	Michele Minervini et.al.	2501.03367	null
2025-01-06	CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets	Tanay Agrawal et.al.	2501.03332	null
2025-01-06	LiLMaps: Learnable Implicit Language Maps	Evgenii Kruzhkov et.al.	2501.03304	null
2025-01-06	A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval	Shuo Tong et.al.	2501.03295	null
2025-01-06	Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model	Naibo Wang et.al.	2501.03292	null
2025-01-06	ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning	Pengwei Tang et.al.	2501.03291	null
2025-01-06	CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models	Zhenyu Xu et.al.	2501.03288	null
2025-01-06	BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning	Beichen Zhang et.al.	2501.03226	link
2025-01-06	Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text	Ayat Najjar et.al.	2501.03212	null
2025-01-06	Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity	Ayat A. Najjar et.al.	2501.03203	null
2025-01-06	CLIX: Cross-Lingual Explanations of Idiomatic Expressions	Aaron Gluck et.al.	2501.03191	null
2025-01-06	Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text	Ali Al-Lawati et.al.	2501.03166	link
2025-01-06	Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy	Risha Goel et.al.	2501.03153	link
2025-01-06	Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches	Alhassan Mumuni et.al.	2501.03151	null
2025-01-06	VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity	Yerong Li et.al.	2501.03139	null
2025-01-07	PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models	Mingyang Song et.al.	2501.03124	link
2025-01-06	CAT: Content-Adaptive Image Tokenization	Junhong Shen et.al.	2501.03120	null
2025-01-06	LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases	Dylan Bouchard et.al.	2501.03112	link
2025-01-06	Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling	Aseem Srivastava et.al.	2501.03088	null
2025-01-06	Retrieval-Augmented TLAPS Proof Generation with Large Language Models	Yuhao Zhou et.al.	2501.03073	null
2025-01-06	ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events	Duygu Sezen Islakoglu et.al.	2501.03040	null
2025-01-06	Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning	Zhen Li et.al.	2501.03035	null
2025-01-06	TransPixar: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	CALM: Curiosity-Driven Auditing for Large Language Models	Xiang Zheng et.al.	2501.02997	link
2025-01-06	Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation	Zhi Qu et.al.	2501.02979	link
2025-01-06	FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2501.02968	null
2025-01-07	Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild	Wanpeng Hu et.al.	2501.02964	link
2025-01-07	SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild	Jiawei Liu et.al.	2501.02962	null
2025-01-06	The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features	Shi Bin Hoo et.al.	2501.02945	link
2025-01-07	Inhibition of bacterial growth by antibiotics	Barnabe Ledoux et.al.	2501.02944	null
2025-01-06	Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions	Jianhua Pei et.al.	2501.02928	null
2025-01-06	DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model	Hao Yu et.al.	2501.02901	link
2025-01-06	FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection	Guray Ozgur et.al.	2501.02892	link
2025-01-06	MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs	Hui Sun et.al.	2501.02885	null
2025-01-06	IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment	Yiming Zhang et.al.	2501.02869	null
2025-01-06	Large Language Models for Video Surveillance Applications	Ulindu De Silva et.al.	2501.02850	null
2025-01-06	Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification	Yubo Wang et.al.	2501.02844	null
2025-01-06	Foundations of GenIR	Qingyao Ai et.al.	2501.02842	null
2025-01-06	An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models	Dezhi Ran et.al.	2501.02829	null
2025-01-06	InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion	Zhaoyi Yan et.al.	2501.02795	null
2025-01-06	CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation	Yuanhong Chen et.al.	2501.02786	null
2025-01-06	GeAR: Generation Augmented Retrieval	Haoyu Liu et.al.	2501.02772	null
2025-01-06	Visual Large Language Models for Generalized and Specialized Applications	Yifan Li et.al.	2501.02765	link
2025-01-06	Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging?	Hongyi Miao et.al.	2501.02751	null
2025-01-06	Artificial Intelligence in Creative Industries: Advances Prior to 2025	Nantheera Anantrasirichai et.al.	2501.02725	null
2025-01-06	KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models	Zaiyi Zheng et.al.	2501.02711	null
2025-01-06	QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance	Binita Saha et.al.	2501.02702	null
2025-01-06	EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models	Andrés Villa et.al.	2501.02699	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-05	Decoding specialised feature neurons in LLMs with the final projection layer	Harry J Davies et.al.	2501.02688	null
2025-01-05	From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering	Wen-ran Li et.al.	2501.02680	null
2025-01-05	A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model	Shivaram Kalyanakrishnan et.al.	2501.02652	null
2025-01-05	Representation Learning of Lab Values via Masked AutoEncoder	David Restrepo et.al.	2501.02648	link
2025-01-05	Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense	Yang Ouyang et.al.	2501.02629	link
2025-01-05	Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets	Mahmoud Jahanshahi et.al.	2501.02628	null
2025-01-05	HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning	Saleh Ashkboos et.al.	2501.02625	null
2025-01-05	LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment	Yifei Liu et.al.	2501.02621	null
2025-01-05	TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms	Jovan Stojkovic et.al.	2501.02600	null
2025-01-05	LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations	Jiaping Wang et.al.	2501.02573	link
2025-01-05	Multi-LLM Collaborative Caption Generation in Scientific Documents	Jaeyoung Kim et.al.	2501.02552	link
2025-01-05	Transformers Simulate MLE for Sequence Generation in Bayesian Networks	Yuan Cao et.al.	2501.02547	null
2025-01-05	Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm	Ljubisa Bojic et.al.	2501.02532	null
2025-01-05	Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI	Ljubisa Bojic et.al.	2501.02531	null
2025-01-05	Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks	Leo Franklin et.al.	2501.02527	null
2025-01-05	Unified Guidance for Geometry-Conditioned Molecular Generation	Sirine Ayadi et.al.	2501.02526	null
2025-01-05	Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors	Minglin Chen et.al.	2501.02519	null
2025-01-05	CHAIR-Classifier of Hallucination as Improver	Ao Sun et.al.	2501.02518	link
2025-01-05	ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use	Junjie Ye et.al.	2501.02506	null
2025-01-05	Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons	Sebastian Morel-Balbi et.al.	2501.02505	null
2025-01-05	ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling	Chaojie Mao et.al.	2501.02487	null
2025-01-05	LLMPC: Large Language Model Predictive Control	Gabriel Maher et.al.	2501.02486	link
2025-01-05	Decoding News Bias: Multi Bias Detection in News Articles	Bhushan Santosh Shah et.al.	2501.02482	null
2025-01-05	Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine	Yishen Liu et.al.	2501.02471	null
2025-01-05	Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera	Yuliang Guo et.al.	2501.02464	null
2025-01-05	Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications	Zhe Chen et.al.	2501.02460	null
2025-01-05	Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap	Hyunwoo Ko et.al.	2501.02448	null
2025-01-05	RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework	Kun Wang et.al.	2501.02446	null
2025-01-05	A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models	Yinpeng Cai et.al.	2501.02441	null
2025-01-05	Efficient Deployment of Large Language Models on Resource-constrained Devices	Zhiwei Yao et.al.	2501.02438	null
2025-01-05	FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance	Haicheng Wang et.al.	2501.02430	link
2025-01-05	GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems	Mehmet Deniz Türkmen et.al.	2501.02408	null
2025-01-04	Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities	Tara Radvand et.al.	2501.02406	null
2025-01-04	Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers	Markus J. Buehler et.al.	2501.02393	link
2025-01-04	Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations	Kangyu Zhu et.al.	2501.02385	null
2025-01-04	Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Tsz Kin Lam et.al.	2501.02370	null
2025-01-04	Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving	Sanghyun Park et.al.	2501.02348	null
2025-01-04	Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support	Florian Putz et.al.	2501.02346	null
2025-01-04	UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility	Yonglin Tian et.al.	2501.02341	link
2025-01-04	AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference	Zhuomin He et.al.	2501.02336	link
2025-01-04	Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications	Jodi M. Casabianca et.al.	2501.02334	null
2025-01-04	Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance	Marta Gentiloni-Silveri et.al.	2501.02298	null
2025-01-04	Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection	Yachao Zhao et.al.	2501.02295	null
2025-01-04	Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control	Yongjeong Oh et.al.	2501.02273	null
2025-01-04	What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph	Yutao Jiang et.al.	2501.02268	link
2025-01-04	Unsupervised Class Generation to Expand Semantic Segmentation Datasets	Javier Montalvo et.al.	2501.02264	null
2025-01-04	Financial Named Entity Recognition: How Far Can LLM Go?	Yi-Te Lu et.al.	2501.02237	link
2025-01-04	Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends	Camille Barboule et.al.	2501.02235	null
2025-01-04	Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection	S M Mostaq Hossain et.al.	2501.02229	null
2025-01-04	Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation	Shijie Wang et.al.	2501.02226	null
2025-01-04	Can ChatGPT implement finite element models for geotechnical engineering applications?	Taegu Kim et.al.	2501.02199	null
2025-01-04	EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks	Shixuan Liu et.al.	2501.02192	null
2025-01-04	On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing	Jianwei Wang et.al.	2501.02191	link
2025-01-04	The Application of Large Language Models in Recommendation Systems	Peiyang Yu et.al.	2501.02178	null
2025-01-04	The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit	Huixue Zhou et.al.	2501.02173	null
2025-01-04	Personalized Graph-Based Retrieval for Large Language Models	Steven Au et.al.	2501.02157	link
2025-01-04	Table as Thought: Exploring Structured Thoughts in LLM Reasoning	Zhenjie Sun et.al.	2501.02152	null
2025-01-04	Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN	Yanxi Chen et.al.	2501.02146	null
2025-01-03	VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction	Chaoyou Fu et.al.	2501.01957	link
2025-01-03	Metadata Conditioning Accelerates Language Model Pre-training	Tianyu Gao et.al.	2501.01956	link
2025-01-03	MADGEN -- Mass-Spec attends to De Novo Molecular generation	Yinkai Wang et.al.	2501.01950	null
2025-01-03	Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap	Weizhi Zhang et.al.	2501.01945	link
2025-01-03	Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models	Manh Duong Nguyen et.al.	2501.01932	link
2025-01-03	Virgo: A Preliminary Exploration on Reproducing o1-like MLLM	Yifan Du et.al.	2501.01904	link
2025-01-03	EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation	Siyuan Huang et.al.	2501.01895	null
2025-01-03	Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions	Rachneet Sachdeva et.al.	2501.01872	link
2025-01-03	Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification	Xiangxiang Dai et.al.	2501.01849	link
2025-01-03	MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning	Pu Yang et.al.	2501.01834	null
2025-01-03	Time Series Language Model for Descriptive Caption Generation	Mohamed Trabelsi et.al.	2501.01832	null
2025-01-03	Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models	Yanjiang Liu et.al.	2501.01830	null
2025-01-03	SDPO: Segment-Level Direct Preference Optimization for Social Agents	Aobo Kong et.al.	2501.01821	link
2025-01-03	BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction	Ferhat Ozgur Catak et.al.	2501.01802	link
2025-01-03	Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation	Mohammad Khalil et.al.	2501.01793	link
2025-01-03	Efficient LLM Inference with Activation Checkpointing and Hybrid Caching	Sanghyeon Lee et.al.	2501.01792	null
2025-01-03	Nonparametric estimation of a factorizable density using diffusion models	Hyeok Kyu Kwon et.al.	2501.01783	null
2025-01-03	SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation	Mingjie Li et.al.	2501.01765	null
2025-01-03	Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models	Andrea Matteazzi et.al.	2501.01761	null
2025-01-03	MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling	Simon Rouard et.al.	2501.01757	null
2025-01-03	Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation	Kangcheng Luo et.al.	2501.01743	null
2025-01-03	How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models	Simone Corbo et.al.	2501.01741	null
2025-01-03	AR4D: Autoregressive 4D Generation from Monocular Videos	Hanxin Zhu et.al.	2501.01722	null
2025-01-03	Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models	Guosheng Zhang et.al.	2501.01720	null
2025-01-03	LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries	Michal Kuk et.al.	2501.01711	null
2025-01-03	MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders	Jiajun Cao et.al.	2501.01709	null
2025-01-03	AgentRefine: Enhancing Agent Generalization through Refinement Tuning	Dayuan Fu et.al.	2501.01702	null
2025-01-03	Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models	Lei Tang et.al.	2501.01679	null
2025-01-03	Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption	Zhang Ruoyan et.al.	2501.01672	null
2025-01-03	BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction	Alaeddine Diaf et.al.	2501.01664	null
2025-01-03	Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning	Danni Peng et.al.	2501.01653	null
2025-01-03	MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments	Cai Yin et.al.	2501.01652	link
2025-01-03	HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding	Heqing Zou et.al.	2501.01645	null
2025-01-03	iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings	Shuhei Tomoshige et.al.	2501.01642	null
2025-01-03	Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation	Rini Smita Thakur et.al.	2501.01640	null
2025-01-03	A non-ergodic framework for understanding emergent capabilities in Large Language Models	Javier Marin et.al.	2501.01638	null
2025-01-03	Revisiting Data Analysis with Pre-trained Foundation Models	Chen Liang et.al.	2501.01631	null
2025-01-03	ICPC: In-context Prompt Compression with Faster Inference	Ziyang Yu et.al.	2501.01625	null
2025-01-03	PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents	Jingoo Lee et.al.	2501.01594	null
2025-01-03	(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges	Mohamed Hisham Abdellatif et.al.	2501.01588	null
2025-01-02	Predicting the Performance of Black-box LLMs through Self-Queries	Dylan Sam et.al.	2501.01558	link
2025-01-02	Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs)	Jumana Almahmoud et.al.	2501.01545	null
2025-01-02	Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information	Rasul Tutnov et.al.	2501.01544	null
2025-01-02	Denoising Diffused Embeddings: a Generative Approach for Hypergraphs	Shihao Wu et.al.	2501.01541	null
2025-01-02	BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery	Kanishk Gandhi et.al.	2501.01540	link
2025-01-02	SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers	Bhavna Gopal et.al.	2501.01529	null
2025-01-02	Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search	Shuangtao Li et.al.	2501.01478	null
2025-01-02	Unifying Specialized Visual Encoders for Video Language Models	Jihoon Chung et.al.	2501.01426	link
2025-01-02	Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models	Jingfeng Yao et.al.	2501.01423	link
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers	Seunghyun Lee et.al.	2501.01414	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-02	OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios	Xize Cheng et.al.	2501.01384	null
2025-01-02	ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI	Neda Tavakoli et.al.	2501.01372	link
2025-01-02	Aligning Large Language Models for Faithful Integrity Against Opposing Argument	Yong Zhao et.al.	2501.01336	link
2025-01-02	CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models	Johan Wahréus et.al.	2501.01335	link
2025-01-02	Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension	Yanbo Fang et.al.	2501.01332	null
2025-01-02	The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation	Shuzheng Gao et.al.	2501.01329	null
2025-01-03	Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking	Xiaoxue Cheng et.al.	2501.01306	null
2025-01-02	Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments -- The Depression and Anxiety Case	Kaushik Roy et.al.	2501.01305	null
2025-01-02	Does a Large Language Model Really Speak in Human-Like Language?	Mose Park et.al.	2501.01273	null
2025-01-02	ProgCo: Program Helps Self-Correction of Large Language Models	Xiaoshuai Song et.al.	2501.01264	null
2025-01-03	CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings	Shanghaoran Quan et.al.	2501.01257	null
2025-01-02	Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers?	Manuel Weber et.al.	2501.01256	null
2025-01-02	Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion	Qiyuan He et.al.	2501.01246	null
2025-01-02	SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization	Yongle Huang et.al.	2501.01245	link
2025-01-02	Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants	Lixiong Qin et.al.	2501.01243	null
2025-01-02	Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction	Alexander Brinkmann et.al.	2501.01237	link
2025-01-03	TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer	Jiayu Li et.al.	2501.01216	null
2025-01-02	Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects	Abdullah Mushtaq et.al.	2501.01205	null
2025-01-02	HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation	Runsong Jia et.al.	2501.01203	null
2025-01-02	LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge	Kyoungkook Kang et.al.	2501.01197	null
2025-01-02	Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education	Annika Bush et.al.	2501.01192	null
2025-01-02	Towards Interactive Deepfake Analysis	Lixiong Qin et.al.	2501.01164	link
2025-01-02	TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions	Vriksha Srihari et.al.	2501.01156	null
2025-01-02	A3: Android Agent Arena for Mobile GUI Agents	Yuxiang Chai et.al.	2501.01149	null
2025-01-03	BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference	Wonsuk Jang et.al.	2501.01144	link
2025-01-02	Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method	Ruichen Zhang et.al.	2501.01141	null
2025-01-02	Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning	Shuo Yu et.al.	2501.01124	null
2025-01-02	MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification	Jimin Park et.al.	2501.01110	null
2025-01-03	MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization	Haina Zhu et.al.	2501.01108	link
2025-01-02	Graph Generative Pre-trained Transformer	Xiaohui Chen et.al.	2501.01073	null
2025-01-02	Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models	Yanwen Huang et.al.	2501.01059	null
2025-01-02	Risks of Cultural Erasure in Large Language Models	Rida Qadri et.al.	2501.01056	null
2025-01-02	Dynamic Scaling of Unit Tests for Code Reward Modeling	Zeyao Ma et.al.	2501.01054	null
2025-01-02	Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs	Linhao Huang et.al.	2501.01042	null
2025-01-02	Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models	Bin Wang et.al.	2501.01034	link
2025-01-02	ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning	Wonduk Seo et.al.	2501.01031	null
2025-01-03	KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model	Xinshuo Hu et.al.	2501.01028	link
2025-01-02	MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model	Chengze Zhang et.al.	2501.01014	null
2025-01-02	FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving	Zihao Ye et.al.	2501.01005	link
2025-01-02	Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory	Zhou Yang et.al.	2501.00999	null
2025-01-02	Optimizing Noise Schedules of Generative Models in High Dimensionss	Santiago Aranguri et.al.	2501.00988	null
2025-01-02	Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice	Federico Ravenda et.al.	2501.00982	link
2025-01-01	IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs	Junfeng Jiao et.al.	2501.00959	null
2025-01-01	Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors	Junfeng Jiao et.al.	2501.00957	null
2025-01-01	Incremental Dialogue Management: Survey, Discussion, and Implications for HRI	Casey Kennington et.al.	2501.00953	null
2025-01-01	SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering	Shihab Ahmed et.al.	2501.00940	null
2025-01-01	Diffusion Policies for Generative Modeling of Spacecraft Trajectories	Julia Briden et.al.	2501.00915	null
2025-01-01	Aligning LLMs with Domain Invariant Reward Models	David Wu et.al.	2501.00911	link
2025-01-01	Population Aware Diffusion for Time Series Generation	Yang Li et.al.	2501.00910	link
2025-01-01	Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things	Talha Zeeshan et.al.	2501.00906	null
2025-01-01	Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model	Chenyang Liu et.al.	2501.00895	null
2025-01-01	Evaluating Time Series Foundation Models on Noisy Periodic Time Series	Syamantak Datta Gupta et.al.	2501.00889	null
2025-01-01	Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization	Weiqi Wu et.al.	2501.00888	link
2025-01-01	Representation in large language models	Cameron C. Yetman et.al.	2501.00885	null
2025-01-01	Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents	Fouad Bousetouane et.al.	2501.00881	null
2025-01-01	Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction	Teng Hu et.al.	2501.00880	null
2025-01-01	TrustRAG: Enhancing Robustness and Trustworthiness in RAG	Huichi Zhou et.al.	2501.00879	link
2025-01-01	LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models	Hieu Man et.al.	2501.00874	link
2025-01-01	Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation	Mingjia Li et.al.	2501.00873	link
2025-01-01	Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation	Shoutao Guo et.al.	2501.00868	link
2025-01-01	Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era	Mihnea C. Moldoveanu et.al.	2501.00867	null
2025-01-01	Alzheimer's disease detection based on large language model prompt engineering	Tian Zheng et.al.	2501.00861	null
2025-01-01	LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions	Adam Ishay et.al.	2501.00830	null
2025-01-01	An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems	Haoxiang Tian et.al.	2501.00829	null
2025-01-01	LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management	Yichen Luo et.al.	2501.00826	null
2025-01-01	Multimodal Large Models Are Effective Action Anticipators	Binglu Wang et.al.	2501.00795	link
2025-01-01	Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models	Minhao Bai et.al.	2501.00786	null
2025-01-01	NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model	Yuzhi Lai et.al.	2501.00785	null
2025-01-01	REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization	Huyen Nguyen et.al.	2501.00779	null
2025-01-01	FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation	Qianli Wang et.al.	2501.00777	null
2025-01-01	Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis	Jie Gao et.al.	2501.00775	null
2025-01-01	An AI-powered Bayesian generative modeling approach for causal inference in observational studies	Qiao Liu et.al.	2501.00755	null
2025-01-01	Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform	Cheonsu Jeong et.al.	2501.00750	null
2025-01-01	DIVE: Diversified Iterative Self-Improvement	Yiwei Qin et.al.	2501.00747	link
2025-01-01	Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines	Xiyang Hu et.al.	2501.00745	null
2025-01-01	A Distributional Evaluation of Generative Image Models	Edric Tam et.al.	2501.00744	null
2025-01-01	New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory	Madiha Ajmal et.al.	2501.00721	null
2025-01-01	Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection	Hao Wang et.al.	2501.00700	null
2025-01-01	Adjoint sharding for very long context training of state space models	Xingzi Xu et.al.	2501.00692	null
2025-01-01	Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro	Md Rakibul Hasan et.al.	2501.00691	null
2025-01-01	IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently	Florian Dietz et.al.	2501.00684	null
2024-12-31	Grade Inflation in Generative Models	Phuc Nguyen et.al.	2501.00664	null
2024-12-31	Finding Missed Code Size Optimizations in Compilers using LLMs	Davide Italiano et.al.	2501.00655	null
2024-12-31	Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models	Suttisak Wizadwongsa et.al.	2501.00651	null
2024-12-31	Efficient Standardization of Clinical Notes using Large Language Models	Daniel B. Hier et.al.	2501.00644	null
2024-12-31	Enabling New HDLs with Agents	Mark Zakharov et.al.	2501.00642	null
2024-12-31	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2024-12-31	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2024-12-31	Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation	M. Ali Bayram et.al.	2501.00593	null
2024-12-31	Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method	Zhenpeng Huang et.al.	2501.00584	null
2024-12-31	Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders	Yipeng Kang et.al.	2501.00581	null
2024-12-31	AI and Quantum Computing in Binary Photocatalytic Hydrogen Production	Dennis Delali Kwesi Wayo et.al.	2501.00575	null
2024-12-31	VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling	Xinhao Li et.al.	2501.00574	link
2024-12-31	Probing Visual Language Priors in VLMs	Tiange Luo et.al.	2501.00569	null
2024-12-31	Robust and Adaptive Optimization under a Large Language Model Lens	Dimitris Bertsimas et.al.	2501.00568	null
2024-12-30	Distributed Mixture-of-Agents for Edge Inference with Large Language Models	Purbesh Mitra et.al.	2412.21200	link
2024-12-31	HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation	Zhaojian Yu et.al.	2412.21199	link
2024-12-30	The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick	Jonathan Berkheim et.al.	2412.21186	null
2024-12-30	Facilitating large language model Russian adaptation with Learned Embedding Propagation	Mikhail Tikhomirov et.al.	2412.21140	link
2024-12-30	ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation	Ruixuan Liu et.al.	2412.21123	null
2025-01-02	Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation	Yuanbo Yang et.al.	2412.21117	null
2024-12-30	Varformer: Adapting VAR's Generative Prior for Image Restoration	Siyang Wang et.al.	2412.21063	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense	Yuyang Zhou et.al.	2412.21051	link
2024-12-30	E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models	Zhiyu Tan et.al.	2412.21044	null
2024-12-30	Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration	Wanglong Lu et.al.	2412.21042	link
2024-12-30	TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization	Chia-Yu Hung et.al.	2412.21037	link
2024-12-30	GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models	Shangyu Xing et.al.	2412.21036	null
2024-12-30	MapQaTor: A System for Efficient Annotation of Map Query Datasets	Mahir Labib Dihan et.al.	2412.21015	link
2024-12-31	Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria	Joonwon Jang et.al.	2412.21006	null
2024-12-30	Plug-and-Play Training Framework for Preference Optimization	Jingyuan Ma et.al.	2412.20996	null
2024-12-30	KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation	Siyuan Fang et.al.	2412.20995	null
2024-12-30	Efficiently Serving LLM Reasoning Programs with Certaindex	Yichao Fu et.al.	2412.20993	null
2024-12-30	QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing	Shlomo Kashani et.al.	2412.20956	null
2024-12-30	AGON: Automated Design Framework for Customizing Processors from ISA Documents	Chongxiao Li et.al.	2412.20954	null
2024-12-30	Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema	Xiaohan Feng et.al.	2412.20942	null
2024-12-30	Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering	Junxiao Xue et.al.	2412.20927	null
2024-12-30	ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation	Ting Zhang et.al.	2412.20901	null
2024-12-30	Towards Compatible Fine-tuning for Vision-Language Model Updates	Zhengbo Wang et.al.	2412.20895	null
2024-12-30	DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models	Xiaolin Hu et.al.	2412.20891	null
2024-12-30	Enhancing Annotated Bibliography Generation with LLM Ensembles	Sergio Bermejo et.al.	2412.20864	null
2024-12-30	Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory	Xingjian Tao et.al.	2412.20846	null
2024-12-30	Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment	Jianfei Zhang et.al.	2412.20834	link
2024-12-30	Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model	Runtao Ren et.al.	2412.20820	null
2024-12-30	TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting	Huanyu Zhang et.al.	2412.20810	null
2024-12-30	Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves	Chayan Chatterjee et.al.	2412.20789	null
2024-12-31	SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity	Pengfei Jing et.al.	2412.20787	null
2024-12-30	Large Language Model Enabled Multi-Task Physical Layer Network	Tianyue Zheng et.al.	2412.20772	null
2024-12-30	Attributing Culture-Conditioned Generations to Pretraining Corpora	Huihan Li et.al.	2412.20760	link
2024-12-30	M $^3$ oralBench: A MultiModal Moral Benchmark for LVLMs	Bei Yan et.al.	2412.20718	link
2024-12-30	HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images	Sungik Choi et.al.	2412.20704	null
2024-12-30	UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design	Zijie Chen et.al.	2412.20694	null
2024-12-30	Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks	Yuhe Ding et.al.	2412.20682	null
2024-12-30	Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA	Qingyun Jin et.al.	2412.20677	null
2024-12-30	Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner	Yitong Zhou et.al.	2412.20662	link
2024-12-30	Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis	Yousef Yeganeh et.al.	2412.20651	null
2024-12-30	SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy	Md Mahadi Hasan Nahid et.al.	2412.20641	null
2024-12-30	Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble	Yongchang Li et.al.	2412.20637	null
2024-12-30	EVOLVE: Emotion and Visual Output Learning via LLM Evaluation	Jordan Sinclair et.al.	2412.20632	null
2024-12-29	Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study	Yulin Fei et.al.	2412.20613	link
2024-12-29	NLP-based Regulatory Compliance -- Using GPT 4.0 to Decode Regulatory Documents	Bimal Kumar et.al.	2412.20602	null
2024-12-29	MATEY: multiscale adaptive foundation models for spatiotemporal physical systems	Pei Zhang et.al.	2412.20601	null
2024-12-29	Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection	Dmitri Roussinov et.al.	2412.20595	link
2024-12-29	Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches	Madhavendra Thakur et.al.	2412.20584	null
2024-12-29	Counterfactual Samples Constructing and Training for Commonsense Statements Estimation	Chong Liu et.al.	2412.20563	null
2024-12-29	Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces	Linglingzhi Zhu et.al.	2412.20556	null
2024-12-29	The Impact of Prompt Programming on Function-Level Code Generation	Ranim Khojah et.al.	2412.20545	link
2024-12-29	Goal-Conditioned Data Augmentation for Offline Reinforcement Learning	Xingshuai Huang et.al.	2412.20519	null
2024-12-29	Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning	Hang Ni et.al.	2412.20505	null
2024-12-29	ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding	Xiao Wang et.al.	2412.20504	link
2024-12-29	TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication	Zongwu Wang et.al.	2412.20501	link
2024-12-29	Multimodal Variational Autoencoder: a Barycentric View	Peijie Qiu et.al.	2412.20487	null
2024-12-29	JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling	Haorui Ji et.al.	2412.20470	null
2024-12-29	Improving Vision-Language-Action Models via Chain-of-Affordance	Jinming Li et.al.	2412.20451	null
2024-12-29	Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs	Pratik Rakesh Singh et.al.	2412.20440	null
2024-12-29	Image Augmentation Agent for Weakly Supervised Semantic Segmentation	Wangyu Wu et.al.	2412.20439	null
2024-12-29	Unlocking adaptive digital pathology through dynamic feature learning	Jiawen Li et.al.	2412.20430	null
2024-12-29	AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models	Mansi et.al.	2412.20427	null
2024-12-29	Bringing Objects to Life: 4D generation from 3D objects	Ohad Rahamim et.al.	2412.20422	null
2024-12-29	Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection	Kalin Kopanov et.al.	2412.20414	null
2024-12-29	Multi-Objective Large Language Model Unlearning	Zibin Pan et.al.	2412.20412	link
2024-12-29	Open-Sora: Democratizing Efficient Video Production for All	Zangwei Zheng et.al.	2412.20404	link
2024-12-29	Natural Language Fine-Tuning	Jia Liu et.al.	2412.20382	link
2024-12-29	Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs)	Jia Wei Sii et.al.	2412.20381	null
2024-12-29	FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation	Yan Luo et.al.	2412.20374	link
2024-12-29	LLM2: Let Large Language Models Harness System 2 Reasoning	Cheng Yang et.al.	2412.20372	link
2025-01-02	Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey	Junqiao Wang et.al.	2412.20367	null
2024-12-29	HindiLLM: Large Language Model for Hindi	Sanjay Chouhan et.al.	2412.20357	null
2024-12-29	Distilling Desired Comments for Enhanced Code Review with Large Language Models	Yongda Yu et.al.	2412.20340	null
2024-12-29	Mind the Data Gap: Bridging LLMs to Enterprise Data Integration	Moe Kayali et.al.	2412.20331	null
2024-12-29	GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions	Tianyao Shi et.al.	2412.20322	null
2024-12-29	Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain	Shintaro Ozaki et.al.	2412.20309	null
2024-12-28	FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration	Jia Liu et.al.	2412.20297	null
2024-12-28	Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games	Guan-Horng Liu et.al.	2412.20279	null
2024-12-28	Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues	Henry J. Xie et.al.	2412.20264	link
2024-12-28	Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception	Athanasios Karagounis et.al.	2412.20230	null
2024-12-28	LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning	Shuguang Chen et.al.	2412.20227	null
2024-12-28	Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation	Yeonhong Park et.al.	2412.20185	null
2024-12-28	LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System	Hyucksung Kwon et.al.	2412.20166	null
2024-12-28	StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN	Andrzej Bedychaj et.al.	2412.20164	null
2024-12-28	Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems	Minhye Jeon et.al.	2412.20163	null
2024-12-28	Multi-Modality Driven LoRA for Adverse Condition Depth Estimation	Guanglei Yang et.al.	2412.20162	null
2024-12-28	Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses	Xinru Wen et.al.	2412.20154	null
2024-12-28	Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering	Wei Zhou et.al.	2412.20145	null
2024-12-28	TradingAgents: Multi-Agents LLM Financial Trading Framework	Yijia Xiao et.al.	2412.20138	null
2024-12-28	M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation	Zhaopeng Feng et.al.	2412.20127	link
2024-12-28	Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers	Tuomas Hakoniemi et.al.	2412.20114	null
2024-12-28	ST $^3$ : Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming	Jiedong Zhuang et.al.	2412.20105	null
2024-12-28	On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs	Atmane Ayoub Mansour Bahar et.al.	2412.20087	null
2024-12-31	Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset	Chongjian Yue et.al.	2412.20072	null
2024-12-28	On the Compositional Generalization of Multimodal LLMs for Medical Imaging	Zhenyang Cai et.al.	2412.20070	link
2024-12-28	VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition	Lan Chen et.al.	2412.20064	link
2024-12-28	MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion	Zechao Zhan et.al.	2412.20062	null
2024-12-28	Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts	Yanxin Shen et.al.	2412.20061	null
2024-12-28	"My life is miserable, have to sign 500 autographs everyday": Exposing Humblebragging, the Brags in Disguise	Sharath Naganna et.al.	2412.20057	null
2024-12-27	Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization	Kumud Tripathi et.al.	2412.19785	null
2024-12-27	Can AI Help with Your Personal Finances?	Oudom Hean et.al.	2412.19784	null
2024-12-27	Tensor Network Estimation of Distribution Algorithms	John Gardiner et.al.	2412.19780	null
2024-12-27	Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration	Le Chen et.al.	2412.19770	link
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-27	On dual-projectively equivalent connections associated to second order superintegrable systems	Andreas Vollmer et.al.	2412.19739	null
2024-12-27	Can Large Language Models Adapt to Other Agents In-Context?	Matthew Riemer et.al.	2412.19726	null
2024-12-27	From Elements to Design: A Layered Approach for Automatic Graphic Design Composition	Jiawei Lin et.al.	2412.19712	null
2024-12-27	Toward Adaptive Reasoning in Large Language Models with Thought Rollback	Sijia Chen et.al.	2412.19707	link
2024-12-27	A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization	Jingchun Lian et.al.	2412.19685	null
2024-12-27	Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework	Jiang Liu et.al.	2412.19684	null
2024-12-27	CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs	Siyu Wang et.al.	2412.19663	null
2024-12-27	Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis	Jiaqi Wang et.al.	2412.19654	link
2024-12-27	FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios	Kaiyi Pang et.al.	2412.19652	null
2024-12-27	Xmodel-2 Technical Report	Wang Qun et.al.	2412.19638	null
2024-12-27	IMTP: Search-based Code Generation for In-memory Tensor Programs	Yongwon Shin et.al.	2412.19630	null
2024-12-27	Signatures of prediction during natural listening in MEG data?	Sahel Azizpour et.al.	2412.19622	null
2024-12-27	Gradient Weight-normalized Low-rank Projection for Efficient LLM Training	Jia-Hong Huang et.al.	2412.19616	link
2024-12-27	SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms	Shashank Rao Marpally et.al.	2412.19595	null
2024-12-27	Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following	Yuxiao Yang et.al.	2412.19562	null
2024-12-27	Diverse Rare Sample Generation with Pretrained GANs	Subeen Lee et.al.	2412.19543	link
2024-12-27	Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy--Fokker--Planck Equations	Yuanfei Huang et.al.	2412.19520	null
2024-12-27	Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model	Hyunwoo Cho et.al.	2412.19517	null
2024-12-27	Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs	Zhe Yang et.al.	2412.19513	link
2024-12-27	Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging	Hua Farn et.al.	2412.19512	null
2024-12-27	Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion	Koustav Ghosal et.al.	2412.19510	null
2024-12-27	MBQ: Modality-Balanced Quantization for Large Vision-Language Models	Shiyao Li et.al.	2412.19509	link
2024-12-27	DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-27	Casevo: A Cognitive Agents and Social Evolution Simulator	Zexun Jiang et.al.	2412.19498	link
2024-12-27	Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation	Chengyang Ye et.al.	2412.19492	link
2024-12-27	Focusing Image Generation to Mitigate Spurious Correlations	Xuewei Li et.al.	2412.19457	null
2024-12-27	Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models	Hyeonseok Moon et.al.	2412.19450	link
2024-12-27	Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models	Shuo Wang et.al.	2412.19449	null
2024-12-27	A Survey on Large Language Model Acceleration based on KV Cache Management	Haoyang Li et.al.	2412.19442	link
2024-12-27	Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback	Seong Jin Lee et.al.	2412.19436	null
2024-12-27	Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints	Alberto Maté et.al.	2412.19424	null
2024-12-27	Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning	Chen Li et.al.	2412.19422	link
2024-12-27	MINIMA: Modality Invariant Image Matching	Xingyu Jiang et.al.	2412.19412	link
2024-12-27	MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios	Jiaqi Fan et.al.	2412.19406	null
2024-12-27	An Engorgio Prompt Makes Large Language Model Babble on	Jianshuo Dong et.al.	2412.19394	link
2024-12-26	Large Language Models for Market Research: A Data-augmentation Approach	Mengxin Wang et.al.	2412.19363	null
2024-12-26	Dynamic Skill Adaptation for Large Language Models	Jiaao Chen et.al.	2412.19361	null
2024-12-26	Identifying Split Vacancies with Foundation Models and Electrostatics	Seán R. Kavanagh et.al.	2412.19330	null
2024-12-26	Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment	Ziang Yan et.al.	2412.19326	link
2024-12-26	Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones	Mehrnaz Mofakhami et.al.	2412.19325	null
2024-12-26	From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries	Hugh Van Deventer et.al.	2412.19312	link
2024-12-26	Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries	Roberto Amoroso et.al.	2412.19304	null
2024-12-26	RecLM: Recommendation Instruction Tuning	Yangqin Jiang et.al.	2412.19302	link
2024-12-26	RAG with Differential Privacy	Nicolas Grislain et.al.	2412.19291	link
2024-12-26	Time Series Foundational Models: Their Role in Anomaly Detection and Prediction	Chathurangi Shyalika et.al.	2412.19286	link
2024-12-26	PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing	Michael Bezick et.al.	2412.19284	null
2024-12-26	MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes	Asma Ben Abacha et.al.	2412.19260	link
2024-12-26	VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis	Jaemin Jung et.al.	2412.19259	null
2024-12-26	Sentiment trading with large language models	Kemal Kirtac et.al.	2412.19245	null
2024-12-26	SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model	Xuyang Li et.al.	2412.19237	null
2024-12-26	Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining	Yuxin You et.al.	2412.19211	null
2024-12-26	Multi-Attribute Constraint Satisfaction via Language Model Rewriting	Ashutosh Baheti et.al.	2412.19198	null
2024-12-26	Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models	Haonan He et.al.	2412.19191	null
2024-12-26	Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue	Shuzhi Xu et.al.	2412.19154	null
2024-12-26	AskChart: Universal Chart Understanding through Textual Enhancement	Xudong Yang et.al.	2412.19146	link
2024-12-26	SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis	Senbin Zhu et.al.	2412.19140	link
2024-12-26	PlanLLM: Video Procedure Planning with Refinable Large Language Models	Dejie Yang et.al.	2412.19139	link
2024-12-26	Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing	Inpyo Hong et.al.	2412.19125	link
2024-12-26	Discrete vs. Continuous Trade-offs for Generative Models	Jathin Korrapati et.al.	2412.19114	null
2024-12-26	SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values	Yunfan Zhang et.al.	2412.19113	null
2024-12-26	Stochastic normalizing flows for Effective String Theory	Michele Caselle et.al.	2412.19109	null
2024-12-26	"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities	Jiawei Yu et.al.	2412.19102	null
2024-12-26	Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security	Vasileios Alevizos et.al.	2412.19088	null
2024-12-26	Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation	Haotian Qian et.al.	2412.19080	null
2024-12-26	CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers	Jingyi Zheng et.al.	2412.19037	link
2024-12-26	Repository Structure-Aware Training Makes SLMs Better Issue Resolver	Zexiong Ma et.al.	2412.19031	null
2024-12-26	Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation	Yixin Chen et.al.	2412.19026	link
2024-12-26	Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication	Xiqiang Qu et.al.	2412.19025	null
2024-12-26	Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation	Tao Liu et.al.	2412.19021	null
2024-12-26	Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability	Ruixi Lin et.al.	2412.19018	null
2024-12-25	How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study	Alejandro Velasco et.al.	2412.18989	null
2024-12-25	ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement	Zhefan Rao et.al.	2412.18966	null
2024-12-25	Musings About the Future of Search: A Return to the Past?	Jimmy Lin et.al.	2412.18956	null
2024-12-25	A Power-Efficient Hardware Implementation of L-Mul	Ruiqi Chen et.al.	2412.18948	null
2024-12-25	MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models	Kaiwen Zuo et.al.	2412.18947	null
2024-12-25	Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations	Yewon Kim et.al.	2412.18940	null
2024-12-25	Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference	Libo Zhang et.al.	2412.18934	null
2024-12-25	UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation	Lunhao Duan et.al.	2412.18928	null
2024-12-25	Exemplar-condensed Federated Class-incremental Learning	Rui Sun et.al.	2412.18926	null
2024-12-25	Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model	Yi-Chia Chen et.al.	2412.18917	link
2024-12-25	AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures	Situo Zhang et.al.	2412.18910	null
2024-12-25	CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models	Ping Guo et.al.	2412.18890	link
2024-12-25	MotionMap: Representing Multimodality in Human Pose Forecasting	Reyhaneh Hosseininejad et.al.	2412.18883	null
2024-12-25	Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models	Meltem Aksoy et.al.	2412.18863	null
2024-12-25	Improving the Readability of Automatically Generated Tests using Large Language Models	Matteo Biagiola et.al.	2412.18843	null
2024-12-25	LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements	Hao Zhang et.al.	2412.18835	null
2024-12-25	Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition	Shujie Hu et.al.	2412.18832	null
2024-12-25	RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting	Yilei Jiang et.al.	2412.18826	null
2024-12-25	CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection	Wenbin Li et.al.	2412.18820	link
2024-12-25	LLM-assisted vector similarity search	Md Riyadh et.al.	2412.18819	null
2024-12-25	DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search	Lei Yang et.al.	2412.18811	null
2024-12-25	Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation	Xinkai Du et.al.	2412.18800	null
2024-12-25	Torque-Aware Momentum	Pranshu Malviya et.al.	2412.18790	null
2024-12-25	Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models	Yu-An Liu et.al.	2412.18770	link
2024-12-25	The Impact of Input Order Bias on Large Language Models for Software Fault Localization	Md Nakhla Rafi et.al.	2412.18750	null
2024-12-24	Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models	Zehan Wang et.al.	2412.18605	link
2024-12-24	Long-Form Speech Generation with Spoken Language Models	Se Jin Park et.al.	2412.18603	link
2024-12-24	Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems	Fernando Jia et.al.	2412.18601	link
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-24	A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs	OpenMind et.al.	2412.18588	null
2024-12-24	Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control	Sergey Sedov et.al.	2412.18582	null
2024-12-24	Zero-resource Speech Translation and Recognition with LLMs	Karel Mundnich et.al.	2412.18566	null
2024-12-24	Distilling Fine-grained Sentiment Understanding from Large Language Models	Yice Zhang et.al.	2412.18552	link
2024-12-24	Token-Budget-Aware LLM Reasoning	Tingxu Han et.al.	2412.18547	link
2024-12-24	PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction	Xingjian Xu et.al.	2412.18541	null
2024-12-24	Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation	Derong Xu Xinhang Li et.al.	2412.18537	link
2024-12-24	Automated Code Review In Practice	Umut Cihan et.al.	2412.18531	null
2024-12-24	Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving	Hao Pang et.al.	2412.18511	null
2024-12-24	Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization	Yi-Fu Fu et.al.	2412.18497	null
2024-12-24	GeFL: Model-Agnostic Federated Learning with Generative Models	Honggu Kang et.al.	2412.18460	null
2024-12-24	3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding	Tatiana Zemskova et.al.	2412.18450	link
2024-12-24	Is Large Language Model Good at Triple Set Prediction? An Empirical Study	Yuan Yuan et.al.	2412.18443	null
2024-12-24	Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm	O. Deniz Akyildiz et.al.	2412.18432	null
2024-12-24	GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent	Kangjia Zhao et.al.	2412.18426	null
2024-12-24	Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models	Zihan Zhou et.al.	2412.18419	null
2024-12-24	Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles	Zihan Wang et.al.	2412.18416	null
2024-12-24	Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English	Avinash Anand et.al.	2412.18415	link
2024-12-24	Discovery of 2D Materials via Symmetry-Constrained Diffusion Model	Shihang Xu et.al.	2412.18414	null
2024-12-24	A Statistical Framework for Ranking LLM-Based Chatbots	Siavash Ameli et.al.	2412.18407	link
2024-12-24	Extract Free Dense Misalignment from CLIP	JeongYeon Nam et.al.	2412.18404	link
2024-12-24	RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction	Wu Xiaoping et.al.	2412.18390	null
2024-12-24	MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs	Qiuyi Gu et.al.	2412.18381	null
2024-12-24	Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents	Kaiwen Ning et.al.	2412.18371	link
2024-12-24	Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.18351	null
2024-12-24	M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models	Jiaxin Guo et.al.	2412.18299	null
2024-12-24	Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight	Xi Ding et.al.	2412.18298	link
2024-12-24	Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases	Christian Di Maio et.al.	2412.18295	null
2024-12-24	DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation	Junyi Lu et.al.	2412.18291	null
2024-12-24	Improved Feature Generating Framework for Transductive Zero-shot Learning	Zihan Ye et.al.	2412.18282	null
2024-12-24	GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications	Zhenzhou Jin et.al.	2412.18281	null
2024-12-24	Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization	Jiacai Liu et.al.	2412.18279	null
2024-12-24	GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge	Shammur Absar Chowdhury et.al.	2412.18274	null
2024-12-24	Annotating References to Mythological Entities in French Literature	Thierry Poibeau et.al.	2412.18270	null
2024-12-24	Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study	Xuefeng Jiang et.al.	2412.18260	link
2024-12-24	AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction	Pufan Zou et.al.	2412.18255	null
2024-12-24	An Automatic Graph Construction Framework based on Large Language Models for Recommendation	Rong Shan et.al.	2412.18241	link
2024-12-24	Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection	Jango Zhang et.al.	2412.18225	null
2024-12-24	Expand VSR Benchmark for VLLM to Expertize in Spatial Rules	Peijin Xie et.al.	2412.18224	link
2024-12-24	ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation	Mengyang Wu et.al.	2412.18216	link
2024-12-24	Adapting Large Language Models for Improving TCP Fairness over WiFi	Shyam Kumar Shrestha et.al.	2412.18200	null
2024-12-24	Robustness-aware Automatic Prompt Optimization	Zeru Shi et.al.	2412.18196	link
2024-12-24	VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks	Shiduo Zhang et.al.	2412.18194	null
2024-12-24	TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization	Yucong Luo et.al.	2412.18185	null
2024-12-24	Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation	Yucong Luo et.al.	2412.18176	null
2024-12-24	INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent	Haohang Li et.al.	2412.18174	null
2024-12-24	Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models	Xiaomeng Hu et.al.	2412.18171	null
2024-12-24	KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management	Rongxin Cheng et.al.	2412.18169	null
2024-12-24	Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence	Yinbin Han et.al.	2412.18164	null
2024-12-24	VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities	Shray Mathur et.al.	2412.18161	null
2024-12-24	Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task	Jinming Liu et.al.	2412.18158	null
2024-12-24	Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance	Yaoyun Zhang et.al.	2412.18157	null
2024-12-24	scReader: Prompting Large Language Models to Interpret scRNA-seq Data	Cong Li et.al.	2412.18156	null
2024-12-24	GeneSUM: Large Language Model-based Gene Summary Extraction	Zhijian Chen et.al.	2412.18154	null
2024-12-24	CoAM: Corpus of All-Type Multiword Expressions	Yusuke Ide et.al.	2412.18151	null
2024-12-24	EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation	Shuhao Han et.al.	2412.18150	link
2024-12-24	Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction	Xiao Guo et.al.	2412.18149	null
2024-12-24	Ensuring Consistency for In-Image Translation	Chengpeng Fu et.al.	2412.18139	null
2024-12-24	LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment	Binrui Zeng et.al.	2412.18135	null
2024-12-24	VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection	Zhaohui Jin et.al.	2412.18124	null
2024-12-24	AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation	Hao Wen et.al.	2412.18116	null
2024-12-24	AIGT: AI Generative Table Based on Prompt	Mingming Zhang et.al.	2412.18111	null
2024-12-24	SlimGPT: Layer-wise Structured Pruning for Large Language Models	Gui Ling et.al.	2412.18110	null
2024-12-24	Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach	Jing Bi et.al.	2412.18108	null
2024-12-24	Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels	Mingcong Song et.al.	2412.18106	null
2024-12-24	EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent	Suyuan Wang et.al.	2412.18100	null
2024-12-24	Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine	Yu He Ke et.al.	2412.18096	null
2024-12-24	Molly: Making Large Language Model Agents Solve Python Problem More Logically	Rui Xiao et.al.	2412.18093	null
2024-12-24	Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner	Aizierjiang Aiersilan et.al.	2412.18086	link
2024-12-24	Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models	Xuan Lin et.al.	2412.18084	link
2024-12-24	Improving Factuality with Explicit Working Memory	Mingda Chen et.al.	2412.18069	null
2024-12-24	LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR	Osama Hosam Abdellaif et.al.	2412.18063	link
2024-12-24	Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction	Hyunbae Jeon et.al.	2412.18061	null
2024-12-24	An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM	Wen Wen et.al.	2412.18060	null
2024-12-23	Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations	Maya Patel et.al.	2412.18051	null
2024-12-23	AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data	Mirko Zaffaroni et.al.	2412.18038	link
2024-12-23	Generating refactored code accurately using reinforcement learning	Indranil Palit et.al.	2412.18035	null
2024-12-23	More than Chit-Chat: Developing Robots for Small-Talk Interactions	Rebecca Ramnauth et.al.	2412.18023	null
2024-12-23	Trustworthy and Efficient LLMs Meet Databases	Kyoungmin Kim et.al.	2412.18022	null
2024-12-23	StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs	Hailin Chen et.al.	2412.18011	null
2024-12-23	CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models	Ruibo Tu et.al.	2412.17970	link
2024-12-23	LMV-RPA: Large Model Voting-based Robotic Process Automation	Osama Abdellatif et.al.	2412.17965	link
2024-12-23	Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models	Antony Seabra et.al.	2412.17964	null
2024-12-23	Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models	Ge Zhang et.al.	2412.17963	null
2024-12-23	Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents	Antony Seabra et.al.	2412.17942	null
2024-12-23	BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism	Martin Fajcik et.al.	2412.17933	null
2024-12-23	Causal Composition Diffusion Model for Closed-loop Traffic Generation	Haohong Lin et.al.	2412.17920	null
2024-12-23	Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning	Orson Mengara et.al.	2412.17908	null
2024-12-23	LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses	Sara Riazi et.al.	2412.17892	null
2024-12-23	ChatGarment: Garment Estimation, Generation and Editing via Large Language Models	Siyuan Bian et.al.	2412.17811	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806	null
2024-12-23	Automating the Search for Artificial Life with Foundation Models	Akarsh Kumar et.al.	2412.17799	link
2024-12-23	ResearchTown: Simulator of Human Research Community	Haofei Yu et.al.	2412.17767	link
2024-12-23	ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback	Wei Zhang et.al.	2412.17754	null
2024-12-23	Deliberation in Latent Space via Differentiable Cache Augmentation	Luyang Liu et.al.	2412.17747	null
2024-12-23	YuLan-Mini: An Open Data-efficient Language Model	Yiwen Hu et.al.	2412.17743	link
2024-12-23	Reasoning to Attend: Try to Understand How Token Works	Rui Qian et.al.	2412.17741	link
2024-12-23	Knowledge Editing through Chain-of-Thought	Changyue Wang et.al.	2412.17727	link
2024-12-23	Understanding the Logic of Direct Preference Alignment through Logic	Kyle Richardson et.al.	2412.17696	null
2024-12-23	Large Language Model Safety: A Holistic Survey	Dan Shi et.al.	2412.17686	link
2024-12-23	A Bias-Free Training Paradigm for More General AI-generated Image Detection	Fabrizio Guillaro et.al.	2412.17671	null
2024-12-23	Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models	Sijbren van Vaals et.al.	2412.17669	link
2024-12-23	Detecting anxiety and depression in dialogues: a multi-label and explainable approach	Francisco de Arriba-Pérez et.al.	2412.17651	null
2024-12-23	SCBench: A Sports Commentary Benchmark for Video LLMs	Kuangzhi Ge et.al.	2412.17637	null
2024-12-23	ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance	Renyang Liu et.al.	2412.17632	link
2024-12-23	Tracking the Feature Dynamics in LLM Training: A Mechanistic Study	Yang Xu et.al.	2412.17626	null
2024-12-23	Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models	Parham Rezaei et.al.	2412.17622	link
2024-12-23	Emerging Security Challenges of Large Language Models	Herve Debar et.al.	2412.17614	null
2024-12-23	Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs	Fabrizio Frasca et.al.	2412.17609	null
2024-12-23	EasyTime: Time Series Forecasting Made Easy	Xiangfei Qiu et.al.	2412.17603	null
2024-12-23	LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context	Kai Ruan et.al.	2412.17596	link
2024-12-23	Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation	Chengbing Wang et.al.	2412.17593	null
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	link
2024-12-23	S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field	Zixi Liang et.al.	2412.17561	link
2024-12-23	GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference	Chao Zeng et.al.	2412.17560	null
2024-12-23	A Survey of Query Optimization in Large Language Models	Mingyang Song et.al.	2412.17558	null
2024-12-23	Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Prakash Aryan et.al.	2412.17548	link
2024-12-23	Retention Score: Quantifying Jailbreak Risks for Vision Language Models	Zaitang Li et.al.	2412.17544	null
2024-12-23	Constructing Fair Latent Space for Intersection of Fairness and Explainability	Hyungjun Joo et.al.	2412.17523	null
2024-12-23	DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak	Hao Wang et.al.	2412.17522	null
2024-12-23	Improving the Noise Estimation of Latent Neural Stochastic Differential Equations	Linus Heck et.al.	2412.17499	null
2024-12-23	Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings	Jérémie Sublime et.al.	2412.17486	null
2024-12-23	Power- and Fragmentation-aware Online Scheduling for GPU Datacenters	Francesco Lettich et.al.	2412.17484	link
2024-12-23	A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression	Chenlong Deng et.al.	2412.17483	null
2024-12-23	A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers	Shuaihang Chen et.al.	2412.17481	link
2024-12-23	CALLIC: Content Adaptive Learning for Lossless Image Compression	Daxin Li et.al.	2412.17464	null
2024-12-23	Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning	Xiaodan Chen et.al.	2412.17456	null
2024-12-23	Applying LLM and Topic Modelling in Psychotherapeutic Contexts	Alexander Vanin et.al.	2412.17449	null
2024-12-23	Measuring Contextual Informativeness in Child-Directed Text	Maria Valentini et.al.	2412.17427	link
2024-12-23	Multimodal Preference Data Synthetic Alignment with Reward Model	Robert Wijaya et.al.	2412.17417	link
2024-12-23	VidCtx: Context-aware Video Question Answering with Image Models	Andreas Goulas et.al.	2412.17415	null
2024-12-23	Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance	Muhammad Reza Qorib et.al.	2412.17408	link
2024-12-23	Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning	Huchen Jiang et.al.	2412.17397	null
2024-12-23	WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models	Huawen Feng et.al.	2412.17395	null
2024-12-23	Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement	Hyeonjin Kim et.al.	2412.17387	link
2024-12-23	Interweaving Memories of a Siamese Large Language Model	Xin Song et.al.	2412.17383	link
2024-12-23	MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models	Beibei Yu et.al.	2412.17339	null
2024-12-23	A Dual-Perspective Metaphor Detection Framework Using Large Language Models	Yujie Lin et.al.	2412.17332	link
2024-12-23	Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance	Nicolas Devatine et.al.	2412.17321	null
2024-12-23	CodeV: Issue Resolving with Visual Data	Linhao Zhang et.al.	2412.17315	link
2024-12-23	Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories	Mahan Tafreshipour et.al.	2412.17298	null
2024-12-23	Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples	Taewoong Kim et.al.	2412.17288	link
2024-12-23	LLM4AD: A Platform for Algorithm Design with Large Language Model	Fei Liu et.al.	2412.17287	link
2024-12-23	Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning	Rui Liang et.al.	2412.17285	null
2024-12-23	Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach	Rafid Ishrak Jahan et.al.	2412.17255	link
2024-12-23	SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval	Xiaopeng Li et.al.	2412.17250	null
2024-12-23	EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling	Zichen Song et.al.	2412.17249	null
2024-12-23	On the Generalization Ability of Machine-Generated Text Detectors	Yule Liu et.al.	2412.17242	link
2024-12-23	Brain-to-Text Benchmark '24: Lessons Learned	Francis R. Willett et.al.	2412.17227	link
2024-12-23	CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder	Lichen Ma et.al.	2412.17225	null
2024-12-22	Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension	Jio Oh et.al.	2412.17189	null
2024-12-22	Foundation Model for Lossy Compression of Spatiotemporal Scientific Data	Xiao Li et.al.	2412.17184	null
2024-12-22	Enhancing Item Tokenization for Generative Recommendation through Self-Improvement	Runjin Chen et.al.	2412.17171	null
2024-12-22	Generative Diffusion Modeling: A Practical Handbook	Zihan Ding et.al.	2412.17162	null
2024-12-22	LLM-based relevance assessment still can't replace human relevance assessment	Charles L. A. Clarke et.al.	2412.17156	null
2024-12-22	LLM Agent for Fire Dynamics Simulations	Leidong Xu et.al.	2412.17146	null
2024-12-22	Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs	Rushendra Sidibomma et.al.	2412.17131	null
2024-12-22	Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models	Cameron R. Jones et.al.	2412.17128	null
2024-12-22	Learning to Adapt to Low-Resource Paraphrase Generation	Zhigen Li et.al.	2412.17111	null
2024-12-22	DreamOmni: Unified Image Generation and Editing	Bin Xia et.al.	2412.17098	null
2024-12-22	Analysis on LLMs Performance for Code Summarization	Md. Ahnaf Akib et.al.	2412.17094	null
2024-12-22	SAIL: Sample-Centric In-Context Learning for Document Information Extraction	Jinyu Zhang et.al.	2412.17092	link
2024-12-22	SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults	Jinzhi Wang et.al.	2412.17077	null
2024-12-22	The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM's Internal States	Fabian Ridder et.al.	2412.17056	link
2024-12-22	DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately	Huiwen Wu et.al.	2412.17053	null
2024-12-22	ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues	Shaina Raza et.al.	2412.17052	link
2024-12-22	Modular Conversational Agents for Surveys and Interviews	Jiangbo Yu et.al.	2412.17049	null
2024-12-22	Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective	Hankun Wang et.al.	2412.17048	null
2024-12-22	Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation	Luoxu Jin et.al.	2412.17042	null
2024-12-22	HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories	Eric Hedlin et.al.	2412.17040	null
2024-12-22	Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model	Kevin Shen et.al.	2412.17039	null
2024-12-22	Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models	Lang Gao et.al.	2412.17034	null
2024-12-22	MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge	Jie He et.al.	2412.17032	null
2024-12-22	FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos	Zhengqian Wu et.al.	2412.17022	link
2024-12-22	GAS: Generative Auto-bidding with Post-training Search	Yewen Li et.al.	2412.17018	null
2024-12-22	Robustness of Large Language Models Against Adversarial Attacks	Yiyi Tao et.al.	2412.17011	null
2024-12-22	InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions	Ronghui Li et.al.	2412.16982	null
2024-12-22	On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora	Tzu-Chieh Chen et.al.	2412.16976	null
2024-12-22	Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs	Alexander von Recum et.al.	2412.16974	null
2024-12-22	Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach	Chunxu Zhang et.al.	2412.16969	link
2024-12-22	System-2 Mathematical Reasoning via Enriched Instruction Tuning	Huanqia Cai et.al.	2412.16964	null
2024-12-22	Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework	Jundong Xu et.al.	2412.16953	null
2024-12-22	A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation	Ekai Hashimoto et.al.	2412.16943	null
2024-12-22	Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering	Zhongjian Hu et.al.	2412.16936	null
2024-12-22	Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models	Kai Zheng et.al.	2412.16933	null
2024-12-22	Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs	Bohan Jin et.al.	2412.16922	null
2024-12-22	Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection	Yuhang Gan et.al.	2412.16918	null
2024-12-22	Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation	Quan Dao et.al.	2412.16906	null
2024-12-22	Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model	Songjun Tu et.al.	2412.16878	link
2024-12-20	HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding	Chenxin Tao et.al.	2412.16158	null
2024-12-20	Can Generative Video Models Help Pose Estimation?	Ruojin Cai et.al.	2412.16155	null
2024-12-20	Offline Reinforcement Learning for LLM Multi-Step Reasoning	Huaijie Wang et.al.	2412.16145	link
2024-12-20	Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation	Seyedreza Mohseni et.al.	2412.16135	null
2024-12-20	Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information	Dirk Bergemann et.al.	2412.16132	null
2024-12-20	PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics	Daniil Larionov et.al.	2412.16120	null
2024-12-20	Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts	Muhammad Abdullah Sohail et.al.	2412.16119	link
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-20	The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse	Mahyar Habibi et.al.	2412.16114	null
2024-12-20	Logical Consistency of Large Language Models in Fact-checking	Bishwamittra Ghosh et.al.	2412.16100	null
2024-12-20	The Evolution of LLM Adoption in Industry Data Curation Practices	Crystal Qian et.al.	2412.16089	null
2024-12-20	Efficient MedSAMs: Segment Anything in Medical Images on Laptop	Jun Ma et.al.	2412.16085	link
2024-12-20	Formal Mathematical Reasoning: A New Frontier in AI	Kaiyu Yang et.al.	2412.16075	null
2024-12-20	The Only Way is Ethics: A Guide to Ethical Research with Large Language Models	Eddie L. Ungless et.al.	2412.16022	link
2024-12-20	Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support	Qijiong Liu et.al.	2412.15973	link
2024-12-20	From General to Specific: Tailoring Large Language Models for Personalized Healthcare	Ruize Shi et.al.	2412.15957	null
2024-12-20	Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring	Markus Borg et.al.	2412.15948	null
2024-12-20	Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation	Gautier Evennou et.al.	2412.15939	link
2024-12-20	Large Language Model assisted Hybrid Fuzzing	Ruijie Meng et.al.	2412.15931	null
2024-12-20	MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection	Andrea Moglia et.al.	2412.15925	link
2024-12-20	RiTTA: Modeling Event Relations in Text-to-Audio Generation	Yuhang He et.al.	2412.15922	link
2024-12-20	Less is More: Towards Green Code Large Language Models via Unified Structural Pruning	Guang Yang et.al.	2412.15921	null
2024-12-20	Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model	Yosuke Yamagishi et.al.	2412.15907	null
2024-12-20	Evaluation of Reliability Criteria for News Publishers with Large Language Models	Manuel Pratelli et.al.	2412.15896	null
2024-12-20	TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain	Camille Barboule et.al.	2412.15891	null
2024-12-20	AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI	Katja Bühler et.al.	2412.15876	null
2024-12-20	Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback	Jiaming Ji et.al.	2412.15838	link
2024-12-20	WebLLM: A High-Performance In-Browser LLM Inference Engine	Charlie F. Ruan et.al.	2412.15803	link
2024-12-20	Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning	Sungjin Park et.al.	2412.15797	null
2024-12-20	GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning	Heming Zhang et.al.	2412.15790	null
2024-12-20	Linguistic Features Extracted by GPT-4 Improve Alzheimer's Disease Detection based on Spontaneous Speech	Jonathan Heitz et.al.	2412.15772	link
2024-12-20	Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference	Jorge García-Carrasco et.al.	2412.15750	link
2024-12-20	Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models	Shamus Sim et.al.	2412.15748	null
2024-12-20	VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models	Dexter Neo et.al.	2412.15739	null
2024-12-20	AutoLife: Automatic Life Journaling with Smartphones and LLMs	Huatao Xu et.al.	2412.15714	null
2024-12-20	Contrastive Learning for Task-Independent SpeechLLM-Pretraining	Maike Züfle et.al.	2412.15712	link
2024-12-20	Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback	Niklas Ippisch et.al.	2412.15702	null
2024-12-20	Code Review Automation Via Multi-task Federated LLM -- An Empirical Study	Jahnavi Kumar et.al.	2412.15676	null
2024-12-20	Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline	Guancheng Zeng et.al.	2412.15660	null
2024-12-20	Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class	Annie D'souza et.al.	2412.15657	null
2024-12-20	MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula	Sieun Hyeon et.al.	2412.15655	link
2024-12-20	Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution	Wentao Tan et.al.	2412.15650	null
2024-12-20	Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model	Xin Du et.al.	2412.15634	link
2024-12-20	Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning?	Mengyu Ye et.al.	2412.15628	null
2024-12-20	JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs	Hongyi Li et.al.	2412.15623	null
2024-12-20	Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage	Zhi Gao et.al.	2412.15606	null
2024-12-20	Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks	Brian J Chan et.al.	2412.15605	link
2024-12-20	Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification	Gyutae Park et.al.	2412.15603	null
2024-12-20	Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation	Xiaoqiang Kang et.al.	2412.15594	link
2024-12-20	NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization	Danial Kamali et.al.	2412.15588	link
2024-12-20	To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models	Jessica Y. Bo et.al.	2412.15584	null
2024-12-20	A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation	Ryien Hosseini et.al.	2412.15582	null
2024-12-20	Score-based Generative Diffusion Models for Social Recommendations	Chengyi Liu et.al.	2412.15579	link
2024-12-20	QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning	Xinyang Tong et.al.	2412.15576	null
2024-12-20	J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM	Takero Yoshida et.al.	2412.15574	null
2024-12-20	Continual Learning Using a Kernel-Based Method Over Foundation Models	Saleh Momeni et.al.	2412.15571	link
2024-12-20	DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation	Yichun Tai et.al.	2412.15570	link
2024-12-20	In-context Continual Learning Assisted by an External Continual Learner	Saleh Momeni et.al.	2412.15563	null
2024-12-20	NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning	Zheyuan Zhang et.al.	2412.15547	null
2024-12-20	MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering	Zhang Siyue et.al.	2412.15540	null
2024-12-20	XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation	Qianren Mao et.al.	2412.15529	link
2024-12-20	HREF: Human Response-Guided Evaluation of Instruction Following in Language Models	Xinxi Lyu et.al.	2412.15524	link
2024-12-20	PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time	Alireza Pourali et.al.	2412.15519	link
2024-12-20	Stylish and Functional: Guided Interpolation Subject to Physical Constraints	Yan-Ying Chen et.al.	2412.15507	null
2024-12-20	Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework	Zhenjie Xu et.al.	2412.15504	link
2024-12-20	Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models	Zhisheng Tang et.al.	2412.15501	null
2024-12-20	TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use	Junjie Ye et.al.	2412.15495	link
2024-12-20	PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search	Jiaxin Wu et.al.	2412.15494	null
2024-12-20	GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators	Hengjia Li et.al.	2412.15491	null
2024-12-20	Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage	Saehyung Lee et.al.	2412.15484	null
2024-12-20	Continual Learning Using Only Large Language Model Prompting	Jiabao Qiu et.al.	2412.15479	null
2024-12-19	TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models	Ammar N. Abbas et.al.	2412.15462	null
2024-12-19	Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization	Sahil Wadhwa et.al.	2412.15453	null
2024-12-19	AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals	Angela Mastrianni et.al.	2412.15444	null
2024-12-19	SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval	Aakash Mahalingam et.al.	2412.15443	null
2024-12-19	Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models	Tianchen Zhang et.al.	2412.15431	null
2024-12-19	MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale	Swapnil Gandhi et.al.	2412.15411	null
2024-12-19	Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification	Edoardo Allegrini et.al.	2412.15410	null
2024-12-19	Systematic Evaluation of Long-Context LLMs on Financial Concepts	Lavanya Gupta et.al.	2412.15386	null
2024-12-19	Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation	Joanne Boisson et.al.	2412.15375	link
2024-12-19	Automated Root Cause Analysis System for Complex Data Products	Mathieu Demarne et.al.	2412.15374	null
2024-12-19	Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs	Liam Seymour et.al.	2412.15352	link
2024-12-19	Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models	Reza Shirkavand et.al.	2412.15341	null
2024-12-19	Complete background cosmology of parity-even quadratic metric-affine gravity	Thomas Dyer et.al.	2412.15329	null
2024-12-19	OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Shuo Xing et.al.	2412.15208	link
2024-12-19	MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark	Qihao Zhao et.al.	2412.15194	link
2024-12-19	LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation	Weijia Shi et.al.	2412.15188	null
2024-12-19	Tiled Diffusion	Or Madar et.al.	2412.15185	null
2024-12-19	Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning	Simon Frieder et.al.	2412.15184	null
2024-12-19	STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning	Marius Memmel et.al.	2412.15182	null
2024-12-19	HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages	Aman Chaturvedi et.al.	2412.15178	null
2024-12-19	Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Federico Castagna et.al.	2412.15177	link
2024-12-19	Rethinking Uncertainty Estimation in Natural Language Generation	Lukas Aichberger et.al.	2412.15176	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Language Models as Continuous Self-Evolving Data Engineers	Peidong Wang et.al.	2412.15151	null
2024-12-19	Jet: A Modern Transformer-Based Normalizing Flow	Alexander Kolesnikov et.al.	2412.15129	null
2024-12-19	Adaptive Pruning for Large Language Models with Structural Importance Awareness	Haotian Zheng et.al.	2412.15127	null
2024-12-19	Outcome-Refining Process Supervision for Code Generation	Zhuohao Yu et.al.	2412.15118	link
2024-12-19	Qwen2.5 Technical Report	Qwen et.al.	2412.15115	link
2024-12-19	Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture	Thomas F Burns et.al.	2412.15113	link
2024-12-19	Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation	Yang Tian et.al.	2412.15109	link
2024-12-19	Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability	Xiangsen Chen et.al.	2412.15101	null
2024-12-19	Nano-ESG: Extracting Corporate Sustainability Information from News Articles	Fabian Billert et.al.	2412.15093	link
2024-12-19	Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation	Haoran Liu et.al.	2412.15086	null
2024-12-19	ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots	Bhupendra Acharya et.al.	2412.15072	null
2024-12-19	ConfliBERT: A Language Model for Political Conflict	Patrick T. Brandt et.al.	2412.15060	link
2024-12-19	LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps	Felix Friedrich et.al.	2412.15035	null
2024-12-19	DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space	Mang Ning et.al.	2412.15032	link
2024-12-19	Large Language Models and Code Security: A Systematic Literature Review	Enna Basic et.al.	2412.15004	null
2024-12-19	HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs	Pham Vu Tuan Dat et.al.	2412.14995	link
2024-12-19	RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning	Raphael Memmesheimer et.al.	2412.14989	null
2024-12-19	Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts	Ioana Buhnila et.al.	2412.14986	null
2024-12-19	AI and Cultural Context: An Empirical Investigation of Large Language Models' Performance on Chinese Social Work Professional Standards	Zia Qi et.al.	2412.14971	null
2024-12-19	Movie2Story: A framework for understanding videos and telling stories in the form of novel text	Kangning Li et.al.	2412.14965	null
2024-12-19	Knowledge Injection via Prompt Distillation	Kalle Kujanpää et.al.	2412.14964	null
2024-12-19	Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities	Daniil Medyakov et.al.	2412.14935	null
2024-12-19	RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response	Junyu Luo et.al.	2412.14922	link
2024-12-19	Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation	Zexiong Ma et.al.	2412.14905	null
2024-12-19	Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering	Peize Li et.al.	2412.14880	null
2024-12-19	Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering	Imed Keraghel et.al.	2412.14867	null
2024-12-19	Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling	Junyi Li et.al.	2412.14860	null
2024-12-19	DS $^2$ -ABSA: Dual-Stream Data Synthesis with Label Refinement for Few-Shot Aspect-Based Sentiment Analysis	Hongling Xu et.al.	2412.14849	link
2024-12-19	Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas	Pietro Bernardelle et.al.	2412.14843	null
2024-12-19	Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis	Greta Dolcetti et.al.	2412.14841	null
2024-12-19	Progressive Multimodal Reasoning via Active Retrieval	Guanting Dong et.al.	2412.14835	null
2024-12-19	Answer Set Networks: Casting Answer Set Programming into Deep Learning	Arseny Skryagin et.al.	2412.14814	link
2024-12-19	ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis	Zeao Tu et.al.	2412.14809	link
2024-12-19	Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning	Ziang Ye et.al.	2412.14780	null
2024-12-19	ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine	Rabee Qasem et.al.	2412.14771	null
2024-12-19	PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children	Yiqun Zhang et.al.	2412.14769	link
2024-12-19	CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering	Ruida Hu et.al.	2412.14764	link
2024-12-19	Query pipeline optimization for cancer patient question answering systems	Maolin He et.al.	2412.14751	null
2024-12-19	Active Inference and Human--Computer Interaction	Roderick Murray-Smith et.al.	2412.14741	null
2024-12-19	On Verbalized Confidence Scores for LLMs	Daniel Yang et.al.	2412.14737	link
2024-12-19	Creation of AI-driven Smart Spaces for Enhanced Indoor Environments -- A Survey	Aygün Varol et.al.	2412.14708	null
2024-12-19	LLMs as mediators: Can they diagnose conflicts accurately?	Özgecan Koçak et.al.	2412.14675	null
2024-12-19	Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT	Hassane Kissane et.al.	2412.14670	null
2024-12-19	IOHunter: Graph Foundation Model to Uncover Online Information Operations	Marco Minici et.al.	2412.14663	link
2024-12-19	Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models	Zijun Chen et.al.	2412.14660	link
2024-12-19	Length Controlled Generation for Black-box LLMs	Yuxuan Gu et.al.	2412.14656	null
2024-12-19	Learning to Generate Research Idea with Dynamic Control	Ruochen Li et.al.	2412.14626	null
2024-12-19	How good is GPT at writing political speeches for the White House?	Jacques Savoy et.al.	2412.14617	null
2024-12-19	Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning	Kepu Zhang et.al.	2412.14588	null
2024-12-19	HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning	Minkuk Kim et.al.	2412.14585	null
2024-12-19	Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues	Tao He et.al.	2412.14584	null
2024-12-19	CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation	Youngwon Lee et.al.	2412.14581	null
2024-12-19	DiffSim: Taming Diffusion Models for Evaluating Visual Similarity	Yiren Song et.al.	2412.14580	link
2024-12-19	Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models	Wenhan Liu et.al.	2412.14574	link
2024-12-19	ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model	Shunlin Lu et.al.	2412.14559	null
2024-12-19	The Current Challenges of Software Engineering in the Era of Large Language Models	Cuiyun Gao et.al.	2412.14554	null
2024-12-19	Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models	Xiao Cui et.al.	2412.14528	link
2024-12-19	Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment	Teng Xiao et.al.	2412.14516	link
2024-12-19	Relational Programming with Foundation Models	Ziyang Li et.al.	2412.14515	null
2024-12-19	PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization	Jiayi Wu et.al.	2412.14510	link
2024-12-19	Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs	Yuzuki Arai et.al.	2412.14501	null
2024-12-19	Guided Diffusion Model for Sensor Data Obfuscation	Xin Yang et.al.	2412.14499	null
2024-12-19	FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis	Abdullah Khan et.al.	2412.14492	link
2024-12-19	Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities	Amandeep Kaur et.al.	2412.14486	null
2024-12-19	DirectorLLM for Human-Centric Video Generation	Kunpeng Song et.al.	2412.14484	null
2024-12-19	Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs	Koshiro Saito et.al.	2412.14471	null
2024-12-19	Agent-SafetyBench: Evaluating the Safety of LLM Agents	Zhexin Zhang et.al.	2412.14470	link
2024-12-19	From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research	Xiang Cheng et.al.	2412.14461	null
2024-12-19	LEDiff: Latent Exposure Diffusion for HDR Generation	Chao Wang et.al.	2412.14456	null
2024-12-19	Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems	Genki Kusano et.al.	2412.14454	null
2024-12-19	Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation	Shengqi Liu et.al.	2412.14453	null
2024-12-19	ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study	Eric Modesitt et.al.	2412.14436	link
2024-12-19	All-in-One Tuning and Structural Pruning for Domain-Specific LLMs	Lei Lu et.al.	2412.14426	null
2024-12-19	FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning	Pramit Saha et.al.	2412.14424	null
2024-12-19	Enhancing Diffusion Models for High-Quality Image Generation	Jaineet Shah et.al.	2412.14422	null
2024-12-18	ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers	Haowei Liu et.al.	2412.14405	null
2024-12-18	Clinical Trials Ontology Engineering with Large Language Models	Berkan Çakır et.al.	2412.14387	null
2024-12-18	ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling	William Han et.al.	2412.14373	link
2024-12-18	Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation	Yuxuan Jiang et.al.	2412.14368	null
2024-12-18	Surrealistic-like Image Generation with Vision-Language Models	Elif Ayten et.al.	2412.14366	link
2024-12-18	ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals	Utkarsh Saxena et.al.	2412.14363	link
2024-12-18	A Unifying Information-theoretic Perspective on Evaluating Generative Models	Alexis Fox et.al.	2412.14340	null
2024-12-18	Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation	Benjamin Steenhoek et.al.	2412.14308	null
2024-12-18	Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs	David Restrepo et.al.	2412.14304	null
2024-12-18	Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data	haina Raza et.al.	2412.14276	link
2024-12-18	Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces	Jihan Yang et.al.	2412.14171	link
2024-12-18	MetaMorph: Multimodal Understanding and Generation via Instruction Tuning	Shengbang Tong et.al.	2412.14164	null
2024-12-18	TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks	Frank F. Xu et.al.	2412.14161	link
2024-12-18	Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models	Atin Sakkeer Hussain et.al.	2412.14146	null
2024-12-18	LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research	Tianyang Gu et.al.	2412.14141	null

(back to top)

Video Understanding

Publish Date	Title	Authors	PDF	Code
2025-01-31	Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search	Yuta Oshima et.al.	2501.19252	null
2025-01-31	$\infty$ -Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation	Saul Santos et.al.	2501.19098	link
2025-01-30	Every Image Listens, Every Image Dances: Music-Driven Image Animation	Zhikang Dong et.al.	2501.18801	null
2025-01-30	MAMS: Model-Agnostic Module Selection Framework for Video Captioning	Sangho Lee et.al.	2501.18269	null
2025-01-28	Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding	Yun Li et.al.	2501.16786	null
2025-01-28	CascadeV: An Implementation of Wurstchen Architecture for Video Generation	Wenfeng Lin et.al.	2501.16612	link
2025-01-27	AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models	Zheng Lian et.al.	2501.16566	null
2025-01-27	Understanding Long Videos via LLM-Powered Entity Relation Graphs	Meng Chu et.al.	2501.15953	null
2025-01-26	TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding	Xingjian Zhang et.al.	2501.15513	link
2025-01-26	"See What I Imagine, Imagine What I See": Human-AI Co-Creation System for 360 $^\circ$ Panoramic Video Generation in VR	Yunge Wen et.al.	2501.15456	null
2025-01-25	HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding	Jiaxing Zhao et.al.	2501.15111	null
2025-01-25	VideoPure: Diffusion-based Adversarial Purification for Video Recognition	Kaixun Jiang et.al.	2501.14999	link
2025-01-11	HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators	Le Chen et.al.	2501.14794	null
2025-01-24	VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking	Runyi Hu et.al.	2501.14195	link
2025-01-24	ENTER: Event Based Interpretable Reasoning for VideoQA	Hammad Ayyubi et.al.	2501.14194	null
2025-01-30	Temporal Preference Optimization for Long-Form Video Understanding	Rui Li et.al.	2501.13919	null
2025-01-23	Improving Video Generation with Human Feedback	Jie Liu et.al.	2501.13918	null
2025-01-23	ReasVQA: Advancing VideoQA with Imperfect Reasoning Process	Jianxin Liang et.al.	2501.13536	null
2025-01-23	Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge	Haomiao Xiong et.al.	2501.13468	link
2025-01-23	EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion	Jiangchuan Wei et.al.	2501.13452	null
2025-01-28	VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Boqiang Zhang et.al.	2501.13106	link
2025-01-21	Taming Teacher Forcing for Masked Autoregressive Video Generation	Deyu Zhou et.al.	2501.12389	null
2025-01-22	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	MMVU: Measuring Expert-Level Multi-Discipline Video Understanding	Yilun Zhao et.al.	2501.12380	link
2025-01-22	Video Depth Anything: Consistent Depth Estimation for Super-Long Videos	Sili Chen et.al.	2501.12375	null
2025-01-21	InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model	Yuhang Zang et.al.	2501.12368	link
2025-01-20	GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video	Zhenliang Ni et.al.	2501.11340	null
2025-01-20	CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation	Zheng Chong et.al.	2501.11325	null
2025-01-23	HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition	Pengcheng Dong et.al.	2501.11007	null
2025-01-18	EMO2: End-Effector Guided Audio-Driven Avatar Video Generation	Linrui Tian et.al.	2501.10687	null
2025-01-17	DiffuEraser: A Diffusion Model for Video Inpainting	Xiaowen Li et.al.	2501.10018	link
2025-01-17	RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation	Yuefan Cao et.al.	2501.09982	null
2025-01-16	VideoWorld: Exploring Knowledge Learning from Unlabeled Videos	Zhongwei Ren et.al.	2501.09781	null
2025-01-16	Learnings from Scaling Visual Tokenizers for Reconstruction and Generation	Philippe Hansen-Estruch et.al.	2501.09755	null
2025-01-14	Do generative video models learn physical principles from watching videos?	Saman Motamed et.al.	2501.09038	link
2025-01-15	Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion	Jingyuan Chen et.al.	2501.09019	null
2025-01-15	RepVideo: Rethinking Cross-Layer Representation for Video Generation	Chenyang Si et.al.	2501.08994	null
2025-01-15	Admitting Ignorance Helps the Video Question Answering Models to Answer	Haopeng Li et.al.	2501.08771	null
2025-01-31	Comprehensive Subjective and Objective Evaluation Method for Text-generated Video	Zelu Qi et.al.	2501.08545	null
2025-01-14	Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models	Weichen Fan et.al.	2501.08453	null
2025-01-14	3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering	Meenakshi Krishnan et.al.	2501.08370	null
2025-01-14	Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks	Miran Heo et.al.	2501.08326	null
2025-01-14	GameFactory: Creating New Games with Generative Interactive Videos	Jiwen Yu et.al.	2501.08325	null
2025-01-14	Diffusion Adversarial Post-Training for One-Step Video Generation	Shanchuan Lin et.al.	2501.08316	null
2025-01-17	LayerAnimate: Layer-specific Control for Animation	Yuxue Yang et.al.	2501.08295	null
2025-01-14	FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors	Yabo Zhang et.al.	2501.08225	link
2025-01-14	Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness	Jiaxing Zhao et.al.	2501.07978	null
2025-01-24	Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding	Liping Yuan et.al.	2501.07888	link
2025-01-14	AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation	Sitong Gong et.al.	2501.07810	link
2025-01-13	BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations	Weixi Feng et.al.	2501.07647	null
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-17	MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning	Tieyuan Chen et.al.	2501.07227	null
2025-01-13	TimeLogic: A Temporal Logic Benchmark for Video QA	Sirnam Swetha et.al.	2501.07214	null
2025-01-13	Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling	Jiebin Yan et.al.	2501.07087	null
2025-01-12	X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding	Wenqi Zhou et.al.	2501.06835	null
2025-01-12	VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning	Ji Soo Lee et.al.	2501.06761	link
2025-01-11	Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning	Maomao Li et.al.	2501.06438	null
2025-01-10	MEt3R: Measuring Multi-View Consistency in Generated Images	Mohammad Asim et.al.	2501.06336	null
2025-01-10	Multi-subject Open-set Personalization in Video Generation	Tsai-Shien Chen et.al.	2501.06187	null
2025-01-10	VideoAuteur: Towards Long Narrative Video Generation	Junfei Xiao et.al.	2501.06173	null
2025-01-13	Valley2: Exploring Multimodal Models with Scalable Vision-Language Design	Ziheng Wu et.al.	2501.05901	link
2025-01-10	Zero-shot Shark Tracking and Biometrics from Aerial Imagery	Chinmay K Lalgudi et.al.	2501.05717	null
2025-01-10	From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities	Dominick Reilly et.al.	2501.05711	link
2025-01-09	OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?	Yifei Li et.al.	2501.05510	link
2025-01-08	Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion	Yongjia Ma et.al.	2501.05484	null
2025-01-09	Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces	Aniruddha Mahapatra et.al.	2501.05442	null
2025-01-09	Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning	Huabin Liu et.al.	2501.05069	null
2025-01-09	LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding	Jiaxing Zhao et.al.	2501.05067	null
2025-01-09	LongViTU: Instruction Tuning for Long-Form Video Understanding	Rujie Wu et.al.	2501.05037	null
2025-01-09	ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark	Ronghao Dang et.al.	2501.05031	link
2025-01-08	ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning	Yuzhou Huang et.al.	2501.04698	null
2025-01-08	Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs	Zeyi Huang et.al.	2501.04336	null
2025-01-08	H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving	Siran Chen et.al.	2501.04302	null
2025-01-08	LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition	Bowen Hao et.al.	2501.04204	null
2024-12-18	FlexCache: Flexible Approximate Cache System for Video Diffusion	Desen Sun et.al.	2501.04012	null
2025-01-07	Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers	Yuechen Zhang et.al.	2501.03931	link
2025-01-09	Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control	Zekai Gu et.al.	2501.03847	link
2025-01-07	Motion-Aware Generative Frame Interpolation	Guozhen Zhang et.al.	2501.03699	null
2025-01-06	License Plate Images Generation with Diffusion Models	Mariia Shpir et.al.	2501.03374	null
2025-01-03	Classifier-Guided Captioning Across Modalities	Ariel Shaulov et.al.	2501.03183	null
2025-01-06	Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation	Guy Yariv et.al.	2501.03059	null
2025-01-20	TransPixeler: Advancing Text-to-Video Generation with Transparency	Luozhou Wang et.al.	2501.03006	link
2025-01-06	MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models	Wenyi Hong et.al.	2501.02955	null
2025-01-06	Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising	Yunlong Yuan et.al.	2501.02741	null
2025-01-05	GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking	Weikang Bian et.al.	2501.02690	null
2025-01-29	Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey	Zongxia Li et.al.	2501.02189	link
2025-01-10	Gender Bias in Text-to-Video Generation Models: A case study of Sora	Mohammad Nadeem et.al.	2501.01987	null
2024-12-30	FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models	Tianyu Fu et.al.	2501.01986	link
2025-01-03	JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing	Qili Wang et.al.	2501.01798	link
2025-01-03	HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding	Heqing Zou et.al.	2501.01645	null
2025-01-07	VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control	Yuanpeng Tu et.al.	2501.01427	null
2025-01-02	Unifying Specialized Visual Encoders for Video Language Models	Jihoon Chung et.al.	2501.01426	link
2025-01-03	Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions	Xincheng Shuai et.al.	2501.01425	null
2025-01-02	Multi-Modal Video Feature Extraction for Popularity Prediction	Haixu Liu et.al.	2501.01422	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409	null
2025-01-29	Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform	Cheonsu Jeong et.al.	2501.00750	null
2025-01-03	DreamDrive: Generative 4D Scene Modeling from Street View Images	Jiageng Mao et.al.	2501.00601	null
2025-01-08	VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM	Yuqian Yuan et.al.	2501.00599	link
2024-12-31	Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method	Zhenpeng Huang et.al.	2501.00584	null
2024-12-31	Fine-grained Video-Text Retrieval: A New Benchmark and Method	Yifan Xu et.al.	2501.00513	null
2024-12-31	OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models	Lala Shakti Swarup Ray et.al.	2501.00432	null
2025-01-09	Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding	Yue Fan et.al.	2501.00358	null
2024-12-30	Detection-Fusion for Knowledge Graph Extraction from Videos	Taniya Das et.al.	2501.00136	link
2024-12-30	LTX-Video: Realtime Video Latent Diffusion	Yoav HaCohen et.al.	2501.00103	link
2024-12-30	Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model	Yifei Huang et.al.	2412.21080	link
2024-12-30	VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation	Jiazheng Xu et.al.	2412.21059	link
2024-12-30	Hierarchical Banzhaf Interaction for General Video-Language Representation Learning	Peng Jin et.al.	2412.20964	link
2024-12-30	ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation	Ting Zhang et.al.	2412.20901	null
2024-12-30	Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling	Min Zhang et.al.	2412.20725	null
2025-01-05	ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding	Xiao Wang et.al.	2412.20504	link
2024-12-29	Open-Sora: Democratizing Efficient Video Production for All	Zangwei Zheng et.al.	2412.20404	link
2024-12-28	DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments	Xijun Wang et.al.	2412.20042	null
2025-01-17	MVTamperBench: Evaluating Robustness of Vision-Language Models	Amit Agarwal et.al.	2412.19794	null
2024-12-27	Generative Video Propagation	Shaoteng Liu et.al.	2412.19761	null
2024-12-30	VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models	Tao Wu et.al.	2412.19645	null
2024-12-30	DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT	Xiaotao Hu et.al.	2412.19505	link
2024-12-26	Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries	Roberto Amoroso et.al.	2412.19304	null
2024-12-25	Accelerating Diffusion Transformers with Dual Feature Caching	Chang Zou et.al.	2412.18911	link
2024-12-24	Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation	Faraz Waseem et.al.	2412.18688	null
2024-12-24	Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models	Jinhui Yi et.al.	2412.18609	link
2024-12-24	DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers	Yuntao Chen et.al.	2412.18607	null
2024-12-24	ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation	Hongjie Li et.al.	2412.18600	null
2024-12-24	DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation	Minghong Cai et.al.	2412.18597	link
2024-12-23	Large Motion Video Autoencoding with Cross-modal Video VAE	Yazhou Xing et.al.	2412.17805	null
2024-12-23	VidTwin: Video VAE with Decoupled Structure and Dynamics	Yuchi Wang et.al.	2412.17726	link
2024-12-23	HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data	Ting Zhou et.al.	2412.17574	link
2024-12-23	VidCtx: Context-aware Video Question Answering with Image Models	Andreas Goulas et.al.	2412.17415	null
2024-12-23	FFA Sora, video generation as fundus fluorescein angiography simulator	Xinyuan Wu et.al.	2412.17346	null
2024-12-23	Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory	Xingyao Li et.al.	2412.17254	null
2024-12-22	SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults	Jinzhi Wang et.al.	2412.17077	null
2025-01-08	Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation	Luoxu Jin et.al.	2412.17042	null
2024-12-22	FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos	Zhengqian Wu et.al.	2412.17022	link
2024-12-22	Video Domain Incremental Learning for Human Action Recognition in Home Environments	Yuanda Hu et.al.	2412.16946	null
2024-12-21	GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space	Souhaib Attaiki et.al.	2412.16717	null
2024-12-21	TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models	Haocheng Huang et.al.	2412.16700	null
2024-12-21	VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation	Chi Zhang et.al.	2412.16677	null
2024-12-25	Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance	Beiyuan Zhang et.al.	2412.16495	null
2024-12-18	ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping	Youxin Pang et.al.	2412.16212	null
2024-12-17	Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation	Yiping Wang et.al.	2412.16211	null
2024-12-20	PruneVid: Visual Token Pruning for Efficient Video Large Language Models	Xiaohu Huang et.al.	2412.16117	link
2024-12-20	DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization	Zihan Ding et.al.	2412.15689	null
2024-12-23	CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training	Xiuli Bi et.al.	2412.15646	link
2024-12-20	PolySmart @ TRECVid 2024 Medical Video Question Answering	Jiaxin Wu et.al.	2412.15514	null
2024-12-19	AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation	Moayed Haji-Ali et.al.	2412.15191	null
2024-12-19	Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM	Yatai Ji et.al.	2412.15156	link
2024-12-19	Parallelized Autoregressive Visual Generation	Yuqing Wang et.al.	2412.15119	null
2024-12-19	Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations	Yucheng Hu et.al.	2412.14803	null
2024-12-19	HiCM $^2$ : Hierarchical Compact Memory Modeling for Dense Video Captioning	Minkuk Kim et.al.	2412.14585	null
2024-12-19	Consistent Human Image and Video Generation with Spatially Conditioned Diffusion	Mingdeng Cao et.al.	2412.14531	link
2024-12-19	DirectorLLM for Human-Centric Video Generation	Kunpeng Song et.al.	2412.14484	null
2024-12-18	Learning from Massive Human Videos for Universal Humanoid Pose Control	Jiageng Mao et.al.	2412.14172	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167	null
2024-12-29	AKiRa: Augmentation Kit on Rays for optical video generation	Xi Wang et.al.	2412.14158	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018	null
2024-12-18	InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models	Cong Wei et.al.	2412.14006	link
2024-12-18	Do Language Models Understand Time?	Xi Ding et.al.	2412.13845	link
2024-12-19	G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o	Tony Cheng Tong et.al.	2412.13647	link
2024-12-18	Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning	Yunbin Tu et.al.	2412.13543	null
2024-12-18	Real-time One-Step Diffusion-based Expressive Portrait Videos Generation	Hanzhong Guo et.al.	2412.13479	link
2024-12-18	SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation	Kazuki Shimada et.al.	2412.13462	null
2024-12-17	CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices	Andrei Znobishchev et.al.	2412.13273	null
2025-01-07	MotionBridge: Dynamic Video Inbetweening with Flexible Controls	Maham Tanveer et.al.	2412.13190	null
2024-12-17	VidTok: A Versatile and Open-Source Video Tokenizer	Anni Tang et.al.	2412.13061	link
2024-12-17	FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering	Zheng Cheng et.al.	2412.12833	null
2024-12-17	Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning	Shiping Ge et.al.	2412.12791	link
2024-12-17	ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries	Wangyu Xue et.al.	2412.12675	null
2024-12-16	Can video generation replace cinematographers? Research on the cinematic language of generated video	Xiaozhe Li et.al.	2412.12223	null
2024-12-16	CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding	Guo Chen et.al.	2412.12075	null
2024-12-16	InterDyn: Controllable Interactive Dynamics with Video Diffusion Models	Rick Akkerman et.al.	2412.11785	null
2024-12-16	Generative Inbetweening through Frame-wise Conditions-Driven Video Generation	Tianyi Zhu et.al.	2412.11755	link
2024-12-16	VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting	Muhammet Furkan Ilaslan et.al.	2412.11621	link
2024-12-16	Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning	Zhuyang Xie et.al.	2412.11467	null
2024-12-15	Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition	Yulin Wang et.al.	2412.11228	link
2024-12-15	GenLit: Reformulating Single-Image Relighting as Video Generation	Shrisha Bharadwaj et.al.	2412.11224	null
2024-12-15	DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes	Jinxiu Liu et.al.	2412.11100	null
2024-12-15	Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track	Deepak Gupta et.al.	2412.11056	null
2024-12-20	Video Diffusion Transformers are In-Context Learners	Zhengcong Fei et.al.	2412.10783	link
2024-12-14	Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives	Ji-jun Park et.al.	2412.10720	null
2024-12-13	SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device	Yushu Wu et.al.	2412.10494	null
2024-12-12	VCA: Video Curious Agent for Long Video Understanding	Zeyuan Yang et.al.	2412.10471	null
2024-12-17	SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization	Zhentao Tan et.al.	2412.10443	null
2024-12-11	COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework	Xin Dong et.al.	2412.10435	null
2024-12-13	Apollo: An Exploration of Video Understanding in Large Multimodal Models	Orr Zohar et.al.	2412.10360	null
2024-12-16	TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation	Xingrui Wang et.al.	2412.10275	null
2024-12-19	AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era	Yudong Jiang et.al.	2412.10255	link
2024-12-13	B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens	Zhuqiang Lu et.al.	2412.09919	link
2024-12-16	IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs	Sosuke Yamao et.al.	2412.09907	null
2024-12-13	LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity	Hongjie Wang et.al.	2412.09856	null
2024-12-13	MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion	Xunnong Xu et.al.	2412.09828	null
2024-12-17	ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation	Ali Athar et.al.	2412.09754	null
2024-12-11	Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model	Junqi You et.al.	2412.09647	null
2024-12-16	Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models	Fan Zhang et.al.	2412.09645	link
2024-12-12	Doe-1: Closed-Loop Autonomous Driving with Large World Model	Wenzhao Zheng et.al.	2412.09627	link
2024-12-12	OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation	Weiqi Li et.al.	2412.09623	null
2024-12-12	PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models	Chenyu Yang et.al.	2412.09613	null
2024-12-12	Owl-1: Omni World Model for Consistent Long Video Generation	Yuanhui Huang et.al.	2412.09600	link
2024-12-12	LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors	Yabo Chen et.al.	2412.09597	null
2024-12-12	Neptune: The Long Orbit to Benchmarking Long Video Understanding	Arsha Nagrani et.al.	2412.09582	link
2024-12-12	Video Creation by Demonstration	Yihong Sun et.al.	2412.09551	null
2024-12-12	Agent-based Video Trimming	Lingfeng Yang et.al.	2412.09513	null
2024-12-12	UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer	Delong Liu et.al.	2412.09389	link
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323	null
2024-12-12	InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption	Tiehan Fan et.al.	2412.09283	null
2024-12-12	Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering	Sai Bhargav Rongali et.al.	2412.09230	null
2024-12-12	LVMark: Robust Watermark for latent video diffusion models	MinHyuk Jang et.al.	2412.09122	null
2024-12-12	Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation	Lianrui Mu et.al.	2412.08976	null
2024-12-12	Mojito: Motion Trajectory and Intensity Control for Video Generation	Xuehai He et.al.	2412.08948	null
2024-12-11	Generative Semantic Communication: Architectures, Technologies, and Applications	Jinke Ren et.al.	2412.08642	null
2024-12-13	Physical Informed Driving World Model	Zhuoran Yang et.al.	2412.08410	null
2024-12-11	FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks	Chongkai Gao et.al.	2412.08261	null
2024-12-11	VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation	Zhiqiang Yuan et.al.	2412.08259	null
2024-12-10	3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark	Wufei Ma et.al.	2412.07825	null
2024-12-11	UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics	Xi Chen et.al.	2412.07774	null
2024-12-10	From Slow Bidirectional to Fast Causal Video Generators	Tianwei Yin et.al.	2412.07772	null
2024-12-10	SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Jianhong Bai et.al.	2412.07760	link
2024-12-10	3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation	Xiao Fu et.al.	2412.07759	null
2024-12-10	Multi-Shot Character Consistency for Text-to-Video Generation	Yuval Atzmon et.al.	2412.07750	null
2024-12-10	StyleMaster: Stylize Your Video with Artistic Generation and Translation	Zixuan Ye et.al.	2412.07744	null
2024-12-10	STIV: Scalable Text and Image Conditioned Video Generation	Zongyu Lin et.al.	2412.07730	null
2024-12-10	ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer	Jinyi Hu et.al.	2412.07720	link
2024-12-10	GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning	Yicheng Wang et.al.	2412.07704	null
2024-12-10	Multimodal Contextualized Support for Enhancing Video Retrieval System	Quoc-Bao Nguyen-Le et.al.	2412.07584	null
2024-12-19	Multi-Scale Contrastive Learning for Video Temporal Grounding	Thong Thanh Nguyen et.al.	2412.07157	null
2024-12-09	SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations	Zhaorun Chen et.al.	2412.06878	null
2024-12-09	VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features	Sifei Li et.al.	2412.06296	null
2024-12-11	Towards Long Video Understanding via Fine-detailed Video Story Generation	Zeng You et.al.	2412.06182	null
2024-12-08	Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training	Zhenghong Zhou et.al.	2412.06029	null
2024-12-08	FlexDiT: Dynamic Token Density Control for Diffusion Transformer	Shuning Chang et.al.	2412.06028	null
2024-12-10	Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation	Hyeonho Jeong et.al.	2412.06016	null
2024-12-08	Accelerating Video Diffusion Models via Distribution Matching	Yuanzhi Zhu et.al.	2412.05899	null
2024-12-08	MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation	Shuwei Shi et.al.	2412.05848	null
2024-12-08	Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval	Shanti Stewart et.al.	2412.05831	null
2024-12-08	Self-Guidance: Boosting Flow and Diffusion Generation on Their Own	Tiancheng Li et.al.	2412.05827	null
2024-12-07	Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation	Leonardo Pina et.al.	2412.05694	null
2024-12-11	Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model	Lening Wang et.al.	2412.05280	link
2024-12-17	Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling	Zhe Chen et.al.	2412.05271	link
2024-12-06	Mind the Time: Temporally-Controlled Multi-Event Video Generation	Ziyi Wu et.al.	2412.05263	null
2024-12-11	LinVT: Empower Your Image-level Large Language Model to Understand Videos	Lishuai Gao et.al.	2412.05185	link
2024-12-06	Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection	Khurram Azeem Hashmi et.al.	2412.04915	null
2024-12-06	UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving	Rui Chen et.al.	2412.04842	link
2024-12-12	Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model	Keunwoo Peter Yu et.al.	2412.04729	null
2024-12-05	Using Diffusion Priors for Video Amodal Segmentation	Kaihua Chen et.al.	2412.04623	null

(back to top)

Name		Name	Last commit message	Last commit date
Latest commit History 115 Commits
.github/workflows		.github/workflows
docs		docs
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2025.02.04

LLM Reasoning

LLM Evaluation

LLM MLLM

Video Understanding

About

Releases

Packages

Languages

Xuchen-Li/llm-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.02.04

LLM Reasoning

LLM Evaluation

LLM MLLM

Video Understanding

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages