Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-31 | Reward-Guided Speculative Decoding for Efficient LLM Reasoning | Baohao Liao et.al. | 2501.19324 | null |
2025-01-31 | BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Han Zhong et.al. | 2501.18858 | null |
2025-01-28 | A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process | Jack David Carson et.al. | 2501.16783 | null |
2025-01-27 | Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations | Pablo Valenzuela-Toledo et.al. | 2501.16495 | null |
2025-01-27 | Large Models in Dialogue for Active Perception and Anomaly Detection | Tzoulio Chamiti et.al. | 2501.16300 | link |
2025-01-26 | TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs | Yuxuan Gu et.al. | 2501.15674 | null |
2025-01-28 | Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning | Zeyu Gan et.al. | 2501.15602 | link |
2025-01-26 | Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework | Yuhong Sun et.al. | 2501.15581 | null |
2025-01-24 | Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains | Xu Chu et.al. | 2501.14431 | null |
2025-01-24 | GraphBC: Improving LLMs for Better Graph Data Processing | Xu Chu et.al. | 2501.14427 | null |
2025-01-23 | Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks | Chang Gong et.al. | 2501.13731 | null |
2025-01-22 | EvidenceMap: Unleashing the Power of Small Language Models with Evidence Analysis for Biomedical Question Answering | Chang Zong et.al. | 2501.12746 | null |
2025-01-17 | LLM Reasoner and Automated Planner: A new NPC approach | Israel Puerta-Merino et.al. | 2501.10106 | null |
2025-01-22 | FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs | Zengyi Gao et.al. | 2501.09957 | null |
2025-01-17 | Evolving Deeper LLM Thinking | Kuang-Huei Lee et.al. | 2501.09891 | null |
2025-01-23 | Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models | Fengli Xu et.al. | 2501.09686 | null |
2025-01-14 | Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data | Jiaxing Qiu et.al. | 2501.08413 | link |
2025-01-14 | Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning | Haoyu Han et.al. | 2501.07845 | null |
2025-01-08 | Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations | Archita Srivastava et.al. | 2501.04675 | null |
2025-01-08 | Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting | Dong-Hai Zhu et.al. | 2501.04341 | link |
2025-01-07 | Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation | Alireza Salemi et.al. | 2501.04167 | null |
2025-01-06 | KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models | Zaiyi Zheng et.al. | 2501.02711 | null |
2025-01-04 | Table as Thought: Exploring Structured Thoughts in LLM Reasoning | Zhenjie Sun et.al. | 2501.02152 | null |
2025-01-03 | Recursive Decomposition of Logical Thoughts: Framework for Superior Reasoning and Knowledge Propagation in Large Language Models | Kaleem Ullah Qasim et.al. | 2501.02026 | null |
2025-01-02 | Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search | Shuangtao Li et.al. | 2501.01478 | null |
2025-01-02 | HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation | Runsong Jia et.al. | 2501.01203 | null |
2025-01-03 | Enhancing LLM Reasoning with Multi-Path Collaborative Reactive and Reflection agents | Chengbo He et.al. | 2501.00430 | null |
2024-12-31 | EQUATOR: A Deterministic Framework for Evaluating LLM Reasoning with Open-Ended Questions. # v1.0.0-beta | Raymond Bernard et.al. | 2501.00257 | null |
2024-12-30 | Efficiently Serving LLM Reasoning Programs with Certaindex | Yichao Fu et.al. | 2412.20993 | null |
2024-12-28 | LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning | Shuguang Chen et.al. | 2412.20227 | null |
2024-12-31 | Token-Budget-Aware LLM Reasoning | Tingxu Han et.al. | 2412.18547 | link |
2024-12-23 | StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs | Hailin Chen et.al. | 2412.18011 | null |
2024-12-22 | Evaluating LLM Reasoning in the Operations Research Domain with ORQA | Mahdi Mostajabdaveh et.al. | 2412.17874 | link |
2024-12-20 | PruneVid: Visual Token Pruning for Efficient Video Large Language Models | Xiaohu Huang et.al. | 2412.16117 | link |
2024-12-19 | Eliciting Causal Abilities in Large Language Models for Reasoning Tasks | Yajing Wang et.al. | 2412.15314 | link |
2024-12-19 | Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Federico Castagna et.al. | 2412.15177 | link |
2024-12-19 | FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis | Abdullah Khan et.al. | 2412.14492 | link |
2024-12-18 | Cognition Chain for Explainable Psychological Stress Detection on Social Media | Xin Wang et.al. | 2412.14009 | null |
2024-12-18 | Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games | Wenye Lin et.al. | 2412.13602 | null |
2024-12-17 | ClarityEthic: Explainable Moral Judgment Utilizing Contrastive Ethical Insights from Large Language Models | Yuxi Sun et.al. | 2412.12848 | null |
2024-12-12 | A NotSo Simple Way to Beat Simple Bench | Soham Sane et.al. | 2412.12173 | null |
2024-12-11 | What Makes In-context Learning Effective for Mathematical Reasoning: A Theoretical Analysis | Jiayu Liu et.al. | 2412.12157 | null |
2024-12-24 | Stepwise Reasoning Error Disruption Attack of LLMs | Jingyu Peng et.al. | 2412.11934 | null |
2024-12-15 | SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation | Hang Zhang et.al. | 2412.11026 | null |
2024-12-15 | Entropy-Regularized Process Reward Model | Hanning Zhang et.al. | 2412.11006 | link |
2024-12-14 | Chasing Progress, Not Perfection: Revisiting Strategies for End-to-End LLM Plan Generation | Sukai Huang et.al. | 2412.10675 | null |
2024-12-14 | Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data | Xue Wu et.al. | 2412.10654 | null |
2024-12-13 | Atomic Learning Objectives Labeling: A High-Resolution Approach for Physics Education | Naiming Liu et.al. | 2412.09914 | null |
2024-12-12 | Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning | Zhenni Bi et.al. | 2412.09078 | null |
2024-12-11 | Training Large Language Models to Reason in a Continuous Latent Space | Shibo Hao et.al. | 2412.06769 | null |
2025-01-23 | GameArena: Evaluating LLM Reasoning through Live Computer Games | Lanxiang Hu et.al. | 2412.06394 | null |
2024-12-08 | Language hooks: a modular framework for augmenting LLM reasoning that decouples tool usage from the model and its prompt | Damien de Mijolla et.al. | 2412.05967 | null |
2024-12-05 | SocialMind: LLM-based Proactive AR Social Assistive System with Human-like Perception for In-situ Live Interactions | Bufang Yang et.al. | 2412.04036 | null |
2024-12-03 | Explainable CTR Prediction via LLM Reasoning | Xiaohan Yu et.al. | 2412.02588 | null |
2024-12-02 | NYT-Connections: A Deceptively Simple Text Classification Task that Stumps System-1 Thinkers | Angel Yahir Loredo Lopez et.al. | 2412.01621 | null |
2025-01-13 | Critical Tokens Matter: Token-Level Contrastive Estimation Enhances LLM's Reasoning Capability | Zicheng Lin et.al. | 2411.19943 | null |
2024-11-29 | TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension | Zipeng Qiu et.al. | 2411.19504 | link |
2024-11-29 | COLD: Causal reasOning in cLosed Daily activities | Abhinav Joshi et.al. | 2411.19500 | link |
2024-11-25 | Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision | Zhiheng Xi et.al. | 2411.16579 | null |
2024-11-22 | On the Impact of Fine-Tuning on Chain-of-Thought Reasoning | Elita Lobo et.al. | 2411.15382 | null |
2024-11-21 | Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models | Yuhao Dong et.al. | 2411.14432 | link |
2024-11-15 | Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination | Haojie Zheng et.al. | 2411.12591 | link |
2024-12-23 | Enhancing Reasoning Capabilities of LLMs via Principled Synthetic Logic Corpus | Terufumi Morishita et.al. | 2411.12498 | link |
2024-11-18 | Semantic-Geometric-Physical-Driven Robot Manipulation Skill Transfer via Skill Library and Tactile Representation | Mingchao Qi et.al. | 2411.11714 | link |
2024-12-31 | Enhancing LLM Reasoning with Reward-guided Tree Search | Jinhao Jiang et.al. | 2411.11694 | null |
2024-12-15 | A dataset of questions on decision-theoretic reasoning in Newcomb-like problems | Caspar Oesterheld et.al. | 2411.10588 | link |
2024-11-14 | Comprehensive and Practical Evaluation of Retrieval-Augmented Generation Systems for Medical Question Answering | Nghia Trung Ngo et.al. | 2411.09213 | null |
2024-11-13 | Tree-of-Table: Unleashing the Power of LLMs for Enhanced Large-Scale Table Understanding | Deyi Ji et.al. | 2411.08516 | null |
2024-11-18 | What Do Learning Dynamics Reveal About Generalization in LLM Reasoning? | Katie Kang et.al. | 2411.07681 | link |
2024-11-27 | Self-Training Meets Consistency: Improving LLMs' Reasoning With Consistency-Driven Rationale Evaluation | Jaehyeok Lee et.al. | 2411.06387 | link |
2024-11-09 | A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization | Haoxin Liu et.al. | 2411.06018 | null |
2024-11-11 | LLMs as Method Actors: A Model for Prompt Engineering and Architecture | Colin Doyle et.al. | 2411.05778 | link |
2024-11-12 | Kwai-STaR: Transform LLMs into State-Transition Reasoners | Xingyu Lu et.al. | 2411.04799 | null |
2024-11-21 | Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding | Haolin Chen et.al. | 2411.04282 | link |
2024-11-05 | CrowdGenUI: Enhancing LLM-Based UI Widget Generation with a Crowdsourced Preference Library | Yimeng Liu et.al. | 2411.03477 | null |
2025-01-27 | MetRex: A Benchmark for Verilog Code Metric Reasoning Using LLMs | Manar Abdelatty et.al. | 2411.03471 | link |
2024-11-04 | RuAG: Learned-rule-augmented Generation for Large Language Models | Yudi Zhang et.al. | 2411.03349 | null |
2024-10-30 | Vision-Language Models Can Self-Improve Reasoning via Reflection | Kanzhi Cheng et.al. | 2411.00855 | null |
2024-11-01 | Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling | Yiwen Ding et.al. | 2411.00750 | link |
2024-11-01 | STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing | Jiaru Zou et.al. | 2411.00387 | null |
2024-11-08 | GRS-QA -- Graph Reasoning-Structured Question Answering Dataset | Anish Pahilajani et.al. | 2411.00369 | null |
2024-10-31 | Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning | Jinghan Zhang et.al. | 2410.24155 | null |
2024-10-31 | RL-STaR: Theoretical Analysis of Reinforcement Learning Frameworks for Self-Taught Reasoner | Fu-Chieh Chang et.al. | 2410.23912 | null |
2024-10-31 | OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models | Junda Wu et.al. | 2410.23703 | null |
2024-10-30 | ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning | Millennium Bismay et.al. | 2410.23180 | link |
2024-10-30 | On Memorization of Large Language Models in Logical Reasoning | Chulin Xie et.al. | 2410.23123 | null |
2024-10-28 | Causal Interventions on Causal Paths: Mapping GPT-2's Reasoning From Syntax to Semantics | Isabelle Lee et.al. | 2410.21353 | null |
2024-10-28 | Guide-LLM: An Embodied LLM Agent and Text-Based Topological Map for Robotic Guidance of People with Visual Impairments | Sangmim Song et.al. | 2410.20666 | null |
2024-10-25 | Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models | Danqing Wang et.al. | 2410.20007 | null |
2024-10-25 | Can Stories Help LLMs Reason? Curating Information Space Through Narrative | Vahid Sadiri Javadi et.al. | 2410.19221 | null |
2024-10-18 | Make LLMs better zero-shot reasoners: Structure-orientated autonomous reasoning | Pengfei He et.al. | 2410.19000 | link |
2024-10-25 | CLR-Bench: Evaluating Large Language Models in College-level Reasoning | Junnan Dong et.al. | 2410.17558 | null |
2024-10-28 | Non-myopic Generation of Language Models for Reasoning and Planning | Chang Ma et.al. | 2410.17195 | link |
2024-11-06 | Improving Causal Reasoning in Large Language Models: A Survey | Longxuan Yu et.al. | 2410.16676 | link |
2024-10-22 | A Statistical Analysis of LLMs' Self-Evaluation Using Proverbs | Ryosuke Sonoda et.al. | 2410.16640 | null |
2024-10-21 | Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic | Jason Chan et.al. | 2410.16502 | null |
2024-11-27 | On Designing Effective RL Reward at Training Time for LLM Reasoning | Jiaxuan Gao et.al. | 2410.15115 | null |
2025-01-28 | Paths-over-Graph: Knowledge Graph Empowered Large Language Model Reasoning | Xingyu Tan et.al. | 2410.14211 | null |
2024-10-21 | Unconstrained Model Merging for Enhanced LLM Reasoning | Yiming Zhang et.al. | 2410.13699 | null |
2024-10-16 | Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models | Linhao Luo et.al. | 2410.13080 | link |
2024-10-16 | KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs | Yongqin Xu et.al. | 2410.12480 | null |
2024-10-17 | Enhancing LLM Trading Performance with Fact-Subjectivity Aware Reasoning | Qian Wang et.al. | 2410.12464 | null |
2024-10-16 | Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up | Jiahao Yuan et.al. | 2410.12323 | link |
2024-10-16 | Exploiting LLMs' Reasoning Capability to Infer Implicit Concepts in Legal Information Retrieval | Hai-Long Nguyen et.al. | 2410.12154 | null |
2024-10-15 | Planning Anything with Rigor: General-Purpose Zero-Shot Planning with LLM-based Formalized Programming | Yilun Hao et.al. | 2410.12112 | null |
2024-10-12 | OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models | Jun Wang et.al. | 2410.09671 | null |
2024-10-11 | P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains | Simeng Han et.al. | 2410.09207 | null |
2024-10-11 | Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning | Yunpeng Gao et.al. | 2410.08500 | null |
2024-10-10 | SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | Hang Yin et.al. | 2410.08189 | null |
2024-10-10 | Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning | Amrith Setlur et.al. | 2410.08146 | null |
2024-10-10 | Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Zirui Zhao et.al. | 2410.07627 | null |
2024-10-09 | Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis | Ahmed Abdullah et.al. | 2410.06841 | null |
2024-10-09 | Towards Self-Improvement of LLMs via MCTS: Leveraging Stepwise Knowledge with Curriculum Preference Learning | Xiyao Wang et.al. | 2410.06508 | null |
2025-01-02 | Filtering Discomforting Recommendations with Large Language Models | Jiahao Liu et.al. | 2410.05411 | null |
2024-10-05 | Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification | Zhenwen Liang et.al. | 2410.05318 | null |
2024-10-06 | Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval | Pengcheng Jiang et.al. | 2410.04585 | link |
2024-10-03 | The Role of Deductive and Inductive Reasoning in Large Language Models | Chengkun Cai et.al. | 2410.02892 | null |
2024-10-02 | Not All LLM Reasoners Are Created Equal | Arian Hosseini et.al. | 2410.01748 | null |
2024-12-25 | Interpretable Contrastive Monte Carlo Tree Search Reasoning | Zitian Gao et.al. | 2410.01707 | link |
2024-10-02 | VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment | Amirhossein Kazemnejad et.al. | 2410.01679 | link |
2024-10-02 | AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses | Xiaotian Lu et.al. | 2410.01246 | null |
2024-10-01 | Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness | Xiao Peng et.al. | 2410.00359 | null |
2024-10-01 | Insight: A Multi-Modal Diagnostic Pipeline using LLMs for Ocular Surface Disease Diagnosis | Chun-Hsiao Yeh et.al. | 2410.00292 | null |
2024-10-08 | GUNDAM: Aligning Large Language Models with Graph Understanding | Sheng Ouyang et.al. | 2409.20053 | null |
2024-09-27 | Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs | Yanyuan Qiao et.al. | 2409.18794 | null |
2024-10-23 | Proof of Thought : Neurosymbolic Program Synthesis allows Robust and Interpretable Reasoning | Debargha Ganguly et.al. | 2409.17270 | null |
2024-09-20 | CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Casual Significance and Consistency | Kangsheng Wang et.al. | 2409.17174 | null |
2024-09-20 | Mufu: Multilingual Fused Learning for Low-Resource Translation with LLM | Zheng Wei Lim et.al. | 2409.13949 | null |
2024-09-19 | SituationAdapt: Contextual UI Optimization in Mixed Reality with Situation Awareness via LLM Reasoning | Zhipeng Li et.al. | 2409.12836 | null |
2024-10-04 | Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning | Jiaxin Wen et.al. | 2409.12452 | link |
2024-12-16 | Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Jiaming Zhou et.al. | 2409.12437 | link |
2024-09-18 | MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning | Justin Chih-Yao Chen et.al. | 2409.12147 | link |
2024-11-05 | Improving LLM Reasoning with Multi-Agent Tree-of-Thought Validator Agent | Fatemeh Haji et.al. | 2409.11527 | link |
2024-09-16 | Enhancing RL Safety with Counterfactual LLM Reasoning | Dennis Gross et.al. | 2409.10188 | link |
2024-09-11 | Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation | SeongYeub Chu et.al. | 2409.07355 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-30 | Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation | Muhammed Yusuf Kocyigit et.al. | 2501.18771 | null |
2025-01-31 | ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation | Minghua He et.al. | 2501.18460 | null |
2025-01-25 | LLM Evaluation Based on Aerospace Manufacturing Expertise: Automated Generation and Multi-Model Question Answering | Beiming Liu et.al. | 2501.17183 | null |
2025-01-28 | An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Koji Inoue et.al. | 2501.16643 | null |
2025-01-26 | HardML: A Benchmark For Evaluating Data Science And Machine Learning knowledge and reasoning in AI | Tidor-Vlad Pricope et.al. | 2501.15627 | null |
2025-01-23 | Question Answering on Patient Medical Records with Private Fine-Tuned LLMs | Sara Kothari et.al. | 2501.13687 | null |
2025-01-10 | CodEv: An Automated Grading Framework Leveraging Large Language Models for Consistent and Constructive Feedback | En-Qi Tseng et.al. | 2501.10421 | null |
2025-01-15 | Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History | Yevhen Kostiuk et.al. | 2501.09154 | null |
2025-01-13 | Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles | Samia Touileb et.al. | 2501.07718 | null |
2025-01-03 | FLAME: Financial Large-Language Model Assessment and Metrics Evaluation | Jiayu Guo et.al. | 2501.06211 | link |
2025-01-07 | MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems | Yannis Katsis et.al. | 2501.03468 | link |
2025-01-05 | Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm | Ljubisa Bojic et.al. | 2501.02532 | null |
2025-01-04 | LLMzSzŁ: a comprehensive LLM benchmark for Polish | Krzysztof Jassem et.al. | 2501.02266 | null |
2025-01-08 | VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM | Yuqian Yuan et.al. | 2501.00599 | link |
2025-01-04 | Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation | M. Ali Bayram et.al. | 2501.00593 | null |
2024-12-31 | Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs | Weijia Xu et.al. | 2501.00273 | null |
2024-12-30 | EVOLVE: Emotion and Visual Output Learning via LLM Evaluation | Jordan Sinclair et.al. | 2412.20632 | null |
2024-12-24 | Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles | Zihan Wang et.al. | 2412.18416 | null |
2024-12-24 | A Statistical Framework for Ranking LLM-Based Chatbots | Siavash Ameli et.al. | 2412.18407 | link |
2025-01-25 | DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation | Junyi Lu et.al. | 2412.18291 | null |
2024-12-23 | CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models | Ruibo Tu et.al. | 2412.17970 | link |
2025-01-02 | Baichuan4-Finance Technical Report | Hanyu Zhang et.al. | 2412.15270 | null |
2024-12-19 | ObjVariantEnsemble: Advancing Point Cloud LLM Evaluation in Challenging Scenes with Subtly Distinguished Objects | Qihang Cao et.al. | 2412.14837 | null |
2024-12-18 | AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge | Xiaobao Wu et.al. | 2412.13670 | link |
2024-12-18 | Mind Your Theory: Theory of Mind Goes Deeper Than Reasoning | Eitan Wagner et.al. | 2412.13631 | null |
2024-12-17 | OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain | Shuting Wang et.al. | 2412.13018 | link |
2024-12-10 | How to Choose a Threshold for an Evaluation Metric for Large Language Models | Bhaskarjit Sarmah et.al. | 2412.12148 | null |
2024-12-15 | Dual Traits in Probabilistic Reasoning of Large Language Models | Shenxiong Li et.al. | 2412.11009 | link |
2024-12-30 | LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation | Eunsu Kim et.al. | 2412.10424 | null |
2024-12-13 | Cultural Evolution of Cooperation among LLM Agents | Aron Vallinder et.al. | 2412.10270 | null |
2024-12-12 | Towards Understanding the Robustness of LLM-based Evaluations under Perturbations | Manav Chaudhary et.al. | 2412.09269 | null |
2024-12-10 | BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities | Sahal Shaji Mullappilly et.al. | 2412.07769 | link |
2024-12-12 | PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models | Qian Zhang et.al. | 2412.06287 | link |
2024-12-02 | AI Benchmarks and Datasets for LLM Evaluation | Todor Ivanov et.al. | 2412.01020 | null |
2024-11-30 | Evaluating the Consistency of LLM Evaluators | Noah Lee et.al. | 2412.00543 | null |
2024-11-29 | MIMDE: Exploring the Use of Synthetic vs Human Data for Evaluating Multi-Insight Multi-Document Extraction Tasks | John Francis et.al. | 2411.19689 | null |
2024-11-29 | Beyond Surface Structure: A Causal Assessment of LLMs' Comprehension Ability | Yujin Han et.al. | 2411.19456 | link |
2024-11-27 | Is my Meeting Summary Good? Estimating Quality with a Multi-LLM Evaluator | Frederic Kirstein et.al. | 2411.18444 | null |
2025-01-17 | CS-Eval: A Comprehensive Large Language Model Benchmark for CyberSecurity | Zhengmin Yu et.al. | 2411.16239 | link |
2024-11-25 | SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text | Reshmi Ghosh et.al. | 2411.16077 | null |
2024-11-26 | Do LLMs Agree on the Creativity Evaluation of Alternative Uses? | Abdullah Al Rabeyah et.al. | 2411.15560 | null |
2024-11-19 | Ranking Unraveled: Recipes for LLM Rankings in Head-to-Head AI Combat | Roland Daynauth et.al. | 2411.14483 | link |
2024-11-21 | Lost in Inference: Rediscovering the Role of Natural Language Inference for Large Language Models | Lovish Madaan et.al. | 2411.14103 | null |
2024-11-21 | An Evaluation-Driven Approach to Designing LLM Agents: Process and Architecture | Boming Xia et.al. | 2411.13768 | null |
2024-11-21 | A Framework for Evaluating LLMs Under Task Indeterminacy | Luke Guerdan et.al. | 2411.13760 | null |
2024-11-12 | Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning | Linyang He et.al. | 2411.07533 | null |
2024-11-13 | Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models | Yancheng He et.al. | 2411.07140 | null |
2024-11-09 | Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models | Xiaojun Wu et.al. | 2411.06272 | link |
2024-11-16 | ProverbEval: Exploring LLM Evaluation Challenges for Low-resource Language Understanding | Israel Abebe Azime et.al. | 2411.05049 | null |
2024-11-07 | Bayesian Calibration of Win Rate Estimation with LLM Evaluators | Yicheng Gao et.al. | 2411.04424 | link |
2024-11-05 | Enhancing LLM Evaluations: The Garbling Trick | William F. Bradley et.al. | 2411.01533 | null |
2025-01-31 | Mastering the Craft of Data Synthesis for CodeLLMs | Meng Chen et.al. | 2411.00005 | null |
2024-10-28 | Project MPG: towards a generalized performance benchmark for LLM capabilities | Lucas Spangher et.al. | 2410.22368 | null |
2024-10-29 | Self-Preference Bias in LLM-as-a-Judge | Koki Wataoka et.al. | 2410.21819 | null |
2024-10-28 | Unveiling Context-Aware Criteria in Self-Assessing LLMs | Taneesh Gupta et.al. | 2410.21545 | null |
2024-10-27 | LoRA Done RITE: Robust Invariant Transformation Equilibration for LoRA Optimization | Jui-Nan Yen et.al. | 2410.20625 | null |
2024-10-26 | Limitations of the LLM-as-a-Judge Approach for Evaluating LLM Outputs in Expert Knowledge Tasks | Annalisa Szymanski et.al. | 2410.20266 | null |
2024-10-23 | MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning | Jingfan Zhang et.al. | 2410.18035 | null |
2025-01-30 | Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements | Isamu Isozaki et.al. | 2410.17141 | link |
2024-10-21 | CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution | Maosong Cao et.al. | 2410.16256 | link |
2025-01-26 | mHumanEval -- A Multilingual Benchmark to Evaluate Large Language Models for Code Generation | Nishat Raihan et.al. | 2410.15037 | link |
2024-10-19 | CAP: Data Contamination Detection via Consistency Amplification | Yi Zhao et.al. | 2410.15005 | null |
2024-10-18 | Enabling Scalable Evaluation of Bias Patterns in Medical LLMs | Hamed Fayyaz et.al. | 2410.14763 | link |
2024-11-06 | Diverging Preferences: When do Annotators Disagree and do Models Know? | Michael JQ Zhang et.al. | 2410.14632 | null |
2024-10-18 | Combining Entropy and Matrix Nuclear Norm for Enhanced Evaluation of Language Models | James Vo et.al. | 2410.14480 | null |
2024-10-21 | BenTo: Benchmark Task Reduction with In-Context Transferability | Hongyu Zhao et.al. | 2410.13804 | link |
2024-10-16 | BenchmarkCards: Large Language Model and Risk Reporting | Anna Sokol et.al. | 2410.12974 | null |
2024-12-29 | Language Model Preference Evaluation with Multiple Weak Evaluators | Zhengyu Hu et.al. | 2410.12869 | link |
2024-10-11 | Enterprise Benchmarks for Large Language Model Evaluation | Bing Zhang et.al. | 2410.12857 | link |
2024-10-16 | An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation | Junjie Chen et.al. | 2410.12265 | null |
2024-10-15 | Leaving the barn door open for Clever Hans: Simple features predict LLM benchmark answers | Lorenzo Pacchiardi et.al. | 2410.11672 | link |
2024-10-15 | Black-box Uncertainty Quantification Method for LLM-as-a-Judge | Nico Wagner et.al. | 2410.11594 | null |
2024-10-14 | Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting | Yifan Luo et.al. | 2410.10150 | null |
2024-12-13 | HARDMath: A Benchmark Dataset for Challenging Problems in Applied Mathematics | Jingxuan Fan et.al. | 2410.09988 | link |
2024-10-15 | LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models | Han Qiu et.al. | 2410.09962 | link |
2024-10-17 | Towards Multilingual LLM Evaluation for European Languages | Klaudia Thellmann et.al. | 2410.08928 | null |
2024-10-11 | Test-driven Software Experimentation with LASSO: an LLM Benchmarking Example | Marcus Kessel et.al. | 2410.08911 | null |
2024-10-10 | Assessing Episodic Memory in LLMs with Sequence Order Recall Tasks | Mathis Pink et.al. | 2410.08133 | null |
2024-10-10 | COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act | Philipp Guldimann et.al. | 2410.07959 | null |
2024-11-06 | News Reporter: A Multi-lingual LLM Framework for Broadcast T.V News | Tarun Jain et.al. | 2410.07520 | null |
2024-10-09 | Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates | Xiaosen Zheng et.al. | 2410.07137 | link |
2024-10-09 | ReIFE: Re-evaluating Instruction-Following Evaluation | Yixin Liu et.al. | 2410.07069 | link |
2024-10-08 | Active Evaluation Acquisition for Efficient LLM Benchmarking | Yang Li et.al. | 2410.05952 | null |
2024-10-07 | TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles | Qingchen Yu et.al. | 2410.05262 | link |
2024-10-01 | Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language Model | Aidan Gilson et.al. | 2410.03740 | null |
2024-10-04 | TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation | Jonathan Cook et.al. | 2410.03608 | null |
2024-10-04 | Towards Reproducible LLM Evaluation: Quantifying Uncertainty in LLM Benchmark Scores | Robert E. Blackwell et.al. | 2410.03492 | null |
2024-10-29 | AIME: AI System Optimization via Multiple LLM Evaluators | Bhrij Patel et.al. | 2410.03131 | null |
2024-10-02 | Comparing Criteria Development Across Domain Experts, Lay Users, and Models in Large Language Model Evaluation | Annalisa Szymanski et.al. | 2410.02054 | null |
2024-10-02 | Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models | Joseph Lee et.al. | 2410.01795 | link |
2024-10-03 | Extending Context Window of Large Language Models from a Distributional Perspective | Yingsheng Wu et.al. | 2410.01490 | null |
2024-10-02 | ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving | Yifan Qiao et.al. | 2410.01228 | null |
2024-10-01 | ViDAS: Vision-based Danger Assessment and Scoring | Pranav Gupta et.al. | 2410.00477 | null |
2024-10-01 | PclGPT: A Large Language Model for Patronizing and Condescending Language Detection | Hongbo Wang et.al. | 2410.00361 | link |
2024-11-26 | LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models | Haitao Li et.al. | 2409.20288 | link |
2024-09-29 | Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems | Xuyang Wu et.al. | 2409.19804 | null |
2024-10-19 | Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models | Xin Li et.al. | 2409.19667 | link |
2024-10-05 | IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation | Fan Lin et.al. | 2409.18892 | link |
2024-12-13 | A Character-Centric Creative Story Generation via Imagination | Kyeongman Park et.al. | 2409.16667 | null |
2024-09-25 | Judgment of Thoughts: Courtroom of the Binary Logical Reasoning in Large Language Models | Sungjune Park et.al. | 2409.16635 | null |
2024-12-18 | Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino | Jann Railey Montalan et.al. | 2409.15380 | link |
2024-12-16 | MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators | Qingyu Lu et.al. | 2409.14335 | link |
2024-09-21 | ChemEval: A Comprehensive Multi-Level Chemical Evaluation for Large Language Models | Yuqing Huang et.al. | 2409.13989 | link |
2024-12-17 | AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs | Basel Mousi et.al. | 2409.11404 | null |
2024-10-02 | LLM-as-a-Judge & Reward Model: What They Can and Cannot Do | Guijin Son et.al. | 2409.11239 | null |
2024-12-08 | Towards Data Contamination Detection for Modern Large Language Models: Limitations, Inconsistencies, and Oracle Challenges | Vinay Samuel et.al. | 2409.09927 | link |
2024-09-13 | Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia | Fajri Koto et.al. | 2409.08564 | null |
2024-09-09 | Assessing SPARQL capabilities of Large Language Models | Lars-Peter Meyer et.al. | 2409.05925 | link |
2024-10-08 | LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs | Yuhao Wu et.al. | 2409.02076 | link |
2024-10-14 | Polyrating: A Cost-Effective and Bias-Aware Rating System for LLM Evaluation | Jasper Dekoninck et.al. | 2409.00696 | null |
2024-08-26 | Evaluating ChatGPT on Nuclear Domain-Specific Data | Muhammad Anwar et.al. | 2409.00090 | null |
2024-08-28 | LLMSecCode: Evaluating Large Language Models for Secure Coding | Anton Rydén et.al. | 2408.16100 | link |
2024-08-26 | LLM-3D Print: Large Language Models To Monitor and Control 3D Printing | Yayati Jadhav et.al. | 2408.14307 | null |
2024-08-26 | Epidemic Information Extraction for Event-Based Surveillance using Large Language Models | Sergio Consoli et.al. | 2408.14277 | null |
2024-10-04 | MobileQuant: Mobile-friendly Quantization for On-device Language Models | Fuwen Tan et.al. | 2408.13933 | link |
2024-08-23 | LalaEval: A Holistic Human Evaluation Framework for Domain-Specific Large Language Models | Chongyan Sun et.al. | 2408.13338 | null |
2024-08-23 | Open Llama2 Model for the Lithuanian Language | Artūras Nakvosas et.al. | 2408.12963 | null |
2024-08-23 | LIMP: Large Language Model Enhanced Intent-aware Mobility Prediction | Songwei Li et.al. | 2408.12832 | link |
2024-12-20 | Recording for Eyes, Not Echoing to Ears: Contextualized Spoken-to-Written Conversion of ASR Transcripts | Jiaqing Liu et.al. | 2408.09688 | null |
2024-08-20 | Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge | Ravi Raju et.al. | 2408.08808 | null |
2024-10-16 | The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation | Samee Arif et.al. | 2408.08688 | link |
2024-10-19 | Persona is a Double-edged Sword: Mitigating the Negative Impact of Role-playing Prompts in Zero-shot Reasoning Tasks | Junseok Kim et.al. | 2408.08631 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-31 | Vintix: Action Model via In-Context Reinforcement Learning | Andrey Polubarov et.al. | 2501.19400 | link |
2025-01-31 | Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game | Mustafa O. Karabag et.al. | 2501.19398 | null |
2025-01-31 | Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models | Alina Shutova et.al. | 2501.19392 | null |
2025-01-31 | Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models | Wenzhi Fang et.al. | 2501.19389 | null |
2025-01-31 | SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions | Dominik Wagner et.al. | 2501.19377 | null |
2025-01-31 | Beyond Fixed Horizons: A Theoretical Framework for Adaptive Denoising Diffusions | Sören Christensen et.al. | 2501.19373 | null |
2025-01-31 | We're Different, We're the Same: Creative Homogeneity Across LLMs | Emily Wenger et.al. | 2501.19361 | null |
2025-01-31 | Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies | Brandon P. Chelstrom et.al. | 2501.19359 | null |
2025-01-31 | The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking | Yuchun Miao et.al. | 2501.19358 | null |
2025-01-31 | Addressing the correlation of Stokes-shifted photons emitted from two quantum emitters | Adrián Juan-Delgado et.al. | 2501.19356 | null |
2025-01-31 | Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 | Ting-Yao E. Hsu et.al. | 2501.19353 | null |
2025-01-31 | Towards Adaptive Self-Improvement for Smarter Energy Systems | Alexander Sommer et.al. | 2501.19340 | null |
2025-01-31 | PixelWorld: Towards Perceiving Everything as Pixels | Zhiheng Lyu et.al. | 2501.19339 | null |
2025-01-31 | Homogeneity Bias as Differential Sampling Uncertainty in Language Models | Messi H. J. Lee et.al. | 2501.19337 | null |
2025-01-31 | Reward-Guided Speculative Decoding for Efficient LLM Reasoning | Baohao Liao et.al. | 2501.19324 | null |
2025-01-31 | MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems | Anirudh Chari et.al. | 2501.19318 | null |
2025-01-31 | LLM-based Affective Text Generation Quality Based on Different Quantization Values | Yarik Menchaca Resendiz et.al. | 2501.19317 | null |
2025-01-31 | Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment | Gregor Bachmann et.al. | 2501.19309 | null |
2025-01-31 | SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling | Jiefeng Chen et.al. | 2501.19306 | null |
2025-01-31 | Beyond checkmate: exploring the creative chokepoints in AI text | Nafis Irtiza Tripto et.al. | 2501.19301 | link |
2025-01-31 | Offline Learning for Combinatorial Multi-armed Bandits | Xutong Liu et.al. | 2501.19300 | null |
2025-01-31 | Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes | Zhiyao Xu et.al. | 2501.19298 | null |
2025-01-31 | Analysis of LLMs vs Human Experts in Requirements Engineering | Cory Hymel et.al. | 2501.19297 | null |
2025-01-31 | Low-Cost and Comprehensive Non-textual Input Fuzzing with LLM-Synthesized Input Generators | Kunpeng Zhang et.al. | 2501.19282 | null |
2025-01-31 | Pheromone-based Learning of Optimal Reasoning Paths | Anirudh Chari et.al. | 2501.19278 | null |
2025-01-31 | From Assistance to Autonomy -- A Researcher Study on the Potential of AI Support for Qualitative Data Analysis | Elisabeth Kirsten et.al. | 2501.19275 | null |
2025-01-31 | Jackpot! Alignment as a Maximal Lottery | Roberto-Rafael Maura-Rivero et.al. | 2501.19266 | null |
2025-01-31 | Neuro-LIFT: A Neuromorphic, LLM-based Interactive Framework for Autonomous Drone FlighT at the Edge | Amogh Joshi et.al. | 2501.19259 | null |
2025-01-31 | A Zero-Shot Generalization Framework for LLM-Driven Cross-Domain Sequential Recommendation | Yunzhe Li et.al. | 2501.19232 | null |
2025-01-31 | Autonomous Legacy Web Application Upgrades Using a Multi-Agent System | Valtteri Ala-Salmi et.al. | 2501.19204 | null |
2025-01-31 | Improving the Robustness of Representation Misdirection for Large Language Model Unlearning | Dang Huu-Tien et.al. | 2501.19202 | null |
2025-01-31 | Efficient Reasoning with Hidden Thinking | Xuan Shen et.al. | 2501.19201 | link |
2025-01-31 | Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning | Xianglin Yang et.al. | 2501.19180 | null |
2025-01-31 | No Foundations without Foundations -- Why semi-mechanistic models are essential for regulatory biology | Luka Kovačević et.al. | 2501.19178 | null |
2025-01-31 | Position: Contextual Integrity Washing for Language Models | Yan Shvartzshnaider et.al. | 2501.19173 | null |
2025-01-31 | Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs | Kejia Zhang et.al. | 2501.19164 | null |
2025-01-31 | A theoretical framework for overfitting in energy-based modeling | Giovanni Catania et.al. | 2501.19158 | null |
2025-01-31 | A Tensor-Train Decomposition based Compression of LLMs on Group Vector Systolic Accelerator | Sixiao Huang et.al. | 2501.19135 | null |
2025-01-31 | Unraveling Zeroth-Order Optimization through the Lens of Low-Dimensional Structured Perturbations | Sihwan Park et.al. | 2501.19099 | null |
2025-01-31 | Ambient Denoising Diffusion Generative Adversarial Networks for Establishing Stochastic Object Models from Noisy Image Data | Xichen Xu et.al. | 2501.19094 | null |
2025-01-31 | Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models | Jialin Zhao et.al. | 2501.19090 | null |
2025-01-31 | Fairness Analysis of CLIP-Based Foundation Models for X-Ray Image Classification | Xiangyu Sun et.al. | 2501.19086 | null |
2025-01-31 | Enhancing Code Generation for Low-Resource Languages: No Silver Bullet | Alessandro Giagnorio et.al. | 2501.19085 | null |
2025-01-31 | Concept Steerers: Leveraging K-Sparse Autoencoders for Controllable Generations | Dahye Kim et.al. | 2501.19066 | link |
2025-01-31 | TeZO: Empowering the Low-Rankness on the Temporal Dimension in the Zeroth-Order Optimization for Fine-tuning LLMs | Yan Sun et.al. | 2501.19057 | null |
2025-01-31 | Enabling Autonomic Microservice Management through Self-Learning Agents | Fenglin Yu et.al. | 2501.19056 | null |
2025-01-31 | Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models | Ruiyu Wang et.al. | 2501.19054 | null |
2025-01-31 | Swarm-Gen: Fast Generation of Diverse Feasible Swarm Behaviors | Simon Idoko et.al. | 2501.19042 | link |
2025-01-31 | Towards the Worst-case Robustness of Large Language Models | Huanran Chen et.al. | 2501.19040 | null |
2025-01-31 | Beyond Token Compression: A Training-Free Reduction Framework for Efficient Visual Processing in MLLMs | Hongliang Li et.al. | 2501.19036 | null |
2025-01-31 | XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses | Bo Lan et.al. | 2501.19034 | link |
2025-01-31 | Multilayer Networks in Neuroimaging | Vesna Vuksanovic et.al. | 2501.19024 | null |
2025-01-31 | Calling a Spade a Heart: Gaslighting Multimodal Large Language Models via Negation | Bin Zhu et.al. | 2501.19017 | null |
2025-01-31 | Importing Phantoms: Measuring LLM Package Hallucination Vulnerabilities | Arjun Krishna et.al. | 2501.19012 | null |
2025-01-31 | Visual Autoregressive Modeling for Image Super-Resolution | Yunpeng Qu et.al. | 2501.18993 | null |
2025-01-31 | Symmetric Pruning of Large Language Models | Kai Yi et.al. | 2501.18980 | null |
2025-01-31 | BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics | Yuxuan Liu et.al. | 2501.18972 | null |
2025-01-31 | Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Boostrapping | Pu Yang et.al. | 2501.18962 | null |
2025-01-31 | Intrinsic Tensor Field Propagation in Large Language Models: A Novel Approach to Contextual Information Flow | Alfred Bexley et.al. | 2501.18957 | null |
2025-01-31 | LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models | Shenghao Fu et.al. | 2501.18954 | link |
2025-01-31 | TabFSBench: Tabular Benchmark for Feature Shifts in Open Environment | Zi-Jian Cheng et.al. | 2501.18935 | link |
2025-01-31 | Language Games as the Pathway to Artificial Superhuman Intelligence | Ying Wen et.al. | 2501.18924 | null |
2025-01-31 | KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search | Haoran Luo et.al. | 2501.18922 | link |
2025-01-31 | LLM Program Optimization via Retrieval Augmented Search | Sagnik Anupam et.al. | 2501.18916 | null |
2025-01-31 | Scaling Laws for Differentially Private Language Models | Ryan McKenna et.al. | 2501.18914 | null |
2025-01-31 | Streamlining Security Vulnerability Triage with Large Language Models | Mohammad Jalili Torkamani et.al. | 2501.18908 | null |
2025-01-31 | Trustworthy Evaluation of Generative AI Models | Zijun Gao et.al. | 2501.18897 | null |
2025-01-31 | Can We Predict the Effect of Prompts? | Jae Yong Lee et.al. | 2501.18883 | null |
2025-01-31 | Adaptivity and Convergence of Probability Flow ODEs in Diffusion Generative Models | Jiaqi Tang et.al. | 2501.18863 | null |
2025-01-31 | BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning | Han Zhong et.al. | 2501.18858 | null |
2025-01-31 | Equivariant Hypergraph Diffusion for Crystal Structure Prediction | Yang Liu et.al. | 2501.18850 | null |
2025-01-31 | Text Data Augmentation for Large Language Models: A Comprehensive Survey of Methods, Challenges, and Opportunities | Yaping Chai et.al. | 2501.18845 | null |
2025-01-31 | Trading Inference-Time Compute for Adversarial Robustness | Wojciech Zaremba et.al. | 2501.18841 | null |
2025-01-31 | Partially Rewriting a Transformer in Natural Language | Gonçalo Paulo et.al. | 2501.18838 | null |
2025-01-31 | Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming | Mrinank Sharma et.al. | 2501.18837 | null |
2025-01-31 | Pitfalls of defacing whole-head MRI: re-identification risk with diffusion models and compromised research potential | Chenyu Gao et.al. | 2501.18834 | null |
2025-01-31 | Structural Embedding Projection for Contextual Large Language Model Inference | Vincent Enoasmo et.al. | 2501.18826 | null |
2025-01-31 | Bridging the Reasoning Gap: Small LLMs Can Plan with Generalised Strategies | Andrey Borro et.al. | 2501.18817 | link |
2025-01-31 | Large Language Models as Common-Sense Heuristics | Andrey Borro et.al. | 2501.18816 | null |
2025-01-30 | Compositional Generalization Requires More Than Disentangled Representations | Qiyao Liang et.al. | 2501.18797 | null |
2025-01-30 | Rope to Nope and Back Again: A New Hybrid Attention Strategy | Bowen Yang et.al. | 2501.18795 | null |
2025-01-30 | Survey and Improvement Strategies for Gene Prioritization with Large Language Models | Matthew Neeley et.al. | 2501.18794 | null |
2025-01-30 | LLM-Generated Heuristics for AI Planning: Do We Even Need Domain-Independence Anymore? | Alexander Tuisov et.al. | 2501.18784 | null |
2025-01-30 | Navigating the Fragrance space Via Graph Generative Models And Predicting Odors | Mrityunjay Sharma et.al. | 2501.18777 | link |
2025-01-30 | Probabilistic Joint Recovery Method for CO |
Zijun Deng et.al. | 2501.18761 | null |
2025-01-30 | Synthetic Data Generation for Augmenting Small Samples | Dan Liu et.al. | 2501.18741 | null |
2025-01-30 | Examining the Robustness of Large Language Models across Language Complexity | Jiayi Zhang et.al. | 2501.18738 | null |
2025-01-30 | Exploring Audio Editing Features as User-Centric Privacy Defenses Against Emotion Inference Attacks | Mohd. Farhan Israk Soumik et.al. | 2501.18727 | null |
2025-01-30 | Strong and Controllable 3D Motion Generation | Canxuan Gang et.al. | 2501.18726 | null |
2025-01-30 | Zero-shot Large Language Models for Long Clinical Text Summarization with Temporal Reasoning | Maya Kruse et.al. | 2501.18724 | null |
2025-01-30 | Invisible Traces: Using Hybrid Fingerprinting to identify underlying LLMs in GenAI Apps | Devansh Bhardwaj et.al. | 2501.18712 | null |
2025-01-30 | Regularized second-order optimization of tensor-network Born machines | Matan Ben-Dov et.al. | 2501.18691 | null |
2025-01-30 | Drag Your Gaussian: Effective Drag-Based Editing with Score Distillation for 3D Gaussian Splatting | Yansong Qu et.al. | 2501.18672 | null |
2025-01-30 | Foundational Models for 3D Point Clouds: A Survey and Outlook | Vishal Thengane et.al. | 2501.18594 | null |
2025-01-30 | Diffusion Autoencoders are Scalable Image Tokenizers | Yinbo Chen et.al. | 2501.18593 | null |
2025-01-30 | Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models | Hao Dong et.al. | 2501.18592 | link |
2025-01-30 | Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs | Yue Wang et.al. | 2501.18585 | null |
2025-01-30 | Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH | Evgenii Evstafev et.al. | 2501.18576 | null |
2025-01-30 | BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos | Lehao Lin et.al. | 2501.18565 | null |
2025-01-30 | SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation | Haoquan Fang et.al. | 2501.18564 | null |
2025-01-30 | Semantic Web and Creative AI -- A Technical Report from ISWS 2023 | Raia Abu Ahmad et.al. | 2501.18542 | null |
2025-01-30 | Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges | Manveer Singh Tamber et.al. | 2501.18536 | link |
2025-01-30 | Differentially Private Steering for Large Language Model Alignment | Anmol Goel et.al. | 2501.18532 | link |
2025-01-30 | Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models | Guanqun Cao et.al. | 2501.18516 | null |
2025-01-30 | Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch | Arthur Douillard et.al. | 2501.18512 | null |
2025-01-30 | WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training | Benjamin Feuer et.al. | 2501.18511 | link |
2025-01-30 | CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction | Peter J. Bentley et.al. | 2501.18504 | null |
2025-01-30 | Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline | Shivani Kapania et.al. | 2501.18493 | null |
2025-01-30 | A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models | Changshu Liu et.al. | 2501.18482 | null |
2025-01-30 | CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization | Yanxia Deng et.al. | 2501.18475 | null |
2025-01-30 | Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations | Chengxi Zeng et.al. | 2501.18474 | null |
2025-01-30 | ExeCoder: Empowering Large Language Models with Executability Representation for Code Translation | Minghua He et.al. | 2501.18460 | null |
2025-01-30 | CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering | Yumeng Wang et.al. | 2501.18457 | null |
2025-01-30 | GENIE: Generative Note Information Extraction model for structuring EHR data | Huaiyuan Ying et.al. | 2501.18435 | null |
2025-01-30 | Exploring Potential Prompt Injection Attacks in Federated Military LLMs and Their Mitigation | Youngjoon Lee et.al. | 2501.18416 | null |
2025-01-30 | RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects | Yiteng Tu et.al. | 2501.18365 | link |
2025-01-30 | A Video-grounded Dialogue Dataset and Metric for Event-driven Activities | Wiradee Imrattanatrai et.al. | 2501.18324 | link |
2025-01-30 | Leveraging LLM Agents for Automated Optimization Modeling for SASP Problems: A Graph-RAG based Approach | Tianpeng Pan et.al. | 2501.18320 | null |
2025-01-30 | Mining for Species, Locations, Habitats, and Ecosystems from Scientific Papers in Invasion Biology: A Large-Scale Exploratory Study with Large Language Models | Jennifer D'Souza et.al. | 2501.18287 | null |
2025-01-30 | Jailbreaking LLMs' Safeguard with Universal Magic Words for Text Embedding Models | Haoyu Liang et.al. | 2501.18280 | null |
2025-01-30 | Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence | Kevin Roitero et.al. | 2501.18265 | null |
2025-01-30 | How to Select Datapoints for Efficient Human Evaluation of NLG Models? | Vilém Zouhar et.al. | 2501.18251 | link |
2025-01-30 | Statistical multi-metric evaluation and visualization of LLM system predictive performance | Samuel Ackerman et.al. | 2501.18243 | null |
2025-01-30 | Contextually Structured Token Dependency Encoding for Large Language Models | James Blades et.al. | 2501.18205 | null |
2025-01-30 | Economic Rationality under Specialization: Evidence of Decision Bias in AI Agents | ShuiDe Wen et.al. | 2501.18190 | null |
2025-01-30 | Investigating Tax Evasion Emergence Using Dual Large Language Model and Deep Reinforcement Learning Powered Agent-based Simulation | Teddy Lazebnik et.al. | 2501.18177 | null |
2025-01-30 | Continually Evolved Multimodal Foundation Models for Cancer Prognosis | Jie Peng et.al. | 2501.18170 | null |
2025-01-30 | RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing | Jinyao Guo et.al. | 2501.18160 | null |
2025-01-30 | Large Language Models for Cryptocurrency Transaction Analysis: A Bitcoin Case Study | Yuchen Lei et.al. | 2501.18158 | null |
2025-01-30 | Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models | Wanlong Liu et.al. | 2501.18154 | null |
2025-01-30 | Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models | Qika Lin et.al. | 2501.18119 | null |
2025-01-30 | Scaling Inference-Efficient Language Models | Song Bian et.al. | 2501.18107 | null |
2025-01-30 | Panacea: Mitigating Harmful Fine-tuning for Large Language Models via Post-fine-tuning Perturbation | Yibo Wang et.al. | 2501.18100 | link |
2025-01-30 | AlphaAdam:Asynchronous Masked Optimization with Dynamic Alpha for Selective Updates | Da Chang et.al. | 2501.18094 | null |
2025-01-30 | Normative Evaluation of Large Language Models with Everyday Moral Dilemmas | Pratik S. Sachdeva et.al. | 2501.18081 | null |
2025-01-30 | FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models | Spencer Mateega et.al. | 2501.18062 | null |
2025-01-29 | RL-based Query Rewriting with Distilled LLM for online E-Commerce Systems | Duy A. Nguyen et.al. | 2501.18056 | null |
2025-01-29 | Current Pathology Foundation Models are unrobust to Medical Center Differences | Edwin D. de Jong et.al. | 2501.18055 | null |
2025-01-29 | A Proximal Operator for Inducing 2:4-Sparsity | Jonas M Kübler et.al. | 2501.18015 | null |
2025-01-29 | Large Language Models Think Too Fast To Explore Effectively | Lan Pan et.al. | 2501.18009 | null |
2025-01-29 | Fault Localization via Fine-tuning Large Language Models with Mutation Generated Stack Traces | Neetha Jambigi et.al. | 2501.18005 | null |
2025-01-29 | InnerThoughts: Disentangling Representations and Predictions in Large Language Models | Didier Chételat et.al. | 2501.17994 | null |
2025-01-29 | Can Generative LLMs Create Query Variants for Test Collections? An Exploratory Study | Marwah Alaofi et.al. | 2501.17981 | link |
2025-01-29 | Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization | Zishun Yu et.al. | 2501.17974 | null |
2025-01-29 | "I Would Never Trust Anything Western": Kumu (Educator) Perspectives on Use of LLMs for Culturally Revitalizing CS Education in Hawaiian Schools | Manas Mhasakar et.al. | 2501.17942 | null |
2025-01-29 | DReSS: Data-driven Regularized Structured Streamlining for Large Language Models | Mingkuan Feng et.al. | 2501.17905 | null |
2025-01-29 | Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning? | Pouya Pezeshkpour et.al. | 2501.17840 | link |
2025-01-29 | Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology | Sobhan Hemati et.al. | 2501.17822 | null |
2025-01-30 | Leveraging Multimodal LLM for Inspirational User Interface Search | Seokhyeon Park et.al. | 2501.17799 | link |
2025-01-29 | BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights | Chan-Jan Hsu et.al. | 2501.17790 | null |
2025-01-29 | AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing | Peter Pak et.al. | 2501.17784 | null |
2025-01-29 | 2SSP: A Two-Stage Framework for Structured Pruning of LLMs | Fabrizio Sandri et.al. | 2501.17771 | link |
2025-01-29 | Generative Unordered Flow for Set-Structured Data Generation | Yangming Li et.al. | 2501.17770 | null |
2025-01-29 | Hybrid Graphs for Table-and-Text based Question Answering using LLMs | Ankush Agarwal et.al. | 2501.17767 | null |
2025-01-29 | On the Partitioning of GPU Power among Multi-Instances | Tirth Vamja et.al. | 2501.17752 | null |
2025-01-29 | Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation | Aitor Arrieta et.al. | 2501.17749 | null |
2025-01-29 | A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches | Ana R. Baião et.al. | 2501.17729 | null |
2025-01-29 | Using Code Generation to Solve Open Instances of Combinatorial Design Problems | Christopher D. Rosin et.al. | 2501.17725 | link |
2025-01-29 | RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts | Eujeong Choi et.al. | 2501.17715 | link |
2025-01-29 | Source-Channel Separation Theorems for Distortion Perception Coding | Chao Tian et.al. | 2501.17706 | null |
2025-01-29 | Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching | Xuzhe Dang et.al. | 2501.17665 | null |
2025-01-30 | In-Context Meta LoRA Generation | Yihua Shao et.al. | 2501.17635 | null |
2025-01-29 | Uncertainty Quantification and Decomposition for LLM-based Recommendation | Wonbin Kweon et.al. | 2501.17630 | link |
2025-01-29 | The Imitation Game According To Turing | Sharon Temtsin et.al. | 2501.17629 | null |
2025-01-29 | Structured Context Recomposition for Large Language Models Using Probabilistic Layer Realignment | Jonathan Teel et.al. | 2501.17617 | null |
2025-01-29 | Semantic Consistency Regularization with Large Language Models for Semi-supervised Sentiment Analysis | Kunrong Li et.al. | 2501.17598 | null |
2025-01-30 | Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models | Behraj Khan et.al. | 2501.17595 | null |
2025-01-29 | GLLM: Self-Corrective G-Code Generation using Large Language Models with User Feedback | Mohamed Abdelaal et.al. | 2501.17584 | null |
2025-01-29 | CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs | Amey Hengle et.al. | 2501.17581 | null |
2025-01-29 | Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding | Marco Pasini et.al. | 2501.17578 | null |
2025-01-29 | Query-Aware Learnable Graph Pooling Tokens as Prompt for Large Language Models | Wooyoung Kim et.al. | 2501.17549 | null |
2025-01-29 | Towards Training-Free Open-World Classification with 3D Generative Models | Xinzhe Xia et.al. | 2501.17547 | null |
2025-01-29 | Is Conversational XAI All You Need? Human-AI Decision Making With a Conversational XAI Assistant | Gaole He et.al. | 2501.17546 | link |
2025-01-29 | Towards Supporting Penetration Testing Education with Large Language Models: an Evaluation and Comparison | Martin Nizon-Deladoeuille et.al. | 2501.17539 | null |
2025-01-29 | Neural Spelling: A Spell-Based BCI System for Language Neural Decoding | Xiaowei Jiang et.al. | 2501.17489 | null |
2025-01-29 | DFPE: A Diverse Fingerprint Ensemble for Enhancing LLM Performance | Seffi Cohen et.al. | 2501.17479 | link |
2025-01-29 | AugmenTest: Enhancing Tests with LLM-Driven Oracles | Shaker Mahmud Khandaker et.al. | 2501.17461 | null |
2025-01-29 | Large Language Models for Single-Step and Multi-Step Flight Trajectory Prediction | Kaiwei Luo et.al. | 2501.17459 | null |
2025-01-29 | Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation | Tiansheng Huang et.al. | 2501.17433 | link |
2025-01-29 | Actions Speak Louder than Words: Agent Decisions Reveal Implicit Biases in Language Models | Yuxuan Li et.al. | 2501.17420 | null |
2025-01-29 | MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs | Ved Sirdeshmukh et.al. | 2501.17399 | link |
2025-01-29 | Learning Free Token Reduction for Multi-Modal LLM | Zihui Zhao et.al. | 2501.17391 | null |
2025-01-29 | Context-Aware Semantic Recomposition Mechanism for Large Language Models | Richard Katrix et.al. | 2501.17386 | null |
2025-01-28 | Deep-and-Wide Learning: Enhancing Data-Driven Inference via Synergistic Learning of Inter- and Intra-Data Representations | Md Tauhidul Islam et.al. | 2501.17347 | null |
2025-01-28 | Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction | Mingyu Derek Ma et.al. | 2501.17326 | null |
2025-01-28 | CardiCat: a Variational Autoencoder for High-Cardinality Tabular Data | Lee Carlin et.al. | 2501.17324 | null |
2025-01-30 | Probing LLM World Models: Enhancing Guesstimation with Wisdom of Crowds Decoding | Yun-Shiuan Chuang et.al. | 2501.17310 | null |
2025-01-28 | "Ownership, Not Just Happy Talk": Co-Designing a Participatory Large Language Model for Journalism | Emily Tseng et.al. | 2501.17299 | null |
2025-01-28 | Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization | Zilu Tang et.al. | 2501.17295 | null |
2025-01-28 | Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology | Peilong Wang et.al. | 2501.17286 | null |
2025-01-30 | From Natural Language to Extensive-Form Game Representations | Shilong Deng et.al. | 2501.17282 | link |
2025-01-28 | Engineering Point Defects in MoS2 for Tailored Material Properties using Large Language Models | Abdalaziz Al-Maeeni et.al. | 2501.17279 | null |
2025-01-28 | Tailored Truths: Optimizing LLM Persuasion with Personalization and Fabricated Statistics | Jasper Timm et.al. | 2501.17273 | link |
2025-01-28 | Integrating Reinforcement Learning and AI Agents for Adaptive Robotic Interaction and Assistance in Dementia Care | Fengpei Yuan et.al. | 2501.17206 | null |
2025-01-28 | SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training | Tianzhe Chu et.al. | 2501.17161 | null |
2025-01-28 | FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data | Deren Lei et.al. | 2501.17144 | link |
2025-01-28 | ASTRAL: Automated Safety Testing of Large Language Models | Miriam Ugarte et.al. | 2501.17132 | null |
2025-01-28 | Optimizing Large Language Model Training Using FP4 Quantization | Ruizhe Wang et.al. | 2501.17116 | null |
2025-01-28 | Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction | Carl-Leander Henneking et.al. | 2501.17112 | null |
2025-01-28 | Goodness of Fit for Bayesian Generative Models with Applications in Population Genetics | Guillaume Le Mailloux et.al. | 2501.17107 | link |
2025-01-28 | Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving | Evgenii Evstafev et.al. | 2501.17084 | null |
2025-01-28 | Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding | Akash Kumar et.al. | 2501.17053 | null |
2025-01-28 | Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models | Minghan Li et.al. | 2501.17039 | null |
2025-01-28 | Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies | Manojkumar Parmar et.al. | 2501.17030 | null |
2025-01-28 | Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs | Alessandro Midolo et.al. | 2501.17024 | link |
2025-01-28 | Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement | Kei Katsumata et.al. | 2501.17022 | link |
2025-01-28 | MIDI-GPT: A Controllable Generative Model for Computer-Assisted Multitrack Music Composition | Philippe Pasquier et.al. | 2501.17011 | null |
2025-01-28 | Large Language Models for Code Generation: The Practitioners Perspective | Zeeshan Rasheed et.al. | 2501.16998 | link |
2025-01-28 | Artificial Intelligence Clones | Annie Liang et.al. | 2501.16996 | null |
2025-01-28 | FedEFM: Federated Endovascular Foundation Model with Unseen Data | Tuong Do et.al. | 2501.16992 | null |
2025-01-28 | Generative quantum combinatorial optimization by means of a novel conditional generative quantum eigensolver | Shunya Minami et.al. | 2501.16986 | null |
2025-01-28 | Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling | Hongzhi Huang et.al. | 2501.16975 | null |
2025-01-28 | Instantiation-based Formalization of Logical Reasoning Tasks using Language Models and Logical Solvers | Mohammad Raza et.al. | 2501.16961 | null |
2025-01-28 | Multiple Abstraction Level Retrieve Augment Generation | Zheng Zheng et.al. | 2501.16952 | null |
2025-01-29 | TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models | Makoto Shing et.al. | 2501.16937 | null |
2025-01-28 | Detecting harassment and defamation in cyberbullying with emotion-adaptive training | Peiling Yi et.al. | 2501.16925 | link |
2025-01-28 | RDMM: Fine-Tuned LLM Models for On-Device Robotic Decision Making with Enhanced Contextual Awareness in Specific Domains | Shady Nasrat et.al. | 2501.16899 | link |
2025-01-28 | Machine-learning semi-local exchange-correlation functionals for Kohn-Sham density functional theory of the Hubbard model | Eoghan Cronin et.al. | 2501.16893 | null |
2025-01-28 | Irony Detection, Reasoning and Understanding in Zero-shot Learning | Peiling Yi et.al. | 2501.16884 | null |
2025-01-28 | Comparing Human and LLM Generated Code: The Jury is Still Out! | Sherlock A. Licorish et.al. | 2501.16857 | null |
2025-01-28 | Adapting Network Information to Semantics for Generalizable and Plug-and-Play Multi-Scenario Network Diagnosis | Tiao Tan et.al. | 2501.16842 | null |
2025-01-28 | Misspellings in Natural Language Processing: A survey | Gianluca Sperduti et.al. | 2501.16836 | null |
2025-01-28 | DIRIGENt: End-To-End Robotic Imitation of Human Demonstrations Based on a Diffusion Model | Josua Spisak et.al. | 2501.16800 | null |
2025-01-28 | Algorithm for Automatic Legislative Text Consolidation | Matias Etcheverry et.al. | 2501.16794 | null |
2025-01-28 | Exponential Family Attention | Kevin Christian Wibisono et.al. | 2501.16790 | link |
2025-01-28 | Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding | Yun Li et.al. | 2501.16786 | null |
2025-01-28 | TORCHLIGHT: Shedding LIGHT on Real-World Attacks on Cloudless IoT Devices Concealed within the Tor Network | Yumingzhi Pan et.al. | 2501.16784 | null |
2025-01-28 | A Stochastic Dynamical Theory of LLM Self-Adversariality: Modeling Severity Drift as a Critical Process | Jack David Carson et.al. | 2501.16783 | null |
2025-01-29 | Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models | Muhammad Atta ur Rahman et.al. | 2501.16769 | null |
2025-01-28 | DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation | Chenguo Lin et.al. | 2501.16764 | null |
2025-01-28 | HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns | Xinyue Shen et.al. | 2501.16750 | link |
2025-01-28 | Through the Prism of Culture: Evaluating LLMs' Understanding of Indian Subcultures and Traditions | Garima Chhikara et.al. | 2501.16748 | null |
2025-01-28 | LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience | Nimesh Jha et.al. | 2501.16744 | null |
2025-01-28 | Distilling Large Language Models for Network Active Queue Management | Deol Satish et.al. | 2501.16734 | null |
2025-01-28 | xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking | Sunbowen Lee et.al. | 2501.16727 | link |
2025-01-28 | One Head Eight Arms: Block Matrix based Low Rank Adaptation for CLIP-based Few-Shot Learning | Chunpeng Zhou et.al. | 2501.16720 | null |
2025-01-28 | Outlier Synthesis via Hamiltonian Monte Carlo for Out-of-Distribution Detection | Hengzhuang Li et.al. | 2501.16718 | link |
2025-01-28 | 3D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow | Yueen Ma et.al. | 2501.16698 | null |
2025-01-28 | MME-Industry: A Cross-Industry Multimodal Evaluation Benchmark | Dongyi Yi et.al. | 2501.16688 | null |
2025-01-28 | Auto-Differentiating Any LLM Workflow: A Farewell to Manual Prompting | Li Yin et.al. | 2501.16673 | link |
2025-01-28 | VeriFact: Verifying Facts in LLM-Generated Clinical Text with Electronic Health Records | Philip Chung et.al. | 2501.16672 | link |
2025-01-28 | Contextual Reinforcement in Multimodal Token Compression for Large Language Models | Naderdel Piero et.al. | 2501.16658 | null |
2025-01-28 | Large Language Model Critics for Execution-Free Evaluation of Code Changes | Aashish Yadavally et.al. | 2501.16655 | link |
2025-01-28 | Molecular-driven Foundation Model for Oncologic Pathology | Anurag Vaidya et.al. | 2501.16652 | null |
2025-01-28 | DOCS: Quantifying Weight Similarity for Deeper Insights into Large Language Models | Zeping Min et.al. | 2501.16650 | null |
2025-01-28 | An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Koji Inoue et.al. | 2501.16643 | null |
2025-01-28 | CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs | Jinlan Fu et.al. | 2501.16629 | link |
2025-01-28 | Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems | Baraa Hikal et.al. | 2501.16616 | null |
2025-01-28 | Sparse Autoencoders Trained on the Same Data Learn Different Features | Gonçalo Paulo et.al. | 2501.16615 | null |
2025-01-28 | Fine-Tuned Language Models as Space Systems Controllers | Enrico M. Zucchelli et.al. | 2501.16588 | null |
2025-01-27 | AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models | Zheng Lian et.al. | 2501.16566 | null |
2025-01-27 | LoRA-X: Bridging Foundation Models with Training-Free Cross-Model Adaptation | Farzad Farhadzadeh et.al. | 2501.16559 | null |
2025-01-27 | Distributional Information Embedding: A Framework for Multi-bit Watermarking | Haiyun He et.al. | 2501.16558 | null |
2025-01-27 | PackDiT: Joint Human Motion and Text Generation via Mutual Prompting | Zhongyu Jiang et.al. | 2501.16551 | null |
2025-01-27 | PhysAnimator: Physics-Guided Generative Cartoon Animation | Tianyi Xie et.al. | 2501.16550 | null |
2025-01-27 | Sample-Efficient Behavior Cloning Using General Domain Knowledge | Feiyu Zhu et.al. | 2501.16546 | null |
2025-01-27 | Generalized Mission Planning for Heterogeneous Multi-Robot Teams via LLM-constructed Hierarchical Trees | Piyush Gupta et.al. | 2501.16539 | null |
2025-01-27 | Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs | Jean-Charles Noirot Ferrand et.al. | 2501.16534 | null |
2025-01-27 | A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain | Jorge del Pozo Lérida et.al. | 2501.16533 | null |
2025-01-27 | Programming by Examples Meets Historical Linguistics: A Large Language Model Based Approach to Sound Law Induction | Atharva Naik et.al. | 2501.16524 | null |
2025-01-27 | How well can LLMs Grade Essays in Arabic? | Rayed Ghazawi et.al. | 2501.16516 | null |
2025-01-27 | Deception in LLMs: Self-Preservation and Autonomous Goals in Large Language Models | Sudarshan Kamath Barkur et.al. | 2501.16513 | null |
2025-01-27 | Smoothed Embeddings for Robust Language Models | Ryo Hase et.al. | 2501.16497 | null |
2025-01-27 | Explaining GitHub Actions Failures with Large Language Models: Challenges, Insights, and Limitations | Pablo Valenzuela-Toledo et.al. | 2501.16495 | null |
2025-01-27 | Generating customized prompts for Zero-Shot Rare Event Medical Image Classification using LLM | Payal Kamboj et.al. | 2501.16481 | link |
2025-01-27 | Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation | Philip Hughes et.al. | 2501.16467 | null |
2025-01-27 | CoCoNUT: Structural Code Understanding does not fall out of a tree | Claas Beger et.al. | 2501.16456 | link |
2025-01-27 | Detecting Zero-Day Attacks in Digital Substations via In-Context Learning | Faizan Manzoor et.al. | 2501.16453 | null |
2025-01-27 | 360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation | Hamed Firooz et.al. | 2501.16450 | null |
2025-01-27 | DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation | Han Sun et.al. | 2501.16410 | null |
2025-01-27 | Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology | Meiyun Cao et.al. | 2501.16309 | null |
2025-01-27 | RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval | Long Nguyen et.al. | 2501.16303 | null |
2025-01-27 | Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width | Zheng Liu et.al. | 2501.16302 | null |
2025-01-27 | Large Models in Dialogue for Active Perception and Anomaly Detection | Tzoulio Chamiti et.al. | 2501.16300 | link |
2025-01-27 | FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers | Renshan Zhang et.al. | 2501.16297 | null |
2025-01-27 | Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models | Jing Zhang et.al. | 2501.16282 | null |
2025-01-27 | Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation | Jiayi Hong et.al. | 2501.16277 | link |
2025-01-27 | URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT | Long Nguyen et.al. | 2501.16276 | null |
2025-01-27 | A foundation model for human-AI collaboration in medical literature mining | Zifeng Wang et.al. | 2501.16255 | null |
2025-01-27 | Multi-Agent Geospatial Copilots for Remote Sensing Workflows | Chaehong Lee et.al. | 2501.16254 | null |
2025-01-27 | Zero-Shot Decision Tree Construction via Large Language Models | Lucas Carrasco et.al. | 2501.16247 | null |
2025-01-27 | CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation | Xiaochuan Ma et.al. | 2501.16246 | null |
2025-01-27 | Phase Transitions in Large Language Models and the |
Youran Sun et.al. | 2501.16241 | null |
2025-01-27 | AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses | Runze Cai et.al. | 2501.16240 | null |
2025-01-28 | Distilling foundation models for robust and efficient models in digital pathology | Alexandre Filiot et.al. | 2501.16239 | null |
2025-01-27 | Language-Based Bayesian Optimization Research Assistant (BORA) | Abdoulatif Cissé et.al. | 2501.16224 | null |
2025-01-27 | Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models | Huayu Li et.al. | 2501.16215 | link |
2025-01-27 | Provence: efficient and robust context pruning for retrieval-augmented generation | Nadezhda Chirkova et.al. | 2501.16214 | null |
2025-01-27 | Raiders of the Lost Dependency: Fixing Dependency Conflicts in Python using LLMs | Antony Bartlett et.al. | 2501.16191 | null |
2025-01-27 | SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting | Wenxuan Xie et.al. | 2501.16178 | link |
2025-01-27 | BAG: Body-Aligned 3D Wearable Asset Generation | Zhongjin Luo et.al. | 2501.16177 | null |
2025-01-27 | Will Systems of LLM Agents Cooperate: An Investigation into a Social Dilemma | Richard Willis et.al. | 2501.16173 | link |
2025-01-27 | MetaDecorator: Generating Immersive Virtual Tours through Multimodality | Shuang Xie et.al. | 2501.16164 | null |
2025-01-27 | CITYWALK: Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge | Yuwei Zhang et.al. | 2501.16155 | null |
2025-01-27 | AdaCoT: Rethinking Cross-Lingual Factual Reasoning through Adaptive Chain-of-Thought | Xin Huang et.al. | 2501.16154 | null |
2025-01-27 | AI Agents for Computer Use: A Review of Instruction-based Computer Control, GUI Automation, and Operator Assistants | Pascal J. Sager et.al. | 2501.16150 | null |
2025-01-27 | PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing | Yuwei Zhang et.al. | 2501.16149 | null |
2025-01-27 | SampleLLM: Optimizing Tabular Data Synthesis in Recommendations | Jingtong Gao et.al. | 2501.16125 | null |
2025-01-27 | Using Generative Models to Produce Realistic Populations of UK Windstorms | Yee Chun Tsoi et.al. | 2501.16110 | null |
2025-01-27 | Integration of LLM Quality Assurance into an NLG System | Ching-Yi Chen et.al. | 2501.16078 | null |
2025-01-27 | PISCO: Pretty Simple Compression for Retrieval-Augmented Generation | Maxime Louis et.al. | 2501.16075 | null |
2025-01-27 | A generative material transformer using Wyckoff representation | Pierre-Paul De Breuck et.al. | 2501.16051 | null |
2025-01-27 | Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation | Xing Zhang et.al. | 2501.16050 | null |
2025-01-27 | PRISMe: A Novel LLM-Powered Tool for Interactive Privacy Policy Assessment | Vincent Freiberger et.al. | 2501.16033 | null |
2025-01-27 | FDLLM: A Text Fingerprint Detection Method for LLMs in Multi-Language, Multi-Domain Black-Box Environments | Zhiyuan Fu et.al. | 2501.16029 | null |
2025-01-27 | Transformability reveals the interplay of dynamics across different network orders | Ming Xie et.al. | 2501.16016 | null |
2025-01-27 | TOPLOC: A Locality Sensitive Hashing Scheme for Trustless Verifiable Inference | Jack Min Ong et.al. | 2501.16007 | null |
2025-01-27 | EDSep: An Effective Diffusion-Based Method for Speech Source Separation | Jinwei Dong et.al. | 2501.15965 | null |
2025-01-27 | Rethinking the Bias of Foundation Model under Long-tailed Distribution | Jiahao Chen et.al. | 2501.15955 | null |
2025-01-27 | Understanding Long Videos via LLM-Powered Entity Relation Graphs | Meng Chu et.al. | 2501.15953 | null |
2025-01-27 | TimeHF: Billion-Scale Time Series Models Guided by Human Feedback | Yongzhi Qi et.al. | 2501.15942 | null |
2025-01-27 | SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub | Benjamin C. Carter et.al. | 2501.15922 | null |
2025-01-27 | Parametric Retrieval Augmented Generation | Weihang Su et.al. | 2501.15915 | link |
2025-01-27 | Robust Mobile Robot Path Planning via LLM-Based Dynamic Waypoint Generation | Muhammad Taha Tariq et.al. | 2501.15901 | null |
2025-01-27 | Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects | Victor Deng et.al. | 2501.15900 | null |
2025-01-27 | Adaptive Width Neural Networks | Federico Errica et.al. | 2501.15889 | null |
2025-01-27 | LCTG Bench: LLM Controlled Text Generation Benchmark | Kentaro Kurihara et.al. | 2501.15875 | link |
2025-01-27 | LLM-attacker: Enhancing Closed-loop Adversarial Scenario Generation for Autonomous Driving with Large Language Models | Yuewen Mei et.al. | 2501.15850 | null |
2025-01-27 | SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model | Delin Qu et.al. | 2501.15830 | null |
2025-01-27 | Aging-aware CPU Core Management for Embodied Carbon Amortization in Cloud LLM Inference | Tharindu B. Hewage et.al. | 2501.15829 | link |
2025-01-27 | MADP: Multi-Agent Deductive Planning for Enhanced Cognitive-Behavioral Mental Health Question Answer | Qi Chen et.al. | 2501.15826 | null |
2025-01-27 | LemmaHead: RAG Assisted Proof Generation Using Large Language Models | Tianbo Yang et.al. | 2501.15797 | null |
2025-01-27 | Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection? | Zhiling Chen et.al. | 2501.15795 | null |
2025-01-27 | Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs | Yu Li et.al. | 2501.15791 | link |
2025-01-27 | Memorization and Regularization in Generative Diffusion Models | Ricardo Baptista et.al. | 2501.15785 | link |
2025-01-27 | Large Language Models to Diffusion Finetuning | Edoardo Cetin et.al. | 2501.15781 | null |
2025-01-27 | Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages | Ivory Yang et.al. | 2501.15773 | link |
2025-01-27 | GraphICL: Unlocking Graph Learning Potential in LLMs through Structured Prompt Design | Yuanfu Sun et.al. | 2501.15755 | null |
2025-01-27 | IndicMMLU-Pro: Benchmarking the Indic Large Language Models | Sankalp KJ et.al. | 2501.15747 | null |
2025-01-27 | Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning | Michael Xieyang Liu et.al. | 2501.15727 | null |
2025-01-27 | A Survey on Computational Pathology Foundation Models: Datasets, Adaptation Strategies, and Evaluation Tasks | Dong Li et.al. | 2501.15724 | null |
2025-01-27 | On Parallelism in Music and Language: A Perspective from Symbol Emergence Systems based on Probabilistic Generative Models | Tadahiro Taniguchi et.al. | 2501.15721 | null |
2025-01-26 | Adapting Biomedical Abstracts into Plain language using Large Language Models | Haritha Gangavarapu et.al. | 2501.15700 | null |
2025-01-26 | TensorLLM: Tensorising Multi-Head Attention for Enhanced Reasoning and Compression in LLMs | Yuxuan Gu et.al. | 2501.15674 | null |
2025-01-26 | Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting | Yuxin Zhang et.al. | 2501.15641 | null |
2025-01-26 | BoKDiff: Best-of-K Diffusion Alignment for Target-Specific 3D Molecule Generation | Ali Khodabandeh Yalabadi et.al. | 2501.15631 | link |
2025-01-26 | Improving Estonian Text Simplification through Pretrained Language Models and Custom Datasets | Eduard Barbu et.al. | 2501.15624 | null |
2025-01-26 | Rethinking External Slow-Thinking: From Snowball Errors to Probability of Correct Reasoning | Zeyu Gan et.al. | 2501.15602 | link |
2025-01-26 | Evaluating an LLM-Powered Chatbot for Cognitive Restructuring: Insights from Mental Health Professionals | Yinzhou Wang et.al. | 2501.15599 | null |
2025-01-26 | Diffusion Generative Modeling for Spatially Resolved Gene Expression Inference from Histology Images | Sichen Zhu et.al. | 2501.15598 | link |
2025-01-26 | SedarEval: Automated Evaluation using Self-Adaptive Rubrics | Zhiyuan Fan et.al. | 2501.15595 | link |
2025-01-26 | SCP-116K: A High-Quality Problem-Solution Dataset and a Generalized Pipeline for Automated Extraction in the Higher Education Science Domain | Dakuan Lu et.al. | 2501.15587 | link |
2025-01-26 | Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework | Yuhong Sun et.al. | 2501.15581 | null |
2025-01-26 | Instruction Tuning for Story Understanding and Generation with Weak Supervision | Yangshu Yuan et.al. | 2501.15574 | null |
2025-01-26 | Cross-Cultural Fashion Design via Interactive Large Language Models and Diffusion Models | Spencer Ramsey et.al. | 2501.15571 | null |
2025-01-26 | ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer | Lin Yueyu et.al. | 2501.15570 | link |
2025-01-26 | Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Song Chen et.al. | 2501.15558 | null |
2025-01-26 | Advancing Generative Artificial Intelligence and Large Language Models for Demand Side Management with Electric Vehicles | Hanwen Zhang et.al. | 2501.15544 | null |
2025-01-26 | Estimating Committor Functions via Deep Adaptive Sampling on Rare Transition Paths | Yueyang Wang et.al. | 2501.15522 | null |
2025-01-26 | Domain Adaptation from Generated Multi-Weather Images for Unsupervised Maritime Object Classification | Dan Song et.al. | 2501.15503 | null |
2025-01-26 | Unveiling the Potential of Multimodal Retrieval Augmented Generation with Planning | Xiaohan Yu et.al. | 2501.15470 | null |
2025-01-26 | Data-adaptive Safety Rules for Training Reward Models | Xiaomin Li et.al. | 2501.15453 | null |
2025-01-26 | OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas | Xiaoyang Wang et.al. | 2501.15427 | null |
2025-01-26 | Visual Generation Without Guidance | Huayu Chen et.al. | 2501.15420 | link |
2025-01-26 | AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement | Junan Zhang et.al. | 2501.15417 | null |
2025-01-26 | The Potential of Large Language Models in Supply Chain Management: Advancing Decision-Making, Efficiency, and Innovation | Raha Aghaei et.al. | 2501.15411 | null |
2025-01-26 | Semantic Layered Embedding Diffusion in Large Language Models for Multi-Contextual Consistency | Irin Kabakum et.al. | 2501.15405 | null |
2025-01-26 | How Green are Neural Language Models? Analyzing Energy Consumption in Text Summarization Fine-tuning | Tohida Rehman et.al. | 2501.15398 | null |
2025-01-26 | Zero-Shot Interactive Text-to-Image Retrieval via Diffusion-Augmented Representations | Zijun Long et.al. | 2501.15379 | null |
2025-01-26 | How to Mitigate Information Loss in Knowledge Graphs for GraphRAG: Leveraging Triple Context Restoration and Query-Driven Feedback | Manzong Huang et.al. | 2501.15378 | null |
2025-01-26 | Evaluating the Effectiveness of XAI Techniques for Encoder-Based Language Models | Melkamu Abay Mersha et.al. | 2501.15374 | null |
2025-01-26 | Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis | Robinson Umeike et.al. | 2501.15370 | null |
2025-01-26 | Decentralized Low-Rank Fine-Tuning of Large Language Models | Sajjad Ghiasvand et.al. | 2501.15361 | null |
2025-01-26 | Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection | Bo Yang et.al. | 2501.15355 | null |
2025-01-25 | Fairness in LLM-Generated Surveys | Andrés Abeliuk et.al. | 2501.15351 | null |
2025-01-25 | Between Puppet and Actor: Reframing Authorship in this Age of AI Agents | Yuqian Sun et.al. | 2501.15346 | null |
2025-01-25 | Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data | Jiajie Li et.al. | 2501.15326 | null |
2025-01-25 | ToMoE: Converting Dense Large Language Models to Mixture-of-Experts through Dynamic Structural Pruning | Shangqian Gao et.al. | 2501.15316 | null |
2025-01-25 | The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders? | Ayo Adedeji et.al. | 2501.15310 | null |
2025-01-25 | You Only Prune Once: Designing Calibration-Free Model Compression With Policy Learning | Ayan Sengupta et.al. | 2501.15296 | null |
2025-01-24 | HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation | Xin Zhou et.al. | 2501.14729 | link |
2025-01-24 | Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? | Ipek Baris Schlicht et.al. | 2501.14719 | null |
2025-01-24 | Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models | Naihao Deng et.al. | 2501.14717 | null |
2025-01-24 | FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing | James Seale Smith et.al. | 2501.14713 | null |
2025-01-24 | The Karp Dataset | Mason DiCicco et.al. | 2501.14705 | null |
2025-01-24 | Rethinking Table Instruction Tuning | Naihao Deng et.al. | 2501.14693 | null |
2025-01-24 | Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST | Fuping Wu et.al. | 2501.14685 | null |
2025-01-24 | An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations | Shabnam Hassani et.al. | 2501.14683 | null |
2025-01-24 | Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning | Jisi Zhang et.al. | 2501.14680 | null |
2025-01-24 | MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications | Yixing Jiang et.al. | 2501.14654 | link |
2025-01-24 | Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion | Ziyao Xu et.al. | 2501.14649 | link |
2025-01-24 | Towards Scalable Topological Regularizers | Hiu-Tung Wong et.al. | 2501.14641 | null |
2025-01-24 | Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics | Renato Ghisellini et.al. | 2501.14634 | null |
2025-01-24 | Extracting Problem Structure with LLMs for Optimized SAT Local Search | André Schilder et.al. | 2501.14630 | null |
2025-01-24 | Single-neuron deep generative model uncovers underlying physics of neuronal activity in Ca imaging data | Jordi Abante et.al. | 2501.14615 | null |
2025-01-24 | ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations | Tianming Liang et.al. | 2501.14607 | null |
2025-01-24 | Leveraging ChatGPT's Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research | Hamid Sarmadi et.al. | 2501.14546 | null |
2025-01-24 | VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning | Benjamin Callewaert et.al. | 2501.14540 | null |
2025-01-24 | Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models | Zhenguang Zhong et.al. | 2501.14530 | link |
2025-01-24 | Scene Understanding Enabled Semantic Communication with Open Channel Coding | Zhe Xiang et.al. | 2501.14520 | null |
2025-01-24 | Real-world Edge Neural Network Implementations Leak Private Interactions Through Physical Side Channel | Zhuoran Liu et.al. | 2501.14512 | null |
2025-01-24 | Automated Assignment Grading with Large Language Models: Insights From a Bioinformatics Course | Pavlin G. Poličar et.al. | 2501.14499 | null |
2025-01-24 | Evaluating and Improving Graph to Text Generation with Large Language Models | Jie He et.al. | 2501.14497 | link |
2025-01-24 | RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques | Zhengyang Tang et.al. | 2501.14492 | link |
2025-01-24 | Pesti-Gen: Unleashing a Generative Molecule Approach for Toxicity Aware Pesticide Design | Taehan Kim et.al. | 2501.14469 | null |
2025-01-24 | Boundary Value Test Input Generation Using Prompt Engineering with LLMs: Fault Detection and Coverage Analysis | Xiujing Guo et.al. | 2501.14465 | null |
2025-01-24 | Understanding and Mitigating Gender Bias in LLMs via Interpretable Neuron Editing | Zeping Yu et.al. | 2501.14457 | null |
2025-01-24 | Domaino1s: Guiding LLM Reasoning for Explainable Answers in High-Stakes Domains | Xu Chu et.al. | 2501.14431 | null |
2025-01-24 | GraphBC: Improving LLMs for Better Graph Data Processing | Xu Chu et.al. | 2501.14427 | null |
2025-01-24 | CENTS: Generating synthetic electricity consumption time series for rare and unseen scenarios | Michael Fuest et.al. | 2501.14426 | null |
2025-01-24 | DeepFlow: Serverless Large Language Model Serving at Scale | Junhao Hu et.al. | 2501.14417 | null |
2025-01-24 | SKIL: Semantic Keypoint Imitation Learning for Generalizable Data-efficient Manipulation | Shengjie Wang et.al. | 2501.14400 | null |
2025-01-24 | ECTIL: Label-efficient Computational Tumour Infiltrating Lymphocyte (TIL) assessment in breast cancer: Multicentre validation in 2,340 patients with breast cancer | Yoni Schirris et.al. | 2501.14379 | link |
2025-01-24 | DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing | Xinyu Ma et.al. | 2501.14371 | link |
2025-01-24 | Uncovering the bias in the evidence for dynamical dark energy through minimal and generalized modeling approaches | Ziad Sakr et.al. | 2501.14366 | null |
2025-01-24 | FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration | Kai-Tuo Xu et.al. | 2501.14350 | link |
2025-01-24 | Chain-of-Retrieval Augmented Generation | Liang Wang et.al. | 2501.14342 | null |
2025-01-24 | Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts | Clément Desroches et.al. | 2501.14334 | null |
2025-01-24 | Assessing Large Language Models in Comprehending and Verifying Concurrent Programs across Memory Models | Ridhi Jain et.al. | 2501.14326 | null |
2025-01-24 | PAID: A Framework of Product-Centric Advertising Image Design | Hongyu Chen et.al. | 2501.14316 | null |
2025-01-24 | Locality-aware Fair Scheduling in LLM Serving | Shiyi Cao et.al. | 2501.14312 | null |
2025-01-24 | A Zero-Shot LLM Framework for Automatic Assignment Grading in Higher Education | Calvin Yeung et.al. | 2501.14305 | link |
2025-01-24 | MASTER: A Multi-Agent System with LLM Specialized MCTS | Bingzheng Gan et.al. | 2501.14304 | null |
2025-01-24 | Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph | Xujian Liang et.al. | 2501.14300 | link |
2025-01-24 | Multi-stage Large Language Model Pipelines Can Outperform GPT-4o in Relevance Assessment | Julian A. Schnabel et.al. | 2501.14296 | null |
2025-01-24 | Examining Alignment of Large Language Models through Representative Heuristics: The Case of Political Stereotypes | Sullam Jeoung et.al. | 2501.14294 | link |
2025-01-24 | Advances in Temporal Point Processes: Bayesian, Deep, and LLM Approaches | Feng Zhou et.al. | 2501.14291 | null |
2025-01-24 | Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation | Sadegh Mahdavi et.al. | 2501.14275 | link |
2025-01-24 | Siren: A Learning-Based Multi-Turn Attack Framework for Simulating Real-World Human Jailbreak Behaviors | Yi Zhao et.al. | 2501.14250 | link |
2025-01-24 | Humanity's Last Exam | Long Phan et.al. | 2501.14249 | null |
2025-01-24 | Multi-agent KTO: Reinforcing Strategic Interactions of Large Language Model in Language Game | Rong Ye et.al. | 2501.14225 | null |
2025-01-24 | Top Ten Challenges Towards Agentic Neural Graph Databases | Jiaxin Bai et.al. | 2501.14224 | null |
2025-01-24 | TFG-Flow: Training-free Guidance in Multimodal Generative Flow | Haowei Lin et.al. | 2501.14216 | null |
2025-01-24 | Serving Long-Context LLMs at the Mobile Edge: Test-Time Reinforcement Learning-based Model Caching and Inference Offloading | Minrui Xu et.al. | 2501.14205 | null |
2025-01-24 | VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking | Runyi Hu et.al. | 2501.14195 | link |
2025-01-24 | Distributed Multi-Agent Coordination Using Multi-Modal Foundation Models | Saaduddin Mahmud et.al. | 2501.14189 | null |
2025-01-24 | GeoSim.AI: AI assistants for numerical simulations in geomechanics | Yared W. Bekele et.al. | 2501.14186 | null |
2025-01-24 | AI Chatbots as Professional Service Agents: Developing a Professional Identity | Wenwen Li et.al. | 2501.14179 | null |
2025-01-24 | Argos: Agentic Time-Series Anomaly Detection with Autonomous Rule Generation via Large Language Models | Yile Gu et.al. | 2501.14170 | null |
2025-01-24 | Test-Time Code-Switching for Cross-lingual Aspect Sentiment Triplet Extraction | Dongming Sheng et.al. | 2501.14144 | null |
2025-01-23 | Autonomous Structural Memory Manipulation for Large Language Models Using Hierarchical Embedding Augmentation | Derek Yotheringhay et.al. | 2501.14119 | null |
2025-01-23 | Domain-Factored Untrained Deep Prior for Spectrum Cartography | Subash Timilsina et.al. | 2501.14116 | null |
2025-01-23 | MedSlice: Fine-Tuned Large Language Models for Secure Clinical Note Sectioning | Joshua Davis et.al. | 2501.14105 | link |
2025-01-23 | StreamingRAG: Real-time Contextual Retrieval and Generation Framework | Murugan Sankaradas et.al. | 2501.14101 | null |
2025-01-23 | Enhancing Biomedical Relation Extraction with Directionality | Po-Ting Lai et.al. | 2501.14079 | link |
2025-01-23 | LLMs are Vulnerable to Malicious Prompts Disguised as Scientific Language | Yubin Ge et.al. | 2501.14073 | null |
2025-01-23 | Efficient 2D CT Foundation Model for Contrast Phase Classification | Benjamin Hou et.al. | 2501.14066 | null |
2025-01-23 | Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models | Jakob Krogh Petersen et.al. | 2501.14051 | link |
2025-01-23 | LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps | Andrey Palaev et.al. | 2501.14046 | link |
2025-01-23 | Leveraging Large Language Models to Analyze Emotional and Contextual Drivers of Teen Substance Use in Online Discussions | Jianfeng Zhu et.al. | 2501.14037 | null |
2025-01-23 | CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation | Guofeng Cui et.al. | 2501.13927 | null |
2025-01-23 | Improving Video Generation with Human Feedback | Jie Liu et.al. | 2501.13918 | null |
2025-01-23 | Binary Diffusion Probabilistic Model | Vitaliy Kinakh et.al. | 2501.13915 | null |
2025-01-23 | Analysis of Indic Language Capabilities in LLMs | Aatman Vaidya et.al. | 2501.13912 | null |
2025-01-23 | Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models | Linh Tran et.al. | 2501.13904 | null |
2025-01-23 | Exploring Finetuned Audio-LLM on Heart Murmur Features | Adrian Florea et.al. | 2501.13884 | null |
2025-01-23 | The machine learning platform for developers of large systems | Alexey Naikov et.al. | 2501.13881 | null |
2025-01-23 | A RAG-Based Institutional Assistant | Gustavo Kuratomi et.al. | 2501.13880 | null |
2025-01-23 | On the Reasoning Capacity of AI Models and How to Quantify It | Santosh Kumar Radha et.al. | 2501.13833 | null |
2025-01-23 | Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing | Hao Zhang et.al. | 2501.13831 | null |
2025-01-23 | Hallucinations Can Improve Large Language Models in Drug Discovery | Shuzhou Yuan et.al. | 2501.13824 | null |
2025-01-23 | Large Language Model driven Policy Exploration for Recommender Systems | Jie Wang et.al. | 2501.13816 | null |
2025-01-23 | Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change | Mowafak Allaham et.al. | 2501.13802 | null |
2025-01-23 | Parameter-Efficient Fine-Tuning for Foundation Models | Dan Zhang et.al. | 2501.13787 | link |
2025-01-23 | Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling | Tanya Rodchenko et.al. | 2501.13779 | null |
2025-01-23 | Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework | Yoonsang Kim et.al. | 2501.13778 | link |
2025-01-23 | Do Large Language Models Truly Understand Geometric Structures? | Xiaofeng Wang et.al. | 2501.13773 | link |
2025-01-23 | Tune In, Act Up: Exploring the Impact of Audio Modality-Specific Edits on Large Audio Language Models in Jailbreak | Erjia Xiao et.al. | 2501.13772 | null |
2025-01-23 | UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models | Xin Xu et.al. | 2501.13766 | null |
2025-01-23 | EICopilot: Search and Explore Enterprise Information over Large-scale Knowledge Graphs with LLM-driven Agents | Yuhui Yun et.al. | 2501.13746 | null |
2025-01-23 | GPT-HTree: A Decision Tree Framework Integrating Hierarchical Clustering and Large Language Models for Explainable Classification | Te Pei et.al. | 2501.13743 | null |
2025-01-23 | An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities | Zezhou Yang et.al. | 2501.13742 | link |
2025-01-23 | Pseudocode-Injection Magic: Enabling LLMs to Tackle Graph Computational Tasks | Chang Gong et.al. | 2501.13731 | null |
2025-01-23 | RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation | Shi-Qi Yan et.al. | 2501.13726 | null |
2025-01-23 | Musical ethnocentrism in Large Language Models | Anna Kruspe et.al. | 2501.13720 | null |
2025-01-23 | A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation | Dario Serez et.al. | 2501.13718 | null |
2025-01-23 | EventVL: Understand Event Streams via Multimodal Large Language Model | Pengteng Li et.al. | 2501.13707 | null |
2025-01-23 | DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale | Linghao Zhang et.al. | 2501.13699 | null |
2025-01-23 | Question Answering on Patient Medical Records with Private Fine-Tuned LLMs | Sara Kothari et.al. | 2501.13687 | null |
2025-01-23 | HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor | Zihui Wu et.al. | 2501.13677 | link |
2025-01-23 | How to Complete Domain Tuning while Keeping General Ability in LLM: Adaptive Layer-wise and Element-wise Regularization | Shezheng Song et.al. | 2501.13669 | null |
2025-01-23 | LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models | Yizheng Sun et.al. | 2501.13652 | null |
2025-01-23 | Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models | Zhenghao Lin et.al. | 2501.13629 | null |
2025-01-23 | Text-to-SQL based on Large Language Models and Database Keyword Search | Eduardo R. Nascimento et.al. | 2501.13594 | null |
2025-01-23 | Improving Contextual Faithfulness of Large Language Models via Retrieval Heads-Induced Optimization | Lei Huang et.al. | 2501.13573 | null |
2025-01-23 | One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt | Tao Liu et.al. | 2501.13554 | link |
2025-01-23 | LLMs Can Plan Only If We Tell Them | Bilgehan Sel et.al. | 2501.13545 | null |
2025-01-23 | ReasVQA: Advancing VideoQA with Imperfect Reasoning Process | Jianxin Liang et.al. | 2501.13536 | null |
2025-01-23 | RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles | Munachiso Nwadike et.al. | 2501.13491 | null |
2025-01-23 | Adaptive Testing for LLM-Based Applications: A Diversity-based Approach | Juyeon Yoon et.al. | 2501.13480 | null |
2025-01-23 | LDR-Net: A Novel Framework for AI-generated Image Detection via Localized Discrepancy Representation | JiaXin Chen et.al. | 2501.13475 | null |
2025-01-23 | Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge | Haomiao Xiong et.al. | 2501.13468 | link |
2025-01-23 | Spurious Forgetting in Continual Learning of Language Models | Junhao Zheng et.al. | 2501.13453 | link |
2025-01-23 | Softplus Attention with Re-weighting Boosts Length Extrapolation in Large Language Models | Bo Gao et.al. | 2501.13428 | null |
2025-01-23 | Predicting Turbulence Structure In Street-Canyon Flows using Deep Generative Modeling | Tomek Jaroslawski et.al. | 2501.13415 | null |
2025-01-23 | VulnBot: Autonomous Penetration Testing for A Multi-Agent Collaborative Framework | He Kong et.al. | 2501.13411 | link |
2025-01-23 | Towards Intelligent Design: A Self-driven Framework for Collocated Clothing Synthesis Leveraging Fashion Styles and Textures | Minglong Dong et.al. | 2501.13396 | null |
2025-01-23 | Can Large Language Models Understand Preferences in Personalized Recommendation? | Zhaoxuan Tan et.al. | 2501.13391 | link |
2025-01-23 | Do as We Do, Not as You Think: the Conformity of Large Language Models | Zhiyuan Weng et.al. | 2501.13381 | link |
2025-01-23 | Scalable Evaluation Framework for Foundation Models in Musculoskeletal MRI Bridging Computational Innovation with Clinical Utility | Gabrielle Hoyer et.al. | 2501.13376 | null |
2025-01-23 | Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement | Jae-Sung Bae et.al. | 2501.13372 | null |
2025-01-23 | Meta-Feature Adapter: Integrating Environmental Metadata for Enhanced Animal Re-identification | Yuzhuo Li et.al. | 2501.13368 | null |
2025-01-23 | 50 Shades of Deceptive Patterns: A Unified Taxonomy, Multimodal Detection, and Security Implications | Zewei Shi et.al. | 2501.13351 | null |
2025-01-23 | MSF: Efficient Diffusion Model Via Multi-Scale Latent Factorize | Haohang Xu et.al. | 2501.13349 | null |
2025-01-23 | Full-Stack Optimized Large Language Models for Lifelong Sequential Behavior Comprehension in Recommendation | Rong Shan et.al. | 2501.13344 | null |
2025-01-23 | Multi-aspect Knowledge Distillation with Large Language Model | Taegyeong Lee et.al. | 2501.13341 | link |
2025-01-23 | Generative Multi-Form Bayesian Optimization | Zhendong Guo et.al. | 2501.13337 | null |
2025-01-23 | SplitLLM: Hierarchical Split Learning for Large Language Model over Wireless Network | Songge Zhang et.al. | 2501.13318 | null |
2025-01-23 | Representing Visualization Insights as a Dense Insight Network | Jane Hoffswell et.al. | 2501.13309 | null |
2025-01-23 | OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia | Xuelong Geng et.al. | 2501.13306 | link |
2025-01-23 | Watching the AI Watchdogs: A Fairness and Robustness Analysis of AI Safety Moderation Classifiers | Akshit Achara et.al. | 2501.13302 | link |
2025-01-23 | Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents | Shrinidhi Kumbhar et.al. | 2501.13299 | null |
2025-01-23 | RAMQA: A Unified Framework for Retrieval-Augmented Multi-Modal Question Answering | Yang Bai et.al. | 2501.13297 | link |
2025-01-23 | Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols | John Joon Young Chung et.al. | 2501.13284 | null |
2025-01-22 | MEDFORM: A Foundation Model for Contrastive Learning of CT Imaging and Clinical Numeric Data in Multi-Cancer Analysis | Daeun Jung et.al. | 2501.13277 | link |
2025-01-22 | RAG-Reward: Optimizing RAG with Reward Modeling and RLHF | Hanning Zhang et.al. | 2501.13264 | null |
2025-01-22 | Exploring GPT's Ability as a Judge in Music Understanding | Kun Fang et.al. | 2501.13261 | link |
2025-01-22 | Bypassing Array Canaries via Autonomous Function Call Resolution | Nathaniel Oh et.al. | 2501.13256 | link |
2025-01-22 | S-LoRA: Scalable Low-Rank Adaptation for Class Incremental Learning | Yichen Wu et.al. | 2501.13198 | null |
2025-01-22 | Computational modelling of biological systems now and then: revisiting tools and visions from the beginning of the century | Axel Loewe et.al. | 2501.13142 | null |
2025-01-23 | VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding | Boqiang Zhang et.al. | 2501.13106 | link |
2025-01-22 | Robust Representation Consistency Model via Contrastive Denoising | Jiachen Lei et.al. | 2501.13094 | link |
2025-01-22 | Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment | Melissa Kazemi Rad et.al. | 2501.13080 | null |
2025-01-22 | Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning | Bohao Yang et.al. | 2501.13042 | link |
2025-01-22 | Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament | Yantao Liu et.al. | 2501.13007 | link |
2025-01-22 | Neural network enhanced cross entropy benchmark for monitored circuits | Yangrui Hu et.al. | 2501.13005 | null |
2025-01-22 | Large Language Model-Based Semantic Communication System for Image Transmission | Soheyb Ribouh et.al. | 2501.12988 | null |
2025-01-22 | LLM4WM: Adapting LLM for Wireless Multi-Tasking | Xuanyu Liu et.al. | 2501.12983 | null |
2025-01-22 | Low-dimensional adaptation of diffusion models: Convergence in total variation | Jiadong Liang et.al. | 2501.12982 | null |
2025-01-22 | OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models | Chongren Sun et.al. | 2501.12975 | link |
2025-01-22 | Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs | Jan Corazza et.al. | 2501.12972 | null |
2025-01-22 | It's complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act | Kristof Meding et.al. | 2501.12962 | null |
2025-01-22 | Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference | Weizhi Fei et.al. | 2501.12959 | null |
2025-01-22 | GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models | Pengxiang Zhao et.al. | 2501.12956 | null |
2025-01-22 | 3D Object Manipulation in a Single Image using Generative Models | Ruisi Zhao et.al. | 2501.12935 | null |
2025-01-22 | Correctness Assessment of Code Generated by Large Language Models Using Internal Representations | Tuan-Dung Bui et.al. | 2501.12934 | null |
2025-01-22 | DynamicEarth: How Far are We from Open-Vocabulary Change Detection? | Kaiyu Li et.al. | 2501.12931 | null |
2025-01-22 | A Functional Software Reference Architecture for LLM-Integrated Systems | Alessio Bucaioni et.al. | 2501.12904 | null |
2025-01-22 | Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration | Offa Kingsleigh et.al. | 2501.12901 | null |
2025-01-22 | Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback | Yafu Li et.al. | 2501.12895 | link |
2025-01-23 | Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program | Carlton Shepherd et.al. | 2501.12883 | null |
2025-01-22 | WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge | Jingyuan Chen et.al. | 2501.12877 | null |
2025-01-22 | ACEBench: Who Wins the Match Point in Tool Learning? | Chen Chen et.al. | 2501.12851 | null |
2025-01-22 | AMM-Diff: Adaptive Multi-Modality Diffusion Network for Missing Modality Imputation | Aghiles Kebaili et.al. | 2501.12840 | null |
2025-01-22 | Adaptive Retrieval Without Self-Knowledge? Bringing Uncertainty Back Home | Viktor Moskvoretskii et.al. | 2501.12835 | null |
2025-01-22 | Open or Closed LLM for Lesser-Resourced Languages? Lessons from Greek | John Pavlopoulos et.al. | 2501.12826 | link |
2025-01-22 | Enhancing Monocular Depth Estimation with Multi-Source Auxiliary Tasks | Alessio Quercia et.al. | 2501.12824 | null |
2025-01-22 | Certified Guidance for Planning with Deep Generative Models | Francesco Giacomarra et.al. | 2501.12815 | null |
2025-01-22 | Revisit Self-Debugging with Self-Generated Tests for Code Generation | Xiancai Chen et.al. | 2501.12793 | null |
2025-01-22 | LLMs as Repositories of Factual Knowledge: Limitations and Solutions | Seyed Mahed Mousavi et.al. | 2501.12774 | null |
2025-01-22 | NExtLong: Toward Effective Long-Context Training without Long Documents | Chaochen Gao et.al. | 2501.12766 | link |
2025-01-22 | Online Preference Alignment for Language Models via Count-based Exploration | Chenjia Bai et.al. | 2501.12735 | link |
2025-01-22 | Paradigm-Based Automatic HDL Code Generation Using LLMs | Wenhao Sun et.al. | 2501.12702 | null |
2025-01-22 | Training Dialogue Systems by AI Feedback for Improving Overall Dialogue Impression | Kai Yoshida et.al. | 2501.12698 | null |
2025-01-22 | Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering | Qian Tao et.al. | 2501.12697 | null |
2025-01-22 | SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling | Shengshi Yao et.al. | 2501.12696 | null |
2025-01-22 | EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation | Yifan Yu et.al. | 2501.12689 | null |
2025-01-22 | Distillation Quantification for Large Language Models | Sunbowen Lee et.al. | 2501.12619 | link |
2025-01-22 | Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We? | Taiming Wang et.al. | 2501.12617 | null |
2025-01-22 | Kimi k1.5: Scaling Reinforcement Learning with LLMs | Kimi Team et.al. | 2501.12599 | null |
2025-01-22 | Leveraging LLMs to Create a Haptic Devices' Recommendation System | Yang Liu et.al. | 2501.12573 | null |
2025-01-22 | Understanding the LLM-ification of CHI: Unpacking the Impact of LLMs at CHI through a Systematic Literature Review | Rock Yuren Pang et.al. | 2501.12557 | link |
2025-01-21 | Human-like conceptual representations emerge from language prediction | Ningyu Xu et.al. | 2501.12547 | null |
2025-01-21 | How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models? | Mirali Purohit et.al. | 2501.12535 | null |
2025-01-21 | An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts | Dhia Elhaq Rzig et.al. | 2501.12521 | null |
2025-01-21 | A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data | Minh Tran et.al. | 2501.12501 | null |
2025-01-21 | The Journey Matters: Average Parameter Count over Pre-training Unifies Sparse and Dense Scaling Laws | Tian Jin et.al. | 2501.12486 | null |
2025-01-21 | An Empirical Characterization of Outages and Incidents in Public Services for Large Language Models | Xiaoyu Chu et.al. | 2501.12469 | link |
2025-01-21 | Adaptive PII Mitigation Framework for Large Language Models | Shubhi Asthana et.al. | 2501.12465 | null |
2025-01-21 | Empowering AIOps: Leveraging Large Language Models for IT Operations ManagementOperations Management | Arthur Vitui et.al. | 2501.12461 | link |
2025-01-21 | Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications | Shubhi Asthana et.al. | 2501.12456 | null |
2025-01-21 | Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation | Dongsheng Zhu et.al. | 2501.12432 | null |
2025-01-21 | FREYR: A Framework for Recognizing and Executing Your Requests | Roberto Gallotta et.al. | 2501.12423 | link |
2025-01-21 | CroMe: Multimodal Fake News Detection using Cross-Modal Tri-Transformer and Metric Learning | Eunjee Choi et.al. | 2501.12422 | null |
2025-01-22 | InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling | Yi Wang et.al. | 2501.12386 | link |
2025-01-21 | Accelerating Pulsar Parameter Estimation Using Convolutional Neural Networks | Greg Olmschenk et.al. | 2501.12383 | null |
2025-01-21 | MMVU: Measuring Expert-Level Multi-Discipline Video Understanding | Yilun Zhao et.al. | 2501.12380 | link |
2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null |
2025-01-21 | Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists | Thomas F. Eisenmann et.al. | 2501.12374 | link |
2025-01-21 | Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL | Yeounoh Chung et.al. | 2501.12372 | null |
2025-01-21 | Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration | Thomas Walshe et.al. | 2501.12332 | null |
2025-01-21 | Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops | Mohamed Harmanani et.al. | 2501.12331 | link |
2025-01-21 | VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model | Xianwei Zhuang et.al. | 2501.12327 | link |
2025-01-21 | LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations | Hasan Abu-Rasheed et.al. | 2501.12300 | null |
2025-01-21 | MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks | Qishen Zhou et.al. | 2501.12281 | link |
2025-01-21 | Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement | Maosong Cao et.al. | 2501.12273 | link |
2025-01-21 | FOCUS: First Order Concentrated Updating Scheme | Yizhou Liu et.al. | 2501.12243 | null |
2025-01-21 | InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models | Pha Nguyen et.al. | 2501.12231 | null |
2025-01-21 | CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning | Yuanheng Fang et.al. | 2501.12226 | null |
2025-01-21 | Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces | Allard Oelen et.al. | 2501.12221 | null |
2025-01-21 | You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense | Wuyuao Mai et.al. | 2501.12210 | null |
2025-01-21 | Explainability for Vision Foundation Models: A Survey | Rémi Kazmierczak et.al. | 2501.12203 | null |
2025-01-22 | Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation | Zibo Zhao et.al. | 2501.12202 | link |
2025-01-21 | BiMarker: Enhancing Text Watermark Detection for Large Language Models with Bipolar Watermarks | Zhuang Li et.al. | 2501.12174 | null |
2025-01-21 | Contextualizing Recommendation Explanations with LLMs: A User Study | Yuanjun Feng et.al. | 2501.12152 | null |
2025-01-21 | Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities | Qirun Dai et.al. | 2501.12147 | null |
2025-01-21 | Do LLMs Provide Links to Code Similar to what they Generate? A Study with Gemini and Bing CoPilot | Daniele Bifolco et.al. | 2501.12134 | null |
2025-01-21 | Evaluating Efficiency and Engagement in Scripted and LLM-Enhanced Human-Robot Interactions | Tim Schreiter et.al. | 2501.12128 | null |
2025-01-21 | Can open source large language models be used for tumor documentation in Germany? -- An evaluation on urological doctors' notes | Stefan Lenz et.al. | 2501.12106 | link |
2025-01-21 | Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis | Weile Luo et.al. | 2501.12084 | null |
2025-01-21 | Phishing Awareness via Game-Based Learning | Argianto Rahartomo et.al. | 2501.12077 | link |
2025-01-21 | PINNsAgent: Automated PDE Surrogation with Large Language Models | Qingpo Wuwu et.al. | 2501.12053 | null |
2025-01-21 | Harnessing Generative Pre-Trained Transformer for Datacenter Packet Trace Generation | Chen Griner et.al. | 2501.12033 | null |
2025-01-21 | Comparative Analysis of Pre-trained Deep Learning Models and DINOv2 for Cushing's Syndrome Diagnosis in Facial Analysis | Hongjun Liu et.al. | 2501.12023 | null |
2025-01-21 | Are Traditional Deep Learning Model Approaches as Effective as a Retinal-Specific Foundation Model for Ocular and Systemic Disease Detection? | Samantha Min Er Yew et.al. | 2501.12016 | null |
2025-01-21 | Rate-Aware Learned Speech Compression | Jun Xu et.al. | 2501.11999 | null |
2025-01-21 | Linear Feedback Control Systems for Iterative Prompt Optimization in Large Language Models | Rupesh Raj Karn et.al. | 2501.11979 | null |
2025-01-21 | Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues | Maya Medjad et.al. | 2501.11977 | link |
2025-01-21 | Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jie Zhao et.al. | 2501.11968 | null |
2025-01-21 | A Hybrid Attention Framework for Fake News Detection with Large Language Models | Xiaochuan Xu et.al. | 2501.11967 | null |
2025-01-21 | TAD-Bench: A Comprehensive Benchmark for Embedding-Based Text Anomaly Detection | Yang Cao et.al. | 2501.11960 | null |
2025-01-21 | Proverbs Run in Pairs: Evaluating Proverb Translation Capability of Large Language Model | Minghan Wang et.al. | 2501.11953 | null |
2025-01-21 | ALoFTRAG: Automatic Local Fine Tuning for Retrieval Augmented Generation | Peter Devine et.al. | 2501.11929 | link |
2025-01-21 | Integrate Temporal Graph Learning into LLM-based Temporal Knowledge Graph Model | He Chang et.al. | 2501.11911 | null |
2025-01-21 | Panoramic Interests: Stylistic-Content Aware Personalized Headline Generation | Junhong Lian et.al. | 2501.11900 | link |
2025-01-22 | Med-R |
Keer Lu et.al. | 2501.11885 | null |
2025-01-21 | From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning | Yafu Li et.al. | 2501.11877 | link |
2025-01-21 | LLM-Agents Driven Automated Simulation Testing and Analysis of small Uncrewed Aerial Systems | Venkata Sai Aswath Duvvuru et.al. | 2501.11864 | null |
2025-01-21 | EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents | Zhili Cheng et.al. | 2501.11858 | link |
2025-01-21 | Network-informed Prompt Engineering against Organized Astroturf Campaigns under Extreme Class Imbalance | Nikos Kanakaris et.al. | 2501.11849 | link |
2025-01-21 | A Survey on Memory-Efficient Large-Scale Model Training in AI for Science | Kaiyuan Tian et.al. | 2501.11847 | null |
2025-01-21 | Large Language Models with Human-In-The-Loop Validation for Systematic Review Data Extraction | Noah L. Schroeder et.al. | 2501.11840 | null |
2025-01-21 | PXGen: A Post-hoc Explainable Method for Generative Models | Yen-Lung Huang et.al. | 2501.11827 | null |
2025-01-21 | CogMorph: Cognitive Morphing Attacks for Text-to-Image Models | Zonglei Jing et.al. | 2501.11815 | null |
2025-01-20 | Benchmarking Large Language Models via Random Variables | Zijin Hong et.al. | 2501.11790 | null |
2025-01-20 | Synthetic Data Can Mislead Evaluations: Membership Inference as Machine Text Detection | Ali Naseh et.al. | 2501.11786 | null |
2025-01-20 | Glinthawk: A Two-Tiered Architecture for High-Throughput LLM Inference | Pouya Hamadanian et.al. | 2501.11779 | link |
2025-01-20 | The Value of Nothing: Multimodal Extraction of Human Values Expressed by TikTok Influencers | Alina Starovolsky-Shitrit et.al. | 2501.11770 | null |
2025-01-20 | Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems | Fatemeh Nazary et.al. | 2501.11759 | link |
2025-01-20 | A generalizable 3D framework and model for self-supervised learning in medical imaging | Tony Xu et.al. | 2501.11755 | null |
2025-01-20 | Are generative models fair? A study of racial bias in dermatological image generation | Miguel López-Pérez et.al. | 2501.11752 | null |
2025-01-20 | Optimizing Pretraining Data Mixtures with LLM-Estimated Utility | William Held et.al. | 2501.11747 | null |
2025-01-20 | MedicoSAM: Towards foundation models for medical image segmentation | Anwai Archit et.al. | 2501.11734 | link |
2025-01-20 | Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks | Zhenhailong Wang et.al. | 2501.11733 | null |
2025-01-20 | Explain-Query-Test: Self-Evaluating LLMs Via Explanation and Comprehension Discrepancy | Saeid Asgari Taghanaki et.al. | 2501.11721 | link |
2025-01-20 | YouLeQD: Decoding the Cognitive Complexity of Questions and Engagement in Online Educational Videos from Learners' Perspectives | Nong Ming et.al. | 2501.11712 | link |
2025-01-20 | Towards Detecting Prompt Knowledge Gaps for Improved LLM-guided Issue Resolution | Ramtin Ehsani et.al. | 2501.11709 | null |
2025-01-20 | Trustformer: A Trusted Federated Transformer | Ali Abbasi Tadi et.al. | 2501.11706 | null |
2025-01-20 | Human services organizations and the responsible integration of AI: Considering ethics and contextualizing risk(s) | Brian E. Perron et.al. | 2501.11705 | null |
2025-01-20 | Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling | Zhenyu Hou et.al. | 2501.11651 | link |
2025-01-20 | Trojan Detection Through Pattern Recognition for Large Language Models | Vedant Bhasin et.al. | 2501.11621 | null |
2025-01-20 | Conversation Routines: A Prompt Engineering Framework for Task-Oriented Dialog Systems | Giorgio Robino et.al. | 2501.11613 | null |
2025-01-20 | SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks | Wentao Wan et.al. | 2501.11599 | link |
2025-01-20 | Recurrent Diffusion for Large-Scale Parameter Generation | Kai Wang et.al. | 2501.11587 | link |
2025-01-20 | Open Sourcing GPTs: Economics of Open Sourcing Advanced AI Models | Mahyar Habibi et.al. | 2501.11581 | null |
2025-01-20 | Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution | Zhiyuan You et.al. | 2501.11561 | null |
2025-01-20 | PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation | Jinyu Wang et.al. | 2501.11551 | link |
2025-01-20 | UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion | Zixuan Chen et.al. | 2501.11515 | null |
2025-01-20 | Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges | Vincent Koc et.al. | 2501.11496 | null |
2025-01-20 | Graph-defined Language Learning with LLMs | Huachi Zhou et.al. | 2501.11478 | null |
2025-01-20 | Curiosity-Driven Reinforcement Learning from Human Feedback | Haoran Sun et.al. | 2501.11463 | link |
2025-01-20 | Ontology Matching with Large Language Models and Prioritized Depth-First Search | Maria Taboada et.al. | 2501.11441 | null |
2025-01-20 | One Does Not Simply Meme Alone: Evaluating Co-Creativity Between LLMs and Humans in the Generation of Humor | Zhikun Wu et.al. | 2501.11433 | null |
2025-01-20 | A Survey on Diffusion Models for Anomaly Detection | Jing Liu et.al. | 2501.11430 | link |
2025-01-20 | Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training | Siyu Yuan et.al. | 2501.11425 | link |
2025-01-20 | Neural Contextual Reinforcement Framework for Logical Structure Language Generation | Marcus Irvin et.al. | 2501.11417 | null |
2025-01-20 | Beyond the Hype: Benchmarking LLM-Evolved Heuristics for Bin Packing | Kevin Sim et.al. | 2501.11411 | null |
2025-01-20 | Revisiting Language Models in Neural News Recommender Systems | Yuyue Zhao et.al. | 2501.11391 | link |
2025-01-20 | Towards Advancing Code Generation with Large Language Models: A Research Roadmap | Haolin Jin et.al. | 2501.11354 | null |
2025-01-20 | EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Guankun Wang et.al. | 2501.11347 | link |
2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
2025-01-20 | Few-shot Policy (de)composition in Conversational Question Answering | Kyle Erwin et.al. | 2501.11335 | null |
2025-01-20 | Nested Annealed Training Scheme for Generative Adversarial Networks | Chang Wan et.al. | 2501.11318 | null |
2025-01-20 | Advancing Multi-Party Dialogue Systems with Speaker-ware Contrastive Learning | Zhongtian Hu et.al. | 2501.11292 | null |
2025-01-20 | Large Language Model Agents for Radio Map Generation and Wireless Network Planning | Hongye Quan et.al. | 2501.11283 | null |
2025-01-20 | Multi-round, Chain-of-thought Post-editing for Unfaithful Summaries | Yi-Hui Lee et.al. | 2501.11273 | null |
2025-01-20 | Can xLLMs Understand the Structure of Dialog? Exploring Multilingual Response Generation in Complex Scenarios | Zhongtian Hu et.al. | 2501.11269 | null |
2025-01-20 | Code Readability in the Age of Large Language Models: An Industrial Case Study from Atlassian | Wannita Takerngsaksiri et.al. | 2501.11264 | link |
2025-01-20 | Multivariate Wireless Link Quality Prediction Based on Pre-trained Large Language Models | Zhuangzhuang Yan et.al. | 2501.11247 | null |
2025-01-20 | Irony in Emojis: A Comparative Study of Human and LLM Interpretation | Yawen Zheng et.al. | 2501.11241 | null |
2025-01-20 | KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jiaxiang Liu et.al. | 2501.11231 | link |
2025-01-20 | Reasoning Language Models: A Blueprint | Maciej Besta et.al. | 2501.11223 | link |
2025-01-20 | Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation | Ivan Lopez et.al. | 2501.11199 | null |
2025-01-19 | Conditional Feature Importance with Generative Modeling Using Adversarial Random Forests | Kristin Blesch et.al. | 2501.11178 | link |
2025-01-17 | FaceXBench: Evaluating Multimodal LLMs on Face Understanding | Kartik Narayan et.al. | 2501.10360 | link |
2025-01-17 | Zero-Shot Monocular Scene Flow Estimation in the Wild | Yiqing Liang et.al. | 2501.10357 | null |
2025-01-17 | Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems | Weibo Gao et.al. | 2501.10332 | null |
2025-01-17 | Large language models for automated scholarly paper review: A survey | Zhenzhen Zhuang et.al. | 2501.10326 | null |
2025-01-17 | HiMix: Reducing Computational Complexity in Large Vision-Language Models | Xuange Zhang et.al. | 2501.10318 | null |
2025-01-17 | Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs | Claudio Di Sipio et.al. | 2501.10313 | null |
2025-01-17 | Computational Protein Science in the Era of Large Language Models (LLMs) | Wenqi Fan et.al. | 2501.10282 | null |
2025-01-17 | Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation | Azat Abdullin et.al. | 2501.10200 | null |
2025-01-17 | Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education | William Hersh et.al. | 2501.10186 | null |
2025-01-17 | Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval | Vera Pavlova et.al. | 2501.10175 | null |
2025-01-17 | Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis | Abhishek Kaushik et.al. | 2501.10134 | null |
2025-01-17 | ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario | Lucen Zhong et.al. | 2501.10132 | link |
2025-01-17 | PaSa: An LLM Agent for Comprehensive Academic Paper Search | Yichen He et.al. | 2501.10120 | link |
2025-01-17 | AI-Generated Music Detection and its Challenges | Darius Afchar et.al. | 2501.10111 | link |
2025-01-17 | LLM Reasoner and Automated Planner: A new NPC approach | Israel Puerta-Merino et.al. | 2501.10106 | null |
2025-01-17 | Universal Actions for Enhanced Embodied Foundation Models | Jinliang Zheng et.al. | 2501.10105 | link |
2025-01-17 | Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Michael Schwingshackl et.al. | 2501.10080 | link |
2025-01-17 | FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Zhaopeng Gu et.al. | 2501.10067 | link |
2025-01-17 | Accelerating Large Language Models through Partially Linear Feed-Forward Network | Gansen Hu et.al. | 2501.10054 | null |
2025-01-17 | AirRAG: Activating Intrinsic Reasoning for Retrieval Augmented Generation via Tree-based Search | Wenfeng Feng et.al. | 2501.10053 | null |
2025-01-17 | Exploring Code Comprehension in Scientific Programming: Preliminary Insights from Research Scientists | Alyssia Chen et.al. | 2501.10037 | null |
2025-01-17 | Mapping scientific communities at scale | Victor Barbier et.al. | 2501.10035 | link |
2025-01-17 | Mitigating Hallucinations on Object Attributes using Multiview Images and Negative Instructions | Zhijie Tan et.al. | 2501.10011 | null |
2025-01-17 | Attention-guided Self-reflection for Zero-shot Hallucination Detection in Large Language Models | Qiang Liu et.al. | 2501.09997 | null |
2025-01-17 | Agent-as-Judge for Factual Summarization of Long Narratives | Yeonseok Jeong et.al. | 2501.09993 | link |
2025-01-17 | RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation | Yuefan Cao et.al. | 2501.09982 | null |
2025-01-17 | GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions | Heda Zuo et.al. | 2501.09972 | null |
2025-01-17 | Explainable artificial intelligence (XAI): from inherent explainability to large language models | Fuseini Mumuni et.al. | 2501.09967 | null |
2025-01-17 | A Survey on Multi-Turn Interaction Capabilities of Large Language Models | Chen Zhang et.al. | 2501.09959 | null |
2025-01-17 | FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs | Zengyi Gao et.al. | 2501.09957 | null |
2025-01-17 | AIRCHITECT v2: Learning the Hardware Accelerator Design Space through Unified Representations | Jamin Seo et.al. | 2501.09954 | link |
2025-01-17 | Sympathy over Polarization: A Computational Discourse Analysis of Social Media Posts about the July 2024 Trump Assassination Attempt | Qingcheng Zeng et.al. | 2501.09950 | null |
2025-01-17 | MultiPruner: Balanced Structure Removal in Foundation Models | J. Pablo Muñoz et.al. | 2501.09949 | link |
2025-01-17 | Steering Large Language Models with Feature Guided Activation Additions | Samuel Soo et.al. | 2501.09929 | null |
2025-01-17 | Towards A Litmus Test for Common Sense | Hugo Latapie et.al. | 2501.09913 | null |
2025-01-17 | Demo: Interactive Visualization of Semantic Relationships in a Biomedical Project's Talent Knowledge Graph | Jiawei Xu et.al. | 2501.09909 | null |
2025-01-17 | Position: Open and Closed Large Language Models in Healthcare | Jiawei Xu et.al. | 2501.09906 | null |
2025-01-17 | FoundationStereo: Zero-Shot Stereo Matching | Bowen Wen et.al. | 2501.09898 | null |
2025-01-17 | Evolving Deeper LLM Thinking | Kuang-Huei Lee et.al. | 2501.09891 | null |
2025-01-17 | Understanding the Effectiveness of LLMs in Automated Self-Admitted Technical Debt Repayment | Mohammad Sadegh Sheikhaei et.al. | 2501.09888 | link |
2025-01-17 | FLORA: Formal Language Model Enables Robust Training-free Zero-shot Object Referring Analysis | Zhe Chen et.al. | 2501.09887 | null |
2025-01-16 | ASTRA: A Scene-aware TRAnsformer-based model for trajectory prediction | Izzeddin Teeti et.al. | 2501.09878 | null |
2025-01-16 | Geometry-Preserving Encoder/Decoder in Latent Generative Models | Wonjun Lee et.al. | 2501.09876 | null |
2025-01-16 | An LLM-Guided Tutoring System for Social Skills Training | Michael Guevarra et.al. | 2501.09870 | null |
2025-01-16 | Fine-grained Testing for Autonomous Driving Software: a Study on Autoware with LLM-driven Unit Testing | Wenhan Wang et.al. | 2501.09866 | null |
2025-01-16 | Optimization is Better than Generation: Optimizing Commit Message Leveraging Human-written Commit Message | Jiawei Li et.al. | 2501.09861 | null |
2025-01-16 | PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery | Shristi Das Biswas et.al. | 2501.09826 | link |
2025-01-16 | Bridging Language Barriers in Healthcare: A Study on Arabic LLMs | Nada Saadi et.al. | 2501.09825 | null |
2025-01-16 | BN-Pool: a Bayesian Nonparametric Approach to Graph Pooling | Daniele Castellana et.al. | 2501.09821 | link |
2025-01-16 | Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems | Soham Roy et.al. | 2501.09801 | null |
2025-01-16 | Computing Optimization-Based Prompt Injections Against Closed-Weights Models By Misusing a Fine-Tuning API | Andrey Labunets et.al. | 2501.09798 | null |
2025-01-16 | GeoManip: Geometric Constraints as General Interfaces for Robot Manipulation | Weiliang Tang et.al. | 2501.09783 | null |
2025-01-16 | SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation | Wanqi Yin et.al. | 2501.09782 | link |
2025-01-16 | VideoWorld: Exploring Knowledge Learning from Unlabeled Videos | Zhongwei Ren et.al. | 2501.09781 | null |
2025-01-16 | Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong | Tairan Fu et.al. | 2501.09775 | null |
2025-01-16 | Distilling Multi-modal Large Language Models for Autonomous Driving | Deepti Hegde et.al. | 2501.09757 | null |
2025-01-16 | Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Philippe Hansen-Estruch et.al. | 2501.09755 | null |
2025-01-16 | Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues | Youngjoon Jang et.al. | 2501.09754 | null |
2025-01-16 | OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking | Zekun Xi et.al. | 2501.09751 | null |
2025-01-16 | Enhancing Lexicon-Based Text Embeddings with Large Language Models | Yibin Lei et.al. | 2501.09749 | null |
2025-01-16 | Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models | Bihui Jin et.al. | 2501.09745 | null |
2025-01-16 | KU AIGEN ICL EDI@BC8 Track 3: Advancing Phenotype Named Entity Recognition and Normalization for Dysmorphology Physical Examination Reports | Hajung Kim et.al. | 2501.09744 | null |
2025-01-16 | Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps | Nanye Ma et.al. | 2501.09732 | null |
2025-01-16 | A Simple Aerial Detection Baseline of Multimodal Language Models | Qingyun Li et.al. | 2501.09720 | link |
2025-01-16 | Comparative Insights from 12 Machine Learning Models in Extracting Economic Ideology from Political Text | Jihed Ncib et.al. | 2501.09719 | null |
2025-01-16 | CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education | Tianyu Wang et.al. | 2501.09709 | link |
2025-01-16 | Domain Adaptation of Foundation LLMs for e-Commerce | Christian Herold et.al. | 2501.09706 | null |
2025-01-16 | Cueless EEG imagined speech for subject identification: dataset and benchmarks | Ali Derakhshesh et.al. | 2501.09700 | link |
2025-01-16 | Simulated Interactive Debugging | Yannic Noller et.al. | 2501.09694 | null |
2025-01-17 | Towards Large Reasoning Models: A Survey on Scaling LLM Reasoning Capabilities | Fengli Xu et.al. | 2501.09686 | null |
2025-01-16 | Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review | Masatoshi Uehara et.al. | 2501.09685 | null |
2025-01-16 | Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark | Alexis Roger et.al. | 2501.09672 | null |
2025-01-16 | A Survey of Research in Large Language Models for Electronic Design Automation | Jingyu Pan et.al. | 2501.09655 | null |
2025-01-16 | The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models | Jonathan Katzy et.al. | 2501.09653 | null |
2025-01-16 | CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding | Johannes Kirmayr et.al. | 2501.09645 | link |
2025-01-17 | LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading | Kuan-Ming Liu et.al. | 2501.09636 | null |
2025-01-16 | Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework | Yushen Lin et.al. | 2501.09631 | null |
2025-01-16 | Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Chaoqi Wang et.al. | 2501.09620 | link |
2025-01-16 | From Scarcity to Capability: Empowering Fake News Detection in Low-Resource Languages with LLMs | Hrithik Majumdar Shibu et.al. | 2501.09604 | link |
2025-01-16 | Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures | Pratyush Dhingra et.al. | 2501.09588 | null |
2025-01-16 | Text-driven Adaptation of Foundation Models for Few-shot Surgical Workflow Analysis | Tingxuan Chen et.al. | 2501.09555 | null |
2025-01-16 | AI in Support of Diversity and Inclusion | Çiçek Güven et.al. | 2501.09534 | null |
2025-01-16 | Confidence Estimation for Error Detection in Text-to-SQL Systems | Oleg Somov et.al. | 2501.09527 | null |
2025-01-16 | Augmenting a Large Language Model with a Combination of Text and Visual Data for Conversational Visualization of Global Geospatial Data | Omar Mena et.al. | 2501.09521 | null |
2025-01-16 | AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation | Junjie He et.al. | 2501.09503 | null |
2025-01-16 | Omni-Emotion: Extending Video MLLM with Detailed Face and Audio Modeling for Multimodal Emotion Analysis | Qize Yang et.al. | 2501.09502 | null |
2025-01-16 | Evaluating Conversational Recommender Systems with Large Language Models: A User-Centric Evaluation Framework | Nuo Chen et.al. | 2501.09493 | null |
2025-01-16 | Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Zhaocheng Liu et.al. | 2501.09484 | link |
2025-01-16 | Guided Debugging of Auto-Translated Code Using Differential Testing | Shengnan Wu et.al. | 2501.09475 | null |
2025-01-16 | DEFOM-Stereo: Depth Foundation Model Based Stereo Matching | Hualie Jiang et.al. | 2501.09466 | link |
2025-01-16 | Pruning for Sparse Diffusion Models based on Gradient Flow | Ben Wan et.al. | 2501.09464 | null |
2025-01-16 | "A Great Start, But...": Evaluating LLM-Generated Mind Maps for Information Mapping in Video-Based Design | Tianhao He et.al. | 2501.09457 | null |
2025-01-16 | Solving the unsolvable: Translating case law in Hong Kong | King-kui Sin et.al. | 2501.09444 | null |
2025-01-16 | Scaling up self-supervised learning for improved surgical foundation models | Tim J. M. Jaspers et.al. | 2501.09436 | link |
2025-01-16 | CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Hwan Heo et.al. | 2501.09433 | link |
2025-01-16 | A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy | Huandong Wang et.al. | 2501.09431 | null |
2025-01-16 | AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring | Xinyi Wang et.al. | 2501.09428 | null |
2025-01-16 | AutoCBT: An Autonomous Multi-agent Framework for Cognitive Behavioral Therapy in Psychological Counseling | Ancheng Xu et.al. | 2501.09426 | null |
2025-01-16 | FASP: Fast and Accurate Structured Pruning of Large Language Models | Hanyu Hu et.al. | 2501.09412 | null |
2025-01-16 | MoE |
Lyudong Jin et.al. | 2501.09410 | null |
2025-01-16 | Adaptive Contextual Caching for Mobile Edge Large Language Model Service | Guangyuan Liu et.al. | 2501.09383 | null |
2025-01-16 | Aligning Instruction Tuning with Pre-training | Yiming Liang et.al. | 2501.09368 | null |
2025-01-16 | PICE: A Semantic-Driven Progressive Inference System for LLM Serving in Cloud-Edge Networks | Huiyou Zhan et.al. | 2501.09367 | null |
2025-01-16 | YETI (YET to Intervene) Proactive Interventions by Multimodal AI Agents in Augmented Reality Tasks | Saptarashmi Bandyopadhyay et.al. | 2501.09355 | null |
2025-01-16 | UVRM: A Scalable 3D Reconstruction Model from Unposed Videos | Shiu-hong Kao et.al. | 2501.09347 | null |
2025-01-16 | Rational Tuning of LLM Cascades via Probabilistic Modeling | Michael J. Zellinger et.al. | 2501.09345 | null |
2025-01-16 | SOP-Agent: Empower General Purpose AI Agent with Domain-Specific SOPs | Anbang Ye et.al. | 2501.09316 | null |
2025-01-16 | A Study of In-Context-Learning-Based Text-to-SQL Errors | Jiawei Shen et.al. | 2501.09310 | link |
2025-01-16 | To Retrieve or Not to Retrieve? Uncertainty Detection for Dynamic Retrieval Augmented Generation | Kaustubh D. Dhole et.al. | 2501.09292 | null |
2025-01-16 | LAVCap: LLM-based Audio-Visual Captioning using Optimal Transport | Kyeongha Rho et.al. | 2501.09291 | link |
2025-01-16 | Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding | Kohei Torimi et.al. | 2501.09278 | null |
2025-01-16 | Large Language Model is Secretly a Protein Sequence Optimizer | Yinkai Wang et.al. | 2501.09274 | null |
2025-01-16 | Perspective Transition of Large Language Models for Solving Subjective Tasks | Xiaolong Wang et.al. | 2501.09265 | null |
2025-01-16 | Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition | Takaaki Hori et.al. | 2501.09258 | null |
2025-01-16 | Clone-Robust AI Alignment | Ariel D. Procaccia et.al. | 2501.09254 | null |
2025-01-16 | Split Fine-Tuning for Large Language Models in Wireless Networks | Songge Zhang et.al. | 2501.09237 | null |
2025-01-16 | Foundations of Large Language Models | Tong Xiao et.al. | 2501.09223 | null |
2025-01-16 | Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs | Sanchit Sinha et.al. | 2501.09221 | null |
2025-01-16 | A Simple Graph Contrastive Learning Framework for Short Text Classification | Yonghao Liu et.al. | 2501.09219 | link |
2025-01-16 | Interpretable Droplet Digital PCR Assay for Trustworthy Molecular Diagnostics | Yuanyuan Wei et.al. | 2501.09218 | null |
2025-01-16 | Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning | Yonghao Liu et.al. | 2501.09214 | link |
2025-01-16 | FineMedLM-o1: Enhancing the Medical Reasoning Ability of LLM from Supervised Fine-Tuning to Test-Time Training | Hongzhou Yu et.al. | 2501.09213 | link |
2025-01-15 | Unified Few-shot Crack Segmentation and its Precise 3D Automatic Measurement in Concrete Structures | Pengru Deng et.al. | 2501.09203 | null |
2025-01-15 | Towards Semantics Lifting for Scientific Computing: A Case Study on FFT | Naifeng Zhang et.al. | 2501.09201 | null |
2025-01-15 | Guiding Retrieval using LLM-based Listwise Rankers | Mandeep Rathee et.al. | 2501.09186 | link |
2025-01-15 | The Veln(ia)s is in the Details: Evaluating LLM Judgment on Latvian and Lithuanian Short Answer Matching | Yevhen Kostiuk et.al. | 2501.09164 | null |
2025-01-15 | Evaluating GenAI for Simplifying Texts for Education: Improving Accuracy and Consistency for Enhanced Readability | Stephanie L. Day et.al. | 2501.09158 | null |
2025-01-15 | Towards Multilingual LLM Evaluation for Baltic and Nordic languages: A study on Lithuanian History | Yevhen Kostiuk et.al. | 2501.09154 | null |
2025-01-15 | Few-Shot Adaptation of Training-Free Foundation Model for 3D Medical Image Segmentation | Xingxin He et.al. | 2501.09138 | null |
2025-01-15 | Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG | Aditi Singh et.al. | 2501.09136 | link |
2025-01-15 | HAFix: History-Augmented Large Language Models for Bug Fixing | Yu Shi et.al. | 2501.09135 | link |
2025-01-15 | Multilingual LLMs Struggle to Link Orthography and Semantics in Bilingual Word Processing | Eshaan Tanwar et.al. | 2501.09127 | link |
2025-01-15 | Augmenting Human-Annotated Training Data with Large Language Model Generation and Distillation in Open-Response Assessment | Conrad Borchers et.al. | 2501.09126 | null |
2025-01-15 | Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration Approach | Alireza Ghaffari et.al. | 2501.09107 | null |
2025-01-15 | Tracking the Takes and Trajectories of English-Language News Narratives across Trustworthy and Worrisome Websites | Hans W. A. Hanley et.al. | 2501.09102 | link |
2025-01-15 | Drama Llama: An LLM-Powered Storylets Framework for Authorable Responsiveness in Interactive Narrative | Yuqian Sun et.al. | 2501.09099 | null |
2025-01-15 | SteLLA: A Structured Grading System Using LLMs with RAG | Hefei Qiu et.al. | 2501.09092 | null |
2025-01-15 | Generative diffusion model with inverse renormalization group flows | Kanta Masuki et.al. | 2501.09064 | link |
2025-01-15 | Decompose-ToM: Enhancing Theory of Mind Reasoning in Large Language Models through Simulation and Task Decomposition | Sneheel Sarangi et.al. | 2501.09056 | link |
2025-01-15 | How Do Generative Models Draw a Software Engineer? A Case Study on Stable Diffusion Bias | Tosin Fadahunsi et.al. | 2501.09014 | link |
2025-01-15 | Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians | Ishan Amin et.al. | 2501.09009 | link |
2025-01-15 | Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails | Shaona Ghosh et.al. | 2501.09004 | null |
2025-01-15 | Vision Foundation Models for Computed Tomography | Suraj Pai et.al. | 2501.09001 | null |
2025-01-15 | CrystalGRW: Generative Modeling of Crystal Structures with Targeted Properties via Geodesic Random Walks | Krit Tangsongcharoen et.al. | 2501.08998 | link |
2025-01-15 | VECT-GAN: A variationally encoded generative model for overcoming data scarcity in pharmaceutical science | Youssef Abdalla et.al. | 2501.08995 | link |
2025-01-15 | CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities | Haozhe Xie et.al. | 2501.08983 | link |
2025-01-15 | Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models | Emma Croxford et.al. | 2501.08977 | null |
2025-01-15 | Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models | Karukriti Kaushik Ghosh et.al. | 2501.08974 | null |
2025-01-15 | Analyzing the Ethical Logic of Six Large Language Models | W. Russell Neuman et.al. | 2501.08951 | null |
2025-01-15 | Applying General Turn-taking Models to Conversational Human-Robot Interaction | Gabriel Skantze et.al. | 2501.08946 | null |
2025-01-15 | Disentangling Exploration of Large Language Models by Optimal Exploitation | Tim Grams et.al. | 2501.08925 | null |
2025-01-15 | GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge | Liam Dugan et.al. | 2501.08913 | link |
2025-01-15 | Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning | Qinyu Ma et.al. | 2501.08897 | link |
2025-01-15 | Connecting SPDE to SGMs | Junsu Seo et.al. | 2501.08877 | null |
2025-01-15 | Exploring Task-Level Optimal Prompts for Visual In-Context Learning | Yan Zhu et.al. | 2501.08841 | null |
2025-01-15 | How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering | Christoph Treude et.al. | 2501.08774 | null |
2025-01-15 | Admitting Ignorance Helps the Video Question Answering Models to Answer | Haopeng Li et.al. | 2501.08771 | null |
2025-01-15 | Enhanced Large Language Models for Effective Screening of Depression and Anxiety | June M. Liu et.al. | 2501.08769 | null |
2025-01-15 | Few-Shot Learner Generalizes Across AI-Generated Image Detection | Shiyu Wu et.al. | 2501.08763 | null |
2025-01-15 | Leveraging LLM Agents for Translating Network Configurations | Yunze Wei et.al. | 2501.08760 | null |
2025-01-15 | The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities | Irina Bigoulaeva et.al. | 2501.08716 | link |
2025-01-15 | Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching | Chuangtao Ma et.al. | 2501.08686 | link |
2025-01-15 | RealVVT: Towards Photorealistic Video Virtual Try-on via Spatio-Temporal Consistency | Siqi Li et.al. | 2501.08682 | null |
2025-01-15 | Augmenting Smart Contract Decompiler Output through Fine-grained Dependency Analysis and LLM-facilitated Semantic Recovery | Zeqin Liao et.al. | 2501.08670 | null |
2025-01-15 | MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities | Savya Khosla et.al. | 2501.08648 | null |
2025-01-15 | Reassessing the Role of Chain-of-Thought in Sentiment Analysis: Insights and Limitations | Kaiyuan Zheng et.al. | 2501.08641 | null |
2025-01-15 | SWSC: Shared Weight for Similar Channel in LLM | Binrui Zeng et.al. | 2501.08631 | null |
2025-01-15 | Disjoint Processing Mechanisms of Hierarchical and Linear Grammars in Large Language Models | Aruna Sankaranarayanan et.al. | 2501.08618 | link |
2025-01-15 | RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation | Kaiqu Liang et.al. | 2501.08617 | null |
2025-01-15 | Assessing the Alignment of FOL Closeness Metrics with Human Judgement | Ramya Keerthy Thatikonda et.al. | 2501.08613 | link |
2025-01-15 | Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design | Zhi Zheng et.al. | 2501.08603 | link |
2025-01-15 | AutoRestTest: A Tool for Automated REST API Testing Using LLMs and MARL | Tyler Stennett et.al. | 2501.08600 | null |
2025-01-15 | LlamaRestTest: Effective REST API Testing with Small Language Models | Myeongsoo Kim et.al. | 2501.08598 | null |
2025-01-15 | Sound Scene Synthesis at the DCASE 2024 Challenge | Mathieu Lagrange et.al. | 2501.08587 | null |
2025-01-15 | LoRS: Efficient Low-Rank Adaptation for Sparse Large Language Model | Yuxuan Hu et.al. | 2501.08582 | null |
2025-01-15 | Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation | Jiaqi Huang et.al. | 2501.08580 | link |
2025-01-15 | Information Entropy Invariance: Enhancing Length Extrapolation in Attention Mechanisms | Kewei Li et.al. | 2501.08570 | link |
2025-01-15 | Adaptive Sampled Softmax with Inverted Multi-Index: Methods, Theory and Applications | Jin Chen et.al. | 2501.08563 | link |
2025-01-15 | LAMS: LLM-Driven Automatic Mode Switching for Assistive Teleoperation | Yiran Tao et.al. | 2501.08558 | null |
2025-01-15 | The Devil is in Temporal Token: High Quality Video Reasoning Segmentation | Sitong Gong et.al. | 2501.08549 | null |
2025-01-15 | Comprehensive Subjective and Objective Evaluation Method for Text-generated Video | Zelu Qi et.al. | 2501.08545 | null |
2025-01-15 | Doc-Guided Sent2Sent++: A Sent2Sent++ Agent with Doc-Guided memory for Document-level Machine Translation | Jiaxin Guo et.al. | 2501.08523 | null |
2025-01-14 | Quantifying the Importance of Data Alignment in Downstream Model Performance | Krrish Chawla et.al. | 2501.08496 | null |
2025-01-14 | Benchmarking Classical, Deep, and Generative Models for Human Activity Recognition | Md Meem Hossain et.al. | 2501.08471 | null |
2025-01-14 | Selective Attention Merging for low resource tasks: A case study of Child ASR | Natarajan Balaji Shankar et.al. | 2501.08468 | link |
2025-01-14 | Time series forecasting for multidimensional telemetry data using GAN and BiLSTM in a Digital Twin | Joao Carmo de Almeida Neto et.al. | 2501.08464 | null |
2025-01-14 | Large Language Models For Text Classification: Case Study And Comprehensive Review | Arina Kostina et.al. | 2501.08457 | null |
2025-01-14 | Tag&Tab: Pretraining Data Detection in Large Language Models Using Keyword-Based Membership Inference Attack | Sagiv Antebi et.al. | 2501.08454 | null |
2025-01-14 | Religious Bias Landscape in Language and Text-to-Image Models: Analysis, Detection, and Debiasing Strategies | Ajwad Abrar et.al. | 2501.08441 | null |
2025-01-14 | SEAL: Speaker Error Correction using Acoustic-conditioned Large Language Models | Anurag Kumar et.al. | 2501.08421 | null |
2025-01-14 | Nonlinear Modeling of a PEM Fuel Cell System; a Practical Study with Experimental Validation | Seyed Mehdi Rakhtala et.al. | 2501.08420 | null |
2025-01-14 | Ensemble of Large Language Models for Curated Labeling and Rating of Free-text Data | Jiaxing Qiu et.al. | 2501.08413 | link |
2025-01-14 | OptiChat: Bridging Optimization Models and Practitioners with Large Language Models | Hao Chen et.al. | 2501.08406 | link |
2025-01-14 | Towards Best Practices for Open Datasets for LLM Training | Stefan Baack et.al. | 2501.08365 | null |
2025-01-14 | Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise | Ryan Burgert et.al. | 2501.08331 | link |
2025-01-14 | PokerBench: Training Large Language Models to become Professional Poker Players | Richard Zhuang et.al. | 2501.08328 | link |
2025-01-14 | Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks | Miran Heo et.al. | 2501.08326 | null |
2025-01-14 | ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations | Ziyuan Huang et.al. | 2501.08324 | null |
2025-01-14 | Exploring Robustness of Multilingual LLMs on Real-World Noisy Data | Amirhossein Aliakbarzadeh et.al. | 2501.08322 | link |
2025-01-14 | Enhancing Automated Interpretability with Output-Centric Feature Descriptions | Yoav Gur-Arieh et.al. | 2501.08319 | link |
2025-01-14 | MiniMax-01: Scaling Foundation Models with Lightning Attention | MiniMax et.al. | 2501.08313 | null |
2025-01-14 | HALoGEN: Fantastic LLM Hallucinations and Where to Find Them | Abhilasha Ravichander et.al. | 2501.08292 | null |
2025-01-14 | LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding | Hongyu Li et.al. | 2501.08282 | link |
2025-01-14 | Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing | Pulkit Arora et.al. | 2501.08276 | null |
2025-01-14 | Addressing the sustainable AI trilemma: a case study on LLM agents and RAG | Hui Wu et.al. | 2501.08262 | null |
2025-01-14 | Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models | Yifu Qiu et.al. | 2501.08248 | null |
2025-01-14 | Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints | Jonathan Nöther et.al. | 2501.08246 | null |
2025-01-14 | CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset | Jiawei Du et.al. | 2501.08238 | null |
2025-01-14 | Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings | Paul Joe Maliakel et.al. | 2501.08219 | null |
2025-01-14 | ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems | Mohita Chowdhury et.al. | 2501.08208 | null |
2025-01-14 | ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving | Zain Ul Abedin et.al. | 2501.08203 | null |
2025-01-14 | CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation | Jinjun Peng et.al. | 2501.08200 | link |
2025-01-14 | OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training | Yijiong Yu et.al. | 2501.08197 | link |
2025-01-14 | PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving | Ahmet Caner Yüzügüler et.al. | 2501.08192 | null |
2025-01-14 | A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation | Steven Landgraf et.al. | 2501.08188 | null |
2025-01-15 | A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following | Yin Fang et.al. | 2501.08187 | link |
2025-01-14 | Potential and Perils of Large Language Models as Judges of Unstructured Textual Data | Rewina Bedemariam et.al. | 2501.08167 | null |
2025-01-14 | I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution | Soohyeon Choi et.al. | 2501.08165 | null |
2025-01-14 | Multiple-Input Variational Auto-Encoder for Anomaly Detection in Heterogeneous Data | Phai Vu Dinh et.al. | 2501.08149 | null |
2025-01-14 | Refusal Behavior in Large Language Models: A Nonlinear Perspective | Fabian Hildebrandt et.al. | 2501.08145 | link |
2025-01-14 | Bootstrapping Corner Cases: High-Resolution Inpainting for Safety Critical Detect and Avoid for Automated Flying | Jonathan Lyhs et.al. | 2501.08142 | null |
2025-01-14 | Revisiting Birds Eye View Perception Models with Frozen Foundation Models: DINOv2 and Metric3Dv2 | Seamie Hayes et.al. | 2501.08118 | null |
2025-01-15 | Consistency of Responses and Continuations Generated by Large Language Models on Social Media | Wenlu Fan et.al. | 2501.08102 | null |
2025-01-14 | Hierarchical Autoscaling for Large Language Model Serving with Chiron | Archit Patke et.al. | 2501.08090 | null |
2025-01-14 | Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Nert Keser et.al. | 2501.08083 | null |
2025-01-14 | CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Guoliang He et.al. | 2501.08071 | link |
2025-01-14 | A Roadmap to Guide the Integration of LLMs in Hierarchical Planning | Israel Puerta-Merino et.al. | 2501.08068 | null |
2025-01-14 | Exploring Narrative Clustering in Large Language Models: A Layerwise Analysis of BERT | Awritrojit Banerjee et.al. | 2501.08053 | null |
2025-01-14 | TriAdaptLoRA: Brain-Inspired Triangular Adaptive Low-Rank Adaptation for Parameter-Efficient Fine-Tuning | Yao Liang et.al. | 2501.08008 | null |
2025-01-14 | LLM-Ehnanced Holonic Architecture for Ad-Hoc Scalable SoS | Muhammad Ashfaq et.al. | 2501.07992 | null |
2025-01-14 | Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness | Jiaxing Zhao et.al. | 2501.07978 | null |
2025-01-14 | Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models | Yifang Xu et.al. | 2501.07972 | null |
2025-01-14 | Self-Instruct Few-Shot Jailbreaking: Decompose the Attack into Pattern and Behavior Learning | Jiaqi Hua et.al. | 2501.07959 | link |
2025-01-14 | AI Guide Dog: Egocentric Path Prediction on Smartphone | Aishwarya Jadhav et.al. | 2501.07957 | null |
2025-01-14 | Advice for Diabetes Self-Management by ChatGPT Models: Challenges and Recommendations | Waqar Hussain et.al. | 2501.07931 | null |
2025-01-14 | Gandalf the Red: Adaptive Security for LLMs | Niklas Pfister et.al. | 2501.07927 | link |
2025-01-14 | VENOM: Text-driven Unrestricted Adversarial Example Generation with Diffusion Models | Hui Kuurila-Zhang et.al. | 2501.07922 | link |
2025-01-14 | Large Language Model Interface for Home Energy Management Systems | François Michelon et.al. | 2501.07919 | null |
2025-01-14 | Bridge-SR: Schrödinger Bridge for Efficient SR | Chang Li et.al. | 2501.07897 | null |
2025-01-14 | Leveraging Metamemory Mechanisms for Enhanced Data-Free Code Generation in LLMs | Shuai Wang et.al. | 2501.07892 | null |
2025-01-14 | ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process Rewarding | Zhongxiang Sun et.al. | 2501.07861 | null |
2025-01-14 | Optimizing Language Models for Grammatical Acceptability: A Comparative Study of Fine-Tuning Techniques | Shobhit Ratan et.al. | 2501.07853 | null |
2025-01-14 | Unveiling Provider Bias in Large Language Models for Code Generation | Xiaoyu Zhang et.al. | 2501.07849 | null |
2025-01-14 | Reasoning with Graphs: Structuring Implicit Knowledge to Enhance LLMs Reasoning | Haoyu Han et.al. | 2501.07845 | null |
2025-01-14 | A Driver Advisory System Based on Large Language Model for High-speed Train | Y. C. Luo et.al. | 2501.07837 | null |
2025-01-14 | Flow: A Modular Approach to Automated Agentic Workflow Generation | Boye Niu et.al. | 2501.07834 | null |
2025-01-14 | Real-time Verification and Refinement of Language Model Text Generation | Joonho Ko et.al. | 2501.07824 | null |
2025-01-14 | 3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene Understanding | Haomiao Xiong et.al. | 2501.07819 | link |
2025-01-14 | A Multi-Encoder Frozen-Decoder Approach for Fine-Tuning Large Language Models | Kaustubh D. Dhole et.al. | 2501.07818 | null |
2025-01-14 | Agent-Centric Projection of Prompting Techniques and Implications for Synthetic Training Data for Large Language Models | Dhruv Dhamani et.al. | 2501.07815 | null |
2025-01-14 | Talk to Right Specialists: Routing and Planning in Multi-agent System for Question Answering | Feijie Wu et.al. | 2501.07813 | null |
2025-01-14 | CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation | Ruwei Pan et.al. | 2501.07811 | null |
2025-01-14 | Visual Language Models as Operator Agents in the Space Domain | Alejandro Carrasco et.al. | 2501.07802 | null |
2025-01-14 | Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding | Zhaokai Wang et.al. | 2501.07783 | link |
2025-01-14 | Symmetry-Aware Generative Modeling through Learned Canonicalization | Kusha Sareen et.al. | 2501.07773 | null |
2025-01-14 | Large Language Models for Knowledge Graph Embedding Techniques, Methods, and Challenges: A Survey | Bingchen Liu et.al. | 2501.07766 | null |
2025-01-14 | On the Statistical Capacity of Deep Generative Models | Edric Tam et.al. | 2501.07763 | link |
2025-01-13 | Advancing Student Writing Through Automated Syntax Feedback | Kamyar Zeinalipour et.al. | 2501.07740 | null |
2025-01-13 | Democratizing Text-to-Image Masked Generative Models with Compact Text-Aware One-Dimensional Tokens | Dongwon Kim et.al. | 2501.07730 | null |
2025-01-13 | LLMic: Romanian Foundation Language Model | Vlad-Andrei Bădoiu et.al. | 2501.07721 | null |
2025-01-13 | CDS: Data Synthesis Method Guided by Cognitive Diagnosis Theory | Haokun Zhao et.al. | 2501.07674 | null |
2025-01-13 | Enhancing Talent Employment Insights Through Feature Extraction with LLM Finetuning | Karishma Thakrar et.al. | 2501.07663 | null |
2025-01-13 | Large Language Models for Interpretable Mental Health Diagnosis | Brian Hyeongseok Kim et.al. | 2501.07653 | null |
2025-01-13 | BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations | Weixi Feng et.al. | 2501.07647 | null |
2025-01-13 | GPT as a Monte Carlo Language Tree: A Probabilistic Perspective | Kun-Peng Ning et.al. | 2501.07641 | null |
2025-01-13 | SafePowerGraph-LLM: Novel Power Grid Graph Embedding and Optimization with Large Language Models | Fabien Bernier et.al. | 2501.07639 | null |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563 | null |
2025-01-13 | Imagine while Reasoning in Space: Multimodal Visualization-of-Thought | Chengzu Li et.al. | 2501.07542 | null |
2025-01-13 | ML Mule: Mobile-Driven Context-Aware Collaborative Learning | Haoxiang Yu et.al. | 2501.07536 | null |
2025-01-13 | Investigating Large Language Models in Inferring Personality Traits from User Conversations | Jianfeng Zhu et.al. | 2501.07532 | null |
2025-01-13 | RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment | Difei Gu et.al. | 2501.07525 | link |
2025-01-13 | Parallel Key-Value Cache Fusion for Position Invariant RAG | Philhoon Oh et.al. | 2501.07523 | null |
2025-01-13 | Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards | Yangsibo Huang et.al. | 2501.07493 | null |
2025-01-13 | TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models | Thales Sales Almeida et.al. | 2501.07482 | null |
2025-01-13 | A Survey of Embodied AI in Healthcare: Techniques, Applications, and Opportunities | Yihao Liu et.al. | 2501.07468 | null |
2025-01-13 | Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI | Rolf Pfister et.al. | 2501.07458 | null |
2025-01-13 | Enhancing LLM's Ability to Generate More Repository-Aware Unit Tests Through Precise Contextual Information Injection | Xin Yin et.al. | 2501.07425 | null |
2025-01-13 | Initial Findings on Sensor based Open Vocabulary Activity Recognition via Text Embedding Inversion | Lala Shakti Swarup Ray et.al. | 2501.07408 | null |
2025-01-13 | OCORD: Open-Campus Object Removal Dataset | Shuo Zhang et.al. | 2501.07397 | null |
2025-01-13 | Simulating the Hubbard Model with Equivariant Normalizing Flows | Dominic Schuh et.al. | 2501.07371 | null |
2025-01-13 | Emergent effects of scaling on the functional hierarchies within large language models | Paul C. Bogdan et.al. | 2501.07359 | null |
2025-01-13 | Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring | Buse Sibel Korkmaz et.al. | 2501.07324 | link |
2025-01-13 | FinerWeb-10BT: Refining Web Data with LLM-Based Line-Level Filtering | Erik Henriksson et.al. | 2501.07314 | link |
2025-01-13 | The Lessons of Developing Process Reward Models in Mathematical Reasoning | Zhenru Zhang et.al. | 2501.07301 | null |
2025-01-13 | GestLLM: Advanced Hand Gesture Interpretation via Large Language Models for Human-Robot Interaction | Oleg Kobzarev et.al. | 2501.07295 | null |
2025-01-13 | LLM-Net: Democratizing LLMs-as-a-Service through Blockchain-based Expert Networks | Zan-Kai Chong et.al. | 2501.07288 | null |
2025-01-13 | Lifelong Learning of Large Language Model based Agents: A Roadmap | Junhao Zheng et.al. | 2501.07278 | link |
2025-01-13 | Bridging Smart Meter Gaps: A Benchmark of Statistical, Machine Learning and Time Series Foundation Models for Data Imputation | Amir Sartipi et.al. | 2501.07276 | null |
2025-01-13 | Transforming Role Classification in Scientific Teams Using LLMs and Advanced Predictive Analytics | Wonduk Seo et.al. | 2501.07267 | null |
2025-01-13 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion | Li Liang et.al. | 2501.07260 | link |
2025-01-13 | EdgeTAM: On-Device Track Anything Model | Chong Zhou et.al. | 2501.07256 | null |
2025-01-13 | Large Language Models: New Opportunities for Access to Science | Jutta Schnabel et.al. | 2501.07250 | null |
2025-01-13 | Breaking Memory Limits: Gradient Wavelet Transform Enhances LLMs Training | Ziqing Wen et.al. | 2501.07237 | link |
2025-01-13 | Touched by ChatGPT: Using an LLM to Drive Affective Tactile Interaction | Qiaoqiao Ren et.al. | 2501.07224 | link |
2025-01-13 | Pre-Trained Large Language Model Based Remaining Useful Life Transfer Prediction of Bearing | Laifa Tao et.al. | 2501.07191 | null |
2025-01-13 | Unveiling Code Clone Patterns in Open Source VR Software: An Empirical Study | Huashan Chen et.al. | 2501.07165 | null |
2025-01-13 | AlphaNet: Scaling Up Local Frame-based Atomistic Foundation Model | Bangchen Yin et.al. | 2501.07155 | link |
2025-01-13 | LLM360 K2: Scaling Up 360-Open-Source Large Language Models | Zhengzhong Liu et.al. | 2501.07124 | null |
2025-01-13 | How GPT learns layer by layer | Jason Du et.al. | 2501.07108 | link |
2025-01-13 | ADKGD: Anomaly Detection in Knowledge Graphs with Dual-Channel Training | Jiayang Wu et.al. | 2501.07078 | link |
2025-01-13 | D3MES: Diffusion Transformer with multihead equivariant self-attention for 3D molecule generation | Zhejun Zhang et.al. | 2501.07077 | link |
2025-01-13 | Value Compass Leaderboard: A Platform for Fundamental and Validated Evaluation of LLMs Values | Jing Yao et.al. | 2501.07071 | null |
2025-01-13 | Enhancing Image Generation Fidelity via Progressive Prompts | Zhen Xiong et.al. | 2501.07070 | link |
2025-01-13 | Logic Meets Magic: LLMs Cracking Smart Contract Vulnerabilities | ZeKe Xiao et.al. | 2501.07058 | null |
2025-01-13 | SFC-GAN: A Generative Adversarial Network for Brain Functional and Structural Connectome Translation | Yee-Fan Tan et.al. | 2501.07055 | null |
2025-01-13 | PoAct: Policy and Action Dual-Control Agent for Generalized Applications | Guozhi Yuan et.al. | 2501.07054 | null |
2025-01-13 | ROSAnnotator: A Web Application for ROSBag Data Analysis in Human-Robot Interaction | Yan Zhang et.al. | 2501.07051 | link |
2025-01-13 | Unveiling the Potential of Text in High-Dimensional Time Series Forecasting | Xin Zhou et.al. | 2501.07048 | link |
2025-01-13 | Explore the Use of Time Series Foundation Model for Car-Following Behavior Analysis | Luwei Zeng et.al. | 2501.07034 | null |
2025-01-13 | A Proposed Large Language Model-Based Smart Search for Archive System | Ha Dung Nguyen et.al. | 2501.07024 | null |
2025-01-13 | Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps | Henry Li et.al. | 2501.06999 | link |
2025-01-13 | LEO: Boosting Mixture of Vision Encoders for Multimodal Large Language Models | Mozhgan Nasr Azadani et.al. | 2501.06986 | link |
2025-01-13 | Combining LLM decision and RL action selection to improve RL policy for adaptive interventions | Karine Karine et.al. | 2501.06980 | null |
2025-01-12 | How is Google using AI for internal code migrations? | Stoyan Nikolov et.al. | 2501.06972 | null |
2025-01-12 | Enhancing Patient-Centric Communication: Leveraging LLMs to Simulate Patient Perspectives | Xinyao Ma et.al. | 2501.06964 | null |
2025-01-12 | Comparison of Autoencoders for tokenization of ASL datasets | Vouk Praun-Petrovic et.al. | 2501.06942 | null |
2025-01-12 | Super-Resolution of 3D Micro-CT Images Using Generative Adversarial Networks: Enhancing Resolution and Segmentation Accuracy | Evgeny Ugolkov et.al. | 2501.06939 | link |
2025-01-12 | Harnessing Large Language Models for Disaster Management: A Survey | Zhenyu Lei et.al. | 2501.06932 | null |
2025-01-12 | Monolithic 3D FPGAs Utilizing Back-End-of-Line Configuration Memories | Faaiq Waqar et.al. | 2501.06921 | null |
2025-01-12 | Risk-Averse Finetuning of Large Language Models | Sapana Chaudhary et.al. | 2501.06911 | link |
2025-01-12 | Deep Learning and Foundation Models for Weather Prediction: A Survey | Jimeng Shi et.al. | 2501.06907 | null |
2025-01-12 | A Foundational Generative Model for Breast Ultrasound Image Analysis | Haojun Yu et.al. | 2501.06869 | null |
2025-01-12 | Transfer Learning of Tabular Data by Finetuning Large Language Models | Shourav B. Rabbani et.al. | 2501.06863 | null |
2025-01-12 | A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context | Noureldin Zahran et.al. | 2501.06859 | null |
2025-01-12 | SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training | Tianjin Huang et.al. | 2501.06842 | link |
2025-01-12 | An efficient approach to represent enterprise web application structure using Large Language Model in the service of Intelligent Quality Engineering | Zaber Al Hassan Ayon et.al. | 2501.06837 | null |
2025-01-12 | X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding | Wenqi Zhou et.al. | 2501.06835 | null |
2025-01-12 | LLMs Model Non-WEIRD Populations: Experiments with Synthetic Cultural Agents | Augusto Gonzalez-Bonorino et.al. | 2501.06834 | link |
2025-01-12 | GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing | Ruizhe Ou et.al. | 2501.06828 | null |
2025-01-12 | Leveraging Taxonomy and LLMs for Improved Multimodal Hierarchical Classification | Shijing Chen et.al. | 2501.06827 | null |
2025-01-12 | Event Argument Extraction with Enriched Prompts | Chen Liang et.al. | 2501.06825 | link |
2025-01-12 | A Study on Educational Data Analysis and Personalized Feedback Report Generation Based on Tags and ChatGPT | Yizhou Zhou et.al. | 2501.06819 | null |
2025-01-12 | RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models | Keyan Chen et.al. | 2501.06809 | link |
2025-01-12 | Semantic-CD: Remote Sensing Image Semantic Change Detection towards Open-vocabulary Setting | Yongshuo Zhu et.al. | 2501.06808 | null |
2025-01-12 | MPCache: MPC-Friendly KV Cache Eviction for Efficient Private Large Language Model Inference | Wenxuan Zeng et.al. | 2501.06807 | null |
2025-01-12 | Bridging the Fairness Gap: Enhancing Pre-trained Models with LLM-Generated Sentences | Liu Yu et.al. | 2501.06795 | null |
2025-01-12 | 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes | Mahmoud Ahmed et.al. | 2501.06785 | link |
2025-01-12 | Cost-Effective Robotic Handwriting System with AI Integration | Tianyi Huang et.al. | 2501.06783 | null |
2025-01-12 | Eliza: A Web3 friendly AI Agent Operating System | Shaw Walters et.al. | 2501.06781 | link |
2025-01-12 | VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning | Ji Soo Lee et.al. | 2501.06761 | link |
2025-01-12 | Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation | Shunfan Zheng et.al. | 2501.06741 | null |
2025-01-12 | ZOQO: Zero-Order Quantized Optimization | Noga Bar et.al. | 2501.06736 | null |
2025-01-12 | Better Prompt Compression Without Multi-Layer Perceptrons | Edouardo Honig et.al. | 2501.06730 | null |
2025-01-12 | Measuring the Robustness of Reference-Free Dialogue Evaluation Systems | Justin Vasselli et.al. | 2501.06728 | link |
2025-01-12 | Integrated Sensing and Edge AI: Realizing Intelligent Perception in 6G | Zhiyan Liu et.al. | 2501.06726 | null |
2025-01-12 | DRDT3: Diffusion-Refined Decision Test-Time Training Model | Xingshuai Huang et.al. | 2501.06718 | null |
2025-01-12 | ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian | Mykyta Syromiatnikov et.al. | 2501.06715 | link |
2025-01-12 | Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management | Liu Qianli et.al. | 2501.06709 | null |
2025-01-12 | Evaluating Sample Utility for Data Selection by Mimicking Model Weights | Tzu-Heng Huang et.al. | 2501.06708 | null |
2025-01-12 | AIOpsLab: A Holistic Framework to Evaluate AI Agents for Enabling Autonomous Clouds | Yinfang Chen et.al. | 2501.06706 | null |
2025-01-12 | Fine-tuning ChatGPT for Automatic Scoring of Written Scientific Explanations in Chinese | Jie Yang et.al. | 2501.06704 | null |
2025-01-12 | Large Language Models, Knowledge Graphs and Search Engines: A Crossroads for Answering Users' Questions | Aidan Hogan et.al. | 2501.06699 | null |
2025-01-12 | DVM: Towards Controllable LLM Agents in Social Deduction Games | Zheng Zhang et.al. | 2501.06695 | null |
2025-01-12 | TAPO: Task-Referenced Adaptation for Prompt Optimization | Wenxin Luo et.al. | 2501.06689 | link |
2025-01-12 | Generative AI in Education: From Foundational Insights to the Socratic Playground for Learning | Xiangen Hu et.al. | 2501.06682 | null |
2025-01-12 | Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving | Haoxiang Gao et.al. | 2501.06680 | null |
2025-01-11 | Challenging reaction prediction models to generalize to novel chemistry | John Bradshaw et.al. | 2501.06669 | link |
2025-01-11 | Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training | Sanjit Kakarla et.al. | 2501.06658 | link |
2025-01-11 | FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings | Tong Liu et.al. | 2501.06645 | null |
2025-01-11 | Scaling Down Semantic Leakage: Investigating Associative Bias in Smaller Language Models | Veronika Smilga et.al. | 2501.06638 | link |
2025-01-11 | Quantifying Relational Exploration in Cultural Heritage Knowledge Graphs with LLMs: A Neuro-Symbolic Approach | Mohammed Maree et.al. | 2501.06628 | null |
2025-01-11 | Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks | Amr Almorsi et.al. | 2501.06625 | null |
2025-01-11 | Denoising Diffusion Probabilistic Model for Radio Map Estimation in Generative Wireless Networks | Xuanhao Luo et.al. | 2501.06604 | null |
2025-01-11 | ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation | Xuanle Zhao et.al. | 2501.06598 | link |
2025-01-11 | ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning | Xiangru Tang et.al. | 2501.06590 | link |
2025-01-11 | Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping | Muru Zhang et.al. | 2501.06589 | link |
2025-01-10 | LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Omkar Thawakar et.al. | 2501.06186 | link |
2025-01-10 | PEACE: Empowering Geologic Map Holistic Understanding with MLLMs | Yangyu Huang et.al. | 2501.06184 | null |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173 | null |
2025-01-10 | GenMol: A Drug Discovery Generalist with Discrete Diffusion | Seul Lee et.al. | 2501.06158 | null |
2025-01-10 | Multilingual Performance of a Multimodal Artificial Intelligence System on Multisubject Physics Concept Inventories | Gerd Kortemeyer et.al. | 2501.06143 | null |
2025-01-10 | Supervision policies can shape long-term risk management in general-purpose AI models | Manuel Cebrian et.al. | 2501.06137 | link |
2025-01-10 | Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI | Yuya Asano et.al. | 2501.06129 | null |
2025-01-10 | Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Fabian David Schmidt et.al. | 2501.06117 | link |
2025-01-10 | From Conversation to Automation: Leveraging Large Language Models to Analyze Strategies in Problem Solving Therapy | Elham Aghakhani et.al. | 2501.06101 | null |
2025-01-10 | Photokinetics of Photothermal Reactions | Mounir Maafi et.al. | 2501.06057 | null |
2025-01-10 | AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discovery | Johann Wenckstern et.al. | 2501.06039 | link |
2025-01-10 | Addressing speaker gender bias in large scale speech translation systems | Shubham Bansal et.al. | 2501.05989 | null |
2025-01-10 | Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing | Eklavya Sarkar et.al. | 2501.05987 | link |
2025-01-10 | Exploring LLMs for Automated Pre-Testing of Cross-Cultural Surveys | Divya Mani Adhikari et.al. | 2501.05985 | null |
2025-01-10 | Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea | Eunjung Cho et.al. | 2501.05981 | null |
2025-01-10 | Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory | Yunmeng Shu et.al. | 2501.05965 | null |
2025-01-10 | Effective faking of verbal deception detection with target-aligned adversarial attacks | Bennett Kleinberg et.al. | 2501.05962 | null |
2025-01-10 | Reusable specimen-level inference in computational pathology | Jakub R. Kaczmarzyk et.al. | 2501.05945 | link |
2025-01-10 | DiffuSETS: 12-lead ECG Generation Conditioned on Clinical Text Reports and Patient-Specific Information | Yongfan Lai et.al. | 2501.05932 | link |
2025-01-10 | LLMs Reproduce Stereotypes of Sexual and Gender Minorities | Ruby Ostrow et.al. | 2501.05926 | null |
2025-01-10 | Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction | Petraq Nako et.al. | 2501.05925 | null |
2025-01-10 | Valley2: Exploring Multimodal Models with Scalable Vision-Language Design | Ziheng Wu et.al. | 2501.05901 | link |
2025-01-10 | Prompt engineering and its implications on the energy consumption of Large Language Models | Riccardo Rubei et.al. | 2501.05899 | link |
2025-01-10 | Affordably Fine-tuned LLMs Provide Better Answers to Course-specific MCQs | Bianca Raimondi et.al. | 2501.05891 | link |
2025-01-10 | Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs | Dabing Cheng et.al. | 2501.05884 | null |
2025-01-10 | VideoRAG: Retrieval-Augmented Generation over Video Corpus | Soyeong Jeong et.al. | 2501.05874 | null |
2025-01-10 | ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability | Antonin Poché et.al. | 2501.05855 | link |
2025-01-10 | Understanding Impact of Human Feedback via Influence Functions | Taywon Min et.al. | 2501.05790 | link |
2025-01-10 | Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models | You Li et.al. | 2501.05767 | null |
2025-01-10 | Controlling Large Language Models Through Concept Activation Vectors | Hanyu Zhang et.al. | 2501.05764 | null |
2025-01-10 | StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation | Shangjin Zhai et.al. | 2501.05763 | null |
2025-01-10 | CognoSpeak: an automatic, remote assessment of early cognitive decline in real-world conversational speech | Madhurananda Pahar et.al. | 2501.05755 | null |
2025-01-10 | Semantic Exploration with Adaptive Gating for Efficient Problem Solving with Language Models | Sungjae Lee et.al. | 2501.05752 | null |
2025-01-10 | TB-Bench: Training and Testing Multi-Modal AI for Understanding Spatio-Temporal Traffic Behaviors from Dashcam Images/Videos | Korawat Charoenpitaks et.al. | 2501.05733 | link |
2025-01-10 | Enabling Scalable Oversight via Self-Evolving Critic | Zhengyang Tang et.al. | 2501.05727 | null |
2025-01-10 | I Can't Share Code, but I need Translation -- An Empirical Study on Code Translation through Federated LLM | Jahnavi Kumar et.al. | 2501.05724 | null |
2025-01-10 | How to Enable Effective Cooperation Between Humans and NLP Models: A Survey of Principles, Formalizations, and Beyond | Chen Huang et.al. | 2501.05714 | null |
2025-01-10 | Multi-Step Reasoning in Korean and the Emergent Mirage | Guijin Son et.al. | 2501.05712 | null |
2025-01-10 | EmotiCrafter: Text-to-Emotional-Image Generation based on Valence-Arousal Model | Yi He et.al. | 2501.05710 | null |
2025-01-10 | Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains | Vighnesh Subramaniam et.al. | 2501.05707 | null |
2025-01-10 | Debugging Without Error Messages: How LLM Prompting Strategy Affects Programming Error Explanation Effectiveness | Audrey Salmon et.al. | 2501.05706 | null |
2025-01-10 | Facilitate Collaboration between Large Language Model and Task-specific Model for Time Series Anomaly Detection | Feiyi Chen et.al. | 2501.05675 | null |
2025-01-10 | Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration | Zuyuan Zhang et.al. | 2501.05673 | null |
2025-01-10 | Cascaded Self-Evaluation Augmented Training for Efficient Multimodal Large Language Models | Zheqi Lv et.al. | 2501.05662 | null |
2025-01-10 | Collaboration of Large Language Models and Small Recommendation Models for Device-Cloud Recommendation | Zheqi Lv et.al. | 2501.05647 | null |
2025-01-10 | Iconicity in Large Language Models | Anna Marklová et.al. | 2501.05643 | null |
2025-01-10 | HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection | Anant Mehta et.al. | 2501.05631 | link |
2025-01-10 | The Impact of Model Scaling on Seen and Unseen Language Performance | Rhitabrat Pokharel et.al. | 2501.05629 | null |
2025-01-09 | Harnessing Large Language Model for Virtual Reality Exploration Testing: A Case Study | Zhenyu Qi et.al. | 2501.05625 | null |
2025-01-09 | Exploring Large Language Models for Translating Romanian Computational Problems into English | Adrian Marius Dumitran et.al. | 2501.05601 | null |
2025-01-09 | Physics-Driven Learning for Inverse Problems in Quantum Chromodynamics | Gert Aarts et.al. | 2501.05580 | null |
2025-01-09 | Exploring Large Language Models (LLMs) through interactive Python activities | Eugenio Tufino et.al. | 2501.05577 | link |
2025-01-09 | LLMQuoter: Enhancing RAG Capabilities Through Efficient Quote Extraction From Large Contexts | Yuri Facanha Bezerra et.al. | 2501.05554 | link |
2025-01-09 | The dynamics of meaning through time: Assessment of Large Language Models | Mohamed Taher Alrefaie et.al. | 2501.05552 | null |
2025-01-09 | Infecting Generative AI With Viruses | David Noever et.al. | 2501.05542 | null |
2025-01-09 | NSChat: A Chatbot System To Rule Them All | Zenon Lamprou et.al. | 2501.05541 | null |
2025-01-09 | ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding | Xingyu Fu et.al. | 2501.05452 | null |
2025-01-09 | Relative Pose Estimation through Affine Corrections of Monocular Depth Priors | Yifan Yu et.al. | 2501.05446 | link |
2025-01-09 | Consistent Flow Distillation for Text-to-3D Generation | Runjie Yan et.al. | 2501.05445 | null |
2025-01-09 | Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark | Yunzhuo Hao et.al. | 2501.05444 | null |
2025-01-09 | A survey of textual cyber abuse detection using cutting-edge language models and large language models | Jose A. Diaz-Garcia et.al. | 2501.05443 | null |
2025-01-09 | Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation | Xuyi Meng et.al. | 2501.05427 | null |
2025-01-09 | Using LLMs to Infer Non-Binary COVID-19 Sentiments of Chinese Micro-bloggers | Jerry Chongyi Hu et.al. | 2501.05423 | null |
2025-01-09 | Seeing Sound: Assembling Sounds from Visuals for Audio-to-Image Generation | Darius Petermann et.al. | 2501.05413 | null |
2025-01-10 | Atlas: A Novel Pathology Foundation Model by Mayo Clinic, Charité, and Aignostics | Maximilian Alber et.al. | 2501.05409 | null |
2025-01-09 | TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts | Yu-Hao Huang et.al. | 2501.05403 | null |
2025-01-09 | Mechanistic understanding and validation of large AI models with SemanticLens | Maximilian Dreyer et.al. | 2501.05398 | null |
2025-01-09 | FairCode: Evaluating Social Bias of LLMs in Code Generation | Yongkang Du et.al. | 2501.05396 | link |
2025-01-09 | Large Physics Models: Towards a collaborative approach with Large Language Models and Foundation Models | Kristian G. Barman et.al. | 2501.05382 | null |
2025-01-09 | Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance | Dimitrios Gerogiannis et.al. | 2501.05379 | null |
2025-01-09 | Accelerated Diffusion Models via Speculative Sampling | Valentin De Bortoli et.al. | 2501.05370 | null |
2025-01-09 | Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction | Hantao Lou et.al. | 2501.05336 | link |
2025-01-09 | "What's Happening"- A Human-centered Multimodal Interpreter Explaining the Actions of Autonomous Vehicles | Xuewen Luo et.al. | 2501.05322 | null |
2025-01-09 | Comparison Study: Glacier Calving Front Delineation in Synthetic Aperture Radar Images With Deep Learning | Nora Gourmelon et.al. | 2501.05281 | link |
2025-01-09 | CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models | Fabian Hörst et.al. | 2501.05269 | link |
2025-01-09 | Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal | Wanli Ma et.al. | 2501.05265 | null |
2025-01-09 | CallNavi: A Study and Challenge on Function Calling Routing and Invocation in Large Language Models | Yewei Song et.al. | 2501.05255 | null |
2025-01-09 | From Scientific Texts to Verifiable Code: Automating the Process with Transformers | Changjie Wang et.al. | 2501.05252 | null |
2025-01-09 | RAG-WM: An Efficient Black-Box Watermarking Approach for Retrieval-Augmented Generation of Large Language Models | Peizhuo Lv et.al. | 2501.05249 | null |
2025-01-09 | Deriving Coding-Specific Sub-Models from LLMs using Resource-Efficient Pruning | Laura Puccioni et.al. | 2501.05248 | null |
2025-01-09 | Online Prompt and Solver Selection for Program Synthesis | Yixuan Li et.al. | 2501.05247 | null |
2025-01-09 | Optimizing Estonian TV Subtitles with Semi-supervised Learning and LLMs | Artem Fedorchenko et.al. | 2501.05234 | null |
2025-01-09 | Harnessing Large Language and Vision-Language Models for Robust Out-of-Distribution Detection | Pei-Kang Lee et.al. | 2501.05228 | null |
2025-01-09 | Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes | Ludwic Leonard et.al. | 2501.05226 | null |
2025-01-09 | Leveraging Large Language Models for Zero-shot Lay Summarisation in Biomedicine and Beyond | Tomas Goldsack et.al. | 2501.05224 | null |
2025-01-09 | A Novel Approach to Scalable and Automatic Topic-Controlled Question Generation in Education | Ziqing Li et.al. | 2501.05220 | null |
2025-01-09 | Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration | Xuyang Liu et.al. | 2501.05179 | link |
2025-01-09 | Emergence of human-like polarization among large language model agents | Jinghua Piao et.al. | 2501.05171 | null |
2025-01-09 | Bringing Order Amidst Chaos: On the Role of Artificial Intelligence in Secure Software Engineering | Matteo Esposito et.al. | 2501.05165 | null |
2025-01-09 | Biomedical Relation Extraction via Adaptive Document-Relation Cross-Mapping and Concept Unique Identifier | Yufei Shang et.al. | 2501.05155 | null |
2025-01-09 | DriVLM: Domain Adaptation of Vision-Language Models in Autonomous Driving | Xuran Zheng et.al. | 2501.05081 | null |
2025-01-09 | Multimodal-to-Text Prompt Engineering in Large Language Models Using Feature Embeddings for GNSS Interference Characterization | Harshith Manjunath et.al. | 2501.05079 | null |
2025-01-09 | Analyzing Memorization in Large Language Models through the Lens of Model Attribution | Tarun Ram Menta et.al. | 2501.05078 | link |
2025-01-09 | A Text-Based Knowledge-Embedded Soft Sensing Modeling Approach for General Industrial Process Tasks Based on Large Language Model | Shuo Tong et.al. | 2501.05075 | null |
2025-01-09 | Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning | Huabin Liu et.al. | 2501.05069 | null |
2025-01-09 | LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding | Jiaxing Zhao et.al. | 2501.05067 | null |
2025-01-09 | Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators | Leonard Lupin-Jimenez et.al. | 2501.05058 | null |
2025-01-09 | LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models | Zengqi Peng et.al. | 2501.05057 | null |
2025-01-09 | On the Generalizability of Transformer Models to Code Completions of Different Lengths | Nathan Cooper et.al. | 2501.05051 | null |
2025-01-09 | SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution | Chengxing Xie et.al. | 2501.05040 | link |
2025-01-09 | Enhancing Human-Like Responses in Large Language Models | Ethem Yağız Çalık et.al. | 2501.05032 | null |
2025-01-09 | ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Ronghao Dang et.al. | 2501.05031 | link |
2025-01-09 | A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications | Ofir Marom et.al. | 2501.05030 | null |
2025-01-09 | TreeKV: Smooth Key-Value Cache Compression with Tree Structures | Ziwei He et.al. | 2501.04987 | null |
2025-01-09 | SpaLLM-Guard: Pairing SMS Spam Detection Using Open-source and Commercial LLMs | Muhammad Salman et.al. | 2501.04985 | null |
2025-01-09 | V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer | Hangzhou He et.al. | 2501.04975 | link |
2025-01-09 | Demystifying Domain-adaptive Post-training for Financial LLMs | Zixuan Ke et.al. | 2501.04961 | link |
2025-01-09 | Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments | Yifan Xu et.al. | 2501.04947 | null |
2025-01-09 | Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models | Qingyu Ren et.al. | 2501.04945 | link |
2025-01-09 | Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency | Shiji Zhao et.al. | 2501.04931 | null |
2025-01-09 | Investigating Numerical Translation with Large Language Models | Wei Tang et.al. | 2501.04927 | null |
2025-01-09 | FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching | Jun-Hak Yun et.al. | 2501.04926 | null |
2025-01-09 | HaVen: Hallucination-Mitigated LLM for Verilog Code Generation Aligned with HDL Engineers | Yiyao Yang et.al. | 2501.04908 | link |
2025-01-09 | JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis | Jun-Hyeok Cha et.al. | 2501.04904 | null |
2025-01-09 | ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries | Keke Huang et.al. | 2501.04901 | null |
2025-01-09 | SUGAR: Leveraging Contextual Confidence for Smarter Retrieval | Hanna Zubkova et.al. | 2501.04899 | null |
2025-01-08 | Leveraging Log Probabilities in Language Models to Forecast Future Events | Tommaso Soru et.al. | 2501.04880 | null |
2025-01-08 | Real-Time Textless Dialogue Generation | Long Mai et.al. | 2501.04877 | link |
2025-01-08 | Modelling complex proton transport phenomena -- Exploring the limits of fine-tuning and transferability of foundational machine-learned force fields | Malte Grunert et.al. | 2501.04876 | null |
2025-01-08 | Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware | Brandon J Walton et.al. | 2501.04848 | null |
2025-01-08 | Do Code LLMs Understand Design Patterns? | Zhenyu Pan et.al. | 2501.04835 | null |
2025-01-08 | On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability | Andreas Vogelsang et.al. | 2501.04810 | null |
2025-01-08 | IQPopt: Fast optimization of instantaneous quantum polynomial circuits in JAX | Erik Recio-Armengol et.al. | 2501.04776 | link |
2025-01-08 | Efficient and Responsible Adaptation of Large Language Models for Robust and Equitable Top-k Recommendations | Kirandeep Kaur et.al. | 2501.04762 | null |
2025-01-08 | Improving Human-Robot Teaching by Quantifying and Reducing Mental Model Mismatch | Phillip Richter et.al. | 2501.04755 | null |
2025-01-08 | EditAR: Unified Conditional Generation with Autoregressive Models | Jiteng Mu et.al. | 2501.04699 | null |
2025-01-08 | Re-ranking the Context for Multimodal Retrieval Augmented Generation | Matin Mortaheb et.al. | 2501.04695 | null |
2025-01-08 | SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images | Zixuan Huang et.al. | 2501.04689 | null |
2025-01-08 | URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics | Ruilin Luo et.al. | 2501.04686 | link |
2025-01-08 | Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations | Archita Srivastava et.al. | 2501.04675 | null |
2025-01-08 | Assessing Language Comprehension in Large Language Models Using Construction Grammar | Wesley Scivetti et.al. | 2501.04661 | null |
2025-01-08 | Multi-task retriever fine-tuning for domain-specific and efficient RAG | Patrice Béchard et.al. | 2501.04652 | null |
2025-01-08 | FlairGPT: Repurposing LLMs for Interior Designs | Gabrielle Littlefair et.al. | 2501.04648 | null |
2025-01-08 | Knowledge Retrieval Based on Generative AI | Te-Lun Yang et.al. | 2501.04635 | null |
2025-01-08 | "Can you be my mum?": Manipulating Social Robots in the Large Language Models Era | Giulio Antonio Abbo et.al. | 2501.04633 | null |
2025-01-09 | MedCoDi-M: A Multi-Prompt Foundation Model for Multimodal Medical Data Generation | Daniele Molino et.al. | 2501.04614 | null |
2025-01-08 | Quantum-inspired Embeddings Projection and Similarity Metrics for Representation Learning | Ivan Kankeu et.al. | 2501.04591 | link |
2025-01-08 | Boosting Salient Object Detection with Knowledge Distillated from Large Foundation Models | Miaoyang He et.al. | 2501.04582 | null |
2025-01-08 | InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection | Yuhang Liu et.al. | 2501.04575 | link |
2025-01-09 | OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis | Run Luo et.al. | 2501.04561 | link |
2025-01-08 | The Impostor is Among Us: Can Large Language Models Capture the Complexity of Human Personas? | Christopher Lazik et.al. | 2501.04543 | null |
2025-01-08 | Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time | Uri Berger et.al. | 2501.04513 | null |
2025-01-08 | CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection | Ruijun Feng et.al. | 2501.04510 | null |
2025-01-08 | Integrating remote sensing data assimilation, deep learning and large language model for interactive wheat breeding yield prediction | Guofeng Yang et.al. | 2501.04487 | null |
2025-01-08 | When LLMs Struggle: Reference-less Translation Evaluation for Low-resource Languages | Archchana Sindhujan et.al. | 2501.04473 | null |
2025-01-08 | Hidden Entity Detection from GitHub Leveraging Large Language Models | Lu Gan et.al. | 2501.04455 | link |
2025-01-08 | Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions | Doaa Mahmud et.al. | 2501.04437 | null |
2025-01-08 | Federated Fine-Tuning of LLMs: Framework Comparison and Research Directions | Na Yan et.al. | 2501.04436 | null |
2025-01-08 | End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach | H. M. Shadman Tabib et.al. | 2501.04425 | null |
2025-01-08 | SEO: Stochastic Experience Optimization for Large Language Models | Jitao Xu et.al. | 2501.04393 | null |
2025-01-08 | iFADIT: Invertible Face Anonymization via Disentangled Identity Transform | Lin Yuan et.al. | 2501.04390 | null |
2025-01-08 | DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications | Feng Liu et.al. | 2501.04366 | link |
2025-01-08 | Understanding Before Reasoning: Enhancing Chain-of-Thought with Iterative Summarization Pre-Prompting | Dong-Hai Zhu et.al. | 2501.04341 | link |
2025-01-09 | Navigating the Designs of Privacy-Preserving Fine-tuning for Large Language Models | Haonan Shi et.al. | 2501.04323 | null |
2025-01-08 | Who Does the Giant Number Pile Like Best: Analyzing Fairness in Hiring Contexts | Preethi Seshadri et.al. | 2501.04316 | link |
2025-01-08 | RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation | Jun Liu et.al. | 2501.04315 | null |
2025-01-08 | Your Fix Is My Exploit: Enabling Comprehensive DL Library API Fuzzing with Large Language Models | Kunpeng Zhang et.al. | 2501.04312 | null |
2025-01-08 | LLM4SR: A Survey on Large Language Models for Scientific Research | Ziming Luo et.al. | 2501.04306 | link |
2025-01-08 | Multimodal Graph Constrastive Learning and Prompt for ChartQA | Yue Dai et.al. | 2501.04303 | null |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-08 | An Analysis of Model Robustness across Concurrent Distribution Shifts | Myeongho Jeon et.al. | 2501.04288 | null |
2025-01-08 | Mapping the Edge of Chaos: Fractal-Like Boundaries in The Trainability of Decoder-Only Transformer Models | Bahman Torkamandi et.al. | 2501.04286 | null |
2025-01-08 | Separate Source Channel Coding Is Still What You Need: An LLM-based Rethinking | Tianqi Ren et.al. | 2501.04285 | null |
2025-01-08 | OpenIN: Open-Vocabulary Instance-Oriented Navigation in Dynamic Domestic Environments | Yujie Tang et.al. | 2501.04279 | null |
2025-01-08 | Exploring the Expertise of Large Language Models in Materials Science and Metallurgical Engineering | Christophe Bajan et.al. | 2501.04277 | link |
2025-01-08 | Robotic Programmer: Video Instructed Policy Code Generation for Robotic Manipulation | Senwei Xie et.al. | 2501.04268 | null |
2025-01-08 | Scaling Large Language Model Training on Frontier with Low-Bandwidth Partitioning | Lang Xu et.al. | 2501.04266 | null |
2025-01-08 | IOLBENCH: Benchmarking LLMs on Linguistic Reasoning | Satyam Goyal et.al. | 2501.04249 | link |
2025-01-08 | TransientVerse: A Comprehensive Real-Time Alert and Multi-Wavelength Analysis System for Transient Astronomical Events | Jian-Hua Fang et.al. | 2501.04247 | null |
2025-01-08 | Statistical Uncertainty Quantification for Aggregate Performance Metrics in Machine Learning Benchmarks | Rachel Longjohn et.al. | 2501.04234 | null |
2025-01-07 | Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation | Alireza Salemi et.al. | 2501.04167 | null |
2025-01-07 | AdaptiveCoPilot: Design and Testing of a NeuroAdaptive LLM Cockpit Guidance System in both Novice and Expert Pilots | Shaoyue Wen et.al. | 2501.04156 | link |
2025-01-07 | Multilingual Open QA on the MIA Shared Task | Navya Yarrabelly et.al. | 2501.04153 | null |
2025-01-07 | The angular momentum spiral of the Milky Way disc in Gaia | Rashid Yaaqib et.al. | 2501.04095 | null |
2025-01-07 | More is not always better? Enhancing Many-Shot In-Context Learning with Differentiated and Reweighting Objectives | Xiaoqing Zhang et.al. | 2501.04070 | link |
2025-01-07 | ChronoLLM: A Framework for Customizing Large Language Model for Digital Twins generalization based on PyChrono | Jingquan Wang et.al. | 2501.04062 | null |
2025-01-07 | LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving | Lingdong Kong et.al. | 2501.04005 | null |
2025-01-07 | Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos | Haobo Yuan et.al. | 2501.04001 | link |
2025-01-07 | RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance | Matin Mortaheb et.al. | 2501.03995 | null |
2025-01-07 | Synthetic Data for Portfolios: A Throw of the Dice Will Never Abolish Chance | Adil Rengim Cetingoz et.al. | 2501.03993 | null |
2025-01-07 | Influences on LLM Calibration: A Study of Response Agreement, Loss Functions, and Prompt Styles | Yuxi Xia et.al. | 2501.03991 | null |
2025-01-07 | (De)-Indexing and the Right to be Forgotten | Salvatore Vilella et.al. | 2501.03989 | null |
2025-01-07 | VLM-driven Behavior Tree for Context-aware Task Planning | Naoki Wake et.al. | 2501.03968 | link |
2025-01-07 | Vision Language Models as Values Detectors | Giulio Antonio Abbo et.al. | 2501.03957 | null |
2025-01-07 | Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States | Jurgita Kapočiūtė-Dzikienė et.al. | 2501.03952 | null |
2025-01-07 | Synthetic Data Privacy Metrics | Amy Steier et.al. | 2501.03941 | null |
2025-01-07 | Not all tokens are created equal: Perplexity Attention Weighted Networks for AI generated text detection | Pablo Miralles-González et.al. | 2501.03940 | null |
2025-01-07 | A precise asymptotic analysis of learning diffusion models: theory and insights | Hugo Cui et.al. | 2501.03937 | link |
2025-01-07 | Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study | Ramya Jonnala et.al. | 2501.03904 | null |
2025-01-07 | LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token | Shaolei Zhang et.al. | 2501.03895 | link |
2025-01-07 | AlphaPO -- Reward shape matters for LLM alignment | Aman Gupta et.al. | 2501.03884 | null |
2025-01-07 | CL3DOR: Contrastive Learning for 3D Large Multimodal Models via Odds Ratio on High-Resolution Point Clouds | Keonwoo Kim et.al. | 2501.03879 | null |
2025-01-07 | Progressive Document-level Text Simplification via Large Language Models | Dengzhao Fang et.al. | 2501.03857 | null |
2025-01-07 | MedFocusCLIP : Improving few shot classification in medical datasets using pixel wise attention | Aadya Arora et.al. | 2501.03839 | null |
2025-01-07 | Deep Sylvester Posterior Inference for Adaptive Compressed Sensing in Ultrasound Imaging | Simon W. Penninga et.al. | 2501.03825 | null |
2025-01-08 | MADation: Face Morphing Attack Detection with Foundation Models | Eduarda Caldeira et.al. | 2501.03800 | link |
2025-01-07 | KAnoCLIP: Zero-Shot Anomaly Detection through Knowledge-Driven Prompt Learning and Enhanced Cross-Modal Integration | Chengyuan Li et.al. | 2501.03786 | null |
2025-01-07 | Context-Alignment: Activating and Enhancing LLM Capabilities in Time Series | Yuxiao Hu et.al. | 2501.03747 | null |
2025-01-07 | Self-adaptive vision-language model for 3D segmentation of pulmonary artery and vein | Xiaotong Guo et.al. | 2501.03722 | null |
2025-01-07 | Motion-Aware Generative Frame Interpolation | Guozhen Zhang et.al. | 2501.03699 | null |
2025-01-07 | SLAM: Towards Efficient Multilingual Reasoning via Selective Language Alignment | Yuchun Fan et.al. | 2501.03681 | link |
2025-01-07 | Effective and Efficient Mixed Precision Quantization of Speech Foundation Models | Haoning Xu et.al. | 2501.03643 | null |
2025-01-07 | CommitShield: Tracking Vulnerability Introduction and Fix in Version Control Systems | Zhaonan Wu et.al. | 2501.03626 | link |
2025-01-07 | LlaMADRS: Prompting Large Language Models for Interview-Based Depression Assessment | Gaoussou Youssouf Kebe et.al. | 2501.03624 | null |
2025-01-07 | Cosmos World Foundation Model Platform for Physical AI | NVIDIA et.al. | 2501.03575 | link |
2025-01-07 | From Code to Compliance: Assessing ChatGPT's Utility in Designing an Accessible Webpage -- A Case Study | Ammar Ahmed et.al. | 2501.03572 | null |
2025-01-07 | What Does a Software Engineer Look Like? Exploring Societal Stereotypes in LLMs | Muneera Bano et.al. | 2501.03569 | null |
2025-01-07 | Applying Large Language Models in Knowledge Graph-based Enterprise Modeling: Challenges and Opportunities | Benedikt Reitemeyer et.al. | 2501.03566 | null |
2025-01-07 | Bridged Semantic Alignment for Zero-shot 3D Medical Image Diagnosis | Haoran Lai et.al. | 2501.03565 | null |
2025-01-07 | PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models | Lingzhi Yuan et.al. | 2501.03544 | null |
2025-01-07 | Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions | Weijieying Ren et.al. | 2501.03540 | null |
2025-01-07 | Deep Learning for Pathological Speech: A Survey | Shakeel A. Sheikh et.al. | 2501.03536 | null |
2025-01-08 | SenseRAG: Constructing Environmental Knowledge Bases with Proactive Querying for LLM-Based Autonomous Driving | Xuewen Luo et.al. | 2501.03535 | null |
2025-01-07 | A generative approach for lensless imaging in low-light conditions | Ziyang Liu et.al. | 2501.03511 | null |
2025-01-07 | A Sequential Optimal Learning Approach to Automated Prompt Engineering in Large Language Models | Shuyang Wang et.al. | 2501.03508 | null |
2025-01-07 | Textualize Visual Prompt for Image Editing via Diffusion Bridge | Pengcheng Xu et.al. | 2501.03495 | null |
2025-01-07 | Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment | Prashant Trivedi et.al. | 2501.03486 | null |
2025-01-07 | Reading with Intent -- Neutralizing Intent | Benjamin Reichman et.al. | 2501.03475 | null |
2025-01-07 | Information-Maximized Soft Variable Discretization for Self-Supervised Image Representation Learning | Chuang Niu et.al. | 2501.03469 | link |
2025-01-07 | MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation Systems | Yannis Katsis et.al. | 2501.03468 | link |
2025-01-07 | ISSR: Iterative Selection with Self-Review for Vocabulary Test Distractor Generation | Yu-Cheng Liu et.al. | 2501.03462 | null |
2025-01-07 | Activating Associative Disease-Aware Vision Token Memory for LLM-Based X-ray Report Generation | Xiao Wang et.al. | 2501.03458 | link |
2025-01-07 | CoReQA: Uncovering Potentials of Language Models in Code Repository Question Answering | Jialiang Chen et.al. | 2501.03447 | null |
2025-01-07 | LLM4CVE: Enabling Iterative Automated Vulnerability Repair with Large Language Models | Mohamad Fakih et.al. | 2501.03446 | null |
2025-01-07 | Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology | Sarah E. Finch et.al. | 2501.03441 | link |
2025-01-06 | SALT: Sales Autocompletion Linked Business Tables Dataset | Tassilo Klein et.al. | 2501.03413 | link |
2025-01-06 | BoundingDocs: a Unified Dataset for Document Question Answering with Spatial Annotations | Simone Giovannini et.al. | 2501.03403 | null |
2025-01-06 | DoubleDiffusion: Combining Heat Diffusion with Denoising Diffusion for Generative Learning on 3D Meshes | Xuyang Wang et.al. | 2501.03397 | link |
2025-01-06 | Evolved Quantum Boltzmann Machines | Michele Minervini et.al. | 2501.03367 | null |
2025-01-06 | CM3T: Framework for Efficient Multimodal Learning for Inhomogeneous Interaction Datasets | Tanay Agrawal et.al. | 2501.03332 | null |
2025-01-06 | LiLMaps: Learnable Implicit Language Maps | Evgenii Kruzhkov et.al. | 2501.03304 | null |
2025-01-06 | A Soft Sensor Method with Uncertainty-Awareness and Self-Explanation Based on Large Language Models Enhanced by Domain Knowledge Retrieval | Shuo Tong et.al. | 2501.03295 | null |
2025-01-06 | Multi-Modal One-Shot Federated Ensemble Learning for Medical Data with Vision Large Language Model | Naibo Wang et.al. | 2501.03292 | null |
2025-01-06 | ADePT: Adaptive Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning | Pengwei Tang et.al. | 2501.03291 | null |
2025-01-06 | CodeVision: Detecting LLM-Generated Code Using 2D Token Probability Maps and Vision Models | Zhenyu Xu et.al. | 2501.03288 | null |
2025-01-06 | BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning | Beichen Zhang et.al. | 2501.03226 | link |
2025-01-06 | Leveraging Explainable AI for LLM Text Attribution: Differentiating Human-Written and Multiple LLMs-Generated Text | Ayat Najjar et.al. | 2501.03212 | null |
2025-01-06 | Detecting AI-Generated Text in Educational Content: Leveraging Machine Learning and Explainable AI for Academic Integrity | Ayat A. Najjar et.al. | 2501.03203 | null |
2025-01-06 | CLIX: Cross-Lingual Explanations of Idiomatic Expressions | Aaron Gluck et.al. | 2501.03191 | null |
2025-01-06 | Semantic Captioning: Benchmark Dataset and Graph-Aware Few-Shot In-Context Learning for SQL2Text | Ali Al-Lawati et.al. | 2501.03166 | link |
2025-01-06 | Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy | Risha Goel et.al. | 2501.03153 | link |
2025-01-06 | Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches | Alhassan Mumuni et.al. | 2501.03151 | null |
2025-01-06 | VicSim: Enhancing Victim Simulation with Emotional and Linguistic Fidelity | Yerong Li et.al. | 2501.03139 | null |
2025-01-07 | PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models | Mingyang Song et.al. | 2501.03124 | link |
2025-01-06 | CAT: Content-Adaptive Image Tokenization | Junhong Shen et.al. | 2501.03120 | null |
2025-01-06 | LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases | Dylan Bouchard et.al. | 2501.03112 | link |
2025-01-06 | Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling | Aseem Srivastava et.al. | 2501.03088 | null |
2025-01-06 | Retrieval-Augmented TLAPS Proof Generation with Large Language Models | Yuhao Zhou et.al. | 2501.03073 | null |
2025-01-06 | ChronoSense: Exploring Temporal Understanding in Large Language Models with Time Intervals of Events | Duygu Sezen Islakoglu et.al. | 2501.03040 | null |
2025-01-06 | Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning | Zhen Li et.al. | 2501.03035 | null |
2025-01-06 | TransPixar: Advancing Text-to-Video Generation with Transparency | Luozhou Wang et.al. | 2501.03006 | link |
2025-01-06 | CALM: Curiosity-Driven Auditing for Large Language Models | Xiang Zheng et.al. | 2501.02997 | link |
2025-01-06 | Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation | Zhi Qu et.al. | 2501.02979 | link |
2025-01-06 | FlipedRAG: Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models | Zhuo Chen et.al. | 2501.02968 | null |
2025-01-07 | Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the Wild | Wanpeng Hu et.al. | 2501.02964 | link |
2025-01-07 | SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jiawei Liu et.al. | 2501.02962 | null |
2025-01-06 | The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple Features | Shi Bin Hoo et.al. | 2501.02945 | link |
2025-01-07 | Inhibition of bacterial growth by antibiotics | Barnabe Ledoux et.al. | 2501.02944 | null |
2025-01-06 | Deep Generative Model-Aided Power System Dynamic State Estimation and Reconstruction with Unknown Control Inputs or Data Distributions | Jianhua Pei et.al. | 2501.02928 | null |
2025-01-06 | DeCon: Detecting Incorrect Assertions via Postconditions Generated by a Large Language Model | Hao Yu et.al. | 2501.02901 | link |
2025-01-06 | FoundPAD: Foundation Models Reloaded for Face Presentation Attack Detection | Guray Ozgur et.al. | 2501.02892 | link |
2025-01-06 | MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs | Hui Sun et.al. | 2501.02885 | null |
2025-01-06 | IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment | Yiming Zhang et.al. | 2501.02869 | null |
2025-01-06 | Large Language Models for Video Surveillance Applications | Ulindu De Silva et.al. | 2501.02850 | null |
2025-01-06 | Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification | Yubo Wang et.al. | 2501.02844 | null |
2025-01-06 | Foundations of GenIR | Qingyao Ai et.al. | 2501.02842 | null |
2025-01-06 | An Infrastructure Software Perspective Toward Computation Offloading between Executable Specifications and Foundation Models | Dezhi Ran et.al. | 2501.02829 | null |
2025-01-06 | InfiFusion: A Unified Framework for Enhanced Cross-Model Reasoning via LLM Fusion | Zhaoyi Yan et.al. | 2501.02795 | null |
2025-01-06 | CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation | Yuanhong Chen et.al. | 2501.02786 | null |
2025-01-06 | GeAR: Generation Augmented Retrieval | Haoyu Liu et.al. | 2501.02772 | null |
2025-01-06 | Visual Large Language Models for Generalized and Specialized Applications | Yifan Li et.al. | 2501.02765 | link |
2025-01-06 | Ultrasound-QBench: Can LLMs Aid in Quality Assessment of Ultrasound Imaging? | Hongyi Miao et.al. | 2501.02751 | null |
2025-01-06 | Artificial Intelligence in Creative Industries: Advances Prior to 2025 | Nantheera Anantrasirichai et.al. | 2501.02725 | null |
2025-01-06 | KG-CF: Knowledge Graph Completion with Context Filtering under the Guidance of Large Language Models | Zaiyi Zheng et.al. | 2501.02711 | null |
2025-01-06 | QuIM-RAG: Advancing Retrieval-Augmented Generation with Inverted Question Matching for Enhanced QA Performance | Binita Saha et.al. | 2501.02702 | null |
2025-01-06 | EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models | Andrés Villa et.al. | 2501.02699 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-05 | Decoding specialised feature neurons in LLMs with the final projection layer | Harry J Davies et.al. | 2501.02688 | null |
2025-01-05 | From thermodynamics to protein design: Diffusion models for biomolecule generation towards autonomous protein engineering | Wen-ran Li et.al. | 2501.02680 | null |
2025-01-05 | A New Interpretation of the Certainty-Equivalence Approach for PAC Reinforcement Learning with a Generative Model | Shivaram Kalyanakrishnan et.al. | 2501.02652 | null |
2025-01-05 | Representation Learning of Lab Values via Masked AutoEncoder | David Restrepo et.al. | 2501.02648 | link |
2025-01-05 | Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense | Yang Ouyang et.al. | 2501.02629 | link |
2025-01-05 | Cracks in The Stack: Hidden Vulnerabilities and Licensing Risks in LLM Pre-Training Datasets | Mahmoud Jahanshahi et.al. | 2501.02628 | null |
2025-01-05 | HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning | Saleh Ashkboos et.al. | 2501.02625 | null |
2025-01-05 | LLMs Help Alleviate the Cross-Subject Variability in Brain Signal and Language Alignment | Yifei Liu et.al. | 2501.02621 | null |
2025-01-05 | TAPAS: Thermal- and Power-Aware Scheduling for LLM Inference in Cloud Platforms | Jovan Stojkovic et.al. | 2501.02600 | null |
2025-01-05 | LeetDecoding: A PyTorch Library for Exponentially Decaying Causal Linear Attention with CUDA Implementations | Jiaping Wang et.al. | 2501.02573 | link |
2025-01-05 | Multi-LLM Collaborative Caption Generation in Scientific Documents | Jaeyoung Kim et.al. | 2501.02552 | link |
2025-01-05 | Transformers Simulate MLE for Sequence Generation in Bayesian Networks | Yuan Cao et.al. | 2501.02547 | null |
2025-01-05 | Evaluating Large Language Models Against Human Annotators in Latent Content Analysis: Sentiment, Political Leaning, Emotional Intensity, and Sarcasm | Ljubisa Bojic et.al. | 2501.02532 | null |
2025-01-05 | Towards New Benchmark for AI Alignment & Sentiment Analysis in Socially Important Issues: A Comparative Study of Human and LLMs in the Context of AGI | Ljubisa Bojic et.al. | 2501.02531 | null |
2025-01-05 | Vision-Driven Prompt Optimization for Large Language Models in Multimodal Generative Tasks | Leo Franklin et.al. | 2501.02527 | null |
2025-01-05 | Unified Guidance for Geometry-Conditioned Molecular Generation | Sirine Ayadi et.al. | 2501.02526 | null |
2025-01-05 | Layout2Scene: 3D Semantic Layout Guided Scene Generation via Geometry and Appearance Diffusion Priors | Minglin Chen et.al. | 2501.02519 | null |
2025-01-05 | CHAIR-Classifier of Hallucination as Improver | Ao Sun et.al. | 2501.02518 | link |
2025-01-05 | ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use | Junjie Ye et.al. | 2501.02506 | null |
2025-01-05 | Learning when to rank: Estimation of partial rankings from sparse, noisy comparisons | Sebastian Morel-Balbi et.al. | 2501.02505 | null |
2025-01-05 | ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling | Chaojie Mao et.al. | 2501.02487 | null |
2025-01-05 | LLMPC: Large Language Model Predictive Control | Gabriel Maher et.al. | 2501.02486 | link |
2025-01-05 | Decoding News Bias: Multi Bias Detection in News Articles | Bhushan Santosh Shah et.al. | 2501.02482 | null |
2025-01-05 | Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine | Yishen Liu et.al. | 2501.02471 | null |
2025-01-05 | Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera | Yuliang Guo et.al. | 2501.02464 | null |
2025-01-05 | Towards Omni-RAG: Comprehensive Retrieval-Augmented Generation for Large Language Models in Medical Applications | Zhe Chen et.al. | 2501.02460 | null |
2025-01-05 | Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap | Hyunwoo Ko et.al. | 2501.02448 | null |
2025-01-05 | RTLMarker: Protecting LLM-Generated RTL Copyright via a Hardware Watermarking Framework | Kun Wang et.al. | 2501.02446 | null |
2025-01-05 | A Statistical Hypothesis Testing Framework for Data Misappropriation Detection in Large Language Models | Yinpeng Cai et.al. | 2501.02441 | null |
2025-01-05 | Efficient Deployment of Large Language Models on Resource-constrained Devices | Zhiwei Yao et.al. | 2501.02438 | null |
2025-01-05 | FOLDER: Accelerating Multi-modal Large Language Models with Enhanced Performance | Haicheng Wang et.al. | 2501.02430 | link |
2025-01-05 | GenTREC: The First Test Collection Generated by Large Language Models for Evaluating Information Retrieval Systems | Mehmet Deniz Türkmen et.al. | 2501.02408 | null |
2025-01-04 | Who Wrote This? Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration Inequalities | Tara Radvand et.al. | 2501.02406 | null |
2025-01-04 | Graph-Aware Isomorphic Attention for Adaptive Dynamics in Transformers | Markus J. Buehler et.al. | 2501.02393 | link |
2025-01-04 | Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations | Kangyu Zhu et.al. | 2501.02385 | null |
2025-01-04 | Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison | Tsz Kin Lam et.al. | 2501.02370 | null |
2025-01-04 | Thinking with Many Minds: Using Large Language Models for Multi-Perspective Problem-Solving | Sanghyun Park et.al. | 2501.02348 | null |
2025-01-04 | Exploring the Capabilities and Limitations of Large Language Models for Radiation Oncology Decision Support | Florian Putz et.al. | 2501.02346 | null |
2025-01-04 | UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility | Yonglin Tian et.al. | 2501.02341 | link |
2025-01-04 | AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference | Zhuomin He et.al. | 2501.02336 | link |
2025-01-04 | Validity Arguments For Constructed Response Scoring Using Generative Artificial Intelligence Applications | Jodi M. Casabianca et.al. | 2501.02334 | null |
2025-01-04 | Beyond Log-Concavity and Score Regularity: Improved Convergence Bounds for Score-Based Generative Models in W2-distance | Marta Gentiloni-Silveri et.al. | 2501.02298 | null |
2025-01-04 | Explicit vs. Implicit: Investigating Social Bias in Large Language Models through Self-Reflection | Yachao Zhao et.al. | 2501.02295 | null |
2025-01-04 | Digital Deep Joint Source-Channel Coding with Blind Training for Adaptive Modulation and Power Control | Yongjeong Oh et.al. | 2501.02273 | null |
2025-01-04 | What Kind of Visual Tokens Do We Need? Training-free Visual Token Pruning for Multi-modal Large Language Models from the Perspective of Graph | Yutao Jiang et.al. | 2501.02268 | link |
2025-01-04 | Unsupervised Class Generation to Expand Semantic Segmentation Datasets | Javier Montalvo et.al. | 2501.02264 | null |
2025-01-04 | Financial Named Entity Recognition: How Far Can LLM Go? | Yi-Te Lu et.al. | 2501.02237 | link |
2025-01-04 | Survey on Question Answering over Visually Rich Documents: Methods, Challenges, and Trends | Camille Barboule et.al. | 2501.02235 | null |
2025-01-04 | Leveraging Large Language Models and Machine Learning for Smart Contract Vulnerability Detection | S M Mostaq Hossain et.al. | 2501.02229 | null |
2025-01-04 | Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation | Shijie Wang et.al. | 2501.02226 | null |
2025-01-04 | Can ChatGPT implement finite element models for geotechnical engineering applications? | Taegu Kim et.al. | 2501.02199 | null |
2025-01-04 | EvoPath: Evolutionary Meta-path Discovery with Large Language Models for Complex Heterogeneous Information Networks | Shixuan Liu et.al. | 2501.02192 | null |
2025-01-04 | On LLM-Enhanced Mixed-Type Data Imputation with High-Order Message Passing | Jianwei Wang et.al. | 2501.02191 | link |
2025-01-04 | The Application of Large Language Models in Recommendation Systems | Peiyang Yu et.al. | 2501.02178 | null |
2025-01-04 | The Efficiency vs. Accuracy Trade-off: Optimizing RAG-Enhanced LLM Recommender Systems Using Multi-Head Early Exit | Huixue Zhou et.al. | 2501.02173 | null |
2025-01-04 | Personalized Graph-Based Retrieval for Large Language Models | Steven Au et.al. | 2501.02157 | link |
2025-01-04 | Table as Thought: Exploring Structured Thoughts in LLM Reasoning | Zhenjie Sun et.al. | 2501.02152 | null |
2025-01-04 | Plasma-CycleGAN: Plasma Biomarker-Guided MRI to PET Cross-modality Translation Using Conditional CycleGAN | Yanxi Chen et.al. | 2501.02146 | null |
2025-01-03 | VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction | Chaoyou Fu et.al. | 2501.01957 | link |
2025-01-03 | Metadata Conditioning Accelerates Language Model Pre-training | Tianyu Gao et.al. | 2501.01956 | link |
2025-01-03 | MADGEN -- Mass-Spec attends to De Novo Molecular generation | Yinkai Wang et.al. | 2501.01950 | null |
2025-01-03 | Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and Roadmap | Weizhi Zhang et.al. | 2501.01945 | link |
2025-01-03 | Bridging Classification and Segmentation in Osteosarcoma Assessment via Foundation and Discrete Diffusion Models | Manh Duong Nguyen et.al. | 2501.01932 | link |
2025-01-03 | Virgo: A Preliminary Exploration on Reproducing o1-like MLLM | Yifan Du et.al. | 2501.01904 | link |
2025-01-03 | EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation | Siyuan Huang et.al. | 2501.01895 | null |
2025-01-03 | Turning Logic Against Itself : Probing Model Defenses Through Contrastive Questions | Rachneet Sachdeva et.al. | 2501.01872 | link |
2025-01-03 | Multi-Agent Conversational Online Learning for Adaptive LLM Response Identification | Xiangxiang Dai et.al. | 2501.01849 | link |
2025-01-03 | MoColl: Agent-Based Specific and General Model Collaboration for Image Captioning | Pu Yang et.al. | 2501.01834 | null |
2025-01-03 | Time Series Language Model for Descriptive Caption Generation | Mohamed Trabelsi et.al. | 2501.01832 | null |
2025-01-03 | Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models | Yanjiang Liu et.al. | 2501.01830 | null |
2025-01-03 | SDPO: Segment-Level Direct Preference Optimization for Social Agents | Aobo Kong et.al. | 2501.01821 | link |
2025-01-03 | BERT4MIMO: A Foundation Model using BERT Architecture for Massive MIMO Channel State Information Prediction | Ferhat Ozgur Catak et.al. | 2501.01802 | link |
2025-01-03 | Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation | Mohammad Khalil et.al. | 2501.01793 | link |
2025-01-03 | Efficient LLM Inference with Activation Checkpointing and Hybrid Caching | Sanghyeon Lee et.al. | 2501.01792 | null |
2025-01-03 | Nonparametric estimation of a factorizable density using diffusion models | Hyeok Kyu Kwon et.al. | 2501.01783 | null |
2025-01-03 | SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation | Mingjie Li et.al. | 2501.01765 | null |
2025-01-03 | Adverse Weather Conditions Augmentation of LiDAR Scenes with Latent Diffusion Models | Andrea Matteazzi et.al. | 2501.01761 | null |
2025-01-03 | MusicGen-Stem: Multi-stem music generation and edition through autoregressive modeling | Simon Rouard et.al. | 2501.01757 | null |
2025-01-03 | Automating Legal Concept Interpretation with LLMs: Retrieval, Generation, and Evaluation | Kangcheng Luo et.al. | 2501.01743 | null |
2025-01-03 | How Toxic Can You Get? Search-based Toxicity Testing for Large Language Models | Simone Corbo et.al. | 2501.01741 | null |
2025-01-03 | AR4D: Autoregressive 4D Generation from Monocular Videos | Hanxin Zhu et.al. | 2501.01722 | null |
2025-01-03 | Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models | Guosheng Zhang et.al. | 2501.01720 | null |
2025-01-03 | LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries | Michal Kuk et.al. | 2501.01711 | null |
2025-01-03 | MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders | Jiajun Cao et.al. | 2501.01709 | null |
2025-01-03 | AgentRefine: Enhancing Agent Generalization through Refinement Tuning | Dayuan Fu et.al. | 2501.01702 | null |
2025-01-03 | Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models | Lei Tang et.al. | 2501.01679 | null |
2025-01-03 | Practical Secure Inference Algorithm for Fine-tuned Large Language Model Based on Fully Homomorphic Encryption | Zhang Ruoyan et.al. | 2501.01672 | null |
2025-01-03 | BARTPredict: Empowering IoT Security with LLM-Driven Cyber Threat Prediction | Alaeddine Diaf et.al. | 2501.01664 | null |
2025-01-03 | Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning | Danni Peng et.al. | 2501.01653 | null |
2025-01-03 | MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments | Cai Yin et.al. | 2501.01652 | link |
2025-01-03 | HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding | Heqing Zou et.al. | 2501.01645 | null |
2025-01-03 | iCBIR-Sli: Interpretable Content-Based Image Retrieval with 2D Slice Embeddings | Shuhei Tomoshige et.al. | 2501.01642 | null |
2025-01-03 | Uncertainty and Energy based Loss Guided Semi-Supervised Semantic Segmentation | Rini Smita Thakur et.al. | 2501.01640 | null |
2025-01-03 | A non-ergodic framework for understanding emergent capabilities in Large Language Models | Javier Marin et.al. | 2501.01638 | null |
2025-01-03 | Revisiting Data Analysis with Pre-trained Foundation Models | Chen Liang et.al. | 2501.01631 | null |
2025-01-03 | ICPC: In-context Prompt Compression with Faster Inference | Ziyang Yu et.al. | 2501.01625 | null |
2025-01-03 | PSYCHE: A Multi-faceted Patient Simulation Framework for Evaluation of Psychiatric Assessment Conversational Agents | Jingoo Lee et.al. | 2501.01594 | null |
2025-01-03 | (WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges | Mohamed Hisham Abdellatif et.al. | 2501.01588 | null |
2025-01-02 | Predicting the Performance of Black-box LLMs through Self-Queries | Dylan Sam et.al. | 2501.01558 | link |
2025-01-02 | Enhancing User Engagement in Large-Scale Social Annotation Platforms: Community-Based Design Interventions and Implications for Large Language Models (LLMs) | Jumana Almahmoud et.al. | 2501.01545 | null |
2025-01-02 | Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information | Rasul Tutnov et.al. | 2501.01544 | null |
2025-01-02 | Denoising Diffused Embeddings: a Generative Approach for Hypergraphs | Shihao Wu et.al. | 2501.01541 | null |
2025-01-02 | BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery | Kanishk Gandhi et.al. | 2501.01540 | link |
2025-01-02 | SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers | Bhavna Gopal et.al. | 2501.01529 | null |
2025-01-02 | Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search | Shuangtao Li et.al. | 2501.01478 | null |
2025-01-02 | Unifying Specialized Visual Encoders for Video Language Models | Jihoon Chung et.al. | 2501.01426 | link |
2025-01-02 | Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models | Jingfeng Yao et.al. | 2501.01423 | link |
2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422 | null |
2025-01-02 | Deep Discrete Encoders: Identifiable Deep Generative Models for Rich Data with Discrete Latent Layers | Seunghyun Lee et.al. | 2501.01414 | null |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-02 | OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios | Xize Cheng et.al. | 2501.01384 | null |
2025-01-02 | ScarNet: A Novel Foundation Model for Automated Myocardial Scar Quantification from LGE in Cardiac MRI | Neda Tavakoli et.al. | 2501.01372 | link |
2025-01-02 | Aligning Large Language Models for Faithful Integrity Against Opposing Argument | Yong Zhao et.al. | 2501.01336 | link |
2025-01-02 | CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language Models | Johan Wahréus et.al. | 2501.01335 | link |
2025-01-02 | Decoding Knowledge in Large Language Models: A Framework for Categorization and Comprehension | Yanbo Fang et.al. | 2501.01332 | null |
2025-01-02 | The Prompt Alchemist: Automated LLM-Tailored Prompt Optimization for Test Case Generation | Shuzheng Gao et.al. | 2501.01329 | null |
2025-01-03 | Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking | Xiaoxue Cheng et.al. | 2501.01306 | null |
2025-01-02 | Large Language Models for Mental Health Diagnostic Assessments: Exploring The Potential of Large Language Models for Assisting with Mental Health Diagnostic Assessments -- The Depression and Anxiety Case | Kaushik Roy et.al. | 2501.01305 | null |
2025-01-02 | Does a Large Language Model Really Speak in Human-Like Language? | Mose Park et.al. | 2501.01273 | null |
2025-01-02 | ProgCo: Program Helps Self-Correction of Large Language Models | Xiaoshuai Song et.al. | 2501.01264 | null |
2025-01-03 | CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings | Shanghaoran Quan et.al. | 2501.01257 | null |
2025-01-02 | Digital Guardians: Can GPT-4, Perspective API, and Moderation API reliably detect hate speech in reader comments of German online newspapers? | Manuel Weber et.al. | 2501.01256 | null |
2025-01-02 | Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion | Qiyuan He et.al. | 2501.01246 | null |
2025-01-02 | SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization | Yongle Huang et.al. | 2501.01245 | link |
2025-01-02 | Face-Human-Bench: A Comprehensive Benchmark of Face and Human Understanding for Multi-modal Assistants | Lixiong Qin et.al. | 2501.01243 | null |
2025-01-02 | Automated Self-Refinement and Self-Correction for LLM-based Product Attribute Value Extraction | Alexander Brinkmann et.al. | 2501.01237 | link |
2025-01-03 | TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer | Jiayu Li et.al. | 2501.01216 | null |
2025-01-02 | Harnessing Multi-Agent LLMs for Complex Engineering Problem-Solving: A Framework for Senior Design Projects | Abdullah Mushtaq et.al. | 2501.01205 | null |
2025-01-02 | HetGCoT-Rec: Heterogeneous Graph-Enhanced Chain-of-Thought LLM Reasoning for Journal Recommendation | Runsong Jia et.al. | 2501.01203 | null |
2025-01-02 | LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge | Kyoungkook Kang et.al. | 2501.01197 | null |
2025-01-02 | Bridging the Early Science Gap with Artificial Intelligence: Evaluating Large Language Models as Tools for Early Childhood Science Education | Annika Bush et.al. | 2501.01192 | null |
2025-01-02 | Towards Interactive Deepfake Analysis | Lixiong Qin et.al. | 2501.01164 | link |
2025-01-02 | TexAVi: Generating Stereoscopic VR Video Clips from Text Descriptions | Vriksha Srihari et.al. | 2501.01156 | null |
2025-01-02 | A3: Android Agent Arena for Mobile GUI Agents | Yuxiang Chai et.al. | 2501.01149 | null |
2025-01-03 | BlockDialect: Block-wise Fine-grained Mixed Format for Energy-Efficient LLM Inference | Wonsuk Jang et.al. | 2501.01144 | link |
2025-01-02 | Embodied AI-Enhanced Vehicular Networks: An Integrated Large Language Models and Reinforcement Learning Method | Ruichen Zhang et.al. | 2501.01141 | null |
2025-01-02 | Graph2text or Graph2token: A Perspective of Large Language Models for Graph Learning | Shuo Yu et.al. | 2501.01124 | null |
2025-01-02 | MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification | Jimin Park et.al. | 2501.01110 | null |
2025-01-03 | MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization | Haina Zhu et.al. | 2501.01108 | link |
2025-01-02 | Graph Generative Pre-trained Transformer | Xiaohui Chen et.al. | 2501.01073 | null |
2025-01-02 | Dynamic Attention-Guided Context Decoding for Mitigating Context Faithfulness Hallucinations in Large Language Models | Yanwen Huang et.al. | 2501.01059 | null |
2025-01-02 | Risks of Cultural Erasure in Large Language Models | Rida Qadri et.al. | 2501.01056 | null |
2025-01-02 | Dynamic Scaling of Unit Tests for Code Reward Modeling | Zeyao Ma et.al. | 2501.01054 | null |
2025-01-02 | Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs | Linhao Huang et.al. | 2501.01042 | null |
2025-01-02 | Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models | Bin Wang et.al. | 2501.01034 | link |
2025-01-02 | ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning | Wonduk Seo et.al. | 2501.01031 | null |
2025-01-03 | KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model | Xinshuo Hu et.al. | 2501.01028 | link |
2025-01-02 | MDSF: Context-Aware Multi-Dimensional Data Storytelling Framework based on Large language Model | Chengze Zhang et.al. | 2501.01014 | null |
2025-01-02 | FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving | Zihao Ye et.al. | 2501.01005 | link |
2025-01-02 | Exploring Information Processing in Large Language Models: Insights from Information Bottleneck Theory | Zhou Yang et.al. | 2501.00999 | null |
2025-01-02 | Optimizing Noise Schedules of Generative Models in High Dimensionss | Santiago Aranguri et.al. | 2501.00988 | null |
2025-01-02 | Are LLMs effective psychological assessors? Leveraging adaptive RAG for interpretable mental health screening through psychometric practice | Federico Ravenda et.al. | 2501.00982 | link |
2025-01-01 | IGGA: A Dataset of Industrial Guidelines and Policy Statements for Generative AIs | Junfeng Jiao et.al. | 2501.00959 | null |
2025-01-01 | Generative AI and LLMs in Industry: A text-mining Analysis and Critical Evaluation of Guidelines and Policy Statements Across Fourteen Industrial Sectors | Junfeng Jiao et.al. | 2501.00957 | null |
2025-01-01 | Incremental Dialogue Management: Survey, Discussion, and Implications for HRI | Casey Kennington et.al. | 2501.00953 | null |
2025-01-01 | SPADE: Enhancing Adaptive Cyber Deception Strategies with Generative AI and Structured Prompt Engineering | Shihab Ahmed et.al. | 2501.00940 | null |
2025-01-01 | Diffusion Policies for Generative Modeling of Spacecraft Trajectories | Julia Briden et.al. | 2501.00915 | null |
2025-01-01 | Aligning LLMs with Domain Invariant Reward Models | David Wu et.al. | 2501.00911 | link |
2025-01-01 | Population Aware Diffusion for Time Series Generation | Yang Li et.al. | 2501.00910 | link |
2025-01-01 | Large Language Model Based Multi-Agent System Augmented Complex Event Processing Pipeline for Internet of Multimedia Things | Talha Zeeshan et.al. | 2501.00906 | null |
2025-01-01 | Text2Earth: Unlocking Text-driven Remote Sensing Image Generation with a Global-Scale Dataset and a Foundation Model | Chenyang Liu et.al. | 2501.00895 | null |
2025-01-01 | Evaluating Time Series Foundation Models on Noisy Periodic Time Series | Syamantak Datta Gupta et.al. | 2501.00889 | null |
2025-01-01 | Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization | Weiqi Wu et.al. | 2501.00888 | link |
2025-01-01 | Representation in large language models | Cameron C. Yetman et.al. | 2501.00885 | null |
2025-01-01 | Agentic Systems: A Guide to Transforming Industries with Vertical AI Agents | Fouad Bousetouane et.al. | 2501.00881 | null |
2025-01-01 | Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction | Teng Hu et.al. | 2501.00880 | null |
2025-01-01 | TrustRAG: Enhancing Robustness and Trustworthiness in RAG | Huichi Zhou et.al. | 2501.00879 | link |
2025-01-01 | LUSIFER: Language Universal Space Integration for Enhanced Multilingual Embeddings with Large Language Models | Hieu Man et.al. | 2501.00874 | link |
2025-01-01 | Exploring Structured Semantic Priors Underlying Diffusion Score for Test-time Adaptation | Mingjia Li et.al. | 2501.00873 | link |
2025-01-01 | Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation | Shoutao Guo et.al. | 2501.00868 | link |
2025-01-01 | Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era | Mihnea C. Moldoveanu et.al. | 2501.00867 | null |
2025-01-01 | Alzheimer's disease detection based on large language model prompt engineering | Tian Zheng et.al. | 2501.00861 | null |
2025-01-01 | LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions | Adam Ishay et.al. | 2501.00830 | null |
2025-01-01 | An LLM-Empowered Adaptive Evolutionary Algorithm For Multi-Component Deep Learning Systems | Haoxiang Tian et.al. | 2501.00829 | null |
2025-01-01 | LLM-Powered Multi-Agent System for Automated Crypto Portfolio Management | Yichen Luo et.al. | 2501.00826 | null |
2025-01-01 | Multimodal Large Models Are Effective Action Anticipators | Binglu Wang et.al. | 2501.00795 | link |
2025-01-01 | Shifting-Merging: Secure, High-Capacity and Efficient Steganography via Large Language Models | Minhao Bai et.al. | 2501.00786 | null |
2025-01-01 | NMM-HRI: Natural Multi-modal Human-Robot Interaction with Voice and Deictic Posture via Large Language Model | Yuzhi Lai et.al. | 2501.00785 | null |
2025-01-01 | REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization | Huyen Nguyen et.al. | 2501.00779 | null |
2025-01-01 | FitCF: A Framework for Automatic Feature Importance-guided Counterfactual Example Generation | Qianli Wang et.al. | 2501.00777 | null |
2025-01-01 | Using Large Language Model to Support Flexible and Structural Inductive Qualitative Analysis | Jie Gao et.al. | 2501.00775 | null |
2025-01-01 | An AI-powered Bayesian generative modeling approach for causal inference in observational studies | Qiao Liu et.al. | 2501.00755 | null |
2025-01-01 | Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Cheonsu Jeong et.al. | 2501.00750 | null |
2025-01-01 | DIVE: Diversified Iterative Self-Improvement | Yiwei Qin et.al. | 2501.00747 | link |
2025-01-01 | Dynamics of Adversarial Attacks on Large Language Model-Based Search Engines | Xiyang Hu et.al. | 2501.00745 | null |
2025-01-01 | A Distributional Evaluation of Generative Image Models | Edric Tam et.al. | 2501.00744 | null |
2025-01-01 | New Agegraphic Dark Energy Model in Modified Symmetric Teleparallel Theory | Madiha Ajmal et.al. | 2501.00721 | null |
2025-01-01 | Knowledge-Guided Prompt Learning for Deepfake Facial Image Detection | Hao Wang et.al. | 2501.00700 | null |
2025-01-01 | Adjoint sharding for very long context training of state space models | Xingzi Xu et.al. | 2501.00692 | null |
2025-01-01 | Labels Generated by Large Language Model Helps Measuring People's Empathy in Vitro | Md Rakibul Hasan et.al. | 2501.00691 | null |
2025-01-01 | IGC: Integrating a Gated Calculator into an LLM to Solve Arithmetic Tasks Reliably and Efficiently | Florian Dietz et.al. | 2501.00684 | null |
2024-12-31 | Grade Inflation in Generative Models | Phuc Nguyen et.al. | 2501.00664 | null |
2024-12-31 | Finding Missed Code Size Optimizations in Compilers using LLMs | Davide Italiano et.al. | 2501.00655 | null |
2024-12-31 | Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models | Suttisak Wizadwongsa et.al. | 2501.00651 | null |
2024-12-31 | Efficient Standardization of Clinical Notes using Large Language Models | Daniel B. Hier et.al. | 2501.00644 | null |
2024-12-31 | Enabling New HDLs with Agents | Mark Zakharov et.al. | 2501.00642 | null |
2024-12-31 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2024-12-31 | VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM | Yuqian Yuan et.al. | 2501.00599 | link |
2024-12-31 | Setting Standards in Turkish NLP: TR-MMLU for Large Language Model Evaluation | M. Ali Bayram et.al. | 2501.00593 | null |
2024-12-31 | Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method | Zhenpeng Huang et.al. | 2501.00584 | null |
2024-12-31 | Causal Graph Guided Steering of LLM Values via Prompts and Sparse Autoencoders | Yipeng Kang et.al. | 2501.00581 | null |
2024-12-31 | AI and Quantum Computing in Binary Photocatalytic Hydrogen Production | Dennis Delali Kwesi Wayo et.al. | 2501.00575 | null |
2024-12-31 | VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling | Xinhao Li et.al. | 2501.00574 | link |
2024-12-31 | Probing Visual Language Priors in VLMs | Tiange Luo et.al. | 2501.00569 | null |
2024-12-31 | Robust and Adaptive Optimization under a Large Language Model Lens | Dimitris Bertsimas et.al. | 2501.00568 | null |
2024-12-30 | Distributed Mixture-of-Agents for Edge Inference with Large Language Models | Purbesh Mitra et.al. | 2412.21200 | link |
2024-12-31 | HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation | Zhaojian Yu et.al. | 2412.21199 | link |
2024-12-30 | The Gaussian Kicked Rotor: Periodic forcing with finite-width pulses and the role of shifting the kick | Jonathan Berkheim et.al. | 2412.21186 | null |
2024-12-30 | Facilitating large language model Russian adaptation with Learned Embedding Propagation | Mikhail Tikhomirov et.al. | 2412.21140 | link |
2024-12-30 | ExpShield: Safeguarding Web Text from Unauthorized Crawling and Language Modeling Exploitation | Ruixuan Liu et.al. | 2412.21123 | null |
2025-01-02 | Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation | Yuanbo Yang et.al. | 2412.21117 | null |
2024-12-30 | Varformer: Adapting VAR's Generative Prior for Image Restoration | Siyang Wang et.al. | 2412.21063 | link |
2024-12-30 | VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation | Jiazheng Xu et.al. | 2412.21059 | link |
2024-12-30 | Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense | Yuyang Zhou et.al. | 2412.21051 | link |
2024-12-30 | E2EDiff: Direct Mapping from Noise to Data for Enhanced Diffusion Models | Zhiyu Tan et.al. | 2412.21044 | null |
2024-12-30 | Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration | Wanglong Lu et.al. | 2412.21042 | link |
2024-12-30 | TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization | Chia-Yu Hung et.al. | 2412.21037 | link |
2024-12-30 | GePBench: Evaluating Fundamental Geometric Perception for Multimodal Large Language Models | Shangyu Xing et.al. | 2412.21036 | null |
2024-12-30 | MapQaTor: A System for Efficient Annotation of Map Query Datasets | Mahir Labib Dihan et.al. | 2412.21015 | link |
2024-12-31 | Verbosity-Aware Rationale Reduction: Effective Reduction of Redundant Rationale via Principled Criteria | Joonwon Jang et.al. | 2412.21006 | null |
2024-12-30 | Plug-and-Play Training Framework for Preference Optimization | Jingyuan Ma et.al. | 2412.20996 | null |
2024-12-30 | KARPA: A Training-free Method of Adapting Knowledge Graph as References for Large Language Model's Reasoning Path Aggregation | Siyuan Fang et.al. | 2412.20995 | null |
2024-12-30 | Efficiently Serving LLM Reasoning Programs with Certaindex | Yichao Fu et.al. | 2412.20993 | null |
2024-12-30 | QuantumLLMInstruct: A 500k LLM Instruction-Tuning Dataset with Problem-Solution Pairs for Quantum Computing | Shlomo Kashani et.al. | 2412.20956 | null |
2024-12-30 | AGON: Automated Design Framework for Customizing Processors from ISA Documents | Chongxiao Li et.al. | 2412.20954 | null |
2024-12-30 | Ontology-grounded Automatic Knowledge Graph Construction by LLM under Wikidata schema | Xiaohan Feng et.al. | 2412.20942 | null |
2024-12-30 | Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering | Junxiao Xue et.al. | 2412.20927 | null |
2024-12-30 | ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation | Ting Zhang et.al. | 2412.20901 | null |
2024-12-30 | Towards Compatible Fine-tuning for Vision-Language Model Updates | Zhengbo Wang et.al. | 2412.20895 | null |
2024-12-30 | DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models | Xiaolin Hu et.al. | 2412.20891 | null |
2024-12-30 | Enhancing Annotated Bibliography Generation with LLM Ensembles | Sergio Bermejo et.al. | 2412.20864 | null |
2024-12-30 | Are LLMs Really Not Knowledgable? Mining the Submerged Knowledge in LLMs' Memory | Xingjian Tao et.al. | 2412.20846 | null |
2024-12-30 | Disentangling Preference Representation and Text Generation for Efficient Individual Preference Alignment | Jianfei Zhang et.al. | 2412.20834 | link |
2024-12-30 | Retrieval-Augmented Generation for Mobile Edge Computing via Large Language Model | Runtao Ren et.al. | 2412.20820 | null |
2024-12-30 | TimeRAF: Retrieval-Augmented Foundation model for Zero-shot Time Series Forecasting | Huanyu Zhang et.al. | 2412.20810 | null |
2024-12-30 | Pre-trained Audio Transformer as a Foundational AI Tool for Gravitational Waves | Chayan Chatterjee et.al. | 2412.20789 | null |
2024-12-31 | SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity | Pengfei Jing et.al. | 2412.20787 | null |
2024-12-30 | Large Language Model Enabled Multi-Task Physical Layer Network | Tianyue Zheng et.al. | 2412.20772 | null |
2024-12-30 | Attributing Culture-Conditioned Generations to Pretraining Corpora | Huihan Li et.al. | 2412.20760 | link |
2024-12-30 | M |
Bei Yan et.al. | 2412.20718 | link |
2024-12-30 | HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images | Sungik Choi et.al. | 2412.20704 | null |
2024-12-30 | UBER: Uncertainty-Based Evolution with Large Language Models for Automatic Heuristic Design | Zijie Chen et.al. | 2412.20694 | null |
2024-12-30 | Learning to Rank Pre-trained Vision-Language Models for Downstream Tasks | Yuhe Ding et.al. | 2412.20682 | null |
2024-12-30 | Align Attention Heads Before Merging Them: An Effective Way for Converting MHA to GQA | Qingyun Jin et.al. | 2412.20677 | null |
2024-12-30 | Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner | Yitong Zhou et.al. | 2412.20662 | link |
2024-12-30 | Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis | Yousef Yeganeh et.al. | 2412.20651 | null |
2024-12-30 | SafeSynthDP: Leveraging Large Language Models for Privacy-Preserving Synthetic Data Generation Using Differential Privacy | Md Mahadi Hasan Nahid et.al. | 2412.20641 | null |
2024-12-30 | Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble | Yongchang Li et.al. | 2412.20637 | null |
2024-12-30 | EVOLVE: Emotion and Visual Output Learning via LLM Evaluation | Jordan Sinclair et.al. | 2412.20632 | null |
2024-12-29 | Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study | Yulin Fei et.al. | 2412.20613 | link |
2024-12-29 | NLP-based Regulatory Compliance -- Using GPT 4.0 to Decode Regulatory Documents | Bimal Kumar et.al. | 2412.20602 | null |
2024-12-29 | MATEY: multiscale adaptive foundation models for spatiotemporal physical systems | Pei Zhang et.al. | 2412.20601 | null |
2024-12-29 | Controlling Out-of-Domain Gaps in LLMs for Genre Classification and Generated Text Detection | Dmitri Roussinov et.al. | 2412.20595 | link |
2024-12-29 | Towards Neural No-Resource Language Translation: A Comparative Evaluation of Approaches | Madhavendra Thakur et.al. | 2412.20584 | null |
2024-12-29 | Counterfactual Samples Constructing and Training for Commonsense Statements Estimation | Chong Liu et.al. | 2412.20563 | null |
2024-12-29 | Distributionally Robust Optimization via Iterative Algorithms in Continuous Probability Spaces | Linglingzhi Zhu et.al. | 2412.20556 | null |
2024-12-29 | The Impact of Prompt Programming on Function-Level Code Generation | Ranim Khojah et.al. | 2412.20545 | link |
2024-12-29 | Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Xingshuai Huang et.al. | 2412.20519 | null |
2024-12-29 | Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning | Hang Ni et.al. | 2412.20505 | null |
2024-12-29 | ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding | Xiao Wang et.al. | 2412.20504 | link |
2024-12-29 | TokenRing: An Efficient Parallelism Framework for Infinite-Context LLMs via Bidirectional Communication | Zongwu Wang et.al. | 2412.20501 | link |
2024-12-29 | Multimodal Variational Autoencoder: a Barycentric View | Peijie Qiu et.al. | 2412.20487 | null |
2024-12-29 | JADE: Joint-aware Latent Diffusion for 3D Human Generative Modeling | Haorui Ji et.al. | 2412.20470 | null |
2024-12-29 | Improving Vision-Language-Action Models via Chain-of-Affordance | Jinming Li et.al. | 2412.20451 | null |
2024-12-29 | Enhancing Entertainment Translation for Indian Languages using Adaptive Context, Style and LLMs | Pratik Rakesh Singh et.al. | 2412.20440 | null |
2024-12-29 | Image Augmentation Agent for Weakly Supervised Semantic Segmentation | Wangyu Wu et.al. | 2412.20439 | null |
2024-12-29 | Unlocking adaptive digital pathology through dynamic feature learning | Jiawen Li et.al. | 2412.20430 | null |
2024-12-29 | AmalREC: A Dataset for Relation Extraction and Classification Leveraging Amalgamation of Large Language Models | Mansi et.al. | 2412.20427 | null |
2024-12-29 | Bringing Objects to Life: 4D generation from 3D objects | Ohad Rahamim et.al. | 2412.20422 | null |
2024-12-29 | Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection | Kalin Kopanov et.al. | 2412.20414 | null |
2024-12-29 | Multi-Objective Large Language Model Unlearning | Zibin Pan et.al. | 2412.20412 | link |
2024-12-29 | Open-Sora: Democratizing Efficient Video Production for All | Zangwei Zheng et.al. | 2412.20404 | link |
2024-12-29 | Natural Language Fine-Tuning | Jia Liu et.al. | 2412.20382 | link |
2024-12-29 | Protégé: Learn and Generate Basic Makeup Styles with Generative Adversarial Networks (GANs) | Jia Wei Sii et.al. | 2412.20381 | null |
2024-12-29 | FairDiffusion: Enhancing Equity in Latent Diffusion Models via Fair Bayesian Perturbation | Yan Luo et.al. | 2412.20374 | link |
2024-12-29 | LLM2: Let Large Language Models Harness System 2 Reasoning | Cheng Yang et.al. | 2412.20372 | link |
2025-01-02 | Enhancing Code LLMs with Reinforcement Learning in Code Generation: A Survey | Junqiao Wang et.al. | 2412.20367 | null |
2024-12-29 | HindiLLM: Large Language Model for Hindi | Sanjay Chouhan et.al. | 2412.20357 | null |
2024-12-29 | Distilling Desired Comments for Enhanced Code Review with Large Language Models | Yongda Yu et.al. | 2412.20340 | null |
2024-12-29 | Mind the Data Gap: Bridging LLMs to Enterprise Data Integration | Moe Kayali et.al. | 2412.20331 | null |
2024-12-29 | GreenLLM: Disaggregating Large Language Model Serving on Heterogeneous GPUs for Lower Carbon Emissions | Tianyao Shi et.al. | 2412.20322 | null |
2024-12-29 | Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain | Shintaro Ozaki et.al. | 2412.20309 | null |
2024-12-28 | FaGeL: Fabric LLMs Agent empowered Embodied Intelligence Evolution with Autonomous Human-Machine Collaboration | Jia Liu et.al. | 2412.20297 | null |
2024-12-28 | Deep Generalized Schrödinger Bridges: From Image Generation to Solving Mean-Field Games | Guan-Horng Liu et.al. | 2412.20279 | null |
2024-12-28 | Scoring with Large Language Models: A Study on Measuring Empathy of Responses in Dialogues | Henry J. Xie et.al. | 2412.20264 | link |
2024-12-28 | Leveraging Large Language Models for Enhancing Autonomous Vehicle Perception | Athanasios Karagounis et.al. | 2412.20230 | null |
2024-12-28 | LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning | Shuguang Chen et.al. | 2412.20227 | null |
2024-12-28 | Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation | Yeonhong Park et.al. | 2412.20185 | null |
2024-12-28 | LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System | Hyucksung Kwon et.al. | 2412.20166 | null |
2024-12-28 | StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN | Andrzej Bedychaj et.al. | 2412.20164 | null |
2024-12-28 | Topic-Aware Knowledge Graph with Large Language Models for Interoperability in Recommender Systems | Minhye Jeon et.al. | 2412.20163 | null |
2024-12-28 | Multi-Modality Driven LoRA for Adverse Condition Depth Estimation | Guanglei Yang et.al. | 2412.20162 | null |
2024-12-28 | Defending Against Network Attacks for Secure AI Agent Migration in Vehicular Metaverses | Xinru Wen et.al. | 2412.20154 | null |
2024-12-28 | Efficient Multi-Agent Collaboration with Tool Use for Online Planning in Complex Table Question Answering | Wei Zhou et.al. | 2412.20145 | null |
2024-12-28 | TradingAgents: Multi-Agents LLM Financial Trading Framework | Yijia Xiao et.al. | 2412.20138 | null |
2024-12-28 | M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation | Zhaopeng Feng et.al. | 2412.20127 | link |
2024-12-28 | Functional Lower Bounds in Algebraic Proofs: Symmetry, Lifting, and Barriers | Tuomas Hakoniemi et.al. | 2412.20114 | null |
2024-12-28 | ST |
Jiedong Zhuang et.al. | 2412.20105 | null |
2024-12-28 | On the Validity of Traditional Vulnerability Scoring Systems for Adversarial Attacks against LLMs | Atmane Ayoub Mansour Bahar et.al. | 2412.20087 | null |
2024-12-31 | Extract Information from Hybrid Long Documents Leveraging LLMs: A Framework and Dataset | Chongjian Yue et.al. | 2412.20072 | null |
2024-12-28 | On the Compositional Generalization of Multimodal LLMs for Medical Imaging | Zhenyang Cai et.al. | 2412.20070 | link |
2024-12-28 | VELoRA: A Low-Rank Adaptation Approach for Efficient RGB-Event based Recognition | Lan Chen et.al. | 2412.20064 | link |
2024-12-28 | MADiff: Text-Guided Fashion Image Editing with Mask Prediction and Attention-Enhanced Diffusion | Zechao Zhan et.al. | 2412.20062 | null |
2024-12-28 | Comparative Analysis of Listwise Reranking with Large Language Models in Limited-Resource Language Contexts | Yanxin Shen et.al. | 2412.20061 | null |
2024-12-28 | "My life is miserable, have to sign 500 autographs everyday": Exposing Humblebragging, the Brags in Disguise | Sharath Naganna et.al. | 2412.20057 | null |
2024-12-27 | Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization | Kumud Tripathi et.al. | 2412.19785 | null |
2024-12-27 | Can AI Help with Your Personal Finances? | Oudom Hean et.al. | 2412.19784 | null |
2024-12-27 | Tensor Network Estimation of Distribution Algorithms | John Gardiner et.al. | 2412.19780 | null |
2024-12-27 | Fortran2CPP: Automating Fortran-to-C++ Migration using LLMs via Multi-Turn Dialogue and Dual-Agent Integration | Le Chen et.al. | 2412.19770 | link |
2024-12-27 | Generative Video Propagation | Shaoteng Liu et.al. | 2412.19761 | null |
2024-12-27 | On dual-projectively equivalent connections associated to second order superintegrable systems | Andreas Vollmer et.al. | 2412.19739 | null |
2024-12-27 | Can Large Language Models Adapt to Other Agents In-Context? | Matthew Riemer et.al. | 2412.19726 | null |
2024-12-27 | From Elements to Design: A Layered Approach for Automatic Graphic Design Composition | Jiawei Lin et.al. | 2412.19712 | null |
2024-12-27 | Toward Adaptive Reasoning in Large Language Models with Thought Rollback | Sijia Chen et.al. | 2412.19707 | link |
2024-12-27 | A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization | Jingchun Lian et.al. | 2412.19685 | null |
2024-12-27 | Boosting Private Domain Understanding of Efficient MLLMs: A Tuning-free, Adaptive, Universal Prompt Optimization Framework | Jiang Liu et.al. | 2412.19684 | null |
2024-12-27 | CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | Siyu Wang et.al. | 2412.19663 | null |
2024-12-27 | Asymmetrical Reciprocity-based Federated Learning for Resolving Disparities in Medical Diagnosis | Jiaqi Wang et.al. | 2412.19654 | link |
2024-12-27 | FreStega: A Plug-and-Play Method for Boosting Imperceptibility and Capacity in Generative Linguistic Steganography for Real-World Scenarios | Kaiyi Pang et.al. | 2412.19652 | null |
2024-12-27 | Xmodel-2 Technical Report | Wang Qun et.al. | 2412.19638 | null |
2024-12-27 | IMTP: Search-based Code Generation for In-memory Tensor Programs | Yongwon Shin et.al. | 2412.19630 | null |
2024-12-27 | Signatures of prediction during natural listening in MEG data? | Sahel Azizpour et.al. | 2412.19622 | null |
2024-12-27 | Gradient Weight-normalized Low-rank Projection for Efficient LLM Training | Jia-Hong Huang et.al. | 2412.19616 | link |
2024-12-27 | SocRATES: Towards Automated Scenario-based Testing of Social Navigation Algorithms | Shashank Rao Marpally et.al. | 2412.19595 | null |
2024-12-27 | Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following | Yuxiao Yang et.al. | 2412.19562 | null |
2024-12-27 | Diverse Rare Sample Generation with Pretrained GANs | Subeen Lee et.al. | 2412.19543 | link |
2024-12-27 | Lévy Score Function and Score-Based Particle Algorithm for Nonlinear Lévy--Fokker--Planck Equations | Yuanfei Huang et.al. | 2412.19520 | null |
2024-12-27 | Estimation of System Parameters Including Repeated Cross-Sectional Data through Emulator-Informed Deep Generative Model | Hyunwoo Cho et.al. | 2412.19517 | null |
2024-12-27 | Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs | Zhe Yang et.al. | 2412.19513 | link |
2024-12-27 | Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging | Hua Farn et.al. | 2412.19512 | null |
2024-12-27 | Parameter Efficient Fine-Tuning for Deep Learning-Based Full-Waveform Inversion | Koustav Ghosal et.al. | 2412.19510 | null |
2024-12-27 | MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Shiyao Li et.al. | 2412.19509 | link |
2024-12-27 | DrivingWorld: ConstructingWorld Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-27 | Casevo: A Cognitive Agents and Social Evolution Simulator | Zexun Jiang et.al. | 2412.19498 | link |
2024-12-27 | Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation | Chengyang Ye et.al. | 2412.19492 | link |
2024-12-27 | Focusing Image Generation to Mitigate Spurious Correlations | Xuewei Li et.al. | 2412.19457 | null |
2024-12-27 | Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models | Hyeonseok Moon et.al. | 2412.19450 | link |
2024-12-27 | Feature Alignment-Based Knowledge Distillation for Efficient Compression of Large Language Models | Shuo Wang et.al. | 2412.19449 | null |
2024-12-27 | A Survey on Large Language Model Acceleration based on KV Cache Management | Haoyang Li et.al. | 2412.19442 | link |
2024-12-27 | Low-Rank Contextual Reinforcement Learning from Heterogeneous Human Feedback | Seong Jin Lee et.al. | 2412.19436 | null |
2024-12-27 | Temporal Context Consistency Above All: Enhancing Long-Term Anticipation by Learning and Enforcing Temporal Constraints | Alberto Maté et.al. | 2412.19424 | null |
2024-12-27 | Gx2Mol: De Novo Generation of Hit-like Molecules from Gene Expression Profiles via Deep Learning | Chen Li et.al. | 2412.19422 | link |
2024-12-27 | MINIMA: Modality Invariant Image Matching | Xingyu Jiang et.al. | 2412.19412 | link |
2024-12-27 | MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic Scenarios | Jiaqi Fan et.al. | 2412.19406 | null |
2024-12-27 | An Engorgio Prompt Makes Large Language Model Babble on | Jianshuo Dong et.al. | 2412.19394 | link |
2024-12-26 | Large Language Models for Market Research: A Data-augmentation Approach | Mengxin Wang et.al. | 2412.19363 | null |
2024-12-26 | Dynamic Skill Adaptation for Large Language Models | Jiaao Chen et.al. | 2412.19361 | null |
2024-12-26 | Identifying Split Vacancies with Foundation Models and Electrostatics | Seán R. Kavanagh et.al. | 2412.19330 | null |
2024-12-26 | Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment | Ziang Yan et.al. | 2412.19326 | link |
2024-12-26 | Performance Control in Early Exiting to Deploy Large Models at the Same Cost of Smaller Ones | Mehrnaz Mofakhami et.al. | 2412.19325 | null |
2024-12-26 | From Interets to Insights: An LLM Approach to Course Recommendations Using Natural Language Queries | Hugh Van Deventer et.al. | 2412.19312 | link |
2024-12-26 | Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries | Roberto Amoroso et.al. | 2412.19304 | null |
2024-12-26 | RecLM: Recommendation Instruction Tuning | Yangqin Jiang et.al. | 2412.19302 | link |
2024-12-26 | RAG with Differential Privacy | Nicolas Grislain et.al. | 2412.19291 | link |
2024-12-26 | Time Series Foundational Models: Their Role in Anomaly Detection and Prediction | Chathurangi Shyalika et.al. | 2412.19286 | link |
2024-12-26 | PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing | Michael Bezick et.al. | 2412.19284 | null |
2024-12-26 | MEDEC: A Benchmark for Medical Error Detection and Correction in Clinical Notes | Asma Ben Abacha et.al. | 2412.19260 | link |
2024-12-26 | VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis | Jaemin Jung et.al. | 2412.19259 | null |
2024-12-26 | Sentiment trading with large language models | Kemal Kirtac et.al. | 2412.19245 | null |
2024-12-26 | SeaMo: A Multi-Seasonal and Multimodal Remote Sensing Foundation Model | Xuyang Li et.al. | 2412.19237 | null |
2024-12-26 | Large Language Models Meet Graph Neural Networks: A Perspective of Graph Mining | Yuxin You et.al. | 2412.19211 | null |
2024-12-26 | Multi-Attribute Constraint Satisfaction via Language Model Rewriting | Ashutosh Baheti et.al. | 2412.19198 | null |
2024-12-26 | Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models | Haonan He et.al. | 2412.19191 | null |
2024-12-26 | Evolutionary de-homogenization using a generative model for optimizing solid-porous infill structures considering the stress concentration issue | Shuzhi Xu et.al. | 2412.19154 | null |
2024-12-26 | AskChart: Universal Chart Understanding through Textual Enhancement | Xudong Yang et.al. | 2412.19146 | link |
2024-12-26 | SILC-EFSA: Self-aware In-context Learning Correction for Entity-level Financial Sentiment Analysis | Senbin Zhu et.al. | 2412.19140 | link |
2024-12-26 | PlanLLM: Video Procedure Planning with Refinable Large Language Models | Dejie Yang et.al. | 2412.19139 | link |
2024-12-26 | Advanced Knowledge Transfer: Refined Feature Distillation for Zero-Shot Quantization in Edge Computing | Inpyo Hong et.al. | 2412.19125 | link |
2024-12-26 | Discrete vs. Continuous Trade-offs for Generative Models | Jathin Korrapati et.al. | 2412.19114 | null |
2024-12-26 | SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values | Yunfan Zhang et.al. | 2412.19113 | null |
2024-12-26 | Stochastic normalizing flows for Effective String Theory | Michele Caselle et.al. | 2412.19109 | null |
2024-12-26 | "I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities | Jiawei Yu et.al. | 2412.19102 | null |
2024-12-26 | Integrating Artificial Open Generative Artificial Intelligence into Software Supply Chain Security | Vasileios Alevizos et.al. | 2412.19088 | null |
2024-12-26 | Mask Factory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation | Haotian Qian et.al. | 2412.19080 | null |
2024-12-26 | CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Jingyi Zheng et.al. | 2412.19037 | link |
2024-12-26 | Repository Structure-Aware Training Makes SLMs Better Issue Resolver | Zexiong Ma et.al. | 2412.19031 | null |
2024-12-26 | Modality-Projection Universal Model for Comprehensive Full-Body Medical Imaging Segmentation | Yixin Chen et.al. | 2412.19026 | link |
2024-12-26 | Channel-Aware Optimal Transport: A Theoretical Framework for Generative Communication | Xiqiang Qu et.al. | 2412.19025 | null |
2024-12-26 | Relation-aware Hierarchical Prompt for Open-vocabulary Scene Graph Generation | Tao Liu et.al. | 2412.19021 | null |
2024-12-26 | Let the Rule Speak: Enhancing In-context Learning Debiasing with Interpretability | Ruixi Lin et.al. | 2412.19018 | null |
2024-12-25 | How Propense Are Large Language Models at Producing Code Smells? A Benchmarking Study | Alejandro Velasco et.al. | 2412.18989 | null |
2024-12-25 | ModelGrow: Continual Text-to-Video Pre-training with Model Expansion and Language Understanding Enhancement | Zhefan Rao et.al. | 2412.18966 | null |
2024-12-25 | Musings About the Future of Search: A Return to the Past? | Jimmy Lin et.al. | 2412.18956 | null |
2024-12-25 | A Power-Efficient Hardware Implementation of L-Mul | Ruiqi Chen et.al. | 2412.18948 | null |
2024-12-25 | MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models | Kaiwen Zuo et.al. | 2412.18947 | null |
2024-12-25 | Amuse: Human-AI Collaborative Songwriting with Multimodal Inspirations | Yewon Kim et.al. | 2412.18940 | null |
2024-12-25 | Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference | Libo Zhang et.al. | 2412.18934 | null |
2024-12-25 | UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation | Lunhao Duan et.al. | 2412.18928 | null |
2024-12-25 | Exemplar-condensed Federated Class-incremental Learning | Rui Sun et.al. | 2412.18926 | null |
2024-12-25 | Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Yi-Chia Chen et.al. | 2412.18917 | link |
2024-12-25 | AdaEAGLE: Optimizing Speculative Decoding via Explicit Modeling of Adaptive Draft Structures | Situo Zhang et.al. | 2412.18910 | null |
2024-12-25 | CoEvo: Continual Evolution of Symbolic Solutions Using Large Language Models | Ping Guo et.al. | 2412.18890 | link |
2024-12-25 | MotionMap: Representing Multimodality in Human Pose Forecasting | Reyhaneh Hosseininejad et.al. | 2412.18883 | null |
2024-12-25 | Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models | Meltem Aksoy et.al. | 2412.18863 | null |
2024-12-25 | Improving the Readability of Automatically Generated Tests using Large Language Models | Matteo Biagiola et.al. | 2412.18843 | null |
2024-12-25 | LoGFiLM: Fine-Tuning A Large Language Model for Automated Generation of Log Statements | Hao Zhang et.al. | 2412.18835 | null |
2024-12-25 | Structured Speaker-Deficiency Adaptation of Foundation Models for Dysarthric and Elderly Speech Recognition | Shujie Hu et.al. | 2412.18832 | null |
2024-12-25 | RapGuard: Safeguarding Multimodal Large Language Models via Rationale-aware Defensive Prompting | Yilei Jiang et.al. | 2412.18826 | null |
2024-12-25 | CausalTAD: Causal Implicit Generative Model for Debiased Online Trajectory Anomaly Detection | Wenbin Li et.al. | 2412.18820 | link |
2024-12-25 | LLM-assisted vector similarity search | Md Riyadh et.al. | 2412.18819 | null |
2024-12-25 | DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search | Lei Yang et.al. | 2412.18811 | null |
2024-12-25 | Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation | Xinkai Du et.al. | 2412.18800 | null |
2024-12-25 | Torque-Aware Momentum | Pranshu Malviya et.al. | 2412.18790 | null |
2024-12-25 | Attack-in-the-Chain: Bootstrapping Large Language Models for Attacks Against Black-box Neural Ranking Models | Yu-An Liu et.al. | 2412.18770 | link |
2024-12-25 | The Impact of Input Order Bias on Large Language Models for Software Fault Localization | Md Nakhla Rafi et.al. | 2412.18750 | null |
2024-12-24 | Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models | Zehan Wang et.al. | 2412.18605 | link |
2024-12-24 | Long-Form Speech Generation with Spoken Language Models | Se Jin Park et.al. | 2412.18603 | link |
2024-12-24 | Decentralized Intelligence in GameFi: Embodied AI Agents and the Convergence of DeFi and Virtual Ecosystems | Fernando Jia et.al. | 2412.18601 | link |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-24 | A Paragraph is All It Takes: Rich Robot Behaviors from Interacting, Trusted LLMs | OpenMind et.al. | 2412.18588 | null |
2024-12-24 | Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control | Sergey Sedov et.al. | 2412.18582 | null |
2024-12-24 | Zero-resource Speech Translation and Recognition with LLMs | Karel Mundnich et.al. | 2412.18566 | null |
2024-12-24 | Distilling Fine-grained Sentiment Understanding from Large Language Models | Yice Zhang et.al. | 2412.18552 | link |
2024-12-24 | Token-Budget-Aware LLM Reasoning | Tingxu Han et.al. | 2412.18547 | link |
2024-12-24 | PLD-Tree: Persistent Laplacian Decision Tree for Protein-Protein Binding Free Energy Prediction | Xingjian Xu et.al. | 2412.18541 | null |
2024-12-24 | Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation | Derong Xu Xinhang Li et.al. | 2412.18537 | link |
2024-12-24 | Automated Code Review In Practice | Umut Cihan et.al. | 2412.18531 | null |
2024-12-24 | Large Language Model guided Deep Reinforcement Learning for Decision Making in Autonomous Driving | Hao Pang et.al. | 2412.18511 | null |
2024-12-24 | Think or Remember? Detecting and Directing LLMs Towards Memorization or Generalization | Yi-Fu Fu et.al. | 2412.18497 | null |
2024-12-24 | GeFL: Model-Agnostic Federated Learning with Generative Models | Honggu Kang et.al. | 2412.18460 | null |
2024-12-24 | 3DGraphLLM: Combining Semantic Graphs and Large Language Models for 3D Scene Understanding | Tatiana Zemskova et.al. | 2412.18450 | link |
2024-12-24 | Is Large Language Model Good at Triple Set Prediction? An Empirical Study | Yuan Yuan et.al. | 2412.18443 | null |
2024-12-24 | Gaussian entropic optimal transport: Schrödinger bridges and the Sinkhorn algorithm | O. Deniz Akyildiz et.al. | 2412.18432 | null |
2024-12-24 | GUI Testing Arena: A Unified Benchmark for Advancing Autonomous GUI Testing Agent | Kangjia Zhao et.al. | 2412.18426 | null |
2024-12-24 | Research on the Proximity Relationships of Psychosomatic Disease Knowledge Graph Modules Extracted by Large Language Models | Zihan Zhou et.al. | 2412.18419 | null |
2024-12-24 | Muse: A Multimodal Conversational Recommendation Dataset with Scenario-Grounded User Profiles | Zihan Wang et.al. | 2412.18416 | null |
2024-12-24 | Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English | Avinash Anand et.al. | 2412.18415 | link |
2024-12-24 | Discovery of 2D Materials via Symmetry-Constrained Diffusion Model | Shihang Xu et.al. | 2412.18414 | null |
2024-12-24 | A Statistical Framework for Ranking LLM-Based Chatbots | Siavash Ameli et.al. | 2412.18407 | link |
2024-12-24 | Extract Free Dense Misalignment from CLIP | JeongYeon Nam et.al. | 2412.18404 | link |
2024-12-24 | RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction | Wu Xiaoping et.al. | 2412.18390 | null |
2024-12-24 | MR-COGraphs: Communication-efficient Multi-Robot Open-vocabulary Mapping System via 3D Scene Graphs | Qiuyi Gu et.al. | 2412.18381 | null |
2024-12-24 | Defining and Detecting the Defects of the Large Language Model-based Autonomous Agents | Kaiwen Ning et.al. | 2412.18371 | link |
2024-12-24 | Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering | Zhongjian Hu et.al. | 2412.18351 | null |
2024-12-24 | M-Ped: Multi-Prompt Ensemble Decoding for Large Language Models | Jiaxin Guo et.al. | 2412.18299 | null |
2024-12-24 | Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight | Xi Ding et.al. | 2412.18298 | link |
2024-12-24 | Pirates of the RAG: Adaptively Attacking LLMs to Leak Knowledge Bases | Christian Di Maio et.al. | 2412.18295 | null |
2024-12-24 | DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation | Junyi Lu et.al. | 2412.18291 | null |
2024-12-24 | Improved Feature Generating Framework for Transductive Zero-shot Learning | Zihan Ye et.al. | 2412.18282 | null |
2024-12-24 | GDM4MMIMO: Generative Diffusion Models for Massive MIMO Communications | Zhenzhou Jin et.al. | 2412.18281 | null |
2024-12-24 | Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization | Jiacai Liu et.al. | 2412.18279 | null |
2024-12-24 | GenAI Content Detection Task 2: AI vs. Human -- Academic Essay Authenticity Challenge | Shammur Absar Chowdhury et.al. | 2412.18274 | null |
2024-12-24 | Annotating References to Mythological Entities in French Literature | Thierry Poibeau et.al. | 2412.18270 | null |
2024-12-24 | Investigating Large Language Models for Code Vulnerability Detection: An Experimental Study | Xuefeng Jiang et.al. | 2412.18260 | link |
2024-12-24 | AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction | Pufan Zou et.al. | 2412.18255 | null |
2024-12-24 | An Automatic Graph Construction Framework based on Large Language Models for Recommendation | Rong Shan et.al. | 2412.18241 | link |
2024-12-24 | Combining GPT and Code-Based Similarity Checking for Effective Smart Contract Vulnerability Detection | Jango Zhang et.al. | 2412.18225 | null |
2024-12-24 | Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Peijin Xie et.al. | 2412.18224 | link |
2024-12-24 | ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation | Mengyang Wu et.al. | 2412.18216 | link |
2024-12-24 | Adapting Large Language Models for Improving TCP Fairness over WiFi | Shyam Kumar Shrestha et.al. | 2412.18200 | null |
2024-12-24 | Robustness-aware Automatic Prompt Optimization | Zeru Shi et.al. | 2412.18196 | link |
2024-12-24 | VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks | Shiduo Zhang et.al. | 2412.18194 | null |
2024-12-24 | TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization | Yucong Luo et.al. | 2412.18185 | null |
2024-12-24 | Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation | Yucong Luo et.al. | 2412.18176 | null |
2024-12-24 | INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based Agent | Haohang Li et.al. | 2412.18174 | null |
2024-12-24 | Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models | Xiaomeng Hu et.al. | 2412.18171 | null |
2024-12-24 | KunServe: Elastic and Efficient Large Language Model Serving with Parameter-centric Memory Management | Rongxin Cheng et.al. | 2412.18169 | null |
2024-12-24 | Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence | Yinbin Han et.al. | 2412.18164 | null |
2024-12-24 | VISION: A Modular AI Assistant for Natural Human-Instrument Interaction at Scientific User Facilities | Shray Mathur et.al. | 2412.18161 | null |
2024-12-24 | Semantics Disentanglement and Composition for Versatile Codec toward both Human-eye Perception and Machine Vision Task | Jinming Liu et.al. | 2412.18158 | null |
2024-12-24 | Smooth-Foley: Creating Continuous Sound for Video-to-Audio Generation Under Semantic Guidance | Yaoyun Zhang et.al. | 2412.18157 | null |
2024-12-24 | scReader: Prompting Large Language Models to Interpret scRNA-seq Data | Cong Li et.al. | 2412.18156 | null |
2024-12-24 | GeneSUM: Large Language Model-based Gene Summary Extraction | Zhijian Chen et.al. | 2412.18154 | null |
2024-12-24 | CoAM: Corpus of All-Type Multiword Expressions | Yusuke Ide et.al. | 2412.18151 | null |
2024-12-24 | EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation | Shuhao Han et.al. | 2412.18150 | link |
2024-12-24 | Dense-Face: Personalized Face Generation Model via Dense Annotation Prediction | Xiao Guo et.al. | 2412.18149 | null |
2024-12-24 | Ensuring Consistency for In-Image Translation | Chengpeng Fu et.al. | 2412.18139 | null |
2024-12-24 | LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment | Binrui Zeng et.al. | 2412.18135 | null |
2024-12-24 | VisionLLM-based Multimodal Fusion Network for Glottic Carcinoma Early Detection | Zhaohui Jin et.al. | 2412.18124 | null |
2024-12-24 | AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation | Hao Wen et.al. | 2412.18116 | null |
2024-12-24 | AIGT: AI Generative Table Based on Prompt | Mingming Zhang et.al. | 2412.18111 | null |
2024-12-24 | SlimGPT: Layer-wise Structured Pruning for Large Language Models | Gui Ling et.al. | 2412.18110 | null |
2024-12-24 | Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach | Jing Bi et.al. | 2412.18108 | null |
2024-12-24 | Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels | Mingcong Song et.al. | 2412.18106 | null |
2024-12-24 | EvoPat: A Multi-LLM-based Patents Summarization and Analysis Agent | Suyuan Wang et.al. | 2412.18100 | null |
2024-12-24 | Real-world Deployment and Evaluation of PErioperative AI CHatbot (PEACH) -- a Large Language Model Chatbot for Perioperative Medicine | Yu He Ke et.al. | 2412.18096 | null |
2024-12-24 | Molly: Making Large Language Model Agents Solve Python Problem More Logically | Rui Xiao et.al. | 2412.18093 | null |
2024-12-24 | Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner | Aizierjiang Aiersilan et.al. | 2412.18086 | link |
2024-12-24 | Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models | Xuan Lin et.al. | 2412.18084 | link |
2024-12-24 | Improving Factuality with Explicit Working Memory | Mingda Chen et.al. | 2412.18069 | null |
2024-12-24 | LMRPA: Large Language Model-Driven Efficient Robotic Process Automation for OCR | Osama Hosam Abdellaif et.al. | 2412.18063 | link |
2024-12-24 | Lla-VAP: LSTM Ensemble of Llama and VAP for Turn-Taking Prediction | Hyunbae Jeon et.al. | 2412.18061 | null |
2024-12-24 | An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM | Wen Wen et.al. | 2412.18060 | null |
2024-12-23 | Factuality or Fiction? Benchmarking Modern LLMs on Ambiguous QA with Citations | Maya Patel et.al. | 2412.18051 | null |
2024-12-23 | AA-SGAN: Adversarially Augmented Social GAN with Synthetic Data | Mirko Zaffaroni et.al. | 2412.18038 | link |
2024-12-23 | Generating refactored code accurately using reinforcement learning | Indranil Palit et.al. | 2412.18035 | null |
2024-12-23 | More than Chit-Chat: Developing Robots for Small-Talk Interactions | Rebecca Ramnauth et.al. | 2412.18023 | null |
2024-12-23 | Trustworthy and Efficient LLMs Meet Databases | Kyoungmin Kim et.al. | 2412.18022 | null |
2024-12-23 | StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs | Hailin Chen et.al. | 2412.18011 | null |
2024-12-23 | CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models | Ruibo Tu et.al. | 2412.17970 | link |
2024-12-23 | LMV-RPA: Large Model Voting-based Robotic Process Automation | Osama Abdellatif et.al. | 2412.17965 | link |
2024-12-23 | Dynamic Multi-Agent Orchestration and Retrieval for Multi-Source Question-Answer Systems using Large Language Models | Antony Seabra et.al. | 2412.17964 | null |
2024-12-23 | Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models | Ge Zhang et.al. | 2412.17963 | null |
2024-12-23 | Contrato360 2.0: A Document and Database-Driven Question-Answer System using Large Language Models and Agents | Antony Seabra et.al. | 2412.17942 | null |
2024-12-23 | BenCzechMark : A Czech-centric Multitask and Multimetric Benchmark for Large Language Models with Duel Scoring Mechanism | Martin Fajcik et.al. | 2412.17933 | null |
2024-12-23 | Causal Composition Diffusion Model for Closed-loop Traffic Generation | Haohong Lin et.al. | 2412.17920 | null |
2024-12-23 | Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning | Orson Mengara et.al. | 2412.17908 | null |
2024-12-23 | LLM-Driven Feedback for Enhancing Conceptual Design Learning in Database Systems Courses | Sara Riazi et.al. | 2412.17892 | null |
2024-12-23 | ChatGarment: Garment Estimation, Generation and Editing via Large Language Models | Siyuan Bian et.al. | 2412.17811 | null |
2024-12-23 | Reconstructing People, Places, and Cameras | Lea Müller et.al. | 2412.17806 | null |
2024-12-23 | Automating the Search for Artificial Life with Foundation Models | Akarsh Kumar et.al. | 2412.17799 | link |
2024-12-23 | ResearchTown: Simulator of Human Research Community | Haofei Yu et.al. | 2412.17767 | link |
2024-12-23 | ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback | Wei Zhang et.al. | 2412.17754 | null |
2024-12-23 | Deliberation in Latent Space via Differentiable Cache Augmentation | Luyang Liu et.al. | 2412.17747 | null |
2024-12-23 | YuLan-Mini: An Open Data-efficient Language Model | Yiwen Hu et.al. | 2412.17743 | link |
2024-12-23 | Reasoning to Attend: Try to Understand How Token Works | Rui Qian et.al. | 2412.17741 | link |
2024-12-23 | Knowledge Editing through Chain-of-Thought | Changyue Wang et.al. | 2412.17727 | link |
2024-12-23 | Understanding the Logic of Direct Preference Alignment through Logic | Kyle Richardson et.al. | 2412.17696 | null |
2024-12-23 | Large Language Model Safety: A Holistic Survey | Dan Shi et.al. | 2412.17686 | link |
2024-12-23 | A Bias-Free Training Paradigm for More General AI-generated Image Detection | Fabrizio Guillaro et.al. | 2412.17671 | null |
2024-12-23 | Generating Completions for Fragmented Broca's Aphasic Sentences Using Large Language Models | Sijbren van Vaals et.al. | 2412.17669 | link |
2024-12-23 | Detecting anxiety and depression in dialogues: a multi-label and explainable approach | Francisco de Arriba-Pérez et.al. | 2412.17651 | null |
2024-12-23 | SCBench: A Sports Commentary Benchmark for Video LLMs | Kuangzhi Ge et.al. | 2412.17637 | null |
2024-12-23 | ANID: How Far Are We? Evaluating the Discrepancies Between AI-synthesized Images and Natural Images through Multimodal Guidance | Renyang Liu et.al. | 2412.17632 | link |
2024-12-23 | Tracking the Feature Dynamics in LLM Training: A Mechanistic Study | Yang Xu et.al. | 2412.17626 | null |
2024-12-23 | Be More Diverse than the Most Diverse: Online Selection of Diverse Mixtures of Generative Models | Parham Rezaei et.al. | 2412.17622 | link |
2024-12-23 | Emerging Security Challenges of Large Language Models | Herve Debar et.al. | 2412.17614 | null |
2024-12-23 | Towards Foundation Models on Graphs: An Analysis on Cross-Dataset Transfer of Pretrained GNNs | Fabrizio Frasca et.al. | 2412.17609 | null |
2024-12-23 | EasyTime: Time Series Forecasting Made Easy | Xiangfei Qiu et.al. | 2412.17603 | null |
2024-12-23 | LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context | Kai Ruan et.al. | 2412.17596 | link |
2024-12-23 | Leveraging Memory Retrieval to Enhance LLM-based Generative Recommendation | Chengbing Wang et.al. | 2412.17593 | null |
2024-12-23 | HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data | Ting Zhou et.al. | 2412.17574 | link |
2024-12-23 | S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field | Zixi Liang et.al. | 2412.17561 | link |
2024-12-23 | GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Chao Zeng et.al. | 2412.17560 | null |
2024-12-23 | A Survey of Query Optimization in Large Language Models | Mingyang Song et.al. | 2412.17558 | null |
2024-12-23 | Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing | Prakash Aryan et.al. | 2412.17548 | link |
2024-12-23 | Retention Score: Quantifying Jailbreak Risks for Vision Language Models | Zaitang Li et.al. | 2412.17544 | null |
2024-12-23 | Constructing Fair Latent Space for Intersection of Fairness and Explainability | Hyungjun Joo et.al. | 2412.17523 | null |
2024-12-23 | DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak | Hao Wang et.al. | 2412.17522 | null |
2024-12-23 | Improving the Noise Estimation of Latent Neural Stochastic Differential Equations | Linus Heck et.al. | 2412.17499 | null |
2024-12-23 | Is ChatGPT Massively Used by Students Nowadays? A Survey on the Use of Large Language Models such as ChatGPT in Educational Settings | Jérémie Sublime et.al. | 2412.17486 | null |
2024-12-23 | Power- and Fragmentation-aware Online Scheduling for GPU Datacenters | Francesco Lettich et.al. | 2412.17484 | link |
2024-12-23 | A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression | Chenlong Deng et.al. | 2412.17483 | null |
2024-12-23 | A Survey on Multi-Generative Agent System: Recent Advances and New Frontiers | Shuaihang Chen et.al. | 2412.17481 | link |
2024-12-23 | CALLIC: Content Adaptive Learning for Lossless Image Compression | Daxin Li et.al. | 2412.17464 | null |
2024-12-23 | Developmental Predictive Coding Model for Early Infancy Mono and Bilingual Vocal Continual Learning | Xiaodan Chen et.al. | 2412.17456 | null |
2024-12-23 | Applying LLM and Topic Modelling in Psychotherapeutic Contexts | Alexander Vanin et.al. | 2412.17449 | null |
2024-12-23 | Measuring Contextual Informativeness in Child-Directed Text | Maria Valentini et.al. | 2412.17427 | link |
2024-12-23 | Multimodal Preference Data Synthetic Alignment with Reward Model | Robert Wijaya et.al. | 2412.17417 | link |
2024-12-23 | VidCtx: Context-aware Video Question Answering with Image Models | Andreas Goulas et.al. | 2412.17415 | null |
2024-12-23 | Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance | Muhammad Reza Qorib et.al. | 2412.17408 | link |
2024-12-23 | Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning | Huchen Jiang et.al. | 2412.17397 | null |
2024-12-23 | WarriorCoder: Learning from Expert Battles to Augment Code Large Language Models | Huawen Feng et.al. | 2412.17395 | null |
2024-12-23 | Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement | Hyeonjin Kim et.al. | 2412.17387 | link |
2024-12-23 | Interweaving Memories of a Siamese Large Language Model | Xin Song et.al. | 2412.17383 | link |
2024-12-23 | MineAgent: Towards Remote-Sensing Mineral Exploration with Multimodal Large Language Models | Beibei Yu et.al. | 2412.17339 | null |
2024-12-23 | A Dual-Perspective Metaphor Detection Framework Using Large Language Models | Yujie Lin et.al. | 2412.17332 | link |
2024-12-23 | Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit Distance | Nicolas Devatine et.al. | 2412.17321 | null |
2024-12-23 | CodeV: Issue Resolving with Visual Data | Linhao Zhang et.al. | 2412.17315 | link |
2024-12-23 | Prompting in the Wild: An Empirical Study of Prompt Evolution in Software Repositories | Mahan Tafreshipour et.al. | 2412.17298 | null |
2024-12-23 | Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples | Taewoong Kim et.al. | 2412.17288 | link |
2024-12-23 | LLM4AD: A Platform for Algorithm Design with Large Language Model | Fei Liu et.al. | 2412.17287 | link |
2024-12-23 | Enabling Time-series Foundation Model for Building Energy Forecasting via Contrastive Curriculum Learning | Rui Liang et.al. | 2412.17285 | null |
2024-12-23 | Unlocking Cross-Lingual Sentiment Analysis through Emoji Interpretation: A Multimodal Generative AI Approach | Rafid Ishrak Jahan et.al. | 2412.17255 | link |
2024-12-23 | SyNeg: LLM-Driven Synthetic Hard-Negatives for Dense Retrieval | Xiaopeng Li et.al. | 2412.17250 | null |
2024-12-23 | EM-MIAs: Enhancing Membership Inference Attacks in Large Language Models through Ensemble Modeling | Zichen Song et.al. | 2412.17249 | null |
2024-12-23 | On the Generalization Ability of Machine-Generated Text Detectors | Yule Liu et.al. | 2412.17242 | link |
2024-12-23 | Brain-to-Text Benchmark '24: Lessons Learned | Francis R. Willett et.al. | 2412.17227 | link |
2024-12-23 | CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder | Lichen Ma et.al. | 2412.17225 | null |
2024-12-22 | Better Think with Tables: Leveraging Tables to Enhance Large Language Model Comprehension | Jio Oh et.al. | 2412.17189 | null |
2024-12-22 | Foundation Model for Lossy Compression of Spatiotemporal Scientific Data | Xiao Li et.al. | 2412.17184 | null |
2024-12-22 | Enhancing Item Tokenization for Generative Recommendation through Self-Improvement | Runjin Chen et.al. | 2412.17171 | null |
2024-12-22 | Generative Diffusion Modeling: A Practical Handbook | Zihan Ding et.al. | 2412.17162 | null |
2024-12-22 | LLM-based relevance assessment still can't replace human relevance assessment | Charles L. A. Clarke et.al. | 2412.17156 | null |
2024-12-22 | LLM Agent for Fire Dynamics Simulations | Leidong Xu et.al. | 2412.17146 | null |
2024-12-22 | Hate Speech Detection and Target Identification in Devanagari Languages via Parameter Efficient Fine-Tuning of LLMs | Rushendra Sidibomma et.al. | 2412.17131 | null |
2024-12-22 | Lies, Damned Lies, and Distributional Language Statistics: Persuasion and Deception with Large Language Models | Cameron R. Jones et.al. | 2412.17128 | null |
2024-12-22 | Learning to Adapt to Low-Resource Paraphrase Generation | Zhigen Li et.al. | 2412.17111 | null |
2024-12-22 | DreamOmni: Unified Image Generation and Editing | Bin Xia et.al. | 2412.17098 | null |
2024-12-22 | Analysis on LLMs Performance for Code Summarization | Md. Ahnaf Akib et.al. | 2412.17094 | null |
2024-12-22 | SAIL: Sample-Centric In-Context Learning for Document Information Extraction | Jinyu Zhang et.al. | 2412.17092 | link |
2024-12-22 | SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults | Jinzhi Wang et.al. | 2412.17077 | null |
2024-12-22 | The HalluRAG Dataset: Detecting Closed-Domain Hallucinations in RAG Applications Using an LLM's Internal States | Fabian Ridder et.al. | 2412.17056 | link |
2024-12-22 | DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately | Huiwen Wu et.al. | 2412.17053 | null |
2024-12-22 | ViLBias: A Framework for Bias Detection using Linguistic and Visual Cues | Shaina Raza et.al. | 2412.17052 | link |
2024-12-22 | Modular Conversational Agents for Surveys and Interviews | Jiangbo Yu et.al. | 2412.17049 | null |
2024-12-22 | Why Do Speech Language Models Fail to Generate Semantically Coherent Outputs? A Modality Evolving Perspective | Hankun Wang et.al. | 2412.17048 | null |
2024-12-22 | Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation | Luoxu Jin et.al. | 2412.17042 | null |
2024-12-22 | HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories | Eric Hedlin et.al. | 2412.17040 | null |
2024-12-22 | Shadow-Frugal Expectation-Value-Sampling Variational Quantum Generative Model | Kevin Shen et.al. | 2412.17039 | null |
2024-12-22 | Shaping the Safety Boundaries: Understanding and Defending Against Jailbreaks in Large Language Models | Lang Gao et.al. | 2412.17034 | null |
2024-12-22 | MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge | Jie He et.al. | 2412.17032 | null |
2024-12-22 | FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos | Zhengqian Wu et.al. | 2412.17022 | link |
2024-12-22 | GAS: Generative Auto-bidding with Post-training Search | Yewen Li et.al. | 2412.17018 | null |
2024-12-22 | Robustness of Large Language Models Against Adversarial Attacks | Yiyi Tao et.al. | 2412.17011 | null |
2024-12-22 | InterDance:Reactive 3D Dance Generation with Realistic Duet Interactions | Ronghui Li et.al. | 2412.16982 | null |
2024-12-22 | On Fusing ChatGPT and Ensemble Learning in Discon-tinuous Named Entity Recognition in Health Corpora | Tzu-Chieh Chen et.al. | 2412.16976 | null |
2024-12-22 | Cannot or Should Not? Automatic Analysis of Refusal Composition in IFT/RLHF Datasets and Refusal Behavior of Black-Box LLMs | Alexander von Recum et.al. | 2412.16974 | null |
2024-12-22 | Multifaceted User Modeling in Recommendation: A Federated Foundation Models Approach | Chunxu Zhang et.al. | 2412.16969 | link |
2024-12-22 | System-2 Mathematical Reasoning via Enriched Instruction Tuning | Huanqia Cai et.al. | 2412.16964 | null |
2024-12-22 | Aristotle: Mastering Logical Reasoning with A Logic-Complete Decompose-Search-Resolve Framework | Jundong Xu et.al. | 2412.16953 | null |
2024-12-22 | A Career Interview Dialogue System using Large Language Model-based Dynamic Slot Generation | Ekai Hashimoto et.al. | 2412.16943 | null |
2024-12-22 | Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering | Zhongjian Hu et.al. | 2412.16936 | null |
2024-12-22 | Towards a Unified Paradigm: Integrating Recommendation Systems as a New Language in Large Models | Kai Zheng et.al. | 2412.16933 | null |
2024-12-22 | Enhancing Supply Chain Transparency in Emerging Economies Using Online Contents and LLMs | Bohan Jin et.al. | 2412.16922 | null |
2024-12-22 | Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection | Yuhang Gan et.al. | 2412.16918 | null |
2024-12-22 | Self-Corrected Flow Distillation for Consistent One-Step and Few-Step Text-to-Image Generation | Quan Dao et.al. | 2412.16906 | null |
2024-12-22 | Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model | Songjun Tu et.al. | 2412.16878 | link |
2024-12-20 | HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding | Chenxin Tao et.al. | 2412.16158 | null |
2024-12-20 | Can Generative Video Models Help Pose Estimation? | Ruojin Cai et.al. | 2412.16155 | null |
2024-12-20 | Offline Reinforcement Learning for LLM Multi-Step Reasoning | Huaijie Wang et.al. | 2412.16145 | link |
2024-12-20 | Can LLMs Obfuscate Code? A Systematic Analysis of Large Language Models into Assembly Code Obfuscation | Seyedreza Mohseni et.al. | 2412.16135 | null |
2024-12-20 | Data-Driven Mechanism Design: Jointly Eliciting Preferences and Information | Dirk Bergemann et.al. | 2412.16132 | null |
2024-12-20 | PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics | Daniil Larionov et.al. | 2412.16120 | null |
2024-12-20 | Deciphering the Underserved: Benchmarking LLM OCR for Low-Resource Scripts | Muhammad Abdullah Sohail et.al. | 2412.16119 | link |
2024-12-20 | PruneVid: Visual Token Pruning for Efficient Video Large Language Models | Xiaohu Huang et.al. | 2412.16117 | link |
2024-12-20 | The Content Moderator's Dilemma: Removal of Toxic Content and Distortions to Online Discourse | Mahyar Habibi et.al. | 2412.16114 | null |
2024-12-20 | Logical Consistency of Large Language Models in Fact-checking | Bishwamittra Ghosh et.al. | 2412.16100 | null |
2024-12-20 | The Evolution of LLM Adoption in Industry Data Curation Practices | Crystal Qian et.al. | 2412.16089 | null |
2024-12-20 | Efficient MedSAMs: Segment Anything in Medical Images on Laptop | Jun Ma et.al. | 2412.16085 | link |
2024-12-20 | Formal Mathematical Reasoning: A New Frontier in AI | Kaiyu Yang et.al. | 2412.16075 | null |
2024-12-20 | The Only Way is Ethics: A Guide to Ethical Research with Large Language Models | Eddie L. Ungless et.al. | 2412.16022 | link |
2024-12-20 | Legommenders: A Comprehensive Content-Based Recommendation Library with LLM Support | Qijiong Liu et.al. | 2412.15973 | link |
2024-12-20 | From General to Specific: Tailoring Large Language Models for Personalized Healthcare | Ruize Shi et.al. | 2412.15957 | null |
2024-12-20 | Trust Calibration in IDEs: Paving the Way for Widespread Adoption of AI Refactoring | Markus Borg et.al. | 2412.15948 | null |
2024-12-20 | Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation | Gautier Evennou et.al. | 2412.15939 | link |
2024-12-20 | Large Language Model assisted Hybrid Fuzzing | Ruijie Meng et.al. | 2412.15931 | null |
2024-12-20 | MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and Detection | Andrea Moglia et.al. | 2412.15925 | link |
2024-12-20 | RiTTA: Modeling Event Relations in Text-to-Audio Generation | Yuhang He et.al. | 2412.15922 | link |
2024-12-20 | Less is More: Towards Green Code Large Language Models via Unified Structural Pruning | Guang Yang et.al. | 2412.15921 | null |
2024-12-20 | Development of a Large-scale Dataset of Chest Computed Tomography Reports in Japanese and a High-performance Finding Classification Model | Yosuke Yamagishi et.al. | 2412.15907 | null |
2024-12-20 | Evaluation of Reliability Criteria for News Publishers with Large Language Models | Manuel Pratelli et.al. | 2412.15896 | null |
2024-12-20 | TelcoLM: collecting data, adapting, and benchmarking language models for the telecommunication domain | Camille Barboule et.al. | 2412.15891 | null |
2024-12-20 | AI-in-the-loop: The future of biomedical visual analytics applications in the era of AI | Katja Bühler et.al. | 2412.15876 | null |
2024-12-20 | Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback | Jiaming Ji et.al. | 2412.15838 | link |
2024-12-20 | WebLLM: A High-Performance In-Browser LLM Inference Engine | Charlie F. Ruan et.al. | 2412.15803 | link |
2024-12-20 | Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning | Sungjin Park et.al. | 2412.15797 | null |
2024-12-20 | GraphSeqLM: A Unified Graph Language Framework for Omic Graph Learning | Heming Zhang et.al. | 2412.15790 | null |
2024-12-20 | Linguistic Features Extracted by GPT-4 Improve Alzheimer's Disease Detection based on Spontaneous Speech | Jonathan Heitz et.al. | 2412.15772 | link |
2024-12-20 | Extracting Interpretable Task-Specific Circuits from Large Language Models for Faster Inference | Jorge García-Carrasco et.al. | 2412.15750 | link |
2024-12-20 | Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models | Shamus Sim et.al. | 2412.15748 | null |
2024-12-20 | VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models | Dexter Neo et.al. | 2412.15739 | null |
2024-12-20 | AutoLife: Automatic Life Journaling with Smartphones and LLMs | Huatao Xu et.al. | 2412.15714 | null |
2024-12-20 | Contrastive Learning for Task-Independent SpeechLLM-Pretraining | Maike Züfle et.al. | 2412.15712 | link |
2024-12-20 | Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback | Niklas Ippisch et.al. | 2412.15702 | null |
2024-12-20 | Code Review Automation Via Multi-task Federated LLM -- An Empirical Study | Jahnavi Kumar et.al. | 2412.15676 | null |
2024-12-20 | Adaptable and Precise: Enterprise-Scenario LLM Function-Calling Capability Training Pipeline | Guancheng Zeng et.al. | 2412.15660 | null |
2024-12-20 | Synthetic Tabular Data Generation for Imbalanced Classification: The Surprising Effectiveness of an Overlap Class | Annie D'souza et.al. | 2412.15657 | null |
2024-12-20 | MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula | Sieun Hyeon et.al. | 2412.15655 | link |
2024-12-20 | Beyond Human Data: Aligning Multimodal Large Language Models by Iterative Self-Evolution | Wentao Tan et.al. | 2412.15650 | null |
2024-12-20 | Darkit: A User-Friendly Software Toolkit for Spiking Large Language Model | Xin Du et.al. | 2412.15634 | link |
2024-12-20 | Can Input Attributions Interpret the Inductive Reasoning Process Elicited in In-Context Learning? | Mengyu Ye et.al. | 2412.15628 | null |
2024-12-20 | JailPO: A Novel Black-box Jailbreak Framework via Preference Optimization against Aligned LLMs | Hongyi Li et.al. | 2412.15623 | null |
2024-12-20 | Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage | Zhi Gao et.al. | 2412.15606 | null |
2024-12-20 | Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks | Brian J Chan et.al. | 2412.15605 | link |
2024-12-20 | Dynamic Label Name Refinement for Few-Shot Dialogue Intent Classification | Gyutae Park et.al. | 2412.15603 | null |
2024-12-20 | Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation | Xiaoqiang Kang et.al. | 2412.15594 | link |
2024-12-20 | NeSyCoCo: A Neuro-Symbolic Concept Composer for Compositional Generalization | Danial Kamali et.al. | 2412.15588 | link |
2024-12-20 | To Rely or Not to Rely? Evaluating Interventions for Appropriate Reliance on Large Language Models | Jessica Y. Bo et.al. | 2412.15584 | null |
2024-12-20 | A Deep Probabilistic Framework for Continuous Time Dynamic Graph Generation | Ryien Hosseini et.al. | 2412.15582 | null |
2024-12-20 | Score-based Generative Diffusion Models for Social Recommendations | Chengyi Liu et.al. | 2412.15579 | link |
2024-12-20 | QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning | Xinyang Tong et.al. | 2412.15576 | null |
2024-12-20 | J-EDI QA: Benchmark for deep-sea organism-specific multimodal LLM | Takero Yoshida et.al. | 2412.15574 | null |
2024-12-20 | Continual Learning Using a Kernel-Based Method Over Foundation Models | Saleh Momeni et.al. | 2412.15571 | link |
2024-12-20 | DefFiller: Mask-Conditioned Diffusion for Salient Steel Surface Defect Generation | Yichun Tai et.al. | 2412.15570 | link |
2024-12-20 | In-context Continual Learning Assisted by an External Continual Learner | Saleh Momeni et.al. | 2412.15563 | null |
2024-12-20 | NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional Reasoning | Zheyuan Zhang et.al. | 2412.15547 | null |
2024-12-20 | MRAG: A Modular Retrieval Framework for Time-Sensitive Question Answering | Zhang Siyue et.al. | 2412.15540 | null |
2024-12-20 | XRAG: eXamining the Core -- Benchmarking Foundational Components in Advanced Retrieval-Augmented Generation | Qianren Mao et.al. | 2412.15529 | link |
2024-12-20 | HREF: Human Response-Guided Evaluation of Instruction Following in Language Models | Xinxi Lyu et.al. | 2412.15524 | link |
2024-12-20 | PreNeT: Leveraging Computational Features to Predict Deep Neural Network Training Time | Alireza Pourali et.al. | 2412.15519 | link |
2024-12-20 | Stylish and Functional: Guided Interpolation Subject to Physical Constraints | Yan-Ying Chen et.al. | 2412.15507 | null |
2024-12-20 | Mitigating Social Bias in Large Language Models: A Multi-Objective Approach within a Multi-Agent Framework | Zhenjie Xu et.al. | 2412.15504 | link |
2024-12-20 | Humanlike Cognitive Patterns as Emergent Phenomena in Large Language Models | Zhisheng Tang et.al. | 2412.15501 | null |
2024-12-20 | TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use | Junjie Ye et.al. | 2412.15495 | link |
2024-12-20 | PolySmart and VIREO @ TRECVid 2024 Ad-hoc Video Search | Jiaxin Wu et.al. | 2412.15494 | null |
2024-12-20 | GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators | Hengjia Li et.al. | 2412.15491 | null |
2024-12-20 | Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage | Saehyung Lee et.al. | 2412.15484 | null |
2024-12-20 | Continual Learning Using Only Large Language Model Prompting | Jiabao Qiu et.al. | 2412.15479 | null |
2024-12-19 | TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models | Ammar N. Abbas et.al. | 2412.15462 | null |
2024-12-19 | Northeastern Uni at Multilingual Counterspeech Generation: Enhancing Counter Speech Generation with LLM Alignment through Direct Preference Optimization | Sahil Wadhwa et.al. | 2412.15453 | null |
2024-12-19 | AI-Enhanced Sensemaking: Exploring the Design of a Generative AI-Based Assistant to Support Genetic Professionals | Angela Mastrianni et.al. | 2412.15444 | null |
2024-12-19 | SKETCH: Structured Knowledge Enhanced Text Comprehension for Holistic Retrieval | Aakash Mahalingam et.al. | 2412.15443 | null |
2024-12-19 | Time Will Tell: Timing Side Channels via Output Token Count in Large Language Models | Tianchen Zhang et.al. | 2412.15431 | null |
2024-12-19 | MoEtion: Efficient and Reliable Checkpointing for Mixture-of-Experts Models at Scale | Swapnil Gandhi et.al. | 2412.15411 | null |
2024-12-19 | Deciphering Social Behaviour: a Novel Biological Approach For Social Users Classification | Edoardo Allegrini et.al. | 2412.15410 | null |
2024-12-19 | Systematic Evaluation of Long-Context LLMs on Financial Concepts | Lavanya Gupta et.al. | 2412.15386 | null |
2024-12-19 | Automatic Extraction of Metaphoric Analogies from Literary Texts: Task Formulation, Dataset Construction, and Evaluation | Joanne Boisson et.al. | 2412.15375 | link |
2024-12-19 | Automated Root Cause Analysis System for Complex Data Products | Mathieu Demarne et.al. | 2412.15374 | null |
2024-12-19 | Large Language Models on Small Resource-Constrained Systems: Performance Characterization, Analysis and Trade-offs | Liam Seymour et.al. | 2412.15352 | link |
2024-12-19 | Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models | Reza Shirkavand et.al. | 2412.15341 | null |
2024-12-19 | Complete background cosmology of parity-even quadratic metric-affine gravity | Thomas Dyer et.al. | 2412.15329 | null |
2024-12-19 | OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving | Shuo Xing et.al. | 2412.15208 | link |
2024-12-19 | MMLU-CF: A Contamination-free Multi-task Language Understanding Benchmark | Qihao Zhao et.al. | 2412.15194 | link |
2024-12-19 | LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation | Weijia Shi et.al. | 2412.15188 | null |
2024-12-19 | Tiled Diffusion | Or Madar et.al. | 2412.15185 | null |
2024-12-19 | Data for Mathematical Copilots: Better Ways of Presenting Proofs for Machine Learning | Simon Frieder et.al. | 2412.15184 | null |
2024-12-19 | STRAP: Robot Sub-Trajectory Retrieval for Augmented Policy Learning | Marius Memmel et.al. | 2412.15182 | null |
2024-12-19 | HPC-Coder-V2: Studying Code LLMs Across Low-Resource Parallel Languages | Aman Chaturvedi et.al. | 2412.15178 | null |
2024-12-19 | Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying | Federico Castagna et.al. | 2412.15177 | link |
2024-12-19 | Rethinking Uncertainty Estimation in Natural Language Generation | Lukas Aichberger et.al. | 2412.15176 | null |
2024-12-19 | Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Yatai Ji et.al. | 2412.15156 | link |
2024-12-19 | Language Models as Continuous Self-Evolving Data Engineers | Peidong Wang et.al. | 2412.15151 | null |
2024-12-19 | Jet: A Modern Transformer-Based Normalizing Flow | Alexander Kolesnikov et.al. | 2412.15129 | null |
2024-12-19 | Adaptive Pruning for Large Language Models with Structural Importance Awareness | Haotian Zheng et.al. | 2412.15127 | null |
2024-12-19 | Outcome-Refining Process Supervision for Code Generation | Zhuohao Yu et.al. | 2412.15118 | link |
2024-12-19 | Qwen2.5 Technical Report | Qwen et.al. | 2412.15115 | link |
2024-12-19 | Associative memory inspires improvements for in-context learning using a novel attention residual stream architecture | Thomas F Burns et.al. | 2412.15113 | link |
2024-12-19 | Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation | Yang Tian et.al. | 2412.15109 | link |
2024-12-19 | Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability | Xiangsen Chen et.al. | 2412.15101 | null |
2024-12-19 | Nano-ESG: Extracting Corporate Sustainability Information from News Articles | Fabian Billert et.al. | 2412.15093 | link |
2024-12-19 | Learning Disentangled Equivariant Representation for Explicitly Controllable 3D Molecule Generation | Haoran Liu et.al. | 2412.15086 | null |
2024-12-19 | ScamChatBot: An End-to-End Analysis of Fake Account Recovery on Social Media via Chatbots | Bhupendra Acharya et.al. | 2412.15072 | null |
2024-12-19 | ConfliBERT: A Language Model for Political Conflict | Patrick T. Brandt et.al. | 2412.15060 | link |
2024-12-19 | LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps | Felix Friedrich et.al. | 2412.15035 | null |
2024-12-19 | DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space | Mang Ning et.al. | 2412.15032 | link |
2024-12-19 | Large Language Models and Code Security: A Systematic Literature Review | Enna Basic et.al. | 2412.15004 | null |
2024-12-19 | HSEvo: Elevating Automatic Heuristic Design with Diversity-Driven Harmony Search and Genetic Algorithm Using LLMs | Pham Vu Tuan Dat et.al. | 2412.14995 | link |
2024-12-19 | RoboCup@Home 2024 OPL Winner NimbRo: Anthropomorphic Service Robots using Foundation Models for Perception and Planning | Raphael Memmesheimer et.al. | 2412.14989 | null |
2024-12-19 | Chain-of-MetaWriting: Linguistic and Textual Analysis of How Small Language Models Write Young Students Texts | Ioana Buhnila et.al. | 2412.14986 | null |
2024-12-19 | AI and Cultural Context: An Empirical Investigation of Large Language Models' Performance on Chinese Social Work Professional Standards | Zia Qi et.al. | 2412.14971 | null |
2024-12-19 | Movie2Story: A framework for understanding videos and telling stories in the form of novel text | Kangning Li et.al. | 2412.14965 | null |
2024-12-19 | Knowledge Injection via Prompt Distillation | Kalle Kujanpää et.al. | 2412.14964 | null |
2024-12-19 | Effective Method with Compression for Distributed and Federated Cocoercive Variational Inequalities | Daniil Medyakov et.al. | 2412.14935 | null |
2024-12-19 | RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response | Junyu Luo et.al. | 2412.14922 | link |
2024-12-19 | Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation | Zexiong Ma et.al. | 2412.14905 | null |
2024-12-19 | Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering | Peize Li et.al. | 2412.14880 | null |
2024-12-19 | Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering | Imed Keraghel et.al. | 2412.14867 | null |
2024-12-19 | Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling | Junyi Li et.al. | 2412.14860 | null |
2024-12-19 | DS |
Hongling Xu et.al. | 2412.14849 | link |
2024-12-19 | Mapping and Influencing the Political Ideology of Large Language Models using Synthetic Personas | Pietro Bernardelle et.al. | 2412.14843 | null |
2024-12-19 | Helping LLMs Improve Code Generation Using Feedback from Testing and Static Analysis | Greta Dolcetti et.al. | 2412.14841 | null |
2024-12-19 | Progressive Multimodal Reasoning via Active Retrieval | Guanting Dong et.al. | 2412.14835 | null |
2024-12-19 | Answer Set Networks: Casting Answer Set Programming into Deep Learning | Arseny Skryagin et.al. | 2412.14814 | link |
2024-12-19 | ResoFilter: Rine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis | Zeao Tu et.al. | 2412.14809 | link |
2024-12-19 | Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning | Ziang Ye et.al. | 2412.14780 | null |
2024-12-19 | ALKAFI-LLAMA3: Fine-Tuning LLMs for Precise Legal Understanding in Palestine | Rabee Qasem et.al. | 2412.14771 | null |
2024-12-19 | PsyDraw: A Multi-Agent Multimodal System for Mental Health Screening in Left-Behind Children | Yiqun Zhang et.al. | 2412.14769 | link |
2024-12-19 | CodeRepoQA: A Large-scale Benchmark for Software Engineering Question Answering | Ruida Hu et.al. | 2412.14764 | link |
2024-12-19 | Query pipeline optimization for cancer patient question answering systems | Maolin He et.al. | 2412.14751 | null |
2024-12-19 | Active Inference and Human--Computer Interaction | Roderick Murray-Smith et.al. | 2412.14741 | null |
2024-12-19 | On Verbalized Confidence Scores for LLMs | Daniel Yang et.al. | 2412.14737 | link |
2024-12-19 | Creation of AI-driven Smart Spaces for Enhanced Indoor Environments -- A Survey | Aygün Varol et.al. | 2412.14708 | null |
2024-12-19 | LLMs as mediators: Can they diagnose conflicts accurately? | Özgecan Koçak et.al. | 2412.14675 | null |
2024-12-19 | Analysis and Visualization of Linguistic Structures in Large Language Models: Neural Representations of Verb-Particle Constructions in BERT | Hassane Kissane et.al. | 2412.14670 | null |
2024-12-19 | IOHunter: Graph Foundation Model to Uncover Online Information Operations | Marco Minici et.al. | 2412.14663 | link |
2024-12-19 | Unveiling Uncertainty: A Deep Dive into Calibration and Performance of Multimodal Large Language Models | Zijun Chen et.al. | 2412.14660 | link |
2024-12-19 | Length Controlled Generation for Black-box LLMs | Yuxuan Gu et.al. | 2412.14656 | null |
2024-12-19 | Learning to Generate Research Idea with Dynamic Control | Ruochen Li et.al. | 2412.14626 | null |
2024-12-19 | How good is GPT at writing political speeches for the White House? | Jacques Savoy et.al. | 2412.14617 | null |
2024-12-19 | Beyond Guilt: Legal Judgment Prediction with Trichotomous Reasoning | Kepu Zhang et.al. | 2412.14588 | null |
2024-12-19 | HiCM |
Minkuk Kim et.al. | 2412.14585 | null |
2024-12-19 | Simulation-Free Hierarchical Latent Policy Planning for Proactive Dialogues | Tao He et.al. | 2412.14584 | null |
2024-12-19 | CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation | Youngwon Lee et.al. | 2412.14581 | null |
2024-12-19 | DiffSim: Taming Diffusion Models for Evaluating Visual Similarity | Yiren Song et.al. | 2412.14580 | link |
2024-12-19 | Sliding Windows Are Not the End: Exploring Full Ranking with Long-Context Large Language Models | Wenhan Liu et.al. | 2412.14574 | link |
2024-12-19 | ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model | Shunlin Lu et.al. | 2412.14559 | null |
2024-12-19 | The Current Challenges of Software Engineering in the Era of Large Language Models | Cuiyun Gao et.al. | 2412.14554 | null |
2024-12-19 | Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models | Xiao Cui et.al. | 2412.14528 | link |
2024-12-19 | Cal-DPO: Calibrated Direct Preference Optimization for Language Model Alignment | Teng Xiao et.al. | 2412.14516 | link |
2024-12-19 | Relational Programming with Foundation Models | Ziyang Li et.al. | 2412.14515 | null |
2024-12-19 | PA-RAG: RAG Alignment via Multi-Perspective Preference Optimization | Jiayi Wu et.al. | 2412.14510 | link |
2024-12-19 | Do Large Language Models Defend Inferentialist Semantics?: On the Logical Expressivism and Anti-Representationalism of LLMs | Yuzuki Arai et.al. | 2412.14501 | null |
2024-12-19 | Guided Diffusion Model for Sensor Data Obfuscation | Xin Yang et.al. | 2412.14499 | null |
2024-12-19 | FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis | Abdullah Khan et.al. | 2412.14492 | link |
2024-12-19 | Moving Beyond LDA: A Comparison of Unsupervised Topic Modelling Techniques for Qualitative Data Analysis of Online Communities | Amandeep Kaur et.al. | 2412.14486 | null |
2024-12-19 | DirectorLLM for Human-Centric Video Generation | Kunpeng Song et.al. | 2412.14484 | null |
2024-12-19 | Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs | Koshiro Saito et.al. | 2412.14471 | null |
2024-12-19 | Agent-SafetyBench: Evaluating the Safety of LLM Agents | Zhexin Zhang et.al. | 2412.14470 | link |
2024-12-19 | From Human Annotation to LLMs: SILICON Annotation Workflow for Management Research | Xiang Cheng et.al. | 2412.14461 | null |
2024-12-19 | LEDiff: Latent Exposure Diffusion for HDR Generation | Chao Wang et.al. | 2412.14456 | null |
2024-12-19 | Are Longer Prompts Always Better? Prompt Selection in Large Language Models for Recommendation Systems | Genki Kusano et.al. | 2412.14454 | null |
2024-12-19 | Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation | Shengqi Liu et.al. | 2412.14453 | null |
2024-12-19 | ORBIT: Cost-Effective Dataset Curation for Large Language Model Domain Adaptation with an Astronomy Case Study | Eric Modesitt et.al. | 2412.14436 | link |
2024-12-19 | All-in-One Tuning and Structural Pruning for Domain-Specific LLMs | Lei Lu et.al. | 2412.14426 | null |
2024-12-19 | FedPIA -- Permuting and Integrating Adapters leveraging Wasserstein Barycenters for Finetuning Foundation Models in Multi-Modal Federated Learning | Pramit Saha et.al. | 2412.14424 | null |
2024-12-19 | Enhancing Diffusion Models for High-Quality Image Generation | Jaineet Shah et.al. | 2412.14422 | null |
2024-12-18 | ChainRank-DPO: Chain Rank Direct Preference Optimization for LLM Rankers | Haowei Liu et.al. | 2412.14405 | null |
2024-12-18 | Clinical Trials Ontology Engineering with Large Language Models | Berkan Çakır et.al. | 2412.14387 | null |
2024-12-18 | ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling | William Han et.al. | 2412.14373 | link |
2024-12-18 | Memorization Over Reasoning? Exposing and Mitigating Verbatim Memorization in Large Language Models' Character Understanding Evaluation | Yuxuan Jiang et.al. | 2412.14368 | null |
2024-12-18 | Surrealistic-like Image Generation with Vision-Language Models | Elif Ayten et.al. | 2412.14366 | link |
2024-12-18 | ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals | Utkarsh Saxena et.al. | 2412.14363 | link |
2024-12-18 | A Unifying Information-theoretic Perspective on Evaluating Generative Models | Alexis Fox et.al. | 2412.14340 | null |
2024-12-18 | Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation | Benjamin Steenhoek et.al. | 2412.14308 | null |
2024-12-18 | Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs | David Restrepo et.al. | 2412.14304 | null |
2024-12-18 | Fake News Detection: Comparative Evaluation of BERT-like Models and Large Language Models with Generative AI-Annotated Data | haina Raza et.al. | 2412.14276 | link |
2024-12-18 | Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces | Jihan Yang et.al. | 2412.14171 | link |
2024-12-18 | MetaMorph: Multimodal Understanding and Generation via Instruction Tuning | Shengbang Tong et.al. | 2412.14164 | null |
2024-12-18 | TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks | Frank F. Xu et.al. | 2412.14161 | link |
2024-12-18 | Advanced Reasoning and Transformation Engine for Multi-Step Insight Synthesis in Data Analytics with Large Language Models | Atin Sakkeer Hussain et.al. | 2412.14146 | null |
2024-12-18 | LLMs can realize combinatorial creativity: generating creative ideas via LLMs for scientific research | Tianyang Gu et.al. | 2412.14141 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2025-01-31 | Inference-Time Text-to-Video Alignment with Diffusion Latent Beam Search | Yuta Oshima et.al. | 2501.19252 | null |
2025-01-31 | Saul Santos et.al. | 2501.19098 | link | |
2025-01-30 | Every Image Listens, Every Image Dances: Music-Driven Image Animation | Zhikang Dong et.al. | 2501.18801 | null |
2025-01-30 | MAMS: Model-Agnostic Module Selection Framework for Video Captioning | Sangho Lee et.al. | 2501.18269 | null |
2025-01-28 | Exploring the Role of Explicit Temporal Modeling in Multimodal Large Language Models for Video Understanding | Yun Li et.al. | 2501.16786 | null |
2025-01-28 | CascadeV: An Implementation of Wurstchen Architecture for Video Generation | Wenfeng Lin et.al. | 2501.16612 | link |
2025-01-27 | AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models | Zheng Lian et.al. | 2501.16566 | null |
2025-01-27 | Understanding Long Videos via LLM-Powered Entity Relation Graphs | Meng Chu et.al. | 2501.15953 | null |
2025-01-26 | TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding | Xingjian Zhang et.al. | 2501.15513 | link |
2025-01-26 | "See What I Imagine, Imagine What I See": Human-AI Co-Creation System for 360 |
Yunge Wen et.al. | 2501.15456 | null |
2025-01-25 | HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding | Jiaxing Zhao et.al. | 2501.15111 | null |
2025-01-25 | VideoPure: Diffusion-based Adversarial Purification for Video Recognition | Kaixun Jiang et.al. | 2501.14999 | link |
2025-01-11 | HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators | Le Chen et.al. | 2501.14794 | null |
2025-01-24 | VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking | Runyi Hu et.al. | 2501.14195 | link |
2025-01-24 | ENTER: Event Based Interpretable Reasoning for VideoQA | Hammad Ayyubi et.al. | 2501.14194 | null |
2025-01-30 | Temporal Preference Optimization for Long-Form Video Understanding | Rui Li et.al. | 2501.13919 | null |
2025-01-23 | Improving Video Generation with Human Feedback | Jie Liu et.al. | 2501.13918 | null |
2025-01-23 | ReasVQA: Advancing VideoQA with Imperfect Reasoning Process | Jianxin Liang et.al. | 2501.13536 | null |
2025-01-23 | Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge | Haomiao Xiong et.al. | 2501.13468 | link |
2025-01-23 | EchoVideo: Identity-Preserving Human Video Generation by Multimodal Feature Fusion | Jiangchuan Wei et.al. | 2501.13452 | null |
2025-01-28 | VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding | Boqiang Zhang et.al. | 2501.13106 | link |
2025-01-21 | Taming Teacher Forcing for Masked Autoregressive Video Generation | Deyu Zhou et.al. | 2501.12389 | null |
2025-01-22 | InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling | Yi Wang et.al. | 2501.12386 | link |
2025-01-21 | MMVU: Measuring Expert-Level Multi-Discipline Video Understanding | Yilun Zhao et.al. | 2501.12380 | link |
2025-01-22 | Video Depth Anything: Consistent Depth Estimation for Super-Long Videos | Sili Chen et.al. | 2501.12375 | null |
2025-01-21 | InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model | Yuhang Zang et.al. | 2501.12368 | link |
2025-01-20 | GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Zhenliang Ni et.al. | 2501.11340 | null |
2025-01-20 | CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal Concatenation | Zheng Chong et.al. | 2501.11325 | null |
2025-01-23 | HFGCN:Hypergraph Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition | Pengcheng Dong et.al. | 2501.11007 | null |
2025-01-18 | EMO2: End-Effector Guided Audio-Driven Avatar Video Generation | Linrui Tian et.al. | 2501.10687 | null |
2025-01-17 | DiffuEraser: A Diffusion Model for Video Inpainting | Xiaowen Li et.al. | 2501.10018 | link |
2025-01-17 | RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation | Yuefan Cao et.al. | 2501.09982 | null |
2025-01-16 | VideoWorld: Exploring Knowledge Learning from Unlabeled Videos | Zhongwei Ren et.al. | 2501.09781 | null |
2025-01-16 | Learnings from Scaling Visual Tokenizers for Reconstruction and Generation | Philippe Hansen-Estruch et.al. | 2501.09755 | null |
2025-01-14 | Do generative video models learn physical principles from watching videos? | Saman Motamed et.al. | 2501.09038 | link |
2025-01-15 | Ouroboros-Diffusion: Exploring Consistent Content Generation in Tuning-free Long Video Diffusion | Jingyuan Chen et.al. | 2501.09019 | null |
2025-01-15 | RepVideo: Rethinking Cross-Layer Representation for Video Generation | Chenyang Si et.al. | 2501.08994 | null |
2025-01-15 | Admitting Ignorance Helps the Video Question Answering Models to Answer | Haopeng Li et.al. | 2501.08771 | null |
2025-01-31 | Comprehensive Subjective and Objective Evaluation Method for Text-generated Video | Zelu Qi et.al. | 2501.08545 | null |
2025-01-14 | Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models | Weichen Fan et.al. | 2501.08453 | null |
2025-01-14 | 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Meenakshi Krishnan et.al. | 2501.08370 | null |
2025-01-14 | Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks | Miran Heo et.al. | 2501.08326 | null |
2025-01-14 | GameFactory: Creating New Games with Generative Interactive Videos | Jiwen Yu et.al. | 2501.08325 | null |
2025-01-14 | Diffusion Adversarial Post-Training for One-Step Video Generation | Shanchuan Lin et.al. | 2501.08316 | null |
2025-01-17 | LayerAnimate: Layer-specific Control for Animation | Yuxue Yang et.al. | 2501.08295 | null |
2025-01-14 | FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors | Yabo Zhang et.al. | 2501.08225 | link |
2025-01-14 | Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness | Jiaxing Zhao et.al. | 2501.07978 | null |
2025-01-24 | Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding | Liping Yuan et.al. | 2501.07888 | link |
2025-01-14 | AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation | Sitong Gong et.al. | 2501.07810 | link |
2025-01-13 | BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations | Weixi Feng et.al. | 2501.07647 | null |
2025-01-13 | Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss | Xinyu Zhang et.al. | 2501.07563 | null |
2025-01-17 | MECD+: Unlocking Event-Level Causal Graph Discovery for Video Reasoning | Tieyuan Chen et.al. | 2501.07227 | null |
2025-01-13 | TimeLogic: A Temporal Logic Benchmark for Video QA | Sirnam Swetha et.al. | 2501.07214 | null |
2025-01-13 | Video Quality Assessment for Online Processing: From Spatial to Temporal Sampling | Jiebin Yan et.al. | 2501.07087 | null |
2025-01-12 | X-LeBench: A Benchmark for Extremely Long Egocentric Video Understanding | Wenqi Zhou et.al. | 2501.06835 | null |
2025-01-12 | VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning | Ji Soo Lee et.al. | 2501.06761 | link |
2025-01-11 | Qffusion: Controllable Portrait Video Editing via Quadrant-Grid Attention Learning | Maomao Li et.al. | 2501.06438 | null |
2025-01-10 | MEt3R: Measuring Multi-View Consistency in Generated Images | Mohammad Asim et.al. | 2501.06336 | null |
2025-01-10 | Multi-subject Open-set Personalization in Video Generation | Tsai-Shien Chen et.al. | 2501.06187 | null |
2025-01-10 | VideoAuteur: Towards Long Narrative Video Generation | Junfei Xiao et.al. | 2501.06173 | null |
2025-01-13 | Valley2: Exploring Multimodal Models with Scalable Vision-Language Design | Ziheng Wu et.al. | 2501.05901 | link |
2025-01-10 | Zero-shot Shark Tracking and Biometrics from Aerial Imagery | Chinmay K Lalgudi et.al. | 2501.05717 | null |
2025-01-10 | From My View to Yours: Ego-Augmented Learning in Large Vision Language Models for Understanding Exocentric Daily Living Activities | Dominick Reilly et.al. | 2501.05711 | link |
2025-01-09 | OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? | Yifei Li et.al. | 2501.05510 | link |
2025-01-08 | Tuning-Free Long Video Generation via Global-Local Collaborative Diffusion | Yongjia Ma et.al. | 2501.05484 | null |
2025-01-09 | Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces | Aniruddha Mahapatra et.al. | 2501.05442 | null |
2025-01-09 | Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning | Huabin Liu et.al. | 2501.05069 | null |
2025-01-09 | LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding | Jiaxing Zhao et.al. | 2501.05067 | null |
2025-01-09 | LongViTU: Instruction Tuning for Long-Form Video Understanding | Rujie Wu et.al. | 2501.05037 | null |
2025-01-09 | ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Ronghao Dang et.al. | 2501.05031 | link |
2025-01-08 | ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning | Yuzhou Huang et.al. | 2501.04698 | null |
2025-01-08 | Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs | Zeyi Huang et.al. | 2501.04336 | null |
2025-01-08 | H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving | Siran Chen et.al. | 2501.04302 | null |
2025-01-08 | LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition | Bowen Hao et.al. | 2501.04204 | null |
2024-12-18 | FlexCache: Flexible Approximate Cache System for Video Diffusion | Desen Sun et.al. | 2501.04012 | null |
2025-01-07 | Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers | Yuechen Zhang et.al. | 2501.03931 | link |
2025-01-09 | Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control | Zekai Gu et.al. | 2501.03847 | link |
2025-01-07 | Motion-Aware Generative Frame Interpolation | Guozhen Zhang et.al. | 2501.03699 | null |
2025-01-06 | License Plate Images Generation with Diffusion Models | Mariia Shpir et.al. | 2501.03374 | null |
2025-01-03 | Classifier-Guided Captioning Across Modalities | Ariel Shaulov et.al. | 2501.03183 | null |
2025-01-06 | Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation | Guy Yariv et.al. | 2501.03059 | null |
2025-01-20 | TransPixeler: Advancing Text-to-Video Generation with Transparency | Luozhou Wang et.al. | 2501.03006 | link |
2025-01-06 | MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models | Wenyi Hong et.al. | 2501.02955 | null |
2025-01-06 | Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising | Yunlong Yuan et.al. | 2501.02741 | null |
2025-01-05 | GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Weikang Bian et.al. | 2501.02690 | null |
2025-01-29 | Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey | Zongxia Li et.al. | 2501.02189 | link |
2025-01-10 | Gender Bias in Text-to-Video Generation Models: A case study of Sora | Mohammad Nadeem et.al. | 2501.01987 | null |
2024-12-30 | FrameFusion: Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models | Tianyu Fu et.al. | 2501.01986 | link |
2025-01-03 | JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Qili Wang et.al. | 2501.01798 | link |
2025-01-03 | HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding | Heqing Zou et.al. | 2501.01645 | null |
2025-01-07 | VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Yuanpeng Tu et.al. | 2501.01427 | null |
2025-01-02 | Unifying Specialized Visual Encoders for Video Language Models | Jihoon Chung et.al. | 2501.01426 | link |
2025-01-03 | Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions | Xincheng Shuai et.al. | 2501.01425 | null |
2025-01-02 | Multi-Modal Video Feature Extraction for Popularity Prediction | Haixu Liu et.al. | 2501.01422 | null |
2025-01-02 | On Unifying Video Generation and Camera Pose Estimation | Chun-Hao Paul Huang et.al. | 2501.01409 | null |
2025-01-29 | Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Cheonsu Jeong et.al. | 2501.00750 | null |
2025-01-03 | DreamDrive: Generative 4D Scene Modeling from Street View Images | Jiageng Mao et.al. | 2501.00601 | null |
2025-01-08 | VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM | Yuqian Yuan et.al. | 2501.00599 | link |
2024-12-31 | Online Video Understanding: A Comprehensive Benchmark and Memory-Augmented Method | Zhenpeng Huang et.al. | 2501.00584 | null |
2024-12-31 | Fine-grained Video-Text Retrieval: A New Benchmark and Method | Yifan Xu et.al. | 2501.00513 | null |
2024-12-31 | OV-HHIR: Open Vocabulary Human Interaction Recognition Using Cross-modal Integration of Large Language Models | Lala Shakti Swarup Ray et.al. | 2501.00432 | null |
2025-01-09 | Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding | Yue Fan et.al. | 2501.00358 | null |
2024-12-30 | Detection-Fusion for Knowledge Graph Extraction from Videos | Taniya Das et.al. | 2501.00136 | link |
2024-12-30 | LTX-Video: Realtime Video Latent Diffusion | Yoav HaCohen et.al. | 2501.00103 | link |
2024-12-30 | Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model | Yifei Huang et.al. | 2412.21080 | link |
2024-12-30 | VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation | Jiazheng Xu et.al. | 2412.21059 | link |
2024-12-30 | Hierarchical Banzhaf Interaction for General Video-Language Representation Learning | Peng Jin et.al. | 2412.20964 | link |
2024-12-30 | ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation | Ting Zhang et.al. | 2412.20901 | null |
2024-12-30 | Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling | Min Zhang et.al. | 2412.20725 | null |
2025-01-05 | ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding | Xiao Wang et.al. | 2412.20504 | link |
2024-12-29 | Open-Sora: Democratizing Efficient Video Production for All | Zangwei Zheng et.al. | 2412.20404 | link |
2024-12-28 | DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Xijun Wang et.al. | 2412.20042 | null |
2025-01-17 | MVTamperBench: Evaluating Robustness of Vision-Language Models | Amit Agarwal et.al. | 2412.19794 | null |
2024-12-27 | Generative Video Propagation | Shaoteng Liu et.al. | 2412.19761 | null |
2024-12-30 | VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models | Tao Wu et.al. | 2412.19645 | null |
2024-12-30 | DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT | Xiaotao Hu et.al. | 2412.19505 | link |
2024-12-26 | Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries | Roberto Amoroso et.al. | 2412.19304 | null |
2024-12-25 | Accelerating Diffusion Transformers with Dual Feature Caching | Chang Zou et.al. | 2412.18911 | link |
2024-12-24 | Video Is Worth a Thousand Images: Exploring the Latest Trends in Long Video Generation | Faraz Waseem et.al. | 2412.18688 | null |
2024-12-24 | Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models | Jinhui Yi et.al. | 2412.18609 | link |
2024-12-24 | DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers | Yuntao Chen et.al. | 2412.18607 | null |
2024-12-24 | ZeroHSI: Zero-Shot 4D Human-Scene Interaction by Video Generation | Hongjie Li et.al. | 2412.18600 | null |
2024-12-24 | DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation | Minghong Cai et.al. | 2412.18597 | link |
2024-12-23 | Large Motion Video Autoencoding with Cross-modal Video VAE | Yazhou Xing et.al. | 2412.17805 | null |
2024-12-23 | VidTwin: Video VAE with Decoupled Structure and Dynamics | Yuchi Wang et.al. | 2412.17726 | link |
2024-12-23 | HumanVBench: Exploring Human-Centric Video Understanding Capabilities of MLLMs with Synthetic Benchmark Data | Ting Zhou et.al. | 2412.17574 | link |
2024-12-23 | VidCtx: Context-aware Video Question Answering with Image Models | Andreas Goulas et.al. | 2412.17415 | null |
2024-12-23 | FFA Sora, video generation as fundus fluorescein angiography simulator | Xinyuan Wu et.al. | 2412.17346 | null |
2024-12-23 | Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory | Xingyao Li et.al. | 2412.17254 | null |
2024-12-22 | SubstationAI: Multimodal Large Model-Based Approaches for Analyzing Substation Equipment Faults | Jinzhi Wang et.al. | 2412.17077 | null |
2025-01-08 | Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation | Luoxu Jin et.al. | 2412.17042 | null |
2024-12-22 | FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos | Zhengqian Wu et.al. | 2412.17022 | link |
2024-12-22 | Video Domain Incremental Learning for Human Action Recognition in Home Environments | Yuanda Hu et.al. | 2412.16946 | null |
2024-12-21 | GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space | Souhaib Attaiki et.al. | 2412.16717 | null |
2024-12-21 | TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models | Haocheng Huang et.al. | 2412.16700 | null |
2024-12-21 | VAST 1.0: A Unified Framework for Controllable and Consistent Video Generation | Chi Zhang et.al. | 2412.16677 | null |
2024-12-25 | Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance | Beiyuan Zhang et.al. | 2412.16495 | null |
2024-12-18 | ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping | Youxin Pang et.al. | 2412.16212 | null |
2024-12-17 | Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation | Yiping Wang et.al. | 2412.16211 | null |
2024-12-20 | PruneVid: Visual Token Pruning for Efficient Video Large Language Models | Xiaohu Huang et.al. | 2412.16117 | link |
2024-12-20 | DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization | Zihan Ding et.al. | 2412.15689 | null |
2024-12-23 | CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training | Xiuli Bi et.al. | 2412.15646 | link |
2024-12-20 | PolySmart @ TRECVid 2024 Medical Video Question Answering | Jiaxin Wu et.al. | 2412.15514 | null |
2024-12-19 | AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation | Moayed Haji-Ali et.al. | 2412.15191 | null |
2024-12-19 | Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM | Yatai Ji et.al. | 2412.15156 | link |
2024-12-19 | Parallelized Autoregressive Visual Generation | Yuqing Wang et.al. | 2412.15119 | null |
2024-12-19 | Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Yucheng Hu et.al. | 2412.14803 | null |
2024-12-19 | HiCM |
Minkuk Kim et.al. | 2412.14585 | null |
2024-12-19 | Consistent Human Image and Video Generation with Spatially Conditioned Diffusion | Mingdeng Cao et.al. | 2412.14531 | link |
2024-12-19 | DirectorLLM for Human-Centric Video Generation | Kunpeng Song et.al. | 2412.14484 | null |
2024-12-18 | Learning from Massive Human Videos for Universal Humanoid Pose Control | Jiageng Mao et.al. | 2412.14172 | null |
2024-12-18 | Autoregressive Video Generation without Vector Quantization | Haoge Deng et.al. | 2412.14169 | link |
2024-12-18 | VideoDPO: Omni-Preference Alignment for Video Diffusion Generation | Runtao Liu et.al. | 2412.14167 | null |
2024-12-29 | AKiRa: Augmentation Kit on Rays for optical video generation | Xi Wang et.al. | 2412.14158 | null |
2024-12-18 | SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation | Tong Chen et.al. | 2412.14018 | null |
2024-12-18 | InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models | Cong Wei et.al. | 2412.14006 | link |
2024-12-18 | Do Language Models Understand Time? | Xi Ding et.al. | 2412.13845 | link |
2024-12-19 | G-VEval: A Versatile Metric for Evaluating Image and Video Captions Using GPT-4o | Tony Cheng Tong et.al. | 2412.13647 | link |
2024-12-18 | Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning | Yunbin Tu et.al. | 2412.13543 | null |
2024-12-18 | Real-time One-Step Diffusion-based Expressive Portrait Videos Generation | Hanzhong Guo et.al. | 2412.13479 | link |
2024-12-18 | SAVGBench: Benchmarking Spatially Aligned Audio-Video Generation | Kazuki Shimada et.al. | 2412.13462 | null |
2024-12-17 | CompactFlowNet: Efficient Real-time Optical Flow Estimation on Mobile Devices | Andrei Znobishchev et.al. | 2412.13273 | null |
2025-01-07 | MotionBridge: Dynamic Video Inbetweening with Flexible Controls | Maham Tanveer et.al. | 2412.13190 | null |
2024-12-17 | VidTok: A Versatile and Open-Source Video Tokenizer | Anni Tang et.al. | 2412.13061 | link |
2024-12-17 | FocusChat: Text-guided Long Video Understanding via Spatiotemporal Information Filtering | Zheng Cheng et.al. | 2412.12833 | null |
2024-12-17 | Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning | Shiping Ge et.al. | 2412.12791 | link |
2024-12-17 | ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Wangyu Xue et.al. | 2412.12675 | null |
2024-12-16 | Can video generation replace cinematographers? Research on the cinematic language of generated video | Xiaozhe Li et.al. | 2412.12223 | null |
2024-12-16 | CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding | Guo Chen et.al. | 2412.12075 | null |
2024-12-16 | InterDyn: Controllable Interactive Dynamics with Video Diffusion Models | Rick Akkerman et.al. | 2412.11785 | null |
2024-12-16 | Generative Inbetweening through Frame-wise Conditions-Driven Video Generation | Tianyi Zhu et.al. | 2412.11755 | link |
2024-12-16 | VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting | Muhammet Furkan Ilaslan et.al. | 2412.11621 | link |
2024-12-16 | Exploring Temporal Event Cues for Dense Video Captioning in Cyclic Co-learning | Zhuyang Xie et.al. | 2412.11467 | null |
2024-12-15 | Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video Recognition | Yulin Wang et.al. | 2412.11228 | link |
2024-12-15 | GenLit: Reformulating Single-Image Relighting as Video Generation | Shrisha Bharadwaj et.al. | 2412.11224 | null |
2024-12-15 | DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes | Jinxiu Liu et.al. | 2412.11100 | null |
2024-12-15 | Overview of TREC 2024 Medical Video Question Answering (MedVidQA) Track | Deepak Gupta et.al. | 2412.11056 | null |
2024-12-20 | Video Diffusion Transformers are In-Context Learners | Zhengcong Fei et.al. | 2412.10783 | link |
2024-12-14 | Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives | Ji-jun Park et.al. | 2412.10720 | null |
2024-12-13 | SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device | Yushu Wu et.al. | 2412.10494 | null |
2024-12-12 | VCA: Video Curious Agent for Long Video Understanding | Zeyuan Yang et.al. | 2412.10471 | null |
2024-12-17 | SweetTokenizer: Semantic-Aware Spatial-Temporal Tokenizer for Compact Visual Discretization | Zhentao Tan et.al. | 2412.10443 | null |
2024-12-11 | COEF-VQ: Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework | Xin Dong et.al. | 2412.10435 | null |
2024-12-13 | Apollo: An Exploration of Video Understanding in Large Multimodal Models | Orr Zohar et.al. | 2412.10360 | null |
2024-12-16 | TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation | Xingrui Wang et.al. | 2412.10275 | null |
2024-12-19 | AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era | Yudong Jiang et.al. | 2412.10255 | link |
2024-12-13 | B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens | Zhuqiang Lu et.al. | 2412.09919 | link |
2024-12-16 | IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs | Sosuke Yamao et.al. | 2412.09907 | null |
2024-12-13 | LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity | Hongjie Wang et.al. | 2412.09856 | null |
2024-12-13 | MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion | Xunnong Xu et.al. | 2412.09828 | null |
2024-12-17 | ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation | Ali Athar et.al. | 2412.09754 | null |
2024-12-11 | Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model | Junqi You et.al. | 2412.09647 | null |
2024-12-16 | Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models | Fan Zhang et.al. | 2412.09645 | link |
2024-12-12 | Doe-1: Closed-Loop Autonomous Driving with Large World Model | Wenzhao Zheng et.al. | 2412.09627 | link |
2024-12-12 | OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation | Weiqi Li et.al. | 2412.09623 | null |
2024-12-12 | PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models | Chenyu Yang et.al. | 2412.09613 | null |
2024-12-12 | Owl-1: Omni World Model for Consistent Long Video Generation | Yuanhui Huang et.al. | 2412.09600 | link |
2024-12-12 | LiftImage3D: Lifting Any Single Image to 3D Gaussians with Video Generation Priors | Yabo Chen et.al. | 2412.09597 | null |
2024-12-12 | Neptune: The Long Orbit to Benchmarking Long Video Understanding | Arsha Nagrani et.al. | 2412.09582 | link |
2024-12-12 | Video Creation by Demonstration | Yihong Sun et.al. | 2412.09551 | null |
2024-12-12 | Agent-based Video Trimming | Lingfeng Yang et.al. | 2412.09513 | null |
2024-12-12 | UFO: Enhancing Diffusion-Based Video Generation with a Uniform Frame Organizer | Delong Liu et.al. | 2412.09389 | link |
2024-12-12 | T-SVG: Text-Driven Stereoscopic Video Generation | Qiao Jin et.al. | 2412.09323 | null |
2024-12-12 | InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption | Tiehan Fan et.al. | 2412.09283 | null |
2024-12-12 | Foundation Models and Adaptive Feature Selection: A Synergistic Approach to Video Question Answering | Sai Bhargav Rongali et.al. | 2412.09230 | null |
2024-12-12 | LVMark: Robust Watermark for latent video diffusion models | MinHyuk Jang et.al. | 2412.09122 | null |
2024-12-12 | Enhancing Facial Consistency in Conditional Video Generation via Facial Landmark Transformation | Lianrui Mu et.al. | 2412.08976 | null |
2024-12-12 | Mojito: Motion Trajectory and Intensity Control for Video Generation | Xuehai He et.al. | 2412.08948 | null |
2024-12-11 | Generative Semantic Communication: Architectures, Technologies, and Applications | Jinke Ren et.al. | 2412.08642 | null |
2024-12-13 | Physical Informed Driving World Model | Zhuoran Yang et.al. | 2412.08410 | null |
2024-12-11 | FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks | Chongkai Gao et.al. | 2412.08261 | null |
2024-12-11 | VSD2M: A Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation | Zhiqiang Yuan et.al. | 2412.08259 | null |
2024-12-10 | 3DSRBench: A Comprehensive 3D Spatial Reasoning Benchmark | Wufei Ma et.al. | 2412.07825 | null |
2024-12-11 | UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics | Xi Chen et.al. | 2412.07774 | null |
2024-12-10 | From Slow Bidirectional to Fast Causal Video Generators | Tianwei Yin et.al. | 2412.07772 | null |
2024-12-10 | SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints | Jianhong Bai et.al. | 2412.07760 | link |
2024-12-10 | 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Xiao Fu et.al. | 2412.07759 | null |
2024-12-10 | Multi-Shot Character Consistency for Text-to-Video Generation | Yuval Atzmon et.al. | 2412.07750 | null |
2024-12-10 | StyleMaster: Stylize Your Video with Artistic Generation and Translation | Zixuan Ye et.al. | 2412.07744 | null |
2024-12-10 | STIV: Scalable Text and Image Conditioned Video Generation | Zongyu Lin et.al. | 2412.07730 | null |
2024-12-10 | ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer | Jinyi Hu et.al. | 2412.07720 | link |
2024-12-10 | GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning | Yicheng Wang et.al. | 2412.07704 | null |
2024-12-10 | Multimodal Contextualized Support for Enhancing Video Retrieval System | Quoc-Bao Nguyen-Le et.al. | 2412.07584 | null |
2024-12-19 | Multi-Scale Contrastive Learning for Video Temporal Grounding | Thong Thanh Nguyen et.al. | 2412.07157 | null |
2024-12-09 | SafeWatch: An Efficient Safety-Policy Following Video Guardrail Model with Transparent Explanations | Zhaorun Chen et.al. | 2412.06878 | null |
2024-12-09 | VidMusician: Video-to-Music Generation with Semantic-Rhythmic Alignment via Hierarchical Visual Features | Sifei Li et.al. | 2412.06296 | null |
2024-12-11 | Towards Long Video Understanding via Fine-detailed Video Story Generation | Zeng You et.al. | 2412.06182 | null |
2024-12-08 | Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training | Zhenghong Zhou et.al. | 2412.06029 | null |
2024-12-08 | FlexDiT: Dynamic Token Density Control for Diffusion Transformer | Shuning Chang et.al. | 2412.06028 | null |
2024-12-10 | Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation | Hyeonho Jeong et.al. | 2412.06016 | null |
2024-12-08 | Accelerating Video Diffusion Models via Distribution Matching | Yuanzhi Zhu et.al. | 2412.05899 | null |
2024-12-08 | MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation | Shuwei Shi et.al. | 2412.05848 | null |
2024-12-08 | Semi-Supervised Contrastive Learning for Controllable Video-to-Music Retrieval | Shanti Stewart et.al. | 2412.05831 | null |
2024-12-08 | Self-Guidance: Boosting Flow and Diffusion Generation on Their Own | Tiancheng Li et.al. | 2412.05827 | null |
2024-12-07 | Combining Genre Classification and Harmonic-Percussive Features with Diffusion Models for Music-Video Generation | Leonardo Pina et.al. | 2412.05694 | null |
2024-12-11 | Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model | Lening Wang et.al. | 2412.05280 | link |
2024-12-17 | Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling | Zhe Chen et.al. | 2412.05271 | link |
2024-12-06 | Mind the Time: Temporally-Controlled Multi-Event Video Generation | Ziyi Wu et.al. | 2412.05263 | null |
2024-12-11 | LinVT: Empower Your Image-level Large Language Model to Understand Videos | Lishuai Gao et.al. | 2412.05185 | link |
2024-12-06 | Beyond Boxes: Mask-Guided Spatio-Temporal Feature Aggregation for Video Object Detection | Khurram Azeem Hashmi et.al. | 2412.04915 | null |
2024-12-06 | UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving | Rui Chen et.al. | 2412.04842 | link |
2024-12-12 | Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model | Keunwoo Peter Yu et.al. | 2412.04729 | null |
2024-12-05 | Using Diffusion Priors for Video Amodal Segmentation | Kaihua Chen et.al. | 2412.04623 | null |