[공지사항]을 빙자한 안부와 근황

Show more

Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

The unknotting number, hard unknot diagrams, and reinforcement learning

Hierarchical Reinforcement Learning for Temporal Abstraction of Listwise Recommendation

Enhancing Natural Language Inference Performance with Knowledge Graph for COVID-19 Automated Fact-Checking in Indonesian Language

CVPT: Cross Visual Prompt Tuning

Proficient Graph Neural Network Design by Accumulating Knowledge on Large Language Models

Stimulating Imagination: Towards General-purpose "Something Something Placement"

Why Does New Knowledge Create Messy Ripple Effects in LLMs?

A Mathematical Framework and a Suite of Learning Techniques for Neural-Symbolic Systems

How to Leverage Predictive Uncertainty Estimates for Reducing Catastrophic Forgetting in Online Continual Learning

Towards the Next Frontier in Speech Representation Learning Using Disentanglement

Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for Large Language Models Aligned with Human Cognitive Principles

Which Experiences Are Influential for RL Agents? Efficiently Estimating The Influence of Experiences

Oversmoothing Alleviation in Graph Neural Networks: A Survey and Unified View

OCK: Unsupervised Dynamic Video Prediction with Object-Centric Kinematics

Benchmarking Mobile Device Control Agents across Diverse Configurations

Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation

Generalized Consistency Trajectory Models for Image Manipulation

Generative Models and Connected and Automated Vehicles: A Survey in Exploring the Intersection of Transportation and AI

Defending Against Unforeseen Failure Modes with Latent Adversarial Training

A Survey of the Evolution of Language Model-Based Dialogue Systems: Data, Task and Models

PFB-Diff: Progressive Feature Blending Diffusion for Text-driven Image Editing

Transformers and Ensemble methods: A solution for Hate Speech Detection in Arabic languages

Abductive forgetting

Recognizing and Eliciting Weakly Single Crossing Profiles on Trees

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Information-Theoretic Aggregation of Ethical Attributes in Simulated-Command

Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning

Grounding Methods for Neural-Symbolic AI

Modeling Deontic Modal Logic in the s(CASP) Goal-directed Predicate Answer Set Programming System

LumiCRS: Asymmetric Contrastive Prototype Learning for Long-Tail Conversational Recommender Systems

THE-Tree: Can Tracing Historical Evolution Enhance Scientific Verification and Reasoning?

A Practical Guide for Evaluating LLMs and LLM-Reliant Systems

SciSage: A Multi-Agent Framework for High-Quality Scientific Survey Generation

The Ultimate Test of Superintelligent AI Agents: Can an AI Balance Care and Control in Asymmetric Relationships?

SeePhys: Does Seeing Help Thinking? -- Benchmarking Vision-Based Physics Reasoning

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

Visually Guided Decoding: Gradient-Free Hard Prompt Inversion with Language Models

Large Language Models are Autonomous Cyber Defenders

From Mind to Machine: The Rise of Manus AI as a Fully Autonomous Digital Agent

DiCE-Extended: A Robust Approach to Counterfactual Explanations in Machine Learning

A Vision for Auto Research with LLM Agents

A Library of LLM Intrinsics for Retrieval-Augmented Generation

Palatable Conceptions of Disembodied Being

Combinatorial Optimization for All: Using LLMs to Aid Non-Experts in Improving Optimization Algorithms

Practical Principles for AI Cost and Compute Accounting

Empowering LLMs with Logical Reasoning: A Comprehensive Survey

SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions

The Elicitation Game: Evaluating Capability Elicitation Techniques

Doing More with Less: A Survey on Routing Strategies for Resource Optimisation in Large Language Model-Based Systems

Smarter Together: Combining Large Language Models and Small Models for Physiological Signals Visual Inspection

Large Language Models Powered Multiagent Ensemble for Mitigating Hallucination and Efficient Atrial Fibrillation Annotation of ECG Reports

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

Decision support system for Forest fire management using Ontology with Big Data and LLMs

zkFL: Zero-Knowledge Proof-based Gradient Aggregation for Federated Learning

Diffusion Beats Autoregressive in Data-Constrained Settings

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

The Impact of Language Mixing on Bilingual LLM Reasoning

GUI-G$^2$: Gaussian Reward Modeling for GUI Grounding

FASTGEN: Fast and Cost-Effective Synthetic Tabular Data Generation with LLMs

Look, Focus, Act: Efficient and Robust Robot Learning via Human Gaze and Foveated Vision Transformers

Operationalizing AI for Good: Spotlight on Deployment and Integration of AI Models in Humanitarian Work

Do AI models help produce verified bug fixes?

True Multimodal In-Context Learning Needs Attention to the Visual Context

ConformalSAM: Unlocking the Potential of Foundational Segmentation Models in Semi-Supervised Semantic Segmentation with Conformal Prediction

Small LLMs Do Not Learn a Generalizable Theory of Mind via Reinforcement Learning

Romance, Relief, and Regret: Teen Narratives of Chatbot Overreliance

Learning Null Geodesics for Gravitational Lensing Rendering in General Relativity

Dynamics is what you need for time-series forecasting!

Supernova: Achieving More with Less in Transformer Architectures

Deep-Learning Investigation of Vibrational Raman Spectra for Plant-Stress Analysis

Left Leaning Models: AI Assumptions on Economic Policy

DiffuMeta: Algebraic Language Models for Inverse Design of Metamaterials via Diffusion Transformers

DialogueForge: LLM Simulation of Human-Chatbot Dialogue

Explainable Anomaly Detection for Electric Vehicles Charging Stations

BEnchmarking LLMs for Ophthalmology (BELO) for Ophthalmological Knowledge and Reasoning

Is Large Language Model Performance on Reasoning Tasks Impacted by Different Ways Questions Are Asked?

Compositional Understanding in Signaling Games

CoLD: Counterfactually-Guided Length Debiasing for Process Reward Models

LINR-PCGC: Lossless Implicit Neural Representations for Point Cloud Geometry Compression

Missing value imputation with adversarial random forests -- MissARF

SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models

Towards Explainable Anomaly Detection in Shared Mobility Systems

Leveraging Context for Multimodal Fallacy Classification in Political Debates

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training

Uncovering Critical Features for Deepfake Detection through the Lottery Ticket Hypothesis

Why can't Epidemiology be automated (yet)?

Accelerating HEC-RAS: A Recurrent Neural Operator for Rapid River Forecasting

Multi-Stage Prompt Inference Attacks on Enterprise LLM Systems

Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario

Unequal Voices: How LLMs Construct Constrained Queer Narratives

GeMix: Conditional GAN-Based Mixup for Improved Medical Image Augmentation

On the Role of AI in Managing Satellite Constellations: Insights from the ConstellAI Project

PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors

RARE-UNet: Resolution-Aligned Routing Entry for Adaptive Medical Image Segmentation

Off-Policy Corrected Reward Modeling for Reinforcement Learning from Human Feedback

ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution

GR-3 Technical Report

The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents

The Emergence of Deep Reinforcement Learning for Path Planning

The New LLM Bottleneck: A Systems Perspective on Latent Attention and Mixture-of-Experts

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

Created by

Haebom

저자

Kaiyuan Zhang, Siyuan Cheng, Hanxi Guo, Yuetian Chen, Zian Su, Shengwei An, Yuntao Du, Charles Fleming, Ashish Kundu, Xiangyu Zhang, Ninghui Li

개요

본 논문은 파인튜닝된 대규모 언어 모델(LLM)의 멤버십 추론 공격(MIA) 취약성에 대한 최초의 종합적인 연구를 수행합니다. 실험 분석 결과, 파인튜닝 과정에서의 손실 감소가 MIA의 효과성을 높이는 주요 원인임을 밝혔습니다. 이를 해결하기 위해, 저자들은 SOFT (Selective data Obfuscation in LLM Fine-Tuning) 라는 새로운 방어 기법을 제안합니다. SOFT는 조정 가능한 매개변수를 사용하여 유용성 유지와 개인 정보 보호 사이의 균형을 맞추는 영향력 있는 데이터 선택을 활용하여 개인 정보 유출을 완화합니다. 여러 LLM 아키텍처와 규모, 6개의 다양한 도메인에 걸쳐 광범위한 실험을 수행한 결과, SOFT는 경쟁력 있는 모델 성능을 유지하면서 개인 정보 위험을 효과적으로 줄여 파인튜닝된 LLM에서 민감한 정보를 보호하는 실용적이고 확장 가능한 솔루션을 제공함을 보여줍니다.

시사점, 한계점

•

시사점:

◦

파인튜닝된 LLM의 MIA 취약성에 대한 최초의 종합적인 연구 결과를 제시.

◦

손실 감소가 MIA의 효과성에 미치는 영향을 실증적으로 밝힘.

◦

파인튜닝된 LLM의 개인 정보 보호를 위한 효과적이고 확장 가능한 방어 기법인 SOFT를 제안.

◦

SOFT가 개인 정보 위험을 줄이면서 경쟁력 있는 모델 성능을 유지함을 실험적으로 입증.

•

한계점:

◦

본 연구에서 제시된 SOFT의 성능은 특정 데이터셋과 LLM 아키텍처에 국한될 수 있음.

◦

다양한 MIA 공격 유형에 대한 SOFT의 일반화 성능에 대한 추가 연구가 필요.

◦

SOFT의 조정 가능한 매개변수 설정에 대한 최적화 전략에 대한 추가 연구가 필요.

Made with Slashpage