[공지사항]을 빙자한 안부와 근황

Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Towards Explainable Anomaly Detection in Shared Mobility Systems

Leveraging Context for Multimodal Fallacy Classification in Political Debates

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training

Uncovering Critical Features for Deepfake Detection through the Lottery Ticket Hypothesis

Why can't Epidemiology be automated (yet)?

Accelerating HEC-RAS: A Recurrent Neural Operator for Rapid River Forecasting

Multi-Stage Prompt Inference Attacks on Enterprise LLM Systems

Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario

Unequal Voices: How LLMs Construct Constrained Queer Narratives

GeMix: Conditional GAN-Based Mixup for Improved Medical Image Augmentation

On the Role of AI in Managing Satellite Constellations: Insights from the ConstellAI Project

PhysGym: Benchmarking LLMs in Interactive Physics Discovery with Controlled Priors

RARE-UNet: Resolution-Aligned Routing Entry for Adaptive Medical Image Segmentation

Off-Policy Corrected Reward Modeling for Reinforcement Learning from Human Feedback

ASPERA: A Simulated Environment to Evaluate Planning for Complex Action Execution

GR-3 Technical Report

The Constitutional Controller: Doubt-Calibrated Steering of Compliant Agents

The Emergence of Deep Reinforcement Learning for Path Planning

The New LLM Bottleneck: A Systems Perspective on Latent Attention and Mixture-of-Experts

Solving nonconvex Hamilton--Jacobi--Isaacs equations with PINN-based policy iteration

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting

EgoPrune: Efficient Token Pruning for Egomotion Video Reasoning in Embodied Agent

Neuro-MSBG: An End-to-End Neural Model for Hearing Loss Simulation

PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants

To Label or Not to Label: PALM -- A Predictive Model for Evaluating Sample Efficiency in Active Learning Models

Multi-beam Beamforming in RIS-aided MIMO Subject to Reradiation Mask Constraints -- Optimization and Machine Learning Design

EEG-based Epileptic Prediction via a Two-stage Channel-aware Set Transformer Network

Latent Space Synergy: Text-Guided Data Augmentation for Direct Diffusion Biomedical Segmentation

Metaphor and Large Language Models: When Surface Features Matter More than Deep Understanding

Scaling Decentralized Learning with FLock

StackTrans: From Large Language Model to Large Pushdown Automata Model

MedSR-Impact: Transformer-Based Super-Resolution for Lung CT Segmentation, Radiomics, Classification, and Prognosis

Beyond Model Base Selection: Weaving Knowledge to Master Fine-grained Neural Network Design

ExDD: Explicit Dual Distribution Learning for Surface Defect Detection via Diffusion Synthesis

Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems

EndoControlMag: Robust Endoscopic Vascular Motion Magnification with Periodic Reference Resetting and Hierarchical Tissue-aware Dual-Mask Control

Preferential subspace identification (PSID) with forward-backward smoothing

Mixture of Autoencoder Experts Guidance using Unlabeled and Incomplete Data for Exploration in Reinforcement Learning

A Novel Self-Evolution Framework for Large Language Models

A2TTS: TTS for Low Resource Indian Languages

Conditional Video Generation for High-Efficiency Video Compression

Optimal Transceiver Design in Over-the-Air Federated Distillation

MEETI: A Multimodal ECG Dataset from MIMIC-IV-ECG with Signals, Images, Features and Interpretations

User Head Movement-Predictive XR in Immersive H2M Collaborations over Future Enterprise Networks

Spatio-Temporal Demand Prediction for Food Delivery Using Attention-Driven Graph Neural Networks

SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Cross-Domain Few-Shot Learning with Coalescent Projections and Latent Space Reservation

SimdBench: Benchmarking Large Language Models for SIMD-Intrinsic Code Generation

PromptArmor: Simple yet Effective Prompt Injection Defenses

Long-Short Distance Graph Neural Networks and Improved Curriculum Learning for Emotion Recognition in Conversation

A Study of Anatomical Priors for Deep Learning-Based Segmentation of Pheochromocytoma in Abdominal CT

Can LLMs Generate User Stories and Assess Their Quality?

Constraint-aware Learning of Probabilistic Sequential Models for Multi-Label Classification

What Level of Automation is "Good Enough"? A Benchmark of Large Language Models for Meta-Analysis Data Extraction

Performance Analysis of Post-Training Quantization for CNN-based Conjunctival Pallor Anemia Detection

Design of an Edge-based Portable EHR System for Anemia Screening in Remote Health Applications

A Case Against Implicit Standards: Homophone Normalization in Machine Translation for Languages that use the Ge'ez Script

AnalogFed: Federated Discovery of Analog Circuit Topologies with Generative AI

Filling the Gap: Is Commonsense Knowledge Generation useful for Natural Language Inference?

BleedOrigin: Dynamic Bleeding Source Localization in Endoscopic Submucosal Dissection via Dual-Stage Detection and Tracking

Evaluation of Coding Schemes for Transformer-based Gene Sequence Modeling

Robust Control with Gradient Uncertainty

NavVI: A Telerobotic Simulation with Multimodal Feedback for Visually Impaired Navigation in Warehouse Environments

Time-RA: Towards Time Series Reasoning for Anomaly with LLM Feedback

StableAnimator++: Overcoming Pose Misalignment and Face Distortion for Human Image Animation

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper

WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

The hunt for new pulsating ultraluminous X-ray sources: a clustering approach

Survey of GenAI for Automotive Software Development: From Requirements to Executable Code

The Rise of AI Teammates in Software Engineering (SE) 3.0: How Autonomous Coding Agents Are Reshaping Software Engineering

FCRF: Flexible Constructivism Reflection for Long-Horizon Robotic Task Planning with Large Language Models

A Comparative Analysis of Statistical and Machine Learning Models for Outlier Detection in Bitcoin Limit Order Books

Probing EFX via PMMS: (Non-)Existence Results in Discrete Fair Division

Byzantine-Robust Decentralized Coordination of LLM Agents

One Step Beyond: Feedthrough & Placement-Aware Rectilinear Floorplanner

Partial Symmetry Enforced Attention Decomposition (PSEAD): A Group-Theoretic Framework for Equivariant Transformers in Biological Systems

TriCLIP-3D: A Unified Parameter-Efficient Framework for Tri-Modal 3D Visual Grounding based on CLIP

Learning Nonlinear Causal Reductions to Explain Reinforcement Learning Policies

Application-Specific Component-Aware Structured Pruning of Deep Neural Networks via Soft Coefficient Optimization

The Tsetlin Machine Goes Deep: Logical Learning and Reasoning With Graphs

Grounding Degradations in Natural Language for All-In-One Video Restoration

Hierarchical Multi-Agent Reinforcement Learning with Control Barrier Functions for Safety-Critical Autonomous Systems

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paired Image Generation with Diffusion-Guided Diffusion Models

EMargin: Revisiting Contrastive Learning with Margin-Based Separation

Benchmarking Foundation Models with Multimodal Public Electronic Health Records

SegQuant: A Semantics-Aware and Generalizable Quantization Framework for Diffusion Models

Seeing Through Deepfakes: A Human-Inspired Framework for Multi-Face Detection

Subliminal Learning: Language models transmit behavioral traits via hidden signals in data

ACME: Adaptive Customization of Large Models via Distributed Systems

Large Language Model as An Operator: An Experience-Driven Solution for Distribution Network Voltage Control

Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree

FOCUS: Fused Observation of Channels for Unveiling Spectra

Exploring the In-Context Learning Capabilities of LLMs for Money Laundering Detection in Financial Graphs

LeAdQA: LLM-Driven Context-Aware Temporal Grounding for Video Question Answering

Omni-Think: Scaling Cross-Domain Generalization in LLMs via Multi-Task RL with Hybrid Rewards

XplainAct: Visualization for Personalized Intervention Insights

CXR-TFT: Multi-Modal Temporal Fusion Transformer for Predicting Chest X-ray Trajectories

QUTCC: Quantile Uncertainty Training and Conformal Calibration for Imaging Inverse Problems

GRACE: Generative Recommendation via Journey-Aware Sparse Attention on Chain-of-Thought Tokenization

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Created by

Haebom

Author

Jiacheng Liu, Sewon Min, Luke Zettlemoyer, Yejin Choi, Hannaneh Hajishirzi

Outline

This paper aims to demonstrate the usefulness of the n-gram language model even in the era of the Large Language Model (LLM) by modernizing the existing n-gram model using large-scale data of 5 trillion tokens. In particular, we developed an infinite n-gram (∞-gram) model that can set the value of n arbitrarily large, and an infini-gram engine that calculates the ∞-gram probability with a millisecond-level delay based on a suffix array. Through this, we performed analysis of human-written and machine-generated texts, and confirmed the high accuracy (47%) of the ∞-gram model and the perplexity reduction effect of the LLM. In addition, we discovered defects in the positional embedding of the Transformer and the LLM pre-training through analysis of machine-generated text.

Takeaways, Limitations

•

Takeaways:

◦

Reevaluation of n-gram models through building a large-scale n-gram language model with a 5 trillion token scale.

◦

Development of ∞-gram model and infini-gram engine to improve performance of n-gram model and suggest new analysis possibilities.

◦

Presenting the possibility of discovering and improving the performance of LLM's Limitations through text analysis using the ∞-gram model.

◦

Fast ∞-gram probability calculations at the millisecond level enable real-time applications.

•

Limitations:

◦

The performance evaluation of the ∞-gram model presented in this study may be limited to a specific dataset.

◦

The computational efficiency of the infini-gram engine may vary depending on the data size and n value.

◦

Although the shortcomings of LLM were pointed out, specific improvement measures were lacking.

View PDF

Made with Slashpage