eBandit: Kernel-Driven Reinforcement Learning for Adaptive Video Streaming

📰 ArXiv cs.AI

arXiv:2604.08791v1 Announce Type: cross Abstract: User-space Adaptive Bitrate (ABR) algorithms cannot see the transport layer signals that matter most, such as minimum RTT and instantaneous delivery rate, and they respond to network changes only after damage has already propagated to the playout buffer. We present eBandit, a framework that relocates both network monitoring and ABR algorithm selection into the Linux kernel using eBPF. A lightweight epsilon-greedy Multi-Armed Bandit (MAB) runs ins

Published 13 Apr 2026

Read full paper → ← Back to Reads