Low-rank extended Kalman filtering for online learning of neural networks from streaming data

Peter Chang,Gerardo Duràn-Martín, Alexander Y Shestopaloff, Matt Jones,Kevin Murphy

CoRR(2023)

引用 0|浏览9
暂无评分
摘要
We propose an efficient online approximate Bayesian inference algorithm for estimating the parameters of a nonlinear function from a potentially non-stationary data stream. The method is based on the extended Kalman filter (EKF), but uses a novel low-rank plus diagonal decomposition of the posterior precision matrix, which gives a cost per step which is linear in the number of model parameters. In contrast to methods based on stochastic variational inference, our method is fully deterministic, and does not require step-size tuning. We show experimentally that this results in much faster (more sample efficient) learning, which results in more rapid adaptation to changing distributions, and faster accumulation of reward when used as part of a contextual bandit algorithm.
更多
查看译文
关键词
kalman filtering,online learning,neural
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要