UPDF AI

Information Theory of Decisions and Actions

Naftali Tishby,D. Polani

2011 · DOI: 10.1007/978-1-4419-1452-1_19
310 Citations

TLDR

Using a graphical model, a recursive Bellman optimality equation for information measures is derived, in analogy to reinforcement learning; from this, new algorithms for calculating the optimal trade-off between the value-to-go and the required information- to-go are obtained, unifying the ideas behind the Bellman and the Blahut–Arimoto iterations.