D. P. Bertsekas and J. N. Tsitsiklis, Neuro-dynamic programming, Athena Scientific, 1996.

K. Blennow and O. Sallnäs, WINDA-a system of models for assessing the probability of wind damage to forest stands within a landscape, Ecological Modelling, vol.175, pp.87-99, 2004.

C. Boutilier, R. Dearden, and M. Goldszmidt, Stochastic Dynamic Programming with Factored Representations, Artificial Intelligence, vol.121, issue.1, pp.49-107, 2000.

C. Claus and C. Boutilier, The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems, Proceedings of AAAI/IAAI pp, pp.746-752, 1998.

M. A. Finney, Modeling the spread and behavior of prescribed natural fires, Proceedings of 12th Conference on Fire and Forest Meteorology, pp.138-143, 1994.

N. Forsell and R. Sabbadin, Approximate Linear-Programming Algorithms for Graph-Based Markov Decision Processes, Proceedings of ECAI, pp.590-599, 2006.

N. Forsell, P. Wikström, F. Garcia, R. Sabbadin, K. Blennow et al., Management of the risk of wind damage in forestry: a graph-based Markov decision process approach, Annals of operations research, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00833150

B. A. Gardiner, B. Marshall, A. Achim, R. E. Belcher, and C. J. Wood, The stability of different silvicultural systems: a wind-tunnel investigation, Forestry, vol.78, issue.5, pp.471-484, 2005.

C. Guestrin, D. Koller, R. Parr, and S. Venkataraman, Efficient Solution Algorithms for Factored MDPs, Journal of Artificial Intelligence Research, vol.19, pp.399-468, 2003.

C. Guestrin, M. G. Lagoudakis, and R. Parr, Coordinated Reinforcement Learning, Proceedings of ICML, pp.227-234, 2002.

J. R. Kok and N. A. Vlassis, Collaborative Multiagent Reinforcement Learning by Payoff Propagation, Journal of Machine Learning Research, issue.7, pp.1789-1828, 2006.

H. Meilby, N. Strange, and B. J. Thorsen, Optimal spatial harvest planning under risk of windthrow, Forest Ecology and Management, vol.149, pp.15-31, 2001.

N. Peyrard and R. Sabbadin, Mean Field Approximation of the Policy Iteration Algorithm for Graph-Based Markov Decision Processes, Proceedings of ECAI, pp.595-599, 2006.
URL : https://hal.archives-ouvertes.fr/hal-02755289

M. L. Puterman, Markov Decision Processes, 1994.

J. G. Schneider, W. K. Wong, A. W. Moore, and M. A. Riedmiller, Distributed Value Functions, Proceedings of ICML, pp.371-378, 1999.

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, 1998.