All you need to know about Markov Decision processes, value- and policy-iteation as well as about Q learning approach