POMDP FAQ

What is a POMDP?

POMDP is an acronym for a partially observable Markov decision process. This is a mathematical model that can capture the domain dynamics that include uncertainty in action effects and uncertainty in perceptual stimuli. Once a problem is captured as POMDP, it them becomes more ammendable for solution using optimization techniques.

Does this have anything to do with Markov Chains or HMMs?

Sure does. These are all stochastic, discrete state, discrete time models. Borrowing from Michael Littman's nifty explanatory grid:

Markov Models		Do we have control over the state transitons?
Markov Models		NO	YES
Are the states completely observable?	YES	Markov Chain	MDP Markov Decision Process
Are the states completely observable?	NO	HMM Hidden Markov Model	POMDP Partially Observable Markov Decision Process