A graduate course on data mining that emphasizes and iterative process and statistical reasoning. Topics such as cluster validity are also given.