Data mining is used to discover patterns and relationships in data. Emphasis is on large complex data sets such as those in very large databases or through web mining. Topics: decision trees, association rules, clustering, case based methods, and data visualization. Prereqs: Introductory courses in statistics or probability (e.g., Stats 60), linear algebra (e.g., Math 51), and computer programming (e.g., CS 105).
Summer Session Online visiting students will be enrolled in STATS 202U. Matriculated Stanford students taking the courses online enroll in STATS 202.
Probability at the level of Stats 116 and familiarity with linear algebra