This page is subject to change before the start of the course.
This course focuses on both concepts and practice. We will introduce (a) the core data mining concepts and (b) practical skills for applying data mining techniques to solve real-world problems.
Topics to be covered:
See the course schedule for weekly topics.
This course will use Python for coding. We will use Jupyter Notebook for creating reproducible data science documents.
Prerequisites for this course include prior experience with Python programming and a foundational understanding of Linear Algebra and Probability.
Your grade will be based on homework, quizzes, and major assessments, giving you several opportunities to demonstrate your progress and understanding.
You will complete one in-person exam (15%) around Week 8. For the second major assessment, you may choose the format that works best for you. This flexibility is meant to give you different ways to demonstrate your learning and strengths.
You may miss up to two classes during the semester without penalty. Each additional absence will lower your final grade by one point. Students who attend every class will receive extra credit added to their final grade. If you expect challenges with attendance, please let me know early so we can explore possible solutions together.
All assignments are due on their scheduled dates. Late submissions will receive a deduction of 10% per day, with the exception of the final project paper, which must be submitted on time. Please ensure all written work follows the required format provided in class.
See the university policies page.