CMPT 732 Lecture Notes

  1. Course Introduction [“Course Introduction” slides]
  2. Hadoop Concepts [“Hadoop Concepts” slides]
  3. Python Preliminaries [“Python Preliminaries” slides]
  4. Spark Concepts [“Spark Concepts” slides]
  5. Spark DataFrames Concepts [“Spark DataFrames Concepts” slides]
  6. Cloud & Data Management [“Cloud & Data Management” slides]
  7. NoSQL & Cassandra [“NoSQL & Cassandra” slides]
  8. Data Management [“Data Management” slides]
  9. Spark Machine Learning [“Spark Machine Learning” slides]
  10. Spark Streaming [“Spark Streaming” slides]
  11. Small Data [“Small Data” slides]
  12. Other DataFrame Tools [“Other DataFrame Tools” slides]
  13. Other Big Data Tools [“Other Big Data Tools” slides]
  14. NumPy/Pandas Speed [“NumPy/Pandas Speed” slides] (not covered but left for interest)

Course home page.

Schedule

Week Deliverables (*) Lecture Date First Slide
1 Assign 0 in lab Sep 5
2 Assign 1 Sep 9
3 Assign 2 Sep 16
4 Assign 3 Sep 23
5 Assign 4 Sep 30
6 Assign 5 Oct 7
7 Assign 6 Oct 14
8 Assign 7, Quiz 1 Oct 21
9 Assign 8 Oct 28
10 Assign 9, Quiz 2 Nov 4
11 Assign 10 Nov 11
12 Nov 18
13 Quiz 3 Nov 25
14 Project

* Check CourSys for the actual due dates and times.