ID2221 Data Intensive Computing Platforms @ KTH
- Spark Scala: In this lab assignment we practice the basics of data intensive programming by setting up HDFS, HBase, Hadoop MapReduce, Spark, and Spark SQL, and implementing simple applications on them.
- Review questions 1: distributed file systems and NoSQL databases
- Review questions 2: data-parallel processing systems
- TensorFlow: A system for large-scale machine learning
- MLlib: Fast Training of GLMs using Spark MLlib