Skip to content

LMAPcoder/Distributed-Data-Analytics-Lab

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed-Data-Analytics-Lab

This repository contains a series of machine learning projects from the Distributed Data Analytics lab course, which is part of the Data Analytics Master's program at Hildesheim Universität.

Content:

  1. Introduction to Message Passing Interface (MPI)
    • Point-to-point communication
    • Collective communication
  2. K-mean with MPI
  3. Gradient descent with MPI
  4. Counting neighbors with MapReduce program within Hadoop
  5. TextRank with MapReduce program within Hadoop
  6. RGB Image into grayscale with MapReduce program within Hadoop
  7. Introduction to Pytorch
    • Regression on Wine quality dataset
  8. Convolutional Architecture in Pytorch
    • Image classification of Flower dataset
    • Loading dataset on the RAM
  9. Multiprocessing in Pytorch
    • Image classification of Flower dataset with multiple workers
    • Tracking and visualization performance on Tensorboard
  10. More exercises with Pytorch
    • Parameters update without optimizer
    • Regression with Neural Network on California Housing dataset
    • Learning rate optimization
    • Activation functions: ReLU/TanH
  11. More exercises with Pytorch
    • Binary classification on a9a dataset
    • Multiprocessing with dataset partitions
    • Binary classification on Gisette dataset
    • Image classification on Flower dataset
    • Effect of batch size on training time
    • Effect of batch normalization on training accuracy
    • Depth-wise Separable Convolutional Layer

About

Machine learning projects using distributed computing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published