Data Science Resources

For interview preparation and learning

Table of Contents:

Interview Preparation
Algorithms and Data Structures
Python
SQL
Machine Learning
MLOps
Deep Learning
Generative AI
NLP
- Packages
Computer Vision
Graphs
Reinforcement Learning
RecSys
- Packages
Time Series
Big Data
System Design
Machine Learning System Design
Math
Other

Interview Preparation

Questions

Data Science

Machine Learning

Deep Learning

SQL

Вопросы с собеседования по анализу данных SQL в 2023 году

NLP

Programming

Behavioural interview

Courses

Questions

Mock Interviews and Pieces of Advice

Валерий Бабушкин:
- Выпуск 1: Технический директор karpov.courses Кирилл Черепанов
- Выпуск 2: Артур Кузин, Kaggle Grandmaster, Head of Deep Learning в компании Eqvilent. В прошлом – Head of Computer Vision Platform в SberDevices
- Выпуск 3: Дарья — ведущий аналитик данных и продуктовый аналитик в X5 Group
Episode 07: Intro to Behavioural Interviews
Dan Croitor
Tips for answering few tricky behavioural interview questions
How to Answer Top Interview Questions
Interview Warmup by Google

English

Home Assignments

Tips

Resources

Courses

Хочу крутой оффер

Other

Algorithms and Data Structures

Platforms

LeetCode
Leetcode Patterns List of questions with patterns + tips
LeetCode Explore
Codewars
HackerRank
CodeAbbey
CodeRun Инструмент для подготовки к очному собеседованию в Яндексе. Задачи очень похожи на те, что будут на интервью.
Другие

Courses

Resources

Articles

Books

Python

Clean Code

Мартин Р. Чистый код: создание, анализ и рефакторинг / Robert C. Martin. Clean Code: A Handbook of Agile Software Craftsmanship
Стив Макконнелл. Совершенный код. Мастер-класс / Steve McConnell. Code Complete: A Practical Handbook of Software Construction

Theory

Questions

53 Python Interview Questions and Answers
Python: вопросы на собеседовании:
[Часть I. Junior](https://pythonist.ru/ python-voprosy-sobesedovaniya-chast-i-junior/)
Часть II. Middle
Часть III. Senior

Other

Efficient Python Tricks and Tools for Data Scientists

Practice

SQL

How to pass data engineering SQL interviews in big tech

Courses

Practice

Machine Learning

Sites

Courses

Open Machine Learning Course by Yury Kashnitsky
Машинное обучение (курс лекций, К.В.Воронцов)
Прикладные задачи анализа данных (курс лекций, А.Г.Дьяконов) video
Алгоритмы Машинного обучения с нуля
Stanford CS229: Machine Learning by Andrew Ng
Kaggle Learn
Google Machine Learning Courses
End to End Machine Learning by Brandon Rohrer
Машинное Обучение в Python: Большой Курс для Начинающих
Обучение работе с ML‑сервисами от Yandex Cloud
Introduction to Machine Learning (I2ML)
MACHINE LEARNING @ VU
Practical Machine Learning

Books

Учебник по машинному обучению от ШАД
An Introduction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie, Rob Tibshirani
The Elements of Statistical Learning by Trevor Hastie, Robert Tibshirani, Jerome Friedman
Machine Learning Simplified: A gentle introduction to supervised learning by Andrew Wolf
The Kaggle Book
Feature Engineering and Selection: A Practical Approach for Predictive Models by Max Kuhn and Kjell Johnson
Clean Machine Learning Code
Interpreting Machine Learning Models With SHAP: A Guide With Python Examples And Theory On Shapley Values
Interpretable Machine Learning. A Guide for Making Black Box Models Explainable by Christoph Molnar
Machine Learning Q and AI. Expand Your Machine Learning & AI Knowledge With 30 In-Depth Questions and Answers by Sebastian Raschka
Reliable Machine Learning: Applying SRE Principles to ML in Production by Cathy Chen
Machine Learning Refined: Foundations, Algorithms, and Applications
Models Demystified. A Practical Guide from t-tests to Deep Learning by Michael Clark & Seth Berry
Machine Learning from Scratch. Derivations in Concept and Code
Supervised Machine Learning for Science by Christoph Molnar & Timo Freiesleben
Machine Learning Refined: Notes, Exercises, Presentations, and Sample Chapters
Дьяконов А.Г. "Машинное обучение и анализ данных"

Cheetsheets

Articles

Applied ML

Blogs

Feature Engineering

Tutorials

Blog posts

Other

MLOps

General

FastAPI for Machine Learning: Live coding an ML web application with the creator of FastAPISebastián Ramírez
Build your MLOps stack
MLOps и production подход к ML исследованиям
MLOps Guide
MLOps guide by Chip Hyyen
MLOps Zoomcamp
MLOps и production подход к ML исследованиям 2.0: Видео + Курс
THE ULTIMATE DOCKER COMPOSE CHEAT SHEET
Practitioner's guide to MLOps by Google

Other

Deep Learning

Books

Courses

Tutorials

Blogs & Blog posts

Other

Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI

Generative AI

Generative AI for Beginners

NLP

Books

Natural Language Processing with Transformers by Lewis Tunstall, Leandro von Werra adn Thomas Wolf
Speech and Language Processing by Dan Jurafsky and James H. Martin
Transformers for Natural Language Processing by Denis Rothman

Courses

General

Large Language Models (LLMs) / Transformers

Embeddings

Reading papers with AI

Prompt Engineering

Tutorials

Blog posts

Articles

Word2Vec, Mikolov et al., Efficient Estimation of Word Representations in Vector Space
FastText, Bojanowski et al., Enriching Word Vectors with Subword Information
Attention, Bahdanau et al., Neural Machine Translation by Jointly Learning to Align and Translate
Transformers, Vaswani et al., Attention Is All You Need
BERT, Devlin et al., BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
GPT-2, Radford et al., Language Models are Unsupervised Multitask Learners
GPT-3, Brown et al, Language Models are Few-Shot Learners
LaBSE, Feng et al., Language-agnostic BERT Sentence Embedding
CLIP, Radford et al., Learning Transferable Visual Models From Natural Language Supervision
RoPE, Su et al., RoFormer: Enhanced Transformer with Rotary Position Embedding
LoRA, Hu et al., LoRA: Low-Rank Adaptation of Large Language Models
InstructGPT, Ouyang et al., Training language models to follow instructions with human feedback
Scaling laws, Hoffmann et al., Training Compute-Optimal Large Language Models
FlashAttention, Dao et al., FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
NLLB, NLLB team, No Language Left Behind: Scaling Human-Centered Machine Translation
Q8, Dettmers et al., LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Self-instruct, Wang et al., Self-Instruct: Aligning Language Models with Self-Generated Instructions
Alpaca, Taori et al., Alpaca: A Strong, Replicable Instruction-Following Model
LLaMA, Touvron, et al., LLaMA: Open and Efficient Foundation Language Models

Packages

Turbo-Alignment
Turbo-Alignment is a library designed to streamline the fine-tuning and alignment of large language models, leveraging advanced techniques to enhance efficiency and scalability
LitGPT
Every LLM is implemented from scratch with no abstractions and full control, making them blazing fast, minimal, and performant at enterprise scale.

Computer Vision

Graphs

Reinforcement Learning

RecSys

Courses

Books

К. Фальк. Рекомендательные системы на практике / Practical Recommender Systems by Kim Falk
Personalized Machine Learning

Other

Packages

Revisiting BPR

Time Series

Временные ряды
Topic 9. Time Series Analysis with Python
Прогнозирование временных рядов
Time Series
Forecasting time series with gradient boosting: Skforecast, XGBoost, LightGBM, Scikit-learn and CatBoost by Joaquín Amat Rodrigo, Javier Escobar Ortiz
ARIMA and SARIMAX models with Python by Joaquín Amat Rodrigo, Javier Escobar Ortiz
Груздев А.В., Рафферти Г. Прогнозирование временных рядов с помощью Prophet, sktime, ETNA и Greykite
Forecasting: Principles and Practice (3rd ed)

Big Data

Books

Перрен Ж.Ж. Spark в действии / Spark in Action by Jean-Georges Perrin
Learning Spark
Data Analysis with Python and PySpark

Other

System Design

Machine Learning System Design

Stanford CS 329S: Machine Learning Systems Design
ML System Design
Шаблон ML System Design Doc от телеграм-канала Reliable ML ML System Design - ML System Design Doc. Лекция-бонус от Reliable ML
Machine learning design primer by Ibragim Badertdinov
ml-design-doc
Что я бы хотел знать про ML System Design раньше
Почему анализ ошибок – это начало разработки ML системы, а не конец?
ML Systems Design Interview Guide
Machine Learning System Design by Valerii Babushkin and Arseny Kravchenko
Designing Machine Learning Systems by Chip Huyen
Machine Learning Engineering Online Book
ML system design: 300 case studies to learn from
Machine Learning System Design Interview by Ali Aminian, Alex Xu + Solutions
The 9-Step ML System Design Formula
CSCE 585 Machine Learning Systems: Lectures + Video
Machine Learning in Production: From Models to Products
ML System Design by Machine Learning REPA
ML for Developers
Learn how to combine machine learning with software engineering to design, develop, deploy and iterate on production ML applications

Math

General

Linear Algebra

Probability and Statistics

A/B Tests

General

Blog posts

Metrics

Продуктовому аналитику: 7 методик, чтобы находить кратные точки роста продукта

Other

Interactive Tools for Machine Learning, Deep Learning, and Math

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
README.md		README.md

Extremesarova/ds_resources

Folders and files

Latest commit

History

Repository files navigation

Data Science Resources

Interview Preparation

Questions

Data Science

Machine Learning

Deep Learning

SQL

NLP

Programming

Behavioural interview

Courses

Questions

Mock Interviews and Pieces of Advice

English

Home Assignments

Tips

Resources

Courses

Other

Algorithms and Data Structures

Platforms

Courses

Resources

Articles

Books

Python

Clean Code

Theory

Questions

Other

Practice

SQL

Courses

Practice

Machine Learning

Sites

Courses

Books

Cheetsheets

Articles

Applied ML

Blogs

Feature Engineering

Tutorials

Blog posts

Other

MLOps

General

Other

Deep Learning

Books

Courses

Tutorials

Blogs & Blog posts

Other

Generative AI

NLP

Books

Courses

General

Large Language Models (LLMs) / Transformers

Embeddings

Reading papers with AI

Prompt Engineering

Tutorials

Blog posts

Articles

Packages

Computer Vision

Graphs

Reinforcement Learning

RecSys

Courses

Books

Other