Skip to content

Collection of useful data science topics along with code and articles

Notifications You must be signed in to change notification settings

daiki-ikeuchi/Data-science

 
 

Repository files navigation

View on GitHub View on Medium Daily Data Science Tips

Collection of useful data science topics along with code and articles in my data science blog.

If you want to received updates of these blogs in your mailbox, you can subscribe to my Medium newsletter. To received bite-sized Python and daily data science tips in your mailbox, you can subscribe to Data Science Simplified.

Contents

  1. Data Science Tools
  2. Productive Tools
  3. Tools for Deployment
  4. Speed-up Tools
  5. Math Tools
  6. Machine Learning
  7. Natural Language Processing
  8. Computer Vision
  9. Time Series
  10. Feature Engineering
  11. Visualization
  12. Mathematical Programming
  13. Scraping
  14. Python
  15. Terminal
  16. Linear Algebra
  17. Data Structure
  18. Statistics
  19. Applications
  20. Learning Tips
  21. Productive Tips
  22. VSCode
  23. Book Review
  24. Data Science Portfolio

Data Science Tools

Title Article Repository
How to Create Fake Data with Faker link link
Introduction to Schema: A Python Libary to Validate your Data link link
Introduction to DVC: Data Version Control Tool for Machine Learning Projects link link
Introduction to Datasette: Explore and Publish Your Data in One Line of Code link
Introduction to Datapane: A Python Library to Build Interactive Reports link
Datapane’s New Features: Create a Beautiful Dashboard in Python in a Few Lines of Code link link
Introduction to Hydra.cc: A Powerful Framework to Configure your Data Science Projects link
Pytest for Data Scientists link link
4 Lessor-Known Yet Awesome Tips for Pytest link link
Introduction to Weight & Biases: Track and Visualize your Machine Learning Experiments in 3 Lines of Code link link
Kedro — A Python Framework for Reproducible Data Science Project link link
Orchestrate a Data Science Project in Python With Prefect link link
Introduction to Deepnote: Real-time Collaboration on Jupyter Notebook link
Great Expectations: Always Know What to Expect From Your Data link link

Productive Tools

Title Article Repository
How to Share your Python Objects Across Different Environments in One Line of Code link link
How to Share your Jupyter Notebook in 3 Lines of Code with Ngrok link
3 Tools to Track and Visualize the Execution of your Python Code link link
2 Tools to Automatically Reload when Python Files Change link link
How to Strip Outputs and Execute Interactive Code in a Python Script link link
Pydash: A Kitchen Sink of Missing Python Utilities link link
4 pre-commit Plugins to Automate Code Reviewing and Formatting in Python link link
Write Clean Python Code Using Pipes link link
Introducing FugueSQL — SQL for Pandas, Spark, and Dask DataFrames link link
DeepDiff — Recursively Find and Ignore Trivial Differences Using Python link link

Tools for Deployment

Title Article Repository
How to Effortlessly Publish your Python Package to PyPI Using Poetry link link
Typer: Build Powerful CLIs in One Line of Code using Python link link

Speed-up Tools

Title Article Repository
Cython-A Speed-Up Tool for your Python Function link link
Train your Machine Learning Model 150x Faster with cuML link link

Math Tools

Title Article Repository
SymPy: Symbolic Computation in Python link link
How to Create Mathematical Animations like 3Blue1Brown Using Python link link

Machine Learning

Title Article Repository
How to Monitor And Log your Machine Learning Experiment Remotely with HyperDash link link
How to Efficiently Fine-Tune your Machine Learning Models link link
How to Learn Non-linear Dataset with Support Vector Machines link link
Introduction to IBM Federated Learning: A Collaborative Approach to Train ML Models on Private Data link link
3 Steps to Improve your Efficiency when Hypertuning ML Models link
human-learn: Create a Human Learning Model by Drawing link link
Patsy: Build Powerful Features with Arbitrary Python Code link link
SHAP: Explain Any Machine Learning Model in Python link link

Natural Language Processing

Title Article Repository
Sentiment Analysis of LinkedIn Messages link link
Find Common Words in Article with Python Module Newspaper and NLTK link link
How to Tokenize Tweets with Python link link
How to Solve Analogies with Word2Vec link link
What is PyTorch link link
Convolutional Neural Network in Natural Language Processing link link
Supercharge your Python String with TextBlob link link
pyLDAvis: Topic Modelling Exploration Tool That Every NLP Data Scientist Should Know link link
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge link link
Build a Robust Conversational Assistant with Rasa link link
I Analyzed 2k Data Scientist and Data Engineer Jobs and This is What I Found link link

Computer Vision

Title Article Repository
How to Create an App to Classify Dogs Using fastai and Streamlit link link

Time Series

Title Article Repository
Kats: a Generalizable Framework to Analyze Time Series Data in Python link link
How to Detect Seasonality, Outliers, and Changepoints in Your Time Series link link

Feature Engineering

Title Article Repository
3 Ways to Extract Features from Dates with Python link link
Similarity Encoding for Dirty Categories Using dirty_cat link link

Visualization

Title Article Repository
How to Embed Interactive Charts on your Articles and Personal Website link link
What I Learned from Scraping 15k Data Science Articles on Medium link link
How to Create Interactive Plots with Altair link link
How to Create a Drop-Down Menu and a Slide Bar for your Favorite Visualization Tool link link
I Scraped more than 1k Top Machine Learning Github Profiles and this is what I Found link link
Top 6 Python Libraries for Visualization: Which one to Use? link link
Introduction to Yellowbrick: A Python Library to Visualize the Prediction of your Machine Learning Model link link
Visualize Gender-Specific Tweets with Scattertext link link
Visualize Your Team’s Projects Using Python Gantt Chart link link
How to Create Bindings and Conditions Between Multiple Plots Using Altair link link
How to Sketch your Data Science Ideas With Excalidraw link
Pyvis: Visualize Interactive Network Graphs in Python link link
Build and Analyze Knowledge Graphs with Diffbot link
Observe The Friend Paradox in Facebook Data Using Python link link
What skills and backgrounds do data scientists have in common? link link
Visualize Similarities Between Companies With Graph Database link link
Visualize GitHub Social Network with PyGraphistry link link
Find the Top Bootcamps for Data Professionals From Over 5k Profiles link link

Mathematical Programming

Title Article Repository
How to choose stocks to invest in with Python link link
Maximize your Productivity with Python link link
How to Find a Good Match with Python link link
How to Solve a Staff Scheduling Problem with Python link link
How to Find Best Locations for your Restaurants with Python link link
How to Schedule Flights in Python link link
How to Solve a Production Planning and Inventory Problem in Python link link

Scraping

Title Article Repository
Web Scrape Movie Database with Beautiful Soup link link
top-github-scraper: Scrape Top Github Users and Repositories Based On a Keyword in One Line of Code link link

Python

Title Article Repository
Numpy Tricks for your Data Science Projects link link
Timing for Efficient Python Code link link
How to Use Lambda for Efficient Python Code link link
Python Tricks for Keeping Track of Your Data link link
Boost Your Efficiency With Specialized Dictionary Implementations in Python link link
Dictionary as an Alternative to If-Else link link
How to Use Zip to Manipulate a List of Tuples link link
Get the Most out of Your Array With These Four Numpy Methods link link
3 Python Tricks to Read, Create, and Run Multiple Files Automatically link link
How to Exclude the Outliers in Pandas DataFrame link link
Python Clean Code: 6 Best Practices to Make Your Python Functions More Readable link link
3 Techniques to Effortlessly Import and Execute Python Modules link link
Simplify Your Functions with Functools’ Partial and Singledispatch link link

Terminal

Title Article Repository
How to Create and View Interactive Cheatsheets on the Command-line link
Understand CSV Files from your Terminal with XSV link
Prettify your Terminal Text With Termcolor and Pyfiglet link link
Stop Using Print to Debug in Python. Use Icecream Instead link
Rich: Generate Rich and Beautiful Text in the Terminal with Python link link
Create a Beautiful Dashboard in your Terminal with Wtfutil link link
3 Tools to Monitor and Optimize your Linux System link
Ptpython: A Better Python REPL link link
fd: a Simple but Powerful Tool to Find and Execute Files on the Command Line link
Speed Up your Command-Line Navigation with These 3 Tools link
Python and Data Science Snippets on the Command Line link link

Linear Algebra

Title Article Repository
How to Build a Matrix Module from Scratch link link
Linear Algebra for Machine Learning: Solve a System of Linear Equations link link

Data Structure

Title Article Repository
Convex Hull: An Innovative Approach to Gift-Wrap your Data link link
How to Visualize Social Network With Graph Theory link link
How to Search Data with KDTree link link
How to Find the Nearest Hospital with a Voronoi Diagram link link

Statistics

Title Article Repository
Can Datasets of a Dinosaur and a Circle have Identical Statistics? link link
Introduction to One-Way ANOVA: A Test to Compare the Means between More than Two Groups link link
Bayes’ Theorem, Clearly Explained with Visualization link link
Detect Change Points with Bayesian Inference and PyMC3 link link

Applications

Title Article Repository
How to Create an Interactive Startup Growth Calculator with Python link link
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge link link
PyWebIO: Write Interactive Web App in Script Way Using Python link link
PyWebIO 1.3.0: Add Tabs, Pin Input, and Update an Input Based on Another Input link link
Simulate Real-life Events in Python Using SimPy link link
Create an App to Deal with Boredom Using PyWebIO link link

Learning Tips

Title Article Repository
How to Learn Data Science when Life does not Give You a Break link
How to Accelerate your Data Science Career by Putting yourself in the Right Environment link
To become a Better Data Scientist, you need to Think like a Programmer link
How not to be Overwhelmed with Data Science link

Productive Tips

Title Article Repository
How to Organize your Data Science Articles with Github link link
How to Create Reusable Command-Line link
5 Reasons why you should Switch from Jupyter Notebook to Scripts link
3 Ways to Get Notified with Python link link
7 Reasons Why you Should Start Documenting your Code link

VSCode

Title Article Repository
How to Leverage Visual Studio Code for your Data Science Projects link
Top 4 Code Viewers for Data Scientist in VSCode link
Incorporate the Best Practices for Python with These Top 4 VSCode Extensions link
Boost Your Efficiency with Customized Code Snippets on VSCode link
Top 9 Keyboard Shortcuts in VSCode for Data Scientists link

Book Review

Title Article Repository
Python Machine Learning: A Comprehensive Handbook for Machine Learning link

Data Science Portfolio

Title Article Repository
How to Create an Elegant Website for your Data Science Portfolio in 10 minutes link
Build an Impressive Github Profile in 3 Steps link

Supporters

Special thanks to these supporters for supporting this project!

About

Collection of useful data science topics along with code and articles

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 96.8%
  • HTML 3.2%