A step by step guide to building a highly scalable data streaming pipeline in Python. Article is avialable here
-
Install the Redis and run it locally.
-
Clone the repository.
git clone https://github.com/iam-mhaseeb/Data-Streaming-Pipeline.git
-
Install the requirements.
pip install -r requirements.txt
-
You are good to go!
- Start the producer
quotes_spider
:cd producer scrapy crawl quotes
- Start the consumer
quotes_consumer
:cd consumer python quotes_consumer.py
- See data flowing between pipeline as soon as data is generated from the producer.
- Muhammad Haseeb - Initial work - Muhammad Haseeb
The project is MIT Licenced.