A minimal scrapping script of 60 lines to get the open source tools and SaaS names behind the world's best startups.
You can simply clone and run the scrapping script : scrap.py
What things you need to install to use this?
- Python 2.7
- BeautifulSoup 4
- requests
Requests module : To install Requests, simply run this simple command in your terminal of choice:
$ pipenv install requests
BeaufilSoup Ubuntu
$ apt-get install python-bs4 (for Python 2)
Python package BeautifulSoup
$ easy_install beautifulsoup4
$ pip install beautifulsoup4
tech_inside("zomato")
Output : {
"DevOps": ["GitHub", "Git", "Docker", "npm", "New Relic", "Jenkins", "Vim", "Travis CI", "Ansible", "Sentry", "Docker Compose", "Kubernetes"],
"Business Tools": ["G Suite", "Slack", "HipChat"],
"Utilities": ["Google Analytics", "Google Drive", "Postman", "SendGrid"],
"Application and Data": ["nginx", "JavaScript", "PHP", "Node.js", "jQuery", "HTML5", "MySQL", "Python", "React", "Amazon EC2", "Redis", "MongoDB", "Java", "Amazon S3", "Ubuntu", "RabbitMQ", "Flask", "Amazon EC2 Container Service", "Scala", "Google Compute Engine", "Debian", "Memcached"]
}
- Python 2.7 - Core programming language used
- BeautifulSoup - Scrapping library
- Requests - Used to make get a html doc
- JSON JSON response
- Data scrapped from : Stackshare