Web application for OmniScrapper management.
It main goal is to provide:
- Ability to define scrappers in UI
- Ability to test defined scrappers from UI
- Ability to schedule scrapping schedule
If you want just to see how the application works, you can use dockerized setup to start project with a single command.
Run
docker-compose up -d postgres redis kafka web
then run
docker-compose up -d scheduler runner
in the directory of project, and they visit localhost:2300.
Dockerized version has already defines scrapper for HackerNews website, which will be started right after services start.
- Copy env variables
cp .env.development.sample .env.development
- Run
make db-prepare
it will create db, run migrations and fill database with data - To start the server run
make server
Run rake db:seed
to fill the database with sample data ready for testing.
It will add one real site, schema and task for scrapping. Feel free to go to tasks page and run sample scrapping.