Notion API

Web scrapping scheduled script to save Awwwards votes to Notion app. Ready to be used on Heroku.

Installation

Git clone the package to install the script.

Features

Save in a notion table all votes assignated to a specific page. In addition, these fields are scrapped as well :

Name (text)
Status (select)
Country (text)
Website (link)
Vote (Number)
Design (Number)
Usability (Number)
Creativity (Number)
Content (Number)

Build with

Nodejs
Notion Api (official @notionhq/client)
Puppeteer
AMQP

Usage

Using this script need you to have a Notion account. Then, if needed, create a new Integration. Duplicate this template, share it with your integration,enter the awwwards url you want to follow (eg. https://www.awwwards.com/sites/[SITE]), save the database id in �.env and insert the table in any page you want to display a filtered view (or not).
Save Notion API variables (NOTION_KEY and NOTION_DATABASE_ID) to .env file and Heroku.
Modify the Job timer inside ./clock.js file (default 50min)

const JOBS = [{
  name: "Cron process 1",
  message: { "taskName": "getNotes", "queue": "worker-queue" },
  cronTime: "*/50 * * * *",  // config -> https://www.npmjs.com/package/node-cron
  repeat: 1
},
//{... other Job}
];

You can try the script in local with the following command : $ npm run start. If it work, the Notion table should fill up after a few seconds.
Then, deploy on heroku (thank to CLI) and don't forget to use the next configs.

Discover Notion API here.

Heroku

Heroku must be configured to run those script.

Variables:
- NOTION_DATABASE_ID
- NOTION_KEY
Add-on:
- Set CloudAMQP free add-on on ressources
Buildpack (settings -> in very order):
- heroku/nodejs
- https://github.com/jontewks/puppeteer-heroku-buildpack – Puppeteer mandatory
Procfile is needed:
- worker: node worker.js
- clock: node clock.js
Dynos : 2 (free) dynos must be started:
- $ heroku ps:scale worker=1
- $ heroku ps:scale clock=1

Warning concerning Awwwards DOM

Since Puppeteer is base on DOM, it may be broken in future. So, if the script is not able to retrieve some information from awwwards page, it should be because the targeted DOM should be updated. You can do it in the ./appfile.

return { 
  name: el.querySelector(".info > .rows > .row a").textContent,
  status: el.querySelector(".list-number-awards .tooltip-text").textContent,
  website: el.querySelector(".info > .rows .row:last-child a").textContent,
  country: el.querySelector(".info > .rows .row strong:last-child").textContent,
  note: el.querySelector(".note").textContent,
  design: el.querySelector(".list-circle-notes > .design").dataset.note,
  usability: el.querySelector(".list-circle-notes > .usability").dataset.note,
  creativity: el.querySelector(".list-circle-notes > .creativity").dataset.note,
  content: el.querySelector(".list-circle-notes > .content").dataset.note,
}

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
app.js		app.js
clock.js		clock.js
package.json		package.json
worker.js		worker.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Notion API - get Awwwards Users votes

Installation

Features

Build with

Usage

Notion API

Heroku

Warning concerning Awwwards DOM

Contributing

License

About

Releases

Packages

Languages

alexiscolin/awwwards-notion-scraping

Folders and files

Latest commit

History

Repository files navigation

Notion API - get Awwwards Users votes

Installation

Features

Build with

Usage

Notion API

Heroku

Warning concerning Awwwards DOM

Contributing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages