QScraper 2.0

Specifications for the next-generation QScraper

Draft 1.2

Overview

QScraper originally started as a simple 'script-like' program that would perform its task and save a single data file (.json) recording the results. This approach is inefficient, restrictive, and difficult to build around and as such has prompted the need for QScraper 2.0 to take shape.

Platform

Azure vs AWS vs Heroku

Currently the Platform of choice looks to be Azure. Amazon Web Services and Heroku have both been appealing but have particular drawbacks that I would like to avoid.

On Azure we would utilize the offered 'Cloud Services' and 'Mobile Services' products. With Cloud Services we would build our various scraper modules (explained below) and the API service layer into one 'app'.

Programming Language

.NET vs NodeJS

Currently .NET is appealing due to the power of JSON.NET and the familiarity with .NET (versus say Ruby). Alternatively, if we wanted to be super hipster we could go the NodeJS route, but having you learn Javascript (NodeJS) from scratch would not be a quick task.

Requirements

Provide a API for all future Deal Flux apps to utilize
Provide Push Notifications to client apps
Be capable of hosting a static informational website
Be highly scalable
Be highly configurable (Ex. all responses would be GZIP compressed)
Be modular, allowing for each deal source module that we create to be 'dropped' in and automatically added to the list of services available through the API.
Be lightweight, as per #3, this needs to be lightweight and scalable. While hardware (Azure) can compensate for poor performance of an application to some extent, it will be costly and due to the mass amount of requests that Deal Flux client apps will generate this needs to be able to handle thousands of requests every few minutes (just as a general benchmark).
Be Restful Compliant
Support for language/region-designated results
Perserve description formatting.

Outline of API Structure

Currently the idea is to break each 'deal source' into their own API route and then provide a consistent set of options/commands that are available on any given deal source API route/request. Furthermore there should be an available bulk request.

dealfluxapp.com/v2/{deal-source-name}/{deal-title}/

v2

Just present as a basic form of versioning for the API allowing us to easily and quickly make significant changes in future API rollouts if need be while still maintaining full support for previous apps (non-upgraded apps)

deal-source-name

This would be a all lower case, no special character, and hyphenated name for the deal source.

Ex. 'woot', '1saleaday', 'yugster', etc.

deal-title

This would be a specified category name within a given deal source. For example on Woot there are various different deal pages/categories.

Ex. 'tech', 'home', 'tools-and-garden', etc.

These would be different per each deal source as no two deal sources have the same amount or types of pages/categories.

Deal Sources Supported

TeeFury
NeweggFlash
Amazon
- Gold Box Deals (include upcoming deals http://www.amazon.com/gp/goldbox/all-deals?ie=UTF8&gbNodeId=468642)
- Amazon Featured Deals (Video Games)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
QScraper		QScraper
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QScraper 2.0

Specifications for the next-generation QScraper

Draft 1.2

Overview

Platform

Azure vs AWS vs Heroku

Programming Language

.NET vs NodeJS

Requirements

Outline of API Structure

v2

deal-source-name

deal-title

Deal Sources Supported

Woot!

1SaleADay

Yugster

DailySteals

Dell Daily Deals

About

Releases

Packages

Languages

wonderkiln/qscraper

Folders and files

Latest commit

History

Repository files navigation

QScraper 2.0

Specifications for the next-generation QScraper

Draft 1.2

Overview

Platform

Azure vs AWS vs Heroku

Programming Language

.NET vs NodeJS

Requirements

Outline of API Structure

v2

deal-source-name

deal-title

Deal Sources Supported

Woot!

1SaleADay

Yugster

DailySteals

Dell Daily Deals

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages