- Introduction/Overview
- Objective
- Methodology/Approach
- Installation/Requirements
- File Descriptions
- Data Collection and Sources
- Results/Conclusions
- License
For this project, I worked on understanding the results of an A/B test run by an e-commerce website. The company has developed a new web page to try and increase the number of users who "convert," meaning the number of users who decide to pay for the company's product. My goal is to work through the test results to help the company understand if they should implement this new page, keep the old page, or perhaps run the experiment longer to make their decision.
To analyze A/B test results to determine the more effective web page design for increasing user conversion rates.
A/B testing is one of the common technique used for analyzing which web designs drive more traffic. Many companies use A/B testing to test feature changes, layouts and even colors to increase a metric that measures a interest from the users.
The way this method works is, a group of people called control group
is shown the old version of the web page and a group of people called experiment group
is shown the new version of the web page. Based on how both groups respond, we can determine if new page is better and should be launched. It turns out that it is just an application of Hypothesis Testing
.
We can set up hypothesis in this way (This is often related to what your problem is about):
NULL
: The new page is no better or worse than the old page.ALTERNATIVE
: The new page is better than the old page.
Once we have our hypothesis set, we can use the data to see which hypothesis is supported.
My approach includes interpreting the A/B test results to suggest whether or not the E-Commerce company has to implement the new web page to increase conversion rate in the following ways:
- Using descriptive statistics
- Hypothesis Testing
- Z_test
- Logistic Regression
Please refer to the ABTestResultsForEcommerce.ipynb
notebook for detailed info on how these individual analyses are performed.
The following libraries and tools are used in this project:
- Numpy
- Pandas
- Statsmodels
- Matplotlib
- The code should run with no issues using Python version
3.*.
This project contains the following files:
ab_data.csv
: holds the A/B test results data.countries.csv
: Another file holding country information for each user inab_data.csv
fileAB-Test-Insights-For-E-Commerce-Conversion.ipynb
: Jupyter notebook containing Python code and briefly describing each step.AB-Test-Insights-For-E-Commerce-Conversion.html
: HTML version of the notebook.
This project contains two datasets:
-
ab_data.csv
: This file contains the A/B test results. The structure of the file is as followsuser_id
: Unique identifier for each usertimestamp
: The date and time when the user visited the pagegroup
: Indicates whether the user was in the control or treatment grouplanding_page
: Shows the version of the page the user landed on (old_page or new_page)converted
: Indicates whether the user converted (1) or not (0)
-
countries.csv
: This file contains country information corresponding to each user inab_data.csv
. It helps in understanding geographical influences in the A/B test results.
According to the analysis:
- All analyses suggested sticking with the old web page as the increase in conversion rate with the new web page doesn't seem statistically significant.
- The results can alter if the A/B test is conducted longer.
Licensed under MIT License