Skip to content

Exploratory and/or Explanatory Data Analysis (EDA) in ecommerce data using python. This analysis aims to provide a basic overview and understanding of the various transaction information in a e-commerce company.

Notifications You must be signed in to change notification settings

novianggita/EDA-in-E-Commerce-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Project Portfolio EDA in E-Commerce Data using Python

EDA stands for Exploratory and Explanatory Data Analysis. I am using an e-commerce dataset containing various online transaction information. This dataset has 8 variables and 4870 rows. Here is the description of each variable:

  1. InvoiceNo: A unique number/code for each sale or purchase
  2. StockCode: A unique number/code for each product
  3. Description: The name or description of each product
  4. Quantity: The quantity of products bought/sold
  5. InvoiceDate: The date the invoice was issued
  6. UnitPrice: The price per unit of the product
  7. CustomerID: A unique number for each customer
  8. Country: The country where the transaction took place or the location of the customer who made the purchase

The purpose of this analysis is to provide a basic overview and deep understanding of the transaction information conducted from December 2010 to December 2011. Here are some aspects being analyzed:

  • Products with the highest sales
  • Countries with the highest sales
  • Correlation between product prices and the quantity sold
  • Customers with the highest purchases

Some insights obtained include:

  1. November 2011 had the highest sales period
  2. The United Kingdom dominated the sales compared to other countries. The highest sales occurred in November 2011
  3. There is a weak and negative correlation between Unit Price and Quantity. This means that products with low Unit Prices are not always sold in high quantities, and/or products with high Unit Prices are not always sold in low quantities
  4. Some customers purchased a few items but incurred high costs (total price)

These insights can help the company to:

  1. Maximize profit in countries with high sales volumes and apply the same sales strategies in countries with low sales volumes
  2. Maintain customer loyalty by providing rewards to customers with the highest purchases
  3. Evaluate products with low sales volumes
  4. Identify market segments for specific products so that recommendation features can be accurately applied to each customer segment

This analysis still needs further development, especially in terms of:

  1. Evaluating the causes of the significant sales increase in November and the significant decrease in sales in the following month (December)
  2. Classifying the types of products with low sales and identifying the reasons behind it.

If you have any suggestions or feedback, please don't hesitate to contact to me in direct message on Email or LinkedIn: [email protected] or https://www.linkedin.com/in/novia-anggita-aprilianti/

#python #EDA #featuresengineering #datacleaning #ecommerce

About

Exploratory and/or Explanatory Data Analysis (EDA) in ecommerce data using python. This analysis aims to provide a basic overview and understanding of the various transaction information in a e-commerce company.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published