Student Branch
Software
Given is a transnational data set that contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based non-store online retail. The company mainly sells unique all-occasion gifts. Many customers of the company are wholesalers. Answer the following questions by analyzing the dataset. 1. How much missing data are there in the dataset? 2. How many transactions were canceled? What is the total quantity of canceled transactions? 3. Which country has the highest gross total price? Hint: sumof (quantity x unit price) 4. Plot the overall trend in total sales. 5. Which products have the highest sales, and in which countries? Give a plot 6. Do some customers have higher average transaction values or quantities than others? 7. What are the most popular products by sales? 8. What are the most popular products, and how do sales of these products vary over time? 9. Analyse and visualize total sales per year, month, transactions per week, transactions per day, and transactions per hour 10. Can we predict customer churn or lifetime value based on past purchasing behavior?
Data Attribute Information:
● InvoiceNo: Invoice number. Nominal, a 6-digit integral number uniquely assigned to each transaction. If this code starts with the letter 'c', it indicates a cancellation.
● StockCode: Product (item) code. Nominal, a 5-digit integral number uniquely assigned to each distinct product.
● Description: Product (item) name. Nominal.
● Quantity: The quantities of each product (item) per transaction. Numeric.
● InvoiceDate: Invoice Date and time. Numeric, the day and time when each transaction was generated.
● UnitPrice: Unit price. Numeric, Product price per unit in sterling.
● CustomerID: Customer number. Nominal, a 5-digit integral number uniquely assigned to each customer.
● Country: Country name. Nominal, the name of the country where each customer resides. Link to Dataset: https://drive.google.com/file/d/14qR5DTrKwr-oERbqlSB2rVQiSzI_VmRL/view?usp=sharing