Coder Social home page Coder Social logo

project1a1b / big-mart-sales-prediction Goto Github PK

View Code? Open in Web Editor NEW
7.0 1.0 2.0 1.72 MB

Building Big Mart Sales Prediction model

Python 0.04% Jupyter Notebook 99.73% HTML 0.23%
bigmart-sales-prediction machine-learning prediction python-3 regression data-science datacleaning sales

big-mart-sales-prediction's Introduction

Big-Mart-Sales-Prediction

The aim is to build a predictive model and find out the sales of each product at a particular store. Create a model by which Big Mart can analyse and predict the outlet production sales.

A perfect project to learn Data Analytics and apply Machine Learning algorithms (Linear Regression, Random Forest Regressor, XG Boost) to predict the outlet production sales.

Mini Project 2-A

Dataset Description

BigMart has collected sales data from the year 2013, for 1559 products across 10 stores in different cities. Where the dataset consists of 12 attributes like Item Fat, Item Type, Item MRP, Outlet Type, Item Visibility, Item Weight, Outlet Identifier, Outlet Size, Outlet Establishment Year, Outlet Location Type, Item Identifier and Item Outlet Sales. Out of these attributes response variable is the Item Outlet Sales attribute and remaining attributes are used as the predictor variables.

The data-set is also based on hypotheses of store level and product level. Where store level involves attributes like:- city, population density, store capacity, location, etc and the product level hypotheses involves attributes like:- brand, advertisement, promotional offer, etc.

Dataset

https://www.kaggle.com/datasets/brijbhushannanda1979/bigmart-sales-data/download?datasetVersionNumber=1

Dataset Details

The data has 8523 rows of 12 variables.

Variable - Details

  • Item_Identifier- Unique product ID
  • Item_Weight- Weight of product
  • Item_Fat_Content - Whether the product is low fat or not
  • Item_Visibility - The % of total display area of all products in a store allocated to the particular product
  • Item_Type - The category to which the product belongs
  • Item_MRP - Maximum Retail Price (list price) of the product
  • Outlet_Identifier - Unique store ID
  • Outlet_Establishment_Year- The year in which store was established
  • Outlet_Size - The size of the store in terms of ground area covered
  • Outlet_Location_Type- The type of city in which the store is located
  • Outlet_Type- Whether the outlet is just a grocery store or some sort of supermarket
  • Item_Outlet_Sales - Sales of the product in the particulat store. This is the outcome variable to be predicted.

Setup

1. Install jupyter notebook -

pip install jupyter notebook

2. Install Pycharm -

https://www.jetbrains.com/pycharm/download/download-thanks.html?platform=windows&code=PCC

3. Install python librarys -

pip install pandas, pip install numpy, pip install matplotlib, pip install klib, pip install seaborn, pip install Sklearn, pip install joblib, pip install xgboost pip install flask

Project Flow

We will handle this problem in a structured way.

  • Loading Packages and Data
  • Data Structure and Content
  • Exploratory Data Analysis
  • Missing Value Treatment
  • Feature Engineering
  • Encoding Categorical Variables
  • Label Encoding
  • PreProcessing Data
  • Modeling
  • Linear Regression
  • RandomForest Regressor
  • XGBoost
  • Deployment

Contributors

o  MAYUR KYATHAM

o  PRERNA SHAKWAR 

o  UTSAV KUNTALWAD

o  SRUSHTI SAWANT

big-mart-sales-prediction's People

Contributors

prerna-shakwar avatar project1a1b avatar srushtisawant15 avatar utsavreddy9 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.