Ecommerce product categorization

Introduction

Product categorization is the task of classifying products as belonging to one or more categories from a given taxonomy.It helps customers navigate an ecommerce store with ease. It deals with organizing our ecommerce products into categories and tags that give us a system to get customers to the exact product they are looking for quicker. This includes creating categories, tags, attributes and more to create a hierarchy for similar products. It is a field of study within natural language processing (NLP).

Data

This project deals with a huge E-commerce text dataset for 4 categories - "Electronics", "Household", "Books" and "Clothing & Accessories".

Procedure

Basic NLP steps for categorizing the E-commerce dataset include:-

Data preprocessing
Data Visualisation ( exploratory data analysis )
Text Preprocessing such as tokenization,stemming,lemmatization,stopword removal,POS tagging
Model Building using ML Classifiers such as Multinomial Bayesian Classifier,SVM,Decison Tree,Random forest,Logistic regression.
Testing
Prediction

Results

We use TF-IDF vectorizer on the normalized product descriptions for text vectorization on Multinomial Bayes Classifier , and perform hyperparameter tuning . The tuned model obtains a validation accuracy of 0.9203. We employ the tuned model with the highest validation accuracy(Support Vector Machine) to predict the labels of the test observations and obtained a test accuracy of 0.96. We present the confusion matrix depicting the test set performance of the model.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
E commerce Search engine and recommendation system (1).ipynb		E commerce Search engine and recommendation system (1).ipynb
Ecommerce_product_categorization.ipynb		Ecommerce_product_categorization.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ecommerce product categorization

Introduction

Data

Procedure

Results

About

Uh oh!

Releases

Packages

Uh oh!

Languages

pranavvb03/ecommerce-product-categorization

Folders and files

Latest commit

History

Repository files navigation

Ecommerce product categorization

Introduction

Data

Procedure

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages