python

Cross-Validation in scikit-learn

Cross-validation is a statistical method used in Machine Learning for estimating the performance of models. It is very important to…

4 years ago

Feature Scaling: MinMax, Standard and Robust Scaler

Feature Scaling is performed during the Data Preprocessing step. Also known as normalization, it is a method that is used…

4 years ago

Outlier Detection using Isolation Forests

For a dataset, an outlier is a data point that behaves differently from the other data points. Outliers cause huge…

4 years ago

Discovering Hidden Themes of Documents

Latent Semantic Analysis using Python Discovering topics are very useful for various purposes such as for clustering documents, organizing online…

4 years ago

Predicting Customer Lifetime Value in Python

Learn how to calculate Customer Life Time Value in Python. Italian economist Vilfredo Pareto states that 80% of the effect…

4 years ago

Introduction to Customer Segmentation in Python

In this tutorial, you’re going to learn how to implement customer segmentation using RFM(Recency, Frequency, Monetary) analysis from scratch in…

4 years ago

Introduction to Factor Analysis in Python

In this tutorial, you’ll learn the basics of factor analysis and how to implement it in Python. Factor Analysis (FA)…

4 years ago

Dimensionality Reduction using tSNE

tSNE stands for t-distributed Stochastic Neighbor Embedding. It is a dimensionality reduction technique and is extremely useful for visualizing datasets…

4 years ago

Dimensionality Reduction using PCA

Dimensionality refers to the number of input variables (or features) of the dataset. Data with a large number of features…

4 years ago

Evaluating Clustering Methods

Predicting optimal clusters is of utmost importance in Cluster Analysis. For a given data, we need to evaluate which Clustering…

4 years ago