START GUIDE

P.S. I made this banner

The previous article was about finding the best-performing machine learning algorithm for the given dataset.

These techniques are often the first step after exploratory data analysis to cross-check if the input features in a given dataset have enough prediction power or not. Also, it is an efficient way to explore…


Start guide

P.S. I made this banner

The human brain responds well and retains more information from simple diagrams or visual content than text or numbers. Therefore, representing a complex dataset in graphical format is an effective way to drive crucial insights and gain more information about the data. Furthermore, the popularity of data visualization techniques can…


START GUIDE

P.S. I made this banner

Classification algorithms are machine learning techniques that involve categorizing data into classes. It is one of the kinds of supervised machine learning, in which algorithms learn from labeled data. Since algorithms learn from the labeled data, hence the distribution of classes plays an important role. For example, training algorithms on…


START GUIDE

P.S. I made this banner

Imagine a situation where you want to test if the given dataset has sufficient features to train machine learning algorithms or to test different algorithms’ performance on the given dataset. Both cases are pretty common in the field of data science.

Usually, to test the features, one can train models…


START GUIDE

P.S. I made this banner

Data Scientists widely use EDA to understand datasets for decision-making and data cleaning processes. EDA reveals crucial information about the data, such as hidden patterns, outliers, variance, covariance, correlations between features. The information is essential for the hypothesis’s design and creating better-performing models.


START GUIDE

This article shows how to create fantastic art using artificial neural networks.

P.S. I made this banner

The convolution neural network may contain several stacked layers, images fed as an input to neural network travel through subsequent layers, and the final decision made by the output layer. But, there exist several questions, such as

  • How…


PYTHON DATA ANALYSIS LIBRARY: PANDAS

Pandas is a Python Data Analysis Library that has cemented its place in the Data Science world. Articles on the internet about top Python libraries for Data Science include Pandas as one of its favorites. Pandas library offers several functions that can speed up data wrangling and exploratory data analysis…


MACHINE LEARNING MODEL DEPLOYMENT

Investing a considerable amount of time optimizing the ML model is one of the most common misconceptions and pitfalls for an unsuccessful ML project. Instead, teams with successful ML project invests time in gathering data, building efficient data pipelines to avoid training-serving skew, and building reliable model serving infrastructure. …


A/B Testing

Studies conducted by big companies have shown that even changing a minor feature such as the response time by few milliseconds, the color of a button, welcome image, fonts, and many more can significantly affect website traffic. A relatable example could be posting a picture on social media. Why specific…


STOCK PRICE FORECASTING MODELS

Forecasting stock price is an exciting topic. The number of articles published on the internet shows the popularity of this topic. However, many of them suffer from a fundamental error. This article offers some of the common pitfalls to avoid when creating a multi-step prediction model for stock prices.

Pitfall 1: Shuffling time-series data

Time-series…

Rahul Pandey

Google Certified ML Engineer | Exploring possibilites of ML in Photovoltaics

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store