Open in app

Sign In

Write

Sign In

Joan Ngugi
Joan Ngugi

196 Followers

Home

About

Published in Analytics Vidhya

·Pinned

Deploy Machine Learning Web Applications with Streamlit

As a data scientist or machine learning engineer, it is not enough to leave your machine learning models in your notebooks. You will want your end-users to seamlessly interact with your model in a creative and easy way. Streamlit is a Python-based library that allows the creation and deployment of…

Streamlit

7 min read

Deploy Machine Learning Web Applications with Streamlit
Deploy Machine Learning Web Applications with Streamlit
Streamlit

7 min read


Published in MLearning.ai

·Jul 28, 2022

P-values in statistics simplified and the DataScience application.

Pre-Requisite — Hypothesis and Hypothesis Test A hypothesis is an assumption about an expected association. Example: Increasing apple fruit consumption will result in a decreased frequency of visits to a doctor. Businesses that give customers loyalty points have more customer loyalty than businesses that don’t. In the research or statistics world, your objective is to determine…

Data Science

7 min read

P-values in statistics simplified and the DataScience application.
P-values in statistics simplified and the DataScience application.
Data Science

7 min read


Published in MLearning.ai

·Jun 10, 2022

Evaluating classification models simplified.

If you cannot measure it, you cannot improve it. ~ Lord Kelvin Any project that someone does can either be called a success or not. The machine learning world is no different either and we need a way to know when to stop and say our model is a success. Introduction

Classification Models

8 min read

Evaluating classification models simplified.
Evaluating classification models simplified.
Classification Models

8 min read


Published in MLearning.ai

·May 30, 2022

Principal Component Analysis(PCA) Simplified

Link to the code: Github Problem Statement Imagine you had a dataset with 1000 features. To visualize all these features and to try to explain the relationships between these features would be a nightmare. Moreover, your model runs the risk of Overfitting. …

Pca

8 min read

Principal Component Analysis(PCA) Simplified
Principal Component Analysis(PCA) Simplified
Pca

8 min read


Published in MLearning.ai

·May 21, 2022

Handling Missing Values — Data Science

DataSet and Notebook used in this article can be found here: Complete Notebook Link : Handling Missing Values DataSet Link: Melbourne Housing Dataset Introduction A perfect data set is usually a big win for any data scientist or machine learning engineer. Unfortunately, more often than not, datasets to be used to solve different data science use-cases will have missing data. …

Data Science

5 min read

Handling Missing Values — Data Science
Handling Missing Values — Data Science
Data Science

5 min read


Published in Analytics Vidhya

·May 17, 2022

Understanding Naive Bayes Algorithm

Naive Bayes is a machine learning model used for classification. The Naive Bayes technique is based on probabilities. The probability of an event happening or not happening can be calculated using historical data. Let’s Dissect the meaning of Naive Bayes. It is called Naive because it is based on the Naive Assumption that each input variable is…

Naive Bayes

6 min read

Understanding Naive Bayes Algorithm
Understanding Naive Bayes Algorithm
Naive Bayes

6 min read


Published in MLearning.ai

·Apr 30, 2022

Understanding a Linear Regression Algorithm with Example.

DataSet and Notebook used in this article can be found here: Complete Notebook Link: Multiple Linear Regression Model DataSet Link: Ecommerce Customers Let’s start on Linear Regression with a few scenarios: Finance companies predicting the top factors that cause a customer to default on a loan. Sports companies analyzing which variations of training have an effect on player performance. Factors affecting the economic growth of a country.

Linear Regression

10 min read

Understanding a Linear Regression Algorithm with Example.
Understanding a Linear Regression Algorithm with Example.
Linear Regression

10 min read


Apr 21, 2022

Understanding AWS Glue for ETL

In the big data world, the biggest problem for many companies might be getting insights from data before it’s outdated. If you need to process the different types of data with speed, and efficiency to get the best value from the data, then you need to leverage ETL tools and…

Aws Glue

6 min read

Understanding AWS Glue for ETL
Understanding AWS Glue for ETL
Aws Glue

6 min read


Feb 11, 2021

Story Telling With Data — Data Visualization

At the time of writing this article Google handles an average of 3.8 million searches per minute across the globe. That translates to 5.6 billion searches per day, or 2 trillion searches per year! When we look at Twitter there are 500 million tweets per day and around 200 billion…

Data Visualization

6 min read

Story Telling With Data — Data Visualization
Story Telling With Data — Data Visualization
Data Visualization

6 min read


Published in Analytics Vidhya

·Feb 9, 2021

Deploy Machine Learning Model using Flask to Heroku — Beginners(Part 3)

This tutorial is part 3 and the final part of the guide on how to deploy Machine Learn Models using Flask to Heroku via GitHub. The first part was a refresher on a logistic regression model on JupyterNotebook. The second part was on how to structure your flask application and…

Heroku

3 min read

Deploy Machine Learning Model using Flask to Heroku — Beginners(Part 3)
Deploy Machine Learning Model using Flask to Heroku — Beginners(Part 3)
Heroku

3 min read

Joan Ngugi

Joan Ngugi

196 Followers

Big Data & Analytics, Data Science, Machine Learning, Data Engineering | ngugijoan.com

Following
  • Tim Denning

    Tim Denning

  • Feng Li

    Feng Li

  • Dariusz Gross #DATAsculptor

    Dariusz Gross #DATAsculptor

  • Darius Foroux

    Darius Foroux

  • Amy @GrabNGoInfo

    Amy @GrabNGoInfo

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech