Personal Blog
  • Home
  • Data Camp Courses
  • Shiny Apps
  • About
Categories
All (27)
analysis (1)
code (1)

Web Scraping in R

Given the following ordered list (using the ol element), which is stored as list_raw_html:
Jul 5, 2024

Introduction to Deep Learning with Keras

You’re going to build a simple neural network to get a feeling of how quickly it is to accomplish this in Keras.
Mar 4, 2024
Mburu

 

Introduction to the OpenAI API

Software applications, web browser experiences, and even whole products are being built on top of the OpenAI API. In this exercise, you’ll be able to explore an application…
Jan 23, 2024

 

Introduction to Shell

An operating system like Windows, Linux, or Mac OS is a special kind of program. It controls the computer’s processor, hard drive, and network connection, but its most…
Jan 19, 2024

 

Python Data Science Toolbox (Part 1)

In the video, you learned of another standard Python datatype, strings. Recall that these represent textual data. To assign the string ‘DataCamp’ to a variable company, you…
Oct 5, 2023

Data Manipulation with pandas

When you get a new DataFrame to work with, the first thing you need to do is explore it and see what it contains. There are several useful methods and attributes for this.
Sep 24, 2023

Intermediate Python

With matplotlib, you can create a bunch of different plots in Python. The most basic plot is the line plot. A general recipe is given here.
Sep 19, 2023

 

Introduction to Python

Python Basics
Sep 16, 2023
Mburu

Intermediate Regression in R

In Introduction to Regression in R, you learned to fit linear regression models with a single explanatory variable. In many cases, using only one explanatory variable limits…
Jun 6, 2023
Mburu

Survival Analysis R

In this course, we will frequently use the GBSG2 dataset. This dataset contains information on breast cancer patients and their survival. In this exercise, we will take a…
May 13, 2023

 

Foundations of Probability in R

In these exercises, you’ll practice using the rbinom() function, which generates random “flips” that are either 1 (“heads”) or 0 (“tails”).
Feb 5, 2023

Introduction to Statistics with R

In this chapter, you’ll be working with the 2018 Food Carbon Footprint Index from nu3. The food_consumption dataset contains information about the kilograms of food consumed…
Jan 17, 2023

Functions in R

One way to make your code more readable is to be careful about the order you pass arguments when you call functions, and whether you pass the arguments by position or by name.
Dec 10, 2022

Missing mechanisms examples

Handling Missing Data with Imputations in R

Missing data is a common problem and dealing with it appropriately is extremely important. Ignoring the missing data points or filling them incorrectly may cause the models…
Dec 3, 2022

 

Anomaly Detection in R

In this exercise, you’ll explore the river dataset which will be used throughout this chapter to illustrate the use of common anomaly detection techniques. The river data is…
Nov 3, 2022
Mburu

Sampling in R

Sampling is an important technique in your statistical arsenal. It isn’t always appropriate though—you need to know when to use it and when to work with the whole dataset.
Sep 9, 2022

degit 3.

Introduction to advanced dimensionality reduction

code
analysis
You will use the MNIST dataset in several exercises through the course. Let’s do some data exploration to gain a better understanding. Remember that the MNIST dataset…
Apr 8, 2022
Mburu

Census data in r with tidycensus

tidycensus is an R package designed to return data from the US Census Bureau ready for use within the Tidyverse.
Feb 23, 2022

Modeling with tidymodels in R

The rsample package is designed to create training and test datasets. Creating a test dataset is important for estimating how a trained model will likely perform on new…
Dec 10, 2021

Communicating with Data in the Tidyverse

In the video, you have learned that the inner_join() function of dplyr needs to be given a “key” on which two data frames are joined. Actually, multiple keys that need to…
Nov 8, 2021
Mburu

The reduction in weekly working hours in Europe

Looking at the development between 1996 and 2006
The International Labour Organization (ILO) has many data sets on working conditions. For example, one can look at how weekly working hours have been decreasing in many…
Nov 8, 2021
Mburu

Writing Efficient R Code

One of the relatively easy optimizations available is to use an up-to-date version of R. In general, R is very conservative, so upgrading doesn’t break existing code.…
Aug 13, 2021
Mburu

Scalable Data Processing in R

If you are processing all elements of two data sets, and one data set is bigger, then the bigger data set will take longer to process. However, it’s important to realize…
Apr 13, 2021
Mburu

Multiple Linear and Logistic Regression in R

We use the lm() function to fit linear models to data. In this case, we want to understand how the price of MarioKart games sold at auction varies as a function of not only…
Apr 13, 2021
Mburu

Network analysis in R

Here you will learn how to create an igraph ‘object’ from data stored in an edgelist. The data are friendships in a group of students. You will also learn how to make a…
Jun 19, 2020
Mburu

String manipulation with stringr in r

Let’s get started by entering some strings in R. In t he video you saw that you use quotes to tell R to interpret something as a string. Both double quotes (“) and single…
May 31, 2020
Mburu

ML with tree based models in r

Let’s get started and build our first classification tree. A classification tree is a decision tree that performs a classification (vs regression) task. You will train a…
Apr 4, 2020
Mburu
No matching items