data-science Tutorials

11. Missing Values in R

By: Karthik Janar : 2018-05-01

Description: Missing values play an important role in statistics and data analysis. Often, missing values must not be ignored, but rather they should be carefully studied to see if there's an underlying pattern or cause for their missingness.


12. Logical and Character Vectors in R

By: Karthik Janar : 2018-05-01

Description: The simplest and most common data structure in R is the vector. Vectors come in two different flavors: atomic vectors and lists. An atomic vector contains exactly one data type, whereas a list may contain multiple data types. Numeric vectors are one type of atomic vector. Other types of atomic vectors include logical, character, integer, and complex. In this tutorial, we'll take a closer look at logical and character vectors.


13. Generating Sequence numbers in R - seq(), rep() c() etc.

By: Karthik Janar : 2018-04-30

Description: In this tutorial, you'll learn how to create sequences of numbers in R using functions such as seq(), rep(), c() etc. The simplest way to create a sequence of numbers in R is by using the : operator


14. File handling commands in R

By: Karthik Janar : 2018-04-30

Description: In this tutorial, you'll learn how to examine your local workspace in R and begin to explore the relationship between your workspace and the file system of your machine. Because different operating systems have different conventions with regards to things like file paths, the outputs of these commands may vary across machines.


15. Discrete Vs Continuous Random Variables in R

By: Karthik Janar : 2018-04-30

Description: Random in statistics does not mean ‘haphazard' but it means a kind of order that emerges in the long run. For example we term unpredictable things that happen in our life as ‘random'. But we rarely see enough repetition of the same random phenomenon to observe a long term regulatrity that probability describes about 'random'ness.


16. Basic Building Blocks of R

By: Karthik Janar : 2018-04-13

Description: In this tutorial, we will explore some basic building blocks of the R programming language. To complete this tutorial, let us use R Studio to get our hands on experience. If you have not installed "R Studio" yet, download and install it. Open R Studio and just follow the instructions as below. In its simplest form, R can be used as an interactive calculator.


17. Overview and History of R

By: Karthik Janar : 2018-04-12

Description: In the world of data science, two popular programming languages are used today. R and Python. For a good comparison of R and Python read this tutorial. Personally I prefer R over Python because of its rich set of freely available packages and its visualization capabilities. R is based on an older language named S. S was originally developed at Bell Labs by John Chambers in 1976 as an internal statistical analysis environment using fortran libraries. In 1988 it was rewritten in C.


18. What are the different types of Analysis a Data Scientist would do?

By: William Alexander : 2018-04-12

Description: This tutorial discusses some basic concepts of Data Science. Data Scientist's Job always starts with a question that needs to be answered. There are few different kinds of questions a data scientist would ask which defines the goal of the analysis. It starts with descriptive, and then it goes to exploratory, inferential, predictive, causal, and mechanistic


19. Git and GitHub - a complete startup guide for a beginner

By: Karthik Janar : 2018-04-11

Description: For anyone who is serious about doing any work with computers needs to understand some basic concept of files and folders and some understanding of command line interface and commands. It is essential to have some hands on to use CLI. Furthermore if you are interested in doing programming or analytics then version control becomes very important. Git and GitHub provide very good toolset to create, maintain and manage your files and folders and their versions as well as share them with others.


20. Data Analytics - Which programming language to learn. R vs Python

By: Ashley J : 2017-09-24

Description: Often people ask which one is better to learn? R or Python. Python is better for for data manipulation and repeated tasks, while R is good for ad hoc analysis and exploring datasets. For example, take text analysis, where you want to deconstruct paragraphs into words or phrases and then identify patterns. In this use case R is better suited and makes it simple. On the other hand, take for example, pulling the data, to running automated analyses over and over, to producing visualizations like maps and charts from the results then Python is better suited.