Programming Tutorials

Logical and Character Vectors in R

By: Karthik Janar in data-science Tutorials on 2018-05-01  

The simplest and most common data structure in R is the vector. Vectors come in two different flavors: atomic vectors and lists. An atomic vector contains exactly one data type, whereas a list may contain multiple data types. Numeric vectors are one type of atomic vector. Other types of atomic vectors include logical, character, integer, and complex. In this tutorial, we"ll take a closer look at logical and character vectors.

Logical vectors can contain the values TRUE, FALSE, and NA (for 'not available"). These values are generated as the result of logical 'conditions". Let's experiment with some simple conditions.

First, create a numeric vector num_vect that contains the values 0.5, 55, -10, and 6.

num_vect <- c(0.5,55,-10,6)

Now, create a variable called tf that gets the result of num_vect < 1, which is read as 'num_vect is less than 1".

tf <- num_vect < 1

This results in a vector of 4 logical values. Print the contents of tf now.

tf
## [1]  TRUE FALSE  TRUE FALSE

The statement num_vect < 1 is a condition and tf tells us whether each corresponding element of our numeric vector num_vect satisfies this condition.

The first element of num_vect is 0.5, which is less than 1 and therefore the statement 0.5 < 1 is TRUE. The second element of num_vect is 55, which is greater than 1, so the statement 55 < 1 is FALSE. The same logic applies for the third and fourth elements.

Let's try another. Type num_vect >= 6 without assigning the result to a new variable.

num_vect >= 6
## [1] FALSE  TRUE FALSE  TRUE

This time, we are asking whether each individual element of num_vect is greater than OR equal to 6. Since only 55 and 6 are greater than or equal to 6, the second and fourth elements of the result are TRUE and the first and third elements are FALSE.

The < and >= symbols in these examples are called 'logical operators". Other logical operators include >, <=, == for exact equality, and != for inequality.

If we have two logical expressions, A and B, we can ask whether at least one is TRUE with A | B (logical 'or" a.k.a. 'union") or whether they are both TRUE with A & B (logical 'and" a.k.a. 'intersection"). Lastly, !A is the negation of A and is TRUE when A is FALSE and vice versa.

It's a good idea to spend some time playing around with various combinations of these logical operators until you get comfortable with their use. We"ll do a few examples here to get you started.

(3 > 5) & (4 == 4)
## [1] FALSE
(TRUE == TRUE) | (TRUE == FALSE)
## [1] TRUE
((111 >= 111) | !(TRUE)) & ((4 + 1) == 5)
## [1] TRUE

Working with logical statements in R takes practice.

Character vectors are also very common in R. Double quotes are used to distinguish character objects, as in the following example.

Create a character vector that contains the following words: "My", "name", "is". Remember to enclose each word in its own set of double quotes, so that R knows they are character strings. Store the vector in a variable called my_char and print its contents

my_char <- c("My", "name", "is")
my_char
## [1] "My"   "name" "is"

Right now, my_char is a character vector of length 3. Let's say we want to join the elements of my_char together into one continuous character string (i.e.  a character vector of length 1). We can do this using the paste() function.

paste(my_char, collapse = " ")
## [1] "My name is"

The collapse argument to the paste() function tells R that when we join together the elements of the my_char character vector, we"d like to separate them with single spaces.

To add (or 'concatenate") your name to the end of my_char, use the c() function like this: c(my_char, "your_name_here"). Store the result in a new variable called my_name.

my_name <- c(my_char, "Karthik")
my_name
## [1] "My"      "name"    "is"      "Karthik"

Now, use the paste() function once more to join the words in my_name together into a single character string using collapse = " "!

paste(my_name, collapse = " ")
## [1] "My name is Karthik"

In this example, we used the paste() function to collapse the elements of a single character vector. paste() can also be used to join the elements of multiple character vectors.

In the simplest case, we can join two character vectors that are each of length 1 (i.e.  join two words). Try paste("Hello", "world!", sep = " "), where the sep argument tells R that we want to separate the joined elements with a single space.

paste("Hello", "World!", sep = " ")
## [1] "Hello World!"

For a slightly more complicated example, we can join two vectors, each of length 3. Use paste() to join the integer vector 1:3 with the character vector c("X", "Y", "Z"). This time, use sep = "" to leave no space between the joined elements.

paste(c(1:3), c("X", "Y", "Z"), sep = "")
## [1] "1X" "2Y" "3Z"

What do you think will happen if our vectors are of different length? Vector recycling! Try paste(LETTERS, 1:4, sep = "-"), where LETTERS is a predefined variable in R containing a character vector of all 26 letters in the English alphabet.

paste(LETTERS, 1:4, sep = "-")
##  [1] "A-1" "B-2" "C-3" "D-4" "E-1" "F-2" "G-3" "H-4" "I-1" "J-2" "K-3"
## [12] "L-4" "M-1" "N-2" "O-3" "P-4" "Q-1" "R-2" "S-3" "T-4" "U-1" "V-2"
## [23] "W-3" "X-4" "Y-1" "Z-2"

Since the character vector LETTERS is longer than the numeric vector 1:4, R simply recycles, or repeats, 1:4 until it matches the length of LETTERS. Also worth noting is that the numeric vector 1:4 gets 'coerced" into a character vector by the paste() function. Coercion here means that the numbers 1, 2, 3, and 4 in the output above are no longer numbers to R, but rather characters "1", "2", "3", and "4".






Add Comment

* Required information
1000

Comments

No comments yet. Be the first!

Most Viewed Articles (in data-science )

Latest Articles (in data-science)