#2 Data Classes (CloudStat)

[This article was first published on CloudStat, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

As stated in CloudStat Intro, we know that CloudStat is based on R Language, an object orientated language, everything in R is an object. Each object has a class. The simplest data objects are one-dimensional arrays called vectors, consisting of any number of elements. For example, the calculation:

Input:

1+1

Output

> 1+1

[1] 2

results in a vector, from the numeric class (as it contains a number), with just one element. Note that the command “1+1” is itself and object of the expression class

The simplest elements produce vectors of the following classes:

  • logical: The values T (or TRUE) and F (or FALSE).
  • integer: Integer values such as 2 or -5.
  • numeric: Floating-point real numbers (double-precision by default).
    Numerical values can be written as
    whole numbers (for example, 2, -5),
    decimal fractions (2.38, -23.125), or in
    scientific notation (2.38e57, 23e-98).
  • complex: Complex numbers of the form a + bi, where a and b are integers or numeric (for example, 5 + 4.67i).
  • character: character strings enclosed by matching double quotes (“) or apostrophes ( ’), for example, “CloudStat”, ’data analysis’.

Example: #2 Data Classes Example

> 1+1

[1] 2

> 

> T; F #Logical

[1] TRUE

[1] FALSE

> 

> 2; -5 #Integer

[1] 2

[1] -5

> 2; -5 #Whole number

[1] 2

[1] -5

> 2.38; -23.125 #Decimal fractions

[1] 2.38

[1] -23.125

> 2.38e57; 23e-98 #Scientific notation

[1] 2.38e+57

[1] 2.3e-97

> 5 + 4.67i #Complex number

[1] 5+4.67i

> "CloudStat"; "data analysis" #Character

[1] "CloudStat"

[1] "data analysis"

> 

    Two other elements which are particularly useful are:

    • factors: These represent labelled observations. For example sex is a factor, generally incorporating two levels: male and female. These are generally used to represent qualitative effects in models.
    • ordered factors: A factor where the levels are ordered. For example there may be three responses to a question about quality, high, medium or low, but each level is not necessarily on a linear scale.

    #2 Data Classes Example

    Source: An Introduction to R: Examples for Actuaries by Nigel De Silva

      To leave a comment for the author, please follow the link and comment on their blog: CloudStat.

      R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
      Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

      Never miss an update!
      Subscribe to R-bloggers to receive
      e-mails with the latest R posts.
      (You will not see this message again.)

      Click here to close (This popup will not appear again)