This cheat sheet gives an overview of the most common statistics functions for spreadsheets in English and Dutch (LibreOffice Calc, Excel), and for the R programming language.
x
denotes a cell range (spreadsheet) or list/array/table (R).
Function | R | Spreadsheet (EN) | Spreadsheet (NL) |
---|---|---|---|
Mean, average | mean(x) | =AVERAGE(x) | =GEMIDDELDE(x) |
Population variance | – | =VAR.P(x) | =VAR.P(x) |
Population standard deviation | – | =STDEV.P(x) | =STDEV.P(x) |
Sample variance | var(x) | =VAR(x) , =VAR.S(x) | =VAR(x) , =VAR.S(x) |
Sample standard deviation | sd(x) | =STDEV(x) , =STDEV.S(x) | =STDEV(x) , =STDEV.S(x) |
Median | median(x) | =MEDIAN(x) | =MEDIAAN(x) |
Minimum | min(x) | =MIN(x) | =MIN(x) |
Maximum | max(x) | =MAX(x) | =MAX(x) |
Quartile | – | =QUARTILE(x, type) † | =KWARTIEL(x, type) † |
Percentile | quantile(x, alphas) ‡ | =PERCENTILE(x, alpha) ‡ | =PERCENTIEL(x, alpha) ‡ |
† type
: 0 = min, 1 = 25% (1st quartile) , 2 = 50% (median), 3 = 75% (3rd quartile), 1 = max
‡ alpha
is a number in [0, 1] denoting the percentile rank (0 = minimum, .5 = median, 1 = max). In R, you can specify an array of the desired percentiles, e.g. quantile(x, c(0, .33, .67, 1))
.
Data Transformation with data.table:: CHEAT SHEET Manipulate columns with j Functions for data.tables data.table is an extremely fast and memory efficient package for transforming data in R. It works by converting R’s native data frame objects into data.tables with new and enhanced functionality. The basics of working with data.tables are. R has 657 built in color names To see a list of names: colors These colors are displayed on P. R color cheatsheet Finding a good color scheme for presenting data can be challenging. This color cheatsheet will help! R uses hexadecimal to represent colors Hexadecimal is a base-16 number system used to describe color. R Language Tutorials for Advanced Statistics. The whole list of colors are displayed at your R console in the color function. Here are few of my suggestions for nice looking colors and backgrounds.
R Style Guide - This resource is more than a cheat sheet. Google's internal R user community put together this guide for clean R code that covers syntax & conventions that are unique to R. I include it here because I've refered to it quite a bit in my own work. Your code will be easy to read & maintain if you follow these guidelines. R Reference Card: A handy R cheat sheet covering many, many functions. RStudio Cheat Sheet: A cheatsheet for the RStudio environment: Data Import Cheat Sheet: A cheatsheet for the RStudio environment: R Markdown Cheat Sheet: A cheatsheet for writing documents with RMarkdown: RStudio Cheat Sheets: R Studio provides the above cheat sheets, as.
x
denotes the cell range (spreadsheet) or list/array/table (R) containing values of the independent variable.y
denotes the cell range (spreadsheet) or list/array/table (R) containing values of the dependent variable.
Function | R | Spreadsheet (EN) | Spreadsheet (NL) |
---|---|---|---|
Pearson’s correlation coefficient (R) | cor(x, y) | =PEARSON(y, x) | =PEARSON(y, x) |
Determination coefficient (R²) | =RSQ(y, x) | =R.KWADRAAT(y, x) | |
Covariance | cov(x, y) | =COVAR(x, y) | COVARIANTIE.S(x, y) |
Statistics For Beginners Pdf
X
is a normally distributed stochastic variable with meanm
and standard deviations
, orX ~ Nor(m, s)
.x
is a number drawn fromX
.P(X < x)
is the probability that a number is drawn fromX
smaller thanx
(left tail probability)
Z
is the standard normal distribution, orZ ~ Nor(0, 1)
.z
is a number drawn fromZ
.P(Z < z)
is the probability that a number is drawn fromZ
smaller thanz
(left tail probability)
Function | R | Spreadsheet (EN) | Spreadsheet (NL) |
---|---|---|---|
z-transformation | z <- (x - m)/s | =STANDARDIZE(x, m, s) | =NORMALISEREN(x, m, s) |
P(Z < z) | pnorm(z) | =NORMSDIST(z) | =STAND.NORM.VERD(z) |
P(X < x) | pnorm(x, m, s) | =NORMDIST(x, m, s) | =NORM.VERD(x, m, s) |
z so P(Z < z) = p | qnorm(p) | =NORM.S.INV(p) | =NORM.S.INV(p) |
x so P(X < x) = p | qnorm(p, m, s) | =NORMINV(p, m, s) | =NORM.INV.N(p, m, s) |
R Program Cheat Sheet
- Van Der Elst, J. (2012). Statistiek met Excel. Derde druk. Uitgeverij De Boeck.
Comments are closed.