Package 'adventr' reference manual

Title:	Interactive R Tutorials to Accompany Field (2016), "An Adventure in Statistics"
Description:	Interactive 'R' tutorials written using 'learnr' for Field (2016), "An Adventure in Statistics", <ISBN:9781446210451>. Topics include general workflow in 'R' and 'Rstudio', the 'R' environment and 'tidyverse', summarizing data, model fitting, central tendency, visualising data using 'ggplot2', inferential statistics and robust estimation, hypothesis testing, the general linear model, comparing means, repeated measures designs, factorial designs, multilevel models, growth models, and generalized linear models (logistic regression).
Authors:	Andy Field [aut, cre]
Maintainer:	Andy Field <[email protected]>
License:	GPL-3
Version:	0.1.10
Built:	2025-03-05 04:35:57 UTC
Source:	https://github.com/profandyfield/adventr

adventr: tutorials in R for An Adventure in Statistics (Field, 2016).

Description

The adventr package contains a series of tutorials that accompany my books An Adventure in Statistics and Discovering statistics using R. These tutorials contain abridged sections from my books so there are some copyright considerations but I offer them under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. (Basically you can use them for teaching and non-profit activities but not meddle with them or claim them as your own work.)

Who is the package aimed at?

Anyone teaching from or reading An Adventure in Statistics or Discovering statistics using R might find them useful.

What is covered?

The tutorials do not teach the background theory (the books do): it is assumed you have either attended my lecture or read the relevant chapter in the aforementioned books (or someone else's) The aim of this tutorial is to augment the theory that you already know by guiding you through fitting linear models using R and RStudio and asking you questions to test your knowledge along the way.

Running a tutorial

To run a tutorial execute:

learnr::run_tutorial("name_of_tutorial", package = "adventr")

replacing name_of_tutorial with the name of the tutorial (see below). For example, to run the tutorial on data basics execute:

learnr::run_tutorial("adventr_02", package = "adventr")

A list of tutorials

adventr_02: Data basics in R and RStudio. This tutorial looks at the basic concepts in R, such as installing packages, setting the working directory, getting help, getting data into R, creating variables, creating data frames and tibbles, reading in data files, manipulating variables within a tibble/data frame.
adventr_03: Summarizing data (introducing ggplot2)
adventr_04: Fitting models (central tendency)
adventr_05: Presenting data (summarizing groups and more ggplot2)
adventr_08: Inferential statistics and robust estimation (covers Chapter 9 too)
adventr_11 Hypothesis testing
adventr_14: The general linear model
adventr_15: Comparing two means
adventr_15_rm: Comparing two means (repeated measures)
adventr_16: Comparing several means
adventr_16_rm: Comparing several means (repeated measures)
adventr_17: Factorial designs
adventr_mlm: Multilevel models (not covered in the book)
adventr_growth: Growth models (not covered in the book)
adventr_log: Logistic regression (not covered in the book)

References

Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.
Field, A. P., Miles, J. N. V., & Field, Z. C. (2012). Discovering statistics using R: And sex and drugs and rock 'n' roll. London: Sage.

Alice's gene data

Description

A dataset relating to the following fictitious experiment: Participants who had facial burns were recruited. One-week pre-test, a gene C-10XFMG (the so-called chameleon gene) was introduced into all participants.Two days prior to test, a genetic toggle switch was introduced into half of the participants. Half of the participants were asked to look at a photograph of themselves from before their injury and to imagine the cells in their faces changing to become like the photo. The remainder acted as a control and were asked to look at a picture of a same-sex stranger and to try to change their face to become the person in the picture. All participants looked at the picture for 6 sessions of 20 minutes each. At the end of the sessions their faces were scanned into a computer and compared to the face in the photograph. Facial recognition software produced a precise resemblance measure as a percentage (100 in the photograph, 0 The data are taken from Chapter 17 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

alice_dat
alice_dat

Format

A tibble with 32 rows and 4 variables:

id: Participant ID
picture: A factor describing whether a participant tried to resemble a picture of themselves (Self), or a matched stranger (Other)
gene: A factor describing whether a participant received the toggle switch (C-gene + Toggle) or not (C-gene)
resemblance: How closely their face resembled the picture (100% = the participants face is exactly like the face in the photograph, 0% the person in the photo bears no resemblance at all to the participant)

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Calcite mask data

Description

A dataset relating to the following fictitious experiment, which is an extension of the invisibility paste data described elsewhere: "... we asked whether we could manipulate the face perceived by the participant when viewing someone wearing calcite invisibility paste. Participants were all so-called Chippers, who had ID chips installed in their brains. Each participant met 40 people who had calcite paste on their face. For each of these people we took a photo of their face (the 'actual' face), and found a photo of a different face matched for sex and age (the 'different' face). We could send these images to the participant's visual cortex using their ID chip. For 20 of the encounters an image of the person's 'actual' face was transmitted to the participant's ID chip during the encounter, for the remaining 20 encounters the 'different' face was sent to the chip. During each encounter the participant was shown the photos of the actual and different face and pointed to the one depicting whom they thought they had just met. As in Experiment 1, participants scored a point for each person they correctly identified: a score of 0 would mean that they never chose the correct photo, a score of 20 would mean that they always chose the correct photo." The data are taken from Chapter 15 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

calcite_dat
calcite_dat

Format

A tibble with 20 rows and 3 variables:

id: Participant ID
actual: How many of the 20 people the participant correctly identified when a picture of the 'actual' person was transmitted to the participant's ID chip
different: How many of the 20 people the participant correctly identified when a picture of a 'different' person was transmitted to the participant's ID chip

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Zombie garlic data

Description

A dataset relating to whether zombies can be repelled by garlic. It is taken from Chapter 14 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

garlic_dat
garlic_dat

Format

A tibble with 10 rows and 3 variables:

id: Victim ID
latency: The time taken for a zombie to approach the victim in seconds
garlic: The number of bulbs of garlic worn by the victim

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Subset of Ha et al. (2010) data

Description

A dataset containing a random sample of 20 female participants from Ha, T., Overbeek, G., & Engels, R. C. M. E. (2010). Effects of Attractiveness and Social Status on Dating Desire in Heterosexual Adolescents: An Experimental Study. Archives of Sexual Behavior, 39(5), 1063-1071. doi: 10.1007/s10508-009-9561-z. It is usedin Chapter 3 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

ha_dat
ha_dat

Format

A tibble with 20 rows and 13 variables:

id: Participant ID
age: Age of the participant, years
Sex: Biological sex of the individual
hi_salary: Rating (out of 10) of the importance of a high salary in a potential romantic partner
fin_ed: Rating (out of 10) of the importance of a potential romantic partner having finished their education
kind: Rating (out of 10) of the importance of a potential romantic partner being kind
humour: Rating (out of 10) of the importance of a potential romantic partner having a sense of humour
ambitious: Rating (out of 10) of the importance of a potential romantic partner being ambitious
wants_child: Rating (out of 10) of the importance of a potential romantic partner wanting children
romantic: Rating (out of 10) of the importance of a potential romantic partner being romantic
attractive: Rating (out of 10) of the importance of a potential romantic partner being physically attractive
creativity: Rating (out of 10) of the importance of a potential romantic partner being creative
honest: Rating (out of 10) of the importance of a potential romantic partner being honest

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Memory implanting data

Description

A dataset relating to the following fictitious experiment: "We wanted to see whether our failure to replace memories was due to the type of conflicting information used. Five participants engaged in three different scripted encounters with strangers during which 10 unusual things happened. After each encounter participants were asked to privately recall what had happened. During the period of recall for one encounter they were left alone (control), during the recall of another their ID chip was used to implant a verbal description of the encounter in which the 10 unusual events were replaced with different events, and during the recall of the final encounter their ID chip was used to implant visual images of 10 different things to what they actually experienced. Each participant, therefore, experienced three encounters and a different type of recall manipulation followed each one; the type of manipulation associated with a particular encounter was counterbalanced. Finally, participants recalled the event for a second time and we measured how many of the 10 unusual events that they originally experienced that they could recall." The data are taken from Chapter 16 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

implant_dat
implant_dat

Format

A tibble with 5 rows and 4 variables:

id: Participant ID
no_interference: How many of the 10 unusual events were remembered when there was no interference
verbal: How many of the 10 unusual events were remembered when verbal information was used as interference
visual: How many of the 10 unusual events were remembered when visual images were used as interference

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

JIG:SAW employee data.

Description

A dataset containing information about the characteristics of different groups of employees within the corporation called JIG:SAW and non-employees. It is taken from Chapter 5 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

jig_dat
jig_dat

Format

A tibble with 240 rows and 7 variables:

id: employee ID
employee: whether or not the employee works for JIG:SAW
job_type: Categories of employee
footspeed: Footspeed of the individual, miles per hour
strength: Maximal push force of the individual, Newtons
vision: Visual accuity
sex: Biological sex of the individual

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Invisibility paste data

Description

A dataset relating to the following fictitious experiment: "Twenty-eight participants were tested individually. Each participant sat on a revolving chair in the middle of a circle of 20 people. The 20 people all wore a light deflecting aid on their face. For half of the participants (N = 14) the 20 people wore a standard calcite-based mask, whereas the other half (N = 14) saw people wearing 'invisibility paste', a transparent cream containing calcite micro-crystals. Participants looked at each person and were given two photographs, one depicting the person they were looking at and one depicting a different person matched for age and sex. Participants had to decide which person they were looking at, before turning to the next person and being given two different photographs. Participants scored a point for each person they identified correctly: a score of 0 would mean they always chose the incorrect photo, a score of 20 would mean that they always chose the correct photo." The data are taken from Chapter 15 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

mask_dat
mask_dat

Format

A tibble with 28 rows and 3 variables:

id: Participant ID
mask: Whether the person was assigned to the calcite mask condition or the invisibility paste condition
recognition: How many of the 20 people they correctly identified

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Memory erasing data with covariate

Description

A dataset relating to the following fictitious experiment from Chapter 16 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage: "Thirty-six participants with ID chips were tested individually. All participants met a stranger who had a 5-minute scripted conversation with them containing 10 critical pieces of information. A week later, the participants were asked to recall the encounter for 5 minutes, and then after a 10-minute break wrote everything that they could remember about the original encounter. During the 5-minute initial recall, one of three things happened: the control group (N = 12) received no intervention; the erase group (N = 12) had a pulse of electricity sent to their brain via their ID chip; and the replace group (N = 12) had conflicting verbal descriptions sent to their ID chip. The outcome was how many of the 10 critical pieces of information the participants wrote down after the recall phase."

Usage

mem_cov_dat
mem_cov_dat

Format

A tibble with 36 rows and 7 variables:

id: Participant ID
recall: How many of the 10 critical pieces of information the participant identified
group: To which of the no intervention, memory erase or memory replace conditions the participant was assigned
working_mem: Variable that measures working memory capacity as the individual's digit span

Details

This version of the data includes an additional variable that measured working memory in each participant (as their digit span).

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Memory erasing data

Description

A dataset relating to the following fictitious experiment: "Thirty-six participants with ID chips were tested individually. All participants met a stranger who had a 5-minute scripted conversation with them containing 10 critical pieces of information. A week later, the participants were asked to recall the encounter for 5 minutes, and then after a 10-minute break wrote everything that they could remember about the original encounter. During the 5-minute initial recall, one of three things happened: the control group (N = 12) received no intervention; the erase group (N = 12) had a pulse of electricity sent to their brain via their ID chip; and the replace group (N = 12) had conflicting verbal descriptions sent to their ID chip. The outcome was how many of the 10 critical pieces of information the participants wrote down after the recall phase." The data are taken from Chapter 16 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

memory_dat
memory_dat

Format

A tibble with 36 rows and 7 variables:

id: Participant ID
group: To which of the no intervention, memory erase or memory replace conditions the participant was assigned
recall: How many of the 10 critical pieces of information the participant identified
erase_dummy: Dummy variable that compares the erase condition to no intervention
replace_dummy: Dummy variable that compares the replace condition to no intervention
contrast_1: Variable that uses contrast codes to compare the memory conditions (combined) to no intervention
contrast_2: Variable that uses contrast codes to compare the erase condition to the replace condition

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Alice's RAS scores

Description

A dataset containing Alice's ratings on the Relationship Assessment Scale (RAS) over 10 weeks. It is taken from Chapter 4 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

ras_dat
ras_dat

Format

A tibble with 10 rows and 2 variables:

week: The week in which the rating was given
rating: Alice's score on the relaationship assessment scale

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Zombie recovery data

Description

A dataset not from Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage, but continuing the theme of the story. At the end of the book it is revealed that Alice used her C-gene therapy to restore the code 1318 workers to a human state. This dataset relates to her third attempt. It contains data from 400 code 1318 workers. Workers were randomly assigned to two arms of a trial (C-gene therapy and C-gene therapy with a toggle switch). They were also given one of five different doses of C-gene: 0 (no dose) or 1, 2, 3, or 4 doses. The outcome was whether (or not) the worker had fully recovered after the treatment.

Usage

recovery_dat
recovery_dat

Format

A tibble with 400 rows and 4 variables:

id: Participant ID
treatment: character vector that codes which arm of the trial the participant was randomized to (no toggle switch or toggle switch)
dose: The dose of C-gene used during treatment (0 doses, 1, 2, 3, or 4 doses)
recovered: Character vector that expresses whether or not the participant had recovered following treatment (Recovered or Not Recovered)

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Zombie rehabilitation data

Description

A dataset not from Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage, but continuing the theme of the story. At the end of the book it is revealed that Alice used her C-gene therapy to restore the code 1318 workers to a human state. This dataset relates to her first attempt. It contains data from 190 code 1318 workers treated at 10 different clinics. Workers were randomly assigned to two arms of the trial (wait list vs.C-gene therapy) and the outcome was how much they resembled their pre-zombie state (as a percentage). The zombification occurred as a result of these workers entering JIG:SAWs genetic enhancement programme (GEP), and additional variables quantify how long it has been since they entered this programme (i.e., how long have they been a zombie) and whether they were originally exposed to the low- or high-intensity GEP.

Usage

rehab_dat
rehab_dat

Format

A tibble with 190 rows and 6 variables:

id: Participant ID
id_clin: Factor that codes which of 10 clinics the participant attended
intervention: Factor that codes which arm of the trial the participant was randomized to (0 = wait list, 1 = gene therapy)
resemblance: How closely their face resembled their pre-zombified state (100% = the participants face is exactly like their original face, 0% the person bears no resemblance to their pre-zombified face)
gep_level: Factor that codes whether the participant was originally exposed to the low- (0) or high-intensity (1) genetic enhancement program at JIG\:SAW
tse_months: How many months since the participant completed the genetic enhancement program that resulted in their zombification

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Zombie rehabilitation longitudinal data

Description

Usage

rehab_growth_dat
rehab_growth_dat

Format

A tibble with 564 rows and 5 variables:

id: Participant ID
intervention: character vector that codes which arm of the trial the participant was randomized to (wait list or gene therapy)
resemblance: How closely their face resembled their pre-zombified state (100% = the participants face is exactly like their original face, 0% the person bears no resemblance to their pre-zombified face)
time: Character vector that expresses when resemblance was measured as "t0" (baseline), "t1" (1 month follow-up), "t6" (6-month follow up) and "t12" (12-month follow-up)
time_num: integer vector exptressing time in months from baseline (0, 1, 6, 12) as a number

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Zombie immobility data

Description

A dataset relating to the degree to which zombies can be rendered immobile by repetitive transcranial magnetic stimulation (rTMS) and zapping with a taser. It is taken from Chapter 14 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

taser_dat
taser_dat

Format

A tibble with 156 rows and 4 variables:

id: Victim ID
immobility: The duration of the zombie's immobility, seconds
r_tms: The frequency of rTMS, Hz
taser: The output of the taser, kV

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Teddy therapy data.

Description

A dataset based on two ficticious studies looking at whether cuddling a teddy bear (compared to the cardboard box that the teddy was packaged in) affects self-reported self-esteem. It is taken from Chapter 10 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

teddy_dat
teddy_dat

Format

A tibble with 220 rows and 4 variables:

id: Participant ID
group: whether the participant cuddled a teddy bear or the cardboard box that contained the teddy bear
study_n: Factor that distinguishes the two studies in the data set. The first was based on a total N of 20 and the second was based on a total N of 200
self_esteem: Self-reported self-esteem scores

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Subset of Zhang data (female sample N = 20)

Description

A dataset extracted from Ha, T., Overbeek, G., & Engels, R. C. M. E. (2010). Effects of Attractiveness and Social Status on Dating Desire in Heterosexual Adolescents: An Experimental Study. Archives of Sexual Behavior, 39(5), 1063-1071. doi:10.1007/s10508-009-9561-z. The study looked at accuracy on a maths test when performed under the participant's own name or a fake name. It is taken from Chapter 15 of Field, A. P. (2016). An adventure in statistics: the reality enigma. London: Sage.

Usage

zhang_female_dat
zhang_female_dat

Format

A tibble with 20 rows and 2 variables:

name: Specifies whether participants completed the test under their own name or a fake name
accuracy: Accuracy on a maths test, percent

Source

https://www.discoveringstatistics.com/books/an-adventure-in-statistics/

Package 'adventr'

Help Index

adventr: tutorials in R for An Adventure in Statistics (Field, 2016).

Description

Who is the package aimed at?

What is covered?

Running a tutorial

A list of tutorials

References

Alice's gene data

Description

Usage

Format

Source

Calcite mask data

Description

Usage

Format

Source

Zombie garlic data

Description

Usage

Format

Source

Subset of Ha et al. (2010) data

Description

Usage

Format

Source

Memory implanting data

Description

Usage

Format

Source

JIG:SAW employee data.

Description

Usage

Format

Source

Invisibility paste data

Description

Usage

Format

Source

Memory erasing data with covariate

Description

Usage

Format

Details

Source

Memory erasing data

Description

Usage

Format

Source

Alice's RAS scores

Description

Usage

Format

Source

Zombie recovery data

Description

Usage

Format

Source

Zombie rehabilitation data

Description

Usage

Format

Source

Zombie rehabilitation longitudinal data

Description

Usage

Format

Source

Zombie immobility data

Description

Usage

Format

Source