Package 'resampledata3'

Title: Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed)
Description: Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed).
Authors: Laura Chihara [aut], Tim Hesterberg [aut, cre]
Maintainer: Tim Hesterberg <[email protected]>
License: CC0
Version: 1.0
Built: 2025-02-28 05:01:25 UTC
Source: https://github.com/cran/resampledata3

Help Index


Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed)

Description

Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed). https://github.com/lchihara/MathStatsResamplingR

Examples

# For a list of datasets do:
library(help = resampledata3)

Alcohol content and calories of beers

Description

Alcohol content and calories for a sample of ale and lager beers.

Usage

Alelager

Format

A data frame with 31 observations on the following 4 variables.

ID

Subject ID

Type

Beer: ale or lager

Alcohol

Percentage alcohol content

Calories

Number of calories


Arsenic levels of wells in Bangladesh

Description

Levels of arsenic, chlorine and cobalt in a sample of 271 wells in Bangladesh.

Usage

Bangladesh

Format

A data frame with 271 observations on the following 3 variables.

Arsenic

Arsenic level, ppb

Chlorine

Chlorine level, ppb

Cobalt

Cobalt level, ppb

Source

https://www2.bgs.ac.uk/groundwater/health/arsenic/Bangladesh/data.html

References

Reproduced with the permission of the British Geological Survey, copyright UKRI. All Rights Reserved.


Beer and hotwings consumption

Description

Beer and hotwings consumption by a sample of patrons at a Minneapolis bar.

Usage

Beerwings

Format

A data frame with 30 observations on the following 4 variables.

ID

Subject ID

Hotwings

Number of hotwings consumed

Beer

Ounces of beers consumed

Gender

Gender of patron (M/F)

Source

Data collected by Nicole Catchpole in 2004 (private communication).


Price of textooks at a college bookstore

Description

Price of textbooks at a college bookstore.

Usage

BookPrices

Format

A data frame with 44 observations on the following 3 variables.

Subject

Biology Chemistry Computer Science Economics Educational Studies Geology Mathematics Physics Political Science Psychology SOAN

Area

Classification of subject as either Math & Science or Social Sciences

Price

Price in U.S.~dollars

Source

Data collected by R.~Hien and S.~Becker in 2010 (private communication).


Fish supply and demand for bushmeat in Ghana

Description

Fish supply (kg) and demand for bushmeat in Ghana.

Usage

Bushmeat

Format

A data frame with 30 observations on the following 4 variables.

Fish

Fish supply (in kg.) per capita

Biomass

Biomass

Year

Year

Change

Percent change in biomass

Details

Biomass of large mammals was calculated for each year by multiplying the number of animals observed in 700 walking counts of 10 to 15 km each by species-specific body weights. The products of these calculations were then summed across all species.

Source

Brashares, Arces, Sam, Coppolillo, Sinclaire, Balmford, Bushmeat hunting, wildlife declines, and fish supply in West Africa, Science. 2004 Nov 12.


Cafeteria

Description

Nutritional data on meals served in a college cafeteria.

Usage

Cafeteria

Format

A data frame with 41 observations on the following 9 variables.

ID

a numeric vector

Type

type of meal, Meat or Vegetarian

Calories

number of calories

Carbohydrates

number of carbohydrates

Fiber

fiber content

Fat

fat content

Cholesterol

cholesterol

Protein

protein

Sodium

sodium

Source

Stephenson (private communication).


Cereals

Description

Nutritional data on a sample of cereals.

Usage

Cereals

Format

A data frame with 43 observations on the following 5 variables.

ID

a numeric vector

Age

target consumer, adult or children

Shelf

location of cereal, bottom, middle, or top shelf

Sodiumgram

sodium content in grams

Proteingram

protein content in grams


Challenger

Description

Data on O-rings in 23 space shuttle flights prior to the Challenger shuttle disaster of January 1986.

Usage

Challenger

Format

A data frame with 23 observations on the following 3 variables.

Date

Data of launch

Temperature

Air temperature at launch (F)

Incident

Binary variable, 1 if one of the 0-rings on one of the booster rockets was damaged, 0 otherwise

Source

https://archive.ics.uci.edu/ml/datasets/Challenger+USA+Space+Shuttle+O-Ring

References

Dala, S.~R., Fowlkes, E.~B., Hoadley, B (1989). Risk analysis of the space shuttle: pre-Challenger prediction of failure. J.~American Statistical Association, 84, 945-957.


ChiMarathonMen

Description

Times from a sample of men who completed the Chicago marathon in 2015.

Usage

data("ChiMarathonMen")

Format

A data frame with 80 observations on the following 4 variables.

name

Name of competitor

Division

Age group

Finish

Finish time

FinishMin

Time in minutes

Source

https://chicago-history.r.mikatiming.com/


Cuckoos

Description

Female cuckoos lay their eggs on the ground and then move them to the nests of other birds. Latter gathered data on the lengths of the cuckoo eggs found in these foster nests.

Usage

data("Cuckoos")

Format

A data frame with 120 observations on the following 2 variables.

Eggs

Lengths of eggs (mm) of cuckoos

Bird

Species of birds: HedgeSparrow, MeadowPipit, PiedWagtail, Robin, TreePipit, Wren

Source

Tippett, L. H. C. (1952). The Methods of Statistics, 4th Edition. Wiley.

References

Latter, O. (1902). An enquiry into the dimensions of the Cuckoo's egg and the relation of the variations to the size of eggs of the foster-parent, with notes on coloration. Biometrika 1 (2): 164-176.


Diving 2017

Description

Scores of 12 female divers (10 m platform) in the 2017 FINA World Championships.

Usage

data("Diving2017")

Format

A data frame with 12 observations on the following 4 variables.

Name

Name of competitor

Country

Country

Semifinal

Score in the semi-finals

Final

Score in the finals.

Details

Competitors perform 5 dives in each round and the sum of these 5 dives determines who moves on to the next round.

Source

https://www.fina.org/competitions/213/17th-fina-world-championships-2017/results?disciplines=DV


Eyes

Description

Measurements of eyes of 40 people.

Usage

data("Eyes")

Format

A data frame with 40 observations on the following 6 variables.

ID

Subject ID

age

Age of subject

hand

Dominant hand of subject, left or right

eye

Dominant eye of subject, left or right

leftPD

Left pupillary distance (mm)

rightPD

Right pupillary distance (mm)

Source

Westfield (private communication).


Fatalities

Description

A random sample of driver fatalities in 2009 in Pennsylvania.

Usage

Fatalities

Format

A data frame with 100 observations on the following 3 variables.

ID

Subject ID

Alcohol

Alcohol involved? 1 = yes, 0 = no

Age

Age

Details

The drivers were driving a car, SUV, or light pickup truck (vehicles such as motor homes, convertibles, or commercial vehicles are excluded).

Source

http://www.nhtsa.gov/FARS


Mercury content in a sample of fish in Minnesota

Description

Mercury levels (ppm) in a sample of fish caught in Minnesota

Usage

FishMercury

Format

A data frame with 30 observations on the following variable.

Mercury

Mercury level in ppm

Source

Minnesota pollution control agency.


Length of delays of airline flights

Description

Length of delays for flights on American Airlines and United Airlines in 2009

Usage

data("FlightDelays")

Format

A data frame with 4029 observations on the following 10 variables.

ID

Subject ID

Carrier

Airline: American Airlines AA or United Airlines UA

FlightNo

Flight number

Destination

Destination: BNA, DEN, DFW, IAD, MIA, ORD, STL

DepartTime

Departure time: 4-8am 4-8pm 8-Mid 8-Noon Noon-4pm

Day

Day of week

Month

Month: May or June

FlightLength

Length of flight

Delay

Delay time (in minutes)

Delayed30

Delayed more than 30 minutes? No or Yes

Details

All departures of AA or UA flights from LaGuardia Airport in May or June of 2009.

Source

https://www.bts.gov/topics/airlines-and-airports/quick-links-popular-air-carrier-statistics


Births of girls in Alaska or Wyoming

Description

Data on births of a random sample of girls in Alaska or Wyoming in 2004.

Usage

data("Girls2004")

Format

A data frame with 80 observations on the following 6 variables.

ID

Subject ID

State

State: AK or WY

MothersAge

Age of mother: 15-19, 20-24, 25-29, 30-34, 35-39, 40-44

Smoker

Mother a smoker? No or Yes

Weight

Weight of baby (grams)

Gestation

Gestation time (weeks)

Source

http://wonder.cdc.gov/natality-current.html


Groceries

Description

Prices of a sample of grocery items at Target or Walmart.

Usage

Groceries

Format

A data frame with 30 observations on the following 4 variables.

Product

Grocery item

Size

Package size

Target

Price at Target

Walmart

Price at Walmart


General Social Survey 2018

Description

General Social Survey data from 2018

Usage

GSS2018

Format

A data frame with 2348 observations on the following 17 variables.

ID

Subject ID

Region

Midwest, Northeast, South, West

GenderNow

Gender of subject: A gender not listed here, Man, Not applicable, Transgender, Woman

Age

Age

Marital

Marital status: Divorced, Married, Never married, Separated, Widowed

Degree

Education: Bachelor Graduate, High school Junior college, Less than high school

Employed

Employed? No or Yes

Income

Income level

Polviews

Political views:Conservative, Extremely liberal, Extremely conservative, Liberal, Moderate, Slightly conservative, Slightly liberal

Pres16

Voted for whom in presidential election of 2016? Clinton, Other, Trump

DeathPenalty

Opinion on death penalty: Favor, Oppose

Courts

How courts deal with criminals: About right, Dont know, Not harsh enough, Too harsh

Attend

Attendance at religious services: Monthly, Never, Occasionally, Weekly

Postlife

Believe in life after death? Dont know, No, Yes

Happy

General happiness level: Not too happy, Pretty happy, Very happy

Satfin

Satisfaction with financial situation: More or less, Not at all, Satisfied

Energy

Government spending on developing alternative energy sources: About right, Dont know, Too little, Too much

Source

https://gss.norc.org


Ice Cream

Description

Nutritional information on a sample of ice cream.

Usage

data("IceCream")

Format

A data frame with 39 observations on the following 7 variables.

Brand

Brand of ice cream

VanillaCalories

Calories in vanilla

VanillaFat

Fat (gm) in vanilla ice cream

VanillaSugar

Sugar (gm) in vanilla ice cream

ChocolateCalories

Calories in chocolate ice cream

ChocolateFat

Fat (gm) in chocolate ice cream

ChocolateSugar

Sugar (gm) in chocolate ice cream


Birth weight of boys born in Illinois

Description

Birth weight of boys born in Illinois.

Usage

ILBoys

Format

A data frame with 241 observations on the following 2 variables.

MothersAge

Age range of mother: 15-19, 20-24, 25-29

Weight

Weight of baby (gm)

Details

Random sample of boys born to mothers in Illinois in 2004. Births are restricted to single births only and gestation lengths of at least 37 weeks.


Illiteracy

Description

Data on female illiteracy in a sample of countries where illiteracy is more than 5%.

Usage

Illiteracy

Format

A data frame with 94 observations on the following 4 variables.

ID

Country ID

Country

Name of country

Illit

Percentage of women over 15 years old who are illiterate (2003)

Births

Number of births per woman in that country (2005)

Source

www.unesco.org, www.data.worldbank.org


Lottery

Description

Winning lottery numbers for Fantasy 5 in California.

Usage

Lottery

Format

A data frame with 500 observations on the following variable.

Win

Number

Details

In Fantasy 5, a lottery game in California, a player tries to match 5 numbers chosen from 1 through 39. This data are the winning numbers for the daily games from 5 May 2010 through 15 August 2010.

Source

http://www.calottery.com/play/draw-games/fantasy-5


Math Anxiety

Description

Data from a study on math anxiety in a sample of primary and secondary school students in Italy

Usage

MathAnxiety

Format

A data frame with 599 observations on the following 6 variables.

Age

Age

Gender

Gender: Boy, Girl

Grade

Grade: Secondary, Primary

AMAS

Score on Abbreviated Math Anxiety Scale

RCMAS

Score on Revised Abbreviated Math Anxiety Scale

Arith

Score on arithmetic test

Source

Hill, Mammarella, Devine, et al (2016). Maths anxiety in primary and secondary school students: gender differences, developmental changes and anxiety specificity. Learning and Individual Differences 48: 45-53


Carbon dioxide levels collected by Mauna Loa Observatory

Description

Average CO2 levels (ppm) for the month of May from 1990 to 2010.

Usage

Maunaloa

Format

A data frame with 21 observations on the following 3 variables.

ID

Subject ID

Year

Year

Level

Carbon dioxide level (ppm)

Source

https://www.esrl.noaa.gov/gmd/ccgg/trends


Minnesota groundwater

Description

Measurements on water quality in wells in Minnesota.

Usage

MnGroundwater

Format

A data frame with 895 observations on the following 10 variables.

County

Minnesota county

Aquifer.Group

Type of aquifer: buried Quaternary, Cambrian, Cretaceous, Devonian, Ordovician, Precambrian, surficial Quaternary

Water.Level

Water level

Alkalinity

Alkalinity

Aluminum

Aluminum

Arsenic

Arsenic

Chloride

Chloride

Lead

lead

pH

pH level

Basin.Name

Basin name

Source

Minnesota Pollution Control Agency


Mobile Ads

Description

Google experiment on effectiveness of certain recommendations for bidding on ads.

Usage

MobileAds

Format

A data frame with 655 observations on the following 40 variables.

Campaign

a numeric vector

m.impr_post

a numeric vector

m.impr_pre

a numeric vector

m.click_post

a numeric vector

m.click_pre

a numeric vector

m.cost_post

a numeric vector

m.cost_pre

a numeric vector

m.conv_post

a numeric vector

m.conv_pre

a numeric vector

m.value_post

a numeric vector

m.value_pre

a numeric vector

m.cpm_pre

a numeric vector

m.cpm_post

a numeric vector

m.cpc_pre

a numeric vector

m.cpc_post

a numeric vector

m.cpa_pre

a numeric vector

m.cpa_post

a numeric vector

m.cpr_pre

a numeric vector

m.cpr_post

a numeric vector

mult.change

a numeric vector

d.impr_post

a numeric vector

d.impr_pre

a numeric vector

d.click_post

a numeric vector

d.click_pre

a numeric vector

d.cost_post

a numeric vector

d.cost_pre

a numeric vector

d.conv_post

a numeric vector

d.conv_pre

a numeric vector

d.value_post

a numeric vector

d.value_pre

a numeric vector

d.cpm_pre

a numeric vector

d.cpm_post

a numeric vector

d.cpc_pre

a numeric vector

d.cpc_post

a numeric vector

d.cpa_pre

a numeric vector

d.cpa_post

a numeric vector

d.cpr_pre

a numeric vector

d.cpr_post

a numeric vector

error.cpr_pre

a numeric vector

error.cpr_post

a numeric vector

Details

Subset of experimental data for one advertiser. See Chihara and Hesterberg textbook for more information.

Source

Ed Lee (Google)

References

Chihara and Hesterberg, Mathematical Statistics with Resampling and R (2022). Wiley.


Nasdaq stock data

Description

Opening and closing stock prices for a random sample of 50 stock funds on NASDAQ on 1 December 2017.

Usage

Nasdaq

Format

A data frame with 50 observations on the following 4 variables.

Symbol

Stock symbol

Open

Opening price

Close

Closing price

Volume

Number of shares traded

Source

https://finance.yahoo.com


NBA 2016-2017 season

Description

Basketball statistics for a sample of NBA players from 4 teams for the 2016-2017 season.

Usage

data("NBA1617")

Format

A data frame with 68 observations on the following 13 variables.

Name

Player name

Position

Position: C (center), PF (power forward), PG (point guard), SF (small forward), SG (shooting guard)

Team

Team: Brooklyn, Charlotte, Cleveland, San Antonio

Games

Number of games played

Minutes

Number of minutes plyaed

PercFG

Field goal percentage

Perc3P

3-point field goal percentage

Perc2P

2-point field goal percentage

PercFT

Free throw percentage

OffReb

Offensive rebounds

DefReb

Defensive rebounds

Assists

Assists

Blocks

Blocks

Details

Players in this data set played a minimum of 100 minutes during the 2016-2017 season.

Source

https://www.basketball-reference.com/


Birth weights of babies born

Description

Birth weights of babies born in North Carolina in 2004

Usage

NCBirths2004

Format

A data frame with 1009 observations on the following 7 variables.

ID

Subject ID

MothersAge

Mother's age level

Smoker

Mother a smoker? codeNo, Yes

Alcohol

Mother consumed alcohol during pregnancy? No, Yes

Gender

Baby's gender

Weight

Baby's weight (gm)

Gestation

Gestation length (weeks)

Details

Babies in this random sample had a gestation period of at least 37 weeks and were single births (that is, not one of a twin or triplet).

Source

http://wonder.cdc.gov/natality-current.html

References

Chihara and Hesterberg, Mathematical Statistics with Resampling and R, 2022 (Wiley).


Olympics 2012

Description

Data on a sample of athletes competing in the 2012 London Olympics.

Usage

Olympics2012

Format

A data frame with 42 observations on the following 7 variables.

Name

Name of athlete

Country

Country

Age

Age

Sex

Sex: F, M

Height

Height (inches)

Weight

weight (lb)

Sport

Sport


Oscars

Description

Age and gender of Academy Award winners

Usage

Oscars

Format

A data frame with 188 observations on the following 6 variables.

Year

Year of award

Actor

Name of actor

Movie

Movie

Gender

Gender: Man, Woman

Birthyear

Birth year of actor

Age

Age at time of award

Source

https://www.oscars.org/


Philadelphia Phillies data 2009

Description

Baseball data for Philadelphia Phillies during the 2009 season.

Usage

Phillies2009

Format

A data frame with 162 observations on the following 8 variables.

Date

Date of game

Location

Game played where: Away, Home

Outcome

Outcome of game: Lose, Win

Outcome2

Outcome recoded: 1=win, 0 = lose

Hits

Number of hits

Doubles

Number of doubles

Homeruns

Number of homeruns

StrikeOuts

Number of strikeouts

Source

https://www.baseball-reference.com/


Quakes

Description

Time between earthquakes for all earthquakes of magnitude 6 or greater (1970-2009).

Usage

data("Quakes")

Format

A data frame with 805 observations on the following 2 variables.

ID

Subject ID

TimeDiff

Time (days)

Source

http://earthquakes.usgs.gov/earthquakes/eqarchives


Quetzal

Description

Heights of nests and snags for the quetzal (bird).

Usage

Quetzal

Format

A data frame with 21 observations on the following 3 variables.

Country

Country: Costa Rica, Guatemala

Nest

Height of nest (meters)

Snag

Height of snag (meters)

Details

The quetzal typically nests in abandoned woodpecker nests in dead tree trunks (snags).

Source

Siegfried, D., Linville, D., Hille, D. (2010). Analysis of nest sites and the resplendent quetzal (pharomachrus mocinno): relationship between nest and snag heights. Wilson Journal of Ornithology 122: 608-11.


Rangers and Twins baseball players (2016 season)

Description

Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins.

Usage

data("RangersTwins2016")

Format

A data frame with 27 observations on the following 17 variables.

Name

Name of player

Team

Team: Rangers, Twins

Pos

Player's position

Age

Age in years

Games

Number of games played

AtBats

Number of at bats

Runs

Runs

Hits

Hits

Doubles

Doubles

Triples

Triples

HR

Homeruns

RBI

Runs batted in

SB

Stolen bases

CS

Caught stealing

BB

Base on balls

SO

Strike outs

BA

Batting average

Details

Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins. These players played at least 50 games. During the 2016 season, the Rangers had the best winning percentage (0.586) in the American League while the Twins had the worst (0.364)

Source

www.baseball-reference.com


Recidivism

Description

Recidivism data from Iowa.

Usage

Recidivism

Format

A data frame with 17022 observations on the following 7 variables.

Gender

Gender: F, M

Age

Age group: 25-34, 35-44, 45-54, 55 and Older, Under 25

Age25

Over or Under 25 years of age? Over 25, Under 25

Offense

Type of offense: Felony Misdemeanor

Recid

Recidivated? No, Yes

Type

Reason: New (new crime), No Recidivism (did not recidivate), Tech (technical violation, such as a parole violation)

Days

Number of days to recidivism; NA if no recidivism

Details

All offenders convicted of either a misdemeanor or felony who were released from an Iowa prison during the 2010 fiscal year ending in June.

Source

https://data.iowa.gov/Public-Safety/3-Year-Recidivism-for-Offenders-Released-from-Pris/mw8r-vqy4


Salaries of baseball players

Description

Salaries of a random sample of baseball players from 1985 and 2015.

Usage

Salaries

Format

A data frame with 70 observations on the following 3 variables.

League

League: American National

Salary

Salary (in millions) in 2015 dollars

Year

Year: 1985 or 2015


Service times at a college snack bar.

Description

Time to be served at a college snack bar.

Usage

Service

Format

A data frame with 174 observations on the following 2 variables.

ID

Subject ID

Times

Time in minutes

Source

Haynor, Lojovich, Syed (private communication, 2010).


Skateboard experiment

Description

Measurement of testosterone levels in males in a skateboard experiment.

Usage

Skateboard

Format

A data frame with 71 observations on the following 3 variables.

Age

Age in years

Experimenter

Treatment (gender of experimenter): Female, Male

Testosterone

Testosterone level

Details

Results from an experiment where male skateboarders performed tricks in front of either a female or male.

Source

Ronay and Hippel (2010). The presence of an attractive woman elevates testosterone and physical risk taking in young men. Social Psychological and Personality Science 1:57-64.


Figure skating scores for men from the 2010 Winter Olympics.

Description

Short and free skate scores for male figure skaters in the 2010 Winter Olympics (Vancouver).

Usage

Skating2010

Format

A data frame with 24 observations on the following 5 variables.

Country

Country of skater

Name

Name

Short

Short program score

Free

Free skate score

Total

Total

Source

https://skatingscores.com/0910/oly/


Spruce data

Description

Measurements from an experiment on the growth of black spruce seedlings.

Usage

Spruce

Format

A data frame with 72 observations on the following 9 variables.

Tree

Subject ID

Competition

Treatment: C (competition), NC (no competition)

Fertilizer

Treatment: F (fertilizer), NF (no fertilizer)

Height0

Height of seedling at start

Height5

Height of seedling after 5 years

Diameter0

Diameter of seedling at start

Diameter5

Diameter of seedling after 5 years

Ht.change

Change in height

Di.change

Change in diameter

Details

Experiment on growth of black spruce seedlings under treatments of fertilizer-no fertilizer, competition- no competition (weeding).

Source

Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picea mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.


Starcraft

Description

Number of wins by a sample of Korean players in Starcraft, a strategy video game.

Usage

Starcraft

Format

A data frame with 45 observations on the following 4 variables.

ID

Subject ID

Race

Chosen race of player: Protoss, Terran, Zerg

Age

Age of player

Wins

Number of wins

Source

Evans, private communication. http://www.teamliquid.net/tipd/players


Titanic

Description

Subset of Titanic data

Usage

Titanic

Format

A data frame with 658 observations on the following 3 variables.

ID

Subject ID

Survived

Survival status: 1 = survived, 0 = died

Age

Age of passenger

Details

Subset of passenger data on the Titanic.

Source

https://data.world/nrippner/titanic-disaster-dataset


Turbine

Description

Average daily wind speeds (2010) from Carleton College turbine.

Usage

data("Turbine")

Format

A data frame with 168 observations on the following 4 variables.

Date2010

Date

AveKW

Average kilowatts

AveSpeed

Average speed (m/s)

Production

Energy output (kilowatt hours)

Source

Carleton College, Northfield MN.

References

Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R. (Wiley)


TV commercial lengths

Description

Lengths of television commercials on basic and extended cable TV channel.s

Usage

data("TV")

Format

A data frame with 20 observations on the following 3 variables.

ID

Subject ID

Times

Time (min)

Cable

Cable: Basic, Extended

Details

Lengths of TV commercials during any given half-hour time period.

Source

Rodgers, Robinson (private communication).


Texas birth weights

Description

Weights of babies born in Texas in 2004.

Usage

TXBirths2004

Format

A data frame with 1587 observations on the following 8 variables.

ID

Subject ID

MothersAge

Mother's age: 15-19, 20-24, 25-29, 30-34, 35-39, 40-44, under 15

Smoker

Mother smokes? No, Yes

Gender

Gender of baby: Female, Male

Weight

Weight of baby (g)

Gestation

Gestation length (weeks)

Number

Baby a single birth (1), twin (2), etc.

Multiple

Part of multiple birth (eg twin, triple)?: No, Yes

Details

Random sample of babies born in Texas in 2004.

Source

http://wonder.cdc.gov/natality-current.html


Verizon repair times

Description

Repair times by Verizon for its customers or customers of other telephone companies.

Usage

Verizon

Format

A data frame with 1687 observations on the following 2 variables.

Time

Repair time (h)

Group

Customer: CLEC (competing local exchange carrier), ILEC (incumbent local exchange carrier)

Details

Verizon is responsible for providing repair service to both its customers (ILEC) and its competitors (ILEC).

References

Chihara and Hesterberg (2022). Mathematical Statistics with Resamplng and R (Wiley).


Volleyball data

Description

Data on a sample of Division I women volleyball teams.

Usage

Volleyball2009

Format

A data frame with 30 observations on the following 4 variables.

Team

Team

HitPercent

Hitting percentage

Assts

Assists

Kills

Kills

Source

http://www.ncaa.org/championships/statistics/womens-volleyball-statistics


Walleye

Description

Lengths and weights of a sample of walleye caught in Minnesota lakes (1990's).

Usage

Walleye

Format

A data frame with 60 observations on the following 2 variables.

Length

Length (inches)

Weight

Weight (pounds)

Source

Monson, Minnesota Pollution Control Agency (private communication)


Watertable

Description

Relationship between the depth of the watertable and survival status of black spruce seedings.

Usage

Watertable

Format

A data frame with 360 observations on the following 2 variables.

Depth

Depth of watertable (cm)

Alive

Status of seedling: 1 = alive, 0 = dead

Details

Part of the data from an experiment to study factors associated with the growth of black spruce seedlings under various treatments. Status of seedling at the end of the second year of the experiment is noted here.

Source

Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picae mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.

References

Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R (Wiley).