Help for package educationR

Type:

Package

Title:

A Comprehensive Collection of Educational Datasets

Version:

0.1.0

Maintainer:

Renzo Caceres Rossi <arenzocaceresrossi@gmail.com>

Description:

Provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics. This package serves as a resource for educational researchers, data analysts, and statisticians to explore and analyze data in the field of education.

License:

GPL-3

URL:

https://github.com/lightbluetitan/educationr, https://lightbluetitan.github.io/educationr/

BugReports:

https://github.com/lightbluetitan/educationr/issues

Encoding:

UTF-8

LazyData:

true

Suggests:

ggplot2, dplyr, knitr, rmarkdown, testthat (≥ 3.0.0)

RoxygenNote:

7.3.2

VignetteBuilder:

knitr

NeedsCompilation:

Packaged:

2024-12-03 23:09:29 UTC; renzocrossi

Author:

Renzo Caceres Rossi [aut, cre]

Depends:

R (≥ 3.5.0)

Repository:

CRAN

Date/Publication:

2024-12-05 18:50:04 UTC

educationR: A Comprehensive Collection of Educational Datasets

Description

This package provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics.

Details

educationR: A Comprehensive Collection of Educational Datasets

A Comprehensive Collection of Educational Datasets.

Author(s)

Maintainer: Renzo Cáceres Rossi arenzocaceresrossi@gmail.com

Math Achievement Test Scores by Gender

Description

This dataset, Achieve_tbl_df, is a tibble containing information about math achievement test scores for 25 high school students, categorized by gender. The dataset provides insights into the distribution of scores between male and female students.

Usage

data(Achieve_tbl_df)

Format

A tibble with 25 observations and 2 variables:

score: Math achievement test score of the student (integer).
gender: Gender of the student (factor with 2 levels: "Male", "Female").

Details

The dataset name has been kept as 'Achieve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Math Test Scores versus Anxiety Scores

Description

This dataset, Anxiety_tbl_df, is a tibble containing information about math test scores and anxiety levels before the test for 20 students. The dataset provides insights into the relationship between anxiety levels and math test performance.

Usage

data(Anxiety_tbl_df)

Format

A tibble with 20 observations and 2 variables:

anxiety: Anxiety score of the student before taking the math test (integer).
math: Math test score of the student (integer).

Details

The dataset name has been kept as 'Anxiety_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Grad Rates: Athletes vs. Non-Athletes Big Ten Conf.

Description

This dataset, Bigten_tbl_df, is a tibble containing information about the graduation rates of student athletes and nonathletes in the Big Ten Conference. The dataset includes data from two academic years, showing the graduation rates by school and athlete status (athletes vs nonathletes).

Usage

data(Bigten_tbl_df)

Format

A tibble with 44 observations and 4 variables:

school: Name of the school (character).
year: Year of the data (factor with 2 levels).
rate: Graduation rate percentage (integer).
status: Athlete status (character, either 'Athlete' or 'Nonathlete').

Details

The dataset name has been kept as 'Bigten_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

NCAA Graduation Rates Report, 2000.

Test Scores on First Exam in Biology Class

Description

This dataset, Biology_tbl_df, is a tibble containing the test scores of 30 students on their first exam in a biology class. The dataset provides insight into the distribution of scores among the students.

Usage

data(Biology_tbl_df)

Format

A tibble with 30 observations and 1 variable:

score: Test scores on the first biology exam (integer).

Details

The dataset name has been kept as 'Biology_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Education Level of Blacks by Gender

Description

This dataset, Blackedu_tbl_df, is a tibble containing information about the education level of Black individuals, categorized by gender. The dataset includes 3800 observations and provides insights into the distribution of education levels across different gender groups.

Usage

data(Blackedu_tbl_df)

Format

A tibble with 3800 observations and 2 variables:

gender: Gender of the individual (factor with 2 levels).
education: Education level of the individual (factor with 5 levels).

Details

The dataset name has been kept as 'Blackedu_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Bureau of Census data.

Books Read & Spelling Scores: 17 Third Graders

Description

This dataset, Books_tbl_df, is a tibble containing information about the number of books read and spelling scores for 17 third-grade students. The dataset provides insights into the relationship between the number of books read and spelling scores in this group of students.

Usage

data(Books_tbl_df)

Format

A tibble with 17 observations and 2 variables:

book: Number of books read by the student (integer).
spelling: Spelling score of the student (integer).

Details

The dataset name has been kept as 'Books_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Detroit Auto Workers: Education Levels

Description

This dataset, Detroit_tbl_df, is a tibble containing data on the educational levels of a sample of 40 auto workers in Detroit. The dataset includes information on the highest level of education attained by each worker.

Usage

data(Detroit_tbl_df)

Format

A tibble with 40 observations and 1 variable:

educ: Educational level of the auto worker (integer).

Details

The dataset name has been kept as 'Detroit_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Dev Students: 2-Year & 4-Year College Demographics

Description

This dataset, Develop_tbl_df, is a tibble containing demographic data on students enrolled in developmental education programs at 2-year and 4-year colleges. The dataset includes information on the racial background of the students and whether they are enrolled in 2-year or 4-year colleges.

Usage

data(Develop_tbl_df)

Format

A tibble with 5656 observations and 2 variables:

race: Racial background of the student (factor with 5 levels).
college: Type of college the student is enrolled in (factor with 2 levels: 2-year or 4-year).

Details

The dataset name has been kept as 'Develop_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from "Research in Development Education" (1994), V. 11, 2.

Fall '95 Developmental Math: Failed Student Scores

Description

This dataset, Devmath_tbl_df, is a tibble containing test scores for students who failed developmental mathematics in the fall semester of 1995. The dataset includes the scores of these students as part of a simulated study.

Usage

data(Devmath_tbl_df)

Format

A tibble with 40 observations and 1 variable:

score: Test scores of students who failed developmental mathematics (integer).

Details

The dataset name has been kept as 'Devmath_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package. Data provided by Dr. Anita Kitchens.

Dyslexia: Data on College Students Diagnosed with Dyslexia

Description

This dataset, Dyslexia_tbl_df, is a tibble containing data on a group of college students diagnosed with dyslexia. The dataset includes various personal characteristics such as age, gender, handedness, weight, height, and number of children, along with the number of words they were able to read correctly.

Usage

data(Dyslexia_tbl_df)

Format

A tibble with 8 observations and 7 variables:

words: Number of words read correctly (integer).
age: Age of the student (integer).
gender: Gender of the student (character).
handed: Handedness of the student (character).
weight: Weight of the student (integer).
height: Height of the student (integer).
children: Number of children the student has (integer).

Details

The dataset name has been kept as 'Dyslexia_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Engineering Grads Salary: 10 Years Post-Graduation

Description

This dataset, Engineer_tbl_df, is a tibble containing salary data for engineering graduates 10 years after graduation. The dataset includes information on the salary of the graduates and the type of university they graduated from (categorized into three types of universities).

Usage

data(Engineer_tbl_df)

Format

A tibble with 51 observations and 2 variables:

salary: Salary of the engineering graduate 10 years after graduation (integer).
university: Type of university the graduate attended (factor with 3 levels).

Details

The dataset name has been kept as 'Engineer_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

College entrance exam scores for 24 high school seniors

Description

This dataset, Entrance_tbl_df, is a tibble containing the college entrance exam scores of 24 high school seniors. The dataset includes information on their exam scores.

Usage

data(Entrance_tbl_df)

Format

A tibble with 24 observations and 1 variable:

score: College entrance exam scores (integer).

Details

The dataset name has been kept as 'Entrance_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

High School GPA versus College GPA

Description

This dataset, GPA_college_tbl_df, is a tibble containing data on the relationship between high school GPA and college GPA for 10 students. The dataset is valuable for analyzing academic performance correlations across different educational levels.

Usage

data(GPA_college_tbl_df)

Format

A tibble with 10 observations and 2 variables:

hsgpa: High school GPA (numeric).
collgpa: College GPA (numeric).

Details

The dataset name has been kept as 'GPA_college_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Before and after German copying errors post-course

Description

This dataset, German_tbl_df, is a tibble containing data on the number of errors made by 20 students when copying a German passage, both before and after participating in an experimental German course. The dataset provides insights into language learning and the effectiveness of the course.

Usage

data(German_tbl_df)

Format

A tibble with 20 observations and 3 variables:

student: Identifier for the student (character).
when: Indicates whether the errors were recorded "before" or "after" the experimental course (character).
errors: Number of errors made by the student when copying the German passage (integer).

Details

The dataset name has been kept as 'German_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Test Grades in a Beginning Statistics Class

Description

This dataset, Grades_stats_tbl_df, is a tibble containing test grades for a beginning statistics class. It includes the grades of 29 students and provides insights into the performance distribution in an introductory statistics course.

Usage

data(Grades_stats_tbl_df)

Format

A tibble with 29 observations and 1 variable:

grades: Test grades (integer).

Details

The dataset name has been kept as 'Grades_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Graduation Rates for Student Athletes in the Southeastern Conf.

Description

This dataset, Graduate_tbl_df, is a tibble containing data on the graduation rates for student athletes in various schools within the Southeastern Conference. It includes the name of the school, a code for identification, and the graduation rate as a percentage.

Usage

data(Graduate_tbl_df)

Format

A tibble with 12 observations and 3 variables:

school: Name of the school (character).
code: Code for the school (character).
percent: Graduation rate as a percentage (integer).

Details

The dataset name has been kept as 'Graduate_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Study Habits of Students in Two Matched School Districts

Description

This dataset, Habits_tbl_df, is a tibble containing data on the study habits of students in two matched school districts. It includes variables related to the number of hours students from each district spent on studying, the difference between the two groups, and the significance of this difference.

Usage

data(Habits_tbl_df)

Format

A tibble with 11 observations and 4 variables:

A: Number of study hours in the first school district (integer).
B: Number of study hours in the second school district (integer).
differ: Difference in study hours between the two school districts (integer).
signrks: Significance of the difference in study hours (numeric).

Details

The dataset name has been kept as 'Habits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Effects of Open Education Reported by Hedges and Olkin (1985)

Description

This dataset, HedgesOlkin85_df, is a data frame containing data from Hedges and Olkin's 1985 study on the effects of open education. The dataset includes variables related to attitudes toward school and reading achievement in students, based on the meta-analysis reported in their work. This data was used to investigate the effects of open education.

Usage

data(HedgesOlkin85_df)

Format

A data frame with 4 observations and 6 variables:

study: Study identifier (numeric).
d_att: Effect size for attitude toward school (numeric).
d_ach: Effect size for reading achievement (numeric).
var_att: Variance of the attitude effect size (numeric).
cov_att_ach: Covariance between attitude and achievement (numeric).
var_ach: Variance of the achievement effect size (numeric).

Details

The dataset name has been kept as 'HedgesOlkin85_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.

Source

The dataset was taken from the metaSEM package.

References

Hedges, L. V., & Olkin, I. (1985). Statistical methods for meta-analysis. Orlando, FL: Academic Press.

Homework Hours: Private vs. Public High Schools

Description

This dataset, Homework_tbl_df, is a tibble containing data on the number of hours per week high school students spend on homework. The dataset compares students from private and public schools, providing insights into the study habits and academic workload of students in these two types of schools.

Usage

data(Homework_tbl_df)

Format

A tibble with 30 observations and 2 variables:

school: Type of school the student attends, either private or public (character).
time: Number of hours per week the student spends on homework (numeric).

Details

The dataset name has been kept as 'Homework_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Job satisfaction and stress level for 9 school teachers

Description

This dataset, Jobsat_tbl_df, is a tibble containing data on job satisfaction and stress levels for 9 school teachers. The dataset provides insights into how teachers' stress levels relate to their job satisfaction, which can be valuable for understanding workplace dynamics and improving teacher well-being.

Usage

data(Jobsat_tbl_df)

Format

A tibble with 9 observations and 2 variables:

wspt: Stress level of the teacher, measured on a scale (integer).
satisfaction: Job satisfaction level of the teacher (numeric).

Details

The dataset name has been kept as 'Jobsat_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

First Grade Reading Scores: With vs. Without Kindergarten

Description

This dataset, Kinder_tbl_df, is a tibble containing data on the reading scores of first grade children who attended kindergarten versus those who did not. The dataset provides insights into the impact of attending kindergarten on early reading abilities, which can help in evaluating the effectiveness of early childhood education programs.

Usage

data(Kinder_tbl_df)

Format

A tibble with 8 observations and 3 variables:

pair: Pair identifier for the group of children (integer).
kinder: Reading score for children who attended kindergarten (integer).
nokinder: Reading score for children who did not attend kindergarten (integer).

Details

The dataset name has been kept as 'Kinder_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Reading skills of 24 matched low ability students

Description

This dataset, Lowabil_tbl_df, is a tibble containing data on the reading skills of 24 students with low abilities, who were matched based on certain characteristics. The dataset compares the performance of the students in two different groups: the experimental group and the control group. This can be used to evaluate the effectiveness of an intervention or treatment in improving reading skills.

Usage

data(Lowabil_tbl_df)

Format

A tibble with 12 observations and 3 variables:

pair: Pair identifier for the matched students (integer).
experiment: Reading score for students in the experimental group (integer).
control: Reading score for students in the control group (integer).

Details

The dataset name has been kept as 'Lowabil_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Standardized Math Test Scores for 30 Students

Description

This dataset, Math_scores_tbl_df, is a tibble containing the standardized math test scores of 30 students. The dataset is useful for analyzing the distribution of math scores and exploring factors that might influence math performance in educational settings.

Usage

data(Math_scores_tbl_df)

Format

A tibble with 30 observations and 1 variable:

score: Standardized math test score (integer).

Details

The dataset name has been kept as 'Math_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Incoming Freshmen Math Competency at a Community College

Description

This dataset, Mathcomp_tbl_df, is a tibble containing the standardized math competency scores for a group of 31 entering freshmen at a small community college. The dataset provides insights into the initial math skills of incoming students, which could be useful for evaluating preparedness and designing interventions to improve academic success.

Usage

data(Mathcomp_tbl_df)

Format

A tibble with 31 observations and 1 variable:

score: Standardized math competency score (integer).

Details

The dataset name has been kept as 'Mathcomp_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Math Proficiency and SAT Math Scores by State

Description

This dataset, Mathpro_tbl_df, is a tibble containing information on math proficiency percentages, SAT math scores, and group classifications for 51 states in the United States. The dataset provides insights into how math proficiency and SAT scores vary by state, allowing for comparative analysis and exploration of regional trends in mathematics education.

Usage

data(Mathpro_tbl_df)

Format

A tibble with 51 observations and 4 variables:

state: Name of the state (character).
sat_math: Average SAT math score for the state (integer).
profic: Math proficiency percentage for the state (numeric).
group: Group classification for the state (integer).

Details

The dataset name has been kept as 'Mathpro_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Graduate School Applicants' Miller Personality Test Scores

Description

This dataset, Miller_scores_tbl_df, is a tibble containing the Miller Personality Test scores for 25 college students applying for graduate school. This dataset allows for an exploration of personality traits and their potential correlation with academic or professional success.

Usage

data(Miller_scores_tbl_df)

Format

A tibble with 25 observations and 1 variable:

miller: Miller Personality Test score for a student (integer).

Details

The dataset name has been kept as 'Miller_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Music Recognition Learning in Identical Twins

Description

This dataset, Music_tbl_df, is a tibble containing improvement scores for identical twins taught music recognition using two different techniques. Each pair of twins was taught using both methods, and the differences in their improvement scores were recorded.

Usage

data(Music_tbl_df)

Format

A tibble with 12 observations and 3 variables:

method1: Improvement scores using the first music recognition technique (integer).
method2: Improvement scores using the second music recognition technique (integer).
differ: Difference in improvement scores between the two methods (integer).

Details

The dataset name has been kept as 'Music_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Relationship Between PSAT and SAT Scores

Description

This dataset, PSAT_SAT_tbl_df, is a tibble containing paired data on PSAT and SAT scores for a sample of students. It allows for the exploration of the relationship between performance on the PSAT and SAT exams.

Usage

data(PSAT_SAT_tbl_df)

Format

A tibble with 7 observations and 2 variables:

psat: PSAT scores (integer).
sat: SAT scores (integer).

Details

The dataset name has been kept as 'PSAT_SAT_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

State University Freshmen: Parental Education Profiles

Description

This dataset, Parented_tbl_df, is a tibble containing information about the education levels of parents of 200 entering freshmen at a state university. The dataset provides insights into the distribution of parental education backgrounds and their possible influence on students' academic paths.

Usage

data(Parented_tbl_df)

Format

A tibble with 200 observations and 2 variables:

education: Education level of the parent (factor with 6 levels).
parent: Parent type (factor with 2 levels: "mother" or "father").

Details

The dataset name has been kept as 'Parented_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Racial Prejudice Scores of High School Students

Description

This dataset, Prejudic_tbl_df, is a tibble containing scores measuring racial prejudice among a sample of 25 high school students. The dataset provides insights into attitudes related to racial prejudice within this demographic.

Usage

data(Prejudic_tbl_df)

Format

A tibble with 25 observations and 1 variable:

prejud: Racial prejudice score (integer).

Details

The dataset name has been kept as 'Prejudic_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Effects of Learning Methods on Statistics Students

Description

This dataset, Program_stats_tbl_df, is a tibble containing information about the effects of four different methods of programmed learning on statistics students. It includes the learning method used and the corresponding scores of the students.

Usage

data(Program_stats_tbl_df)

Format

A tibble with 44 observations and 2 variables:

method: Programmed learning method (character).
score: Score obtained by the student (integer).

Details

The dataset name has been kept as 'Program_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Quiz vs Lecture Pulse Rates

Description

This dataset, QuizPulse10_df, is a data frame containing paired data on pulse rates for 10 students during a quiz and a lecture. The dataset allows for the exploration of how pulse rates differ during these two activities.

Usage

data(QuizPulse10_df)

Format

A data frame with 10 observations and 3 variables:

Student: Student ID (integer).
Quiz: Pulse rate during the quiz (integer).
Lecture: Pulse rate during the lecture (integer).

Details

The dataset name has been kept as 'QuizPulse10_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the Lock5Data package.

Reading Scores vs IQ Scores

Description

This dataset, Readiq_tbl_df, is a tibble containing paired data on reading scores and IQ scores for a sample of individuals. It allows for the exploration of the relationship between reading ability and IQ scores.

Usage

data(Readiq_tbl_df)

Format

A tibble with 14 observations and 2 variables:

reading: Reading scores (integer).
iq: IQ scores (integer).

Details

The dataset name has been kept as 'Readiq_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Schizophrenic Learning Ability After Tranquilizer Dose

Description

This dataset, Schizoph_tbl_df, is a tibble containing standardized exam scores for 13 patients. The data was collected to investigate the learning ability of schizophrenics after being given a specified dose of a tranquilizer.

Usage

data(Schizoph_tbl_df)

Format

A tibble with 13 observations and 1 variable:

score: Standardized exam scores (integer).

Details

The dataset name has been kept as 'Schizoph_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Hours Studied by Freshmen

Description

This dataset, Study_freshmen_tbl_df, is a tibble containing the number of hours studied per week by a sample of 50 freshmen. It allows for the exploration of study habits and potentially provides insights into the relationship between study time and academic performance among college freshmen.

Usage

data(Study_freshmen_tbl_df)

Format

A tibble with 50 observations and 1 variable:

hours: Number of hours studied per week (integer).

Details

The dataset name has been kept as 'Study_freshmen_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Tennessee Self-Concept Scores for Gifted Students

Description

This dataset, Tennessee_gifted_df, is a data frame containing the self-concept scores for 20 gifted high school students in Tennessee. It allows for the exploration of self-concept in the context of gifted students and may provide insights into their self-perception and potential academic achievement.

Usage

data(Tennessee_gifted_df)

Format

A data frame with 20 observations and 1 variable:

score: Self-concept scores for gifted students (numeric).

Details

The dataset name has been kept as 'Tennessee_gifted_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Textbook Costs for Different Fields

Description

This dataset, TextbookCosts_df, is a data frame containing information about the number of textbooks and their total costs for different fields of study. It allows for the exploration of textbook expenses in various academic disciplines.

Usage

data(TextbookCosts_df)

Format

A data frame with 40 observations and 3 variables:

Field: Field of study (factor with 4 levels).
Books: Number of textbooks (integer).
Cost: Total cost of textbooks (integer).

Details

The dataset name has been kept as 'TextbookCosts_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the Lock5Data package.

Student Admissions at UC Berkeley

Description

This dataset, UCBAdmissions_table, is a contingency table containing aggregate data on applicants to graduate school at UC Berkeley in 1973. The data is classified by admission status, gender, and department, and it provides insights into admissions patterns for the six largest departments.

Usage

data(UCBAdmissions_table)

Format

A contingency table with 24 cells (2 × 2 × 6), classified by:

Admit: Admission status (Admitted, Rejected).
Gender: Gender of the applicants (Male, Female).
Dept: Department (A, B, C, D, E, F).

The table entries are the number of applicants.

Details

The dataset name has been kept as 'UCBAdmissions_table' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.

Source

The dataset was taken from the datasets package.

Ability and Intelligence Tests

Description

This dataset, ability_list, is a list containing information about six ability and intelligence tests administered to 112 individuals. The dataset provides a covariance matrix, the means (centers) of the variables, and the number of observations.

Usage

data(ability_list)

Format

A list with 3 components:

cov: A covariance matrix (numeric matrix) of dimensions 6x6, representing the relationships between six different tests.
center: A numeric vector of length 6 containing the mean scores for each of the six tests.
n.obs: The total number of observations (integer), which is 112.

Details

The dataset name has been kept as 'ability_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'list' indicates that the dataset is a list object. The original content has not been modified in any way.

Source

Generated for educational purposes.

Absenteeism from School in New South Wales

Description

This dataset, absenteeism_tbl_df, is a tibble containing information about absenteeism from school and certain demographic characteristics of children in rural New South Wales, Australia. The dataset includes data from 146 randomly sampled students during a particular school year, providing insights into the relationships between absenteeism and variables such as ethnicity, sex, age group, and learning categories.

Usage

data(absenteeism_tbl_df)

Format

A tibble with 146 observations and 5 variables:

eth: Ethnicity of the student (factor with 2 levels).
sex: Sex of the student (factor with 2 levels).
age: Age group of the student (factor with 4 levels).
lrn: Learning category of the student, e.g., slow learner or not (factor with 2 levels).
days: Number of days the student was absent from school (integer).

Details

The dataset name has been kept as 'absenteeism_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

Data collected from a study in rural New South Wales, Australia.

Community college housing (simulated data)

Description

This dataset, cchousing_tbl_df, is a tibble containing simulated data on housing prices for students at a community college. The dataset provides the housing prices for 75 students, offering insights into the distribution of housing prices in this educational setting.

Usage

data(cchousing_tbl_df)

Format

A tibble with 75 observations and 1 variable:

price: Housing price for the student (numeric).

Details

The dataset name has been kept as 'cchousing_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

College Credits

Description

This dataset, credits_tbl_df, is a tibble containing simulated data on the number of college credits taken by students each semester. The dataset includes data from 100 students, providing insights into the distribution of credits taken by students in a college setting.

Usage

data(credits_tbl_df)

Format

A tibble with 100 observations and 1 variable:

credits: Number of college credits taken by the student (integer).

Details

The dataset name has been kept as 'credits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package (simulated data).

US Crime Rates vs. High School Dropout

Description

This dataset, crime_degree_tbl_df, is a tibble containing data on crime rates and the percentage of the population without a high school degree in 51 U.S. states. The dataset includes information on the crime rate and the percentage of the population without a high school degree for each state.

Usage

data(crime_degree_tbl_df)

Format

A tibble with 51 observations and 3 variables:

state: State name (character).
nodegree: Percentage of the population without a high school degree (numeric).
crime: Crime rate (numeric).

Details

The dataset name has been kept as 'crime_degree_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the BSDA package.

Drug use of students and parents

Description

This dataset, drug_use_tbl_df, is a tibble containing data on the drug use of students and their parents. The dataset summarizes 445 student-parent pairs, with each pair indicating whether the student and/or their parent has used drugs, specifically marijuana.

Usage

data(drug_use_tbl_df)

Format

A tibble with 445 observations and 2 variables:

student: Whether the student has used drugs (factor with 2 levels).
parents: Whether the parent has used drugs (factor with 2 levels).

Details

The dataset name has been kept as 'drug_use_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package. Ellis GJ and Stone LH. 1979. Marijuana Use in College: An Evaluation of a Modeling Explanation. Youth and Society 10:323-334.

Exam Scores: Class of 19 Students

Description

This dataset, exams_tbl_df, is a tibble containing the exam scores of a class of 19 students. The dataset includes information on their performance in a specific exam.

Usage

data(exams_tbl_df)

Format

A tibble with 19 observations and 1 variable:

scores: Exam scores of students (integer).

Details

The dataset name has been kept as 'exams_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Female College Student Heights

Description

This dataset, fheights_tbl_df, is a tibble containing the heights of 24 female college students, measured in inches. The dataset provides insight into the physical characteristics of a specific demographic group.

Usage

data(fheights_tbl_df)

Format

A tibble with 24 observations and 1 variable:

heights: Heights of female college students, measured in inches (integer).

Details

The dataset name has been kept as 'fheights_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Analytical Skills of Young Gifted Children

Description

This dataset, gifted_tbl_df, is a tibble containing data on the analytical skills of 36 young gifted children, along with several factors that may influence these skills. The dataset includes measures such as parental IQ, early developmental milestones, and television habits.

Usage

data(gifted_tbl_df)

Format

A tibble with 36 observations and 8 variables:

score: Analytical skills score of the child (integer).
fatheriq: IQ of the father (integer).
motheriq: IQ of the mother (integer).
speak: Age in months when the child first said "mummy" or "daddy" (integer).
count: Age in months when the child first counted to 10 successfully (integer).
read: Average number of hours per week the child's parents read to the child (numeric).
edutv: Average number of hours per week the child watched educational TV programs in the past three months (numeric).
cartoons: Average number of hours per week the child watched cartoons on TV in the past three months (numeric).

Details

The dataset name has been kept as 'gifted_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package. Graybill, F.A. & Iyer, H.K., (1994) *Regression Analysis: Concepts and Applications*, Duxbury, p. 511-6.

Survey of Duke Students on GPA, Studying, and More

Description

This dataset, gpa_tbl_df, is a tibble containing survey data from 55 Duke University students. It includes information on students' GPA, weekly study hours, average hours of sleep per night, time spent going out per week, and gender. The dataset provides valuable insights into the relationship between academic performance and lifestyle habits.

Usage

data(gpa_tbl_df)

Format

A tibble with 55 observations and 5 variables:

gpa: Grade Point Average (numeric).
studyweek: Number of hours spent studying per week (integer).
sleepnight: Average hours of sleep per night (numeric).
out: Average hours spent going out per week (numeric).
gender: Gender of the student (factor with levels "Male" and "Female").

Details

The dataset name has been kept as 'gpa_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Relationship Between Watching TV and Grades (Simulated data)

Description

This dataset, gradestv_tbl_df, is a tibble containing simulated data for analyzing the relationship between the number of hours per week students watch TV and their grades in a statistics class. It provides a simple dataset for exploring correlations or regression models in educational settings.

Usage

data(gradestv_tbl_df)

Format

A tibble with 25 observations and 2 variables:

tv: Number of hours per week students watch TV (integer).
grades: Grade obtained in a statistics class (integer).

Details

The dataset name has been kept as 'gradestv_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Survey of Duke students and the area of their major

Description

This dataset, major_survey_tbl_df, is a tibble containing information about the GPAs of 218 Duke students and their academic major. The dataset provides insights into the relationship between GPA and the field of study, allowing for an exploration of how academic performance varies across different majors.

Usage

data(major_survey_tbl_df)

Format

A tibble with 218 observations and 2 variables:

gpa: Grade point average of the student (numeric).
major: Academic major of the student (factor with 3 levels).

Details

The dataset name has been kept as 'major_survey_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Simulated data for SAT score improvement

Description

This dataset, sat_improve_tbl_df, is a tibble containing simulated data on SAT score improvements for a sample of students who took a course from an SAT score improvement company. It allows for the exploration of score improvements from students who enrolled in the course.

Usage

data(sat_improve_tbl_df)

Format

A tibble with 30 observations and 1 variable:

sat_improve: SAT score improvement (integer).

Details

The dataset name has been kept as 'sat_improve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Final exam scores for twenty students

Description

This dataset, stats_scores_tbl_df, is a tibble containing the final exam scores of 20 students. The data provides insights into the academic performance of the students in their final exam for a statistics course.

Usage

data(stats_scores_tbl_df)

Format

A tibble with 20 observations and 1 variable:

scores: Final exam scores (integer).

Details

The dataset name has been kept as 'stats_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

Sleep for 110 college students (simulated)

Description

This dataset, student_sleep_tbl_df, is a tibble containing the number of hours that 110 college students slept in a single night. The data allows for the analysis of sleep patterns among college students, potentially useful for understanding the relationship between sleep and academic performance, well-being, or other factors.

Usage

data(student_sleep_tbl_df)

Format

A tibble with 110 observations and 1 variable:

hours: Number of hours slept (numeric).

Details

The dataset name has been kept as 'student_sleep_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.

Source

The dataset was taken from the openintro package.

educationR: A Comprehensive Collection of Educational Datasets

Description

Details

Author(s)

See Also

Math Achievement Test Scores by Gender

Description

Usage

Format

Details

Source

Math Test Scores versus Anxiety Scores

Description

Usage

Format

Details

Source

Grad Rates: Athletes vs. Non-Athletes Big Ten Conf.

Description

Usage

Format

Details

Source

Test Scores on First Exam in Biology Class

Description

Usage

Format

Details

Source

Education Level of Blacks by Gender

Description

Usage

Format

Details

Source

Books Read & Spelling Scores: 17 Third Graders

Description

Usage

Format

Details

Source

Detroit Auto Workers: Education Levels

Description

Usage

Format

Details

Source

Dev Students: 2-Year & 4-Year College Demographics

Description

Usage

Format

Details

Source

Fall '95 Developmental Math: Failed Student Scores

Description

Usage

Format

Details

Source

Dyslexia: Data on College Students Diagnosed with Dyslexia

Description

Usage

Format

Details

Source

Engineering Grads Salary: 10 Years Post-Graduation

Description

Usage

Format

Details

Source

College entrance exam scores for 24 high school seniors

Description

Usage

Format

Details

Source

High School GPA versus College GPA

Description

Usage