| Type: | Package | 
| Title: | A Comprehensive Collection of Educational Datasets | 
| Version: | 0.1.0 | 
| Maintainer: | Renzo Caceres Rossi <arenzocaceresrossi@gmail.com> | 
| Description: | Provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics. This package serves as a resource for educational researchers, data analysts, and statisticians to explore and analyze data in the field of education. | 
| License: | GPL-3 | 
| URL: | https://github.com/lightbluetitan/educationr, https://lightbluetitan.github.io/educationr/ | 
| BugReports: | https://github.com/lightbluetitan/educationr/issues | 
| Encoding: | UTF-8 | 
| LazyData: | true | 
| Suggests: | ggplot2, dplyr, knitr, rmarkdown, testthat (≥ 3.0.0) | 
| RoxygenNote: | 7.3.2 | 
| VignetteBuilder: | knitr | 
| NeedsCompilation: | no | 
| Packaged: | 2024-12-03 23:09:29 UTC; renzocrossi | 
| Author: | Renzo Caceres Rossi [aut, cre] | 
| Depends: | R (≥ 3.5.0) | 
| Repository: | CRAN | 
| Date/Publication: | 2024-12-05 18:50:04 UTC | 
educationR: A Comprehensive Collection of Educational Datasets
Description
This package provides a comprehensive collection of datasets related to education, covering topics such as student performance, learning methods, test scores, absenteeism, and other educational metrics.
Details
educationR: A Comprehensive Collection of Educational Datasets
 
A Comprehensive Collection of Educational Datasets.
Author(s)
Maintainer: Renzo Cáceres Rossi arenzocaceresrossi@gmail.com
See Also
Useful links:
Math Achievement Test Scores by Gender
Description
This dataset, Achieve_tbl_df, is a tibble containing information about math achievement test scores for 25 high school students, categorized by gender. The dataset provides insights into the distribution of scores between male and female students.
Usage
data(Achieve_tbl_df)
Format
A tibble with 25 observations and 2 variables:
- score
- Math achievement test score of the student (integer). 
- gender
- Gender of the student (factor with 2 levels: "Male", "Female"). 
Details
The dataset name has been kept as 'Achieve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Math Test Scores versus Anxiety Scores
Description
This dataset, Anxiety_tbl_df, is a tibble containing information about math test scores and anxiety levels before the test for 20 students. The dataset provides insights into the relationship between anxiety levels and math test performance.
Usage
data(Anxiety_tbl_df)
Format
A tibble with 20 observations and 2 variables:
- anxiety
- Anxiety score of the student before taking the math test (integer). 
- math
- Math test score of the student (integer). 
Details
The dataset name has been kept as 'Anxiety_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Grad Rates: Athletes vs. Non-Athletes Big Ten Conf.
Description
This dataset, Bigten_tbl_df, is a tibble containing information about the graduation rates of student athletes and nonathletes in the Big Ten Conference. The dataset includes data from two academic years, showing the graduation rates by school and athlete status (athletes vs nonathletes).
Usage
data(Bigten_tbl_df)
Format
A tibble with 44 observations and 4 variables:
- school
- Name of the school (character). 
- year
- Year of the data (factor with 2 levels). 
- rate
- Graduation rate percentage (integer). 
- status
- Athlete status (character, either 'Athlete' or 'Nonathlete'). 
Details
The dataset name has been kept as 'Bigten_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
NCAA Graduation Rates Report, 2000.
Test Scores on First Exam in Biology Class
Description
This dataset, Biology_tbl_df, is a tibble containing the test scores of 30 students on their first exam in a biology class. The dataset provides insight into the distribution of scores among the students.
Usage
data(Biology_tbl_df)
Format
A tibble with 30 observations and 1 variable:
- score
- Test scores on the first biology exam (integer). 
Details
The dataset name has been kept as 'Biology_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Education Level of Blacks by Gender
Description
This dataset, Blackedu_tbl_df, is a tibble containing information about the education level of Black individuals, categorized by gender. The dataset includes 3800 observations and provides insights into the distribution of education levels across different gender groups.
Usage
data(Blackedu_tbl_df)
Format
A tibble with 3800 observations and 2 variables:
- gender
- Gender of the individual (factor with 2 levels). 
- education
- Education level of the individual (factor with 5 levels). 
Details
The dataset name has been kept as 'Blackedu_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Bureau of Census data.
Books Read & Spelling Scores: 17 Third Graders
Description
This dataset, Books_tbl_df, is a tibble containing information about the number of books read and spelling scores for 17 third-grade students. The dataset provides insights into the relationship between the number of books read and spelling scores in this group of students.
Usage
data(Books_tbl_df)
Format
A tibble with 17 observations and 2 variables:
- book
- Number of books read by the student (integer). 
- spelling
- Spelling score of the student (integer). 
Details
The dataset name has been kept as 'Books_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Detroit Auto Workers: Education Levels
Description
This dataset, Detroit_tbl_df, is a tibble containing data on the educational levels of a sample of 40 auto workers in Detroit. The dataset includes information on the highest level of education attained by each worker.
Usage
data(Detroit_tbl_df)
Format
A tibble with 40 observations and 1 variable:
- educ
- Educational level of the auto worker (integer). 
Details
The dataset name has been kept as 'Detroit_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Dev Students: 2-Year & 4-Year College Demographics
Description
This dataset, Develop_tbl_df, is a tibble containing demographic data on students enrolled in developmental education programs at 2-year and 4-year colleges. The dataset includes information on the racial background of the students and whether they are enrolled in 2-year or 4-year colleges.
Usage
data(Develop_tbl_df)
Format
A tibble with 5656 observations and 2 variables:
- race
- Racial background of the student (factor with 5 levels). 
- college
- Type of college the student is enrolled in (factor with 2 levels: 2-year or 4-year). 
Details
The dataset name has been kept as 'Develop_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from "Research in Development Education" (1994), V. 11, 2.
Fall '95 Developmental Math: Failed Student Scores
Description
This dataset, Devmath_tbl_df, is a tibble containing test scores for students who failed developmental mathematics in the fall semester of 1995. The dataset includes the scores of these students as part of a simulated study.
Usage
data(Devmath_tbl_df)
Format
A tibble with 40 observations and 1 variable:
- score
- Test scores of students who failed developmental mathematics (integer). 
Details
The dataset name has been kept as 'Devmath_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package. Data provided by Dr. Anita Kitchens.
Dyslexia: Data on College Students Diagnosed with Dyslexia
Description
This dataset, Dyslexia_tbl_df, is a tibble containing data on a group of college students diagnosed with dyslexia. The dataset includes various personal characteristics such as age, gender, handedness, weight, height, and number of children, along with the number of words they were able to read correctly.
Usage
data(Dyslexia_tbl_df)
Format
A tibble with 8 observations and 7 variables:
- words
- Number of words read correctly (integer). 
- age
- Age of the student (integer). 
- gender
- Gender of the student (character). 
- handed
- Handedness of the student (character). 
- weight
- Weight of the student (integer). 
- height
- Height of the student (integer). 
- children
- Number of children the student has (integer). 
Details
The dataset name has been kept as 'Dyslexia_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Engineering Grads Salary: 10 Years Post-Graduation
Description
This dataset, Engineer_tbl_df, is a tibble containing salary data for engineering graduates 10 years after graduation. The dataset includes information on the salary of the graduates and the type of university they graduated from (categorized into three types of universities).
Usage
data(Engineer_tbl_df)
Format
A tibble with 51 observations and 2 variables:
- salary
- Salary of the engineering graduate 10 years after graduation (integer). 
- university
- Type of university the graduate attended (factor with 3 levels). 
Details
The dataset name has been kept as 'Engineer_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
College entrance exam scores for 24 high school seniors
Description
This dataset, Entrance_tbl_df, is a tibble containing the college entrance exam scores of 24 high school seniors. The dataset includes information on their exam scores.
Usage
data(Entrance_tbl_df)
Format
A tibble with 24 observations and 1 variable:
- score
- College entrance exam scores (integer). 
Details
The dataset name has been kept as 'Entrance_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
High School GPA versus College GPA
Description
This dataset, GPA_college_tbl_df, is a tibble containing data on the relationship between high school GPA and college GPA for 10 students. The dataset is valuable for analyzing academic performance correlations across different educational levels.
Usage
data(GPA_college_tbl_df)
Format
A tibble with 10 observations and 2 variables:
- hsgpa
- High school GPA (numeric). 
- collgpa
- College GPA (numeric). 
Details
The dataset name has been kept as 'GPA_college_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Before and after German copying errors post-course
Description
This dataset, German_tbl_df, is a tibble containing data on the number of errors made by 20 students when copying a German passage, both before and after participating in an experimental German course. The dataset provides insights into language learning and the effectiveness of the course.
Usage
data(German_tbl_df)
Format
A tibble with 20 observations and 3 variables:
- student
- Identifier for the student (character). 
- when
- Indicates whether the errors were recorded "before" or "after" the experimental course (character). 
- errors
- Number of errors made by the student when copying the German passage (integer). 
Details
The dataset name has been kept as 'German_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Test Grades in a Beginning Statistics Class
Description
This dataset, Grades_stats_tbl_df, is a tibble containing test grades for a beginning statistics class. It includes the grades of 29 students and provides insights into the performance distribution in an introductory statistics course.
Usage
data(Grades_stats_tbl_df)
Format
A tibble with 29 observations and 1 variable:
- grades
- Test grades (integer). 
Details
The dataset name has been kept as 'Grades_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Graduation Rates for Student Athletes in the Southeastern Conf.
Description
This dataset, Graduate_tbl_df, is a tibble containing data on the graduation rates for student athletes in various schools within the Southeastern Conference. It includes the name of the school, a code for identification, and the graduation rate as a percentage.
Usage
data(Graduate_tbl_df)
Format
A tibble with 12 observations and 3 variables:
- school
- Name of the school (character). 
- code
- Code for the school (character). 
- percent
- Graduation rate as a percentage (integer). 
Details
The dataset name has been kept as 'Graduate_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Study Habits of Students in Two Matched School Districts
Description
This dataset, Habits_tbl_df, is a tibble containing data on the study habits of students in two matched school districts. It includes variables related to the number of hours students from each district spent on studying, the difference between the two groups, and the significance of this difference.
Usage
data(Habits_tbl_df)
Format
A tibble with 11 observations and 4 variables:
- A
- Number of study hours in the first school district (integer). 
- B
- Number of study hours in the second school district (integer). 
- differ
- Difference in study hours between the two school districts (integer). 
- signrks
- Significance of the difference in study hours (numeric). 
Details
The dataset name has been kept as 'Habits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Effects of Open Education Reported by Hedges and Olkin (1985)
Description
This dataset, HedgesOlkin85_df, is a data frame containing data from Hedges and Olkin's 1985 study on the effects of open education. The dataset includes variables related to attitudes toward school and reading achievement in students, based on the meta-analysis reported in their work. This data was used to investigate the effects of open education.
Usage
data(HedgesOlkin85_df)
Format
A data frame with 4 observations and 6 variables:
- study
- Study identifier (numeric). 
- d_att
- Effect size for attitude toward school (numeric). 
- d_ach
- Effect size for reading achievement (numeric). 
- var_att
- Variance of the attitude effect size (numeric). 
- cov_att_ach
- Covariance between attitude and achievement (numeric). 
- var_ach
- Variance of the achievement effect size (numeric). 
Details
The dataset name has been kept as 'HedgesOlkin85_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.
Source
The dataset was taken from the metaSEM package.
References
Hedges, L. V., & Olkin, I. (1985). Statistical methods for meta-analysis. Orlando, FL: Academic Press.
Homework Hours: Private vs. Public High Schools
Description
This dataset, Homework_tbl_df, is a tibble containing data on the number of hours per week high school students spend on homework. The dataset compares students from private and public schools, providing insights into the study habits and academic workload of students in these two types of schools.
Usage
data(Homework_tbl_df)
Format
A tibble with 30 observations and 2 variables:
- school
- Type of school the student attends, either private or public (character). 
- time
- Number of hours per week the student spends on homework (numeric). 
Details
The dataset name has been kept as 'Homework_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Job satisfaction and stress level for 9 school teachers
Description
This dataset, Jobsat_tbl_df, is a tibble containing data on job satisfaction and stress levels for 9 school teachers. The dataset provides insights into how teachers' stress levels relate to their job satisfaction, which can be valuable for understanding workplace dynamics and improving teacher well-being.
Usage
data(Jobsat_tbl_df)
Format
A tibble with 9 observations and 2 variables:
- wspt
- Stress level of the teacher, measured on a scale (integer). 
- satisfaction
- Job satisfaction level of the teacher (numeric). 
Details
The dataset name has been kept as 'Jobsat_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
First Grade Reading Scores: With vs. Without Kindergarten
Description
This dataset, Kinder_tbl_df, is a tibble containing data on the reading scores of first grade children who attended kindergarten versus those who did not. The dataset provides insights into the impact of attending kindergarten on early reading abilities, which can help in evaluating the effectiveness of early childhood education programs.
Usage
data(Kinder_tbl_df)
Format
A tibble with 8 observations and 3 variables:
- pair
- Pair identifier for the group of children (integer). 
- kinder
- Reading score for children who attended kindergarten (integer). 
- nokinder
- Reading score for children who did not attend kindergarten (integer). 
Details
The dataset name has been kept as 'Kinder_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Reading skills of 24 matched low ability students
Description
This dataset, Lowabil_tbl_df, is a tibble containing data on the reading skills of 24 students with low abilities, who were matched based on certain characteristics. The dataset compares the performance of the students in two different groups: the experimental group and the control group. This can be used to evaluate the effectiveness of an intervention or treatment in improving reading skills.
Usage
data(Lowabil_tbl_df)
Format
A tibble with 12 observations and 3 variables:
- pair
- Pair identifier for the matched students (integer). 
- experiment
- Reading score for students in the experimental group (integer). 
- control
- Reading score for students in the control group (integer). 
Details
The dataset name has been kept as 'Lowabil_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Standardized Math Test Scores for 30 Students
Description
This dataset, Math_scores_tbl_df, is a tibble containing the standardized math test scores of 30 students. The dataset is useful for analyzing the distribution of math scores and exploring factors that might influence math performance in educational settings.
Usage
data(Math_scores_tbl_df)
Format
A tibble with 30 observations and 1 variable:
- score
- Standardized math test score (integer). 
Details
The dataset name has been kept as 'Math_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Incoming Freshmen Math Competency at a Community College
Description
This dataset, Mathcomp_tbl_df, is a tibble containing the standardized math competency scores for a group of 31 entering freshmen at a small community college. The dataset provides insights into the initial math skills of incoming students, which could be useful for evaluating preparedness and designing interventions to improve academic success.
Usage
data(Mathcomp_tbl_df)
Format
A tibble with 31 observations and 1 variable:
- score
- Standardized math competency score (integer). 
Details
The dataset name has been kept as 'Mathcomp_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Math Proficiency and SAT Math Scores by State
Description
This dataset, Mathpro_tbl_df, is a tibble containing information on math proficiency percentages, SAT math scores, and group classifications for 51 states in the United States. The dataset provides insights into how math proficiency and SAT scores vary by state, allowing for comparative analysis and exploration of regional trends in mathematics education.
Usage
data(Mathpro_tbl_df)
Format
A tibble with 51 observations and 4 variables:
- state
- Name of the state (character). 
- sat_math
- Average SAT math score for the state (integer). 
- profic
- Math proficiency percentage for the state (numeric). 
- group
- Group classification for the state (integer). 
Details
The dataset name has been kept as 'Mathpro_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Graduate School Applicants' Miller Personality Test Scores
Description
This dataset, Miller_scores_tbl_df, is a tibble containing the Miller Personality Test scores for 25 college students applying for graduate school. This dataset allows for an exploration of personality traits and their potential correlation with academic or professional success.
Usage
data(Miller_scores_tbl_df)
Format
A tibble with 25 observations and 1 variable:
- miller
- Miller Personality Test score for a student (integer). 
Details
The dataset name has been kept as 'Miller_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Music Recognition Learning in Identical Twins
Description
This dataset, Music_tbl_df, is a tibble containing improvement scores for identical twins taught music recognition using two different techniques. Each pair of twins was taught using both methods, and the differences in their improvement scores were recorded.
Usage
data(Music_tbl_df)
Format
A tibble with 12 observations and 3 variables:
- method1
- Improvement scores using the first music recognition technique (integer). 
- method2
- Improvement scores using the second music recognition technique (integer). 
- differ
- Difference in improvement scores between the two methods (integer). 
Details
The dataset name has been kept as 'Music_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Relationship Between PSAT and SAT Scores
Description
This dataset, PSAT_SAT_tbl_df, is a tibble containing paired data on PSAT and SAT scores for a sample of students. It allows for the exploration of the relationship between performance on the PSAT and SAT exams.
Usage
data(PSAT_SAT_tbl_df)
Format
A tibble with 7 observations and 2 variables:
- psat
- PSAT scores (integer). 
- sat
- SAT scores (integer). 
Details
The dataset name has been kept as 'PSAT_SAT_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
State University Freshmen: Parental Education Profiles
Description
This dataset, Parented_tbl_df, is a tibble containing information about the education levels of parents of 200 entering freshmen at a state university. The dataset provides insights into the distribution of parental education backgrounds and their possible influence on students' academic paths.
Usage
data(Parented_tbl_df)
Format
A tibble with 200 observations and 2 variables:
- education
- Education level of the parent (factor with 6 levels). 
- parent
- Parent type (factor with 2 levels: "mother" or "father"). 
Details
The dataset name has been kept as 'Parented_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Racial Prejudice Scores of High School Students
Description
This dataset, Prejudic_tbl_df, is a tibble containing scores measuring racial prejudice among a sample of 25 high school students. The dataset provides insights into attitudes related to racial prejudice within this demographic.
Usage
data(Prejudic_tbl_df)
Format
A tibble with 25 observations and 1 variable:
- prejud
- Racial prejudice score (integer). 
Details
The dataset name has been kept as 'Prejudic_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Effects of Learning Methods on Statistics Students
Description
This dataset, Program_stats_tbl_df, is a tibble containing information about the effects of four different methods of programmed learning on statistics students. It includes the learning method used and the corresponding scores of the students.
Usage
data(Program_stats_tbl_df)
Format
A tibble with 44 observations and 2 variables:
- method
- Programmed learning method (character). 
- score
- Score obtained by the student (integer). 
Details
The dataset name has been kept as 'Program_stats_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Quiz vs Lecture Pulse Rates
Description
This dataset, QuizPulse10_df, is a data frame containing paired data on pulse rates for 10 students during a quiz and a lecture. The dataset allows for the exploration of how pulse rates differ during these two activities.
Usage
data(QuizPulse10_df)
Format
A data frame with 10 observations and 3 variables:
- Student
- Student ID (integer). 
- Quiz
- Pulse rate during the quiz (integer). 
- Lecture
- Pulse rate during the lecture (integer). 
Details
The dataset name has been kept as 'QuizPulse10_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.
Source
The dataset was taken from the Lock5Data package.
Reading Scores vs IQ Scores
Description
This dataset, Readiq_tbl_df, is a tibble containing paired data on reading scores and IQ scores for a sample of individuals. It allows for the exploration of the relationship between reading ability and IQ scores.
Usage
data(Readiq_tbl_df)
Format
A tibble with 14 observations and 2 variables:
- reading
- Reading scores (integer). 
- iq
- IQ scores (integer). 
Details
The dataset name has been kept as 'Readiq_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Schizophrenic Learning Ability After Tranquilizer Dose
Description
This dataset, Schizoph_tbl_df, is a tibble containing standardized exam scores for 13 patients. The data was collected to investigate the learning ability of schizophrenics after being given a specified dose of a tranquilizer.
Usage
data(Schizoph_tbl_df)
Format
A tibble with 13 observations and 1 variable:
- score
- Standardized exam scores (integer). 
Details
The dataset name has been kept as 'Schizoph_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Hours Studied by Freshmen
Description
This dataset, Study_freshmen_tbl_df, is a tibble containing the number of hours studied per week by a sample of 50 freshmen. It allows for the exploration of study habits and potentially provides insights into the relationship between study time and academic performance among college freshmen.
Usage
data(Study_freshmen_tbl_df)
Format
A tibble with 50 observations and 1 variable:
- hours
- Number of hours studied per week (integer). 
Details
The dataset name has been kept as 'Study_freshmen_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Tennessee Self-Concept Scores for Gifted Students
Description
This dataset, Tennessee_gifted_df, is a data frame containing the self-concept scores for 20 gifted high school students in Tennessee. It allows for the exploration of self-concept in the context of gifted students and may provide insights into their self-perception and potential academic achievement.
Usage
data(Tennessee_gifted_df)
Format
A data frame with 20 observations and 1 variable:
- score
- Self-concept scores for gifted students (numeric). 
Details
The dataset name has been kept as 'Tennessee_gifted_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Textbook Costs for Different Fields
Description
This dataset, TextbookCosts_df, is a data frame containing information about the number of textbooks and their total costs for different fields of study. It allows for the exploration of textbook expenses in various academic disciplines.
Usage
data(TextbookCosts_df)
Format
A data frame with 40 observations and 3 variables:
- Field
- Field of study (factor with 4 levels). 
- Books
- Number of textbooks (integer). 
- Cost
- Total cost of textbooks (integer). 
Details
The dataset name has been kept as 'TextbookCosts_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.
Source
The dataset was taken from the Lock5Data package.
Student Admissions at UC Berkeley
Description
This dataset, UCBAdmissions_table, is a contingency table containing aggregate data on applicants to graduate school at UC Berkeley in 1973. The data is classified by admission status, gender, and department, and it provides insights into admissions patterns for the six largest departments.
Usage
data(UCBAdmissions_table)
Format
A contingency table with 24 cells (2 × 2 × 6), classified by:
- Admit
- Admission status (Admitted, Rejected). 
- Gender
- Gender of the applicants (Male, Female). 
- Dept
- Department (A, B, C, D, E, F). 
The table entries are the number of applicants.
Details
The dataset name has been kept as 'UCBAdmissions_table' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The original content has not been modified in any way.
Source
The dataset was taken from the datasets package.
Ability and Intelligence Tests
Description
This dataset, ability_list, is a list containing information about six ability and intelligence tests administered to 112 individuals. The dataset provides a covariance matrix, the means (centers) of the variables, and the number of observations.
Usage
data(ability_list)
Format
A list with 3 components:
- cov
- A covariance matrix (numeric matrix) of dimensions 6x6, representing the relationships between six different tests. 
- center
- A numeric vector of length 6 containing the mean scores for each of the six tests. 
- n.obs
- The total number of observations (integer), which is 112. 
Details
The dataset name has been kept as 'ability_list' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'list' indicates that the dataset is a list object. The original content has not been modified in any way.
Source
Generated for educational purposes.
Absenteeism from School in New South Wales
Description
This dataset, absenteeism_tbl_df, is a tibble containing information about absenteeism from school and certain demographic characteristics of children in rural New South Wales, Australia. The dataset includes data from 146 randomly sampled students during a particular school year, providing insights into the relationships between absenteeism and variables such as ethnicity, sex, age group, and learning categories.
Usage
data(absenteeism_tbl_df)
Format
A tibble with 146 observations and 5 variables:
- eth
- Ethnicity of the student (factor with 2 levels). 
- sex
- Sex of the student (factor with 2 levels). 
- age
- Age group of the student (factor with 4 levels). 
- lrn
- Learning category of the student, e.g., slow learner or not (factor with 2 levels). 
- days
- Number of days the student was absent from school (integer). 
Details
The dataset name has been kept as 'absenteeism_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
Data collected from a study in rural New South Wales, Australia.
Community college housing (simulated data)
Description
This dataset, cchousing_tbl_df, is a tibble containing simulated data on housing prices for students at a community college. The dataset provides the housing prices for 75 students, offering insights into the distribution of housing prices in this educational setting.
Usage
data(cchousing_tbl_df)
Format
A tibble with 75 observations and 1 variable:
- price
- Housing price for the student (numeric). 
Details
The dataset name has been kept as 'cchousing_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
College Credits
Description
This dataset, credits_tbl_df, is a tibble containing simulated data on the number of college credits taken by students each semester. The dataset includes data from 100 students, providing insights into the distribution of credits taken by students in a college setting.
Usage
data(credits_tbl_df)
Format
A tibble with 100 observations and 1 variable:
- credits
- Number of college credits taken by the student (integer). 
Details
The dataset name has been kept as 'credits_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package (simulated data).
US Crime Rates vs. High School Dropout
Description
This dataset, crime_degree_tbl_df, is a tibble containing data on crime rates and the percentage of the population without a high school degree in 51 U.S. states. The dataset includes information on the crime rate and the percentage of the population without a high school degree for each state.
Usage
data(crime_degree_tbl_df)
Format
A tibble with 51 observations and 3 variables:
- state
- State name (character). 
- nodegree
- Percentage of the population without a high school degree (numeric). 
- crime
- Crime rate (numeric). 
Details
The dataset name has been kept as 'crime_degree_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the BSDA package.
Drug use of students and parents
Description
This dataset, drug_use_tbl_df, is a tibble containing data on the drug use of students and their parents. The dataset summarizes 445 student-parent pairs, with each pair indicating whether the student and/or their parent has used drugs, specifically marijuana.
Usage
data(drug_use_tbl_df)
Format
A tibble with 445 observations and 2 variables:
- student
- Whether the student has used drugs (factor with 2 levels). 
- parents
- Whether the parent has used drugs (factor with 2 levels). 
Details
The dataset name has been kept as 'drug_use_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package. Ellis GJ and Stone LH. 1979. Marijuana Use in College: An Evaluation of a Modeling Explanation. Youth and Society 10:323-334.
Exam Scores: Class of 19 Students
Description
This dataset, exams_tbl_df, is a tibble containing the exam scores of a class of 19 students. The dataset includes information on their performance in a specific exam.
Usage
data(exams_tbl_df)
Format
A tibble with 19 observations and 1 variable:
- scores
- Exam scores of students (integer). 
Details
The dataset name has been kept as 'exams_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Female College Student Heights
Description
This dataset, fheights_tbl_df, is a tibble containing the heights of 24 female college students, measured in inches. The dataset provides insight into the physical characteristics of a specific demographic group.
Usage
data(fheights_tbl_df)
Format
A tibble with 24 observations and 1 variable:
- heights
- Heights of female college students, measured in inches (integer). 
Details
The dataset name has been kept as 'fheights_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Analytical Skills of Young Gifted Children
Description
This dataset, gifted_tbl_df, is a tibble containing data on the analytical skills of 36 young gifted children, along with several factors that may influence these skills. The dataset includes measures such as parental IQ, early developmental milestones, and television habits.
Usage
data(gifted_tbl_df)
Format
A tibble with 36 observations and 8 variables:
- score
- Analytical skills score of the child (integer). 
- fatheriq
- IQ of the father (integer). 
- motheriq
- IQ of the mother (integer). 
- speak
- Age in months when the child first said "mummy" or "daddy" (integer). 
- count
- Age in months when the child first counted to 10 successfully (integer). 
- read
- Average number of hours per week the child's parents read to the child (numeric). 
- edutv
- Average number of hours per week the child watched educational TV programs in the past three months (numeric). 
- cartoons
- Average number of hours per week the child watched cartoons on TV in the past three months (numeric). 
Details
The dataset name has been kept as 'gifted_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package. Graybill, F.A. & Iyer, H.K., (1994) *Regression Analysis: Concepts and Applications*, Duxbury, p. 511-6.
Survey of Duke Students on GPA, Studying, and More
Description
This dataset, gpa_tbl_df, is a tibble containing survey data from 55 Duke University students. It includes information on students' GPA, weekly study hours, average hours of sleep per night, time spent going out per week, and gender. The dataset provides valuable insights into the relationship between academic performance and lifestyle habits.
Usage
data(gpa_tbl_df)
Format
A tibble with 55 observations and 5 variables:
- gpa
- Grade Point Average (numeric). 
- studyweek
- Number of hours spent studying per week (integer). 
- sleepnight
- Average hours of sleep per night (numeric). 
- out
- Average hours spent going out per week (numeric). 
- gender
- Gender of the student (factor with levels "Male" and "Female"). 
Details
The dataset name has been kept as 'gpa_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Relationship Between Watching TV and Grades (Simulated data)
Description
This dataset, gradestv_tbl_df, is a tibble containing simulated data for analyzing the relationship between the number of hours per week students watch TV and their grades in a statistics class. It provides a simple dataset for exploring correlations or regression models in educational settings.
Usage
data(gradestv_tbl_df)
Format
A tibble with 25 observations and 2 variables:
- tv
- Number of hours per week students watch TV (integer). 
- grades
- Grade obtained in a statistics class (integer). 
Details
The dataset name has been kept as 'gradestv_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Survey of Duke students and the area of their major
Description
This dataset, major_survey_tbl_df, is a tibble containing information about the GPAs of 218 Duke students and their academic major. The dataset provides insights into the relationship between GPA and the field of study, allowing for an exploration of how academic performance varies across different majors.
Usage
data(major_survey_tbl_df)
Format
A tibble with 218 observations and 2 variables:
- gpa
- Grade point average of the student (numeric). 
- major
- Academic major of the student (factor with 3 levels). 
Details
The dataset name has been kept as 'major_survey_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Simulated data for SAT score improvement
Description
This dataset, sat_improve_tbl_df, is a tibble containing simulated data on SAT score improvements for a sample of students who took a course from an SAT score improvement company. It allows for the exploration of score improvements from students who enrolled in the course.
Usage
data(sat_improve_tbl_df)
Format
A tibble with 30 observations and 1 variable:
- sat_improve
- SAT score improvement (integer). 
Details
The dataset name has been kept as 'sat_improve_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Final exam scores for twenty students
Description
This dataset, stats_scores_tbl_df, is a tibble containing the final exam scores of 20 students. The data provides insights into the academic performance of the students in their final exam for a statistics course.
Usage
data(stats_scores_tbl_df)
Format
A tibble with 20 observations and 1 variable:
- scores
- Final exam scores (integer). 
Details
The dataset name has been kept as 'stats_scores_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.
Sleep for 110 college students (simulated)
Description
This dataset, student_sleep_tbl_df, is a tibble containing the number of hours that 110 college students slept in a single night. The data allows for the analysis of sleep patterns among college students, potentially useful for understanding the relationship between sleep and academic performance, well-being, or other factors.
Usage
data(student_sleep_tbl_df)
Format
A tibble with 110 observations and 1 variable:
- hours
- Number of hours slept (numeric). 
Details
The dataset name has been kept as 'student_sleep_tbl_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the educationR package and assists users in identifying its specific characteristics. The suffix 'tbl_df' indicates that the dataset is a tibble. The original content has not been modified in any way.
Source
The dataset was taken from the openintro package.