Increasing representation and diversity in health research: A protocol of the MYHealth Research Training Program for high school students ========================================================================================================================================= * Samantha A. Chuisano * Jane Rafferty * Alison Allen * Tammy Chang * Matthew Diemer * Kara Harris * Lisa M. Vaughn * Daphne C. Watkins * Melissa DeJonckheere ## Abstract **Background** Despite decades of calls for increased diversity in the health research workforce, disparities exist for many populations, including Black, Indigenous, and People of Color individuals, those from low-income families, and first-generation college students. To increase representation of historically marginalized populations, there is a critical need to develop programs that strengthen their path toward health research careers. High school is a critically important time to catalyze interest and rebuild engagement among youth who may have previously felt excluded from science, technology, engineering, and mathematics (STEM) and health research careers. **Methods** The overall objective of the MYHealth program is to engage high school students in a community-based participatory research program focused on adolescent health. Investigators will work alongside community partners to recruit 9th through 12th graders who self-identify as a member of a group underrepresented in STEM or health research careers (e.g., based on race and ethnicity, socioeconomic status, first generation college student, disability, etc.). MYHealth students are trained to be co-researchers who work alongside academic researchers, which will help them to envision themselves as scientists capable of positively impacting their communities through research. Implemented in three phases, the MYHealth program aims to foster a continuing interest in health research careers by developing: 1) researcher identities, 2) scientific literacy, 3) scientific self-efficacy, and 4) teamwork and leadership self-efficacy. In each phase, students will build knowledge and skills in research, ethics, data collection, data analysis, and dissemination. Students will directly collaborate with and be mentored by a team that includes investigators, community advisors, scientific advisors, and youth peers. **Discussion** Each year, a new cohort of up to 70 high school students will be enrolled in MYHealth. We anticipate the MYHealth program will increase interest and persistence in STEM and health research among groups that have been historically excluded in health research careers. Keywords * children and adolescents * education * research * science * technology * engineering & math (STEM) * health disparities * community-based participatory research (CBPR) * identity * health sciences research ## Introduction Despite decades of calls for increased diversity in the research workforce, several groups continue to be excluded and thereby underrepresented in Science, Technology, Engineering, and Math (STEM) careers in the United States [1]. In 2019, Black, Indigenous, and People of Color (BIPOC) individuals were awarded only 11.7% of STEM research doctorates, despite making up about a third of the U.S. population and the workforce [1]. Likewise, while women comprised 47% of the workforce, they received only 42.2% of STEM doctorates [1]. Parental education is also associated with postsecondary enrollment, which is an important step on the trajectory toward many STEM professions. College enrollment directly after high school is lower among children whose parents did not graduate high school (50%), attend college (57%), or graduate college (67%), compared to those whose parents have completed a bachelor’s degree (84%) [2]. Low-income youth also continue to graduate high school and enroll in college at lower rates (65% vs. 83%) than their middle- and high-income peers [2]. While these figures have improved slightly in the last decade, significant work is still needed to address these inequities [3, 4]. An initial and persistent interest in science is driven largely by social and environmental factors, making those from historically excluded groups more vulnerable to barriers such as family factors (e.g., attitudes toward science careers, parent interest in scientific activities), pedagogical factors (e.g., teaching culture, classroom culture), extracurricular activities (e.g., science clubs, out-of-school experiences), and social factors (e.g., models of science interest, peer culture), which in turn impact one’s expectations and beliefs that they can succeed in science [5, 6]. High school and adolescence are critical windows to intervene in developing an interest in STEM. This period of the life course is centered on identity development and is when youth begin to understand their skills, interests, and values and begin to make career decisions [6]. When youth are encouraged to practice science and strengthen their beliefs about their efficacy, they are more likely to express interest, achieve, and persist in science. However, this is not the most likely experience for youth from historically excluded groups, who may experience school practices that do not nurture science interests and capacities [2, 7] and have access to fewer out-of-school science experiences, mentorship, and science role models [2, 5]. Despite school requirements for science curricula, some high school students still lack meaningful and relevant opportunities to engage in science and research [8, 9]. Research suggests that teens from historically excluded groups often view school science as “hard” and “discouraging” and do not see science as relevant to their everyday lives [5, 9-11]. Moreover, students may not know any science professionals in their daily lives, making science and research seem abstract. When paired with a lack of advocates, mentors, and support systems, this leaves students from historically excluded groups with more negative experiences of science [11]. On the other hand, when students experience success with science, receive support from parents and other adults, and develop self-efficacy in science, they are more likely to have positive attitudes toward science [8]. Out-of-school experiences, including summer research experiences and after-school programs can help students develop an interest in science, improve science self-efficacy, increase their science knowledge and skills, and build commitment and persistence in high school that may improve enrollment and retention in undergraduate and graduate school programs in the future [12-14]. Critical to the success and effectiveness of programs is exposure to developmentally- and personally-relevant research opportunities, mentorship and social support, and experiences that encourage empowerment, agency, and a sense of purpose [9]. Increasing the participation and success of researchers from historically excluded groups is essential for strengthening innovation and global leadership in health research [15]. Diverse and inclusive scientific teams not only generate novel research questions and methods, but by bringing their life experiences and perspectives, they also offer new problem-solving approaches to persistent issues in research [16]. Furthermore, tackling the innumerable health disparities faced by historically excluded groups requires the support of diverse scientific teams. Thus, the need for a diverse health research workforce serves as a comprehensive theoretical and empirical foundation for the Michigan Youth Health (MYHealth) program [6, 11, 12, 17]. The overall goal of the MYHealth research training program is to address the persistent problem of underrepresentation of historically excluded groups in health research careers. This project combines an innovative, youth-centered research program with a community-based participatory research (CBPR) approach to partner with high school students as co-researchers. The training program aims to help youth build persistence toward health research careers by developing: 1) researcher identities, 2) scientific literacy, 3) scientific self-efficacy, and 4) teamwork and leadership self-efficacy. Engaging high school students in a research training program is advantageous because postsecondary decisions and occupational interests are often crystallized and plans formulated during adolescence [9]. ### Conceptual overview Because students from excluded groups are more likely to encounter external barriers (e.g., fewer STEM role models, less access to rigorous STEM coursework) that are associated with dropping out of the STEM and health research pipeline, it is imperative to understand how to support their persistence toward science careers effectively [18-20]. Based on an ethnically diverse sample, Chemers and colleagues demonstrated that researcher identity, science self-efficacy, and leadership and teamwork self-efficacy mediated the relationship between science support experiences and persistence in science and health research careers [18]. Extrapolating from these findings, researchers have validated the effectiveness of interventions that incorporate a range of out-of-school programs that provide opportunities for research experiences, mentoring, and community involvement [12-14]. Building on Chemers’ framework for understanding persistence in science, MYHealth integrates social cognitive theory, expectancy-value theory of achievement motivation, and CBPR. The MYHealth framework is outlined in Fig 1. ![Fig 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/03/2023.02.02.23285366/F1.medium.gif) [Fig 1.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/F1) Fig 1. MYHealth conceptual framework, adapted from Chemers et al., 2011 ### Social cognitive theory (SCT) SCT seeks to explain human learning and motivation and has been applied to domain-specific learning (e.g., science learning) across fields [21-25]. SCT identifies three components that interact to influence learning: personal characteristics (e.g., self-efficacy), environmental characteristics (e.g., school, support, community), and behavior (e.g., pursuit of opportunities). These dynamics have been extensively studied in relation to science interest, achievement, and career choice [18, 26, 27], and among historically excluded youth [28, 29]. Research demonstrates that students from historically excluded groups have fewer opportunities to have their science self-efficacy beliefs nurtured; these dynamics, in turn, suppress motivation and science self-efficacy beliefs and ultimately contribute to fewer youth pursuing science degrees and research careers [30-32]. ### Expectancy-value theory of achievement motivation Linked to SCT, the core of the expectancy-value theory of achievement motivation is that students’ achievement and decisions related to achievement are determined by expectancies for success in that domain and the perceived value of that domain in their lives [33, 34]. According to this theory, achieving and maintaining an interest in science results from students’ beliefs about their likelihood of success in science and the value they assign to science. Applied to MYHealth, expectancy-value theory posits that students from excluded groups can be supported to view science and research careers as attainable and important through opportunities that: 1) increase positive perceptions related to their science and research skills, characteristics, and competencies and 2) increase positive perceptions related to their personal values and goals. Taken together, science- and research-related SCT and expectancy-value theory help us to understand the processes that contribute to persistence in science and research and associated careers, particularly among historically excluded students. ### Community-based participatory research (CBPR) CBPR is built on core principles that support partnership, capacity building, co-learning, and action toward equity and social justice [26, 35, 36] and has decades of research demonstrating its effectiveness in creating partnerships and enhancing the quality and rigor of research [24, 37-41]. CBPR follows the research cycle and includes building a research team (with nontraditional research partners, like youth), identifying the needs and priorities of a community, asking questions, gathering data, interpreting data from the lens of the community, and implementing action strategies toward health and social justice. Using CBPR as a model for engaging youth as co-researchers benefits youth directly (e.g., research training, communication skills, problem-solving and critical thinking skills, leadership, preparedness for college), their communities (e.g., community literacy, improved programs and services, improved policy), and research quality (e.g., improved recruitment, retention, sustainability) [42, 43]. Given its overall inclusive and equitable approach, CBPR is a compelling and effective methodology for engaging students from marginalized groups and supporting their long-term interest in science. In summary, the MYHealth program builds on the theoretical underpinnings of SCT, expectancy-value theory of achievement motivation, and a CBPR approach to create an out-of-school research experience for youth that includes experiences with an applied adolescent health research project, instrumental and socioemotional mentoring from academic researchers and a community of learners, and community involvement through research projects that impact youth and their communities. Through these program components, we will impact four primary psychological and cognitive processes: researcher identity, science self-efficacy, leadership and teamwork self-efficacy, and scientific literacy. By using MYHealth to develop these domains among youth co-researchers, we will achieve our long-term goal of increasing persistence toward health research careers. ### Overview of the MYHealth program The MYHealth program is a research training program with the long-term goal of increasing the representation of groups traditionally excluded from biomedical, behavioral, and clinical research, including individuals from low-income backgrounds, first-generation college students, and racial and ethnic minority youth. The MYHealth program will enroll 4 successive 12-month cohorts of high school students who self-identify as underrepresented in STEM. The MYHealth program achieves its overall aim via multiple phases each year, as outlined in Fig 2. The Summer Launch, Impact Projects, and Peer Leadership phases each have specific goals, activities, outputs, and outcomes that are summarized in Table 1 and detailed in the sections that follow. The program curriculum was developed through an iterative process of piloting and feedback from high school students from the local population. A condensed curriculum was presented to a pilot cohort of participants and the refined curriculum is detailed below. Each year, modifications will be made based on our formative evaluation. View this table: [Table 1.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/T1) Table 1. MYHealth program struture ![Fig 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2023/02/03/2023.02.02.23285366/F2.medium.gif) [Fig 2.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/F2) Fig 2. MYHealth program overview ### Summer launch The Summer Launch will be a 10-day training in CBPR and health research methodologies for up to 70 students each year. The overall goal is to train high school students to be co-researchers who can work alongside academic researchers on studies of adolescent health to develop their interest in STEM and incubate researcher identities. A sample of Summer Launch curriculum topics and program constructs are outlined in Table 2. The Summer Launch is designed to take place in person on the University of Michigan campus to build excitement and energy related to research, create community among students from different communities in Southeast Michigan, and introduce youth to college campuses. The Summer Launch will be adapted and conducted virtually when meeting in person is not possible due to COVID-19 restrictions. View this table: [Table 2.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/T2) Table 2. Sample summer launch agenda topics aligned with MYHealth constructs Throughout Summer Launch, students will learn about research methods, CBPR, and various adolescent health studies (Table 2). We will create applied learning experiences using MyVoice, a national text message poll of 14-24-year-olds to understand adolescents’ views on a range of health-related topics [44]. For all discussions, MyVoice will serve as a specific data source to illustrate research elements in the context of a real-world project. For example, when learning about data collection, youth will review existing MyVoice question sets and participate in a MyVoice pilot, receiving SMS messages like MyVoice participants. Consistent with our CBPR approach that trains youth to be co-researchers, the Summer Launch agenda and activities will align with equity, inclusion, and social justice principles. Each topic will consider the importance of ethical and responsible conduct of research—from designing a study with the well-being and needs of participants in mind, to obtaining consent, to fairly representing data and disseminating it to all relevant stakeholders. ### Impact projects At the end of the Summer Launch, all students are invited to apply to participate in the Impact Projects phase, a nine-month program that aims to build on the overall momentum, skills, and interests developed in the previous phase. When applying, students will specify their interest in various adolescent health topics to help guide the direction of the Impact Projects. We anticipate that up to 30 high school students will complete Impact Projects in the following academic year. Impact Projects are guided by several key principles, including equitable partnership as co-researchers, positioning youth as experts of their own experiences, studying a relevant and meaningful topic to youth, and sharing findings with diverse audiences. By engaging in these collaborative activities, the students will develop their researcher identities, enhance their sense of scientific self-efficacy, and hone their scientific literacy. Students will meet virtually, approximately twice per month, in small teams that include a faculty mentor, an undergraduate and/or graduate student research assistant, and up to two Peer Leaders who are MYHealth alumni. Projects are conducted virtually to broaden access to MYHealth for high school students across schools and communities. To effectively engage high school students, meetings will be highly interactive, including group processes and applied exercises. See Table 3 for a sample of the Impact Projects agenda. Activities will mimic the research process, including three key phases: 1) Study Design; 2) Data Collection and Analysis; and 3) Dissemination. These phases will include ongoing training in CBPR, health research methods, and the responsible conduct of research. Though most of the training and activities will occur in small groups, each team will also share their progress with other Impact Project teams and program faculty to receive feedback and practice research communication. View this table: [Table 3.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/T3) Table 3. Impact projects agenda topics by stage ### Peer leadership The overall aim of the Peer Leadership phase is to provide ongoing social and academic support, increase motivation, and improve self-efficacy for students from historically excluded groups pursuing STEM and health research careers [28, 41]. Peer Leaders will receive additional coaching and support from program faculty, including additional training in CBPR and health research methodologies. Selection criteria for being a Peer Leader include having an aptitude and interest in becoming a researcher. We anticipate that up to 10 of the 30 participating Impact Project students each year will be Peer Leaders in the following year. Peer Leaders will interact closely with MYHealth high school students and program faculty during the Summer Launch and Impact Projects. Peer Leaders will be part of Impact Project teams and will provide coaching and guidance to their team members, work alongside faculty mentors, and maintain logs of research activities and progress for the program evaluation. Peer Leaders will be encouraged to contribute to dissemination efforts, including 1) contributing to manuscripts and presentations to share evaluation findings; 2) supporting resource-sharing efforts through in-person presentations to local STEM programs; 3) supporting resource-sharing efforts through the website and social media strategies. ### Methods A mixed methods approach will be used to collect multiple quantitative and qualitative measures of implementation and outcomes over five years. An explanatory sequential design (quantitative followed by qualitative) will be implemented wherein survey responses will inform final interview questions to evaluate the impact of the MYHealth program on participants’ interest in and persistence in STEM research. The Institutional Review Board at the University of Michigan approved the evaluation study of the MYHealth program (HUM00213914, HUM00214694, HUM00214949). All MYHealth participants will consent or assent to participate in the research and parental consent will be collected for minors. Participants will be assigned a random identifier at the start of the program to protect their anonymity and confidentiality throughout data analysis. Because survey questions are focused on attitudes and beliefs about science and research and are not sensitive in nature, there is minimal risk to the participants throughout the course of the study. ### Setting The MYHealth program focuses on recruiting high school students based in Southeast Michigan including, but not limited to, the Detroit, Southfield, Flint, East Lansing, Ypsilanti, and the surrounding areas. These communities in Southeastern Michigan will be well-suited for recruitment of high school students from historicall excluded groups and have a clear need for additional science training that is more engaging, relevant, and impactful than traditional classroom science. During the 2016-2017 school year, science proficiency rates in these communities lagged behind other schools in Southeastern Michigan and the state of Michigan. For example, several focal communities had science proficiency rates of 11th grade high school students between 9% and 11%. When looking at economically disadvantaged districts, these numbers dropped below 8%. While state proficiency assessments are not the sole determinant of science interest, enrollment in science courses in college, or future STEM careers [45], high school science achievement can alter youth’s perceptions of their capacity to engage in science [6, 18, 46]. ### Recruitment High school students will be recruited through ongoing partnerships with community partners who have established networks of schools, teachers, and communities and will connect eligible students and families to the MYHealth program. These partners include university centers that serve students and schools, local school districts, and an academic support program. Our community partners will distribute recruitment materials directly (e.g., information sheets, flyers, emails) through their networks. Interested students and their families will be directed to complete an online application form or contact our research team via email or telephone. The application form (administered via the Qualtrics secure platform) asks interested high school students and their families to provide contact information, state their interest in MYHealth, and explain why they think they would be a good fit for the program. Strong applicants will demonstrate an interest in health, community engagement, research, or the capacity to work in teams. ### Eligibility criteria Incoming high school students living in Southeast Michigan who self-identify as members of an underrepresented group in science or health will be invited to participate. No racial or ethnic groups will be excluded from participating in the study. ### Data collection and outcome measures Numerous scales that have been previously validated with youth will be used to measure youth experiences with STEM and research [8], researcher identity [38, 40], interest in and persistence in STEM careers [24], scientific literacy [37, 39], and leadership, and teamwork-self-efficacy [18]. Measures were tested during a pilot phase and then revisions were incorporated. Table 4 presents an overview of the key measures and the critical time points at which they are collected. View this table: [Table 4.](http://medrxiv.org/content/early/2023/02/03/2023.02.02.23285366/T4) Table 4. Overview of evaluation instruments and data collection timepoints ### Surveys An online survey will be administered online via Qualtrics at multiple timepoints during each program phase, including pre- and post-Summer Launch, pre-, mid-, and post-Impact Projects, pre- and post-Peer Leadership, and one year after completing all program activities (Table 4). The one-year follow-up survey will track youths’ continued interest and persistence in STEM research careers, knowledge and skills related to STEM research, and how the program has influenced their engagement in other research activities. ### Engagement and participation Each year students’ reactions to the Summer Launch and the Impact Project programs will be collected to make ongoing course improvements. In addition, we will design Peer Leader and graduate research assistant logs to facilitate reflections about what worked in each meeting and what could be improved before the next session for both the Summer Launch and the Impact Project programs. A research assistant will track program hours and attendance to gauge dosage [47] in the Impact Projects. Logs will be used to track the implementation of CBPR processes at monthly meetings. The logs will consist of a *Research Process Checklist* [48] to understand progress in the research design process (e.g., stated research problem, defined variables, and specified procedures to analyze data). Finally, the graduate research assistant will complete an observational rating scale, *Youth Participatory Action Research Process Template* [49], to track CBPR processes demonstrated by the Peer Leader and Impact Project students in each small group, including training and practice of research skills, promoting strategic thinking, group work, networking, communication skills, and power sharing. In addition, Evaluators will work with the graduate research assistants to complete the *Critical Thinking Rubric for PBL [50]* at three points during the academic year. In the spring of each year, Impact Project students will disseminate research findings. The graduate research assistant will use the *Presentation Rubric for PBL* to capture how students are communicating research processes and findings with youth, community, policy, and academic audiences [51]. ### Semi-structured interviews and/or focus groups Each year at the end of the Summer Launch, up to 10 students will be invited to participate in a semi-structured interview or focus group [52]. We will use a semi-structured interview guide that explores student experiences during the Summer Launch, including perceptions of research and STEM careers, experiences with diversity, equity, and inclusion during MYHealth, and possible barriers to participation and continued engagement. Interviews will be recorded and transcribed. Similarly, at the end of the Impact Project each year, a sample of up to 10 students will be asked to participate in semi-structured interviews or a focus group to understand processes that occurred during implementation with each cohort. Questions will probe topics such as students’ perceptions of STEM and health research careers, their experiences with diversity, equity, and inclusion on the Impact Projects, and possible barriers to participation and continued engagement. All interviews will be recorded and transcribed for analysis. ### Analyses Overall, there are parallel strategies to evaluate all three program components: the Summer Launch, Impact Projects, and Peer Leadership. A Repeated Measures Analysis of Variance will be conducted to analyze the quantitative items in the pre-, post-, and follow-up surveys. Small-scale analyses will be performed on each program phase, and a longer-term evaluation will be conducted on longitudinal data. Over time, between-subjects factors may be added to account for differences between cohorts, or disaggregate data by demographic factors (e.g., gender, ethnicity). The final reporting will utilize appropriate statistical significance and associated effect sizes. Interview data will be analyzed with an inductive Outcome Harvesting approach [54, 55], in which stakeholders identify individual and group impacts that mattered most, and speculate on program elements that led to these outcomes. Codes will be applied to all interview transcripts and then synthesized into related themes and conclusions. Findings will be used to inform future years of the MYHealth program. ## Results We anticipate that up to 280 total high school students from historically excluded groups will be enrolled in MYHealth across four years: up to 280 will have completed the Summer Launch, a subset of up to 120 will have completed the Impact Projects, and a smaller subset of up to 40 will have become Peer Leaders. Primary quantitative outcomes for this evaluation are Researcher Identity and STEM research career interest. Our power analyses indicate that we need 90 students each in the Summer Launch and Impact Projects to detect a minimum effect size of 0.3 which is considered small by conventional interpretations [56]. This assumes 80% power at 5% Type 1 error (two-tailed t-test). Given our anticipated sample, we will have sufficient students to make conclusions about changes in Researcher Identity and STEM career interest even with attrition. For the qualitative components of the program evaluation, we will ensure that we have a sufficient sample size to achieve thematic saturation. In qualitative approaches, thematic saturation refers to the point at which there is adequate evidence to develop themes and no new insights are gathered with continued data collection. For interview studies, thematic saturation is often estimated to be achieved between 6-20 interviews [57, 58]. ## Discussion MYHealth is limited by our small cohort size and geography. The cohort size for each phase is limited to ensure that students receive adequate time with faculty mentors and support for their small group projects. Over the four years of the project, we will be able to understand the impact of the research training program. Currently, only students from Southeast Michigan can participate. If successful, future iterations of MYHealth could leverage our virtual program to extend to another setting outside of Southeast Michigan. The first complete year of the MYHealth program began in the Summer of 2022, and will re-open annually for the following four years. We anticipate data will become available through peer-reviewed publications beginning in the first two years and on several more instances throughout the course of the project. ## Conclusion The MYHealth program capitalizes on Social Cognitive Theory, Expectancy Value Theory, and a CBPR approach to design and implement a program that aims to address the continuing lack of diversity within the STEM workforce. The program assesses the role and intersection of personal characteristics (e.g., researcher identity, self-efficacy), social and environmental factors, and behavioral influences to spark and maintain interest in STEM careers. ## Data Availability No datasets were generated or analysed during the current study. All relevant data from this study will be made available upon study completion. ## Authors’ contributions All authors, MD, JR, AA, TC, MDi, KH, LMV, DCW contributed equally to the the work by conceptualizing the study aims and writing, reviewing and editing the study protocol. ## Footnotes * & These authors wrote the first and final drafts of the protocol and all other authors contributed to the drafting process. * **Data availability:** Data will be made available upon study completion. * Received February 2, 2023. * Revision received February 2, 2023. * Accepted February 3, 2023. * © 2023, Posted by Cold Spring Harbor Laboratory This pre-print is available under a Creative Commons License (Attribution 4.0 International), CC BY 4.0, as described at [http://creativecommons.org/licenses/by/4.0/](http://creativecommons.org/licenses/by/4.0/) ## References 1. 1.National Center for Science and Engineering Statistics (NCSES) [Internet]. Women, Minorities, and Persons with Disabilities in Science and Engineering. Directorate for Social, Behavioral and Economic Sciences. Alexandria, VA: National Science Foundation; 2021 [cited 2022 May 5]. Report No.: Special Report NSF-21-321:[Available from: [https://ncses.nsf.gov/pubs/nsf21321/](https://ncses.nsf.gov/pubs/nsf21321/). 2. 2.National Center for Science and Engineering Statistics (NCSES) [Internet]. Science and Engineering Indicators. National Science Board. Alexandria, VA: National Science Foundation; 2021 [cited 2022 May 5]. Available from: [https://ncses.nsf.gov/pubs/nsb20196/data](https://ncses.nsf.gov/pubs/nsb20196/data). 3. 3.NIH research: Action needed to ensure workforce diversity strategic goals are achieved. Bethesda (MD): US Governement Accountability Office (GAO); 2018. Report No.: GAO-18-545. 4. 4.Fry R, Kennedy B, Funk C. STEM jobs see uneven progress in increasing gender, racial and ethnic diversity. Washington (DC): Pew Research Center Science & Society; 2021. 5. 5.Barton AC, Tan E, Rivet A. Creating hybrid spaces for engaging school science among urban middle school girls. Am Educ Res J. 2008;45(1):68–103. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3102/0002831207308641&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000253585500003&link_type=ISI) 6. 6.Britner SL, Pajares F. Sources of science self-efficacy beliefs of middle school students. J Res Sci Teach. 2006;43(5):485–99. 7. 7.Vedder-Weiss D, Fortus D. Adolescents’ declining motivation to learn science: Inevitable or not? Res Sci Teach. 2011;48(2):199–216. 8. 8.Aschbacher PR, Li E, Roth EJ. Is science me? High school students’ identities, participation and aspirations in science, engineering, and medicine. J Res Sci Teach. 2010;47(5):564–82. 9. 9.Barton AC, Tan E. Funds of knowledge and discourses and hybrid space. J Res Sci Teach. 2009;46(1):50–73. 10. 10.Kang H, Calabrese Barton A, Tan E, D. Simpkins S, Rhee H-y, Turner C. How do middle school girls of color develop STEM identities? Middle school girls’ participation in science activities and identification with STEM careers. Sci Educ. 2019;103(2):418–39. 11. 11.Zacharia Z, Barton AC. Urban middle-school students’ attitudes toward a defined science. Sci Educ. 2004;88(2):197–222. 12. 12.Dirks C, Cunningham M. Enhancing diversity in science: Is teaching science process skills the answer? CBE–Life Sci Educ. 2006;5:218–26. 13. 13.McLean NA, Fraser M, Primus NA, Joseph MA. Introducing students of color to health sciences research: An evaluation of the health disparities summer internship program. J Community Health. 2018;43(5):1002–10. 14. 14.Salto LM, Rigg ML, De Leon DD, Casiano CA, De Leon M. Underrepresented minority high school and college students report STEM-pipeline sustaining gains after participating in the Loma Linda University summer health disparities research program. PLoS One. 2014;9(9). 15. 15.National Academy of Sciences, National Academy of Engineering, and Institute of Medicine. Expanding Underrepresented Minority Participation: America’s Science and Technology Talent at the Crossroads. Washington (DC): The National Academies Press; 2011. 16. 16.Swartz TH, Palermo AS, Masur SK, Aberg JA. The science and value of diversity: Closing the gaps in our understanding of inclusion and diversity. J Infect Dis. 2019;220(Suppl 2):s33–s41. 17. 17.Andersen L, Ward TJ. Expectancy-value models for the STEM persistence plans of ninth-grade, high-ability students: A comparison between Black, Hispanic, and White students. Sci Educ. 2014;98(2):216–42. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/sce.21092&link_type=DOI) 18. 18.Chemers MM, Zurbriggen EL, Syed M, Goza BK, Bearman S. The role of efficacy and identity in science career committment among underrepresented minority students. J Soc Issues. 2011;67(3):469–91. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1540-4560.2011.01710.x&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000295093000004&link_type=ISI) 19. 19.Hazari Z, Sadler PM, Sonnert G. The science identity of college students: Exploring the intersection of gender, race, and ethnicity. J Coll Sci Teach. 2013;42(5):82–91. 20. 20.Hazari Z, Sonnert G, Sadler PM, Shanahan M-C. Connecting high school physics experiences, outcome expectations, physics identity, and physics career choice: A gender study. J Res Sci Teach. 2010;47(8):978–1003. 21. 21.Bandura A. Social Foundations of Thought and Action: A Social Cognitive Theory. Englewood Cliffs (NJ): Prentice-Hall; 1986. 22. 22.Bandura A. Human agency in social cognitive theory. Am Psychol. 1989;44(9):1175–84. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1037/0003-066X.44.9.1175&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=2782727&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=A1989AN65800001&link_type=ISI) 23. 23.Bandura A. Social cognitive theory: An agentic perspective. Annu Rev Psychol. 2001;52(1):1–26. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1146/annurev.psych.52.1.1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=11148297&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000167463100003&link_type=ISI) 24. 24.Bryan RR, Glynn SM, Kittleson JM. Motivation, achievement, and advanced placement intent of high school students learning science: Motivation of high school students learning science. Sci Educ. 2011;95(6):1049–65. 25. 25.Wang X. Why students choose STEM majors: Motivation, high school learning, and postsecondary context of support. Am Educ Res J. 2013;50(5):1081–121. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3102/0002831213488622&link_type=DOI) 26. 26.Minkler M, Wallerstein N. Community-based participatory research for health: From process to outcomes. 2nd ed. Hoboken (NJ): Jossey-Bass; 2010. 27. 27.van der Zee T, Reich J. Open education science. AERA Open. 2018;4(3):2332858418787466. 28. 28.Hurtado S, Cabrera NL, Lin MH, Arellano L, Espinosa LL. Diversifying science: Underrepresented student experiences in structured research programs. Res High Educ J. 2009;50:189–214. 29. 29.Trujillo G, Tanner KD. Considering the role of affect in learning: monitoring students’ self-efficacy, sense of belonging, and science identity. CBE Life Sci Educ. 2014;13(1):6– 15. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1187/cbe.13-12-0241&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24591497&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) 30. 30.Byars-Winston A, Estrada Y, Howard C, Davis D, Zalapa J. Influence of social cognitive and ethnic variables on academic goals of underrepresented students in science and engineering: A multiple-groups analysis. J Couns Psychol. 2010;57(2). 31. 31.Fouad NA, Santana MC. SCCT and underrepresented populations in STEM fields: Moving the needle. J Career Assess. 2017;25(1):24–39. 32. 32.Navarro RL, Flores LY, Worthington RL. Mexican American middle school students’ goal intentions in mathematics and science: A test of social cognitive career theory. J Couns Psychol. 2007;54(3). 33. 33.Eccles J. Achievement and Achievement Motives: Psychological and Sociological Approaches. In: Spence JT, editor. Expectancies, Values, and Academic Behaviors. San Francisco (CA): Freeman; 1983. p. 75–146. 34. 34.Wigfield A, Eccles JS. Expectancy–value theory of achievement motivation. Contemp Educ Psychol. 2000;25(1):68–81. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1006/ceps.1999.1015&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=10620382&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000084720200004&link_type=ISI) 35. 35.Israel BA. Methods in Community-Based Participatory Research for Health. San Francisco (CA): Jossey-Bass; 2005. 36. 36.Jacquez F, Vaughn LM, Wagner E. Youth as partners, participants or passive recipients: A review of children and adolescents in Community-Based Participatory Research (CBPR). Am J Community Psychol. 2013;51:176-89. 37. 37.Arnold ME, Bourdeau VD, Nott BD. Measuring science inquiry skills in youth development programs: The science process skills inventory. J Youth Dev. 2018;8(1):15. 38. 38.Bathia S, Morell L, Smith R, Koo B, Wilson M. Measuring high school students’ self-perception as a researcher: Development and validation of the researcher identity scale. American Educational Research Association (AERA) Annual Meeting; San Francisco (CA). 2020. 39. 39.Bourdeau V, Arnold ME. Science Process Skills Inventory [Internet]. Corvallis (OR): 4-H Youth Devleopment Education, Oregon State University; 2009 [cited 2022 5 May]. Available from: [http://www.pearweb.org/atis/data/documents/000/000/010/Science\_Process\_Skills\_Inventory\_1.pdf](http://www.pearweb.org/atis/data/documents/000/000/010/Science\_Process_Skills_Inventory_1.pdf). 40. 40.Morell L, Bathia S, Koo B, Smith R, Wilson M. The construction and validation of a research identity scale. American Educational Research Association (AERA) Annual Meeting; San Francisco (CA). 2019. 41. 41.Palmer RT, Maramba DC, Dancy TE. A qualitative investigation of factors promoting the retention and persistence of students of color in STEM. J Negro Educ. 2011;80(4):491– 504. 42. 42.DeJonckheere M, Vaughn LM, Bruck D. Youth-led participatory action research: A collaborative methodology for health, education, and social change. Sage Res Meth Cases. 2017. 43. 43.Suleiman AB, Soleimanpour S, London J. Youth action for health through youth-led research. J Community Pr. 2006;14(1-2):125–45. 44. 44.DeJonckheere M, Nichols LP, Moniz MH, Sonneville KR, Vydiswaran VV, Zhao X, et al. MyVoice national text message survey of youth aged 14 to 24 years: Study protocol. JMIR Res Protoc. 2017;6(12):e8502. 45. 45.Maltese AV, Tai RH. Pipeline persistence: Examining the association of educational experiences with earned degrees in STEM among U.S. students. Sci Edu. 2011;95(5):877–907. 46. 46.Chen JA, Usher EL. Profiles of the sources of science self-efficacy. Learning and Individual Differences. 2013;24:11–21. 47. 47.O’Donnell L, O’Donnell C, Wardlaw DM, Stueve A. Risk and resiliency factors influencing suicidality among urban African American and Latino youth. Am J Community Psychol. 2004;33(1-2):37–49. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/B:AJCP.0000014317.20704.0b&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15055753&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000188709900005&link_type=ISI) 48. 48.Freed MN, Ryan JM, Hess RK. Handbook of Statistical Procedures and Their Computer Applications to Education and the Behavioral Sciences. United Kingdom: Macmillan Publishing Co, Inc; 1991. 49. 49.Ozer EJ, Douglas L. Assessing the key processes of youth-led participatory research: Psychometric analysis and application of an observational rating scale. Youth Soc. 2015;47(1):29–50. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0044118X12468011&link_type=DOI) 50. 50.PBLWorks [Internet]. 6-12 Critical Thinking Rubric (non-CCSS) Novato (CA). 2020 [cited 2022 5 May]. Available from: [https://my.pblworks.org/resource/document/6\_12\_critical\_thinking\_rubric\_non\_ccss](https://my.pblworks.org/resource/document/6\_12\_critical\_thinking_rubric_non_ccss). 51. 51.PBLWorks [Internet]. 9-12 Presentation Rubric for PBL (non-CCSS Novato (CA). 2020 [cited 2022 5 May]. Available from: [https://my.pblworks.org/system/files/documents/PBLWorks-9-12-Presentation-Rubric-Non-CCSS.pdf](https://my.pblworks.org/system/files/documents/PBLWorks-9-12-Presentation-Rubric-Non-CCSS.pdf). 52. 52.DeJonckheere M, Vaughn LM. Semistructured interviewing in primary care research: A balance of relationship and rigour. FMCH. 2019;7(2). 53. 53.Ozer EJ, Schotland M. Psychological empowerment among urban youth: Measure development and relationship to psychosocial functioning. Health Edu & Behav. 2011;38(4):348–56. 54. 54.Abboud R, Claussen C. The use of Outcome Harvesting in learning-oriented and collaborative inquiry approaches to evaluation: An example from Calgary, Alberta. Eval Program Plann. 2016;59:47-54. 55. 55.Wilson-Grau R. Outcome harvesting: Better Evaluation; 2015 [cited 2022 5 May]. Available from: [https://www.betterevaluation.org/en/plan/approach/outcome\_harvesting](https://www.betterevaluation.org/en/plan/approach/outcome_harvesting). 56. 56.Maher JM, Markey JC, Ebert-May D. The other half of the story: Effect size analysis in quantitative research. CBE Life Sci Educ. 2013;12(3):345–51. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1187/cbe.13-04-0082&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24006382&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom) 57. 57.Guest G, Bunce A, Johnson L. How many interviews are enough? An experiment with data saturation and variability. Field Methods. 2006;18(1):59–82. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1525822X05279903&link_type=DOI) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000242853100004&link_type=ISI) 58. 58.Malterud K, Siersma VD, Guassora AD. Sample size in qualitative interview studies: Guided by information power. Qual Health Res. 2016;26(13):1753–60. [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/1049732315617444&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26613970&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F02%2F03%2F2023.02.02.23285366.atom)