ABSTRACT
Purpose The Westlake BioBank for Chinese (WBBC) pilot cohort is a population-based prospective study with its major purpose to better understand the effect of genetic and environmental factors on growth and development from adolescents to adults. The dataset comprises a wide range of demographics, anthropometric measures, physical activity, sleep quality, age at menarche and bone mineral density.
Participants A total of 14,726 participants (4,751 males and 9975 females) aged 14 to 25 years were recruited and the baseline survey was carried out from 2017 to 2019. The pilot cohort contains rich range of information regarding of demographics, anthropometric measurements, blood pressure and heart rate, lifestyle and sleep patterns, biological, clinical and health outcomes.
Findings to date There were most of the participants were Chinese Han ethnic (97.78%), and more than 60% of them were originally from rural areas. For anthropometry measurements, the mean height and weight were 172.92 cm and 65.81 kg for men, and 160.13 cm and 52.85 kg for women; the mean waist, hip and thigh circumference were 75.51cm, 90.75cm and 51.56 cm in men, and 71.67cm, 89.75cm and 51.68 among women. Results indicated that the prevalence of underweight in female was much higher than male, but the prevalence of overweight and obesity in female was lower than male. The mean of serum 25(OH)D level in the 14,726 young participants was 22.39 ng/ml, and male had higher level of serum 25(OH)D than female, overall, 33.47% of the participants had vitamin D deficiency and even more participants suffered from vitamin D insufficiency (58.17%). The proportion of deficiency in women was much higher than that in men (41.83 vs. 16.35%).
Future plans WBBC is designed as a prospective cohort study and provides a unique and rich data set analyzing health trajectories of Chinese adolescents to young adults.
Introduction
The initial goal of setting up this cohort is trying to bring up more attentions to the skeletal health in adolescence and adults, and thus to provide strategy in prevention of osteoporosis in the elderly. Osteoporosis, a disease of increased bone fragility, is a systemic osteopathy characterized by a decrease in bone density/quality and the destruction of bone microstructure caused by genetic and environmental factors1. Bone mineral density (BMD), the bone mineral content (BMC) in bone tissue, is recognized as the most important predictor of osteoporosis. Currently, approximately 200 million people worldwide suffer from osteoporosis, and 83.9 million of which are in China 2. As the trend of global aging is becoming more and more obvious, more attention should be paid to the bone health not only in the elderly, but also in adolescence and young adults. Population-based studies have shown that roughly half of the boys and one-third of the girls would undergo a fracture by age of 18 and 1/5 would have two or more fractures3,4. Epidemiologic studies have shown that a 10% increase in peak bone mass (PBM) at the population level reduces the risk of fracture later in life by 50%5. In fact, bone mass attained in early life was considered to be the most important modifiable determinant of lifelong skeletal health. A longitudinal data have shown that more than 94% of BMD was acquired at the age of sixteen6 and approximately 40% to 60% of adult bone mass was accrued during the adolescent years in both women and men7.
Any condition interfering with optimal peak bone mass accrual can, therefore, increase fracture risk later in life, adolescence is critical period for skeletal mineralization. The bone mass gain during adolescence is influenced by multiple factors, including genetic factors, ethnicity, and environmental factors, such as alcohol intake, cigarette smoking, physical activity, endocrine status (e.g. vitamin D), diet (calcium and protein intake) and other factors5,8-13. As environmental and behavioral factors account for 20% to 40% of adult peak bone mass14,15, the early identification of the factors associated with poor bone health and the provision of reliable counseling might help children and teenagers take action to maximize BMD before their PBM was completed.
Overweight and obesity are a major public health problem16. Elevated body mass index (BMI) in adolescence had been associated with several obesity-related morbidities in adult life, such as diabetes, metabolic syndromes and some types of cancer17. And obesity in adolescence conferred very high risks for obesity in adults18; 70% of overweight adolescents had one or more concomitant conditions such as high blood pressures and fasting insulin, which were also risk factors for cardiovascular disease, and 23% of those accompanied with three or more concomitant conditions19. Following rapid economic development since the 1980s, China experiences a rapidly increasing of overweight and obesity among children and adolescents20,21. In 2019, a cross-sectional study22 found that the prevalence of overweight in college students (aged 18-26 years) was 8.0%, and the prevalence of obesity was 3.5%. A recent study from 12 provinces in China showed that the prevalence of overweight and obesity were 14.0% and 10.5% in boys, and 9.7% and 7.1% in girls, respectively23.
Vitamin D deficiency is becoming a public health problem in both developed and developing countries24,25. Besides its effect on musculoskeletal system, Vitamin D showed pleiotropic effect on human health, such as cardiovascular diseases26, common infectious diseases27 and autoimmune diseases28. Serum 25(OH)D is a good indicator of vitamin D storage and is an optimal method of assessing vitamin D levels24. According to the Endocrine Society clinical practice guidelines, vitamin D levels were defined as deficiency [25(OH)D<20 ng/mL], insufficiency [25(OH)D: 20-29 ng/mL)] and sufficiency [25(OH)D≥30 ng/mL] respectively29. Many people in central and western Europe had vitamin D concentration of 11–20 ng/mL in winter 30. Studies from other countries, including Canada31, Japan32, Australia33 and Iran34, presented similar situations, with high prevalence of vitamin D insufficiency in different ethnicities. A study in north China found that more than 40% of adolescent girls had Vitamin D-deficiency in the winter35. Another study in Shanghai showed that more than one-third newborns had plasma 25(OH)D less than 20 ng/mL36. Even in Hong Kong (latitude 22° north), 72% of young adults were reported to have vitamin D deficiency37.
The overall goal of the Westlake Biobank for Chinese (WBBC) pilot cohort is to recruit individuals at their late adolescence/young adulthood. The biological samples such as whole blood, serum, urine and faeces were collected, genomic DNA were extracted and the DNA sequence information were acquired through sequencing technique. A long questionnaire with questions concerning the environmental factors such as nutrition, sleep quality, physical activity, medication etc was provided. These data will help us to understand the association between the genetics, environmental factors, microbiome and health statue of adolescence population. With a broad range of phenotype collection on many aspects of participants’ daily life, a wide range of scientific questions could be addressed. The main purpose of this particular paper is to profile the cohort, therefore, only limited findings were described, such as: (1) what are the prevalence of underweight, overweight, obesity and vitamin D deficiency in Chinese late adolescence? What is the reference value of serum vitamin D level in the young people? (2) what is the difference between male and female in term of height, weight, blood pressure, lifestyle and bone health in the young people.
Cohort description
Sampling Design
The Westlake Biobank for Chinese (WBBC) pilot study was collected in three main regions in China (Hanghzou city of Zhejiang province, Shangrao city of Jiangxi province and Yantai city of Shandong province), but the participants covered all around the country (Table 1 and Figure 1). The baseline survey was carried out from 2017 to 2019. The target population was young people aged 14–25 years who were college students and available for follow-up studies. In the first phase of baseline (WBBC pilot 1), the participants were recruited from two colleges at Zhejiang province and Jiangxi province in Southeast China from September 2017 to March 2018 (Figure 2), and 1,258 and 2,769 participants were from Zhejiang and Jiangxi provinces, respectively, and 1,263 participants were from other 25 provinces of China (Table 1). From September 2018 to December 2018, the second phase of WBBC pilot project was initiated (WBBC pilot 2), the participants were recruited at the same college in Jiangxi province and a college in Shandong province in Northeast China (Figure 2). There were 2,920 participants from Shandong province, 2,032 participants from Jiangxi province and 1,306 participants from other 28 provinces of China in WBBC pilot 2 (Table 1). From September 2019, the WBBC pilot project phase 3 (WBBC pilot 3) recruited participants from the same college in Jiangxi province (Figure 2), most of the participants (2,504) of WBBC pilot 3 were from Jiangxi province and 6,74 participants were from other 26 provinces of China (Table 1). All participants provided their Chinese unique national identity (ID) number for unique reference at the health examination center in the campus. The inclusive criteria were: a). All study participants signed the informed consent form before taking part in the survey. b). Participants should complete the physical examination, bone mineral density scan, blood test and questionnaires. And the exclusion criteria were: a). No informed consent. b). Participants missed most of the items in data collection. In WBBC pilot 3, the urine and feces of the participants were collected, therefore, participants taking antibiotics should be excluded.
Data Collection Procedures
The WBBC pilot study is a multidisciplinary study and contains rich range of information regarding of demographics, anthropometric measurements, blood pressure and heart rate, lifestyle and sleep patterns, biological, clinical and health outcomes. Genomic data are available for 7,033 participants. Data and samples were collected via examinations, questionnaire and venipuncture (Table 2).
Measurements of anthropometric parameters
Anthropometric data included height, body weight, bust, waist, hip and thigh circumference, resting blood pressure, heart rate and hand grip. Height was measured to the nearest 0.1 cm with participants’ light-weight clothes and shoes off; weight was measured to the nearest 0.01 kg with the weight scale (Ultrasonic surveying instrument, Beryl BYH01, China) calibrated daily before each series of measurements. Bust, waist, hip and thigh circumference were measured to the nearest 0.5 cm by using a measuring tape with the subject standing comfortably. Resting blood pressure and heart rate were measured on the left arm supported at heart-level sitting position using electronic sphygmomanometers (Yuwell YE660A, China). To ensure accurate data, the participants were asked to have rested for at least 5 minutes and have no excessive physical activity, tea or alcohol intake or smoking for at least one night. Using a handgrip dynamometer (CAMRY EH101, China), grip strength with both hands were tested for most of the participants in WBBC pilot 2 and WBBC pilot 3. In order to get more accurate results, the participants should make sure the arm that’s being tested was at a 90 degree angle at the elbow38 until the test was finished. Details of the methods and instruments used for measurements of anthropometric parameters were provided in Table 3.
Biochemistry assessment
Participants came to the examination center in each college in the morning with at least 8 h of overnight fasting, about 25 ml of venous blood samples were collected for routine blood measurements, biochemical indexes, DNA extraction and so on. Venous blood samples were collected using ethylenediamine tetraacetic acid dipotassium (EDTAK2) anticoagulation tube (3×5.0 ml) and vacuum tube without anticoagulation (2×5.0 ml). Serum and plasma samples were separated from whole blood through centrifugation for 10 min at the relative centrifugal force 3000 g (Figure 3). Serum samples were forwarded to test biochemical indexes that included serum 25(OH)D level, serum calcium level, fasting blood glucose, kidney function test, hepatic function test, blood lipids [triglyceride, cholesterol, low-density lipoprotein (LDL), high-density lipoprotein (HDL)], triiodothyronine (T3), thyroid and parathyroid function and bone turnover markers (Table 2). Details of the platforms used for biochemical analysis are provided in Table 3. We also reserved serum samples (2×0.5ml) for each participant at - 80□for future use. Figure 3 displays the detail of the flow diagram of blood separation and detection of main blood biochemical indexes.
Questionnaire-based assessments
Baseline data collection for participants included a self-completion questionnaire. In Table 2, a list of core questions within the aforementioned domains, was provided. The questionnaire included social and demographic measures data (e.g. age, sex, ethnicity, family economic status and born place), menstrual history (for female), lifestyle (e.g. physical activity, smoking status, alcohol, tea and coffee intake), additional supplement (e.g. calcium and vitamin D), health status and other information. Sleep duration and sleep quality were assessed by the Pittsburgh Sleep Quality Index (PSQI)39. This is composed of 19 questions which reflect seven major components, all seven components are then summed up to create a scale from 0–21 points.
Bone mineral density assessment
Bone mineral density (BMD) is expressed in terms of bone mass per cm2 (g/cm2) and were assessed by dual-energy X-ray absorptiometry40 (DXA, Discovery QDR 4500; Hologic Inc., Waltham, MA, USA). In WBBC pilot, the main sites for BMD evaluation were lumbar spine (L1–L4), femur (femoral neck and total hip) and distal third of the radius.
Whole Genome sequencing and genotyping
In the process of collecting fasting venous blood, 5-ml whole blood was used for isolation of genomic DNA. These genomic DNA were used for whole genome sequencing and genotyping (Figure 3). Whole genome sequencing was completed by NovaSeq 6000 system (Illumina Co., Ltd), and for now, 1,192 participants have been sequenced at mean depth of 14x, with highest depth of 65x. A Chinese specific reference panel will be constructed for imputation for Chinese population. Whole genome genotyping was completed with Infinium Asian Screening Array (ASA) (Illumina Co., Ltd), and 5,841 participants have been genotyped in approximately 700,000 SNPs.
Follow-up and outcome measures
We are seeking funding to follow the cohort to examine development and growth of the participants, and to investigate the effect of environmental factors on later outcomes. An important area of future research will focus on the development of bone mineral density and body weight from late adolescence to adulthood. Figure 2 shows the overall study plan. Follow-up surveys will be conducted according to the design of the subsequent research projects, the participants will be invited for re-survey with repeat interviews, including the questionnaire, anthropometric measurements, grip strength and bone mineral density collection as those used in the baseline stage and the data of nutritional status by food frequency questionnaire.
We have started a pilot follow-up study for WBBC pilot 2 since December 2019, and 1,303 participants had completed all examinations (Figure 2). The collected information included height, weight, grip strength and the updated questionnaire. Besides, we retested bone mineral density at spine (L1-L4), hip and distal third of the radius.
Statistical analysis
To test differences in means and proportions between male and female, we used T-test and Chi-square tests for continuous and categorical variables, respectively. All variables were presented by unadjusted proportions for categorical variables and unadjusted means with standard deviations (SD) for continuous variables. The variables demonstrating a p-value of less than 0.05 were considered statistically significant. All statistical analyses were analyzed using Stata 12.0 software.
Findings to date
This pilot cohort of WBBC is a large longitudinal survey conducted among adolescents and young adults in China. We surveyed 14,726 young people aged 14–25 years who were college students and available for completing follow-up studies. The baseline survey was carried out from 2017 to 2019, including WBBC pilot 1 (5,290 participants), WBBC pilot 2 (6,258 participants) and WBBC pilot 3 (3,178 participants).
We have several ongoing projects under WBBC pilot study. One of the most significant ongoing projects is the study of Chinese population structure. In WBBC pilot study, the data from 1,192 samples with whole genome sequencing and 5,841 samples with whole genome genotyping were available, collaborating with another research group in China, we finally achieved approximate ∼10,000 Chinese samples with whole genome data, covering 30 provincial region of China. In addition, WBBC pilot study will provide a haplotype reference panel to improve the imputation accuracy of Chinse GWAS study, since our previous study41 demonstrated that the existing reference panels, such as the 1000 Genome Phase3 panel42 and the HRC (Haplotype Reference Consortium) panel43, were not best fit for imputation for Chinese population, especially for the rare variant imputation.
Given the extensive range of data collection in the WBBC study, it is not feasible to present all the results, only limited findings were described in the present study. In summary, a total of 17,407 college students were invited, of whom, 14,983 (86.07 %) responded. After removing participants with missing data and invalid data, the final study included an effective sample size of 14,726 (84.60%) adolescents and young adults (with age from 14 to 25 years, and mean age at 18.5 years). Table 4 provides an overview of socio-demographic, anthropometry, cardiovascular system, lifestyle, grip strength and BMD characteristics of the WBBC pilot participants at baseline. Briefly, within the 14,726 sample, there were more females than males (67.74 vs. 32.26%), with mean age of 18.48 years for women and 18.55 years for men, respectively. Most of the participants were Chinese Han ethnic (97.78), and more than 60% of them were originally from rural areas (60.24% of males and 69.62% of females). For anthropometry measurements, the mean height and weight were 172.92 cm and 65.81 kg for men, and 160.13 cm and 52.85 kg for women; the mean waist, hip and thigh circumference were 75.51cm, 90.75cm and 51.56 cm in men, and 71.67cm, 89.75cm and 51.68 among women. The mean systolic blood pressure (SBP), diastolic blood pressure (DBP) and heart rate (HR) in participants were 113 mmHg, 71 mmHg and 86 beats/minute, respectively. In the cohort, only 5.49% of the participants were current smokers and 38.82% of them were regular drinkers. Regarding the current smoking status, there was significant difference between men and women (16.29 vs. 1.64%, p < 0.001). As for alcohol consumption, the proportion of current drinker in boys and girls were 62.03% and 29.19%, respectively, which is much higher in males (p < 0.001). The mean sleeping time estimated in women were higher than men (8.34 vs. 7.97 hours, p < 0.001). As for grip strength, the data collection was started from WBBC pilot 2, the mean of grip strength in boys were much higher than girls (grip-left: 36.66 vs. 27.38kg and grip-right: 39.90 vs. 29.76 kg, both p < 0.001).
Height and weight were measured using the standardized procedures. Body mass index (BMI) was calculated based on the formula: weight in kilograms divided by height in meters-squared (kg/m2). According to the Working Group on Obesity in China (WGOC)44, participants were defined as underweight (<□18.5 kg/m2), normal weight (18.5-23.9 kg/m2), overweight (24-27.9 kg/m2) and obese (≥□28 kg/m2). Therefore, the WBBC pilot study provided an overall prevalence of underweight, overweight and obesity among young participants of 24.26%, 11.50% and 5.03%, respectively (Table 5). The prevalence of underweight in female was much higher than male (26.41% vs. 19.70%, p < 0.0001), but the prevalence of overweight in female was much lower than male (9.03% vs. 16.71%, p < 0.0001) (Table 5), similarly, the prevalence of obesity in female (3.21%) was lower than in male (8.88%) (p < 0.0001) (Table 5). Waist circumference (WC) is good indicator of abdominal visceral fat distribution and is a strong predictor of diabetes mellitus and cardiovascular disease 45. It is meaningful to investigate the WC along with BMI among adolescents and young people. In WBBC pilot study, central obesity was defined as WC≥85 cm for men and as WC≥80 cm for women based on the recommendations of the WGOC 44. In the cohort of 12,396 participants, the prevalence of central obesity was 14.62%, which was higher in male than in female (19.10% vs. 12.55%, p < 0.0001) (Table 5).
In WBBC pilot study, the prevalence of underweight were high in both male (19.70 %) and female (26.41%), though the prevalence of moderate and severe underweight decreased from 9.2% in 1975 to 8.4% in 2016 in girls and from 14.8% in 1975 to 12.4% in 2016 in boys in the world46. This phenomenon may due to the modern aesthetics of human stature that thinness is preferred, especially in China47,48. Recently, a study involving 2,023 young female participants (70.5% subjects aged 20-25 years) from eight Chinese universities48 showed that 30.55% of the participants were underweight, and 57.39% of them would like to be much thinner, which would lead to more underweight individuals. Therefore, future studies should not only pay attention to the problem of obesity/overweight, but also to the underweight issue in young people.
Using simple anthropometric indices of body composition, such as BMI and WC, has been considered as a practical and valuable approach to the assessment of obesity for a long time. Waist-to-hip ratios (WHR), waist-to-height ratios (WHeR), a body shape index (ABSI)49 and body roundness index (BRI)50 were also as parameters of body fat and visceral adipose tissue volume. In WBBC pilot cohort, we had collected several anthropometric measures including height, weight, bust, waist, hip and thigh circumference and these data could help us examine the usefulness of these anthropometric parameters and identify the optimal cut-off of the parameters to evaluate overweight and obesity among adolescence and young people in future study. It is noteworthy that vitamin D deficiency in females was significantly worse than in males. This may due to the modern aesthetics of Chinese culture that paler skin is preferred, especially in females. A questionnaire related to vitamin D and sun exposure was conducted at a university in Nanjing, China and found that 75.0 % of the students lacked sun exposure because they would like to avoid dark skin, and most of students (82.7 %) used sun protection, and sunscreen use were more popular in females51, but it was reported that using the amount of sun cream recommended by World Health Organization exponentially suppressed vitamin D synthesis in the skin52. In WBBC pilot study, the mean serum 25(OH)D level was 22.39 ± 5.33 ng/ml for all the participants (male: 25.15 ng/mL and female: 21.05 ng/mL, p < 0.0001) (Table 6). Overall, 33.47% of the participants had vitamin D deficiency and even more participants suffered from vitamin D insufficiency (58.17%) (Table 6). In addition, the proportion of women with sufficient vitamin D was much lower than that of men (3.69 vs. 17.91%, p < 0.0001), while the proportion of deficiency in women was much higher than that in men (41.83 vs. 16.35%, p < 0.0001) (Table 6). Most of the participants (86.87%) preferred to stay indoors in spare time, the females were less willing to do exercise than males (53.68% vs 70.59%) (Table 7), and 44.58% of females hardly had outdoor activities, only 5.89% of females often had outdoor activities every week (Table 7). These results jointly suggested that the females had not enough sun exposure. Although food sources of vitamin D were not commonly recognized, only 10–20% of vitamin D in human bodies was obtained through food sources53. In WBBC pilot study, there was only 2.24% of the participants used vitamin D supplements (3.12% in male and 1.87% in female, p = 0.00098) (Table 6) and this was consistent with Zhou et al51, which found that only 5.6 % of the students used vitamin D supplements in a university of Nanjing, China.
Strengths and limitations
The WBBC pilot study was designed to develop a longitudinal database on health-related factors for young people. Firstly, this was a comprehensive cohort study to pay attention to the health of adolescence and young adults, including over ∼14,000 participants (Table 1). Secondly, the WBBC pilot cohort is rich in longitudinal phenotypic data and archived biospecimens, including whole blood and serum as well as DNA samples. These resources will facilitate to digitize the information of the genomic, proteomic and metabolomic and microbiome of the participants, and further investigate the association of the information with the development of adolescence and young adults. Thirdly, this cohort has now accrued sufficient events for reliable analyses, and the major findings on adiposity, BMD, sleep quality, nutrition and growth for adolescence are expected over the next few years.
There are several limitations to this cohort design. First, although the participants covered all around the country, most of them were mainly from 3 provinces (Jiangxi, Shandong and Zhejiang), and the selection of participants was not random, but in a specific population (college students). Second, the participants were collected in 3 phases at different time points, and the phenotypic information was updated in the later phases, for example, the items in the questionnaire of different phases were not always the same, therefore, the phenotypic data were not always uniform in 3 phases.
Data Availability
The data is not freely available in the public domain, but specific proposals and ideas for future collaboration would be very welcome.
Collaboration
Participants have agreed to provide their pseudonymized data being made available to other approved researchers. The WBBC pilot study welcomes and offers global collaboration. The data is not freely available in the public domain, but specific proposals and ideas for future collaboration would be very welcome. Applicants for collaboration and more information are encouraged to contact Dr. Hou-Feng Zheng (Email address: zhenghoufeng{at}westlake.edu.cn), the person in charge of this project.
Funding
This study was supported by the National Natural Science Foundation of China (81871831), and by the Zhejiang Provincial Natural Science Foundation for Distinguished Young Scholars of China (LR17H070001), and by the Westlake Biobank for Chinese (WBBC) funds from Westlake University.
Contributors
Hou-Feng Zheng gained funding and conceived of the study, and all authors were involved in the design and collection data of the study. Xiao-Wei Zhu and Hou-Feng Zheng analyzed the data and wrote the paper and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.
Conflict of interest
Jun-Quan Liu and Yi Sun are employees of Hangzhou Kingmed Diagnostics Co., Ltd. The other authors have no conflict of interest to declare.
Patient and public involvement
No patients were involved in the described linkage between existing registries providing an anonymous data set.
Ethics Approval and Consent to Participate
The study protocol and informed consent procedure were approved by the Ethics Committees at Westlake University. All study participants signed the informed consent form before taking part in the study.
Acknowledgements
We gratefully acknowledge all the participants of this project, and thank all the people who helped us in the sample recruitment, including volunteers, laboratory technicians, nurses and clerical workers. We also thank the Westlake University Supercomputer Center for the facility support and technical assistance. In addition, we would like to thank Shanghai AvanTech BioSciences Co., Ltd for their support and assistance in sample storage and management.