The Relationship between Deteriorating Mental Health Conditions and Longitudinal Behavioral Changes in Google and YouTube Usages among College Students in the United States during COVID-19: Observational Study ================================================================================================================================================================================================================= * Anis Zaman * Boyu Zhang * Ehsan Hoque * Vincent Silenzio * Henry Kautz ## Abstract **Background** Mental health problems among the global population are worsened during the coronavirus disease (COVID-19). Yet, current methods for screening mental health issues rely on in-person interviews, which can be expensive, time-consuming, blocked by social stigmas and quarantines. Meanwhile, how individuals engage with online platforms such as Google Search and YouTube undergoes drastic shifts due to COVID-19 and subsequent lockdowns. Such ubiquitous daily behaviors on online platforms have the potential to capture and correlate with clinically alarming deteriorations in mental health profiles of users in a non-invasive manner. **Objective** The goal of this study is to examine, among college students in the United States, the relationship between deteriorating mental health conditions and changes in user behaviors when engaging with Google Search and YouTube during COVID-19. **Methods** This study recruited a cohort of undergraduate students (N=49) from a U.S. college campus during January 2020 (prior to the pandemic) and measured the anxiety and depression levels of each participant. The anxiety level was assessed via the General Anxiety Disorder-7 (GAD-7). The depression level was assessed via the Patient Health Questionnaire-9 (PHQ-9). This study followed up with the same cohort during May 2020 (during the pandemic), and the anxiety and depression levels were assessed again. The longitudinal Google Search and YouTube history data of all participants were anonymized and collected. From individual-level Google Search and YouTube histories, we developed 5 signals that can quantify shifts in online behaviors during the pandemic. We then assessed the differences between groups with and without deteriorating mental health profiles in terms of these features. **Results** Of the 49 participants, 41% (n=20) of them reported a significant increase (increase in the PHQ-9 score ≥ 5) in depression, denoted as DEP; 45% (n=22) of them reported a significant increase (increase in the GAD-7 score ≥ 5) in anxiety, denoted as ANX. Of the 5 features proposed to quantify online behavior changes, statistical significances were found between the DEP and non-DEP groups for all of them (*P*≤.01, effect sizes ![Graphic][1] ranging between 0.130 to 0.320); statistical significances were found between the ANX and non-ANX groups for 4 of them (*P*≤.02, effect sizes ![Graphic][2] ranging between 0.115 to 0.231). Significant features included late-night online activities, continuous usages and time away from the internet, porn consumptions, and keywords associated with negative emotions, social activities, and personal affairs. **Conclusions** The results suggested strong discrepancies between college student groups with and without deteriorating mental health conditions in terms of behavioral changes in Google Search and YouTube usages during the COVID-19. Though further studies are required, our results demonstrated the feasibility of utilizing pervasive online data to establish non-invasive surveillance systems for mental health conditions that bypasses many disadvantages of existing screening methods. Keywords * mental health * anxiety * depression * Google Search * YouTube * pandemic * COVID-19 ## Introduction ### Background Globally, mental health problems such as depression, anxiety, and suicide ideations are severely worsened during the coronavirus disease (COVID-19) [1–3], specifically for college students [4,5–7]. Yet, current methods for screening mental health issues and identifying vulnerable individuals rely on in-person interviews. Such assessments can be expensive, time-consuming, and blocked by social stigmas, not to mention the reluctancy induced by travel restrictions and exposure risks. It has been reported that very few patients in need were correctly identified and received proper mental health treatments on time under the current healthcare system [8,9]. Even with emerging Telehealth technologies and online surveys, the screening requires patients to actively reach out to care providers. At the same time, because of the lockdown enforced by the global pandemic outbreak, people’s engagements with online platforms underwent notable changes, particularly in search engine trends [10–12], exposures to media reports [13,14], and through quotidian smartphone usages for COVID-19 information [5]. Reliance on the internet has significantly increased due to the overnight change in lifestyles, for example, working and remote learning, imposed by the pandemic on society. The sorts of content consumed, the time and duration spent online, and the purpose of online engagements may be influenced by COVID-19. Furthermore, the digital footprints left by online interactions may reveal information about these changes in user behaviors. Most importantly, such ubiquitous online footprints may provide useful signals of deteriorating mental health profiles of users during COVID-19. They may capture insights into what was going on in the mind of the user through a non-invasive manner, especially since Google and YouTube Searches are short and succinct and can be quite rich in providing the in the moment cognitive state of a person. On one hand, online engagements can cause fluctuations in mental health. On the other hand, having certain mental health conditions can cause certain types of online behaviors. This opens up possibilities for potential healthcare frameworks that leverage pervasive computing approaches to monitor mental health conditions and deliver interventions on-time. ### Prior Work Extensive researches have been conducted on a population level, correlating mental health problems with user behaviors on social platforms [15,16], especially among young adolescents. Researchers monitored Twitter to understand mental health profiles of the general population such as suicide ideations [17] and depressions [18]. Similar researches have been done with Reddit, where anxiety [19], suicide ideations [17], and other general disorders were studied [20,21]. Another popular public platform is Facebook, and experiments have been done studying anxiety, depression, body shaming, and stress online [22,23]. However, such studies were limited to macro observations and failed to identify individuals in need of mental health assistance. In addition, it has been shown that college student communities rely heavily on YouTube for both academic and entertainment purposes [24,25]. Yet, abundant usages may lead to compulsive YouTube engagements [26], and researchers have found that social anxiety is associated with YouTube consumptions in a complex way [27]. During COVID-19, multiple studies have reported deteriorating mental health conditions in various communities [1–3,28], such as nation-wise [29,30], across the healthcare industry [31,32], and among existing mental health patients [33]. Besides, online behaviors during COVID-19 have been explored, especially for web searches related to the pandemic [10–12] and abnormal TV consumptions during the lockdown [13]. Many of the behavioral studies also discussed the effects of online interactions on the spread, misinformation, knowledge, and protective measures of COVID-19, including the roles of YouTube [34–36] and other platforms [37]. [38] investigated hate speech targeting the Chinese and Asian communities on Twitter during COVID-19. Ubiquitous data has been proved to be useful in detecting mental health conditions. Mobile sensor data, such as GPS logs [39,40], electrodermal activity, sleep behavior, motion, and phone usage patterns [41,42] has been applied in investigating depressive symptoms. [43] found that individual private Google Search histories can be used to detect low self-esteem conditions among college students. [5] examined the longitudinal changes in mental health and smartphone usages through ecological momentary assessments (EMAs) during COVID-19 among college populations. Yet, none of the previous studies evaluated the relationship *between* individual online behaviors (Google Search and YouTube) and the deterioration in mental health conditions during COVID-19. ### Goal of This Study It has been shown that online platforms preserve useful information about the mental health conditions of users, and COVID-19 is jeopardizing the mental well-being of the global community. Thus, we demonstrate the richness of online engagement logs and how it can be leveraged to uncover alarming mental health conditions during COVID-19. In this study, we aim to examine whether the changes in user behaviors during COVID-19 have a relationship with deteriorating mental health profiles. We focus on Google Search and YouTube usages, and we investigate if the behavior shifts when engaging with these two platforms signify worsened mental health conditions. We hypothesize that late-night activities, compulsive and continuous usages, time away from online platforms, porn and news consumptions, and keywords related to health, social engagements, personal affairs, and negative emotions may play a role in deteriorating mental health conditions. The scope of the study covers undergraduate students in the U.S. We envision this project as a pilot study: it may lay a foundation for mental health surveillance and help delivery frameworks based on pervasive computing and ubiquitous online data. Compared to traditional interviews and surveys, such a non-invasive system may be cheaper, efficient, and avoid being blocked by social stigmas while notifying caregivers on-time about individuals at risk. ## Methods ### Recruitment and Study Design We recruited a cohort of undergraduate students, all of whom were at least 18 years old and have an active Google account for at least 2 years, from the University of Rochester River Campus, Rochester, NY, U.S.A. Participation was voluntary, and individuals had the option to opt-out of the study at any time, although we did not encounter any such cases. We collected individual-level longitudinal online data (Google Search and YouTube) in the form of private history logs from the participants. For every participant, we measured the depression and anxiety levels via the clinically validated Patient Health Questionnaire-9 (PHQ-9) and Generalized Anxiety Disorder-7 (GAD-7), respectively. Basic demographic information was also recorded. There were in total two rounds of data collection: the first round during January 2020 (prior to the pandemic) and the second round during May 2020 (during the pandemic). During each round, for each participant, the anxiety and depression scores were assessed, and the change in mental health conditions was calculated in the end. The entire individual online history data up untill the date of participation was also collected in both rounds from the participants. Figure 2 gives an illustration of the recruitment timeline and two rounds of data collections. All individuals participated in both rounds and were compensated with 10-dollar Amazon gift cards during each round of participation. Given the sensitivity and proprietary nature of private Google Search and YouTube histories, we leveraged the Google Takeout web interface [44] to share the data with the research team. Prior to any data cleaning and analysis, all sensitive information such as the name, email, phone number, social security number, and credit card information was automatically removed via the Data Loss Prevention (DLP) API [45] of Google Cloud. For online data and survey response storage, we utilized a HIPAA- compliant cloud-based secure storing pipeline. The whole study design, pipelines, and survey measurements involved were similar to our previous setup in [43] and have been approved by the Institutional Review Board (IRB) of the University of Rochester. ### Online Data Processing and Feature Extractions The Google Takeout platform enables users to share the entire private history logs associated with their Google accounts, and as long as the account of the user was logged in, all histories would be recorded regardless of which device the individual was using. Each activity in Google Search and YouTube engagement logs were timestamped, signifying when the activity happened to the precision of seconds. Besides, for each Google Search, the history log contained the query text input by the user. It also recorded the URL if the user directly input a website address to the search engine. For each YouTube video watched by the user, the history log contained the URL to the video. If the individual directly searched with keyword(s) on the YouTube platform, the history log also recorded the URL to the search results. In order to capture the change in online behaviors for the participants, we first introduced a set of features that quantifies certain aspects of how individuals interact with Google Search and YouTube. The set of features was calculated for each participant separately. Individual-level behavior changes were then obtained by examining the variations of the feature between January to mid-March of 2020 (prior to the outbreak) and mid-March to May of 2020 (after the outbreak). Concretely, we defined 5 features and cut the longitudinal data of each participant into two segments by mid-March, around the time of the COVID-19 outbreak in the U.S and campus lockdown. The two segments spanned 2.5 months before and after mid-March, respectively, and data before January 2020 was discarded. The same feature was extracted from both segments of data, and the change was calculated. Such change was referred to as the behavior shifts during the pandemic and lockdown. Figure 2 gives an illustration of data segmentations and feature engineering pipelines. #### Late Night Activities We defined late-night activities (LNA) as the activities happened between 10:00 P.M. and 5:00 A.M. of the next day, regardless of Google Search or YouTube. For each participant, we counted the numbers of late-night activities before (*LNAbefore*) and after the outbreak (*LNAbefore*), respectively. We then calculated the percentage change of late-night activities and used it as a behavior shift feature: *Equation 1. The percentage change of late nigh activities before and after the COVID-19 outbreak*. ![Formula][3] For the rest of the study, any mentioned percentage or relative changes of features were calculated the same way as above. #### Inactivity Periods We defined inactivity periods as the periods of time where no Google Search nor YouTube activity was performed. We set a threshold of 7 hours, and we identified all the inactivity periods that were longer than 7 hours for each participant from the online data log. Moreover, we looked at how these inactivity periods were distributed across 24 hours. We obtained the mid-point hour mark for each inactivity period: for example, an inactivity period started at 11 P.M. and ended at 7 A.M. has a mid-point of 3 A.M. With normalization, we received a discrete distribution of inactivity period midpoints over the 24-hour bins. It represented how the time away from Google Search and YouTube of an individual was distributed in a 24-hour period. Such distribution was calculated on the data segments before *(Qbefore*) and after *(Qafter*) the outbreak, respectively. Figure 1 showcases two normalized inactivity midpoint distributions before and after the outbreak: after the outbreak, most of the inactive periods of participant 1 shifted to later hours of the dawn, which was most likely to be a delay in bedtime; for participant 2, the morning inactivity moved earlier, and new inactive periods during the afternoon appeared after the outbreak. One possible explanation could be that participant 2 started to take naps after noon, resulting in midpoints around 5 P.M. ![Figure 1.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/08/29/2020.08.22.20178640/F1.medium.gif) [Figure 1.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/F1) Figure 1. The normalized inactivity midpoint distributions over 24 hours before and after the outbreak of COVID-19 of two example participants. The threshold for the inactivity is 7 hours. Brighter blocks indicate higher frequencies in that hour. To estimate the difference before and after the outbreak, we calculated the KL-divergence [46] between the two distributions for each participant: *Equation 2. The KL divergence of inactivity distributions before and after the COVID-19 outbreak*. ![Formula][4] The KL-divergence is strictly greater than or equals to 0, and it equals to 0 only when the two distributions are identical. #### Short Event Intervals We defined a short event interval (SEI) as the period of time that is less than 5 minutes between two adjacent events. It usually occurs when one is consuming several YouTube videos or searching for related content in a roll. We counted the total numbers of short event intervals for each participant before (*SEIbefore*) and after (*SEIafter*) the outbreak, respectively. We calculated the percentage change of SEI the same way as Equation 1 and used it as a behavioral feature. #### LIWC Attributes The Linguistic Inquiry and Word Count (LIWC) is a toolkit used to analyze various emotions, cognitive processes, social concerns, and psychological dimensions in a given text by counting the numbers of specific words [47]. It has been widely applied in researches involving social media and mental health. For the complete list of linguistic and psychological dimensions LIWC measures, see [47(pp3-4)]. We segmented the data log for each participant by mid-March as two blobs of texts and analyzed the words using LIWC: for Google Search, we input the raw query text; for YouTube, we input the video title. We considered the ‘Personal Concerns’, ‘Negative Emotion’, ‘Health/illness’, and ‘Social Words’ LIWC dimensions. LIWC categorized words associated with work, leisure, home, money, and religion as ‘Personal Concerns’. In the ‘Negative Emotion’ dimension, LIWC included words related to anxiety, anger, and sadness. Whereas, in the ‘Social Words’ dimension, LIWC included family, friends, and gender references. The LIWC output the count of words falling in each dimension among the whole text. We quantified the shift in behavior by calculating the percentage change of words in each dimension after the outbreak. #### Google Search and YouTube Categories We labeled each Google Search query with a category using the Google NLP API [48]. We utilized the official YouTube API to retrieve the information of videos watched by the participants, including the title, duration, number of likes and dislikes, and default YouTube category tags. For a comprehensive list of Google NLP category labels and default YouTube category tags, please refer to [49,50]. There were several categories overlapping with the LIWC dimensions, such as ‘Health’ and ‘Finance’, and we regarded the LIWC dimensions as a more well-studied standard. Instead, we focused on the number of activities belonging to the ‘Adult’ and ‘News’ categories, which were not presented in the LIWC. We calculated the relative changes of activities in these two categories as the behavior shifts for each participant, the same as Equation 1. ![Figure 2.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/08/29/2020.08.22.20178640/F2.medium.gif) [Figure 2.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/F2) Figure 2. The study recruitment procedure and feature engineering process. ### Measurement Outcomes #### Measurements for Changes in Online Behaviors There were in total 5 scalar continuous dependent variables measuring various aspects of the changes in online behavior for each participant, as defined above. These variables were extracted from two segments of the online data logs, namely the data before and after the pandemic outbreak. For the *Inactivity Periods*, the measurement was the KL-divergence between inactivity distributions. For the rest 4 behavioral features, the measurements were all in percentage changes. #### Measurements for Mental Health Conditions For both rounds of the data collection, anxiety levels were assessed using the GAD-7 survey, and depression levels were assessed using the PHQ-9 survey. With two rounds of surveys reported before and after the outbreak, the change in mental health conditions of each participant was obtained. According to [51,52], an increase greater than or equals to 5 in the GAD-7 score may be clinically alarming. Therefore, individuals with an increase ≥ 5 in GAD-7 scores were labeled as the ANX group; the rest were labeled as the non-ANX group. Similarly, as stated in [53], an increase greater than or equals to 5 in the PHQ-9 score may indicate the need for medical interventions. Hence, individuals with an increase ≥ 5 in PHQ-9 scores were labeled as the DEP group; the rest were labeled as the non-DEP group. #### Demographics and Covariates Besides the online data and mental health surveys, we also collected basic demographic information such as school year, gender, and nationality. ### Statistical Analysis Before any analysis of mental health conditions, in order to eliminate the possibility of annual confounding factors interfering with the shifts in online behaviors, two-tailed paired independent *t*-tests were performed. We inspected that, in terms of the five quantitative features, whether the online behavior changes happened every year, such as due to seasonal factors, or only during COVID-19 for the whole study population. As mentioned above, we collected the entire Google history log back to the registration date of the Google accounts of all participants. Thus, we computed the online behaviors changes in both 2020 and 2019 for all participants, spanning 2.5 months before and after the mid-March of each year. The behavior changes were dependent between 2020 and 2019 for the same participant. Viewing the cohort as a whole and measured twice, two-tailed paired independent *t*-tests were performed on all 5 behavior features. For the main experiment, chi-square tests were first performed to investigate the differences in demographics: school year, gender, and nationality. After that, analyses of covariance were conducted to explore the discrepancy between the DEP and non-DEP groups with each of the 5 online behavior features while controlling significant demographic covariates. The same was performed between the ANX and non-ANX groups. Notice that, in this observational study, the independent variable was the binary group, i.e., whether or not the individual had a significant increase in the GAD-7 (or PHQ-9) score. The dependent variables were the 5 behavior changes extracted from the longitudinal individual online data. Experiments were carried out in a one-on-one fashion: anxiety or depression condition was the single independent variable, and one of the 5 online behavior changes was the single dependent variable each time. Since multiple hypotheses were tested and some dependent variables might be moderately correlated, a Holm’s sequential Bonferroni procedure was performed with an original significance level a=0.05 to deal with the family-wise error rates. ## Results ### Study Population Statistics We recruited 49 (N=49) participants in total, and all of them participated in both rounds of the study (response rate=100%). On average, each participant made 2,446 (95% CI 2,120.22-2,481.75) Google Searches and 2,985 (95% CI 2,576.57-3,393.43) YouTube interactions from January to March 14th, and 2578 (95% CI 2,165.16-2,990.96) Google Searches and 3146 (95% CI 2,758.79-3,533.24) YouTube interactions from March 14th to the end of May. Of the 49 participants, 41% (n=20) of them reported an increase in the PHQ-9 score ≥ 5 (the DEP group); 45% (n=22) of them reported an increase in the GAD-7 score ≥ 5 (the ANX group). 37% (n=18) of the participants belonged to the ANX and DEP group simultaneously. Of the 49 participants, 61% (n=30) of the them were female; 35% (n=17) of the them were male; the rest 4% (n=2) reported non-binary genders. First and second-year students occupied 63% (n=31) of the whole cohort, and the rest were third and fourth-year students (n=24). 80% (n=39) of the participants were U.S. citizens, and the rest (n=10) were international students. A complete breakdown of demographics and group separations are given in Table 1. View this table: [Table 1.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/T1) Table 1. Demographics of the study population. ### Evaluation Outcomes The distributions of female participants were not well-stratified. 77% (n=17/22) of the ANX group were female while 48% (n=13/27) of the non-ANX group were female; 85% (n=17/20) of the DEP group were female while 45% (n=13/29) of the non-DEP group were female (Table 1). This observation among female students is consistent with the statistics reported in [4]. Chi-square tests showed that being female had a significant difference between the ANX and non-ANX group ![Graphic][5]; it also had a significant difference between the DEP and non-DEP group ![Graphic][6]. Meanwhile, being a U.S. citizen did not show a significant difference in deteriorating anxiety ![Graphic][7] nor depression ![Graphic][8]; being a lower-class student (first or second-year) did not show a significant difference in deteriorating anxiety ![Graphic][9] nor depression ![Graphic][10]. Thus, the gender factor was controlled for the rest of the study. The two-tailed paired independent *t*-tests mentioned at the beginning of Statistical Analysis was designed to rule out seasonal factors in online behavior changes but focus on COVID-19 before any of the main experiments, and they reported *P*<.001 for all 5 quantitative features. Hence, the presence of annual or seasonal factors accountable for online behavior changes was neglectable, and it was safe to carry out the following main experiment. This is consistent with one of the main conclusions in [5] that, when comparing the longitudinal data between different years, behaviors during COVID-19 shifted drastically. For each group (ANX, non-ANX, DEP, and non-DEP), the average percentage changes in *Late Night Activities, Short Event Intervals, LIWC Attributes*, and *Google Search and YouTube Categories* were all positive increases. ### Depression Group Analysis Analyses of covariance were performed to investigate the online behavior differences between the DEP and the non-DEP groups, ruling out the gender factor. We dummy-coded the categorical gender factor as a continuous covariate. For *Late Night Activities*, the DEP group (mean=9.70%, 95% CI 8.72%-10.68%) had a higher relative increase than the non-DEP group (mean=7.54%, 95% CI 6.59%-8.49%), and a significant difference was found (*P*=. 005, ![Graphic][11], *F*1,46=8.53). For *Inactivity Periods*, the DEP group (mean=0.86, 95% CI 0.65-1.06) had a lower divergence, i.e., fewer variations in how the time away from Google products was distributed in a day, than the non-DEP group (mean=1.32, 95% CI 1.20-1.44), and a significant difference was found (*P*<.001, ![Graphic][12], *F*1,46= 21.55). The DEP group (mean=18.46%, 95% CI 16.51%-20.40%) had more increase in *Short Event Intervals* than the non-DEP group (mean=14.62%, 95% CI 13.28%-15.96%), and a significant difference was found (*P*=.002, ![Graphic][13], *F*1,46=10.34). For the *LIWC Attributes*, the DEP group (mean=9.52%, 95% CI 8.36%-10.78%) had a higher relative increase in ‘Personal Concern’ keywords than the non-DEP group (mean=7.24%, 95% CI 6.50%-7.98%), and a significant difference was found (*P*=001, ![Graphic][14], *F*1,46=11.45). Similarly, for the prevalence of ‘Negative Words’ (*P*=.006, ![Graphic][15], *F*1,46= 8.28), the DEP group (mean=4.26%, 95% CI 3.62%-4.89%) increased more than the non-DEP group (mean=2.97%, 95% CI 2.52%-3.41%); for ‘Social Words’ (*P*=. 006, ![Graphic][16], *F*1,46= 8.22), the DEP group (mean=3.57%, 95% CI 2.55%-4.58%) increased less than the non-DEP group (mean=5.44%, 95% CI 4.76%-6.12%). ‘Health/illness’ (*P*=.69, ![Graphic][17], *F*1,46=0.17) did not show any significant group difference with a 95% CI. For *Google Search and YouTube Categories*, ‘Adult’ contents showed a significant difference (*P*=. 01, ![Graphic][18], *F*1,46=6.85): the DEP group (mean=13.45%, 95% CI 11.23%-15.68%) had a greater increasing consumption than the non-DEP group (mean=9.90%, 95% CI 8.46%-11.35%). ‘News’ contents did not show any significant group difference (*P*=.39, ![Graphic][19], *F*1,46= 0.75). Table 2 summarizes these findings in detail. Figure 3 shows the distributions of the percentage increases in online behavior features except for the inactivity divergence in the two groups. View this table: [Table 2.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/T2) Table 2. The means, SDs, and statistical test results for the DEP and non-DEP groups. ![Figure 3.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/08/29/2020.08.22.20178640/F3.medium.gif) [Figure 3.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/F3) Figure 3. Comparisons of the 4 online behavior changes measured in percentage increases, except the inactivity distributions, between the DEP (red) and the non-DEP (blue) groups. Boxes cover the 25th and 75th percentiles, and whiskers represent the range of the group. Horizontal solid lines in the boxes represent the medians. #### Anxiety Group Analysis Similar trends were found between the ANX and non-ANX groups, partially due to the overlapping with the DEP and non-DEP populations. For *Late Night Activities* (*P*=.001, ![Graphic][20], *F*1,46=13.85), the ANX group (mean=9.82%, 95% CI 9.04%- 10.61%) had a higher percentage increase than the non-ANX group (mean=7.28%, 95% CI 6.27%-8.29%). For *Inactivity Periods* (*P*=.01, ![Graphic][21], *F*1,46=7.19), the ANX group (mean=0.96, 95% CI 0.76-1.16) had a lower divergence, i.e., fewer alterations in the pattern of inactive periods in a 24-hour period, than the non-ANX group (mean=1.27, 95% CI 1.13-1.42). The ANX group (mean=18.09%, 95% CI 16.22%-19.95%) had more increase in *Short Event Intervals* than the non-ANX group (mean=14.64%, 95% CI13.21%-16.06%), and a significant difference was found (*P*=.007, ![Graphic][22], *F*1,46=8.05). For the *LIWC Attributes*, the ANX group (mean=9.15%, 95% CI 7.88%-10.41%) had a higher relative increase in ‘Personal Concern’ keywords than the non-ANX group (mean=7.37%, 95% CI 6.61%-8.14%), and this difference was statistically significant (*P*=.02, ![Graphic][23], *F*1,46=5.99). We found a similar result for ‘Negative Words’ (*P*=.01, ![Graphic][24], *F*1,46=6.59) where the ANX group (mean=4.12%, 95% CI 3.48%-4.76%) had higher usages than the non-ANX group (mean=2.98%, 95% CI 2.53%-3.43%). ‘Health/illness’ (*P*=.52, ![Graphic][25], *F*1,46=0.42) and ‘Social Words’ (*P*=.10, ![Graphic][26], *F*1,46=2.77) did not show any significant group difference with a 95% CI. For *Google Search and YouTube Categories*, neither ‘Adult’ (*P*=.25, ![Graphic][27], *F*1,46=1.33) nor ‘News’ (*P*=.71, ![Graphic][28], *F*1,46=0.14) content showed any significant group difference. For more details, see Table 3. Figure 4 shows the distributions of the percentage increases in online behavior features except for the inactivity divergence in the two groups. View this table: [Table 3.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/T3) Table 3. The means, SDs, and statistical test results for the ANX and non-ANX groups. ![Figure 4.](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/08/29/2020.08.22.20178640/F4.medium.gif) [Figure 4.](http://medrxiv.org/content/early/2020/08/29/2020.08.22.20178640/F4) Figure 4. Comparisons of the 4 online behavior changes measured in percentage increases, except the inactivity distributions, between theANX (yellow) and the non-ANX (green) groups. Boxes cover the 25th and 75th percentiles, and whiskers represent the range of the group. Horizontal solid lines in the boxes represent the medians. ## Discussion In this study, we collected longitudinal individual-level Google Search and YouTube data from college students, and we measured their anxiety (GAD-7) and depression (PHQ-9) levels before and after the outbreak of COVID-19. We then developed explainable features from the online data logs and quantified the online behavior shifts of the participants during the pandemic. We also calculated the change in mental health conditions for all participants. Our experiment examined the differences between groups with and without deteriorating mental health profiles in terms of these online behavior features. To the best of our knowledge, we are the first to conduct observational studies on how mental health problems and Google Search and YouTube usages of college students are related during COVID-19. ### Principal Results Our results showed significant differences between groups of college students with and without worsened mental health profiles in terms of online behavior changes during the pandemic. The features we developed based on online activities were all explainable and preserved certain levels of interpretability. For example, the *Short Event Intervals* and *Inactivity Periods* measured the consecutive usages and time away from Google Search and YouTube, which were inspired by previous studies on excessive YouTube usages [26], internet addictions [54], and positive associations with social anxiety among college students [27]. Our results indicated that individuals with meaningful increasing anxiety or depressive disorders during the pandemic tended to have long usage sessions (multiple consecutive activities with short time intervals) when engaging with Google Search and YouTube. Moreover, ANX and DEP individuals tended to maintain their regular time-away-from-internet patterns regardless of the lockdown as the KL-divergence was low. One possible reason could be that depressed people tend to spend more time at home as regular lifestyles [40,55], and thereby, after the lockdown, the living environment did not alter much. We further found that the majority of the inactivity periods longer than 7 hours had midpoints around 5 to 6 A.M. for all individuals, which were most likely to be the sleeping period. Well-established previous researches stated that depressed individuals have more disrupted sleeping patterns and less circadian lifestyles [40,56,57], but they are not validated for special periods such as COVID-19. We instead focused on comparing the distributions of time away from Google before and after the outbreak of COVID-19, and we had an emphasis on the behavior *changes* of groups with and without worsened mental disorders. Besides, the increase in *Late Night Activities* corresponded with previous studies in sleep deprivation and subsequent positive correlations with mental health deteriorations [58,59]. Our results demonstrated that individuals with significant worsened anxiety or depressive symptoms during the pandemic were indeed likely to stay up late and engage more online. The above three features captured the temporal aspects of user online behaviors, and they have shown statistically significant differences between groups. Additionally, our analysis found that there was a significant difference in the amount of adult and porn consumption between individuals with and without worsening depression, which adheres to previous findings that people suffering from depression and loneliness are likely to consume more pornographies [60,61]. For the LIWC features, ‘Personal Concern’ and ‘Negative Emotion’ keywords appeared more frequently among students in the ANX group, and previous research showed that negative YouTube videos tended to receive more attention from vulnerable individuals [62]. For the DEP group, ‘Social Words’ became less prevalent than the non-DEP group. This was consistent with studies on patterns of social withdrawal and depression [40,63,64], and social interactions and isolations have been recognized by [65] as one of the priorities in mental illness prevention, especially during COVID-19 [30]. These attributes captured the semantic aspect of user online behaviors. The prevalence of personal affair, social activity, and negative keywords as well as porn consumption have shown statistically significant differences between groups. Many researchers have reported that there has been a significant boost in health and news-related topics, at the population level, in various online platforms during COVID-19. This is partly due to additional measures taken by individuals, various stakeholders, and agencies with regards to preventive measures [11,35,36], daily statistics [10,12,13], and healthcare (mis)information [34,36,37], However, unlike many, our investigation was carried out considering individual-level Google Search and YouTube engagement logs, and our analysis did not reveal any significant spikes in ‘News’ and ‘Health/illness’ category between the groups of individuals with deteriorating anxiety and depression during the pandemic. One possible explanation for such observation can be due to the target population (college students) of our study who may prefer to follow news from other popular platforms such as social media. Finally, COVID-19 has shaken the foundation of human society and forced us to alter daily lifestyles. The world was not ready for such a viral outbreak. Since there is no cure for COVID-19, it, or an even more deadly viral disease, may resurface at different capacities in the near future. Society may be forced to rely on technologies even more and employ remote learning, working, and socializing for a longer period of time. It is important that we learn from our experience of living through the initial COVID-19 outbreak and take necessary measures to uncover the changes in online behaviors, investigating how that can be leveraged to understand and monitor various mental health conditions of individuals in the least invasive manner. Furthermore, we hope our work paves the path for technology stakeholders to consider incorporating various mental health assessment monitoring systems using user engagements, following users’ consent in a privacy-preserving manner. They can periodically share the mental health monitoring assessment report with respective users based on their online activities, education, and informing users about their current mental health. This can eventually encourage individuals to acknowledge the importance of mental health and take better care of themselves. ### Limitations First, while most of the online behavioral features we developed showed significant differences between groups of students with and without deteriorating anxiety and depressive disorders during COVID-19, our study cohort only represented a small portion of the whole population suffering from mental health difficulties. Therefore, further studies are required to investigate if the significant behavioral changes still hold among more general communities, not limiting to college students. Nonetheless, we argue that the explainable features we constructed, such as late-night activities, continuous usages, inactivity, pornography, and certain keywords, can remain behaviorally representative and be applied universally across experiments exploring the relationship between mental health and online activities during the pandemic. Second, in this work, we studied the relationship between user online behaviors and the fluctuations in mental health conditions during COVID-19. Any causal relationship between online behavior and mental disorders is beyond the scope of this work. As one can readily imagine, online behavioral changes could both contribute to or be caused by deteriorating anxiety or depressive disorders. Moreover, though we included preliminary demographic information as covariates, there remains the possibility of other confounding factors. In fact, both the shifts in online behaviors and deteriorating mental health profiles may be due to common factors such as living conditions, financial difficulties, and other health problems during the pandemic. Nor there was any causal direction implied between COVID-19 and online behavior changes, which was introduced in the first paragraph of Statistical Analysis as a precaution before the main experiments. ## Ethical and Privacy Concerns Albeit a pilot study, our results indicated that it is possible to build an anxiety and depression surveillance system based on passively collected private Google data histories during COVID-19. Such non-invasive systems shall be subject to rigorous data security and anonymity checks. Necessary measures need to be in place to ensure personal safety and privacy concerns when collecting sensitive and proprietary data such as Google Search logs and YouTube histories. Even in pilot studies, participants shall preserve full rights over their data: they may choose to opt-out of the study at any stage and remove any data shared in the system. Moreover, anonymity and systematic bias elimination shall be enforced. As an automatic medical screening system based on pervasive data, it has been extensively studied that such frameworks are prone to implicit machine learning bias during data collection or training phases [66–68]. Black-box methods should be avoided as they are known to be vulnerable to adversarial attacks and produce unexplainable distributional representations [69,70]. Anonymizing data and obscuring identity information should be the first step in data debiasing. In the end, to what extent should caregivers trust a clinical decision made by machines remains an open question. We believe that possible pervasive computing frameworks shall play the role of a smart assistant, at most, to the care providers. Any final intervention or help delivery decision should be made by healthcare professionals who understand both the mental health problems and the limitations of automatic detection systems in clinical settings. ## Data Availability Due to the IRB restriction, the data is not available for the general public. ## Conflicts of Interest Disclose any personal financial interests related to the subject matters discussed in the manuscript here. For example, authors who are owners or employees of Internet companies that market the services described in the manuscript will be disclosed here. If none, indicate with “none declared”. Abbreviations : ANX : the group of participants reported an increase in the GAD-7 score ≥ 5 COVID-19 : coronavirus disease DEP : the group of participants reported an increase in the PHQ-9 score ≥ 5 EMA : ecological momentary assessment LIWC : Linguistic Inquiry and Word Count LNA : Late night activity non-ANX : the group of participants did NOT report an increase in the GAD-7 score ≥ 5 non-DEP : the group of participants did NOT report an increase in the PHQ-9 score ≥ 5 SEI : Short event interval ## Acknowledgments This research was supported in part by grant W911NF-15-1-0542 and W911NF-19-1-0029 with the US Defense Advanced Research Projects Agency (DARPA) and the Army Research Office (ARO). We acknowledge the contributions by Michael Giardino, Adira Blumenthal, and Ariel Hirschhorn at the beginning phase of the project. ## Footnotes * Added ORCID for all authors. * Received August 22, 2020. * Revision received August 28, 2020. * Accepted August 29, 2020. * © 2020, Posted by Cold Spring Harbor Laboratory The copyright holder for this pre-print is the author. All rights reserved. The material may not be redistributed, re-used or adapted without the author's permission. ## References 1. 1.Torales J, O’Higgins M, Castaldelli-Maia JM, Ventriglio A. The outbreak of COVID-19 coronavirus and its impact on global mental health. Int J Soc Psychiatry. 2020;66(4):317-320. doi:10.1177/0020764020915212 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0020764020915212&link_type=DOI) 2. 2.Pfefferbaum B, North CS. Mental Health and the Covid-19 Pandemic. New England Journal of Medicine. 2020;():null. doi:10.1056/NEJMp2008017 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1056/NEJMp2008017&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32283003&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 3. 3.Rajkumar RP. COVID-19 and mental health: A review of the existing literature. Asian Journal of Psychiatry. 2020;52:102066. doi:10.1016/j.ajp.2020.102066 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajp.2020.102066&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 4. 4.Elmer T, Mepham K, Stadtfeld C. Students under lockdown: Comparisons of students’ social networks and mental health before and during the COVID-19 crisis in Switzerland. PLOS ONE. 2020;15(7):e0236337. doi:10.1371/journal.pone.0236337 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0236337&link_type=DOI) 5. 5.Huckins JF, DaSilva AW, Wang W, et al. Mental Health and Behavior of College Students During the Early Phases of the COVID-19 Pandemic: Longitudinal Smartphone and Ecological Momentary Assessment Study. Journal of Medical Internet Research. 2020;22(6):e20185. doi:10.2196/20185 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/20185&link_type=DOI) 6. 6.Liu X, Liu J, Zhong X. Psychological State of College Students During COVID-19 Epidemic. Social Science Research Network; 2020. doi:10.2139/ssrn.3552814 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2139/ssrn.3552814&link_type=DOI) 7. 7.W C, Z F, G H, et al. The psychological impact of the COVID-19 epidemic on college students in China. Psychiatry Res. 2020;287:112934-112934. doi:10.1016/j.psychres.2020.112934 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.psychres.2020.112934&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32229390&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 8. 8.Kessler RC, Chiu WT, Demler O, Walters EE. Prevalence, Severity, and Comorbidity of Twelve-month DSM-IV Disorders in the National Comorbidity Survey Replication (NCS-R). Arch Gen Psychiatry. 2005;62(6):617-627. doi:10.1001/archpsyc.62.6.617 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archpsyc.62.6.617&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15939839&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000229628400006&link_type=ISI) 9. 9.Wang PS, Berglund P, Olfson M, Pincus HA, Wells KB, Kessler RC. Failure and delay in initial treatment contact after first onset of mental disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 2005;62(6):603-613. doi:10.1001/archpsyc.62.6.603 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archpsyc.62.6.603&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=15939838&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000229628400004&link_type=ISI) 10. 10.Mavragani A. Tracking COVID-19 in Europe: Infodemiology Approach. JMIR Public Health and Surveillance. 2020;6(2):e18941. doi:10.2196/18941 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/18941&link_type=DOI) 11. 11.Walker A, Hopkins C, Surda P. Use of Google Trends to investigate loss-of-smell-related searches during the COVID-19 outbreak. International Forum of Allergy & Rhinology. 2020;10(7):839-847. doi:10.1002/alr.22580 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/alr.22580&link_type=DOI) 12. 12.Rovetta A, Bhagavathula AS. COVID-19-Related Web Search Behaviors and Infodemic Attitudes in Italy: Infodemiological Study. JMIR Public Health and Surveillance. 2020;6(2):e19374. doi:10.2196/19374 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/19374&link_type=DOI) 13. 13.Dixit A, Marthoenis M, Arafat SMY, Sharma P, Kar SK. Binge watching behavior during COVID 19 pandemic: A cross-sectional, cross-national online survey. Psychiatry Res. 2020;289:113089. doi:10.1016/j.psychres.2020.113089 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.psychres.2020.113089&link_type=DOI) 14. 14.Gao J, Zheng P, Jia Y, et al. Mental health problems and social media exposure during COVID-19 outbreak. PLOS ONE. 2020;15(4):e0231924. doi:10.1371/journal.pone.0231924 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0231924&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 15. 15.Seabrook EM, Kern ML, Rickard NS. Social Networking Sites, Depression, and Anxiety: A Systematic Review. JMIR Mental Health. 2016;3(4):e50. doi:10.2196/mental.5842 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/mental.5842&link_type=DOI) 16. 16.Lin L yi, Sidani JE, Shensa A, et al. Association Between Social Media Use and Depression Among U.s. Young Adults. Depression and Anxiety. 2016;33(4):323-331. doi:10.1002/da.22466 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/da.22466&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26783723&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 17. 17.De Choudhury M, Kiciman E, Dredze M, Coppersmith G, Kumar M. Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media. Proc SIGCHI Conf Hum Factor Comput Syst. 2016;2016:2098-2110. doi:10.1145/2858036.2858207 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1145/2858036.2858207&link_type=DOI) 18. 18.De Choudhury M, Counts S, Horvitz E. Social media as a measurement tool of depression in populations. In: Proceedings of the 5th Annual ACM Web Science Conference. WebSci ‘13. Association for Computing Machinery; 2013:47–56. doi:10.1145/2464464.2464480 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1145/2464464.2464480&link_type=DOI) 19. 19.Shen JH, Rudzicz F. Detecting Anxiety through Reddit. In: Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology — From Linguistic Signal to Clinical Reality. Association for Computational Linguistics; 2017:58–65. doi:10.18653/v1/W17-3107 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18653/v1/W17-3107&link_type=DOI) 20. 20.Silveira Fraga B, Couto da Silva AP, Murai F. Online Social Networks in Health Care: A Study of Mental Disorders on Reddit. In: 2018IEEE/WIC/ACM International Conference on Web Intelligence (WI).; 2018:568-573. doi:10.1109/WI.2018.00-36 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/WI.2018.00-36&link_type=DOI) 21. 21.Gaur M, Kursuncu U, Alambo A, et al. “Let Me Tell You About Your Mental Health ! ”: Contextualized Classification of Reddit Posts to DSM-5 for Web-based Intervention. In: Proceedings of the 27th ACM International Conference on Information and Knowledge Management. CIKM ‘18. Association for Computing Machinery; 2018:753–762. doi:10.1145/3269206.3271732 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1145/3269206.3271732&link_type=DOI) 22. 22.Frost RL, Rickwood DJ. A systematic review of the mental health outcomes associated with Facebook use. Computers in Human Behavior. 2017;76:576-600. doi:10.1016/j.chb.2017.08.001 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chb.2017.08.001&link_type=DOI) 23. 23.Zhang R. The stress-buffering effect of self-disclosure on Facebook: An examination of stressful life events, social support, and mental health among college students. Computers in Human Behavior. 2017;75:527-537. doi:10.1016/j.chb.2017.05.043 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chb.2017.05.043&link_type=DOI) 24. 24.Fleck BKB, Beckman LM, Sterns JL, Hussey HD. YouTube in the Classroom: Helpful Tips and Student Perceptions. Journal of Effective Teaching. 2014;14(3):21-37. 25. 25.Moghavvemi S, Sulaiman A, Jaafar NI, Kasem N. Social media as a complementary learning tool for teaching and learning: The case of youtube. The International Journal of Management Education. 2018;16(1):37-42. doi:10.1016/j.ijme.2017.12.001 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ijme.2017.12.001&link_type=DOI) 26. 26.Klobas JE, McGill TJ, Moghavvemi S, Paramanathan T. Compulsive YouTube usage: A comparison of use motivation and personality effects. Computers in Human Behavior. 2018;87:129-139. doi:10.1016/j.chb.2018.05.038 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chb.2018.05.038&link_type=DOI) 27. 27.de Bérail P, Guillon M, Bungener C. The relations between YouTube addiction, social anxiety and parasocial relationships with YouTubers: A moderated-mediation model based on a cognitive-behavioral framework. Computers in Human Behavior. 2019;99:190-204. doi:10.1016/j.chb.2019.05.007 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chb.2019.05.007&link_type=DOI) 28. 28.Gunnell D, Appleby L, Arensman E, et al. Suicide risk and prevention during the COVID-19 pandemic. The Lancet Psychiatry. 2020;7(6):468-471. doi:10.1016/S2215-0366(20)30171-1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2215-0366(20)30171-1&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=32330430&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 29. 29.Agteren J van, Bartholomaeus J, Fassnacht DB, et al. Using Internet-Based Psychological Measurement to Capture the Deteriorating Community Mental Health Profile During COVID-19: Observational Study. JMIR Mental Health. 2020;7(6):e20696. doi:10.2196/20696 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/20696&link_type=DOI) 30. 30.Holmes EA, O’Connor RC, Perry VH, et al. Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science. The Lancet Psychiatry. 2020;7(6):547-560. doi:10.1016/S2215-0366(20)30168-1 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2215-0366(20)30168-1&link_type=DOI) 31. 31.Greenberg N, Docherty M, Gnanapragasam S, Wessely S. Managing mental health challenges faced by healthcare workers during covid-19 pandemic. BMJ. 2020;368. doi:10.1136/bmj.m1211 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoiYm1qIjtzOjU6InJlc2lkIjtzOjE3OiIzNjgvbWFyMjZfNC9tMTIxMSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA4LzI5LzIwMjAuMDguMjIuMjAxNzg2NDAuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 32. 32.Chen Q, Liang M, Li Y, et al. Mental health care for medical staff in China during the COVID-19 outbreak. The Lancet Psychiatry. 2020;7(4):e15-e16. doi:10.1016/S2215-0366(20)30078-X [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2215-0366(20)30078-X&link_type=DOI) 33. 33.Yao H, Chen J-H, Xu Y-F. Patients with mental health disorders in the COVID-19 epidemic. The Lancet Psychiatry. 2020;7(4):e21. doi:10.1016/S2215-0366(20)30090-0 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2215-0366(20)30090-0&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 34. 34.Li HO-Y, Bailey A, Huynh D, Chan J. YouTube as a source of information on COVID-19: a pandemic of misinformation? BMJ Global Health. 2020;5(5):e002604. doi:10.1136/bmjgh-2020-002604 [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NToiYm1qZ2giO3M6NToicmVzaWQiO3M6MTE6IjUvNS9lMDAyNjA0IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDgvMjkvMjAyMC4wOC4yMi4yMDE3ODY0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 35. 35.Basch CE, Basch CH, Hillyer GC, Jaime C. The Role of YouTube and the Entertainment Industry in Saving Lives by Educating and Mobilizing the Public to Adopt Behaviors for Community Mitigation of COVID-19: Successive Sampling Design Study. JMIR Public Health and Surveillance. 2020;6(2):e19145. doi:10.2196/19145 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/19145&link_type=DOI) 36. 36.Basch CH, Hillyer GC, Meleo-Erwin ZC, Jaime C, Mohlman J, Basch CE. Preventive Behaviors Conveyed on YouTube to Mitigate Transmission of COVID-19: Cross-Sectional Study. JMIR Public Health and Surveillance. 2020;6(2):e18807. doi:10.2196/18807 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/18807&link_type=DOI) 37. 37.Pennycook G, McPhetres J, Zhang Y, Lu JG, Rand DG. Fighting COVID-19 Misinformation on Social Media: Experimental Evidence for a Scalable Accuracy-Nudge Intervention. PsycholSci. 2020;31(7):770-780. doi:10.1177/0956797620939054 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0956797620939054&link_type=DOI) 38. 38.Lyu H, Chen L, Wang Y, Luo J. Sense and Sensibility: Characterizing Social Media Users Regarding the Use of Controversial Terms for COVID-19. IEEE Transactions on Big Data. Published online 2020:1-1. doi:10.1109/TBDATA.2020.2996401 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/TBDATA.2020.2996401&link_type=DOI) 39. 39.Saeb S, Lattie EG, Schueller SM, Kording KP, Mohr DC. The relationship between mobile phone location sensor data and depressive symptom severity. PeerJ. 2016;4:e2537. doi:10.7717/peerj.2537 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7717/peerj.2537&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 40. 40.Saeb S, Zhang M, Karr CJ, et al. Mobile Phone Sensor Correlates of Depressive Symptom Severity in Daily-Life Behavior: An Exploratory Study. J Med Internet Res. 2015;17(7):e175. doi:10.2196/jmir.4273 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/jmir.4273&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=26180009&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 41. 41.Ghandeharioun A, Fedor S, Sangermano L, et al. Objective assessment of depressive symptoms with machine learning and wearable sensors data. In: 2017 Seventh International Conference on Affective Computing and Intelligent Interaction (ACII).; 2017:325-332. doi:10.1109/ACII.2017.8273620 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1109/ACII.2017.8273620&link_type=DOI) 42. 42.Wang R, Wang W, daSilva A, et al. Tracking Depression Dynamics in College Students Using Mobile Phone and Wearable Sensing. Proc ACM Interact Mob Wearable Ubiquitous Technol. 2018;2(1):43:1–43:26. doi:10.1145/3191775 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1145/3191775&link_type=DOI) 43. 43.Zaman A, Acharyya R, Kautz H, Silenzio V. Detecting Low Self-Esteem in Youths from Web Search Data. In: The World Wide Web Conference on - WWW ’19. ACM Press; 2019:2270-2280. doi:10.1145/3308558.3313557 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1145/3308558.3313557&link_type=DOI) 44. 44.Google. Google Takeout. Accessed August 7, 2020. [https://takeout.google.com/](https://takeout.google.com/) 45. 45.Google. Cloud Data Loss Prevention. Accessed August 7, 2020. [https://cloud.google.com/dlp](https://cloud.google.com/dlp) 46. 46.Kullback S, Leibler RA. On Information and Sufficiency. Ann Math Statist. 1951;22(1):79-86. doi:10.1214/aoms/1177729694 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1214/aoms/1177729694&link_type=DOI) 47. 47.Pennebaker JW, Boyd RL, Jordan K, Blackburn K. The Development and Psychometric Properties of LIWC2015. Published online September 15, 2015. Accessed August 2, 2020. [https://repositories.lib.utexas.edu/handle/2152/31333](https://repositories.lib.utexas.edu/handle/2152/31333) 48. 48.Google. Classifying Content. Accessed August 7, 2020. [https://cloud.google.com/natural-language/docs/classifying-text](https://cloud.google.com/natural-language/docs/classifying-text) 49. 49.Google. Content Categories. Accessed August 7, 2020. [https://cloud.google.com/natural-language/docs/categories](https://cloud.google.com/natural-language/docs/categories) 50. 50.Google. VideoCategories. Accessed August 7, 2020. [https://developers.google.com/youtube/v3/docs/videoCategories](https://developers.google.com/youtube/v3/docs/videoCategories) 51. 51.Spitzer RL, Kroenke K, Williams JBW, Löwe B. A Brief Measure for Assessing Generalized Anxiety Disorder: The GAD-7. Arch Intern Med. 2006;166(10):1092-1097. doi:10.1001/archinte.166.10.1092 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/archinte.166.10.1092&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=16717171&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000237720500008&link_type=ISI) 52. 52.Rutter LA, Brown TA. Psychometric Properties of the Generalized Anxiety Disorder Scale-7 (GAD-7) in Outpatients with Anxiety and Mood Disorders. J Psychopathol Behav Assess. 2017;39(1):140-146. doi:10.1007/s10862-016-9571-9 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10862-016-9571-9&link_type=DOI) 53. 53.Kroenke K. Enhancing the clinical utility of depression screening. CMAJ. 2012;184(3):281-282. doi:10.1503/cmaj.112004 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiY21haiI7czo1OiJyZXNpZCI7czo5OiIxODQvMy8yODEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wOC8yOS8yMDIwLjA4LjIyLjIwMTc4NjQwLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 54. 54.Kuss DJ, Griffiths MD, Binder JF. Internet addiction in students: Prevalence and risk factors. Computers in Human Behavior. 2013;29(3):959-966. doi:10.1016/j.chb.2012.12.024 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.chb.2012.12.024&link_type=DOI) 55. 55.Arai A, Ishida K, Tomimori M, Katsumata Y, Grove JS, Tamashiro H. Association between lifestyle activity and depressed mood among home-dwelling older people: A community-based study in Japan. Aging & Mental Health. 2007;11(5):547-555. doi:10.1080/13607860601086553 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/13607860601086553&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17882593&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 56. 56.Kronfeld-Schor N, Einat H. Circadian rhythms and depression: human psychopathology and animal models. Neuropharmacology. 2012;62(1):101-114. doi:10.1016/j.neuropharm.2011.08.020 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neuropharm.2011.08.020&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21871466&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 57. 57.Raoux N, Benoit O, Dantchev N, et al. Circadian pattern of motor activity in major depressed patients undergoing antidepressant therapy: Relationship between actigraphic measures and clinical course. Psychiatry Research. 1994;52(1):85-98. doi:10.1016/0165-1781(94)90122-8 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/0165-1781(94)90122-8&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=8047624&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 58. 58.Tarokh L, Saletin JM, Carskadon MA. Sleep in adolescence: physiology, cognition and mental health. Neurosci Biobehav Rev. 2016;70:182-188. doi:10.1016/j.neubiorev.2016.08.008 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.neubiorev.2016.08.008&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=27531236&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 59. 59.Konjarski M, Murray G, Lee VV, Jackson ML. Reciprocal relationships between daily sleep and mood: A systematic review of naturalistic prospective studies. Sleep Med Rev. 2018;42:47-58. doi:10.1016/j.smrv.2018.05.005 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.smrv.2018.05.005&link_type=DOI) 60. 60.Willoughby BJ, Busby DM, Young-Petersen B. Understanding Associations between Personal Definitions of Pornography, Using Pornography, and Depression. Sex Res Soc Policy. 2019;16(3):342-356. doi:10.1007/s13178-018-0345-x [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s13178-018-0345-x&link_type=DOI) 61. 61.Yoder VC, III TBV, Amin K. Internet Pornography and Loneliness: An Association? Sexual Addiction & Compulsivity. 2005;12(1):19-44. doi:10.1080/10720160590933653 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/10720160590933653&link_type=DOI) 62. 62.Keelan J, Pavri-Garcia V, Tomlinson G, Wilson K. YouTube as a Source of Information on Immunization: A Content Analysis. JAMA. 2007;298(21):2482-2484. doi:10.1001/jama.298.21.2482 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jama.298.21.2482&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=18056901&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000251361900013&link_type=ISI) 63. 63.Girard JM, Cohn JF, Mahoor MH, Mavadati SM, Hammal Z, Rosenwald DP. Nonverbal social withdrawal in depression: Evidence from manual and automatic analyses. Image and Vision Computing. 2014;32(10):641-647. doi:10.1016/j.imavis.2013.12.007 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.imavis.2013.12.007&link_type=DOI) 64. 64.Katz SJ, Conway CC, Hammen CL, Brennan PA, Najman JM. Childhood Social Withdrawal, Interpersonal Impairment, and Young Adult Depression: A Mediational Model. J Abnorm Child Psychol. 2011;39(8):1227. doi:10.1007/s10802-011-9537-z [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10802-011-9537-z&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=21744059&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 65. 65.Leigh-Hunt N, Bagguley D, Bash K, et al. An overview of systematic reviews on the public health consequences of social isolation and loneliness. Public Health. 2017;152:157-171. doi:10.1016/j.puhe.2017.07.035 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.puhe.2017.07.035&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=28915435&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 66. 66.Challen R, Denny J, Pitt M, Gompels L, Edwards T, Tsaneva-Atanasova K. Artificial intelligence, bias and clinical safety. BMJ Qual Saf. 2019;28(3):231-237. doi:10.1136/bmjqs-2018-008370 [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6MzoicWhjIjtzOjU6InJlc2lkIjtzOjg6IjI4LzMvMjMxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDgvMjkvMjAyMC4wOC4yMi4yMDE3ODY0MC5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 67. 67.Gianfrancesco MA, Tamang S, Yazdany J, Schmajuk G. Potential Biases in Machine Learning Algorithms Using Electronic Health Record Data. JAMA Intern Med. 2018;178(11):1544-1547. doi:10.1001/jamainternmed.2018.3763 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1001/jamainternmed.2018.3763&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30128552&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 68. 68.Wang W, Kiik M, Peek N, et al. A systematic review of machine learning models for predicting outcomes of stroke with structured data. PLOS ONE. 2020;15(6):e0234722. doi:10.1371/journal.pone.0234722 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0234722&link_type=DOI) [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F08%2F29%2F2020.08.22.20178640.atom) 69. 69.Kim H, Jung DC, Choi BW. Exploiting the Vulnerability of Deep Learning-Based Artificial Intelligence Models in Medical Imaging: Adversarial Attacks. J Korean Soc Radiol. 2019;80(2):259. doi:10.3348/jksr.2019.80.2.259 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.3348/jksr.2019.80.2.259&link_type=DOI) 70. 70.Ma X, Niu Y, Gu L, et al. Understanding adversarial attacks on deep learning based medical image analysis systems. Pattern Recognition. Published online May 1, 2020:107332. doi:10.1016/j.patcog.2020.107332 [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.patcog.2020.107332&link_type=DOI) [1]: /embed/inline-graphic-1.gif [2]: /embed/inline-graphic-2.gif [3]: /embed/graphic-1.gif [4]: /embed/graphic-3.gif [5]: /embed/inline-graphic-3.gif [6]: /embed/inline-graphic-4.gif [7]: /embed/inline-graphic-5.gif [8]: /embed/inline-graphic-6.gif [9]: /embed/inline-graphic-7.gif [10]: /embed/inline-graphic-8.gif [11]: /embed/inline-graphic-9.gif [12]: /embed/inline-graphic-10.gif [13]: /embed/inline-graphic-11.gif [14]: /embed/inline-graphic-12.gif [15]: /embed/inline-graphic-13.gif [16]: /embed/inline-graphic-14.gif [17]: /embed/inline-graphic-15.gif [18]: /embed/inline-graphic-16.gif [19]: /embed/inline-graphic-17.gif [20]: /embed/inline-graphic-18.gif [21]: /embed/inline-graphic-19.gif [22]: /embed/inline-graphic-20.gif [23]: /embed/inline-graphic-21.gif [24]: /embed/inline-graphic-22.gif [25]: /embed/inline-graphic-23.gif [26]: /embed/inline-graphic-24.gif [27]: /embed/inline-graphic-25.gif [28]: /embed/inline-graphic-26.gif