Cohort Identification Using Semantic Web Technologies: Triplestores as Engines for Complex Computable Phenotyping

Emily R. Pfaff; Robert Bradford; Marshall Clark; James P. Balhoff; Rujin Wang; John S. Preisser; Kellie Walters; Matthew E. Nielsen

doi:10.1101/2021.12.02.21267186

ABSTRACT

Background Computable phenotypes are increasingly important tools for patient cohort identification. As part of a study of risk of chronic opioid use after surgery, we used a Resource Description Framework (RDF) triplestore as our computable phenotyping platform, hypothesizing that the unique affordances of triplestores may aid in making complex computable phenotypes more interoperable and reproducible than traditional relational database queries.

To identify and model risk for new chronic opioid users post-surgery, we loaded several heterogeneous data sources into a Blazegraph triplestore: (1) electronic health record data; (2) claims data; (3) American Community Survey data; and (4) Centers for Disease Control Social Vulnerability Index, opioid prescription rate, and drug poisoning rate data. We then ran a series of queries to execute each of the rules in our “new chronic opioid user” phenotype definition to ultimately arrive at our qualifying cohort.

Results Of the 4,163 patients in the denominator, our computable phenotype identified 248 patients as new chronic opioid users after their index surgical procedure. After validation against charts, 228 of the 248 were revealed to be true positive cases, giving our phenotype a PPV of 0.92.

Conclusion We successfully used the triplestore to execute the new chronic opioid user phenotype logic, and in doing so noted some advantages of the triplestore in terms of schemalessness, interoperability, and reproducibility. Future work will use the triplestore to create the planned risk model and leverage the additional links with ontologies, and ontological reasoning.

BACKGROUND

Electronic health record (EHR)-based computable phenotypes (hereafter “computable phenotypes”) are algorithms used to query “data captured in the delivery of healthcare… to identify individuals or populations (i.e., cohorts) with conditions or events relevant to” some use case.(1) An increasing number of clinical studies rely upon one or more computable phenotyping methods to define cohorts of interest.(2)

Despite its importance, computable phenotyping suffers from a number of inefficiencies and pitfalls. Most methods rely solely upon the data available in the EHR, which is imperfect, suffers from missingness, may not represent the “true” patient state, and can be semantically ambiguous.(1) Another major barrier to achieving robust, sharable, machine readable computable phenotypes is the way in which most healthcare institutions choose to structure and store their EHR data—in siloed, relational data warehouses, disconnected from other potentially relevant data sources and structured according to a custom or vendor-specific data model.

In this study, we describe our efforts to address some of these pitfalls as we executed a computable phenotype to identify previously opioid-naïve patients who became new chronic opioid users after a surgical procedure. This research question had complex data needs, with both an intricate phenotype requiring access to both EHR and claims data, as well as a need to link in community-level data for use in the eventual statistical model. Considering this complexity, we wanted to test whether this computable phenotype could be successfully run with SPARQL queries against a resource description framework (RDF) triplestore, rather than our standard method of running SQL queries against a relational database. In addition to determining whether the triplestore-driven phenotyping and data linkage approach was indeed feasible, we also sought to determine whether this approach is actually advantageous for certain complex phenotyping and linkage efforts.

Computable Phenotyping

In the years following the 2009 HITECH Act, which incentivized and greatly increased usage of EHRs in American health care institutions, computable phenotyping has become a common mechanism to efficiently identify patient cohorts for clinical operations, quality improvement, and research purposes. The ideal computable phenotype is shareable, data model agnostic, semantically unambiguous, machine readable, and publicly accessible. This ideal has not yet been achieved at scale, but some version of this ideal has been advocated by many.(1,3–7)

However, there are many caveats to consider when using EHR data to define populations. The most essential point is that “the EHR is not a direct reflection of the patient and physiology, but a reflection of the recording process inherent in healthcare with noise and feedback loops.”(8) As such, important data can be missing or inaccurately recorded, the data may not reflect the full complexity of a patient’s condition, and the data may be biased toward high utilizers (i.e., sicker patients will generate more data).(4,8,9) Moreover, the context in which EHR data is collected, which can either be clinical or administrative, has a major impact on the meaning of the collected data.(1,10) EHR-driven computable phenotyping, then, requires acknowledgment and understanding of these caveats in balance with its power and efficiency as a method, and may benefit from the inclusion of complementary, non-EHR data sources (such as claims or community data) to fill in some of the gaps.

Triplestores for Storing and Analyzing Clinical Data

A triplestore is the storage mechanism for data formatted in RDF. RDF is a W3C standard for data exchange on the web, represents data as a graph, and is heavily intertwined with the concept of the semantic web. Triplestores store data as triples, or statements in the format subject – predicate – object. (E.g., Michael Strauss (subject) is a (predicate) physician (object)). This restrictive format is at the same time quite flexible in its simplicity, as a wide variety of information can be encoded as triples. The rigidity of the triple format, however, is what allows the data contained in triples to be understood and consumed by different agents across the web—in other words, enabling interoperability. A number of prior studies have used triplestores for storing and analyzing EHR data, with a variety of rationales for doing so,(11–21) though this approach appears to exist outside the computable phenotyping mainstream. Some of the specific affordances of triplestores for clinical data are listed below.

Linking with Ontologies and Terminologies

Triplestores are a natural fit for working with ontologies and terminologies, many of which are published in RDF or Web Ontology Language (OWL) format and can be easily ingested by a triplestore. An ontology is a systematic representation of knowledge; a way to consistently classify and define data across users, databases, institutions, and nations. Terminologies are controlled vocabularies in a given domain, such as ICD (the International Statistical Classification of Diseases and Related Health Problems) for diagnoses and procedures, or LOINC (Logical Observation Identifiers Names and Codes) for laboratory tests and results. Ontologies and terminologies provide common terms and definitions for, in this case, clinical concepts. Ontologies also allow for computational logical reasoning on the relationships between concepts, which is not the case with terminologies.(22)

As data get larger in volume and more heterogeneous in structure, use of ontologies and terminologies to label data is increasingly important to ascribe valid, consistent meaning to data points across disparate data sources. Moreover, mapping clinical data to an ontology helps to promote interoperability and complex querying based on logical reasoning.

Schemalessness

Schemalessness, or tolerance of loosely modeled data, is a hallmark feature of triplestores and all “Not Only SQL,” or NoSQL, database technologies. Because clinical data is dynamic, changing frequently with new discoveries, new regulations, and advancements in EHRs, an adaptable data model is highly advantageous. Moreover, tolerance of heterogeneity makes it easier to link EHR data with external data sources (e.g., genomic data, geographically-based data, sensor data), which have their own structures distinct from that of the clinical data. In contrast, altering the data model of a relational database can come at a high cost, either requiring adding columns to existing tables (and creating sparsity within existing tuples) or adding new tables.

It is worth pointing out that having no data model at all would be as disadvantageous as having too rigid a model. With no model, data validity suffers (e.g., with three different ways to store blood pressure measurements, critical data could be easily missed in a query), and interoperability would be impossible. To achieve balance, one can use a standard clinical data model (e.g., openEHR, HL7, etc.) in tandem with a NoSQL database. Such models are specifically designed to accommodate changing requirements, and fit nicely within different NoSQL technologies. This approach allows a database to be technically schemaless, but with guardrails.

Modeling Highly Related Data

Triplestores also excel at storing and querying “highly related” data. This includes hierarchical data, where one needs to retrieve, for example, the parent of the parent of the parent of a given concept. This may also include situations where the relationship between data is as important to model and expand upon as the data itself, such as a relationship between a gene and a disease that changes in the presence of certain drugs, environmental exposures, or other gene variants. Triplestores (and other graph databases) excel in this area specifically, due to their ability to materialize relationships as queryable concepts. Clinical data is relationship dense—diseases have causes, medications have side-effects, treatments have outcomes—and may be particularly well-suited to the graph structure for this reason.

Each of these affordances is a rationale behind our decision to use a triplestore to execute a particularly complex computable phenotype—one that required linkage of heterogeneous data sources and semantic harmonization across different datasets. Though our use case does not take full advantage of all of these capabilities, it serves as a starting point for more elaborate phenotyping use cases in the future.

METHODS

The study motivating our triplestore-driven phenotyping approach aims to devise a model to determine a previously opioid-naïve patient’s risk of becoming a new chronic opioid user after a surgical procedure resulting in an opioid prescription for post-operative pain. Before beginning work on the risk model, it was first necessary to define the computable phenotype to identify existing postsurgical new chronic opioid users, whose shared characteristics would enable us to define the risk model for future patients.

The data sources used for both the phenotyping and the risk modeling effort included (1) UNC Health Care System EHR data; (2) claims data from a major commercial insurer in the Southeast; (3) American Community Survey (ACS) data (2012-2016);(23) and (4) CDC Social Vulnerability Index (SVI),(24) opioid prescription rate,(25) and drug poisoning rate data (all 2016).(26)

Phenotype Definition

The criteria used to define new postsurgical chronic opioid users (adapted by author MN based on a prior study(27)) depends upon first applying criteria to the base population of surgical patients so that those with recent history of opioid use or prior opioid use disorder are excluded. These criteria are:

Patient has one CPT (procedure) code in the range 10000 – 69990 (the “surgery” range) between 4/14/2014 and 3/31/2017.
Patient does NOT have a second CPT code in the surgery range within six months (after) the index surgery.
Patient has at least one opioid prescription between 30 days prior to the index surgery and 14 days after the index surgery.
Patient does NOT have an opioid prescription between 1 year prior and 30 days prior to the index surgery (i.e., the patient was “opioid naïve” before surgery).
Patient does not have a diagnosis code for “opioid use disorder” (ICD-9: 304.0X, 304.7X; ICD-10: F11.X) prior to the index surgery.
Patient had insurance with the commercial insurer from whom we obtained claims data at the time of the index surgery. (Not relevant to the patient’s status as a chronic opioid user, but necessary in this study to ensure linkage with available claims data.)

Among the resulting set of previously opioid-naïve patients, we define new chronic opioid users as:

Patient has at least one opioid prescription between 90 and 180 days after the index surgery (suggesting that they are still using opioids well after the surgery at which they were originally prescribed).

This rule-based computable phenotype is possible to implement with SQL code against EHR data— however, stopping at this single source of data would result in a skewed picture. This is because medication data is such an essential part of this phenotype—and as such, we care not only about medications that were prescribed by UNC physicians, but any medications that the patient may have received, from anywhere. In order to make use of non-UNC medication data, we needed to incorporate insurance claims data, which allowed us to work with a superset of patient medication data—all medications billed to the patient’s insurance, regardless of prescriber. Merging claims-based medication data with EHR-based medication data introduced one level of structural heterogeneity.

To add further heterogeneity, we also recognized the need to incorporate community-level data sources in order to access socioeconomic data about the cohort for use in the risk model. While publicly available, these data sources are not available within UNC’s clinical data warehouse, and would have to be downloaded and modeled appropriately for incorporation with the EHR and claims data.

Considering the challenges of incorporating these disparate data sources for unified analysis, as well as the affordances described earlier, our team decided on a triplestore approach.

Data Transformation Pipeline

As the three data sources are highly heterogeneous, we developed a transformation pipeline to take each data source from its raw form to RDF. This pipeline is illustrated in Figure 1, and described in detail in the text below.

Figure 1.

Data transformation pipeline to take EHR, claims, and geographically based datasets from their native formats to RDF, and integrate them in a Blazegraph triplestore.

EHR Data

UNC’s EHR data is sourced from the Carolina Data Warehouse for Health (CDWH), UNC Health’s enterprise clinical data warehouse. An analyst was tasked with extracting a cohort meeting the specifications of the rule-based computable phenotype listed above, with the exception of the last rule. (As we wanted to compare patients who did and did not become new chronic opioid users for the risk model, we did not want to limit the cohort only to new chronic users at this point.) Because of the limitations of the EHR data, the resulting cohort was known to contain false positives (i.e., patients who qualified for the cohort using EHR data only, but would later be excluded based on claims data).

We extracted EHR data for this cohort in the domains of patient demographics, encounter details, diagnoses, procedures, medications, smoking status, and pain scores. To convert the extracted data to HL7 Fast Healthcare Interoperability Resources (FHIR) format (version 3.0.1) we used CAMP FHIR,(28,29) an open-source data conversion utility built to convert EHR data to FHIR format. Once the data were in FHIR format, we were able to use an open-source FHIR-to-RDF conversion utility(30) to output the data in RDF and ready it for loading into the triplestore.

Claims Data

We produced a finder file containing identifiers for all of the patients identified from the EHR data, which an analyst used to perform a deterministic linkage with the claims cohort. Matching claims data were extracted in the domains of plan membership details, medications, diagnoses, and procedures. The latter three domains were converted to FHIR format using a Python script, and then converted to RDF using the same conversion utility as the EHR data. The plan membership data were manually converted to RDF using R,(31) as it did not fit cleanly within the FHIR specification.

Community Data

Though non-clinical data was not required to run the computable phenotype to identify existing chronic users, the research team did want to include socioeconomic and other geographically-based factors in the eventual risk model. The chosen datasets (ACS and CDC SVI, opioid prescribing rate, and drug poisoning rate data) are all publicly available, but do not have a unified structure. For the purpose of querying these data within our triplestore, we minted custom URIs to represent the predicates required to represent the structure of the data (e.g., livesInTract).

Other Data

Four additional datasets were loaded in order to aid querying and make connections between datasets in the triplestore. First was a patient-to-census-tract crosswalk, extracted from the EHR. Second, we loaded a census-tract-to-county crosswalk, to enable linkage with both tract-based data (e.g., ACS) and county-based data (e.g., CDC opioid prescribing rates).

Third, in order to consistently query medication data across EHR and claims data, we loaded a crosswalk between National Drug Codes (NDC) and RxNorm codes. Claims data uses NDC codes, which are representative of the exact medication dispensed by a pharmacy for a submitted claim, while UNC’s EHR medications are RxNorm coded. Using FHIR, it is possible to tie multiple codes to a single Medication resource where they share a common meaning. To accomplish this, we used a Python program called SMOREs(32) to translate between NDC and RxNorm and stored both values within the same Medication resource that was subsequently converted to RDF.

Finally, we loaded a subset of the Department of Veterans Affairs National Drug File (VANDF), crosswalked to RxNorm codes. This was loaded to make use of VANDF’s categorization scheme for medications in our queries, allowing us to query by high-level medication categories (opioids, benzodiazepines) rather than matching on hundreds of individual RxNorm codes.

After each data source (EHR, claims, community, other) was converted to RDF, we loaded each RDF file into a Blazegraph(33) triplestore.

Phenotyping with SPARQL

We wrote a series of SPARQL queries to execute the same rule-based logic as the original SQL queries that extracted the EHR-based cohort. Because the data were now integrated, running the same logic a second time could (and did) exclude patients who had originally qualified for the EHR cohort. Examples of such situations included patients who didn’t have a disqualifying pre-surgery opioid prescription in UNC’s EHR data, but did have one (or more) reflected in the claims data, and patients who only had one surgical procedure in a six-month period in UNC’s EHR data, but whose claims data showed an additional (disqualifying) procedure at another institution.

A final SPARQL query was written to implement the final rule of the phenotype (the presence of an opioid prescription between 90 and 180 days after the index surgery), which would define our outcome variable for each patient in the cohort: CHRONIC_USER_YN.

RESULTS

The number of triples loaded into Blazegraph from each data domain are listed in Table 1.

View this table:

Table 1.

All data domains included in the Blazegraph triplestore, with number of triples.

We initially loaded 15,954 patients that qualified as previously opioid-naïve patients with a qualifying surgical procedure, executed against EHR data only. Once the matching claims data were loaded and the computable phenotype (except for the final rule) was re-executed over the integrated data using SPARQL, the number of qualifying patients dropped to 4,163. The reasons for this drop are illustrated in Figure 2.

Figure 2.

Illustration of cohort narrowing as phenotype rules were applied in the triplestore.

Of the 4,163 patients in the final cohort, 248 patients were identified as new chronic opioid users after their index surgical procedure. Patient identifiers for this group were given to two clinicians to manually chart review in order to validate the actual presence of new chronic opioid use in the patient’s record. Of the 248 patients identified by the phenotype, 228 were found to be true new chronic users through chart review. The remaining 20 patients qualified for the computable phenotype, but upon review of their charts, the procedure defined as their index surgery (which qualified them for the cohort) appeared not to have truly been a “surgery,” despite being coded in the surgical range. One possible explanation for this is that the CPT “surgery” range of 10000 through 69900 is very broad, and may have resulted in the computable phenotype considering certain events as “surgeries” when a clinician would not see them as such. Based on these results, this computable phenotype has a positive predictive value of 0.92 for identifying new chronic opioid users after surgery.

Of the 248 patients identified as new chronic opioid users, 16 were identified by EHR data alone, 39 were identified by both EHR and claims data, and 193 were identified by claims data alone. (“Alone,” in this case, means that while the patient had data in both the EHR and claims, the opioid prescription(s) that qualified them as a new chronic user between 90 and 180 days from their index surgery] was only present in one of the two sources.)

4,121 of the 4,163 patients were able to be linked with a census tract and county, based on their address recorded in the EHR as of March 31, 2017 (the end of the study period). This enables us to connect these patients with a selection of socioeconomic variables (including Social Vulnerability Index, drug poisoning rate, and unemployment rate) for potential inclusion in the new chronic opioid user risk model.

DISCUSSION

Primary Results

Our main finding is that triplestores are an effective platform for complex computable phenotyping, enabling cross-querying among heterogeneous datasets and offering some functionality beyond that of a relational database. For this particular phenotyping project, combining claims data with EHR data was particularly advantageous, allowing us to identify many more patients qualifying for the phenotype than using EHR data alone. Converting the EHR data and the claims data to a common format (FHIR, then RDF) and loading it in a triplestore made integration and cross-querying very straightforward, essentially like querying a single source of data.

It is worth noting that for a dataset of this size and scope, use of this triplestore-driven approach was not a requirement. Rather, we view this work as a successful proof of concept for future work (detailed below) involving phenotypes that require (or could benefit from) the inclusion of much larger and varied ontologies and datasets (e.g., the Human Phenotype Ontology, the entirety of SNOMED CT, the Gene Ontology, etc.).

Advantages of the Triplestore Approach

Schemalessness

The triplestore’s schemalessness proved particularly valuable in the context of the community datasets. Beyond converting from CSV to RDF format, almost no transformation was needed to prepare these data sources for ingestion in the triplestore. Should we have found additional publicly available data sources that we wanted to use, it would have taken very little effort to incorporate them. This capability holds a lot of promise for future studies, particularly due to the recognition of the importance of social determinants of health variables, which are often absent in EHR and claims data, in clinical research.(34–38)

Interoperability and Reproducibility

The FHIR representation standard is becoming increasingly supported by major EHR vendors—in effect, forcing institutions to standardize their data. In addition to defining specific resources and fields, FHIR enforces the use of established code sets (e.g., LOINC, SNOMED CT, ICD-9/-10) or FHIR-specific value sets in many of its fields in order to maximize standardization. However, as FHIR data exists in flat files (JSON- or XML-formatted), it is not inherently “queryable” unless additional processing steps are taken. In this case, loading FHIR data in a triplestore enables its use for computable phenotyping.

The interoperability benefits of FHIR, compared with the interoperability (by definition) of semantic web technologies, enabled us to create a phenotype that should be transferrable to any other site with few local changes, so long as the institution can output EHR data in FHIR format. (This particular study offered a proof of this concept, in that FHIR enabled us to make EHR and claims data interoperable despite their native formats being completely different.) As FHIR moves toward becoming a lingua franca for EHR data, FHIR-based phenotypes such may gain reproducibility advantages over phenotypes that run against institution-, data network- or vendor-specific data models.

Limitations and Lessons Learned

Because the FHIR format creates an enormous number of triples (as shown in Table 1), it would seem to be a best practice to narrow down the initial cohort as much as possible before transformation to RDF. It would be less feasible to try to persist an entire clinical data warehouse in RDF. Similarly, within a given data domain (particularly a large one, like Condition), it would be more efficient to pare down the set of diagnoses to only those needed for the given research question prior to loading. With that said, if a study’s objective is less targeted and more exploratory, this may not be desirable. For such a use case, sufficient resources to load and compute over all those triples would be the primary concern.

Additionally, as with any use case that involves data transformation, there is always a chance that information can be lost over the course of the transformation pipeline. For this reason, we took the time to validate our final cohort against the source data system (the EHR), and would advocate doing so for future cohorts identified in this way.

Future Work

Now that the computable phenotype has been executed, statistical analysis has commenced in order to establish a risk model that can be applied to patients de novo. This work will occur outside of the triplestore. In parallel with this effort, we identified several new directions to take the triplestore phenotyping methodology that will make this framework valuable for future use cases.

We see great potential in leveraging additional ontologies (other than the medication ontologies used in this study) in future work. Ontologies allow users to work with information at both the “data level” and the “knowledge level.”(13) As data at the knowledge level is not often stored in the EHR itself, the ability to link to external sources containing that knowledge adds richness to a clinical data store.(15,18,39)

“OWL reasoning,” another capability afforded by RDF, is an extension of this knowledge level, in that new assertions can be concluded based on existing assertions in the triplestore.(21,40,41) For this project specifically, we used a small subset of VANDF to query medications at the knowledge level to enable easier querying (e.g., querying by rolled-up categories rather than hundreds of individual medication codes). With this success in mind, we believe that linking with ontologies and leveraging OWL reasoning could potentially reveal new hypotheses to explore, via patterns in the data that may be hidden at the individual medication code level.

Use of standard ontologies also can enable linkage with other external datasets, in addition to the community data we used here. Additional external data sources may include one or more of the datasets available on the web as Linked Open Data, or LOD, an open data repository expressed in triples. Many of the datasets available as LOD are biomedical in nature, and could allow one to link EHR data with public data to discover, for example, gene-disease associations, genotype-phenotype associations, or drug-drug interactions.(11,12,14,18)

CONCLUSION

Though computable phenotypes are written, executed, and shared with great frequency today, computable phenotyping as a discipline has not yet reached the ideal of consistently producing phenotypes that are interoperable, reproducible, and capable of integrating heterogeneous data sources. Some of these challenges may be addressed by exploring alternatives to siloed, relational data warehouses for storage and analysis of clinical data. Semantic web technologies may be such an alternative.

Data Availability

The electronic health record and claims data analyzed in this study are not publicly available due to restrictions of the Health Insurance Portability and Accountability Act (HIPAA). American Community Survey data (2012-2016) is available from the US Census Bureau at https://www.census.gov/acs/www/data/data-tables-and-tools/data-profiles/2016. Social vulnerability index data (2016) are available from the Centers for Disease Control at https://svi.cdc.gov/data-and-tools-download.html. US county opioid prescribing and drug poisoning rate data (2016) are available from the Centers for Disease Control at https://www.cdc.gov/drugoverdose/maps/rxcounty2016.html and https://data.cdc.gov/NCHS/NCHS-Drug-Poisoning-Mortality-by-County-United-Sta/pbkm-d27e, respectively.

https://www.census.gov/acs/www/data/data-tables-and-tools/data-profiles/2016

https://www.cdc.gov/drugoverdose/maps/rxcounty2016.html

https://svi.cdc.gov/data-and-tools-download.html

https://data.cdc.gov/NCHS/NCHS-Drug-Poisoning-Mortality-by-County-United-Sta/pbkm-d27e

DECLARATIONS

Ethics

The work described was approved by the Institutional Review Board at UNC Chapel Hill (#18-1154).

Consent to Publication

Not applicable.

Availability of Data and Material

Competing Interests

The authors have no competing interests to disclose.

Funding

The project described was supported by the National Center for Advancing Translational Sciences (NCATS), National Institutes of Health, through Grant Award Number UL1TR002489. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH. The database infrastructure used for claims data portion of this project was funded by the Cecil G. Sheps Center for Health Services Research; the Department of Health Policy and Management, UNC Gillings School of Global Public Health; the CER Strategic Initiative of UNC’s Clinical and Translational Science Award (UL1TR002489); and the UNC School of Medicine.

Author contributions

E.R.P. drafted the initial manuscript. M.E.N. defined the phenotype and provided clinical expertise during analysts. E.R.P., R.B., and M.C. performed data cleaning, transformation, and analysis. J.P.B. provided ontological and technical expertise during analysis. R.W. and J.S.P. provided statistical consulting and analysis. K.W. selected appropriate SES variables and performed literature review. All authors edited the manuscript draft and approved the final content.

Acknowledgements

The authors thank Charan Mohan, Mark Ehlers, and Nessim Abu-Saif for performing manual chart reviews to assess phenotype performance, Adam Lee for serving as an EHR honest broker, Lily Wang for serving as a claims honest broker, and James Champion for his assistance with implementing CAMP FHIR as part of our data transformation pipeline.

LIST OF ABBREVIATIONS

ACS: American Community Survey
CDC: Centers for Disease Control and Prevention
CDWH: Carolina Data Warehouse for Health
CPT: Current Procedural Terminology
EHR: Electronic health record(s)
FHIR: Fast Healthcare Interoperability Resources
HITECH Act: Health Information Technology for Economic and Clinical Health Act
HL7: Health Level 7
ICD: International Statistical Classification of Diseases and Related Health Problems
JSON: JavaScript Object Notation
LOD: Linked Open Data
LOINC: Logical Observation Identifiers Names and Codes
NDC: National Drug Code
NoSQL: Not only SQL
OWL: Web Ontology Language
PPV: Positive predictive value
RDF: Resource Description Framework
SPARQL: SPARQL Protocol and RDF Query Language
SQL: Structured query language
SVI: Social Vulnerability Index
UNC: University of North Carolina at Chapel Hill
URI: Uniform resource identifier
VANDF: Department of Veterans Affairs National Drug File
W3C: World Wide Web Consortium
XML: Extensible Markup Language

REFERENCES

1.↵
Richesson RL, Hammond WE, Nahm M, Wixted D, Simon GE, Robinson JG, et al. Electronic health records based phenotyping in next-generation clinical trials: a perspective from the NIH Health Care Systems Collaboratory. J Am Med Inform Assoc. 2013 Dec;20(e2):e226–31.
OpenUrl CrossRef PubMed
2.↵
Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J Am Med Inform Assoc. 2014 Apr;21(2):221–230.
OpenUrl CrossRef PubMed
3.↵
Mo H, Thompson WK, Rasmussen LV, Pacheco JA, Jiang G, Kiefer R, et al. Desiderata for computable representations of electronic health records-driven phenotype algorithms. J Am Med Inform Assoc. 2015 Nov;22(6):1220–1230.
OpenUrl CrossRef PubMed
4.↵
Wei W-Q, Denny JC. Extracting research-quality phenotypes from electronic health records to support precision medicine. Genome Med. 2015 Apr 30;7(1):41.
OpenUrl CrossRef PubMed
5.
Jiang G, Kiefer RC, Rasmussen LV, Solbrig HR, Mo H, Pacheco JA, et al. Developing a data element repository to support EHR-driven phenotype algorithm authoring and execution. J Biomed Inform. 2016 Jul 5;62:232–242.
OpenUrl CrossRef
6.
Pacheco JA, Rasmussen LV, Kiefer RC, Campion TR, Speltz P, Carroll RJ, et al. A case study evaluating the portability of an executable computable phenotype algorithm across multiple institutions and electronic health record environments. J Am Med Inform Assoc. 2018 Nov 1;25(11):1540–1546.
OpenUrl
7.↵
Deans AR, Lewis SE, Huala E, Anzaldo SS, Ashburner M, Balhoff JP, et al. Finding our way through phenotypes. PLoS Biol. 2015 Jan 6;13(1):e1002033.
OpenUrl CrossRef PubMed
8.↵
Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc. 2013 Jan 1;20(1):117–121.
OpenUrl CrossRef PubMed
9.↵
Xu J, Rasmussen LV, Shaw PL, Jiang G, Kiefer RC, Mo H, et al. Review and evaluation of electronic health records-driven phenotype algorithm authoring tools for clinical and translational research. J Am Med Inform Assoc. 2015 Nov;22(6):1251–1260.
OpenUrl CrossRef PubMed
10.↵
Morley KI, Wallace J, Denaxas SC, Hunter RJ, Patel RS, Perel P, et al. Defining disease phenotypes using national linked electronic health records: a case study of atrial fibrillation. PLoS One. 2014 Nov 4;9(11):e110900.
OpenUrl CrossRef PubMed
11.↵
Pathak J, Kiefer RC, Bielinski SJ, Chute CG. Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank. J Biomed Semantics. 2012 Dec 17;3(1):10.
OpenUrl CrossRef PubMed
12.↵
Pathak J, Kiefer RC, Chute CG. Using linked data for mining drug-drug interactions in electronic health records. Stud Health Technol Inform. 2013;192:682–686.
OpenUrl PubMed
13.↵
Fernández-Breis JT, Maldonado JA, Marcos M, Legaz-García M del C, Moner D, Torres-Sospedra J, et al. Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts. J Am Med Inform Assoc. 2013 Dec;20(e2):e288–96.
OpenUrl CrossRef PubMed
14.↵
Odgers DJ, Dumontier M. Mining Electronic Health Records using Linked Data. AMIA Jt Summits Transl Sci Proc. 2015 Mar 23;2015:217–221.
OpenUrl
15.↵
Mirhaji P, Zhu M, Vagnoni M, Bernstam EV, Zhang J, Smith JW. Ontology driven integration platform for clinical and translational research. BMC Bioinformatics. 2009 Feb 5;10 Suppl 2:S2.
OpenUrl CrossRef PubMed
16.
Maldonado JA, Marcos M, Fernández-Breis JT, Parcero E, Boscá D, Legaz-García MDC, et al. A platform for exploration into chaining of web services for clinical data transformation and reasoning. AMIA Annu Symp Proc. 2016;2016:854–863.
OpenUrl
17.
Papatheodorou I, Crichton C, Morris L, Maccallum P, METABRIC Group, Davies J, et al. A metadata approach for clinical data management in translational genomics studies in breast cancer. BMC Med Genomics. 2009 Nov 30;2:66.
OpenUrl CrossRef PubMed
18.↵
Kawazoe Y, Imai T, Ohe K. A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources. JMIR Med Inform. 2016 Apr 5;4(2):e12.
OpenUrl
19.
H. Dhayne, R. Kilany, R. Haque, Y. Taher. SeDIE: A Semantic-Driven Engine for Integration of Healthcare Data. 2018. p. 617–622.
20.
Sun H, Depraetere K, De Roo J, Mels G, De Vloed B, Twagirumukiza M, et al. Semantic processing of EHR data for clinical research. J Biomed Inform. 2015 Dec;58:247–259.
OpenUrl
21.↵
Liu S, Ni Y, Mei J, Li H, Xie G, Hu G, et al. iSMART: Ontology-based Semantic query of CDA documents. AMIA Annu Symp Proc. 2009 Nov 14;2009:375–379.
OpenUrl PubMed
22.↵
Haendel MA, Chute CG, Robinson PN. Classification, ontology, and precision medicine. N Engl J Med. 2018 Oct 11;379(15):1452–1462.
OpenUrl
23.↵
2016 Data Profiles | American Community Survey | U.S. Census Bureau [Internet]. [cited 2019 Oct 28]. Available from: https://www.census.gov/acs/www/data/data-tables-and-tools/data-profiles/2016/
24.↵
The Social Vulnerability Index (SVI): Data and Tools Download | CDC [Internet]. [cited 2019 Oct 28]. Available from: https://svi.cdc.gov/data-and-tools-download.html
25.↵
U.S. County Prescribing Rates, 2016 | Drug Overdose | CDC Injury Center [Internet]. [cited 2019 Oct 28]. Available from: https://www.cdc.gov/drugoverdose/maps/rxcounty2016.html
26.↵
NCHS -Drug Poisoning Mortality by County: United States | Data | Centers for Disease Control and Prevention [Internet]. [cited 2019 Oct 28]. Available from: https://data.cdc.gov/NCHS/NCHS-Drug-Poisoning-Mortality-by-County-United-Sta/pbkm-d27e
27.↵
Brummett CM, Waljee JF, Goesling J, Moser S, Lin P, Englesbe MJ, et al. New persistent opioid use after minor and major surgical procedures in US adults. JAMA Surg. 2017 Jun 21;152(6):e170504.
OpenUrl PubMed
28.↵
Pfaff ER, Champion J, Bradford RL, Clark M, Xu H, Fecho K, et al. Fast healthcare interoperability resources (FHIR) as a meta model to integrate common data models: development of a tool and quantitative validation study. JMIR Med Inform. 2019 Oct 16;7(4):e15199.
OpenUrl CrossRef
29.↵
GitHub - NCTraCSIDSci/camp-fhir [Internet]. [cited 2019 Sep 15]. Available from: https://github.com/NCTraCSIDSci/camp-fhir
30.↵
GitHub - BD2KOnFHIR/fhirtordf: Python based FHIR to RDF conversion utility [Internet]. [cited 2019 Sep 15]. Available from: https://github.com/BD2KOnFHIR/fhirtordf
31.↵
R: The R Project for Statistical Computing [Internet]. [cited 2019 Dec 4]. Available from: https://www.r-project.org/
32.↵
NCTraCSIDSci/SMOREs: Synergistic Medication Organizer for RxNorm and Extras (SMOREs) [Internet]. [cited 2020 Jan 16]. Available from: https://github.com/NCTraCSIDSci/SMOREs
33.↵
Blazegraph Documentation at Github [Internet]. [cited 2019 Nov 1]. Available from: https://blazegraph.com/
34.↵
Hatef E, Searle KM, Predmore Z, Lasser EC, Kharrazi H, Nelson K, et al. The impact of social determinants of health on hospitalization in the veterans health administration. Am J Prev Med. 2019 Jun;56(6):811–818.
OpenUrl PubMed
35.
Gardner BJ, Pedersen JG, Campbell ME, McClay JC. Incorporating a location-based socioeconomic index into a de-identified i2b2 clinical data warehouse. J Am Med Inform Assoc. 2019 Apr 1;26(4):286–293.
OpenUrl
36.
Bazemore AW, Cottrell EK, Gold R, Hughes LS, Phillips RL, Angier H, et al. Community vital signs”: incorporating geocoded social determinants into electronic records to promote patient and population health. J Am Med Inform Assoc. 2016 Mar;23(2):407–412.
OpenUrl CrossRef PubMed
37.
World Health Organization Commission on Social Determinants of Health. Closing the gap in a generation: health equity through action on the social’ ‘ determinants of health. Final Report of the Commission on Social Determinants of Health. [Internet]. Geneva: World Health Organization; 2008 [cited 2019 Dec 13]. Available from: https://www.who.int/social_determinants/thecommission/finalreport/en/
38.↵
Committee on the Recommended Social and Behavioral Domains and Measures for Electronic Health Records, Board on Population Health and Public Health Practice, Institute of Medicine. Capturing social and behavioral domains in electronic health records: phase 1. Washington (DC): National Academies Press (US); 2014.
39.↵
Pacaci A, Gonul S, Sinaci AA, Yuksel M, Laleci Erturkmen GB. A semantic transformation methodology for the secondary use of observational healthcare data in postmarketing safety studies. Front Pharmacol. 2018 Apr 30;9:435.
OpenUrl
40.↵
Pierce CD, Booth D, Ogbuji C, Deaton C, Blackstone E, Lenat D. SemanticDB: A Semantic Web Infrastructure for Clinical Research and Quality Reporting. CURRENT BIOINFORMATICS. 2012 Sep;7(3):267–277.
OpenUrl
41.↵
Wang H, Zhou T, Tian L, Qian Y, Li J. Creating hospital-specific customized clinical pathways by applying semantic reasoning to clinical data. J Biomed Inform. 2014 Dec;52:354–363.
OpenUrl

View the discussion thread.

Posted December 05, 2021.

Download PDF

Data/Code

Citation Tools

Subject Area

Health Informatics

Subject Areas

All Articles

Addiction Medicine (400)
Allergy and Immunology (711)
Anesthesia (204)
Cardiovascular Medicine (2961)
Dentistry and Oral Medicine (335)
Dermatology (250)
Emergency Medicine (443)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (1048)
Epidemiology (12768)
Forensic Medicine (12)
Gastroenterology (829)
Genetic and Genomic Medicine (4604)
Geriatric Medicine (420)
Health Economics (731)
Health Informatics (2935)
Health Policy (1069)
Health Systems and Quality Improvement (1088)
Hematology (390)
HIV/AIDS (927)
Infectious Diseases (except HIV/AIDS) (14119)
Intensive Care and Critical Care Medicine (850)
Medical Education (429)
Medical Ethics (116)
Nephrology (472)
Neurology (4384)
Nursing (237)
Nutrition (641)
Obstetrics and Gynecology (813)
Occupational and Environmental Health (737)
Oncology (2282)
Ophthalmology (648)
Orthopedics (258)
Otolaryngology (326)
Pain Medicine (279)
Palliative Medicine (83)
Pathology (502)
Pediatrics (1199)
Pharmacology and Therapeutics (507)
Primary Care Research (499)
Psychiatry and Clinical Psychology (3781)
Public and Global Health (6969)
Radiology and Imaging (1537)
Rehabilitation Medicine and Physical Therapy (910)
Respiratory Medicine (917)
Rheumatology (442)
Sexual and Reproductive Health (445)
Sports Medicine (385)
Surgery (491)
Toxicology (60)
Transplantation (212)
Urology (182)

[1] 1.↵
Richesson RL, Hammond WE, Nahm M, Wixted D, Simon GE, Robinson JG, et al. Electronic health records based phenotyping in next-generation clinical trials: a perspective from the NIH Health Care Systems Collaboratory. J Am Med Inform Assoc. 2013 Dec;20(e2):e226–31.
OpenUrl CrossRef PubMed

[2] 2.↵
Shivade C, Raghavan P, Fosler-Lussier E, Embi PJ, Elhadad N, Johnson SB, et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J Am Med Inform Assoc. 2014 Apr;21(2):221–230.
OpenUrl CrossRef PubMed

[3] 3.↵
Mo H, Thompson WK, Rasmussen LV, Pacheco JA, Jiang G, Kiefer R, et al. Desiderata for computable representations of electronic health records-driven phenotype algorithms. J Am Med Inform Assoc. 2015 Nov;22(6):1220–1230.
OpenUrl CrossRef PubMed

[4] 4.↵
Wei W-Q, Denny JC. Extracting research-quality phenotypes from electronic health records to support precision medicine. Genome Med. 2015 Apr 30;7(1):41.
OpenUrl CrossRef PubMed

[5] 5.
Jiang G, Kiefer RC, Rasmussen LV, Solbrig HR, Mo H, Pacheco JA, et al. Developing a data element repository to support EHR-driven phenotype algorithm authoring and execution. J Biomed Inform. 2016 Jul 5;62:232–242.
OpenUrl CrossRef

[6] 6.
Pacheco JA, Rasmussen LV, Kiefer RC, Campion TR, Speltz P, Carroll RJ, et al. A case study evaluating the portability of an executable computable phenotype algorithm across multiple institutions and electronic health record environments. J Am Med Inform Assoc. 2018 Nov 1;25(11):1540–1546.
OpenUrl

[7] 7.↵
Deans AR, Lewis SE, Huala E, Anzaldo SS, Ashburner M, Balhoff JP, et al. Finding our way through phenotypes. PLoS Biol. 2015 Jan 6;13(1):e1002033.
OpenUrl CrossRef PubMed

[8] 8.↵
Hripcsak G, Albers DJ. Next-generation phenotyping of electronic health records. J Am Med Inform Assoc. 2013 Jan 1;20(1):117–121.
OpenUrl CrossRef PubMed

[9] 9.↵
Xu J, Rasmussen LV, Shaw PL, Jiang G, Kiefer RC, Mo H, et al. Review and evaluation of electronic health records-driven phenotype algorithm authoring tools for clinical and translational research. J Am Med Inform Assoc. 2015 Nov;22(6):1251–1260.
OpenUrl CrossRef PubMed

[10] 10.↵
Morley KI, Wallace J, Denaxas SC, Hunter RJ, Patel RS, Perel P, et al. Defining disease phenotypes using national linked electronic health records: a case study of atrial fibrillation. PLoS One. 2014 Nov 4;9(11):e110900.
OpenUrl CrossRef PubMed

[11] 11.↵
Pathak J, Kiefer RC, Bielinski SJ, Chute CG. Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank. J Biomed Semantics. 2012 Dec 17;3(1):10.
OpenUrl CrossRef PubMed

[12] 12.↵
Pathak J, Kiefer RC, Chute CG. Using linked data for mining drug-drug interactions in electronic health records. Stud Health Technol Inform. 2013;192:682–686.
OpenUrl PubMed

[13] 13.↵
Fernández-Breis JT, Maldonado JA, Marcos M, Legaz-García M del C, Moner D, Torres-Sospedra J, et al. Leveraging electronic healthcare record standards and semantic web technologies for the identification of patient cohorts. J Am Med Inform Assoc. 2013 Dec;20(e2):e288–96.
OpenUrl CrossRef PubMed

[14] 14.↵
Odgers DJ, Dumontier M. Mining Electronic Health Records using Linked Data. AMIA Jt Summits Transl Sci Proc. 2015 Mar 23;2015:217–221.
OpenUrl

[15] 15.↵
Mirhaji P, Zhu M, Vagnoni M, Bernstam EV, Zhang J, Smith JW. Ontology driven integration platform for clinical and translational research. BMC Bioinformatics. 2009 Feb 5;10 Suppl 2:S2.
OpenUrl CrossRef PubMed

[16] 16.
Maldonado JA, Marcos M, Fernández-Breis JT, Parcero E, Boscá D, Legaz-García MDC, et al. A platform for exploration into chaining of web services for clinical data transformation and reasoning. AMIA Annu Symp Proc. 2016;2016:854–863.
OpenUrl

[17] 17.
Papatheodorou I, Crichton C, Morris L, Maccallum P, METABRIC Group, Davies J, et al. A metadata approach for clinical data management in translational genomics studies in breast cancer. BMC Med Genomics. 2009 Nov 30;2:66.
OpenUrl CrossRef PubMed

[18] 18.↵
Kawazoe Y, Imai T, Ohe K. A Querying Method over RDF-ized Health Level Seven v2.5 Messages Using Life Science Knowledge Resources. JMIR Med Inform. 2016 Apr 5;4(2):e12.
OpenUrl

[19] 19.
H. Dhayne, R. Kilany, R. Haque, Y. Taher. SeDIE: A Semantic-Driven Engine for Integration of Healthcare Data. 2018. p. 617–622.

[20] 20.
Sun H, Depraetere K, De Roo J, Mels G, De Vloed B, Twagirumukiza M, et al. Semantic processing of EHR data for clinical research. J Biomed Inform. 2015 Dec;58:247–259.
OpenUrl

[21] 21.↵
Liu S, Ni Y, Mei J, Li H, Xie G, Hu G, et al. iSMART: Ontology-based Semantic query of CDA documents. AMIA Annu Symp Proc. 2009 Nov 14;2009:375–379.
OpenUrl PubMed

[22] 22.↵
Haendel MA, Chute CG, Robinson PN. Classification, ontology, and precision medicine. N Engl J Med. 2018 Oct 11;379(15):1452–1462.
OpenUrl

[23] 23.↵
2016 Data Profiles | American Community Survey | U.S. Census Bureau [Internet]. [cited 2019 Oct 28]. Available from: https://www.census.gov/acs/www/data/data-tables-and-tools/data-profiles/2016/

[24] 24.↵
The Social Vulnerability Index (SVI): Data and Tools Download | CDC [Internet]. [cited 2019 Oct 28]. Available from: https://svi.cdc.gov/data-and-tools-download.html

[25] 25.↵
U.S. County Prescribing Rates, 2016 | Drug Overdose | CDC Injury Center [Internet]. [cited 2019 Oct 28]. Available from: https://www.cdc.gov/drugoverdose/maps/rxcounty2016.html

[26] 26.↵
NCHS -Drug Poisoning Mortality by County: United States | Data | Centers for Disease Control and Prevention [Internet]. [cited 2019 Oct 28]. Available from: https://data.cdc.gov/NCHS/NCHS-Drug-Poisoning-Mortality-by-County-United-Sta/pbkm-d27e

[27] 27.↵
Brummett CM, Waljee JF, Goesling J, Moser S, Lin P, Englesbe MJ, et al. New persistent opioid use after minor and major surgical procedures in US adults. JAMA Surg. 2017 Jun 21;152(6):e170504.
OpenUrl PubMed

[28] 28.↵
Pfaff ER, Champion J, Bradford RL, Clark M, Xu H, Fecho K, et al. Fast healthcare interoperability resources (FHIR) as a meta model to integrate common data models: development of a tool and quantitative validation study. JMIR Med Inform. 2019 Oct 16;7(4):e15199.
OpenUrl CrossRef

[29] 29.↵
GitHub - NCTraCSIDSci/camp-fhir [Internet]. [cited 2019 Sep 15]. Available from: https://github.com/NCTraCSIDSci/camp-fhir

[30] 30.↵
GitHub - BD2KOnFHIR/fhirtordf: Python based FHIR to RDF conversion utility [Internet]. [cited 2019 Sep 15]. Available from: https://github.com/BD2KOnFHIR/fhirtordf

[31] 31.↵
R: The R Project for Statistical Computing [Internet]. [cited 2019 Dec 4]. Available from: https://www.r-project.org/

[32] 32.↵
NCTraCSIDSci/SMOREs: Synergistic Medication Organizer for RxNorm and Extras (SMOREs) [Internet]. [cited 2020 Jan 16]. Available from: https://github.com/NCTraCSIDSci/SMOREs

[33] 33.↵
Blazegraph Documentation at Github [Internet]. [cited 2019 Nov 1]. Available from: https://blazegraph.com/

[34] 34.↵
Hatef E, Searle KM, Predmore Z, Lasser EC, Kharrazi H, Nelson K, et al. The impact of social determinants of health on hospitalization in the veterans health administration. Am J Prev Med. 2019 Jun;56(6):811–818.
OpenUrl PubMed

[35] 35.
Gardner BJ, Pedersen JG, Campbell ME, McClay JC. Incorporating a location-based socioeconomic index into a de-identified i2b2 clinical data warehouse. J Am Med Inform Assoc. 2019 Apr 1;26(4):286–293.
OpenUrl

[36] 36.
Bazemore AW, Cottrell EK, Gold R, Hughes LS, Phillips RL, Angier H, et al. Community vital signs”: incorporating geocoded social determinants into electronic records to promote patient and population health. J Am Med Inform Assoc. 2016 Mar;23(2):407–412.
OpenUrl CrossRef PubMed

[37] 37.
World Health Organization Commission on Social Determinants of Health. Closing the gap in a generation: health equity through action on the social’ ‘ determinants of health. Final Report of the Commission on Social Determinants of Health. [Internet]. Geneva: World Health Organization; 2008 [cited 2019 Dec 13]. Available from: https://www.who.int/social_determinants/thecommission/finalreport/en/

[38] 38.↵
Committee on the Recommended Social and Behavioral Domains and Measures for Electronic Health Records, Board on Population Health and Public Health Practice, Institute of Medicine. Capturing social and behavioral domains in electronic health records: phase 1. Washington (DC): National Academies Press (US); 2014.

[39] 39.↵
Pacaci A, Gonul S, Sinaci AA, Yuksel M, Laleci Erturkmen GB. A semantic transformation methodology for the secondary use of observational healthcare data in postmarketing safety studies. Front Pharmacol. 2018 Apr 30;9:435.
OpenUrl

[40] 40.↵
Pierce CD, Booth D, Ogbuji C, Deaton C, Blackstone E, Lenat D. SemanticDB: A Semantic Web Infrastructure for Clinical Research and Quality Reporting. CURRENT BIOINFORMATICS. 2012 Sep;7(3):267–277.
OpenUrl

[41] 41.↵
Wang H, Zhou T, Tian L, Qian Y, Li J. Creating hospital-specific customized clinical pathways by applying semantic reasoning to clinical data. J Biomed Inform. 2014 Dec;52:354–363.
OpenUrl

Cohort Identification Using Semantic Web Technologies: Triplestores as Engines for Complex Computable Phenotyping

ABSTRACT

BACKGROUND

Computable Phenotyping

Triplestores for Storing and Analyzing Clinical Data

Linking with Ontologies and Terminologies

Schemalessness

Modeling Highly Related Data

METHODS

Phenotype Definition

Data Transformation Pipeline

EHR Data

Claims Data

Community Data

Other Data

Phenotyping with SPARQL

RESULTS

DISCUSSION

Primary Results

Advantages of the Triplestore Approach

Schemalessness

Interoperability and Reproducibility

Limitations and Lessons Learned

Future Work

CONCLUSION

Data Availability

DECLARATIONS

Ethics

Consent to Publication

Availability of Data and Material

Competing Interests

Funding

Author contributions

Acknowledgements

LIST OF ABBREVIATIONS

REFERENCES

Citation Manager Formats

Subject Area