Abstract
Background Autopsy studies have provided valuable insights into the pathophysiology of COVID-19. Controversies remain whether the clinical presentation is due to direct organ damage by SARS-CoV-2 or secondary effects, e.g. by an overshooting immune response. SARS-CoV-2 detection in tissues by RT-qPCR and immunohistochemistry (IHC) or electron microscopy (EM) can help answer these questions, but a comprehensive evaluation of these applications is missing.
Methods We assessed publications using IHC and EM for SARS-CoV-2 detection in autopsy tissues. We systematically evaluated commercially available antibodies against the SARS-CoV-2 spike protein and nucleocapsid, dsRNA, and non-structural protein Nsp3 in cultured cell lines and COVID-19 autopsy tissues. In a multicenter study, we evaluated specificity, reproducibility, and inter-observer variability of SARS-CoV-2 nucleocapsid staining. We correlated RT-qPCR viral tissue loads with semiquantitative IHC scoring. We used qualitative and quantitative EM analyses to refine criteria for ultrastructural identification of SARS-CoV-2.
Findings Publications show high variability in the detection and interpretation of SARS-CoV-2 abundance in autopsy tissues by IHC or EM. In our study, we show that IHC using antibodies against SARS-CoV-2 nucleocapsid yields the highest sensitivity and specificity. We found a positive correlation between presence of viral proteins by IHC and RT-qPCR-determined SARS-CoV-2 viral RNA load (r=-0.83, p-value <0.0001). For EM, we refined criteria for virus identification and also provide recommendations for optimized sampling and analysis. 116 of 122 publications misinterpret cellular structures as virus using EM or show only insufficient data. We provide publicly accessible digitized EM and IHC sections as a reference and for training purposes.
Interpretation Since detection of SARS-CoV-2 in human autopsy tissues by IHC and EM is difficult and frequently incorrect, we propose criteria for a re-evaluation of available data and guidance for further investigations of direct organ effects by SARS-CoV-2.
Key messages
Detection of SARS-CoV-2 proteins by IHC in autopsy tissues is less sensitive in comparison to SARS-CoV-2 RNA detection by RT-qPCR.
For determination of SARS-CoV-2 protein positive cells by IHC in autopsy tissues, detection of spike protein is less sensitive than nucleocapsid protein.
Correct identification of SARS-CoV-2 particles in human samples by EM is limited to the respiratory system.
Interpretation of IHC and EM should follow substantiated consensus criteria to enhance accuracy.
Existing datasets describing SARS-CoV-2 presence in human autopsy tissues need to be critically re-evaluated.
Introduction
In clinical routine, quantitative reverse transcriptase (RT)-qPCR and rapid antigen detection tests from nasopharyngeal swabs are robust, standardized, and validated tools for screening or diagnosing severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections. In contrast, in situ SARS-CoV-2 detection methods such as immunohistochemistry (IHC) and electron microscopy (EM) in patient tissues are much less investigated and validated, yet autopsy studies use these methods to investigate mechanisms of organ damage and organ tropism of SARS-CoV-2 1-6.
Detection of pathogen-specific antigens by immunohistochemical staining is a potent diagnostic tool allowing for spatial correlation of pathological changes with presence of the pathogen. At the beginning of the COVID-19 pandemic, positive controls for this newly emerging disease were not available, neither for diagnostic nor for research applications. The urgent need to publish data on the distribution of SARS-CoV-2 in tissues of deceased patients with mentioned technical limitations led to inconsistencies in interpretation of SARS-CoV-2 localization and distribution by IHC and EM 1-10.
Diagnostic EM is the only technique to directly visualize and detect intact SARS-CoV-2 particles 11,12. It thereby validates other in situ viral-detection techniques, detecting viral proteins or RNA, and enables cellular and subcellular localization of virus particles 11. EM of virus molecules or particles in model systems expanded our understanding of SARS-CoV-2 structural and cellular biology, yet did not provide information on the distribution of the virus in human tissues 13-17.
Due to complex sample processing procedures in electron microscopy and challenges in recording and interpretation of micrographs, misinterpretation of structures as SARS-CoV-2 particles in patient tissues occurred 18-20. In autopsy tissues, virus structures have to be distinguished from other structures of cells. For this task, sufficient structural preservation of the tissue and a suitable sampling strategy to detect infected cells are needed 8,18,21. Recommendations for identifying SARS-CoV-2 particles by EM are mainly based on virus particles in cell cultures 22,23 which only partially reflect the situation in autopsy tissues. In fact, misinterpretations or findings with insufficient evidence have still been published 3,24. Here we collated available data on in situ detection of SARS-CoV-2 in human tissue samples focussing on immunohistochemical detection of SARS-CoV-2 proteins and EM detection of intact SARS-CoV-2 particles. Furthermore, we determined optimally suited SARS-CoV-2 antibodies and evaluated their sensitivity and specificity in a multicenter approach, which allowed a correlation between the viral load detected by RT-qPCR and by IHC. Additionally, we refined criteria for identifying SARS-CoV-2 by IHC and EM and provide a publicly accessible repository of entirely digitized light- and electron microscopical sections showing examples and pitfalls for in situ detection of SARS-CoV-2.
Material and methods
Sample processing for immunohistochemistry
We generated paraffin cell blocks from SARS-CoV-2-infected and un-infected Vero cells (see Supplementary Methods for details) and processed these as if they were autopsy samples regarding fixation time in 10% formalin and paraffin embedding. Furthermore, SARS-CoV-2-infected and un-infected Vero cells were grown on coverslips and fixed in 4% formaldehyde solution. Autopsy samples from lungs and respiratory mucosa from COVID-19 patients and controls were formalin-fixed and paraffin-embedded (FFPE) using standard laboratory procedures from three study centers. Beside hematoxylin and eosin staining, immunohistochemistry with several antibodies against different SARS-CoV-2 proteins (see Supplementary Table 2) were performed using a Ventana Benchmark XT autostainer at two centers. For the multicenter study evaluating IHC to SARS-CoV-2 nucleocapsid, we used antibody N#9. Staining specificity and intensity were evaluated by eight independent pathologists/neuropathologists at four different centers (Pathology Aachen, Neuropathology Berlin, Neuropathology Hamburg, Pathology Paris). Observers received instructions and a set of stained slides and were then asked to categorize each slide in a blinded fashion, using a published four-tiered semiquantitative approach (none (0), slight (+), moderate (++), and severe (+++) 25 (Figure 3). Training slides were annotated, uploaded, and are publicly available using OMERO 26. Correlation of RT-qPCR (Pearson r) values and scoring was analyzed using Graphpad Prism (GraphPad Software Version 8, La Jolla, USA) with 34 pairs. Every case was rated by seven investigators. Cases with ≥70% agreement amongst raters (at least 5 from 7 raters) were classified as positive (“+” or more) or negative, respectively. Cases with unclear classification into positive or negative due to inconsistent ratings (less than 60% agreement) were defined as non-classifiable (n=2; one control / one COVID-19). Sensitivity was calculated as Sensitivity = True positive/True positive + False negative. Specificity was calculated as Specificity = True negative/True negative + False positive (see Supplementary Table 5).
Sample processing for electron microscopy
29 autopsy samples derived from different tissues of 16 patients and SARS-CoV-2-infected Vero cells were processed according to standard protocols (Supplementary Methods; Supplementary Table 6). For qualitative and quantitative analyses of infected cells and viral particles, autopsy lung tissue (Supplementary Table 6, case 1) and FFPE re-embedded olfactory mucosa (case 2) with a high viral RNA load, as determined by RT-qPCR, and successful detection of particles by EM were used.
Large-scale electron microscopy and transmission electron microscopy
Four large sections prepared from different resin blocks of two autopsy lung samples (case 1) and two sections prepared from different resin blocks from SARS-CoV-2-infected Vero cells were completely digitized at 3-4 nm pixel size as recently described 27 to screen for SARS-CoV-2 particles (Supplementary Material). The screening datasets were processed via Fiji software/TrakEM2 plugin 28 and nip2 software to generate high-resolution tif files 27 for in-depth analysis using QuPath software 0.2.0. 29. In the autopsy lung, in total, 15 infected cells were detected, all located within two of the four digitized sections (dataset 1 and 2 on www.nanotomy.org). Subsequently, the infected cells were annotated and digitized at 1 nm pixel size for quantitative analysis in QuPath. Selected regions showing extracellular particles and non-infected cells were also digitized. To assess the heterogeneity of coronavirus (CoV) particles and their distribution in infected cells, four different categories of particles were specified; dark type 1 particles, bright type 2 particles, deformed type 3 particles, and extracellular type 4 particles (Figure 5D-S). These types were counted manually in QuPath. Also, total area (in μm2) of infected cells, area of the nucleus, and area of luminal structures were measured with the polygon tool to determine the density of intracellular CoV particles per μm2 cytoplasmic area. Intracellular CoV particles were analysed for particle diameter (largest diameter without spikes), ribonucleoprotein (RNP) diameter (smallest diameter, granular or elongated profiles), and diameter of tubular structures (cells 2 and 15). Measurements were performed with the line annotation tool, analysis was done in Excel (Office Home and Student 2019; Microsoft Corporation, Redmond, USA). Average, standard deviation, minimal and maximal values were determined for CoV diameter, RNP diameter, and tubular structure diameter. No measurements were excluded during data processing.
In FFPE re-embedded olfactory mucosa (case 2), we restricted quantitative analysis to measurement of virus particle diameter due to limited preservation. The maximal diameter of intracellular particles (without spikes), based on TEM images, was measured using Fiji 30 and the “straight line” tool.
We roughly compared viral RNA load measured by RT-qPCR with virus particle number per section and cell to estimate the likelihood of finding CoV particles via thin section EM (Supplementary Methods).
Search and analysis of publications demonstrating immunohistochemical and/or ultrastructural detection of SARS-CoV-2
We defined specific search strategies to find scientific publications using immunohistochemical datasets from autopsy studies with antibodies to SARS-CoV-2 proteins. Additionally, we searched for all scientific publications claiming ultrastructural proof of SARS-CoV-2 particles in infected human tissues (see Supplementary Methods).
Results
Analysis of publications demonstrating immunohistochemical detection of SARS-CoV-2 in human samples
We curated papers with immunohistochemical datasets for inclusion into our overview table (Supplementary Table 1). Detection of SARS-CoV-2 spike and nucleocapsid by immunohistochemistry in autopsy tissues showed differences regarding virus protein amounts and interpretation of data. Furthermore, results of adequate positive and negative controls were rarely reported (Supplementary Table 1).
Assessing commercially available antibodies against SARS-CoV-2 proteins
Of the 13 commercially available anti-SARS-CoV-2 protein antibodies that were evaluated in this study, three target the spike protein, two target non-structural protein 3 (Nsp3), seven target the nucleocapsid protein, and one targets double-strand RNA (dsRNA) (Supplementary Table 2).
Firstly, we tested all antibodies on FFPE SARS-CoV-2 infected and un-infected Vero cells (Supplementary Figure 1) simultaneously processed to avoid batch effects. Of the three different spike-protein targeting antibodies, one gave a specific signal with no background, one showed signal, but also produced nonspecific staining. Of note, one widely used antibody (#3) did not produce a signal at all. Both antibodies against Nsp3 generated nonspecific high background staining in un-infected cells. An antibody against dsRNA generated a faint signal in infected cells. Of the seven antibodies detecting SARS-CoV-2 nucleocapsid, five produced specific staining on infected cells with minimal background in un-infected cells. One widely used antibody showed high nonspecific background in un-infected cells (#12), while one antibody did not produce any signal (#13) (Supplementary Figure 1).
To further investigate the specificity of antibodies that have been widely used before in research, but did not perform well in our FFPE cell-based evaluation, we infected Vero cells with SARS-CoV-2, fixed them in formalin, and stained them without prior embedding into paraffin. To determine viral protein distribution, cells were double-stained with one of the specific nucleocapsid antibodies (#7 or #9) and one of the antibodies in question against spike #3, Nsp3 #5, dsRNA #6, and nucleocapsid #12, respectively (Supplementary Figure 2). Un-infected cells served as control. While antibodies #5, #6, and #12 produced specific staining, antibody #3 did not produce a detectable signal again (Supplementary Figure 2). Since antibodies #4 and #13 did not produce specific signals in FFPE cell blocks or fixed cells, they were not further considered.
To assess the performance of anti-SARS-CoV-2 antibodies in COVID-19 autopsy tissues, we tested nine of the antibodies on FFPE autopsy tissues from COVID-19 patients and controls. We chose lung and respiratory mucosa as these tissue compartments harbor high SARS-CoV-2 viral loads and we stained consecutive sections with our panel of antibodies (Figure 1; Supplementary Figure 3). Again, two out of three anti-spike antibodies produced specific staining, while one gave no signal (#3). Of the six different nucleocapsid antibodies, five antibodies showed specific and robust staining. However, the background staining in COVID-19-positive tissues was higher in the polyclonal (#7) and one monoclonal (#11) compared to the other monoclonal antibodies tested in our study. Background staining caused by presence of SARS-CoV-2 proteins in cellular debris or tissue artifacts was higher in lung tissue when compared to that of respiratory mucosa of COVID-19 patients (Figure 1; Supplementary Figure 3). Interestingly, signals obtained with antibodies targeting nucleocapsid protein were more abundant in tissues when compared to signals obtained using anti-SARS-CoV-2 spike protein antibodies (Figure 1; Supplementary Figure 3). To evaluate this further we performed double immunofluorescence staining on Vero cells, lung, and respiratory mucosa and consistently observed more nucleocapsid positive cells than spike-protein positive cells. Moreover, nucleocapsid signals were more abundant than spike-protein within double-positive cells and they are evenly distributed throughout individual infected cells (Figure 2). To exclude an antibody-specific effect caused by spike antibody targeting spike 2 (#1), we evaluated several anti-spike antibodies. However, reduced abundance of spike SARS-CoV-2 in contrast to nucleocapsid in cells and tissues could also be confirmed using antibodies against spike S1 and receptor-binding domain (RBD) (Supplementary Figure 4).
Of note, two widely used and published antibodies did not perform well in our analyses. One antibody (anti-N, #12) produced unacceptably high background staining even in control tissues (Figure 1; Supplementary Figures 1,3), while another widely used antibody (anti-spike, #3) did not result in measurable staining in SARS-CoV-2-infected Vero cells or COVID-19 tissues (Figure 1; Supplementary Figures 1-3). In summary, six anti-SARS-CoV-2 protein antibodies reliably work on FFPE autopsy tissues from COVID-19 patients, with antibodies against nucleocapsid protein providing better results. Our recommendations for the usage of antibodies for the detection of anti-SARS-CoV-2 proteins in human autopsy tissues are summarized in Table 1 and Supplementary Table 3.
A multicenter study assessing SARS-CoV-2 immunohistochemistry
We selected one monoclonal antibody against nucleocapsid (#9) which showed a very reliable signal in cells and human autopsy tissues with minimal background staining (Figure 1A,B; Supplementary Figures 1,3), to perform a blinded multi-centric study. The goals were (1) to assess, whether IHC staining in human lung autopsy tissue by experienced pathologists is a suitable method to detect SARS-CoV-2 protein and (2) to investigate the correlation between SARS-CoV-2 load as measured by RT-qPCR and IHC (Figure 3).
Three centers contributed human autopsy lung tissues from COVID-19 deceased individuals which were stained against nucleocapsid. Anonymized patient details are summarized in Supplementary Tables 4a-c. Abundance of SARS-CoV-2 nucleocapsid in lung tissue in COVID-19 patients is highly variable and present in a clustered, inhomogenous pattern (Figure 3; Supplementary Figure 5). SARS-CoV-2 viral loads were determined by RT-qPCR of consecutive sections of the same paraffin blocks of the stained tissue samples (Supplementary Table 5). Lung tissues from control patients without pathological lung changes, from patients dying of non-COVID-19 related Acute Respiratory Distress Syndrome (ARDS), and from patients dying with Influenza infections were included since lung tissue intrinsically presents with high background staining (Figure 5; patients in Supplementary Tables 4a-c). Slides were evaluated blinded by pathologists from four different centers and scored in a semiquantitative manner (see Figure 3 for overview and examples; Supplementary Table 5).
We assessed false positive and false negative ratings in lung tissues. The utilized monoclonal antibody (#9) had a sensitivity of 0.71 (Supplementary Table 5). Extensive tissue damage seen in COVID-19 lungs, where pre-necrotic epithelial cells and hyaline membranes obscured interpretation of signals. The specificity for antibody #9 was high at 0.98 (40 True negative/ 40 True negative + 1 False positive; see Supplementary Table 5). A correlation between RT-qPCR-defined SARS-CoV-2 viral loads and the presence of immunohistochemically detected SARS-CoV-2 nucleocapsid protein was seen in tissues with high SARS-CoV-2 viral loads (Figure 3). COVID-19 tissues with low viral RNA loads were often rated false negative (n=7 with a mean ct value of 28.7) which could be true negative as, in these cases with only a low RNA signal there is probably no SARS-CoV-2 protein, however, in our calculations these figure as false negative. Only two cases out of 71 were not classifiable with an interrater agreement of less than 60%. The overall interrater reliability documented as interrater agreement frequency was 62% and reliability increased up to 83% if only trained raters with experience in SARS-CoV-2 IHC were included (Supplementary Table 5) 31. The majority of cases with interrater discrepancy (all raters) were controls (n=14) compared to COVID-19 (n=9), and almost always these cases were rated with single positive cells (Controls and COVID-19 cases). In contrast, COVID-19 cases with a high viral RNA load and a high IHC score were classified correctly.
Thus, our multicenter study showed that detection of SARS-CoV-2 proteins in human autopsy tissues is feasible yet is tainted with technical and interpretational difficulties, highlighting the importance of proper controls in autopsy studies and training of evaluators.
Ultrastructural analysis of SARS-CoV-2 in human tissues
Guidance for identifying SARS-CoV-2 has been provided previously, however without detailed consideration of the specific challenges of autopsy tissues 20,22,23. Key elements guiding ultrastructural identification of SARS-CoV-2 include the presence of a membrane envelope, surface projections (spikes), a granular interior (ribonucleoprotein), and diameters from 60 to 140 nm 20. We searched for SARS-CoV-2 viral particles in the lung, olfactory mucosa (removed directly under the lamina cribrosa, but ultrastructurally only parts with ciliated and goblet cells were identifiable), medulla oblongata, kidney, trachea, and myocardium in autopsy tissues of 16 RT-qPCR-positive COVID-19 patients (Supplementary Table 6).
Overall, virus particles could be identified in 15 cells within the large-scale screening datasets of lung tissue of one patient. All virus-containing cells were found in two of the four digitized sections. Due to simplicity, we refer to these cells as infected cells, albeit intracellular virus particles in e.g. macrophages may be mainly composed of phagocytosed particles. Virus particles were well-preserved with distinct substructures (Figure 4A,B) and virus particles containing cells could be identified as type 2 pneumocytes and alveolar macrophages based on morphological criteria (Supplementary Table 7). Potential viral mimics such as swollen mitochondria, vesicles of rough endoplasmic reticulum (rER), and coated vesicles could be distinguished from viral particles (Supplementary Figure 6) 23,32-34. We recorded all 15 infected cells in the lung at very high resolution (Figure 5, repository datasets on www.nanotomy.org) for morphometric analyses and found 1557 intracellular and 144 extracellular viral particles. Infected cells showed virus loads ranging from 4 to 620 intracellular particles per sectioned cell profile (or 0.07 to 5.44 intracellular particles per μm2 cytoplasmic area, Supplementary Figure 7). Intracellular particles showed a mean diameter of 87 nm (±13 nm; n=1369; 55 to 177 nm). The RNP showed a mean diameter of 7.2 nm (± 1.6 nm; n=433; 3.6 to 13 nm). According to our quantitative analysis and calculations (see Supplementary Methods), individual cells may contain thousands (possibly up to 40,000; cell 7) of CoV particles. Based on a rough approximation from viral RNA load, we can expect 0.006 (case 5) or 500 (case 1) CoV particles/ mm2 ultrathin section area for low and high RNA load, respectively, which implies a huge difference in the likelihood of particle detection (see calculations in Supplementary Methods). These calculations also reflect our findings acquired by IHC analyses. No typical replication compartments such as double-membrane vesicles and budding viruses were found, most probably because of the limited structural preservation. However, four cells of the autopsy lung demonstrated peculiar tubular structures which are possibly associated with CoV infection (Supplementary Figure 8) 32.
Virus particles could also be identified in several ciliated cells within olfactory mucosa of one patient (Supplementary Table 6; Figure 4C,D). However, virus particles were less well-preserved due to the FFPE-embedding and paraffin extraction procedure prior to re-embedding for EM. Thus, their identification relied on comparison of more particles within each cell as well as presence of typical membrane compartments with multiple isomorphic particles enclosed. Virus particles appeared more condensed than virus particles after standard preparation for thin section EM with a mean diameter of 73 nm (± 7 nm; n=175; 58 to 108 nm; intracellular particles).
Of note, no virus particles were found in 26 samples of 14 patients in different tissues (Supplementary Table 6).
Based on reliable reference publications on ultrastructural in situ detection of CoV in human samples 21,32,35-39, our previous work on SARS-CoV-2 in autopsy tissues 8,18, cell culture 16, MERS 40, and the results presented here, we developed refined criteria for identification of CoV particles in autopsy samples (Table 2).
Analysis of publications demonstrating ultrastructural evidence for the presence of SARS-CoV-2 in human samples
We surveyed publications (published April 2020 to November 2021) using ultrastructural findings as proof of virus particles in human samples (Supplementary Tables 8,9) and re-evaluated the data using our refined criteria for virus identification. Six publications presented sufficient data to prove the presence of virus particles, while 116 publications misinterpreted different cellular structures as virus or showed only insufficient evidence for the presence of virus particles (Supplementary Figure 9). In total, only 63 of 292 electron micrographs (22%) showed sufficient structural preservation and image quality necessary for identifying structural details such as enveloped viruses. Structures misinterpreted as virus particles were mostly compatible with coated vesicles, vesicles of rough endoplasmic reticulum, multivesicular bodies, and autolytic mitochondria. Thirty-eight publications discussed the challenges of SARS-CoV-2 identification by EM (Supplementary Table 10).
Discussion
One key question regarding severe to lethal COVID-19 is whether the clinical presentation including considerable organ damage is due to direct organ targeting of SARS-CoV-2 or downstream effects such as an overshooting immune response. During the COVID-19 pandemic, autopsy-driven research, using multimodal approaches, attempted at defining organ tropism of the virus and at unraveling organ-specific pathomechanisms 8,9,25,41-43, yet a critical and systematic study investigating the limitations of in situ detection of SARS-CoV-2 in autopsy tissues has not been performed.
Looking into the discrepancy of results from studies using autopsy material to determine organ tropism and especially identification of organ-specific target cell types of SARS-CoV-2 for most organ systems led to conflicting results hampering research progress 3,8-10,44,45. For instance, published data include studies revealing direct infection of neurons by SARS-CoV-2 with substantial neuroinvasion 46, single infected cells in a subset of patients 25, but also the absence of virus and COVID-19-specific alterations 47. Validations are complicated by the use of various, often not well-evaluated antibodies in different studies and the lack of sufficient positive and negative controls.
Here we determined the limitations of SARS-CoV-2 detection by IHC and thin section EM using a defined set of control tissues including FFPE highly susceptible human cell lines and autopsy tissues with expectable high SARS-CoV-2 viral load. Finally, we performed a multicenter study assessing how well IHC performs in detecting SARS-CoV-2 proteins in autopsy tissues.
Assessment of a wide range of commercially available antibodies directed at SARS-CoV-2 proteins showed that only a subset of these can be reliably used in autopsy tissues (Supplementary Figure 3). Interestingly, antibodies against nucleocapsid protein showed highest sensitivity. This may be since, of all SARS-CoV-2 proteins, nucleocapsid protein is produced at the highest levels during the lifecycle of SARS-CoV-2 in cells 48. Correspondingly, we found much less spike protein than nucleocapsid protein-based on both, the amount per cell and the general abundance in affected tissues. This should be considered when interpreting studies proving multi-organ tropism and claiming specific target cell types for SARS-CoV-2, especially by using anti-spike antibodies 1,3,46,49,50.
We found a discrepancy between viral RNA loads as determined by RT-qPCR and detection of SARS-CoV-2 proteins by immunohistochemistry. In fact, in tissues with low viral RNA loads immunohistochemistry is not a reliable method to determine organ tropism or target cell type as the interpretation of the rare immunosignals is difficult and nonspecific staining may be falsely interpreted as a positive signal. In agreement with this, we found low inter-observer reliability in tissues with low viral RNA loads (Supplementary Table 5). This may have to do with the fact that the distribution of the virus is uneven, even in highly affected organ systems such as lungs (Figure 3). The absence of viral proteins, on the other hand, cannot and should not be used as an argument for the absence of SARS-CoV-2-related tissue pathology, as autopsy tissues can only provide an incomplete snapshot of what has occurred in the sometimes very long clinical phase of the disease 51. Recent studies have tried to address this point by studying autopsy tissues at different stages of COVID-19 44. Our study also disclosed that it is crucial to choose suitable control tissues for immunohistochemistry not only based on high viral RNA loads but also on tissue integrity. Lung, for example, the tissue with the highest viral RNA loads cannot be considered an optimal control tissue as it tends to produce false-positive signals and difficult to interpret staining patterns. This may be because extensive tissue damage seen in COVID-19 lungs leads to pre-necrotic epithelial cells and hyaline membranes, both prone to false-positive signals. Also, this study shows that lungs often contain SARS-CoV-2 protein-harboring cell debris and mucus leading to difficulties to interpret signals (Figure 3). However, these limitations may also be valid for other (autopsy) tissues. Thus, it is warranted to carefully discriminate between direct SARS-CoV-2 virus presence or infection and inflammation-related tissue pathology, since the latter might considerably contribute to false-positive signals in IHC.
Using EM to detect intact SARS-CoV-2 particles in autopsy tissues comes along with specific challenges such as a relatively low sensitivity as compared to e.g. RT-qPCR, limitations in cell-type identification, and high inter-observer variability depending on EM expertise. We found virus particles only in a minor fraction of patient samples (with comparatively high viral RNA load), also, virus particles were spatially highly confined. In fact, individual cells may contain up to tens of thousands CoV particles and hundreds of thousands of RNA copies. Thus, even in samples with a high SARS-CoV-2 load, few infected cells (∼20 or less per 10,000 cells), possibly also mobile cells, could make up a significant fraction of the total viral load. This result aligns with light microscopic findings suggesting a focal infection and argues for the complementarity of both methods so as not to misguide research 18,21. However, quantity of viral load may be cell type-specific and also depend on the disease phase. Moreover, presence of intracellular particles in e.g. macrophages may variably be a result of phagocytosis and not infection, thus indicating a need for further research on this correlation.
Based on our data, we provide recommendations on a suitable strategy for identifying virus-infected cells in Table 2. We slightly expanded and detailed previously published criteria 20,22,23 primarily based on examinations of virus particles produced by cell culture.
If these refined criteria for SARS-CoV-2 identification were applied to journal publications, 116 of 122 publications do not sufficiently prove the presence of viral particles in various human tissues. This problem has already been discussed 20,23 and resulted in specific recommendations for the correct detection of CoV particles. However, a general decline of diagnostic EM over the last decades with loss of expertise occurred 18,52, further aggravated by the unfamiliarity of most EM facilities with in-situ-detection of viruses 23. Both probably complicate transfer of these recommendations into practice, as also indicated by the lack of general quality standards 53 of many published EM data. This unfortunate and long-standing decline of diagnostic EM is further illustrated by the fact that also during the SARS-CoV pandemic in the early 2000s, detection of the virus by EM was tainted with technical and interpretational difficulties, and non-viral particles in different organs were used to propose a multi-organ tropism of SARS-CoV 54-56. These misinterpretations were then perpetuated early on in the SARS-CoV-2 pandemic 57-64. Importantly, misinterpretations of different cellular structures as virus also occurred in cell culture and organoids 46,65.
However, we emphasize that diagnostic EM is a valuable method for virus detection if appropriate standards are applied 20. It should always be pursued to validate other techniques of virus detection to gain reliable information on tropism and virus-induced alterations. Generally, virus detection by EM in a routine diagnostic setting is achievable if the recommendations and criteria provided in our work are considered. Moreover, it is necessary to learn visual patterns of virus within complex tissue samples to speed up the screening process. Our large-scale datasets, corresponding to approximately 130,000 conventional electron micrographs, may help in acquiring these visual pattern recognition skills, which cannot be acquired using small sets of preselected conventionally published electron micrographs lacking cellular and microanatomical context. As demonstrated by our Supplementary Video, virus particles can be found and identified in the complex structural matrix of lung tissue by spanning the scales from millimeter to nanometer. This technique 27,66 also provides a promising approach for fast and precise ultrastructural in silico analysis for future pandemics, especially in light of innovative high-throughput EM imaging approaches 67. In summary, usage of autopsy tissues with in situ detection of SARS-CoV-2 is valuable if interpreted within the limits of all applied methods and tissues. In the early phase of the COVID-19 pandemic, for various reasons, researchers have not fully abided to this, opening the door for misinterpretation and overestimation of SARS-CoV-2 multi-organ tropism. The here formulated consensus criteria can provide guidance to improve quality autopsy-based SARS-CoV-2 research. The main limitation of our study is its limited scope regarding assessed tissues and anti-SARS-CoV-2 antibodies.
Data Availability
All data produced in the present study are available upon reasonable request to the authors
Funding
This work was supported by the German Registry of COVID-19 Autopsies (www.DeRegCOVID.ukaachen.de), funded Federal Ministry of Health (ZMVI1-2520COR201), by the Federal Ministry of Education and Research within the framework of the network of university medicine (DEFEAT PANDEMICs, 01KX2021). ACH was supported by Berlin University Alliance GC2 Global Health (Corona Virus Pre-Exploration Project), BMBF (RAPID and Organo-Strat 01KX2021) as well as DFG (SFB-TR 84, B6 / Z1a), HR by DFG (RA 2491/1-1), SB by DFG SFB 1365 C04, S01 and NIH 2R01DK05149-19A1, subaward 1016678, while SK was funded by the German Center for Infectious Research (TTU.01.929).
Acknowledgements
SK (head) and KH (technician) are running the Core Facility for Experimental Pathology of the UKE (“Mouse Patho”). We thank Annette Gries, Hanna Jania and Gudrun Holland for excellent technical assistance and the UMIF/UKE for using their microscopes. Our condolences to the families and all those who have lost their loved ones during the ongoing pandemic. We thank all relatives who made the difficult decision to give their permission for autopsy and research.
Footnotes
↵# joint senior authors