E-biopsy proteomics distinguishes 4T1 tumor samples from healthy breast tissue
We first consider e-biopsy extracted proteomic profiles in the context of distinguishing between 4T1 tumor and healthy murine breast tissue, regardless of sampling site. Differential analysis of protein expression (paired two-sided t-test with n = 5, the two replicates from each location were averaged) was performed for three pairs of e-biopsy extracts: 4T1 tumor center vs healthy breast, 4T1 tumor periphery vs healthy breast, and 4T1 tumor middle vs healthy breast. We found 13 proteins harvested by e-biopsy (Table 1) strongly overexpressed (p-value < 0.01, Methods) in all sampled locations in the tumor vs normal breast (no underexpressed proteins were identified under the same criteria). An intersection of this size has a p-value < 1.5e-06. These 13 proteins therefore represent an FDR of 3.5e-04 (Methods). Moreover, releasing the p-value cutoff to 0.05 results in a set of 242 (238 overexpressed and 4 under-expressed) differentially expressed proteins, corresponding to FDR of 2.3e-03. In further analysis we call these 242 proteins – potential 4T1 biomarkers.
Table 1
List of proteins extracted with e-biopsy that differentiate 4T1 tumor from healthy breast tissue samples for all tumor sampling locations. This is a list of genes that are overexpressed in all 3 tumor locations together (Center/Midway/Peripheral; with both tumor location replicas averaged into a single value) compared to healthy breast samples (paired t-test p-value < 0.01 at each location). A two-sided, paired t-test was applied to each of the 4,519 known proteins sampled from 5 mice via e-biopsy. No underexpressed proteins were identified using the same criterion, namely p<0.01 in all three comparisons.
Genes
|
Peripheral > Healthy Breast
|
Middle > Healthy Breast
|
Center > Healthy Breast
|
p-value
|
t-stats
|
p-value
|
t-stats
|
p-value
|
t-stats
|
Vim
|
8.0E-03
|
4.9
|
1.7E-03
|
7.5
|
1.4E-03
|
7.9
|
Hnrnpa2b1
|
5.6E-04
|
10.0
|
5.7E-04
|
9.9
|
4.7E-04
|
10.5
|
Pabpc1
|
5.9E-03
|
5.3
|
8.6E-03
|
4.8
|
3.2E-03
|
6.3
|
Nucb1
|
7.4E-03
|
5.0
|
3.4E-03
|
6.2
|
2.5E-03
|
6.7
|
Serpinh1
|
3.8E-03
|
6.0
|
8.6E-03
|
4.8
|
7.8E-04
|
9.2
|
Hnrnpa1
|
2.9E-03
|
6.5
|
1.8E-03
|
7.3
|
9.4E-03
|
4.7
|
Slc25a24
|
3.2E-03
|
6.4
|
1.3E-03
|
8.1
|
2.4E-03
|
6.8
|
Plin3
|
9.9E-03
|
4.6
|
4.5E-03
|
5.8
|
9.7E-03
|
4.6
|
Hmgb1
|
3.7E-04
|
11.1
|
6.4E-03
|
5.2
|
3.7E-03
|
6.1
|
Rcn1
|
3.8E-03
|
6.0
|
1.3E-03
|
8.0
|
1.3E-03
|
8.0
|
Cnpy2
|
3.8E-03
|
6.0
|
9.4E-04
|
8.8
|
4.4E-03
|
5.8
|
Mat2b
|
1.5E-03
|
7.7
|
3.0E-04
|
11.7
|
2.5E-03
|
6.8
|
Bax
|
6.3E-03
|
5.2
|
1.2E-04
|
14.9
|
2.6E-03
|
6.7
|
The distribution of the observed differential expression scores, computed from comparing protein measurements in the healthy breast versus the different 4T1 tumor locations, is statistically significant, manifesting an overabundance of differentially expressed proteins. As shown in the overabundance plots (Fig. 2), there are more differentially expressed proteins observed in our data than expected under a random null model43.
Gene Ontology (GO) analysis of 4,519 e-biopsy-extracted proteins was used to further examine the differential expression between control (healthy breast) and three 4T1 tumor locations in terms of cellular processes, functions, and components. Here we present the most significant (p-value < 1E-06 at all locations simultaneously) differentially regulated processes, functions, and components (Tables 2-7) and discuss several interesting observations. All identified processes, functions, and components that are simultaneously enriched in all positions with p-value < 1E-06 are presented online in https://github.com/GolbergLab/eBiopsy4T1.
Notably, the analysis of the GO terms revealed several significant (p-value < 1E-06 in all locations) down-regulated cellular processes in 4T1 tumor (Table 2). Among them there are many immunoglobulin-related processes, which is consistent with earlier reports44, who found that intra-tumoral injection of allogeneic IgG combined with other factors induced nearly complete eradication of lung metastases from 4T1. The down-regulation of various peptidase inhibitors-related cellular functions (Table 3) in 4T1 is also consistent with previous works45. Moreover, the tumor’s extracellular component in all three locations was downregulated compared to the healthy breast (Table 4), which is expected in aggressive and invasive tumors such as 4T146. Furthermore, we found that many biosynthesis-related cellular processes are up-regulated in 4T1 (Table 5), which is consistent with the tumor’s need for enhanced replication rates47. These findings are also supported by many upregulated cellular functions (Table 6) and cellular components (Table 7).
These data show that in vivo e-biopsy extraction of proteins yields statistically significant and biologically different profiles when comparing various locations in 4T1 tumors to healthy breast tissue in mice.
Table 2
GO Process terms significantly underexpressed in all 4T1 tumor locations vs healthy breast. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value amongst the three different locations.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
protein activation cascade
|
3.22E-18
|
1.73E-20
|
5.40E-22
|
complement activation
|
8.94E-16
|
8.47E-19
|
3.35E-20
|
humoral immune response
|
4.67E-14
|
1.09E-14
|
1.41E-16
|
complement activation, classical pathway
|
7.83E-14
|
3.00E-15
|
3.27E-17
|
immune response
|
1.25E-12
|
6.54E-14
|
2.51E-13
|
phagocytosis, recognition
|
2.01E-11
|
3.20E-11
|
6.07E-13
|
response to bacterium
|
1.48E-13
|
5.56E-11
|
6.66E-13
|
immunoglobulin production
|
1.07E-11
|
9.08E-11
|
2.92E-12
|
production of molecular mediator of immune response
|
4.21E-11
|
3.17E-10
|
1.16E-11
|
blood coagulation
|
9.27E-10
|
3.91E-10
|
4.44E-10
|
Coagulation
|
9.27E-10
|
3.91E-10
|
4.44E-10
|
defense response to bacterium
|
1.25E-12
|
1.15E-09
|
1.55E-11
|
Hemostasis
|
5.76E-09
|
2.53E-09
|
1.37E-09
|
immune system process
|
1.02E-08
|
1.35E-08
|
1.92E-09
|
defense response
|
2.11E-08
|
1.25E-08
|
2.28E-10
|
B cell receptor signaling pathway
|
4.98E-08
|
3.14E-09
|
3.93E-09
|
phagocytosis, engulfment
|
1.60E-07
|
1.60E-07
|
1.50E-08
|
activation of immune response
|
1.71E-07
|
4.62E-07
|
5.06E-09
|
Table 3
GO Function terms significantly underexpressed in all 4T1 tumor locations. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value between 3 areas.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
endopeptidase inhibitor activity
|
1.25E-09
|
1.51E-08
|
2.24E-09
|
peptidase inhibitor activity
|
2.23E-09
|
2.21E-08
|
4.50E-09
|
serine-type endopeptidase inhibitor activity
|
3.93E-09
|
6.20E-08
|
2.27E-08
|
endopeptidase regulator activity
|
3.40E-08
|
1.24E-07
|
8.67E-08
|
peptidase regulator activity
|
1.82E-07
|
2.71E-07
|
5.09E-07
|
Table 4
GO Component terms significantly underexpressed in all 4T1 tumor locations. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value between 3 areas.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
extracellular region part
|
5.62E-21
|
9.83E-27
|
7.23E-28
|
extracellular space
|
3.25E-20
|
3.63E-25
|
1.09E-22
|
extracellular region
|
3.28E-15
|
1.66E-17
|
4.03E-18
|
immunoglobulin complex
|
4.51E-12
|
6.80E-13
|
8.62E-14
|
immunoglobulin complex, circulating
|
4.51E-12
|
6.80E-13
|
8.62E-14
|
side of membrane
|
2.04E-09
|
3.83E-07
|
3.22E-08
|
cornified envelope
|
9.30E-10
|
2.14E-08
|
6.68E-07
|
Table 5
GO Process terms significantly overexpressed in all 4T1 tumor locations. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value between 3 areas.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
translation
|
1.42E-14
|
9.60E-14
|
2.20E-19
|
peptide biosynthetic process
|
9.80E-14
|
8.59E-13
|
1.99E-17
|
cellular macromolecule biosynthetic process
|
4.96E-12
|
5.99E-15
|
9.25E-17
|
amide biosynthetic process
|
3.74E-11
|
2.47E-10
|
4.08E-14
|
macromolecule biosynthetic process
|
3.13E-10
|
3.06E-14
|
2.39E-15
|
peptide metabolic process
|
6.68E-12
|
1.22E-09
|
1.53E-13
|
cellular nitrogen compound biosynthetic process
|
1.90E-08
|
1.40E-09
|
1.15E-10
|
Table 6
GO Function terms significantly overexpressed in all 4T1 tumor locations. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value between 3 areas.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
RNA binding
|
7.48E-14
|
2.07E-21
|
4.56E-26
|
structural constituent of ribosome
|
1.65E-13
|
1.02E-17
|
1.39E-32
|
nucleic acid binding
|
1.79E-10
|
1.32E-21
|
2.92E-24
|
heterocyclic compound binding
|
4.95E-08
|
2.36E-14
|
1.46E-12
|
organic cyclic compound binding
|
2.47E-07
|
4.46E-13
|
2.79E-11
|
mRNA binding
|
9.43E-07
|
1.58E-09
|
4.56E-14
|
Table 7
GO Component terms significantly overexpressed in all 4T1 tumor locations. Two-sided, paired t-test p-value < 1E-06 for all 3 tumor locations simultaneously. Highlighted cells indicate maximal p-value between 3 areas.
Description
|
Center
p-value
|
Middle
p-value
|
Peripheral
p-value
|
intracellular part
|
1.46E-12
|
1.02E-13
|
1.26E-13
|
ribosomal subunit
|
6.01E-12
|
3.36E-16
|
2.33E-32
|
cytosolic part
|
3.61E-11
|
3.48E-15
|
4.12E-20
|
ribosome
|
2.42E-10
|
6.54E-15
|
1.21E-26
|
ribonucleoprotein complex
|
2.75E-10
|
6.04E-18
|
5.50E-27
|
organelle
|
3.28E-09
|
2.03E-09
|
1.42E-11
|
intracellular organelle
|
3.76E-09
|
2.45E-09
|
2.37E-12
|
nucleus
|
1.02E-08
|
1.15E-16
|
6.31E-15
|
protein-containing complex
|
1.36E-08
|
2.87E-14
|
6.26E-11
|
cytoplasm
|
1.39E-12
|
5.85E-08
|
1.31E-07
|
intracellular non-membrane-bounded organelle
|
3.47E-09
|
2.17E-07
|
1.42E-11
|
non-membrane-bounded organelle
|
4.92E-09
|
3.06E-07
|
1.58E-11
|
Reproducibility of in vivo molecular harvesting with e-biopsy
To study the reproducibility of our in vivo e-biopsy extraction method, we harvested liquids from 6 positions: 2 in the center, 2 in the middle, and 2 at the periphery (Fig. 1) from 4T1 tumors in vivo in 5 mice. In total, 4,262 proteins (with positive LFQ intensity in at least one e-biopsy sample) out of 4,519 total proteins were considered in this analysis. We found that the expression levels of proteins extracted from all locations in the tumor are highly correlated when comparing the location replicates (Table 8).
Table 8
E-biopsy reproducibility in vivo. Pearson and Spearman correlation coefficients of the LFQ intensities of e-biopsy-extracted proteins, at various position replicates from 4T1 tumor, as quantified by LC-MS/MS. 4,262 proteins were considered from 5 mice (Fig. 1). All p-values < 10E-324.
Location Replicates
|
Center
|
Middle
|
Periphery
|
Average Pearson Correlation
|
0.928
|
0.944
|
0.845
|
Average Spearman Correlation
|
0.787
|
0.835
|
0.769
|
In vivo e-biopsy supports mapping of 4T1 intratumor proteome spatial heterogeneity
To study the intratumor heterogeneity, we compared (two-sided, paired t-test) expression levels of proteins extracted in vivo by e-biopsy from three different tumor locations – center, midway, and periphery – in five animals. We found (Table S4) that 26 of 4,519 genes are significantly overexpressed, and 15 of 4,519 genes are significantly underexpressed in the center (compared to both other zones); 111 are overexpressed and 2 underexpressed in the middle; finally, 18 are overexpressed and 99 underexpressed at the periphery (significance here is defined by p-value < 0.05). This represents FDR of 2.7e-01, 1.0e-01 and 9.7e-02 respectively (Methods).
Next, we intersected the genes from the above analysis (41 from the center, 113 from the middle and 117 from the periphery) with the set of 242 potential 4T1 biomarkers (over/under-expressed in each of three 4T1 tumor locations compared to healthy breast with p-value < 0.05, Methods and text before Table 1). We found (Table 9) 2 such genes in the center, 3 in the middle and 3 in the periphery. This represents FDR of 7.1e-4, 4.7e-4 and 4.7e-4 respectively (Methods).
Specifically, we found (Table 9) that the gene Glrx is significantly overexpressed in the 4T1 tumor center compared to the tumor’s middle area and periphery (Fig. 3). We also found that Rbb7, Pkn1, and Ppme1 are overexpressed in the middle area of the tumor compared to its center and periphery (Fig. 3). No uniquely overexpressed potential tumor biomarker genes have been identified at the tumor periphery.
In the opposite direction, we found (Table 9) that the gene Ppme1 is significantly underexpressed in the 4T1 tumor center compared to tumor middle and tumor periphery (Fig. 3). Moreover, we found that Prkcsh, Tra2a, and Shoc2 are underexpressed at the tumor’s periphery compared to its center and mid-zones (Fig. 3). No uniquely underexpressed potential tumor biomarker genes have been identified in the middle area tumor zone.
Table 9
Potential 4T1 tumor biomarkers that significantly differ (Methods) between three tumor areas. C – Center; M – Middle; P – Periphery; HB – Healthy Breast
|
|
|
C vs M
|
C vs P
|
M vs P
|
C vs NB
|
M vs NB
|
P vs NB
|
Area
|
Direction
|
Gene
|
p-value
|
t-stats
|
p-value
|
t-stats
|
p-value
|
t-stats
|
p-value
|
t-stats
|
p-value
|
t-stats
|
p-value
|
t-stats
|
Glrx
|
UP
|
center
|
2.49E-02
|
2.7
|
0.0
|
2.5
|
not relevant
|
2.25E-03
|
7.0
|
1.65E-02
|
4.0
|
4.91E-02
|
2.8
|
Rbbp7
|
UP
|
middle
|
8.82E-03
|
-3.3
|
not relevant
|
3.13E-02
|
2.5
|
3.51E-02
|
3.1
|
1.22E-02
|
4.3
|
4.43E-02
|
2.9
|
Ppme1
|
UP
|
middle
|
1.40E-03
|
-4.5
|
not relevant
|
7.46E-04
|
5.0
|
2.70E-02
|
3.4
|
1.25E-02
|
4.3
|
2.37E-02
|
3.6
|
Pkn1
|
UP
|
middle
|
1.24E-02
|
-3.1
|
not relevant
|
2.02E-02
|
2.8
|
4.91E-02
|
2.8
|
4.34E-03
|
5.8
|
3.55E-02
|
3.1
|
Ppme1
|
DOWN
|
center
|
1.40E-03
|
-4.5
|
4.70E-02
|
-2.3
|
not relevant
|
2.70E-02
|
3.4
|
1.25E-02
|
4.3
|
2.37E-02
|
3.6
|
Prkcsh
|
DOWN
|
periphery
|
not relevant
|
2.09E-02
|
2.8
|
2.08E-02
|
2.8
|
1.01E-02
|
4.6
|
2.55E-02
|
3.5
|
2.68E-02
|
3.4
|
Tra2a
|
DOWN
|
periphery
|
not relevant
|
2.33E-02
|
2.7
|
1.75E-02
|
2.9
|
3.70E-02
|
3.1
|
1.77E-02
|
3.9
|
1.77E-02
|
3.9
|
Shoc2
|
DOWN
|
periphery
|
not relevant
|
1.02E-02
|
3.2
|
3.42E-02
|
2.5
|
4.02E-03
|
5.9
|
2.02E-02
|
3.7
|
2.17E-02
|
3.7
|
The distribution of the observed differential expression scores, computed from comparing protein measurements between 4T1 tumor locations, is statistically significant (Fig. 4). The overabundance plots show that more differentially expressed proteins are observed in our data than would be expected under a random null model 43. We also show that in vivo measurements obtained by e-biopsy are consistent with those obtained from the standard ex vivo lysis method (Supplementary information).
Gene Ontology (GO) analysis of 4,519 e-biopsy-extracted proteins was used to further examine the differential expression between all three locations in the 4T1 tumor in terms of various cellular processes, functions, and components. Here we present the most significant (p-value < 1e-06 at each location simultaneously) differentially regulated processes, functions, and components (Table 10) and discuss several interesting observations. All the identified processes, functions, and components that differ with p-value < 1e-06 are presented online in https://github.com/GolbergLab/eBiopsy4T1. In the comparison between Center and Periphery, no significant overexpressed function and components were identified in the Center. In the comparison between Middle and Periphery, no significant overexpressed processes, functions, and components were identified in the Middle (all three were overexpressed at the Periphery).
Table 10
GO terms significantly differentially regulated per each 4T1 tumor location. Two-sided, paired t-test p-value < 1e-06 with two other tumor locations simultaneously.
Region
|
Direction
|
Description
|
Center p-value
|
Middle p-value
|
Peripheral p-value
|
Center
|
High
|
PROCESS: blood coagulation
|
|
6.96E-09
|
4.94E-07
|
Center
|
High
|
PROCESS: coagulation
|
|
6.96E-09
|
4.94E-07
|
Center
|
Low
|
PROCESS: cytoplasmic translation
|
|
1.90E-11
|
5.60E-08
|
Center
|
Low
|
FUNCTION: structural constituent of ribosome
|
|
8.60E-11
|
5.36E-12
|
Center
|
Low
|
COMPONENT: cytosolic large ribosomal subunit
|
|
1.42E-11
|
1.54E-12
|
Center
|
Low
|
COMPONENT: ribosomal subunit
|
|
9.67E-11
|
1.02E-11
|
Center
|
Low
|
COMPONENT: ribosome
|
|
2.81E-11
|
4.06E-10
|
Center
|
Low
|
COMPONENT: cytosolic part
|
|
1.83E-11
|
2.52E-09
|
Center
|
Low
|
COMPONENT: cytoplasmic part
|
|
5.45E-09
|
2.33E-08
|
Center
|
Low
|
COMPONENT: ribonucleoprotein complex
|
|
2.78E-14
|
3.03E-08
|
Center
|
Low
|
COMPONENT: large ribosomal subunit
|
|
1.27E-07
|
1.59E-08
|
Center
|
Low
|
COMPONENT: intracellular
|
|
1.44E-09
|
5.09E-07
|
Periphery
|
High
|
COMPONENT: cytoplasmic part
|
2.33E-08
|
7.71E-07
|
|
Notably, in analyzing GO terms we found a significant decrease in ribosomal activity toward the 4T1 tumor’s central region. This finding is consistent with previous work that showed that tumorigenicity was associated with profound alterations in ribosomal biogenesis and function, leading to the decreased translation of mRNA of tumor suppressor p53 and the reduced control of translational fidelity48. Also, GO analysis yielded a significant increase in blood coagulation toward the center of the tumor, consistent with increased vacularisation49.
The findings above show that proteomic profiling of in vivo e-biopsy samples can detect and potentially characterize 4T1 tumor heterogeneity. Specifically, the differences in protein expression profiles for the different sampled tumor regions are statistically significant. In addition, we identified enriched biological changes in cellular processes, functions, and components when comparing the 4T1 tumor’s center, middle, and periphery regions.