Using the WES data of TCGA-HCC and NMF algorithm, we attempted to decipher the mutational patterns of HCC and explore the relationship between COL1A2 and mutation patterns in HCC. The optimal k factorization of 5 was selected (Supplementary Fig.
2), and five mutational signatures were identified and matched in the COSMIC database (Fig.
5a). The three major mutational signatures were annotated with “Defective DNA mismatch repair”, “Aflatoxin exposure”, and “Aristolochic acid exposure”, and the abundance of each mutational signature in TCGA-HCC was shown in a pie chart (Fig.
5b). We reviewed the medical records of the TCGA-HCC cohort and extracted 184 HCC samples with history of alcohol consumption, hepatitis or NAFLD, and the distribution of each mutational signature was depicted in a stacked barplot (Fig.
5c). Using chi-square test, we found that NAFLD-HCC is characterized with high COL1A2 expression (Fig.
5d). With quantiles of COL1A2 mRNA expression, 46 HCC samples were assigned to the COL1A2-lowest and -highest group, respectively. Oncoplots of the two groups demonstrated that CTNNB1 acts as the most frequently mutated gene in the COL1A2-low cohort, with the mutation frequency up to 48% (left panel, Fig.
5e). In contrast, CTNNB1 is rarely mutated in the COL1A2-high cohort (right panel, Fig.
5e). In the integrated analyses of TCGA-HCC, MSK-HCC and INSERM-HCC cohorts, the gene-pair of TP53 and CTNNB1 is shown significantly mutually exclusive (Fig.
5f). Furthermore, COL1A2 mRNA expression is significantly elevated in CTNNB1-wild type HCC samples compared to CTNNB1-mutated ones (Fig.
5g). Among nine representative oncogenic pathways, the WNT signaling pathway is the most frequently affected one in the COL1A2-low cohort (Fig.
5h). GOBP analysis was used to further explore the altered pathways in CTNNB1-WT/COL1A2-high samples. A functional network showed the five most important pathways in CTNNB1-WT/COL1A2-high HCC samples were termed “vasculature development”, “chemotaxis”, “ECM organization”, “positive regulation of locomotion”, and “positive regulation of cell adhesion” (Fig.
5i). These evidences demonstrated that the CTNNB1/COL1A2 axis might play a role in the fibrosis and inflammation severity during NAFLD-HCC progression.