본문으로 건너뛰기
← 뒤로

Integrated multi-omics analysis reveals comprehensive metabolism-driven tumor heterogeneity and immune microenvironment in hepatocellular carcinomas.

1/5 보강
Scientific reports 📖 저널 OA 97.6% 2021: 24/24 OA 2022: 32/32 OA 2023: 45/45 OA 2024: 140/140 OA 2025: 938/938 OA 2026: 718/767 OA 2021~2026 2026 Vol.16(1) OA
Retraction 확인
출처

Li Y, Luo Z, Zhang B, Zhu W, Zhang J, Li Z

📝 환자 설명용 한 줄

[UNLABELLED] Metabolic reprogramming is a well-recognized hallmark of cancer, characterized by its remarkable flexibility in activating alternative pathways in the absence of specific regulators or su

이 논문을 인용하기

↓ .bib ↓ .ris
APA Li Y, Luo Z, et al. (2026). Integrated multi-omics analysis reveals comprehensive metabolism-driven tumor heterogeneity and immune microenvironment in hepatocellular carcinomas.. Scientific reports, 16(1). https://doi.org/10.1038/s41598-026-42856-7
MLA Li Y, et al.. "Integrated multi-omics analysis reveals comprehensive metabolism-driven tumor heterogeneity and immune microenvironment in hepatocellular carcinomas.." Scientific reports, vol. 16, no. 1, 2026.
PMID 41781663 ↗

Abstract

[UNLABELLED] Metabolic reprogramming is a well-recognized hallmark of cancer, characterized by its remarkable flexibility in activating alternative pathways in the absence of specific regulators or substrates. Non-negative Matrix Factorization (NMF) was employed to identify tumor heterogeneity at the terms of metabolism, progression-related pathways and molecular subtypes of HCC based on gene set variation analysis (GSVA) and stratified survival analysis. The inherent heterogeneities of metabolism landscape which includes genomes, methelomic and transcriptomic data, proteomes, phosphoproteomes, mutational and immune microenvironment landscape between metabolic subtypes were performed based on multi-omics analyses. NMF has led three distinct survival-associated subtypes (NMF cluster 1, iHCC1; NMF cluster 2, iHCC2; NMF cluster 3, iHCC3) based on metabolic gene expression, and GSVA revealed remakeable metabolic differences. Additionally, multi-omics analysis further revealed the unique landscape of different metabolic subtypes, encompassing transcriptome, epigenetic and post-transcriptional modifications (PTMs) at both bulk and single cell seq-RNA level. Furthermore, 39 subtype-specific variables for identifying metabolic subtypes were screened using four feature selection algorithms and preliminarily validated on 8 machine learning models. We then built and verified a nomogram to guide the individualized strategy for HCC patients, utilizing a combination of metabolic signatures and clinical characteristics. Finally, we preliminarily identified the potential contribution of Aldolase B (ALDOB) in metabolic reprogramming triggered by epigenetic and PTMs. Overall, the research defined robust subtypes and further revealed potential targets linking metabolism with immune microenvironment and non-mutational epigenetic modifications, thereby advancing our understanding of metabolic heterogeneity for application in HCC diagnosis and clinical risk stratification.

[SUPPLEMENTARY INFORMATION] The online version contains supplementary material available at 10.1038/s41598-026-42856-7.

🏷️ 키워드 / MeSH 📖 같은 키워드 OA만

같은 제1저자의 인용 많은 논문 (5)

📖 전문 본문 읽기 PMC JATS · ~83 KB · 영문

Introduction

Introduction
HCC is the predominant form of primary liver cancer worldwide, with cholangiocarcinoma being the subsequent type, accounting for over 90% of all cases and ranks as the fourth leading cause of cancer-derived mortality worldwide1.The prevalence and risk factors for HCC include hepatitis B/ C virus infection, prolonged alcohol consumption, and nonalcoholic fatty liver disease (NAFLD) associated with metabolic syndrome (such as obesity) or dyslipidemia in individuals with type 2 diabetes mellitus (T2DM). The pathophysiology of HCC is a complex, multi-step, and multi-stage process driven by a intricate interplay of genetic susceptibility, viral and non-viral risk factors, microenvironmental substrates, immune cell activity, and chronic inflammation2. Metabolic abnormalities are a significant hallmark of cancer, as tumor cells undergo metabolic reprogramming to support their rapid proliferation. Simultaneously, metabolic alterations can further facilitate tumorigenesis and development through a specific interplay between metabolites and genes/ proteins3,4. Compared to normal tissues, cancer cells and tumors exhibit a diverse range of metabolic heterogeneity. This heterogeneity is characterized by varied metabolic activity within the tumor, as well as distinct metabolic preferences among tumor cells, resulting in disparities in drug response. The metabolic landscape consisting of genomes, transcriptomes, proteomes, metabolites, and their internal interplays between them is highly heterogeneous across cancer types, predominant histologic subtypes, and even single-cell levels under specific conditions3,5,6.
The liver plays a crucial role in metabolizing three major nutrients in the body: namely glucose, lipids, and amino acids. During its progression, HCC exhibits various alterations in metabolic pathways, including heightened aerobic glycolysis and glucose consumption, augmented de novo fatty acid synthesis and pentose phosphate pathways, accelerated glutamine catabolism, and disrupted redox homeostasis. These metabolic changes provide the necessary energy and biological macromolecular raw materials for tumor cell growth and proliferation. Various enzymes and signaling molecules involved in multiple metabolic processes of HCC cells are indispensable7–9. Given that metabolic abnormalities precede the onset and progression of HCC, it is conceivable that these metabolic imbalances may be detectable in the early stages of HCC or even in precancerous lesions6,10. Consequently, changes in the activity of key enzymes within metabolism-related pathways and the metabolite content present opportunities for early diagnosis and novel therapeutic interventions for HCC. However, the metabolism of HCC tumor cells, such as genetic heterogeneity, exhibits obvious heterogeneity owing to different genetic backgrounds and tumor microenvironments. In essence, tumor cell metabolism does not exhibit universal alterations11,12. Instead, tumor cells generate diverse genetic variations during metastasis and treatment, leading to changes in their metabolic profiles in various directions13,14. HCC exhibits a complex network of metabolic pathways, including glucose metabolism, lipid metabolism, and amino acid metabolism. Targeting these pathways precisely is challenging due to their intricate interactions. HCC produces a variety of metabolic products during its progression, such as lactate, pyruvate, and fatty acids. These metabolites play pivotal roles in tumorigenesis and progression15. However, interventions targeting these metabolites are still in the early stages of research. The metabolic microenvironment of HCC is heterogeneous, with different tumor regions potentially exhibiting distinct metabolic features16,17. This heterogeneity poses significant challenges for treatment. Precision treatment targeting this heterogeneity is an essential research area. Yet, for HCC, precision treatment targeting its specific metabolic pathways remains a challenge18. Despite significant advancements in our understanding of HCC metabolism, several gaps remain in previous research. One critical area that has been overlooked is the metabolic heterogeneity within HCC tumors. Previous studies have primarily focused on identifying common metabolic alterations in HCC, but have failed to address the significant variability in metabolic profiles between different patients and even within the same tumor. This metabolic heterogeneity can lead to variable treatment responses and the emergence of drug resistance, highlighting the need for more personalized and targeted therapeutic approaches. Consequently, a comprehensive exploration and comprehension of metabolic heterogeneity in HCC are critical for elucidating the molecular basis of metabolic abnormalities, accelerating the development of new metabolism-specific drugs, and promoting risk stratification and individualized management strategies for HCC patients.
Over the past decades, considerable efforts have been made to categorize HCC into several predominant molecular subtypes, each characterized by unique genomic alterations, mutational patterns, diverse prognoses, and/or activation of oncogenic pathways. To delve into the heterogeneity of HCC, numerous stratification strategies have been devised. Certain clinically significant subtypes have emerged as closely tied to specific pathways, histopathological features, and immune microenvironment characteristics19–21. Although previous studies have revealed several subtype-specific characteristics of HCC, it was difficult to reveal the comprehensive characterization of prognosis and progression with only a single or a small number of biomarkers. Therefore, as a result of the small sample size and a deficiency of multi-omics data, few studies have comprehensively investigated HCC from the perspective of metabolic disturbances and heterogeneity22,23. The present study aimed to comprehensively investigate which metabolic flow alters the overall metabolic preference and transcriptional signatures in HCC. Utilizing multi-omics data derived from 4 independent HCC cohorts, we successfully identified 3 metabolic subtypes. These 3 metabolic pathway-specific subtypes have well-marked metabolic preferences and gene expression patterns, DNA methylation level, protein phosphorylation level, clinicopathologic features, and distinct prognosis. It is worthwhile to recognize the metabolic pattern associated with prognosis based on distinctly metabolic characteristics, which may facilitate biological insights in identifying potential therapeutic targets, as well as clinically unique subclasses that may affect the management strategies for HCC patients (Graphical Abstract).

Materials and methods

Materials and methods

Data retrieval, pre-processing, and validation cohorts
RNA-seq gene expression profiling and associated phenotype data for HCC and non-cancerous liver tissue were obtained as transcripts per kilobase million values (TPM) from the Cancer Genome Atlas (TCGA, http://cancergenome.nih.gov/) and the International Cancer Genome Consortium (ICGC, www.icgc.org, http://lifeome.net/database/hccdb/download.html), and metabolic genes were selected based on MSigDB. To validate associations between metabolism-associated gene patterns and metabolic subtypes, an additional cohort holding of 221 HCC microarray samples and corresponding materials attained from GSE14520 were utilized24. The three HCC datasets were then combined into a single metadata set, and data were ComBat corrected for batch effects25. While overall survival (OS) and recurrence-free survival (RFS) has been regarded as the standard primary clinical endpoint, and those without defined endpoints will be excluded. Figure S1 indicated principal component analysis (PCA) and t-distributed stochastic neighbor embedding (t-SNE) analyses before and after batch effect correction. Proteome and phosphoproteome data of 159 HCC patients were approved and provided by the researchers in supplementary materials26. The Random Forest model in multiple imputations by chained equations (MICE) was applied for the missing data imputation. Proteins and phosphorylation sites with more than 80% missing values were eliminated before imputation to ensure that each sample contained sufficient data for future processing. The imputation method was implemented in the R package mice (version 3.16.0)27,28. A total of 983 HCC patients were enrolled in the study, and the baseline characteristics were summarized in Table S1.

DNA methylation data pre-processing and preliminary screening
TCGA provided a gene-specific DNA methylation profile of HCC patients. Illumina Human Methylation 450 Beadchip (450K array) was used to measure genome-wide methylation. Array quality assessment, data preprocessing, normalization, and filtration were performed in the R package Minfi (version 1.52.1). A total of 394201 CpG sites with applicable values throughout the genome were subjected to the following criteria: (i) probes with detection P-value ≥ 0.01 and < 3 beads in at least 5% of samples were eliminated; (ii) probes on the non-sex chromosomes and in CpG context; (iii) non-overlapping probes with single nucleotide polymorphisms; (iv) probes with stable genomic sites in the human genome29,30. We then executed data normalization and the β-value calculation in the R package minfi and ChAMP (version 2.36.0)30–32. The determination of DNA methylation levels (β-values) involved the calculation of the ratio of intensities between methylated and unmethylated alleles, which spanned from 0 (completely unmethylated) to 1 (completely methylated), respectively (Figure S2A-C). A total of 47474 probes from the DNA methylation chip were identified and annotated to 3124 gene signatures associated with metabolism (Figure S2D).

Non-negative matrix factorization (NMF) for subtype identification
Genes with a median absolute deviation (MAD) ≥ 0.5 were incorporated, resulting in a total of 3124 metabolism-associated genes were regarded as input matrix for clustering analysis in the R package NMF (version 0.27). To determine the optimal factorization rank (k), we performed NMF based on 50 random starting points for k values ranging from 2 to 10. Various separation qualities including silhouette width, cophenetic coefficients, and dispersion were evaluated for each factorization rank. The k values corresponding to a sudden decline in the magnitude of the cophenetic coefficient were considered as the optimal number of clusters. The samples were then assigned to HCC subtype using the optimal k value on the coefficient matrix, and 100 iteration runs were performed from fixed random initial conditions. PCA and t-SNE plots were used to detect differences in expression between subgroups.

Gene set enrichment analysis (GSEA) and Gene set variation analysis (GSVA) analysis
GSEA analysis was performed in GSEA tool (https://www.gsea-msigdb.org/gsea/index.jsp, version: 4.3.2), and GSVA analysis was implemented in R package GSVA (version 1.52.3). Normalized enrichment scores (NES) were used to investigate the enrichment variation of biological processes in different phenotypes. Gene sets with a false discovery rate (FDR) q value and a nominal P value of < 0.05 were considered statistically significant in GSEA33,34. The normalized enrichment scores ranged from -1 to 1, corresponding to the relative enrichment score in each sample in GSVA. A Wilcoxon signed ranks test (P value < 0.05) was performed to compare enrichment scores in phenotype-matched samples for each gene set. A total of 140 metabolism-associated gene signatures, 31 cancer-relevant pathways, and 44 HCC subclass-relevant signatures were obtained from previously published studies using the MsigDB (Table S2)5,26,35,36. Subtype-specific pathways between at least two subgroups were included in further analysis.

Quantification of immune cell infiltration
The deconvolution algorithm took into account the transcriptome profile of heterogeneous samples by conceptualizing it as a convolution resulting from the presence of various cell types. It estimated the unknown cell abundance using a signature matrix describing cell type-specific expression profiles. Therefore, the relative proportions of 22 human immune cell subsets can be estimated quantitatively using a leukocyte gene signatures matrix (CIBERSORT deconvolution algorithm, https://cibersortx.stanford.edu/)37,38. The gene expression profile of HCC based on three RNA-seq data portal was used as the input matrix for CIBERSORT, which calculated empirically defined deconvoluting global p-values for each sample.

Benefit prediction of immunotherapy and targeted therapy between HCC subtypes
Based on SubMap algorithm (Gene Pattern, https://www.genepattern.org/), available data from patients treated with immunotherapies were utilized to indirectly predict the potential benefits of three HCC subtypes by measuring the resemblance in gene expression patterns between subtypes and individuals who responded positively to anti-CTLA4, anti-PD1, Pembrolizumab, anti-PD-L1 or anti-MAGE-A3 therapy39–42.

Identification of subtype-specific gene signatures and fuzzy C-means clustering
The R package Limma (version 3.60.3) was employed to identify differentially expressed genes (DEGs) within four subgroups, namely Normal (Control), iHCC1, iHCC2, and iHCC3, respectively. The samples are divided into two groups, for each subgroup, one containing sample from this subgroup and the other collecting the remaining samples from the data cohort. The Student’s t-test was employed to conduct a comparative analysis between the two groups. Benjamini-Hochberg method was used for multiple inspections and correction of the obtained P values. Significant DEGs were assigned using a P value < 0.05 and a log2-transformed fold change of ≥ |±1.5|. Only genes exhibited significant differences across all 6 possible comparisons were considered as subgroup-specific DEGs. The fuzzy C-means algorithm of R package Mfuzz (version 2.66.0) was implemented to perform clustering analysis of DEGs according to HCC evolution orders, which concentrated genes with similar expression trends into one group. Normal (Control), iHCC1, iHCC2, and iHCC3 were sequentially assigned points 1, 2, 3, and 4, sequentially. Finally, DEGs were divided into 4 clusters with a membership threshold of 0.2543,44.

Single-cell RNA sequencing data processing
The raw gene expression matrix was imported and subjected to processing in the R package Seurat (version 4.0.4). HCC scRNA-seq dataset was obtained from the GEO dataset45. HTseq calculated unique mapping reads and counts from the same UMI (Unique Molecular Identifier), which were then merged to obtain transcript counts for each gene from a single cell. Cells with < 8000 counts, 500 genes and > 20% of total UMIs mapping to the mitochondrial genome were excluded from the following analyses. After filtration, the scRNA-seq matrix was merged, and R package Harmony (version 1.2.0) (lambda = 1, max.iter.harmony = 20) was used to remove the batch effect46,47. The matrix was subsequently subjected to scaling and normalization using a library size factor so that every cell had 10,000 counts and was log2-transformed. The variance-to-mean ratio was then mployed to identify highly variable genes (HVGs) using the FindVariableFeatures function in R package Seurat (version 4.0.4). Significant genes (FDR ≤ 1e-3) were used to perform PCA analysis to reveal biologically significant variables, as determined by the application of. To ascertain the optimal cluster numbers, the FindClusters function (resolution = 0.8) was used48,49. Cells were projected into two dimensions using a Uniform Manifold Approximation and Projection (UMAP) dimensionality50. Cell types were identified using lineage-specific signatures45. The CellMarker (version 2.0) database was used for manual verification and correction51. The final annotation results were manually verified and visualized by UMAP.

Mutation landscape analysis
The DNA somatic mutation data for HCC patients were obtained from TCGA database. Additionally, three other mutation datasets are from cbioportal (Ulsan2014, FR2015) (https://www.cbioportal.org/)52,53 and the Proteogenomic cohort. Somatic variant analysis was conducted to analyze and visualize the Mutation Annotation Format (MAF) using R package maftools (version 2.20.0)54. A waterfall plot with variant frequencies depicted the mutation landscape.

Machine learning for subtype classifications and performance evaluation
R package caret (version 6.0-94) was utilized for model training and appraisal55,56. In brief, we initially determined ternary classification objectives based on metabolic subtypes, followed by conducting feature selection for each objective. Subtype-specific signatures were employed initially as an input matrix for iHCC identification. AdaBoost, XgBoost, Random Forest, and Boruta algorithms were adopted as the feature dimensionality reduction and elimination algorithms to identify the most significant features57,58. Venn diagrams were used to demonstrate the intersection of different algorithms and were regarded as detection features that should be incorporated into the final model construction. Subsequently, for each objective, we developed machine learning (ML) models based on the selected features. Furthermore, to evaluate the overall performance of the prediction models, a 5-fold cross-validation was conducted within the training set. Each fold underwent the aforementioned training procedure and was tested on the reserved portion of the training set. The specific algorithms were Decision Tree (DT), support vector machine (SVM), Random Forest (RF), linear discriminant analysis (LDA), k-nearest neighbor (KNN), extreme gradient boosting (XgBoost), Adaptive Boosting (AdaBoost), and MultiLR, respectively59–61. The HCC samples were randomly divided into 3 groups based on the subtype assignment collected from the NMF cluster (k = 3) so that each subtype was represented in 3 subgroups. The random segmentation was repeated ten times to obtain a reliable performance evaluation. The samples were randomly split into training (70%) and testing (30%) sets for each run, respectively. The parameters that yielded the highest cross-validation AUC for each classifier model were selected. Model performance comparison was employed based on the per-subtype confusion matrix and balanced accuracy (to adjust for the class imbalance problem)62,63. Cohen’s kappa was used to compare overall ML model performance for consistency inspection and classification accuracy measurement61.

Development and external validation of metabolism-associated prognostic model
In the discovery cohort, whether the candidate signatures were relevant to the survival status was evaluated through univariate Cox proportional hazards regression analysis. Subsequently, the Elastic Net Regression was employed for further feature filtering. Stepwise Cox regression was utilized to identify an optimal subset for prognostic model construction. The risk score represented the weighted sum of each signature expression, considering their respective coefficients. The restricted cubic spline (RCS) was applied to visualize the optimal threshold, enabling the classification of all patients into high- and low-risk groups.

Human samples and immunohistochemistry assay
Resected HCC tissues and noncancerous liver tissues were obtained and granted by the clinical trial ethics committee (No. 2023023) diagnosed first by radiology or histology at the First Affiliated Hospital of Bengbu Medical University. The study has obtained written informed consent from participants or their legal guardians (for patients lacking autonomous capacity) and complies with the Declaration of Helsinki. Paraffin-embedded human HCC samples were cut in 5 μm sections, which were processed for histological staining and immunohistochemically (IHC) analysis was performed using a previously described method.

Statistics and reproducibility
Statistical analyses were conducted utilizing the R package (version 4.0.4, RStudio 2024.04.2 Build 764). The Fisher exact and Mann-Whitney U tests were used to compare categorical and continuous variables, respectively. Unless explicitly mentioned, all statistical tests (paired Wilcoxon rank-sum test (Student’s t-test, Mann-Whitney U-test), etc.) were two-sided, and adjusted P values were calculated using the Benjamini & Hochberg method for each comparison. Statistical significance was determined at a threshold of 0.05 or lower. Significance levels are as follows: *P < 0.05; ** P < 0.01; ***P < 0.001; N.S., insignificant.

Results

Results

Gene set variation analysis (GSEA) displays metabolic dysregulation during HCC development
We obtained the gene expression profiles of HCC and normal liver tissues in TCGA, and searched metabolism-related genes in MsigDB (Figure 1A). Amino acid, glucose, lipids, nucleotide, vitamin metabolisms, and other related metabolic pathways were significantly dysregulated in HCC (Figure 1B, Table S3). Table S3 provided a comprehensive description of the difference in molecular pathways as determined by the GSEA analysis. NES and FDR q-value revealed an enrichment of metabolic pathways involved in different tissues. Activation of multiple metabolic pathways, particularly those related to nucleotides (such as the reactome metabolism of nucleotides), has been noted in HCC. The upregulation of these pathways is associated with increased levels of DNA synthesis and proliferation of tumor cells, suggesting that adaptive metabolic changes take place during the development of HCC. Anabolism and catabolism are two key biological processes involved in these metabolic pathways. A total of 3214 non-redundant metabolism-related genes were obtained through the extraction of genes from these gene sets. Furthermore, two other HCC RNA-seq profiles were obtained from ICGC and GEO databases, and HCC-related proteome/ phosphoproteome data were obtained from materials by other investigators. According to metabolic genes, expression profiles at gene and protein levels were extracted from these 4 HCC cohorts. Heatmaps revealed the metabolic genes/ proteins difference in noncancerous liver tissues of patients with HCC (Figure S3).

NMF clustering identified distinct subtype-specific prognosis in HCC
In TCGA cohort, 371 HCC samples were categorized into 3 metabolic subtypes with different prognosis (iHCC1, iHCC2, and iHCC3) using unsupervised clustering based on the NMF algorithm (Figure 2A). The consensus matrix heatmap and silhouette coefficient revealed distinct boundaries between different subtypes, indicating that the clustering results of the samples were robust and reliable (Figure 2B). Among them, the iHCC3 subtype had the worst prognosis, with shorter overall survival (OS) (Figure 2C, left) and recurrence-free survival (RFS) (Figure 2C, right) than iHCC1 and iHCC2. The three metabolic subtypes were distinguished from one another using PCA and t-SNE dimensionality reduction techniques, revealing unique internal differences among the subtypes. (Figure 2D). We performed NMF clustering in two external RNA-seq cohorts (ICGC, and GEO), a Proteogenomic, and a merged cohort, to determine whether HCC can be robustly divided into three metabolic subtypes. These findings indicate that there are still three robust metabolic subtypes of HCC. Similarly, the iHCC3 subtype had the worst prognosis in different RNA-seq cohorts (Figure S4 and Figure S5) and the Proteogenomic cohort (Figure 3), with shorter OS and RFS times than the other subtypes. The Submap mapping algorithm was performed to compare metabolism-related gene (protein) expression patterns among the 3 metabolic subtypes. The iHCC2 and iHCC3 subtypes were significantly similar across different cohorts (Bonferroni corrected P < 0.05), indicating the robustness of iHCC2 and iHCC3 subtypes in HCC. In ICGC and Proteogenomic cohorts, iHCC1 was not significant (Bonferroni corrected P > 0.05) (Figure S6A). In addition, we investigated the differential expression pattern of metabolism-related genes (proteins) among different metabolic subtypes. The heatmap illustrated clear disparities in the abundance of metabolic genes (proteins). Unique subtype-specific signatures in HCC, which may have driven the evolution of the different metabolic subtypes (Figure S6B).

Characterizing different characteristics of HCC metabolic subtypes
The Chi-square test demonstrated a significant correlation between clinicopathological characteristics and HCC subtypes across different HCC cohorts. In TCGA cohort, iHCC3 had a higher histologic grade (G3/ G4) (P < 0.001), and a higher alpha-fetoprotein (AFP) level (P < 0.001) (Figure 4A, Table S4). In GEO cohort, iHCC3 was associated with a worse Child-Pugh score (P < 0.001), higher AFP level (P < 0.001), and higher pathologic stage (Stage III/ IV) (P = 0.0379) (Figure 4A, Table S4). Higher histologic grade (G3/ G4) (P < 0.001) and pathologic stage (Stage III/ IV) (P = 0.00435) were associated with iHCC3 in the ICGC cohort (Figure 4A, Table S4). AFP (P < 0.001), γ-Glutamyl Transferase (GGT) levels (P = 0.00317), and tumor purity (P = 0.0354) were higher in the iHCC3 in the Proteogenomic cohort (Figure 4A, Table S4). Consistently, higher AFP levels were strongly associated with the iHCC3 subtype (Figure 4B), and higher AFP generally predicted shorter OS or RFS survival (Figure 4C).

Inter-tumor metabolic heterogeneity in HCC and distinct subtype-specific metabolism alterations
To further investigate the metabolic alterations between subtypes, we employed GSVA to quantify relevant metabolic pathways, followed by differential analysis to identify subtype-specific pathways. The heatmap revealed that iHCC3 had high nucleotide metabolic activity (for example, pyrimidine and purine metabolism) in the TCGA cohort, whereas pathways rather than nucleotide metabolism (Amino acid, glucose, lipids, vitamin metabolisms, and drug related metabolic pathways) were activated in iHCC2, a pattern consistent across other cohorts as well (Figure 5A). The overall landscape of iHCC1 was similar to that of iHCC2, but subtype-specific characteristics were preserved. In particular, ascorbate and aldarate, xenobiotics by cytochrome P450, heme and porphyrin, Krebs cycle, phenylalanine, porphyrins, pyruvate metabolism, TCA (Tricarboxylic acid) cycle, pentose and glucuronate interconversions, and fatty acid metabolism pathways were enriched in iHCC2 (Figure S7). Furthermore, we conducted a comprehensive review of the existing literatures to identify gene sets associated with HCC progression and previously reported HCC molecular subclasses. Through GSVA, we observed the activation of numerous pathways related to HCC progression and carcinogenicity in iHCC3, which aligned with the presence of subtypes with poorer prognosis. In detail, pathways associated with tumor metastasis, vascular invasion, recurrence, and EPCAM expression were up-regulated in the iHCC3 subtype (Figure 5B). The Wnt, Notch/PI3K-AKT/ mTOR, TGF-β/ Smads, and hypoxia/ autophagy-associated pathways were also activated in iHCC3 (Figure 5A, Table S5).

Fuzzy C-means clustering reveals a dynamic expression landscape of subtype-specific signatures
Four groups (Control group, iHCC1, iHCC2, iHCC3) of genes were clustered according to their distinct temporal expressing patterns using the fuzz C-means clustering algorithm. Some DEGs were specifically up-regulated or down-regulated between controls and iHCC subtypes (Figure 6A). The analysis of metabolic gene mapping to their respective pathways reveals that genes associated with nucleotide metabolism exhibited elevated expression levels in the iHCC3 subtype, while showing a significant decrease in expression in the control group. Genes related to other metabolic pathways were highly expressed in the control group and lowly expressed in the iHCC3 subtype. Consistently, similar trend was observed in GEO and ICGC cohorts (Figure 6B, top). Moreover, aberrant hypomethylation of promoters associated nucleotide metabolism and increased levels of proteins and phosphorylation were detected in iHCC3. Conversely, in normal liver tissues, genes involved in metabolic pathways other than nucleotide metabolism were hypermethylated and decreased protein abundance was accompanied by a low phosphorylation level (Figure 6B, bottom).

Global landscape of metabolic heterogeneity at single-cell resolution
In addition, we endeavored to evaluate the metabolic heterogeneity of HCC at the single-cell level. UMAP dimensionality reduction identified 37 major clusters subdivided into eight major cell populations according to ElbowPlot and clustering tree in PCA algorithm, following the correction of batch effects (five CD8+ T clusters (C0, C1, C20, C27, C36), six Myeloid clusters (C2, C4, C5, C11, C29, C30), twelve Hepatocyte clusters (C3, C7, C9, C12, C14, C16, C18, C21, C25, C26, C31, C34), two Endothelial clusters (C6, C33), six CD4+ T clusters (C8, C19, C22, C24, C28, C32), one Fibroblast cluster (C10), one Cholangiocyte cluster (C13), and 4 B cell cluster (C15, C17, C23, C35)) (Figure S8). Canonical lineage markers were specifically expressed among different major cell populations (Figure S9). To further investigate the metabolic heterogeneity of HCC at the single-cell level, we initially analyzed the panoramic view of metabolic gene expression in these 37 primary clusters using UMAP based on 120 metabolic and HCC-related pathways. GSVA analysis revealed the cell-cluster-specific differences between 37 cells clusters in different pathways (Figure 7A). After mapping these cell clusters to the major cell populations, hepatocytes and cholangiocyte cells exhibited significant metabolic characteristics in amino acid, glucose, lipids, vitamin metabolisms, and drug related metabolic pathways. Conversely, CD4+ T cells displayed heightened nucleotide metabolic activity (Figure 7B). Then, 12 hepatocyte clusters were re-clustered into 15 major clusters, based on 3214 metabolic genes, instead of HVGs (Figure 7C). It was further identified that hepatocytes derived from normal tissues (NTL) were grouped into one cell population within the designated region (a part of the dotted circle). Conversely, hepatocytes derived from tumor tissues (PT, MNL, and PVTT) were clustered into independent cell populations and separated from their tissue of origin, indicating that cells derived from HCC tumors also exhibited metabolic heterogeneity between individual patients (Figure 7D). Additionally, Figure 7D suggested that malignant hepatocyte clusters derived from the same patient (HCC7T and HCC7P, HCC08T and HCC08P, HCC10T and HCC10L) do not completely overlap, suggesting the presence of metabolic heterogeneity among them. The application of clustering analysis following UMAP dimensionality reduction demonstrated that the malignant hepatocyte cells exhibited discernible clusters, which aligned with the primary origin of their respective tumors in HCC (i.e. from which tumor site, stage or viral infection status the cells were derived) (Figure S10). Since hepatocyte cells derived from normal tissues were clustered and exhibited no observable interpatient heterogeneity within the same cell types, suggesting that metabolic gene expression in malignant cells was predominantly determined by patient-specific factors. Single-cell GSVA analysis depicted a striking difference in verious metabolic and HCC-related pathways between tumor cells derived from 15 hepatocyte clusters (Figure 7E). Furthermore, we tried to characterize the unique global metabolic landscape of different Patients, Samples, Tissues, Sites, Viral infection status and Stage, ultimately revealing the presence of location-specific metabolic heterogeneity (Figure S11A-E). The Pearson correlation also suggested three major subsets of hepatocyte clusters (Figure S11F) and the NMF algorithm also divided malignant hepatocytes into three major clusters (Figure S11G). Figure S12 showed that patients or tumor tissues with similar metabolic status are divided into the same NMF cluster (Figure S12A), whereas metabolic genes exhibited different global structures among different clusters (Figure S12B). To further determine major contributors to intra-tumoral metabolic heterogeneity of HCC (i.e. variation of metabolic characteristics among malignant cells from different HCC patients), CD4+ T, CD8+ T, B cell, and Myeloid cell clusters were then re-clustered based on 3214 metabolic genes. The UMAP clustering analysis revealed that there was no discernible heterogeneity among different patients in terms of CD4+ T, CD8+ T, and B cell populations (Figure S13A-C). Figure S13D illustrated that the re-clustered Myeloid cells exhibited a higher degree of distinctiveness in their clustering patterns across different patients. Consequently, these cells were subsequently redefined into eight prominent cell clusters (Figure S13E), and GSVA analysis revealed high metabolic activity in myeloid-2, 3, and 6 (Figure S13F). We subsequently endeavored to identify the specific variation of metabolic pathway among the different myeloid cell clusters. Following this, we conducted enrichment scoring utilizing established markers for recognized cell types. Through this approach, we identified 11 major Myeloid populations (including both cytotoxic lymphocyte cells (CTL), Proliferative CD4 T cells, regulatory T cells (Treg), Mature B cells, MARCO+ macrophage, Dendritic cells (DCs), mucosal-associated invariant T cells (MAIT), resident memory T cells (Trm), Monocyte-derived macrophage, Plasma B cells, and MMP9+ macrophage) (Figure S14A-B). GSVA analysis demonstrated that Proliferative CD4 T cells showed higher metabolic activity compared to other major cell populations, followed by MMP9+ macrophage, MARCO+ macrophage, and Monocyte derived macrophage (Figure S14C). Stacked barplots showed significant depletion of CTL cells and enrichment of MARCO+ macrophage, Treg, and Proliferative CD4 T cells in tumor tissues (PT, MNL, and PVTT) as compared to normal tissues (NTL) (Figure S14D). Enrichment analysis revealed that multiple amino acid and vitamin-related metabolic pathways were activated in the CTL cell population (Figure S14E). We conducted GSVA analyses on each macrophage cell cluster and observed that metabolic heterogeneity in all macrophage cell types was primarily driven by variations in metabolic activities. More specifically, lipid-related metabolic pathways were enriched in the MARCO+ macrophage cluster, while the MMP9+ macrophage cluster demonstrated a predominant enrichment of pathways associated with nucleotide metabolism (Figure S14F). Collectively, these findings indicated that hepatocyte and Myeloid cells derived from HCC tumor exhibited higher metabolic plasticity, potentially resulting in patient-specific metabolic reprogramming of malignant cells as opposed to other supporting cells in the TME. Our comprehensive analysis determined significant metabolic heterogeneity in HCC, which may be related to metabolic reprogramming during HCC occurrence and development.

Immune microenvironment heterogeneity among the three HCC subtypes
Since iHCC3 held the worst prognosis, we further characterized their immune landscape across the 22 human hematopoietic cell phenotypes with the CIBERSORT algorithm from 3 HCC cohorts (Figure 8A-C). Specifically, CD8+ T cells and M0 Macrophages exhibited lower abundance in iHCC3, while a higher abundance of naive CD4+ T cells and M2 Macrophages in the TCGA cohort (P < 0.05). Consistently, this conclusion was also applicable in the GEO and ICGC cohorts (Table S6). We conducted additional analysis to examine the characteristics of immune checkpoint genes that may be targeted in these three subtypes. Figure 8D indicated that iHCC3 had significantly higher expression of 14 immune checkpoint signatures (excluding IL-6) compared to iHCC1 and iHCC2. This implied that different metabolic subtypes may yield disparate outcomes in forthcoming immune checkpoint inhibitor therapy, which needed further investigation. Our study compared the gene expression patterns between different subtypes of HCC and other patients undergoing various immunotherapies, including anti-CTLA4, anti-PD1, Pembrolizumab, anti-PD-L1, and anti-MAGE-A3 therapies. The comparison was conducted using the TCGA and GEO cohorts, employing the Submap algorithm. Bonferroni-corrected P-values indicated iHCC3 was more promising to respond to anti-PD-L1 therapy, whereas iHCC2 was more favorable to the Pembrolizumab treatment (Figure 8E-F). Based on the Genomics of Cancer Drug Sensitivity (GDSC) database, we predicted the sensitivity of each subtype to 198 drugs using the R package oncoPredict (version 0.2). Figure 8G highly responded to the effectiveness of anti-pyrimidine drugs in iHCC3. Notably, PI3K/AKT, growth factor, GSK/ ERK/ TKI, and Wnt/ TGF-β pathway inhibitors may be applicable in different metabolic subtypes (Figure S15). These results emphasized the potential role of specific TME in tumor metabolic heterogeneity.

Mutation landscape of metabolic subtypes in HCC
Metabolic reprogramming and epigenetic modifications, which were inextricably linked and reciprocally regulate each other26,64. We analyzed the global mutational landscape in four independent cohorts. TP53 (22%-58%), TTN (21%-37%), MUC16 (12%-23%), and CTNNB1 (18%-36%) were frequently observed as missense mutation in HCC (Figure S16A). Further analysis revealed that iHCC3 had a higher frequency of TP53 mutations (iHCC1: 31/94 (24.8%), iHCC2: 37/116 (24.2%), iHCC3: 42/51 (45.2%), P < 0.001), while CTNNB1 mutations were more frequently observed in iHCC1 (iHCC1: 49/76 (39.2%), iHCC2: 20/133 (13.1%), and iHCC3: 22/71 (23.7%), P < 0.001) (Table S7). Stratified survival curves demonstrated that patients with TP53 mutations had significantly shorter OS (P = 0.005) and RFS (P = 0.012). Conversely, patients with CTNNB1 mutations exhibited a statistically significantly favorable prognosis than CTNNB1-wild-type patients in each HCC subtypes (OS: P = 0.01, RFS: P = 0.002), respectively (Figure S16B). Proteins enriched in CTNNB1-mutated and TP53-mutated HCC tissues have been associated with several metabolic pathways incorporating lipids, glycolysis/ gluconeogenesis, amino acid metabolism, and drug-related metabolism26,65,66. We performed a GSEA analysis among CTNNB1-mutated and CTNNB1-wild-type samples to investigate the molecular changes associated with CTNNB1 mutation. Amino acid, fatty acid, and drug-related metabolism pathways were enriched in CTNNB1-mutant tissues (Table S8). In contrast, TP53 mutations were primarily associated with the activation of pyrimidine, polyamine, and glucose metabolism (Figure S16C, FDR q-value < 0.05) (Table S8).

Machine learning models for metabolic subtypes classification and performance evaluation
An attempt was made to identify subtype-specific signatures in the three HCC subtypes, based on AdaBoost, XgBoost, Random Forest, and Boruta algorithms. A total of 39 genes were selected for further analysis on the basis of the intersection of the four feature selection algorithms (Figure S17A). Utilizing these features, we built a multi-gene classifier based on eight machine-learning algorithms. The XgBoost model demonstrated superior performance in terms of balanced accuracy, overall accuracy, and Kappa score compared to the other seven machine learning models, resulting in improved classification performance on the test set (96.9∼98.9% balanced accuracy, 0.973 overall accuracies with 95.8% Kappa score) (Table S9). These findings indicated that the multi-gene classifier was promising in identifying various metabolic subtypes in HCC, it needed to be validated in real-world datasets. Figure S17B and S17C demonstrated that these features are demarcated and independent of each other between 3 HCC subtypes and normal liver tissues, implying that subtype-specific genes may drive different metabolic subtypes. These results may reveal the potential role of subtype-specific gene signatures in driving the metabolic landscape of different HCC patients. Furthermore, we identified the expression of ALDOB, GNMT, HAAO, HMGCS2, IMPDH2, MTHFD1L, PYCR2, SLC10A1, and TRMT61A were significantly negatively correlated with CpG methylation (Figure S17D). Moreover, their expression levels and CpG site methylation levels significantly differed among normal liver tissues and metabolic subtypes. We hypothesized that this differential expression might be driven by DNA methyltransferases between different metabolic subtypes (Figure S17E), which needs to be further validated at the histopathological and cytological levels. Both bulk RNA-seq and single cell RNA-seq analyses confirmed the dysregulation of these nine genes in HCC (Figure S17F, Figure S18).

Development and validation of a machine learning model as a novel stratification strategy for HCC patients
To further expand the clinical availability of metabolic subtypes, 39 subtype-specific signatures were employed as an input matrix to construct a prognostic model. A total of 30 genes reached the threshold of P < 0.2 in univariate cox regression, then 15 non-zero variables were further identified through Elastic Net regression analysis (Figure 9A). Finally, DHX37, FBL, HMGCS2, ALDOB, NR1H3, and TRMT61A were identified by stepwise multivariate regression analysis in the final model. The model exported the tumor metabolic index (TMI) of each patient by the formula below: (0.602∗DHX37) + (0.040∗FBL) + (-0.0571 ∗ HMGCS2) + (- 0.383 ∗ ALDOB) + (-0.018 ∗ NR1H3) + (0.15 ∗ TRMT61A). The RCS curve effectively determined the optimal threshold value at which the hazard ratio (HR) equaled 1 (Figure 9B). 173 HCC patients were assigned into the high-risk group, while the other 198 were classified into the low-risk group. The KM plot revealed a 194% increased risk of death in the high-risk group compared with the low-risk group (Log-Rank P < 0.001). The ROC curve (AUC) showed that the TMI model provided better prognostic prediction efficiency at every temporal node (Figure 9C). Three independent validation cohorts confirmed the robustness of the TMI model, revealing that high TMI had a higher risk of mortality (GEO cohort: HR = 1.82, P = 0.005; ICGC cohort: HR = 4.82, P < 0.001; Proteogenomic cohort: HR = 3.71, P < 0.001) (Figure 9D and 9E), and AUC indicated the prediction efficiency of the model at different time points (Figure 9F). Furthermore, in four independent cohorts, both univariate and multivariate Cox regression analyses identified TMI as a significant independent prognostic factor (P < 0.05) (Table S10). To provide better clinical applicability, we developed a multivariate nomogram based on risk score, and various clinical features (Figure 10A) to intuitively understand the 1-, 2-, 3- and 5-year survival probabilities of each HCC patient. A good degree of consistency between the calibration curve and the ideal curve was observed (Figure 10B). Time-dependent AUC demonstrated better accuracy compared to clinical models, TMI, and other single features at different time nodes (Figure 10C-D). Furthermore, the application of decision curve analysis (DCA) demonstrated that the Nomogram model exhibited a superior net benefit across various hierarchical strategies (Figure 10E).

ALDOB represents a potential prognostic biomarker and therapeutic target in HCC
Previous results have demonstrated a significant down-regulation of ALDOB in HCC, particularly in iHCC3. Furthermore, the KM plot analysis revealed a correlation between decreased ALDOB expression and reduced OS and RFS (Figure S19A). Further analysis also demonstrated dramatically decreased phosphorylation of ALDOB at eight sites (Figure S19B). We also observed that the methylation level of multiple CG sites on ALDOB was negatively correlated with gene expression (Figure S19C), and there were numerous CpG islands in the promoter region, suggesting that DNA methylation may mediate the deficiency of ALDOB (Figure S19D). DNA methyltransferases (DNMTs) are a family of “writer” enzymes responsible for DNA methylation. As the three active enzymes that maintain DNA methylation, our results showed that they were significantly negatively correlated with ALDOB expression and had opposite expression pattern to ALDOB in normal liver tissues and three HCC subtypes (Figure S19E-F). Further, we observed a continuous decrease in ALDOB with HCC progression in four cohorts (Figure 11A). In HPA database, ALDOB showed weak or moderate protein levels in HCC tissues compared to normal liver tissues (Figure 11B-C). We verified the low protein levels of ALDOB in HCC by immunohistochemical staining (IHC) in real world cohort (Abcam, ab153828, 1:500) (Figure 11D). In previous results, we observed significant upregulation of the Notch, PI3K/ AKT, mTOR, and Wnt pathways, which were closely associated with lipid metabolism (Figure S20A). Furthermore, GSEA analysis showed that high levels of DNMTs and low levels of ALDOB were closely associated with activation of Notch, PI3K/ AKT, mTOR and Wnt pathways and dysregulation of lipid metabolism pathways (Figure S20B). The above results suggest that the DNMTs/ALDOB/Notch/PI3K-AKT/mTOR metabolic axis may play a role in the occurrence and development of HCC, which needs further verification.

Discussion

Discussion
HCC is one of the most common categories of liver cancer, and effective treatment often varies significantly among patients owing to inherent tumor heterogeneity. A comprehensive and hierarchical approach to managing HCC patients is anticipated, aiming to elucidate the molecular basis of heterogeneity and identify potential therapeutic targets. Early-stage diagnosis and management of HCC can be effectively achieved through a hierarchical framework based on pathological and histological classification. Recognizing high-risk HCC patients to reduce mortality and customize individualized treatment before the emergence of severe symptoms and complications remains a critical task. In this study, we stratified and characterized each HCC subtype by molecular signatures using an unsupervised algorithm (NMF) based on the expression profiles of metabolic genes and/ or proteins. Integrated multi-omics analysis identified 3 distinct HCC subtypes, each with significant differences in metabolic and HCC-related pathways projecting at the genomic, transcriptomic, proteomic, and phosphoproteomic levels. These subtypes differ in clinical outcomes due to altered metabolic activities, Wnt, Notch/PI3K-AKT/mTOR, TGF-β/ Smads, and hypoxia/ autophagy-associated pathways, etc.
Compared to previous studies, patients classified under the iHCC3 subtype, which is associated with the poorest survival and disease progression, exhibited several indicators of reduced survival and an increased likelihood of disease recurrence. In contrast, in the high- and moderate-survival groups, iHCC1 and iHCC2 were associated with signatures of low recurrence, high survival, and tumor cell differentiation. Furthermore, the highest levels of AFP and GGT were observed in iHCC3, accompanied by a higher pathological stage (Stage III/IV) or grade (G3/G4). Regarding prognosis and pathway activities, iHCC1 exhibited similarities to iHCC2 overall, while still maintaining subtype-specific characteristics. Notably, with the exception of nucleotide metabolism, nearly all metabolic pathways showed significant down-regulation in iHCC3 compared to iHCC1 and iHCC2. Subsequent analysis suggested that this phenomenon may be linked to hypermethylation and reduced protein phosphorylation of the relevant metabolic signatures. Previous studies have also reported the down-regulation of multiple metabolic enzymes to be associated with a poor prognosis in HCC67–70. Similarly, phosphorylation of proteins associated with metabolic enzymes played a crucial part in the metabolic reprogramming of HCC, including phosphoproteomic alterations in glycolysis, fatty acid-lipid metabolism, bile acid metabolism, TCA cycle, and xenobiotic metabolism26,71. We also conducted a pan-cancer GSVA investigation. HCC demonstrated the most distinct metabolic pattern separated from other 14 common cancer types (Figure S21), highlighting the significance of remarkable metabolic alterations in HCC.
Previously reported HCC molecular subclasses, including Yamshita’s classification (EpCAM+, EpCAM-), Chiang’s classification (CTNNB1, Proliferation, IFN-related, polysomy 7, and an unannotated class), Lee’s classification, Minguez’s classification, Sakai’s classification, Woo’s classification, and Hoshida’s classification (termed S1, S2, and S3) have all been used to compare with our subtype classification. Generally speaking, our classification validates previously reported HCC subclasses while retaining their characteristics. Specifically, iHCC3 corresponded to the EpCAM+ subtype, accompanied by hepatic progenitor cell features and high AFP levels72. iHCC1 corresponds to Chiang’s CTNNB1 subclass, whereas iHCC3 corresponds to the proliferation subclass, with IFN-related and polysomy 7 subclasses showing a stronger match with iHCC273. The high frequency of CTNNB1 mutations in the iHCC1 subtype suggested a potential benefit from inhibitors targeting Wnt signaling26,74. Hoshida’s S1 and S2 subtypes are more associated with iHCC3, characterized by high proliferation, low differentiation and activation of MYC, AKT, and Wnt75. In addition, iHCC3 has been associated with vascular invasion76, intrahepatic metastasis77, higher mononuclear cell infiltration78, and a disappointing outcome79,80. The metabolic subtypes in our study were consistent with the previously identified molecular subtypes.
Metabolic reprogramming is driven not only by a series of oncogenic alterations but also through complex interactions with the tumor ecosystem, influencing tumorigenesis and progression. It is also worth noting that the three metabolic subtypes differed significantly in the immune microenvironment. Prognostic features include the abundance and characteristics of immune cells in the TME and response to immunotherapy. Notably, the iHCC3 subtype is characterized by low levels of CD8 T cells and high infiltration of M2 macrophages, which have been associated with poorer outcomes in previous studies45,81–83. In addition, the differential expression of immune checkpoint signatures predicted the response of different metabolic subtypes to immunotherapy84,85. Moreover, the findings of the present study demonstrating the reciprocal interactions between the anti-tumor microenvironment and metabolic heterogeneity in HCC may provide mechanistic insights into the benefits of immunotherapy. However, the robustness of the predictive capability of the metabolic phenotyping assay needs to be further validated prospectively in a larger patient cohort before translating to the clinic, including responses to immunotherapy and small molecule inhibitors.
The present study investigated a range of subtype-specific gene signatures among 3 iHCC subtypes. We attempted to determine the optimal combinations of features to improve outcome prediction and identify reproducible metabolic subtypes in HCC. Our findings indicate that the XgBoost model provides better discrimination and can recognize different metabolic subtypes with high sensitivity and specificity using a reduced set of 39 signatures. These results provided a potential combination of diagnostic biomarkers for metabolic subtype classification (Table S5). These signatures represent different metabolic proclivities that may contribute to metabolic heterogeneity in HCC. Furthermore, significant alterations caused by hypermethylation or hypomethylation of genes may lead to the reprogramming of HCC (ALDOB86,87, GNMT88, HAAO89, HMGCS290, IMPDH291, MTHFD1L92, PYCR226,93, SLC10A126,94, and TRMT61A95) were observed, suggesting the potential role of DNA methylation in the metabolic reprogramming of tumors96–98. Finally, we identified the DNMTs/ ALDOB/ Notch/ PI3K-AKT/ mTOR metabolic axis as a potential marker and therapeutic target for HCC patients. Meanwhile the internal validation was performed in an unforeseeable test set. Further validation in external cohorts would be beneficial before clinical translation. We aimed to capture multi-omics-integrated data, including radiomics, metabolomics, pathomics and even hematological tests. These multi-omics data should be prospectively extracted from new samples drawn from a larger population and used for independent validation. This would facilitate the ML model more robust and generally applicable for a multi-centres disposition99,100. The refined ML model can be used as a webserver-available classifier in primary care to identify HCC patients. Those identified as high risk will be referred to a medical center with appropriate expertise for an individualized appraisal, if necessary. Overall, this presents as an extremely promising pilot study which needs further validation in a larger cohort.
Regarding intra-tumor metabolic heterogeneity, our findings suggest that the major hypometabolic subtype (iHCC3) has significant prognostic significance in HCC, regardless of the presence or absence of other subordinate subtypes. Further investigation into the metabolic subtypes of HCC in advanced cases through the analysis of tumor biopsies or biofluid-based metabolomics is necessary to understand intra- and inter-tumoral metabolic dysregulation that may aid in early detection and treatment of HCC. Therefore, we conducted extensive GSVA analysis on CCLE cell lines and an additional 81 HCC cell lines based on metabolic and HCC-related pathways101. Our PCA and t-SNE results show that cell lines derived from HCC exhibit distinctly different metabolic profiles from other cell lines (hepatocyte clusters are independent of other tumor cell clusters of different origins) (Figure S22A and 22B). NMF revealed that 81 HCC cell lines replicated the typical three metabolic subtypes (Figure S22C). NMF revealed that the 81 HCC cell lines replicated the typical three metabolic subtypes, with the iHCC3 subtype showing good similarity in specific pathway activity to an independent bulk-seq RNA panel. This suggests that a subtype with high nucleic acid metabolic activity may consistently exist in many HCC cell lines, regardless of the specific cell line or HCC cohort. These results suggest that cell lines such as iHCC3 (JHH2, CLC5, SNU761) may serve as representative cell lines for subtype 3 in subsequent in vitro experiments (Figure S22D). Furthermore, we validated the expression of nine characteristic genes in different cell lines, which also showed good consistency with other HCC cohorts (Figure S22E). Future, according to the results of metabolomics, we analyzed the distribution levels of different metabolites in the three metabolic subtypes through NMF in TIGER-LC cohort, and the results showed that the distribution of different metabolites in the three metabolic subtypes was different (Figure S23A)21. In addition, subtype-specific metabolites that were different in at least two HCC subtypes were considered as subtype-specific metabolites. The enrichment degree of these subtype-specific metabolites in these HCC subtypes was analyzed, and some specific primary and secondary metabolites were found to be more abundant in the iHCC subtype (Figure S23B-D). In addition, we also analyzed the effects of different metabolites on the prognosis of HCC patients, and the abundance of metabolites including N-acetyltryptophan and taurocholate had a significant impact on the prognosis of HCC patients (Figure S24). These results further confirmed the impact of metabolic heterogeneity on the risk stratification of HCC patients, which will greatly promote our understanding of the specific mechanisms of metabolic heterogeneity in future studies.
Bulk RNA-seq evaluates the average gene expression levels across a mixture of various cell types, thereby obscuring the distinctions between cell types within the same sample. This observation was further supported by an extensive analysis of pathway activity distributions in single cells and bulk samples, which demonstrated greater variability in pathway activities among different types of single cells compared to the variability between bulk tumors and normal tissues. Single-cell sequencing revealed that the metabolic activity of various pathways differs in each HCC patient compared to normal hepatocytes. The above results suggest that metabolic heterogeneity may not be completely the same in different subtypes or even within the same subtype, and each HCC patient has its own unique characteristics. This will encourage us to take full account of each independent individual when conducting heterogeneity studies. Tumor heterogeneity plays a crucial role in cancer progression and clinical treatment, and is an important factor in tumor treatment resistance and progression. A comprehensive exploration of tumor heterogeneity at the single-cell level is anticipated to greatly enhance overall tumor survival rates. Similarly, the integration of single-cell multi-omics analysis, akin to bulk multi-omics analyses, is expected to offer substantial insights into unraveling the complexities of cellular heterogeneity.
Due to the high metabolic adaptability of tumor cells, when any metabolic pathway encounters obstacles, tumor cells will automatically switch or enable other pathways, so as to avoid stress damage. Consequently, therapeutic strategies for tumor metabolic regulation should be combined to block or regulate multiple metabolic pathways simultaneously. Both tumor cells and stromal cells undergo rapid metabolic adaptations during tumor proliferation and metastasis. Tumor cells and their “neighbors” maintain their biosynthetic and energy metabolic needs through metabolic coordination or competition, while evading immune surveillance or therapeutic intervention. There is a dual mechanism in tumor evolution. The balance between mutation level and selective metabolic pressure determines the trajectory of tumor evolution, leading to different temporal sequences of accumulation of different somatic cell aberrations. In addition to genomic evolution, cancer cells also evolve through epigenetic and metabolic adaptations and may exhibit a more complex adaptive landscape than dual mechanisms. Elucidation of the interplay between the different types of genetic and non-genetic changes occurring in cancer cells will help reveal the weaknesses of tumors and improve clinical treatment outcomes. The causes of metabolic heterogeneity are multifaceted. Metabolic dysregulation in HCC encompasses systemic perturbations influenced by a multitude of factors, including age, gender, dietary patterns, circadian rhythms, and others. The complex interplay of genetic, epigenetic, microbiome, lifestyle, dietary, and environmental factors contributes to metabolic heterogeneity102,103. In future studies, metabolic heterogeneity will encourage in-depth exploration from a more comprehensive perspective.
In summary, the integration of tumor metabolism and multi-omics holds immense promise for advancing risk stratification management, treatment, and prognostic monitoring of HCC patients. However, several challenges and deficiencies remain to be addressed. Specifically, the acquisition and analysis of certain omics data are constrained by factors such as sample quality, data accuracy, and reproducibility. Although multi-omics data offers more comprehensive information, its accuracy in prognostic monitoring still requires enhancement. To overcome these limitations, our future research will prioritize the optimization of metabolic therapy, ensuring a more precise targeting of tumor cells while minimizing toxicity to normal tissues. Additionally, there will be a focus on deepening the integration of multi-omics data, leveraging advanced algorithms and tools to improve data processing accuracy and efficiency. By establishing comprehensive databases containing large samples and diverse omics data, we aim to provide richer resources for tumor stratification, treatment, and prognostic monitoring. In conclusion, the integration of tumor metabolism and multi-omics represents a significant step forward in the management of HCC patients. However, continued research and innovation are essential to overcome current challenges and deficiencies, ensuring that we can provide the most precise and effective treatment for these patients.

Conclusions

Conclusions
On the whole, these observations above highlight distinct discrepancies in metabolic and HCC-related signaling pathways across three HCC subtypes. These differences were thought to result from high tumor heterogeneity among tumors and appeared to be a factor influencing factor in determining the prognosis. In a comprehensive multi-omics analysis, we identified several potential subtype-specific therapeutic targets involved in regulating various metabolic networks. Revealing the mechanistic differences between HCC subtypes and identifying subtype-specific drug targets should accelerate efficient treatment strategies and precision medicine.

Supplementary Information

Supplementary Information

출처: PubMed Central (JATS). 라이선스는 원 publisher 정책을 따릅니다 — 인용 시 원문을 표기해 주세요.

🏷️ 같은 키워드 · 무료전문 — 이 논문 MeSH/keyword 기반

🟢 PMC 전문 열기