Identification of race-associated metabolite biomarkers for hepatocellular carcinoma in patients with liver cirrhosis and hepatitis C virus infection
Identification of race-associated metabolite biomarkers for hepatocellular carcinoma in patients with liver cirrhosis and hepatitis C virus infection
Cristina Di Poto 0 1 2 3
Shisi He 0 1 2 3
Rency S. Varghese 0 1 2 3
Yi Zhao 0 1 3
Alessia Ferrarini 0 1 2 3
Shan Su 0 1 2 3
Abdullah Karabala 0 1 3
Mesfin Redi 0 1 3 5
Hassen Mamo 0 1 3
Amol S. Rangnekar 0 1 3
Thomas M. Fishbein 0 1 3
Alexander H. Kroemer 0 1 3
Mahlet G. Tadesse 0 1 3
Rabindra Roy 0 1 2 3
Zaki A. Sherif 0 1 3 4
Deepak Kumar 0 1 3
Habtom W. Ressom 0 1 2 3
☯ These authors contributed equally to this work. 0 1 3
0 1 3
0 Funding: Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health (https://
1 Data Availability Statement: The data obtained in this study is accessible at the NIH Common Fund's Data Repository and Coordinating Center (supported by NIH grant, U01-DK097430) website , the Metabolomics Workbench (
2 Department of Oncology, Lombardi Comprehensive Cancer Center, Georgetown University Medical Center, Washington DC, United States of America, 2 Department of Biostatistics, Johns Hopkins University Bloomberg School of Public Health , Baltimore , Maryland, United States of America, 3 MedStar Georgetown University Hospital and Georgetown University Medical Center , Washington, DC , United States of America
3 Editor: Anand S Mehta, Drexel University College of Medicine , UNITED STATES
4 Department of Biochemistry & Molecular Biology, College of Medicine, Howard University , Washington DC , United States of America, 8 Julius L. Chambers Biomedical/Biotechnology Research Institute, North Carolina Central University , Durham, North Carolina , United States of America
5 Department of Chemistry, Addis Ababa University , Addis Ababa , Ethiopia , 5 Department of Microbial, Cellular and Molecular Biology, Addis Ababa University , Addis Ababa , Ethiopia , 6 Department of Mathematics and Statistics, Georgetown University , Washington DC , United States of America
Disparities in hepatocellular carcinoma (HCC) incidence and survival have been observed between ethnic groups including African-Americans (AA) and European-Americans (EA). The evaluation of the changes in the levels of metabolites in samples stratified by race could provide a snapshot of ethnically diverse disease related pathways and identify reliable biomarkers. In this study, we considered AA and EA to investigate metabolites that may be associated with HCC in a race-specific manner. The levels of 46 metabolites in plasma samples, collected from patients recruited at MedStar Georgetown University Hospital, were analyzed by Agilent GC-qMS in selected ion monitoring (SIM) mode. A least absolute shrinkage and selection operator (LASSO) regression model was applied to select metabolites with significant changes in HCC vs. cirrhosis in three groups: (1) AA and EA combined; (2) AA separately; and (3) EA separately. In addition, metabolites that distinguish HCC cases from cirrhosis in these three groups were selected by excluding those without HCV infection. The performances of the metabolites selected by LASSO in each group were evaluated through a leave-one-out cross-validation. We identified race-specific metabolites that differentiated HCC cases from cirrhotic controls, yielding better area under the receiver operating characteristics (ROC) curve (AUC) compared to alpha-fetoprotein (AFP), the serological marker widely used for the diagnosis of HCC. This study sheds light on metabolites that could potentially be used as biomarkers for HCC by
grants-funding) under Award Numbers
U01CA185188 and R01CA143420. The funders
had no role in study design, data collection and
analysis, decision to publish, or preparation of the
Competing interests: The authors have declared
that no competing interests exist.
monitoring their levels in high-risk population of cirrhotic patients in a race-specific
Hepatocellular carcinoma (HCC) is the most common type of liver cancer. An estimated
40,710 new cases of liver cancer (including intrahepatic bile duct cancers) will be diagnosed in
the US during 2017, approximately three-fourths of which will be HCC [
]. Most of the HCC
patients are diagnosed at late stage when treatment is no more effective, making HCC the
most lethal type of liver cancer with an overall 5-year survival rate of approximately 15% [
Worldwide, HCC is the fifth most common cancer and the third leading cause of cancer
Persistent infections by HBV or HCV are the main recognized risk factors for HCC [4±8]
with the cancer developing faster once the viral-related cirrhosis is established [
Epidemiological studies and clinical trials have identified additional demographic, clinical,
pharmacological, genetics and life style factors that further affect the likelihood of HCC and can be
used in clinical practice to identify at-risk patients through stratified analysis .
Longitudinal analysis of cancer follow-up data, stratified by race, has shown higher incidence and
mortality rate in African-Americans (AA) affected by HCC [
]. Few reports have addressed
racial differences in survival for patients with HCC. In a study where AA and
European-Americans (EA) diagnosed with HCC were examined during a 10-year period from 1992 through
2001, it was found that AA were 4% to 20% more likely to die of localized HCC after adjusting
for age, sex, and treatment status [
]. Race/ethnicity-specific differences in disease
progression and HCC risk may occur even when the underlying liver disease etiology is the same,
necessitating a more aggressive disease management and monitoring among certain patient
Other than liver imaging, current diagnosis of HCC relies on the measurement of the level
of the serum biomarker, α-fetoprotein (AFP). However, when considering AFP levels in
patients with HCV infection, AFP appears to be insensitive for the diagnosis of HCC in AA. In
a case-control study of 163 HCC patients with HCV infection and 149 control patients with
HCV-related cirrhosis, the sensitivity of AFP for the diagnosis of HCC in AA with HCV
infection was reported to be lower than that of patients of all other ethnic groups combined [
Thus, serological biomarkers that take into account differences due to race are highly desired.
Metabolomics has been broadly used for biomarker discovery for many human diseases
including cancer [
]. Metabolites, end products of intercellular pathways can potentially
serve as indicators of the overall physiological status as well as the response to host and
environmental stimuli [
]. Although, it would be difficult to measure concentrations of all
metabolites in a biological system by a single analytical method due to their significant chemical
diversity and concentration range, the recognition of cancer metabolism (other than somatic
mutation) as a hallmark of cancer as first identified by Otto Warburg [
], makes the utility of
metabolomics indispensable for the study of cancer biology and hepatocellular carcinogenesis.
Considering the disparities that exist for liver diseases and HCC, the evaluation of the changes
in the levels of metabolites in samples from a homogeneous racial group could lead to the
identification of more reliable race-specific biomarkers than those obtained through
2 / 16
We previously conducted several metabolomic studies aimed at identifying HCC
biomarkers in cirrhotic patients [21±26]. Particularly, in a targeted analysis of metabolites in sera from
two study cohorts (Egyptian and US) by using multiple reaction monitoring (MRM), we
evaluated HCC biomarkers through stratified analysis by race, gender and alcohol cirrhosis. Two
metabolites (3sulfo-glycochenodeoxycholic acid and 3β, 6β-Dihydroxy-5β-cholan-24-oic acid)
were selected based on their significance to both cohorts. While both metabolites
discriminated HCC cases from cirrhotic controls in males and EA, they were insignificant in females
and AA. 3sulfo-glycochenodeoxycholic acid was significant in patients with alcoholic cirrhosis
and 3β, 6β-Dihydroxy-5β-cholan-24-oic acid in non-alcoholic cirrhosis, which may also
include non-alcoholic steatohepatitis (NASH). These analyses revealed that those clinical
covariates are important factors in biomarker discovery.
In this paper, we investigate race-stratified analysis of plasma metabolites measured by gas
chromatography coupled with selected ion monitoring mass spectrometry (GC-SIM-MS). The
plasma samples were collected from patients recruited at MedStar Georgetown University
Hospital (MGUH), Washington, DC. We compared the levels of the metabolites in plasma
samples from AA and EA to identify those that distinguish HCC cases from the cirrhotic
controls in a race-specific manner. A panel of metabolites was selected using least absolute
shrinkage and selection operator (LASSO) logistic regression [
] by considering data from the
following groups: (1) AA and EA combined with and without adjustment for race; (2) AA
only; and (3) EA only. The analyses were repeated by considering only those with HCV
(HCV+). We observed that the combination of LASSO-selected metabolites in each group
leads to better prediction in distinguishing HCC cases from cirrhotic controls, compared to
AFP. In parallel, a multiple support vector machine recursive feature elimination
] model was applied to rank the metabolites in each group, considering only those
that are HCV+. While LASSO is used for feature selection by penalizing some features,
MSVM-RFE ranks the features allowing better prediction performance than LASSO by
controlling the number of selected features [
]. We found that the top metabolites in MSVM-RFE
were also selected in LASSO, so the overlapping ones between the two methods were
individually compared with AFP.
The discovery of exclusive biomarkers in HCC is still a formidable task mainly due to the
heterogeneity of the clinical symptoms of cancer and the various etiologic agents that initiate
the pathological liver disorders such as abnormal structural nodules with peripheral fibrosis,
cirrhosis, chronic inflammation and fatty liver disease that are precursors to end stage liver
disease. To our knowledge, this is the first metabolomic study that seeks to identify race-specific
biomarkers by comparing the levels of plasma metabolites in HCC cases against cirrhotic
patients by stratifying AA and EA. Replication of these findings may contribute to
understanding of the racial disparity in HCC and also in improving diagnosis of HCC through
Materials and methods
Adult patients were recruited from the Hepatology Clinic at MedStar Georgetown University
Hospital (MGUH). All participants provided informed consent to a protocol approved by the
Institutional Review Board (IRB) at Georgetown University. The patients were diagnosed to
have liver cirrhosis on the basis of established clinical, laboratory and/or imaging criteria.
Cases were diagnosed to have HCC based on well-established diagnostic imaging criteria
and/or histology. Clinical stages for HCC cases were determined based on the
tumor-nodemetastasis (TNM) staging system. Controls were required to be HCC free for at least 6 months
from the time of study entry. Race information was collected from patients' self-report. The
3 / 16
Fisher exact test was used for categorical variables. Wilcoxon rank sum test was used for continuous variables that do not have a symmetric distribution. T-test was
used for continuous variables with symmetric distribution.
characteristics of AA and EA, selected from these patients, are summarized in Table 1, whereas
the characteristics of AA and EA that are HCV+ are shown in Table 2. The HCV+ participants
were predominantly genotype 1a and 1b with no statistically significant difference between
AA and EA.
The levels of 46 metabolites in plasma samples from AA and EA in Table 1, and additional
15 subjects (Asian, Hispanic/Latino, Other), were analyzed by GC-SIM-MS. Details on sample
collection, metabolite extraction, GC-SIM-MS data acquisition, pre-processing, and
metabolite ID verification, can be found in Di Poto et al. [
]. In the following sections, we summarize
the stratified analysis we performed to identify race-associated biomarkers for HCC.
Metabolite selection by LASSO. The metabolite levels corresponding to AA and EA were
retrieved from the GC-SIM-MS data previously reported [
]. A LASSO regression model was
applied to select a set of metabolites for each of three groups: (1) AA and EA combined (with
Fisher exact test was used for categorical variables. Wilcoxon rank sum test was used for continuous variables that do not have a symmetric distribution. T-test was
used for continuous variables with symmetric distribution.
4 / 16
and without adjustment for race); (2) AA only; and (3) EA only, based on their association
with HCC or cirrhotic disease status. These analyses were repeated by considering HCV+ AA
and EA. For LASSO models, the tuning parameter λ was chosen by cross-validation using the
R function cv.glmnet. After estimating the coefficient β in (I), metabolites could be selected for
each cohort [
b0 xiT b log 1 eb0xiTb
Yi is the status of disease, xi is the matrix of metabolites for each patient and N is the number of
Metabolite ranking by MSVM-RFE. We used multiple support vector machine recursive
feature elimination (MSVM-RFE) model to rank all metabolites. MSVM-RFE computed
feature-ranking score of multiple linear SVMs and reported the average ranking for each
]. Area under the curve (AUC) of the receiver operating characteristics (ROC) curve
was used to decide the cut-off of top metabolites. The individual metabolite, overlapping in the
two methods of LASSO and MSVM-RFE, was compared with AFP in terms of fold change
(median values), p value (univariate t-test), AUC (95%CI).
Performance evaluation of predictors. Logistic regression models were built to evaluate
the performance of the predictors in HCV+ patients. Leave-one-out cross validation was used
to calculate AUC for the ROC curve in logistic regression using a set of metabolites selected in
two different ways: (i) selected metabolites from LASSO; and (ii) top-ranked metabolites in
MSVM-RFE. The AUC for the ROC curve was then calculated based on testing data in the
cross validation within each model. By R package pROC [
], the confidence interval of AUC
with Delong`s method, sensitivity and specificity with bootstrap method at 0.5 threshold were
performed for the three groups (AA and EA with adjustment for race, AA only and EA only),
Performance of AFP
We evaluated the performance of AFP as a classifier for HCC in AA and EA combined, AA
only, and EA only groups, for all subjects (HCV + and HCV-) as well as for HCV+ only
subjects. As shown in Fig 1, with the exception of the p-value for AA and EA combined (Panel A),
AFP was unable to distinguish HCC from cirrhosis in AA (Panel B and B1) or EA (Panel C
and C1), emphasizing the need for more potent HCC markers particularly for subjects with
Metabolite selection by LASSO
LASSO regression conducted on AA and EA combined (adjusted or not for race), AA only,
and EA only groups, selected a combination of metabolites whose expression levels jointly
differentiate HCC cases from cirrhotic controls. The list of metabolites selected in each of the
above three groups and considering those with or without HCV infection, is provided in S1
Table. In the comparison between the AA and EA with and without HCV,
alpha-D-glucosamine 1-phosphate, palmitic acid, putrescine, tagatose, tyrosine and urea were selected for AA,
and of those metabolites only alpha-D-glucosamine 1-phosphate and tyrosine were also
selected in the AA and EA combined group when adjusted for race. For EA, glycine, glyceric
acid, isoleucine, linoleic acid, oxalic acid, serine, sorbose and threitol were selected; of those,
all, except threitol, were selected also in AA and EA combined when adjusted by race. While
considering only the HCV+ subjects, palmitic acid, putrescine and tagatose were selected
5 / 16
Fig 1. Performance of AFP as a classifier for HCC. The panels show dot plots (circle for liver cirrhotic, diamond for HCC; the horizontal line
represents median) and ROC curve (AUC; 95% CI) for AFP for AA and EA combined, AA only, and EA only groups. The left panels (A, B, C)
correspond to HCV +/- and the right panels (A1, B1, C1) to HCV+ only.
again while ethanolamine and valine were added for AA and of those putrescine and valine
were selected in the AA and EA combined group when adjusted for race. For EA, glycine,
glyceric acid and threitol were selected again while lactulose, lauric acid, sorbose and tyrosine
were added; of those, all metabolites, except lactulose and sorbose, were selected also in the AA
and EA combined group when adjusted by race.
Metabolite ranking by MSVM-RFE
MSVM-RFE was used to rank all metabolites. Due to the high percentage of HCV+ in AA, this
analysis was conducted for subjects that are HCV+ only. The complete list of the ranked
metabolites is provided in S2 Table. The plot of AUC values for the ROC curves using the
ranked metabolites, from 1 to 10, from MSVM-RFE are shown in Fig 2. The individual AUC
values are provided in the S3 Table. As shown in Fig 2, AUC > 0.9 is achieved with n = 5
metabolites in a panel for all three groups (AA and EA combined, AA only and EA only). The
model shows better performance with metabolites selected for AA and EA separately instead
of combining them. Specifically, AUC = 0.917 for AA and EA combined, AUC = 1 for AA and
AUC = 1 for EA were achieved using MSVM-RFE. The five metabolites selected by
MSVMRFE were also selected by LASSO in all three groups. The metabolites overlapping in the two
methods (LASSO and MSVM-RFE) were individually compared against AFP via AUC
(Table 3). These metabolites include amino acids and their derivatives (valine, ethanolamine,
glutamic acid, phenylalanine, alpha-D-glucosamine 1-phospate, glycine), fatty acids (lauric
6 / 16
Fig 2. Plot of the AUC values for the ROC curves corresponding to selected top n metabolites (n = 1,. . .,10), using
MSVM-RFE, for AA and EA combined, AA only, and EA only.
acid, glyceric acid, linoleic acid) and a vitamin (alpha tocopherol). Among these metabolites,
those found to be significant by univariate t-test in the comparison between AA and EA within
HCC and CIRR groups respectively are indicated. As shown in Table 3, the majority of the
metabolites selected in this study showed better AUC in one or all three groups (AA and EA,
AA, EA) than AFP. The AUC for valine and alpha tocopherol are the highest in AA only group
whereas phenylalanine and glycine in EA. Although AFP tends to have high fold change, it has
large variability as shown in Tables 1 and 2. This variability has impacted AFP's statistical
significance and AUC compared to the metabolites listed in Table 3.
Performance evaluation of predictors
Leave-one-out cross-validation examined the prediction performance based on these selected
metabolites in LASSO and the top-ranked metabolites in MSVM-RFE model. By performing
LASSO, 20 metabolites were selected for the AA and EA combined group with adjustment of
race factor, 11 for AA and 13 for EA. The top 5 metabolites in all three groups from
MSVMRFE were also selected and evaluated. Table 4 presents the results based on the prediction of
the left out sample in the leave-one-out cross-validation where the remaining N-1 samples
from the training set were used to estimate the logistic regression coefficients. The
performance of the selected metabolites in distinguishing HCC cases from cirrhotic was evaluated
using AUC (95% CI), sensitivity and specificity, respectively. As shown in Table 4, the panel of
metabolites selected by LASSO, performed well particularly for EA. These metabolites include
alpha tocopherol, alpha-D-glucosamine 1-phosphate, glutamic acid, glyceric acid, glycine,
lactulose, lauric acid, linoleic acid, oxalic acid, phenylalanine, sorbose, threitol, and tyrosine. The
top five metabolites selected by MSVM-RFE, valine glutamic acid, linoleic acid, ethanolamine,
and alpha tocopherol, demonstrated better prediction for AA.
7 / 16
selected & top 5
Selected metabolites and AFP along with their fold change (FC), p-value (univariate t-test), and AUC are listed. Fold changes are calculated as HCC vs. cirrhosis.
Upward arrows indicate metabolites with increased level in HCC vs. cirrhosis (positive FC). Down-ward arrows indicate metabolites with increased level in cirrhosis vs.
HCC (negative FC). Metabolites with a small fold change (- 1.10 FC + 1.10) are reported without arrow. AUC values 0.75 are highlighted in bold.
p<0.05 by univariate t-test in the comparison between AA and EA within HCC group.
p<0.05 by univariate t-test in the comparison between AA and EA within CIRR group.
Among the metabolites selected by LASSO regression, those that were top ranked by
MSVM-RFE and showed a consistent fold change between HCC cases from the cirrhotic
controls across the three groups (AA and EA combined, AA only, and EA only) are: alpha
tocopherol, selected in all three groups; whereas valine for AA and glycine for EA. Further biological
investigation will focus on them. Fig 3 depicts the individual dot plots for alpha tocopherol
(Fig 3A), valine (Fig 3B), and glycine (Fig 3C) in each group (AA and EA combined, AA only,
EA only) respectively, showing the changes of the metabolites level from cirrhosis to HCC
groups. Furthermore, we looked into the staging for HCC subjects and depicted the
correspondent dot plots from cirrhosis, HCC stage I to HCC stage II. These results are available in
8 / 16
Fig 3. Individual dot plots for alpha tocopherol, valine, and glycine in each group. The individual dot plot, for
alpha tocopherol, valine and glycine in AA and EA combined, AA, and EA groups are shown in Fig 3A, 3B, 3C
respectively (blue circle dots for liver cirrhotic, red diamond dots for HCC; the horizontal line represents the median
S1 Fig. In Fig 4 the individual ROC curves for alpha tocopherol (Fig 4A), valine (Fig 4B) and
glycine (Fig 4C) and the ones in combination with AFP are shown in each group (AA and EA
combined, AA only, EA only), respectively. Candidate metabolites selected in each group
(alpha tocopherol for AA and EA combined, valine for AA and glycine for EA) show better
9 / 16
Fig 4. Individual and combined with AFP ROC curves for alpha tocopherol, valine and glycine in each group. ROC curves for alpha tocopherol,
valine and glycine in AA and EA combined, AA, and EA groups are shown in Fig 4A, 4B and 4C respectively (black square dot line for AFP, red triangle
dot line for alpha tocopherol, valine and glycine and blue circle dot line for the combination of AFP and the corresponding metabolite).
performance than AFP only, and the combination with AFP, in distinguishing HCC cases
from cirrhotic controls.
To investigate to what degree these metabolites fluctuate in a healthy population, we
analyzed sera from the patients presented in Table 1 along with sera from healthy volunteers
recruited at MGUH and Howard University Hospital. The levels of valine and glycine were
confirmed to have changed significantly in sera from EA vs. AA HCC and cirrhotic patients
(p-value = 0.02 for glycine and p-value = 0.009 for valine), whereas alpha-tocopherol had a
pvalue of 0.08 in EA vs. AA HCC patients. On the other hand, as anticipated, the changes in the
levels of these metabolites were statistically insignificant in sera from EA vs. AA healthy
volunteers (p-value = 0.65 for alpha-tocopherol, p-value = 0.85 for valine, p-value = 0.35 for glycine).
Furthermore, we observed that the variability of the metabolite levels in sera from the healthy
volunteers was far less than the variability in sera from cirrhotic and HCC patients, when the
healthy and patient groups were frequency-matched by age and gender. This demonstrates
that the metabolite levels tend to have less fluctuation in the healthy group compared to the
Discussion and conclusion
The use of metabolomics to identify potential biomarkers of HCC is greatly advantageous to
patients and healthcare providers because the dysregulation of metabolites may be an early
10 / 16
indication of dysfunctional metabolic pathways that could offer valuable insight into the
mechanism of HCC initiation, development or progression. In this study, we investigated plasma
metabolites that may be associated with HCC in a race-specific manner by considering AA
and EA from a cohort that we previously examined [
]. The levels of selected plasma
metabolites were measured by GC-SIM-MS. LASSO regression was conducted to select
HCC-associated metabolites in a stratified analysis of AA and EA combined, adjusted or not for race, AA
only, and EA only, with or without HCV infection. MSVM-RFE was used to rank the
metabolites based on their ability to distinguish HCC cases from cirrhotic controls. The metabolites
overlapping between the ones selected by LASSO and the top five ranked by MSVM-RFE were
taken into consideration. Several metabolites including alpha tocopherol for AA and EA
combined, valine for AA only, and glycine for EA only exhibited better performance than AFP.
AFP has been extensively used as a biomarker for HCC. However, its performance in HCC
surveillance has been generally low [
]. In adults, the human AFP gene is silenced by
methylation processes and its glycoprotein product reappears only in instances of hepatic damage or
tissue regeneration as well as in solid tumors, which makes AFP a non-specific marker for
HCC diagnosis [
]. We evaluated the performance of AFP as a classifier (Fig 1). Although
in the AA and EA combined group AFP is statistically significant (p-value = 0.0269), its
performance declines when evaluated in AA only and EA only groups. Gupta et al.  examined
the performance of AFP as a marker for HCC patients with HCV over a period of 30 years. By
considering the most commonly reported cut-off for AFP (200 μg/L), they reported AFP's
sensitivity to range from 20% to 45% and specificity from 99% to 100%. The authors stated that
AFP appears to have limited utility in identifying HCC in patients with HCV. However, they
underlined the limitations of the studies, attesting the need for a prospective study designed to
limit bias and define whether a screening strategy can provide clinically important benefits.
We found statistically significant changes in alpha-tocopherol in HCC cases versus cirrhotic
controls similar to our previous report [
], which was conducted on an HCV+ Egyptian
cohort. Alpha-tocopherol is the most prevalent form of vitamin E that is readily absorbed in
mammalian tissues. Many studies including trials have reported the beneficial effects of
vitamin E supplementation and higher serum levels of alpha-tocopherol and retinol contributing
to increased antioxidant function and reduced risk of liver cancer [
]. This enhanced
benefit was also observed in trials involving non-diabetic individuals with NASH (non-alcoholic
steatohepatitis)  and even became the basis for recent clinical recommendations from the
American Association for the Study of Liver Diseases (AASLD) [
]. However, data from
recent clinical trials have shown less benefit for alpha-tocopherol in preventing cancer [40±42]
or having a prophylactic effect on hepatocarcinogenesis in patients with liver cirrhosis or a
chronic HCV infection [
]. The anti-cancer action of this fat-soluble type of vitamin E
could be related to its similar uptake and absorption to the dietary cholesterol, preventing the
uptake and delivery of the excess of cholesterol to tissues . On the other hand, the
abnormal LDL cholesterol metabolism has shown an association with many forms of cancer [46±
49]. Therefore, although the variability in the type of cancer, stage of disease and method of
treatment in these clinical reports, studying the effect of alpha-tocopherol [
] may be a reason
for these controversial results, studies demonstrating the anti-cancer actions of vitamin E
should not be disregarded [
As candidate biomarkers selected from the analysis based on the larger cohort [
reported down- and up-regulation of glycine and valine, respectively, in HCC cases versus
cirrhotic controls. In this study, we found that glycine, which is consistently downregulated in
HCC cases, is particularly specific to EA whereas valine, consistently upregulated in HCC, is
specific to AA. A study conducted on Egyptian subjects, with multivariate statistical analysis of
H-NMR spectroscopy data, generated from urine specimen, revealed down-regulation of
11 / 16
glycine in HCC cases when compared to cirrhosis subjects [
]. Although these results are not
consistent across various studies, as extensively discussed in ref. [
], glycine shows
consistency in differentiating patients with tumors. This nonessential amino acid is involved in
several synthetic reactions, including protein synthesis, and is also a key component of a central
methylation reaction within cells. Its involvement in tumorigenesis could be related to the
aberrant DNA methylation mechanism [
]. In contrast, valine, which is an essential amino
acid, was shown to occur at higher levels in HCC cases versus cirrhotic controls, and exhibited
similar pattern in our previous GC-MS-based study conducted on an HCV + Egyptian cohort
]. Valine is one of the Branched-chain amino acids (BCAAs). BCAAs levels are carefully
regulated by an enzymatic system that quickly responds to conditions of excess or deficiency,
playing a crucial role in cancer development [
]. A recent study in hematopoietic stem cell
renewal demonstrated that valine plays a crucial role in the creation of blood stem cells and its
deficiency or absence in the diet of leukemic mice led to the starvation and death of blood
cancer cells [
]. This suggests that the observed increase in the level of valine in the HCC cases
may be, in part, due to the steady requirement of this amino acid by the cancer cells for growth
and proliferation. Furthermore, not only are metabolites the end products of gene or protein
expression, but they are also a manifestation of the relationship that exists between the genome
and the internal cellular environment. In that vein, metabolites can be the cause or the result
of carcinogenesis in tumor cells such as HCC. Although further studies are warranted for the
functional characterization of the metabolic environment and the determination of the
relationship between metabolite changes and stage / histologic tumor grade of HCC, identifying
critical indicators such as valine may be a significant diagnostic method for the early
identification of the disease in clinical practice.
Alpha-tocopherol, glycine and valine were previously reported in HCC related
metabolomics studies, conducted by our group and others using additional human specimen,
complementary metabolomics platforms and multivariate analysis. In addition, validation of the
genetic ancestry of the participants should be performed using a panel of ancestry informative
markers. We also acknowledge the possibility of confounding variables other than HCV (such
as NASH) for the development of HCC in our study patients.
In summary, we have demonstrated the inability of AFP in discriminating HCC cases from
cirrhotic controls, particularly when its performance is evaluated in HCV+ race specific
groups. Among the metabolites selected by LASSO, glycine and valine showed better
performance than AFP in EA and AA, respectively. Following further validation in a large cohort of
patients and healthy controls matched by their demographic characteristics, the metabolites
discovered in this study could contribute to better understanding of the development of HCC
and allow early detection of HCC in patients with liver cirrhosis and HCV in a race-specific
S1 Table. List of LASSO selected metabolites in each race group and viral infection.
S2 Table. List of the ranked GC-SIM-MS targeted metabolites by MSVM-RFE in HCV+.
S3 Table. List of individual AUC values for the ROC curves using the ranked metabolites, from 1 to 10, by MSVM-RFE.
12 / 16
S1 Fig. Individual dot plots for alpha tocopherol, valine, and glycine in each group. The
individual dot plot, for alpha tocopherol, valine and glycine in AA and EA combined, AA, and
EA groups are shown in S1A, S1B, S1C Fig respectively (blue circle dots for liver cirrhotic, red
diamond dots for HCC; the horizontal line represents the median level). The changes of the
metabolites level are shown from cirrhosis, HCC stage I to HCC stage II.
The authors wish to thank Catherine Lopez from MedStar Georgetown University Hospital
and Georgetown University Medical Center for her help in the recruitment of the subjects
involved in the study and Dr. Md Islam from the Genomics and Epigenomics Shared Resource
for his help in sample processing. Research reported in this publication was supported by
the National Cancer Institute of the National Institutes of Health under Award Numbers
U01CA185188 and R01CA143420.
Conceptualization: Habtom W. Ressom.
Data curation: Cristina Di Poto, Shisi He, Rency S. Varghese, Yi Zhao, Alessia Ferrarini.
Formal analysis: Cristina Di Poto, Shisi He, Yi Zhao, Mahlet G. Tadesse.
Methodology: Cristina Di Poto, Alessia Ferrarini.
Project administration: Cristina Di Poto.
Resources: Abdullah Karabala, Mesfin Redi, Hassen Mamo, Amol S. Rangnekar, Thomas M.
Fishbein, Alexander H. Kroemer.
Supervision: Cristina Di Poto.
Validation: Cristina Di Poto.
Visualization: Rency S. Varghese.
Writing ± original draft: Cristina Di Poto, Shisi He.
Writing ± review & editing: Cristina Di Poto, Alessia Ferrarini, Shan Su, Mahlet G. Tadesse,
Rabindra Roy, Zaki A. Sherif, Deepak Kumar, Habtom W. Ressom.
13 / 16
14 / 16
15 / 16
1. Siegel RL , Miller KD , Jemal A . Cancer statistics, 2017 . CA: A Cancer Journal for Clinicians . 2017 ; 67 ( 1 ):7± 30 .
2. Mittal S , Kanwal F , Ying J , Chung R , Sada YH , Temple S , et al. Effectiveness of surveillance for hepatocellular carcinoma in clinical practice: A United States cohort . J Hepatol . 2016 ; 65 ( 6 ): 1148 ± 54 . https:// doi.org/10.1016/j.jhep. 2016 . 07 .025 PMID: 27476765
3. Torre LA , Bray F , Siegel RL , Ferlay J , Lortet-Tieulent J , Jemal A . Global cancer statistics, 2012 . CA: a cancer journal for clinicians . 2015 ; 65 ( 2 ): 87 ± 108 .
4. Korba B , Shetty K , Medvedev A , Viswanathan P , Varghese R , Zhou B , et al. Hepatitis C virus Genotype 1a core gene nucleotide patterns associated with hepatocellular carcinoma risk . J Gen Virol . 2015 ; 96 ( 9 ): 2928 ± 37 . https://doi.org/10.1099/jgv.0.000219 PMID: 26296571
5. Mohd Hanafiah K , Groeger J , Flaxman AD , Wiersma ST . Global epidemiology of hepatitis C virus infection: New estimates of age-specific antibody to HCV seroprevalence . Hepatology . 2013 ; 57 ( 4 ): 1333 ± 42 . https://doi.org/10.1002/hep.26141 PMID: 23172780
6. Hatzakis A , Wait S , Bruix J , Buti M , Carballo M , Cavaleri M , et al. The state of hepatitis B and C in Europe: report from the hepatitis B and C summit conference* . J Viral Hepat . 2011 ; 18 ( s1 ): 1 ± 16 .
7. Kim W , Loomba R , Berg T , Aguilar Schall RE , Yee LJ , Dinh PV , et al. Impact of long-term tenofovir disoproxil fumarate on incidence of hepatocellular carcinoma in patients with chronic hepatitis B . Cancer . 2015 ; 121 ( 20 ): 3631 ±8. https://doi.org/10.1002/cncr.29537 PMID: 26177866
8. Marcellin P. Hepatitis B and hepatitis C in 2009 . Liver International. 2009 ; 29 ( s1 ): 1 ± 8 .
9. Tansel A , Katz LH , El-Serag HB , Thrift AP , Parepally M , Shakhatreh MH , et al. Incidence and Determinants of Hepatocellular Carcinoma in Autoimmune Hepatitis: A Systematic Review and Meta-analysis . Clin Gastroenterol Hepatol . 2017 Feb 12 .
10. Ko C , Park W , Park S , Kim S , Windisch MP , Ryu W. The FDA approved drug irbesartan inhibits HBVinfection in HepG2 cells stably expressing sodium taurocholate co-transporting polypeptide . Antivir Ther (Lond) . 2015 ; 20 ( 8 ): 835 ± 42 .
11. Singal AG , El-Serag HB . Hepatocellular Carcinoma from Epidemiology to Prevention: Translating Knowledge into Practice . Clinical Gastroenterology and Hepatology . 2015 .
12. Alcaraz K , Bertaut T , Fedewa S , Gansler T , Goding Sauer A , McMahon C , et al. American Cancer Society. Cancer Facts & Figures for African Americans 2016 ± 2018 . American Cancer Society; 2016 .
13. Zeng C , Wen W , Morgans AK , Pao W , Shu X , Zheng W. Disparities by race, age, and sex in the improvement of survival for major cancers: results from the National Cancer Institute Surveillance, Epidemiology, and End Results (SEER) Program in the United States, 1990 to 2010 . JAMA oncology. 2015 ; 1 ( 1 ): 88 ± 96 . https://doi.org/10.1001/jamaoncol. 2014 .161 PMID: 26182310
14. Sloane D , Chen H , Howell C . Racial disparity in primary hepatocellular carcinoma: tumor stage at presentation, surgical treatment and survival . J Natl Med Assoc . 2006 Dec; 98 ( 12 ): 1934 ± 9 . PMID: 17225837
15. Yu L , Sloane DA , Guo C , Howell CD . Risk factors for primary hepatocellular carcinoma in black and white Americans in 2000 . Clinical Gastroenterology and Hepatology . 2006 ; 4 ( 3 ): 355 ± 60 . https://doi.org/ 10.1016/j.cgh. 2005 . 12 .022 PMID: 16527700
16. Ha J , Yan M , Aguilar M , Bhuket T , Tana MM , Liu B , et al. Race/ethnicity-specific disparities in cancer incidence, burden of disease, and overall survival among patients with hepatocellular carcinoma in the United States . Cancer . 2016 ; 122 ( 16 ): 2512 ± 23 . https://doi.org/10.1002/cncr.30103 PMID: 27195481
17. Nguyen MH , Garcia RT , Simpson PW , Wright TL , Keeffe EB . Racial differences in effectiveness of alpha-fetoprotein for diagnosis of hepatocellular carcinoma in hepatitis C virus cirrhosis . Hepatology . 2002 Aug; 36 ( 2 ): 410 ±7. https://doi.org/10.1053/jhep. 2002 .34744 PMID: 12143050
18. Liesenfeld DB , Habermann N , Owen RW , Scalbert A , Ulrich CM . Review of mass spectrometry-based metabolomics in cancer research . Cancer Epidemiol Biomarkers Prev . 2013 Dec; 22 ( 12 ): 2182 ± 201 . https://doi.org/10.1158/ 1055 - 9965 .EPI- 13 -0584 PMID: 24096148
19. Shen J , Yan L , Liu S , Ambrosone CB , Zhao H . Plasma metabolomic profiles in breast cancer patients and healthy controls: by race and tumor receptor subtypes . Translational oncology . 2013 ; 6 ( 6 ): 757 ± 65 . PMID: 24466379
20. Hanahan D , Weinberg RA . Hallmarks of cancer: the next generation . Cell . 2011 ; 144 ( 5 ): 646 ± 74 . https:// doi.org/10.1016/j.cell. 2011 . 02 .013 PMID: 21376230
21. Ressom HW , Xiao JF , Tuli L , Varghese RS , Zhou B , Tsai TH , et al. Utilization of metabolomics to identify serum biomarkers for hepatocellular carcinoma in patients with liver cirrhosis . Anal Chim Acta. 2012 Sep 19 ; 743 : 90 ± 100 . https://doi.org/10.1016/j.aca. 2012 . 07 .013 PMID: 22882828
22. Xiao JF , Varghese RS , Zhou B , Nezami Ranjbar MR , Zhao Y , Tsai TH , et al. LC-MS based serum metabolomics for identification of hepatocellular carcinoma biomarkers in Egyptian cohort . J Proteome Res. 2012 Dec 7 ; 11 ( 12 ): 5914 ± 23 . https://doi.org/10.1021/pr300673x PMID: 23078175
23. Xiao JF , Zhao Y , Varghese RS , Zhou B , Di Poto C , Zhang L , et al. Evaluation of metabolite biomarkers for hepatocellular carcinoma through stratified analysis by gender, race, and alcoholic cirrhosis . Cancer Epidemiol Biomarkers Prev . 2014 Jan; 23 ( 1 ): 64 ± 72 . https://doi.org/10.1158/ 1055 - 9965 .EPI- 13 -0327 PMID: 24186894
24. Nezami Ranjbar MR , Luo Y , Di Poto C , Varghese RS , Ferrarini A , Zhang C , et al. GC-MS based plasma metabolomics for identification of candidate biomarkers for hepatocellular carcinoma in Egyptian cohort . PloS one . 2015 ; 10 ( 6 ):e0127299. https://doi.org/10.1371/journal.pone. 0127299 PMID: 26030804
25. Ferrarini A , Di Poto C , Varghese RS , Nezami Ranjbar MR , Alonso DE , Binkley J , et al. A multi-platform approach for metabolomic analysis of human liver tissues . Chromatography Today . 2015 :Aug/Sep, 2015 .
26. Di Poto C , Ferrarini A , Zhao Y , Varghese RS , Tu C , Zuo Y , et al. Metabolomic Characterization of Hepatocellular Carcinoma in Patients with Liver Cirrhosis for Biomarker Discovery . Cancer Epidemiol Biomarkers Prev . 2016 Dec 2 .
27. Tibshirani R . Regression shrinkage and selection via the lasso . Journal of the Royal Statistical Society. Series B (Methodological) . 1996 : 267 ± 88 .
28. Duan K , Rajapakse JC , Wang H , Azuaje F . Multiple SVM-RFE for gene selection in cancer classification with expression data . IEEE transactions on nanobioscience . 2005 ; 4 ( 3 ): 228 ± 34 . PMID: 16220686
29. Wei P , Hu Q , Ma P , Su X . Robust feature selection based on regularized brownboost loss . KnowledgeBased Syst . 2013 ; 54 : 180 ± 98 .
30. Hastie T , Qian J . Glmnet vignette. 2014 .
31. Robin X , Turck N , Hainard A , Tiberti N , Lisacek F , Sanchez J , et al. pROC: an open-source package for R and S to analyze and compare ROC curves . BMC Bioinformatics . 2011 ; 12 ( 1 ): 77 .
32. White DL , Richardson P , Tayoub N , Davila JA , Kanwal F , El-Serag HB . The Updated Model: An Adjusted Serum Alpha-Fetoprotein-Based Algorithm for Hepatocellular Carcinoma Detection With Hepatitis C Virus-Related Cirrhosis . Gastroenterology. 2015 Dec; 149 ( 7 ): 1986 ± 7 . https://doi.org/10.1053/j. gastro. 2015 . 10 .004 PMID: 26519622
33. Kuo MT , Iyer B , Wu JR , Lapeyre JN , Becker FF . Methylation of the alpha-fetoprotein gene in productive and nonproductive rat hepatocellular carcinomas . Cancer Res . 1984 Apr; 44 ( 4 ): 1642 ± 7 . PMID: 6200217
34. Mizejewski G . Alpha-fetoprotein (AFP) and inflammation: is AFP an acute and/or chronic phase reactant ? Journal of Hematology & Thromboembolic Diseases . 2015 .
35. Gupta S , Bent S , Kohlwes J . Test characteristics of alpha-fetoprotein for detecting hepatocellular carcinoma in patients with hepatitis C. A systematic review and critical analysis . Ann Intern Med . 2003 Jul 1 ; 139 ( 1 ): 46 ± 50 . PMID: 12834318
36. Ito Y , Suzuki K , Ishii J , Hishida H , Tamakoshi A , Hamajima N , et al. A population-based follow-up study on mortality from cancer or cardiovascular disease and serum carotenoids, retinol and tocopherols in Japanese inhabitants . Asian Pacific Journal of Cancer Prevention . 2006 ; 7 ( 4 ): 533 . PMID: 17250424
37. Zhang W , Shu X , Li H , Yang G , Cai H , Ji B , et al. Vitamin intake and liver cancer risk: a report from two cohort studies in China . J Natl Cancer Inst . 2012 ; 104 ( 15 ): 1174 ± 82 .
38. Sanyal AJ , Chalasani N , Kowdley KV , McCullough A , Diehl AM , Bass NM , et al. Pioglitazone, vitamin E, or placebo for nonalcoholic steatohepatitis . N Engl J Med . 2010 ; 362 ( 18 ): 1675 ± 85 . https://doi.org/10. 1056/NEJMoa0907929 PMID: 20427778
39. Chalasani N , Younossi Z , Lavine JE , Diehl AM , Brunt EM , Cusi K , et al. The diagnosis and management of non-alcoholic fatty liver disease: Practice Guideline by the American Association for the Study of Liver Diseases, American College of Gastroenterology, and the American Gastroenterological Association . Hepatology. 2012 ; 55 ( 6 ): 2005 ±23. https://doi.org/10.1002/hep.25762 PMID: 22488764
40. Taylor PR , Greenwald P. Nutritional interventions in cancer prevention . Journal of clinical oncology . 2005 ; 23 ( 2 ): 333 ± 45 . https://doi.org/10.1200/JCO. 2005 . 06 .190 PMID: 15637396
41. Caraballoso M , Sacristan M , Serra C , Bonfill X . Drugs for preventing lung cancer in healthy people . Cochrane Database Syst Rev . 2003 ; 2 .
42. Ostlund RE Jr. Phytosterols in human nutrition . Annu Rev Nutr . 2002 ; 22 ( 1 ): 533 ± 49 .
43. Bjelakovic G , Nikolova D , Gluud LL , Simonetti RG , Gluud C . Antioxidant supplements for prevention of mortality in healthy participants and patients with various diseases . The Cochrane Library . 2012 .
44. Takagi H , Kakizaki S , Sohara N , Sato K , Tsukioka G , Tago Y , et al. Pilot clinical trial of the use of alphatocopherol for the prevention of hepatocellular carcinoma in patients with liver cirrhosis . Int J Vitam Nutr Res . 2003 Nov; 73 ( 6 ): 411 ±5. https://doi.org/10.1024/ 0300 - 9831 . 73 .6.411 PMID: 14743544
45. Tucker J , Townsend D . Alpha-tocopherol: roles in prevention and therapy of human disease . Biomedicine & pharmacotherapy . 2005 ; 59 ( 7 ): 380 ± 7 .
46. Freeman MR , Solomon KR . Cholesterol and prostate cancer . J Cell Biochem . 2004 ; 91 ( 1 ): 54 ± 69 . https://doi.org/10.1002/jcb.10724 PMID: 14689582
47. Molony T . Use of cholesterol drugs may decrease breast cancer risk . J Dent Hyg . 2003 Fall; 77 ( 4 ): 214 .
48. Michaud DS , Giovannucci E , Willett WC , Colditz GA , Fuchs CS . Dietary meat, dairy products, fat, and cholesterol and pancreatic cancer risk in a prospective study . Am J Epidemiol . 2003 ; 157 ( 12 ): 1115 ± 25 . PMID: 12796048
49. Siemianowicz K , Gminski J , Stajszczyk M , Wojakowski W , Goss M , Machalski M , et al. Serum HDL cholesterol concentration in patients with squamous cell and small cell lung cancer . Int J Mol Med . 2000 ; 6 ( 3 ): 307 ± 18 . PMID: 10934294
50. Shariff MI , Gomaa AI , Cox IJ , Patel M , Williams HR , Crossey MM , et al. Urinary metabolic biomarkers of hepatocellular carcinoma in an Egyptian population: a validation study . Journal of proteome research . 2011 ; 10 ( 4 ): 1828 ±36. https://doi.org/10.1021/pr101096f PMID: 21275434
51. Wahid B , Ali A , Rafique S , Idrees M . New Insights into the Epigenetics of Hepatocellular Carcinoma . Biomed Res Int . 2017 ; 2017 :1609575. https://doi.org/10.1155/ 2017 /1609575 PMID: 28401148
52. O 'Connell TM. The complex role of branched chain amino acids in diabetes and cancer . Metabolites . 2013 ; 3 ( 4 ): 931 ± 45 . https://doi.org/10.3390/metabo3040931 PMID: 24958258
53. Taya Y , Ota Y , Wilkinson AC , Kanazawa A , Watarai H , Kasai M , et al. Depleting dietary valine permits nonmyeloablative mouse hematopoietic stem cell transplantation . Science . 2016 :aag3145.