Back to Journals » Infection and Drug Resistance » Volume 12

Computational analysis of naturally occurring resistance-associated substitutions in genes NS3, NS5A, and NS5B among 86 subtypes of hepatitis C virus worldwide

Authors Wu R, Geng D, Chi X, Wang X, Gao X, Xu H, Shi Y, Guan Y, Wang Y, Jin J, Ding Y, Niu J

Received 6 June 2019

Accepted for publication 22 August 2019

Published 19 September 2019 Volume 2019:12 Pages 2987—3015


Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Dr Joachim Wink

Download Article [PDF] 

Ruihong Wu,1 Dongfeng Geng,2 Xiumei Chi,1 Xiaomei Wang,1 Xiuzhu Gao,1 Hongqin Xu,1 Ying Shi,1 Yazhe Guan,1 Yang Wang,1 Jinglan Jin,1 Yanhua Ding,3 Junqi Niu1

1Department of Hepatology, First Hospital of Jilin University, Changchun, Jilin Province 130021, People’s Republic of China; 2Centre for Reproductive Medicine, Centre for Prenatal Diagnosis, First Hospital of Jilin University, Changchun, Jilin Province 130021, People’s Republic of China; 3Phase I Clinical Research Center, The First Hospital of Jilin University, Changchun, Jilin Province 130021, People’s Republic of China

Correspondence: Yanhua Ding
Phase I Clinical Research Center, The First Hospital of Jilin University, Changchun, Jilin Province 130021, People’s Republic of China
Tel +86 4 318 878 2168
Email [email protected]

Junqi Niu
Department of Hepatology, The First Hospital of Jilin University, Changchun, Jilin Province 130021, People’s Republic of China
Tel +86 4 318 187 5101
Email [email protected]

Background and objective: Direct-acting antivirals (DAA) facing resistance continue to be used in some areas worldwide. Thus, identifying hepatitis C virus (HCV) genotypes/subtypes and loci with certain prevalent resistance-associated substitutions (RASs) deserves attention. We investigated the global and regional frequencies of naturally occurring RASs among all confirmed HCV subtypes (n=86) and explored co-occurring and mutually exclusive RAS pairs within and between genes NS3, NS5A, and NS5B.
Methods: A total of 213,908 HCV sequences available as of July 10, 2019 were retrieved from the NCBI nucleotide database. After curation, 17,312 NS3, 8,478 NS5A, and 25,991 NS5B sequence fragments from DAA-naïve patients were screened for RASs. MEGA 6.0 was used to translate aligned nucleotide sequences into amino acid sequences, and RAS pairs were identified by hypergeometric analysis.
Results: RAS prevalence varied significantly among HCV subtypes. For example, D168E, highly resistanct to all protease inhibitors except voxilaprevir, was nearly absent in all subtypes except in 43.48% of GT5a sequences. RASs in NS3 exhibiting significantly different global distribution included Q80K in GT1a with the highest frequency in North America (54.49%), followed by in Europe (22.66%), Asia (6.98%), Oceania (6.62%), and South America (1.03%). The prevalence of NS3 S122G in GT1b was highest in Asia (26.6%) and lowest in Europe (2.64%). NS5A L28M, R30Q, and Y93H in GT1b, L31M in GT2b, and NS5B C316N in GT1b was most prevalent in Asia. A150V in GT3a, associated with sofosbuvir treatment failure, was most prevalent in Asia (44.09%), followed by Europe (31.19%), Oceania (24.29%), and North America (19.05%). Multiple mutually exclusive or co-occurring RAS pairs were identified, including Q80K+R155K and R155K+D168G in GT1a and L159F+C316N and R30Q (NS5A)+C316N (NS5B) in GT1b.
Conclusion: Our data may be of special relevance for those countries where highly effective antivirals might not be available. Considering the specific RASs prevalence will help the clinicians to make optimal treatment choices. The RASs pairs would benefit anti-HCV drug development.

Keywords: hepatitis C virus, direct-acting antiviral, resistance-associated substitution, subtype


Infection with hepatitis C virus (HCV) is a global public health problem. Between 130 and 170 million people are HCV chronically infected1 and up to 4 million individuals are newly infected with HCV annually.2 Persistent HCV infection induces high risk for developing severe liver diseases, such as liver cirrhosis and hepatocellular carcinoma.2,3 HCV is an enveloped positive-sense single-stranded RNA virus whose replication can be robust; model-based calculations indicate production of 1012 virions/day.4,5 This high-level replication and a lack of viral RNA polymerase proofreading contribute to HCV’s genetic divergence. The virus is currently classified into seven major genotypes and 86 subtypes according to the International Committee on Taxonomy of Viruses (June 2017) ( with ~30% divergence at the genotype level and ~15% divergence at the subtype level.6 HCV clearance, which is associated with reduced rates of de novo hepatocellular carcinoma,7 is strongly dependent on HCV genotype/subtype when induced by interferon-based antiviral therapy with sustained virologic response (SVR) rates of approximately 50–80%.810

HCV therapy has been revolutionized with the advent of direct-acting antivirals (DAA) that directly target HCV gene products, including NS3 protease inhibitors, NS5A inhibitors, nucleos(t)ide inhibitors (NI), and non-nucleoside inhibitors (NNI) of the NS5B RNA-dependent RNA polymerase.11 Generally, DAA based regimens yield highly promising SVR rates (>90%). However, virologic failure still occurs and has been associated with the emergence of HCV variants with resistance-associated substitutions (RASs), which impair drug susceptibility. Notably, RASs can occur naturally in a genotype/subtype-dependent manner before DAA-induced selective pressure occurs.1217 For example, the SVR rate of daclatasvir/asunaprevir was severely attenuated due to baseline RASs (65.4% with RASs vs 94.3% without RASs).18 Moreover, due to the negative effects of RAS Q80K on the efficacy of simeprevir, clinical guidelines recommend pre-treatment screening in patients infected with HCV subtype GT1a.19,20 Thus, assessing the prevalence of naturally occurring RASs in different HCV genotypes/subtypes and determining their global geographic distribution will help optimize the selection of therapeutic regimens.

Several studies have assessed the prevalence of naturally occurring RASs in HCV genes NS3, NS5A, and/or NS5B from DAA-naïve patients but have focused on particular subtypes2125 or have used HCV sequence databases2628 containing a relatively small number of sequences covering very few subtypes. Welzel et al29 performed the largest study to date including 46 subtypes across 5 geographic regions, but the RAS distributions determined in that study were based on clinical trials from regional medical centers in primarily developed countries and thus may not reflect the global HCV RAS landscape.

To address this knowledge gap, the aims of our current study were to (1) investigate the global and regional prevalence of HCV RASs among all confirmed HCV subtypes (n=86) by mining HCV sequences in NCBI nucleotide database and Los Alamos HCV database, (2) explore the RASs pairs showing significant more or less co-occurrence.

Materials and methods

HCV datasets

HCV genomic sequences available as of July 10, 2019 were retrieved from the NCBI nucleotide database ( in GenBank (full) format using the following searching criteria: the title contained the words “hepatitis C virus” and the organism was “hepatitis C virus”. The following information was extracted for each sequence: accession number, times of sampling and publication, HCV genotype/subtype, geographic region, and treatment. If the above-mentioned parameters were not available, we extracted the information from publications linked with the sequences. A total of 213,908 HCV genomic sequences were ultimately retrieved. Only one sequence from any duplicate sets and the sequence obtained from the last visit for patients with multiple visits was retained for further analysis. Sequence exclusion criteria were as follows: (1) no NS3, NS5A, or NS5B fragments present; (2) low quality; (3) from non-human hosts; (4) different clone sequences from the same patient; (5) sequences that encoded non-functional proteins; (6) sequences without any available subtype information or with mixed-genotypes; (7) groups of sequences with ambiguous DAA treatment information (eg, “some DAA-treated patients”) that did not specify which patients/sequences were DAA-treated; and (8) sequences from DAA-treated patients. Sequences were confirmed to be from DAA-treated patients based on the NCBI database description and/or the linked publications. Finally, 17,312 NS3, 8,478 NS5A, and 25,991 NS5B sequences were retained for further analysis.

All nucleotide sequences were aligned using the Los Alamos HCV Database (LANL;, which also provided curated HCV subtype and geographic region information for some sequences. All sequences were aligned against the H77 reference sequence (GenBank accession no. NC_004102). The aligned nucleotide sequences in FASTA format were downloaded and then translated into their corresponding amino acid sequences with MEGA 6.0 software (Center for Evolutionary Medicine and Informatics, Tempe, AZ, USA) and manually checked and edited as necessary. The MEGA 6.0 output table was further analyzed with R (version 2.10.0) to calculate allele frequencies for each RAS. We focused only on the defined genomic regions relevant to drug resistance, including the first 630 amino acids in NS3, the first 100 amino acids in NS5A, and all 591 amino acids in NS5B.


RASs were defined by a combination of substitutions summarized in three review papers,3032 and others recently reported associated with DAA treatment failure and/or conferred a ≥2-fold change in susceptibility compared with a reference strain via in vitro replicon assays.3339

NS3 RASs included V36A/G/L/M, Q41K/R, F43C/I/L/S/V, T54A/S, V55A/I, Y56F/H/N, Q80G/H/K/L/R, S122D/G/N/R/T, S138T, R155C/G/I/K/L/N/Q/S/T/W, A156G/H/K/L/M/S/T/V, V158A, A166T, D168/A/C/E/F/G/H/I/K/L/N/Q/R/S/T/V/Y, I170T/V, and L175M.

NS5A RASs included K24/A/E/G/N/Q/R, S24F/H/T, Q24K/T, T24A/S, K26E, M28A/G/I/K/S/T/V, L28A/F/I/M/S/T/V, F28C/M/S/V,Q30D/E/G/H/I/K/L/N/R/S/T/Y, R30C/E/G/H/K/N/Q/S/T, A30G/H/K/V, L30A/F/G/H/Q/R/S, L31F/I/M/P/V/W, M31F/I/L/V, P32A/L/Q/R/S, S38F, Q54H, H58D/L/N, P58A/D/G/L/R/S/T, T58A/D/G/H/L/N/S, E62D/L, A92K/P/T, C92A/K/N/R/S/T, E92K, Y93C/F/H/I/L/N/R/S/T/W, and T93A/H/I/N/S.

NS5B RASs included A150V, L159F, G188D, K206E, E237G, N244I, S282G/R/T, M289I/L, L314H, C316F/H/N/Y, L320F, V321A/I, S368T, A395G, N411S, M414I/T/V, N444K, C445F, E446K/Q, Y448C/H, C451S, A553T/V, G554S, S556G/N/R, G558R, D559G/N, Y561H, and S565F.

Statistical analysis

Differences in RAS prevalence among geographic regions were determined using Fisher’s exact test. Probabilities (P-values) of observing a pair of RASs together in no fewer or no greater than n sequences by random chance were calculated using the hypergeometric test. Statistical analyses were performed using R (version 2.10.0).

A P value <0.05 was considered to be statistically significant.


Prevalence of naturally occurring NS3 RASs in 86 HCV subtypes

The prevalence of naturally occurring NS3 RASs in different HCV subtypes is shown in Table 1. Majority of NS3 RASs were absent or have very low frequencies (<0.5%), and only several RASs including Q80K, S122G/T/N and D168E were observed in a high rate of sequences in a subtype dependent manner (Figure S1A). The RAS Q80K confers low-level resistance to simeprevir in vitro and is associated with a reduced treatment response in vivo. We found Q80K-positive sequences in 31.74% of HCV subtype GT1a sequences (2277/7178) but only 1.14% of sequences in subtype GT1b (81/7176). This RAS was found frequently in subtypes GT1d (86.67%, 13/15), 5a (100%, 46/46), and 6a (98.28%, 402/409) but was very rare in subtype 3a (0.24%, 2/820). The RAS was also observed in 16.67% (1/6) of subtype 1i sequences. All GT4 and other GT1, 3, 5, 6, 7 subtypes harbored no Q80K-positive sequences. Q80R, which confers resistance to simeprevir/asunaprevir/faldaprevir, was rarely present in G1a (0.49%, 35/7178), G1b (0.3%, 21/7176), 3i (16.67%, 1/6), 4d (1.45%, 1/69), and 6a (0.24%, 1/409). Similarly, R155K, which carries variants associated with resistance to protease inhibitors such as simeprevir, asunaprevir, paritepravir, vaniprevir, and faldaprevir, was rarely present in 0.96% (69/7164) of GT1a, 16.67% (1/6) 1h, and 0.25% (2/805) 3a sequences. A156L/T/V, the only RASs conferred high resistance (>100-fold) to voxilaprevir (a potent pan-genotypic second generation of protease inhibitor), were not detected except in 1a, and 1b with frequencies <0.05%. The RASs at position D168, highly resistant to all protease inhibitors except voxilaprevir, were rare (approximately 1%) in nearly all HCV subtypes, whereas D168E occurred in 43.48% (20/46) of GT5a sequences. RASs at position 122, which confer resistance to simeprevir, asunaprevir and/or voxilaprevir in GT1a and/or GT1b, was highly prevalent in GT5a (122T, 73.9%), GT6a (122N, 76.3%) and GT1b (122G, 9.34%). S122R (confers resistance to simeprevir and asunaprevir) was exclusively detected in GT2 and with GT2 subtype-specific frequencies (i.e., 100% in 2b, 2c, 2d, 2e, 2f, 2i, 2j, 2l, 2m, 2q and 2t but only 1.89% (1/53) in 2a and 0% in 2r and 2u). These present of these RASs may limit the use of some inhibitors for treating the corresponding subtypes. V36L, associated with resistance to asunaprevir, paritepravir, and faldaprevir, was uncommon in GT1a (1.50%) and 1b (0.96%) but more frequent (13.33% to 100%) in five GT1 subtypes including 1d, 1e, 1g, 1i, and 1l. This RAS was also found in almost all GT2, 3, 4, 5, and 7 sequences, as well in several GT6 subtypes. Another asunaprevir/paritepravir/faldaprevir RAS, V36M, was only observed in GT1a (0.48%) and 1b (0.03%). T54S was infrequent in GT1a (3.02%) and 1b (2.01%). The frequency of V170A was extremely low (<0.1%) in GT1a or GT1b but significantly varied among other subtypes with respect to frequency. Lastly, three RASs (Q41R, F43L/S, Y56H) were only found in GT1a or GT1b and at an extremely low prevalence. (<0.1%)

Table 1 Prevalence of naturally occurring NS3 RASs in 86 HCV subtypes

Table 1 (Continued).

Prevalence of naturally occurring NS5A RASs in 86 HCV subtypes

The prevalence of naturally occurring NS5A RASs in different HCV subtypes is shown in Table 2. Similar to NS3, most RASs were absent or have very low frequencies (<0.5%) (Figure S1B). RAS Y93H was associated with reduced NS5A-targeted DAA efficacy, with or without L31M/V/I, in GT1b-infected patients.40 Y93H appeared in sequences of subtypes GT1a (0.41%, 12/2928), 1b (4.25%, 80/1882), 1c (25%,1/4), 1m (50%, 1/2), 3a (1.35%, 14/1114), 4a (3.33%,1/30), 4b (50%,1/2), 4g (33.33%,1/3), 7a (100%,2/2) and 7b (100%,1/1). Other substitutions at this position, such as Y93C/F/N/S, were uncommon in GT1a, 1b, 2a, 3a, and 6a (0.03%-1.92%), but were prevalent in other subtypes, including GT1c, 1g, 1m, 4w, 6e, 6m, 6n, 6o, 6u, 6v, and 6xe (15.38%-100%). L31M, which confers resistance to daclatasvir/omibitasvir/ledipasvir, has been associated with reduced elbasvir/grazoprevir efficacy in patients with HCV-GT1a infection.17 L31M was rare in GT1a (0.65%,19/2928) and 1b (2.63%,49/1865) sequences and absent in GT3a, 5a, and all GT6 subtypes except in one GT6a sequence. In contrast, this RAS was frequently detected (≥50%) in subtypes G1d, 1e, 1l, 1m, 3b, and a majority of GT2 and 4 subtypes. A30K, which is associated with daclatasvir resistance, was only detected in 2.25% of GT3a sequences but was found in nearly 100% of sequences from other GT3 subtypes. The most commonly observed RAS in GT1b was Q54H (26.76%, daclatasvir resistance). RASs L28M (daclatasvir/ombitasvir resistance) and R30Q (daclatasvir resistance) were identified in 2.37% and 4.66% of GT1b sequences, respectively.

Table 2 Prevalence of naturally occurring NS5A RASs in 86 HCV subtypes

Table 2 (Continued).

Prevalence of naturally occurring NS5B NI-specific and NNI-specific RASs in 86 HCV subtypes

The prevalence of naturally occurring NI-specific NS5B RASs and NNI-specific RASs in different HCV subtypes is shown in Table 3. Except for a few RASs with high rates, others have very low rates (Figure S1C). A150V has recently been found to be associated with a reduced response to treatment with sofosbuvir and ribavirin, with or without pegylated interferon in GT3a infected patients.34 A150V is highly prevalent in sequences of GT3a (31.5%, 103/327). L159F was found in 11.19% (297/2655) of GT1b sequences but in only 0.09% (2/2346) of GT1a sequences. S282T, the only known variant conferring sofosbuvir resistance in vitro, rarely appeared in GT1a (0.19%, 10/5182), 1b (0.15%, 11/7440), 2b (0.22%, 1/455), 3a (0.03%, 1/3003) and 4a (0.35%, 3/857).

Table 3 Prevalence of naturally occurring NS5B RASs in 86 HCV subtypes

Table 3 (Continued).

All observed NNI-specific RASs are associated with dasabuvir. C316N was common in sequences of GT1b (43.09%, 3179/7377), GT4f (81.61%, 71/87), 4b (14.29%, 2/14), and 1e (10.17%,6/59). C316H was observed in GT1b (1.19%, 88/7377) and 5–10% of several GT4 subtypes but was more prevalent in GT4r (60.32%, 76/126). The frequency of S556G was higher in GT1b than in GT1a (11.77% vs.0.79%) and was found in 6h (85.71%, 6/7), 6e (5.26%, 1/19), and GT2, 3, 4, 5 and 7 subtypes (nearly 100%). However, this RAS was absent in other GT1 subtypes and GT6 subtypes, although S556N, a closely related variant, was harbored by GT4r (75%, 3/4). S556R was found in GT1a (0.34%) and in several GT6 subtypes (6a, 6e, 6n, 6o, 6p, 6q, 6s, 6t, 6u, 6xc, and 6xf).

Geographical distribution of RASs

Country of origin information was available for approximately 70% of the analyzed sequences. We classified these sequences into Asia, Europe, North America, Central America, South America, Former USSR, Oceania, Africa, Caribbean, or Middle East clusters according to geographic region definitions in the Los Alamos HCV Database. The majority of RASs in most HCV subtypes were similarly distributed among different geographic regions worldwide. NS3 RASs with distinctly variable prevalence by geographic region including Q8OK in GT1a, V36L in GT1a, S122G in GT1b and so on (Figure 1, P<0.05). Q80K in GT1a was mostly prevalent in North America (54.49%, 679/1246), followed by Europe (22.66%, 246/1090), Asia (6.98%, 3/43), Oceania (6.62%, 9/136), and South America (1.03%, 4/390). NS5A RASs (L28M, R30Q, Q54H, and Y93H in GT1b, L31M in GT2b, and E62L in GT3a) varied significantly in geographic prevalence (Figure 2, all P-values <0.05). L28M, R30Q, and Y93H in GT1b showed the highest prevalence in Asia. NS5B RASs exhibited distinct global distribution patterns are present in Figure 3. C316N/H was found mostly in Asia (73.20%, 1923/2627), followed by in the Former USSR (63.77%, 213/334), Europe (31.82%, 415/1304), North America (22.81%, 52/228), South America (21.39%, 182/851), Oceania (10.64%, 5/47), the Middle East (21.82%, 24/110), Africa (4.49%, 7/156), Central America (0%, 0/10), and the Caribbean (0%, 0/12). In contrast, Asia had the lowest prevalence of L159F in GT1b (0.62%, 2/322), while S556G in GT1b commonly appeared in Oceania (26.92%, 7/26) but infrequently in North America (4.49%, 8/178). A150V in GT3a was most prevalent in Asia (44.09%), followed by Europe (31.19%), Oceania (24.29%), and North America (19.05%).

Figure 1 The global and regional frequency of naturally occurring NS3 RASs that showed unequal distribution by geographic regions. In each plot, except for the first bar representing the global prevalence, geographic regions were arranged in descending order according to the frequency of RASs. Sequences were clustered into Asia, Europe, North America, Central America, South America, Former USSR, Oceania, Africa, Caribbean or Middle East. In each plot, regions with <10 sequences were not shown. Region definition was according to the Los Alamos HCV Database. “A” in North A, South A and Central A denotes America.

Figure 2 NS5A RASs rate with significantly different frequencies among different geographic regions. In each plot, the first bar represents the global prevalence and geographic regions were arranged in descending order according to the frequency of RASs. Sequences were clustered into Asia, Europe, North America, Central America, South America, Former USSR, Oceania, Africa, Caribbean or Middle East. In each plot, only regions with at least 10 sequences were shown. Region definition was according to the Los Alamos HCV Database. “A” in North A, South A and Central A denotes America.

Figure 3 NS5B RASs rate with significantly different frequencies among different geographic regions. In each plot, the first bar represents the global prevalence and geographic regions were arranged in descending order according to the frequency of RASs. Sequences were clustered into Asia, Europe, North America, Central America, South America, Former USSR, Oceania, Africa, Caribbean or Middle East. In each plot, only regions with at least 10 sequences were shown. Region definition was according to the Los Alamos HCV Database. “A” in North A, South A and Central A denotes America.

Naturally occurring combined RASs

The associations of RASs within and between the HCV NS3, NS5A, and NS5B genes were investigated using a hypergeometric test to detect significantly more or less frequent RAS co-occurrences. This analysis identified pairs of variants that may result in improved or reduced fitness, and RASs were defined as co-occurring or mutually exclusive based on their observed frequencies. Subtypes GT1a and 1b were separately analyzed. For each pair of RASs, only overlapping sequences were used.

Dozens of RAS pairs were identified. In GT1a, RAS combinations within NS3 is shown in Figure 4A. Q80K has three partners (R155K, D168E, and T54S), but the frequencies of all their combinations was significantly lower than expected. Q80K and R155K were respectively observed in 31.87% (2275/7137) and 0.96% (69/7137) of GT1a sequences, but the pair was only found in 0.056% (4/7137) of sequences compared with the expected level (0.308%). Thus, these RASs were considered a mutually exclusive pair. All of these RASs, except Q80K, showed significantly higher co-occurrences with other RASs than expected. For example, R155K tends to be present with V36M, D168G, and T54S. Co-occurring pairs in NS5A included L31M+Y93C and Q30H+Y93H (Figure 4B), and those in NS5B included NI-L159F+NNI-S556G and NNI-M414T+NNI-S556G (Figure 4C). We also identified some co-occurring RASs pairs among NS3, NS5A, and NS5B, including NS3-Q80K+NA5A-M28T/V, NS3-T54S+NS5A-Q30H/L and NS3-V36M+NS5B-A553V and so on (Figure 4D). Two RASs from different regions co-occurred rarely in this study (one or two of the 734 sequences). This means that each of these RASs occurred rarely, but they tend to be co-occurred.

Figure 4 Co-occurring and mutually exclusive RAS pairs within NS3 (A), NS5A (B) and NS5B (C) and among regions (D) in GT1a; within NS3 (E), NS5A (F) and NS5B (G) and among regions (H) in GT-1b. Bold line represents HCV genome, the figures on the line indicate the location of amino acid, the characters above the location indicate the wild type amino acids, and the characters below the location indicate RASs. The solid, and dashed line connecting two RASs indicate co-occurring RAS pairs (significantly more frequent appearance than expected) and mutually exclusive RAS pairs (significantly infrequent occurrence than expected), respectively. Amino acid residue position is numbered relative to the first amino acid of the NS3, NS5A, or NS5B region.

For GT1b, co-occurring pairs in NS3, NS5A, and NS5B were shown in Figure 4E, F, and G respectively. NS3 RAS pairs included T54S+Q80L, Y56F+S122T and so on. NS5A pairs included Y93H+Q54H and L28M+R30Q. Within NS5B, similar to GT1a, pairs NI-L159F+NNI-S556G and NNI-M414I/T+NNI-S556G were identified. Other pairs included C316H+V321I. 95.4% of sequences with 316H have 321I. Co-occurring pairs between NS3, NS5A, and NS5B, included NS5A-L28M+NS5B-C316N, NS5A-R30Q+NS5B-C316N, and NS3-V36L+NS5B-S556N (Figure 4H).


This study investigated naturally occurring RASs among all 86 confirmed HCV subtypes using nucleotide sequences from multiple public databases. We analyzed the frequency and distribution of RASs based on HCV subtype and global geographic regions. In addition, co-occurring RAS and mutually exclusive RAS pairs were identified in subtypes GT1a and 1b within or between the NS3, NS5A, and NS5B genes.

The frequency of Q80K in NS3 varied by both HCV subtype and geographic regions. This RAS was detected in nearly one-third of HCV GT1a sequences and was particularly prevalent in North America, which corroborates findings from previous studies. The NS3 R155K and D168E substitutions, which confer resistance to simeprevir and cross-resistance to other NS3/4A protease inhibitors, appeared in 0.96% and 0.23% of HCV GT1a sequences, respectively. Frequencies of Q80K+R155K and Q80K+D168E were lower than expected, and the pairs appeared to be mutually exclusive. However, these observations contrast with those from another study in which 83% (29/35) of patients infected with HCV GT1a harboring Q80K who experienced virologic failure with simeprevir plus Peg-IFNα/RBV developed virus with a treatment-emergent R155K.41 R155 enables protein conformation favorable for interactions with the quinoline moiety of simeprevir (TMC435), and the salt bridge network between Q80, R155, and D168 within the complex stabilizes this conformation. The R155K mutation results in loss of the salt bridge between residue 155 and Asp168, which leads to reduced simeprevir efficacy.42

Y93H was most prevalent in GT1b (dominant in Asia), and present in 1.35% GT3a and the two 7a and one 7b sequences. Its distribution may be associated with polymorphism of some immune genes. Nguyen et al suggested that the frequency of Y93H in patients who were IFNL3 rs12979860 CC major homozygotes (30%, 3/10) was higher than in the non-CC group in Ireland (11.1%, 4/36),43 and Asian patients also had a high frequency of IFNL3 rs12979860 CC (approximately 90%).44 This association may explain the prevalence of Y93H in Asia. Y93H variants reduce viral sensitivity to ledipasvir in the GT3 HCV subtype.45 The low frequency of Y93H in GT3a is consistent with previous reports, which found only 1 or 2 patients carrying Y93H of among approximately 50 patients at baseline.4648

The NI-specific RAS S282T, the only RAS to confer in vitro resistance to sofosbuvir, was detected in 27 sequences from GT1a, GT1b, GT2b, GT2h, GT3a, and GT4a. Although this RAS was widely distributed geographically, the most recent notation of this RAS in the searched databases was from 2008. This finding suggests S282T may be deleterious to HCV fitness and could explain why S282T has not been recently identified in samples from clinical trials and has only been found in a few patients with viral relapse in recent years.4952 The first case involving the S282T variant was reported when viral breakthrough occurred at week 12 in a patient infected with genotype GT3a.49

L159F is not associated with reduced sofosbuvir susceptibility, although this RAS was frequently detected with C316N. The high frequency of this double mutation was reported in untreated Brazilian patients infected with GT1b.53 The combination of L159F with C316N was also frequently found in GT1b-infected patients who failed to respond to sofosbuvir/ribvirin or other sofosbuvir-based regimens.54 Notably, we found a very high prevalence of C316N, but a very low occurrence of L159F, in Asia. As demonstrated in a study of Japanese patients, deep sequencing showed that 30.0% of patients with C316N also carried L159F, indicating that the variant is present but not easily detected due to low abundance.55 S556G significantly co-occurred with C316N in GT1b sequences, which reflects results from a previous study showed this combination after treatment failure with three DAAs (paritaprevir, ombitasvir, and dasabuvir) in GT1b-infected patients.56

Two important points need to be addressed. The first one is whether the observed RASs are the result of natural HCV variation or of transmission from patients who selected RASs during DAA treatment is unclear. This information was not available in overwhelming studies. In clinical practice, it is difficult to determine the source of infection. The second one is about sequences with highly similarity. Although we have excluded all the clones from the same individuals and only kept one sequence from DAA naive patients with multiple visits, we cannot rule out the possibility that there may be some sequences originated from the same individual at different time points but are not specified in NCBI or related publications.

In conclusion, we obtained the knowledge about the geographic and subtype specific prevalence for an updated list of RASs. Our data may be of special relevance for those countries where highly effective antivirals might not be available. Considering the geographic and subtype specific RASs prevalence will help the clinicians to make optimal treatment choices. The RASs pairs both mutually exclusive and co-occurring would benefit anti-HCV drug development. In addition, given that the emergence of RASs is a growing issue in the setting of current treatment with DAAs, the results provide valuable data on the baseline prevalence, which can be used to monitor for increasing antiviral resistance in the future.


DAA, direct-acting antiviral; GT, genotype; HCV, hepatitis C virus; NI, nucleos(t)ide inhibitors; NNI, non-nucleoside inhibitors; RAS, resistance-associated substitution; SVR, sustained virologic response.


We are grateful to Dr. Hongmei Mo for the suggestion. This work was supported by the Natural Science Foundation of China [grant number 81301415], Natural Science and Technology Major Project [grant number 2014ZX10002002] and Program for JLU Science and Technology Innovative Research Team [grant number 2017TD-08].

Author contributions

All authors contributed to data analysis, drafting and revising the article, gave final approval of the version to be published, and agree to be accountable for all aspects of the work.


The authors report no conflicts of interest in this work.


1. Lavanchy D. The global burden of hepatitis C. Liver Int. 2009;29 Suppl 1:74–81. doi:10.1111/j.1478-3231.2008.01934.x

2. Westbrook RH, Dusheiko G. Natural history of hepatitis C. J Hepatol. 2014;61(1 Suppl):S58–S68. doi:10.1016/j.jhep.2014.07.012

3. El-Serag HB. Hepatocellular carcinoma. N Engl J Med. 2011;365(12):1118–1127. doi:10.1056/NEJMra1001683

4. Guedj J, Dahari H, Rong L, et al. Modeling shows that the NS5A inhibitor daclatasvir has two modes of action and yields a shorter estimate of the hepatitis C virus half-life. Proc Natl Acad Sci U S A. 2013;110(10):3991–3996. doi:10.1073/pnas.1203110110

5. Neumann AU, Lam NP, Dahari H, et al. Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science. 1998;282(5386):103–107. doi:10.1126/science.282.5386.103

6. Smith DB, Bukh J, Kuiken C, et al. Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: updated criteria and genotype assignment web resource. Hepatology. 2014;59(1):318–327. doi:10.1002/hep.26744

7. van der Meer AJ, Veldt BJ, Feld JJ, et al. Association between sustained virological response and all-cause mortality among patients with chronic hepatitis C and advanced hepatic fibrosis. JAMA. 2012;308(24):2584–2593. doi:10.1001/jama.2012.144878

8. Hadziyannis SJ, Sette H Jr, Morgan TR, et al. Peginterferon-alpha2a and ribavirin combination therapy in chronic hepatitis C: a randomized study of treatment duration and ribavirin dose. Ann Intern Med. 2004;140(5):346–355. doi:10.7326/0003-4819-140-5-200403020-00010

9. Mangia A, Santoro R, Minerva N, et al. Peginterferon alfa-2b and ribavirin for 12 vs. 24 weeks in HCV genotype 2 or 3. N Engl J Med. 2005;352(25):2609–2617. doi:10.1056/NEJMoa042608

10. Pang PS, Planet PJ, Glenn JS. The evolution of the major hepatitis C genotypes correlates with clinical response to interferon therapy. PLoS One. 2009;4(8):e6579. doi:10.1371/journal.pone.0006579

11. Asselah T, Boyer N, Saadoun D, Martinot-Peignoux M, Marcellin P. Direct-acting antivirals for the treatment of hepatitis C virus infection: optimizing current IFN-free treatment and future perspectives. Liver Int. 2016;36 Suppl 1:47–57. doi:10.1111/liv.13027

12. McCown MF, Rajyaguru S, Kular S, Cammack N, Nájera I. GT-1a or GT-1b subtype-specific resistance profiles for hepatitis C virus inhibitors telaprevir and HCV-796. Antimicrob Agents Chemother. 2009;53(5):2129–2132. doi:10.1128/AAC.01598-08

13. Cento V, Mirabelli C, Salpini R, et al. HCV genotypes are differently prone to the development of resistance to linear and macrocyclic protease inhibitors. PLoS One. 2012;7(7):e39652. doi:10.1371/journal.pone.0039652

14. Pawlotsky JM. Treatment failure and resistance with direct-acting antiviral drugs against hepatitis C virus. Hepatology. 2011;53(5):1742–1751. doi:10.1002/hep.24262

15. McPhee F, Hernandez D, Yu F, et al. Resistance analysis of hepatitis C virus genotype 1 prior treatment null responders receiving daclatasvir and asunaprevir. Hepatology. 2013;58(3):902–911. doi:10.1002/hep.26388

16. Karino Y, Toyota J, Ikeda K, et al. Characterization of virologic escape in hepatitis C virus genotype 1b patients treated with the direct-acting antivirals daclatasvir and asunaprevir. J Hepatol. 2013;58(4):646–654. doi:10.1016/j.jhep.2012.11.012

17. Komatsu TE, Boyd S, Sherwat A, et al. Regulatory analysis of effects of hepatitis C virus NS5A polymorphisms on efficacy of elbasvir and grazoprevir. Gastroenterology. 2017;152(3):586–597. doi:10.1053/j.gastro.2016.10.017

18. Ji F, Wei B, Yeo YH, et al. Systematic review with meta-analysis: effectiveness and tolerability of interferon-free direct-acting antiviral regimens for chronic hepatitis C genotype 1 in routine clinical practice in Asia. Aliment Pharmacol Ther. 2018;47(5):550–562. doi:10.1111/apt.14507

19. Panel AIHG. Hepatitis C guidance: AASLD-IDSA recommendations for testing, managing, and treating adults infected with hepatitis C virus. Hepatology. 2015;62(3):932–954. doi:10.1002/hep.27950

20. European Association for Study of L. EASL recommendations on treatment of hepatitis C 2015. J Hepatol. 2015;63(1):199–236. doi:10.1016/j.jhep.2015.03.025

21. Bartels DJ, Sullivan JC, Zhang EZ, et al. Hepatitis C virus variants with decreased sensitivity to direct-acting antivirals (DAAs) were rarely observed in DAA-naive patients prior to treatment. J Virol. 2013;87(3):1544–1553. doi:10.1128/JVI.02294-12

22. Paolucci S, Fiorina L, Mariani B, et al. Naturally occurring resistance mutations to inhibitors of HCV NS5A region and NS5B polymerase in DAA treatment-naive patients. Virol J. 2013;10:355. doi:10.1186/1743-422X-10-355

23. Costantino A, Spada E, Equestre M, et al. Naturally occurring mutations associated with resistance to HCV NS5B polymerase and NS3 protease inhibitors in treatment-naive patients with chronic hepatitis C. Virol J. 2015;12:186. doi:10.1186/s12985-015-0414-1

24. Wang Y, Rao HY, Xie XW, et al. Direct-acting antiviral agents resistance-associated polymorphisms in chinese treatment-naive patients infected with genotype 1b hepatitis C virus. Chin Med J (Engl). 2015;128(19):2625–2631. doi:10.4103/0366-6999.166038

25. Li Z, Chen ZW, Li H, Ren H, Hu P. Prevalence of hepatitis C virus-resistant association substitutions to direct-acting antiviral agents in treatment-naive hepatitis C genotype 1b-infected patients in western China. Infect Drug Resist. 2017;10:377–392. doi:10.2147/IDR.S146595

26. Chen ZW, Li H, Ren H, Hu P. Global prevalence of pre-existing HCV variants resistant to direct-acting antiviral agents (DAAs): mining the GenBank HCV genome data. Sci Rep. 2016;6:20310. doi:10.1038/srep20310

27. Kliemann DA, Tovo CV, Da Veiga AB, de Mattos AA, Wood C. Polymorphisms and resistance mutations of hepatitis C virus on sequences in the European hepatitis C virus database. World J Gastroenterol. 2016;22(40):8910–8917. doi:10.3748/wjg.v22.i40.8910

28. Patino-Galindo JA, Salvatierra K, Gonzalez-Candelas F, López-Labrador FX. Comprehensive screening for naturally occurring hepatitis C virus resistance to direct-acting antivirals in the NS3, NS5A, and NS5B genes in worldwide isolates of viral genotypes 1 to 6. Antimicrob Agents Chemother. 2016;60(4):2402–2416. doi:10.1128/AAC.02776-15

29. Welzel TM, Bhardwaj N, Hedskog C, et al. Global epidemiology of HCV subtypes and resistance-associated substitutions evaluated by sequencing-based subtype analyses. J Hepatol. 2017;67(2):224–236. doi:10.1016/j.jhep.2017.03.014

30. Sorbo MC, Cento V, Di Maio VC, et al. Hepatitis C virus drug resistance associated substitutions and their clinical relevance: update 2018. Drug Resist Updat. 2018;37:17–39. doi:10.1016/j.drup.2018.01.004

31. Sarrazin C. The importance of resistance to direct antiviral drugs in HCV infection in clinical practice. J Hepatol. 2016;64(2):486–504. doi:10.1016/j.jhep.2015.09.011

32. Pawlotsky JM. Hepatitis C virus resistance to direct-acting antiviral drugs in interferon-free regimens. Gastroenterology. 2016;151(1):70–86. doi:10.1053/j.gastro.2016.04.003

33. Ng TI, Tripathi R, Reisch T, et al. In vitro antiviral activity and resistance profile of the next-generation hepatitis C virus NS3/4A protease inhibitor glecaprevir. Antimicrob Agents Chemother. 2018;62(1):e01620–17.

34. Wing PAC, Jones M, Cheung M, et al. Amino acid substitutions in genotype 3a hepatitis C virus polymerase protein affect responses to sofosbuvir. Gastroenterology. 2019;157:692–704.e9. doi:10.1053/j.gastro.2019.05.007

35. McPhee F, Ueland J, Vellucci V, et al. Impact of preexisting hepatitis C virus genotype 6 NS3, NS5A, and NS5B polymorphisms on the in vitro potency of direct-acting antiviral agents. Antimicrob Agents Chemother. 2019;63(4). doi:10.1128/AAC.00779-19

36. Cheng G, Tian Y, Doehle B, et al. In vitro antiviral activity and resistance profile characterization of the hepatitis C virus NS5A inhibitor ledipasvir. Antimicrob Agents Chemother. 2016;60(3):1847–1853. doi:10.1128/AAC.02524-15

37. Dvory-Sobol H, Han B, Lu J, et al. In vitro resistance profile of hepatitis C virus NS5A inhibitor velpatasvir in genotypes 1 to 6. J Viral Hepat. 2019;26(8):991–1001. doi:10.1111/jvh.13116

38. Han B, Parhy B, Zhou E, et al. In vitro susceptibility of hepatitis C virus genotype 1 through 6 clinical isolates to the pangenotypic NS3/4A inhibitor voxilaprevir. J Clin Microbiol. 2019;57(4). doi:10.1128/JCM.01844-18

39. Sano T, Akuta N, Suzuki F, et al. Role of NS5A-L31/Y93 double wild-type in failure of glecaprevir/pibrentasvir double therapy in two patients with a history of direct-acting antiviral agent failure: an ultra-deep sequencing analysis. Intern Med. 2019. doi:10.2169/internalmedicine.2604-18

40. Kumada H, Suzuki Y, Ikeda K, et al. Daclatasvir plus asunaprevir for chronic HCV genotype 1b infection. Hepatology. 2014;59(6):2083–2091. doi:10.1002/hep.27113

41. Harrington PR, Komatsu TE, Deming DJ, et al. Impact of hepatitis C virus polymorphisms on direct-acting antiviral treatment efficacy: regulatory analyses and perspectives. Hepatology. 2018;67(6):2430–2448. doi:10.1002/hep.29693

42. Xue W, Pan D, Yang Y, Liu H, Yao X. Molecular modeling study on the resistance mechanism of HCV NS3/4A serine protease mutants R155K, A156V and D168A to TMC435. Antiviral Res. 2012;93(1):126–137. doi:10.1016/j.antiviral.2011.11.007

43. Nguyen LT, Hall N, Sheerin D, Carr M, De Gascun CF. Naturally occurring HCV NS5A/B inhibitor resistance-associated mutations to direct-acting antivirals. Antivir Ther. 2016;21(5):447–453. doi:10.3851/IMP3025

44. Wu R, Chi X, Wang X, et al. IFNL4 ss469415590 polymorphism contributes to treatment decisions in patients with chronic hepatitis C virus genotype 1b, but not 2a, infection. Infect Genet Evol. 2016;39:132–140. doi:10.1016/j.meegid.2016.01.020

45. Hernandez D, Zhou N, Ueland J, Monikowski A, McPhee F. Natural prevalence of NS5A polymorphisms in subjects infected with hepatitis C virus genotype 3 and their effects on the antiviral activity of NS5A inhibitors. J Clin Virol. 2013;57(1):13–18. doi:10.1016/j.jcv.2012.12.020

46. Peiffer KH, Sommer L, Susser S, et al. Interferon lambda 4 genotypes and resistance-associated variants in patients infected with hepatitis C virus genotypes 1 and 3. Hepatology. 2016;63(1):63–73. doi:10.1002/hep.28255

47. Leroy V, Angus P, Bronowicki JP, et al. Daclatasvir, sofosbuvir, and ribavirin for hepatitis C virus genotype 3 and advanced liver disease: a randomized phase III study (ALLY-3+). Hepatology. 2016;63(5):1430–1441. doi:10.1002/hep.28473

48. Bartolini B, Giombini E, Taibi C, et al. Characterization of naturally occurring NS5A and NS5B polymorphisms in patients infected with HCV genotype 3a treated with direct-acting antiviral agents. Viruses. 2017;9(8):212. doi:10.3390/v9080212

49. Walker A, Filke S, Lubke N, et al. Detection of a genetic footprint of the sofosbuvir resistance-associated substitution S282T after HCV treatment failure. Virol J. 2017;14(1):106. doi:10.1186/s12985-017-0779-4

50. Svarovskaia ES, Dvory-Sobol H, Parkin N, et al. Infrequent development of resistance in genotype 1–6 hepatitis C virus-infected subjects treated with sofosbuvir in phase 2 and 3 clinical trials. Clin Infect Dis. 2014;59(12):1666–1674. doi:10.1093/cid/ciu697

51. Donaldson EF, Harrington PR, O’Rear JJ, Naeger LK. Clinical evidence and bioinformatics characterization of potential hepatitis C virus resistance pathways for sofosbuvir. Hepatology. 2015;61(1):56–65. doi:10.1002/hep.27375

52. Hedskog C, Dvory-Sobol H, Gontcharova V, et al. Evolution of the HCV viral population from a patient with S282T detected at relapse after sofosbuvir monotherapy. J Viral Hepat. 2015;22(11):871–881. doi:10.1111/jvh.12405

53. Peres-da-Silva A, Brandao-Mello CE, Lampe E. Prevalence of sofosbuvir resistance-associated variants in Brazilian and worldwide NS5B sequences of genotype-1 HCV. Antivir Ther. 2017;22(5):447–451. doi:10.3851/IMP3131

54. Di Maio VC, Cento V, Lenci I, et al. Multiclass HCV resistance to direct-acting antiviral failure in real-life patients advocates for tailored second-line therapies. Liver Int. 2017;37(4):514–528. doi:10.1111/liv.13327

55. Ito J, Suda G, Yamamoto Y, et al. Prevalence and characteristics of naturally occurring sofosbuvir resistance-associated variants in patients with hepatitis C virus genotype 1b infection. Hepatol Res. 2016;46(13):1294–1303. doi:10.1111/hepr.12685

56. Dietz J, Susser S, Vermehren J, et al. Patterns of resistance-associated substitutions in patients with chronic HCV infection following treatment with direct-acting antivirals. Gastroenterology. 2018;154(4):976–988.e4. doi:10.1053/j.gastro.2017.11.007

Creative Commons License This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF]