Genom DataGenom DataGenomics Data2213-5960Elsevier264842294583633S2213-5960(15)00078-110.1016/j.gdata.2015.05.015Regular ArticleIn silico analysis of consequences of non-synonymous SNPs of Slc11a2 gene in Indian bovinesPatelShreya M.KoringaPrakash G.ReddyBhaskar B.NathaniNeelam M.JoshiChaitanya G.cgjoshi@rediffmail.comDepartment of Animal Biotechnology, College of Veterinary Science and Animal Husbandry, Anand Agricultural University, Anand,388001 Gujarat, IndiaCorresponding author. Tel./fax: + 91 2692 261201. cgjoshi@rediffmail.com305201592015305201557279183201521520152152015© 2015 The Authors2015This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

The aim of our study was to analyze the consequences of non-synonymous SNPs in Slc11a2 gene using bioinformatic tools. There is a current need of efficient bioinformatic tools for in-depth analysis of data generated by the next generation sequencing technologies. SNPs are known to play an imperative role in understanding the genetic basis of many genetic diseases. Slc11a2 is one of the major metal transporter families in mammals and plays a critical role in host defenses. In this study, we performed a comprehensive analysis of the impact of all non-synonymous SNPs in this gene using multiple tools like SIFT, PROVEAN, I-Mutant and PANTHER. Among the total 124 SNPs obtained from amplicon sequencing of Slc11a2 gene by Ion Torrent PGM involving 10 individuals of Gir cattle and Murrah buffalo each, we found 22 non-synonymous. Comparing the prediction of these 4 methods, 5 nsSNPs (G369R, Y374C, A377V, Q385H and N492S) were identified as deleterious. In addition, while tested out for polar interactions with other amino acids in the protein, from above 5, Y374C, Q385H and N492S showed a change in interaction pattern and further confirmed by an increase in total energy after energy minimizations in case of mutant protein compared to the native.

Highlights

22 nsSNPs were predicted to decrease the stability of protein based on I-Mutant.

From these SNPs, 5 was identified as deleterious by SIFT, PROVEAN, and PANTHER.

Y374C, Q385H and N492S were found to be damaging.

AbbreviationsATM, ataxia telangiectasia mutatedBRAF, B-RafCFTR, cystic fibrosis transmembrane conductance regulatorGalNAc-T1, N-acetylgalactosaminyltransferase 1GATK, Genome Analysis Tool KitHBB, hemoglobin betaHMM, Hidden Markov ModelIGF1R, insulin-like growth factor 1 receptorNCBI, National Center for Biotechnology InformationPANTHER, Protein Analysis Through Evolutionary RelationshipsPolyPhen, Polymorphism PhenotypingPROVEAN, Protein Variation Effect AnalyzerRMSD, root-mean-square deviationSIFT, sorting intolerant from tolerantSlc11a2, solute carrier family 11 member 2SNP, single nucleotide polymorphismTMDs, transmembrane domainsTYRP1, tyrosinase-related protein 1KeywordsNon-synonymousPANTHERIon torrent PGMSIFTProtein
Introduction

Single-nucleotide polymorphisms (SNPs) play a major role in understanding the genetic basis of many complex diseases and it is still a major challenge to identify the functional SNPs in a disease-related gene. Non-synonymous SNPs (nsSNPs) cause changes in the amino acid residues and are important factors contributing to the functional diversity of the encoded proteins [1]. Non-synonymous SNPs affect gene regulation by altering DNA and transcriptional binding factors and maintaining the structural integrity of cells and tissues. Also, nsSNPs affect the functional roles of proteins involved in signal transduction of visual, hormonal, and other stimulants [2], [3].

The advents in computational algorithms are useful for predicting the impact of amino-acid substitutions on protein structure and function. The computational tools like SIFT, PolyPhen, I-Mutant, PANTHER are used nowadays for detecting impact of amino acid substitution especially in coding exonic region [2], [4], [5]. Earlier reports have shown that the computational tools precisely predicted the consequences of nsSNPs associated with genes such as IGF1R[6], ATM[7], HBB[8], CFTR[9], BRAF[10], TYRP1[11], and GalNAc-T1.[12].

The mammalian Slc11a1 and Slc11a2 proteins are a large family of secondary metal transporters. Slc11a1 and Slc11a2 function as pH-dependent divalent cation transporters that play a critical role in host defenses against infections and in Fe2 + homeostasis respectively [13]. Slc11a1 is expressed primarily in macrophages and Slc11a2 has a much broader range of tissue expression. The mechanism by which these proteins exert their antimicrobial activity is uncertain. However observation that these proteins transport Fe2 + down a proton gradient suggests that their antimicrobial activity is due to the removal of Fe2 + (or other divalent metals) from the acidic phagosome and bacterial death due to essential micronutrients starvation [14]. Slc11a2is a 90–100 kD transmembrane protein with intracellular N- and C-termini with an even number of 12 transmembrane domains. First ten TMDs constitute the main functional unit of this family of transporters and TMD1 is a highly conserved sequence motif (residues 384–403), in which alterations abrogate transport [15]. The Slc11a2 gene is comprised of 17 exons and spans more than 36 kb. It contains an additional 5′ exon and intron (exon and intron 1) and an additional 3′ exon (exon 17) and intron (intron 16).Slc11a2 proteins play a central role in iron homeostasis and transport is electrogenic caused by proton movement through the transporter(substrate-dependent and substrate-independent H + leak) [16], [17]. A loss-of-function mutation (G185R) is reported to cause very severe microcytic anemia in the mk mouse and in the Belgrade rat.In addition, a number of loss-of-function missense (R416C, G212V, delV114) and splicing mutations have been detected in the human Slc11a2 gene in patients suffering from hypochromic microcytic anemia with serum and liver iron overload [18], [19]. The aim of our study was to identify functional and structural impact of nsSNPs of Slc11a2. From amino acid sequence retrieved from NCBI, 3D model of this protein was constructed using RaptorX protein modeling tool and visualized in PyMOL. SNPs were inserted in the native sequence of protein and its consequences were checked using several computational tools.

Materials and methodsVariant calling

Genetic variation of the Slc11a2gene was analyzed from data obtained by sequencing of exonic regions of this innate immune gene which was studied to screen SNPs. Ten Bos taurus animals of Gir breed and ten Bubalus bubalis animals of Murrah breed were used for genomic DNA extraction (unpublished data). GATK software tools (version 2.8; http://www.broadinstitute.org) were used for genotype calling with recommended parameters. Genotypes were called by the GATK Unifiedgenotyper tool, and variants were filtered by depth 60.

Deleterious nsSNP found by the SIFT program

SIFT performs multiple alignments of a number of peptide sequences until a median conservation for the sequence is reached at the default of 3.0 and then it predicts whether substitution with any of the other amino acids is tolerated or deleterious for every position in the submitted sequence [20]. The SIFT prediction was given as a tolerance index (TI) score ranging from 0.0 to 1.0, which was the normalized probability that the amino acid change was tolerated. A nsSNP with a TI score of V0.05 was considered to be deleterious i.e. an amino acids with probabilities < 0.05 were predicted to be deleterious. We submitted the amino acid sequence of Slc11a2 along with nsSNPs with corresponding amino acid positions.

Validation and functional characterization predicted nsSNPs by PANTHER-cSNP

The functional validation of nsSNPs predicted by SIFT was analyzed by PANTHER (Protein Analysis Through Evolutionary Relationships;www.pantherdb.org/tools/csnp). This tool estimates the likelihood of a particular non-synonymous coding SNP to cause a functional impact on the protein using Hidden Markov Models (HMM) based modeling and evolutionary relationship. It calculates the subPSEC (substitution position-specific evolutionary conservation) score based on an alignment of evolutionarily related proteins [5]. The score of subPESC ≥− 3 was predicted as a less deleterious, while ≤− 3 was predicted as the deleterious effect. Amino acid sequence in FASTA format was uploaded.

Prediction of functional impact of nsSNPs

PROVEAN (Protein Variation Effect Analyzer) is a tool which predicts the impact of an amino acid substitution or indel on the biological function of a protein (http://provean.jcvi.org/index.php).This algorithm allows for the best balanced separation between the deleterious and neutral amino acids, based on a threshold. The score <− 2.5 indicates that the variant is deleterious and >− 2.5 score is considered as a neutral variant [21]. A query peptide sequence of Slc11a2 was provided in FASTA format to the PROVEAN server for predicting the functional impact of the SNPs.

Investigation of mutant protein stability by I-Mutant 2.0

I-Mutant2.0 (http://folding.biofold.org/cgi-bin/i-mutant2.0) is a Support Vector Machine-based web server for the automatic prediction of protein stability changes upon single-site mutations. The input FASTA sequence of protein along with the residues change was provided for analysis of DDG value (kcal/mol) [22]. Also the RI value (reliability index) was computed.

Modeling of native and mutant structure of Slc11a2

RaptorX, a protein structure prediction server,predicts 3D structures for protein sequences lacking close homologs in the Protein Data Bank (PDB).For given FASTA sequence RaptorX predicted its secondary and tertiary structures as well as solvent accessibility and disordered regions. RaptorX also calculates p-value for the relative global quality, GDT (global distance test) and uGDT (un-normalized GDT) for the absolute global quality, and RMSD for the absolute local quality of each residue in the model. The 3D structures were visualized by PyMOL (http://www.pymol.org/) which is an open source molecular isualization tool. Mutant model was also constructed using PyMoL tool.

Model quality & structure assessment and RMSD difference

Model quality was checked both of native and altered protein by Ramachandran plot using software RAMPAGE (mordred.bioc.cam.ac.uk/~rapper/rampage.php) which analyzed residue-by-residue geometry and overall structure geometry. PyMOL was used to locate nsSNPs on protein structure and for analyzing RMS deviation by superimposing both native and mutant structures. Amino acids at the position of SNPs were checked for polar interactions with other amino acids in the protein using PyMOL. In addition, total energy after energy minimization was calculated for each altered model using Swiss PDB viewer.

Binding site and ligand prediction

To find whether these identified nsSNPs are present on any epitope region or any protein binding region, we performed binding site prediction using RaptorX Binding and FT site server which predicted binding site regions in Slc11a2 protein.

ResultsVariant calling

Upon variant calling, total 124 SNPs were observed in Slc11a2 gene (Supplementary Table 1). Among these SNPs, 22 (17.74%) and 74 (59.67%) were found to be non-synonymous and synonymous respectively. The remaining 28 (22.58%) were found to be in the non-coding region, 3 in UTR 5′region and 25 in UTR 3′ region.

Deleterious nsSNP found by the SIFT program

The SIFT identified 8 nsSNPs viz. I114T, G369R, Y374C, A377V, Q385H, M389V, N492S and V497M to be deleterious. Non-synonymous SNPs with SIFT prediction and SIFT score are shown in Table 1.

Validation and functional characterization predicted nsSNPs by PANTHER-cSNP

The results of SIFT were further confirmed by investigating the effect of nsSNPs on protein function using HMM based PANTHER tool. The analysis of 22 non-synonymous mutations revealed that 5 SNPs (G369R, Y374C, A377V, Q385H and N492S) reflected a subPSEC score >−3, thus PANTHER classified them as deleterious. Remaining SNPs of Slc11a2 had a score <− 3 and were classified as tolerated. Non-synonymous SNPs along with PANTHER score are given in Table 1.

Prediction of functional impact of nsSNPs

Further confirmation of effect of nsSNPs on protein was done using PROVEAN tool which revealed 8 from 22 nsSNPs (I114T, G369R, Y374C, A377V, Q385H, M389V, S490F and N492S) to be deleterious. The higher the tolerance index is, the less functional impact a particular amino acid substitution is likely to have, and vice versa. Among the 22 nsSNPs, 8 (36.36%) were found to be deleterious, having a tolerance index score of ≤− 2.5 using PROVEAN tool (Table 1).

Investigation of mutant protein stability by I-Mutant 2.0

To add another layer of confirmation, we also analyzed effect of these nsSNPs using I-Mutant 2.0. It gave result in the form of effect of mutants on stability of protein with reliability index at pH 7.0 and temperature 25 °C. Here in our case, in Slc11a2, all 22 non-synonymous SNPs showed resulting decrease in stability of the protein. All 22 SNPs with reliability index and DDG value are given in Table2.

Analysis of structural model of Slc11a2 protein

The 3D structure of native model generated through RaptorX was visualized using PyMoL. Slc11a2 is having 568 amino acid residues (Supplementary Fig. 1). From these 480(85%) residues were modeled and 65(11%) positions predicted as disordered. Secondary structures revealed 69% helix, 0% beta sheet and 30% loop structures. The solvent accessibility was divided into three states by 2 cut-off values: 10% and 42%. Value less than 10% was identified as buried, larger than 42% value was identified as exposed and if value was between 10% and 42% was identified as medium. Proportions of buried, medium and exposed regions in our protein were 62%, 21% and 15% respectively. Overall uGDT (GDT) value was 143(25%). The uGDT is the unnormalized GDT (global distance test) score. For a protein with > 100 residues, uGDT > 50 is a good indicator. For a protein with < 100 residues, GDT > 50 is a good indicator. If a model has acceptable uGDT (> 50) but lower GDT (< 50), it indicates that only a small portion of the model may be good. P-value is the likelihood of a predicted model being worse than the best of a set of randomly-generated models for this protein (or domain), so P-value evaluates the relative quality of a model. The smaller the p-value, the higher is the quality of the model. For alpha proteins, p-value less than 10− 3 is a good indicator. For manly beta proteins, p-value less than 10− 4 is a good indicator. For this model of Slc11a2, RaptorX predicted p-value of 4.55e − 07. Twenty two mutant models were generated in PyMOL (Supplementary Fig. 2, Supplementary Fig. 3, Supplementary Fig. 4, Supplementary Fig. 5, Supplementary Fig. 6, Supplementary Fig. 7, Supplementary Fig. 8, Supplementary Fig. 9, Supplementary Fig. 10, Supplementary Fig. 11, Supplementary Fig. 12, Supplementary Fig. 13, Supplementary Fig. 14, Supplementary Fig. 15, Supplementary Fig. 16, Supplementary Fig. 17, Supplementary Fig. 18, Supplementary Fig. 19, Supplementary Fig. 20, Supplementary Fig. 21, Supplementary Fig. 22, Supplementary Fig. 23).

Model quality & structure assessment and RMSD difference

Ramachandran plot of native protein showed 442 residues (88.8%) in favored region, 42 residues (8.4%) in allowed region and 14 residues (2.8%) in outlier region (Supplementary Fig. 24). While in case of 22 altered proteins, in case of 21 nsSNPs, Ramachandran plot showed similar pattern as native protein but for nsSNP G369R, one residue from favored region was shifted to outlier region. So in this case, 441 residues (88.6%) were in favored region and 15 residues (3.0%) were in outlier region (Supplementary Fig. 25). While checking RMSD value, it was observed that there was not much deviation from native protein. The higher the RMSD value, the more the deviation between the two structures which in turn changes their functional activity [23]. V108I, R465Q, W477L, and V517I showed somewhat higher RMSD values of 0.053, 0.057, 0.033, and 0.036 respectively which are given in Table 3. While tested for polar interactions, in case of some nsSNPs, there is a change in interaction patterns compared to native protein. SNPs V108I, I114T, T336K, T343A, Y374C, Q385H, N492S and A512V showed different interaction patterns than native protein's amino acids. In V108I, V108 formed two polar interactions with K104 and A112 while I108 in altered protein formed three interactions. Two were with K104, I108 with altered bond lengths and third extra interaction with L105 (Supplementary Fig. 26 & B). I114 in case of I114T having interaction with L110 with a bond length of 2.6 Å, but T114 had additional interaction to same amino acid with 3.3 Å bond length due to change of R group from non-polar to polar (Supplementary Fig. 27A & B). K336 which is having positively charged R group in altered protein forming 3.6 Å long interaction with V334, while T334 proposed polar R group not forming any interaction (Supplementary Fig. 28. & B). SNP T343A showed alteration of R group from polar to non-polar, which changed interactions (Supplementary Fig. 29. & B). Y374 in native protein forming one polar interaction with P370 and one with V378 of 2.9 Å and 2.8 Å length respectively. While altered amino acid C374 forming two polar interactions with P370 of 2.9 Å, 3.0 Å lengths and with V378 of 2.8 Å, 4.8 Å lengths (Fig. 1 & B). In Q385C, Q385 formed three polar interactions with T343 of 2.8 Å, 2.9 Å, and 3.2 Å lengths, one with L381 of 2.9 Å length, one with A382 of 3.1 Å length and one with M389 of 3.0 Å length but altered residue H385 had interaction with L381 of 2.9 Å length and one with V389 of 3.0 Å length because of the change in R group (Fig. 2 & B). Native residue N492 showed one interaction with I488 of 3.0 Å length and one with I491 of 3.2 Å length and altered residue S492 forming two polar interactions with I488 of 2.5 Å and 3.0 Å lengths (Fig. 3 & B).Similarly, A512V showed the change in bond length (Fig. 4 & B). While verified further for energy change, T336K, Y374C, Q385H, M389V, R465Q, W477L, L484V, S490F, N492S, D502G, V510M, A512V and V517I showed higher total energy after energy minimization than native protein which are given in Table 3.

Binding site and ligand prediction

Further when analyzed for binding site regions using several tools, results revealed ligands and ligand binding sites which are shown in Supplementary Tables 2 and 3. However, none of above nsSNPs resided in the above identified binding sites.

Discussion

In order to investigate structural and functional impact of nsSNPs present in coding region of Slc11a2, we performed extensive computational analysis. Non-synonymous SNPs in coding region can cause amino acid change further altering protein function which may lead to susceptibility to disease. Identification of deleterious nsSNPs from tolerant nsSNPs is ideal for analyzing individual susceptibility to disease. It is not necessary that all variants have a major deleterious functional impact and some may be well tolerated. However, nsSNPs which are linked to diseases or other phenotypes often have some molecular significance [4]. They may modify enzyme activity, destabilize protein structures or disrupt protein interactions.

Nowadays, major concern relating to nsSNP in molecular biology and population genetics is to identify and characterize the nsSNPs that are functionally related from those that are not [24]. To determine the functional effects of nsSNPs in Slc11a2 gene, we employed four widely used in silico tools specifically I-Mutant3, SIFT, PROVEAN and PANTHER. If a marker is found to be associated with the disease and the marker is a nsSNP, prediction tools can provide independent evidence as to whether the nsSNP itself contributes to disease. Because carrying out the appropriate assays may be time-consuming, these tools can filter out nsSNPs that are unlikely to affect protein function before experimentation. The difference in the results of these four prediction tools is due to the difference in features utilized by the methods therefore we would expect the outcomes to occur dissimilar at some point [6]. If the prediction results of all four tools for these identified nsSNPs in this ion transport innate immune gene Slc11a2 would be combined, it would provide high reliability. One of the nsSNP G185R was observed which abrogated iron transport in one of the phenotype of Belgrade rats [25]. This SNP was not observed in this analysis.

To test the effect of these nsSNPs on structural stability of protein, protein modeling proved to be an efficient in silico means using several bioinformatic tools. Change in amino acid can be further modeled and this altered modeled protein structure can be utilized during in silico approach to confirm the effect of particular nsSNP on stability of protein before validating in vitro. However here, nsSNPs are not falling in the epitope region according to the results of FT site and RaptorX Binding which identified potential binding sites in the protein structure. But change in amino acid affects polar–polar interactions within the protein molecule itself which further altered energy of stabilization and further destabilized the protein [26]. Here, as observed, in some amino acid changes, number of polar interactions changed which ultimately affected total energy of protein indicating decrease in protein stability. These imperative results indicate that identified nsSNPs in this protein might alter its stability and might affect the protein–protein interaction and metal binding sites.

By comparing the results of above 4 methods and total energy, we can conclude that nsSNPs viz. Y374C, Q385H and N492S should be further confirmed for their association with disordered Slc11a2 function in addition to existing nsSNPs of this gene. However, RMSD values were not that much higher and these nsSNPs were not residing in the metal binding site regions, suggesting that these nsSNPs might not be too strong candidate for disease association of this gene.

Conclusion

Nowadays, the next generation sequencing techniques are generating high throughput of data related to SNPs, but the evaluation of biologically functional SNPs using this in vitro studies is quite tedious, time consuming and economically less significant. On the other way, in silico approach can help us to predict the consequences of mutations and explain their affecting role in biological mechanisms. Out of 22, 8(36.36%) nsSNPs were revealed to be deleterious using SIFT. Similarly PROVEAN identified 8 nsSNPs deleterious. Additionally, I-Mutant3 predicted all 22 substitutions which affected the stability of protein. From the above 7 nsSNPs, PANTHER predicted 5 (22.72%) as damaging. G369R, Y374C, A377V, Q385H and N492S were predicted deleterious using abovementioned tools. Also, these nsSNPs were observed for altered interaction patterns and verified by calculating total energy change after energy minimization which confirmed Y374C, Q385H and N492S as damaging.

The following are the supplementary data related to this article.

Native protein Slc11a2.

Altered protein Slc11a2 (V108I).

Altered protein Slc11a2 (I114T).

Altered protein Slc11a2 (V334A).

Altered protein Slc11a2 (T336K).

Altered protein Slc11a2 (T343A).

Altered protein Slc11a2 (G369R).

Altered protein Slc11a2 (A371S).

Altered protein Slc11a2 (Y374C).

Altered protein Slc11a2 (A377V).

Altered protein Slc11a2 (Q385H).

Altered protein Slc11a2 (M389V).

Altered protein Slc11a2 (R465Q).

Altered protein Slc11a2 (W477L).

Altered protein Slc11a2 (L484V).

Altered protein Slc11a2 (S490F).

Altered protein Slc11a2 (N492S).

Altered protein Slc11a2 (V497M).

Altered protein Slc11a2 (D502G).

Altered protein Slc11a2 (V507A).

Altered protein Slc11a2 (V510M).

Altered protein Slc11a2 (A512V).

Altered protein Slc11a2 (V517I).

Ramachandran plot of native protein.

Number of residues in favored region (~ 98.0% expected):442 (88.8%), number of residues in allowed region (~ 2.0% expected):42 (8.4%), number of residues in outlier region: 14 (2.8%).

Ramachandran plot of altered protein (G369R).

Number of residues in favored region (~ 98.0% expected): 441 (88.6%), number of residues 489 in allowed region (~ 2.0% expected):42 (8.4%), number of residues in outlier region: 15(3.0%).

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP V108I.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP V108I.

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP I114T.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP I114T.

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP K336T.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP K336T.

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP T343A.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP T343A.

Supplementary Table 1

Interaction of residues.

Supplementary tables

Conflict of interest

The authors have no conflict of interest.

ReferencesYatesC.M.SternbergM.J.The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactionsJ. Mol. Biol.42520133949396323867278RajasekaranR.SudandiradossC.DossC.G.SethumadhavanR.Identification and in silico analysis of functional SNPs of the BRCA1 geneGenomics90200744745217719744GfellerD.ErnstA.JarvikN.SidhuS.S.BaderG.D.Prediction and experimental characterization of nsSNPs altering human PDZ-binding motifsPloS ONE.92014e9450724722214JohnsonM.M.HouckJ.ChenC.Environmental aspects selection for EEE using ANP methodScreening for Deleterious Nonsynonymous Single-nucleotide Polymorphisms in Genes Involved in Steroid Hormone Metabolism and Response14200513261329ThomasP.D.CampbellM.J.KejariwalA.MiH.KarlakB.DavermanR.PANTHER: a library of protein families and subfamilies indexed by functionGenom. Res.13200321292141de AlencarS.A.LopesJ.C.A comprehensive in silico analysis of the functional and structural impact of SNPs in the IGF1R genelJ. Biomed. Biotechnol.2010201071513920625407George Priya DossC.RajithB.Computational refinement of functional single nucleotide polymorphisms associated with ATM genePloS ONE72012e3457322529920AlanaziM.AbduljaleelZ.KhanW.WarsyA.S.ElrobhM.KhanZ.In silico analysis of single nucleotide polymorphism (SNPs) in human beta-globin genePloS one.62011e2587622028795George Priya DossC.RajasekaranR.SudandiradossC.RamanathanK.PurohitR.SethumadhavanR.A novel computational and structural analysis of nsSNPs in CFTR geneGenom. Med.220082332HussainM.R.ShaikN.A.Al-AamaJ.Y.AsfourH.Z.KhanF.S.MasoodiT.A.In silico analysis of Single Nucleotide Polymorphisms (SNPs) in human BRAF geneGenetics5082012188196KamarajB.PurohitR.In silico screening and molecular dynamics simulation of disease-associated nsSNP in TYRP1 gene and its structural consequences in OCA3BioMed Res. Int.20132013697,051MohamoudH.S.HussainM.R.El-HarouniA.A.ShaikN.A.QasmiZ.U.MericanA.F.First comprehensive in silico analysis of the functional and structural consequences of SNPs in human GalNAc-T1 geneComput. Math. Methods Med.20142014904,052CzachorowskiM.Lam-Yuk-TseungS.CellierM.GrosP.Transmembrane topology of the mammalian Slc11a2 iron transporterBiochemistry4820098422843419621945CooperC.A.ShayeghiM.TechauM.E.CapdevilaD.M.MacKenzieS.DurrantC.Analysis of the rainbow trout solute carrier 11 family reveals iron import ≤ pH 7.4 and a functional isoform lacking transmembrane domains 11 and 12FEBS lett.58120072599260417509573PinnerE.GruenheidS.RaymondM.GrosP.Functional complementation of the yeast divalent cation transporter family SMF by NRAMP2, a member of the mammalian natural resistance-associated macrophage protein familyJ. Biol. Chem.272199728,93328,9388995220MackenzieB.UjwalM.L.ChangM.H.RomeroM.F.HedigerM.A.Divalent metal-ion transporter DMT1mediates both H+ coupled Fe2 + and uncoupled fluxesPflügersArchiv - Eur. J. Physiol.4512006544558GunshinH.MackenzieB.BergerU.V.GunshinY.RomeroM.F.BoronW.F.Cloning and characterization of a mammalian proton-coupled metal-ion transporterNature38819974824889242408LeeP.L.GelbartT.WestC.HalloranC.BeutlerE.The human Nramp2 gene: characterization of the gene structure, alternative splicing, promoter region and polymorphismsBlood Cells Mol. Dis.241998199215ss9642100GruenheidS.Canonne-HergauxF.GauthierS.HackamD.J.GrinsteinS.GrosP.The iron transport protein NRAMP2 is an integral membrane glycoprotein that colocalizes with transferrin in recycling endosomesJ. Exp. Med.1891999831-41PaulineC.N.StevenH.SIFT: predicting amino acid changes that affect protein functionNucleic Acids Res.3120033812381412824425ChoiY.SimsG.E.MurphyS.MillerJ.R.ChanA.P.Predicting the functional effect of amino acid substitutions and indelsPloS ONE72012e4668823056405CapriottiE.FariselliP.CasadioR.I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structureNucleic Acid Res.332005306310RevaB.A.FinkelsteinA.V.SkolnickJ.What is the probability of a chance prediction of a protein structure with an RMSD of 6 Å?Fold. Des.319981411479565758George Priya DossC.NagasundaramN.ChakrabortyC.ChenL.ZhuH.Extrapolating the effect of deleterious nsSNPs in the binding adaptability of flavopiridol with CDK7 protein: a molecular dynamics approachHum. Genom.7201310FlemingM.RomandoM.SuM.GarrickL.GarrickM.AndrewsN.Nramp2 is mutated in the anemic Belgrade (b) rat: evidence of a role for Nramp2 in endosomal iron transportPNAS951998114811539448300PengY.ZhaolongL.JohnM.Loss of protein structure stability as a major causative factor in monogenic diseaseJ. Mol. Biol.353200545947316169011Acknowledgment

The authors are thankful to the Department of Biotechnology, Government of India, New Delhi (grant no. BT/PR3111/AAQ/1/474/2011) for providing financial support.

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP Y374C.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP Y374C.

A. Interaction of native residue with vicinal residue (yellow dotted line) for SNP Q385H.

B. Interaction of altered residue with vicinal residue (yellow dotted line) for SNP Q385H.

A. Interaction of native residue with vicinal residues (yellow dotted line) for SNP N492S.

B. Interaction of altered residue with vicinal residues (yellow dotted line) for SNP N492S.

A. Interaction of native residue with vicinal residues (yellow dotted line) for SNP A512V.

B. Interaction of altered residue with vicinal residues (yellow dotted line) for SNP A512V.

Functional validations of nsSNPs in Slc11a2 using SIFT, PANTHER-cSNP and PROVEAN.

Amino acid changeSIFT predictionSIFTscorePantherpredictionPantherscore(subPSEC)PdeleteriousPROVEANpredictionPROVEAN score(cutoff = − 2.5)
V108ITolerated0.07Does not align to HMMDoes not align to HMMNeutral− 0.503
I114TDeleterious0.01Does not align to HMMDoes not align to HMMDeleterious− 3.318
V334ATolerated0.86Tolerated− 1.5690.193Neutral− 1.325
T336KTolerated0.87Tolerated− 1.2680.150Neutral− 0.385
T343ATolerated1.00Tolerated− 2.194790.30891Neutral0.404
G369RDeleterious0Deleterious− 5.3980.917Deleterious− 7.858
A371STolerated0.12Tolerated− 2.5640.393Neutral− 2.409
Y374CDeleterious0Deleterious− 5.1970.9Deleterious− 8.419
A377VDeleterious0.01Deleterious− 3.864050.70351Deleterious− 3.835
Q385HDeleterious0Deleterious− 4.2370.776Deleterious− 4.899
M389VDeleterious0.02Tolerated− 2.7440.437Deleterious− 3.668
R465QTolerated0.39Tolerated− 2.1320.296Neutral− 0.311
W477LTolerated0.65Tolerated− 1.0240.122Neutral− 2.225
L484VTolerated0.55Tolerated− 1.5400.187Neutral− 1.12
S490FTolerated0.19Tolerated− 1.9730.264Deleterious− 3.106
N492SDeleterious0Deleterious− 4.6730.842Deleterious− 4.744
V497MDeleterious0.03Tolerated− 2.7340.434Neutral− 1.549
D502GTolerated0.34Tolerated− 1.7950.230Neutral− 1.343
V507ATolerated0.84Tolerated− 1.9800.265Neutral1.787
V510MTolerated0.16Tolerated− 2.5020.378Neutral− 1.494
A512VTolerated0.96Tolerated− 0.9270.118Neutral− 0.17
V517ITolerated0.90Tolerated− 1.0280.122Neutral− 0.231

Investigation of mutant protein stability by I-Mutant 2.0.

Protein symbolAmino acid changeAmino acid positionReliability index (RI)DDG value (kcal/mol)Stability prediction
Slc11a2V/I10870.86Decrease
I/T1143− 2.03Decrease
V/A3349− 2.13Decrease
T/K3364− 0.48Decrease
T/A3438− 2.05Decrease
G/R3692− 1.01Decrease
A/S3717− 0.81Decrease
Y/C37410.71Decrease
A/V37720.06Decrease
Q/H38530.27Decrease
M/V3896− 0.17Decrease
R/Q4658− 1.88Decrease
W/L47761.00Decrease
L/V4849− 1.07Decrease
S/F4900− 0.88Decrease
N/S4926− 0.88Decrease
V/M4978− 1.28Decrease
D/G5027− 2.14Decrease
V/A5079− 2.93Decrease
V/M5108− 2.69Decrease
A/V51210.76Decrease
V/I5179− 1.61Decrease

RMSD value and total energy after minimization of altered model.

Amino acid changeRMSD value of altered proteinTotal energy after energy minimization (kJ/mol)
Native protein− 5449.571
V108I0.053− 12,538.697
I114T0.009− 5508.284
V334A0.002− 5551.974
T336K0.001− 5373.680
T343A0.002− 5522.909
G369R0.001− 5682.057
A371S0.002− 5461.978
Y374C0.002− 5419.823
A377V0.000− 5492.021
Q385H0.001− 5153.123
M389V0.002− 5107.444
R465Q0.057− 5345.201
W477L0.033− 5294.021
L484V0.001− 5488.854
S490F0.001− 5314.866
N492S0.001− 5291.899
V497M0.002− 5469.048
D502G0.001− 5381.310
V507A0.002− 5449.402
V510M0.001− 2574.750
A512V0.001− 5448.742