CSIR IndiGenome Resource

CSIR IndiGenome resource of 1029 Indian genomes provides a compendium of genetic variants representing the contemporary Indian population.

Results from the extensive computation analysis of the 1029 sequenced genomes from India carried out by CSIR constituent labs, CSIR-Institute of Genomics and Integrative Biology (IGIB), Delhi and CSIR-Centre for Cellular and Molecular Biology (CCMB), Hyderabad were published in the scientific journal, Nucleic Acid Research recently.

  • The analysis led to the identification of 55,898,122 single nucleotide variants in the India genome dataset.
  • Comparisons with the global genome datasets revealed that 18,016,257 (32.23%) variants were unique and found only in the samples sequenced from India. This emphasizes the need for an India centric population genomic initiative.
  • India is the second largest country in terms of population density with more than 1.3 billion individuals encompassing 17% of the world population.
  • Despite having this rich genetic diversity, India has been under-represented in global genome studies.
  • Further, the population architecture of India has resulted in high prevalence of recessive alleles. In the absence of large-scale whole genome studies from India, these population-specific genetic variants are not adequately captured and catalogued in global medical literature.

IndiGen Program

  • In order to fill the gap of whole genome sequences from different populations in India, CSIR initiated the IndiGen Program in April 2019.
  • Under this program, the whole genome sequencing of 1029 self-declared healthy Indians drawn from across the country has been completed.
  • This has enabled benchmarking the scalability of genome sequencing at population scale in a defined timeline.

The Minister of S&T, ES and MoHFW,  Dr Harsh Vardhan announced the completion of the IndiGen sequence generation efforts on 25th October 2019.

IndiGenomes Data Resource

  • The current IndiGenomes data resource provides a compendium of genetic variants representing the contemporary Indian population with an objective to classify variants involved in mendelian disorders and improve precision medicine outcomes.
  • The resource can also enable the identification of markers for carrier screening, variations causing genetic diseases, prevention of adverse events and provide better diagnosis and optimal therapy through mining data of clinically actionable pharmacogenetic variants.
  • The phased data will allow researchers to build Indian-specific reference genome dataset and efficiently impute haplotype information.
  • This resource can provide useful insights for clinicians and researchers in comprehending genetics not only at the population level but at the individual level.

Related Readings

Image by Pete Linforth from Pixabay

Facebook
Twitter
YouTube