Home   »   Science and Technology UPSC Notes   »   Genome Sequencing

Genome Sequencing, Methods, Challenges and its Projects

Context: A recent initiative by the pharmaceutical companies Regeneron Genetics Center, AstraZeneca, Novo Nordisk, and Roche, along with the Meharry Medical College, Tennessee, plans to sequence five lakh individuals of African ancestry through the Diversity Human Genome Initiative.

Understanding Genome

  • A genome is all of the genetic material in an organism.
  • It is made of DNA (or RNA in some viruses) and includes genes and other elements that control the activity of those genes.
  • Genome v/s Gene: Genome is the entire set of genetic material or DNA, while gene is a specific segment of DNA that codes for a particular protein or RNA molecule.

We’re now on WhatsAppClick to Join

Human genome

  • The human genome is the entire set of deoxyribonucleic acid (DNA) residing in the nucleus of every cell of each human body.
  • The DNA consists of a double-stranded molecule built up by four bases – adenine (A), cytosine (C), guanine (G) and thymine (T). Every base on one strand pairs with a complementary base on the other strand (A with T and C with G).
  • In all, the genome is made up of approximately 3.05 billion such base pairs.

What is genome sequencing?

  • Genome sequencing is figuring out the order of DNA nucleotides, or bases, in a genome—the order of Adenine, Cytosine, Guanines, and Thymine that make up an organism’s DNA.
  • While the sequence or order of base pairs is identical in all humans, compared to that of a mouse or another species, there are differences in the genome of every human being that make them unique.

Methods of Genome Sequencing

  • Major genome sequencing methods are the clone-by-clone method and the whole genome shotgun sequencing.

The clone-by-clone method:

  • It involves constructing a physical map of the genome by cutting the DNA into small fragments, cloning these fragments into bacterial or yeast artificial chromosomes (BACs or YACs), and then mapping the location of each fragment within the genome.
  • The DNA in each BAC or YAC clone is then sequenced, and the sequences are assembled to reconstruct the entire genome.
  • It works well for larger genomes like eukaryotic genomes, but it requires a high-density genome map.

The whole genome shotgun sequencing method:

  • It involves randomly fragmenting the genome into small pieces, sequencing each piece, and then using computational algorithms to assemble the sequences into a complete genome.
  • This method was made possible by the development of high-throughput sequencing technologies, such as next-generation sequencing, which can generate large amounts of sequence data rapidly and inexpensively.
  • It is a faster method of sequencing but is not suitable for larger genomes like eukaryotic genomes as they have several repetitive DNA sequences in which the assembling process is challenging.

Challenges of Whole Genome Sequencing

  • Disclosing and managing incidental and secondary findings: Whole genome sequencing can reveal information about an individual’s genetic predispositions or conditions unrelated to the initial reason for sequencing.
  • Privacy concerns: Whole genome sequencing involves gathering a vast amount of sensitive genetic information, which raises privacy concerns.
  • Psychological impact on families: Learning about genetic risks or conditions through sequencing results can have a significant psychological impact on families.
  • Equity in access and utilization: The availability and affordability of newborn whole genome sequencing may not be equal for all populations, raising concerns about equitable access to this technology.
  • Ethical considerations: The use of whole genome sequencing in newborns raises broader ethical considerations, including issues of justice and fairness in the distribution of benefits, potential harm from misinterpretation of results, and the appropriate use of genetic information.

Applications of genome sequencing

  • Disease risk assessment: Genomic sequencing can identify genetic variants associated with increased risk of certain diseases, such as breast cancer or Alzheimer’s disease.
    • Genome sequencing has been used to evaluate rare disorders, preconditions for disorders, even cancer from the viewpoint of genetics, rather than as diseases of certain organs.
    • Nearly 10,000 diseases — including cystic fibrosis and thalassemia — are known to be the result of a single gene malfunctioning.
  • Ancestry tracing: Genomic sequencing can provide information about an individual’s ancestry and genetic heritage.
  • Cancer diagnosis: Liquid biopsies, where a small amount of blood is examined for DNA markers, could help diagnose cancer long before symptoms appear.
  • Halting disease transmission: In public health, however, sequencing has been used to read the codes of viruses—
    • For Example: One of its first practical usages was in 2014, when a group of scientists from M.I.T and Harvard sequenced samples of Ebola from infected African patients to show how genomic data of viruses could reveal hidden pathways of transmission, which might then be halted, thus slowing or even preventing the infection’s spread.
    • Also, during the COVID-19 pandemic, genomic sequencing helped to identify the virus, track its spread, identify new variants, understand how the virus spreads, and develop vaccines.
  • Pharmacogenomics: Genomic sequencing can identify genetic variants that affect an individual’s response to certain medications.

Concerns associated with genomic sequencing

  • Privacy concerns: As genomic sequencing involves the collection and analysis of genetic data, there are concerns about privacy and the potential misuse of this information.
  • High Costs: Genomic sequencing can be expensive, which can limit access to this technology for some patients or researchers.
  • Misuse: There is a risk that genomic sequencing data could be used for nefarious purposes, such as genetic discrimination or targeting of individuals based on their genetic profile.
  • Psychological impact: Knowing one’s genetic risk factors can have a psychological impact, including increased anxiety or depression. Individuals may also face stigma or discrimination based on their genetic risk factors.
  • Data interpretation and accuracy: Errors in data interpretation or analysis could lead to incorrect diagnoses or treatment decisions.

Global Genome-Sequencing Programmes

Country/Region Initiative Description
Iceland The deCODE Initiative
  • Began in 1996 by deCODE genomics.
  • Enrolled most of the Icelandic population.
  • Enhanced disease genetics understanding and risk assessment.
  • Advanced methods and discussions on genomic data handling and bioethics.
United Kingdom 100K Genome Project
  • Aimed to integrate genomics into routine healthcare.
  • The UK’s 100K initiative showed 18.5% actionable data for direct healthcare benefits.
United States AllofUS Programme Seeks to collect genetic information from a million people, funded by the National Institutes of Health.
European Union 1+ Million Genomes Initiative Recently launched to gather over a million genomes.
Africa Three Million African Genomes An ongoing project to sequence three million African genomes.
United Arab Emirates Emirati Genome Programme Plans to sequence over a million samples, with more than 400,000 already completed.

About the Genome India Project

  • It is a government-led initiative launched in 2019 that aims to sequence the genomes of over 10,000 Indians from diverse socio-economic, geographical and linguistic backgrounds to create a comprehensive genomic database of the Indian population.
  • The project involves about 20 institutions across India and with analysis and coordination done by the Centre for Brain Research at IISc, Bangalore.

Significance of the Genome India Project:

  • India’s 1.3 billion-strong population consists of over 4,600 population groups, many of which are endogamous. Thus, the Indian population harbours distinct variations, with disease-causing mutations often amplified within some of these groups. Creating a database of Indian genomes allows researchers to learn about genetic variants unique to India’s population groups and use that to customise drugs and therapies.
  • The project will also help “unravel the genetic underpinnings of chronic diseases currently on the rise in India, (for) example, diabetes, hypertension, cardiovascular diseases, neurodegenerative disorders, and cancer”.

Other Initiatives by Indian Government

Initiative

Details

Indian Genome Variation Consortium It is a network of researchers and institutions that are working to map the genetic variation in the Indian population and understand its implications for health and disease.
Genome Valley 2.0 Genome Valley 2.0 is a project aimed at creating a genomic database of the Indian population and developing personalized healthcare solutions. It is a joint initiative between the government of Telangana and several research institutions and companies.
IndiGen The IndiGen programme aims to undertake whole genome sequencing of thousands of individuals representing diverse ethnic groups from India. The objective is to enable genetic epidemiology and develop public health technologies applications using population genome data.

Sharing is caring!

Leave a comment

Your email address will not be published. Required fields are marked *