Karen Miga

Last updated
Karen H. Miga
Born
Karen Elizabeth Hayden
Known forTelomere to Telomere Project
Scientific career
Institutions University of California, Santa Cruz
Thesis A Genomic Definition of Centromeres in Complex Genomes  (2011)
Doctoral advisor Huntington Willard
Website www.migalab.com

Karen Elizabeth Hayden Miga is an American geneticist who co-leads the Telomere-to-Telomore (T2T) consortium that released fully complete assembly of the human genome in March 2022. [1] She is an assistant professor of biomolecular engineering at the University of California, Santa Cruz and Associate Director of Human Pangenomics at the UC Santa Cruz Genomics Institute. She was named as "One to Watch" in the 2020 Nature's 10 and one of Time 100’s most influential people of 2022. [2]

Contents

Research and career

In 2012, Miga joined the laboratory of David Haussler at the University of California, Santa Cruz. [3] At UCSC she combined computational and experimental approaches. [4] There she leads the telomere-to-telomere (T2T) consortium, [4] [5] a community based effort that seeks to fully sequence and assemble the human genome. [5] Her research efforts make use of long-read sequencing strategies. [5] [6] She makes use of the Oxford Nanopore Technologies MinION sequencer, which analyses DNA by detecting changes in current flow when DNA passes through nanopores in a membrane. [6]

Miga is the director of the Human Pangenome Production Center that seeks to contribute to the next human pangenome reference map through the creation of 350 T2T diploid genomes. [5] [7] This map will support the development of personalized therapeutics. [5]

In 2022, Miga participated in the closing plenary session of the Clinton Global Initiative meeting, [8] where she discussed the future of equitable genomics research with former President Bill Clinton, U2 lead singer Bono, and Director-General of the World Health Organization (WHO) Tedros Ghebreyesus. 2023, Karen Miga was named a 2023 Searle Scholar [9] to study uncharted heterochromatin regions of the human genome.

Selected publications

Related Research Articles

<span class="mw-page-title-main">Human genome</span> Complete set of nucleic acid sequences for humans

The human genome is a complete set of nucleic acid sequences for humans, encoded as DNA within the 23 chromosome pairs in cell nuclei and in a small DNA molecule found within individual mitochondria. These are usually treated separately as the nuclear genome and the mitochondrial genome. Human genomes include both protein-coding DNA sequences and various types of DNA that does not encode proteins. The latter is a diverse category that includes DNA coding for non-translated RNA, such as that for ribosomal RNA, transfer RNA, ribozymes, small nuclear RNAs, and several types of regulatory RNAs. It also includes promoters and their associated gene-regulatory elements, DNA playing structural and replicatory roles, such as scaffolding regions, telomeres, centromeres, and origins of replication, plus large numbers of transposable elements, inserted viral DNA, non-functional pseudogenes and simple, highly repetitive sequences. Introns make up a large percentage of non-coding DNA. Some of this non-coding DNA is non-functional junk DNA, such as pseudogenes, but there is no firm consensus on the total amount of junk DNA.

<span class="mw-page-title-main">Genomics</span> Discipline in genetics

Genomics is an interdisciplinary field of biology focusing on the structure, function, evolution, mapping, and editing of genomes. A genome is an organism's complete set of DNA, including all of its genes as well as its hierarchical, three-dimensional structural configuration. In contrast to genetics, which refers to the study of individual genes and their roles in inheritance, genomics aims at the collective characterization and quantification of all of an organism's genes, their interrelations and influence on the organism. Genes may direct the production of proteins with the assistance of enzymes and messenger molecules. In turn, proteins make up body structures such as organs and tissues as well as control chemical reactions and carry signals between cells. Genomics also involves the sequencing and analysis of genomes through uses of high throughput DNA sequencing and bioinformatics to assemble and analyze the function and structure of entire genomes. Advances in genomics have triggered a revolution in discovery-based research and systems biology to facilitate understanding of even the most complex biological systems such as the brain.

<span class="mw-page-title-main">DNA sequencer</span> A scientific instrument used to automate the DNA sequencing process

A DNA sequencer is a scientific instrument used to automate the DNA sequencing process. Given a sample of DNA, a DNA sequencer is used to determine the order of the four bases: G (guanine), C (cytosine), A (adenine) and T (thymine). This is then reported as a text string, called a read. Some DNA sequencers can be also considered optical instruments as they analyze light signals originating from fluorochromes attached to nucleotides.

<span class="mw-page-title-main">DNA sequencing</span> Process of determining the nucleic acid sequence

DNA sequencing is the process of determining the nucleic acid sequence – the order of nucleotides in DNA. It includes any method or technology that is used to determine the order of the four bases: adenine, guanine, cytosine, and thymine. The advent of rapid DNA sequencing methods has greatly accelerated biological and medical research and discovery.

<span class="mw-page-title-main">National Human Genome Research Institute</span> Institute of the National Institutes of Health, located in Bethesda, Maryland, US

The National Human Genome Research Institute (NHGRI) is an institute of the National Institutes of Health, located in Bethesda, Maryland.

<span class="mw-page-title-main">Jim Kent</span> American research scientist and computer programmer

William James Kent is an American research scientist and computer programmer. He has been a contributor to genome database projects and the 2003 winner of the Benjamin Franklin Award.

The Baskin School of Engineering, known simply as Baskin Engineering, is the school of engineering at the University of California, Santa Cruz. It consists of six departments: Applied Mathematics, Biomolecular Engineering, Computational Media, Computer Science and Engineering, Electrical and Computer Engineering, and Statistics.

<span class="mw-page-title-main">Human Genome Project</span> Human genome sequencing programme

The Human Genome Project (HGP) was an international scientific research project with the goal of determining the base pairs that make up human DNA, and of identifying, mapping and sequencing all of the genes of the human genome from both a physical and a functional standpoint. It started in 1990 and was completed in 2003. It remains the world's largest collaborative biological project. Planning for the project started after it was adopted in 1984 by the US government, and it officially launched in 1990. It was declared complete on April 14, 2003, and included about 92% of the genome. Level "complete genome" was achieved in May 2021, with a remaining only 0.3% bases covered by potential issues. The final gapless assembly was finished in January 2022.

<span class="mw-page-title-main">David Haussler</span> American bioinformatician

David Haussler is an American bioinformatician known for his work leading the team that assembled the first human genome sequence in the race to complete the Human Genome Project and subsequently for comparative genome analysis that deepens understanding the molecular function and evolution of the genome.

<span class="mw-page-title-main">Whole genome sequencing</span> Determining nearly the entirety of the DNA sequence of an organisms genome at a single time

Whole genome sequencing (WGS), also known as full genome sequencing, complete genome sequencing, or entire genome sequencing, is the process of determining the entirety, or nearly the entirety, of the DNA sequence of an organism's genome at a single time. This entails sequencing all of an organism's chromosomal DNA as well as DNA contained in the mitochondria and, for plants, in the chloroplast.

<span class="mw-page-title-main">Pan-genome</span> All genes of all strains in a clade

In the fields of molecular biology and genetics, a pan-genome is the entire set of genes from all strains within a clade. More generally, it is the union of all the genomes of a clade. The pan-genome can be broken down into a "core pangenome" that contains genes present in all individuals, a "shell pangenome" that contains genes present in two or more strains, and a "cloud pangenome" that contains genes only found in a single strain. Some authors also refer to the cloud genome as "accessory genome" containing 'dispensable' genes present in a subset of the strains and strain-specific genes. Note that the use of the term 'dispensable' has been questioned, at least in plant genomes, as accessory genes play "an important role in genome evolution and in the complex interplay between the genome and the environment". The field of study of pangenomes is called pangenomics.

<span class="mw-page-title-main">Reference genome</span> Digital nucleic acid sequence database

A reference genome is a digital nucleic acid sequence database, assembled by scientists as a representative example of the set of genes in one idealized individual organism of a species. As they are assembled from the sequencing of DNA from a number of individual donors, reference genomes do not accurately represent the set of genes of any single individual organism. Instead, a reference provides a haploid mosaic of different DNA sequences from each donor. For example, one of the most recent human reference genomes, assembly GRCh38/hg38, is derived from >60 genomic clone libraries. There are reference genomes for multiple species of viruses, bacteria, fungus, plants, and animals. Reference genomes are typically used as a guide on which new genomes are built, enabling them to be assembled much more quickly and cheaply than the initial Human Genome Project. Reference genomes can be accessed online at several locations, using dedicated browsers such as Ensembl or UCSC Genome Browser.

David Wilson Deamer is an American biologist and Research Professor of Biomolecular Engineering at the University of California, Santa Cruz. Deamer has made significant contributions to the field of membrane biophysics. His work led to a novel method of DNA sequencing and a more complete understanding of the role of membranes in the origin of life.

Third-generation sequencing is a class of DNA sequencing methods currently under active development.

<span class="mw-page-title-main">Angela N. Brooks</span> American biologist and geneticist

Angela Brooks is an Assistant Professor of Biomolecular Engineering at University of California, Santa Cruz. She is a member of the Genomics Institute.

Medical genetic ethics is a field in which the ethics of medical genetics is evaluated. Like the other field of medicine, medical genetics also face ethical issues.

The UC Santa Cruz Genomics Institute is a public research institution based in the Jack Baskin School of Engineering at the University of California, Santa Cruz. The Genomics Institute's scientists and engineers work on a variety of projects related to genome sequencing, computational biology, large data analytics, and data sharing. The institute also maintains a number of software tools used by researchers worldwide, including the UCSC Genome Browser, Dockstore, and the Xena Browser.

<span class="mw-page-title-main">COVID-19 Genomics UK Consortium</span> British genomics research consortium

The COVID-19 Genomics UK (COG-UK) consortium was a group of academic institutions and public health agencies in the United Kingdom created in April 2020 to collect, sequence and analyse genomes of SARS-CoV-2 at scale, as part of COVID-19 pandemic response.

<span class="mw-page-title-main">Arang Rhie</span> South Korean bioinformatician

Arang Rhie is a South Korean bioinformatician serving as a staff scientist in the genome informatics section at the National Human Genome Research Institute.

Circular consensus sequencing (CCS) is a DNA sequencing method that is used in conjunction with single-molecule real-time sequencing to yield highly accurate long-read sequencing datasets with read lengths averaging 15–25 kb with median accuracy greater than 99.9%. These long reads, which are created via the formation of consensus sequencing obtained from multiple passes on a single DNA molecule, can be used to improve results for complex applications such as single nucleotide and structural variant detection, genome assembly, assembly of difficult polyploid or highly repetitive genomes, and assembly of metagenomes.

References

  1. "Researchers generate the first complete, gapless sequence of a human genome". Genome.gov. Retrieved 2023-02-07.
  2. Doudna, Jennifer (May 23, 2022). "The 100 Most Influential People of 2022: Michael Schatz, Karen Miga, Evan Eichler, and Adam Phillippy". Time. Retrieved 2023-10-18.
  3. "Speaker - Cell Symposia: 20 Years of the Human Genome: from Sequence to Substance". www.cell-symposia.com. Retrieved 2020-12-20.
  4. 1 2 Isenberg, Sara (2019-10-23). "Are we there yet? Karen Miga on the Next Era of Genomics". Santa Cruz Tech Beat. Retrieved 2020-12-20.
  5. 1 2 3 4 5 "Karen Miga | VIB Conferences". www.vibconferences.be. Retrieved 2020-12-20.
  6. 1 2 "Scientists achieve first complete assembly of human X chromosome". EurekAlert!. Retrieved 2020-12-20.
  7. "Dawn of good, fast and cheap human genome assembly". ScienceDaily. Retrieved 2020-12-20.
  8. "Karen Miga discusses the future of equitable genomics research with Clinton, Bono" . news.ucsc.edu. Retrieved 2023-11-1
  9. SEARLE SCHOLARS PROGRAM NAMES 15 SCIENTISTS AS SEARLE SCHOLARS FOR 2023. searlescholars.org. Retrieved 2023-11-1.