Chromosome-scale assembly of Kazakh individuals using modern genomics approaches
Main Article Content
Authors
Aiziya Paizulla
School of Sciences and Humanities, Nazarbayev University, Kazakhstan
Kuanysh Sarkytbayev
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Anel Ordabayeva
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Asset Daniyarov
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Askhat Molkenov
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Diana Samatkyzy
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Saule Rakhimova
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ainur Akilzhanova
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ulykbek Kairov
School of Sciences and Humanities, Nazarbayev University, Kazakhstan.
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan.
Abstract
Background: Population-specific reference genomes are crucial for accurate genomic analysis and precision medicine applications. Currently, there is a lack of high-quality chromosome- level reference genomes specifically representing the Kazakh population, which limits the effectiveness of biomedical investigations and genomic studies in this demographic. This project aims to address this gap by creating a comprehensive chromosomal-level assembly of Kazakh individuals' whole genomes using advanced genomic technologies, including next- generation sequencing (NGS, Illumina), third-generation sequencing (TGS, Oxford Nanopore), optical genome mapping, and Hi-C chromosomal conformation capture.
Materials and methods: We generated chromosome-scale assemblies of Kazakh individuals by integrating multiple modern genomic technologies. Whole-genome sequencing was performed using both long-read and short-read platforms. The assemblies were polished with short-read data, scaffolded with Bionano optical genome maps, and organized at the chromosomal level using Hi-C chromosomal conformation data. Quality assessment was conducted at each stage, and comparative analyses will be performed against other global reference genomes.
Results: The project delivered high-quality, chromosome-level assemblies for individuals of the Kazakh population. These assemblies provide insights into unique genetic features, improve the accuracy of population-specific genomic analyses, and will be deposited in open- access repositories for use in biomedical and bioinformatics research worldwide.
Acknowledgement: This study is supported by a grant AP23490594 from the Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan.
Keywords: Chromosome-scale assembly; Kazakh genome; long-read sequencing; Bionano optical mapping; Hi-C; de novo assembly; population genomics.