Identification of Recurrent Fusion Genes in Esophageal Squamous Cell Carcinoma Patients from Kazakhstan
Main Article Content
Authors
Aibek Karimov
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Rakhimbek Bektayev
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ruslan Kalendar
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Aigul Sharip
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Saule Rakhimova
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Nazerke Satvaldina
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ainur Seisenova
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ainur Akilzhanova
Laboratory of Genomic and Personalized Medicine, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Ulykbek Kairov
Laboratory of Bioinformatics and Systems Biology, Center for Life Sciences, National Laboratory Astana, Nazarbayev University, Kazakhstan
Abstract
Background: Esophageal squamous cell carcinoma (ESCC) is one of the most lethal cancers in Central Asia, yet its molecular landscape in Kazakhstan remains poorly explored. However, the fusion-gene landscape in Kazakhstan remains undercharacterized. Fusion transcripts can serve as diagnostic biomarkers or therapeutic targets, motivating a focused survey in a local cohort.
Materials and methods: We analyzed paired tumor and adjacent normal tissues from ESCC patients collected in Kazakhstan (34 normal, 33 tumor samples). Tumor specimens were obtained from patients who had undergone Ivor-Lewis esophagectomy without receiving prior chemotherapy or radiotherapy. Total RNA was sequenced (paired-end), reads aligned to GRCh38 (Gencode v44 / CTAT genome library). Fusion calling was performed with two orthogonal tools (STAR-Fusion v1.13.0 and Arriba v2.4.0). Callsets were intersected to prioritize concordant events; candidate junctions were manually inspected in IGV to remove artifacts. Primers for prioritized fusions were designed and PCR followed by Sanger sequencing is underway for orthogonal validation.
Results: Intersection of STAR-Fusion and Arriba reduced noise and produced a concise set of high-confidence fusion candidates detected in multiple tumors and absent from matched normals. Manual IGV review confirmed strong split- and spanning-read support at exon boundaries for several recurrent events. Notable recurrent partners include MAP4K5-L2HGDH, ADAMTS2-ENSG00000253652, RAB8A-CIB3, BAZ2B-WDSUB1, HUWE1-SUPT3H, and PLEKHA5-KRAS.
Conclusion: Using complementary fusion callers and rigorous manual curation, we generated an assay-ready shortlist of recurrent fusion transcripts in Kazakh ESCC. Pending PCR/Sanger confirmation, these candidates warrant follow-up as potential biomarkers or mechanistic leads for ESCC in Central Asia.
Acknowledgement:
We thank all the participants in this study. This research has been funded by the Science Committee of the Ministry of Science and Higher Education of the Republic of Kazakhstan (Grants No. AP23490594, BR18574184, BR24993023, BR24992841, BR27199879), Nazarbayev University funding CRP grants 021220CRP2222, 211123CRP1608.
Key words: esophageal squamous cell carcinoma, gene fusions, RNA-seq, Kazakhstan