Browse Articles

Article|13 Jun 2023|OPEN
quarTeT: a telomere-to-telomere toolkit for gap-free genome assembly and centromeric repeat identification 
Yunzhi Lin1,2 ,† , Chen Ye3,4 ,† , Xingzhu Li2 , Qinyao Chen2 , Ying Wu2 , Feng Zhang2 , Rui Pan2 , Sijia Zhang2 , Shuxia Chen3,4 , Xu Wang5 , Shuo Cao5 and Yingzhen Wang2 , Yi Yue3,4 , , Yongsheng Liu1,2,4 , , Junyang Yue,2,5 ,
1College of Life Science, Sichuan University, Chengdu, Sichuan 610064, China
2School of Horticulture, Anhui Agricultural University, Hefei, Anhui 230036, China
3School of Information and Computer, Anhui Agricultural University, Hefei, Anhui 230036, China
4State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, Anhui 230036, China
5Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, Guangdong 518124, China
6Key Laboratory of Horticultural Plant Biology Ministry of Education, Huazhong Agricultural University, Wuhan, Hubei 430070, China
*Corresponding author. E-mail: yyyue@ahau.edu.cn,liuyongsheng1122@ahau.edu.cn,yuejy@ahau.edu.cn
Both authors contributed equally to the study.

Horticulture Research 10,
Article number: uhad127 (2023)
doi: https://doi.org/10.1093/hr/uhad127
Views: 145

Received: 07 Apr 2023
Accepted: 05 Jun 2023
Published online: 13 Jun 2023

Abstract

A high-quality genome is the basis for studies on functional, evolutionary, and comparative genomics. The majority of attention has been paid to the solution of complex chromosome structures and highly repetitive sequences, along with the emergence of a new ‘telomere-to-telomere (T2T) assembly’ era. However, the bioinformatic tools for the automatic construction and/or characterization of T2T genome are limited. Here, we developed a user-friendly web toolkit, quarTeT, which currently includes four modules: AssemblyMapper, GapFiller, TeloExplorer, and CentroMiner. First, AssemblyMapper is designed to assemble phased contigs into the chromosome-level genome by referring to a closely related genome. Then, GapFiller would endeavor to fill all unclosed gaps in a given genome with the aid of additional ultra-long sequences. Finally, TeloExplorer and CentroMiner are applied to identify candidate telomere and centromere as well as their localizations on each chromosome. These four modules can be used alone or in combination with each other for T2T genome assembly and characterization. As a case study, by adopting the entire modular functions of quarTeT, we have achieved the Actinidia chinensis genome assembly that is of a quality comparable to the reported genome Hongyang v4.0, which was assembled with the addition of manual handling. Further evaluation of CentroMiner by searching centromeres in Arabidopsis thaliana and Oryza sativa genomes showed that quarTeT is capable of identifying all the centromeric regions that have been previously detected by experimental methods. Collectively, quarTeT is an efficient toolkit for studies of large-scale T2T genomes and can be accessed at http://www.atcgn.com:8080/quarTeT/home.html without registration.