Amjad Ali, Xiomar S. Soares, Eudes Barbosa, Anderson R. Santos, Debmaliyah Barkh, Syeda M. Bakhtiar, Syed S. Hassan, David W. Ussery, Arthur Silva, Anderson Miyoshi, and Vasco Azevedo
Next-generation sequencing (NGS) has enabled the generation of complete genome sequences of pathogenic and commercially important organisms in a short time and at minimal cost. Computational comparative genomics is essential given that we sequence thousands of organisms every day, but our subsequent knowledge is still very limited. However, genomic information from a single genome is not enough to provide insights into the lifestyle and extended view of the gene pool of a species. Multiple genomes could enrich our understanding of the relatedness and variation of organisms. Therefore, comparative genomic analysis remains a powerful tool to identify orthologous genes across species, the presence and absence of specific genes, evolutionary signals, and
candidate regions associated with pathogenicity. In addition, pan-genome strategies together with subtractive genomics help to highlight inter- and intra-species relationships, conserved core, and pan-genome for characterization of virulence factors, drug targets, and vaccine candidates. In this article, we provide an overview of the background of microbial comparative genomics: sequencing technologies, alignment tools, annotation pipelines, databases and resources, visualization and comparative genomics tools, and strategies. Finally, we present comparative genomic and functional analyses based on insights and recent discoveries in the genus Corynebacterium.