LexicMap: efficient sequence alignment against millions of prokaryotic genomes​
GitHub Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Toggle Dark/Light/Auto mode Back to homepage

Motivation

  1. BLASTN is not able to scale to millions of bacterial genomes, it’s slow and has a high memory occupation. For example, it requires >2000 GB for alignment a 2-kb gene sequence against all the 2.34 millions of prokaryotics genomes in Genbank and RefSeq.

  2. Large-scale sequence searching tools only return which genomes a query matches (color), but they can’t return positional information.