"Indexing All Life's Known Biological #Sequences" https://www.biorxiv.org/content/10.1101/2020.10.01.322164v3
"In this work, we take advantage of recently developed, very efficient data structures and algorithms for representing sequence sets. We make Petabases of DNA sequences across all clades of life, including viruses, bacteria, fungi, plants, animals, and humans, fully searchable and make the indexes available to the research community"
Indexing All Life's Known Biological Sequences
The amount of biological sequencing data available in public repositories is growing exponentially, forming an invaluable biomedical research resource.bioRxiv