Contamination in Reference Sequence Databases: Time for Divide-and-Rule Tactics
Contaminating sequences in public genome databases is a pervasive issue with potentially far-reaching consequences.This problem has attracted much attention in the recent literature and many different tools are now available to detect contaminants.Although these methods are based on diverse algorithms that can sometimes produce widely different est