Are there recommended pipelines/methods for dereplicating LARGE custom databases? #43

MicroBTM · 2025-01-30T17:44:26Z

MicroBTM
Jan 30, 2025

I was hoping to find the process for dereplicating the provided sylph databases (e.g. from the GTDB). In lieu of that, are there any recommended pipelines/methods for dereplicating large custom databases? I'm not a bioinformatician, but I can't make use of the segmented genomes after profiling with gtdb-r220-c200-dbv1.syldb.

Answered by bluenote-1577

Jan 30, 2025

@bryantmurphy The GTDB-R220 database is the species-level dereplicated genomes from GTDB. They have a specific pipeline for dereplication, see https://academic.oup.com/nar/article/50/D1/D785/6370255

For dereplicating large custom databases, see https://github.com/MrOlm/drep (quite popular) or https://github.com/raufs/skDER for possible tools.

View full answer

bluenote-1577 · 2025-01-30T18:16:30Z

bluenote-1577
Jan 30, 2025
Maintainer

@bryantmurphy The GTDB-R220 database is the species-level dereplicated genomes from GTDB. They have a specific pipeline for dereplication, see https://academic.oup.com/nar/article/50/D1/D785/6370255

For dereplicating large custom databases, see https://github.com/MrOlm/drep (quite popular) or https://github.com/raufs/skDER for possible tools.

0 replies

MicroBTM · 2025-01-30T19:54:08Z

MicroBTM
Jan 30, 2025
Author

1 reply

bluenote-1577 Jan 30, 2025
Maintainer

Yes, this is exactly how gtdb-r220-c200... was made. See https://sylph-docs.github.io/taxonomic-profiling-tutorial/: Step 1. for how the provided sylph database was created.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are there recommended pipelines/methods for dereplicating LARGE custom databases? #43

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

Select a reply

Are there recommended pipelines/methods for dereplicating LARGE custom databases? #43

MicroBTM Jan 30, 2025

Replies: 2 comments · 1 reply

bluenote-1577 Jan 30, 2025 Maintainer

MicroBTM Jan 30, 2025 Author

bluenote-1577 Jan 30, 2025 Maintainer

MicroBTM
Jan 30, 2025

Replies: 2 comments 1 reply

bluenote-1577
Jan 30, 2025
Maintainer

MicroBTM
Jan 30, 2025
Author

bluenote-1577 Jan 30, 2025
Maintainer