Skip to content

Large genome result with the ultra-long data model #809

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
ld9866 opened this issue Apr 25, 2025 · 7 comments
Open

Large genome result with the ultra-long data model #809

ld9866 opened this issue Apr 25, 2025 · 7 comments

Comments

@ld9866
Copy link

ld9866 commented Apr 25, 2025

Dear developer:

I noticed that you recently developed the latest version of Hifisam software that can be assembled using only ONT data, but I found that its effect is not as good as we expected.

We used about 100X of ONT data (library: 100kb). The pig genome we assembled was about 2.6gb in size, but we found that the assembled genome was 4.2G and had nearly 2,000 contigs. What is the reason for this?

Best regards,
Dong

@chhylp123
Copy link
Owner

How about N50? Are both haplotypes 4.2G?

@ld9866
Copy link
Author

ld9866 commented Apr 28, 2025

The Contig N50 is about 90Mb, and the number of contigs is 2775.

The version we used the (HiFi and Ultra-long) achieved 140Mb and about 130 contigs. My original intention of using this mode was to get more complete sequences. It seems that using ONT alone is not very good.

The haplotypes are about 4.3 Gb and 3.5 Gb, respectively.

@chhylp123
Copy link
Owner

Could you please show me the command line you were using, as well as the log file? Thank you so much.

@ld9866
Copy link
Author

ld9866 commented Apr 28, 2025

code: /home/lidong/Software/hifiasm-0.25.0/hifiasm -o bm2.ont.only --ont -t 256 BM2.final.combine.fastq.gz

bm2.ont.log

@ld9866
Copy link
Author

ld9866 commented Apr 28, 2025

I uploaded the code and log files. Can you see what's going on?

@chhylp123
Copy link
Owner

Got it. Thanks! Are you using R10 data with SUP base calling?

@ld9866
Copy link
Author

ld9866 commented Apr 28, 2025

Yes, the data type is what you think it is.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants