SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation
Install
GitHub: shenwei356/seqkit
|
Convert the Fastq to Fasta
|
Remove Duplicated Sequence
|
-s
: Specifies that duplicates should be identified based on sequence content.[input_file]
: Replace this with the path to your input FASTA or FASTQ file.-o [output_file]
: Specifies the output file. Replace[output_file]
with the desired path for the file containing the sequences after duplicate removal.-D
: write all removed duplicates (and counts) to this specified file.
SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation