baseq is a basic sequence processing tool for biological
data. It provides simple and efficient functions for common tasks in
molecular biology, such as cleaning sequences, translating DNA/RNA to
protein, and calculating GC content.
You can clean DNA or RNA sequences by removing any non-standard
characters. The universal clean_seq() function
automatically detects the type.
baseq can translate DNA and RNA sequences into protein
sequences in all six reading frames.
Calculate the GC content of a DNA sequence.
baseq provides universal functions to read and write
FASTA and FASTQ files.
# Read a FASTA file into a dataframe
# df <- read_seq("path/to/file.fasta")
# Write a dataframe to a FASTA file
# write_seq(df, "output.fasta")For more details, see the documentation for individual functions.