Build

Description

Build a multiple sequence alignment pangraph.

Options

NameTypeShort FlagLong FlagDescription
minimum lengthIntegerllenminimum block size for alignment graph (in nucleotides)
block junction costFloataalphaenergy cost for introducing block partitions due to alignment merger
block diversity costFloatbbetaenergy cost for interblock diversity due to alignment merger
circular genomesBooleanccirculartoggle if input genomes are circular
pairwise sensitivityStringssensitivitycontrols the pairwise genome alignment sensitivity of minimap 2. Currently only accepts "5", "10" or "20"
maximum self-mapsIntegerxmax-self-mapmaximum number of iterations to perform block self maps per pairwise graph merger
enforce uppercaseBooleanuupper-casetoggle to force genomes to uppercase characters
distance calculatorStringddistance-backendonly accepts "native" or "mash"
alignment kernelStringkalignment-kernelonly accepts "minimap2" or "mmseqs"
kmer length (mmseqs)IntegerKkmer-lengthkmer length, only used for mmseqs2 alignment kernel. If not specified will use mmseqs default.
consistency checkBooleanttesttoggle to activate consistency check: verifies that input genomes can be exactly reconstructed from the graph
verbose modeBooleanvverbosetoggle to activate verbose mode
debug modeBooleanDdebugtoggle to activate debug mode: during merging intermediate graphs are saved in the debug folder
random seedIntrrandom-seedrandom seed for pangraph construction.

Arguments

Expects one or more fasta files. Multiple records within one file are treated as separate genomes Fasta files can be optionally gzipped.

Output

Prints the constructed pangraph as a JSON to stdout.