BLAST
Overview
BLAST: In bioinformatics, BLAST is an algorithm and program for comparing primary biological sequence information, such as the amino-acid sequences of proteins or the nucleotides of DNA and/or RNA sequences.
Using
Use the module name blast
to discover versions available and to load the application.
Multicore
Some of the suite of tools can be used across multicores. Look at the help for each command to identify what it can do. For example look at blastp -help
for the -num_threads
option.
Blast Database
ARCC has a version of the related BLAST database installed on Teton and WilidIirs that is configured so that blast knows where to look for it (so you don't have to worry about this).
Due to the reasonably dynamic nature of this database, ARCC does have a method to update this database on request.
Which Version of the Database?
According to the discussion here, there is no specific version since these are live databases. You can check dates using for example:
[]$ module load gcc/12.2.0 blast-plus/2.13.0
[]$ blastdbcmd -db $BLASTDB/nr -info | head
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects
372,811,526 sequences; 135,507,326,095 total residues
Date: Apr 14, 2021 12:42 AM Longest sequence: 74,488 residues
BLASTDB Version: 5
Volumes:
/pfs/tc1/udata/blastdb/db/nr.00
/pfs/tc1/udata/blastdb/db/nr.01
[]$ blastdbcmd -db $BLASTDB/htgs -info | head
Database: Unfinished High Throughput Genomic Sequences; Sequences: phases 0,1 and 2
176,930 sequences; 27,769,248,758 total bases
Date: Jul 29, 2018 4:58 PM Longest sequence: 6,788,435 bases
BLASTDB Version: 4
Volumes:
/pfs/tc1/udata/blastdb/db/htgs.00
/pfs/tc1/udata/blastdb/db/htgs.01
Version: 2.6.10
Since the last update of the database, version 2.6.0 is no longer compatible with this database. If you try running blastp
with this version, you will see the following error message:
BLAST Database error: Error: Not a valid version 4 database.