funannotate
funannotate is a pipeline for genome annotation (built specifically for fungi,
but will also work with higher eukaryotes). Installation, usage, and more
information can be found at http://funannotate.readthedocs.io
To activate, check out a node with qrsh
and run:
1
2
|
bash
source /local/cluster/funannotate/activate.sh
|
To use in SGE_Batch, write a shell script that includes the above source line,
then the commands you would like to run.
Location and version:
1
2
3
4
|
(/local/cluster/funannotate-1.8.7) [Linux@chrom1 ~]$ which funannotate
/local/cluster/funannotate-1.8.7/bin/funannotate
(/local/cluster/funannotate-1.8.7) [Linux@chrom1 ~]$ funannotate --version
funannotate v1.8.7
|
DB Path:
1
|
$FUNANNOTATE_DB=/nfs1/CGRB/databases/funannotate/current
|
help message:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
|
$ funannotate --help
Usage: funannotate <command> <arguments>
version: 1.8.7
Description: Funannotate is a genome prediction, annotation, and comparison pipeline.
Commands:
clean Find/remove small repetitive contigs
sort Sort by size and rename contig headers
mask Repeatmask genome assembly
train RNA-seq mediated training of Augustus/GeneMark
predict Run gene prediction pipeline
fix Fix annotation errors (generate new GenBank file)
update RNA-seq/PASA mediated gene model refinement
remote Partial functional annotation using remote servers
iprscan InterProScan5 search (Docker or local)
annotate Assign functional annotation to gene predictions
compare Compare funannotated genomes
util Format conversion and misc utilities
setup Setup/Install databases
test Download/Run funannotate installation tests
check Check Python, Perl, and External dependencies [--show-versions]
species list pre-trained Augustus species
database Manage databases
outgroups Manage outgroups for funannotate compare
Written by Jon Palmer (2016-2019) nextgenusfs@gmail.com
|
funannotate check message:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
|
$ funannotate check --show-versions
-------------------------------------------------------
Checking dependencies for 1.8.7
-------------------------------------------------------
You are running Python v 3.7.10. Now checking python packages...
biopython: 1.70
goatools: 1.1.6
matplotlib: 2.1.2
natsort: 7.0.1
numpy: 1.20.2
pandas: 0.24.1
psutil: 5.8.0
requests: 2.25.1
scikit-learn: 0.24.2
scipy: 1.4.1
seaborn: 0.11.1
All 11 python packages installed
You are running Perl v b'5.026002'. Now checking perl modules...
Bio::Perl: 1.007002
Carp: 1.38
Clone: 0.42
DBD::SQLite: 1.64
DBD::mysql: 4.046
DBI: 1.642
DB_File: 1.855
Data::Dumper: 2.173
File::Basename: 2.85
File::Which: 1.23
Getopt::Long: 2.5
Hash::Merge: 0.300
JSON: 4.02
LWP::UserAgent: 6.39
Logger::Simple: 2.0
POSIX: 1.76
Parallel::ForkManager: 2.02
Pod::Usage: 1.69
Scalar::Util::Numeric: 0.40
Storable: 3.15
Text::Soundex: 3.05
Thread::Queue: 3.12
Tie::File: 1.02
URI::Escape: 3.31
YAML: 1.29
threads: 2.15
threads::shared: 1.56
All 27 Perl modules installed
Checking Environmental Variables...
$FUNANNOTATE_DB=/nfs1/CGRB/databases/funannotate/current
$PASAHOME=/local/cluster/funannotate-1.8.7/opt/pasa-2.4.1
$TRINITY_HOME=/local/cluster/funannotate-1.8.7/opt/trinity-2.8.5
$EVM_HOME=/local/cluster/funannotate-1.8.7/opt/evidencemodeler-1.1.1
$AUGUSTUS_CONFIG_PATH=/local/cluster/funannotate-1.8.7/config/
$GENEMARK_PATH=/local/cluster/gmes_linux_64
All 6 environmental variables are set
-------------------------------------------------------
Checking external dependencies...
PASA: 2.4.1
CodingQuarry: 2.0
Trinity: 2.8.5
augustus: 3.3.3
bamtools: bamtools 2.5.1
bedtools: bedtools v2.30.0
blat: BLAT v36
diamond: 2.0.8
emapper.py: There was an error retrieving eggnog-mapper DB data: malformed database schema (prots) - near "WITHOUT": syntax error
Maybe you need to run download_eggnog_data.py
emapper-2.1.4-2 / Expected eggNOG DB version: 5.0.2 / Installed eggNOG DB version: unknown / Local diamond version: diamond 2.0.4 / Local MMseqs2 version: 113e3212c137d026e297c7540e1fcd039f6812b1
ete3: 3.1.2
exonerate: exonerate 2.4.0
fasta: no way to determine
glimmerhmm: 3.0.4
gmap: 2017-11-15
gmes_petap.pl: 4.62_lic
hisat2: 2.2.1
hmmscan: HMMER 3.3.2 (Nov 2020)
hmmsearch: HMMER 3.3.2 (Nov 2020)
java: 11.0.8-internal
kallisto: 0.46.1
mafft: v7.480 (2021/May/21)
makeblastdb: makeblastdb 2.2.31+
minimap2: 2.21-r1071
proteinortho: 6.0.31
pslCDnaFilter: no way to determine
salmon: salmon 0.14.1
samtools: samtools 1.10
signalp: 4.1
snap: 2006-07-28
stringtie: 2.1.7
tRNAscan-SE: 2.0.7 (Oct 2020)
tantan: tantan 26
tbl2asn: no way to determine, likely 25.X
tblastn: tblastn 2.2.31+
trimal: trimAl v1.4.rev15 build[2013-12-17]
trimmomatic: 0.39
All 36 external dependencies are installed
|
Funannotate can be used to annotate eukaryotic genomes. It was developed for
smaller genomes (e.g. less than 1GB in size).
software ref: https://github.com/nextgenusfs/funannotate
software ref: http://funannotate.readthedocs.io
research ref: https://zenodo.org/record/4054262