Monarch geneset OGS2.0

DPOGS200914
TranscriptDPOGS200914-TA627 bp
ProteinDPOGS200914-PA208 aa
Genomic positionDPSCF300066 + 460201-462115
RNAseq coverage289x (Rank: top 38%)
Annotation
HeliconiusHMEL0127292e-4677.78% 
BombyxBGIBMGA000682-TA1e-4887.00% 
DrosophilaRab21-PC1e-6162.36% 
EBI UniRef50UniRef50_P352821e-7567.96%Ras-related protein Rab-21 n=60 Tax=root RepID=RAB21_MOUSE
NCBI RefSeqXP_001948825.12e-7866.98%PREDICTED: similar to Ras-related small GTPase [Acyrthosiphon pisum]
NCBI nr blastpgi|3071908474e-8072.22%Ras-related protein Rab-21 [Camponotus floridanus]
NCBI nr blastxgi|3071908471e-7970.81%Ras-related protein Rab-21 [Camponotus floridanus]
Group
Gene OntologyGO:00072641.2e-68small GTPase mediated signal transduction
GO:00055251.2e-68GTP binding
GO:00150311.2e-68protein transport
GO:00160201.7e-30membrane
GO:00071651.7e-30signal transduction
GO:00061841.7e-30GTP catabolic process
GO:00039241.7e-30GTPase activity
GO:00056224e-12intracellular
GO:00068864.5e-05intracellular protein transport
GO:00069134.5e-05nucleocytoplasmic transport
KEGG pathway 
InterPro domain[11-174] IPR0035791.2e-68Small GTPase superfamily, Rab type
[12-172] IPR0018066e-56Small GTPase superfamily
[8-174] IPR0208491.7e-30Small GTPase superfamily, Ras type
[10-164] IPR0052251.4e-26Small GTP-binding protein domain
[13-174] IPR0035784e-12Small GTPase superfamily, Rho type
Orthology groupMCL16147 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200914-TA
ATGACCACAGTCCAAAGTGGTACACACAATTTTAAAGTAGTTTTGTTAGGAGAAGGCTGCGTCGGAAAAACATCTTTATTACTGCGATATATTGAAGACAAATATAATGACAAACATCTTACGACGCTCCAAGCAACCTTTCTCAACAAGAAATTGAACATAAATGGAAAACGCATAAACCTCTCTATTTGGGACACAGCCGGACAGGAGAAATTTCATGCCCTTGGACCAATTTACTATCGGAACTCGAATGGCGCCATACTTGTTTATGATATAACTGACGAGGATTCTTTTGGAAAAGTAAAGAATTGGGTTAAGGAGTTAAGAAAGATGCTCGGCTCCGACATTGTGTTGGTAATTGCAGGCAATAAAATTGATTTGGAGCAGGACAGGACTGTGCCGTTGGAGGAAGCTGAGAGTTATGCGGTAATGGTAGGTGCGAGGCATTTTAATACATCAGCTAAAATGAATCAAGGTGTTGAGGAGTTATTCCTTGAACTGACGAGGGAGATGACGGAAAGATTTGAACAAAATTCTCAAGCCGATGTGTCAAGGACGTCCAGGGTTCTGGTTGTCGAAGATGAAGCCCCACAGCAGTCTTCTTGCTGCTCAGGAATTAGGAACTAA

Protein sequence:

>DPOGS200914-PA
MTTVQSGTHNFKVVLLGEGCVGKTSLLLRYIEDKYNDKHLTTLQATFLNKKLNINGKRINLSIWDTAGQEKFHALGPIYYRNSNGAILVYDITDEDSFGKVKNWVKELRKMLGSDIVLVIAGNKIDLEQDRTVPLEEAESYAVMVGARHFNTSAKMNQGVEELFLELTREMTERFEQNSQADVSRTSRVLVVEDEAPQQSSCCSGIRN-