Monarch geneset OGS2.0

DPOGS212821
TranscriptDPOGS212821-TA1419 bp
ProteinDPOGS212821-PA472 aa
Genomic positionDPSCF300086 - 311201-314858
RNAseq coverage3279x (Rank: top 4%)
Annotation
HeliconiusHMEL0101120.087.13% 
BombyxBGIBMGA000767-TA0.079.83% 
Drosophilaatl-PA0.067.74% 
EBI UniRef50UniRef50_Q9VC570.067.74%Atlastin n=12 Tax=Drosophila RepID=ATLAS_DROME
NCBI RefSeqXP_001663157.10.069.16%atlastin [Aedes aegypti]
NCBI nr blastpgi|1571341230.069.16%atlastin [Aedes aegypti]
NCBI nr blastxgi|583951030.070.24%AGAP002047-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00055252.5e-72GTP binding
GO:00039242.5e-72GTPase activity
KEGG pathway 
InterPro domain[21-228] IPR0158942.5e-72Guanylate-binding protein, N-terminal
[231-359] IPR0031916.5e-25Guanylate-binding protein, C-terminal
Orthology groupMCL10455 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212821-TA
ATGCATCACAAGTACGGCGCTGGCGGGGGTGGCGGCGAGTGGCTGGGGTCGGAGGACGAGCCGCTGGCGGGGTTCAGCTGGCGCGGAGGATCCGAGAGAGACACCACCGGCATACTCATGTGGTCCGAGATATTCAAGGCCACGCTGGAGACTGGGGAGAAGGTAGCCATAATCTTGCTGGACACTCAAGGTGCCTTCGACAGTGAGTCCACGGTCCGCGAGTGTGCGACGGTGTTCGCCCTGTCTACAATGTTGTCCTCAGTTCAGATCTATAACCTGGCCCAGAACATACAGGAGGATGATCTCCAGCACTTGCAGTTGTTCACTGAGTACGGCCGTCTGGCGCTGGAGGACGGGGGGCGGACTCCCTTCCAGCGACTGCAGTTCGTTGTTAGGGACTGGAGCTTCCCTTACCTCCGCGGGATGCCTCGGCTCCCCGGCCAGAAGATACTCCAGAGAAGATTGAAAGTGTCTGACAAACAGCACCCCGAGCTGCAATCCCTCCGGAAGCACATCACGTCTTGTTTCTCGGAGATCGCCTGCTTCCTGATGCCTCACCCCGGACTCAAGGTCGCCACCAGCCCCGACTTCGATGGACGGCTATCTGACATCGAAGGTGAGTTCAAGCGCGCGTTGGTGCAGTTGGTACCGATGCTGCTGGCTCCCCACAACCTGGTCCCCAAACTCATCAACGGACAGCCCGTCAGGGTCAAGGACCTGCTGCAGTACTTCAAGAGCTACATGAACATCTACAGAGGAAACGACTTGCCGGAACCGAAGAGCATGCTCGTGGCGACGGCGGAGGCCAACAACCTGACGGCGGTGGCGGAGGCTCGAGACGTGTACACCACGTTGATGGAGGAGGTGTGTGGCGGCGCGCGGCCCTACCTGCAGACACAGCTGCTGGACATGGAGCATCAGAGGATCCGGGACAAGGCCCTGCACGCCTTCCGCGCCAAGCGCAAGATGGGGGGAGACGAGTTCAGCAAGTCCTATCACGATCAGCTCGTGCAGGACCTCGAGGACCAGTTCTCACAGTTCCGCGCCCACAACGAGAGCAAAAACATCTTTAAGGCGGCTCGGACGCCGTCGGTGCTGTTTGCGGTGGCGCTGGCTTTCTACGTGATGAGCGGAGTGCTGGGACTCCTCGCGCTCTACCCGCTGGCCAACCTCTGCAACCTGGTCATGGGACTGGCGCTGCTCACCTTGGCCTTGTGGGCCTACATCAGGTACAGCGGAGAGATGAGGGAACTCGGTGTCACCATAGACGAGTCCGCCAGCTGGCTGTGGGACAACGTTATGAAGCCGGTGTACCAGGCGGGCATAGAGAAGGGCATGGAGCACGCGGCGGCGGCGGCGGCCGAGCGCGCCATGGGACCTTCGACGCCGAACGCCAATGGCAAATTCAAACAATTGTGA

Protein sequence:

>DPOGS212821-PA
MHHKYGAGGGGGEWLGSEDEPLAGFSWRGGSERDTTGILMWSEIFKATLETGEKVAIILLDTQGAFDSESTVRECATVFALSTMLSSVQIYNLAQNIQEDDLQHLQLFTEYGRLALEDGGRTPFQRLQFVVRDWSFPYLRGMPRLPGQKILQRRLKVSDKQHPELQSLRKHITSCFSEIACFLMPHPGLKVATSPDFDGRLSDIEGEFKRALVQLVPMLLAPHNLVPKLINGQPVRVKDLLQYFKSYMNIYRGNDLPEPKSMLVATAEANNLTAVAEARDVYTTLMEEVCGGARPYLQTQLLDMEHQRIRDKALHAFRAKRKMGGDEFSKSYHDQLVQDLEDQFSQFRAHNESKNIFKAARTPSVLFAVALAFYVMSGVLGLLALYPLANLCNLVMGLALLTLALWAYIRYSGEMRELGVTIDESASWLWDNVMKPVYQAGIEKGMEHAAAAAAERAMGPSTPNANGKFKQL-