Monarch geneset OGS2.0

DPOGS212858
TranscriptDPOGS212858-TA1566 bp
ProteinDPOGS212858-PA521 aa
Genomic positionDPSCF300086 + 305003-308850
RNAseq coverage796x (Rank: top 16%)
Annotation
HeliconiusHMEL0101120.062.91% 
BombyxBGIBMGA000802-TA0.067.05% 
Drosophilaatl-PA7e-16756.42% 
EBI UniRef50UniRef50_E0VUN47e-16756.59%Atlastin, putative n=5 Tax=Eukaryota RepID=E0VUN4_PEDHC
NCBI RefSeqXP_321004.27e-17359.18%AGAP002047-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3071730201e-17358.38%Atlastin-2 [Camponotus floridanus]
NCBI nr blastxgi|3454984043e-16859.80%PREDICTED: atlastin-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00055257.1e-84GTP binding
GO:00039247.1e-84GTPase activity
KEGG pathway 
InterPro domain[18-291] IPR0158947.1e-84Guanylate-binding protein, N-terminal
[296-422] IPR0031917.4e-23Guanylate-binding protein, C-terminal
Orthology groupMCL10455 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212858-TA
ATGGAGAACTTAGAATCTGGCAGAGGTATTCAGGTGGTGACTCCCGGACCTGAACACACTTTTACCTTAGACGAGGAGGCGCTCGAGACGATCTTACTCAGAGAAGACATTAAGGACAGATCAGTTGTAGTGGTGTCCGTGGCCGGCGCCTTCCGTAAGGGCAAGTCCTTCCTGCTGGACTTCTTTCTTCGATACATGCACCACAAGTATAATCTGCATGGAAGTGGTGTTGACTGGTTGGGGGTGGAGGACGAGCCACTGTCAGGTTTCAGCTGGCGAGGAGGCTCCGAGAGAGACACTACTGGGTTGCTGTTCTGGTCTCAGCCATTCAAGGCTACATTAGACAATGGAGAAAAGGTGGTGATCTTCCTTATGGACACACAAGGCACATTTGATAGCGAAACTACAGTTAAAGAGAATGCCACTGTGTTCGCACTCTCCACCATGTTGTCCTCTGTACAGATATACAACCTGTCACAAAATATTGAAGAAGATGATCTGCAACACCTGCAATTGTTCACTGACTATGGGCGCCTGGCACAAGAGTCATCGCAGGGCGCGCCCTTCCAGCGGCTGCAGATGTTGGTAAGAGACTGGAGCTTTCCGTACGACCACCCTTACGGAGCGGTGGGCGGGCAGCAGCTATTGGACAAACGACTTAAGGTCCACGATGGCCAGCACCCCGAACTGCAATCTCTGCGAGTCCACATCGCGGGCTGCTTCGAGGAACTAGCTTGCTTCCTTATGCCCCACCCCGGACTGAATGTAGCGACTGATCCACACTTCAATGGCAATTTGGCCGCCATAAGTCAAGAATTCAAGCTTTGTTTGAAACAACTCGTACCCATGCTGCTGGGACCAGAGAACTTGATTATAAAAAAGATTGGAGGAAACAAATTGAAATCCAGAGATTTGCTTTTATATTTCAAGTCGTACATCAACATATTTAACGGCGCCGCTCTCCCCGAGCCGAAGACTATTCTAGAGGCAACTGCAGAGGCTAACAATTTATCAGCTATGGCAGAAGCTAAGGAGGTTTACGAGACGCTCATGGAAGAAGTCGCCGGTGGAGCGAAGCCCTATCTGCAGCCGTCGCGGCTCGAGCAGGAACACCATCGAGCGAGGGACAAAGCGCTGCACGGCTTTCGCTCCAAAAAGAAAATGGGCGGAGATGAGATTGCAGAGTCATACGCCGATAGGCTGGTCAAGGAGTTAGAGGAGCAGTACGAGCAGTACCGGTTGCACAACGAGGACAAGAACCTGTTCCGACAGCTGGGAACGGTCGCGGTGCTGCTGATGTTGGCGCTGGCCGTGTACATGGTCGGGATACTAGCGGGCGTGTTGGGTTTCGAGTCCCTTCAGGCCCTGGGGAAGACGGCGGCTATTGTTTTCCTGGTCTTGATGGCGGTTTGCGGTTACTCCAGAGCCACCGGCAACATGCGAGAGCTCACCATACAGCTGGACCAACTGGCGGAATCTGTGATGGACAAAGCTGGCCAGCTGGGGGGATGGCGAACCGCTGCGACAACAGACAGCTACAGCAGGGACAAAAACAAGCAGTCATAG

Protein sequence:

>DPOGS212858-PA
MENLESGRGIQVVTPGPEHTFTLDEEALETILLREDIKDRSVVVVSVAGAFRKGKSFLLDFFLRYMHHKYNLHGSGVDWLGVEDEPLSGFSWRGGSERDTTGLLFWSQPFKATLDNGEKVVIFLMDTQGTFDSETTVKENATVFALSTMLSSVQIYNLSQNIEEDDLQHLQLFTDYGRLAQESSQGAPFQRLQMLVRDWSFPYDHPYGAVGGQQLLDKRLKVHDGQHPELQSLRVHIAGCFEELACFLMPHPGLNVATDPHFNGNLAAISQEFKLCLKQLVPMLLGPENLIIKKIGGNKLKSRDLLLYFKSYINIFNGAALPEPKTILEATAEANNLSAMAEAKEVYETLMEEVAGGAKPYLQPSRLEQEHHRARDKALHGFRSKKKMGGDEIAESYADRLVKELEEQYEQYRLHNEDKNLFRQLGTVAVLLMLALAVYMVGILAGVLGFESLQALGKTAAIVFLVLMAVCGYSRATGNMRELTIQLDQLAESVMDKAGQLGGWRTAATTDSYSRDKNKQS-