Monarch geneset OGS2.0

DPOGS205005
TranscriptDPOGS205005-TA3426 bp
ProteinDPOGS205005-PA1141 aa
Genomic positionDPSCF300123 + 276949-288752
RNAseq coverage3258x (Rank: top 4%)
Annotation
HeliconiusHMEL0094620.072.29% 
BombyxBGIBMGA010235-TA0.065.78% 
DrosophilaCG9297-PB0.081.82% 
EBI UniRef50UniRef50_F5HK420.060.74%AGAP002456-PB n=22 Tax=Coelomata RepID=F5HK42_ANOGA
NCBI RefSeqXP_969014.10.059.01%PREDICTED: similar to sarcalumenin [Tribolium castaneum]
NCBI nr blastpgi|1839792100.069.22%sarcalumenin [Papilio xuthus]
NCBI nr blastxgi|1839792100.068.65%sarcalumenin [Papilio xuthus]
Group
Gene OntologyGO:00055253.8e-09GTP binding
GO:00039243.8e-09GTPase activity
KEGG pathwaycel:W06H8.11e-44 
 K12476 (EHD3)maps-> Endocytosis
InterPro domain[778-937] IPR0014013.8e-09Dynamin, GTPase domain
Orthology groupMCL12524 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205005-TA
ATGGCGCTCAGGGTAGCCCGGGCCAACCGTACAATGTTATATAACGAGGCATGCATTCTCGCAGGGATACCGCTGTGGGATCTTGAAGCTGAAGCCTTCGTAGCGGTATGGAAGCGACGCATGCTTTCCTGGCAGGAGAACGACGGCGATTCCCACCAAAGTGGAGCAAAGGTATGGAGAGAGGAGGCGCATGCCGAAGTCTTGCAGGAGTGGGCAGTGCGCCTACGTGACCCGAGATATGACCGGGATTTGGTGTGGTCTGTCCTTCCCCACCTTAAGCAGTGCGGTGATAGGAGGTTCGGTTCCAGGACGTATCACTTAGCGCAACTGCTTGCCGGGTATGGATGTTTCGGTCGGTGTCTGTGTCGGGTGGTGCGCAGGGAACCTTTACGTACACGAGGATACGGCACAGCACGCCCGTGCCGTATGTCCCGCCTGAGACAATCAACGGGGCGATTTGGTTGTTGCCATAGGGCCGGACCTCTCGTAGCCGGTCGTAGTAAGCGCCACGTAAAGAGTGTAAAGTGTAAAGGTGTAAACAGGAGGCAGAAAGGGAGTGGGAGCAACACACCGCATCTCTCCCCATGCCTCACGGGAGAGGTGGCCGAGGGCGTCGAACCTACGTGGCGGCCTTTGTTACCACGTAGCAAAATCGTCGCGCGTTGGACGGAGCCTATGGGTGGCGGATGGGGAATTTTGTCGCTCATATTACAGGGTTCGGGTTGGTGTGCGTCTCCCATATTCCGAGGCACGATGTTGCCTGCACGGAGTGGAGGGGGGAAGTATCTACCGATCGGGTGCGCGAGTGTGAGTGCAAATGATATAGAGCCCGGTGTAGCTGGAACAGAACAAAATGTGGACCAAGAGATTAAAGTTATGGAGGTTTATGTTGTGCATGTTGTTTGCCGTGTCTCTGTTCGCGGTAGCTGCGAGAGGTACTTGAAAATGATGAATTCCTTACATTTCCTACAAAAGCTCAAGAATATTTTACTGCTGGACACCGACGACGACATTCCTTCAGAATCACAATGCCGTCCATACATTGAAAAAGCACTTAAAGAACTCGAATCGGGGGATACAGAACCAGATTCTAAAGAAAAACTCGAATCTAGTGAAGTAAGTATAGACGACGAAACAAAAGAAACATCAGCAGAAATAGGCGAGGCAGCGGAAGATTCAGACGACAGTAAAGAAAATGCTGAGAGTGAAGAAACAGGAGATGTTCCTTCAGTAGAAGTAGCAGATGAACCTTATGTTTCTCAATCTTCTTACGACGACTCTGATAGTCCTGTAAGTGAAGATTCTGTTGAGAATGAAGCTGAAAACGAAAGTGCTGAAAACGTCGAGCAACAAGCAGATGAGAGTAAAGAGGATGGCGGAGACAGCAAAGAAGAAGCATCTGCTGATGACAAAGAAATCGAAAAAGAAAGTGGCGAAGTAATCGATGACGAAGAAAGTAAAGAAGATTCTAAAGACAGTAAAGAAGACTCTGAAGACACCAAAGAAGAAAAGGAAGAGAGCGAGGAAAGAGATGACAAAGAAGAATCAGATGAAGTTGGTGTTGAAAGCGCTGATGATAACCAAGACTCGGCTGAGCAGGCAGAAGACCAGAGTACAGAACAGGCTGAGTCAGATGAAAAGGTAGAGGAATCTTCTGATGCTTCTGAAGAACGTGATGCTGAAGGGGAAAGTGATGAAGACAAAAAAACAGATGAAGACGACGATGGATCTCAAGAAAAGAAAGAGTCTGAAGAAATAGAGGGAGGAAAGATAGATGACTCCCAAGAAGAAGAAAAACACGACGATGAATCTAAAGAAGATACTGAAGAGGACGGTAAATCCGATGAAGAAAAAGAGAAAGCAGTGGAAGAAGATGAATCGGCAGAAGCAGAAAAATCTGCAGAAGAAGTTCAATCTGCGGAAGAAGAAGAATCTGTAGAAAAAACTGAATCTTCCGAAGACGTTGCTGAGGAAGAGAAGCTATCTGTAGAAGATCTTGCTGGCGAATCCGACGCCGAAAAAGATAGTGGTGAACAATCCGAGGTTACTGGAGATGGAGAAAGCGAAGAAGAAGGTGTACCGGAAGAGGACTTGCTCTTAGAAGGGGAAATTCCAGAGAACTTGCGCTCCCGTGACCATATCATTCAAATCCTAAGATTGGACGAGGAAGCCAGCGAGTCGGAGATGATTATTGAGAAATCGGCCGACATTGTTCTCCGAGATCTGAAACGGCTTTACGAGAACTCGATCAAGCCACTTGAAGCGTTGTACAAATACAGAGATTTGAGTAACAGACATTTCGGTGATCCAGAGATATTCTCCAAGCCATTGGTCCTGTTCATGGGACCGTGGAGCGGCGGAAAGTCCTCCATATTGAATTATCTGACAGGAATAGAGTTCACCGAATGGTCTCTACGTACAGGCGCCGAGCCATCACCAGCGTACTTCAATATCTTAATGCACGGACAAAATCCAGAAGTGCTGGATGGAACTCAACTAGCAGCGGACTGGACGTTCTCCGGCCTGCAGAAGTTTGGACAGGGCTTGGAAGAACGTCTCAGAGGTCTTAGGTATCCCAGTAAGCTACTGGAAAAGGTAAACGTTGTGGAAATACCTGGTATCCTAGAAGTAAGGAAGCAGGTGTCGCGAGTGTTTCCCTTCAACGACGCCTGTCAATGGTTCATTGACCGCGCCGACATCATATTTCTCGTCTACGATCCTTCTAAGCTGGACGTCGGACCGGAAACAGAAGCTATCCTGGACCAGCTCAAAGGCAGAGAGTCACAGACGCGTATCGTCCTAAACAAAGCGGACACCGTGAAGCCGGAGGAGCTGATGCGTGTACAGAGCGCGTTGATCTGGAACATCTCGCCGTTGATGAGCTCGGCGCAGCCTCCCGTCATGTACACGGTGTCGCTGTGGTCCATGCCGCTGGAGGCCGGCGCGCCCGCGAGGCTGCTGCTGGCGCAGGAGAGGGAACTGCTCAGGGACCTGAGACAGGCCATAGACAAGCGGATAGAAAACAAGATAGCGAGCGCCAGGAGATTCGCCGTGCGCGTGAGGAACCACGCTAAGATGGTTGACTGCTATTTGACCACGTACTACAACCACAAGACCATATTTGGAAATAAGAAGGTCATCGCCGACGCGATCATCGACAGCCCACAGAACTACCACATCTACGAGGGACTCAGTACACTTACCAATATATCAAGGTACGACCTCCCCGATCCGGAAACCTACCGGGACTTCTTCCGTCTGAACCCCTTGTACGAGTTCCAGCAGCTGTCGTCCACGTGCACGTATTTCCGCGGCTGTCCCATCAATCGTCTGGACGTAGCGATCGCCTACGACCTGCCCGAGCTGGTCGGCAAATACAAGAAGATGGTAGAAACTGCCACCCCCCAGGGAATGCCCAAAAGTTGA

Protein sequence:

>DPOGS205005-PA
MALRVARANRTMLYNEACILAGIPLWDLEAEAFVAVWKRRMLSWQENDGDSHQSGAKVWREEAHAEVLQEWAVRLRDPRYDRDLVWSVLPHLKQCGDRRFGSRTYHLAQLLAGYGCFGRCLCRVVRREPLRTRGYGTARPCRMSRLRQSTGRFGCCHRAGPLVAGRSKRHVKSVKCKGVNRRQKGSGSNTPHLSPCLTGEVAEGVEPTWRPLLPRSKIVARWTEPMGGGWGILSLILQGSGWCASPIFRGTMLPARSGGGKYLPIGCASVSANDIEPGVAGTEQNVDQEIKVMEVYVVHVVCRVSVRGSCERYLKMMNSLHFLQKLKNILLLDTDDDIPSESQCRPYIEKALKELESGDTEPDSKEKLESSEVSIDDETKETSAEIGEAAEDSDDSKENAESEETGDVPSVEVADEPYVSQSSYDDSDSPVSEDSVENEAENESAENVEQQADESKEDGGDSKEEASADDKEIEKESGEVIDDEESKEDSKDSKEDSEDTKEEKEESEERDDKEESDEVGVESADDNQDSAEQAEDQSTEQAESDEKVEESSDASEERDAEGESDEDKKTDEDDDGSQEKKESEEIEGGKIDDSQEEEKHDDESKEDTEEDGKSDEEKEKAVEEDESAEAEKSAEEVQSAEEEESVEKTESSEDVAEEEKLSVEDLAGESDAEKDSGEQSEVTGDGESEEEGVPEEDLLLEGEIPENLRSRDHIIQILRLDEEASESEMIIEKSADIVLRDLKRLYENSIKPLEALYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSILNYLTGIEFTEWSLRTGAEPSPAYFNILMHGQNPEVLDGTQLAADWTFSGLQKFGQGLEERLRGLRYPSKLLEKVNVVEIPGILEVRKQVSRVFPFNDACQWFIDRADIIFLVYDPSKLDVGPETEAILDQLKGRESQTRIVLNKADTVKPEELMRVQSALIWNISPLMSSAQPPVMYTVSLWSMPLEAGAPARLLLAQERELLRDLRQAIDKRIENKIASARRFAVRVRNHAKMVDCYLTTYYNHKTIFGNKKVIADAIIDSPQNYHIYEGLSTLTNISRYDLPDPETYRDFFRLNPLYEFQQLSSTCTYFRGCPINRLDVAIAYDLPELVGKYKKMVETATPQGMPKS-