Monarch geneset OGS2.0

DPOGS200545
TranscriptDPOGS200545-TA4230 bp
ProteinDPOGS200545-PA1409 aa
Genomic positionDPSCF300119 - 87268-96285
RNAseq coverage3336x (Rank: top 4%)
Annotation
HeliconiusHMEL0168680.057.91% 
BombyxBGIBMGA010785-TA0.051.97% 
DrosophilalqfR-PD3e-13932.11% 
EBI UniRef50UniRef50_UPI00020615F77e-15532.21%UPI00020615F7 related cluster n=1 Tax=unknown RepID=UPI00020615F7
NCBI RefSeqXP_002073575.14e-16131.59%GK13071 [Drosophila willistoni]
NCBI nr blastpgi|1954529588e-16031.59%GK13071 [Drosophila willistoni]
NCBI nr blastxgi|1984523283e-16732.45%GA26538 [Drosophila pseudoobscura pseudoobscura]
Group
KEGG pathwayxtr:1002160837e-36 
 K12471 (EPN)maps-> Endocytosis
InterPro domain[22-163] IPR0089421e-58ENTH/VHS
[25-152] IPR0138096.7e-49Epsin-like, N-terminal
[24-148] IPR0010265.8e-41Epsin domain, N-terminal
[971-1083] IPR0193374e-26Telomere length regulation protein, conserved domain
Orthology groupMCL11575 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200545-TA
ATGGATCGTTTCATAAGTATGTGGAAAGTCAGGGAGCTGGCGGACAAGGTGACGAACGTGGTGATGAACTACACGGAGGTGGAGGGCAAGGTCCGGGAGGCGACCTCGGACGAGGCCTGGGGCCCCACCGGCCAACAGATGCAGGAGCTGGCGCTGGCGACCTTCACATACGAACACTTCCCGGAGGTCATGTCTATGTTGTGGAGGAGAATGTTACATGACAACAAAGCGCATTGGAGAAGGACTTACAAGTGCCTCCTCCTCCTCAGCTACCTGGTGAGGAACGGCTCCGAGCGAGTCGTGACCTCGGCCAGGGAACACATCTACGACCTGAGGTCGCTGGAGAACTACTCCTTCGTCGATGACTTGGGCAAGGACCAGGGCATCAACATAAGGCACAAGGTCCGCGAGCTGATCGACTTCATCCAGGACGACGACAAGCTGCGGGACGAGAGGAAGAAAGCCAAGAAGAACAAGGACAAGTACATAGACGACAAAGATAGAAACGAAGACGACTACGACAGAGAGGACTCGGACGGCGACGACGGACACACCAAACATAACAAAGAAAACGTGTACCGAGACTCCGAGGTGATAGACGAGTGTCCCGTCCCGGCCCGGGACAGCGAGTACACCTCCAGGACGCTCAACATCAGTCTGAGGAGTCCCGCCAGGAACAAGCAGAGCACGCCCGTCAAGAAGATAGACCTGGGAGCGGCGGCCAACTACGGCAAGACTCCCGGGGCTCCCCCCGCCCCCGCGGTGACGCCCGGCCCGCCCCTCACCTCTGGGACCCAGCAGTCACAGGAGCTGCTGGACGAGTTGTTCAAGACCTGCGCACAAACAGACAACACGCCGGCCGGAGAGGACGACTTCGACCCCAGGCACCCTTCAAGCAGACCGGTGAAGAACGATGACTTTGGAGATTTCAGCACAGCCTTCAGCGGAAACGGAAACGACGAGGGGTTCGCCGACTTCACCAGCGCCTTCCACAACAACAACACGCAGACAGTTTCAGCTCCCACCTCCAACCTCCAACTCCTGAGTGAGTTGTCCCCCGCGATGCCCAGTTTGACCCCCGGGTTGACCCCTGGACTAGCTCCGGGCCTGACCCCCGCCCTCGACCCCCTCTCCTCGCACTTCGACAGCGCTCTCAATATAACAGATGGTGACCGACCGACACACAGCGACCGGCTCAGGGCCGAGATGAAGAAGCTGGTTAACATACTACACGTGATGGAGAGGATCAAGAGCGAAGGAGACGTGGCCGACGTGAACGGAAGGATACAAGTCATAAAGAGATACCTCCCGGGGCCCGTCACCGTACAGAAGCTGTCCAGGTGTGACAGCAGGCTCATAGATCACGAGGCCTTGGAGGTGTTCTCCCAGCTGCTGGGCAGCATCGTCCGGGTTCTGTTACCGCATTGGCCGGAGTTCAGGGACGAAGTGGTGTATCTGTTCACGGTGGAGGAAGGCTTCGACGTTAGTAACGAGATACTCACTAACCTGTGCGGGTATGTCAAGGAGGACAGGAACGACGTGGTGCTGGAGGCTCTCGGATATGTGACGCTCAAGTTCGCAAAGAGCGACGCCGTGCTGGCCTCCATAATAGACTGCAGTGTGGCCGGAGAAGACGTGAGGCTCATGACAGACTGGGAGGGCTACGTCCAGTTGCTGACAACGCTACCCGAGAGGATAGCCAACAGACTGGAGATCAAGACTCCGATAGAGTTCTCTCACGAAAACTACTCGTTCATCCTCCTGTTCCAAGTGATCCGCTCCGTGGACTACATGTGTCAGAGCAACTTCTACCAGGGAACCCTGTATAATTTATCCTATTTATCGTACCTGGTGTCCAAATATGTCGTGTACTATATAAAGACGGAGGCCGTTCTGAAATTGTGCGACATGTTGATCGCCTGGACCGACGACAACAACGACGATCCCTACAGGTTCGTGAGGAGGAAGCTGATTCAGACGGTGCTGAATAAACTCAGCAGACAGGCGATCGACAAGCTGGCGCTGATCCTGCTCAAGAGATGTCCGATCGTATACAGCTCAAAAATCCAGCCGATTAAATTATTATTAGGGAATAACCTGGAGCTGAACAAGGACTGGCATGAGATACTCACCTTCAGGATACCGTTCTACGTGCATCCTCAGAACTTCAGGGACACCACCATCCCGGAGAACCTGGTTTACTACATTGCTACCACCAAGAACGCCCTGGACATACTCACCAACCTGATAGTGACGCTCGCAAAGACCTGGGCCGACGTTCATCTCAACAACGTCATCAACATAGACCACCACATGCACACCTCGGTGCTGCTCGTGCTGGCCATCAAGTACAGGATCATAATGTGGAGGCAGAGGAAGGGCGTCTGGAACCTAATCGAAATAAAGAGGATGCTATACAAGGGGATGTCCAAGCATCTAGACATACTCACCACGGAGTTCCGCTGTGTGGGGATGGCCACGGTCGAGATAGTCTGCAAGATGCTGGTCGAGGTGGACGACTCCGACCGCGCGGCTGTGGAGAGACTCAACTTCGAGTTCAACGAGTTGGGGCAAGTGTGCGTCGACATCTACAACACGCTGGTGACCATAACCAACAAGTGCGTGCTGGACGACCGAGCCAAGCCACCCACAGCCGAGCGGAGGCTCATCGACGCCCAGCAACTGATGGACGTTATAGCGGAGAAGGTCACGGACCACGTCGAGAAGCCCGTCCAGAACACGATAGTGACATGCGCTGTCAAGGGACCCCAGCAGACCAAGGAGATCGTCAAAACCATCATATCCGCCAAACTCGACGCTCTCAAGGGCGGCAGGAACCTGGACCTGGACTCCGACGACGACTTGCAGCCCTACGACATGACCAACGACGTCAGCGTCGCCTCCAAGAAGAAACCCAACTACCTGAGGGATCTGTTAGAGGTGGTCCGGGAAGCCAAGGATCAGGAGTCCTTTGAGGCCGCGCTCACCTCGGCAGAGAATCTCGTGAAGAAGCAACTGAAGCACGAAGACGGGAAGCTCGCCATCGAGCTGTTGGACCTGTTCGTGCACCTGGAGGAGAAGTACCACGTGGACAAGTTCAAGAGTCTGAAGTTCAACACGGCAGTGGCCATCGTATGCAGCCAGCCCAGGGTGTGCGCCGAGCATCTGTGCAAGGAGGTGCACAGCGACATCGGCCGCTACTCGATATCCACCAAGATATTCATGTTGGACGTGTTCACCGAGGCGGCCGAGAGGATCGCCGACATCAAGACGGACCCCTCGTACGAGATACACAAGAAGGCCGAGATCATCATCGAGGCCAAGGAGCTGCCGCGCGACGAGGTGCTGAGGCGGAGGCTGCTCAAGAAGACCAAGTTCATACACTCCAAGCGCGCCCACCCCTTCTCCAAAGCCAAGAAGAACCAGTTCGCCCCCGTGTCGGATTACTTCTTCTACCCGCTCATTGCCGGCTTCGGCTACCGCCAGCTGACGCTGAGCCACCACAACCTGAAGCAGGACATCGACAACCTGCTGCTGCTGCGCTACCTGTCGGCGGTGGGCAGCGTGGTGCTGGCCGCCAAGAACTGCCCCAAGTGTCCCGTGTACTGCCGCGAGATCCTGCAGATGGTGCTGTTCCTGCGCTTCACGCCGCACCCCGAGCTGCAGCTGTGCGTCATATCCATCATCGCGGCCATCGCCCTCGCCCTGCCGCAGTCCATGCTGAAGGGCGAGTTCTACGACGTGATGATGGAGCTGTGCTCGTGGGTCATCGACTTACTGACGCACGCCGACCTCTCGCACCGCCTCGGCGGACCCAAATCCGAGGCCACCGTGTTCGCCGGAGAATCACATGTACCTTCGACCCTCATAACTGTCGTTGACGCGCCATCGGTGAGTTTGCAGCCGAGCCTCCAGCCGTGTCGCCTGCAGCCGAACCCTCAGCCGATCCAGCCGAGCGCGGCTCTGTCCCAGCGGAGCGGCGCGGCCGCCGTCAACAACAACCAGTCGAAGCTGGCCCCCCGGCTGGGCGCCACGTGGGCGGACAGCGCGGCCTCCACCATCATCGACGTGGACAACCTGCTGTCCCCGCGCTCGCCCAAGGCTGGGCCCGCGCCCTCCATCAACCAGCTCAAGTCGAACCCCGCCAGTCCCGCCCACAGACCGGCCTGGCCCGTCGCCTCCAACAGCAACAACAACAACAACCTCACCACGGACGACCTGCTGCAATGA

Protein sequence:

>DPOGS200545-PA
MDRFISMWKVRELADKVTNVVMNYTEVEGKVREATSDEAWGPTGQQMQELALATFTYEHFPEVMSMLWRRMLHDNKAHWRRTYKCLLLLSYLVRNGSERVVTSAREHIYDLRSLENYSFVDDLGKDQGINIRHKVRELIDFIQDDDKLRDERKKAKKNKDKYIDDKDRNEDDYDREDSDGDDGHTKHNKENVYRDSEVIDECPVPARDSEYTSRTLNISLRSPARNKQSTPVKKIDLGAAANYGKTPGAPPAPAVTPGPPLTSGTQQSQELLDELFKTCAQTDNTPAGEDDFDPRHPSSRPVKNDDFGDFSTAFSGNGNDEGFADFTSAFHNNNTQTVSAPTSNLQLLSELSPAMPSLTPGLTPGLAPGLTPALDPLSSHFDSALNITDGDRPTHSDRLRAEMKKLVNILHVMERIKSEGDVADVNGRIQVIKRYLPGPVTVQKLSRCDSRLIDHEALEVFSQLLGSIVRVLLPHWPEFRDEVVYLFTVEEGFDVSNEILTNLCGYVKEDRNDVVLEALGYVTLKFAKSDAVLASIIDCSVAGEDVRLMTDWEGYVQLLTTLPERIANRLEIKTPIEFSHENYSFILLFQVIRSVDYMCQSNFYQGTLYNLSYLSYLVSKYVVYYIKTEAVLKLCDMLIAWTDDNNDDPYRFVRRKLIQTVLNKLSRQAIDKLALILLKRCPIVYSSKIQPIKLLLGNNLELNKDWHEILTFRIPFYVHPQNFRDTTIPENLVYYIATTKNALDILTNLIVTLAKTWADVHLNNVINIDHHMHTSVLLVLAIKYRIIMWRQRKGVWNLIEIKRMLYKGMSKHLDILTTEFRCVGMATVEIVCKMLVEVDDSDRAAVERLNFEFNELGQVCVDIYNTLVTITNKCVLDDRAKPPTAERRLIDAQQLMDVIAEKVTDHVEKPVQNTIVTCAVKGPQQTKEIVKTIISAKLDALKGGRNLDLDSDDDLQPYDMTNDVSVASKKKPNYLRDLLEVVREAKDQESFEAALTSAENLVKKQLKHEDGKLAIELLDLFVHLEEKYHVDKFKSLKFNTAVAIVCSQPRVCAEHLCKEVHSDIGRYSISTKIFMLDVFTEAAERIADIKTDPSYEIHKKAEIIIEAKELPRDEVLRRRLLKKTKFIHSKRAHPFSKAKKNQFAPVSDYFFYPLIAGFGYRQLTLSHHNLKQDIDNLLLLRYLSAVGSVVLAAKNCPKCPVYCREILQMVLFLRFTPHPELQLCVISIIAAIALALPQSMLKGEFYDVMMELCSWVIDLLTHADLSHRLGGPKSEATVFAGESHVPSTLITVVDAPSVSLQPSLQPCRLQPNPQPIQPSAALSQRSGAAAVNNNQSKLAPRLGATWADSAASTIIDVDNLLSPRSPKAGPAPSINQLKSNPASPAHRPAWPVASNSNNNNNLTTDDLLQ-