Monarch geneset OGS2.0

DPOGS215936
TranscriptDPOGS215936-TA4722 bp
ProteinDPOGS215936-PA1573 aa
Genomic positionDPSCF300308 - 53824-69159
RNAseq coverage275x (Rank: top 39%)
Annotation
HeliconiusHMEL0076750.070.57% 
BombyxBGIBMGA001866-TA0.058.24% 
DrosophilaDab-PB1e-7983.02% 
EBI UniRef50UniRef50_B4H7C52e-8563.60%GL13283 n=1 Tax=Drosophila persimilis RepID=B4H7C5_DROPE
NCBI RefSeqXP_002026754.14e-8663.60%GL13283 [Drosophila persimilis]
NCBI nr blastpgi|1951719288e-8563.60%GL13283 [Drosophila persimilis]
NCBI nr blastxgi|3867712761e-9635.77%disabled, isoform D [Drosophila melanogaster]
Group
Gene OntologyGO:00055155.1e-42protein binding
KEGG pathwaycin:1001775282e-43 
 K12475 (DAB2)maps-> Endocytosis
InterPro domain[24-164] IPR0119935.1e-42Pleckstrin homology-type
[33-165] IPR0060203.9e-35Phosphotyrosine interaction domain
Orthology groupMCL17573 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215936-TA
ATGTTCAATCACAAGCTACCGACTGACTTTCCAATGCAAACCCTTCGTAAGAAAACCAGCCCTTGCAAATATAAGAACGAGCCGGGTCGATTTCTGGGTGAGGGGGTCTCTTTTCGGGCAAAATTGATCGGAGTTCTCGAGGTACCGGAAGCGAGGGGGGACAGGATGTGCCAGGAAGCGCTGGCTGACCTCAAGATGGCCATCAGAGCCGCCGGGGAGCACAAGCAGAGGATACAAGTACACGTTGCCATCGACGGGCTAAGACTGCGTGATGATAAGACTGGAGATTCTCTGTATCATCACCCAGTCCACAAGATATCCTTCATAGCCCAGGACATGACAGACTCCCGGGCCTTCGGATATATCTTCGGATCACCAGACACTGGACACAGATTCTTCGGCATCAAGACTGATAAAGCTGCTAGTCAAGTAGTTATAGCGATGAGGGATCTCTTTCAAGTGGTTTTCGAATTGAAGAAGAAGGAAGTGGAAATGGCGAAGCAACAGCTCGAGGGGAAGACGGTTAGCAGTCTCGCTAGACATGCCGCTGTCACTGCCACAGATAAAGCCAAGGATTCTCTGTATCATCACCCAGTCCACAAGATATCCTTCATAGCCCAGGACATGACAGACTCCCGGGCCTTCGGATATATCTTCGGATCACCAGACACTGGACACAGATTCTTCGGCATCAAGACTGATAAAGCTGCTAGTCAAGTAGTTATAGCGATGAGGGATCTCTTTCAAGTGGTTTTCGAATTGAAGAAGAAGGAAGTGGAAATGGCGAAGCAACAGCTCGAGGGGAAGACGGTTAGCAGTCTCGCTAGACATGCCGCTGTCACTGCCACAGATAAAGCCAAGGATCGAAGAATTCAATCGGCACGACTAAAACAAACAGCTGCATACCACGAGCGACCACAGCAGTCGCTCAATGTCGGCCAGAGTAAGATGAGAGACAGGCGGCAAATACGACGGTTGAGATGTTTCTGTAAACAAAAATACGTGAGATTTGAGAATGCGCTCGATGAAAACGAATCTAAATCAAGTTGTCGTAGCGTTCCATCCGGGGAGAGCGCTAAGAGCGAGGGTCCGGAAGGAGGAGTGGCGGAGCTGGTAGACTTGGAGCAAGAACTATGTTCACTAAGACGTGGTCTCACTCAAGTTGAAGGACTCACACCTAGTACAGATCCTTTCGGTGACTCCTTTATTGTACCAACTCAGAAGGGCTTACTGCCTCCTCCTCCTCGAGGCCGTGCTCCCCCACGAGCACCATTTTCGCCCAGCAAGCCGAGCTTCGATCTGACCTCCGATATCGAGCCCAAGATAGACTTACCGGATATGGAAATACCAGAACTGCCTGATCTGAGGGAGCCTAAAACCACCATGGTGCGTGGTGGTGGCTACGACGTGTTCACGGACCTCGACCCACTGGGCACCGGACGGAGTAAACCTTACGTTGACAAAAAACTCTTCTTTCAGGAGCTCAAAAATCCACCAAAGAAAGTGCTCAAAGACCTCGTACCGACGACGCATTCCCTTCTATCAGATATCTTGCCGGTAGAGAAATCGGATTACGGGCCGCCCACCACGATGACGTCACTGACTCGTCACTCAGGAGGCACGGTCGCGGTAGCGAAACCGACTCAGCAGACCTTCTCACCGAACTTCTTTACATCGGATCCGTTCGCCGAGACGGATCCCTTTGATAACACGGACCCCTTCTCAGACACGTTCAAGGACGATCCCTTCACCACCATGGCTGAATTCCCTAAGATCAGCACCCTGAGGATGGAGGAGCCGAAGAATAGGGCCAAGGAATCAATACTTGACAAAGACAGTCTGGACGCGGATAAGAACGTGTTCAATGGACCGCTCCAAGTGTCTCTGCCGCCGGAATCCACGCCCAAATCCCCGAGACTAACCAGACAGAACACGGACACAAGTACGATACGCCAGCGACCGACACCTGGTCGTATTAGTGGTGAGGGTGCGTCCCCCCCTCCCCCGCTCCCCCCCAAGAAGACCGCTTCCGCTGACCGCAGCACTAAGCCGCGTCAGTCGTCCCTGGCGACGAGCTCGGAGGACGAGTACCTGTCCCCGGCGCCGCCTCCCCTGCCCGCGGCTCGTAGACTTGACATCACACTCAGCCAGCTCTTGACACTCACCATGGACGACCTGGCCGCTCGACTTCGGGTTCCAGCTGAAACATTGTCTTCAATGACGCTGCCTCAACTGACTGACTATCTGAGATCATATGTGGCTTCCGAGAACGAGAAGGCTCATATACACGTGGAACCGCTGATGAGACCGGAGAAGGAAAAGGTGATGCCAATAATACCCACGACTGTATCAGAGAAATTAGTAACTCTAGCGCCGGTGAGCCAGTTCAGACCTCAGTTCGAAGACAACTTCGCGCCGTCGGACTCCGCGGACACTTTCGTAGCGAACTTTGACGACTTTGATAAGAAAGCCAATCCAACTTACGACAAATACGCCGCCTTCAGAGAAATACAGGAGCAGGAGCTGAAGGCGAAATCCATATTAGACCCGATAGAACCCGAGAAACAAGAAGATGGTGAACTGACCATCATAGAGAAGCTGATCAAGACCGAGGAACAGAAGATCGAGGAACGTAAGGAACTAGACAAGAGTCCGCTGAAGAGTTTGGACGAACTGACGATAAACTCGTTCAACATGTTCAGGAACACCGTCAGCCCGAAACCCATCGACGCCAAGATAGAAGATATAAAGACGGTCATGAAGAACCTGCAGATAGAACAGCAAAGGCGCAGCGTCTCACCCAGGGACAACGGGCTGCCGGACACCAAACAAGAGGACACCGGCGACAGGTACGCGGCCTTGAGGGAAATCACCATAACGGAGCCGCCGGAAGACTTCGAATCGATACCACCCGAGGCGCCCAAGGAGAGGAAGAAATCTGACGAGAAGTCCGACGGGTTTGATAACTCAGATTTCTTTGATTGCATCGATAATAGTTCGCTGTCTTTTAACGTGGAGGACGCGTTCAGAAAGAGTCCCATAGTGAAGGAGAAGGAGGAAGAGAAGAAAGTAGAAGAGAAAAAGGAAGATCCTCCGATAGAAGAGATTCCAGTAAGAGATCTACAACCACCAACGAGACTTTCGACGGGTTCCATCAGTGACGTCGCCAGTGGATCCTCGCCGGATACTAAAGGTGCGGTGGTGGGGGTGGTGGGTGGTCTCACTAGACTGGGCTGGACGGAAGGCTGGTCCTCTGATTCGCGGGAGTCCGAGCCTCGGCAAAGGAGAAGACGGACACACAATAGAGGACCCTCGGTACAGACATCATCATCGTCTCGCGACGTGTCTCCGTGGGACGAGGAGCCGCCGACCAGACTAGCACGACCCCCCTCGCATAGGGACAGGGGAAGGGACTCTAGGCACAATTCATCCGGTTCAAGAGAGTGTCTCGACTCAGGCAGCGGGAGGGAGAAAGAGAAACGAGACAAACGAGGCAGCAGAGAAAGAGACCTGAACAGAGATGGCAGGGACTCTGCGAGAGATAGGAGAGACTCCGGTAGTGGGAGAGATAGGCGTGACGTCAGCAAAGAAAGAGATCACAGAGACCACAGAGAGGGCAGAGACAGAAGAGATAGACGGAGTAGAGAAAGAGAAAAGGAGTCTTGGAACAGAGAGAGGACTTACAGTAGAGAAAGGGACTACAGGGACTCGAGGAGCAGAGAGAGATCTAAGGACTATAGGGACATGGACAGACATAGGAAGACAAGACATTCGTATGATGACGGAGATTACAGTGACGGATGCGGTTCAGGGAGATCTTCACCCAGGGAACGCTGGAACGAAAGATGGAGGAGAGATGAAAGAAACTATGGTTCCTTGGGCTGGAGGGAGACTAGAGGGAGACGGAGGAATGAACTGAGGCAGGCGGAGGACAGCAGTGAGAGGAGATATGGCTGTACTCTCGGCTGGCCCGGACGACGGTCTGCTGGTAGTACAAGGAGAGAGACGGATAGAGAGATCAGAGATTCCAGGGAATCTCGGGACTCGGGGCGGGATTCTGGACGGGATTCGAGAGACCCGCGGGACTCCGGCAGGGAGTCGGGCCGAGACTCCCGCGATCCACGGGATTCTGGACGGGATTCGGGTCGAGAATCCCGAGACGCCAGGGAACCGGTCCGGGATTCACGGGATCCGAGACGACGGCGCAGGGAGGATGCGAGAACTAGACCACGGGATTACAGGTTCTCGAACGACTTCTCTCCTCGTGAGAGAGAACACCCCTCGCCATTTGATAATGATTTCCAAGATACTCCTGTACCGAGCGTGAAGAAGAAGACTCCTTACGCCACTCTGGAAGGAACCAGGTTATTATTCGAGGCGGAGGACAGTACGGCGTCCCCTCTCTCAGCCGGGTCCCTCCGCCGCGAGGGCGAGTCCCCCGCGGCTCGTTTCCGGTTCGACTCCGACTCGGTCTCCCCGCGGTCCATGTTCGAGGACGACTTCACGCCGCGGGCCCCATCCATCGCGGAGGAGGACGAGGGGGACCCGGCCCCGCTGCGGACGAGGCCCCCGCGTGACGTCAGGAAGTCCGACTCGGTGGACATCTTCACTCGGGAGTCCGACCCCTTCGAGGGCGACGCGTTCTTCGCCTGCACCGGCTCGGACCGTGTGGCGCGGAGAGAGAACTGGCCGGGCGACTTCCAGGGCTTCGACAACGTGTAA

Protein sequence:

>DPOGS215936-PA
MFNHKLPTDFPMQTLRKKTSPCKYKNEPGRFLGEGVSFRAKLIGVLEVPEARGDRMCQEALADLKMAIRAAGEHKQRIQVHVAIDGLRLRDDKTGDSLYHHPVHKISFIAQDMTDSRAFGYIFGSPDTGHRFFGIKTDKAASQVVIAMRDLFQVVFELKKKEVEMAKQQLEGKTVSSLARHAAVTATDKAKDSLYHHPVHKISFIAQDMTDSRAFGYIFGSPDTGHRFFGIKTDKAASQVVIAMRDLFQVVFELKKKEVEMAKQQLEGKTVSSLARHAAVTATDKAKDRRIQSARLKQTAAYHERPQQSLNVGQSKMRDRRQIRRLRCFCKQKYVRFENALDENESKSSCRSVPSGESAKSEGPEGGVAELVDLEQELCSLRRGLTQVEGLTPSTDPFGDSFIVPTQKGLLPPPPRGRAPPRAPFSPSKPSFDLTSDIEPKIDLPDMEIPELPDLREPKTTMVRGGGYDVFTDLDPLGTGRSKPYVDKKLFFQELKNPPKKVLKDLVPTTHSLLSDILPVEKSDYGPPTTMTSLTRHSGGTVAVAKPTQQTFSPNFFTSDPFAETDPFDNTDPFSDTFKDDPFTTMAEFPKISTLRMEEPKNRAKESILDKDSLDADKNVFNGPLQVSLPPESTPKSPRLTRQNTDTSTIRQRPTPGRISGEGASPPPPLPPKKTASADRSTKPRQSSLATSSEDEYLSPAPPPLPAARRLDITLSQLLTLTMDDLAARLRVPAETLSSMTLPQLTDYLRSYVASENEKAHIHVEPLMRPEKEKVMPIIPTTVSEKLVTLAPVSQFRPQFEDNFAPSDSADTFVANFDDFDKKANPTYDKYAAFREIQEQELKAKSILDPIEPEKQEDGELTIIEKLIKTEEQKIEERKELDKSPLKSLDELTINSFNMFRNTVSPKPIDAKIEDIKTVMKNLQIEQQRRSVSPRDNGLPDTKQEDTGDRYAALREITITEPPEDFESIPPEAPKERKKSDEKSDGFDNSDFFDCIDNSSLSFNVEDAFRKSPIVKEKEEEKKVEEKKEDPPIEEIPVRDLQPPTRLSTGSISDVASGSSPDTKGAVVGVVGGLTRLGWTEGWSSDSRESEPRQRRRRTHNRGPSVQTSSSSRDVSPWDEEPPTRLARPPSHRDRGRDSRHNSSGSRECLDSGSGREKEKRDKRGSRERDLNRDGRDSARDRRDSGSGRDRRDVSKERDHRDHREGRDRRDRRSREREKESWNRERTYSRERDYRDSRSRERSKDYRDMDRHRKTRHSYDDGDYSDGCGSGRSSPRERWNERWRRDERNYGSLGWRETRGRRRNELRQAEDSSERRYGCTLGWPGRRSAGSTRRETDREIRDSRESRDSGRDSGRDSRDPRDSGRESGRDSRDPRDSGRDSGRESRDAREPVRDSRDPRRRRREDARTRPRDYRFSNDFSPREREHPSPFDNDFQDTPVPSVKKKTPYATLEGTRLLFEAEDSTASPLSAGSLRREGESPAARFRFDSDSVSPRSMFEDDFTPRAPSIAEEDEGDPAPLRTRPPRDVRKSDSVDIFTRESDPFEGDAFFACTGSDRVARRENWPGDFQGFDNV-