Monarch geneset OGS2.0

DPOGS211581
TranscriptDPOGS211581-TA4656 bp
ProteinDPOGS211581-PA1551 aa
Genomic positionDPSCF300084 - 230887-252355
RNAseq coverage1356x (Rank: top 9%)
Annotation
HeliconiusHMEL0169970.065.59% 
BombyxBGIBMGA006517-TA0.071.08% 
Drosophilascrib-PL7e-13846.30% 
EBI UniRef50UniRef50_Q17IJ73e-17645.19%Putative uncharacterized protein (Fragment) n=2 Tax=Aedes aegypti RepID=Q17IJ7_AEDAE
NCBI RefSeqXP_001603194.10.048.43%PREDICTED: similar to CG5462-PH [Nasonia vitripennis]
NCBI nr blastpgi|3454894800.048.43%PREDICTED: protein lap4-like [Nasonia vitripennis]
NCBI nr blastxgi|3454894800.044.48%PREDICTED: protein lap4-like [Nasonia vitripennis]
Group
Gene OntologyGO:00055159.4e-28protein binding
KEGG pathwaydre:1144461e-30 
 K12076 (DLG1)maps-> T cell receptor signaling pathway
InterPro domain[496-616] IPR0014789.4e-28PDZ/DHR/GLGF
Orthology groupMCL10300 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211581-TA
ATGGGTGTGATAAGGAAGAAGAAACGCGAATCGGTATCGGAGCGTGCTGACAGCGAAGTACGTATAGTGAACGACCAGAAATGTATAGAGGTGCGCATAGCACGGGCGGCTGGTGGTCTGGGTCTCAGTATAGCTGGAGGAAGAGGATCCACACCATATGTAGGGGACGACGATGGCATCTTCATATCAAGAGTCACTCCTAATGGACCCGCATACCTCGCTGGGCTTAGGGTCGGTGATAAAGTGCTGTCAGTGAATGGTACGTCAGTGGTAGATGTCGATCATTACTACGCTGTTGAGGTTCTGAAAGCGAGCGGACAGACACTCACTCTGGTTGTTACTAGAGGCTCGCCAACGTCCACTAGAACTCACAGTAGAGCCCCGAGCGGTGCAAGTCATCACTCCATATCCAGCACCACGGACACTGTATCCACATTGGAAAATGGACAGGTTCCGACCGGACGCGGTATATTATCGAAGCCTCTCATTATCAGCAATCAGTACGCTGAGGAGTTCGATCCGGAGATAAAGGAAACTGTCAGCAATCAAACAAATATAGGATACGGAACCAACTATCAAAGACTACAGGCTTCACCTCCACCTATAACACAGCCTTATGTACCACCGGCGTATCAACAGAAGGTGCTAGTTCATACGACGCTGATCCGGGATGGGACTGGTCTTGGGTTTAGTATTGCTGGTGGCAAAGGTTCCCCGGCTTATAGAGAAGACAGTGACGCCATCTATGTATCCCGGATAAGTCCTCAGGGAGCCGCTGCCAAGGATGGGAAAATGTTGGTCGGGGATAAGGTCGTCTCGATCAACGGCGTTGATATGGAGCAGGCGACTCACGAAACAGCTGTCTCCTTACTGACGGGTCACGAACGTTTCGTGAGACTTGTACTACAACGGACAATAACACAGGAACAAGGAGATCTTATACCACGGAAGTCAACATCGGAAGACATAAAGGAGCACAAGACGAGCCTGCAGAACGTGAACATCACGCCGGCACAGAAACCCACCGCCAACCACGTGCCATTGATTGGCAACCATACAGCACCACGCGCGAACACACACGCACCGCAACCTAACACGCAGACAAACACGCAACCAACAACGCAACAACACACACATACACAAAAACCGTTGATCACACAGACACACACATTCGCAGCACCACAGCCAGCCCCAAGAAAACTAAGTCAGACTAATGGAACGGCGAGACCGGCAACTAACAGTACGAGCACGCCTCTAACTCCCGGCGCAAATAAAATGCACGCATCTAGCGACGACGTGTTCGAAGATATCCAGCCGCGGGCGATCACCAGCGAGGACTTCCAAGCGATGATACCGGCGCACTTCCTCGGCGGTGGCGGCGGTGGCGGCGGTGGTAGCGGCGATGGTGGTGACGGTGGCCGCGTGCGTGTGGTGGTCGAGCGCCCGCGGCCGCCAGTGTTACCGCCGCCGCCTTCCACCATCGGCCGCGTCACCGAAACCATCACCAAGTCGACCTTCACCGAGACGACGGTCACACGCATCACTGATAACAAACTCGTCGAGCCACTCATCATTGAAGATGTTATATTGGTTAAGGACGGCGGCTCTCTCGGGTTCAGTATAATCGGTGGTACCGACCACTCCTGTGTGCCATTCGGCGGCAAAGAACCCGGCATATTTATATCACATATTGTTCCGGGCGGCGTGGCGGCTAGATCTGGCAAGCTTCGTATGGGCGACCGTTTGCTTAAGGTGAACGGCACCGATCTCCCCGGGGCCACTCACAGGGACGCCGTCCAACTGCTCCTACAGCCGGGACCCACCCTCACACTCACAGTACGACACGACCCACTACCGCTAGGATTCCAGGAGTTGACCATAATAAAGCAAGAAGGTGAAAAACTGGGTATGCATATAAAGGGAGGTTTGAACGGACAACGCGGCAACCCTAATGATCCAAATGATGAAGGAGTATTTATATCGAAGATAAATAGCGGCGGAGCAGCCCGGAGAGACGGCAGACTGAAGGCTGGCATGCGTCTGTTAGAGGTGAATGGCATTTCCCTTCTCGGTGCTACACACGCGGAAGCTGTAAACGCCCTAAGATCTGCATCGGACGCCCCACTAACACTAGTCGTCTGCCATGGATATTCACGTCCGGAGAAGAGTACTACGGGCAGTGAGAGCGGGACGGCTGATACCGGTGGTTCTCTGTCTCACTCTACCTCCAGTCTGGATAGAGACGAGTCCTTGCATCAACAACAGGAACAACATATCCGACAAGATTTAGTGGAGTTTGAACATGAGAAGCAAGTCGCCGAGGTGAGGGAGAAATCCACCCCTGAAAAGGTGCTAGACATAGTGCATGCTGTAGAGAGTATCGCCCTTGATCCTTCACCACCCACGCCGGAACCGCATCAGAAGACCACTACCGTCGTCATGTCTAAGCACACTCTGCACAACCAGTCGTCTAGTGCGCCGGTTGCGGTGTCACCGTTGGGGGTCACACAACACAAGCCGAAGAGTCCACCTGCTGATAGTCTGGGAACATCAATCGACCCACCGACTAACCCACCGCCGGTCAAACAGAAACCACAAGTGCCTAAGAAGCCACAAATGAAGCGAGTTAGTTTCACATCGGAAGCACAAGATATGACGAACGAATCGAACGAGCGAAACATAACGCATCTATCGCATGATACATCCCCGACGGATACTACTAACGATGCATGTCAGGGAAATAGCTCTTTACTAATAAAACATCGACCCACTAAAATTGTGAATGAAACCAAATCAAGTATCGTACGCCCCGAAGAATTTATCATCCCACCTCCGCCTTTATTCGCGACTAAAGTAAGTTTAAGTCATTTAGTACATGAGGCGGCACCTCAGCTAGACGCCCCCCTCATCACTGTTGTCCCTGAGAGAGTAGTTCTGCCAGAACCTATAGTGGCCGAAACAACTGTGCAGTCTAGTAACTCAACCAATTCACCAGAAACAGAATTCCCTCCACCCGTCAACTACAATGCGCATCCGATTTTAGATAGTTCTTTAAGATTAGATAGGGATCCTCTTGCAGCACCAATATTTTCTGAAGTGCCTTCACCTCAAGTCTTCCGTAATAAATTACCGTCTAGCGAGAGTAGTGAATACTTTGAAGAATTATCAGTTAGTCCTGTAGGAGAGCCAGAGCCACCGGAATTGTTGCATGCTCAATATAAAACTCTTCCTTCCGTATCACACGTACCGAATTCACCTTGCTACTTGCATCCTATGAGAGGCCAGTTCGATGACTCAGATGCCAATTACTACACTTTGCGCGACTTACGAGTTAAGAACATATCGCCGAACATGGTCCATTCTATAACTTATCCTCCTTGTAATCCATACCCCTCGTTAACGACGGTCGCGGGTACATTCAATCCATATACTTGCACGGTCCCATCAAGGGCATCTAGCGACGAATCGTTCTCGCCATCGAAACCGCAACTGTCACCGTCATTAATTAGCTCAGTAGGAAAAGATATAAACGCCAATGTCACACAACTTGTTACGATTAAAGAGGAAGATGCTCCTAAAAGTGTTCAATTAGAAAGAAGAGTCCGTTTCGGCGAAGTGACGACTGCGCCAGAAATTAATAGATTTTCAATGGGTTCCGAAACTGCGAGAGAGGGAAGTACCTCGCCCGCATCCACCCTGAAACCCTCATGGAGCAGCAAAATCAAATCCTTTGCCATCGGCGAGGCTTCCCCACCTCAATCTGGTAATTTTGTGAAAGCATCTGTGAGCGATAAGAAAAAATTCTTTGAGAATGCTATGGAAGAAAGTCACAAATCATCACCCAAGCCAGAGAAAGTATTCGCATTCTTATCAGCGGATGAGGTGGAGAAATTGAAGCAGGAAGAGGAGCGGAAAATGGCGACGCTCTCCCGAGCTGAGCTCGCCTCGTGGTCGCCCGACCGCCAGAGCGGAGACAGCGAGCATTCAGACGATGAACCGTATGAGAACGGTCACAGCGTTTCGTTGGTTGGTGTTTCCCCCTCAGCGAAATCTCAGCGTCGTGCGGGACGCGATGGTTTAGACGCTGAGGCTCGCGCAGCACGACGAGCAGCCTGGCGAGCAGCCCGTCTGCGCTCGCTAGAACAGGACGCGATTGAATCACAAAAAGCAATTAAAAGCATGGTGGGTCCAGCGCATGAAATCATAGGAGAGGTGCCGCCAAGGGATAATTCTCCTTCCGAGACATGGCCCCTGCCTCGTATCGCGTTACGTACAAAGCCGGGGCCAATACTGGCCGTGAAGGAGAAGGAAAAGTTGTTAGACGAGCGAATAACACTGCGAACTGAGGAATACGTCTGTCCGGCTACCGGCGAGACTAAAGTGCGAACCGTCGAGTACATTGAGAAAGTTATTGAGAAAGAGGTGGAGACGACTCAAGAGAAGATAATATCATTGGAATTAACAACGAGTCCGAGTAGTGAGACGGCACCGGATTTAGAATTGGGAGCTAATCTGGAGGCGGAAGAATCTGGGGAGAACTCTCTTCAGCCGGATATCATACAAGTGGTGAATCCCATACTCGATGGCGGAGCCGGTCACGTGACTTCCATACCCGTAGGCCCCGCCCACAGGCTCACCTACACAGATAAACAGTAA

Protein sequence:

>DPOGS211581-PA
MGVIRKKKRESVSERADSEVRIVNDQKCIEVRIARAAGGLGLSIAGGRGSTPYVGDDDGIFISRVTPNGPAYLAGLRVGDKVLSVNGTSVVDVDHYYAVEVLKASGQTLTLVVTRGSPTSTRTHSRAPSGASHHSISSTTDTVSTLENGQVPTGRGILSKPLIISNQYAEEFDPEIKETVSNQTNIGYGTNYQRLQASPPPITQPYVPPAYQQKVLVHTTLIRDGTGLGFSIAGGKGSPAYREDSDAIYVSRISPQGAAAKDGKMLVGDKVVSINGVDMEQATHETAVSLLTGHERFVRLVLQRTITQEQGDLIPRKSTSEDIKEHKTSLQNVNITPAQKPTANHVPLIGNHTAPRANTHAPQPNTQTNTQPTTQQHTHTQKPLITQTHTFAAPQPAPRKLSQTNGTARPATNSTSTPLTPGANKMHASSDDVFEDIQPRAITSEDFQAMIPAHFLGGGGGGGGGSGDGGDGGRVRVVVERPRPPVLPPPPSTIGRVTETITKSTFTETTVTRITDNKLVEPLIIEDVILVKDGGSLGFSIIGGTDHSCVPFGGKEPGIFISHIVPGGVAARSGKLRMGDRLLKVNGTDLPGATHRDAVQLLLQPGPTLTLTVRHDPLPLGFQELTIIKQEGEKLGMHIKGGLNGQRGNPNDPNDEGVFISKINSGGAARRDGRLKAGMRLLEVNGISLLGATHAEAVNALRSASDAPLTLVVCHGYSRPEKSTTGSESGTADTGGSLSHSTSSLDRDESLHQQQEQHIRQDLVEFEHEKQVAEVREKSTPEKVLDIVHAVESIALDPSPPTPEPHQKTTTVVMSKHTLHNQSSSAPVAVSPLGVTQHKPKSPPADSLGTSIDPPTNPPPVKQKPQVPKKPQMKRVSFTSEAQDMTNESNERNITHLSHDTSPTDTTNDACQGNSSLLIKHRPTKIVNETKSSIVRPEEFIIPPPPLFATKVSLSHLVHEAAPQLDAPLITVVPERVVLPEPIVAETTVQSSNSTNSPETEFPPPVNYNAHPILDSSLRLDRDPLAAPIFSEVPSPQVFRNKLPSSESSEYFEELSVSPVGEPEPPELLHAQYKTLPSVSHVPNSPCYLHPMRGQFDDSDANYYTLRDLRVKNISPNMVHSITYPPCNPYPSLTTVAGTFNPYTCTVPSRASSDESFSPSKPQLSPSLISSVGKDINANVTQLVTIKEEDAPKSVQLERRVRFGEVTTAPEINRFSMGSETAREGSTSPASTLKPSWSSKIKSFAIGEASPPQSGNFVKASVSDKKKFFENAMEESHKSSPKPEKVFAFLSADEVEKLKQEEERKMATLSRAELASWSPDRQSGDSEHSDDEPYENGHSVSLVGVSPSAKSQRRAGRDGLDAEARAARRAAWRAARLRSLEQDAIESQKAIKSMVGPAHEIIGEVPPRDNSPSETWPLPRIALRTKPGPILAVKEKEKLLDERITLRTEEYVCPATGETKVRTVEYIEKVIEKEVETTQEKIISLELTTSPSSETAPDLELGANLEAEESGENSLQPDIIQVVNPILDGGAGHVTSIPVGPAHRLTYTDKQ-