Monarch geneset OGS2.0

DPOGS201547
TranscriptDPOGS201547-TA3351 bp
ProteinDPOGS201547-PA1116 aa
Genomic positionDPSCF300006 + 1880382-1895567
RNAseq coverage212x (Rank: top 46%)
Annotation
HeliconiusHMEL0069820.074.96% 
BombyxBGIBMGA002731-TA0.065.85% 
DrosophilaX11Lbeta-PA0.076.07% 
EBI UniRef50UniRef50_E2B9H90.077.32%Protein lin-10 n=7 Tax=Formicidae RepID=E2B9H9_HARSA
NCBI RefSeqXP_002432583.10.063.39%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|3072116130.077.32%Protein lin-10 [Harpegnathos saltator]
NCBI nr blastxgi|2700144390.045.48%hypothetical protein TcasGA2_TC001711 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-45protein binding
KEGG pathway 
InterPro domain[771-918] IPR0119931.8e-45Pleckstrin homology-type
[773-914] IPR0060204e-36Phosphotyrosine interaction domain
[905-1037] IPR0014787.3e-20PDZ/DHR/GLGF
Orthology groupMCL10396 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201547-TA
ATGCAGCACCAGCACGATAGACTAAGTACTGGTCTGGAAGGAAGAGACATGATTAGCGTAAACAACGCGTCCTCTGATGAAGAGCAAGACCAAGTCATCCTTAACACTAAAGGTGGCACTTACAAGCTACGGGACTCGAGGATAATAGAAATCGCCGGAGGCAGAGAACTGTACTCACAAAGTAGGGCGAAGGCTGTGAGAAAGTTGCGACATAGCACCACTCACAAACACAACCTCAGGAACAAAGTGCGTTCACTCAACAAAGACGCCGTAGACTATACCAATGATAATAACATCAATAATGAAATAACTATACACAGAATTAATAGTGAACCAAAAAAGATGATATCACGCCATACATCACCCGTAACATACACGAATGGTGATCGGGAAGCGACTATAGAACAGAATCCCATGGGGATACCAAAGAACAGTTCACCTATACTAGACCCACATAAAATAAATGACGTGAGCACAAAAAGCAGTCCGATAGCAGTCGTGGCTGATGGGGAAGTTATGGTTTTTGATGAAAACGATGATTGGAAAGGTTTGAGGATGGACCCAGACGCTAGTGACCACGAAATAGATTTGAGTGTTAAAAGGACTTTAGACAATATGAAAAACGGTGATTGTGACATGTTGAACAATAAAAGTGAGTTGAGTTGTGATGAGAGTGAGTCTTCGAGGTCGGAGGATGTCAGGTTGAGTGAAGGGAGTCCGACTGCCGTATGGCTGTCGGGGGATGAAGATAAGAACAGGGTCACTAGGGTCATAGGGGAACTGCCGATAGCTGAGTACGAAGGCTCCCCGAGAAGATATGGCATCGGGTCACAGAAGGCTCCTGTCAGGTCACCGCGACCAGGGTTCCCTCAGAGAGTACTTCCTGAGAGTAGAGATGCAACGCCTCCGCCCGGCTCAGCGGCTTTCGATTATTTGTATGAGTTCTCGGAAACAAGAAAGGTCCTAGAAGAGTTCTTTAGATGTCCCACTTTACCTGGACAGCAGCAGAATATTAATGACGAGATAGTCAATTTCCAGGAACTCGATTACGAACTTCGAAGACAAACGCAAGATTGTCCGTCAGAAAATGGCATCGACGGGAGCGAGTCGGATTGGCCTCAGAACGAACCAATAGAATCTCCACACGCCTTGAATAATCACACGGACTTCTTGGGACTACAGCAAAAGCGTGGTCGCTTCCCGTCTCCAGAAATATCAGCGCTACAGCCGGGTGACGGCGCTAGTGTTTGCGGCTCCCCACCTGGATCTAGCTCACGCAGTGACCTTTCAGTGGCCGGCGCAGAAAGCGAAATGCCCGTCAAAGCCTGTGTCAGATACCCCGCGATGTTTCCTGCCAACGAATCTGAGATACAGCTGTCGATGGCCATCATTGAGACGGACTTAGCAGAAATGCAAGTTCAAGAAGGACTTCGCACGCTGCCCATAGTCGAGGACGGTCTCTCATCGGGACACACGTCAGACGCGGACAATAACAATGCCATTTTACAGACGCATTCTGCGACGATCGAGGAACAAGTGGACAGTGTCAATAACACTGACCTCGTCCTCACCGACGCTGATCTGCACTCCTTGGATCCGTTAGCTATGGACGTTTGTATGGGGAAAAGGCCCTCGTCGGCTCAAAGTACGGAATCAGTTGAAATAAAACCAGCAAGCGGCGATGAAGACGAGGCTGACACAGATTTGGAGACGGACAGACTGCTCGGCCAGCAGAGGACTGATGATCAAGGATTCTTTGATGATAAAGTTGAGGTACATCGTGAGGATGACGATGACTCCGATAGAAAAGATGATAATGATGTGCTCGGTTTAAACGTTAACGAAAATGACACCTCCGACCAGGGCTCTTGTTTTCTAAGTGTCGAAACAGAATATAAAACAAAATTTAATTCAAACGATGAAATAAGTAAAGCTGGTTGTAAAGAAAACGTTCGCGATACAATCACTGTGGAAGGAAGTGATGTATTAAGTGATTCAGGCTGGCGTAAACCTAAGTCTCGTACTGTGATGCCGTCGGGTGGGTCAAGGGACGCTCATGATGATCACAGGGAAGACGAAGTGCATAATGGAAAGGATAAGGACGGGAAAAAGAAAAGCAAGAACAAGGAAGATTGGGCCGCAGTCTTGATAGAGGGCGTGCTGTTCAGAGCCCGTTATCTAGGCTCCACCCAATTGGCGTGTGAAGGTCAACCGACCAAAGCTACAAGGATGCTGCAAGCTGAAGAAGCTGTGTCTCGGATCAAGTGGGGCTGCGAGGGACCCTACTCGGGGGTGTACGCGGCACCCGCGGCCGTGTTCCGCCTCTCATTCCTCGGCTCCGTCGAGGTCGACGAGGACTCGAGACGGCGGAAGAGGAGACCCAAAAAGAATATGGTCGAGGAGGCCGTCACCAAAATTAAGGCTCCAGAAGGTGAAAATCAGCCGAGCACCGAAGTTGATCTTTTCATATCAACTGAGAAGATTATGGTCCTTAACACTGAGTTGAAGGAAATAATGATGGATCATGCTTTGAGAACGATATCTTATATAGCTGATATAGGGGACCTGGTTGTGTTGATGGCAAGGAGACGGTTTGTACCCCACGAGAATGATTCCGATCAACCAAAATTGAACCGTACGCCTAAGATGATATGTCATGTTTTCGAAAGCGAAGAGGCCCAGTTCATAGCTCAGTCCATAGGCCAAGCTTTCCAAGTTGCGTATATGGAGTTCTTGAAGGCCAACGGTATAGAGGACCATAGCTTTGTGAAAGAAATGGACTATCAGGAAGAATTGCAGAAAGAGGTGGTTGTCCCGAAGACGAAGGGAGAGATCCTCGGAGTGGTTGTGGTGGAATCGGGCTGGGGTTCCATGCTGCCCACTGTGGTCATTGCGAATCTCGCACCAGCCGGAGCGGCCGCGAGGTGTGGACAGCTCAATATTGGTGATCAGATAATCGCGATAAACGGTGTAAGTCTCGTGGGTCTGCCGCTGTCTACATGTCAGACTTATATTAAGAACTCCAAGAACCAGACGGTGGTGAAGCTGACGGTGGTGCCGTGCGCGCCCGTCGTCGAGGTGAAGATCAAACGACCCGACACCAAGTACCAACTGGGATTCAGCGTTCAGAACGGCGTGATCTGCAGTCTCCTCCGCGGCGGTATAGCTGAGCGCGGCGGCGTACGCGTCGGTCATCGCATCATAGAGATCAACTCTCAGAGCGTGGTGGCCGTCCCCCACGAGAGGATCGTCAACCTGCTGGCCACCTCCGTCGGAGAGATCCTAATGAAGACGATGCCGACGTCCATGTTCCGGCTGCTGACCGGCCAGGAGAACCCGGTGTTCATTTAA

Protein sequence:

>DPOGS201547-PA
MQHQHDRLSTGLEGRDMISVNNASSDEEQDQVILNTKGGTYKLRDSRIIEIAGGRELYSQSRAKAVRKLRHSTTHKHNLRNKVRSLNKDAVDYTNDNNINNEITIHRINSEPKKMISRHTSPVTYTNGDREATIEQNPMGIPKNSSPILDPHKINDVSTKSSPIAVVADGEVMVFDENDDWKGLRMDPDASDHEIDLSVKRTLDNMKNGDCDMLNNKSELSCDESESSRSEDVRLSEGSPTAVWLSGDEDKNRVTRVIGELPIAEYEGSPRRYGIGSQKAPVRSPRPGFPQRVLPESRDATPPPGSAAFDYLYEFSETRKVLEEFFRCPTLPGQQQNINDEIVNFQELDYELRRQTQDCPSENGIDGSESDWPQNEPIESPHALNNHTDFLGLQQKRGRFPSPEISALQPGDGASVCGSPPGSSSRSDLSVAGAESEMPVKACVRYPAMFPANESEIQLSMAIIETDLAEMQVQEGLRTLPIVEDGLSSGHTSDADNNNAILQTHSATIEEQVDSVNNTDLVLTDADLHSLDPLAMDVCMGKRPSSAQSTESVEIKPASGDEDEADTDLETDRLLGQQRTDDQGFFDDKVEVHREDDDDSDRKDDNDVLGLNVNENDTSDQGSCFLSVETEYKTKFNSNDEISKAGCKENVRDTITVEGSDVLSDSGWRKPKSRTVMPSGGSRDAHDDHREDEVHNGKDKDGKKKSKNKEDWAAVLIEGVLFRARYLGSTQLACEGQPTKATRMLQAEEAVSRIKWGCEGPYSGVYAAPAAVFRLSFLGSVEVDEDSRRRKRRPKKNMVEEAVTKIKAPEGENQPSTEVDLFISTEKIMVLNTELKEIMMDHALRTISYIADIGDLVVLMARRRFVPHENDSDQPKLNRTPKMICHVFESEEAQFIAQSIGQAFQVAYMEFLKANGIEDHSFVKEMDYQEELQKEVVVPKTKGEILGVVVVESGWGSMLPTVVIANLAPAGAAARCGQLNIGDQIIAINGVSLVGLPLSTCQTYIKNSKNQTVVKLTVVPCAPVVEVKIKRPDTKYQLGFSVQNGVICSLLRGGIAERGGVRVGHRIIEINSQSVVAVPHERIVNLLATSVGEILMKTMPTSMFRLLTGQENPVFI-