Monarch geneset OGS2.0

DPOGS211012
TranscriptDPOGS211012-TA5145 bp
ProteinDPOGS211012-PA1714 aa
Genomic positionDPSCF300004 + 1226803-1239660
RNAseq coverage161x (Rank: top 52%)
Annotation
HeliconiusHMEL0060400.051.55% 
BombyxBGIBMGA006495-TA0.050.39% 
DrosophilaCG14303-PA3e-2021.73% 
EBI UniRef50UniRef50_UPI00021A75955e-4828.66%UPI00021A7595 related cluster n=3 Tax=unknown RepID=UPI00021A7595
NCBI RefSeqXP_001649356.15e-4026.82%hypothetical protein AaeL_AAEL014694 [Aedes aegypti]
NCBI nr blastpgi|3838602192e-4823.13%PREDICTED: RING finger protein 17-like [Megachile rotundata]
NCBI nr blastxgi|3287800531e-8521.65%PREDICTED: RING finger protein 17-like [Apis mellifera]
Group
Gene OntologyGO:00036762.9e-14nucleic acid binding
GO:00082709.9e-07zinc ion binding
GO:00056229.9e-07intracellular
KEGG pathway 
InterPro domain[673-789] IPR0081918.8e-19Maternal tudor protein
[1579-1638] IPR0029992.9e-14Tudor domain
[135-177] IPR0003159.9e-07Zinc finger, B-box
Orthology groupMCL15040 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211012-TA
ATGGAAGGAAAAAATAAGAAATGTTGTCCGAAGTGTTGTCAGTTTTACGAGATTAATTATTTGAATTCAAAACCCGTGGGTGGTAATGTACCACTCTTTCTTTCATGCGGCCACACCATATGCGAAAACTGTGTTAGGACTCTCATAAAATGTGAACCATTGATCAATTGTCCAATATGTGAAAACAAAACAAACATATCTTCATCTGACATAGCTTTAATAGCACTGAACAAGTTTGCCCTATATGATTTCTTTCCTATAAATAATAATATGTTAGGTGAACTTATATTGCCGGATATAAAGCCGAACATTAATACATCTAGTGCAGATGATGAATATTTTCTTAATATCACAGAACTAATTAAAACTACAGAATCATCAAAAGATGTTAAACCAGAAAGTGCTTTTTGTCATATACACAAGAGCAAATTATTGGATTACTATTGCTATGATTGCGTCAAAGCTATATGTATGGATTGTTTAATGGTTGGCGGAGACAAATCTTGTAAGAACCACAATATCACATCCATTGAGCAAGTGAATGAATCTATGTTGAGTGAAGTCACCGAATTGGCACCAACGGTTGAGGATACACTTAAAAGACTTACAAAAACGGCATTTGATATTGGAAAGACATTAACAAGACTTAATGACGATAATAATTCTGACGTATTCACAAAACTTTTGAATGAAGTTGAGCAAAACTATTCGAAACTAAATGCTGCCATACAAAAACAAAAAGACTATATTGTTGCCAACATAATAAAAATGAAATTAATTGAAATAGATTCACTTAACGAAGCTAAATATAGTGTTGCTGAATCAATACAGAAAGCGAAATACACTTTGGATACAATTAATTCTATAGATACAAAAAAATTAAAAGAGGTAAATATGTCAGCAGTTTTAAAGGACGCCAAAGAAATTGTTAACACACCATGGTATTTAAATAGAGAGGAAACAGAGAGTTCATTAAGAATAAAGATAAACGACAATATCATACAAATGATAGACAGCTTCATACAACTAGAGGGGAGTGAATGTCATACGTACACTTTAAGTACTACCAGAGAACTTGTTGAAAAAAATGTGGATATCCCAAAAGCACCACCTACTATAGTGTATCCCCCTGAGATTGTTAAAGATGTGCGAGAAACGGTTAAATGCAAGACAGGAAAGGACAAAGATAATGTCAAAGCGACGAGTTTCATAAAAAAAATACCACAATACAAGAGCAAAATGGGATCGTGTTCTTCACTGAATTCTATAAACAGTGACACATCGCACAAAAGTTATCAGTCATATGATCAAACACCCACAATATCGTCTTATCCAGACGACCAACATTCGAAGCAATTTTTTGAAGGATCTCAAGAGCTGATATATATATCACATATCGTGGATCCTCACAATTTCTACATACAAAGGGCCTGTCATCAGTCTAATATCAAGGAAATGTTGAGGGAGTTCAAAAATGCGGTTTCAATGCCAAGGCCGTCTGTCAGCCATGTTACTGAAGGTAAAATGTATCTGGTATTTAGTAAAGTTGACAATATGTGGCAGCGTTGTGAAGTGTTGTCAGTGGACAAACGAAACGTTAACAAGCCGATATACAAAGTATTCTGTGTCGACTTTGGCTGTACTGAATTCGTTACCATAGATAAATTGCGTCTATTACAACGCGCCCGTGTTCAAAACCCTCCACACTTCGCTTTCAACTGCAGACTGGCCAACTGTGAGCCAATTAACGGAAGTTGGACCAGCGAAGACTCCATACTTATTCAGAATATCATTGACAATAAACAGGCTGTGCTTCATGTGCATCAATTACGATCGAATACCACGGGAGGCGTGTCTTTGGAAGGAGATGTTATCACGGTGGAACACGCAGTTAATGTAGCGCGGGCACTCGCCTTCCATGGACGGGCGAGGATACCTCATGCGACGAAATATCCAAAAATAAAGGCCATGACAGAGAAACCAAAACTGTTTATGAGTAATAACGATTTCAAACAAGGCACCGTGGAAGATGTTTATATCACGCATATTATGAGCCCGGATCATTTTTATGTTAGAAAGCAACACCTTCAAAGTGTTTACGAAAATCTATGTGAAGAATTGGATCACGAGTACAGTTTAAGTTCACAAAATGACTGTATATATTTACCAGAAAAAGACATGGTGGTGGTTGCTCACTGTACCCGTTGGTCCCGTGCTGTGATCCGTGAGCTGCCGGGTCGTGGTCGTGTGCGTGTGATGTGTGTAGACACTGGAGTATCGGAACTGGTGCATTGGACCGCATTGAGGAGACTGAAGACTAAGTTCACTGTACTGAGGGCGCTGGCAACTGAATGTCATTTGGCGGGAGTAACGCCTCTAAATAAAAAATGGAGCCCGGCTTCCGTATCGCTGCTACAAGAGTTTCAAGACAAGTTATTAGAGCTTTGTGTTGAAGACAATCGCAATAAAAACTCGTTGGGTGTCACACTCAACGACACAAGTGACGAAAGTAATGTTGTGTGCATCAACACGCTAATGATTAAACATAAGTTCGCTGCAACTGAATGTCATTTGGCGGGAGTGACGCCTCTAAATAAAAAATGGAGCCCGGCTTCCGTCTCGCTGCTACAAGAGTTTCAAGACAAGTTACTAGAGCTTTGTGTTGAAGACAATCGCAATAAAAACTCGTTGGGTGTCACACTCAACGACACGAGTGACGAAAGTAATGTTGTGTGCATCAACACGCTAATGATTAAACATAAGTTCGCTGTGAGTTTTGGACTTTTTATGTTTAACAAAAACACGGATATGGACGATCTAGTCATCACTAACAAATCGCCGCTCGATGAACCCAAACCTGTTATGAAAAGCGAAAAAAAAATTACAATCCTCAAAAAAGATACAAATATAGAAAATAAAACAGATGAAAAAAATTTGGAAGCGAAGGATAAAGGACCTCTCAGACTTGAAGCGCACATCCTGAATTATCAATCGCCATCACTCCTATATGTGTCTTTGGTGCATCAACAGAAAACATTCAATGAACTGTTTGAGAAAATACAAAAGTATTACACTACTAAGAAAATACAAGGCAAAAATGTGTGGAACGTCGGTGATAGATGTTGTACTCTATGTAATGAGTCGCACACATGGCGCCGGGCGGCCATTTTGGAAATCGAAAATGATAATGCCAAAGTGTTTTATTCTGATTTCGCGTGCGTTGAAACGGTTCCTATATCCGATTTGAGGGAATTATCCCAAGAATTCGGATCTGTAGGTGATGCTGCGATAATGTGCCATCTCTGTGGCGTCACACCAGCTGTCGGTGATGAATGGCCATCGCTTACGAAGGAATACTTAAAGGAATTACTTGACGCGTATAAAAGAGTTTTTATAACTAAAGTCGGTCAATTTAAGGGTAAAAGTATGCCGGTAGAACTGTGGGTGTACCACACGATACAAGGAGGCGCTCTCGAACCGAACAAATCTGAATGGAGATGTCTCAATAAGAAAATAATTGACCAAGGTTTAGGAATTCCTGATAAAAGTGATGAGCTAACTCCTGACTGTGCTACCAACGGCGACGATATGCTGTCTTTCTTAAACATTACCGGTTCGGTTCGTGATTGGCTGCAAATAGAACCGATGCCATTGAAACCACTCAAAATAAAAAGTTGTTCTGATGAAGCCAGCAATAATTCAACGCAAGGGGAAAATCAAAGTGAAAAGTTCGAAAATGTCTCAAATTCAAATACAGTTTTCATATCAGAATGGTTACCACCCGAACCGTTGCCGGCCAATGAGTTCAAAGCTATGCCAACATACATAGACAACGACGGCCTAATTTACCTTCATGACATGTCACAAGAGGATACTTTGGATCTTATCCGCAAAGCGTTGGATGTGCGCTTCAAGAATCCAGATCCAAAAGCTAAGTTCGTCAAGTGGTCGGTAGGCGAGCCTTGTGTGGCGCTTTACTTCTTAGACAATCGCTTTTACAGGGGGAAAATACTAGCGGTCGATAACGAAGAGTCGACATGTCTGGTCCACTACATAGACTACGGCAACGATGAAATCTGTGCGTTCGAAAATCTTAGAAAGAGTATAGCCTTGTACCAAATACCGACTCAGGCGCATAAGTGTGTTCTGAGCAAAATAGAACCCGTCGGCAAGAACTGGGACAGAACGACCTTGGATTATATACATAGATCAATAGTCGAGAAAATATGTTTCGTTAAAGTTAGTGGGGAAGCGATAGGCGATCTGGTTCCTATAGAACTAAAGTACGACAAGTTGTGGGTCAATGATCATCTAGTGGAGTTTGAGATGGCCAAATACACGGACGGCTCCGAGGCCATTGTAAGAAAATACGCCCCAGATATAAAAGATAAGAAAGATAAAAAAGATAAAAAACCAGAACAGCTAATAGAATCAGATTCCGGCCCAGATTATATCATAGGGGACGATAACGTTGACACTTCAACGACACACGATTCTATAAATTTAGGCTCATTGGATGGCAAGGACTGGAACGAAGTAATAGAAATCGAAGAGAACCAAAACAACTTCGTAACTTACACTCCTTATAGTGAAAGGGAGTTCAAATGCACGATAACGGTACTCAACGACGTCAACACACTCGAACTGAATATCGCCTTCGACGACCACGCCGCTAAGACATACGAGGACATGTTCGCAGAACTTCAGAATGATAGCTGTGATGCGATTGGGTTGAATGGTGTTTTTGAGAACAAGGCCTGCGTTGCCCTGTTTCCGGATGACGGTCAATGGTATAGAGCCTCCATTTTACAGTACAGCAGAACTTCAAATAGGGTGAAAGTCAAATATGTAGATTATGGCAACATTCAAGTACTATCTCTGAGTGATGTCAGAGAAATTGACAGGAAATTTGTTGAACTACCCCCGGCTAATTTAACCGTGACGTTGCACGGTGTGAGACCGAATCCGAGCATAGACAAAGTTTGTTTGGTGAAAGTTTACGAACAGACATTCCTAGACAAAGAACCGTTTGACGTTAAAATTATCGATATTATTGATTCGGTGCCCAGCGTCGAGCTGAGGCGGGATGGACATTTGGTTTACGAAAATCTCATACGAGAAAACATTTTTGTCAAATGTGATTGA

Protein sequence:

>DPOGS211012-PA
MEGKNKKCCPKCCQFYEINYLNSKPVGGNVPLFLSCGHTICENCVRTLIKCEPLINCPICENKTNISSSDIALIALNKFALYDFFPINNNMLGELILPDIKPNINTSSADDEYFLNITELIKTTESSKDVKPESAFCHIHKSKLLDYYCYDCVKAICMDCLMVGGDKSCKNHNITSIEQVNESMLSEVTELAPTVEDTLKRLTKTAFDIGKTLTRLNDDNNSDVFTKLLNEVEQNYSKLNAAIQKQKDYIVANIIKMKLIEIDSLNEAKYSVAESIQKAKYTLDTINSIDTKKLKEVNMSAVLKDAKEIVNTPWYLNREETESSLRIKINDNIIQMIDSFIQLEGSECHTYTLSTTRELVEKNVDIPKAPPTIVYPPEIVKDVRETVKCKTGKDKDNVKATSFIKKIPQYKSKMGSCSSLNSINSDTSHKSYQSYDQTPTISSYPDDQHSKQFFEGSQELIYISHIVDPHNFYIQRACHQSNIKEMLREFKNAVSMPRPSVSHVTEGKMYLVFSKVDNMWQRCEVLSVDKRNVNKPIYKVFCVDFGCTEFVTIDKLRLLQRARVQNPPHFAFNCRLANCEPINGSWTSEDSILIQNIIDNKQAVLHVHQLRSNTTGGVSLEGDVITVEHAVNVARALAFHGRARIPHATKYPKIKAMTEKPKLFMSNNDFKQGTVEDVYITHIMSPDHFYVRKQHLQSVYENLCEELDHEYSLSSQNDCIYLPEKDMVVVAHCTRWSRAVIRELPGRGRVRVMCVDTGVSELVHWTALRRLKTKFTVLRALATECHLAGVTPLNKKWSPASVSLLQEFQDKLLELCVEDNRNKNSLGVTLNDTSDESNVVCINTLMIKHKFAATECHLAGVTPLNKKWSPASVSLLQEFQDKLLELCVEDNRNKNSLGVTLNDTSDESNVVCINTLMIKHKFAVSFGLFMFNKNTDMDDLVITNKSPLDEPKPVMKSEKKITILKKDTNIENKTDEKNLEAKDKGPLRLEAHILNYQSPSLLYVSLVHQQKTFNELFEKIQKYYTTKKIQGKNVWNVGDRCCTLCNESHTWRRAAILEIENDNAKVFYSDFACVETVPISDLRELSQEFGSVGDAAIMCHLCGVTPAVGDEWPSLTKEYLKELLDAYKRVFITKVGQFKGKSMPVELWVYHTIQGGALEPNKSEWRCLNKKIIDQGLGIPDKSDELTPDCATNGDDMLSFLNITGSVRDWLQIEPMPLKPLKIKSCSDEASNNSTQGENQSEKFENVSNSNTVFISEWLPPEPLPANEFKAMPTYIDNDGLIYLHDMSQEDTLDLIRKALDVRFKNPDPKAKFVKWSVGEPCVALYFLDNRFYRGKILAVDNEESTCLVHYIDYGNDEICAFENLRKSIALYQIPTQAHKCVLSKIEPVGKNWDRTTLDYIHRSIVEKICFVKVSGEAIGDLVPIELKYDKLWVNDHLVEFEMAKYTDGSEAIVRKYAPDIKDKKDKKDKKPEQLIESDSGPDYIIGDDNVDTSTTHDSINLGSLDGKDWNEVIEIEENQNNFVTYTPYSEREFKCTITVLNDVNTLELNIAFDDHAAKTYEDMFAELQNDSCDAIGLNGVFENKACVALFPDDGQWYRASILQYSRTSNRVKVKYVDYGNIQVLSLSDVREIDRKFVELPPANLTVTLHGVRPNPSIDKVCLVKVYEQTFLDKEPFDVKIIDIIDSVPSVELRRDGHLVYENLIRENIFVKCD-