Monarch geneset OGS2.0

DPOGS215043
TranscriptDPOGS215043-TA4434 bp
ProteinDPOGS215043-PA1477 aa
Genomic positionDPSCF300208 - 194619-207526
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0020068e-16363.49% 
BombyxBGIBMGA005675-TA3e-12861.22% 
Drosophila% 
EBI UniRef50UniRef50_E9J2A12e-1819.45%Putative uncharacterized protein (Fragment) n=1 Tax=Solenopsis invicta RepID=E9J2A1_SOLIN
NCBI RefSeqXP_002430737.12e-1523.88%hypothetical protein Phum_PHUM497560 [Pediculus humanus corporis]
NCBI nr blastpgi|3838645704e-3721.66%PREDICTED: androglobin-like [Megachile rotundata]
NCBI nr blastxgi|3838645702e-4321.04%PREDICTED: androglobin-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL22102 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215043-TA
ATGTCTAAAAAATCGGATATGGGACGAAGTCTTGTGGTCTACGAGGTCGACCCCACGGAATGCCCATTTAGGGAATTTCGGGATAATGAACTCTCCACGGAGTTTTGGGGGATGGGTCCATCTGCTTTTCGAACCAGTCATTTTGTTAGCAAAATTTCGACGAAGACTCAAGCAGACTATGTTTGGGTGGATGACCAAACACAGCCCTTGCCCCGATCAGCGAGACAGTATCTTCATGGATGGGTTAGAGCTGACGAATTAGCTTTGAATCGATGGGACGCTGAAGTTGTTATTTTTGAGGATAACGGCAGTGGTAAGATGTCAATCATAGACATGCAGCTCAGCCATGCACAGGTTCTCCTGAGATCATCCTTCTGCAGACGAGTCCTCTCTATGTGCTTCATGCTTGAAAGGGTTGATAGTTTAATTGTGGAACATCAATGGGAGAACTTTGTATTCACAATGGCCCCGGAAGGCTGGAGGTCTCGGTTTCATATTTACTCCCCGGGATTGAAACCTGGTGGTGGACAACAACATCGACCAACACATTCCAAAAATGGTAATAGAGCCCTCGAAAATACTTTGTTGCCTGGTTGTTATCTCGTTCGCCTATTCTATCTCGGCGCCTGGCGCTGCGTTTGGGTCAGTGATCAGGTTCCAGTGGATGCGACTGATTCACCACTCCTGCCATTCTCTCCGCTACTCAGCCGTGCTCCAGCCAAACCAGGCGGTAAACAAGCCCCCGCCATGGTGACTTCCAACGTCGTACATCTATGGCCCTTACTTATATGCAAGGCGTTGTTGAAGCTGGCAGCACCCGACTTGAATTCGGACGAAGACGCGAATTGTGAAGACGAATTGATGCCAGAGTTTGATATATTACACGCCCTAACTGGTTGTTTGAATCTGGTTTATAATATAAAGAACCCCGAGACCATATGGGACATTATAACAAGTGAGGTTCCATTATTTACTTGGGACGATGATGACGATACCCTGGCTAGCACTGTAAAGTCTAGGAGTACGAAGAAACCTGCTACGAAAGAAACTACGGTTGTCCGGAGAGGATCTATGACGTCAATTCTTATTGAAGACTCAAAGAATTCTCCTCCTTACGCTCTCCCGGGAATTACTCCAGGCCACGAAATGAATCTCCTCGTGACTATGGCTAGGGATTTACCACTTAAAAAACCGCTACCTGAACCGGAGGTCCCACTGTGGAAAACATATCGGTGGGTGGATTGGGCGCGTCGACATGGTTTGTACGAGGCTTATGATTGTCCGCGCACCCAGTTTCTGAAAGTCAATGGGTTCTTAAAGCTATCACACGCTCCGCATTTGTTAGATGTCCAAAGTACTGAATCGATTACATTACAATTTAGAGAAGAACATGATAGGACCAATCCACCGCTAAAGAAAGGTATAAGAGATGTAAATCGTTCGCAAACAGCTAATTCAGCAACAGCTCAGCAACTAAAAGAAGAATTGAGGGAATGGATACCGTATCTGACATTATATGAGTTATTAAAGCAGTGCAGCGTTCTGTTTTACCCTTCTATGTACGAGTTCACATCGGTAGCTAGCAACCCTCCAGTTAGAATTACTAAAACGGCTCCTAATAAAGCGCTAGATATAGCTGCTTCTAAGTCATCACCGCTGTACCTGCAAATTGATGGACCTGATGAGAATATTTTAAGAATTTCTCTGAGCGCTCTTCATCCGCGAGTTCTTTTTAACTGTGGCGTTGCGATACTTGATCATATAGAACCAGCTCATTTAGTACTGGAAGTTTTTGAATGGTTTAATGACTGCGAGTTGCCCAGGGCAAAAGCGTATATTCATACCCGAGGATATGATTCTGTTGAAGTTAAATTACAACCAGGAAGGCACTTTTGCAGAGTTTGGGTTCATTCTCGTATGAATTGGCATGCTATGCTTCTGAGTGAATCATCCTTACTACTTGGAACTCGAGATGTGATCCAGTCTGCAGCTGTCAGAGATTGCCCGTGGGCAGCACGTTTTTTATGTAATCTCGGCAATGCTTTCCAAAATTGGATAAAAGCAACAAGATCAGCAGTGAACTTATCATTAAATGATAAAGAATTTTACGGATCGTACCAGCCCGATTTACAATGGGATGCAAATATAGTGGGTTATGACAAGGCTTTTCTTCATTGGATGTTCAGACAGGCTCTACAGTCCTTACTATCAAAAAAGCTTGTACGATCAGACTACAATAGCGTATGTTTGGTGTTGAGAAAACATTTTCTTGACCCAGATTTTGGGTTCCCACCTAAGCCAAGGCCTCCATTAAAGCCAGTACGATACGTTGCAGAAGTAGATCCATGTGATTGCGTTATGCCAGAAGTAGAAGAACAAGAAGTTGTAGAAGAAGAAACGGAAGAACAACAGTTACTGGAGGAAATTCCATTGGTTAATCAAGAAACAATGGATCGACTGCTCACTTTACCAAAACCACCTCTAACGTCTCAAGTTTGTGAGCTTGCCACTGAAGAATTGCCCTGTGGAGTTCTGAAAAACGAAAGGGAAAAAACCATCCAAAGACATGAGGCAGCAACAACATTGCAAGCTTATTGGAGAGGAACCTGGGTCCGGCAATGCTTGACACGCATGGTCTCCCTTACACCTGAAATTTTAAAACTTATAATGGATAATGCTTTTGGTAATATGGAGGCACTGTCGTCCCTGATGAATGAGTTCTTCAAAATGTACCCAGGAACGAAAAAGTCGTATTCAGTTGCTTCTGCACTTAGCGGAGTGTACGGGCTCCAGCAACACAGCGGATCTTCACCTATAAGTCCAAAATGCAAATGGGTTCCGTACTTTCAGAGCGTATTTACGTGTCACGCTCCAGTTAAAGTTCACTTGGATGTTCAAAGTTCACTCCAACACAGCACATTGGCTGTTTACAATAATGATACCGGTGAACAGATGCCCCAGGCCTACAACTCTCATATAACATTCATTTTTCAACCAAATGATCATGGTTATACGGTCATGGGTCATGGGACATTGAATCAACCATCTGGAGTTAATAGTGAGGTGCATTGGCAGTTAACAGTACTATCATCGATTGCTGACGTTTTTCACGTATGTGACAATGAGATTGACTCTTGCAAGGAGCTGCCACTCTCACCGGCTAGCAAGCTGCATATTGATGAAATTTTCATTCCTAATCGCAAGAATATATTGGGTGGCATACAAATATTGGTCACAAAACATGATGCTGTTTGTTTTAGAGCAGCTGCTACATCCCCAGAGCTCGAAATGGAAGCGATTTTACGTACCGTGAATCCGGATGGATCTGTGGAAGAATTGGGCAGGTGTTCAGGGACGGGAGAGCTGCAGTGGCCTTACATAAGACTAGAACCGACACTACTCATAGCCAACAACCAATTCAAAAAAGCTTCCACTTCGCAAGCAAACTTGGCATCGACGGCCAAAGAAAACATCACGAGTGCGCGTTCTTTGAGGAGTAAGCAAAAAGCACCGAGTGCTAAGAATAAATCTGCAACTAGGGTTAAAGATATCAAACTTAATTTGGAACCGAAGCAGTATTCAATTGAAGTCGTGGCTCCGAAAGGATGGCCTTTGACGTTGGCGCAGTGGAACAGAGTCGATCAGGTCCGGAATTCTCAGGAGTCTAACAAAGTGGAAGCCGCTCCTGTCAAGAAACCTGTCAAAGATAAGGGTGTATTGAAAGACAAGATCCAATCACCAACGCTATACCAGCCTCAGATAGGAGATGCATATGTGGAACTGGAATGCTCGTTGGCGATCGGAGGTGGTTCGGTGGCGAGGCTTGATGATGAGCGGGATATACAGTTCGCAACAGCAAAAAGAAATTGGGATTTACTTGAGCCTGGTAGGAATGCTAGAGGAGCGCAGATCAGGAAAGAGTTCAGGGCGGATTTTTTAGAGTCCGTACCACCTCCACAGTCTCTGAGTGAACAAAGTTTAGGAGAGGAAATACTGGGAGAGGATTTATTCGGAGAGGAGAAAACTCTTGAAGTATCAGAAGAAAGCGAGGAAGAGACCAAGTATCTGACGATGCCGGAAATATTGAAGGACAAATTTTTACCGTTGTACTTCATACCATTGTGTACCAAAGAGTACAATGAAGATGAATGCGTCGTTGTCACACCAGAAATGGCTGAAGTAGCGAAGAATGATCGCCAAAATCGCATTGACGCAGCGTTGAAGCGCATGCGTGAGCTGCAGGCTTACAATGAGCAGTATGTGATATACAGGCAACGACAGAGGTGCCACTTACTAGAGAAATTGTTTGTCGATTCTCAATGGAATGAAGAATTAAATGCCGTTCTGGAAGAGAGAGACGACGCTATAGCGAGAGAAGCACTGATTCGATCTCTCTCTGCGACAAAAAAGAAGCAGGAGGCAAAGAAGAAATAA

Protein sequence:

>DPOGS215043-PA
MSKKSDMGRSLVVYEVDPTECPFREFRDNELSTEFWGMGPSAFRTSHFVSKISTKTQADYVWVDDQTQPLPRSARQYLHGWVRADELALNRWDAEVVIFEDNGSGKMSIIDMQLSHAQVLLRSSFCRRVLSMCFMLERVDSLIVEHQWENFVFTMAPEGWRSRFHIYSPGLKPGGGQQHRPTHSKNGNRALENTLLPGCYLVRLFYLGAWRCVWVSDQVPVDATDSPLLPFSPLLSRAPAKPGGKQAPAMVTSNVVHLWPLLICKALLKLAAPDLNSDEDANCEDELMPEFDILHALTGCLNLVYNIKNPETIWDIITSEVPLFTWDDDDDTLASTVKSRSTKKPATKETTVVRRGSMTSILIEDSKNSPPYALPGITPGHEMNLLVTMARDLPLKKPLPEPEVPLWKTYRWVDWARRHGLYEAYDCPRTQFLKVNGFLKLSHAPHLLDVQSTESITLQFREEHDRTNPPLKKGIRDVNRSQTANSATAQQLKEELREWIPYLTLYELLKQCSVLFYPSMYEFTSVASNPPVRITKTAPNKALDIAASKSSPLYLQIDGPDENILRISLSALHPRVLFNCGVAILDHIEPAHLVLEVFEWFNDCELPRAKAYIHTRGYDSVEVKLQPGRHFCRVWVHSRMNWHAMLLSESSLLLGTRDVIQSAAVRDCPWAARFLCNLGNAFQNWIKATRSAVNLSLNDKEFYGSYQPDLQWDANIVGYDKAFLHWMFRQALQSLLSKKLVRSDYNSVCLVLRKHFLDPDFGFPPKPRPPLKPVRYVAEVDPCDCVMPEVEEQEVVEEETEEQQLLEEIPLVNQETMDRLLTLPKPPLTSQVCELATEELPCGVLKNEREKTIQRHEAATTLQAYWRGTWVRQCLTRMVSLTPEILKLIMDNAFGNMEALSSLMNEFFKMYPGTKKSYSVASALSGVYGLQQHSGSSPISPKCKWVPYFQSVFTCHAPVKVHLDVQSSLQHSTLAVYNNDTGEQMPQAYNSHITFIFQPNDHGYTVMGHGTLNQPSGVNSEVHWQLTVLSSIADVFHVCDNEIDSCKELPLSPASKLHIDEIFIPNRKNILGGIQILVTKHDAVCFRAAATSPELEMEAILRTVNPDGSVEELGRCSGTGELQWPYIRLEPTLLIANNQFKKASTSQANLASTAKENITSARSLRSKQKAPSAKNKSATRVKDIKLNLEPKQYSIEVVAPKGWPLTLAQWNRVDQVRNSQESNKVEAAPVKKPVKDKGVLKDKIQSPTLYQPQIGDAYVELECSLAIGGGSVARLDDERDIQFATAKRNWDLLEPGRNARGAQIRKEFRADFLESVPPPQSLSEQSLGEEILGEDLFGEEKTLEVSEESEEETKYLTMPEILKDKFLPLYFIPLCTKEYNEDECVVVTPEMAEVAKNDRQNRIDAALKRMRELQAYNEQYVIYRQRQRCHLLEKLFVDSQWNEELNAVLEERDDAIAREALIRSLSATKKKQEAKKK-