Monarch geneset OGS2.0

DPOGS210055
TranscriptDPOGS210055-TA2715 bp
ProteinDPOGS210055-PA904 aa
Genomic positionDPSCF300017 - 971178-991441
RNAseq coverage226x (Rank: top 44%)
Annotation
HeliconiusHMEL0074110.090.67% 
BombyxBGIBMGA012693-TA0.084.65% 
Drosophilako-PA6e-17043.32% 
EBI UniRef50UniRef50_Q9VP799e-16843.32%Knockout n=9 Tax=Sophophora RepID=Q9VP79_DROME
NCBI RefSeqXP_001973575.11e-16843.25%GG13263 [Drosophila erecta]
NCBI nr blastpgi|1948753113e-16743.25%GG13263 [Drosophila erecta]
NCBI nr blastxgi|510920536e-17643.19%RE48574p [Drosophila melanogaster]
Group
KEGG pathway 
InterPro domain[14-93] IPR0193911.6e-32Storkhead-box protein, winged-helix domain
Orthology groupMCL12294 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210055-TA
ATGCAATTGGCTAGAGTAGGGGACATCGAAGACTGTATCGTGACGCCGACCTACCAGGGCCAGTTCACGCCGCTGCCGGAGGCGCTCTGCGACTGCATCATGGACCTCACGACACAGGGTCAATCAGCGACATTAGAAAGTATAAGGACCTCCTTATCCTCCAAGTTCCCGTCGATGCAGACGCCGTCCCAGGAGGTGGTGTACGACACGCTGGCTCAGCTCATGCAGGAGAGGAAGATATACCAGACCTCGAGGGGCTTCTTCATTGTTACACCAGAACGTCGTCGTTCCCGTTCTCGATCGTCTTCTCGTCACCACTCCACCGAGGATGAGTGCAGCAGTCCTCGTACTATCCTCATGTCGGACCAGGAGGCGTTGCACCAGCTCTACGGAGAGATCACCACTGTCAGGGATGGCGCCGTCACACACCAATGTGTTCAGACGAACCTCGCGGATGTTATATGTGGAGGGAACCCAAACGACAAAGTTCTATACGGTAGACCCAACAAGCGCCGCAGCGCTTCGTTCCCTGCGCCCCGCTCCCTCGACCGGCGCCACTCCCTTCGGATCTTCGGCTCATCCAGCAAGTACTTGCAGCGATGCGCTTCCACACGTAGTTTGCACCACAAGCACAATACGACAGATAGCTCGTCATCCACCGACTACCCACCCAGTGTAGAATCTGCGTCGCCTAAGAAAGTGTCGTTGCTCTCCCGCTTATTCAGACGAAGCGGCCGCAGCAAGCAGACCCGCGCGATGAGCACCTTCTCTGCGCAGTTCCCGCCCACTGAGTGGTTCAACTCCAAGGCGGTGCATCTACACTGTGTCGCGACACAAACTGACTCTAAGGAGTCCTTGCAGAGCCAAACATCAATCGTTTCATCGTATTATGACGGTTCAGAAATCAGCAACCGATCGTCGACATTACCAAGACGACACAAGAGACATCTTTCAACCGAATCAGGTCTAGCGACGTCCATGCAATACGACCACTCGCCCGTGAGGCAGTCGAGCCCTAGTTCAGGCAGTCTCCCGAGGTCAACACTGAGTAGAAGTTCCACGAACAAAACTATATTGGACCATTTTGGATCACCTCAACGAAACAGCGAACCAAAATTAAGAGACAGTCCAAATGGAAGTTTACGGCGTGTGGATTACTCAGATAACTTAATGTCAACTAGTGGGCCTTCAAGCTTAGAGAGCAACACCCATAGAACACCGCGTAAAGATTATAGAAATGGACAATCGGATCTAAATTCGCCTATGAAAAGAGGTTATCACACGAGCGGTAGCAGTGGTCATTCAAGCTTAGAATCACATATATCAGATCGCACTTTGAAACCTTCACCTAGTCATACTAACGGGAACGCTGGCCTTAGCAGAGCACAATCACTAGTTAAAGTCCAAAGCCTTCAAAGTTCACCTAAAAGTCTGCACAAGTGTAGAACTAAACCACCAATTGTTGGAGGTGAAACGGTTAAAAACGGTAAGAGCACATCCCCGAGAACTTCACCAAAAAATTGTCCAAATACTCCCAAAAAAACCGCTACTCCTTTGCATAACAAAACTCTTGGATCCAAAGTCGAGAACCCCGCTACCGCAATGCTTGCTAGTTCTAACTCAAATAACTCCATCACACTGAAAGTAACGACAAACACATTGTCACAAAATGGCGGAACTAACACCAAAGTCTATGTTCAAAATTCACCTGTTCGTTCTGTAATTACTTTTGAAAACGGTAAAATTACTGAGACTAGTAACGGCAGCAACATATATATAATTAATGATGACAGACAAGTCGTGTCCGTCTCCCAAAATGGAGTGTCTAACCAAACGACAAAAAATCCGACAGAAACATCTTTTGATGTATGCGTTGAAAAGAAGAAACAGTTGTACCAAGAACCAGATAAAGAAAAAGTACAAGAAAATAAATTCAACAAAAATTCAATCAACGACACGCGAATGAATAACAACAGAAAATTATCTTTACAGATAGGAGGTGTTGCGAACAATAACAGTTTCATTTACAAAAATCAGTTAAACAATACAAATGACGTAGCCTCAAATCCTTCTTCGCCAGCAATCCATAACAACATTCATAATATAGAAGCGAGCGCTACCAGCAATCCTTCCACGCCAACTAAGACCTATGACAATGTCATTGGATCATTAGGCAATCTAAGATATCAGAAGAATTCTTTAACAACAACGCACGGAGGAATACATGCGAATTTAAACACAAATAGCGACCTTAAAAGCGTTGACTTGCTAAAAAAAGCAGCAGCTTTGCAAATGTTGAATGGGCTTCCAAATATTCATAATAGAATGAGCTCGGAATCTATAGCTAATCTATTATCGAAAGGAATCGATCAGAAACCGACAGTTCTGGGTTCTGAACCCAATCTGGCACTAAAAAACCAAGAAGTAAATAAAGATATAAACATTTCATCTGAAAAGATTCACAGCTTAGAAAACAAACAGAAGCGATACAGCCTAAGTCAAGATGGCAAGAAGGAACACCAAGATTTTTATAACTTTCCAAGCCTAAGCGACTTGAGCTTTAATTTTACTAGTTTAGCTGCACAAAAGATTTTGAAGGGGGTCAGTATAAATAGCGTAGATACGTTGGTAGAACTGAACATGGCCGCCAACAATACAGAGAAGCAAAACAATCGCGATGTAGCTGCAGTATGCACGGACTTCGGCCTTGTATAG

Protein sequence:

>DPOGS210055-PA
MQLARVGDIEDCIVTPTYQGQFTPLPEALCDCIMDLTTQGQSATLESIRTSLSSKFPSMQTPSQEVVYDTLAQLMQERKIYQTSRGFFIVTPERRRSRSRSSSRHHSTEDECSSPRTILMSDQEALHQLYGEITTVRDGAVTHQCVQTNLADVICGGNPNDKVLYGRPNKRRSASFPAPRSLDRRHSLRIFGSSSKYLQRCASTRSLHHKHNTTDSSSSTDYPPSVESASPKKVSLLSRLFRRSGRSKQTRAMSTFSAQFPPTEWFNSKAVHLHCVATQTDSKESLQSQTSIVSSYYDGSEISNRSSTLPRRHKRHLSTESGLATSMQYDHSPVRQSSPSSGSLPRSTLSRSSTNKTILDHFGSPQRNSEPKLRDSPNGSLRRVDYSDNLMSTSGPSSLESNTHRTPRKDYRNGQSDLNSPMKRGYHTSGSSGHSSLESHISDRTLKPSPSHTNGNAGLSRAQSLVKVQSLQSSPKSLHKCRTKPPIVGGETVKNGKSTSPRTSPKNCPNTPKKTATPLHNKTLGSKVENPATAMLASSNSNNSITLKVTTNTLSQNGGTNTKVYVQNSPVRSVITFENGKITETSNGSNIYIINDDRQVVSVSQNGVSNQTTKNPTETSFDVCVEKKKQLYQEPDKEKVQENKFNKNSINDTRMNNNRKLSLQIGGVANNNSFIYKNQLNNTNDVASNPSSPAIHNNIHNIEASATSNPSTPTKTYDNVIGSLGNLRYQKNSLTTTHGGIHANLNTNSDLKSVDLLKKAAALQMLNGLPNIHNRMSSESIANLLSKGIDQKPTVLGSEPNLALKNQEVNKDINISSEKIHSLENKQKRYSLSQDGKKEHQDFYNFPSLSDLSFNFTSLAAQKILKGVSINSVDTLVELNMAANNTEKQNNRDVAAVCTDFGLV-