Monarch geneset OGS2.0

DPOGS210956
TranscriptDPOGS210956-TA3624 bp
ProteinDPOGS210956-PA1207 aa
Genomic positionDPSCF300004 - 1086032-1098624
RNAseq coverage9x (Rank: top 85%)
Annotation
HeliconiusHMEL0101280.082.91% 
BombyxBGIBMGA006383-TA0.072.51% 
DrosophilaCG13855-PA2e-8732.87% 
EBI UniRef50UniRef50_D6W9N37e-12538.64%Putative uncharacterized protein (Fragment) n=1 Tax=Tribolium castaneum RepID=D6W9N3_TRICA
NCBI RefSeqXP_002073739.11e-9032.68%GK14267 [Drosophila willistoni]
NCBI nr blastpgi|2700030032e-12438.64%hypothetical protein TcasGA2_TC030746, partial [Tribolium castaneum]
NCBI nr blastxgi|2700030032e-13038.95%hypothetical protein TcasGA2_TC030746, partial [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL14812 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210956-TA
ATGGATGTGGAAAGGACAATAGGTATTTCTCCAAACTCAGAGTGTTTAAAGGGATCTGTTTCGTGCCAATGTGAATTAGGTCGAGAGAAATTGCTAACCGCTCCGCCTTACAAGCGTAACTTATCTCCAATTGAAGATGATATAAAATCACGTCTACAAACAGCACCGTGTGCAAGAGAACTCGAACAGTATAAATGCCCAGTCAGCCATATTCCAGTGAATTTCAAATCGCCTCTGCACCCTAACGAAGTGTTTAACAAAGTCTTCCCGTCAAACACCCCAATTAAACTAGTTAAACGCAAATTAGCCAAACTATTATCTATTTCAAATAGCAATCTTTTATTAACCAAAAATAGAAATGTATTAAAGGAAACAAGCCTCCTTTCTGAACTAAAAACTGATGCTTTAGGAAATCTTTTTATAGATGTTTTTACTAAAGATTCTGAAGAGTTTTCTTTATCATCTATACCCAAAGAGTCTTATGTTCACGAACTTTTACAAGCAATGATGCCAAAAAAGAAGACTATGCCTTTTATTGCTATTAAATTTAGGGTACACAATCAAAACACCGCATTCACGCGGTCGTACCACTCGATCATGAAAGTTCACGAAGTGAAGAAAAATTTGGCTGGTATATTTCAAACAGCTCCCGATAATATTGTCATCCTAAGAGAAAATCGCCCTTTGAAAGATCGCATGGCACTTTTGGACCTGGATTATGATAAATATGGCATCGTTGAAGTGGATCTATTCACTAAAAACAATGACACTCTTAACCTAGACAAACTTTACAAAGAATTGCCAATGACTGACGTGTTGACAGTAACAGTTCCGTTCGGGAGCACCATTAAACATATAAACGTTGAAGTATTTTCGGAACCGATGCGTAAACCCTTTTTGGGTGGCTATCGAAATGTACACACAGGTATTGTTTATCATCACGCTTACACGCAGACACCTCAAAAAGCTGAAAAATTGCCTCCCGAAAAGAAAAACTGTAGAGACACGCAAACGGCTGAAATGCGAGAGAAAATATATGACACCAATTATAGCCGAGCTACTCAAATGAATACAGTACACGCTTATGTACCAAACGTCACAGATAGAATTATAGTACCGAAACCTTATGAAACTTACGAGGAAATGATCGCAAGACTCAATCACGACCACTACGCTGCAATTATACAACGCGCGTTTAAACACCATCAGTTTAGACAAAAAGTTAAGAGATGGCTTCAGGAATGCATGGAAAGAATTGCTAGAATGGAGGAAGAGCGAAGGCTGGAACGCGAAGCAATGGAAAGAAGACTTAAAAGAGATTTGGTTACTAAGACGTTTCCAAAAACCAGAGAAGACTTTGACCAACTTTACGCAATGGTCGATCGCTGGAAACATGCAGAAATAGCCAGAATATCACAGTTGCATTCCAAGGGTCCAAAAATTGCAGAATTTACACTTTTACTAGACAAAGAGGTTGAACTTCTCCGTTGCATTGAAGCATATCGAGTTAAAGTAAAGGAGGACAGTCGTAAAATTAAAGAAAAACAATTCCTAGAAGAAATATCGAAACCTGTAGCGTGGTATGGACGTGACGGAAAGCTCATTACCATGGACACTGTAGAGATACAAAAGGCAAGGAAATTGAAGGAATTGTATAATTCATTTATAAGAGACGACGTTGAAGTTAAAGAACGCATAGAACTTCTCGTAAACATGAAATTCGCTCTTCAAGAATTCCGTCATCCTTTAGCTGAGGAGACCATTACGCTTTTGGATCGCGAGTGCGATCTCCTTGTAAGAAGATGTGACGACCAACAATTAGAATTTTTAAGGCGACGAGTAGCAGCCTGTGTGTTGCAACTGATAAAAACATCCGAGTTAAACTCCGGCGTGACAAAACGTAAAGAGGTCAGAGATTATAGAAAGATCGAGAACAGTAGATTGCAATTCTGCGAAATGTGTCACCAGGTTAAAATTAATACAGACTTTCCTTTAAACGCTAAGATGTCAGGTTTCACAGTTTGCACGTCGTGTTCGTGGAAGGATGTATCGGAACGTTGTTGGATTGACATGACCCCGTACAAATTCATTTTGCGAGCCGTCCAACGTGACGAGCGGAAAAGAAAATGTTGGGGTTCTTTGGCTTTTGTTCTTCAGGAGAAGGATATATTTTTCATAGTCGAAAAGCTTTGGCATTCGCATTCAGCAATAAGTGAGTGTACAGAAATGACCGAGTTACGGCTCTGTCGTTGGCGTGTCAATGAAGATTGGTCGCCTTGGAATTGCTTTCTGGTGACAGTACAGGAAATGAAGGCGCACTGTAAATTAGAAGACCCCGAGGCAGTTTATGACGAAGAGTTAGTTCAAAAAGTCCTCAATAAACACAAACTAGCGAAGGCAAACTTTGAACAACTTTTAGCTGTAAATAAAAGGTTTACAGAAAGCGGTGATTGGGCTGGAATTCGTGCACCCGCCATAGTACGAGCCAACGCTGTCGATCGCTGGAAACATGCAGAAATAGCCAGAATATCACAGTTGCATTCCAAGGGTCCAAAAATTGCAGAATTTACACTTTTACTAGACAAAGAGGTTGAACTTCTCCGTTGCATTGAAGCATATCGAGTTAAAGTAAAGGAGGACAGTCGTAAAATTAAAGAAAAACAATTCCTAGAAGAAATATCGAAACCTGTAGCGTGGTATGGACGTGACGGAAAGCTCATTACCATGGACACTGTAGAGATACAAAAGGCAAGGAAATTGAAGGAATTGTATAATTCATTTATAAGAGACGACGTTGAAGTTAAAGAACGCATAGAACTTCTCGTAAACATGAAATTTGCTCTTCAAGAATTCCGTCATCCTTTAGCAGAGGAGACCATTACGCTTTTGGATCGCGAGTGCGATCTCCTTGTAAGAAGATGTGACGACCAACAATTAGAATTTTTAAGGCGACGAGTAGCAGCCTGTGTGTTGCAACTGATAAAAACATCCGAGTTAAACTCCGGCGTGACAAAACGTAAAGAGGTCAGAGATTATAGAAAGATCGAGAACAGTAGATTGCAATTCTGCGAAATGTGTCACCAGGTTAAAATTAATACAGACTTTCCTTTAAACGCTAAGATGTCAGGTTTCACAGTTTGCACGTCGTGTTCGTGGAAGGATGTATCGGAACGTTGTTGGATTGACATGACCCCGTACAAATTCATTTTGCGAGCCGTCCAACGTGACGAGCGGAAAAGAAAATGTTGGGGTTCTTTGGCTTTTGTTCTTCAGGAGAAGGATATATTTTTCATAGTCGAAAAGCTTTGGCATTCGCATTCAGCAATAAGTGAGTGTACAGAAATGACCGAGTTACGGCTCTGTCGTTGGCGTGTCAATGAAGATTGGTCGCCTTGGAATTGCTTTCTGGTGACAGTACAGGAAATGAAGGCGCACTGTAAATTAGAAGACCCCGAGGCAGTTTATGACGAAGAGTTAGTTCAAAAAGTCCTCAATAAACACAAACTAGCGAAGGCAAACTTTGAGCAACTTTTAGCTGTAAATAAAAGGTTTACAGAAAGCGGTGATTGGGCTGGAATTCGTGCACCCGCCATAGTACGAGCCAACGCTGTAGACCGAATATGA

Protein sequence:

>DPOGS210956-PA
MDVERTIGISPNSECLKGSVSCQCELGREKLLTAPPYKRNLSPIEDDIKSRLQTAPCARELEQYKCPVSHIPVNFKSPLHPNEVFNKVFPSNTPIKLVKRKLAKLLSISNSNLLLTKNRNVLKETSLLSELKTDALGNLFIDVFTKDSEEFSLSSIPKESYVHELLQAMMPKKKTMPFIAIKFRVHNQNTAFTRSYHSIMKVHEVKKNLAGIFQTAPDNIVILRENRPLKDRMALLDLDYDKYGIVEVDLFTKNNDTLNLDKLYKELPMTDVLTVTVPFGSTIKHINVEVFSEPMRKPFLGGYRNVHTGIVYHHAYTQTPQKAEKLPPEKKNCRDTQTAEMREKIYDTNYSRATQMNTVHAYVPNVTDRIIVPKPYETYEEMIARLNHDHYAAIIQRAFKHHQFRQKVKRWLQECMERIARMEEERRLEREAMERRLKRDLVTKTFPKTREDFDQLYAMVDRWKHAEIARISQLHSKGPKIAEFTLLLDKEVELLRCIEAYRVKVKEDSRKIKEKQFLEEISKPVAWYGRDGKLITMDTVEIQKARKLKELYNSFIRDDVEVKERIELLVNMKFALQEFRHPLAEETITLLDRECDLLVRRCDDQQLEFLRRRVAACVLQLIKTSELNSGVTKRKEVRDYRKIENSRLQFCEMCHQVKINTDFPLNAKMSGFTVCTSCSWKDVSERCWIDMTPYKFILRAVQRDERKRKCWGSLAFVLQEKDIFFIVEKLWHSHSAISECTEMTELRLCRWRVNEDWSPWNCFLVTVQEMKAHCKLEDPEAVYDEELVQKVLNKHKLAKANFEQLLAVNKRFTESGDWAGIRAPAIVRANAVDRWKHAEIARISQLHSKGPKIAEFTLLLDKEVELLRCIEAYRVKVKEDSRKIKEKQFLEEISKPVAWYGRDGKLITMDTVEIQKARKLKELYNSFIRDDVEVKERIELLVNMKFALQEFRHPLAEETITLLDRECDLLVRRCDDQQLEFLRRRVAACVLQLIKTSELNSGVTKRKEVRDYRKIENSRLQFCEMCHQVKINTDFPLNAKMSGFTVCTSCSWKDVSERCWIDMTPYKFILRAVQRDERKRKCWGSLAFVLQEKDIFFIVEKLWHSHSAISECTEMTELRLCRWRVNEDWSPWNCFLVTVQEMKAHCKLEDPEAVYDEELVQKVLNKHKLAKANFEQLLAVNKRFTESGDWAGIRAPAIVRANAVDRI-