Monarch geneset OGS2.0

DPOGS214870
TranscriptDPOGS214870-TA1305 bp
ProteinDPOGS214870-PA434 aa
Genomic positionDPSCF300091 + 359675-360979
RNAseq coverage204x (Rank: top 47%)
Annotation
HeliconiusHMEL0076140.092.86% 
BombyxBGIBMGA010079-TA0.090.78% 
DrosophilaTeh3-PA3e-17969.57% 
EBI UniRef50UniRef50_Q9VZG84e-17769.57%RT03134p n=27 Tax=Neoptera RepID=Q9VZG8_DROME
NCBI RefSeqXP_312093.10.068.99%AGAP002819-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3123821020.069.23%hypothetical protein AND_05497 [Anopheles darlingi]
NCBI nr blastxgi|3123821020.069.23%hypothetical protein AND_05497 [Anopheles darlingi]
Group
KEGG pathway 
Orthology groupMCL15936 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214870-TA
ATGCCGAAACCCACCCCCGTGGAGGACCTCATAATACCCCCTCAAGACCAAAAAATATGCGGGACCATATGCGTCTGCCAAATGACGGCCATACTCAGCTGCGTCGCGATTGTATACCTGACCGTTGCAATCTACATGCCGTACACTCGGGCGATCGCATCGGGGATCGACCCGACTCCGATCATGTGCACGACGACGAGGGCCGTTAACAAAGAGAATTGCGATTGGGGCTCGTGTGGGGAGTGGTGCTTGAGCAAAACCTCGGGCGCCTGCATTCAGATCTACGTCAACCTCAGGAAGAACGGCTCTTCGTTATTACTGTCCGAATGTGGCAGTGCTGCAAACAAAACCTGTTACGGCATAGACCAAGAGAACGCCAAAAAGTATCATTGTATACGAGACGAATGCAGAAATCTGACTGGAACGTTCAATTGCACGGAAGGAAAATGTATAAACATAACAGACGCTTTCGAATGCGCTTTTAGAGATACAGATCCTCCCTTGAAATGTTCAGGTAGAAGAGGAAAAATAACTTGTATAGACGTCCACGGATTATTTTCATGTAGTAGAGGCACTTGTAGAAAAATAAGAACACCATACAACTGTGACAGGCGCTGTGTTGACATCCCTACTCGTAACAAGAACGTCATAGTATTGAGTGGTGACAGAGTCTTCCTAGCAAAATGTGCTAAATTGGCGCAAGAGGAAGGTGGTAATGTAGTTTGGACTGACTCAGGAGAAGAAGTTTTAATGTTGTCTTGTCATGCAGTTCATAACGGCTCATCGGGGGTGGTAGCTGTAGATTGTATTAACGCGGCTTTGTTGCCCCGTACTGAAATATCCGATCTTACAAATTTTACATATTTACAATATTTATATACCTCAAAGGCAACACCTAATAGACTTATCGCTCCGTCTGAAGTGGAGTTAACATTAGCGAATGACAGCAGATTGATGATAAATCTAGAAGGATGTGTCAATACACTTGCTGATGAATGTAAGGAATTCTTAAAAGATTATGGACGTGATGGTACCGACCACAACGCCAAAGCCAGATTCCCCTGTTTTTATACAGAGAGCAATCCAGACACCGTGGTAGCGAGATTTGATTTGGACGCCACCTATCGTCAATTCATTGTAGCTTTAATACTGCCCACAGTTCTAATTGTAGTTTCGTGTATAACTTTAATGTTGTGTCAAAAGACGGTTGAAGTGGGAGACGATGCAAAGATGCGAATTAAAGGTTGCGGAAGTGGACAAGCTGATATGCAACTATCTCCGAACGATCCTGTATCTCCCCTCTGA

Protein sequence:

>DPOGS214870-PA
MPKPTPVEDLIIPPQDQKICGTICVCQMTAILSCVAIVYLTVAIYMPYTRAIASGIDPTPIMCTTTRAVNKENCDWGSCGEWCLSKTSGACIQIYVNLRKNGSSLLLSECGSAANKTCYGIDQENAKKYHCIRDECRNLTGTFNCTEGKCINITDAFECAFRDTDPPLKCSGRRGKITCIDVHGLFSCSRGTCRKIRTPYNCDRRCVDIPTRNKNVIVLSGDRVFLAKCAKLAQEEGGNVVWTDSGEEVLMLSCHAVHNGSSGVVAVDCINAALLPRTEISDLTNFTYLQYLYTSKATPNRLIAPSEVELTLANDSRLMINLEGCVNTLADECKEFLKDYGRDGTDHNAKARFPCFYTESNPDTVVARFDLDATYRQFIVALILPTVLIVVSCITLMLCQKTVEVGDDAKMRIKGCGSGQADMQLSPNDPVSPL-