Monarch geneset OGS2.0

DPOGS214474
TranscriptDPOGS214474-TA2259 bp
ProteinDPOGS214474-PA752 aa
Genomic positionDPSCF300122 - 477888-489314
RNAseq coverage292x (Rank: top 38%)
Annotation
HeliconiusHMEL0037491e-6968.80% 
BombyxBGIBMGA013359-TA1e-3665.48% 
Drosophilasv-PA9e-2034.85% 
EBI UniRef50UniRef50_D6WHT97e-3943.06%Shaven n=3 Tax=Endopterygota RepID=D6WHT9_TRICA
NCBI RefSeqXP_968041.21e-3943.06%PREDICTED: similar to shaven CG11049-PA, partial [Tribolium castaneum]
NCBI nr blastpgi|2700048863e-3843.06%shaven [Tribolium castaneum]
NCBI nr blastxgi|1892353707e-4442.16%PREDICTED: similar to shaven CG11049-PA, partial [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL31006 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214474-TA
ATGGAAGAATCAGAGGAGATAAACAGTAACAGGAGGATCGTTCGCAACAAGGCAGCAGAGAAAGCAAAACACGCACACAATCAACAGCAGCAACAACAGCAACAGGAACAGGAAATCAGCTCAGCGCAGGGCAGCGTAGTATCAGTAATAACGCATTCTGGCAATGCACCGACAACTGCGGTGCTGTCTCCTACCACGGAGCATGTGAACAACACGGGGAGTTACAGCATCAACGGAATCCTGGGTATAGAGCAGGTGGACGCCTTACACAACGGAAACACCCTCAAGAGGAAACGCCACGAGGACCAAGATGAAAACAGAGATTTCAACAGTCACTCCGAAGACGACGTGAAGAGACAAAGGAGTCATTACAACGGTGACCAGATATATTCCAACCTCTGGTCATCCAAATGGAGTCTGAAGGATGAGTACAAGTTGCTGTCTGAGCTGGGAGGCTCTGCGGGCTATTACGGGGAACTGTTCGACGCACAATATGATCACACACACCACTACACCACGCACTCTGCAGTCCCCGCCAGTGATTCAGGCTCGACTGCTGCGGGGGAACCTCCATCGGGAGCGGGCAGCCCGGACGCCGCGGCGCTGGTTGTACTACAACCACCAGCTGCCTACCCCCCATACGCGCACTATGATGGTACAACTACCCTCGCCAGCGAGTACGCATACGCGGCACCCTACTCACAATACTCCTACGGACCTTATGGTCACTCCGCTACAGGACTTCTATCCAAGTGCAATACTTCCGTCAATAGACCTGGGAGACCACTGCCACAAAACATTGAAAACACGGCGCCGCCTAACAACGACAAGGAGAAGGTCTTCGAATTTTGCGAAATAATTCAAGACCCCAATGACCCGGATTCCGTGAGTATTGTGACGGAAAAATTCTCAGAAGAAGTTATAACTACGAAACAGAAAGGTTCATTCTTTGAAAAGCCCACAGCAAACAAATATAAATACTATAGAGAGGTGATACAACCGTATTCTAAGAAGTTTTGCACAGAGTGTCCTTCTTATGTCCCTCCCCGTAGGAAGGATAGTCCAGCCTGTCCGCCCAGTCCGCCCTGTAAACCTAGTTGCCCCATTATACCCACTCAATCCTGCGAGTCCCCACAACCTAGCCAGACCAGTGAACCACCTAAGCAGAGTGGTCCTGATACCGATTGGTGTACTAATAAATCTATAAACGATAAAGATAATAATTGTGACAGCGGTATGGATTGCAACAACAAATGCAAGATGCATTGCAAGACAACATTTATAAATCCCTTAGACGAAGCTGCAAAAAATCCATGTAAGTCCCCCTGTAAATCCCCAAGCAAAGAAAAATGTTCTCGAACGAAATTAGACTGCGATAAGCCAAAAACACCACCATGTGAAAAGTGCGAAAAAGTAGACGAAGCCCACCTCAAATGCTGTATCGCAGCGTGCATTAAAGATAGCGTAAAGGCAGTCAAAAAAGGTAGTTCAATGTGTGCTAATGAAATTGAAAAAGCTATTGGTGTAAAACACGCGGAAAATTGCGACGACAAACCAACCTATTACGTCCTGGAAAAACCAAGATGTCCGTCTTACACCGGTAGCCGTGGATCATCAACAAGTCTACCAAAAAAGAGCAGTATTCAAGAGTTTTTTGACACTCAAACTCAAACAGTCGCGACAATACAATGTGCACTCAAAGACTTCATGGGTACAGTCTGCAATAAAACGAGAGACGTTATTGAGGTTTTACGAACAGAAAGTTGTAACAAAATAAGGACGTCCGAAGAAAAAGAAAAAAATAGAAATACGAAAGACTGTTCGTATGTAAAATCCTTAACATTGATTTCTGAGAAAATAAAAGATCAAATTAACAAAATAGACACGAGCACCGGAAGTGAAAGCTTAGAAAGAATATTAGAACCGGCTAAGCAAGTTCTAGATAAGGTCGCCTCGGTTGTGTCGTCCACAGTATCTGTAATACAGGAGGCTAATTTTGAAAAGATGGTAGATGATGCCCTCGAAGCGAAATTCAACAAGAGCTCACCCTATGACAGACCGGGAGCATACAAGCCACCAGAACAAGTTGCTCAGGAACACGAACCCAGTAGGAACGCCAACATGTTCACGACTATAAAAACGAAGTTGTTTTCTATATTCGGACAAAGGGAAAGTAGCCATGAGATAGAGGACGATGACGATAATGAGGACGAAGATCTCACAGCTGAAAAATACTTCGACAAATATTGCGACCAGTAA

Protein sequence:

>DPOGS214474-PA
MEESEEINSNRRIVRNKAAEKAKHAHNQQQQQQQQEQEISSAQGSVVSVITHSGNAPTTAVLSPTTEHVNNTGSYSINGILGIEQVDALHNGNTLKRKRHEDQDENRDFNSHSEDDVKRQRSHYNGDQIYSNLWSSKWSLKDEYKLLSELGGSAGYYGELFDAQYDHTHHYTTHSAVPASDSGSTAAGEPPSGAGSPDAAALVVLQPPAAYPPYAHYDGTTTLASEYAYAAPYSQYSYGPYGHSATGLLSKCNTSVNRPGRPLPQNIENTAPPNNDKEKVFEFCEIIQDPNDPDSVSIVTEKFSEEVITTKQKGSFFEKPTANKYKYYREVIQPYSKKFCTECPSYVPPRRKDSPACPPSPPCKPSCPIIPTQSCESPQPSQTSEPPKQSGPDTDWCTNKSINDKDNNCDSGMDCNNKCKMHCKTTFINPLDEAAKNPCKSPCKSPSKEKCSRTKLDCDKPKTPPCEKCEKVDEAHLKCCIAACIKDSVKAVKKGSSMCANEIEKAIGVKHAENCDDKPTYYVLEKPRCPSYTGSRGSSTSLPKKSSIQEFFDTQTQTVATIQCALKDFMGTVCNKTRDVIEVLRTESCNKIRTSEEKEKNRNTKDCSYVKSLTLISEKIKDQINKIDTSTGSESLERILEPAKQVLDKVASVVSSTVSVIQEANFEKMVDDALEAKFNKSSPYDRPGAYKPPEQVAQEHEPSRNANMFTTIKTKLFSIFGQRESSHEIEDDDDNEDEDLTAEKYFDKYCDQ-