Monarch geneset OGS2.0

DPOGS202780
TranscriptDPOGS202780-TA1782 bp
ProteinDPOGS202780-PA593 aa
Genomic positionDPSCF300018 - 969175-971193
RNAseq coverage109x (Rank: top 59%)
Annotation
HeliconiusHMEL0092982e-17258.06% 
BombyxBGIBMGA010501-TA2e-9360.94% 
DrosophilaCpr97Ea-PA2e-1244.87% 
EBI UniRef50UniRef50_C0H6I61e-8960.94%Putative cuticle protein (Fragment) n=1 Tax=Bombyx mori RepID=C0H6I6_BOMMO
NCBI RefSeqXP_001662245.13e-3137.11%hypothetical protein AaeL_AAEL012088 [Aedes aegypti]
NCBI nr blastpgi|2236710914e-8960.94%TPA: putative cuticle protein [Bombyx mori]
NCBI nr blastxgi|2236710911e-9748.87%TPA: putative cuticle protein [Bombyx mori]
Group
Gene OntologyGO:00423021.4e-05structural constituent of cuticle
KEGG pathway 
Orthology groupMCL24725 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202780-TA
ATGAACCTCTTCATGCTGCTCGCTTGTATCATAGCGACGACCACTGCTACCAATTCAACAGAGACTACAACCAATCCTCCAAATAATACCACCCAGGAAATTCTCAAGGCCAGTGAAACTGCGGACCCGACCAAAGCTGAAATTGTAAAACAAATTCGTCGTCTGAACGAAGATGGATCGTATACGATTGGATACGAAGCCAACGATGGAACGTTTAAAATTGAAAGTAGAGATGTTTTGGGTAACGTAAAAGGAACCTTTGGTTATGTGTCAGATGATGGAGAAATAAAACGTGTCACTTACAGTTCCTCTGCTGATAGCACACCGGCATCGGTAACTACTAGTACCACGCCAACTACACCCACCATGGTCGTGAGAGTTAACAAAACTATATCGTCGACGACAAGAAGACCGTTAGCCACTGTAGTGTATCCGACCAGAGGAAGCACCACCACCAGAGGTACAGTTATACAAGCCATTCCGAGACGACGAACCGGGTCAAGCTCCGTGCGCCCTCAAACGACTGATACAACTACGGAAACTCAAAAACAAATCACTGCAAGTAGCTCGAACGTTCATCGAAGAGAAGATCTTCTGAAATCACGGTCTCAATCAGCAAAAACTGCACCTGTAACCTCCAAAGACGATTTATTAACCAAACAGACTACAAGTACGGCGTCTCAGACTCTAAAACCTGTTTACGAGCATACGACAGAAAGAGAGACGGATATTAATAAATCGACAGCTACGAGACGTGAATTATCAGGAGCGAGCGCTAATCATCATATGTTGAATCTTCAACAATCAATGGGAGATGATTCAACTGATGTTTATGGGATTGTGCACATATCAGCACAGAGAGGTTCCGATAAAATATTTTATCAACCCCAATACCGTCGACCTGCGGCTGTTCTTTTCAGGACTCAGGAATATTTGAGAGACAATCCCGGTGCCCCTATTCCCATTGGCAACCAGCGGCCCTTTCTTAACTATGAATATCCAGATAAAATATTGGACTCACAGTATGTAAAAGAATCACAGCAAGTCAATAACAATCAAGAAGCGGAGTCTGGTCCTTATGAATACAGACATAATGATTACAGACCAGCCCCAAGAATTATCCACGTGCCAGTCGACGATAGAGGCGTTCCAATTCAGGGGTACGAAGCGAGATATGTCAACCCATATCGACCACAGCCTTTAATTCAAAGATACGATCCTGTAAACGAAATGCATTCCATTTCAGCACCGGTGAGCACGAGAGATTTCAAACGCTTGCTACATATTTTGATACTAAGACAGAACCGTCTGCAAGCTCTCATGGAGCAGATAATGCCAGAGGCCTACCAGGCGGCTCATTACCGCTCTGAACCGTATCACGCACAATCGCGGCCCTATTCTCGCCACCGAGACGACGACCAATACGACTACAGATATCAACCGCAGTACAGACAAGATTTTTATTCAACACAAGTATCAAACTACGACGATCGCGACTACGAGTCCCATCGCTATTCGCCTCGAAGAAGATTATACTCGCGACCCTATGACGCTCAGGGTTCAGCCTCGGAACATATAGAACAGACACCGGAGTATCTCCCGGTCGAAGTGAGAGAAGCTCTCTTGTTGAAAATGCTGTTGTTGGCTATCAGCCCGGACTTCATGCCGACACCAGCGCCCGCCACCGAGCTGACCACCGCAGCACCAACAAGGAAACAGGTGAGAAACGTTCAAATACTTGGAGAAGAAGGTTCCGACAAAAAAACAAGGCAGGGGCACTAG

Protein sequence:

>DPOGS202780-PA
MNLFMLLACIIATTTATNSTETTTNPPNNTTQEILKASETADPTKAEIVKQIRRLNEDGSYTIGYEANDGTFKIESRDVLGNVKGTFGYVSDDGEIKRVTYSSSADSTPASVTTSTTPTTPTMVVRVNKTISSTTRRPLATVVYPTRGSTTTRGTVIQAIPRRRTGSSSVRPQTTDTTTETQKQITASSSNVHRREDLLKSRSQSAKTAPVTSKDDLLTKQTTSTASQTLKPVYEHTTERETDINKSTATRRELSGASANHHMLNLQQSMGDDSTDVYGIVHISAQRGSDKIFYQPQYRRPAAVLFRTQEYLRDNPGAPIPIGNQRPFLNYEYPDKILDSQYVKESQQVNNNQEAESGPYEYRHNDYRPAPRIIHVPVDDRGVPIQGYEARYVNPYRPQPLIQRYDPVNEMHSISAPVSTRDFKRLLHILILRQNRLQALMEQIMPEAYQAAHYRSEPYHAQSRPYSRHRDDDQYDYRYQPQYRQDFYSTQVSNYDDRDYESHRYSPRRRLYSRPYDAQGSASEHIEQTPEYLPVEVREALLLKMLLLAISPDFMPTPAPATELTTAAPTRKQVRNVQILGEEGSDKKTRQGH-