Monarch geneset OGS2.0

DPOGS202779
TranscriptDPOGS202779-TA1590 bp
ProteinDPOGS202779-PA529 aa
Genomic positionDPSCF300018 - 973998-982457
RNAseq coverage971x (Rank: top 13%)
Annotation
HeliconiusHMEL0092981e-17978.42% 
BombyxBGIBMGA010500-TA5e-14663.64% 
DrosophilaCG8927-PA7e-2164.52% 
EBI UniRef50UniRef50_C0H6I43e-14365.33%Putative cuticle protein n=2 Tax=Obtectomera RepID=C0H6I4_BOMMO
NCBI RefSeqNP_001166749.16e-14465.33%cuticular protein hypothetical 28 [Bombyx mori]
NCBI nr blastpgi|2905632671e-14265.33%cuticular protein hypothetical 28 precursor [Bombyx mori]
NCBI nr blastxgi|2905632670.069.48%cuticular protein hypothetical 28 precursor [Bombyx mori]
Group
Gene OntologyGO:00423022.9e-09structural constituent of cuticle
KEGG pathway 
InterPro domain[127-169] IPR0006182.9e-09Insect cuticle protein
Orthology groupMCL17158 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202779-TA
ATGGCCGACATCCGGAAACCACGGAACGTATACTTTTCAGAAACCGAGTTTGTTGTGATCAATAAAATTATACGGCGATACGAAACGGAACTCTTACCGCCTGTGTTGTGTCTTGTGCTGTCGACGGGTCTGGCGGCCAGAGCTCGTCCGCGAGTGCAGCAACCCATCGATGACTATGACTATCAGCCGCAGTCCCAGGAGAGATCTGGACAGTTGGTGTTGGTTTCTCAAGACGCTTATGACAACTTATATCAGGCACAAAACCGCGCTGACGAGGAGTACGAGCCTCGGGCTGTTCACCGCGGACCAGCACGTATCAAGTCATCGCAACCGCTACAGGAAGCACAGAAGCAGCCTCCAGTTAACGATGACGGCAGCTTTACCTTCGGTTATGAGGCTGCGGACGGATCTTTCAAAGAAGAGACCAGAGGAACTGACTGCGTCGTTCGTGGTAAATACGGTTATGTCGACCCCGATGGTAATAAACGAGAGTTCACTTACGTCTCAGGAAATCCTTGCGATCCCAACAAGCCAGACGATGAGGAACAAGAAGCTCCGTCTAGGGACTCTGCAGAGAGAGACGACCCTGAACCCAACTACCCGTCGAGACCACCCGTAAGGCCGACTACACCACGACCAGCGACTACCTACTTCCAAAACGATTTCAGGGACGCTGATGAAGAACCCGAAGAGGAACCTCTCCAAAATATTAGACCGCGTGTAGTTCAACGTCCACAACAAGCTAGGCCAGTATTACGACCGGCCTACCAAGCTCAACAGATCGCTATTACGCCAAGACCGGTACCGGTGACCACTGCCAGAGCACTCCCACCAGCGACCACCTTCAGACCACAATTAGTCCAAGTCACACCCAAACCCCAAATTCAGTACAGTCCCCAGCCTGAATATTCACCATCCCCCTCACCGGCCGCCATTCTCAATTCCAGACCGGGACAGATCGATTTCGCTGCTGAGTTTGCTAAATTCCATCGTGAAAATCAACAAACTGCTTCAACACCTAGTTCTTTAAGTACAGCAAGCAGAGCTCAGCCTACCGCTGCTGCCGCTATTCCTTCTGGCAATCCTCTATACTCAACTGAATTAGTCTTCGATCCTTCCAGCGGTCAATACGATACTCAACTTTTCCAATCTCTGCCTCAGACAAAGGGAGAACTCAATCTCAACCAACGCCTTCAGCCTTACGTATCTCAGCAGCAGCAGCAGCAGCAGAGACCTTTCATACCTTCCCCACAAATTTCCTCTGGACCATCCGCTCAGGCTCCCATATATAGACAGCAAATACAACAAGTACAACAAGCAAATCCGCAGGAACTTTACCAGAGGCAACAGTCTGAGTCACAATTCCAAAGTTCCCAACAACTTTTTGCCCAGCAGCAGCAAATTCAACAAAGTCAGTTACAAAGAGACAGAGCTGCGGCTAAAGCTCAAGCCCAGAGACTAGCGATCGCTCAGCAGGCGCAGGCTCCTCAGCGGTCTCCGCAAACACCCCAGTACTACTACGTGCCACGAGAGAATCAGTCCACGGGACAGATCGACGCGTTCCTGAGAGGCCACGGCATCCAGCTTTAA

Protein sequence:

>DPOGS202779-PA
MADIRKPRNVYFSETEFVVINKIIRRYETELLPPVLCLVLSTGLAARARPRVQQPIDDYDYQPQSQERSGQLVLVSQDAYDNLYQAQNRADEEYEPRAVHRGPARIKSSQPLQEAQKQPPVNDDGSFTFGYEAADGSFKEETRGTDCVVRGKYGYVDPDGNKREFTYVSGNPCDPNKPDDEEQEAPSRDSAERDDPEPNYPSRPPVRPTTPRPATTYFQNDFRDADEEPEEEPLQNIRPRVVQRPQQARPVLRPAYQAQQIAITPRPVPVTTARALPPATTFRPQLVQVTPKPQIQYSPQPEYSPSPSPAAILNSRPGQIDFAAEFAKFHRENQQTASTPSSLSTASRAQPTAAAAIPSGNPLYSTELVFDPSSGQYDTQLFQSLPQTKGELNLNQRLQPYVSQQQQQQQRPFIPSPQISSGPSAQAPIYRQQIQQVQQANPQELYQRQQSESQFQSSQQLFAQQQQIQQSQLQRDRAAAKAQAQRLAIAQQAQAPQRSPQTPQYYYVPRENQSTGQIDAFLRGHGIQL-