Monarch geneset OGS2.0

DPOGS215910
TranscriptDPOGS215910-TA1290 bp
ProteinDPOGS215910-PA429 aa
Genomic positionDPSCF300029 + 458887-464395
RNAseq coverage2x (Rank: top 92%)
Annotation
HeliconiusHMEL0057382e-2054.95% 
BombyxBGIBMGA000282-TA2e-2287.93% 
DrosophilaCG34461-PA8e-2148.08% 
EBI UniRef50UniRef50_C0H6S42e-3569.05%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6S4_BOMMO
NCBI RefSeqNP_001166679.14e-3669.05%cuticular protein RR-2 motif 82 [Bombyx mori]
NCBI nr blastpgi|2905609118e-3569.05%cuticular protein RR-2 motif 82 precursor [Bombyx mori]
NCBI nr blastxgi|2905609114e-4869.77%cuticular protein RR-2 motif 82 precursor [Bombyx mori]
Group
Gene OntologyGO:00423023.7e-13structural constituent of cuticle
KEGG pathway 
InterPro domain[72-124] IPR0006183.7e-13Insect cuticle protein
Orthology groupMCL44267 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215910-TA
ATGTCGGCCAGTTACGCGATATATCCACATCATCATCACCTGGCTGTTTCCCATCAGCAGGTGATCAAGCACGACGGACCACACCATCCGATACCAATTCATCATCACGTGCCACACCACCACGTTGGAATCATTCACAATCTTCCATTACCCGTACCCATCCACCACGTGCCTCACCATCAAATACACCACGATCATTATGCATTCCCGGAGTACAAGTTTGCGTACTCAGTCCATGACCATCATACTGGTGATGTGAAATCACAGCATGAGTTCCGTCATGGAGACGTGGTGCAAGGCGGATACGAGCTCATCGAACCCGACGGCCGCCAGAGAAAAGTTGAATACAAAGCTGACGATCATTCTGGATTTAACGCTAATGTTTTAATTTCAAAACCATGGGAAGAAGGAAGTAATAACGAAGAAAACGGAGGAGAAAATTCAAATGAAAATAATGAAGGCCAAATGGAAAACGAAGAAAATGGAGGTGAAAATCAAGAAAGAGAAGAAGAGCGTAATCAAGGAAATGAACGAGAGTCTAATCAAAATAATAGCTCAGAAAATGGTGGAAATGAAAATAATAACGAAGAAGAAGGAGAGACAATAAATATAAACAGGAGTCAGGAGAACAACAATGGTAGAAATAATAATAGTGGTGGACGAAATTCAGCAGAAAATTCGGGTGAAGGTCAGGGCCGCGGTGAAAGTGGTAGGGGTTGGCAAAATAACCGAGGAATGGAATGGCAAAGAGGCAACAGTGGTTCTAATGAAAATAGCGAAGAAAGACGCGGCGGAGAAAATAATGGCGGTAGAGCAGGAGGAAATTGGCGCGGACGTGTTAAGTGGAATGGATGGCAAGAGGGTGGCAATCGTGGCGAACGAAATCAGGAAAACAATGAGGGAAGACGAGAAGAAGGTAATGAAAATGAAAACCGTAACGATAGAAGCAGGCAGAGTAGCGAGAGCTCAGAAGTCCAAGAAAACGATCGTGGGCAATGGAGCAATGAGAGACAAGAAAACAACAATCAGGATGAAGAAAATCAGGAAGAAAATGGAGGCGAGCGAAATCGTAACAGTGGAGAAAATGGACAATGGAATGAAAACGAAGGAAACAATGAAAATGGGGGAAACAATGAAAATAGGGGTAGTAATGAAGGCCAGGAAAATAACGGCAAAAGCAATGGCCGTAAAGGCGAGAGGAACGAAAAAGGGAAGCAAGAAGTCACAAAAACCCATTATCACATAATTATTCATCATCCTAAACATCATTACAAGTCAAAACAAAATTAA

Protein sequence:

>DPOGS215910-PA
MSASYAIYPHHHHLAVSHQQVIKHDGPHHPIPIHHHVPHHHVGIIHNLPLPVPIHHVPHHQIHHDHYAFPEYKFAYSVHDHHTGDVKSQHEFRHGDVVQGGYELIEPDGRQRKVEYKADDHSGFNANVLISKPWEEGSNNEENGGENSNENNEGQMENEENGGENQEREEERNQGNERESNQNNSSENGGNENNNEEEGETININRSQENNNGRNNNSGGRNSAENSGEGQGRGESGRGWQNNRGMEWQRGNSGSNENSEERRGGENNGGRAGGNWRGRVKWNGWQEGGNRGERNQENNEGRREEGNENENRNDRSRQSSESSEVQENDRGQWSNERQENNNQDEENQEENGGERNRNSGENGQWNENEGNNENGGNNENRGSNEGQENNGKSNGRKGERNEKGKQEVTKTHYHIIIHHPKHHYKSKQN-