Monarch geneset OGS2.0

DPOGS203121
TranscriptDPOGS203121-TA2643 bp
ProteinDPOGS203121-PA880 aa
Genomic positionDPSCF300094 + 140919-143737
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0160454e-9153.57% 
BombyxBGIBMGA001445-TA7e-9950.87% 
DrosophilaCpr47Ee-PA3e-1446.43% 
EBI UniRef50UniRef50_C0H6L12e-9650.87%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6L1_BOMMO
NCBI RefSeqNP_001166728.13e-9750.87%cuticular protein RR-1 motif 20 [Bombyx mori]
NCBI nr blastpgi|2905632496e-9650.87%cuticular protein RR-1 motif 20 precursor [Bombyx mori]
NCBI nr blastxgi|2905632493e-13937.46%cuticular protein RR-1 motif 20 precursor [Bombyx mori]
Group
Gene OntologyGO:00423025.4e-13structural constituent of cuticle
KEGG pathway 
InterPro domain[180-236] IPR0006185.4e-13Insect cuticle protein
Orthology groupMCL21163 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203121-TA
ATGAATAAAATCAATTATCAATTTCTCCCTTCAAGTCAATCAATAGTACCATCTCATTTTCCAACTCCAATTTCTATAATTCTATCCTCAACCCTAAGACCTAATTCGGATGATAATGTTTTTGAGGATGGTGAACAAGAGCATATACATAATCACAACTCGAAGCCAAGTTTCACATCAAGAGATCAGGTCAATCGTACATTTACCAATAAGGATAGAAAAACATTTGAAAGGATAGATAATTTACCAAGTTTAAGACTGTATAATACACAACGTCAAAATGAGAAAGGACGACCCAACCAAGTTGTAATAGGTAATAAGCAATCATCGAGTAATTTTCCGGCAACTCAAAAACCAGCTGCTCGTCGAATGATACTGACAAGTGAAATTCAAAAGGCTGATCCAGGACTAGACGAAACAAAAGCTGGTTCGACTCAAAACGCCCAATACATTCAAGCACCAGGCCATATCGCACGAACAACAACAGAATCAGCTATACCAATAATACGCTTATCTAATGAAATGGATTTGGATGGCAGCTTTAGTTACGAAGCATTAGGAGCTGATCAGACCCATTACGTGCAACATAGTCGCATGGAAAATATGGGTACTGACAAAGAAGAGCAGGTTGTAGAGGGATCTTACTCCTACATCGGGGATGATGGACGTACATATACTGTACATTATATCGCCGACTCTAATGGTTACCGAGCTTCTGGTGACCATCTACCGTCACCACCCCCAGTTCCCGAAATAATACAAAGGGCAATTCAATATAATTTAGCAGAAGAAGCTAGAAAACCTCCACATTTAAGAAATTTATGGGAAAATGAAGGAAGTAGTGATGAAAGTAAAGAAGAAAGCAAACAAAACTTCCTCAGTATACCTTCACAACGAAGTTTGTTTACTGGAAAAACTCCTGAAGCATTTTCCTTTTCAGATAGTTCCAATCCACAAAGCAATCAAATTACATCACGTTCGAATTTAAATATTCAAACTAAACCTAAAAATGAAAATATTCTGCATAATCCATTGAAATTAAATACACCAATTACTCCTCAAATTACTTTTCTGGCATCTCAAGGTGCCCATAGCCCACTGTCACAACAACAAATCCAGACAACAATGCAGAAAGCAAGTAATTCAGATAAATTGCCTCAGTTGATAAACTACGAAGCTGATAAAAAACTTGAACATGAAGGAAATAAGCAACTGTGGAGGTGGCAGTATGGTTATAACGCCAATGAAAACCCTAATAAAAATTCAATATCTAGATCATTTACCGAGGGGGACGATGTCATTATAAATTTCAACGATATGACACCAGAGCAGTACACTCGAATGATGCAGACTCAAATATTAACTCAGACAGATGCAAATTCTAACAATATTGAAAACGACAATGGAAAACATTACCCCCCACCTAATTTCGATTCCAACTTGGAAAATAATTTAAATGAACCTAATACATTTTCACAGACATATAATATAAAACCAGAAACAGGCGTAAATAATAATGACAAAAGAGAAGAAACGCGTTCATCAACACCAACGTATGAAACATTATTTTCAACACCATTATATAGACAAGAATCTAATAAATATAATATTAACCAAAATGCAAATCTTCATACAAATTTACCAGATGCAAAAACGGAATACCAACATACAAAATCAGATGTATTTGAAAATCATAATTTTTCTCCAGATAATCAATTGGTAACTCAAACCTATAATTCTGAAAACCCTCCATTTGAACCCGTGAAAATAAGTAAAAGTAATGAGAATTCAACGGTGTATAAACAATTTGAACAAAAATATAAAATTCAAGCTTTAAACGACGAGGAGCCTTCTCAAGATCACCATAAACATATGTCTACACCTTCATCAAGTTTATATACAAATATTTACGTTACGCCTTACAATTACTATCAATATACAGAACAGAACAGCCATAAACCAAGTTTAAATGAAAATAAACCAGAAACATTCACACAACAATCGACTACCACAGAATCATCGGTAGAGCAAATGTTACAAGAAAATATTTTCCTAAGGAATTTATTTAAAGCACCCAATACAATAAATACATCAACAACAGCACTACCTCGGAACGATGATAGCAATATATATAAATTTTATAATCCAGAACCCTTCAATAAACCAAATAATGCAATCGAAAATAAGCCAAATCTTCCCGATAAAACAAAAACGACGAAAGATAAATCTTTAAATCTTAATGAGGTTTTTAACTATGTAATGGCAAAAAATCATTTCGAATCAGCTAAAACAAAGCCTAGAAGTAAATCAACACATTTCATACAATACAACAATAACTTAAGCAAAAAAGAGAATCCTGTATTGTATATACCAACAGTAAAAGAAATAGATAAAAATGAAGACGAAAGTTTTCAAGATGAAAACAAAAATCATTCAAATCTAAACCAACAACAACAGCAAGAATTACATGGTTTAATTAAAAAATATAAAGTCCTTCAACGTCAGAAGAATGCGAGCTCAAAGAGCCAAATTATGTACGAACCCCAACTCCACATAAAGACCTTCCACTCCCCAGGTCTGCCACCGCTAGGACGAGCGGGACCATCTATGAAAACTTACTTCCCACCAACTTTACAGCGATAA

Protein sequence:

>DPOGS203121-PA
MNKINYQFLPSSQSIVPSHFPTPISIILSSTLRPNSDDNVFEDGEQEHIHNHNSKPSFTSRDQVNRTFTNKDRKTFERIDNLPSLRLYNTQRQNEKGRPNQVVIGNKQSSSNFPATQKPAARRMILTSEIQKADPGLDETKAGSTQNAQYIQAPGHIARTTTESAIPIIRLSNEMDLDGSFSYEALGADQTHYVQHSRMENMGTDKEEQVVEGSYSYIGDDGRTYTVHYIADSNGYRASGDHLPSPPPVPEIIQRAIQYNLAEEARKPPHLRNLWENEGSSDESKEESKQNFLSIPSQRSLFTGKTPEAFSFSDSSNPQSNQITSRSNLNIQTKPKNENILHNPLKLNTPITPQITFLASQGAHSPLSQQQIQTTMQKASNSDKLPQLINYEADKKLEHEGNKQLWRWQYGYNANENPNKNSISRSFTEGDDVIINFNDMTPEQYTRMMQTQILTQTDANSNNIENDNGKHYPPPNFDSNLENNLNEPNTFSQTYNIKPETGVNNNDKREETRSSTPTYETLFSTPLYRQESNKYNINQNANLHTNLPDAKTEYQHTKSDVFENHNFSPDNQLVTQTYNSENPPFEPVKISKSNENSTVYKQFEQKYKIQALNDEEPSQDHHKHMSTPSSSLYTNIYVTPYNYYQYTEQNSHKPSLNENKPETFTQQSTTTESSVEQMLQENIFLRNLFKAPNTINTSTTALPRNDDSNIYKFYNPEPFNKPNNAIENKPNLPDKTKTTKDKSLNLNEVFNYVMAKNHFESAKTKPRSKSTHFIQYNNNLSKKENPVLYIPTVKEIDKNEDESFQDENKNHSNLNQQQQQELHGLIKKYKVLQRQKNASSKSQIMYEPQLHIKTFHSPGLPPLGRAGPSMKTYFPPTLQR-