Monarch geneset OGS2.0

DPOGS203120
TranscriptDPOGS203120-TA1212 bp
ProteinDPOGS203120-PA403 aa
Genomic positionDPSCF300094 + 135707-139244
RNAseq coverage442x (Rank: top 28%)
Annotation
HeliconiusHMEL0160453e-3741.88% 
BombyxBGIBMGA001444-TA1e-4639.90% 
DrosophilaCpr49Ah-PA4e-1638.84% 
EBI UniRef50UniRef50_Q8I6Y05e-4243.79%Cuticle protein n=2 Tax=Bombyx mori RepID=Q8I6Y0_BOMMO
NCBI RefSeqNP_001036894.19e-4343.79%cuticular protein RR-1 motif 21 [Bombyx mori]
NCBI nr blastpgi|2236711439e-4439.90%TPA: putative cuticle protein [Bombyx mori]
NCBI nr blastxgi|2236711432e-6140.33%TPA: putative cuticle protein [Bombyx mori]
Group
Gene OntologyGO:00423022.7e-15structural constituent of cuticle
KEGG pathway 
InterPro domain[194-249] IPR0006182.7e-15Insect cuticle protein
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203120-TA
ATGAAGCTGCTTTTCTTTACTATCGCCATCCTAGCATTTTCGCTAGCAGACGAAATCGAAACCAATGACGATCTCCGCGTCGCTGCATCTATTAATGTGCCTGTTCATCATGCACCTCGTAGCTTTCGACGAGTAGTTCAAAGACGCCGAAGCTGGAACCCTCAGAATCCTCAAAATTCTGCCCAGGATCCCGCTAGGCCAGATCCAGTACAAAATTCATATTCCCCTATTGATAACGAACAGATCTCTTATAATCAACCACAGAATAACAATAATCAAAACAATTGGTCAAATAACGGAAATAACGTTCAAAATTCTTGGTCCACCGCTGCTCCCAATACATGGAATCAAAATCAGCCCCAGAATAACTGGAATAATCAGCAAAATGTTTGGGCAAATCAGCCAGCCAACCCGAGCTCTCAAAATAATTGGAATAAACCTGCAAATTCTGGCATGGCAAACAAACCAGCTGTCGCCGGATCCCAAAACAACTGGAACCAAAATCCAAGCCCAAGTTCTTCAACAACCACTCCAATCCCGGTTATCAAAAACGAACAAATCATCGGTGATAATGGCAGCTACAAATATGAATACGAGATCGCTGATGGGACTCATGTTGCCGAAGAAGGCTACTTCACAGACCCTAACACTGAAGATGAAAGCATTGTGAAGAAAGGCTTCTACTCCTTCACCGCTGCTGACGGCAAGGTCTACAGCGTAACCTACTGGGCAGATAAAACTGGCTTCCACGCAGTCGGCGACCATCTCCCTAAACCACCCGCAGTCCCACCCGCGATTCAGGCAGCTCTCGATCAAAACGCTAAAGAGGAAGCAGCGAAAGCTGAAGCTGAAAAGAATAAACAACAGCAGAGCAGCAAGCCTCAGGCTCAGGAACCTCCGAAACCCATTGCTCCCCAACCAATCCAACCAGCGCCTCAACAGCCCATTCAAAATGTTCCCCAACAAGGTTACCCTCAACAAAGCTATCCCCAACAAAGTAACCCCCAACAAGGTTACCCTCAACAAGGATACCCTCAACAGGGTTACCCTCAACAGGGATATCCTCAACAAGGATACCCCCAACAAGGAAACTACCAGCAAGGCTACCCTCAACAAGGATATCCCCAACAGCAACAACAACAAAACTACAACTCCAATGAACAAAACTATTACAATTCTCAAGCTCAGAATTTCCCCTCTTACGGAAAGTAA

Protein sequence:

>DPOGS203120-PA
MKLLFFTIAILAFSLADEIETNDDLRVAASINVPVHHAPRSFRRVVQRRRSWNPQNPQNSAQDPARPDPVQNSYSPIDNEQISYNQPQNNNNQNNWSNNGNNVQNSWSTAAPNTWNQNQPQNNWNNQQNVWANQPANPSSQNNWNKPANSGMANKPAVAGSQNNWNQNPSPSSSTTTPIPVIKNEQIIGDNGSYKYEYEIADGTHVAEEGYFTDPNTEDESIVKKGFYSFTAADGKVYSVTYWADKTGFHAVGDHLPKPPAVPPAIQAALDQNAKEEAAKAEAEKNKQQQSSKPQAQEPPKPIAPQPIQPAPQQPIQNVPQQGYPQQSYPQQSNPQQGYPQQGYPQQGYPQQGYPQQGYPQQGNYQQGYPQQGYPQQQQQQNYNSNEQNYYNSQAQNFPSYGK-