Monarch geneset OGS2.0

DPOGS209413
TranscriptDPOGS209413-TA1470 bp
ProteinDPOGS209413-PA489 aa
Genomic positionDPSCF300346 + 265947-269483
RNAseq coverage28x (Rank: top 76%)
Annotation
HeliconiusHMEL0097961e-9448.71% 
BombyxBGIBMGA012601-TA8e-6453.53% 
DrosophilaCpr67Fb-PA1e-1748.98% 
EBI UniRef50UniRef50_C0H6K22e-6153.53%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6K2_BOMMO
NCBI RefSeqNP_001166737.13e-6253.53%cuticular protein RR-1 motif 11 [Bombyx mori]
NCBI nr blastpgi|2905608106e-6153.53%cuticular protein RR-1 motif 11 precursor [Bombyx mori]
NCBI nr blastxgi|2905608104e-6253.53%cuticular protein RR-1 motif 11 precursor [Bombyx mori]
Group
Gene OntologyGO:00423025.9e-11structural constituent of cuticle
KEGG pathway 
InterPro domain[152-201] IPR0006185.9e-11Insect cuticle protein
Orthology groupMCL25986 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209413-TA
ATGAACTGGGTATTGTTTGCCGCTCTTACAGTGGCATGTAATGCTGCGAAATTAGACCGTACATATTTACCTCCTGCATCAGCTTCAACAGCTGGTGGAAGTCCAGGTTCAATACAAACTCCGATTGCAAAGTCTGAAATAGAGAATTTGCCACTTGGTAGCTTTGTTAACGAACATGATGGGGTTGTCATTGATGTTGGCGCTGCTGGCATAAAAGCAGTGCAATCCGGCTTAGGAGCTTTAAGAATATCATACGGATCAATGGATTCTAAAGTAGGAGAGGCAGCATTCAGAGGTACAAACCAGGCTCACGTAGCAAACCCGATTTTGGATTACAACCCCGACCTGTCAGCTTTCAATACGCCGAAGAAACCGATGAGATCTGAAATTCAGGTTACACGTGACCGTGGAGCTAGTATTGTTAAGTACCACAATGATAACAACGGTGAGCGTTACAGCTATGGCTATGAAACTGATAATGGAATAAAAGCTGAGGAGAACGGAGTAGCCATCAATGGTGTTCAAGCTGAGGGTGGATTTTCCTATGTGGGTGATGATGGGAAAGTGTACAGTGTCGTCTACACCGCCGATGAGGGTGGTTATCGGCCGATGGGTAATCACCTCCCGACTCCACCACCTATACCGGTGGAGATATTGAGGGCCTTGGAGCAGAACATGAGAGATGAAGCAGCCGGTATTTTTGAGGATGGCTCATACGATGCACGAAAATACAACAATAATGATTATAAGCAAGCTGGGATCAATAATAACTATGATGACAGACAAACAAACATGTTTAATGTTCAATCTGGGCTCATCGCTAATATGAATGGAGGATTCATGAATAACAACCCAGCAGCTGCTGTCAGACCAAACCCAATAGAATTGTTTGGTACACAAACTGAATCTTCGAAATTTGGTGCTGTGAATAAGTTCGGGACAGATTTTAACAAAGGAAGTGGTTTAGGACAAAACCAGATGAACATAAATGCTGAACTTGGCTCAAGAAATGAATATTCACAGAAGCAGAATGCTCAAAATACAATTTCTTCTAATTTTGAGTCCTCGTCTACGAACATGAACGGCAGTAACAGTTTTACGCCCGCGAAGCCTCTTTCACAACAAGCACTTCAGGAAACAATGAATTTAGGACAAGGCCAACTTGATAGCCCAAGGCCAAGTTCATACGATACCTCGATATCATCAGTGACTTCTGAGAAGGTATCTTCAGAAGATAAACAAAAGAATAATCAAGATTCGGGAATAAATTTCTCGATAGAAAGTAATCAACACAAACCAAGCAACAATGGTCAATCATTAATGGCACCGATGCAGAGATTACCAACATATAATATTAACAAAAATACAAGTCCACAATATACGCAATCAACGCAACAAAGTAACGAACAGAATCTTTTCGTGGTCCTCAAGGTTCTCTACAATCTTCTATTTCTAGTCCTTCTCGTTTAG

Protein sequence:

>DPOGS209413-PA
MNWVLFAALTVACNAAKLDRTYLPPASASTAGGSPGSIQTPIAKSEIENLPLGSFVNEHDGVVIDVGAAGIKAVQSGLGALRISYGSMDSKVGEAAFRGTNQAHVANPILDYNPDLSAFNTPKKPMRSEIQVTRDRGASIVKYHNDNNGERYSYGYETDNGIKAEENGVAINGVQAEGGFSYVGDDGKVYSVVYTADEGGYRPMGNHLPTPPPIPVEILRALEQNMRDEAAGIFEDGSYDARKYNNNDYKQAGINNNYDDRQTNMFNVQSGLIANMNGGFMNNNPAAAVRPNPIELFGTQTESSKFGAVNKFGTDFNKGSGLGQNQMNINAELGSRNEYSQKQNAQNTISSNFESSSTNMNGSNSFTPAKPLSQQALQETMNLGQGQLDSPRPSSYDTSISSVTSEKVSSEDKQKNNQDSGINFSIESNQHKPSNNGQSLMAPMQRLPTYNINKNTSPQYTQSTQQSNEQNLFVVLKVLYNLLFLVLLV-