Monarch geneset OGS2.0

DPOGS207987
TranscriptDPOGS207987-TA2154 bp
ProteinDPOGS207987-PA717 aa
Genomic positionDPSCF300090 + 734882-758485
RNAseq coverage3185x (Rank: top 4%)
Annotation
HeliconiusHMEL0060122e-7961.29% 
BombyxBGIBMGA000338-TA1e-4676.15% 
DrosophilaCpr49Aa-PB4e-2768.37% 
EBI UniRef50UniRef50_G8FVQ16e-4877.69%Cuticular protein RR-1 motif 32 n=2 Tax=Endopterygota RepID=G8FVQ1_ANTYA
NCBI RefSeqNP_001166719.14e-4576.15%cuticular protein RR-1 motif 32 [Bombyx mori]
NCBI nr blastpgi|3545495232e-4777.69%cuticular protein RR-1 motif 32 [Antheraea yamamai]
NCBI nr blastxgi|3545495234e-5576.06%cuticular protein RR-1 motif 32 [Antheraea yamamai]
Group
Gene OntologyGO:00423026.2e-18structural constituent of cuticle
KEGG pathway 
InterPro domain[536-591] IPR0006186.2e-18Insect cuticle protein
Orthology groupMCL10403 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207987-TA
ATGAAATCGTTCATTGTATTATCGGCTCTGGTAGCCCTGACCTACGCCGCACCCCAGTTCCAGTACCAACAACAGCCTCAATACCAGAATCAATTCATCCCAATCTTGCGGCAAAACCAGGAAATTAACCCTGATGGATCCTACTCATTCAGCTACGAAACTGGTAATGGAATTAATGCTCAAGAGCAAGGCTACCTCAAAAACCCCGGTATTAAAGACGCTGAAGCACAGGTCGCTCAAGGTTCCTTTAGCTACACCTCCCCCGAGGGTATCCCAATCAACGTAAAATATTACGCTGACGAGACCGGTTTCCACGCCGAAGGTGCTCATCTCCCCGTCCCTCCCCCAATCCCAGAAGCCATTGCCCGCGCTCTCCAGTACATCCAATCCCAACCCCAGCAACCTCAGAACCAATACCAAGGTTTCCGAGGACGCGATATAACCAACGACAAACGTGAGATCTCCTATTCACTGAATATGAGGTTCTATCTTTTGTTTAGTATTGTAATAATTACTGCGTATAGTGCTCCGACAGATGAATCTCAAGCCACAATTTTGTCATACGAATATAACAACGATGGAAATGGAAACTACAATTTCAGTCTATATAAATTGTACAACAACAAAAACATAGTCTTTGCTGTCTATGCACATCATCGAGGCGATTTTAATATTAAAGAGTTTAGAACCATAAAGATGCGCTTCTATCTTTTATTCGGTATAATAGTAACCACTGCGTATAGTGCTCCGACAGACGACGAATCTCAAGCAACGATCTTGACTTACGAATATAATAACGATGGAAGTGGAAACTACAACTTCAGGTATGTCATATCTGATGGCACATTTAGAAAGGAGGATGGAGGTCTGATTAATAACAAAGGAGCGCTTAATTTGGTGGTACGAGGAGAGTATGGATACATAGATCCTGAAGGACATCATCATTACATCAAATATATAGCAGACACCAACGGCTTCCAAGCTCTAAGCGATTACAACGATGTTAGGTTTAATGATAGGCGAATTATCGCTGTTTTACGTGGACAAGTTGACGCTGCGATTTATACTTACGAGGACGTGGAGCTCAGACGGATATTTCCAAATGATTTCCCCCAGGACACTGTCAGAAAGAGTGGGAATTTGAAACCAGAACACTTGGGCCGATATATGGAACTTAGCAATTATTTCAGATATCCCAAGAAGGCATCAAACAGTCACTTTGAAGAATTGTCATGTTACAGAGAGAATAACAGAAAGGTTTGCAGGATAGACGTGGATAGGTATTCCCCTGAAGAGAACGAATACTATTTGACACAAAATTCTGCGAGTCACATAGACAGACGTCGGGTATATTGCTTCATGCTGCTATGTTTTACCGAAGCTGAATTAGACCGAGTGAGAATTATTTGTGAACCTATCGTAATAGTTATCAAGATTGACCCAAAAAGGTTCTGTAAGGAAGGTGTCGGAGGTCAGGCGCACATCGGTTCAACAGACTGTGATTTTCTTGTCTTAACCCTGGTTGCTGTTGCTTCCGCGGCTCCTCAAAATCCACAGGATGTTCAGATCCTACGATATGATTCTGATAATTCTGGCTTAGGTTCATACAGCTTTGCCTGGGAACTATCTGATGGAAGCAAACATGAAGAACAGGGACAGTTAAAGAATCAAGGAACTGAAGCTGAGGCTTTGAGTGTGCAAGGACAGTATGCGTGGGTTGGCCCTGATGGTGTGACATATACTGTCACTTATTTGGCTGACGAAAATGGCTACCAACCCCAGCTCCAACAGAGTCCCGGTGGATCAATACCATCAGCAATAGTACTCGTATGTTTAGTTGCTGCAGTAGCAGCAGCTCCGCCTTCTAGAGTTAACTATGATAACAATAACGTACAAATTCTTTCCTATGAAAACGATAACATTGGTTTAAATAGATACAAATATGGCTTCTCACAGTCAGATGGAACGAAACAGGAGCAACAGGGTGAATTTAGAAGCGACGGTGTTTATGTCGTGAAAGGTTTTTATTCGTGGGTCGGTCCCAACGGTTACCTGTATACCGTCAAATACATTTCTGATGAAAATGGCTATCAACCAGAAATGGAAGAGGCCCCCGGTTATGATTCTGGTCTCATTGCTACCGCTCTCGGCTAA

Protein sequence:

>DPOGS207987-PA
MKSFIVLSALVALTYAAPQFQYQQQPQYQNQFIPILRQNQEINPDGSYSFSYETGNGINAQEQGYLKNPGIKDAEAQVAQGSFSYTSPEGIPINVKYYADETGFHAEGAHLPVPPPIPEAIARALQYIQSQPQQPQNQYQGFRGRDITNDKREISYSLNMRFYLLFSIVIITAYSAPTDESQATILSYEYNNDGNGNYNFSLYKLYNNKNIVFAVYAHHRGDFNIKEFRTIKMRFYLLFGIIVTTAYSAPTDDESQATILTYEYNNDGSGNYNFRYVISDGTFRKEDGGLINNKGALNLVVRGEYGYIDPEGHHHYIKYIADTNGFQALSDYNDVRFNDRRIIAVLRGQVDAAIYTYEDVELRRIFPNDFPQDTVRKSGNLKPEHLGRYMELSNYFRYPKKASNSHFEELSCYRENNRKVCRIDVDRYSPEENEYYLTQNSASHIDRRRVYCFMLLCFTEAELDRVRIICEPIVIVIKIDPKRFCKEGVGGQAHIGSTDCDFLVLTLVAVASAAPQNPQDVQILRYDSDNSGLGSYSFAWELSDGSKHEEQGQLKNQGTEAEALSVQGQYAWVGPDGVTYTVTYLADENGYQPQLQQSPGGSIPSAIVLVCLVAAVAAAPPSRVNYDNNNVQILSYENDNIGLNRYKYGFSQSDGTKQEQQGEFRSDGVYVVKGFYSWVGPNGYLYTVKYISDENGYQPEMEEAPGYDSGLIATALG-