Monarch geneset OGS2.0

DPOGS202347
TranscriptDPOGS202347-TA768 bp
ProteinDPOGS202347-PA255 aa
Genomic positionDPSCF300104 - 632336-639512
RNAseq coverage261x (Rank: top 41%)
Annotation
HeliconiusHMEL0116741e-5349.81% 
BombyxBGIBMGA007611-TA3e-3639.36% 
DrosophilaCpr31A-PA3e-1153.23% 
EBI UniRef50UniRef50_Q170154e-1044.05%Cuticle protein n=31 Tax=Endopterygota RepID=CU01_ANOGA
NCBI RefSeqXP_001238481.15e-1144.05%cuticular protein 5, RR-2 family (AGAP001668-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1187943269e-1044.05%AGAP001668-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1187943321e-0943.18%AGAP001667-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00423023.5e-06structural constituent of cuticle
KEGG pathway 
InterPro domain[191-232] IPR0006183.5e-06Insect cuticle protein
Orthology groupMCL25369 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202347-TA
ATGAAATCTTCAATAGTACTTATATTTTCTCTCCTTGCGAGCTCCAACGCTATATCTCACAGCTTTTCTGGACCAATCTCAGCAGCATCCAAGTCGCAGATCAGATTCGCTATCGAGAACAAATACGACGACACAGGCCCGGGGGCCAAAGGGCCAGAGTACGCTTACAACACCTACAAAACCTTAGAGGAGGCTCTCATCAGCTACCTCGATGATCCCGACACCAAGCTCCCCGAAGGAGAAAGAGAGAGAGCCGTCGCTGCTCTTCAAATACCACCCATACACACACCAATAAGACAATATCCCAAAACGGTCGAGGAGTACACTGGGTACAAACCAACAGAAAGACCAGCACTTAAACAGTACGATTTGCCTAAAGAGTATTACAGACCTGTAGTCGGTGTTAATGAGCACCAAGTAAATAATATTGGGAACAATTACGTCAAAGCCAATGATGGTGTGAGGTTCCACAGAGTACACCCGGTACAGAACAGGCCGGTGGGGTCGGTGTACTACAACAAGCAGCCACAAACCCAGCAGACGTTCAGCTCATTCAATCCTAACCCTAGATACAGCTTTTCTTACGGCGTTCACGATAAATCAACCGGAGACAGTAAATCAGCTCATGAGAGCCGGACTGATGGAGTGGTCACGGGGTACTACACCTTCATGGACGCCGATGGGAAACAACGAACTCAGGTGATGTGGCGTCTACTGGGGCAAGGGTTCAGGGCGACGGTGCAGAGATCCACCTCCGCTATACAGTGA

Protein sequence:

>DPOGS202347-PA
MKSSIVLIFSLLASSNAISHSFSGPISAASKSQIRFAIENKYDDTGPGAKGPEYAYNTYKTLEEALISYLDDPDTKLPEGERERAVAALQIPPIHTPIRQYPKTVEEYTGYKPTERPALKQYDLPKEYYRPVVGVNEHQVNNIGNNYVKANDGVRFHRVHPVQNRPVGSVYYNKQPQTQQTFSSFNPNPRYSFSYGVHDKSTGDSKSAHESRTDGVVTGYYTFMDADGKQRTQVMWRLLGQGFRATVQRSTSAIQ-