Monarch geneset OGS2.0

DPOGS205769
TranscriptDPOGS205769-TA1233 bp
ProteinDPOGS205769-PA410 aa
Genomic positionDPSCF300255 + 103838-105532
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0077481e-9648.06% 
BombyxBGIBMGA009990-TA1e-7742.73% 
DrosophilaCG34462-PA6e-0733.73% 
EBI UniRef50UniRef50_C0H6Y32e-7542.73%Putative cuticle protein n=1 Tax=Bombyx mori RepID=C0H6Y3_BOMMO
NCBI RefSeqXP_001851262.14e-2053.75%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|2236713849e-7542.73%TPA: putative cuticle protein [Bombyx mori]
NCBI nr blastxgi|2236713843e-8142.50%TPA: putative cuticle protein [Bombyx mori]
Group
Gene OntologyGO:00423021.1e-08structural constituent of cuticle
KEGG pathway 
InterPro domain[348-399] IPR0006181.1e-08Insect cuticle protein
Orthology groupMCL18786 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205769-TA
ATGAGAATACCTATAGAAAACAGTAATAATGATAATTTACAAATAGTCCCCAGTGCAAGTACAACTTCGGAAAGTCTTCAAGTTCAGCCCATTTATTATACTAAAAATGCACAGAATCTGGACAGCTATATAATACATGAGCCCCAAAAGACTCCTCAAATCAAGCAGGAGATAAAAGAAGAAATACCGACACCTGCGACTTATTTATTACCACCATCACCGTACGCACGCAATGAATTCTTCTTAGCAACAACTGAGTCCAACGAGGAAAGCGATTGGTATCCCATTCAGAGTGATTCTCAAAATCAAAATATCAACGAAGGCCAGCCACTTGAAGAGTTGAATCTTAGATCGGGTAAAGAATTGCATAGGGGACAAGTTTTTTCAGTCCCTATAAGGAACCTTCTGCCTCCGAAAGAAAATGCTCCCAATGACTTTATAGTCGTCTCTCCGTCTGTGGAACTAGAATTACCTATAGAAGAAATAGATCATACCATAAACAATCCTTCTGCAGATATAACCATGTTAAGAACACCGAAAACAGAAAAACACCATTTTAAAATAGACAACATACCACATCGTCATATTTCACCCCCTAAATATAATCTTCAAAAATATACAAATCCAACTAAATTGTATCCGAAGAAATATATTGGTGAATTTAAACCTATACCAATACCAATTTCTCAATATAGCGATGGTTCAACGGACATACCTGTCGCTAGACCTGTCAAAGTGATTATTCCTGATGACATCAAAAATGGTTTTGTGCCACAACCTATAGAAAACATTGAAAACTCATTTGACAACAATCAAAAATTAAATCAAAATAATGAAGCGGCCAAGATTACGACCGCAACTGGTGCGCCCCACACTCAAGTTATTCCGTCTGAACAAGATACGTCTGAAACAAACTTCCGACATCCGATACGCGACGCTTATTCAGCACCTGCGAAGCAGATAGCTCAAAAGCATATTCCTTTAAAGTCTGACGGCAAACGCACCGAATTTCGAATGCATGGAATGAAAGGTCCACATAGCTATCAATTTGGTTACGATACTGGAAAGGGGAAAAATCGTCAATTCAGATACGAAGAAAGGGATAATGACGGCCATGTCCGAGGCCATTATGGTTATGTGGATAGAGGCGGGAAACTGCGCGTTGTGAACTACGATGCCGATCCCGTGCACGGTTTCCGGGCTGAGGCGCCGGTGGAAAAAGATACAGAATAA

Protein sequence:

>DPOGS205769-PA
MRIPIENSNNDNLQIVPSASTTSESLQVQPIYYTKNAQNLDSYIIHEPQKTPQIKQEIKEEIPTPATYLLPPSPYARNEFFLATTESNEESDWYPIQSDSQNQNINEGQPLEELNLRSGKELHRGQVFSVPIRNLLPPKENAPNDFIVVSPSVELELPIEEIDHTINNPSADITMLRTPKTEKHHFKIDNIPHRHISPPKYNLQKYTNPTKLYPKKYIGEFKPIPIPISQYSDGSTDIPVARPVKVIIPDDIKNGFVPQPIENIENSFDNNQKLNQNNEAAKITTATGAPHTQVIPSEQDTSETNFRHPIRDAYSAPAKQIAQKHIPLKSDGKRTEFRMHGMKGPHSYQFGYDTGKGKNRQFRYEERDNDGHVRGHYGYVDRGGKLRVVNYDADPVHGFRAEAPVEKDTE-