Monarch geneset OGS2.0

DPOGS203249
TranscriptDPOGS203249-TA2442 bp
ProteinDPOGS203249-PA813 aa
Genomic positionDPSCF300210 + 104864-109151
RNAseq coverage429x (Rank: top 28%)
Annotation
HeliconiusHMEL0058260.078.21% 
BombyxBGIBMGA007027-TA0.074.59% 
DrosophilaCG8613-PA0.046.06% 
EBI UniRef50UniRef50_E2B5870.051.96%Spermatogenesis-associated protein 20 n=7 Tax=Eumetazoa RepID=E2B587_HARSA
NCBI RefSeqXP_973977.20.055.94%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|2700113410.053.73%hypothetical protein TcasGA2_TC005347 [Tribolium castaneum]
NCBI nr blastxgi|2700113410.053.69%hypothetical protein TcasGA2_TC005347 [Tribolium castaneum]
Group
Gene OntologyGO:00038243.1e-77catalytic activity
KEGG pathway 
InterPro domain[278-681] IPR0089283.1e-77Six-hairpin glycosidase-like
[87-248] IPR0048792.4e-74Domain of unknown function DUF255
[497-655] IPR0123412.5e-29Six-hairpin glycosidase
[98-239] IPR0123361.2e-24Thioredoxin-like fold
Orthology groupMCL11810 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203249-TA
ATGCCTACAGGCCGTGCTGTATTTTTATTGCGACGTTTGTCGACAGGTGATAATCGATTAAAAAATATAAACTTCAATCAAAATACGTCAAAGGATACATTACTGTCTCACCCGGTTTGTATACATCGTGATTTGGCTTACAATAAAAGAATCTCTGGATTTGGTCACTATAATATTAACCCAAAGCTCACAAGAAGTTACAGTGATAACATAATAAAAATGGCTTCTTCAGAATCAAGTGCAACTCCTAAGAAACACACAAATAAATTAGTGAATGAGAAGTCTCCATATTTGCTGCAACATGCTCATAACCCTGTAGATTGGTATCCATGGTGCCAAGAAGCGATTGATAGAGCAAAACAAGAAAATAAACTCATATTTCTGTCTGTGGGTTATTCGACATGCCATTGGTGTCATGTTATGGAAAGGGAGTCATTTGAAAGTGAAGATGTTGCAAAGATAATGAACGAACATTTTATCAACATCAAAGTGGACCGCGAGGAACGTCCTGATTTAGATCGTGTGTACATGCTCTTCGTTATGGCGACCACTGGCGGTGGAGGTTGGCCTATGTCCGTTTTTCTAACACCAGATCTACGTCCGGTGACTGGCGGCACATACTTCCCTCCTGAAGATAGGTGGGGACGGCCAGGATTCAAAACAATCTTGCTCTCCTTGGCTAAGAAGTGGAAAGAAAATCAAACCCAATTCTTGGAGGCCAGCATCAACATAATGGATGCCTTACAGAACATTTCAAATGTTAAAGTCGAAACGAATTCGGTGCCTGGTGAGGCCACTTGGAACAAATGCGTTAGACGGTACATCACCAACTTTGAACCTCATTTCGGTGGCTTTGGAACAGCCCCAAAATTTCCTCAAGCCTCAATATTTAACTTTTTGTTCCATTTCTATGCACGTGATAAACAAAATCCGGAAGGCAAGCAATGTCTAGAAATGTGCTTGCATACTCTGACTAAGATATCAAAGGGCGGTATCCACGACCACGTGGCGAGTGGTTTCGCTCGATATTCTGTTGACAATGATTGGCACGTGCCGCACTTTGAGAAGATGTTGTACGATCAAGCCCAGCTGATGGTCGCCTATACTGATGCATATCTCGCAACAAAAGAGGAATACTATGCTGACGTAGTACGGGATATTGTTAAGTATGTAAATAGAGATTTAAGACATGACTTAGGTGGTTATTATAGTGCAGAGGATGCGGACTCATATCCAGTTTTTGGGGCCGATAAAAAAAAAGAAGGCGCTTTCTGTGTTTGGGAGTATGATGAAATCAATTCGCTGATTGGAGATAAAAAAGTTGGTAACGTCTCGTATTTGGAAATTTTCTGTGATTATTTCAATGTAGAGGAATCAGGTAATGTGTCCCCCGAGAGCGATCCGCACGGAGAACTGACGAATAAAAATGTTTTAATCATTTATGGATCGGAGGAGGAGACCGCTAGTAAATTTGAGATCACGAAAGATCAGTTGAAACAAGTCCTGAAGGAATGTATTGATATCTTGTATGAAGCTCGTTCAAAAAGACCGAGACCACATCTCGATACTAAAATGTTGTGTTCTTGGAATGGTCTTGCAATATCTGGCCTAGCACACGCAGGGCAAGGCTTGGGAGAGAAAAGTTTCGTCGAGGACGCCATAAAAACTGCAAACTTTATCAAAGAACATTTGTATGATCAAGAAAATAAAACGCTCCTCCATTCATGTTACAAAGCAGAAGACGGCAATATTACTCAAACGAACCCACCAATAAAAGGATTTTTGGACGATTATGCGTTTTTAATAAGGGGTCTGCTCGATTTGTACGAAGCGTCTCTAGACTTGCACTGGTTGAACTGGGCGCGTGAATTACAAGAGAAACAGAACGAATTGTTTTGGGATTCAGATAACGGTGGTTATTTCACTTGCTCCGCCGAGGACACTTCCGTTGTTTTGAGATTAAAAGAAGACCAAGATGGCGCAGAGCCTTCAGGCAACAGTGTGTCCTGTCACAATCTTCAGCGCTTAGCAGCTTACGCTGATAAGAGTTCGGCGGAAGAGGGGGGAGATAGAGAGAGGGATATGGCCAAAAAAGTGTTGATGGCATTTGCGAAACGGCTGATTGACTCTCCTACTGCGTCACCGGAGATGATGTCGGCTCTTATGTTCTTTACTGACTCACCGACTCAGGTGTTAATATCTGGTGGTTGTTCTGATCCCCGCACCCTTGCGCTGGTCCGCGCCGTTCGCTCCCGTTTACTTCCAGGTCGAGTGCTGGCCGTCGCAGATCCCAAGGATTCGCCAGCTGGTATGTCGGACATACTATTGAGTCGCATCCGTAGTACTGGGGAAGCTCCTACGGCGTACGTGTGTCGTCGCTACGCGTGTTCGCTGCCCGTCACGAGCGTTCAACAGCTGGAAACGCTGCTCGATGAACCTTAG

Protein sequence:

>DPOGS203249-PA
MPTGRAVFLLRRLSTGDNRLKNINFNQNTSKDTLLSHPVCIHRDLAYNKRISGFGHYNINPKLTRSYSDNIIKMASSESSATPKKHTNKLVNEKSPYLLQHAHNPVDWYPWCQEAIDRAKQENKLIFLSVGYSTCHWCHVMERESFESEDVAKIMNEHFINIKVDREERPDLDRVYMLFVMATTGGGGWPMSVFLTPDLRPVTGGTYFPPEDRWGRPGFKTILLSLAKKWKENQTQFLEASINIMDALQNISNVKVETNSVPGEATWNKCVRRYITNFEPHFGGFGTAPKFPQASIFNFLFHFYARDKQNPEGKQCLEMCLHTLTKISKGGIHDHVASGFARYSVDNDWHVPHFEKMLYDQAQLMVAYTDAYLATKEEYYADVVRDIVKYVNRDLRHDLGGYYSAEDADSYPVFGADKKKEGAFCVWEYDEINSLIGDKKVGNVSYLEIFCDYFNVEESGNVSPESDPHGELTNKNVLIIYGSEEETASKFEITKDQLKQVLKECIDILYEARSKRPRPHLDTKMLCSWNGLAISGLAHAGQGLGEKSFVEDAIKTANFIKEHLYDQENKTLLHSCYKAEDGNITQTNPPIKGFLDDYAFLIRGLLDLYEASLDLHWLNWARELQEKQNELFWDSDNGGYFTCSAEDTSVVLRLKEDQDGAEPSGNSVSCHNLQRLAAYADKSSAEEGGDRERDMAKKVLMAFAKRLIDSPTASPEMMSALMFFTDSPTQVLISGGCSDPRTLALVRAVRSRLLPGRVLAVADPKDSPAGMSDILLSRIRSTGEAPTAYVCRRYACSLPVTSVQQLETLLDEP-