Monarch geneset OGS2.0

DPOGS214008
TranscriptDPOGS214008-TA1227 bp
ProteinDPOGS214008-PA408 aa
Genomic positionDPSCF300313 - 23497-26829
RNAseq coverage500x (Rank: top 25%)
Annotation
HeliconiusHMEL0138170.084.28% 
BombyxBGIBMGA000439-TA3e-1933.92% 
DrosophilaCG2970-PA1e-13477.05% 
EBI UniRef50UniRef50_E2BXV34e-12966.39%Stomatin-like protein 2 n=1 Tax=Harpegnathos saltator RepID=E2BXV3_HARSA
NCBI RefSeqXP_001987049.15e-14073.98%GH21699 [Drosophila grimshawi]
NCBI nr blastpgi|1950283701e-13873.98%GH21699 [Drosophila grimshawi]
NCBI nr blastxgi|1892393995e-13571.78%PREDICTED: similar to AGAP009439-PA [Tribolium castaneum]
Group
Gene OntologyGO:00160201.8e-162membrane
KEGG pathwaydpo:Dpse_GA141458e-130 
 K03364 (CDH1)maps-> Ubiquitin mediated proteolysis
    Cell cycle - yeast
    Progesterone-mediated oocyte maturation
    Cell cycle
InterPro domain[43-366] IPR0019721.8e-162Stomatin
[46-204] IPR0011071.5e-58Band 7 protein
Orthology groupMCL15266 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214008-TA
ATGTTCTCCCGATCTAAATTACTTTTCTCGAGGATCACACCTCTGAAAAATACTTTTAGTGTAGATTTTCAAAATGGGGCCCGTGTGTTGTATGGTATATCATCAGTGAGAAATCGTTCAACAACTCCAATCAATACGATCATAATGTTTGTTCCACAACAAGAGGCGTGGATCGTAGAGAGAATGGGTAAATTTCACAGATTACTAGAGCCAGGACTGAATCTGCTTTGGCCAATTGTAGACAAAATTAAGTATGTGCAGAGTCTAAAAGAAATTGCTATAGATGTTCCAAAGCAGAGTGCAATAACATCTGATAATGTTACTTTAAGTATTGATGGTGTGTTGTATCTACGAATTGTTGATCCATACTTAGCTTCTTACGGAGTAGAAGATCCAGAGTTTGCTATAACACAACTAGCACAAACAACTATGAGGTCAGAGTTGGGACAGATATCACTTGACAAAGTTTTTAGGGAGCGAGAGTCACTCAATGTTTCTATAGTCCATGCCATCAATAAAGCAAGTGAAGCATGGGGCATAACTTGTCTGAGATATGAAATACGGGATATAAAGCTGCCAACCAGGGTACATGAAGCCATGCAAATGCAGGTCGAAGCTGAGAGAAGGAAACGCGCTGCTATATTAGAATCTGAAGGTGTTAGAGCAGCCGATATTAATGTTGCCGAAGGAAAACGTCAATCAAGAATATTGGCTTCCGAGGCCGAGAAGATGGAGCAGATAAATAAAGCGTCAGGTGAAGCTCAAGCCATGTTGGCGGTGGCTGATGCTCGGGCTAAAGGGCTCACCATCATAGGGCAAGCGCTCGCACAAACGGATAGTAAACATGCTGCCGCGCTTACATTGGCCGAGCAGTATGTGTCAGCATTTAATAAACTAGCAAGGACAAACAATACTCTCATACTGCCAGCTAATGCAGGGGATGTTTCAAATTTGGTCGCTCAGGCAATGTCAATATATTCAACGGTAACTTCACAAAGCAACCGCAACCAAGTATCTCATGGTGAGCCGATCATACCTGATATAATGGCTGAGGATCCTATGTACAAACTCAATATGCCAAATGAGAAAGGTCTGACCATGGCTGGACAAGCCGTTCCAAATGAGGACTTAGCGGAGTATTTCTCTGACGATGAGGAGAGAGAAAAGGCATTGCAGACACAGAAAGACAAGAAAAAGCATTTGAACACATTAGAAAAAGATCCATAG

Protein sequence:

>DPOGS214008-PA
MFSRSKLLFSRITPLKNTFSVDFQNGARVLYGISSVRNRSTTPINTIIMFVPQQEAWIVERMGKFHRLLEPGLNLLWPIVDKIKYVQSLKEIAIDVPKQSAITSDNVTLSIDGVLYLRIVDPYLASYGVEDPEFAITQLAQTTMRSELGQISLDKVFRERESLNVSIVHAINKASEAWGITCLRYEIRDIKLPTRVHEAMQMQVEAERRKRAAILESEGVRAADINVAEGKRQSRILASEAEKMEQINKASGEAQAMLAVADARAKGLTIIGQALAQTDSKHAAALTLAEQYVSAFNKLARTNNTLILPANAGDVSNLVAQAMSIYSTVTSQSNRNQVSHGEPIIPDIMAEDPMYKLNMPNEKGLTMAGQAVPNEDLAEYFSDDEEREKALQTQKDKKKHLNTLEKDP-