Monarch geneset OGS2.0

DPOGS207445
TranscriptDPOGS207445-TA2580 bp
ProteinDPOGS207445-PA859 aa
Genomic positionDPSCF300051 - 478776-485024
RNAseq coverage319x (Rank: top 36%)
Annotation
HeliconiusHMEL0148600.069.38% 
BombyxBGIBMGA001188-TA0.067.25% 
DrosophilaCG16979-PA8e-12639.05% 
EBI UniRef50UniRef50_D1ZZB83e-14143.12%Putative uncharacterized protein GLEAN_07479 n=1 Tax=Tribolium castaneum RepID=D1ZZB8_TRICA
NCBI RefSeqXP_970116.15e-14243.12%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910808119e-14143.12%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910808113e-13843.12%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[669-851] IPR0124621.3e-50Peptidase C78, ubiquitin fold modifier-specific peptidase 1/ 2
[53-194] IPR0205685.7e-29Ribosomal protein S5 domain 2-type fold
[64-193] IPR0012471.6e-21Exoribonuclease, phosphorolytic domain 1
Orthology groupMCL14359 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207445-TA
ATGATTACTACAAAATATAAGACCGCCTCCTTACCAGCTCAGATACAGTCTATGTCCTCAGTGAAAACTAAAATGACAAGAGCTGTCAAATTTATCAAAACATTAAACACACTTATACATCCAGGTAAATTTTTTAATGACTATATCTCCCGAGACATTCGTCCAGATGGTAGAAAATTTAATGACCAACGTAATATAAAACTTAATGTTAATGCTATTAAGACAGCCGATGCATCAGCAGTCGTAAAATGTGGAAATACCACCGTTGTTTGTGGCATAAAATTGGAATTAGCAAAACCTAAAGCAGAAGAACCAGATGTAGGCTTTTTAATAACTAATGTAGAGCTTCTTCCATTATGTTCATCAAAATTCCGTCCTGGACCTCCCTCAGACCATGCTCAAGTTGTCAGCAATGTTGTATCAGATATAGTTACGAACTCTAAATGTATAGACATGAAAGATTTATGCATAGTGCCGGACAAACTCTCATGGGTGTTATATTGTGATATGGTTTGCTTGGACTATGACGGCAGTGTTGTTGATGCCTGCTTAATAACACTCATGAGCAGCCTCAAAAGCCATCAAGACACCTTTACCTGTGCATGGTTTACCCATTGCTACAAAAGCGTTACATTAACAGACCCAACAGCATATGAGGAGGAGATGTGTGGTGGCGTTGGTGCCAACCTTATAGCTTGTTGGAATAAAGGACAACTTTGTGGAGTATTTAAATTTGGTGGCAGTAATCTTTCAACAGAAAATGAAAAAGAAACTTTAGCCCTTGCAAAGCAAAAGAGCAAACTAGTTGAAAAGCTCAAAATGTCACCACGTCTCAAAATATCAAAATATGTTATTGAGAGATTATCCAAGATCGACACGAACGAAACTACAGGATGCCTTTACGGCCTCATGTACGACAGCACATTGCTGGTTGTTGGGCTAAGTTTAGAGTTTTTTGAGGATGAGAAAAACACATATCGGCAACTACTTCTTCATTTACCTGCTGAAATTGAGTTGTGCGGTGTTGTTAAGTTCACTGGTTCACTATCTATAGAATCTAAACCGAAAGAAATACTCCAGAATGTGGATATAACTGATAATCCTCTGTTTATCATTATAAATAATGAAAAAGATATAAAGGCACATTTCCTCGTACATGATAAATTTGAAGAAACACCATTTGAAGTTCTTCAACCAGAAGATCTCTGGAAACAGTTCACTTATGTACGCTTAAACACAATTTTACCATTAACATGCGAAGCCACTATGACCGGTGTGAAAAATATTATGCAAAATAAGAGAAAGAAGATAGCATCAGGACAGGTATCTTTCCATATAGATGGATCATCAGTATATTTATTTGGAGTAGACTCTGATGTTGGAGTCACAGGTACTAGCGTCGATACTGATATTGGAGAGTTGATTGACTCTCTGAATCCTGAACAGTCCTCTAAGAAGAAGAAGAATATTATAAACAAAGTTGAAGTAATGCCAGTTAATTTAGTTATGAAGGCTACAAAGGATATGTTTTCGGACAAATTAGTTAAAACAGCTGTTAAAATGATGACCACACAGAGGAAACCTGCATTTTGTATAAGTATGCCGCTAAGAGTGGACACATTGGCTCTTGTACATAGAAATACTAAATTATCAGAACTGTACAATGTGTTAGTTGAAGCTACTTGTCGCTCTCTAAAACTATTGGAAAGTGTTCTGTTAGAGCAATTAGGTCAAGAAGGTATTGGAGATGGAGCTGGTTTACGTCTGCCGGAAACATTCCATTATTTGCCCCAAGAACTGGGACACTTTATCACCAGAGTTGTGCCAAAGGATATTCCAGATGAGAGCATGGAGAAGGAAAGGCGCATACTCCATGAGCAACTGGGTTTATCTGTACAAAGGCCTATATTTAGACGGGGAAATGCTTATAATTTCATTATAAGTAAGCTTGTTAATCCACATGAAGCGATAACTGTACATTCTTTGAAGCCTGACGTTCGAGTGGCCTCGGTCAGGGGAAGATACATCTACTATCATTACATGCAAGATAACTTCAACGATGACGGCTGGGGTTGTGCTTATCGTTCTATGCAGACTATATTTTCTTGGTTTAGGTTCCAAGGTTACACATCAGTTGAAATACCCACACATAGAGAAATCCAACAGTGCCTTGTTAATATTGGTGATAAGCCTACATCATTTTTGGGTTCCAAGCAATGGATAGGTTCTACAGAAGTGATGTTTTGTTTGGAGACCCTTTTAGGTGTGCAATCAAGGATTATTTTTGCTAACACTGGATCAGAACTGCTCAATTACACCCCAGAGCTTGTTCATCATTTTGAGAAACATGGAAGCCCTATAATGATAGGCGGAGGAGTATTAGCACATACAATTATCGGGGTGGAATACAACAATATAACAAATGAAACGAGATACCTTATCCTGGACCCACACTATACAGGTGTTGACGATATCAACACTGTTATAAGCAAAGGCTGGTGCGGTTGGAAGAATTCAGACTTTTGGAACAAAACTGCTCACTATAATCTATGTTTGCCACAAACAAAACCATCTATATGA

Protein sequence:

>DPOGS207445-PA
MITTKYKTASLPAQIQSMSSVKTKMTRAVKFIKTLNTLIHPGKFFNDYISRDIRPDGRKFNDQRNIKLNVNAIKTADASAVVKCGNTTVVCGIKLELAKPKAEEPDVGFLITNVELLPLCSSKFRPGPPSDHAQVVSNVVSDIVTNSKCIDMKDLCIVPDKLSWVLYCDMVCLDYDGSVVDACLITLMSSLKSHQDTFTCAWFTHCYKSVTLTDPTAYEEEMCGGVGANLIACWNKGQLCGVFKFGGSNLSTENEKETLALAKQKSKLVEKLKMSPRLKISKYVIERLSKIDTNETTGCLYGLMYDSTLLVVGLSLEFFEDEKNTYRQLLLHLPAEIELCGVVKFTGSLSIESKPKEILQNVDITDNPLFIIINNEKDIKAHFLVHDKFEETPFEVLQPEDLWKQFTYVRLNTILPLTCEATMTGVKNIMQNKRKKIASGQVSFHIDGSSVYLFGVDSDVGVTGTSVDTDIGELIDSLNPEQSSKKKKNIINKVEVMPVNLVMKATKDMFSDKLVKTAVKMMTTQRKPAFCISMPLRVDTLALVHRNTKLSELYNVLVEATCRSLKLLESVLLEQLGQEGIGDGAGLRLPETFHYLPQELGHFITRVVPKDIPDESMEKERRILHEQLGLSVQRPIFRRGNAYNFIISKLVNPHEAITVHSLKPDVRVASVRGRYIYYHYMQDNFNDDGWGCAYRSMQTIFSWFRFQGYTSVEIPTHREIQQCLVNIGDKPTSFLGSKQWIGSTEVMFCLETLLGVQSRIIFANTGSELLNYTPELVHHFEKHGSPIMIGGGVLAHTIIGVEYNNITNETRYLILDPHYTGVDDINTVISKGWCGWKNSDFWNKTAHYNLCLPQTKPSI-