Monarch geneset OGS2.0

DPOGS208050
TranscriptDPOGS208050-TA2934 bp
ProteinDPOGS208050-PA977 aa
Genomic positionDPSCF300203 + 280894-284680
RNAseq coverage283x (Rank: top 39%)
Annotation
HeliconiusHMEL0178060.067.07% 
BombyxBGIBMGA001478-TA0.055.91% 
DrosophilaSmg5-PA3e-3426.28% 
EBI UniRef50UniRef50_UPI0002063D3C4e-10128.63%UPI0002063D3C related cluster n=2 Tax=unknown RepID=UPI0002063D3C
NCBI RefSeqXP_002426268.11e-10129.63%protein SMG5, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838555548e-10929.66%PREDICTED: protein SMG5-like [Megachile rotundata]
NCBI nr blastxgi|3838555541e-11629.82%PREDICTED: protein SMG5-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[74-185] IPR0194586.7e-12Telomerase activating protein Est1
Orthology groupMCL14323 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208050-TA
ATGAAGAACGGGTGTGATGATTTGGATGCTATAATATTGGGGCGTAGTGAACGCGCTAAAAAGGTTTACAGGTATGTAAACGAAGTAGCGCGACGTCTTGGTGAGGCGACGGCAAACTGTAAATCTATAACAGAACTTTTTACCACAAAGATTGAATTGGAACGTCAAAAACTAAGAGACAACTGTGAGAAATTATTTTTTCTTGATCCAATTAACTATGGCAAAAAAAGCCTTGAACTTCTGTGGCGAAAAGTCTATTATGACACGGTAAGTGTTGCCAAGAAACTGCGAGAGAATGATAATGGGTGTGACAGTTACTTATTCATGCACCTAGTGGGTGGAATTGGCCACTTCAACCATCTGATGACCAGAGTACATTCAGAAATGAATGTTCAAGTCAAAGAACTTGATTATTTGCCTCTTTATGATGAAGATGATTCCGACCCTACCAACACTACCAGGGATTCAACAGACGAGCAGGGATATCTTGGCAGATTTGTGTTGTATTCATGTCTGATTTATCTCGGTGATCTAAGTCGATACCAAGTTGAGATATTCAATACATTTGACAGCACATTAGCTGCGAGATACTATCTTCAAGCAGCACAACTTGATTTCACTGTGGGCATGCCTTTTAATCAACTTGGAAATCTTTACTTGGACAAAAATTATAATCTAGATTCAGTTTCATACTACATACACTGTCTGAACACTCTTACACCATTTGAGGGTGCGATGGGCAATTTGACAAAAATTTTTGACAAGAATAATCAATTCTGTGAGACGTTGGTCAACACCAAAGTATTGACCCAGCCGGAACACATGCAAGTGACAATTGCGAATTTTTTATCCTTGATTGAGATATGGTACCTTGGTAAAGAAAATGTAGATATATCACAAATATGTAACACTATAGCTCTTCAACTAAAGATTGCTATGGACTTTACCAAAATGCCTCTGCCAGATCCAAACCAGAACTATACTGAGTACGTCCAAGCTATGGAAGAAGAAAATATCAATCCATCTTATTTAAATTCTAGTGTAATTCATAATATTGTCCAAATTTGTTTGTTTACAATAGCAAAACTAAATGATACAGATGAAGTCAAAGCATTTCCGTGTAAAGCATTTACATTGGCATTTTTATCTCAAGTACTTCAGAAACTACTGAAGCAAATAGAATCTTTTGGTTTAAAAAATCCAGCTCATAAGTATGTGTCGAAGTTTGCATCGAAATATAACAAAGAGAAGGAAGCGGAAGTTACGACTTTAGAAGCGGAAAGTGTAGTGAATGGTGAGAAGAATGAAGAAAACACAGAAAATTTGACTGCAATTGAGGAACATAGCCCAAAAGAAACTTGTCCAGAAACTGAGGTGCCTTTGGTAAACAACGGTGACACTAAAAACAGTAAGAAGGCCTTAGCAAAACGACGTCGCCGACGGCGCGTTGCATCATCCGAGAGTTCAGACGTTAGTGACGCGGAAACTGAGAGCAGTCTGATAGGTTCAGACAAATCTGACTCAGAAGACGACCTGTCAGATTCTACTTTCCAATCAGACGATGAGTCTAAAAGCGAAGGATCAGATTATGATGGTTCTGATTGCGAAATTAATGAGGTAAACCCTAACGCTAACGATGAAAACATGACCAATGGTGATATAAACAAGATCGACAAGAAGATCGTCGAAGAGGCAAATAATAACAAAAACAATAAGGAACCCGTAAAAACAGAGAACTCAAAAAATGATTTCATTGACATACAAGCCATTGAACGATTCCTATTTGGTGATAATTTTCTACCAAGCATAAAATTGTTGTTAGATTGGATTCTAACCGAAAAGGAGCTTATATTATCTTGTGGTGACAGCGGCGAATCGCTCTTCCAGTGCGTAGTAGACCTTCTCAACATATTCACTTACTATTTCAATTCAAAGAACTGCGACGTTCCAGCAAATTGTAAGATTCTTGAATACTCTAAGAATATCGCAAAGAAATTAAAACTTGAATTCAAAACATTACCGCTACCAGAGGACATGAAACTTCGGGGAACTAATATTTGCAGATTCGACAAAGATGCAGCTGAATGGCAGATATTAGATAAAATGAAACCAACTCTTGTAGAAGAAAACGTTATAAGAATATTAACATTCATCGATTTTGGATATCAGATCGCTAAAATCGTTCCCAGGATTAGATTCAATAGAACATTGAAAATATTCTATTTCAAGAAAGTTCTGCCTCCAAAGGTGTCCACGAAAGTTAATCATAAGAAGAGCAGGGAATGGCACAATTCAAAGAAACAGTCTGATAGCGACATCCTACGTCGTCTGGGTCGTCTGTGGCTGGCGTCACAGGTCCGTGAACTGGAGCGTACGGGTCAGGAAGTGCCCTCACTGCTGGCCGTGGACTCGGCGACACTACACAAACACCTGCGTAGAGTGAAGCAGCTTGTACGAACTAGGAACTTCATATTCCTAGTACCAACTGTGGTTTTACAAGAGCTCGATGATCTAAAACGCGAGCGCAGTTCAGCTCGGGACGCGATCCGTTGGCTGGAAATACAACTTAAGAGCGGATCAAGATTCCTCAGGACACAGCGACCAGGACAGTCGAAACCAATTCCTCTACTGAAGTACCCTCCAAAAGCGCCGCCGCATGTGCACAATTTCATACAAATATTGGAGTTTTGTAATCACTTTATATCAGATGAGAAACACGCTCTTGGTGGAAATGGTGATCCAGATAACTCCATACACGGGAAAAGTGCGCCGATACTTATCTTATTGGTCGGCAACGAACCAGGCAGCGAGGAACAATATAAAGACTTCAGTCCAACAGGCACGGCTCAAGCGGCGGGAATATCTGTCAAGTTCATCGGTGACTTTTACGCAAAATGGCGGCAAACCTTCCATAAGAACGGGAAAAAAAGATGA

Protein sequence:

>DPOGS208050-PA
MKNGCDDLDAIILGRSERAKKVYRYVNEVARRLGEATANCKSITELFTTKIELERQKLRDNCEKLFFLDPINYGKKSLELLWRKVYYDTVSVAKKLRENDNGCDSYLFMHLVGGIGHFNHLMTRVHSEMNVQVKELDYLPLYDEDDSDPTNTTRDSTDEQGYLGRFVLYSCLIYLGDLSRYQVEIFNTFDSTLAARYYLQAAQLDFTVGMPFNQLGNLYLDKNYNLDSVSYYIHCLNTLTPFEGAMGNLTKIFDKNNQFCETLVNTKVLTQPEHMQVTIANFLSLIEIWYLGKENVDISQICNTIALQLKIAMDFTKMPLPDPNQNYTEYVQAMEEENINPSYLNSSVIHNIVQICLFTIAKLNDTDEVKAFPCKAFTLAFLSQVLQKLLKQIESFGLKNPAHKYVSKFASKYNKEKEAEVTTLEAESVVNGEKNEENTENLTAIEEHSPKETCPETEVPLVNNGDTKNSKKALAKRRRRRRVASSESSDVSDAETESSLIGSDKSDSEDDLSDSTFQSDDESKSEGSDYDGSDCEINEVNPNANDENMTNGDINKIDKKIVEEANNNKNNKEPVKTENSKNDFIDIQAIERFLFGDNFLPSIKLLLDWILTEKELILSCGDSGESLFQCVVDLLNIFTYYFNSKNCDVPANCKILEYSKNIAKKLKLEFKTLPLPEDMKLRGTNICRFDKDAAEWQILDKMKPTLVEENVIRILTFIDFGYQIAKIVPRIRFNRTLKIFYFKKVLPPKVSTKVNHKKSREWHNSKKQSDSDILRRLGRLWLASQVRELERTGQEVPSLLAVDSATLHKHLRRVKQLVRTRNFIFLVPTVVLQELDDLKRERSSARDAIRWLEIQLKSGSRFLRTQRPGQSKPIPLLKYPPKAPPHVHNFIQILEFCNHFISDEKHALGGNGDPDNSIHGKSAPILILLVGNEPGSEEQYKDFSPTGTAQAAGISVKFIGDFYAKWRQTFHKNGKKR-