Monarch geneset OGS2.0

DPOGS200138
TranscriptDPOGS200138-TA2217 bp
ProteinDPOGS200138-PA738 aa
Genomic positionDPSCF300128 - 387940-391567
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0075583e-9961.20% 
BombyxBGIBMGA002783-TA2e-15268.27% 
DrosophilaCG42358-PA1e-6841.77% 
EBI UniRef50UniRef50_E2A4B73e-11740.56%Putative methyltransferase NSUN5 n=1 Tax=Camponotus floridanus RepID=E2A4B7_CAMFO
NCBI RefSeqXP_968918.29e-12741.62%PREDICTED: similar to williams-beuren syndrome critical region protein [Tribolium castaneum]
NCBI nr blastpgi|1892410142e-12541.62%PREDICTED: similar to williams-beuren syndrome critical region protein [Tribolium castaneum]
NCBI nr blastxgi|1892410145e-13436.76%PREDICTED: similar to williams-beuren syndrome critical region protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[304-520] IPR0016783.5e-24Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
[332-342] IPR0232671.8e-14RNA (C5-cytosine) methyltransferase
[145-205] IPR0190131.1e-11Vacuolar ATPase assembly integral membrane protein VMA21-like domain
Orthology groupMCL13247 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200138-TA
ATGTTTGAACATTCTGTAAAAGTTCCAAGACATTATAAAGTAGCTGCAAATATTTTTAAAAAGGTTGCCACAGAAGGCGGTAGTGTCAAAAATTTGCTGTACGACGATAAATTAAAGCATTTCAGAACTAATGTGCTTTACGCACTTATAACAGAGACAATTAAACATGCGACTGATATTGATAAAATATTTGAAAATTGTGGTATTTTGGCGAAAGAGCAGCGGCTAGATCCTTGGCTTGCTAAGATTCTTACCGCAGAGTTGCTTTTTGGCAAAAAGGCTCTGCCTGGGAAAAGCAAACCTGAACTGACAATATTATCATATAAAGAACAGTTTGAGAACTTTAGGAGTGAAAATCCAGATGAAGTGAAATCTAAAGAGACTGATAAAGGTCCCCTCATACCGAAACAAACGAAGTTGGCAGCGCTCGCATTAGCCAATTTATTTTTATATAGCGTGGCAATGTTTACATTACCATTCATAGCATTTTTTGGTGTACGTCATGTTCTGACAGACTATTACCCAGTTGATCAATTCACGAGGAATGTTTGGTCGGTTGTATCAGCCGTTGTTGTTGTCAATGTCATAATCGCTATGTACGTGTACAAAGCTTATCATGAGAAAGAATATGATGAACATGGAAATGAAATCGACCAGCATTCATATGGTCCCCATGAAACAAGCAAAAAACCGCGGTACGTCAGGATAAACACGAACCTTTTAACCACATCGGATGCTATAAGAGCATTCCAAGACGAAGGCTACAAGTTCATAAGATGTACATCAGGGTCCTATGATGATTACTTGAAGCAGATCCAGGGTTTGACGGAGTACGACTTCACTCAGGACTACCATGTGAAAACCATGTTTGTGTTTGCACCGGGAACCAAGTTTCACGACCATGATCTGTACTTGAATAATCAAATTATTTTGCAAGATAAGGCTACAGCCCTAGCCGTACACCTGCTCGCCCCGCCATCTGGCAGCACTGTATTAGATATGTGTGCTGCTCCAGGTATGAAGACCACACAAGTTGCTGCATATCTTCGAAACCAGGGTAAGGTATACGCTGTTGAGAGGAACGATCAGAGATATCAAACACTGTGTCAATTAGTCGAGAGCACTTCATCAAAATGTGTTGAGACCATACATAAGGATGTACTGGAGATTAAGAGAGGTGATTTAGATGATGTGGAATACGTCCTGCTGGATCCCAGCTGTTCAGGATCTGGTATGGATTTTTCTGTCCACAACTACATCGAAGACACGAGGCTGGCCAAACTGACCTCGTTGCAAGAGAAATTTCTGAAACACGCAATGAACGCGTTCCCGAATGCAAAGCGCATAGTCTACAGTACGTGCTCGATATTTCCCGAGGAAAATGAACGGGTTGTGACAAACGTTGTGAAGACTTCAAGGGCTAAGTGGAGGGTGCAGGATGTTAAGGAGCTGTTGAAAAACCAGTGGAACAACTACGGTTCAGGAATGTATGGCAGTATGGGTACCAGGTGTCTATATTCTAGACCGGATACCGATATGACAACTGGATTCTTCCTAGCCGTCTTGGACAGAGACCAAAAAGCCCGTGACGATGAGGGGAAAAATCTTAATATTGACGATAATAAAGTCAAAAGTATGAGTAAAGACATCCCTAATGGCAAAGCAGTTCATGAAGCTGAATATGCATCAACCTTAGATGAGGTCAGTGACGTCATAGTGAAGAAAAAAAAGAAAAAAGAGAGAATACGTTCAGAAAATGAGAGCGATATCAAAAATAATGTTACTGAAATAGAATCTGACTTAACGAAAGGTGACGTCGAAACTAAAATGAAGAAAAAGCACAAAAAGAGCAAATCCAAGGACGACGGTAATGATCAAGAGTTCAAACAAAGTGTTACAGAAGAAGTGATTGCAGAATATCATCAGGATATAACAGAAGTAGACCGTTCAGATAGAACAGACAATATTGAAACCAAGAAGAAGAAAAGAAAGAAGAGTAAAACTTTAGAGCATGATACGGGCGAAGATGACAACTCGAAACAAAATCACGAACCAGAAGACGACGGCCTAGAAGAACCTTCAAAGAAGAAAAAAAAGAAAAAAAAAAGTGAAGAAGAATCTACAGCAAATGATAGTTCTGTTTCCAATCACTTAGATTTTACAGAAGATAATGTAAAAGAGAAGAAAAAGAAGAAAAAGAAAAATCATTTGGATTAA

Protein sequence:

>DPOGS200138-PA
MFEHSVKVPRHYKVAANIFKKVATEGGSVKNLLYDDKLKHFRTNVLYALITETIKHATDIDKIFENCGILAKEQRLDPWLAKILTAELLFGKKALPGKSKPELTILSYKEQFENFRSENPDEVKSKETDKGPLIPKQTKLAALALANLFLYSVAMFTLPFIAFFGVRHVLTDYYPVDQFTRNVWSVVSAVVVVNVIIAMYVYKAYHEKEYDEHGNEIDQHSYGPHETSKKPRYVRINTNLLTTSDAIRAFQDEGYKFIRCTSGSYDDYLKQIQGLTEYDFTQDYHVKTMFVFAPGTKFHDHDLYLNNQIILQDKATALAVHLLAPPSGSTVLDMCAAPGMKTTQVAAYLRNQGKVYAVERNDQRYQTLCQLVESTSSKCVETIHKDVLEIKRGDLDDVEYVLLDPSCSGSGMDFSVHNYIEDTRLAKLTSLQEKFLKHAMNAFPNAKRIVYSTCSIFPEENERVVTNVVKTSRAKWRVQDVKELLKNQWNNYGSGMYGSMGTRCLYSRPDTDMTTGFFLAVLDRDQKARDDEGKNLNIDDNKVKSMSKDIPNGKAVHEAEYASTLDEVSDVIVKKKKKKERIRSENESDIKNNVTEIESDLTKGDVETKMKKKHKKSKSKDDGNDQEFKQSVTEEVIAEYHQDITEVDRSDRTDNIETKKKKRKKSKTLEHDTGEDDNSKQNHEPEDDGLEEPSKKKKKKKKSEEESTANDSSVSNHLDFTEDNVKEKKKKKKKNHLD-