Monarch geneset OGS2.0

DPOGS212221
TranscriptDPOGS212221-TA2139 bp
ProteinDPOGS212221-PA712 aa
Genomic positionDPSCF300263 - 224145-230735
RNAseq coverage318x (Rank: top 36%)
Annotation
HeliconiusHMEL0167992e-11060.43% 
BombyxBGIBMGA004408-TA0.078.40% 
DrosophilaCG6133-PA0.051.68% 
EBI UniRef50UniRef50_Q5TTT30.054.03%AGAP002504-PA n=9 Tax=cellular organisms RepID=Q5TTT3_ANOGA
NCBI RefSeqXP_395050.20.056.01%PREDICTED: similar to CG6133-PA [Apis mellifera]
NCBI nr blastpgi|3320212380.055.33%tRNA (cytosine-5-)-methyltransferase [Acromyrmex echinatior]
NCBI nr blastxgi|3320212380.055.33%tRNA (cytosine-5-)-methyltransferase [Acromyrmex echinatior]
Group
Gene OntologyGO:00164281.7e-13tRNA (cytosine-5-)-methyltransferase activity
KEGG pathwayolu:OSTLU_271523e-108 
 K00599 (E2.1.1.-)maps-> Naphthalene and anthracene degradation
    Tyrosine metabolism
    Histidine metabolism
    Selenoamino acid metabolism
InterPro domain[154-363] IPR0016781e-32Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
[155-169] IPR0232671.2e-22RNA (C5-cytosine) methyltransferase
[130-151] IPR0232701.7e-13RNA (C5-cytosine) methyltransferase, NCL1
Orthology groupMCL13472 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212221-TA
ATGGGTAGACGAAACAGAAATGTTAATAAATTCGCACAACGCAAAAGAGAACGGAAAGAACAGGAAAAAAATCCACAACAGAAACCGGCCGACACGCGAAAGCATTATGAAGATATTGTGCGTGAAAATGCAATCTTCGAAGAGTATTATAAGGCGCAGAAAGTTTGTCCTGATGATCAATGGGACGACTTCATGAGAGCTATCAAGAAAGATCTGCCAACAGCATTCAGAATAACGGGCTCCAAATGTGAAACAGATGCGCTACTCAATATAGTTAAGAGCCAATATTTTTCAGAGATATTAAATCATAAACTTAAATTGGATGATGAGAAAGAAGAAGAGGAAATCAAACCTGTTAACTTGCCGTGGTATCCCGGAGGTCTAGTGTGGCAACTGCCAGTATCTCGCACACATATCCGTCGGAATGAACCACTCTATCGTCTCCATAACTTTTTAGTGGCTGAAACAGAAGCGGGCGGTGTTTCACGTCAAGAGGCTGTATCTATGATACCGCCTGTTGTGTTACAAGTTGAACCTCATCATAAGGTATTAGACATGTGTGCTGCACCGGGCTCCAAGACTGCACAACTCATTGAGTTCTTACATTCTGATGAAGACAAAATGCCTACAGGTTTCGTAATGGCCAATGATGTTGACAACAGTCGCTGTTACATGTTGGTGCACCAGGCGAAGAGGTTGAACTCACCCTGCATCATCATCACCAACCATGACTCCGCAGTGTTGCCATCACTGGTTGTGAGTGATGAGGAGAACCCGAGTGCGACGAAGCCGCTTAAGTTCGACCGCGTTCTGTGTGATGTTCCATGTTCCGGAGACGCCACCTTGAGGAAGAACCCTGATATATGGACGAAATGGTCGACCGGAAACGGAAATAACTTACACGGTATTCAGTATAGAATCCTCCGTCGTGGCGTTGAGTTGTTGTCTGTGGGCGGAAGATTGGTCTATTCCACCTGTTCCTTCAACCCTGTGGAGAACGAGGCCGTGGTGCACAGAATCCTTCAGGAGACCGGCGCCAGTGTGACCCTCGTGGATGTACAGGATCTACTGCCCGGACTAAAGTTCCATAAAGGCATGACACATTGGCGGCCGGCGTCTAAAGACATGGTGTTCTACAACAGTTATGATGAGGTTCCAGAGAAATGGCAGACGGTGGTGAGGCCGCAAATGTTCCCTCCCAAGACTGAAGACTTGGACAAATATAATCTGGATAGATGCATAAGAATTCTGCCTCATCACCAAGATACTGGAGGGTTTTTCGTGGCAGTGTTTGAAAAAACCGCCCTCCTGCCATGGGAGAAGGACCCAACCAAGAAACCGGATGTGGCAGCCGATGAACCGGCAGAGGAACCGGAAAAGAAGGAACCACCAAAGAAGAGAAGAAGAATGGGAGGATATAGGGAGGATCCTTTTGTATTTTTCTCCGGTGAAAATGAAGATGTGTTCCCTTCTATCAAGGAATTCTACGATCTTGATACAAAATTCGACCCTACCTGTCTCTTGACGAGATGTCATGTTGGGAAAAAGAAGAATATTTACCTGGTGTCAGCCATGGTGAAAGAAGTTGTACAGAAAAATGAGAATAGTATTAAGATTATAAACACAGGCGTCAAAACATTTGTTAGGTGTGATAATAAAAATATGAAGTGCCCATTCAGACTATCTCAAGAAGGTCTTCAGAGTATAGCCCAGTACATCGGTCCAAAACGACGCGTGACCATTCTTAAGGAGGATCTCATACTAATATTACAATGTGACAACCCTAGCAAACCCCCAGAACTAAAACTGTTCACAGAACACACTCAGAATATGGTGAAAGATTTCGCTACTGGTAGCTGCGTGTTGGAGTATAAGGACACGTCATCAGGGTTGTCACTCCGCCTGGTCGGTTGGCGAGGTGTTCACTCTCTACGCGCGTACACCGCCGCCCCTGACACCGTGCACTACCTAAGACTGTTGGGAGCTGACTACAGCAAATATGACGTAAATAAGTTCAAGAAGGCTGCAGAAGTCCCCAAGGATGATAGTATTGAGGTTAGCGGAACAGCCTCTACCAGTGAAAATGATCCTAACAAGACCAACGCTATGGAGACTGAAGAGGGAATGAATGTAACATGA

Protein sequence:

>DPOGS212221-PA
MGRRNRNVNKFAQRKRERKEQEKNPQQKPADTRKHYEDIVRENAIFEEYYKAQKVCPDDQWDDFMRAIKKDLPTAFRITGSKCETDALLNIVKSQYFSEILNHKLKLDDEKEEEEIKPVNLPWYPGGLVWQLPVSRTHIRRNEPLYRLHNFLVAETEAGGVSRQEAVSMIPPVVLQVEPHHKVLDMCAAPGSKTAQLIEFLHSDEDKMPTGFVMANDVDNSRCYMLVHQAKRLNSPCIIITNHDSAVLPSLVVSDEENPSATKPLKFDRVLCDVPCSGDATLRKNPDIWTKWSTGNGNNLHGIQYRILRRGVELLSVGGRLVYSTCSFNPVENEAVVHRILQETGASVTLVDVQDLLPGLKFHKGMTHWRPASKDMVFYNSYDEVPEKWQTVVRPQMFPPKTEDLDKYNLDRCIRILPHHQDTGGFFVAVFEKTALLPWEKDPTKKPDVAADEPAEEPEKKEPPKKRRRMGGYREDPFVFFSGENEDVFPSIKEFYDLDTKFDPTCLLTRCHVGKKKNIYLVSAMVKEVVQKNENSIKIINTGVKTFVRCDNKNMKCPFRLSQEGLQSIAQYIGPKRRVTILKEDLILILQCDNPSKPPELKLFTEHTQNMVKDFATGSCVLEYKDTSSGLSLRLVGWRGVHSLRAYTAAPDTVHYLRLLGADYSKYDVNKFKKAAEVPKDDSIEVSGTASTSENDPNKTNAMETEEGMNVT-