Monarch geneset OGS2.0

DPOGS213490
TranscriptDPOGS213490-TA1650 bp
ProteinDPOGS213490-PA549 aa
Genomic positionDPSCF300100 + 282256-289519
RNAseq coverage183x (Rank: top 49%)
Annotation
HeliconiusHMEL0168430.077.31% 
BombyxBGIBMGA004373-TA0.069.58% 
DrosophilaMta70-PA1e-12982.28% 
EBI UniRef50UniRef50_D6WRZ11e-16556.46%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WRZ1_TRICA
NCBI RefSeqXP_002429182.17e-17256.50%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420174101e-17056.50%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420174103e-16456.48%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00081681.4e-58methyltransferase activity
GO:00061391.4e-58nucleobase, nucleoside, nucleotide and nucleic acid metabolic process
KEGG pathway 
InterPro domain[353-514] IPR0077571.4e-58MT-A70
Orthology groupMCL11764 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213490-TA
ATGTCTGACGCTTGGGAAGAAATTCAGGCTGTCAAAAGCAAGAGAAACAGTTTGAGAGAGAAGTTAGAGAAAAGGAAGAAGGAGAGACAAAACATTTTAGGAACGAATTTAGCTAGTGGTGATAAAACTGAGAGCGTACCGGCGGCTGCGTCTAGTGGTAAAGAACGCAGCACATCTAGCCCGGCGCCCAGTAAAAGTGACACTCCGTCTATAGACAAATCGAAATTGATACCCTCAGCATCTCTGGAGGTTCGTCTTCTCCAAGTTTTATCGGACATGCAGCTCCAACTCCCGGCCACGGCCAGCGCTTTAATGCCAGGCCTCGGTGATGTCGATACTGGTATTGTGGCCAGCCTGCTACAGAAGTTCGCAACACAGAAACTTATAACTATTAAAGAGAGGACAGAGAACACAACGGATGCTAGTATTGAGGTTGTGAATGCTGAATCTGTTAGACTGGCTGCCGTGTTGGCAGCATTGATGGAAGAACAATCGTCCACAGTCAAAAGGAAAGGTGATAGTGCCTCTGACGAAGGAGCCCATCCGAAAGTCGCTAAATCTGCCACAGATGATAAAAAAACTGGGAAAAGTGATGCTGATATCATGTCTTTATTAGCAATGCCGTCAAGTAGAGAAAAAGCTGTGAAAAGAGTGGGCGAAGAAATTATGGATTTGCTGAGTAAACCGACGGCTAAGGAGAAATCACTCGCGGACAAAATAAAAATAGGAATATTGTGTAATTCACGATTGTATTTGACATGCCTTTGTCCTGATAGCAAACTTAAGGATAGAAAAAAAAATGTACTGAAAATGTTACTCGGCACACATCCACCTCTTAGTTTGTCTGTTTCTATATTTAAACAAATATACGTTCACTACGAGGTGGATAACACAGACCCAAACATAACAGCCCCCAAGACGACCCCAGAACCAGCGGCGAAACCGGGGAATAATGGAACACCCGCAGCGCCCAAGGCTGACGGAGTGCTGACGCTGACCCCCCCACAGTGGATACAGTGCGACCTGAGATATCTAGACATGACGTTCCTTGGTAAGTTTGCGGTGATAATGGCTGATCCCCCATGGGATATCCACATGGAACTGCCGTACGGCACCATGTCTGATGATGAGATGAGGTGTCTCGGAGTGCCGCAGCTGCAGGACAGTGGGCTCATCTTCCTCTGGGTGACCGGAAGAGCCATGGAGTTGGGCAGAGAGTGTCTGAAGCTGTGGGGATACGAGCGTGTGGATGAACTCATCTGGGTGAAGACGAACCAGCTGCAGCGGATCATACGAACCGGGCGCACCGGGCATTGGCTGAACCACGGGAAGGAACACTGTTTGGTGGGCATGAAAGGCAACCCCGAGAATTTGAATCGTGGTCTAGACTGTGACGTCATCGTAGCCGAGGTCCGCGCCACAAGCCACAAACCAGACGAGATATATGGCATAATAGAGAGACTCAGTCCTGGGACGAGAAAGATAGAACTTTTTGGCCGTCCGCATAACGTGCAACCAAACTGGATAACACTAGGCAACCAAGTGGATGGTGTGCGTCTCGTCGACCCTGATCTCATAGCGGCTTTCAAGAAACGCTACCCTGATGGCAATTGTATGGCGCCCCCTCCCCCGGACCCCGGCCTAGCTTAA

Protein sequence:

>DPOGS213490-PA
MSDAWEEIQAVKSKRNSLREKLEKRKKERQNILGTNLASGDKTESVPAAASSGKERSTSSPAPSKSDTPSIDKSKLIPSASLEVRLLQVLSDMQLQLPATASALMPGLGDVDTGIVASLLQKFATQKLITIKERTENTTDASIEVVNAESVRLAAVLAALMEEQSSTVKRKGDSASDEGAHPKVAKSATDDKKTGKSDADIMSLLAMPSSREKAVKRVGEEIMDLLSKPTAKEKSLADKIKIGILCNSRLYLTCLCPDSKLKDRKKNVLKMLLGTHPPLSLSVSIFKQIYVHYEVDNTDPNITAPKTTPEPAAKPGNNGTPAAPKADGVLTLTPPQWIQCDLRYLDMTFLGKFAVIMADPPWDIHMELPYGTMSDDEMRCLGVPQLQDSGLIFLWVTGRAMELGRECLKLWGYERVDELIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGMKGNPENLNRGLDCDVIVAEVRATSHKPDEIYGIIERLSPGTRKIELFGRPHNVQPNWITLGNQVDGVRLVDPDLIAAFKKRYPDGNCMAPPPPDPGLA-