Monarch geneset OGS2.0

DPOGS200845
TranscriptDPOGS200845-TA1362 bp
ProteinDPOGS200845-PA453 aa
Genomic positionDPSCF300071 - 137563-139983
RNAseq coverage241x (Rank: top 43%)
Annotation
HeliconiusHMEL0123430.067.26% 
BombyxBGIBMGA009901-TA0.069.09% 
DrosophilaCG12863-PA3e-6532.30% 
EBI UniRef50UniRef50_D6WX243e-9942.28%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WX24_TRICA
NCBI RefSeqXP_970334.22e-9541.96%PREDICTED: similar to zinc finger, CCHC domain containing 4 [Tribolium castaneum]
NCBI nr blastpgi|2700116071e-9842.28%hypothetical protein TcasGA2_TC005649 [Tribolium castaneum]
NCBI nr blastxgi|2700116072e-10141.70%hypothetical protein TcasGA2_TC005649 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[157-273] IPR0193691.1e-10DNA methylase, N-6 adenine-specific, eukaryotic
Orthology groupMCL15572 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200845-TA
ATGACTAAGAAAAGGAAGTATGTAACTGCAGAGAAAATTGAAAATTCTGGCCCTGTAGAGGTTGTTGTTGAAGATGTAACATCACATCCTTTATGCTTGCACGGTCCGACTCTCTTATTTGCCTCCCCTAAGGGCCGGTATTTTGCATGTTCTGCATGTAGAAATAAGAAGGACTGTACAGTACACATTGAGGAAGAAGAGTGGAAGAAAGAGGGTGTGAAGAAAAGAAATGAAAAATATTATAGCATTATCCCTAAGATAAACAAAGAAGCTGCATGGAAGAACTTGCAAGAGGTTAAACTTCAACATCCATCAGACAGAGCATACTGCAACACCTGCAAGGAACTGTACATAATATCCAAAAGTAGGAAACATATAAAGGATCACAAAGTTATCCTGTCACTGACCAATGAACAGCTAACACATCCCTCGACCGTACTACCACCCCTCGAGAATGATGGCCATGAAGCACAGTACTTATTCTCTAAGAAAGCTATCAGCACTGTGCTGGGGATATTGACTAATAATAAGATAAGTAATATCCTCTGCATCGGCACTCCAACAATCCACGAAGCAGCTCAAGATCATCCCGAGTTTAACTCCCTGCTCCTGGACTATGATACTCGCCACCACCTGTTTCATCCAACTAACAAGTTTTTATGGTACAATATGTTTAACAACTACCTCTTTGATGGTAATAAGGATGAAAAAATACTTAAGAAATTCATGAAACAGTCTAAAAACAAAGGTCTTGCAATTGTAATGGACCCTCCGTTTGGAGGAAGAGTTGAACCTTTGATACAGACTATAAAAGAACTATCTAACTTGTATAATACATTATGTGAAACTGCTGATAAAATTTTGCCAGTCATATGGGCTTTCCCATACTTTGCAGAGCCTTATATTAAAAATATTATGCCCGAGATCAAAATGCATGATTATCAGGTGGAATATGCCAATCACAAAAAGTTTGGAAATAAGAATGGGGGTCGCAAATTTGGATCACCTGTTAGGTTCTTCACTAACCTACCATTTTCTACCATTGACCTCTCCAATGACAGCAGCTATAAACTATGTGACAAGTGTAAATTCTGGGTGTCCACATCTAACAGACACTGCACCAAGTGCAGAGAATGCACCAGTAAGAATGGCATGACATACAAGCACTGTAACATTTGTAAGAGGTGTGTGAAGCCTACATTTGTACACTGTGAGAAATGTGAACGGTGTTGCCAGGAGAAAGGACATGTTTGCGGCACTATTGTTGTATCACAGTCCTGTTATATTTGCAATGAAAAAGGTCACAAGAAATCAGAATGTCCAGAGAGCAAGGGAGAAAATAAGAAGAAAAAACATAAATAA

Protein sequence:

>DPOGS200845-PA
MTKKRKYVTAEKIENSGPVEVVVEDVTSHPLCLHGPTLLFASPKGRYFACSACRNKKDCTVHIEEEEWKKEGVKKRNEKYYSIIPKINKEAAWKNLQEVKLQHPSDRAYCNTCKELYIISKSRKHIKDHKVILSLTNEQLTHPSTVLPPLENDGHEAQYLFSKKAISTVLGILTNNKISNILCIGTPTIHEAAQDHPEFNSLLLDYDTRHHLFHPTNKFLWYNMFNNYLFDGNKDEKILKKFMKQSKNKGLAIVMDPPFGGRVEPLIQTIKELSNLYNTLCETADKILPVIWAFPYFAEPYIKNIMPEIKMHDYQVEYANHKKFGNKNGGRKFGSPVRFFTNLPFSTIDLSNDSSYKLCDKCKFWVSTSNRHCTKCRECTSKNGMTYKHCNICKRCVKPTFVHCEKCERCCQEKGHVCGTIVVSQSCYICNEKGHKKSECPESKGENKKKKHK-