Monarch geneset OGS2.0

DPOGS214532
TranscriptDPOGS214532-TA1449 bp
ProteinDPOGS214532-PA482 aa
Genomic positionDPSCF300287 + 82045-83844
RNAseq coverage306x (Rank: top 37%)
Annotation
HeliconiusHMEL0100618e-13281.82% 
BombyxBGIBMGA014563-TA1e-17872.12% 
DrosophilaCG2982-PB2e-5539.62% 
EBI UniRef50UniRef50_B0WMG31e-6448.03%Lysine-specific demethylase NO66 n=2 Tax=Culicinae RepID=NO66_CULQU
NCBI RefSeqXP_395039.32e-6750.19%PREDICTED: similar to RIKEN cDNA 2410016O06, partial [Apis mellifera]
NCBI nr blastpgi|3407235549e-7050.97%PREDICTED: lysine-specific demethylase NO66-like [Bombus terrestris]
NCBI nr blastxgi|3407235548e-6950.97%PREDICTED: lysine-specific demethylase NO66-like [Bombus terrestris]
Group
KEGG pathway 
InterPro domain[130-479] IPR0131096.8e-79Cupin 4
[215-354] IPR0227775.8e-22Cupin, JmjC-type
Orthology groupMCL11253 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214532-TA
ATGGATCCCTCGGTTAAAAGACCGGTCTCGAAAAATGTCTCTGCTAAGAACATGAAAGAAAATGCCATGAAAGAAATAAGTTTGAAGCTCAAACGAAAGCAGAAAAAACTCAAGCAATCCAAGAACATTGTGAAGAAAAATTCTCTTAAAAAAAAGAAATCAAAACAAGTGATCAAAAATGAAATGAATGGTCAGGCAGCCAATATGGAATCCTCCAGTTCAAATAATAACCAAATAACAGTAGAACAGACCACGAAAATACCCAAACCACCTATCATTGAAGAAGTTCCAGAACTTGTGCCAGCACCGAAACTCGAAAAGCAGGAGGTTGATAGTGCATGTTCATGTTCAAGTAGTGATGGCTTTGAGTTTACGCCAGTCACAACTCACAGTAAGGAAGAAGGCCTTAAGGTTTTCACATGGATGCTCACACCATTTGATCCTAATGAATTTCTTAAAGAAATATGGGAGAAGAAACCTTTGCATATTGCTCGGAAGAAACCCGACTACTACAAAGATGTAATCTCAACTCCAGTTATTGATAACATGCTGAGGACAGAAAATATTCAATTTACAAAGAATATTGATATTACATCATATGTAGACGGCAAACGAGAAACTCACAACCCTGAAGGTCGGGCTAATCCACACCTCAGTGAAATTGGGGAACCAGTTTTGGAAGTCGTTTTAGAAGCTGGTGACATGTTATATTTCCCGAGAGGCTACATCCATCAAGGGGTGACAATTGATGGTGAACACTCACTTCATGTTACAATCAGCATGTATCAAAAGCATGCCTGGGCAGATCTGCTTGAAAAAATGATTCCAGCGGCTCTACAAATTGCCATAAATGAAAATATAGAATTCAGGGAAGGTTTGCCTCTCGATATTTACGATCATTTTGGTTTGGTCCACTCCGATACTAACACTCCGCGAAAAGCAGAAATGGAAGAGATTGTAAAGAGACTATTTAACAAGATCAAAGATTACTTACCAATAGATGAAGCTGTCGATCAGATGAATAAAAAATTCCAACAAGATGCATTACCTCCCGTCTTAAGCGATTTCGAAAAAGCTGTCACTGTATTTGGTGACTCTGATGTTATGATAGAAAACGGTAAAGTTACCAACAGAGTTGAAATCGGTCTCGACACAAGGATAAGACTACTCCGTAAGAATATTCTAAGAATTGTTTCCGAGGAACGTATCAAGTTGTATTATTATGCTGAAAACGCTTTAGAATATCATGGAGCTGAGCTGCCGTTCCTGGAAATCGAAGAAGATTTGGCCCCAGCTATCGAAACCCTCATAACCACATACCCTGAATATGTTTCTGTTGAGAACCTTGATATTCCAAGTGATTCTGATAAGATTCAAATATCAGATGCTTTATGGAGTCGGGGTCTCATCATGACCGAATATCCTCTGGAAACTATTGACGATGAATAA

Protein sequence:

>DPOGS214532-PA
MDPSVKRPVSKNVSAKNMKENAMKEISLKLKRKQKKLKQSKNIVKKNSLKKKKSKQVIKNEMNGQAANMESSSSNNNQITVEQTTKIPKPPIIEEVPELVPAPKLEKQEVDSACSCSSSDGFEFTPVTTHSKEEGLKVFTWMLTPFDPNEFLKEIWEKKPLHIARKKPDYYKDVISTPVIDNMLRTENIQFTKNIDITSYVDGKRETHNPEGRANPHLSEIGEPVLEVVLEAGDMLYFPRGYIHQGVTIDGEHSLHVTISMYQKHAWADLLEKMIPAALQIAINENIEFREGLPLDIYDHFGLVHSDTNTPRKAEMEEIVKRLFNKIKDYLPIDEAVDQMNKKFQQDALPPVLSDFEKAVTVFGDSDVMIENGKVTNRVEIGLDTRIRLLRKNILRIVSEERIKLYYYAENALEYHGAELPFLEIEEDLAPAIETLITTYPEYVSVENLDIPSDSDKIQISDALWSRGLIMTEYPLETIDDE-