Monarch geneset OGS2.0

DPOGS200115
TranscriptDPOGS200115-TA4452 bp
ProteinDPOGS200115-PA1483 aa
Genomic positionDPSCF300044 + 604933-624238
RNAseq coverage854x (Rank: top 15%)
Annotation
HeliconiusHMEL0155540.076.64% 
BombyxBGIBMGA004558-TA0.076.70% 
Drosophilalid-PE0.047.52% 
EBI UniRef50UniRef50_F4WHF80.053.27%Lysine-specific demethylase 5A n=7 Tax=Myrmicinae RepID=F4WHF8_ACREC
NCBI RefSeqXP_001603951.10.052.42%PREDICTED: similar to retinoblastoma binding protein 2 [Nasonia vitripennis]
NCBI nr blastpgi|3287863620.048.04%PREDICTED: lysine-specific demethylase lid isoform 1 [Apis mellifera]
NCBI nr blastxgi|2700148240.048.12%hypothetical protein TcasGA2_TC010807 [Tribolium castaneum]
Group
Gene OntologyGO:00167062.3e-52oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors
GO:00551142.3e-52oxidation-reduction process
GO:00055152.5e-42protein binding
GO:00036772.7e-33DNA binding
GO:00056222.7e-33intracellular
GO:00056341.3e-15nucleus
GO:00082701.8e-14zinc ion binding
KEGG pathway 
InterPro domain[759-1099] IPR0136372.3e-52Lysine-specific demethylase-like domain
[463-622] IPR0033472.5e-42Transcription factor jumonji/aspartyl beta-hydroxylase
[80-194] IPR0016062.7e-33ARID/BRIGHT DNA-binding domain
[496-605] IPR0131291.3e-24Transcription factor jumonji
[32-73] IPR0033492e-20Transcription factor jumonji, JmjN
[305-379] IPR0110113.3e-17Zinc finger, FYVE/PHD-type
[319-377] IPR0130838.2e-16Zinc finger, RING/FYVE/PHD-type
[695-747] IPR0041981.3e-15Zinc finger, C5HC2-type
[322-368] IPR0019651.8e-14Zinc finger, PHD-type
[322-368] IPR0197874.9e-12Zinc finger, PHD-finger
Orthology groupMCL10247 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200115-TA
ATGGGAAAATGTGACGAAGCTAATGTCGAAGTTAGTATGGGGGACTCGGCGTCCCCCATGAAACCTGATAATTTTACGTTCACCGTACCCCCTGAGGCCCCTGTCTTCGAGCCAACTCCTGAAGAGTTCCTCGATCCTCTGGCGTATATCAGCAAAATCAGGCCCATTGCTGAGAAGTCAGGGATATGCAAAATAAAACCCCCCGCGCACTGGCAACCTCCGTTTGCTGTGGATGTAGACAGACTGAGATTCACTCCTCGGATACAACGGTTAAATGAGTTAGAGGCAATTACAAGAGTGAAGCTAAACTTCCTGGATCAGATAATCAAGTTCTGGGAGCTGCAAGGCTCGATGCTGAAGATACCAACGGTGGAAAGGAAACCATTAGATTTGTATGCACTACATAAAATTGTCAAAGAAGCTGGTGGTTTTGAAGTATGTTCGGCCGAGAGGAAATGGTCGAAGATCGCTAGACGCATGGGTCATCCACAGGGTAAAGGTATAGGTTCCATATTAAAGAATCACTACGAGAGGATCCTCTACCCGTACGACGTTTTCAAGAGCAGCGGCGCTGTCGGAAACGGAAATGATAGCAAAGAAACGAAGATAGAAGTGAAAGGCCATCAGACACCAGAGAGGGGTCGCAGATCAAGCAAGCCGCCGGCTGCGACTCCGCCACCTGCCCGTCGGTTCAAGAGTTCGGAGCCCGAGCCCGCTGAGGGCAGCCATACTGAGGGGGCGCTGCCCGTGGGCGACGTCAAGAAGACACCCATCAAGGAAGAACCGGGAGAGACAAGGAGGAGTCCACGTGAGAACAAGTGGATGTCTCAGGCCGGGAGTTGTGGAGCGCGAGCTCGCGCCGTCCGCTCCAAACGACTGAGGGAACACCGTTTCAGTAATATGACGGTGTCTGTGACGCCCGCACCTCCCCCCCTACATCCCGACGACCCGCTCGCGAAGTACATGTGTCATATATGCGGGCGCGGTGACATCGAGGAACAGATGCTGTTGTGTGATGGCTGCGACGATTCCTACCACACGTTCTGCCTGGTGCCGCCGCTGGCTGATGTACCCAAGGGAGACTGGAGGTGTCCCGTGTGTCTGGCGGAGGAGGTGTCAAAACCGACCGAGGCCTTCGGTTTTGAGCAAGCGAGCCGCGAATATACACTGCAGCAGTTCGGAGAAATGGCTGATCAGTTCAAATCTGATTACTTCAATATGCCGGTACATATGGTGCCGACGTCGACGGTGGAGCGAGAGTTCTGGCGCGTCGTGTCGTCTATAGATGAGGACGTGACTGTTGAATATGGAGCGGATCTGCACTCCATGGACCATGGATCAGGTTTTCCAACAAAATCCAGTGCCCATCTCTATCCTGGAGAACAGCAATACGCTGAATCCTCATGGAATCTCAACAACCTGCCAGTGCTGGAGGGATCTGTACTGGGACATATAAATGCAGACATATCCGGGATGAAGGTACCTTGGCTGTACGTGGGCATGTGCTTCGCCACGTTCTGCTGGCACAACGAGGATCATTGGAGTTATTCCATCAATTACTTACACTGGGGAGAACCGAAGACATGTGACTGGGACAAGATATTCCAAAGGTGGCAGGTTCAAATCCCACTCACTCCGATGATTTATATACTTTCTATACACAATATTACCATGATATACTATCCCCTTGAATCCTTATTTAAAAGCAAGGTAATTAGGACTGATCAACATGCTGGAGAATTTGTAATAACCTTTCCCCGCGCCTACCACGCCGGTTTCAATCAAGGATATAACTTTGCCGAGGCAGTCAACTTCACACCAGCTGATTGGCTAAAAATGGGTCGCGAGTGCATCACCCACTACTCCACTCTGCGGCGGTATTGCGTGTTCTCCCACGACGAGCTGGTCTGTAAGATGGCGCTTGAGGCGGACTCGCTTAGTCTGACCGTGGCCCTGGCCGCGTACAGAGATATGAGGACAATGTTACATGACGAGAGGAAGTTAAGGAAGGGATTACTTGACTGGGGTGTAACGGAAGCTGAGCGTGAGGCGTTCGAACTCCTCCCGGACGACGAACGTCAATGTCACGAGTGTAAGACCACGTGTTTCCTGTCCTGCGTCACATGCGCCTGTACCACGCAAATCGCATGTCTGCGACACTACGATCAACTGTGCGGTTGTTCGCCGGCGGAACACAAACTTAGATATCGTTACACACTGGACGAACTCCCGGCGATGCTTGAGAAGCTAAAGCGTAAATCAGAACAGTTCCGTGAGTGGGCCGAGGCCGTGCAGAACGCTTTGGACCCTGATACGCCAAAGACATGTGACCTTGACGGACTCAGGGGTCACTTGAAGAGGGCTCACGATCTGAAGATGCACAAGACGGAACTGGTGCGTGCCTTGGAAACGGCGATAGAGGACGCTGAGAAATGCGCGTCTGTGATACAACAGTTGGATCTGAACAAGATGAGGACGAGGACGAGGCATCACGACCCCAAGTACCGCCTCACCATACACGAGCTGACGCTGTTCGCCGCAGAGATAGACGGACTCGCCTGTGTGCTGCCCGAAGGGTCTGCTGTTAAAGAAGTGCTTCGTCAGACAGCTGAGTTTGAGGACAGAGCGATCGCACTGCTGGGGAGAGACTTGGACGACTGTGATGCTGCCTCCGTCAGGGAACTAGAAGAGGTCGTTGACCTGGGTTCCCGTCTCTGTATAGTCCTGCCCCAGCTAGGAGCGCTCCAGGCTCGTCTCCAGCAAGAGAAGTTCATACTCTCCGTTCGTACACATCGCGAGGACGCCGCCTCTCTCACACCAGAGACCATTGACAAACTGCTGGCTGAAGCAGAAACGGTCTTACCACATAGAAGAGTTGAAACGGAACGGGCTGGGCTGTATAAATTGAAATTACAAGTTGAAGAATGGGAACAGAAAGCCCGGGCTATATTAGACGTGAACCGGGAACGCGATGATGATTCACACACTACGCTGGCTGATTTGGAAGAATTGTTAGCAGCTGCTGATGAAGTGGAAGCAGCGCTGCCCTCTAGACACTCGTTGGCTACAGCCGCTGCACACGCCAAAGACTGGCTTGCCAAGGTGGAGGAAATGCAATCAAAGGAATTATATCCATACATGCACAGTGTTGAAGCTCTGTGCAAACGCGGAGCACAGATACCTGTGGCTCTGCTCGAGAAAAGACATTTGGCGGCAGCCCTACTATCAGCCAGGGACTGGCAGAGAGGAGCTGCTGATATGTTCCTTAAGAAGAACTGGCCTTACTCTTTGTTGGAGGCGTTGTCCCCTCGTTCGGAGTGCGCCCCTCGCCGAAAGAAAGGGGACTCTGAAGGATTATTACAACACTTCAGTGAGGAAGCCACTCCCACTGAGATAGTGGCGGCCTTCAAGCAGGCGGAACAAAGGGAATTGGCTGCTATTAAGGAACTCAGAGCTAAAAATATGCGGAAAGAGGTCCGAACGCCCGCGCCCGGCGTTACTTTCTGCGTGTGTCAGAAGAAACAGTATGGGGTGATGACGCAGTCTGCATGTGTGTTGTCGAGTCGCTCGGAGGAGTCGGAGTCGTCACCGGAGCCCGAGGCGGAGCCTTCGCCCTCGCCGGCCCCTGAACCGAAATTCCTCTGTCCTGATTGCGCACGAACCAAACGACCTAGACTGCATCGGATATTGGCTCTATTGGTTTGGCTACAGAAGCTGCCCGTGCGCTTGGCGGAAGGTGAGGCGTTACAGTGTGTGACTGAGCGGGCGATGGCCTGGCAGGACGCCGCTCGAGCACTACTGGCTTCTCTTCCGGCTACACGCGTGGAGAGGGGGGAGCGCCAGGGGAGGGGAGACGCACACAGGTCGTCAGCGAGTGTGGAACACGCATACAGTGCGACACCTCGCACCAGCACAGCTCGAGTTTCGCCGGTCATGCTACAGAGACTCGAAGATCTCATGATGGAAGGTGACCTGTTAGAGGTTCGTTTGGAGGAACAGCGTCTCGTATGGGCTGCGGTGTGTGCGGCTCGTGCGGCGGAAGGTCGGCGGGCGGCGCGAGTGTTGGACGCCGGAAGGAGGAAGAGAGCCCAGAGACCAGCTCGACACCATCACCCGCTCAAGAGGACACGCGTCTTACACGGAACAAACACCACAAAGACCACCATGAAGCGCGGCTCGGCCACAACCACCAACTACTTAAATCGGAAGCAGGGCATGTCGTCTGGTTCTGGTCTGATGATGCGGAAGCACTACCTGGCCCGCCAGGAGCGTCGTAAGCGCCCGCTCCCCCAGCGCCCGCCTCCACGGCCCCGCGGCAAGGAGGTGGACTGGGTTCAATGCGACGGCGGCTGCGATCAGTGGTTCCACATGCACTGCGTGGGCCTGAGTCGCGGCGCGCTGCGAGAGGACGACGACTACGTCTGCGGCTCGTGTGCCGAGACACGGAAGTAA

Protein sequence:

>DPOGS200115-PA
MGKCDEANVEVSMGDSASPMKPDNFTFTVPPEAPVFEPTPEEFLDPLAYISKIRPIAEKSGICKIKPPAHWQPPFAVDVDRLRFTPRIQRLNELEAITRVKLNFLDQIIKFWELQGSMLKIPTVERKPLDLYALHKIVKEAGGFEVCSAERKWSKIARRMGHPQGKGIGSILKNHYERILYPYDVFKSSGAVGNGNDSKETKIEVKGHQTPERGRRSSKPPAATPPPARRFKSSEPEPAEGSHTEGALPVGDVKKTPIKEEPGETRRSPRENKWMSQAGSCGARARAVRSKRLREHRFSNMTVSVTPAPPPLHPDDPLAKYMCHICGRGDIEEQMLLCDGCDDSYHTFCLVPPLADVPKGDWRCPVCLAEEVSKPTEAFGFEQASREYTLQQFGEMADQFKSDYFNMPVHMVPTSTVEREFWRVVSSIDEDVTVEYGADLHSMDHGSGFPTKSSAHLYPGEQQYAESSWNLNNLPVLEGSVLGHINADISGMKVPWLYVGMCFATFCWHNEDHWSYSINYLHWGEPKTCDWDKIFQRWQVQIPLTPMIYILSIHNITMIYYPLESLFKSKVIRTDQHAGEFVITFPRAYHAGFNQGYNFAEAVNFTPADWLKMGRECITHYSTLRRYCVFSHDELVCKMALEADSLSLTVALAAYRDMRTMLHDERKLRKGLLDWGVTEAEREAFELLPDDERQCHECKTTCFLSCVTCACTTQIACLRHYDQLCGCSPAEHKLRYRYTLDELPAMLEKLKRKSEQFREWAEAVQNALDPDTPKTCDLDGLRGHLKRAHDLKMHKTELVRALETAIEDAEKCASVIQQLDLNKMRTRTRHHDPKYRLTIHELTLFAAEIDGLACVLPEGSAVKEVLRQTAEFEDRAIALLGRDLDDCDAASVRELEEVVDLGSRLCIVLPQLGALQARLQQEKFILSVRTHREDAASLTPETIDKLLAEAETVLPHRRVETERAGLYKLKLQVEEWEQKARAILDVNRERDDDSHTTLADLEELLAAADEVEAALPSRHSLATAAAHAKDWLAKVEEMQSKELYPYMHSVEALCKRGAQIPVALLEKRHLAAALLSARDWQRGAADMFLKKNWPYSLLEALSPRSECAPRRKKGDSEGLLQHFSEEATPTEIVAAFKQAEQRELAAIKELRAKNMRKEVRTPAPGVTFCVCQKKQYGVMTQSACVLSSRSEESESSPEPEAEPSPSPAPEPKFLCPDCARTKRPRLHRILALLVWLQKLPVRLAEGEALQCVTERAMAWQDAARALLASLPATRVERGERQGRGDAHRSSASVEHAYSATPRTSTARVSPVMLQRLEDLMMEGDLLEVRLEEQRLVWAAVCAARAAEGRRAARVLDAGRRKRAQRPARHHHPLKRTRVLHGTNTTKTTMKRGSATTTNYLNRKQGMSSGSGLMMRKHYLARQERRKRPLPQRPPPRPRGKEVDWVQCDGGCDQWFHMHCVGLSRGALREDDDYVCGSCAETRK-