Monarch geneset OGS2.0

DPOGS203332
TranscriptDPOGS203332-TA4617 bp
ProteinDPOGS203332-PA1538 aa
Genomic positionDPSCF300003 - 309561-318936
RNAseq coverage361x (Rank: top 33%)
Annotation
HeliconiusHMEL0029463e-6040.00% 
BombyxBGIBMGA002098-TA4e-9247.66% 
DrosophilaJarid2-PD2e-13138.34% 
EBI UniRef50UniRef50_E3XFT21e-13337.76%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XFT2_ANODA
NCBI RefSeqXP_001845762.12e-13437.86%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700358144e-13337.86%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700358143e-12838.83%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.9e-22protein binding
GO:00036771e-19DNA binding
GO:00056221e-19intracellular
KEGG pathway 
InterPro domain[1260-1375] IPR0131297.3e-23Transcription factor jumonji
[1227-1392] IPR0033471.9e-22Transcription factor jumonji/aspartyl beta-hydroxylase
[981-1101] IPR0016061e-19ARID/BRIGHT DNA-binding domain
[934-975] IPR0033497.5e-13Transcription factor jumonji, JmjN
Orthology groupMCL23644 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203332-TA
ATGAAGACAAAAGTCCAAGCTCAACGCAAGTTCGCTCAAGGTGCCTATGTGCCGCCTGCACTCAACGTCTCTGGCGGAGGAATAGATAGAAGGACTGAGATATCACCTATAAAAGAAAAAAGTGTCTTCCATAATAAAACATCAGCGAGGTCGCTGTTGGATATCGTAGCGTTTAAAAATTTACACAAACACGCCCAACCTATAGTATTGTTGGAACGGCTAGAAAACAGTCATCCTTTGTGCTCAAAGGATCGATCAAATCCAATAGCCTTGGCCAAACCATCAGGCGTTACGATAAACAAAACAAGCCATGAACCGATCCCACCAGTATCCAACGCAACACATAGCTCGCCCATGTCACGTGACAGAACCAGGAACGCTAGGATCATTACGCCCAGGCTACCCATGCACTTGAGATCGCGGGTAGTCACATACGGCGAGAGAAAGAAAGTCTTGGTCTCCAAGAAGAAGAACTTTGAGAATCGCGAGAAGAATATTGTTAAAATACGGAATTCACGTGAATTGAGAAAGAAGAAACGACCCGACAATTATTCTGACGACTTCTCCATTGACGACGACAAGCCTCTAATGTTCTTCAGACAAAAGTCGTTGAGCTTAAAGAAAGATGCATCTATGAAGCAAAGAAAGATCAATAATAAACGCCCAAAGTCAGTTAAAAACTCCGAGCCAAAACCTAGAAATAAGACTAAAATTGTACGACCTGGCCTAAATCTGCAAGGTAAAGATGCAGTGAAACGTAGTTCGTCGTGTTTAGATCAAAGCGCAGTACAAGGACCAGCGTCGTGCAGAACGAATATAAACCCCATACCAGGGCTACCGTCCTCCAATAAAAATATTAGCCCTATCCTGGAAGCAGCGCCTTCCAATAAAGACGTAATCCCTGTCCCAGAAACTTCGCCGACCACTACAAAAACTAACCCCGTTTCTGGTCTGCCAACACGCAACACCAATACAAATACTACACCACTGTCATCACAAGCTAAAGAGGAGGGACAAAATCTTGAACCATCATTTCCAAATAAAAACGTGGACTTTGTTCCTGGAACGGTAACACCAAATGTAAATGCGAATCTTATTAAAGTGCTTTCGACGTCCAATTTGATAGTGGACGCGAATGTTAGACCAGCATTCGCTGGTATGAACATGGACACGAGCGGCGGTCAAGTACCAGACAAAATAGACGTGAACCCTTACTCCTGGCCCGCCTCCCATATGAATATGGAAACTATTCCTGTGCAAACATTCCCTAATGGAAATATGGGACCCATTCCGGTAACGTCCCCACCCGGTATCAATGTGAACCCCATTACGGGACCACAGTCACAGATATACGTGGTGAATGTACCGGGTTCATCGTCAAATATATATGTGGTCCCCGTCCCGGGTGCATCAACCTTTGCATATGCGACCCCCGTCCCAGCACCGACATCGACGAATTTATACGTGGACAGTATCCCCGGGCCATCGTCACTGAACATATACAAAGATCCTGTACCAGGACCGTCATGCACGGGCGTCTATGCTGTCGCTACTTCAGCTCCGTCCACAAATATTTGTTCAGACAATGACCGAGGCCCTTTGTCATCAAATATTTATACGGGCAATGTCTCTGAGCCGTCAGCATCAATGCCATGCGCAACGTCTGGTTCATTGCCATCATGCCTAAATACTCACCCAGCCCTAGCATCAGGTTGTTCATCTCCGAACGCATCCACAGCCTCCACTGCCAGTTCACCATCCTCAAATCAAGTGCCGACAGCTTCTTCGTCAGAAATTTATTCGCATCCCGTCCCAGGAACTTCAAGTTCCAGTGTATGTTCAGACCCCATCCCGAATACATCCTCAAACGGACGTAACAAAAGAACCATAGCAAAGCAGAGACGGAAGTCGTCTACGAAGTCAGCATCATCATTAACAAGGAAACGACCAATGGCGCTGAGAGCACCGTTTTATGGAATAAAGATGATCGAGACCTGGTTGTCATCAGATTTTAGGGTGGTCACTGCACCACCGCCACCGCCACCGCCTAAGAAGAAGTCCAACGACCCATCAGAAGCCAGAGGGGTAACTATGTCCCCAGAACCTGAAATTGACGAGACTGAGCAATGTATCGGGGCAAAATTTTCAACTATAAAGGACATCATAAACTACGGCCCGAGCTATTTCGATTGCCTGTGTCTGCAATGTGACCATTGCAATTATATTTATAAAATGTTGAACAAGTATAAACAGGATTCCAGTGCAGCCAGACATGTAGAGGTCGGGATGAGGAAGCGGAAACACGAGGAATCCTTGACCGTGGAGCGTCAAGAAGCGTCGGATTCCATGGTCATCGAGGTTTCAGACAGTGAGGATATAAATACGATCGACCTTACCGGTGACACCGATGACCTCATGCAAGATAGTGAAGCTTCAAACTGCTCTCCAAAGCCAGCTAACAGTAGCAACCCGCAGCCAGAGAATGGCAGCCAGCAGAATGAAGCGCAACGCGATTCACCAGGTGAAAGGTTGCAGAATCAATTAGCAAGGAATCTAGCGCAGCGGATACAAACACAACAGAGCCAAAGACGAATTCAACAGAGGCAGAGGCAGCGTGTCGGCCGACAGGCGGGTTCCTCGTTACGTGGCCGACAGAGAATTGTACCGAGGGCGGTCCAGCGGCCATTGAGGCGGGTAGTTGTTAATAAAGTTGTGAACAGGAGATCGCTAACGAGGTACTCATACCAGACGAGCAAGCACTCTGATAACGTGTCCAGGATACTGCGAGACAAGATCATCACCGCTCCCGTGTTCTATCCGACTCTAGAGGAATTTTCGGATCCCATGGCCTATCTCGAGAAGATTATGAAATTTACGAAGAAATACGGTATCTTCAAACTAGTGGCGCCAGACGAATATGAACCCACTTGTACTATATCCGAGAACTTTAAATTCAGTACATCCCATCAATACATAGCTCGGTTCTTCAATCGCTGGGGTCCCGCTGCTAGGGAACTATGTACCATGAGGGCTTATTTAGCTACACAGAACGTTCATTTCAAACGAGGGCCTTTGCTAGGTGGTCTAGAGGTCGATCTGCCCAAAGTGTTCCATATCGTCCAAAGACTGGGCGGCTTAAAGACTGTCATGGACAAAAAGAAATGGCATCGCATAGCCGAAGAATTGAACCTTAGAAATTTACGTAACCCGGAGAAGAAGTTTGATAATTTATTCCTAAAGTACTTGTTGCCATACAACAGTCTGACTAAAAGAGAACGGCAGGACATGATGATGAAGGTGGAACAGAGGTGGATCAAGAAGAACAATAGACTGATGAAGCGTGTCGTGAACCCGCTGTACCGGCAGAAACGTATGCTGGGAGAGATCGAGTCGTCCGACGAGGAAGCCGAGGACGAGGACTATCTCACAGAGGCGCTCAATCTCGCCGAAGACTGTAGTCAACTAGGACGGAAGATGGGTCTCGAGGCCTTCAAGAAGATTGCCGGAACCGCATTCAATATGTACTTCCCGGATGAGAAGGCTCAGCCCACTGTAGCTGAGATCGAGGAGAAGTATTGGAACATTGTGCTGCTGGGCACCCAGCATGTGTCGGTTAACACGGCATTCATAGAGAGCGGCGTAGAAGGAAACATATCACCCAAAAGCAAGACGAGTGATGCTATCACCACCAGCCCCTGGTATTTGAAGAATCTGAGTACCGACAAGAGTAATGTCCTACGCTTGCTCGGCTCGTTGGCTGGTATGACAGTGCCGTCGCTCCACGTGGGGATGGTGTTTTCAACCAGTTGCTGGCACCGCGACCCCCATGGGTTACCATGGATGGATTACTTGCACCAAGGAACAGAGAAGATATGGTACGGAGTACCCAGCCACGAAGGGCAGAACTTCCGATGCGCTCTGGAAACTCTCTGCCCTACTCTGTGCCAGAACAAAACATTGTGGCTACCCTCCGAGATCGCAATGACACCGCTCAACCTCCTGTTAGACCGCAACATCAAACTGACACGCTGCGTACAGCGGCCGGGCGAATTCGTGTTCGTCAACCCCCAGGCGTATTCTAGTTCGGTGTCAACTGACTTCACTGTTTCGGAGAGCGTTTACTTCGCTACCGAGTCCTATTTCGAAAACGTCAATCAGGCCTTCCAGGAGTTGAAGGAGAGCTGCGAGCCGTCGTCGTTTTCGCTGGAACAGCTGCTGATATCCGCGGCCAAGGATCCGCATCTATCACCGAACGTTCTAGAACACGTCAACAAACATCTCAACGACATAGTGTCCGAAGAGCTCTCCTTCAGGAGGGCGCTCACCGACCTCAAAGTACCGTTGCTGCTGAATAAAAACCGTCCAACAATATGGAGTGCTCGTGATGACGATGAATGCCAAGTCTGCAGGACGGCGTTGTATTTGTCGCGTGTAACGGGACTTTTCAAGAACGCCTCCGTGTGTCTGCAACACGCTCTGAGACTGATCAACTTGAAGAAAGGGGCCGAGCTGAAGGCGTTGATAGCGACCCTCGCGATGGAAGTGACGATAAGCAATCAAGAGCTACACGACATCGTCGTTAAACTGCAGAGGAGACTGTCGCAGAGAGCGAAGTGA

Protein sequence:

>DPOGS203332-PA
MKTKVQAQRKFAQGAYVPPALNVSGGGIDRRTEISPIKEKSVFHNKTSARSLLDIVAFKNLHKHAQPIVLLERLENSHPLCSKDRSNPIALAKPSGVTINKTSHEPIPPVSNATHSSPMSRDRTRNARIITPRLPMHLRSRVVTYGERKKVLVSKKKNFENREKNIVKIRNSRELRKKKRPDNYSDDFSIDDDKPLMFFRQKSLSLKKDASMKQRKINNKRPKSVKNSEPKPRNKTKIVRPGLNLQGKDAVKRSSSCLDQSAVQGPASCRTNINPIPGLPSSNKNISPILEAAPSNKDVIPVPETSPTTTKTNPVSGLPTRNTNTNTTPLSSQAKEEGQNLEPSFPNKNVDFVPGTVTPNVNANLIKVLSTSNLIVDANVRPAFAGMNMDTSGGQVPDKIDVNPYSWPASHMNMETIPVQTFPNGNMGPIPVTSPPGINVNPITGPQSQIYVVNVPGSSSNIYVVPVPGASTFAYATPVPAPTSTNLYVDSIPGPSSLNIYKDPVPGPSCTGVYAVATSAPSTNICSDNDRGPLSSNIYTGNVSEPSASMPCATSGSLPSCLNTHPALASGCSSPNASTASTASSPSSNQVPTASSSEIYSHPVPGTSSSSVCSDPIPNTSSNGRNKRTIAKQRRKSSTKSASSLTRKRPMALRAPFYGIKMIETWLSSDFRVVTAPPPPPPPKKKSNDPSEARGVTMSPEPEIDETEQCIGAKFSTIKDIINYGPSYFDCLCLQCDHCNYIYKMLNKYKQDSSAARHVEVGMRKRKHEESLTVERQEASDSMVIEVSDSEDINTIDLTGDTDDLMQDSEASNCSPKPANSSNPQPENGSQQNEAQRDSPGERLQNQLARNLAQRIQTQQSQRRIQQRQRQRVGRQAGSSLRGRQRIVPRAVQRPLRRVVVNKVVNRRSLTRYSYQTSKHSDNVSRILRDKIITAPVFYPTLEEFSDPMAYLEKIMKFTKKYGIFKLVAPDEYEPTCTISENFKFSTSHQYIARFFNRWGPAARELCTMRAYLATQNVHFKRGPLLGGLEVDLPKVFHIVQRLGGLKTVMDKKKWHRIAEELNLRNLRNPEKKFDNLFLKYLLPYNSLTKRERQDMMMKVEQRWIKKNNRLMKRVVNPLYRQKRMLGEIESSDEEAEDEDYLTEALNLAEDCSQLGRKMGLEAFKKIAGTAFNMYFPDEKAQPTVAEIEEKYWNIVLLGTQHVSVNTAFIESGVEGNISPKSKTSDAITTSPWYLKNLSTDKSNVLRLLGSLAGMTVPSLHVGMVFSTSCWHRDPHGLPWMDYLHQGTEKIWYGVPSHEGQNFRCALETLCPTLCQNKTLWLPSEIAMTPLNLLLDRNIKLTRCVQRPGEFVFVNPQAYSSSVSTDFTVSESVYFATESYFENVNQAFQELKESCEPSSFSLEQLLISAAKDPHLSPNVLEHVNKHLNDIVSEELSFRRALTDLKVPLLLNKNRPTIWSARDDDECQVCRTALYLSRVTGLFKNASVCLQHALRLINLKKGAELKALIATLAMEVTISNQELHDIVVKLQRRLSQRAK-