Monarch geneset OGS2.0

DPOGS200181
TranscriptDPOGS200181-TA5892 bp
ProteinDPOGS200181-PA1963 aa
Genomic positionDPSCF300128 + 676667-686962
RNAseq coverage1446x (Rank: top 9%)
Annotation
HeliconiusHMEL0217840.090.55% 
BombyxBGIBMGA002931-TA0.088.46% 
DrosophilaMi-2-PB0.067.39% 
EBI UniRef50UniRef50_E2AEH30.070.39%Chromodomain-helicase-DNA-binding protein Mi-2-like protein n=13 Tax=Eumetazoa RepID=E2AEH3_CAMFO
NCBI RefSeqXP_624414.20.071.39%PREDICTED: similar to Chromodomain helicase-DNA-binding protein Mi-2 homolog (ATP-dependent helicase Mi-2) (dMi-2) [Apis mellifera]
NCBI nr blastpgi|1107771980.071.39%PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog [Apis mellifera]
NCBI nr blastxgi|1107771980.072.16%PREDICTED: chromodomain-helicase-DNA-binding protein Mi-2 homolog [Apis mellifera]
Group
Gene OntologyGO:00056344.3e-96nucleus
GO:00036774.3e-96DNA binding
GO:00055244.3e-96ATP binding
GO:00063554.3e-96regulation of transcription, DNA-dependent
GO:00082704.3e-96zinc ion binding
GO:00168184.3e-96hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides
GO:00043861e-25helicase activity
GO:00036761e-25nucleic acid binding
GO:00055151.1e-14protein binding
KEGG pathway 
InterPro domain[1727-1898] IPR0129574.3e-96CHD, C-terminal 2
[723-1014] IPR0003301.6e-76SNF2-related
[1358-1514] IPR0094629.7e-70Domain of unknown function DUF1086
[716-925] IPR0140012.5e-27DEAD-like helicase
[1271-1327] IPR0094631.4e-26Domain of unknown function DUF1087
[1071-1155] IPR0016501e-25Helicase, C-terminal
[141-195] IPR0129582.3e-24CHD, N-terminal
[572-649] IPR0161978.9e-20Chromo domain-like
[428-497] IPR0110111.1e-18Zinc finger, FYVE/PHD-type
[426-487] IPR0130833.9e-17Zinc finger, RING/FYVE/PHD-type
[436-479] IPR0019651.1e-14Zinc finger, PHD-type
[596-645] IPR0237809.3e-14Chromo domain
[593-649] IPR0009533.1e-13Chromo domain/shadow
[377-421] IPR0197874.2e-11Zinc finger, PHD-finger
Orthology groupMCL10356 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200181-TA
ATGGCATCAGATGATGAGGTTGATGGATCTTTCGCAGGCGACGAGGACGTAGAAGAAGGTGAGGGTCAGATTGATAACTCTGGGGAGAGTGATGAAGCACCGCCAAAAGAAGATGATGATTACTCTCCAGAAGATGGTAGAAAGAAAAAGAAAGGTAAGAAAAGAAAAGCAAGAGGAGAAGAAAAGAAAGGAAGAAAGAAAAAGAAGAAGCGAAAGAATGAGAGTGAAGATGATGATGACTTTGGTCTGGAGATTGAGGCAGAGGGTGATAGTGATTATGCACTAAGTGCGGTGTCTTCAAGTAAGAAGTCCCGGAAAGGACGTACCAGCAAACACAATACCTCTGCACCAGCGGTACCAGACTCGGGATCAGGTATGCCGACAGTTGAAGAAGTGTGCTCTACATTTGGCCTCACAGACGTTGATATTCAGTATTCAGATGCGGATCTTGAAAATTTAACAAGCTACAAGTTGTTTCAACAACATGTCCGCCCACTTATAGTGAAGGAAAATCCTAAGGTACCAGTGTCGAAGTTGATGATGCTGGTAGCGGCGAAATGGCGTCTGTTCTGTGAGAGAAATCCACATCTGGATGGTGGGAGTCAGAACATGGGCTCCGAAGACAACACTAACACATCCGCCACATCCGACTACACGCCGAAGAGGCCTGGACGTACACCCAAAGAAACTAAAATCGAGGATTCCATTGATGAGCCTGAGGAGGAGGACATGAGCGACGAAGGTACGCCCGCTCCTCGCAAGCGTGGCCGCAAGCCAAAGCACGCGCCACGCGGCAAGCCGGGGCGCAAGCCCAAAGTGCCCACGCTCAAGATCAAGTTCAGTAAACGGAAACGAACCAGCAGTGAAGAAGATCAAGAAGGAAGCGCTGGTGCGAATACAGACTCGGATGAAGAATTTGAGCAGTTATTGGCTGAAGCCGAAGAACCGAGACCAGTGGCCAGTACCACAGAAGAAACCCCGCAACAGAAAGAGGATGATCCGAATTTACCACAACAGCCAAAGAAGAAGGCTAAAACAAAAATCGGCAACAAGAGTAAAAAGAAGAAAAAAACGAAATCAACTAATAAATTTCCAGACGGCGCTGAAGGTGAACAGGAACATCAAGATTACTGCGAGGTGTGTCAACAAGGGGGTGAAATAATTCTATGTGACACATGTCCCCGGGCGTACCATTTGGTGTGTTTGGACCCTGAGTTGGAGGAGACGCCGGAAGGTCGGTGGTCGTGTACATACTGCCAGGCCGAGGGTAATCAGGAACAGGAAGACGATGATGAGCATCAGGAGTTCTGCAGGATTTGCAAAGACGGTGGAGAGTTATTATGTTGCGACTCGTGTCCGTCGGCGTACCATAGATTCTGTTTAAATCCACCATTAGAAGAAGTCCCTGATGGTGAATGGAAATGTCCACGATGTAGTTGTCCACCTTTGGATGGTAAAGTCGCTAAGATTTTAACATGGCGGTGGAAAGAACAGCCGGCCAAGTCAAAGGCACCACGTTCGAGAGAGTTCTTCGTAAAATGGCATGAACGTTCCTACTGGCATTGCAGTTGGATTTCAGAGATCCAGTTGGATGTATTCCATCCCCTGATGTATCGCTACTATATGCGAAAGTCTGATCCCGAAGAACCGCCTAAGTTGGACGACGGGCTCGAGGAACGCGAAGGTCGCAGGCGTATGAAGCACAGCAAGCAACATCACCAAGACAACGATGAGAAACTCCTGGAGGAGAAATACTACAGATACGGCGTGAGACCGGAGTGGCTCATAGTACATCGAGTGATTAATCATCGTACAGCCCGCGATGGTACTACGTACTACCTGGTTAAATGGCGAGATTTATCTTACGATCAGGCCACTTGGGAATCTGAGCACGAGGATATAGCTGGGCTTAAGAATGCCCTTGAATATTATCAGGATATGCGCGCTTATATCACTTCGGAAGGCAAAACTAAGGGGAGTAAAGGCAAAAAGGCCGGTCGTAAGAGTAAGAACAAAGATAACATTGATGACGACGAAAGCAGCAGTGGTCTGCAGTTCAAAGGAAGGAAGTACAACCCGCCACCTGACCGCCCTACCACCAACCTAAACAAGAAGTATGAAGATCAGCCGCCTTTCGTTTATGAAACTGGTATGCAGCTTCATACATACCAGTTGGATGGACTCAATTGGCTGCGGTACTCCTGGGGACAAGGAATTGATACAATACTTGCTGATGAAATGGGTCTCGGCAAAACCATTCAAACTGTGACATTCCTTTACTCGTTGTTTAAGGAAGGTCATTGCAAGGGTCCTTTCTTAGTATCTGTACCCCTTTCAACAATTATCAATTGGGAGAGAGAGTTCGAGCTGTGGGCTCCAGATTTGTACTGTATTACGTATGTCGGTGATAAAGATTCAAGAGCAGTTATTAGAGAAAACGAGCTAACTTTCGACGATGGTGCCAATAGAGGAGGAAGGCCGTCTAAGATAAAGTCTCAAGTGAAATTCAACGTACTCTTAACGTCATATGAACTTATATCAATTGATTCCACGTGTCTCGGTTCTATAGATTGGGCAGTTCTGGTCGTTGACGAAGCTCACAGACTTAAAAGTAACCAGTCTAAATTTTTCCGTCTCCTTGCTGGATATCACATTAATTACAAGCTTCTATTGACGGGAACTCCTTTACAAAACAATCTGGAAGAATTGTTCCATCTTTTGAACTTCTTGAACAAAGATAAGTTTAACGATTTAGCCGCTTTCCAAAATGAATTCGCGGATGTATCGAAAGAGGAACAAGTCAAAAGGCTTCACGAAATGTTAGGGCCGCATATGCTGCGTCGTCTTAAGGCTGACGTATTGAAAAACATGCCAGCGAAATCTGAGTTCATTGTGCGAGTAGAACTGTCACCCATGCAGAAGAAATATTATAAGTACATCTTGACGAGAAACTATGAAGCTTTGAACCCTAAGAGCGGTGGTCAAACTGTGTCGTTACTTAATGTCATGATGGATCTCAAAAAATGTTGCAATCACCCTTATCTGTTCCCTGTAGCGGCAGAAGAAGCACCGCTCGGACCTCATGGCAATTATGAAACCCAAGCTTTGGTTAAGGCATCTGGAAAACTTGTGCTCATGTCTAAAATGTTAAAACAGCTCAAAGAACAAGGGCATAGGGTACTCATATTCTCTCAAATGACGAAAATGTTGGATATTTTAGAAGATTTCTTAGAAGGGGAAGGATATAAATACGAAAGAATTGACGGTGGTATTACAGGAACTATCCGTCAAGAAGCCATTGATAGGTTTAATGCGCCGGGCGCTCAGCAATTCGTTTTCCTTCTCTCAACAAGGGCTGGTGGTCTCGGAATCAATTTAGCCACGGCCGATACTGTAATTATTTACGATTCGGATTGGAATCCTCACAATGATATCCAGGCGTTTTCGCGTGCTCATCGTATCGGTCAAGCCAATAAGGTGATGATTTATCGGTTCGTAACACGTAACAGTGTCGAAGAGAGAGTAACGCAAGTTGCGAAAAGAAAAATGATGTTAACTCACTTGGTTGTGCGTCCGGGGATGGGTGGGAAAGGTGCAAACTTTACTAAACAAGAACTGGACGATATCTTGAGATTCGGTACAGAAGAGTTATTCAAAGAAGAGGAAGGCAAAGAAGAAGCTATACATTACGACGACAGAGCGGTGTCCGAATTACTCGATCGTTCGAAAGAAGGTATCGAGCAGAAGGAATCGTGGGCTAATGAATACCTTAGCTCCTTTAAAGTTGCCAGCTATTCAACCAAAGAAGGCGATGGTGAAGAAGAAGTGGATACTGAGATCATCAAACAAGAGGCCGAGAACACCGACCCCGCTTACTGGATCAAGTTACTTAGGCATCATTACGAACAGCATCAAGAAGATCAGGCGAGAACTCTCGGTAAAGGCAAGAGAGTTCGCAAGCAGGTTAATTACAGCGACGGTATAGTTGCTCAGACTGAAAATAGAGAGGATACTACTTGGCAAGAAAACGGTTCAGATTATAACTCTGATTTCTCCCAGGGCAGTGAAGACGATAAAGAGGACGACGATTTCGATGAAAAGAACGATAACGGTGATTTACTCAGTCGTCGTAGTAAGAGACGTTTGGAGAGAAGGGAAGAAAGAGACAGGCCGTTACCGCCACTGCTTGCTCGTGTCGGTGGAAATATGGAAGTGCTTGGTTTTAACGCACGACAACGTAAGTCGTTCCTCAACGCCATTATGCGATATGGCATGCCGCCGCAGGACGCCTTTAATTCGCAATGGTTGGTCAGAGATCTCAGAGGAAAATCTGAACGGAATTTCAAGGCGTACGTTTCCTTATTCATGAGGCATTTGTGCGAGCCAGGCGCTGATAATGCTGAGACATTTGCGGATGGTGTTCCTAGAGAAGGCTTGTCGAGACAACACGTTCTGACAAGGATTGGTGTCATGAGTCTTATTAGAAAAAAAGTCCAGGAGTTTGAACACATTAACGGATACTACAGTATGCCAGAATTGATAAGGAAGCCTGTAGAGCCAGTAAAAATAGCGGGTGCTGAAAGCGCCGCGCCTAGCCCGGCGCCCTCAACAGCTACACCCATTACTTCAGCGGCTCCATCACCAGCTCCCACTCAAGTCACTCAAGCATCAGGTCAAGCTCCAACACCGGGTAGTTCCGAGAAAGATGAGAAGGAAGAGACTAAAGATGACAAATCTGAAACTAAGGATAAAGAAAAGGATGAACCGATGGACATTAGTGACATTAAAGAAGAAAAAGATAAATTTGACGAAAAGGAAACAAAGGATATTAAAGAAGAGATAAAAGAAGAAAGGAGAATGTCTGTCGATGAGGAGCCCGCCAAAGATGAAGAAAAATCGGAAGAGAAGAAAGACGAAAAGGATGAAAAAATCAAAGAAGAAAATAAAGAGGAATCTGATAAGAAGGATGACGCGAGGAGTGACAAGACCGATTCGGAAGGCTCAGATAAACCGAAAGACGAGAAGAAGGACGAAGATGACGACGACGTGGTGATTGTCAAAGAAGAAGACGAGCTGAAAGTGGAGCGGCGCAAGTTCATGTTTAACATCGCCGACGGCGGCTTCACTGAACTGCACACATTATGGTTGAACGAAGAACGTGCTGCGGCCCCAGGCAGGGAGTACGAAATATGGCACAGGCGACACGATTACTGGCTGCTGGCCGGCATCGTGACGCACGGCTACGGTCGCTGGCAGGACATACAGAACGATCTGCGGTTTGCGATCATCAACGAACCCTTCAAGATGGACGTCGGCAAGGGCAATTTCCTCGAAATCAAGAATAAGTTCTTGGCCAGGCGGTTTAAGTTGCTGGAACAGGCATTGGTTATAGAAGAGCAGCTCCGACGTGCCGCTTACCTCAACCTGACGCAGGATCCCAACCACCCGGCTATGTCGCTGAACGCGCGGTTCGCCGAGGTGGAGTGTCTCGCAGAATCACATCAACACCTCAGCAAGGAATCGCTGGCCGGAAATAAGCCCGCTAACGCCGTTTTGCATAAGGTGTTGAACCAGCTGGAGGAGCTGTTGTCGGATATGAAGTCGGACGTGTCGCGTCTTCCGGCCACGCTGGCTCGCATCCCGCCAGTGGCTCAGCGTCTACAGATGTCGGAGAGGTCTATACTGTCACGGCTCGCAGCCACCGCGGGGAACCCCGTGCCCGCAGCTCAAATGGCCCAGTTCCCTGGTGGTTTCGGGGCGGGCGGTACTCTGCCCGGATTCTCCCCGGCCGCGGCCGCCGCTGCTAACTTCACTAACTTCCGGCCGCAGTACTCTGTCCCCGGACAGCCCGCCGCCGCCGCCACAGCCACCGCCTCCAATGTGAATATCTAA

Protein sequence:

>DPOGS200181-PA
MASDDEVDGSFAGDEDVEEGEGQIDNSGESDEAPPKEDDDYSPEDGRKKKKGKKRKARGEEKKGRKKKKKRKNESEDDDDFGLEIEAEGDSDYALSAVSSSKKSRKGRTSKHNTSAPAVPDSGSGMPTVEEVCSTFGLTDVDIQYSDADLENLTSYKLFQQHVRPLIVKENPKVPVSKLMMLVAAKWRLFCERNPHLDGGSQNMGSEDNTNTSATSDYTPKRPGRTPKETKIEDSIDEPEEEDMSDEGTPAPRKRGRKPKHAPRGKPGRKPKVPTLKIKFSKRKRTSSEEDQEGSAGANTDSDEEFEQLLAEAEEPRPVASTTEETPQQKEDDPNLPQQPKKKAKTKIGNKSKKKKKTKSTNKFPDGAEGEQEHQDYCEVCQQGGEIILCDTCPRAYHLVCLDPELEETPEGRWSCTYCQAEGNQEQEDDDEHQEFCRICKDGGELLCCDSCPSAYHRFCLNPPLEEVPDGEWKCPRCSCPPLDGKVAKILTWRWKEQPAKSKAPRSREFFVKWHERSYWHCSWISEIQLDVFHPLMYRYYMRKSDPEEPPKLDDGLEEREGRRRMKHSKQHHQDNDEKLLEEKYYRYGVRPEWLIVHRVINHRTARDGTTYYLVKWRDLSYDQATWESEHEDIAGLKNALEYYQDMRAYITSEGKTKGSKGKKAGRKSKNKDNIDDDESSSGLQFKGRKYNPPPDRPTTNLNKKYEDQPPFVYETGMQLHTYQLDGLNWLRYSWGQGIDTILADEMGLGKTIQTVTFLYSLFKEGHCKGPFLVSVPLSTIINWEREFELWAPDLYCITYVGDKDSRAVIRENELTFDDGANRGGRPSKIKSQVKFNVLLTSYELISIDSTCLGSIDWAVLVVDEAHRLKSNQSKFFRLLAGYHINYKLLLTGTPLQNNLEELFHLLNFLNKDKFNDLAAFQNEFADVSKEEQVKRLHEMLGPHMLRRLKADVLKNMPAKSEFIVRVELSPMQKKYYKYILTRNYEALNPKSGGQTVSLLNVMMDLKKCCNHPYLFPVAAEEAPLGPHGNYETQALVKASGKLVLMSKMLKQLKEQGHRVLIFSQMTKMLDILEDFLEGEGYKYERIDGGITGTIRQEAIDRFNAPGAQQFVFLLSTRAGGLGINLATADTVIIYDSDWNPHNDIQAFSRAHRIGQANKVMIYRFVTRNSVEERVTQVAKRKMMLTHLVVRPGMGGKGANFTKQELDDILRFGTEELFKEEEGKEEAIHYDDRAVSELLDRSKEGIEQKESWANEYLSSFKVASYSTKEGDGEEEVDTEIIKQEAENTDPAYWIKLLRHHYEQHQEDQARTLGKGKRVRKQVNYSDGIVAQTENREDTTWQENGSDYNSDFSQGSEDDKEDDDFDEKNDNGDLLSRRSKRRLERREERDRPLPPLLARVGGNMEVLGFNARQRKSFLNAIMRYGMPPQDAFNSQWLVRDLRGKSERNFKAYVSLFMRHLCEPGADNAETFADGVPREGLSRQHVLTRIGVMSLIRKKVQEFEHINGYYSMPELIRKPVEPVKIAGAESAAPSPAPSTATPITSAAPSPAPTQVTQASGQAPTPGSSEKDEKEETKDDKSETKDKEKDEPMDISDIKEEKDKFDEKETKDIKEEIKEERRMSVDEEPAKDEEKSEEKKDEKDEKIKEENKEESDKKDDARSDKTDSEGSDKPKDEKKDEDDDDVVIVKEEDELKVERRKFMFNIADGGFTELHTLWLNEERAAAPGREYEIWHRRHDYWLLAGIVTHGYGRWQDIQNDLRFAIINEPFKMDVGKGNFLEIKNKFLARRFKLLEQALVIEEQLRRAAYLNLTQDPNHPAMSLNARFAEVECLAESHQHLSKESLAGNKPANAVLHKVLNQLEELLSDMKSDVSRLPATLARIPPVAQRLQMSERSILSRLAATAGNPVPAAQMAQFPGGFGAGGTLPGFSPAAAAAANFTNFRPQYSVPGQPAAAATATASNVNI-