Monarch geneset OGS2.0

DPOGS213223
TranscriptDPOGS213223-TA7020 bp
ProteinDPOGS213223-PA2339 aa
Genomic positionDPSCF300114 + 500340-511165
RNAseq coverage691x (Rank: top 19%)
Annotation
HeliconiusHMEL0170760.067.88% 
BombyxBGIBMGA007419-TA7e-12062.72% 
Drosophilaenok-PA6e-13262.65% 
EBI UniRef50UniRef50_D6WX357e-16142.68%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WX35_TRICA
NCBI RefSeqXP_970807.11e-16142.68%PREDICTED: similar to enoki mushroom CG11290-PA [Tribolium castaneum]
NCBI nr blastpgi|910888413e-16042.68%PREDICTED: similar to enoki mushroom CG11290-PA [Tribolium castaneum]
NCBI nr blastxgi|910888410.036.24%PREDICTED: similar to enoki mushroom CG11290-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056341.5e-84nucleus
GO:00063551.5e-84regulation of transcription, DNA-dependent
GO:00167471.5e-84transferase activity, transferring acyl groups other than amino-acyl groups
GO:00036778.5e-08DNA binding
GO:00063348.5e-08nucleosome assembly
GO:00007868.5e-08nucleosome
GO:00055155.2e-05protein binding
GO:00082705.2e-05zinc ion binding
KEGG pathway 
InterPro domain[646-918] IPR0161811.5e-117Acyl-CoA N-acyltransferase
[701-882] IPR0027171.5e-84MOZ/SAS-like protein
[92-163] IPR0119911.5e-08Winged helix-turn-helix transcription repressor DNA-binding
[96-160] IPR0058188.5e-08Histone H1/H5
[174-252] IPR0110112.9e-06Zinc finger, FYVE/PHD-type
Orthology groupMCL22157 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213223-TA
ATGAGCGAACCGGACGACGTTGGAAAAGAAGTGTGGAAGCGTTGGATTCTTGAGGCGATACATAAGATACGATCGCAGAAGCAGAGGCCGAGTGTGGAGAGGATATGTCACGCAATAAGACAGCATCACAACTACCACGAAGATGTTGTTTCGGAGCGTCTGGAGAACGCTGTGCGTGAGGGAGTAGTGCTGAAAGTGTATAATAAGGGACAAAGTTCGTACAAGGATCCAGGGGGGTTGCAAAATAAGGTGTTACGCATATCGGCAGATGTAGATGTTTCGAGGGCGGTCGCCAAAGCAGTGCGAGAACTCGGTGAGCGAGATGGATCCAGCTTGAAGACTATCGAGAAGTATCTGCGGCAAGCTTACCAAGTGTCCGTAGAGGAAAACATCGAGGTTCGGAATATATTGCGAGGCGCAGCGAAGCGTGCAGTGGCTAGGGGCCTGCTCATTCATCACGCGGGTAACTATAAAGCTGCCGAGAGACCCTTACAAGCCTCGGAGCGCATTTCAAAACAAAGGAAGTTGCAAGATTCTCCAGAAGAAAAGCCTAGCCCTGGTGGGGTGCCGGTTTGTGCTGAATGTCTTGGAACAGATGCTAAGAATAGACTCGGGGTCACTGAGGCACTCATCTGCTGTGCCCAGTGTAAATCATACGCCCACCCTACCTGCCTCAACCTGCTCGAATACATCAATTTGACAACATTGAAGAGTGTCCGTTGGTGGTGCGGGGAGTGTTCGCGGTGTAGTTCGTGTGGTTTAAGCGGTGAGTGGGCGTCGCGCTGCAGCGGCTGCCCTCGAGCCGCCCACGCACGCTGCCACGCTCCGCCCTGGCGCTGCGACCGCTGCTCACGGACCAGAACGGCGATGGGAGGGACGCCGAAGAAGCGGAAGTCGAAGACAGCAGCGAGACGCATCGAATCAGACGACGAGCCCTCCGACGGTAATAGCTCGCCGTACGCACGCCTGCCGTCCGAACAGAGAATGTCCAAGGAGAAGCAAAAGTTTTTCAAATTCTCCGCTTTTAATCTCGTAAAACGGCGTCGCTGTAGGGAGTCGTCGAGCGAGTTTGAGGGCTGGGGTGGCGTGCGCGTGACGCGGGTGCAACGCCTCGAGCTGACCGTGGAGTCCTCGCCGCCCGTCCCGCCGCAGCCCGAGCCCACCATCTTCGAGCGCCTGGCCACGGACGCGTCGCCCGACGGCGCCTGGGGCTTCGCCGCGGAGGCTCGCAAGCAGCGCCTCGTAGAGTCCCCGCCTAAAGTCGTACCCGCGCGGTCGCCGCCCAGTCCGCCCGCTAGGCACAGGCGCAGGAGTAGATGCGGAGATCGTCTCTTAACCACGCTATTCGACGGTCTATCCGAGTTTTATTCTGTTCGAACGGCGTCTCGGTCGCAGTCTCGCCATCGCCCGGTTCGGAAGAACAGTGAGGACCGACAAGTACTTAAAGAGACGAGGAGTACGTTCAAACGGTACCAGGCGTCGCTGAACAGGGAGCCTAGTGATTCCCGGCCCATAGAAGACACTGGCTGTAAGGATGCGGGGCGGGACGATCGATCGAGGGAGAGCGAATCTATAGAGAGAGTGCCTTACGAGGCGAGACGGGCCGAGAGGGGCCGGAGCGTGCGGTCTCAGAGCGGCGCAGAGGGCTTCGGGCTCGGCGGGGAAAAGTTGAGTGCGTCGGCGCTGGTGGTGCGCGCGGCGGAGGGCAAGCGGGGTGGGGCGGGAGTGTGCGCGGGGGCGGGCGAGGACGCTCAGCAGCACAAGCTGGCGCGCGCGCTGGGCCCGGCCACCAATCAGACGGAAGGTAAGCGGTTGCCGGCCGGTGTGACTGAGGCCGACGCGGAGTTGTTTAAGCAGGCTCGTGAGGCGAGCGGCGTGGACGGCGGCCAGTCTCCCACTCCCGGCCCAACTCCTGGGGTCGGGCCCCCGCCGCCTCGCTGTCCCTCCGCCATCGAGTTCGGACAGTGGGAGATCGAGACCTGGTACTCCAGCCCCTTCCCGCAGGAATATGCGAGGTTGCCAAAGTTATTCCTATGTGAATTTTGTCTTAAGTATGCGAAGAGTCGAGCGGTTCTCATGCGTCACCTGGACAAGTGTCTGTGGCGACATCCGCCCGCTACGGAGATATACCGCTGCGGCGACATATCTGTGTTCGAAGTCGACGGGAACGCGAACAAAATATACTGTCAGAATCTCTGCCTCCTCGCCAAACTGTTTCTGGACCACAAGACGTTGTATTATGACGTCGAGCCGTTCTTGTTTTACGTACTGACCAAAAACGACAGCAAAGGCTGCCATCTTGTCGGATATTTCTCTAAAGAGAAACATTGTCAGCAGAAATATAATGTGTCGTGTATAATGACGATGCCTCAATATCAAAGACAGGGTTATGGGAGGTTCCTCATCCATTTCAGTTATTTGTTGTCAAAAGAGGAAGGTCAAGCGGGTACTCCAGAGAAGCCTCTGTCAGATTTGGGAAGGGTTTCATATCATGCCTACTGGAAATCGGTGATACTGGAATATCTCCACGACCACAGAGACAAACCGTTCACATTCGAAGACATCGCTCTTTCGACTGGAATGCACATGAATGACATCGCTGTAACATTTCAGCTGCTCGGCTTTGTGAGATACGTTCCCGACAAGGATGATATCAAATTAGGCATTTGCGTAGATTGGAAGAGGGTCGAGAATCACGTGAAGAAGCTGAACAGTAGACCTCGCCTGGAAATAGATCCCGAATGCTTAAGGTGGACGCCATTATTGACGCCGACCATCAATCCCTTCAGGTCCCCGGATGAAAATTCAGCAGATCAGGACACAGAAAATGATGAAGGGGAATTGAAAACAGAAAGTGAAAATACGGAAACGGAACCCGAAACCATTCCTTCGAAGTCCAGCGCGAAAACAAAGTCCGCTGACAAAGACTCGCTAGAACAACCGATAGCCGTCACGTCATCCGGACGGAAGAGGACGCGACCTCTCAAATATAGCGAAACTACGTATCAAACTACGCCAACGTTAAGTGACGCGACACGGAAACGAAAACGAGACGCGAATAGAAAGCTCTCCGAGAGCGTCGATGAAGAAAGGAAAGAAGACAGTACACCTCGACGACAGAGGAGCAAGAGTGTCGGTCGTAAGGCTAAAGTCGTACAAGAAGAAGTCATAGAAGATCTACCGAGGAATAGGCGGAAAGCTAAAGAACAAGCGTACGCGTCAGATAGTCAAGAGAGCCAGGAAAGTAATGATGTTCGTACTGAAGAACCCGCGCCTGTGAGGATGGGTAAGGCAAAAACGAAACGGAAACTCGCCTGGAGAGGAAGACAGACGAAACGCCAGACGGGAAGTAAAACAGTTACAAGCAATGAATCACCAGTAGCGAAGAAGGCCAAGGTTAATACCGAGGAAAACGTTGGCTACAAAACTGACGACTCCTCGCATCCGACTGCTGAACCTGTCGAGGAGAAGACACCAGTAGAACGAAAGGAGAAGTCCGAAAAGGACAAGAATTCAGAAGCAAGCTCAGAAGATTCGTCAGGAGAGGCCGATGACGAGATGGACGTAGACGAAGATAGAGCCAGTGTGCCCAGCAAACCTCCGACACCGCGCCACTTAGAGGAGAACAGCACGGACCATCACACCAGTGACATGGAGTTGGACAGCATACACATGGACTCCCCGAAATCGATTGCTGAAAAAGAGCAAGTGATTAACGACAGCAAAGACAAAGCCGAGGACGAACCGCCCGCCACAGGAGACGGGTCGCAGGCGGCACACGCGGAAAATAACGACAAGATTACCAAGAATGCCAATGATAGTGACAAAATTACCAAGGATGCGAACAGTGACAGAAATTCACCCGCGAAAGTGGTTCTCGATGATAAAGACACAATTGTCATATCAGAATCCGATGATAACAATAGCCAGAGCTGCCCGCTACCATCACCCAAGCCAAATAACATAGCTCCGGCACAAGTCGCCGTATCCGATTCCAAAGATGTTAAAGACGTTGCAGAAAACGAAAGCCCTACTAAAGTTATACCCATAGTGTCCACAGAAAATTCCCATATCGTATTAGACGCGAAGGGACATAGACTGCCTCACTTTGAAACTATCGTCGTGGAGACGGAATCAGACAACGCCCATTCACCAGCGATAGGTGTGTCTCCCAAGAAGAACGACAAGACGGTCATAAGCGAATCGCCGAAACAAAAGAAATCTCCAGAAAAAAATACAGAATCAGCCTACGAGCAGAAGAAGTGCGACATGAGGAAGGAAACCGTGATCCATCATCAAAATATAGAAGAGAGTAAAGGGCCGGAGAAGAAGACGCTCAATAAAATAGAAACCATCATACAGAACTTGGATCCACATAGAGATATGACGAACAGTATGAAGAAGCCCGACCCTCCTAAAGTTGACACGTTCAACGAAATGGATTCAAGGAAACTACAAATGCCCGTGACCAGTAAAGAGAGCGAAACATCTTTCCGGAACGACATGTTGCATGTGAGGAAGGATGAAATCGCGGTTTCCAGAAGCATCATCGAAAACACCTTGAACAACTATCAAAACTCAGTGAGCAACTCGATGACCATTCAGCCCATAAACTGTCTACAACTGCAGCATTCCTATATGAATCACAGGACGAGTGTCAGTAAGGAATCGCAGGCGGTGCAGACTGACAAACTACACGTAAAGACCAGTGATCCCAACAGAGTGATTAATAATATGCCTGAACCGCCTGTTATAAGTAAAAGTACGACTAAACTAGAAGTGCCCCATCCGATCACGTCACCAGTTGTAAACCCTGTTATTCCCAAAGTGCCCAATGTATCCGACATAAGTTTTTTACCCAGATGTCAGAGTGCGAATGCTGCTGTTAACTTGGGTATGGAAAGACCAGACTTAGATAACAATTTCACGAACAGTATACAAAGTTCCCTCAGCGGTCCTATCAACATAGGTCAAACTAATTCCCAAGATAAAAACGACCCCAACCAAAAACCTAGGGAGAAGAGCAAGCTCAGAGACGTACGAGTGAACTCCGCTCACAGTAAGATCGAAAAATCTGAAAAGAAATCAACAAAAAACGAGACACCGAGGAGTACGCCGGAGCCGAAGATGTTTTTTCCAGATCAGAATGCCATCGCCAGCCATCGGAAGACAGAGACCTCGGTACCGACAAACGTGATAAACAGCGTGTCCGTGACGAGCAAACTAGACACGAAGACCAGCGAGGCTCCCAAGAAACAAGATTTTTTTAGGAAAGAGAAATCTAGTACCACTAAATGCGAAACTAAAAATACAATCAAACACGACAAAACCTGTTCGTCCCAGTTGAAGGCGGCAGAGCAAAACGATTTGAATAAAATGTTGCCCAAGTTCAAATATGATAACGAATTAGTGCCTAAAGCGGATTACGCCATGAACCAAATACCGAGCTATCACACGACGCATGCGCAGTACGCGCAGTGGCCGACATGGGACCATACCAGGTTACAGGGGACGTGGGACAACAGGTTCCTGGACATGAAGAACAATGATAAAAACTACTTGGAGAAATTCCAGGGCTTTAATTTACCTCAGTTGGATCAAATGCAAAAATCACCACAGAAACTCCATCCAAAGTACGACCATAATCTCGCTTACGGGGCTCTATCTAGCGGCCTCTACGCCACCACGGGGCTACCGCATTTCAAAGAGACTAAGGCGCCAACCTCCAAAGCCTCGGAATGCCCTCAAAAGAACGACTGCAAACCGACCAAGCAGTCCAAAACGACGACCGCCTGTCAGACACAGTGCGACAAGAAATCCCAGTCGCACACCAACGACACGCACATGAAGCAGATGATGCAACGGCAAGTGAACAAACAACAGGAGGTGGCGACCAGCTGCGCAGACTTCAGGCAGCAGAGCCCGCAGCTACTGACGTCGCCGGGCTGCAAGACGCCCTTCAACCAGAACGGGCCCTGCGACAAGGACATAGACAAGAAAGCCCGCCAGCAGAAGAAGGACGAGTCGCCCAAAGATACGAGCAAGGAGGAGATGTGCGAGGCCGTCAGTCCCGCTCTACAGTCTATGGGAGTCTACACACCAGACTCCACGAGCAACTCGGTCCACTCGGTCCAGTACCCCGCCTGCGAGCTGGACGTCAGCCAGCTAGGCCTGGAGTCGCCCACCAGCATCAGCTCCGACCTCGCCTCGCCCTGCTCCATGATGCACATGCATCCCGCACCCAGTCCGCAGTACCCGCACTCCTCCATACACATACCGTCCATCATGAGCCAACCCAACCAACCGCCCAAACAACAGAAGATCAACAACAGGAACAGGAGTACGGGCAGTGCGGGCGCGGCATCTAGTGACAAGAGCGCTCGTGGGTCCGGCACGCCCCCCGCGCGACACCGCGTCACGCCGCCACACGCGCCGCACCCTCCACACGCACCCGTCTCGCACGGCGGTAGTGTGATGCAGGGCGGAGGGTATCAGGGCGGGTACCTGTCCTTCCAGCAGCAGCAGCAGTACCACGCGGGCTGGCCGCCGTCCTGCTCGCTGGCCAAGCTGCAGCAGATGGCTGACGCGCCACAACATCCGCCGCACACGCCACCAGCCACCGCACAGTACGGGCAGCAGGCTGGGACGCCGCCGGCCGGTCACTATCACGCGCCCAAATACTACGCGCCCGCCCACAACCAGATAGAATCGCCCAGGAACACACGGAATGCGCCCAGTAACCTAAGTCCTATGCAGCACGTCCAAATGGGACCAGGATCTCGGATGTCTCCGAATCTGAACACACACATCATCAGCCAGTACGGGCTGAACGGGTACCGTGTACCCCCGCAGCAACAGTTCAACAATCTGCCAGTTCAGATGATGAACGTCCAGCCGGGTGTGCAGTACCCGGGACCGGATCCTCGTGCGCAACAACCCAACGTGTACGCGTACGCGGGTTACATAAACCCACCACCGCCTCTCACTATGCAAACTTTGAACTCGACTATGCGTCGGTAG

Protein sequence:

>DPOGS213223-PA
MSEPDDVGKEVWKRWILEAIHKIRSQKQRPSVERICHAIRQHHNYHEDVVSERLENAVREGVVLKVYNKGQSSYKDPGGLQNKVLRISADVDVSRAVAKAVRELGERDGSSLKTIEKYLRQAYQVSVEENIEVRNILRGAAKRAVARGLLIHHAGNYKAAERPLQASERISKQRKLQDSPEEKPSPGGVPVCAECLGTDAKNRLGVTEALICCAQCKSYAHPTCLNLLEYINLTTLKSVRWWCGECSRCSSCGLSGEWASRCSGCPRAAHARCHAPPWRCDRCSRTRTAMGGTPKKRKSKTAARRIESDDEPSDGNSSPYARLPSEQRMSKEKQKFFKFSAFNLVKRRRCRESSSEFEGWGGVRVTRVQRLELTVESSPPVPPQPEPTIFERLATDASPDGAWGFAAEARKQRLVESPPKVVPARSPPSPPARHRRRSRCGDRLLTTLFDGLSEFYSVRTASRSQSRHRPVRKNSEDRQVLKETRSTFKRYQASLNREPSDSRPIEDTGCKDAGRDDRSRESESIERVPYEARRAERGRSVRSQSGAEGFGLGGEKLSASALVVRAAEGKRGGAGVCAGAGEDAQQHKLARALGPATNQTEGKRLPAGVTEADAELFKQAREASGVDGGQSPTPGPTPGVGPPPPRCPSAIEFGQWEIETWYSSPFPQEYARLPKLFLCEFCLKYAKSRAVLMRHLDKCLWRHPPATEIYRCGDISVFEVDGNANKIYCQNLCLLAKLFLDHKTLYYDVEPFLFYVLTKNDSKGCHLVGYFSKEKHCQQKYNVSCIMTMPQYQRQGYGRFLIHFSYLLSKEEGQAGTPEKPLSDLGRVSYHAYWKSVILEYLHDHRDKPFTFEDIALSTGMHMNDIAVTFQLLGFVRYVPDKDDIKLGICVDWKRVENHVKKLNSRPRLEIDPECLRWTPLLTPTINPFRSPDENSADQDTENDEGELKTESENTETEPETIPSKSSAKTKSADKDSLEQPIAVTSSGRKRTRPLKYSETTYQTTPTLSDATRKRKRDANRKLSESVDEERKEDSTPRRQRSKSVGRKAKVVQEEVIEDLPRNRRKAKEQAYASDSQESQESNDVRTEEPAPVRMGKAKTKRKLAWRGRQTKRQTGSKTVTSNESPVAKKAKVNTEENVGYKTDDSSHPTAEPVEEKTPVERKEKSEKDKNSEASSEDSSGEADDEMDVDEDRASVPSKPPTPRHLEENSTDHHTSDMELDSIHMDSPKSIAEKEQVINDSKDKAEDEPPATGDGSQAAHAENNDKITKNANDSDKITKDANSDRNSPAKVVLDDKDTIVISESDDNNSQSCPLPSPKPNNIAPAQVAVSDSKDVKDVAENESPTKVIPIVSTENSHIVLDAKGHRLPHFETIVVETESDNAHSPAIGVSPKKNDKTVISESPKQKKSPEKNTESAYEQKKCDMRKETVIHHQNIEESKGPEKKTLNKIETIIQNLDPHRDMTNSMKKPDPPKVDTFNEMDSRKLQMPVTSKESETSFRNDMLHVRKDEIAVSRSIIENTLNNYQNSVSNSMTIQPINCLQLQHSYMNHRTSVSKESQAVQTDKLHVKTSDPNRVINNMPEPPVISKSTTKLEVPHPITSPVVNPVIPKVPNVSDISFLPRCQSANAAVNLGMERPDLDNNFTNSIQSSLSGPINIGQTNSQDKNDPNQKPREKSKLRDVRVNSAHSKIEKSEKKSTKNETPRSTPEPKMFFPDQNAIASHRKTETSVPTNVINSVSVTSKLDTKTSEAPKKQDFFRKEKSSTTKCETKNTIKHDKTCSSQLKAAEQNDLNKMLPKFKYDNELVPKADYAMNQIPSYHTTHAQYAQWPTWDHTRLQGTWDNRFLDMKNNDKNYLEKFQGFNLPQLDQMQKSPQKLHPKYDHNLAYGALSSGLYATTGLPHFKETKAPTSKASECPQKNDCKPTKQSKTTTACQTQCDKKSQSHTNDTHMKQMMQRQVNKQQEVATSCADFRQQSPQLLTSPGCKTPFNQNGPCDKDIDKKARQQKKDESPKDTSKEEMCEAVSPALQSMGVYTPDSTSNSVHSVQYPACELDVSQLGLESPTSISSDLASPCSMMHMHPAPSPQYPHSSIHIPSIMSQPNQPPKQQKINNRNRSTGSAGAASSDKSARGSGTPPARHRVTPPHAPHPPHAPVSHGGSVMQGGGYQGGYLSFQQQQQYHAGWPPSCSLAKLQQMADAPQHPPHTPPATAQYGQQAGTPPAGHYHAPKYYAPAHNQIESPRNTRNAPSNLSPMQHVQMGPGSRMSPNLNTHIISQYGLNGYRVPPQQQFNNLPVQMMNVQPGVQYPGPDPRAQQPNVYAYAGYINPPPPLTMQTLNSTMRR-