Monarch geneset OGS2.0

DPOGS212999
TranscriptDPOGS212999-TA4548 bp
ProteinDPOGS212999-PA1515 aa
Genomic positionDPSCF300024 - 384716-399872
RNAseq coverage412x (Rank: top 29%)
Annotation
HeliconiusHMEL0046990.085.77% 
BombyxBGIBMGA006909-TA0.083.08% 
DrosophilaCG12090-PA0.051.39% 
EBI UniRef50UniRef50_E2BB040.061.27%DEP domain-containing protein 5 n=5 Tax=Formicidae RepID=E2BB04_HARSA
NCBI RefSeqXP_001605551.10.059.91%PREDICTED: similar to ENSANGP00000023755 [Nasonia vitripennis]
NCBI nr blastpgi|3454818740.060.46%PREDICTED: DEP domain-containing protein 5-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454818740.060.31%PREDICTED: DEP domain-containing protein 5-like isoform 1 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[96-383] IPR0220467.1e-61Protein of unknown function DUF3608
[1056-1145] IPR0119913e-06Winged helix-turn-helix transcription repressor DNA-binding
Orthology groupMCL11160 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212999-TA
ATGAAGTCTTTTAAATTAATTGTTCATCAATCTAGTTTTAGCACCGAAGACCTTATTATTAATCTTAAAGATTATCCTGGATTGAAAGAGAAAGATATTGTCGAAATATATCATCCTGAAAATGATTATCCGCGGCTGCTCTTACAAGTTACAAAAGTGGATGCACCAGGGCGAGGCAGAGATTCAATCAGTGTTGAGCAAAGCATTGCAACAACATTTCAATTACGAACATTTGCTGATGTTTATATTAATATAGTGAATGTCCCTGATGTGGCTCTGGATTCAGTTGAATTAACATTTAAGGATCAATATCTAGGAAGATCAGAAATGTGGAGACTGAAAAACCACTTAGTGGGTACTTGTGTATATTTAAACAAAAAGATAGAATATTGTGGTGGTGCAGTCCGTTGTCAGGTTTATGAAATGTGGTCACAAGGTGATAGAGTGGCATGCGGCGTTATCACCGAGGACACCAAAATTGTGTTTAGATCATCGACCTCCATGGTTTATTTGTTTATACAAATGTCATCGGAAATGTGGGATTTTGATATACATGGTGATTTGTATTTTGAAAAAGCTGTAAATGGATTTCTTGCTGATTTATTTACTAAGTGGAAGAAAAATGGGAGTAACCATGAAGTGACAATTGTTCTATTTTCAAGAACATTTTACAAAGCAAAAACTTTAGAAGAATTTCCTCAGCACATGAAAGAATGTTTACAAAAAGATTACAGGGGCAGGTTTTATGAAGATTTTTACAGGGTAGCCGTACAAAATGAGAGATATGAAGACTGGTCCAATGTTCTTCTTCAGTTAAGAAGATTGTTCACAGACTATCAGAAGATTGTACTTCAATACCACGAGAGACCGAATATGGAAATACCCACGGCAATAAACTCAACGGCTGCCCAGGGGAACTTTCTAGAGGTTCTGAATATGAGTTTGAACGTATTTGAAAAACATTACCTGGACAGATGTTTCGATAGGACGGGTCAACTTAGTGTGGTCATAACACCGGGCGTCGGCGTCTTTGAAGTGGATAGGGAACTGACGAATGTTACCAAACAGAGGATCATTGATAACGGCGTGGGAAGCGACCTCGTCTGCGTCGGCGAGCAACCACTACACGCTGTACCATTACTGAAGTTCCACAACAAAGACAGCACAATAAATTCCATAGACGATTACTCTATGCCGCATTGGATCAATCTATCATTCTATTCCACGAACAAGAAGGTTGCATATTCTAATTTCATCCCAAGGATCAAGCTGCCGCCGCGCAAGAGCAATGAACCTCTGAAGAAAATGTACGAAGACGAGAGTAAAGGCAAGTTGTTGAAGGAGGACGACTATATGCACAATTCTATATTCGATTACGACGCTTACGACGCTCAGGTGTTTCAATTGCCACCAGCTCACAGCACATGTGTACAAAGAGTCGCTCGTACAAAGAAGACTTCGGTGGCTGGTCTAGAAGGTATAAGCCGTAGAAGCTCACCCTCGATACATCACAGAAAGATGTCGGATCCCGATATACATCACAGTCTCGGTTCTGACATACTGAGTAACAGTAAATACAGCGTGACTGACAGCAGTGACACAACGGATTCTCCGACATCGATGAGTCGCGCGTCTCCAAAGAGTTCAGTTTCCAACAAACCCGTTATAAGAACTGGGAGGGCGCTCATCAATCCGTTCGATCCATCACACGTGACGGTCAAACTTACCAGCAACAGGCGACGGTGGACGCATATATTCCCTAAAGGTCCAACCGGCGTGTTGATTCAACAGCACCATTACCAGGCTCGTCCAGCCGGGGAACCGGCCAGGGAAACGGAAACCAAGGATAACGGTTCGATAGTTAACAACCACTCCGTCTATCAAGTTGACGGTAATATTATTGGAAGGAAGCGCATGATAAGCACTTCCGGTGGTTTGGGTGTCTTGGGTACATTGGGGACTCCGTCTACAACAAACAACGCCTCCCTAGCGTTACTTTGGGGCGCTACCGGCGAACAAGAGTGGACGCCAGCTTTAACCACCGGCGTGGACTGGAAATCGTTGACTATACCGGCGTGTCTTCCGATAACCACGGATTACTTCCCTGATAAGAGATCTCTGCAGAATGATTATCTCGTTTCCGACTATAACCTTTTGCCTGATGATGTGAATGCTGATTTTGCCCAAAACAGAGCAATATACAAAGAACCGCTTACGACTATGGAGGTGTTTAAGGAACTTGTATCTCAGAGATTAGCGCAGGGCTTCCAACTGATAGTGGGAGTGAATGAAAACGAAGTGATTGAATCCCACTGTCCATCAACCCACACCCCACCACCGTCCAAAGTCGCACCACAGAGTAGCAAACTCACGAACGCGCCCACAAAACGCTATTTACTGTCTATAGGAAGAATATTCCACAAGCTGACTCTGGTTGGATCCACGATCACCGTCACTCGTTACAGGCCAAGACATCCCTACCCACCGTTCAACATCCACTACCGCTACCGTTTCCACGCCCCCTACCACGACACGTACGAGGTGTCCTGGGTCTCCTTCACCACCGAGAAACTCGAGAACTACAACTGGAACTACATGGACCACTATATATGTACTAGGGGTGACACGGACTTCTTTCTTGTTGAGTCACTGAAATATTGGCGTTTTAGAACACTTCTCCTGCCTTTATACAATCCAGCAACGAAAACAATACTCGAGGATGACTCTACCCACTGCGATATTTATCCAACACCAACCAGGCAGGATCTCGACCATCTCACCGAAGGATTCCTTAAGATGACAGAGTCTTACTTCAATAAGGTTAAACGTCCTAACAAGCAGAGGGTGGCGTGCGGCGGCGACGCGGGGCTCACTCGACGGCGGCACTCCACCTCAACCTCCATCCTCACGAGGACCGTTCAGGGCGGGGCTTCGGGTTCCCCGTTCAGAGAGCGCGTTGGCAGCACCCGACTTCCGGATCGACCGAGGCTTAGGATCGAGGCTATCGAATTGGCTGAGAGATATGGCAATACATTTAATAAACAGGTTTCACCTACTAAGACACTAAATACTAGCCCTGGGACCAAAGCCGCAGCTAAAACAGTGCTCGAGAGATCGCAATCACAGTCCGGGGAATGCGACGACACCTACGACGATGGCGTTATTGAACCGAAGTTAAAACCGAACGCCACGTTATCCGAGGTTATAGAGCGGATGCGTCACCAGACCTTGGGCGTCGGCTTCCTGCAGCAGACTGTCAGTCTACCGTCTCACACCTTCGTATCAATATACGCTATACATTGGCTACAGGCCAATATGGAGAATGTTACTTATGAAAAAGCAACTAACATTATGGAGAAACTTCTTCAGGAAAAAATGATATGCCATGCGTCCGGTGATACTCTCAAGCGTTATGTTGTTGGTTATTATATGTATCATATATTACCACAAAAGAAAGATAAAGAGCTTTCGGATTACGTGAAACCTCTAGGGGACTTACAGAGCTTTGAAAACGAGTGGATGGAAGTAGAGGTCCTTGGGCCTAGATCTCCACTACAAACAACAGAGACGAGCTCGCGAGCAATTGATATATCTGGCACGAGCCCTGTCACCGATGATTCCGGCATGCCCGCCTTCCTATGTGATAATATAGATCCGAATTATATGCAATTAGGCAGTGACGATATGCCGCTATACAAGAACACTCACCTGGATATCGATGTGAACAACAAGAGTGATCGTATTGAGTGGGGCCACGCTCGCTACCAGGCCACCTTCAGACCGGACCAGGCCTACGAGATGTGCATACAGTGGGCCATCGCTAGCGGGAACATCGTGGCTGAATTGATATTCGGTTGGGCTCGCAAGGCGCAGAGTTGTCGTCTCCAAATGGTGCCAATACCGGCTGACCCACTGGCGTTACCCTTCACAGAGAAATCTGATCCATTAAGAGGTCCTATATACGTGCCTTTAAATGAAGAACCTCTATTAAGAGGAAAGACTTCTTTGTTCGAAGGTTTTCCAGAGGAAACCTGGTTAGAAAGGTTGTTCCTGTTCCAAGAGGCTATAGTCGGTAGATTCGGTTTCATAAAATGCACGGTGGAGAGTACTTCACACGCTGCGGGGGTGGGGGACCATTTGTACGTTCACGTGACTGGCAACATGTTCATACTGATACCAACAACAGTTAAATTAGAACAGAAGTGTTTGAGGACGAAGCCAGCGAATAAACCGGTTAATGCCAGCAGATATCCTGTGCACTCGGATGTGGCGCCGAGTCCGCACGAAGGTTACATCACACGACATGTCAGCGGGAAAAATAAAGACGACTATGATAACAGCAGACGGATGGGTTTCTTGTGGTCGTGGAATCACATGATATCTAAGAAATGGAAGTGGTCCCAAACACCGGCTACGGGAGAAGAAGGATTCCAGATGAGAATGCTCAGAGATTTCAAACATTTCTGTGCAAATCATGAACAGAGATTGAGTTTATTCTGGGACTCGTGCTGGGAAATCAGAGAGAGAGCAAATGGAATTAAGTTCTGA

Protein sequence:

>DPOGS212999-PA
MKSFKLIVHQSSFSTEDLIINLKDYPGLKEKDIVEIYHPENDYPRLLLQVTKVDAPGRGRDSISVEQSIATTFQLRTFADVYINIVNVPDVALDSVELTFKDQYLGRSEMWRLKNHLVGTCVYLNKKIEYCGGAVRCQVYEMWSQGDRVACGVITEDTKIVFRSSTSMVYLFIQMSSEMWDFDIHGDLYFEKAVNGFLADLFTKWKKNGSNHEVTIVLFSRTFYKAKTLEEFPQHMKECLQKDYRGRFYEDFYRVAVQNERYEDWSNVLLQLRRLFTDYQKIVLQYHERPNMEIPTAINSTAAQGNFLEVLNMSLNVFEKHYLDRCFDRTGQLSVVITPGVGVFEVDRELTNVTKQRIIDNGVGSDLVCVGEQPLHAVPLLKFHNKDSTINSIDDYSMPHWINLSFYSTNKKVAYSNFIPRIKLPPRKSNEPLKKMYEDESKGKLLKEDDYMHNSIFDYDAYDAQVFQLPPAHSTCVQRVARTKKTSVAGLEGISRRSSPSIHHRKMSDPDIHHSLGSDILSNSKYSVTDSSDTTDSPTSMSRASPKSSVSNKPVIRTGRALINPFDPSHVTVKLTSNRRRWTHIFPKGPTGVLIQQHHYQARPAGEPARETETKDNGSIVNNHSVYQVDGNIIGRKRMISTSGGLGVLGTLGTPSTTNNASLALLWGATGEQEWTPALTTGVDWKSLTIPACLPITTDYFPDKRSLQNDYLVSDYNLLPDDVNADFAQNRAIYKEPLTTMEVFKELVSQRLAQGFQLIVGVNENEVIESHCPSTHTPPPSKVAPQSSKLTNAPTKRYLLSIGRIFHKLTLVGSTITVTRYRPRHPYPPFNIHYRYRFHAPYHDTYEVSWVSFTTEKLENYNWNYMDHYICTRGDTDFFLVESLKYWRFRTLLLPLYNPATKTILEDDSTHCDIYPTPTRQDLDHLTEGFLKMTESYFNKVKRPNKQRVACGGDAGLTRRRHSTSTSILTRTVQGGASGSPFRERVGSTRLPDRPRLRIEAIELAERYGNTFNKQVSPTKTLNTSPGTKAAAKTVLERSQSQSGECDDTYDDGVIEPKLKPNATLSEVIERMRHQTLGVGFLQQTVSLPSHTFVSIYAIHWLQANMENVTYEKATNIMEKLLQEKMICHASGDTLKRYVVGYYMYHILPQKKDKELSDYVKPLGDLQSFENEWMEVEVLGPRSPLQTTETSSRAIDISGTSPVTDDSGMPAFLCDNIDPNYMQLGSDDMPLYKNTHLDIDVNNKSDRIEWGHARYQATFRPDQAYEMCIQWAIASGNIVAELIFGWARKAQSCRLQMVPIPADPLALPFTEKSDPLRGPIYVPLNEEPLLRGKTSLFEGFPEETWLERLFLFQEAIVGRFGFIKCTVESTSHAAGVGDHLYVHVTGNMFILIPTTVKLEQKCLRTKPANKPVNASRYPVHSDVAPSPHEGYITRHVSGKNKDDYDNSRRMGFLWSWNHMISKKWKWSQTPATGEEGFQMRMLRDFKHFCANHEQRLSLFWDSCWEIRERANGIKF-