Monarch geneset OGS2.0

DPOGS214256
TranscriptDPOGS214256-TA5037 bp
ProteinDPOGS214256-PA1678 aa
Genomic positionDPSCF300014 + 1480361-1493027
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0113720.088.04% 
BombyxBGIBMGA005973-TA0.074.82% 
DrosophilaCG42533-PF0.042.79% 
EBI UniRef50UniRef50_UPI00022CAB890.048.10%UPI00022CAB89 related cluster n=3 Tax=unknown RepID=UPI00022CAB89
NCBI RefSeqXP_394253.30.047.66%PREDICTED: similar to CG6630-PA [Apis mellifera]
NCBI nr blastpgi|3407092480.048.13%PREDICTED: LOW QUALITY PROTEIN: dedicator of cytokinesis protein 9-like [Bombus terrestris]
NCBI nr blastxgi|3504251750.048.10%PREDICTED: dedicator of cytokinesis protein 9-like [Bombus impatiens]
Group
Gene OntologyGO:00055156.4e-17protein binding
KEGG pathway 
InterPro domain[145-250] IPR0119936.4e-17Pleckstrin homology-type
[144-255] IPR0018497.5e-16Pleckstrin homology domain
[35-114] IPR0218164e-15Protein of unknown function DUF3398
Orthology groupMCL10747 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214256-TA
ATGCCTCCAGTGATATCCCGCAATAAAAAGAATGGCGCTGAGTATGAGAATGTACTAAGTACTAAGTGCAATGTGCTTGATTTTGAGAAATACATTCAAGAAAACAAAACACTTCTTTTAAATGATCCATTGAGGGAAATACTTCTCTATCCATCTGATGATGTTTCTAGTGTTGTGCTACCTCGTCGTTGGAGAACTGTGACAACAGCTGTACCAGATGTCAGAACTGCATCAACATGTCCTCTCCTAACGAGACAGGCATTACTTAGTTACGCCTCTAGCTGGAACCTTGTTCATTATAAATACAGCAACTATTCCGGATCTTATCTTAATTTACCAAGACCGTTAAATGATAAGCTGCTGGAAGAAGTGTATGATATTGATGCTGAAACAGAGAAAGATCATGAGTCCCAAGCTAAGGTGGACGTTGGAACCAAAGAGGGCTACCTGCTCAAAGGACCAGAAATCGGTTCTGATAGGATATTCAGCAATCTCGCTTCAAAATCGTTTAAGAAACGCTATTGCTCAATGGTCAGAGAGGCGGACGGAGCTTACATACTCGAAGTGTATAAAGACGAAAAGAAAACTGACACTAAACTGACTATTGTTATGGATTTTTGCTCAGAAGTTGTTAAGAATACAAAGCGCGCGCGTTACTGTTTCGAACTACGGATGTCCGGCAACAAGGGATATACCTTTGCTGCCGAAAACGAAGAAGAAATGAACGAATGGATAAAGGCTTTCGAGGCGGCGCTAAAGAAGAACCAGGACCAGAGTATCGATCAGTCTGAAGAGACTTTAGACAGAGATGCAGACGTATCTCTAACAGCGGAGCCGCCTCCTCCCATCTACGGAACCCTGAAAGGCTTGGAGCATAGCATGAACCCGCTACTGATGCGGTACTCGAGGGAGACTGACCTCTCAATAGCTGCGGCTAGGAACGACTCTCGTTCTAATATATTCACCTTGCCATATAAAAGGGCTCCTTCCCCCGAGCCGCAGTTGGAACCGTTCAAAGAACACTTTGGCCAAAGAATTTTGCTCAAATGCGAAAGCTTAAAATTTAGATTACAAGCACCGATAGATGGTGACAAGGAACTGCTGTGCCAAGTGGAGCCCTACTTCACTTATCTAGCGCTCTACGATGCCCGTAGTGGCAAAAAACTCACCGAAAATTTTCACTTCGATTTAAACCACGCGGCCGTGAGAGATCTAGTCAAAGAGACGGAGTGCACAAAATCATTAGTAGAAAATTTTAGCTATGACGTTAAACTGGACGTCAAACAGTTATCCGATGAATGGTTCAAATCAAAGAGACAGGCTTTACTATCAGTAAATAATCCACATCCAGACTTATTTTTAGTTGTTAAAATTGATAAGATCTTACAAGGACATGTGAGTCAGGTGTTAGAGCCGTACATGAAGGCCACGAAAGACCCTCGGCTGGGACTCAAAGTTCACAAAACTGTCCAAGCCTACGCCAACCGAGTTGGCAACTATCGCATGCCGTTTGCTTGGGCGATGAAACCTTTATTTAAATTATATAGCAACGATCTCGACACGACGCCAGAATTTCCGGCAATCTATCGTCAGGAACCGAACAAAATGTCCGATGAAGATTTATTGAAAATATTATCGGATTACCGAAAACCGGAAAAACTTGCAAAATTGACTGTAATACCGGGATGGCTCAGCATCAATATACAGCAGAGCAACGAACAAATACCAAATTGTATGACAAGCGCGTACGCAGCACTAAAGCCTTTCCCGCTATCACCTCTGTCGCCGCTCACGTTGGAGTTGGCAACACTGAATGCAGACGCCGAACAGCCTTACGCATCCTACATCCACCACTTATACGTGCGACCGTTATCACTCAGCTTCGAGTCGCAGAAAATGTTCGTGAGAGCCAGGAACATTGCTTGCTCTATTGAACTCAAGGAAAGCGATGTCGGTGATGTGAAACCATTGCAGGTTATATATGGGCGTATAGGCATGACGACCCAGCACCGGTGTTCAGTACTACATCACAACACGAATCCCAGTTGGTGTGATGAAGCAAAGATCCGTCTGCCAGCAACCATAACACCGTCGCATCATCTTTTGTTCACCTTTCATCATATTTCTTGCGACTTGGCAAAGAAGAATGACACCAATGTGGAAACTTGTATAGGATATGCTTGGGTGCCATTGCTGAAAAACGATAAGCTCATCGATGAGTTCATCAATCTGCCGGTGGCGACACATTTGCCCTCGGGTTATCTATCAATTCAGCCCCTTGGTCTAGGAAAAGGAAACGCCGGTCCAGAGGTGGTGTGGGTTGAAAGCGAGAAGCCACTATTTCGTTGCCAGCTAGTGCTTGACTCAACTGTGGCAACTCGTGACGTGCATTTGCACAATCTATTCTGCCAAATAGAGAGATTGATAAAGAGCAGTTCCCCTCCCACTTCCCCCGGCGCTCCGCCGTGGCACGACGTGTGTAACGCGCTCAAGGGCGCACACGCTGTTAATCTCAGTTCGCTGATAGCTTTCCTTCCTACGATCTTCAATCAGCTTTTCGATTTGATGACAATTGAAAAAGGATACACTTCAGACATGGGCTACCAAGTTGTGAAATTGATAGTACATTATGTTCATCTCATACATGATTTTGGCAGAAAAGATTTACTCGATAGTTATGTTAAGTATGTCTTCAATTGCGTAGAATTTAAGCTACACACAGTTCTGACAGCACCACTGTACATGTTCGTTGATCCTAATCAACAAGACTTTCTATTGTGTCACAAATTTATGCAGTACTCAAGTTTTTTCTTTGACATTATCGTGAAAAGCATGGCACAATACCTGATTAACACCGGTCGGATAAAGATGTCCCGAAACGAGAGATTTCATAACGATCTTTTAGAGAACATCGACAGGCTCGTAACGACCGTGGAACCCACGTATATATTACAACAACCGATGCAGACTCACATATTTAATAAAAATCTCGCCGTGTTCCTTAAGTCTTGTCTTTCGTTCATGGACCGTGGTTTCGTATTTAGACAAATAAAGAAATATTTGGAGAAATTTAAGGCCTGCGACCCAAAAGCTTTGTTCGATTTTAAATTCACATTTCTCCAAACTATATGCTCCCACGAGCACTACGTTCCTTTCAATTTGCCTCTACAAGCTAATAAAAATGGCAAAGATGAGAATGAAGATCCATCAAAGCTAAGATTGTCTGAAGACTTTATAATGAGACATTTTCTAGCTGGTATCCTATTGAAACAGGTGGAGCAATCCCTCCGCGAGGTGCCCAGCAAACGTCGTGTGTCGCTGGGCGTGCTGCGCGCGTTACTCACCAAGCACGAGCACGACGACCGCTACAGGACCAGGCAAGCCCGTGCGCGACTGGCACAGCTGTATGCACCGTGGCTCACAGTCGTGCTTGACAATGCCCACCGGCTCGTTACTAAAACCTTGGCCAGTCCGGTATTAGAAAATGGTCATGACAGAGTGGATGGAGACGGCACCTGTATTCCCGCAGTCAGCAGTTCACAGAAAGAAGCCGCCTCCGCTAACAGCACACCACGCAAAAACAGGCTCACCTTACACTTCGATCACACGCCGTTGCGTAACTCGACTCACTTTAAGGAACCTCCGAATATGTATGGGAAAGATAACGCAACTAACATGTCGCAAAGCTCCCTAGAATCAGTGTCCACTATGTCCGGGGGAGACTCGTTGCCACGTAACGCTCGATTAGATTTGAGCGAAATCGGTGACCAAGTGAACAGGTTCGGCGTTCTATCTCCGGAAGAAGTGCGAGATGTCTTGCTATGTTTCCTGTTCGTACTCAAGTACCTGGACGACGAACGACTATTGGAGTGGTGGCGGACGCACTCTCGTGTCCAACAACAGGCCTTCTTTAACGTATTAGAGATATGTGCGGAACAATTTCAATATGTGGGAAGAAAGAAGATTTTGTCCGAATTCAGTTTACCGTCACCTGATTGTAACGGTAAACTTAAACCAGCTAAAGCCAGGACTCTGCCCGCCAGGATGAGTCCACCAGACTTCTCGAAGGAGCCGCCCATCGTTGTCGAAAAAAATAACACTGTCAATAGAGAAAACCTTGTCAACCCAACAGTCACAAGTGAACTTGAAATGATGTCCGAACATGCAGTCTTATCCGCCGGTTGCTTAGCAACGGAAGTAGGGCTAATTATTATAGACCGAGCGTGTCTTTTTATGAGGAACTTAGGACCGGCAGAAGGAAACGCCGCTAAGCCGTATCTAAAATTACTGCAGTGCCCTCAAAGCGAAACTCTATATAAGCATCTATTCGCCGCCCTCCGGGCTTATATCAATCAGTACTCAGAAACACTGTTCGAAGGCGGCAGTTCAGTTTGTGCCGGTGTGGTGTGCGCTGTGGTCCGTCTGTGCGCGGCGCGGGCTGCCTGGCTGCGGCGAGAAGCAGCCGCCGCCCTCTACCTGCTCATGAGGGCTAACTTCCAGCACGCCGAGAAGGGAAACCTCACTAGAGTACATCTACAGGTGATAATAGCCGTGTCCAAACTGCTGGATAGTTCTACAGTATTGAACTGCAAGCGATTCCAGGAATCCCTATCAGTGATAAACTGCTTCGCTACCGGAGATAAAGCTATGAAAGGGACTGGTCTCTCAAGTGAGGTGGCGGAACTAATGCGTCGCGTGCGCACAGTGGTGGGGGCGCGGGCCAGGCTGGCCGGAGTGGGCGGGGCCGCCGGACACGGTGCTGGAAGCGTTCATCTATCGGAGCTACAGCACGCGCTGGCAGCTTCGTGTCGTGCCACGCCTCGCCTACGACACGCCTGGCTCGCGACACTGGCGGAACACAACGTGCGACACAACGACCACGACGAGGCCATGTGTTGCCAACTTCACATAGCTGCTTTGATAGCGGAATATCTCAAACTGCGAGGAACTCAAACTTGGGGCTCCGAGGCTTTCGCGGATTTGTCCTTAAACATACCTAAAGATGAAATCGGCCTTAAGATAGACGAGGGTAAGTAA

Protein sequence:

>DPOGS214256-PA
MPPVISRNKKNGAEYENVLSTKCNVLDFEKYIQENKTLLLNDPLREILLYPSDDVSSVVLPRRWRTVTTAVPDVRTASTCPLLTRQALLSYASSWNLVHYKYSNYSGSYLNLPRPLNDKLLEEVYDIDAETEKDHESQAKVDVGTKEGYLLKGPEIGSDRIFSNLASKSFKKRYCSMVREADGAYILEVYKDEKKTDTKLTIVMDFCSEVVKNTKRARYCFELRMSGNKGYTFAAENEEEMNEWIKAFEAALKKNQDQSIDQSEETLDRDADVSLTAEPPPPIYGTLKGLEHSMNPLLMRYSRETDLSIAAARNDSRSNIFTLPYKRAPSPEPQLEPFKEHFGQRILLKCESLKFRLQAPIDGDKELLCQVEPYFTYLALYDARSGKKLTENFHFDLNHAAVRDLVKETECTKSLVENFSYDVKLDVKQLSDEWFKSKRQALLSVNNPHPDLFLVVKIDKILQGHVSQVLEPYMKATKDPRLGLKVHKTVQAYANRVGNYRMPFAWAMKPLFKLYSNDLDTTPEFPAIYRQEPNKMSDEDLLKILSDYRKPEKLAKLTVIPGWLSINIQQSNEQIPNCMTSAYAALKPFPLSPLSPLTLELATLNADAEQPYASYIHHLYVRPLSLSFESQKMFVRARNIACSIELKESDVGDVKPLQVIYGRIGMTTQHRCSVLHHNTNPSWCDEAKIRLPATITPSHHLLFTFHHISCDLAKKNDTNVETCIGYAWVPLLKNDKLIDEFINLPVATHLPSGYLSIQPLGLGKGNAGPEVVWVESEKPLFRCQLVLDSTVATRDVHLHNLFCQIERLIKSSSPPTSPGAPPWHDVCNALKGAHAVNLSSLIAFLPTIFNQLFDLMTIEKGYTSDMGYQVVKLIVHYVHLIHDFGRKDLLDSYVKYVFNCVEFKLHTVLTAPLYMFVDPNQQDFLLCHKFMQYSSFFFDIIVKSMAQYLINTGRIKMSRNERFHNDLLENIDRLVTTVEPTYILQQPMQTHIFNKNLAVFLKSCLSFMDRGFVFRQIKKYLEKFKACDPKALFDFKFTFLQTICSHEHYVPFNLPLQANKNGKDENEDPSKLRLSEDFIMRHFLAGILLKQVEQSLREVPSKRRVSLGVLRALLTKHEHDDRYRTRQARARLAQLYAPWLTVVLDNAHRLVTKTLASPVLENGHDRVDGDGTCIPAVSSSQKEAASANSTPRKNRLTLHFDHTPLRNSTHFKEPPNMYGKDNATNMSQSSLESVSTMSGGDSLPRNARLDLSEIGDQVNRFGVLSPEEVRDVLLCFLFVLKYLDDERLLEWWRTHSRVQQQAFFNVLEICAEQFQYVGRKKILSEFSLPSPDCNGKLKPAKARTLPARMSPPDFSKEPPIVVEKNNTVNRENLVNPTVTSELEMMSEHAVLSAGCLATEVGLIIIDRACLFMRNLGPAEGNAAKPYLKLLQCPQSETLYKHLFAALRAYINQYSETLFEGGSSVCAGVVCAVVRLCAARAAWLRREAAAALYLLMRANFQHAEKGNLTRVHLQVIIAVSKLLDSSTVLNCKRFQESLSVINCFATGDKAMKGTGLSSEVAELMRRVRTVVGARARLAGVGGAAGHGAGSVHLSELQHALAASCRATPRLRHAWLATLAEHNVRHNDHDEAMCCQLHIAALIAEYLKLRGTQTWGSEAFADLSLNIPKDEIGLKIDEGK-