Monarch geneset OGS2.0

DPOGS203087
TranscriptDPOGS203087-TA6114 bp
ProteinDPOGS203087-PA2037 aa
Genomic positionDPSCF300228 - 52597-99922
RNAseq coverage548x (Rank: top 23%)
Annotation
HeliconiusHMEL0023526e-16259.06% 
BombyxBGIBMGA002321-TA0.070.40% 
Drosophilambc-PB2e-15436.89% 
EBI UniRef50UniRef50_D6X2E20.040.02%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X2E2_TRICA
NCBI RefSeqXP_972351.10.040.02%PREDICTED: similar to myoblast city CG10379-PA [Tribolium castaneum]
NCBI nr blastpgi|3838661990.034.74%PREDICTED: dedicator of cytokinesis protein 1-like isoform 1 [Megachile rotundata]
NCBI nr blastxgi|3838661990.034.69%PREDICTED: dedicator of cytokinesis protein 1-like isoform 1 [Megachile rotundata]
Group
Gene OntologyGO:00510202.5e-30GTPase binding
GO:00055252.5e-30GTP binding
GO:00050852.5e-30guanyl-nucleotide exchange factor activity
GO:00055157.1e-10protein binding
GO:00054885.7e-05binding
KEGG pathwaycin:1001766410.0 
 K13708 (DOCK1)maps-> Shigellosis
    Regulation of actin cytoskeleton
    Bacterial invasion of epithelial cells
    Focal adhesion
InterPro domain[1554-1730] IPR0107032.5e-30Dedicator of cytokinesis
[12-68] IPR0014527.1e-10Src homology-3 domain
[13-65] IPR0115112.7e-07Variant SH3
Orthology groupMCL10111 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203087-TA
ATGACTGTGTGGCAAGAACTCGAAAATGAAAACTGCTATGCCGTCGCGCTGTACAATTTCAATTCACCATCAACTGTCCACCTGCCGCTGGAGGTGGGCGAGCTGGTTCACTTGACGAGGGAAACGAAAGACTGGTACTGGGGCAGCAGTCTCAGGAGAAACAAGAGTGGAGCCTTCCCCAAGAAATATGTCGTCATAAGAGACTGTGTCGTTGACAGATGTGGAGAGACAGTGGTAGCTTCAGCTGGTGGCAGCGGGGTTGTCCACGACATAGCGGTCACGTTACGCGAATGGCTGCAACATTGGAAGGTTTTATATACGACAAATGATGAGCGTTTCAAATTCATGGAAGTTAGCATGAGGGCTCTTTTGGAGTTAAGAGCTCAGGCCGCGTCCGGAGCCCTGCCTGTAGATCAGCTCCGAAGAGTCGCGAGGACGGCTGTCTTCACCATCGACAAGGGGAATAGGACTTTAGGAATGGAGTTGGCGGTCAGAACGATCAGCGGCCAGCTGGTGGATCCGTTAACGACGTCTACCTTCAAACTGAACGCCTTACACGAGGAGGCCAACTCCAGGATAGATAAGAACATGGAAACGACCCCGACGAGTCGTCCTTCTAGTCAAATAACCGCCGCCCGCTCCTATACTATAGTGGCTCACGTGAATAATTTCGTGTGCCGTTTGAGAGACGCCGCCGAGTTATCTCTAGCGCTCCACGACGGCGCCGGCAATAAACTGACGGAAAACGTGCTCCTGAAGTGGCCGCCGGGACATCTGGCAGCACAGGAACTGTTCACCGCGCCTTACGTGGTGTTCACGGATCTGGGCAGCGACATAAAGAAAGAGCGTCTCTTCCTGGTATGCCACGTGGTCAGGATCGGGTCCATGGAGCCGCAGAACGTTGATCACAGACGCAGTTCGATATCTCCGTCGGTGTCCGGGAGCGGCATGCGGCGGCCGTGCGGAGTCGCGTGCGCGGACATCACACAGCACCTCGCCAACGACCACTCCGACAAGGAGATCCAGCTGCCGTTTTACGCATGTGAGAAGGAGAACATGGATTCTCTTCTCAAGAAAGTTATTGGGAACCTGAAGGATCAGAAACACTCCCAAGGACTGTGGGTGTTCCTTCAGATGGTGGAGGGAGATCTACAACAGGTCCGTCGCGAGAAACCTCACATGGTGGTAGGCAACACGGTGTTCGCTCGGAAGATGGGCTTCCCTGAAGTGATCTTGCCAGGGGACGCCCGCAATGACCTGTACGTGACTCTGTGTGGCGGGTCATTCTCTAAGGGGGGCAAGTCCAGCGAGAGGAACATAGAACTGGTGGCCAGAGTCGTGGATAAGAACGGAAAAACGCTACCGGGTGTAATATCAGTGGGCGCGGGCTCTCCGCTGGTGGACGAGTACACCTCCGTGGTCTACTACCACGAGGACCGGCCCCGCTGGCAGGAAGTGTTCAAGGTGTGTCTGAACATAGAAGAGTTCAAAGAGGCTCATATGGTGTTCCTGTGCCGTCACCGCAGCTCCAACGAGGCGAAGGATCGCGCCGAGAAGCACTTCGCCCTCGCCTTCCTCAGACTCATGCAAAAAGAGGGAACCACCATCCCCGACACCACGCACAACCTGGCCGTGTACAAGATAGAACATAAGAGGACATGTTCCTCCTCGGAGCTGGACACGGAGGCGTCGTCAGTGTGTCTCGGGCTGCCGTCCAGGCGGGACGAGCTGCCCGCGGGCTGCGAGCGAGGCCTCACCCGGGGCCCGCTCTCACTCTTACACAGAGACACGCTCTCTGTAACCACCAAGCTCTGCTCCACTAAACTCACACAGAGAGAGGAGATTCTAGGCGTGCTGAAGTGGAGCAATCATCAGACAGACGGGTCATTGAAGAGTTCTCTCACCAGTCTACTCCGAGTGCCAAACGACGAGCTCGTCAAGTTCCTGCAAGATATACTGGATGCATTATTCAGTATACTAACACAGGTAGAGGAAGACGATGATTACACCGAAGACAGCTACTCGGTGCTGGTGCTGGACTGTCTCCTGCAGGTGATATCCCTGGTAGCGGATCACAAGTACCAGCACTTCCTGCCAGTACTCCAGGTGTACATCGACACCAGCTTCTGTGACACGCTGGCTTATGAGCGGCTGATATCCATCATGGTGTGGGCGATTCGTTCCGCGGAGCGAGGCGAGGGCGCGTCCAAACGACTGCTGCAGTGTATGAAGTGTTTGGAGAGCCTGGCGCGGGTGCTGGTGAGGTCTCGGCAGCTCAGGGCGGCGTTGGCCGGCGCTTCACCTGGACACTGCACCCAGCTACAGACTCTGCTGGAGGCGCTCGTTGGCCTCATGAGATGCGGGGACTCCGCCCTCACGTGTCAGGGCTCCGCGCTGAAGTACCTACCGCACGCCATACCTCACATGATCAAAATATACGACGACACGCAACTCTGCGAATATCTAGTCTGGGCATTGGAGGCCCTCCCTCTGTCGCGTCTCTCGAACCAACGTCTTCACGCCCTGCTGGAGTTAGTTAGGGGCCCCCTGGGGGCCTCGGCGGGCCCCCGGGCCAAATTACTACCACACTTAGCCAGCACCCTGAGGGCCCTATTGAGGAATCCTGCTGATAGGCCATACAAAGTGTTCAGGCACATAAAATATTATTTACTTGTTATAGATAATAGTTATAGTATCGAAAAATATGTAACAATGTCAAGTAACGATGACCAAGCCATACATCCTCTCCAAACTATTAACGAGTCCTCTGCGACAGTCCAAGATGTAATAACTATCACACATAATTCTAATAAAACCATGTCGTACATGGTCAATCACTCGCTATATGACCGTCCTCCGTCCGCCGGCGTGGTGAGTGTTGGTATGTCACAGTTGGACACGCCGCACAAGTTACGATCAGCGGACAAGGCCGCCAGGTTGCTGGGAGCCGACACCGCCAGCCTCCTAGATCACACGCAGCAGATGCAGATCGTGGAGTTGTGTGTGGAAACTTTAGGCGAAGTAGTTTCTCTTCTCGCGAGAGATGATGTCGGTCCAGTGGCTGCGGATAGAGGAGAACTAGCGAGGTCTCTGTTACCGACGGTGTTGAGGATAGCGAACAACATCATGAAGGACAGGAATAAGGAAGACGGAACACCTCATGATGATCTACTTCCGCGTAAGTTAATATGCGTGCTGCTAGACATGATGCGTCAGATGTCTTTAGAGCAGTACACGCTGGTGGTGCGTTCGTTGGGCGGGGGCAGAGCCCTGGCCGCGGATGCTCTGTCGTTCACCGGAGCCCTACTCGCACGACCCGTGTTCAAACAGCACTGGGCTGACATGCTGCATCTACAGCACTACGTTATGCTGCACACTTTGAGATTATTAGCGACGTCGCTGCGGGAGGATCTCAACGAGGAGAATTCAGACCAATCGCACGTGCACTCCACGTTACAAGACTGGTTCGTTACATTGAGCGCCTTGGCGTGCTCCAAGCCGTTGCAGCTGGAAACATTACCGGCGGCGAGGAGGCAGAGAGCCGTCATACTTTACGGAGACATCAGGAGAAGCGCCGCGAGCTTGATGGCCGACATTTGGTTCAGTCTCGGTAAAGAACATACTTACATCAGCGGCCAATTCAATAGGAAGTTTTCATCAATTTGTAAATTGTACATTATTATTTCTTACAAAAAAAAAAAAAAAAAAAATCTTTTTCCATTTGTCGTCAGTGAAGGGAATCTACTTCGTCTGGAGAACGAGCTGATCGACAAGCTGGACGTGCTGGCGGAGTCCGGCCTGGGAGACGCGGCGTGGCGGGCGAAGTTCGTGTCCATGTGCGGGGCGGCGTGCGTGGCGGGCGACGGCGGCGAGCTGAGTGGGGCGGGCGGCGCCCTGGTGGCCGCGGCCGCCCGACAGCTGGACGCCTTGCTACAGTACAGAGCGGCGCCCACACACCACAGGATGTACCTCGTCACGAGCGTACTGAGGTTCTACGAACAGATCAAACGACCTCACATGTACATACGCTACGTTCATCGCCTGGTGTCTATGCACCGCTCGTCTCAACACTGGGCGGAGGCCGGTCTAACGTTGCAACTACACGCCAAACTGTTGGACTGGAGCGAGACTCCCCTCCCTCCCCGCCTCCGACACCCCGCCTGTGACGACTATCACACGCACCTGGATCTGAAGATAGCGTTATACCAGGAGGTGGCGAGTCTGTTGGAGATGGGTCACCAGTGGGAGCTGGCGGTGGAGGTCATCAAGGAGCTGGTGTGCGTGTACGAGGACCGCGGCCAGGGGTACGCCGCCCTGGGCGCGCTGCATGACCAGCTAGCCAGGCTGTATCGAGCGCCCCTGGAGCACACGCGGCTACAACCCGCTTACTTCCGGGTGCGCTACCTGGGGAGGGGATTCCCCGAACATTTGAGACACCCCAAGGTAATGAGGAGCAGGTGCAATTGTCGGGGATCTTTAAAAATTGGTTCAGAAAATCCAAATGAACAGCAACCAGCTTTGTCGCACCCCAACCAACTGAGAGTTGTCGCATCACAGACGAAAACGAAAGACTCATTCCTACAAATCAACGCCGTGACGCCGGTGATGGCGGACAAGTGGAAGAAGTTCCTGAGCCGCGCCGTGGACCACCAGGTCGTTAACTACTACGAGCATAACAACGTTAACACGTTCGTGTACTCCCGGCCCTTCCATAGGAACGATGACTGTATCATCGACCCCCGCGACGACCGCACCGGCGCCGGAGAGTTCGCTACCAGGTGGCTGGAGAGGACGGAGCTCACCACAGCACACGACCTGCCCGGTATCCTCCGCTGGTTTCCCATCGTATCGACCAGGACTTACTGGGTGTGTCCCCTGGAGGTGGCCGTCGAGACGATGACTCACACCAATAAAGAGCTCAAAGGGCTCATCCACTCGACTCTCTCCCCGTCCGCTCCACTACACCCTCTCACTATGAGGCTTCAGGGCATACTAGATTCGGCGGTCCAGGGAGGGTTAGCGCAGTACGAGAAGGCATTCCTCGCCCCCGCCTACCTCGAGAGGAGACCGGAGCACAGGGAACTGCTCGGAAAACTGAAAGATCTCATCGCGCAACAGATACCGCTACTCAAATATGGTCTGGAGGTGCACGCGTCCCGGGAACCCATACGAGACCTTCACGCGAGACTTATTAATTGTTTCCGAGAAAAATATGGAAGACAGGTGTACGAGGGTGACTCTGACCCTCCAGAGGTCCAGCTCAGACCTTCCGCGAGACGAGACAACGACAACAGACTCTCAGACCTCTCTGCGGGCAACGAATTATTAGCGACGTCGCTGCGGGAGGATCTCAATGAGGAGAATTCAGACCAATCGCACGTGCACTCCACGTTACAAGACTGGTTCGTTACATTGAGCGCCTTGGCGTGCTCCAAGCCGTTGCAGCTGGAAACATTACCGGCGGCGAGGAGGCAGAGAGCCGGGTTCGTATACCGCGGCAACTCGTGCGACATGCTGCATAACTTCAAGGAGCGGATGTTGGACGAGTGGCCGGAAGCGGATGTGCTGCTCAAACTGGACGAACCCGGCGCTGACGTCACAGACTCGGACGGACAGTTCCTACAAATCAACGCCGTGACGCCGGTGATGGCGGACAAGTGGAAGAAGTTCCTGAGCCGCGCGGTGGACCACCAGGTCGTTAACTACTACGAGCATAACAACGTTAACACGTTCGTGTACTCCCGGCCCTTCCATAGGAACGATGACTGTATCATCGACCCCCGCGACGACCGCACCGGCGCCGGAGAGTTCGCTACCAGGTGGCTGGAGAGGACCGAGCTCACCACAGCACACGACCTGCCCGGACGACACCGAAATCTAAATTCTCCTTCAACTCGGCGATCAGTTGGACCAGGAACGACGGACTCCAACCAGGAGGAGGAGGAGCCGCCGCCTCTACCGCAGAAACAGTCCCGCAGTATGGAACACGACGAGGAAACGACCGTCAGGCACAACTATGACCTGGTGCTGGCGCCCAGGACCTCCTACCTGTACTCCTCCAAGAGAGACAGGGCGCCCACACCGCCGCCCAAGAAGAGGAACCAACAGCTGTAG

Protein sequence:

>DPOGS203087-PA
MTVWQELENENCYAVALYNFNSPSTVHLPLEVGELVHLTRETKDWYWGSSLRRNKSGAFPKKYVVIRDCVVDRCGETVVASAGGSGVVHDIAVTLREWLQHWKVLYTTNDERFKFMEVSMRALLELRAQAASGALPVDQLRRVARTAVFTIDKGNRTLGMELAVRTISGQLVDPLTTSTFKLNALHEEANSRIDKNMETTPTSRPSSQITAARSYTIVAHVNNFVCRLRDAAELSLALHDGAGNKLTENVLLKWPPGHLAAQELFTAPYVVFTDLGSDIKKERLFLVCHVVRIGSMEPQNVDHRRSSISPSVSGSGMRRPCGVACADITQHLANDHSDKEIQLPFYACEKENMDSLLKKVIGNLKDQKHSQGLWVFLQMVEGDLQQVRREKPHMVVGNTVFARKMGFPEVILPGDARNDLYVTLCGGSFSKGGKSSERNIELVARVVDKNGKTLPGVISVGAGSPLVDEYTSVVYYHEDRPRWQEVFKVCLNIEEFKEAHMVFLCRHRSSNEAKDRAEKHFALAFLRLMQKEGTTIPDTTHNLAVYKIEHKRTCSSSELDTEASSVCLGLPSRRDELPAGCERGLTRGPLSLLHRDTLSVTTKLCSTKLTQREEILGVLKWSNHQTDGSLKSSLTSLLRVPNDELVKFLQDILDALFSILTQVEEDDDYTEDSYSVLVLDCLLQVISLVADHKYQHFLPVLQVYIDTSFCDTLAYERLISIMVWAIRSAERGEGASKRLLQCMKCLESLARVLVRSRQLRAALAGASPGHCTQLQTLLEALVGLMRCGDSALTCQGSALKYLPHAIPHMIKIYDDTQLCEYLVWALEALPLSRLSNQRLHALLELVRGPLGASAGPRAKLLPHLASTLRALLRNPADRPYKVFRHIKYYLLVIDNSYSIEKYVTMSSNDDQAIHPLQTINESSATVQDVITITHNSNKTMSYMVNHSLYDRPPSAGVVSVGMSQLDTPHKLRSADKAARLLGADTASLLDHTQQMQIVELCVETLGEVVSLLARDDVGPVAADRGELARSLLPTVLRIANNIMKDRNKEDGTPHDDLLPRKLICVLLDMMRQMSLEQYTLVVRSLGGGRALAADALSFTGALLARPVFKQHWADMLHLQHYVMLHTLRLLATSLREDLNEENSDQSHVHSTLQDWFVTLSALACSKPLQLETLPAARRQRAVILYGDIRRSAASLMADIWFSLGKEHTYISGQFNRKFSSICKLYIIISYKKKKKKNLFPFVVSEGNLLRLENELIDKLDVLAESGLGDAAWRAKFVSMCGAACVAGDGGELSGAGGALVAAAARQLDALLQYRAAPTHHRMYLVTSVLRFYEQIKRPHMYIRYVHRLVSMHRSSQHWAEAGLTLQLHAKLLDWSETPLPPRLRHPACDDYHTHLDLKIALYQEVASLLEMGHQWELAVEVIKELVCVYEDRGQGYAALGALHDQLARLYRAPLEHTRLQPAYFRVRYLGRGFPEHLRHPKVMRSRCNCRGSLKIGSENPNEQQPALSHPNQLRVVASQTKTKDSFLQINAVTPVMADKWKKFLSRAVDHQVVNYYEHNNVNTFVYSRPFHRNDDCIIDPRDDRTGAGEFATRWLERTELTTAHDLPGILRWFPIVSTRTYWVCPLEVAVETMTHTNKELKGLIHSTLSPSAPLHPLTMRLQGILDSAVQGGLAQYEKAFLAPAYLERRPEHRELLGKLKDLIAQQIPLLKYGLEVHASREPIRDLHARLINCFREKYGRQVYEGDSDPPEVQLRPSARRDNDNRLSDLSAGNELLATSLREDLNEENSDQSHVHSTLQDWFVTLSALACSKPLQLETLPAARRQRAGFVYRGNSCDMLHNFKERMLDEWPEADVLLKLDEPGADVTDSDGQFLQINAVTPVMADKWKKFLSRAVDHQVVNYYEHNNVNTFVYSRPFHRNDDCIIDPRDDRTGAGEFATRWLERTELTTAHDLPGRHRNLNSPSTRRSVGPGTTDSNQEEEEPPPLPQKQSRSMEHDEETTVRHNYDLVLAPRTSYLYSSKRDRAPTPPPKKRNQQL-