Monarch geneset OGS2.0

DPOGS208548
TranscriptDPOGS208548-TA4398 bp
ProteinDPOGS208548-PA1465 aa
Genomic positionDPSCF300064 + 1014970-1023012
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0178800.060.84% 
BombyxBGIBMGA010336-TA0.078.24% 
Drosophilacapu-PF1e-10742.83% 
EBI UniRef50UniRef50_D6X0704e-13955.17%Cappuccino n=1 Tax=Tribolium castaneum RepID=D6X070_TRICA
NCBI RefSeqXP_969724.18e-14055.17%PREDICTED: similar to formin 1,2/cappuccino [Tribolium castaneum]
NCBI nr blastpgi|910898312e-13855.17%PREDICTED: similar to formin 1,2/cappuccino [Tribolium castaneum]
NCBI nr blastxgi|2420248820.038.04%formin 1,2/cappuccino, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00037796.1e-68actin binding
GO:00160436.1e-68cellular component organization
GO:00300366.1e-68actin cytoskeleton organization
KEGG pathwaydme:Dmel_CG33991e-105 
 K02184 (FMN2)maps-> Dorso-ventral axis formation
InterPro domain[1016-1416] IPR0154251.5e-89Actin-binding FH2
[1015-1430] IPR0031046.1e-68Actin-binding FH2/DRF autoregulatory
Orthology groupMCL17005 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208548-TA
ATGGGCAATACACAAGCAGGTGAAAAACAACATAAAACCGGAAAGTCACCAGCCAAAGGTAAACATTTTATAAGAAATTTAAACAGAAAAAGTTCCGGTAAAGAAACTAAGAAAAATGCTCGTAAAAAGAGCATCGGAAGGCGAGAAACTCCAGATAAAGAGTTCGATAAAACATCAGAATCCGATAACAATGAAACTATTGAGGCATCTGATAACGACACGGTCGAGTGCGTGTTCAAAACATCGACTGGCGAGGGGGGGGCGGGAACTAGTGTGCAGTCGGAGGTAACGGTGACCAGGTGCGAGTACTCCCAGCGGTCACCGACCCGCGACCACTCACACGCCGTCTCACCCGCCGAGAAATCCTCCTCGGACTCAGTATTCACGGACCCTTTAACGCCGCTCGCCGTAGAAGTCAACCAGTGTTATTACTCCGCCGAGAGCGACAGCGCTCCCGAAGAACCGCTCGTCACGAACCTCGCTCCATGCATCGATGCGCAACAGGTGACCTTGGCAGTCGAAGCCGGACAGACAACTATGCCTCACGATGATACTGCCACCCCGCTGTCATTTCATGACAGCATAGAAAAAGAAGAAAAACTTGACATTTCCGATGAGATTAGTGATAGAGATAATAATGAAACTGTTATGGGTGGCATTTTATCAAAGTCAAAGGAGGAAACCTTACCGCATAAGGATCATGAGTGTAATCATGAATTTCTGGAGAATCGGTTGAAAGCTTGTCCGGGACAAACCTCATTTACTATATCCAGGCACAGGAAAGTGGAGCTTCCACCGGTCGCAGATACCTCTCTTAATATTCTGGATGATGATAAAGAAAAGCGTCATTCCAGTGTCAGTGATGGAGCGGTCCCCGATTCAAATGTTTTACGAAAAGTTGCATCTCTCACCCTCGATAAGCACACAGAGTCGAGGGTCGTACGTCCCAAGTTCGTACCGGAGAAACTTGATTTCCAACTATACCAAAAGTTTGAAGGGCAAATGCTTCTTAATTGGTTCGCATCATCCACATCTGAAGATAATCATTTGCGGAACATTCTCAACAGTCAGGATCTCAAAACACTTGGGGTACAATACTGTACGCACTTGCTGGCGGCCGGTGTCTTAAGACAGATACCTGACAAGGATGCTCCAACTGAGAAAATATTTAAGCCTAATCTAATGTATTATTGGTCACACATGGAGATGCCAGCATCTCAGCCAGTCACACCGGGAAGATTAGACATGTCCTCATGGCCGCCTGAACGTGATACACTTAACTTAAAAACACAAACACAAATCACCAACTATTGTACTATGGAGAACCAATATAACAAACAGCACAGTGAAGATGTTAATGACATAGCTGAGGCAAAGTTTGTTATATCACAGTTAAAAAGGAAACTTCAAGAATTAGAATACCAGTTGGAAAATTATAAAATGAGTCCACAAATTGGATTAATTAATAAAAAGATTAATAACACATTTATGTCGACTCCAGAAAATTTATCGAACAGGAACAAAATTGATTTAGTCAGTAGAGAGGTTCAAACAATAAATGATAGCGAAGTTATTAGGCTGTCAGACAAACCTGTGATAAATTATAAAGATGCAAGTACAGGTGTAAATGAAGATTGCAATTTAGAATTAAGTAGATTAGGTAAAAACAATGAATTGTATAATAACGATTTACTAGGTAAATCAGAAAAGAGATCTGAAAAAATAGAAAATTATTGTAAATTAAATAAAAGTCGAGATACTAATAACGCTTATCGGAATGAAGATATAACAAATATAAGTGGTAACGGCAATAGAAAGCTTAGTAATAGAGAAAGAGATTTACTAATCACGAATCTTACAAATCAACTATATAGTATAGATGATAATAATATTAATAATACATGTGAAAGCTCCTCGGATGCTTCATTCTTAAGTACAACAACAGAACTATTAAGTAGCAGTAGTAACAGATTCTTGGACATCAATCAGGACAATATATCTAACTCATCTACAGAATTTCTTTGTAGAAATGATGATAAGAAAACTGATTGTAATATACAATCATCAGATTTGAAAAATAGCATTATCCCAATACAAACTATAGAAGCAAGAGATGTTATGTTATCAGATATGAACCTCACATTGAAAGGTTCTGCTAAAAGTAATGATTGTCAATCTGCTGCTACACCACCGCCTCCATACCCTGGGATGCTGCCATCTCCCCCACCTGCGCCAGGGACAATCCAAGATCAGTCACTTTGCAAGGAATCAAAATCACCTGACATAAAACAACAGTCAACAACACCAGCTAATCAACACAAAATCGATTTACTTGATGACAGTAAAGGTTCCCCAACACAAGATATTGTTAAAAGTTCCAGTATTGATGAATCTCCAAAATCAAACAGAAATCTAGACTGTTTCAGAGAAATCGAAAATAATGACTTAAAAGCTAAAAACTTCAATTCAGTCTCTTCAAACCAATATGATGAAAGGAAAAGAAATAGTATAAATTCCTCACTACCTCTCTCCAATGATAACGTGTTACCACCGCCACAATCTCAAGAGTCCACAGTATTATCTCCTCAAGATATTAAAGTAGACGATACTAAAACGCCAGATATCCCTGGTCTAGAACCGCTGTCTTTAAAGTTGGAGTTACCGAGTACAGAATCACCAAAAACAACGATAAGTCCGCATTCACCAGTAAAAGTTGTGGGTCCTCCACCCCCACCTATGCCTGGAACTGACGGTCTCACACCAGTAATGCCAATTATGGGCCCCCCTCCCCCGCCAATGCTTGGAATGGGTCCCCCTCCTCCACCGATGCCTGGAATGGGCCCCCCTCCCCCACCAATGCCTGGAATGGGGCCCCCTCCTCCCCCTCTGCCCGGTATGGGCCCTCCACCACCTCCCATGACTGGTTTAGGACCACCATCGCCACTAACACCTGGAATGCCTCCGCCGCCTCCATTAAATGCTGGACCTGTACCATTCCCAGCGCCACCCGTGGGTGGATGGAATATGCAAAGAGCAACTCTGAGGAAAACTCCCATAAAGCCTGCGGCTCCCATGAAGCCTTTATATTGGACGAGGATTTTAGCACCACCGATTCCCCCATCATGTCAAGGCGATCCAGAAACATCGGGATTCAAACCGCTCTGGCTAGAAATAGAAGAAGCTAAATTGGACAACATAGACGAATTCGCTAATCTATTTTCACGTCAAGTCGTAAAAGCTCCCGTTAAGAAAAAGGTTGAAGTGAAGAGTAAAATACAACCCATCAAGATTTTGGACAGCAAACGTTCGCAGAACGTCGGGATTCTAGCTCAGAGTCTGCACGTTGAGTTCTCGGAGATCGAGAACGCTATCTATAACTTCGACACTTCCGTCGTCAGTTTGGAGGCTCTGCAGCAGATTTATGAATTGAGAGCCAAAGATGAAGAGCTGATGATGATCAAGGAGCACTTGAAGACTAAACCCGGCGTGCCACTGGATAAGCCTGAGGCGTTCCTTCATGACTTATCCGGAATCCACAATTTCGCGGAGAGAATATCTTGTTTTACATTCCAAGCGGAGTTCGACGATGCAGCAAACACGATAATGCACAAATTAGATAATCTGAAGCACACTTGCGAGTTTCTTGTGACGAACGAATCTCTTAAGCAGCTGTTTGCTATCATACTGGCTCTCGGCAATTACATGAACGGGGGGAATGGTCAAAGAGGACAGGCTGATGGATTCGGTCTAGAGATACTTTCTAAATTGAAAGACGTTAAGTCGAAGCAGTCCCATATAACGCTACTACACTTCATAGTACGTACGTACATGCGCGTTTCGTCACTTGGAGCCCTACCGGTGCCGGAGCCCGGGGACGTTGCCCGCGCCGCCGCCCTAGACTTCGCAGAGGTCGCCACGAGTCTGAACACGCTGCGCACTAACCTTGACGAGTGCCGAGAAAAGGTTGAAAAAGTTATCGAGACGGATGCACGGATACAAAAGACGGAAGACGAAAACGCGAGTTGTGAAAGTAAAAAAAGACTGGAAGTGTTCAAAGACAAAATGACGGCGTTCCTGAACGCTGCGGGAGAAAAACTTAAGACGGAAAACGATAACTTATCAGAGTGTAAAAATAAGTTTATAGCTACAGTGAAATTCTATCAATATTCTCCAAAATGTGGCAAAGTTGAAGATTGTGAACCGAAGGAGTTTTTTTCCCTTTGGACGTCGTTTTGCAGTGATTTTAAAGACATTTATAAAAAAGAAGAACAAATAGCTATTAAAGAAAAATTAAAAGAAACTAAAAAGCTACAGTGCGAAAGAAAAGCGAATACCCAACCAAAAAAAGAGGGTGGTTTGAAAGCCCGTCTTCAGAAGTTGTCTAGCACAAAAAAATGA

Protein sequence:

>DPOGS208548-PA
MGNTQAGEKQHKTGKSPAKGKHFIRNLNRKSSGKETKKNARKKSIGRRETPDKEFDKTSESDNNETIEASDNDTVECVFKTSTGEGGAGTSVQSEVTVTRCEYSQRSPTRDHSHAVSPAEKSSSDSVFTDPLTPLAVEVNQCYYSAESDSAPEEPLVTNLAPCIDAQQVTLAVEAGQTTMPHDDTATPLSFHDSIEKEEKLDISDEISDRDNNETVMGGILSKSKEETLPHKDHECNHEFLENRLKACPGQTSFTISRHRKVELPPVADTSLNILDDDKEKRHSSVSDGAVPDSNVLRKVASLTLDKHTESRVVRPKFVPEKLDFQLYQKFEGQMLLNWFASSTSEDNHLRNILNSQDLKTLGVQYCTHLLAAGVLRQIPDKDAPTEKIFKPNLMYYWSHMEMPASQPVTPGRLDMSSWPPERDTLNLKTQTQITNYCTMENQYNKQHSEDVNDIAEAKFVISQLKRKLQELEYQLENYKMSPQIGLINKKINNTFMSTPENLSNRNKIDLVSREVQTINDSEVIRLSDKPVINYKDASTGVNEDCNLELSRLGKNNELYNNDLLGKSEKRSEKIENYCKLNKSRDTNNAYRNEDITNISGNGNRKLSNRERDLLITNLTNQLYSIDDNNINNTCESSSDASFLSTTTELLSSSSNRFLDINQDNISNSSTEFLCRNDDKKTDCNIQSSDLKNSIIPIQTIEARDVMLSDMNLTLKGSAKSNDCQSAATPPPPYPGMLPSPPPAPGTIQDQSLCKESKSPDIKQQSTTPANQHKIDLLDDSKGSPTQDIVKSSSIDESPKSNRNLDCFREIENNDLKAKNFNSVSSNQYDERKRNSINSSLPLSNDNVLPPPQSQESTVLSPQDIKVDDTKTPDIPGLEPLSLKLELPSTESPKTTISPHSPVKVVGPPPPPMPGTDGLTPVMPIMGPPPPPMLGMGPPPPPMPGMGPPPPPMPGMGPPPPPLPGMGPPPPPMTGLGPPSPLTPGMPPPPPLNAGPVPFPAPPVGGWNMQRATLRKTPIKPAAPMKPLYWTRILAPPIPPSCQGDPETSGFKPLWLEIEEAKLDNIDEFANLFSRQVVKAPVKKKVEVKSKIQPIKILDSKRSQNVGILAQSLHVEFSEIENAIYNFDTSVVSLEALQQIYELRAKDEELMMIKEHLKTKPGVPLDKPEAFLHDLSGIHNFAERISCFTFQAEFDDAANTIMHKLDNLKHTCEFLVTNESLKQLFAIILALGNYMNGGNGQRGQADGFGLEILSKLKDVKSKQSHITLLHFIVRTYMRVSSLGALPVPEPGDVARAAALDFAEVATSLNTLRTNLDECREKVEKVIETDARIQKTEDENASCESKKRLEVFKDKMTAFLNAAGEKLKTENDNLSECKNKFIATVKFYQYSPKCGKVEDCEPKEFFSLWTSFCSDFKDIYKKEEQIAIKEKLKETKKLQCERKANTQPKKEGGLKARLQKLSSTKK-