Monarch geneset OGS2.0

DPOGS215226
TranscriptDPOGS215226-TA4278 bp
ProteinDPOGS215226-PA1425 aa
Genomic positionDPSCF300143 + 518117-527783
RNAseq coverage405x (Rank: top 30%)
Annotation
HeliconiusHMEL0102790.072.67% 
BombyxBGIBMGA008634-TA0.077.03% 
Drosophilaform3-PB0.039.09% 
EBI UniRef50UniRef50_E2AC020.047.50%FH2 domain-containing protein 1 n=2 Tax=Camponotus floridanus RepID=E2AC02_CAMFO
NCBI RefSeqXP_002134824.10.043.36%GA23590 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3071814250.047.50%FH2 domain-containing protein 1 [Camponotus floridanus]
NCBI nr blastxgi|1892417990.047.41%PREDICTED: similar to formin 3 CG33556-PB [Tribolium castaneum]
Group
Gene OntologyGO:00037796.2e-54actin binding
GO:00160436.2e-54cellular component organization
GO:00300366.2e-54actin cytoskeleton organization
GO:00054882.6e-45binding
KEGG pathwayhsa:816242e-35 
 K05745 (DIAPH3, DRF3)maps-> Regulation of actin cytoskeleton
InterPro domain[298-723] IPR0154252.3e-101Actin-binding FH2
[322-774] IPR0031046.2e-54Actin-binding FH2/DRF autoregulatory
[1-220] IPR0160242.6e-45Armadillo-type fold
[49-223] IPR0104727.7e-18Diaphanous FH3
Orthology groupMCL12875 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215226-TA
ATGGAGAGCCGTGTGGGTCTCGACTACATCGTAGAGCATGCGGAGTACGCCGGCAAACTCGCGGCGGCGCTCATGACGCCAACAGCCGCTGTGAAAAAACAAGTCTTTGAACTTCTATCAGCGCTCTGTGTGTACAACGCCGACGGATACGCCAGGGCAGTCGACACGCTCGACAGATACAAGACACTGAAGGGTGATCGCTACCGTCTGTCCGTGGTGGTAGAGGAACTGAAACAAGCTACCACCATCGACTACAAAACAGCGCTCGTGGCATTCGTCAACTGTCTGATAATATCCGCCCCCCGGCTGCCGGACAGGATACGAGTGAGGAATGAGTTTATTGGTGAGTCTACGAGCGAGTTACTTCGATTAGAACACGAAGCTGCTTCTCATCCGAATCTCGGCGTTCAGCTGGACGTCTTCGAGGAGCAGAGAGAGAGTGACGAGGCTCACGGACCTGGAGGAATCAATCTCAACTCACACCTCGACGTCTTCTACGCTATACTCAAACAGGTATCAGACACCCCCCAAGAGATCCCGTTCCTCAGTATATTGCAACACTTACTCCGAATAGACCCTAAGGAAGCTGTCAGTGATATAGTGTGGGATACGGCTGAGACACTCGTACACAGAGCCACCCTGCTGGAGACGAGAGAAGATGCTGCTAAACTACTACGAGCTCCGAGCGTTCAGACCAAAATGGTGTGCGCGTGTCAACACCGTGAGGCCGGCTCCGCGAGGAAGCAGAGTCTACAGCGAGCGCTGTCACCACCACCCGCACCACCAGCTCCGCCAGCTCCGCCCGCGCCACCTGGATCTCACGCCCCCCTCCCTCCCTTACCGCCCGCTCCCCCAGCTCCCCCTGCCCCGCCGGGTCCTCCCCGCTCGGGTCCCCCTCCTCCTCCTCCCGCGCCGTCCACCCCCACCCCACCTCCGCCGCCAGTGGTGGACGTGAAGCTGCCGCAGCAGGAGACTCCTCTACCGAAAACGAAAATGAAGACGATCAACTGGAACAAGATCCCCAACAGCAAAATCGTGGGCCAGAACAACATCTGGTCGCTGGTAGCGTCGAGTCACAAACACTCGCCTAAAGCAGAGCTGGACTGGACTGAGATCGAGGGGCTGTTTTGTCAACAGCTCCAGCCGCCAGGGTCCGCAGGCTCGTCCCCCCGTCTCGGGCGCAGTCCCGTCTGCGACAGTTCAGGAGAACGAAAGCCCCGCAAGGAACCCTCGGAGATCACACTCCTCGACGGGAAGCGCAGCCTCAACGTTAACATTTTCTTGAAGCAGTTCCGCAGTTCCAACGAGGAGATCATACAAATGATCCGTGAAGGAGCTCACGACGACATCGGCGCAGAGAAACTCCGCGGACTCTTAAAGATACTGCCCGAGATCGACGAGTGCGAGATGCTGAAATCCTTCTCCGGTGACGTCACCAAGCTCGGGAACGCCGAGAAATTCCTCCTGCAACTCATCCAGCTGCCCAACTACCGTGTTCGCGTGGAGGCTCTCCTGCTGAAGGAGGAGTGGTCGTCCACCGCAGGCGCACTGGAGACAGCGGTCAACGCGCTCCTGGTGGCGGGAGATGACCTCATGTCCTCGAGAGCCATACAGGAGGTGCTATACATCCTGCTGGTGGCGGGTAACTTCCTGAACGCTGGAGGGTACGCGGGCGGAGCAGCCGGGGTCAAGCTGTCCTCGCTACAGAAGCTCACCGACATCCGCGCCAACAAACCGGGGATGAATCTGATGCACTACGTCGCCATGCAAGCTGAACGGAAGAACAAGGAGCTGGTGCACTTCGCGGATGACATACGAGTGCTGGAGGAGGCCTCGAAGGCCAGCGTGGAACAGCTGCACAACGAAATACACACCCTCGCCAACAGGATCCACACCTTGAAGAGAGATCTGCATCACACCAGCGAAGACATCCGCCTCCAGTCGGGGGATTTCCTCCAGGTAGCGGAACGCGAGGTGGCAGCCCTGAAAAAGGATATGGAGGAAGTGGAGGGGATGAGGAAACAGCTCGCGGAGTTCTTCTGTGAGGATCCGGTGTCCTTTAAACTGGAAGAGTGCTTCAAGACGTTCGTGTTATTCTGCACCAAGTTCCGTTCGGCCGTCGCAGACAACGAGAGACGGCGAACGCTGGAACAACAAGCGGCGGCCAGGAACAGGAACAGGAACAAGATGCAGAAGAAGACCGGGGACGTGCTGGGGAACAACGACTTAGGCGGCAGTGTGTGCAGTACTCCTGTGTCGGAGAGCGAGTCCCTGATGGATTCTCTGCTGCTGGACATCAGGAACGGTCTCGGCAGACGGTCGCTAAGAAAACCCCACGAACCATCCCCGTCCCCCGACGCGACTCCACCCGGTAGTCTCCGCCGTCGTTCCCGCGCCTCCCCGGACGAGGACGGGCTGATGGAGTTCCTCCGCCACGCCTCCCCCGCGGCCGACGACCACAGGGAAAGAAGCGCCTGGGGAAGCCTCGATCGCTCCTGGTCTCGGCGGGCTCCGGTCCGCGCTCGTCTGGAGCTTCCGGACCGAGAGCGTGCGCCCCCCTCAACCCCGCAACCCTCCACCACCCCCGGGGACAACCCCAAACCCAAGGAGTGGCGTCAGAAGATCGAGTCCTGGCTTCGCGCGAACAGCGAGGAGGAGCGCCGGGCGCCGTCCCCGAGCGCGGCTGCCCCGGCCTCGTCCCTGGCCCCGGCCCGGGCCCGCTCCCTCCGAGTGCTGCATCGGCGCTCCTTGGAAAACGACTCCGAGAGCGAGCGCAGTACACTGGACACGCTGACGGAGGGCGGCGGCGCGGCGGCGGGAGAGAGGGGCTGGAGACCGGAAGTGCCCACAGACACGGACCTCGTGAAGGCCATCGAGGTCGTCGAAGATGTTCAGCCGAAAACAATAGAGAACAGGATACCGTGGAGAAAGACGGAGGAGAACGAGGAGATACGGCGACTGAGGAGACAGAGGTCACGACCGCAGATGGAGTCACAACCACTCGTGGCCATCACGGAGGAGAGGAGGAGGCTGCCGGACATCACCGTCACCAGGGATACAGTACCGACAGACAACTCACGTAGAGGGGAGGTCGATTCGGAGAATTTAGAGACCCCGCCCGTACAGAGGAAGATGTTCAGTCCGCCGCCTGACAGAGAACCCTGCAGAAGATTCCTACCGGCGATATCCAACACCAATACCAACGATAAGGCGTCCCCAGAGATGAAGGACCTCTGTCAGGAGATACTGGGCGATGGACAGTTCGACAGATTCTCAGCCGCGAGAAGAACGAGGCGGTACAAGAGAAGTACGGAGACCAGCTCCCCAGAAGACGAGAAGAAGAGCGCCTCGGAGCTAGTGACGGAAACACAAGTCACAAGACCTGCGACACTAGAAGTACAGGCCGCGTACCCGGCGGAAGAAACGAAGGAGGTCACGAGACCCTCGGAGACCAGAGACGACACAGAGAACAGGTTGAAGCGGTGGCAGGAACGATTGAAAAACCAAAGCAAAGACAAAACGCCAGCCAAAGACAAGGTGCCGTACTCGAGAATGAGGCGGCAGACCTCCATCAACCAAGAAGACGTCCAGAAAGCTATCAGGGAGCTGAAGTCTCCGACGCAGTCTCCCGCGGGCGTGTGGTCCAGGAACGCTTACAGGAGGTCTTTCAACGCTAAAACGGACAAGGTAACGCCCACAGCCCCCGCTCCTGCCCCCGCCGCCGCCACCGGCAGCGGCACTCACGACAGAACAACGTCCCCGAGGATCTTGAAAGTGAAAAGCGAACACGAACTCAACGACGAGGGCTTCGAGGAGACGCAGAGTCTGAACTCGGAGAGCGCCTCGCAGGGAGCGTCCTCGGGCTGCGGGGTCGACTGCGAGTCACCCGTTCCCAAGAAAAACACAGAAATCCCAAAGAAAATAGCTACCAAAGACACGAGGCCGCCTCCGCGGACGCCGCTCGACCCCAGGCGAAGTCTCCCGCGGAGGCCGACCTCTCTGCGGGTAGAGCGCTCCGCGTCCAGGGCGTCTCTGAGGAGCTCCCGGAGTTCCTTGAACAGCTCGGCGTCCGTGGCGACCGTCAAGCGAGCGCCCACCATCAAACCCGTCCCCAAGCCGATCCCCAAACCGGCGCCCAGGGTACCGGCCTCCCGGTCCTCGTCCAGCGGCAGCTCCATCGGCACCTCCAGGCCCAAGGCCGACAAGACGTCCGGCTTCATGAGGCCCACGCAGGCCTCCAAGGTCCGAGGGTCCGCGGCGAAGACGAGCGCCAAGTGA

Protein sequence:

>DPOGS215226-PA
MESRVGLDYIVEHAEYAGKLAAALMTPTAAVKKQVFELLSALCVYNADGYARAVDTLDRYKTLKGDRYRLSVVVEELKQATTIDYKTALVAFVNCLIISAPRLPDRIRVRNEFIGESTSELLRLEHEAASHPNLGVQLDVFEEQRESDEAHGPGGINLNSHLDVFYAILKQVSDTPQEIPFLSILQHLLRIDPKEAVSDIVWDTAETLVHRATLLETREDAAKLLRAPSVQTKMVCACQHREAGSARKQSLQRALSPPPAPPAPPAPPAPPGSHAPLPPLPPAPPAPPAPPGPPRSGPPPPPPAPSTPTPPPPPVVDVKLPQQETPLPKTKMKTINWNKIPNSKIVGQNNIWSLVASSHKHSPKAELDWTEIEGLFCQQLQPPGSAGSSPRLGRSPVCDSSGERKPRKEPSEITLLDGKRSLNVNIFLKQFRSSNEEIIQMIREGAHDDIGAEKLRGLLKILPEIDECEMLKSFSGDVTKLGNAEKFLLQLIQLPNYRVRVEALLLKEEWSSTAGALETAVNALLVAGDDLMSSRAIQEVLYILLVAGNFLNAGGYAGGAAGVKLSSLQKLTDIRANKPGMNLMHYVAMQAERKNKELVHFADDIRVLEEASKASVEQLHNEIHTLANRIHTLKRDLHHTSEDIRLQSGDFLQVAEREVAALKKDMEEVEGMRKQLAEFFCEDPVSFKLEECFKTFVLFCTKFRSAVADNERRRTLEQQAAARNRNRNKMQKKTGDVLGNNDLGGSVCSTPVSESESLMDSLLLDIRNGLGRRSLRKPHEPSPSPDATPPGSLRRRSRASPDEDGLMEFLRHASPAADDHRERSAWGSLDRSWSRRAPVRARLELPDRERAPPSTPQPSTTPGDNPKPKEWRQKIESWLRANSEEERRAPSPSAAAPASSLAPARARSLRVLHRRSLENDSESERSTLDTLTEGGGAAAGERGWRPEVPTDTDLVKAIEVVEDVQPKTIENRIPWRKTEENEEIRRLRRQRSRPQMESQPLVAITEERRRLPDITVTRDTVPTDNSRRGEVDSENLETPPVQRKMFSPPPDREPCRRFLPAISNTNTNDKASPEMKDLCQEILGDGQFDRFSAARRTRRYKRSTETSSPEDEKKSASELVTETQVTRPATLEVQAAYPAEETKEVTRPSETRDDTENRLKRWQERLKNQSKDKTPAKDKVPYSRMRRQTSINQEDVQKAIRELKSPTQSPAGVWSRNAYRRSFNAKTDKVTPTAPAPAPAAATGSGTHDRTTSPRILKVKSEHELNDEGFEETQSLNSESASQGASSGCGVDCESPVPKKNTEIPKKIATKDTRPPPRTPLDPRRSLPRRPTSLRVERSASRASLRSSRSSLNSSASVATVKRAPTIKPVPKPIPKPAPRVPASRSSSSGSSIGTSRPKADKTSGFMRPTQASKVRGSAAKTSAK-