Monarch geneset OGS2.0

DPOGS207157
TranscriptDPOGS207157-TA3816 bp
ProteinDPOGS207157-PA1271 aa
Genomic positionDPSCF300001 + 4375929-4392887
RNAseq coverage251x (Rank: top 42%)
Annotation
HeliconiusHMEL0130410.058.97% 
BombyxBGIBMGA000615-TA0.072.22% 
DrosophilaFhos-PC1e-15957.37% 
EBI UniRef50UniRef50_E2ACZ90.049.29%FH1/FH2 domain-containing protein 3 (Fragment) n=5 Tax=Formicidae RepID=E2ACZ9_CAMFO
NCBI RefSeqXP_001956875.13e-17855.07%GF24355 [Drosophila ananassae]
NCBI nr blastpgi|3071808450.049.29%FH1/FH2 domain-containing protein 3 [Camponotus floridanus]
NCBI nr blastxgi|1892359780.050.27%PREDICTED: similar to CG32030 CG32030-PA [Tribolium castaneum]
Group
Gene OntologyGO:00037794.2e-42actin binding
GO:00160434.2e-42cellular component organization
GO:00300364.2e-42actin cytoskeleton organization
GO:00054889.8e-22binding
KEGG pathwayptr:4527527e-19 
 K05745 (DIAPH3, DRF3)maps-> Regulation of actin cytoskeleton
InterPro domain[738-1162] IPR0154251.9e-87Actin-binding FH2
[776-1233] IPR0031044.2e-42Actin-binding FH2/DRF autoregulatory
[179-426] IPR0160249.8e-22Armadillo-type fold
Orthology groupMCL13041 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207157-TA
ATGATTGAATTAGTGAAAGTTGGTGATAATAATAGAGTTAGCGGCGCGATGAGGGAAATAGGAGCGGACTCGTTCGTATGTCGCGTTCAGTACTTGAACGATTTGGATCCATTTATGGATTACAACGTAAGGGAGCCTCCCAGGCCGCTGTACCACACCTTCAACACGACCATCCCCCTATCATACCAAATAGCCGCTGTGCATCGTCTGCTGCAAGCGCCACACAGGCTGGACGATGCTACATTGCAAGTTTTCAAAGACGGTGACTACGGACCGTATTTGGATCTTGATTCGACTCTCGGAGAGCAAGATGAAGAACTGGAAGGATTACAAGAGAGAGTCGATAGAGCCACTATTGTGGGCGACATCAGCAGACAGGGATCGTTAGATTCAGGGATTGCGGACGTGGAGTATTCCCTCATCCCTCATCCCTTCATCCTGGATTCCGAAGATGACATCTCCGAGGACGACTTCCAGGAAGCTTTGGATTATATTGAAGACCGCAAAAATTCTTTGGTGCTACGTACGCAACTCTCTGTTCGAGTCCATGCTATTATTGAGAGGCTGCTCCACTCCCAGGGTCGGGAACTCCGGCGGGCATTATTCACCCTTAAGCAGATCCTGCAATCAGACAAGGATCTGGTACATGAATTCGTTGCTAATAAAGGCCTGGATTGCCTTATGCAAGTTGCCAATATGGCAGATCATAACTACTTGGATTATATATTGAGGGCTCTTGGACAGATTCTTCTCTATGTGGATGGAATGCATGGTGTGATGAATCACAAGCGCTGTATTCAATGGCTGTATTCACTCATATCTAGCAAATTAAGGCACGTCGTCAAGACAGCTCTGAAACTCCTCTTAGTTTTCGTTGAATATACCGAAAAGAACTGCCTTCTCTTTATTGAGGCAATCGTGGCCGTTGACACTTCCAACGCTAGGCAACCGTGGTACAACGTGATGAAAATTCTTCAGGACTTCGATGCTTCTGACACCGAATTGCTCATCTACGTGACAACTTTAATCAACAGATGTTTGAATAACATCCCCGATAGAGATCTGTATTATGATCAAGTTGATTCCCTGCAGGATCAGGGGATCGATGACATTATACAGTTATATATGTCCAAGCAAGGAACGGACCTAGATTTGTTACGTCAGTTGCAGATATTCGAAGCTGTACTTCTTTATGAGGACGGTGACGAAACGGGTACAGCCCTCAAGCAACTTGATGAGTCTGTGATAACATCTTTACGAAAGAGGAGTCATAATCTAAGCACAACAACAAGAAGAAAGTCAAAGAAACAGGAAGAGAGGGAAAAAGCAATCTCACCAGTCATGATGGAATCCAATATTATAACGAGTACTCCGAAAAGACCCCAGTACTTGCCTTCCATACTGGATACAAAAGAAAATAAAGAATTAAGTACCGCCCTAAAAAGAAGAAGGGACAGATACGCAAGACAAGCTCATCACATGATACAGCAGCAAGAATTGATGAAAAGCACAAGCCCTAACGGATACAGTTATAGCGATTTTGATAGTAGTGAATACAACTTTAGCAGTATAAGTCGTAGCAGTTACAGCTCGAATTCATTCATATCTAGCAAGTACCCTAATAGCTGTAGCACGATGAATGGAGACAGTAATTCAGTTAATGAGGAAGAAGACTTTAATAGGAAGAATCTACCCAGCCTTTTAGTTAACGGCAACCAAAGATTTACAACTGGTTTAAAGCAAAGAATACAAAATGGAACGCTAGGTCATAGTGTAAACACAGCGGACTCTCTCGTTGCAATGCGGCAGAAACAAATAGATGTTAATCCCGAGCCCAAGGATGACAGGGAGATACTGTTAAATAGAGAACACAGTATTAAGGATATCGCTCAAAGACTCACCAGCCCTTTGAATTCAGCCACAGACGAGAAACCTATCAGAGTATCTGATATGGCTGGGATTGTTTCAAAAGCCAAAGAGGAATTAGCCAAATCTAAGTCAAAAGAGATCATAAAAAGTCCTACTATAGAAAAATCCCCAAAGGTACATGAGATTAAAGTATCAGAAAACGATTTGCACTGGGAAGAATTAAAGAAAGGCTGTCTAAATCGTGAATTCCACTTGTGCGACTTGGACTTTTCCGATCTGCGACATGATTCCGACGACGAAATGGAATGTCATCAGCTTCAACGAGTGTGTGCTTCTAATGGACCACCACCTCCTCCTCCCAGTATGCTACCCCCTATGAACCTGCCCCCACCGCCGCCAACTAATTTCCCTACTTTGCCAAAAAATACATCAAATCCTCCAACTGAGACAGACTCAGCTAATTCCACTTTAAAGAAGAATAAAAAGACTGTGAAATTGTTTTGGAGAGAGATTCAGGAGGTACCTGTTCCGGCTCCAGTAAAAACCAAAATTGGAGGATCCATTTGGGACGATCTTCCCCAAGTCGCCTTAGATACAAACATGTTGGAACATCTTTTTGAATCTAGGTCTAATGATCTCATTATAAAGAAACTAATGGAACCAAAGAGAAATCTGATCTTGGACGCTAAGCGATCAAACGCTATCAATATAGCGATGAAAAAATTACCAACTCCGCAAACAATAAAGGCAGCCATTATGAAGATGGATGCCACTGTCATCGGAAGAGAGGGCATTGAGAAGTTACTGACCATGCTGCCTACTCAAGAAGAAAAAGTTAAGATTCAGGAAGCACAGTATGCGAACCCTGACCTTCCCCTGGGCAGTGCGGAGCAGTTCCTTCTCACCCTCGCTTCCATCAATGAGCTTTCTTCGAGGCTCAAGCTGTGGGTCTTCAAGCTGGACTTTGACAATTTGGAAAAAGAGATAGCCGAACCCCTAATGGACCTTAAGCAGGGTATTGAACTGCTTAAAGTAAACAAGACTTTTAAGGTAATCCTTGCAACCTTGAGGTCAGTTGGCAGTTTTCTAAATGGCACGCAAGTCAAGGGATTCCGACTGGATTATCTTTCAAAGGTTATGGAGGTCAAAGATACTGTTCACAAACATCCTCTATTATACCATATTTGCGAGATGATAATAGAGAAGTTCCCAGATACAACAGATTTCTTCAGCGAGGTTGGTCCCGTGATACGAGCTTCGAAAGTTGACTTCGAAGTCCTTAGTTCCAACCTTGTGAAACTAGAAGCAGACTGCAAAGCTTCCTGGGATCATATGAAACGTGTCGCTAAGCACGACAGCTCCCAAATTTTTAAGACCAAGATCAACGAATTCCTCACCGATGCCGCCGAAAGGATCATCTTGTTATCACTTATCAAGAAGAGGGTTATGAGCAGATACACCAAGTTTCTGATATACGTGGGGATGGCATCGGATGACATACTGCGTTCCAAGCCCTCAGAGCTGTTGAAGGTGATCTCGGAGTTTGCGTTGGAGTACCGGACCACCAGAGAGAGAGTGTTGCAGCAGTTAGAGAAGAGGGCCAATCATCGAGAGAGGAATAAGACTAGGGGGAAAATGATTATCGATATGGGCAACTACAGTGGGAAGTCGGGCGATGCCTGCGCGGACACCGCCCTCAAAGAGCTGTTGCGAGCTGATGGACCGGAGCGTCACATCAGACGGAACGCACACCCTGTGTTGAATGGCGACGCGACAGCTGACGAAGAAATCATAAAAAGCCTTGTACGTTGTCCAGCGTCCAAGAGACGTCCGCTCGCTAGGGACCGTCGCCGAAACAGGCTCGTCGATAGAAACTCCAGTTATCATCATGTTATTTATTGTAATGAAATCTCAGCATCAGTTAGCAGAGTTGAATTTAACGAATAA

Protein sequence:

>DPOGS207157-PA
MIELVKVGDNNRVSGAMREIGADSFVCRVQYLNDLDPFMDYNVREPPRPLYHTFNTTIPLSYQIAAVHRLLQAPHRLDDATLQVFKDGDYGPYLDLDSTLGEQDEELEGLQERVDRATIVGDISRQGSLDSGIADVEYSLIPHPFILDSEDDISEDDFQEALDYIEDRKNSLVLRTQLSVRVHAIIERLLHSQGRELRRALFTLKQILQSDKDLVHEFVANKGLDCLMQVANMADHNYLDYILRALGQILLYVDGMHGVMNHKRCIQWLYSLISSKLRHVVKTALKLLLVFVEYTEKNCLLFIEAIVAVDTSNARQPWYNVMKILQDFDASDTELLIYVTTLINRCLNNIPDRDLYYDQVDSLQDQGIDDIIQLYMSKQGTDLDLLRQLQIFEAVLLYEDGDETGTALKQLDESVITSLRKRSHNLSTTTRRKSKKQEEREKAISPVMMESNIITSTPKRPQYLPSILDTKENKELSTALKRRRDRYARQAHHMIQQQELMKSTSPNGYSYSDFDSSEYNFSSISRSSYSSNSFISSKYPNSCSTMNGDSNSVNEEEDFNRKNLPSLLVNGNQRFTTGLKQRIQNGTLGHSVNTADSLVAMRQKQIDVNPEPKDDREILLNREHSIKDIAQRLTSPLNSATDEKPIRVSDMAGIVSKAKEELAKSKSKEIIKSPTIEKSPKVHEIKVSENDLHWEELKKGCLNREFHLCDLDFSDLRHDSDDEMECHQLQRVCASNGPPPPPPSMLPPMNLPPPPPTNFPTLPKNTSNPPTETDSANSTLKKNKKTVKLFWREIQEVPVPAPVKTKIGGSIWDDLPQVALDTNMLEHLFESRSNDLIIKKLMEPKRNLILDAKRSNAINIAMKKLPTPQTIKAAIMKMDATVIGREGIEKLLTMLPTQEEKVKIQEAQYANPDLPLGSAEQFLLTLASINELSSRLKLWVFKLDFDNLEKEIAEPLMDLKQGIELLKVNKTFKVILATLRSVGSFLNGTQVKGFRLDYLSKVMEVKDTVHKHPLLYHICEMIIEKFPDTTDFFSEVGPVIRASKVDFEVLSSNLVKLEADCKASWDHMKRVAKHDSSQIFKTKINEFLTDAAERIILLSLIKKRVMSRYTKFLIYVGMASDDILRSKPSELLKVISEFALEYRTTRERVLQQLEKRANHRERNKTRGKMIIDMGNYSGKSGDACADTALKELLRADGPERHIRRNAHPVLNGDATADEEIIKSLVRCPASKRRPLARDRRRNRLVDRNSSYHHVIYCNEISASVSRVEFNE-