DPGLEAN19578 in OGS1.0

New model in OGS2.0DPOGS215226 
Genomic Positionscaffold59:+ 21342-31008
See gene structure
CDS Length4308
Paired RNAseq reads  2702
Single RNAseq reads  6456
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008634 (7e-91)
Best Drosophila hit  formin 3, isoform B (6e-143)
Best Human hitFH2 domain-containing protein 1 (4e-69)
Best NR hit (blastp)  PREDICTED: similar to formin 3 CG33556-PB [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (5e-180)
GeneOntology terms


  
GO:0003779 actin binding
GO:0007424 open tracheal system development
GO:0030036 actin cytoskeleton organization
GO:0035147 branch fusion, open tracheal system
InterPro families



  
IPR014768 GTPase-binding/formin homology 3
IPR015425 Actin-binding FH2
IPR003104 Actin-binding FH2/DRF autoregulatory
IPR016024 Armadillo-type fold
IPR010472 Diaphanous FH3
Orthology groupMCL13090

Nucleotide sequence:

ATGGAGAGCCGTGTGGGTCTCGACTACATCGTAGAGCATGCGGAGTACGCCGGCAAACTC
GCGGCGGCGCTCATGACGCCAACAGCCGCTGTGAAAAAACAAGTCTTTGAACTTCTATCA
GCGCTCTGTGTGTACAACGCCGACGGATACGCCAGGGCAGTCGACACGCTCGACAGATAC
AAGACACTGAAGGGTGATCGCTACCGTCTGTCCGTGGTGGTAGAGGAACTGAAACAAGCT
ACCACCATCGACTACAAAACAGCGCTCGTGGCATTCGTCAACTGTCTGATAATATCCGCC
CCCCGGCTGCCGGACAGGATACGAGTGAGGAATGAGTTTATTGGACTCGGTTTGCTCTCT
ACGCTTAGCAGTTTAAGACACGAAGCTGCTTCTCATCCGAATCTCGGCGTTCAGCTGGAC
GTCTTCGAGGAGCAGAGAGAGAGTGACGAGGCTCACGGACCTGGAGGAATCAATCTCAAC
TCACACCTCGACGTCTTCTACGCTATACTCAAACAGGTATCAGACACCCCCCAAGAGATC
CCGTTCCTCAGTATATTGCAACACTTACTCCGAATAGACCCTAAGGAAGCTGTCAGTGAT
ATAGTGTGGGATACGGCTGAGACACTCGTACACAGAGCCACCCTGCTGGAGACGAGAGAA
GATGCTGCTAAACTACTACGAGCTCCGAGCGTTCAGACCAAAATGGTGTGCGCGTGTCAA
CACCGTGAGGCCGGCTCCGCGAGGAAGCAGAGTCTACAGCGAGCGCTGTCACCACCACCC
GCACCACCAGCTCCGCCAGCTCCGCCCGCGCCACCTGGATCTCACGCCCCCCTCCCTCCC
TTACCGCCCGCTCCCCCAGCTCCCCCTGCCCCGCCGGGTCCTCCCCGCTCGGGTCCCCCT
CCTCCTCCTCCCGCGCCGTCCACCCCCACCCCACCTCCGCCGCCAGTGGTGGACGTGAAG
CTGCCGCAGCAGGAGACTCCTCTACCGAAAACGAAAATGAAGACGATCAACTGGAACAAG
ATCCCCAACAGCAAAATCGTGGGCCAGAACAACATCTGGTCGCTGGTAGCGTCGAGTCAC
AAACACTCGCCTAAAGCAGAGCTGGACTGGACTGAGATCGAGGGGCTGTTTTGTCAACAG
ACTGAGAGAAGAATTTTAACTCTCCAGCTCCAGCCGCCAGGGTCCGCAGGCTCGTCCCCC
CGTCTCGGGCGCAGTCCCGTCTGCGACAGTTCAGGAGAACGAAAGCCCCGCAAGGAACCC
TCGGAGATCACACTCCTCGACGGGAAGCGCAGCCTCAACGTTAACATTTTCTTGAAGCAG
TTCCGCAGTTCCAACGAGGAGATCATACAAATGATCCGTGAAGGAGCTCACGACGACATC
GGCGCAGAGAAACTCCGCGGACTCTTAAAGATACTGCCCGAGATCGACGAGTGCGAGATG
CTGAAATCCTTCTCCGGTGACGTCACCAAGCTCGGGAACGCCGAGAAATTCCTCCTGCAA
CTCATCCAGCTGCCCAACTACCGTGTTCGCGTGGAGGCTCTCCTGCTGAAGGAGGAGTGG
TCGTCCACCGCAGGCGCACTGGAGACAGCGGTCAACGCGCTCCTGGTGGCGGGAGATGAC
CTCATGTCCTCGAGAGCCATACAGGAGGTGCTATACATCCTGCTGGTGGCGGGTAACTTC
CTGAACGCTGGAGGGTACGCGGGCGGAGCAGCCGGGGTCAAGCTGTCCTCGCTACAGAAG
CTCACCGACATCCGCGCCAACAAACCGGGGATGAATCTGATGCACTACGTCGCCATGCAA
GCTGAACGGAAGAACAAGGAGCTGGTGCACTTCGCGGATGACATACGAGTGCTGGAGGAG
GCCTCGAAGGCCAGCGTGGAACAGCTGCACAACGAAATACACACCCTCGCCAACAGGATC
CACACCTTGAAGAGAGATCTGCATCACACCAGCGAAGACATCCGCCTCCAGTCGGGGGAT
TTCCTCCAGGTAGCGGAACGCGAGGTGGCAGCCCTGAAAAAGGATATGGAGGAAGTGGAG
GGGATGAGGAAACAGCTCGCGGAGTTCTTCTGTGAGGATCCGGTGTCCTTTAAACTGGAA
GAGTGCTTCAAGACGTTCGTGTTATTCTGCACCAAGTTCCGTTCGGCCGTCGCAGACAAC
GAGAGACGGCGAACGCTGGAACAACAAGCGGCGGCCAGGAACAGGAACAGGAACAAGATG
CAGAAGAAGACCGGGGACGTGCTGGGGAACAACGACTTAGGCGGCAGTGTGTGCAGTACT
CCTGTGTCGGAGAGCGAGTCCCTGATGGATTCTCTGCTGCTGGACATCAGGAACGGTCTC
GGCAGACGGTCGCTAAGAAAACCCCACGAACCATCCCCGTCCCCCGACGCGACTCCACCC
GGTAGTCTCCGCCGTCGTTCCCGCGCCTCCCCGGACGAGGACGGGCTGATGGAGTTCCTC
CGCCACGCCTCCCCCGCGGCCGACGACCACAGGGAAAGAAGCGCCTGGGGAAGCCTCGAT
CGCTCCTGGTCTCGGCGGGCTCCGGTCCGCGCTCGTCTGGAGCTTCCGGACCGAGAGCGT
GCGCCCCCCTCAACCCCGCAACCCTCCACCACCCCCGGGGACAACCCCAAACCCAAGGAG
TGGCGTCAGAAGATCGAGTCCTGGCTTCGCGCGAACAGCGAGGAGGAGCGCCGGGCGCCG
TCCCCGAGCGCGGCTGCCCCGGCCTCGTCCCTGGCCCCGGCCCGGGCCCGCTCCCTCCGA
GTGCTGCATCGGCGCTCCTTGGAAAACGACTCCGAGAGCGAGCGCAGTACACTGGACACG
CTGACGGAGGGCGGCGGCGCGGCGGCGGGAGAGAGGGGCTGGAGACCGGAAGTGCCCACA
GACACGGACCTCGTGAAGGCCATCGAGGTCGTCGAAGATGTTCAGCCGAAAACAATAGAG
AACAGGATACCGTGGAGAAAGACGGAGGAGAACGAGGAGATACGGCGACTGAGGAGACAG
AGGTCACGACCGCAGATGGAGTCACAACCACTCGTGGCCATCACGGAGGAGAGGAGGAGG
CTGCCGGACATCACCGTCACCAGGGATACAGTACCGACAGACAACTCACGTAGAGGGGAG
GTCGATTCGGAGAATTTAGAGACCCCGCCCGTACAGAGGAAGATGTTCAGTCCGCCGCCT
GACAGAGAACCCTGCAGAAGATTCCTACCGGCGATATCCAACACCAATACCAACGATAAG
GCGTCCCCAGAGATGAAGGACCTCTGTCAGGAGATACTGGGCGATGGACAGTTCGACAGA
TTCTCAGCCGCGAGAAGAACGAGGCGGTACAAGAGAAGTACGGAGACCAGCTCCCCAGAA
GACGAGAAGAAGAGCGCCTCGGAGCTAGTGACGGAAACACAAGTCACAAGACCTGCGACA
CTAGAAGTACAGGCCGCGTACCCGGCGGAAGAAACGAAGGAGGTCACGAGACCCTCGGAG
ACCAGAGACGACACAGAGAACAGGTTGAAGCGGTGGCAGGAACGATTGAAAAACCAAAGC
AAAGACAAAACGCCAGCCAAAGACAAGGTGCCGTACTCGAGAATGAGGCGGCAGACCTCC
ATCAACCAAGAAGACGTCCAGAAAGCTATCAGGGAGCTGAAGTCTCCGACGCAGTCTCCC
GCGGGCGTGTGGTCCAGGAACGCTTACAGGAGGTCTTTCAACGCTAAAACGGACAAGGTA
ACGCCCACAGCCCCCGCTCCTGCCCCCGCCGCCGCCACCGGCAGCGGCACTCACGACAGA
ACAACGTCCCCGAGGATCTTGAAAGTGAAAAGCGAACACGAACTCAACGACGAGGGCTTC
GAGGAGACGCAGAGTCTGAACTCGGAGAGCGCCTCGCAGGGAGCGTCCTCGGGCTGCGGG
GTCGACTGCGAGTCACCCGTTCCCAAGAAAAACACAGAAATCCCAAAGAAAATAGCTACC
AAAGACACGAGGCCGCCTCCGCGGACGCCGCTCGACCCCAGGCGAAGTCTCCCGCGGAGG
CCGACCTCTCTGCGGGTAGAGCGCTCCGCGTCCAGGGCGTCTCTGAGGAGCTCCCGGAGT
TCCTTGAACAGCTCGGCGTCCGTGGCGACCGTCAAGCGAGCGCCCACCATCAAACCCGTC
CCCAAGCCGATCCCCAAACCGGCGCCCAGGGTACCGGCCTCCCGGTCCTCGTCCAGCGGC
AGCTCCATCGGCACCTCCAGGCCCAAGGCCGACAAGACGTCCGGCTTCATGAGGCCCACG
CAGGCCTCCAAGGTCCGAGGGTCCGCGGCGAAGACGAGCGCCAAGTGA

Protein sequence:

MESRVGLDYIVEHAEYAGKLAAALMTPTAAVKKQVFELLSALCVYNADGYARAVDTLDRY
KTLKGDRYRLSVVVEELKQATTIDYKTALVAFVNCLIISAPRLPDRIRVRNEFIGLGLLS
TLSSLRHEAASHPNLGVQLDVFEEQRESDEAHGPGGINLNSHLDVFYAILKQVSDTPQEI
PFLSILQHLLRIDPKEAVSDIVWDTAETLVHRATLLETREDAAKLLRAPSVQTKMVCACQ
HREAGSARKQSLQRALSPPPAPPAPPAPPAPPGSHAPLPPLPPAPPAPPAPPGPPRSGPP
PPPPAPSTPTPPPPPVVDVKLPQQETPLPKTKMKTINWNKIPNSKIVGQNNIWSLVASSH
KHSPKAELDWTEIEGLFCQQTERRILTLQLQPPGSAGSSPRLGRSPVCDSSGERKPRKEP
SEITLLDGKRSLNVNIFLKQFRSSNEEIIQMIREGAHDDIGAEKLRGLLKILPEIDECEM
LKSFSGDVTKLGNAEKFLLQLIQLPNYRVRVEALLLKEEWSSTAGALETAVNALLVAGDD
LMSSRAIQEVLYILLVAGNFLNAGGYAGGAAGVKLSSLQKLTDIRANKPGMNLMHYVAMQ
AERKNKELVHFADDIRVLEEASKASVEQLHNEIHTLANRIHTLKRDLHHTSEDIRLQSGD
FLQVAEREVAALKKDMEEVEGMRKQLAEFFCEDPVSFKLEECFKTFVLFCTKFRSAVADN
ERRRTLEQQAAARNRNRNKMQKKTGDVLGNNDLGGSVCSTPVSESESLMDSLLLDIRNGL
GRRSLRKPHEPSPSPDATPPGSLRRRSRASPDEDGLMEFLRHASPAADDHRERSAWGSLD
RSWSRRAPVRARLELPDRERAPPSTPQPSTTPGDNPKPKEWRQKIESWLRANSEEERRAP
SPSAAAPASSLAPARARSLRVLHRRSLENDSESERSTLDTLTEGGGAAAGERGWRPEVPT
DTDLVKAIEVVEDVQPKTIENRIPWRKTEENEEIRRLRRQRSRPQMESQPLVAITEERRR
LPDITVTRDTVPTDNSRRGEVDSENLETPPVQRKMFSPPPDREPCRRFLPAISNTNTNDK
ASPEMKDLCQEILGDGQFDRFSAARRTRRYKRSTETSSPEDEKKSASELVTETQVTRPAT
LEVQAAYPAEETKEVTRPSETRDDTENRLKRWQERLKNQSKDKTPAKDKVPYSRMRRQTS
INQEDVQKAIRELKSPTQSPAGVWSRNAYRRSFNAKTDKVTPTAPAPAPAAATGSGTHDR
TTSPRILKVKSEHELNDEGFEETQSLNSESASQGASSGCGVDCESPVPKKNTEIPKKIAT
KDTRPPPRTPLDPRRSLPRRPTSLRVERSASRASLRSSRSSLNSSASVATVKRAPTIKPV
PKPIPKPAPRVPASRSSSSGSSIGTSRPKADKTSGFMRPTQASKVRGSAAKTSAK