DPGLEAN15673 in OGS1.0

New model in OGS2.0DPOGS207120 
Genomic Positionscaffold1:+ 3001018-3012328
See gene structure
CDS Length1539
Paired RNAseq reads  2058
Single RNAseq reads  5723
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012489 (7e-123)
Best Drosophila hit  coro, isoform B (0.0)
Best Human hitcoronin-1C isoform 1 (1e-151)
Best NR hit (blastp)  GH19864 [Drosophila grimshawi] (0.0)
Best NR hit (blastx)  GH19864 [Drosophila grimshawi] (0.0)
GeneOntology terms
  
GO:0003779 actin binding
GO:0015629 actin cytoskeleton
InterPro families







  
IPR001680 WD40 repeat
IPR015505 Coronin
IPR015943 WD40/YVTN repeat-like-containing domain
IPR015049 Domain of unknown function DUF1900
IPR015048 Domain of unknown function DUF1899
IPR019781 WD40 repeat, subgroup
IPR019775 WD40 repeat, conserved site
IPR019782 WD40 repeat 2
IPR017986 WD40-repeat-containing domain
Orthology groupMCL10603

Nucleotide sequence:

ATGTCGTTCAGAGTGGTTCGTAGTTCAAAATTCCGCCATGTATATGGTCAGGCCCTTAAA
AGGGAACAGTGTTATGATAACATCAGAGTATCGAAAAGTTCGTGGGACTCTACGTTTTGT
GCTGTAAACCCCACTTTCCTGGCCATCATTGTAGAGTCAGCCGGCGGCGGAGCCTTCATA
GTTTTACCGCATAATAAGGTCGGCCGCATACCAGCAGATCATCCTCTAGTGGGAGGACAC
AAGGGTCCAGTGTTGGACATAGCGTGGTGCCCCCACAACGACAACGTCATCGCCAGCGGC
TCCGAGGACTGCGTTGTTAAGGTATGGCAAATACCGGACGGTGGACTGACCCGTACGTTG
ACGGAGCCTGTGGTGGATCTCGTGTATCATCAGCGACGCGTGGGCTTGGTGCTATGGCAT
CCAACCGCTCAGAACGTGTTGCTCACTGCTGGCTCCGATAACCAGATAGCGATCTGGAAT
GTTGGCACTGGCGAGGTCTTGCTGAGTCTGGACTGTCACCCTGACCTCATCTACTCCGCT
TGCTGGAACTGGACGGGCTCCAAGCTGCTCACCACCTGCCGTGACAAGAAGATTAGGATA
ATAGATCCTCGCAAGGGAGAAGTGGAATCAGAGGCCATAGCTCACGAAGGCAGCAAAGCG
TCCAGAGCAATCTTTTTAAAGCATGGACTGGTGTTTACCACTGGATTCAGTCGCATGTCC
GAGCGTCAGTACACTCTCCGTACACCGGACGCCCTCGGAGAACCTATCGTGACGGTGGAG
ATTGACACAAGTAACGGAGTCATGTTCCCACTCTACGATCCCGACACCAATCTCATCTAC
CTCTGCGGGAAGGGCGATTCGGTCATCAGATATTTTGAGGTCACCCCAGAGCCGCCTTTC
GTCCACTACATTAACACCTTCCAAACACCGGACCCACAGAGAGGTATTGGTATGATGCCC
AAGCGCGGCTGTGACGTAGCTACGTGCGAAATAGCGAAGTTTTACAGACTTAACAACTCT
GGTCTCTGTCAGGTGGTTTCGATGACCGTGCCGCGTAAGTCTGAGTTGTTCCAAGAGGAC
TTGTACCCTGACACATTGTCCGATGAAGCTTCATTGACGGCCGACGAGTGGCTCGCGGGT
GAAGACGCCGAACCCTGCACCATGTCGCTGAAGGGTGGTTACGTAGCGGGAAGGGCGCAC
AACCTCACCGTGACCAAGAGGAACGCGCTGGCGACCGCCAGGGATAAGGAAAAGGAGAAG
GAGAAGGAAAAGGATAAGGAGCCCGAGAGGAGCCCCACCCCGGGCCAGAGGGACACACCC
GCCACGCCGGCCGCCACCCCACCGCCAGCCTTCACCGCTATGGTGGAGAAACAACTATCG
GACCTGGTGGAAGAGATCCGTAAGCTGAAATCGGTTATAGTGAAGCAAGAGAACCGTATA
CGGGCACTAGAGGCTACGGTTAAGGGACAAGTGGCTGCAGCCACACCAGTACCCGCTGAT
CACAACCACGACGACAACATGGCGCCCGACGAGGTCTGA

Protein sequence:

MSFRVVRSSKFRHVYGQALKREQCYDNIRVSKSSWDSTFCAVNPTFLAIIVESAGGGAFI
VLPHNKVGRIPADHPLVGGHKGPVLDIAWCPHNDNVIASGSEDCVVKVWQIPDGGLTRTL
TEPVVDLVYHQRRVGLVLWHPTAQNVLLTAGSDNQIAIWNVGTGEVLLSLDCHPDLIYSA
CWNWTGSKLLTTCRDKKIRIIDPRKGEVESEAIAHEGSKASRAIFLKHGLVFTTGFSRMS
ERQYTLRTPDALGEPIVTVEIDTSNGVMFPLYDPDTNLIYLCGKGDSVIRYFEVTPEPPF
VHYINTFQTPDPQRGIGMMPKRGCDVATCEIAKFYRLNNSGLCQVVSMTVPRKSELFQED
LYPDTLSDEASLTADEWLAGEDAEPCTMSLKGGYVAGRAHNLTVTKRNALATARDKEKEK
EKEKDKEPERSPTPGQRDTPATPAATPPPAFTAMVEKQLSDLVEEIRKLKSVIVKQENRI
RALEATVKGQVAAATPVPADHNHDDNMAPDEV