DPGLEAN22628 in OGS1.0

New model in OGS2.0DPOGS202081 
Genomic Positionscaffold1148:- 5965-18275
See gene structure
CDS Length4098
Paired RNAseq reads  1343
Single RNAseq reads  3374
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011306 (0.0)
Best Drosophila hit  rolling pebbles, isoform C (0.0)
Best Human hitprotein TANC2 (0.0)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC002665 [Tribolium castaneum] (0.0)
Best NR hit (blastx)  hypothetical protein TcasGA2_TC002665 [Tribolium castaneum] (0.0)
GeneOntology terms




  
GO:0007520 myoblast fusion
GO:0005515 protein binding
GO:0005737 cytoplasm
GO:0008270 zinc ion binding
GO:0005912 adherens junction
GO:0009986 cell surface
InterPro families


  
IPR020683 Ankyrin repeat-containing domain
IPR002110 Ankyrin repeat
IPR013026 Tetratricopeptide repeat-containing
IPR011990 Tetratricopeptide-like helical
Orthology groupMCL11267

Nucleotide sequence:

ATGCCCTTTGATAAGGGTAAAAAGAGAAAGTTAATTGACACATGCGGACATGAGCGCTGC
TATTCCTGTATGTTCAGAAACGAAGCCTGCCCAATTTGTGCAAGAAAAAGTCAGGGAAGA
CGTCCAGTTATGGAGAGATATACCCCTTCTCCACAGCGACAAGTGGATCATGAATGGCAA
TCACCGATGCGACTACCAAAACCACCGAAGCCTTCGAGTCTCGCTCAAAGCTGCCCCACA
CCCCCTCATACGAGAAGAAGATTCTTCCTTAGTCCTAAATCCTTGCGCAGTCCATTCGGC
CAGCGAAGCCGCCATTCTCACGAAAACCACGTGCCTCTATCAGGGTTACCAGAAGAAGGT
CCTAGGAACGCAGCCTGGACGAGCTTGGTGTTTAATAAGATAAGATCGTTGTGGTCAGCG
CAGTCCTCAGTGCCTCAAGGACTCAACCAATTGACAGGCACGGAAACACAATATGACGAA
GGAGGTCATATCAAACAAGGTTACGAGACAAGACGTCAAAATGACTTGTACATGCGGTTG
GGATTACTTCTTGGAGAGCGACGTGGATCCAGAAACAAATCCCGGGACAGTTGCACATCT
CTGGCCTCATTGGACGCTCATACTCTAGCCTCTCACAATACCAGTCCAGTGTCAACTCTA
ACTGGATCGTCAGAAGTGGATGCTGCGACACCACTTGGTAGGGATTCTTTAGGATCACTA
GCCTCAATGTCACTCTCTGCCGCCAGCAATTGTTCATCATCAAGTCCAGGAAGCAGACGG
CATTCTGTTAACACCTTGCAAAATGGACGAGAAGAATTGACACGGATGTCAAGTGGATTC
TTTAAGAACAGAAAAACAGCAGCACGGAGATCAGCTCGTGTCAACAGCAAACAGTCGTCA
TCTTCGTCAGAAATAAAGAAAGTTCATCCAACTCCGCAACTGACATTGAGACCACTATTC
TTCGAAGTGCCGGCAACCAATAACGACACTTGCTTTTCTGGACGACACTGGCTTATGAGA
GACATGGAAAAAGCTTTGGAATCTTCTTCACCTGGTATAATGATATCAGGCTGTCCAGGT
ACAGGCAAAACTGCTTTAATACTTCAGCTGGTCGAATATTCATGTTTCGGCCGCAAAAGG
AATTATCAGTATCAAGAATTAAGAGAGCAGTCAGATATTAGAGAAATGCTGCCAGAAGAA
ATAGCAGCAGGGATGATCACACAACTAGCATCACAAGTTGTTGCTTATCATTTCTGCCAA
GCAGACAACAACAGCACTTGCCTTGTTGGCGAATTTGTACATTCCCTGGCGGCCCAACTA
TGTCAAGCACCAAGACTACAGGCATATAGAGAATACCTACTTAGCGAACCACATTTGCTA
TCCTGCCTTTCATTAAAAGAATGTATAGCCGATCCAGACTTAGCCTTTATGAGAGGCATT
ATAGAACCTCTTATAATATTAAGAAGAAATGGAAGTATAGATTCAAGTAATAGTATTATA
CTTGTTGATGGGCTCTGTGAAGCTGAATATCACCGACCCGATCACGGTTATACTGTTGCT
TCATTTCTTATAAGACATGTACCAGAAATGCCAGCATGGCTTAAAGTTGTAGCCACCATA
AGAAGTCAATTTCTGGAACTAACAAAGCAACTACCATACACAAGGTTCAGTCTAAATGAA
TGTGACAATGTCCAAAAAGATCTATTGGAATATTTTAATGCCAGGGTACAAGCAGCCCCA
ATTATAGAAACAAATATTAAAAGTTCCACGGGGAAATCCGAAGGAGTTCATAATTCTGTC
ATGAAGTTTGCCCAATATGTTATTCATCTCAGTCAAGGGTCATTCCTGTTTCTAAAATTA
ATTTTAGACCTTCTTGAACGCAGTCATATAGTCGTAAAGTCGACTAACTACAAAGTTGTG
CCAATTTCGTTAGCTCAAATATTTTTGCTGCAATTCAATTTAAGATTCCCCACGGTACAA
TCTTTTGAAAAAGTAACCCACATTTTAAGTGTTTGTCTAGCAGCACTGTATCCTCTCACC
TTGGTAGAGATTTATTACTCTGTAAATTCTCTTCTTGTCGACACTTACTTGCCGTGGGAA
GAATTTTGTCACAGATTTGAAAGCCTATCCGATTTCTTGGTGAAAAGAATCGATAATACT
TACATGTTCTTCCACCCATCATTCAGAGAATGGTTAATACGACGCGATGATAATGAGAGT
CCAAAATTTCTATGTGACCTGCGGGCTGGTCACTGCGGTATTGCTTTTAGACTTGCTAGA
GTGCAAGCGCCTCTAGACCCAGAAAAGTCTATGGAACTCGGACACCACATTTTAAAAGCT
CATATGTACAGAAATATGGGACCAGCACAGTTAGGACTATGTCCGAGAGATTTACAAGCA
ATGATGGTAGCGTCGAGCTCTTCGAATGTAGGCGAAGCAGTAGCTAATTTACGTAACGTA
TATACTCCAAATGTAAAAGTATCGCGTCTCATGCTGCTGGCTGGTGGATCACCTAATCAA
ATTACTGATTGTCTTGGAAATGCTCCTCTATTATGTATGTATGCATATCAAGGAATTATA
TCAATGGTGGGATTACTGATTGAATTTGGAGCTGATTTAGAAATGACAAACTCGCAAGGA
TGTTCAGCTTTATCATTAGCTTGTCAGAGAGGTCACACCGATGTTGCGAGGATGTTGATA
GCATCAGGCGCATCTTTAAGTCACACTGATACAGCCGAACAAACACCTCTCGTCCACGCA
GCAAAGAATGGTCATAGAGATACAGTAATTTACCTGCTGGGTTGTCAAACTGGTAAAGAC
GATCGAAACTCAATAGAAATAGACGAAGGCAACATTGAACAACTAGTTCCCGGATCAAGA
CATGCTCTGATAGCGGCAGCTCAAAACGGTCATTTGGATATTGTCGAGTATCTTCTAGAT
ACAGCTGAATTAATCCCCGACGGTATTTGTCCAGTAACAGGTGAGACAGCACTGACAGCT
GCTTGCTCTACTGGTAACGCTGCCATCGCTGATGCTCTCCTAATTCGAGGAGCTACGCCA
TACTCATTAAATGCCAGACAAATGTCACCTTTGGCCCTAGCAGCTAAAAATGGCAGAACA
GCATTAGTTTTACGACTCCTGGATTCTGGAGCTGATGTTATGGGGTCGAGTGGGAAAATA
CCATTAATTTTAGCAGCTGCGGAGGGTCATTCAGATGTTGTTGAAATGCTTTTAGGTCAT
GGAGCTGATCCCAATGCTGTGGATGGTGATGGCATATCTGCTTTAGGTTGGGCAAGTCTG
AGATCTAGAATACCCACGGTAGTAATGCTTTTAGACAAAGGAGCAAATATAGAGCAAGCT
GACAGTAGCGGCCGTACACCGTTAGGACTAGCTTGCGGTGGACCGGCGGAGCTAGCGGAA
CTTCTTTTAGAACGTGGCGCATCACTAGAACGTGGAGACCACAGCGGCTTACGACCATTA
GATCGCGCCATCGGACAGAGGAATGTACCGATAGTAAATTGCTTTCTACGGAAAGGAGCG
AAACTCGGTCCAACGACATGGGTAATGGCGTCAGGAAAACCAGAATTTATGCTCATCCTA
CTCAACAAACTTCTGGAAGACGGTAACATTTTATACCGCAAGAACAGGCCGTCTGAAGCT
GCTCATAGATATCAATACGCCCTCAAGAAGATCTCTCCGCTCATCAGCGATGACGTCACC
AACGCCCAGGAACACTTGAACGTTTTCGTGCAGCTTAAAACCAATCTGCTGCTAAATTTA
TCGAGATGCAAACGAAAACTTAATGAACCATCAGAGGCTTTGGATTTAGCCGCCCGCGCG
TCCGTGTTACGTCCGAACGCTTTCGAATGTTCCTACGCCATGGCGAGAGCGATACTTGCT
CTGAACAAACCATCAGATGCTCTTCCTCATGCTAGACGAGCTTTACTCCTCGCTCCACAG
ACAGATCTATCAGCCATGAGAACCTTGAAAGCCCTTCAACAAGAAATTCTGACGCGTATT
AATGCCGGTACACAAAGTTTAAACGGTGACACACGATCTTTAAGAAATTTTGACAGCATT
AGTCTAAACATGCCTTAA

Protein sequence:

MPFDKGKKRKLIDTCGHERCYSCMFRNEACPICARKSQGRRPVMERYTPSPQRQVDHEWQ
SPMRLPKPPKPSSLAQSCPTPPHTRRRFFLSPKSLRSPFGQRSRHSHENHVPLSGLPEEG
PRNAAWTSLVFNKIRSLWSAQSSVPQGLNQLTGTETQYDEGGHIKQGYETRRQNDLYMRL
GLLLGERRGSRNKSRDSCTSLASLDAHTLASHNTSPVSTLTGSSEVDAATPLGRDSLGSL
ASMSLSAASNCSSSSPGSRRHSVNTLQNGREELTRMSSGFFKNRKTAARRSARVNSKQSS
SSSEIKKVHPTPQLTLRPLFFEVPATNNDTCFSGRHWLMRDMEKALESSSPGIMISGCPG
TGKTALILQLVEYSCFGRKRNYQYQELREQSDIREMLPEEIAAGMITQLASQVVAYHFCQ
ADNNSTCLVGEFVHSLAAQLCQAPRLQAYREYLLSEPHLLSCLSLKECIADPDLAFMRGI
IEPLIILRRNGSIDSSNSIILVDGLCEAEYHRPDHGYTVASFLIRHVPEMPAWLKVVATI
RSQFLELTKQLPYTRFSLNECDNVQKDLLEYFNARVQAAPIIETNIKSSTGKSEGVHNSV
MKFAQYVIHLSQGSFLFLKLILDLLERSHIVVKSTNYKVVPISLAQIFLLQFNLRFPTVQ
SFEKVTHILSVCLAALYPLTLVEIYYSVNSLLVDTYLPWEEFCHRFESLSDFLVKRIDNT
YMFFHPSFREWLIRRDDNESPKFLCDLRAGHCGIAFRLARVQAPLDPEKSMELGHHILKA
HMYRNMGPAQLGLCPRDLQAMMVASSSSNVGEAVANLRNVYTPNVKVSRLMLLAGGSPNQ
ITDCLGNAPLLCMYAYQGIISMVGLLIEFGADLEMTNSQGCSALSLACQRGHTDVARMLI
ASGASLSHTDTAEQTPLVHAAKNGHRDTVIYLLGCQTGKDDRNSIEIDEGNIEQLVPGSR
HALIAAAQNGHLDIVEYLLDTAELIPDGICPVTGETALTAACSTGNAAIADALLIRGATP
YSLNARQMSPLALAAKNGRTALVLRLLDSGADVMGSSGKIPLILAAAEGHSDVVEMLLGH
GADPNAVDGDGISALGWASLRSRIPTVVMLLDKGANIEQADSSGRTPLGLACGGPAELAE
LLLERGASLERGDHSGLRPLDRAIGQRNVPIVNCFLRKGAKLGPTTWVMASGKPEFMLIL
LNKLLEDGNILYRKNRPSEAAHRYQYALKKISPLISDDVTNAQEHLNVFVQLKTNLLLNL
SRCKRKLNEPSEALDLAARASVLRPNAFECSYAMARAILALNKPSDALPHARRALLLAPQ
TDLSAMRTLKALQQEILTRINAGTQSLNGDTRSLRNFDSISLNMP