DPGLEAN07817 in OGS1.0

New model in OGS2.0DPOGS210430 
Genomic Positionscaffold758:- 99366-107002
See gene structure
CDS Length2577
Paired RNAseq reads  792
Single RNAseq reads  1904
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012653 (7e-06)
Best Drosophila hit  muskelin, isoform A (2e-78)
Best Human hitmuskelin isoform 1 (9e-77)
Best NR hit (blastp)  PREDICTED: similar to Muskelin [Monodelphis domestica] (3e-165)
Best NR hit (blastx)  PREDICTED: similar to muskelin [Nasonia vitripennis] (3e-91)
GeneOntology terms
  
GO:0005515 protein binding
GO:0005737 cytoplasm
InterPro families



  
IPR010565 Muskelin, N-terminal
IPR011498 Kelch repeat type 2
IPR015915 Kelch-type beta propeller
IPR006594 LisH dimerisation motif
IPR008979 Galactose-binding domain-like
Orthology groupMCL15317

Nucleotide sequence:

ATGGATAATAAATGTGAAACGGTTAAACTCTCTTACACCGTACATAGATATTCAAGTTAT
TCTGCCAATTATTTACCGGAAAATATTTTGATAAACAAACCGTCGGACCAGCTTTCGAGA
TGGTTTACAGATAGTTCGACACCCTGCCAGTATATATTGTTAAAACTGGAAAATCCAGCT
ATAGTGGAGTCAATAACATTTGGTAAATATATAAAGGCACATGTCAGTGATCTTAAAAAG
TTTCAAATACTTGGAGGCACAGATGAAAACAACTTGTCATTGTTATTAAAAGCAGGTTTA
AAAAAGGATAATAGTACGGAAACTTTCACATTAAGGCACAGAACCTCAGAAGGATTATTT
TTTCCAGTGCAATACATAAAAATTGTGCCCCTTCAGTCTTGGGGACCAGCATACAATTAC
ACTATATGGTATGTGGAATTAAATGGGAAAAATCAAGAGGATTTTGTAAATAATGCTCTT
GAAACTATCAGCTTGCGTAAAGAAGAAGAGGCAGTTCGAATACTATTGAAGCATCTAAGA
CGAAGACGCTACAAAGATGCATTTGAAGCCTTGACCCGCGAGAGTGGCGTTCAGTTAGAG
GGTCCCATGCAAGGAAGGCTATGGAATGCGCTCGTTGAAAACGGAGATTATGAACTTGCC
GAGCAAATATTTGAAGAGGCTGCTAACAGTGGTGAGTTGGATTGGTACCTGTCCTGGCAG
CCCTATACTCCGGAGTGGAAGCAGTTAGTGCCGTGTTCTAGTGTTGTGGAGGAGCGCCTG
TACTCGGGGGACACGTCTCCTCCCCCACACGCCAGGGCTCAAGATCTGTCCTGCGCAGAC
AGACAACCTCAGATAGATCCATTGGAGGCTAATGGTGTGGAGGCTGGTGGAGTGGATGGG
GCCGTGGGGGGTGCAGGGTGTGTAAGGGGGCCGGGGGGTGACGGTGAGAGTGCAGGGGGT
GGGGAGGCAGCGGCCTCCCCGTGTCTATTACGACCGGTGCCGCGTGGTGGACACCAGCTT
GTAGTGGATCCCAATACAGGTCTTTTATACCTATTCGGTGGTTGGAACGGCGAGGAGGAT
CTAGACGACCTGTGGAGTTTCGACCCTTCGACTGAGTCCTGGAGGCTGTTGTGTCTTCAC
AGCGGAGCAGTTGGCGGTCCTTCGCCTCGCTCCTGTCACAAGATGGTCTTTGATCCTGTA
CACGAGAAGCTGTACACACTCGGGCGGTATCTGGATAATGTGCAACGAGTTCCAGAGAAT
ATGAAGTGCGAGCTGTACTCGTACAGCGTCCGCGAGGGCGTCTGGCGAGTGTCGTGTGAG
GACACGGCGGCCGCGGGCGGTCCACGGCTGGTGTTCGACCATCAGATGTGTCTCGACGCC
AGCGAGCAGACCATATACGTGTTCGGAGGACGAGTACTGCCAGCCAACACTGAGGAGTTG
TCGAGCCCCCAGTACTCCGGTCTGTTCTCGTACCACATCGAGTCTAACGTGTGGAGGTCG
TTAGAGCCCGAGCCTCCCCACCCCGCGCCTCCTCACCGCGTCTCACACTCCATGCTGTTC
CATCCGGTCCAACGACGGCTGTACATGTTCGCTGGTCAGCGTAACAAGGAACAACTAGTG
GACATGTGGTGGTGGGACTGTGAGGAGGCGCCGCCACGGCCGCACGCCCTGTGCACCGCG
CCCCCTCGGGCTCCGCCCCCCCAGGGCTTCACACAGAGGGCGACCATTGACCCACACACC
GACGAGATACATGTACTGTCGGGTATGAGCAAGGAGAAGGACAAGTGCGTGTACAATACT
CTGTGGGTGTTTTCCCTACGACGTATGACCTGGAGTTGCTTGTATAGGAACGACTCAGTG
GAGCCCTCGGAGCCGCGGCCGAGGTTCGCGCACCAGTTCGTCTACGATCCCGTCAGGAAG
GTGCATTATCTGTTCGGTGGTAACCCTGGCCTGACGTCGAGTCCTCGCCTCCGCCTGGAC
GATCTGTGGTCTTTGCGTCTCCACCGCGCGGGCTGCGGCGGAGTGAGGGCGAAGGCTCGC
GCGGCGCTCAGGGAAGCTCGCTACAGGGAGCTCGCCGCCGCCCCTGAACGTGCGCCCGCC
GCTCTACACTACCTGAGACACGCGCTCAGTGAGGCCATCGACCACAGCAGCTCGCTTCAG
GTGGCGCATTTCCAGAAGCTAGCCACGGTACTGTTCTCAGGGAGGACGAGTCCAGACGAA
GATGAGCTCCTGTACGGATCGGAAGCGGCGGGTGCCGGGACTGGGAGACCTCTTGACGTT
GGTCGCGTGAGGACTCTGAGAAGGAGGAGGGCGGCCGCCAGGAGGGACGCTTACCACGCC
GACCTGCCGCGGGCCCTGCTCTCAGTGCTTGGCACGGACGAGCCGCTAGTGGAGCCTAGT
CCGTTCGAAGACGAGTGCCCGTCCCCCGAGCCGGCAGAGGAGCCCCGGGATTCCCGCTCC
AACGAGACCTGGGAGACTCGCGCCGCTCGCATACGTCTGTACGACCGTCTGTGCCGCTAC
TTCCGCCCGTCTGCGGTGCCGCCAGCTGACGACATTCAAAACCTCGTCCACTTGTAA

Protein sequence:

MDNKCETVKLSYTVHRYSSYSANYLPENILINKPSDQLSRWFTDSSTPCQYILLKLENPA
IVESITFGKYIKAHVSDLKKFQILGGTDENNLSLLLKAGLKKDNSTETFTLRHRTSEGLF
FPVQYIKIVPLQSWGPAYNYTIWYVELNGKNQEDFVNNALETISLRKEEEAVRILLKHLR
RRRYKDAFEALTRESGVQLEGPMQGRLWNALVENGDYELAEQIFEEAANSGELDWYLSWQ
PYTPEWKQLVPCSSVVEERLYSGDTSPPPHARAQDLSCADRQPQIDPLEANGVEAGGVDG
AVGGAGCVRGPGGDGESAGGGEAAASPCLLRPVPRGGHQLVVDPNTGLLYLFGGWNGEED
LDDLWSFDPSTESWRLLCLHSGAVGGPSPRSCHKMVFDPVHEKLYTLGRYLDNVQRVPEN
MKCELYSYSVREGVWRVSCEDTAAAGGPRLVFDHQMCLDASEQTIYVFGGRVLPANTEEL
SSPQYSGLFSYHIESNVWRSLEPEPPHPAPPHRVSHSMLFHPVQRRLYMFAGQRNKEQLV
DMWWWDCEEAPPRPHALCTAPPRAPPPQGFTQRATIDPHTDEIHVLSGMSKEKDKCVYNT
LWVFSLRRMTWSCLYRNDSVEPSEPRPRFAHQFVYDPVRKVHYLFGGNPGLTSSPRLRLD
DLWSLRLHRAGCGGVRAKARAALREARYRELAAAPERAPAALHYLRHALSEAIDHSSSLQ
VAHFQKLATVLFSGRTSPDEDELLYGSEAAGAGTGRPLDVGRVRTLRRRRAAARRDAYHA
DLPRALLSVLGTDEPLVEPSPFEDECPSPEPAEEPRDSRSNETWETRAARIRLYDRLCRY
FRPSAVPPADDIQNLVHL