New model in OGS2.0 | DPOGS210430  |
---|---|
Genomic Position | scaffold758:- 99366-107002 |
See gene structure | |
CDS Length | 2577 |
Paired RNAseq reads   | 792 |
Single RNAseq reads   | 1904 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012653 (7e-06) |
Best Drosophila hit   | muskelin, isoform A (2e-78) |
Best Human hit | muskelin isoform 1 (9e-77) |
Best NR hit (blastp)   | PREDICTED: similar to Muskelin [Monodelphis domestica] (3e-165) |
Best NR hit (blastx)   | PREDICTED: similar to muskelin [Nasonia vitripennis] (3e-91) |
GeneOntology terms    | GO:0005515 protein binding GO:0005737 cytoplasm |
InterPro families    | IPR010565 Muskelin, N-terminal IPR011498 Kelch repeat type 2 IPR015915 Kelch-type beta propeller IPR006594 LisH dimerisation motif IPR008979 Galactose-binding domain-like |
Orthology group | MCL15317 |
Nucleotide sequence:
ATGGATAATAAATGTGAAACGGTTAAACTCTCTTACACCGTACATAGATATTCAAGTTAT
TCTGCCAATTATTTACCGGAAAATATTTTGATAAACAAACCGTCGGACCAGCTTTCGAGA
TGGTTTACAGATAGTTCGACACCCTGCCAGTATATATTGTTAAAACTGGAAAATCCAGCT
ATAGTGGAGTCAATAACATTTGGTAAATATATAAAGGCACATGTCAGTGATCTTAAAAAG
TTTCAAATACTTGGAGGCACAGATGAAAACAACTTGTCATTGTTATTAAAAGCAGGTTTA
AAAAAGGATAATAGTACGGAAACTTTCACATTAAGGCACAGAACCTCAGAAGGATTATTT
TTTCCAGTGCAATACATAAAAATTGTGCCCCTTCAGTCTTGGGGACCAGCATACAATTAC
ACTATATGGTATGTGGAATTAAATGGGAAAAATCAAGAGGATTTTGTAAATAATGCTCTT
GAAACTATCAGCTTGCGTAAAGAAGAAGAGGCAGTTCGAATACTATTGAAGCATCTAAGA
CGAAGACGCTACAAAGATGCATTTGAAGCCTTGACCCGCGAGAGTGGCGTTCAGTTAGAG
GGTCCCATGCAAGGAAGGCTATGGAATGCGCTCGTTGAAAACGGAGATTATGAACTTGCC
GAGCAAATATTTGAAGAGGCTGCTAACAGTGGTGAGTTGGATTGGTACCTGTCCTGGCAG
CCCTATACTCCGGAGTGGAAGCAGTTAGTGCCGTGTTCTAGTGTTGTGGAGGAGCGCCTG
TACTCGGGGGACACGTCTCCTCCCCCACACGCCAGGGCTCAAGATCTGTCCTGCGCAGAC
AGACAACCTCAGATAGATCCATTGGAGGCTAATGGTGTGGAGGCTGGTGGAGTGGATGGG
GCCGTGGGGGGTGCAGGGTGTGTAAGGGGGCCGGGGGGTGACGGTGAGAGTGCAGGGGGT
GGGGAGGCAGCGGCCTCCCCGTGTCTATTACGACCGGTGCCGCGTGGTGGACACCAGCTT
GTAGTGGATCCCAATACAGGTCTTTTATACCTATTCGGTGGTTGGAACGGCGAGGAGGAT
CTAGACGACCTGTGGAGTTTCGACCCTTCGACTGAGTCCTGGAGGCTGTTGTGTCTTCAC
AGCGGAGCAGTTGGCGGTCCTTCGCCTCGCTCCTGTCACAAGATGGTCTTTGATCCTGTA
CACGAGAAGCTGTACACACTCGGGCGGTATCTGGATAATGTGCAACGAGTTCCAGAGAAT
ATGAAGTGCGAGCTGTACTCGTACAGCGTCCGCGAGGGCGTCTGGCGAGTGTCGTGTGAG
GACACGGCGGCCGCGGGCGGTCCACGGCTGGTGTTCGACCATCAGATGTGTCTCGACGCC
AGCGAGCAGACCATATACGTGTTCGGAGGACGAGTACTGCCAGCCAACACTGAGGAGTTG
TCGAGCCCCCAGTACTCCGGTCTGTTCTCGTACCACATCGAGTCTAACGTGTGGAGGTCG
TTAGAGCCCGAGCCTCCCCACCCCGCGCCTCCTCACCGCGTCTCACACTCCATGCTGTTC
CATCCGGTCCAACGACGGCTGTACATGTTCGCTGGTCAGCGTAACAAGGAACAACTAGTG
GACATGTGGTGGTGGGACTGTGAGGAGGCGCCGCCACGGCCGCACGCCCTGTGCACCGCG
CCCCCTCGGGCTCCGCCCCCCCAGGGCTTCACACAGAGGGCGACCATTGACCCACACACC
GACGAGATACATGTACTGTCGGGTATGAGCAAGGAGAAGGACAAGTGCGTGTACAATACT
CTGTGGGTGTTTTCCCTACGACGTATGACCTGGAGTTGCTTGTATAGGAACGACTCAGTG
GAGCCCTCGGAGCCGCGGCCGAGGTTCGCGCACCAGTTCGTCTACGATCCCGTCAGGAAG
GTGCATTATCTGTTCGGTGGTAACCCTGGCCTGACGTCGAGTCCTCGCCTCCGCCTGGAC
GATCTGTGGTCTTTGCGTCTCCACCGCGCGGGCTGCGGCGGAGTGAGGGCGAAGGCTCGC
GCGGCGCTCAGGGAAGCTCGCTACAGGGAGCTCGCCGCCGCCCCTGAACGTGCGCCCGCC
GCTCTACACTACCTGAGACACGCGCTCAGTGAGGCCATCGACCACAGCAGCTCGCTTCAG
GTGGCGCATTTCCAGAAGCTAGCCACGGTACTGTTCTCAGGGAGGACGAGTCCAGACGAA
GATGAGCTCCTGTACGGATCGGAAGCGGCGGGTGCCGGGACTGGGAGACCTCTTGACGTT
GGTCGCGTGAGGACTCTGAGAAGGAGGAGGGCGGCCGCCAGGAGGGACGCTTACCACGCC
GACCTGCCGCGGGCCCTGCTCTCAGTGCTTGGCACGGACGAGCCGCTAGTGGAGCCTAGT
CCGTTCGAAGACGAGTGCCCGTCCCCCGAGCCGGCAGAGGAGCCCCGGGATTCCCGCTCC
AACGAGACCTGGGAGACTCGCGCCGCTCGCATACGTCTGTACGACCGTCTGTGCCGCTAC
TTCCGCCCGTCTGCGGTGCCGCCAGCTGACGACATTCAAAACCTCGTCCACTTGTAA
Protein sequence:
MDNKCETVKLSYTVHRYSSYSANYLPENILINKPSDQLSRWFTDSSTPCQYILLKLENPA
IVESITFGKYIKAHVSDLKKFQILGGTDENNLSLLLKAGLKKDNSTETFTLRHRTSEGLF
FPVQYIKIVPLQSWGPAYNYTIWYVELNGKNQEDFVNNALETISLRKEEEAVRILLKHLR
RRRYKDAFEALTRESGVQLEGPMQGRLWNALVENGDYELAEQIFEEAANSGELDWYLSWQ
PYTPEWKQLVPCSSVVEERLYSGDTSPPPHARAQDLSCADRQPQIDPLEANGVEAGGVDG
AVGGAGCVRGPGGDGESAGGGEAAASPCLLRPVPRGGHQLVVDPNTGLLYLFGGWNGEED
LDDLWSFDPSTESWRLLCLHSGAVGGPSPRSCHKMVFDPVHEKLYTLGRYLDNVQRVPEN
MKCELYSYSVREGVWRVSCEDTAAAGGPRLVFDHQMCLDASEQTIYVFGGRVLPANTEEL
SSPQYSGLFSYHIESNVWRSLEPEPPHPAPPHRVSHSMLFHPVQRRLYMFAGQRNKEQLV
DMWWWDCEEAPPRPHALCTAPPRAPPPQGFTQRATIDPHTDEIHVLSGMSKEKDKCVYNT
LWVFSLRRMTWSCLYRNDSVEPSEPRPRFAHQFVYDPVRKVHYLFGGNPGLTSSPRLRLD
DLWSLRLHRAGCGGVRAKARAALREARYRELAAAPERAPAALHYLRHALSEAIDHSSSLQ
VAHFQKLATVLFSGRTSPDEDELLYGSEAAGAGTGRPLDVGRVRTLRRRRAAARRDAYHA
DLPRALLSVLGTDEPLVEPSPFEDECPSPEPAEEPRDSRSNETWETRAARIRLYDRLCRY
FRPSAVPPADDIQNLVHL