New model in OGS2.0 | DPOGS204963  |
---|---|
Genomic Position | scaffold2685:+ 3675-16343 |
See gene structure | |
CDS Length | 3657 |
Paired RNAseq reads   | 724 |
Single RNAseq reads   | 1944 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011132 (9e-164) |
Best Drosophila hit   | CG10011 (4e-76) |
Best Human hit | ankyrin repeat domain-containing protein 50 isoform 1 (1e-71) |
Best NR hit (blastp)   | hypothetical protein TcasGA2_TC015164 [Tribolium castaneum] (2e-158) |
Best NR hit (blastx)   | PREDICTED: similar to ankyrin repeat domain 50 [Apis mellifera] (2e-107) |
GeneOntology terms   | GO:0005488 binding |
InterPro families    | IPR002110 Ankyrin repeat IPR020683 Ankyrin repeat-containing domain |
Orthology group | MCL12717 |
Nucleotide sequence:
ATGCCGATATACAGTCAGGTGGCGCCCAAGTATAGGGCAAAAAACGCTAAAGGCAACTCC
ACGCGGGAGAATAGGGGTAATCCGAGAGAATTCGACCCCCTGAATTACAACCACGGGGGC
AATATACAGCTTCTGAACGCGCCGATGGAGAGTAGAATTGAGAAACTAGAAAAGAGGGAA
TCGAGACGCAAAGAAGAGACCAGAGTAAGAAGAACACAGAACAGAAGTGACGCCAACGAG
GGGAAGGTCGAAGCGTATTTGAGGCAAAACCAGGTCAGCATAAGGCAGCAGCAGAACAGA
CATTCCGCTGTCAGACACGCCGAACACCGGGTTAATTTACGGTCACTCCCCCTGAACGAG
ATAACATCAAGGCAGGAGCGTACTCCGACCGGCCACCGGGTATTCCCCGAGAGAGAGGTC
TCCAGGGACAGTAGACGACAGAGAGACAGGGAAAGGGACTTCGATTTTGATGGATCCCCG
CTGAGTCCAATAGCTCGTGACAAGGACAGAGTGAAGAAGTCTTCAACACGGGTGTCGGAT
GTTAGTGACAAGAGGAGGTTCTTCTGTAGGGAGTGGGTGTTCCAGAAGATAGCGCACTGC
TTAGAACAACGAGCGGTCAGCAAAACTGCTGGAACCCTTATATTGGGTGACGCTGGCAGT
GGGAAGACTGCCTTATGTCAAGAGCTGAGCGCCCCGGGCACAGGTCCTCAGGCTAGACAA
CAGCGAGCGTTGAACAGGCGAATGCTCGCCAGACACTTCTTGCAGGGTCCAGGAGACTGC
AGCCAAAGACCTGGAGAATTCGTCCGTTCACTGGCGATGCAGATCCTTTCACATTCTGAA
CATGCAAAACCTGATGACAGATCAGATAGAAATTCCTTGGAAGAGAATTTTGTTCAGCGG
TTCCGTGATATGGGTGAGGATTGTGAGGAGAGCTCCAAACCACTGCTGGCTGATAGTGAA
GATCAACGCACAGACGAGGAGGAGACGAGCTCTAGACAGAGGACAGCGGACAGATTACAC
GAGAGAGAAGATTATACAGATTGTATTAATGACGCCGAGCTCAGTGGACATATGCTACCG
GAAATACTGCCGAAGGCGGATGTGAAGCCAACGAATCCGTTCGTCAGCGACCAGAATGAC
CTCCGTCTATACGAGAACCACGAAAACTTGTTCCTCCGCAACATATCACAGCGACAGTCC
AAAGAACTGCGGAATTCAAGACTTCTACGTCAAAGTTCTGAACCGCTTCAGGAGAAGAGA
CCTTCCGTGCTCCAGAAGAGTCTGTCGAACGATCAAGAAGAGAAGAAAGAGATGAACAGC
CCTCCAAAATCAAGAATACCAGTCGCCAACTTCCGGTATCCGAATAAAAGTGGCTTACGT
CCTGATAGTTCCCCGAAAAAGGATCCCAAAGAACTTCCCGAGTACCAGAACATAGTCCAC
GACGTTAAAGACGATCCGCAGACTGAGATGGAGCTACTTCTGGAGAAGAAGCGATCAGCA
TCTGAAGAGGAACCGCCTCCGATCCCCAGCCTGCCCGTCAATCCCAGGACTTTGATAGCC
AACGCCTACTACGAGAAGCTGCTATCGGAAACGGAAATCCAGCAGGCCTTGCTGCCCCAG
AACCTGGACAAGAATCCCGATGAGTGCTTCAAGAAGGCCATACTGTTTCCGTTACTGGAG
ATAGATCCACCTAAGCAGTGCTTGTTTTTACTTATTGACGCTATTGACGAGGGCGCCACT
AACGACGGCGATGGCAGCGAGGGCAGCGTGGCCGGGGTGGTCGGCCGTCACCAGCACCTG
CTACCGCACTGGCTGCTGCTGGTGGCCACTGCGAGGAGACACTCCCGCCTGGCTAGGGTG
TTCACCGGTTTCCGTAAGATAACCTTGGATGAGCTGTGTCGAGCCCATGTGGCCGCGGAC
GTCCAGCGCTACGTGTTGGCTCGCCTGGACAACGAACCTAGACTAAGGGCGCGCGTGTCC
AGCGACGCGGCGGCGGCGGCCTCGGCGGCGGCCGCCCTCGACCATCTCCGCATCAAGAGC
GACGGATGCTTACTGTACCTCGAGAAGGTGCTGGACGGTGTAGCCGACGGTTTTATAGCT
CTGCGTGAAATAAGAGAGATCCCAGGTACACTCAACGGACTGTACTTGTGGCTCGCGCAG
AGGCTGTTCCACGGACGGAGATTCAATAAGGTCCGGCTGGTGTTGGACGTGCTGCTGGCC
GCTCGCTGCGGTGTGACTGAGGACATGCTGTACAAGTGTCTCCTCACTAAGGAGTACAGC
GTCACCAGGGAGGACTTTAACCGACGGATGCATCTGTTGAGGAGGATAGTGTCCGTGGAC
CGCTCGACGGGCTTCGTGGCGATCTTCCACCGCTCCTTCTCCTCGTGGCTGGTGGACGTG
AAGCACTGCACCCGGCGCTACTTGTGCGACGTCTCCGCGGGTCACGCCGCCCTCGCCATG
CATTACACTCTAGAAGCCAGAAGACTGTCAGCTCTCGAGATCCATCACTACGTGTATCAC
ATGACGCAGCTGGAGCAACACCTGGCCTCGCTCAAGAAGGGGAAGCTCGGCTGTGAGCCG
GTGGAGCTTCATACTCTGGTGCTGCTCTGGGTGTTGGACTCCGGCTGCCAGGTGGAGGCG
GCGCTCCAACATGACAGAGGACAGATTGAGGAGAAAATCGAAGATAAAGATCAAGATCCG
GAGTCTGAGGGAAAGGAGTCGACTTCCTGTAAATCATTGGAACAGTCCGCTCTGGAGAAC
ATAATGCCGGAGCTGGTGAACGGCAGCACTCCCAGGTGGCCGAGGGACAGGAGGGTGATG
CGGGCCCTCATGGAGCTCAGCAGGACGGATTCGGTCCCCACGGAACCCGAGGAAGACGTC
AATGATCTGCTGTCCACTGAGAAGGCGCTGGAGAGTGAAGAGAACGCGACCGGGGACGAG
CACGATGAGGCGCTACTCCTGGATCCGGGAACTGTTCATGAGTTAGCAGCGAGAGGAGAT
GAAGACGCGTTATCAGTTTTATTGAAGCGTCGTCCTGAGCTGGCTCAGTCGGTGGACGCG
GCGGGGGCCACGGCCTTGCACGCCGCGGCCCGGGCGGGGCGCGCCGGGACCGCAGCCCTC
CTGCTCAAGGCGGGCGCTTCGCCCTCCGTAGCGGACGGCGACGGCTGGAGCCCGCTCAGA
GCTGCGGCCTGGGCTGGAGCTGGAGCCCGCTCAGGGCTGCGGCCTGGGCTGGCCACGTGG
AGGTTTTTATGTTTACTTGTGGTGGACGTGCTCCTGGAACATGGTTGCGACGTGGACTGC
GTGGACGCCGACAACAGGACCGCCTTGAGGGCTGCGGCGTGGTCCGGCCACGAGGCCGTG
GTGGCGCGGCTCTTGGCCGGGGGGGCGGAGGCCGAGCGCGCGGACGCCGAGGGCCGCACG
CCCCTAATGGCCGCCGCTTACATGGGACACGCGGACATCGTGAGATTGCTGCTGGACGCC
GGCGCCGGCACCGACCACGCCGATCATGACGGTCGTACGGCGCTGTCGGTGGCGGCTCTA
TGCAGAGCTGGTGGGGCGTGCGCCGCCTTACTCCTGGAGCGAGGAGCGGACCCCGGCCGA
GCGGACAGGGACCGCGCGACCCCCCTCTTAGTGGCCGCCTTTGAAGGACACACGTGA
Protein sequence:
MPIYSQVAPKYRAKNAKGNSTRENRGNPREFDPLNYNHGGNIQLLNAPMESRIEKLEKRE
SRRKEETRVRRTQNRSDANEGKVEAYLRQNQVSIRQQQNRHSAVRHAEHRVNLRSLPLNE
ITSRQERTPTGHRVFPEREVSRDSRRQRDRERDFDFDGSPLSPIARDKDRVKKSSTRVSD
VSDKRRFFCREWVFQKIAHCLEQRAVSKTAGTLILGDAGSGKTALCQELSAPGTGPQARQ
QRALNRRMLARHFLQGPGDCSQRPGEFVRSLAMQILSHSEHAKPDDRSDRNSLEENFVQR
FRDMGEDCEESSKPLLADSEDQRTDEEETSSRQRTADRLHEREDYTDCINDAELSGHMLP
EILPKADVKPTNPFVSDQNDLRLYENHENLFLRNISQRQSKELRNSRLLRQSSEPLQEKR
PSVLQKSLSNDQEEKKEMNSPPKSRIPVANFRYPNKSGLRPDSSPKKDPKELPEYQNIVH
DVKDDPQTEMELLLEKKRSASEEEPPPIPSLPVNPRTLIANAYYEKLLSETEIQQALLPQ
NLDKNPDECFKKAILFPLLEIDPPKQCLFLLIDAIDEGATNDGDGSEGSVAGVVGRHQHL
LPHWLLLVATARRHSRLARVFTGFRKITLDELCRAHVAADVQRYVLARLDNEPRLRARVS
SDAAAAASAAAALDHLRIKSDGCLLYLEKVLDGVADGFIALREIREIPGTLNGLYLWLAQ
RLFHGRRFNKVRLVLDVLLAARCGVTEDMLYKCLLTKEYSVTREDFNRRMHLLRRIVSVD
RSTGFVAIFHRSFSSWLVDVKHCTRRYLCDVSAGHAALAMHYTLEARRLSALEIHHYVYH
MTQLEQHLASLKKGKLGCEPVELHTLVLLWVLDSGCQVEAALQHDRGQIEEKIEDKDQDP
ESEGKESTSCKSLEQSALENIMPELVNGSTPRWPRDRRVMRALMELSRTDSVPTEPEEDV
NDLLSTEKALESEENATGDEHDEALLLDPGTVHELAARGDEDALSVLLKRRPELAQSVDA
AGATALHAAARAGRAGTAALLLKAGASPSVADGDGWSPLRAAAWAGAGARSGLRPGLATW
RFLCLLVVDVLLEHGCDVDCVDADNRTALRAAAWSGHEAVVARLLAGGAEAERADAEGRT
PLMAAAYMGHADIVRLLLDAGAGTDHADHDGRTALSVAALCRAGGACAALLLERGADPGR
ADRDRATPLLVAAFEGHT