New model in OGS2.0 | DPOGS200959  |
---|---|
Genomic Position | scaffold548:+ 12501-22058 |
See gene structure | |
CDS Length | 2268 |
Paired RNAseq reads   | 886 |
Single RNAseq reads   | 2228 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010165 (0.0) |
Best Drosophila hit   | CG2747, isoform C (0.0) |
Best Human hit | HEAT repeat-containing protein 5B (4e-163) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL012079 [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | AGAP002215-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms   | GO:0005488 binding |
InterPro families    | IPR011989 Armadillo-like helical IPR016024 Armadillo-type fold |
Orthology group | MCL10889 |
Nucleotide sequence:
ATGGAGGTGTCACATTCTTTGACTCTGAATGAGGCGGCTCTTCAGCAACTTCCCGACGAT
AAGAAACCGCACTTCATTTTTGAATGGCTTCGGTTCCTGGACAAAGTTCTTGTAGCTGCC
CATAAGTCTGATCTCAAAAACTGCCAGCAAAAGCTTGTCGCTCAGCTCGTTAATCTTTTC
CGTGAGAATCTCGGAGTGCCAGCCAGGAGGCTGCTCGCGAGATGTCTGGCCACACTCTTC
TCCGTCGGTGACACATTCCTGTTGTTTGAAACAGTCAATAAGTGCAATGACATCATAAAA
GTTAAGGATGACTCTCCATCGTACCTGCCAACTAGACTTGCTGCTATATGCTGTCTGGGC
AGCATGTATGAAAAGTTGGGCAGGATGATGGGACGTTCATATGAAGAAACTGTCACCACT
CTAGTGAGGTCCATGAAAAGTGCTGAGTCTCAGACCAGGCTGGAGATCATGATTACACTG
GACAAGATCTGTGTGGGTATGGGCACAGCAGCAGGCTCCGCCCTCCGTGAGGTGTCACGT
GCCGCTCGTTCCTCGGTCCAACACGAGAGGGCGCCAGTTGTACGAGCGGCAGCAGCGACA
GCATTGAAACACGTACTGCCGCTGATGAGACTCACTCCCGCTGATGTTGACGCTATTGCC
GCTGCTTGTCTAAGAGCGGCACAGACTGCTGATTATGCTTTGAGATGCTCTGTAGCAGAT
CTCCTAGGAGCTTTGGTAGCAGCCACTCAGGCCGGAGAAACAAATTCTATCAACACCAAA
ACTAAAATAGGGATGCAATCTGTTCACCAAAACAAGAAAGACCCGCCAAAGAATGTCATA
TCACTGGACGAGGTTTTAAATCTCCTAATGGGTGCCTTCCTACGAGGAGGCACCTCCTTC
CTTAAGGGTGAGATCATCAAGTCTGGCTCCGCTGTCAACAGGGAAGTCCGTGTTTGTGTT
ACGCATGCATACGTGATCTTTGTTCAGAACATGGGAGGTGTTTGGTTAGAGCGGAATTTG
ACAACTTTCTTGAGCCACGTGTTAGACCTGGTCGCAAACCCTAAGGCGGCCAGTTCTCAC
GTGGATGCTGTGTACTCCAGAAAATGTATCAATTATATATTAAAGAACACTCTGGGAAAG
ATGCTCGGAGAGAAAGCCCAAGCGTCCGCTTGTCGGGAAATAATACAGATTATAGCAAAG
CAGATGAACAGTATCGACTTCAACCCGGAAAATGCTAAGGATTGTAACCAGGAGACGCTG
TTCAGTCAACATCTGTTAGTGTGTGCGTTGACGCAGGCGGGCTCCTTGTGTTTGGCTCTG
GGCACAGCGCTACATAACCTCGTATCTGACCCAGGCTTGCACCTCATAGATACCATATTC
TCGGTGTTGGAACACCCGTGTGTAGCTGCGAGGCTGGCGGCGGCGTCGTGCGCGAGGTCG
CTGGGGCGCGCTCACCCGGCGCTGTTGACGCCGCTGTTGGACAGGTGTGCTGACGCCTTG
GACGTGCCGAGACCTACACCGCATAGGATATCCGGTTATTCTGCCGCGTACGCTGCCATC
CTGGGTGCCGTTCAATGGTCACCGCTCGGTGTGCCTCACGGTCGCGGTAAGGTCGCCTTC
AACGCGGCTGAACAACTACTAAGGTCGGCTGGGCAGAGCAGCCGGCTAACGGCCGCTAGA
ACTAACGCCGGGTGGCTTATAGTCGGTGCTATATGTACCCTGGGAGTTCCTGTTGTGCGT
GGCCTTCTTCCGCGGATGTTGTTACTGTGGAGGAACAGTTTCCCTCGATCAGCCAAAGAA
CTGGAGTCGGAAAAAGCCAGAGGCGATGCGTTTACTTGGCAGGTAACTCTGGAAGGTCGT
GCTGGTGCGTTGTCTGCTCTACATAGTTTGTTAATACACTGTCCGTCACTCGTCAACAAT
GAGGACACCGCCAAACGCCTCGCGCAGCCGATAGATGGCGCTATTGCTGTCTTGACCAAC
GTGGGTCAGGTTGTGCGTAATTATGGGGGGTCACTGAAAGCCCCCGCTGCCCTGCTCCGT
CTGCGTCTATACCAGGCGTGTGCTGCGCTGGGGTCGTTGTCAGCCCCCGCGGCTCCCCTA
CTAAGGTTATTGGCTGCCGAGCTGGCCGGCTCCGCTGACCCCGGAGGCGCTATCGTCGCA
ACTGGTATGTTAAGAAATGCGATGCATCCTCGCGATACGATCCTACTGGGTGAAGAAGCT
TGGATATATGAGACTGATCACGCTGAAATAGAAGAACAGGTTCGTTAA
Protein sequence:
MEVSHSLTLNEAALQQLPDDKKPHFIFEWLRFLDKVLVAAHKSDLKNCQQKLVAQLVNLF
RENLGVPARRLLARCLATLFSVGDTFLLFETVNKCNDIIKVKDDSPSYLPTRLAAICCLG
SMYEKLGRMMGRSYEETVTTLVRSMKSAESQTRLEIMITLDKICVGMGTAAGSALREVSR
AARSSVQHERAPVVRAAAATALKHVLPLMRLTPADVDAIAAACLRAAQTADYALRCSVAD
LLGALVAATQAGETNSINTKTKIGMQSVHQNKKDPPKNVISLDEVLNLLMGAFLRGGTSF
LKGEIIKSGSAVNREVRVCVTHAYVIFVQNMGGVWLERNLTTFLSHVLDLVANPKAASSH
VDAVYSRKCINYILKNTLGKMLGEKAQASACREIIQIIAKQMNSIDFNPENAKDCNQETL
FSQHLLVCALTQAGSLCLALGTALHNLVSDPGLHLIDTIFSVLEHPCVAARLAAASCARS
LGRAHPALLTPLLDRCADALDVPRPTPHRISGYSAAYAAILGAVQWSPLGVPHGRGKVAF
NAAEQLLRSAGQSSRLTAARTNAGWLIVGAICTLGVPVVRGLLPRMLLLWRNSFPRSAKE
LESEKARGDAFTWQVTLEGRAGALSALHSLLIHCPSLVNNEDTAKRLAQPIDGAIAVLTN
VGQVVRNYGGSLKAPAALLRLRLYQACAALGSLSAPAAPLLRLLAAELAGSADPGGAIVA
TGMLRNAMHPRDTILLGEEAWIYETDHAEIEEQVR