New model in OGS2.0 | DPOGS210234  |
---|---|
Genomic Position | scaffold42:- 9053-12576 |
See gene structure | |
CDS Length | 1449 |
Paired RNAseq reads   | 271 |
Single RNAseq reads   | 724 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA002373 (1e-17) |
Best Drosophila hit   | CG32343, isoform C (5e-25) |
Best Human hit | GA-binding protein subunit beta-1 isoform beta 1 (1e-29) |
Best NR hit (blastp)   | AGAP006384-PA [Anopheles gambiae str. PEST] (3e-55) |
Best NR hit (blastx)   | AGAP006384-PA [Anopheles gambiae str. PEST] (1e-36) |
GeneOntology terms    | GO:0005575 cellular_component GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR020683 Ankyrin repeat-containing domain IPR002110 Ankyrin repeat |
Orthology group | MCL16379 |
Nucleotide sequence:
ATGACTTCAGATCTCTGTGAAGAAGATGTAGTCGTTCGATTATCCTCACCTCATATAATA
TCATCAGAGACGATAGTACCAGCTGCCAGTAATGTTGGCGGTGGACGAATACAGACCGGA
GGAGTGGAGCTGGGTCGCAGACTGCTCTTAGCAGCCAGAGCAGGAGATACCGCTACTGTA
CTTGATCTCATGGCCAAAGGTGCACCATTTACCACTGACTGGCTGGGTACATCACCGCTG
CACCTGGCTGCTGCCAACAACCATGTGGAGACATGCGGTGTATTACTGAGGGCGGGTGTG
TCTCGGGATGCTCGGACTAAAGTTGAACGAACACCGCTGCACCTGGCCGCACATGCTGGG
CATGCCGCTGTAGTTGCACTGCTGCTCGACCATGGAGCTATGGTGGACTGTCGCGACATG
CTCCACATGACGCCGCTGCACTGGGCGAGTGCTCGAGGTCACGTGGCCGTGGTCCGCGAG
CTAGTGTGTCGCGGCGCGGATTTGCTCGCTCGCTGCAAGTTCAGGAAGACGCCGCGCTGC
CTCGCCGTCCGCGCCGGGGCCAGTGACGTCATGGCTGTCCTCGACCAAGCTGCCAAGGAA
CACGACCGACCCACAGTGACTGAGGAAACGCCAAAGATTCAACATTTTGAAACAATCCAA
AGACTACAGGAGGTCAGACAGCAGACCAAAACCAAGCCTCCGGAGAAGACTATCGTAATA
GAATCTAAGACTGAGCCGGCGTCGGGTCTGTCCGGGGCGGCGTTACTCCGCGCACACGGC
ATCACTCTCCTACCCCGGGACCGCGGCTCCACTGTACTCAGCGCACTGAGGAGCGGACGG
ACCGTCGTACTGTCCGATGCCGGGAAGCTGATGTTGAAGGAGAGCACCAACGCCCCGGTG
ATGGTCAGCGCCACCAGCGCCTCTGTGGACGCGAGCAACAACACAGCCAGCAACAGTCAG
TCAAGCTTGCCCACAACTAACATAGTGACCAGTTCAAACATCACCGACGCTAAAGGGGTC
ATGGTCCGAGCGAGGACTCTCAACACCATCAAGGGCGTCAAAGGCTTGCAAATGCTCTCC
GTCAACAGATCCGACCACACTGTTAAGAAGGTCATCAGTTCACATGACTTGCAGAAAGTT
AAATTACTCGGCGTGAAAGAGAACAAGTCACCCCGCCGTCCAGCTCTCAAGATCCTTCTC
AACAAAGCCAACCTCACACGACTACTAGCCAACACCACTAACGCTTCTACCACCAACAAC
ACACAGATATCGATCGAGCCTTCCGGCGAGCTGAGCGAGTCGCCGGTTCAAAGTGACGCG
GTGATGGAGGACGCGTCGGAATCGTCTCTGAGGGTTCAACTGCAACAAGCGCACGCCGCC
CTGGCCAGCCTGGCCGCAGAGTTACGACACTGTAAGGCTAAACTGGCCAAATACGAACAC
ACGCACTGA
Protein sequence:
MTSDLCEEDVVVRLSSPHIISSETIVPAASNVGGGRIQTGGVELGRRLLLAARAGDTATV
LDLMAKGAPFTTDWLGTSPLHLAAANNHVETCGVLLRAGVSRDARTKVERTPLHLAAHAG
HAAVVALLLDHGAMVDCRDMLHMTPLHWASARGHVAVVRELVCRGADLLARCKFRKTPRC
LAVRAGASDVMAVLDQAAKEHDRPTVTEETPKIQHFETIQRLQEVRQQTKTKPPEKTIVI
ESKTEPASGLSGAALLRAHGITLLPRDRGSTVLSALRSGRTVVLSDAGKLMLKESTNAPV
MVSATSASVDASNNTASNSQSSLPTTNIVTSSNITDAKGVMVRARTLNTIKGVKGLQMLS
VNRSDHTVKKVISSHDLQKVKLLGVKENKSPRRPALKILLNKANLTRLLANTTNASTTNN
TQISIEPSGELSESPVQSDAVMEDASESSLRVQLQQAHAALASLAAELRHCKAKLAKYEH
TH