New model in OGS2.0 | DPOGS204082  |
---|---|
Genomic Position | scaffold4047:- 3378-8086 |
See gene structure | |
CDS Length | 1380 |
Paired RNAseq reads   | 62 |
Single RNAseq reads   | 197 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010817 (4e-62) |
Best Drosophila hit   | CG9400, isoform A (2e-14) |
Best Human hit | ND |
Best NR hit (blastp)   | gvag protein precursor [Anopheles funestus] (2e-20) |
Best NR hit (blastx)   | gvag protein precursor [Anopheles funestus] (5e-20) |
GeneOntology terms   | GO:0005576 extracellular region |
InterPro families    | IPR014044 CAP domain IPR001283 Allergen V5/Tpx-1-related |
Orthology group | MCL21719 |
Nucleotide sequence:
ATGCGGAGTGACGACCAGACATATTGCACTTTGAGATATCGAAGGCTTTGTATGGGGAAG
GGATCACACGTCGCTTGTCAATTTCCTTTGGCGGGCGCGGGGGCTTCGTGTAGCAACTAC
ACTAAAATCAAATTCACTAATGTTTTAAAGCATTTCGTGACGAGTTATATAAACAGACGA
CGTCAGCGGATAGCGTCGGGTTCCGAACGTGTCCGCGGCGGTGCTCCTTTACCGCGACCG
GAAGTATGGGACAAAGAGCTAGCCTTTCTGGCCCAAAGGCTGGCAGACCAATGTAACTTC
GTTCATGATGATTGCCGAGCTACAGTTCGTTATCCTTACGCTGGTCAGAGCGTGGGTGAA
GTGCATTGGAGAGGTACAGAGGAGCTCAGCCTCCAACGAGCGATCAGACGCGTGTTGGAC
GCCTGGTGGGGGGAGAGGAGACGGGTCCAGCCGGAACAGCTCATAACCCCCTTCAGACTT
ACTAACAAAGGCAGTGTTTGGGGTCACTTCAGCCAATTGGCGGTGTGGTCTCTTAGGGCT
GTCGGTTGCGGTGCCGTCATCCACGGATGGGATTATACTCGCCTGTTGTTAGTCTGCGAC
TTCTCTCACACCAACATGTTGGGACAGAGGACCATATCCCCGGGACCTCTGGCCCCGTGT
CCGATACACACTGTGAGGAAACAAAGAAGTCCTTATCCTTTATTGTGTGCTCCCATTAAG
CGATCCTTAGACACCGAAAATGAAGAAAACGATCTTAACAACGATTATCAGAACACACCA
GACTATGACGGGATACGTGACACGGAAATAACAAAAAGATATTCTATGAACACGTATAAA
TATGACGAAACAACTAAAAAGAATATGCTGTCAATTCCAAGGAGATATACATGGCTGAAA
GATTCAGAAATAAGTGATCAGAAAACGTCTGATTTGCTGAAACACAGGATCAAACAAATG
AAATTGTGGAATAGTGTGAGGACGAGTATAAATTTGAGAAAGTATAACCGATGGGAGTCA
TGGCCGACACACGCAGACATGAGAGAAGATCAAGAGAACAAAGGGGAAGCAAAGCTGTTC
AGGGCTCACAAAAAATATAACGCGCATAAAGATAACTATCGATTCTCCGAGCATTCTAGA
AGAGTCACTACCGAGGCTGGATATCAATCACTTCAAGAGTTTACCTCAAGGCATAGATGG
AAACAAACAAGAATAAGACAAATGAGACCTGGAGCAAAGGCTTTATTGAATAAGCCGCTC
GGAAAACTTCCCCAAAAGCCAAGCATCGCGTCTTTGATGATTGACCAAGATGACGTCGAG
CAGCTCTACAGAGACACAGGGTTTCACCTCTACTTGAAGAAACAAACACGAATTGAATAA
Protein sequence:
MRSDDQTYCTLRYRRLCMGKGSHVACQFPLAGAGASCSNYTKIKFTNVLKHFVTSYINRR
RQRIASGSERVRGGAPLPRPEVWDKELAFLAQRLADQCNFVHDDCRATVRYPYAGQSVGE
VHWRGTEELSLQRAIRRVLDAWWGERRRVQPEQLITPFRLTNKGSVWGHFSQLAVWSLRA
VGCGAVIHGWDYTRLLLVCDFSHTNMLGQRTISPGPLAPCPIHTVRKQRSPYPLLCAPIK
RSLDTENEENDLNNDYQNTPDYDGIRDTEITKRYSMNTYKYDETTKKNMLSIPRRYTWLK
DSEISDQKTSDLLKHRIKQMKLWNSVRTSINLRKYNRWESWPTHADMREDQENKGEAKLF
RAHKKYNAHKDNYRFSEHSRRVTTEAGYQSLQEFTSRHRWKQTRIRQMRPGAKALLNKPL
GKLPQKPSIASLMIDQDDVEQLYRDTGFHLYLKKQTRIE