New model in OGS2.0 | DPOGS210573  |
---|---|
Genomic Position | scaffold3410:- 253-5862 |
See gene structure | |
CDS Length | 1002 |
Paired RNAseq reads   | 125 |
Single RNAseq reads   | 747 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009754 (3e-43) |
Best Drosophila hit   | ND |
Best Human hit | thyroglobulin precursor (1e-06) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL009862 [Aedes aegypti] (2e-25) |
Best NR hit (blastx)   | AGAP007053-PA [Anopheles gambiae str. PEST] (2e-28) |
GeneOntology terms    | GO:0005179 hormone activity GO:0005515 protein binding GO:0005576 extracellular region GO:0005615 extracellular space GO:0005783 endoplasmic reticulum GO:0005794 Golgi apparatus GO:0006590 thyroid hormone generation GO:0009268 response to pH GO:0015705 iodide transport GO:0031641 regulation of myelination GO:0032403 protein complex binding GO:0032496 response to lipopolysaccharide GO:0042403 thyroid hormone metabolic process GO:0042446 hormone biosynthetic process GO:0045056 transcytosis GO:0048471 perinuclear region of cytoplasm GO:0051087 chaperone binding |
InterPro families   | IPR000716 Thyroglobulin type-1 |
Orthology group | MCL18144 |
Nucleotide sequence:
ATGACCAAAGAAAGCGGTTTGAAATTGTTTAGGAAACGGTGGGGTTTTCTTAAGTCGCAG
GTCTATGCCAACAAGCCGCAAACCACCGATGCCCTCAAAGTCAACATACGCCACGCCATC
GATCAAATACAGCCCGATTTGTGCGCCAGAGTCATCGAAAATTGGACCTTTCGTGTGCGC
GCCACCAACCGAAGCCGTGGCGATAAAGGAGCTGTGTGTAGTATTGGAGGACCTGGTACA
GGCATGACTGTGGGCAGATGTGGGGAAGGTCTTACCTGTGACAACACAACCAGAGTGTGT
GTACGAATGAAGACGAAATGCCATGACGCCCAAGACGACTACGACGCTCGTGAAGCCCGC
TCCCAGACCGGTTTCACTGAAGTTCGGCCCGAATGTGATGATAAAGGAAAGTTTTTATCC
TACGTCTGCGTGCCTTCACAGACATGTTTCTGTCAATCCGAAGACGGCGAGAGGATTTTC
GGTGAAGTGGCCAACACCGGAAGCGTATCGATGCCTTGCGGATGTTCAAGAATGTTCCAC
AAAATCCAGAAAACCATTTCCAATAGCGTTCCCTACCCAGTCGTAACATTGCGATGTACA
TCCGACGGCAACTTCAACCCGGTGCAATGCTTTGACAGGAAATGTCATTGTGTTGATAAA
ATAACTGGGATCAAGACTGGCACAGAAGTCATAGATTTGGATGAGAAAGCGATTACAGAT
TTGCCATGTTATGAAGCTGATTTGGATCTTTTTCGTCCGAGAAACGTATCCCAGCGTCCA
TTCCAATACACGACGCCATGCTACGAGAGCGTCAACGAAAGGCGGCAACTGATAGACCAA
AGAAAATGCCCTTCATGGTCCTGTTTCCGTACAGAGTCACCGGTCGAGGTTGAGGAAAAC
TTCTTGTTGTCGCGCAATGATTTGCAGCGAGAGGTGTTGGGTCACCAGGTGATTGCACAA
AGAGGCCGTGGAGCAGAAAAGGGAGTCAGTCTCATCCGTTAA
Protein sequence:
MTKESGLKLFRKRWGFLKSQVYANKPQTTDALKVNIRHAIDQIQPDLCARVIENWTFRVR
ATNRSRGDKGAVCSIGGPGTGMTVGRCGEGLTCDNTTRVCVRMKTKCHDAQDDYDAREAR
SQTGFTEVRPECDDKGKFLSYVCVPSQTCFCQSEDGERIFGEVANTGSVSMPCGCSRMFH
KIQKTISNSVPYPVVTLRCTSDGNFNPVQCFDRKCHCVDKITGIKTGTEVIDLDEKAITD
LPCYEADLDLFRPRNVSQRPFQYTTPCYESVNERRQLIDQRKCPSWSCFRTESPVEVEEN
FLLSRNDLQREVLGHQVIAQRGRGAEKGVSLIR