DPGLEAN22095 in OGS1.0

New model in OGS2.0DPOGS210573 
Genomic Positionscaffold3410:- 253-5862
See gene structure
CDS Length1002
Paired RNAseq reads  125
Single RNAseq reads  747
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA009754 (3e-43)
Best Drosophila hit  ND
Best Human hitthyroglobulin precursor (1e-06)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL009862 [Aedes aegypti] (2e-25)
Best NR hit (blastx)  AGAP007053-PA [Anopheles gambiae str. PEST] (2e-28)
GeneOntology terms















  
GO:0005179 hormone activity
GO:0005515 protein binding
GO:0005576 extracellular region
GO:0005615 extracellular space
GO:0005783 endoplasmic reticulum
GO:0005794 Golgi apparatus
GO:0006590 thyroid hormone generation
GO:0009268 response to pH
GO:0015705 iodide transport
GO:0031641 regulation of myelination
GO:0032403 protein complex binding
GO:0032496 response to lipopolysaccharide
GO:0042403 thyroid hormone metabolic process
GO:0042446 hormone biosynthetic process
GO:0045056 transcytosis
GO:0048471 perinuclear region of cytoplasm
GO:0051087 chaperone binding
InterPro families  IPR000716 Thyroglobulin type-1
Orthology groupMCL18144

Nucleotide sequence:

ATGACCAAAGAAAGCGGTTTGAAATTGTTTAGGAAACGGTGGGGTTTTCTTAAGTCGCAG
GTCTATGCCAACAAGCCGCAAACCACCGATGCCCTCAAAGTCAACATACGCCACGCCATC
GATCAAATACAGCCCGATTTGTGCGCCAGAGTCATCGAAAATTGGACCTTTCGTGTGCGC
GCCACCAACCGAAGCCGTGGCGATAAAGGAGCTGTGTGTAGTATTGGAGGACCTGGTACA
GGCATGACTGTGGGCAGATGTGGGGAAGGTCTTACCTGTGACAACACAACCAGAGTGTGT
GTACGAATGAAGACGAAATGCCATGACGCCCAAGACGACTACGACGCTCGTGAAGCCCGC
TCCCAGACCGGTTTCACTGAAGTTCGGCCCGAATGTGATGATAAAGGAAAGTTTTTATCC
TACGTCTGCGTGCCTTCACAGACATGTTTCTGTCAATCCGAAGACGGCGAGAGGATTTTC
GGTGAAGTGGCCAACACCGGAAGCGTATCGATGCCTTGCGGATGTTCAAGAATGTTCCAC
AAAATCCAGAAAACCATTTCCAATAGCGTTCCCTACCCAGTCGTAACATTGCGATGTACA
TCCGACGGCAACTTCAACCCGGTGCAATGCTTTGACAGGAAATGTCATTGTGTTGATAAA
ATAACTGGGATCAAGACTGGCACAGAAGTCATAGATTTGGATGAGAAAGCGATTACAGAT
TTGCCATGTTATGAAGCTGATTTGGATCTTTTTCGTCCGAGAAACGTATCCCAGCGTCCA
TTCCAATACACGACGCCATGCTACGAGAGCGTCAACGAAAGGCGGCAACTGATAGACCAA
AGAAAATGCCCTTCATGGTCCTGTTTCCGTACAGAGTCACCGGTCGAGGTTGAGGAAAAC
TTCTTGTTGTCGCGCAATGATTTGCAGCGAGAGGTGTTGGGTCACCAGGTGATTGCACAA
AGAGGCCGTGGAGCAGAAAAGGGAGTCAGTCTCATCCGTTAA

Protein sequence:

MTKESGLKLFRKRWGFLKSQVYANKPQTTDALKVNIRHAIDQIQPDLCARVIENWTFRVR
ATNRSRGDKGAVCSIGGPGTGMTVGRCGEGLTCDNTTRVCVRMKTKCHDAQDDYDAREAR
SQTGFTEVRPECDDKGKFLSYVCVPSQTCFCQSEDGERIFGEVANTGSVSMPCGCSRMFH
KIQKTISNSVPYPVVTLRCTSDGNFNPVQCFDRKCHCVDKITGIKTGTEVIDLDEKAITD
LPCYEADLDLFRPRNVSQRPFQYTTPCYESVNERRQLIDQRKCPSWSCFRTESPVEVEEN
FLLSRNDLQREVLGHQVIAQRGRGAEKGVSLIR