DPGLEAN16038 in OGS1.0

New model in OGS2.0DPOGS210128 
Genomic Positionscaffold3424:+ 1532-6415
See gene structure
CDS Length1350
Paired RNAseq reads  492
Single RNAseq reads  1345
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000235 (2e-133)
Best Drosophila hit  PAPS synthetase, isoform B (7e-160)
Best Human hitbifunctional 3'-phosphoadenosine 5'-phosphosulfate synthase 1 (1e-161)
Best NR hit (blastp)  AGAP001256-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP001256-PA [Anopheles gambiae str. PEST] (2e-178)
GeneOntology terms




  
GO:0004781 sulfate adenylyltransferase (ATP) activity
GO:0000103 sulfate assimilation
GO:0016301 kinase activity
GO:0005524 ATP binding
GO:0016772 transferase activity, transferring phosphorus-containing groups
GO:0005575 cellular_component
InterPro families

  
IPR014729 Rossmann-like alpha/beta/alpha sandwich fold
IPR002650 Sulphate adenylyltransferase
IPR015947 Pseudouridine synthase/archaeosine transglycosylase-like
Orthology groupMCL10774

Nucleotide sequence:

GGCTTCACTGGCATAACTCAGGAGTATGAACGTCCTGAGGCTCCAGAGCTTGTCATTCAG
ACAGTTGGACGCTCCATCGAAGAGTCCACCATAGAAGTGGTGCGACTCCTCGAATCACAG
GGTATTATACCACGCTACAATGAAAATGACTCAGGTGTTGAAGAGCTCTTCATTTACGGA
AACAGACTTAGCAGTGCTAAGGAAGAGGCGGCCAGGTTGCCGCAAATAGAACTCTCATTT
TTGGACTTGCAATGGGTTCAGGTGTTATCTGAAGGTTGGGCCTACCCTCTTAAAGGTTTT
ATGAGGGAATCCGAATATTTGCAAGCGCTACATTCCAACTGCTTTACACTACCAGATGGG
ACCTTGGTAAACCAATCTGTACCAATCGTGTTGCCAGTGGCCACGACCACTAAGGAGCGC
CTCACTGGTTCCACGGCCATCGCATTGGTCCACGATGGCCGAACCATCGCCATTATGAGA
AACCCCGAGTTCTACCCTCATAGGAAACAGGAGAGGTGCTGTCGGCAGTTCGGAATATAT
AACACAGGACATCCCTATATCAAAATGATCGAGGAGTCTGGGGACTGGCTGGTGGGCGGT
AACCTGGAAGTGTTCGAACGTATTCAGTGGAATGACGGCCTAGACTCTTACAGACTGACG
CCCAACGAACTGAGGCAGAGGTTCAAGGACATGGATGCTGATGCTGTGTTTGCATTCCAG
CTTCGTAACCCTATCCACAACGGCCACGCCCTCCTGATGCAAGACACTCAAAAACAACTC
ATCGAGAGAGGATACAAGAAACCAGTACTGCTATTACACCCCCTTGGCGGCTGGACTAAA
GACGATGATGTTCCCCTGTCGGTGCGCGTGATACAACACAAGGCGGTCTTGAATGAACGA
GTGCTGGACCCTGAACATACCGTGCTGGCGATCTTTCCATCTCCAATGATGTACGCCGGA
CCCACGGAGGTCCAATGGCATGCTAAGTGCCGTATGAACGCTGGCGCTAACCACTATATA
GTGGGTCGTGACCCCGCTGGATTGCCGCACCCTAACGGCGGCGGTGACCTCTACGACCCC
CGACACGGTGCTATCGTACTGGCAGCCGCACCCGGACTGGATGATCTTGAGATCATACCA
TTCCGAGTAGCAGCGTATGATTCATCCGTCGGGAAGATGGCATTCTTTGATCCCACTCGT
AAGGAAGACTTCGACTTCATATCCGGCACCAGGATGAGGGGTCTTGCTAAAGCTGGAAAG
GAGCCACCGAAAGGTTTCATGGCTCCCAGCGCCTGGAAGGTCCTCTCAGAATACTACCAG
TCGCTTAAATCTAAAATGGAAACCAATTAA

Protein sequence:

GFTGITQEYERPEAPELVIQTVGRSIEESTIEVVRLLESQGIIPRYNENDSGVEELFIYG
NRLSSAKEEAARLPQIELSFLDLQWVQVLSEGWAYPLKGFMRESEYLQALHSNCFTLPDG
TLVNQSVPIVLPVATTTKERLTGSTAIALVHDGRTIAIMRNPEFYPHRKQERCCRQFGIY
NTGHPYIKMIEESGDWLVGGNLEVFERIQWNDGLDSYRLTPNELRQRFKDMDADAVFAFQ
LRNPIHNGHALLMQDTQKQLIERGYKKPVLLLHPLGGWTKDDDVPLSVRVIQHKAVLNER
VLDPEHTVLAIFPSPMMYAGPTEVQWHAKCRMNAGANHYIVGRDPAGLPHPNGGGDLYDP
RHGAIVLAAAPGLDDLEIIPFRVAAYDSSVGKMAFFDPTRKEDFDFISGTRMRGLAKAGK
EPPKGFMAPSAWKVLSEYYQSLKSKMETN