Monarch geneset OGS2.0

DPOGS210128
TranscriptDPOGS210128-TA2487 bp
ProteinDPOGS210128-PA828 aa
Genomic positionDPSCF300017 + 1703687-1720512
RNAseq coverage180x (Rank: top 49%)
Annotation
HeliconiusHMEL0211442e-14170.99% 
BombyxBGIBMGA000234-TA0.080.42% 
DrosophilaPapss-PD4e-17063.86% 
EBI UniRef50UniRef50_O432524e-17263.68%Bifunctional 3'-phosphoadenosine 5'-phosphosulfate synthase 1 n=238 Tax=cellular organisms RepID=PAPS1_HUMAN
NCBI RefSeqXP_321893.40.066.89%AGAP001256-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583023280.066.89%AGAP001256-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479655942e-17866.89%AGAP001256-PC [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00001032.8e-106sulfate assimilation
GO:00047812.8e-106sulfate adenylyltransferase (ATP) activity
GO:00055241.3e-72ATP binding
GO:00163011.3e-72kinase activity
GO:00167721.3e-72transferase activity, transferring phosphorus-containing groups
KEGG pathwayaga:AgaP_AGAP0012560.0 
 K13811 (PAPSS)maps-> Purine metabolism
    Selenoamino acid metabolism
    Sulfur metabolism
InterPro domain[443-816] IPR0026502.8e-106Sulphate adenylyltransferase
[606-824] IPR0147292.5e-81Rossmann-like alpha/beta/alpha sandwich fold
[59-213] IPR0028911.3e-72Adenylylsulphate kinase, C-terminal
[420-591] IPR0159477.8e-46PUA-like domain
Orthology groupMCL11023 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210128-TA
ATGAGTTTGGAGTTGAGAAATAAAAAAATTGCAGAGACATTTGAGAAATTCAAAATTGCGCAATGGGGACACAACAATAACATCGCGTGTGCCCAAGTCGCTACAAATGTAGTTGAACAGAAGCATCAGGTGTCGAGAGCTAAGAGGAGCAAAGCTCTTGGAAGCCGTGCCTTTAGAGGCAGCACGATTTGGTTTACTGGACTCAGCGGCGCCGGCAAGACTAGTATAGCGTTTGCACTTGAAGCCTATCTCGTTTCTAAAGGTATACCAGCTTACGGTCTAGACGGAGACAACATCAGGACTGGTCTCAACAAGAACCTCGGCTTTTCTAAGGAAGACAGAGAAGAGAATATTCGTAGAGTCGCAGAAGTAGCTAAACTCTTCGCTGACAGCGGCGTCGTTTGCTTGTGCAGTTTTGTCTCGCCCTTTGCTGAGGACAGGGAGGTAGCTCGTCGCATTCACACTGACTCCGAGTTGCCGTTCTTCGAAGTGTTCATAGACACGCCGCTGGAAGTATGCGAACAGAGAGATACCAAGGGCCTCTACAAGAAGGCTAGGGAGGGACAGATTAAGGGCTTCACTGGCATAACTCAGGAGTATGAACGTCCTGAGGCTCCAGAGCTTGTCATTCAGACAGTTGGACGCTCCATCGAAGAGTCCACCATAGAAGTGGTGCGACTCCTCGAATCACAGGGTATTATACCACGCTACAATGAAAATGACTCAGGTGTTGAAGAGCTCTTCATTTACGGAAACAGACTTAGCAGTGCTAAGGAAGAGGCGGCCAGGTTGCCGCAAATAGAACTCTCATTTTTGGACTTGCAATGGGTTCAGGTGTTATCTGAAGGTTGGGCCTACCCTCTTAAAGGCTTTATGAGGGAATCCGAATATCTGCAAGCGCTACATTCCAACTGCTTTACACTACCAGATGGGACCTTGGTAAACCAATCTGTACCAATCGTGTTGCCAGTGGCCACGACCACTAAGGAGCGCCTCACTGGTTCCACGGCCATCGCATTGGTCCACGATGGCCGAACCATCGCCATTATGAGAAACCCCGAGTTCTACCCTCATAGGAAACAGGAGAGGTGCTGTCGGCAGTTCGGAATATATAACACAGGACATCCCTATATCAAAGGCTTCACTGGCATAACTCAGGAGTATGAACGTCCTGAGGCTCCAGAGCTTGTCATTCAGACAGTTGGACGCTCCATCGAAGAGTCCACCATAGAAGTGGTGCGACTCCTCGAATCACAGGGTATTATACCACGCTACAATGAAAATGACTCAGGTGTTGAAGAGCTCTTCATTTACGGAAACAGACTTAGCAGTGCTAAGGAAGAGGCGGCCAGGTTGCCGCAAATAGAACTCTCATTTTTGGACTTGCAATGGGTTCAGGTGTTATCTGAAGGTTGGGCCTACCCTCTTAAAGGTTTTATGAGGGAATCCGAATATTTGCAAGCGCTACATTCCAACTGCTTTACACTACCAGATGGGACCTTGGTAAACCAATCTGTACCAATCGTGTTGCCAGTGGCCACGACCACTAAGGAGCGCCTCACTGGTTCCACGGCCATCGCATTGGTCCACGATGGCCGAACCATCGCCATTATGAGAAACCCCGAGTTCTACCCTCATAGGAAACAGGAGAGGTGCTGTCGGCAGTTCGGAATATATAACACAGGACATCCCTATATCAAAATGATCGAGGAGTCTGGGGACTGGCTGGTGGGCGGTAACCTGGAAGTGTTCGAACGTATTCAGTGGAATGACGGCCTAGACTCTTACAGACTGACGCCCAACGAACTGAGGCAGAGGTTCAAGGACATGGATGCTGATGCTGTGTTTGCATTCCAGCTTCGTAACCCTATCCACAACGGCCACGCCCTCCTGATGCAAGACACTCAAAAACAACTCATCGAGAGAGGATACAAGAAACCAGTACTGCTATTACACCCCCTTGGCGGCTGGACTAAAGACGATGATGTTCCCCTGTCGGTGCGCGTGATACAACACAAGGCGGTCTTGAATGAACGAGTGCTGGACCCTGAACATACCGTGCTGGCGATCTTTCCATCTCCAATGATGTACGCCGGACCCACGGAGGTCCAATGGCATGCTAAGTGCCGTATGAACGCTGGCGCTAACCACTATATAGTGGGTCGTGACCCCGCTGGATTGCCGCACCCTAACGGCGGCGGTGACCTCTACGACCCCCGACACGGTGCTATCGTACTGGCAGCCGCACCCGGACTGGATGATCTTGAGATCATACCATTCCGAGTAGCAGCGTATGATTCATCCGTCGGGAAGATGGCATTCTTTGATCCCACTCGTAAGGAAGACTTCGACTTCATATCCGGCACCAGGATGAGGGGTCTTGCTAAAGCTGGAAAGGAGCCACCGAAAGGTTTCATGGCTCCCAGCGCCTGGAAGGTCCTCTCAGAATACTACCAGTCGCTTAAATCTAAAATGGAAACCAATTAA

Protein sequence:

>DPOGS210128-PA
MSLELRNKKIAETFEKFKIAQWGHNNNIACAQVATNVVEQKHQVSRAKRSKALGSRAFRGSTIWFTGLSGAGKTSIAFALEAYLVSKGIPAYGLDGDNIRTGLNKNLGFSKEDREENIRRVAEVAKLFADSGVVCLCSFVSPFAEDREVARRIHTDSELPFFEVFIDTPLEVCEQRDTKGLYKKAREGQIKGFTGITQEYERPEAPELVIQTVGRSIEESTIEVVRLLESQGIIPRYNENDSGVEELFIYGNRLSSAKEEAARLPQIELSFLDLQWVQVLSEGWAYPLKGFMRESEYLQALHSNCFTLPDGTLVNQSVPIVLPVATTTKERLTGSTAIALVHDGRTIAIMRNPEFYPHRKQERCCRQFGIYNTGHPYIKGFTGITQEYERPEAPELVIQTVGRSIEESTIEVVRLLESQGIIPRYNENDSGVEELFIYGNRLSSAKEEAARLPQIELSFLDLQWVQVLSEGWAYPLKGFMRESEYLQALHSNCFTLPDGTLVNQSVPIVLPVATTTKERLTGSTAIALVHDGRTIAIMRNPEFYPHRKQERCCRQFGIYNTGHPYIKMIEESGDWLVGGNLEVFERIQWNDGLDSYRLTPNELRQRFKDMDADAVFAFQLRNPIHNGHALLMQDTQKQLIERGYKKPVLLLHPLGGWTKDDDVPLSVRVIQHKAVLNERVLDPEHTVLAIFPSPMMYAGPTEVQWHAKCRMNAGANHYIVGRDPAGLPHPNGGGDLYDPRHGAIVLAAAPGLDDLEIIPFRVAAYDSSVGKMAFFDPTRKEDFDFISGTRMRGLAKAGKEPPKGFMAPSAWKVLSEYYQSLKSKMETN-