Monarch geneset OGS2.0

DPOGS214352
TranscriptDPOGS214352-TA1209 bp
ProteinDPOGS214352-PA402 aa
Genomic positionDPSCF300020 + 452130-453338
RNAseq coverage1784x (Rank: top 7%)
Annotation
HeliconiusHMEL0200380.099.75% 
BombyxBGIBMGA003972-TA0.098.51% 
DrosophilaSelD-PB0.088.69% 
EBI UniRef50UniRef50_P499031e-16773.14%Selenide, water dikinase 1 n=114 Tax=Eukaryota RepID=SPS1_HUMAN
NCBI RefSeqNP_001037388.10.098.26%selenophosphate synthetase 1 [Bombyx mori]
NCBI nr blastpgi|1129838620.098.26%selenophosphate synthetase 1 [Bombyx mori]
NCBI nr blastxgi|1129838620.098.26%selenophosphate synthetase 1 [Bombyx mori]
Group
Gene OntologyGO:00055241.6e-157ATP binding
GO:00047561.6e-157selenide, water dikinase activity
GO:00038243.8e-15catalytic activity
KEGG pathwaytca:6571120.0 
 K01008 (E2.7.9.3, selD)maps-> Selenoamino acid metabolism
InterPro domain[39-391] IPR0045361.6e-157Selenide water dikinase
[220-390] IPR0109184.1e-24AIR synthase-related protein, C-terminal
[69-207] IPR0161883.8e-15PurM, N-terminal-like
[95-190] IPR0007284.1e-10AIR synthase-related protein
Orthology groupMCL13442 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214352-TA
ATGTCGTATCAATCTAGTGTAGCGCAGGATTCTCTTTCTGCAGCACAACTAGAAATGGCCGGTAACCCCAATGCTTTGGCTCTACGTCGGCCATTCGACCCTGTGGCACACGATTTGGAAGCTAGTTTTCGTCTAACAAGATTTGCCGATTTAAAAGGAAGAAGCTGTAAAGTACCGCAAGATGTTTTATCTAAACTCGTTGAATCATTACAACAAGATTACTCCCAACAAGACCAGGATCAGTTTATGCATGTTGCGATCCCGCGTATTGGCATTGGCTTGGACTGTTCAGTTACACCGCTAAGACATGGTGGACTATGTTTGGTGCAAACTACTGACTTCTTTTATCCTTTAGTGGACGATCCATATATGATGGGAAAAATTGCCTGTGCAAACGTTCTTAGCGATTTATATGCTATGGGCGTGACTGAATGTGATAATATGCTGATGCTTCTTGGCGTATCAACTAAGATGACCGAGAAGGAACGAGATGTTGTTATTCCTCTTATTATGCGTGGCTTCAAGGATTCAGCGCTAGAAGCTGGTACGTCTGTGACTGGAGGTCAAACTGTCATTAACCCTTGGTGCACTATTGGAGGAGTAGCCACTACTATCTGTCAGCCCAACGAATATATTGTGCCCGACAATGCTGTTATGGGTGATGTCTTAGTTCTCACAAAGCCTCTTGGTACCCAGGTTGCTGTGAATGCTCATCAATGGCTCGATCAGCCTGAACGTTGGAATAGAATAAAATTAGTTGTGTCAGAAGAAGATGTTAGAAAGGCTTATCATCGTGCTATGGACTCTATGAGTCGGCTAAATCGTATTGCAGCTAGACTTATGCACAAATATAATGCCCACGGTTCAACTGATGTTACCGGATTTGGTCTTCTTGGCCACGCTCAGAACCTTGCCTCTCATCAGAAAAATGAAGTTTCATTTGTTATTCACAACTTACCAGTGATAGCCAAAATGGCAGCTGTTGCTAAGGCTTGCGGAAACATGTTCCAACTCTTACAGGGTCATGCACCTGAGACATCTGGAGGACTTTTAATCTGTTTGCCTCGTGAGCAAGCTGCAGCATACTGTAAGGATATTGAGAAGCAAGAGGGATACCAGGCATGGATTATTGGCATTGTTGAAAAGGGCAACCGTACAGCTAGAATTATTGACAAACCTCGAGTTATTGAAGTGCCAGCTAAAGATTAA

Protein sequence:

>DPOGS214352-PA
MSYQSSVAQDSLSAAQLEMAGNPNALALRRPFDPVAHDLEASFRLTRFADLKGRSCKVPQDVLSKLVESLQQDYSQQDQDQFMHVAIPRIGIGLDCSVTPLRHGGLCLVQTTDFFYPLVDDPYMMGKIACANVLSDLYAMGVTECDNMLMLLGVSTKMTEKERDVVIPLIMRGFKDSALEAGTSVTGGQTVINPWCTIGGVATTICQPNEYIVPDNAVMGDVLVLTKPLGTQVAVNAHQWLDQPERWNRIKLVVSEEDVRKAYHRAMDSMSRLNRIAARLMHKYNAHGSTDVTGFGLLGHAQNLASHQKNEVSFVIHNLPVIAKMAAVAKACGNMFQLLQGHAPETSGGLLICLPREQAAAYCKDIEKQEGYQAWIIGIVEKGNRTARIIDKPRVIEVPAKD-