Monarch geneset OGS2.0

DPOGS203686
TranscriptDPOGS203686-TA2643 bp
ProteinDPOGS203686-PA880 aa
Genomic positionDPSCF300010 - 2075217-2082364
RNAseq coverage591x (Rank: top 22%)
Annotation
HeliconiusHMEL0133200.082.27% 
BombyxBGIBMGA003480-TA0.077.33% 
DrosophilaArs2-PA2e-13462.23% 
EBI UniRef50UniRef50_E0VY010.049.01%Arsenite-resistance protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0VY01_PEDHC
NCBI RefSeqXP_002430995.10.049.01%arsenite-resistance protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3838472490.049.27%PREDICTED: serrate RNA effector molecule homolog [Megachile rotundata]
NCBI nr blastxgi|1571356830.052.77%arsenite-resistance protein [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[675-824] IPR0070428e-68Arsenite-resistance protein 2
[148-261] IPR0219331.6e-37Protein of unknown function DUF3546
Orthology groupMCL13278 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203686-TA
ATGGCTGATAGCGACGACGAGTATGACCGCAAGAGACGCGATAAATTCCGTGGTGAAAGAGGAGCAGCAGAAGGCAGCAGTTATCGAAACAGCGATAGACGAGAGGAACGAGGACGCGGTAGAGAGGAATGGTCAGAAAGGTCTCGTGGAGGTAGATCAGGACCCGACTACAGAGATTATAGAGGTGGAGCAAGTGCAAGTCGTGGGTATTCTCCAGTTAGAGGTGAAGGGCCTCCTAGCAAACGAATCAGACCAGACTGGCCAGTTGATGATAGAAGATATGGCGGTATGCCTCATGACTCATATGGCTCATATGGTTGGGCCCACGACCACTTTGGACCGCATCCTGCTCATCAAGGATATGGTCAACCAATGCCTCCTGTACCAGCCAGAGATGCTGTTCTGCCGATGGGACCAACTGATGGGCCACCTTCAATGATGTCATTCAAGGCTTTCCTGGCTGCTCAAGACGATGCTATCACCGTTGATGACGCTATACAGAAGTATAATGAATACAAACTTGAATTTAGGAGGCAGCAATTAAATGAATTTTTTGTAGCACACAAAGATGAGGAGTGGTTCAAGATAAAATACCATCCAGAGGAATCGGTGAAACGTAAAGAGGAGCAACTAGCTGCGCTTAAGAACCGACTTAACGTATTTCTTGAGCTCCTGGAACAACGCGAATTGGACAAAGTATCAGTGGATGTTGACAAGTCGGACAAGTTGATACGTTTACTTGACACTGTTGTTATTAAATTAGAAGGCGGAACAGAGGAAGATCTTAAAGCTCTCGACGAACCCAACCCAGCGGAAAATGCAAATGACAAACAAGATAAGAATGACACGAATAAGGCAATTGTAATTGAAGACGATGCGGTCAAAGAAATTAAGGATGAGAAAGACACCGAACAGAACAAGGATAAGATTAATTCAACTGAGGAAAAAAAGGAAGATTCTCCAAAAAAAACCGCACCGCTTACGATGGAGATCGACCCTCACCTCCGTCAGCTACAAGAACAGGCCAAACTTTTTTCTCGTTACAACAGTGTGCCTGGAACGGAATCGGAACAAGTTGTACCCGAGAAAGAACCTCTTTGGAAGATACGTTTTATGAATGATGTTCCTCACACAGCTCCGCCAGGTTCTTCATCGAGCTCATCATCATCGAGTTCATCGTCATCAAGTTCGGAAGACGAAGGGGAAACCGGAACGCGCAGGAAATCAAAATCCAAGTCTAAATCCAAAACTCCCGACAAGTCGCCGAAACAAAAGGAACGGACCGCGTCACCGAGCGCAGAGAAAGTCGTCGAAATAAAAGACAAGGAGGCGAGCAATGATAATAATGAAACTTCCATTGATGTGACGGAGAAGAAAGAATCCAGGGCTCTCCATAAAACAACTTCCATTTTCTTAAGAAATCTAGCTCCAACGATCACGAAGGCTGAAGTAGAAGCTATGTGTAAACGTTACGGTGGGTTCCTGCGCGTGGCGCTCGCAGACCCACTGCCCGAAAGACGATGGTTCCGCAGAGGTTGGGTCACGTTCCGACGAGAGGTCAACATCAAGGACATCTGCTGGAATCTTAATAATATAAGGCTTCGCGAGTGTGAGTTGGGTGCGATAGTAAACCGCGATCTTCAGCGTCGTATCCGAGCTGTCTCCGGCGTCACTTTGGAGCGAGCGGTGTTGCGGGCTGACGCTAGACTCGCGGCTAGACTCGCACACCATCTAGACACTAGGTCTAGACTGTGGGACGGACCTGGTGAAGATGGACCGCAGACTGAGAACTTCAGTTTGAGCTCCAAAAACCCGGTGCTACATAAGATAACGGAACATCTCATAGAAGAGGCTTCAACGGAGGAAGAAGAGTTGCTCGGTCTGGAGGCGTCGTCGGAGGCCGCGGCGCATGAACAACCGGATCCGGAACTCATCAAAGTGTTGGACCGCCTAGTGTTGTACCTTCGTATTGTACACTCTGTGGACTATTACAATCATTGTGAATATCCATACGAGGACGAGATGCCAAACCGTTGTGGTATTATGCATGCGCGCTCCGGACCTCCTCCTAACAAGCCCACTCAGCAGGAGATCCAAGATTATATTAAAACTTTCGAAGGTAAAATGTCAGCTTTTCTGCAAGATGTCAAACCGCTGACAGACGAAGAGCTGCAGAAACTAGGAATTAAGGACTCCGAGGCAGAAGTAGAAAAGTTCATTCAAGCTAACACTCAAGAGCTGTCTCAAGACAAATGGTTGTGCCCACTCAGCGGTAAAAAGTTCAAGGGACCAGATTTCATAAGAAAGCACATCTTCAATAAACACGCTGAAAAGGTGGATGAGGTCCGCCGCGAGGTGTCGTACTTCAACGCGTACGTTAAAGACGTGCGACGTCCCCAACAACCCGAGCAACCGGCTCGCGCCGCGCCACAACCCGTGCACGCGCCGCCGGCACATCCATATAGTGGAGCTGGTGGAGCAGGCGGTGGTCGCGGCTGGGGATGGGGCGGCTGGGCACCACCCGCGCCTTACATGCCCAGACACCCGCGGTTCTCGAGACCCAGGGCCGGTGCCGCGGAGTTCCGTCCGGTGATACACTATCGCGACTTGGACGCGCCGCGGGAACCCGACGAGTTCATTTAA

Protein sequence:

>DPOGS203686-PA
MADSDDEYDRKRRDKFRGERGAAEGSSYRNSDRREERGRGREEWSERSRGGRSGPDYRDYRGGASASRGYSPVRGEGPPSKRIRPDWPVDDRRYGGMPHDSYGSYGWAHDHFGPHPAHQGYGQPMPPVPARDAVLPMGPTDGPPSMMSFKAFLAAQDDAITVDDAIQKYNEYKLEFRRQQLNEFFVAHKDEEWFKIKYHPEESVKRKEEQLAALKNRLNVFLELLEQRELDKVSVDVDKSDKLIRLLDTVVIKLEGGTEEDLKALDEPNPAENANDKQDKNDTNKAIVIEDDAVKEIKDEKDTEQNKDKINSTEEKKEDSPKKTAPLTMEIDPHLRQLQEQAKLFSRYNSVPGTESEQVVPEKEPLWKIRFMNDVPHTAPPGSSSSSSSSSSSSSSSEDEGETGTRRKSKSKSKSKTPDKSPKQKERTASPSAEKVVEIKDKEASNDNNETSIDVTEKKESRALHKTTSIFLRNLAPTITKAEVEAMCKRYGGFLRVALADPLPERRWFRRGWVTFRREVNIKDICWNLNNIRLRECELGAIVNRDLQRRIRAVSGVTLERAVLRADARLAARLAHHLDTRSRLWDGPGEDGPQTENFSLSSKNPVLHKITEHLIEEASTEEEELLGLEASSEAAAHEQPDPELIKVLDRLVLYLRIVHSVDYYNHCEYPYEDEMPNRCGIMHARSGPPPNKPTQQEIQDYIKTFEGKMSAFLQDVKPLTDEELQKLGIKDSEAEVEKFIQANTQELSQDKWLCPLSGKKFKGPDFIRKHIFNKHAEKVDEVRREVSYFNAYVKDVRRPQQPEQPARAAPQPVHAPPAHPYSGAGGAGGGRGWGWGGWAPPAPYMPRHPRFSRPRAGAAEFRPVIHYRDLDAPREPDEFI-