Monarch geneset OGS2.0

DPOGS212010
TranscriptDPOGS212010-TA1554 bp
ProteinDPOGS212010-PA517 aa
Genomic positionDPSCF300136 + 379427-411955
RNAseq coverage419x (Rank: top 29%)
Annotation
HeliconiusHMEL0058523e-13569.55% 
BombyxBGIBMGA004530-TA1e-17982.09% 
DrosophilaCG33181-PG9e-13254.46% 
EBI UniRef50UniRef50_A8JV321e-12954.46%CG33181, isoform C n=21 Tax=Endopterygota RepID=A8JV32_DROME
NCBI RefSeqXP_311188.43e-14157.18%AGAP000663-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479642397e-14057.18%AGAP000663-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479642391e-13757.45%AGAP000663-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068121.2e-23cation transport
GO:00083241.2e-23cation transmembrane transporter activity
KEGG pathway 
InterPro domain[146-278] IPR0066671.2e-23MgtE magnesium transporter, integral membrane
Orthology groupMCL16935 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212010-TA
ATGAAATCCAGCATTATAAAACTGACACTCGTTTCAATAGCCGAGAATTGCATGCAGCTGAAAGGCATGGTTGGAGCAGCCGATGTCGTCAAGCCAGCCGATAAAGTTGAATTCTATTCACCCCAGGTCAAAGAAACACCGATCTATACTATCAGCGATGATATTGGTAGTCCGGATAGTACATTCACAATGTCGTCAATAAATTCATCCGGGGACGTCAAAAGTCCTGGGGATCCACCGGAGAAGAAATTCGCTGAATCCGATCCAGAGAAGCAACCTTTATCCGGGAAAAAGGAAGAGAGATGGTGGACAACCCTGATGCAGATAGCTGTTCCATTCTTCATAGCCGGTTGTGGAACGATAGGATCCGGCTTGGTGTTGGGTAGTGTTAAAAACTGGGAGGTATTTCTGCAAGTCAAAGCTATTTTCGTGTTAGTTCCATCCCTGTCCGGATTAAAAGGTAACTTGGACATGTGTCTCGCTTCACGGCTCTCAACTCAGGCTAACCTTGGTAATATGAGATTACCACGAGAAGTTATATCAATGGTCGTCGGCAATATATCGCTGGTGCAGGTTCAAGCTATAGTAGCGGCGACGGTGGTGTCTATGTTCGCTGTGGTGGTCAACACAATAACAGACAGACAGTTTAATGGCAGCTACGTACTGTTATTGATCGCTTCAGCTGTCTTCACTGCTACCACCACTTGTTTCGTTTTAGATTTCGTTATGGTAATAGTGATATTCGCCTCACAAAAGTTCAAGGTGAATCCCGACAATGTGGCGACACCTTTAGCCGCGTCAATAGGCGATATCGTTAGTAACTCAGTATTGGCTGTCACAGCTGCATATATGTTCGAACAGATAAAGATATCGCTATGGCAACCGATTGCACTTCTATGTGTATACTACAGTCTTCTTCCCATTTGGGTGTTTATTGGTTGGAAGAATAAATATACGAAGGCCGTCCTCACCACTGGATGGACCCCCGTTATATCCGCCCTGTTTATTAGTGGTGTTGGTGGTATCGTTCTGGACCAAGCTGTGGATCAATATCCGGGATACGAAGTTTTCCAACCGATAGTGAACGGTATCGGTGGCAACTTAGTTTGTGTTCAATCATCACGTCTAGCAACACACTTACACCAGACAGCCATACCCGGCATACTACCGGAGAATACGAGGATCATCGAATGGCCGTGGAGGACGTTGTTCTACGGAACTCCTGCTGCAAAAATTGCGAGAATGCTCCTAATACTGGCCGTTTGTGGGCAAATAGTGTTCATGATTGTCGCCGATGTCGTGTTCAGAGGATTTGTCTCGCTGCAGTTTGTCTTCGGTGTCACTTACGTGTTATGTTCAGTTATTCAAGTTATGGTTTTACTCTACCTTTCTTACATCCTAATACACTTTATGTGGAAGAAGAAGAAGGATCCAGACAACGCCGCCATTCCGTACTTGACGGCTCTCGGCGATCTCTTGGGTTCTGTTTTCCTAGGTCTAGCGTTCTTAATTCTATCAATATTCGGCATGGAATACGGTAATAACGTACAATAA

Protein sequence:

>DPOGS212010-PA
MKSSIIKLTLVSIAENCMQLKGMVGAADVVKPADKVEFYSPQVKETPIYTISDDIGSPDSTFTMSSINSSGDVKSPGDPPEKKFAESDPEKQPLSGKKEERWWTTLMQIAVPFFIAGCGTIGSGLVLGSVKNWEVFLQVKAIFVLVPSLSGLKGNLDMCLASRLSTQANLGNMRLPREVISMVVGNISLVQVQAIVAATVVSMFAVVVNTITDRQFNGSYVLLLIASAVFTATTTCFVLDFVMVIVIFASQKFKVNPDNVATPLAASIGDIVSNSVLAVTAAYMFEQIKISLWQPIALLCVYYSLLPIWVFIGWKNKYTKAVLTTGWTPVISALFISGVGGIVLDQAVDQYPGYEVFQPIVNGIGGNLVCVQSSRLATHLHQTAIPGILPENTRIIEWPWRTLFYGTPAAKIARMLLILAVCGQIVFMIVADVVFRGFVSLQFVFGVTYVLCSVIQVMVLLYLSYILIHFMWKKKKDPDNAAIPYLTALGDLLGSVFLGLAFLILSIFGMEYGNNVQ-