Monarch geneset OGS2.0

DPOGS205618
TranscriptDPOGS205618-TA1329 bp
ProteinDPOGS205618-PA442 aa
Genomic positionDPSCF300023 - 968076-984698
RNAseq coverage606x (Rank: top 21%)
Annotation
HeliconiusHMEL0073440.086.30% 
BombyxBGIBMGA001130-TA5e-16282.91% 
DrosophilaZnT63C-PD3e-14466.28% 
EBI UniRef50UniRef50_Q7K3K55e-14266.28%LD22804p n=32 Tax=Metazoa RepID=Q7K3K5_DROME
NCBI RefSeqXP_001957004.12e-14667.13%GF10205 [Drosophila ananassae]
NCBI nr blastpgi|3227893484e-15365.84%hypothetical protein SINV_09120 [Solenopsis invicta]
NCBI nr blastxgi|3227893484e-15465.84%hypothetical protein SINV_09120 [Solenopsis invicta]
Group
Gene OntologyGO:00550851.2e-121transmembrane transport
GO:00160211.2e-121integral to membrane
GO:00068121.2e-121cation transport
GO:00083241.2e-121cation transmembrane transporter activity
KEGG pathway 
InterPro domain[1-392] IPR0025241.2e-121Cation efflux protein
Orthology groupMCL12626 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205618-TA
ATGGGGAAGTTTTCGGGTAAGAAGTGTCGTTTACTGTCCATGCTATGGTTAACAGGGACATTTTTCTTTGTGGAGCTCATAGTCGGTTATGTTACGAACTCTATGGCATTGGTCGCTGATTCCTTCCACATGTTGAGTGATGTCGCAGCTCTTGTTATTGCATTTTTATCTGTTAAGATGTCACCAAAGAAATGGTCGAAGAATACCTTTGGTTGGGCGCGTGCGGAAGTTTTGGGAGCTCTTGTGAATGCTGTGTTCCTGGTAGCGCTGTGCTTCAGCATCACAGTGGAGGCTGTGCAGAGATTCATCCGAGCTGAGATGATACATAACGCTCAGCTGCTTGTAGCTGTCGGCACTTTAGGACTTGTTCTCAACATCATTGGACTGTTTCTGTTCCACGAGCACGGTAGTAGCCATGGTCATAGCCACGGGGTGGTGCCACCACCATCCAACGTCCGCCACCTGTCGGAGCTGGTGAACAGCAACGCGGATATGGCACTGGGGCATGCTACCACCGACACTGAGGAGACAGACGAGATGGTACCACCGAAAGTGGTCAAGATACCAAACGACCAAACACCCAAGACACATTCAGACCCTGGCAATCTGAACATGAAGGGTGTCTTCCTGCACGTGCTGTCTGATGCGTTGGGTTCCTTAATAGTCGTAAGCTCAGCGCTCGTGGTGTGGCTGACTGAGTGGCGATACAAGTACTACATCGACCCGGCACTCAGTATAGTGCTGGTTATTCTGATACTGGCATCAGTCTGGCCGCTGTTGAGGGAATCAGCTCTCATACTGCTGCAGACAGTGCCGACACATATACAGGTGGATGCAATCCAAAGACGTCTCCTGGAGAAGGTGGACGGTGTGCTGGCGGTGCACGAGTTTCACGTCTGGCAGTTAGCTGGAGATAGAATTATAGCCAGCGCTCACATACGGTGTAGGAACCTGTCGGAGTACATGAAGATAGCTGAGAAAGTTAAGGAGTTCTTCCACAACGAGGGCATACACTCGACCACCATACAGCCGGAGTTTGTGGAACTGCCGCTGGATGGGAACGAGATCACTAGCGGCGCTTCAGCGGAGGCCCCGTGTGCCTTACATTGCCCCCCCAACGACCTCTGTCACAACGCTACTTGCTGTGGACCACATCAGGAGAAACAGAGCACCCCCTATTTATGTCGTCAGAGAAACATGGGATCGCGAGCAGAGCCTTTGAACTCTGACCTCGATAGCTCCGGGAATCCCACCGCTAGGAATCTTGACGCTTTGGAATGCGGATCACTGTTGCCTTTCCCGATGCTAGATGGACGTTTGACAGCTTGA

Protein sequence:

>DPOGS205618-PA
MGKFSGKKCRLLSMLWLTGTFFFVELIVGYVTNSMALVADSFHMLSDVAALVIAFLSVKMSPKKWSKNTFGWARAEVLGALVNAVFLVALCFSITVEAVQRFIRAEMIHNAQLLVAVGTLGLVLNIIGLFLFHEHGSSHGHSHGVVPPPSNVRHLSELVNSNADMALGHATTDTEETDEMVPPKVVKIPNDQTPKTHSDPGNLNMKGVFLHVLSDALGSLIVVSSALVVWLTEWRYKYYIDPALSIVLVILILASVWPLLRESALILLQTVPTHIQVDAIQRRLLEKVDGVLAVHEFHVWQLAGDRIIASAHIRCRNLSEYMKIAEKVKEFFHNEGIHSTTIQPEFVELPLDGNEITSGASAEAPCALHCPPNDLCHNATCCGPHQEKQSTPYLCRQRNMGSRAEPLNSDLDSSGNPTARNLDALECGSLLPFPMLDGRLTA-