Monarch geneset OGS2.0

DPOGS211328
TranscriptDPOGS211328-TA1038 bp
ProteinDPOGS211328-PA345 aa
Genomic positionDPSCF300125 + 251543-257956
RNAseq coverage227x (Rank: top 44%)
Annotation
HeliconiusHMEL0164755e-11871.15% 
BombyxBGIBMGA004955-TA9e-6353.44% 
DrosophilaZip3-PA6e-7343.59% 
EBI UniRef50UniRef50_F4WYB22e-7349.20%Zinc transporter ZIP1 n=5 Tax=Formicidae RepID=F4WYB2_ACREC
NCBI RefSeqXP_001659748.13e-7444.61%zinc/iron transporter [Aedes aegypti]
NCBI nr blastpgi|3800294714e-7648.30%PREDICTED: zinc transporter ZIP3-like [Apis florea]
NCBI nr blastxgi|3838572357e-7851.11%PREDICTED: zinc transporter ZIP3-like [Megachile rotundata]
Group
Gene OntologyGO:00160203.6e-55membrane
GO:00550853.6e-55transmembrane transport
GO:00468733.6e-55metal ion transmembrane transporter activity
GO:00300013.6e-55metal ion transport
KEGG pathway 
InterPro domain[1-299] IPR0036893.6e-55Zinc/iron permease
Orthology groupMCL34877 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211328-TA
ATGGTTTCCCTCTTCGCCATATCGATGGCTGTTGGTATAGCGCCCATGTTAATATCGGTCAAATTTGGCTGGTTCACACAATCAGATGGGGAAATACGTTCCAGCAAACTTGTAATGGGACTGTTGTCTTTCGGCGGAGGGGTCCTTTTTGCTACTACATTCATGCATTTGTTACCGGAAGTTGCAGAAAATATTAAGGAATTACAAGAAACCGGTGTTATACCAGAGATACCTTTGTACCTGGCTTCGCTCGTCATGTGCTGTGGGTTCTTCATGATGTATTTAGTTGAAGAGCTTGTCCATGCGTACATAAACAGCCACCAGAACAAAGACGCCAATACCAGCTTCACACGCGTCTTGAGCATACGAAGAAAGAGCAATGAGACAGTGGAAACCAATGAACCAGTTACGAAAAACGTAGAAGCCAATTATGGTGACAGGCACTTGCCTCTGAGTGGTGATGACACGACTGTAACAGCTTTAAGGGGACTGTTGATTGTGCTAGCTTTGTCTATACACGAGCTGTTTGAAGGATTAGCGGTTGGTCTGGAGTCATCTGTGAGGAATGTTTGGTATATGTTTGGTGCGGTTTCCGCTCACAAATATATTATCGCGTTCTGCATCGGAGTCGAGTTATTGGCAGCTGGCACGAAGAGATGGTTGTCAGTTGTCTATGTTTTTACATTTTCCTTCGTGTCAGCTTTGGGGATCGCAGTTGGGATCCTGCTGGTAGGTGGCGCTGGAGCGACTGCAGCCGGTATATCATCTGTTGTGCTTCAGGGTCTGGCTTGCGGAACCTTAATGTACGTGGTATTCTTTGAAGTGTGGCGTCAAGACAGAACCGGCCTCTTGCAGTTTGTGTGCTCAGTGGTTGGTTTCGCGATCATGGTTGGACTGCAGACGGTTGCGGAATTATGTTTGTATTTAGTTTATATTGATATATGTGAAGTAATGAGTACGAGACATCATATTATGAGAGGATACTTAGTCGTTGATTCGTTCGATTTAAATCGATTCAGTGAAGTTCGTATTATATGA

Protein sequence:

>DPOGS211328-PA
MVSLFAISMAVGIAPMLISVKFGWFTQSDGEIRSSKLVMGLLSFGGGVLFATTFMHLLPEVAENIKELQETGVIPEIPLYLASLVMCCGFFMMYLVEELVHAYINSHQNKDANTSFTRVLSIRRKSNETVETNEPVTKNVEANYGDRHLPLSGDDTTVTALRGLLIVLALSIHELFEGLAVGLESSVRNVWYMFGAVSAHKYIIAFCIGVELLAAGTKRWLSVVYVFTFSFVSALGIAVGILLVGGAGATAAGISSVVLQGLACGTLMYVVFFEVWRQDRTGLLQFVCSVVGFAIMVGLQTVAELCLYLVYIDICEVMSTRHHIMRGYLVVDSFDLNRFSEVRII-