Monarch geneset OGS2.0

DPOGS211947
TranscriptDPOGS211947-TA1077 bp
ProteinDPOGS211947-PA358 aa
Genomic positionDPSCF300011 + 920922-925209
RNAseq coverage213x (Rank: top 46%)
Annotation
HeliconiusHMEL0177581e-16679.94% 
BombyxBGIBMGA000896-TA1e-12868.48% 
DrosophilaCG5130-PB2e-3927.83% 
EBI UniRef50UniRef50_E2C7772e-4430.95%Zinc transporter 1 n=3 Tax=Formicidae RepID=E2C777_HARSA
NCBI RefSeqXP_001943438.15e-4528.89%PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]
NCBI nr blastpgi|3838621592e-4428.50%PREDICTED: zinc transporter 1-like [Megachile rotundata]
NCBI nr blastxgi|3071933282e-4330.95%Zinc transporter 1 [Harpegnathos saltator]
Group
Gene OntologyGO:00550854.7e-32transmembrane transport
GO:00160214.7e-32integral to membrane
GO:00068124.7e-32cation transport
GO:00083244.7e-32cation transmembrane transporter activity
KEGG pathway 
InterPro domain[18-283] IPR0025244.7e-32Cation efflux protein
Orthology groupMCL24772 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211947-TA
ATGGCTATGAAAGAATGGTTGCAGTGGCTCCCACCTCCTCGCTCCTTGATGGCACTGCTCATAGCCATCACAGGCTTCGGCGGTCGCATGTGTGCAGCTTACGTCACCCACTCACCGACTCTCCTAGTGGATGCCTGTCACTCCCTGTGCAGGCTGGTGGGCCTCGTCACCACACTGCTGGCGTATAAGTACCAAAGGGCTGATGAAGGAGCCGGGCGCGAGGGTCGTCTTCGGAACACCTTCGGCTGGACCCGTATCGAGGTGGTTGGGAGGTTGTCCGTCCACGTGCTCTTCGCCTCCTTCGCTTTAGCGCTGGTGGTGAACGCTCTCCAGCTGGGAGTCCATTCTTCACACGTCACCCCACCCAAATTCCCCCGGGTCATAGTCCTTAGCGCTGTTGTTGGACTGTTGTTACACGCTACTAACTATATGCTACTCGCTGGTCGAGAGTTAAGTTACAGTCGTCGGCTGAGTATTGCCGAAGGTGGAGATGTGGTTCTGAAGAGTGGAACCGCTGAGCCTGTACTGGCACACGCGCCCACCGATATAGCCAGCAGTTTGTTCGTGATGGCCGCCGGTCTGACGCTAGAGTGGGAGCCGGTAGCCGCGAGGATAGCTGACCCGGCGCTGTCTGCAGCCGCCGCTATCACCCTCGTTATATTCAACTATCCATTCATGCGTTCCGCCGGGCTGGTGTTGTTGCAGACCGTGCCGGAGGGTCTGGGCGCGGGTTCCCTCCGCGAGGCCGCCCTCCGAGTCCGAGGAGTTCTCGCCGTCCACGAGCTACACGTGTGGCAACTTCACCGAGACAGGATTGTTGCCACGGCTCACATATTCTACGAATCACCTGAGGACTACCTGAGTAGCGCCGGCCTCGTCTGTGATGTATTCAAGCGCCACGGGATCAGCTTGGTGACTCTTCAACCGGAGTTCATAGTCTCAGCGGATTCAGATGCGGAAGAGAGGAAGATGTTAATCGAATTCGCGAACACGGCCTGCTCCTGTCCTTGCGCCAAGGACTGCTCGGCCCCGCGATGCTGCCAGGCCCCGCGGCGACCGTCCGTCACACGCGTCTGA

Protein sequence:

>DPOGS211947-PA
MAMKEWLQWLPPPRSLMALLIAITGFGGRMCAAYVTHSPTLLVDACHSLCRLVGLVTTLLAYKYQRADEGAGREGRLRNTFGWTRIEVVGRLSVHVLFASFALALVVNALQLGVHSSHVTPPKFPRVIVLSAVVGLLLHATNYMLLAGRELSYSRRLSIAEGGDVVLKSGTAEPVLAHAPTDIASSLFVMAAGLTLEWEPVAARIADPALSAAAAITLVIFNYPFMRSAGLVLLQTVPEGLGAGSLREAALRVRGVLAVHELHVWQLHRDRIVATAHIFYESPEDYLSSAGLVCDVFKRHGISLVTLQPEFIVSADSDAEERKMLIEFANTACSCPCAKDCSAPRCCQAPRRPSVTRV-