Monarch geneset OGS2.0

DPOGS206381
TranscriptDPOGS206381-TA1245 bp
ProteinDPOGS206381-PA414 aa
Genomic positionDPSCF300192 - 76896-85159
RNAseq coverage1180x (Rank: top 11%)
Annotation
HeliconiusHMEL0090090.084.26% 
BombyxBGIBMGA005779-TA7e-14380.12% 
DrosophilaZnT35C-PA2e-11149.07% 
EBI UniRef50UniRef50_G6D9360.098.07%Putative uncharacterized protein n=9 Tax=Coelomata RepID=G6D936_DANPL
NCBI RefSeqXP_624588.15e-14259.27%PREDICTED: similar to CG31860-PA [Apis mellifera]
NCBI nr blastpgi|3071800253e-14766.02%UDP-glucose:glycoprotein glucosyltransferase [Camponotus floridanus]
NCBI nr blastxgi|3071800251e-14966.35%UDP-glucose:glycoprotein glucosyltransferase [Camponotus floridanus]
Group
Gene OntologyGO:00550851.1e-133transmembrane transport
GO:00160211.1e-133integral to membrane
GO:00068121.1e-133cation transport
GO:00083241.1e-133cation transmembrane transporter activity
KEGG pathway 
InterPro domain[64-413] IPR0025241.1e-133Cation efflux protein
Orthology groupMCL10833 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206381-TA
ATGACCGAGGACACCAAGCCGTTATTGCTATTCAAAGATGCTAAAGCCAGCTACGGAACCGATAACCCGGCGAGGGGTCGGAGGGTCATCTTCTGCGTCCACGGGAACCCTTCCACGGGATGCTGCGCTGTCATTGAGACCAGCGCGGACGGTGATGAAGTTGCAAGAACGAACAGTATTACTGAAGAGAGACATTGTCACAGATCAAGGAACGAAGAAATAGACAAGCGGGCCCGAAGAAAACTTATTATAGCGAGCGTGCTGTGTGTTATATTTATGATCGGGGAAATTGTAGGTGGTTATTTATCTAACAGCTTAGCCATAGCAACGGATGCTGCACATCTGCTGACAGACTTCGCTAGCTTCATGATATCACTGTTCTCATTGTGGGTGGCCAGCAGGCCCGCCACCAGACGGATGCCGTTCGGGTGGTACCGTGCGGAGGTGATCGGCGCCCTGACCTCGGTCCTCCTTATCTGGGTGGTGACGGGCATCCTCCTGTACATGGCCGTCCAGCGAGTCATCTACAAGTCATTCGAGATAGACGCCACCGTCATGCTCATCACGTCCGCCGTCGGAGTCGCCGTGAACCTAGTTATGGGTCTGACGCTACACCAACATGGACACAGTCACGGAGGACAGGCGGGACACGGACACAGCCATGGAGGGGCCAACCCGGTGCTCAATAATAAGGAGCGTGTGGACTCGGACGCCGAGAGCTCGTCGTCTCACACCCAGGAGGTTCACTCTCACACTCACGGCGAGAACATCAACGTGCGGGCAGCCTTCATCCACGTGCTGGGGGACTTCCTGCAGAGCTTCGGGGTCCTGGTCGCTGCTATCGTCATATACTTCAAGCCGGAGTGGAGCCTGGTGGACCCTATCTGCACGTTCCTGTTCTCGGTGCTGGTGCTGCTCACCACATACAACATCATCAAGGACGCCCTGCTGGTGCTCATGGAGGGCTCTCCGCGCGGCGTGGACTTCCAGGAGGTGGCCAACACGTTCCTGTCTCTCCCGGGCGTGGTCCGTGTACACAACCTGCGGATGTGGGCGTTGTCTCTCGACAAGACCGCGCTCGCCGCTCATCTCGCTATACGGAGCGGAGTGAGTCCCCAGAAGGTTCTAGAACAGGCGACGCGTCTCGTTCACGAGAAATACAACTTCTTCGAGATGACGCTGCAGATCGAGGAGTTCAGTGACGTCATGGAGCAGTGCAGACAATGTGAGATGCCCAGCGCCTAG

Protein sequence:

>DPOGS206381-PA
MTEDTKPLLLFKDAKASYGTDNPARGRRVIFCVHGNPSTGCCAVIETSADGDEVARTNSITEERHCHRSRNEEIDKRARRKLIIASVLCVIFMIGEIVGGYLSNSLAIATDAAHLLTDFASFMISLFSLWVASRPATRRMPFGWYRAEVIGALTSVLLIWVVTGILLYMAVQRVIYKSFEIDATVMLITSAVGVAVNLVMGLTLHQHGHSHGGQAGHGHSHGGANPVLNNKERVDSDAESSSSHTQEVHSHTHGENINVRAAFIHVLGDFLQSFGVLVAAIVIYFKPEWSLVDPICTFLFSVLVLLTTYNIIKDALLVLMEGSPRGVDFQEVANTFLSLPGVVRVHNLRMWALSLDKTALAAHLAIRSGVSPQKVLEQATRLVHEKYNFFEMTLQIEEFSDVMEQCRQCEMPSA-